putative gene coding: Topics by Science.gov

Sample records for putative gene coding

A Catalogue of Putative cis-Regulatory Interactions Between Long Non-coding RNAs and Proximal Coding Genes Based on Correlative Analysis Across Diverse Human Tumors.

PubMed

Basu, Swaraj; Larsson, Erik

2018-05-31

Antisense transcripts and other long non-coding RNAs are pervasive in mammalian cells, and some of these molecules have been proposed to regulate proximal protein-coding genes in cis For example, non-coding transcription can contribute to inactivation of tumor suppressor genes in cancer, and antisense transcripts have been implicated in the epigenetic inactivation of imprinted genes. However, our knowledge is still limited and more such regulatory interactions likely await discovery. Here, we make use of available gene expression data from a large compendium of human tumors to generate hypotheses regarding non-coding-to-coding cis -regulatory relationships with emphasis on negative associations, as these are less likely to arise for reasons other than cis -regulation. We document a large number of possible regulatory interactions, including 193 coding/non-coding pairs that show expression patterns compatible with negative cis -regulation. Importantly, by this approach we capture several known cases, and many of the involved coding genes have known roles in cancer. Our study provides a large catalog of putative non-coding/coding cis -regulatory pairs that may serve as a basis for further experimental validation and characterization. Copyright © 2018 Basu and Larsson.
Identification and Characterization of Long Non-Coding RNAs Related to Mouse Embryonic Brain Development from Available Transcriptomic Data

PubMed Central

He, Hongjuan; Xiu, Youcheng; Guo, Jing; Liu, Hui; Liu, Qi; Zeng, Tiebo; Chen, Yan; Zhang, Yan; Wu, Qiong

2013-01-01

Long non-coding RNAs (lncRNAs) as a key group of non-coding RNAs have gained widely attention. Though lncRNAs have been functionally annotated and systematic explored in higher mammals, few are under systematical identification and annotation. Owing to the expression specificity, known lncRNAs expressed in embryonic brain tissues remain still limited. Considering a large number of lncRNAs are only transcribed in brain tissues, studies of lncRNAs in developmental brain are therefore of special interest. Here, publicly available RNA-sequencing (RNA-seq) data in embryonic brain are integrated to identify thousands of embryonic brain lncRNAs by a customized pipeline. A significant proportion of novel transcripts have not been annotated by available genomic resources. The putative embryonic brain lncRNAs are shorter in length, less spliced and show less conservation than known genes. The expression of putative lncRNAs is in one tenth on average of known coding genes, while comparable with known lncRNAs. From chromatin data, putative embryonic brain lncRNAs are associated with active chromatin marks, comparable with known lncRNAs. Embryonic brain expressed lncRNAs are also indicated to have expression though not evident in adult brain. Gene Ontology analysis of putative embryonic brain lncRNAs suggests that they are associated with brain development. The putative lncRNAs are shown to be related to possible cis-regulatory roles in imprinting even themselves are deemed to be imprinted lncRNAs. Re-analysis of one knockdown data suggests that four regulators are associated with lncRNAs. Taken together, the identification and systematic analysis of putative lncRNAs would provide novel insights into uncharacterized mouse non-coding regions and the relationships with mammalian embryonic brain development. PMID:23967161
Biodegradation of the Organic Disulfide 4,4′-Dithiodibutyric Acid by Rhodococcus spp.

PubMed Central

Khairy, Heba; Wübbeler, Jan Hendrik

2015-01-01

Four Rhodococcus spp. exhibited the ability to use 4,4′-dithiodibutyric acid (DTDB) as a sole carbon source for growth. The most important step for the production of a novel polythioester (PTE) using DTDB as a precursor substrate is the initial cleavage of DTDB. Thus, identification of the enzyme responsible for this step was mandatory. Because Rhodococcus erythropolis strain MI2 serves as a model organism for elucidation of the biodegradation of DTDB, it was used to identify the genes encoding the enzymes involved in DTDB utilization. To identify these genes, transposon mutagenesis of R. erythropolis MI2 was carried out using transposon pTNR-TA. Among 3,261 mutants screened, 8 showed no growth with DTDB as the sole carbon source. In five mutants, the insertion locus was mapped either within a gene coding for a polysaccharide deacetyltransferase, a putative ATPase, or an acetyl coenzyme A transferase, 1 bp upstream of a gene coding for a putative methylase, or 176 bp downstream of a gene coding for a putative kinase. In another mutant, the insertion was localized between genes encoding a putative transcriptional regulator of the TetR family (noxR) and an NADH:flavin oxidoreductase (nox). Moreover, in two other mutants, the insertion loci were mapped within a gene encoding a hypothetical protein in the vicinity of noxR and nox. The interruption mutant generated, R. erythropolis MI2 noxΩtsr, was unable to grow with DTDB as the sole carbon source. Subsequently, nox was overexpressed and purified, and its activity with DTDB was measured. The specific enzyme activity of Nox amounted to 1.2 ± 0.15 U/mg. Therefore, we propose that Nox is responsible for the initial cleavage of DTDB into 2 molecules of 4-mercaptobutyric acid (4MB). PMID:26407888
Intrinsic and extrinsic approaches for detecting genes in a bacterial genome.

PubMed Central

Borodovsky, M; Rudd, K E; Koonin, E V

1994-01-01

The unannotated regions of the Escherichia coli genome DNA sequence from the EcoSeq6 database, totaling 1,278 'intergenic' sequences of the combined length of 359,279 basepairs, were analyzed using computer-assisted methods with the aim of identifying putative unknown genes. The proposed strategy for finding new genes includes two key elements: i) prediction of expressed open reading frames (ORFs) using the GeneMark method based on Markov chain models for coding and non-coding regions of Escherichia coli DNA, and ii) search for protein sequence similarities using programs based on the BLAST algorithm and programs for motif identification. A total of 354 putative expressed ORFs were predicted by GeneMark. Using the BLASTX and TBLASTN programs, it was shown that 208 ORFs located in the unannotated regions of the E. coli chromosome are significantly similar to other protein sequences. Identification of 182 ORFs as probable genes was supported by GeneMark and BLAST, comprising 51.4% of the GeneMark 'hits' and 87.5% of the BLAST 'hits'. 73 putative new genes, comprising 20.6% of the GeneMark predictions, belong to ancient conserved protein families that include both eubacterial and eukaryotic members. This value is close to the overall proportion of highly conserved sequences among eubacterial proteins, indicating that the majority of the putative expressed ORFs that are predicted by GeneMark, but have no significant BLAST hits, nevertheless are likely to be real genes. The majority of the putative genes identified by BLAST search have been described since the release of the EcoSeq6 database, but about 70 genes have not been detected so far. Among these new identifications are genes encoding proteins with a variety of predicted functions including dehydrogenases, kinases, several other metabolic enzymes, ATPases, rRNA methyltransferases, membrane proteins, and different types of regulatory proteins. Images PMID:7984428
Emerging Putative Associations between Non-Coding RNAs and Protein-Coding Genes in Neuropathic Pain: Added Value from Reusing Microarray Data

PubMed Central

Raju, Hemalatha B.; Tsinoremas, Nicholas F.; Capobianco, Enrico

2016-01-01

Regeneration of injured nerves is likely occurring in the peripheral nervous system, but not in the central nervous system. Although protein-coding gene expression has been assessed during nerve regeneration, little is currently known about the role of non-coding RNAs (ncRNAs). This leaves open questions about the potential effects of ncRNAs at transcriptome level. Due to the limited availability of human neuropathic pain (NP) data, we have identified the most comprehensive time-course gene expression profile referred to sciatic nerve (SN) injury and studied in a rat model using two neuronal tissues, namely dorsal root ganglion (DRG) and SN. We have developed a methodology to identify differentially expressed bioentities starting from microarray probes and repurposing them to annotate ncRNAs, while analyzing the expression profiles of protein-coding genes. The approach is designed to reuse microarray data and perform first profiling and then meta-analysis through three main steps. First, we used contextual analysis to identify what we considered putative or potential protein-coding targets for selected ncRNAs. Relevance was therefore assigned to differential expression of neighbor protein-coding genes, with neighborhood defined by a fixed genomic distance from long or antisense ncRNA loci, and of parental genes associated with pseudogenes. Second, connectivity among putative targets was used to build networks, in turn useful to conduct inference at interactomic scale. Last, network paths were annotated to assess relevance to NP. We found significant differential expression in long-intergenic ncRNAs (32 lincRNAs in SN and 8 in DRG), antisense RNA (31 asRNA in SN and 12 in DRG), and pseudogenes (456 in SN and 56 in DRG). In particular, contextual analysis centered on pseudogenes revealed some targets with known association to neurodegeneration and/or neurogenesis processes. While modules of the olfactory receptors were clearly identified in protein–protein interaction networks, other connectivity paths were identified between proteins already investigated in studies on disorders, such as Parkinson, Down syndrome, Huntington disease, and Alzheimer. Our findings suggest the importance of reusing gene expression data by meta-analysis approaches. PMID:27803687
Emerging Putative Associations between Non-Coding RNAs and Protein-Coding Genes in Neuropathic Pain: Added Value from Reusing Microarray Data.

PubMed

Raju, Hemalatha B; Tsinoremas, Nicholas F; Capobianco, Enrico

2016-01-01

Regeneration of injured nerves is likely occurring in the peripheral nervous system, but not in the central nervous system. Although protein-coding gene expression has been assessed during nerve regeneration, little is currently known about the role of non-coding RNAs (ncRNAs). This leaves open questions about the potential effects of ncRNAs at transcriptome level. Due to the limited availability of human neuropathic pain (NP) data, we have identified the most comprehensive time-course gene expression profile referred to sciatic nerve (SN) injury and studied in a rat model using two neuronal tissues, namely dorsal root ganglion (DRG) and SN. We have developed a methodology to identify differentially expressed bioentities starting from microarray probes and repurposing them to annotate ncRNAs, while analyzing the expression profiles of protein-coding genes. The approach is designed to reuse microarray data and perform first profiling and then meta-analysis through three main steps. First, we used contextual analysis to identify what we considered putative or potential protein-coding targets for selected ncRNAs. Relevance was therefore assigned to differential expression of neighbor protein-coding genes, with neighborhood defined by a fixed genomic distance from long or antisense ncRNA loci, and of parental genes associated with pseudogenes. Second, connectivity among putative targets was used to build networks, in turn useful to conduct inference at interactomic scale. Last, network paths were annotated to assess relevance to NP. We found significant differential expression in long-intergenic ncRNAs (32 lincRNAs in SN and 8 in DRG), antisense RNA (31 asRNA in SN and 12 in DRG), and pseudogenes (456 in SN and 56 in DRG). In particular, contextual analysis centered on pseudogenes revealed some targets with known association to neurodegeneration and/or neurogenesis processes. While modules of the olfactory receptors were clearly identified in protein-protein interaction networks, other connectivity paths were identified between proteins already investigated in studies on disorders, such as Parkinson, Down syndrome, Huntington disease, and Alzheimer. Our findings suggest the importance of reusing gene expression data by meta-analysis approaches.
Characterization of a Theta-Type Plasmid from Lactobacillus sakei: a Potential Basis for Low-Copy-Number Vectors in Lactobacilli

PubMed Central

Alpert, Carl-Alfred; Crutz-Le Coq, Anne-Marie; Malleret, Christine; Zagorec, Monique

2003-01-01

The complete nucleotide sequence of the 13-kb plasmid pRV500, isolated from Lactobacillus sakei RV332, was determined. Sequence analysis enabled the identification of genes coding for a putative type I restriction-modification system, two genes coding for putative recombinases of the integrase family, and a region likely involved in replication. The structural features of this region, comprising a putative ori segment containing 11- and 22-bp repeats and a repA gene coding for a putative initiator protein, indicated that pRV500 belongs to the pUCL287 subfamily of theta-type replicons. A 3.7-kb fragment encompassing this region was fused to an Escherichia coli replicon to produce the shuttle vector pRV566 and was observed to be functional in L. sakei for plasmid replication. The L. sakei replicon alone could not support replication in E. coli. Plasmid pRV500 and its derivative pRV566 were determined to be at very low copy numbers in L. sakei. pRV566 was maintained at a reasonable rate over 20 generations in several lactobacilli, such as Lactobacillus curvatus, Lactobacillus casei, and Lactobacillus plantarum, in addition to L. sakei, making it an interesting basis for developing vectors. Sequence relationships with other plasmids are described and discussed. PMID:12957947
Biodegradation of the organic disulfide 4,4'-dithiodibutyric acid by Rhodococcus spp.

PubMed

Khairy, Heba; Wübbeler, Jan Hendrik; Steinbüchel, Alexander

2015-12-01

Four Rhodococcus spp. exhibited the ability to use 4,4'-dithiodibutyric acid (DTDB) as a sole carbon source for growth. The most important step for the production of a novel polythioester (PTE) using DTDB as a precursor substrate is the initial cleavage of DTDB. Thus, identification of the enzyme responsible for this step was mandatory. Because Rhodococcus erythropolis strain MI2 serves as a model organism for elucidation of the biodegradation of DTDB, it was used to identify the genes encoding the enzymes involved in DTDB utilization. To identify these genes, transposon mutagenesis of R. erythropolis MI2 was carried out using transposon pTNR-TA. Among 3,261 mutants screened, 8 showed no growth with DTDB as the sole carbon source. In five mutants, the insertion locus was mapped either within a gene coding for a polysaccharide deacetyltransferase, a putative ATPase, or an acetyl coenzyme A transferase, 1 bp upstream of a gene coding for a putative methylase, or 176 bp downstream of a gene coding for a putative kinase. In another mutant, the insertion was localized between genes encoding a putative transcriptional regulator of the TetR family (noxR) and an NADH:flavin oxidoreductase (nox). Moreover, in two other mutants, the insertion loci were mapped within a gene encoding a hypothetical protein in the vicinity of noxR and nox. The interruption mutant generated, R. erythropolis MI2 noxΩtsr, was unable to grow with DTDB as the sole carbon source. Subsequently, nox was overexpressed and purified, and its activity with DTDB was measured. The specific enzyme activity of Nox amounted to 1.2 ± 0.15 U/mg. Therefore, we propose that Nox is responsible for the initial cleavage of DTDB into 2 molecules of 4-mercaptobutyric acid (4MB). Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Transcriptional landscapes of Axolotl (Ambystoma mexicanum).

PubMed

Caballero-Pérez, Juan; Espinal-Centeno, Annie; Falcon, Francisco; García-Ortega, Luis F; Curiel-Quesada, Everardo; Cruz-Hernández, Andrés; Bako, Laszlo; Chen, Xuemei; Martínez, Octavio; Alberto Arteaga-Vázquez, Mario; Herrera-Estrella, Luis; Cruz-Ramírez, Alfredo

2018-01-15

The axolotl (Ambystoma mexicanum) is the vertebrate model system with the highest regeneration capacity. Experimental tools established over the past 100 years have been fundamental to start unraveling the cellular and molecular basis of tissue and limb regeneration. In the absence of a reference genome for the Axolotl, transcriptomic analysis become fundamental to understand the genetic basis of regeneration. Here we present one of the most diverse transcriptomic data sets for Axolotl by profiling coding and non-coding RNAs from diverse tissues. We reconstructed a population of 115,906 putative protein coding mRNAs as full ORFs (including isoforms). We also identified 352 conserved miRNAs and 297 novel putative mature miRNAs. Systematic enrichment analysis of gene expression allowed us to identify tissue-specific protein-coding transcripts. We also found putative novel and conserved microRNAs which potentially target mRNAs which are reported as important disease candidates in heart and liver. Copyright © 2017 Elsevier Inc. All rights reserved.
Novel insights into the response of Atlantic salmon (Salmo salar) to Piscirickettsia salmonis: Interplay of coding genes and lncRNAs during bacterial infection.

PubMed

Valenzuela-Miranda, Diego; Gallardo-Escárate, Cristian

2016-12-01

Despite the high prevalence and impact to Chilean salmon aquaculture of the intracellular bacterium Piscirickettsia salmonis, the molecular underpinnings of host-pathogen interactions remain unclear. Herein, the interplay of coding and non-coding transcripts has been proposed as a key mechanism involved in immune response. Therefore, the aim of this study was to evidence how coding and non-coding transcripts are modulated during the infection process of Atlantic salmon with P. salmonis. For this, RNA-seq was conducted in brain, spleen, and head kidney samples, revealing different transcriptional profiles according to bacterial load. Additionally, while most of the regulated genes annotated for diverse biological processes during infection, a common response associated with clathrin-mediated endocytosis and iron homeostasis was present in all tissues. Interestingly, while endocytosis-promoting factors and clathrin inductions were upregulated, endocytic receptors were mainly downregulated. Furthermore, the regulation of genes related to iron homeostasis suggested an intracellular accumulation of iron, a process in which heme biosynthesis/degradation pathways might play an important role. Regarding the non-coding response, 918 putative long non-coding RNAs were identified, where 425 were newly characterized for S. salar. Finally, co-localization and co-expression analyses revealed a strong correlation between the modulations of long non-coding RNAs and genes associated with endocytosis and iron homeostasis. These results represent the first comprehensive study of putative interplaying mechanisms of coding and non-coding RNAs during bacterial infection in salmonids. Copyright Â© 2016 Elsevier Ltd. All rights reserved.
Analysis of the Genome of the Sexually Transmitted Insect Virus Helicoverpa zea Nudivirus 2

PubMed Central

Burand, John P.; Kim, Woojin; Afonso, Claudio L.; Tulman, Edan R.; Kutish, Gerald F.; Lu, Zhiqiang; Rock, Daniel L.

2012-01-01

The sexually transmitted insect virus Helicoverpa zea nudivirus 2 (HzNV-2) was determined to have a circular double-stranded DNA genome of 231,621 bp coding for an estimated 113 open reading frames (ORFs). HzNV-2 is most closely related to the nudiviruses, a sister group of the insect baculoviruses. Several putative ORFs that share homology with the baculovirus core genes were identified in the viral genome. However, HzNV-2 lacks several key genetic features of baculoviruses including the late transcriptional regulation factor, LEF-1 and the palindromic hrs, which serve as origins of replication. The HzNV-2 genome was found to code for three ORFs that had significant sequence homology to cellular genes which are not generally found in viral genomes. These included a presumed juvenile hormone esterase gene, a gene coding for a putative zinc-dependent matrix metalloprotease, and a major facilitator superfamily protein gene; all of which are believed to play a role in the cellular proliferation and the tissue hypertrophy observed in the malformation of reproductive organs observed in HzNV-2 infected corn earworm moths, Helicoverpa zea. PMID:22355451
Analysis of the genome of the sexually transmitted insect virus Helicoverpa zea nudivirus 2.

PubMed

Burand, John P; Kim, Woojin; Afonso, Claudio L; Tulman, Edan R; Kutish, Gerald F; Lu, Zhiqiang; Rock, Daniel L

2012-01-01

The sexually transmitted insect virus Helicoverpa zea nudivirus 2 (HzNV-2) was determined to have a circular double-stranded DNA genome of 231,621 bp coding for an estimated 113 open reading frames (ORFs). HzNV-2 is most closely related to the nudiviruses, a sister group of the insect baculoviruses. Several putative ORFs that share homology with the baculovirus core genes were identified in the viral genome. However, HzNV-2 lacks several key genetic features of baculoviruses including the late transcriptional regulation factor, LEF-1 and the palindromic hrs, which serve as origins of replication. The HzNV-2 genome was found to code for three ORFs that had significant sequence homology to cellular genes which are not generally found in viral genomes. These included a presumed juvenile hormone esterase gene, a gene coding for a putative zinc-dependent matrix metalloprotease, and a major facilitator superfamily protein gene; all of which are believed to play a role in the cellular proliferation and the tissue hypertrophy observed in the malformation of reproductive organs observed in HzNV-2 infected corn earworm moths, Helicoverpa zea.
Systematic asymmetric nucleotide exchanges produce human mitochondrial RNAs cryptically encoding for overlapping protein coding genes.

PubMed

Seligmann, Hervé

2013-05-07

GenBank's EST database includes RNAs matching exactly human mitochondrial sequences assuming systematic asymmetric nucleotide exchange-transcription along exchange rules: A→G→C→U/T→A (12 ESTs), A→U/T→C→G→A (4 ESTs), C→G→U/T→C (3 ESTs), and A→C→G→U/T→A (1 EST), no RNAs correspond to other potential asymmetric exchange rules. Hypothetical polypeptides translated from nucleotide-exchanged human mitochondrial protein coding genes align with numerous GenBank proteins, predicted secondary structures resemble their putative GenBank homologue's. Two independent methods designed to detect overlapping genes (one based on nucleotide contents analyses in relation to replicative deamination gradients at third codon positions, and circular code analyses of codon contents based on frame redundancy), confirm nucleotide-exchange-encrypted overlapping genes. Methods converge on which genes are most probably active, and which not, and this for the various exchange rules. Mean EST lengths produced by different nucleotide exchanges are proportional to (a) extents that various bioinformatics analyses confirm the protein coding status of putative overlapping genes; (b) known kinetic chemistry parameters of the corresponding nucleotide substitutions by the human mitochondrial DNA polymerase gamma (nucleotide DNA misinsertion rates); (c) stop codon densities in predicted overlapping genes (stop codon readthrough and exchanging polymerization regulate gene expression by counterbalancing each other). Numerous rarely expressed proteins seem encoded within regular mitochondrial genes through asymmetric nucleotide exchange, avoiding lengthening genomes. Intersecting evidence between several independent approaches confirms the working hypothesis status of gene encryption by systematic nucleotide exchanges. Copyright © 2013 Elsevier Ltd. All rights reserved.
Network perturbation by recurrent regulatory variants in cancer

PubMed Central

Cho, Ara; Lee, Insuk; Choi, Jung Kyoon

2017-01-01

Cancer driving genes have been identified as recurrently affected by variants that alter protein-coding sequences. However, a majority of cancer variants arise in noncoding regions, and some of them are thought to play a critical role through transcriptional perturbation. Here we identified putative transcriptional driver genes based on combinatorial variant recurrence in cis-regulatory regions. The identified genes showed high connectivity in the cancer type-specific transcription regulatory network, with high outdegree and many downstream genes, highlighting their causative role during tumorigenesis. In the protein interactome, the identified transcriptional drivers were not as highly connected as coding driver genes but appeared to form a network module centered on the coding drivers. The coding and regulatory variants associated via these interactions between the coding and transcriptional drivers showed exclusive and complementary occurrence patterns across tumor samples. Transcriptional cancer drivers may act through an extensive perturbation of the regulatory network and by altering protein network modules through interactions with coding driver genes. PMID:28333928
Influence of putative exopolysaccharide genes on Pseudomonas putida KT2440 biofilm stability.

PubMed

Nilsson, Martin; Chiang, Wen-Chi; Fazli, Mustafa; Gjermansen, Morten; Givskov, Michael; Tolker-Nielsen, Tim

2011-05-01

We report a study of the role of putative exopolysaccharide gene clusters in the formation and stability of Pseudomonas putida KT2440 biofilm. Two novel putative exopolysaccharide gene clusters, pea and peb, were identified, and evidence is provided that they encode products that stabilize P. putida KT2440 biofilm. The gene clusters alg and bcs, which code for proteins mediating alginate and cellulose biosynthesis, were found to play minor roles in P. putida KT2440 biofilm formation and stability under the conditions tested. A P. putida KT2440 derivative devoid of any identifiable exopolysaccharide genes was found to form biofilm with a structure similar to wild-type biofilm, but with a stability lower than that of wild-type biofilm. Based on our data, we suggest that the formation of structured P. putida KT2440 biofilm can occur in the absence of exopolysaccharides; however, exopolysaccharides play a role as structural stabilizers. © 2011 Society for Applied Microbiology and Blackwell Publishing Ltd.
Molecular cloning of the mouse gene coding for {alpha}{sub 2}-macroglobulin and targeting of the gene in embryonic stem cells

DOE Office of Scientific and Technical Information (OSTI.GOV)

Umans, L.; Serneels, L.; Hilliker, C.

1994-08-01

The authors have cloned the mouse gene coding for {alpha}{sub 2}-macroglobulin in overlapping {lambda} clones and have analyzed its structure. The gene contains 36 exons, coding for the 4.8-kb cDNA that we cloned previously. Including putative control elements in the 5{prime} flanking region, the gene covers about 45 kb. A region of 3.8 kb, stretching from 835 bases upstream of the cDNA start site to exon 4, including all intervening sequences, was sequenced completely. The analysis demonstrated that the putative promoter region of the mouse A2M gene differed considerably from the known promoter sequences of the human A2M gene andmore » of the rat acute-phas A2M gene. Comparison of the exon-intron structure of all known genes of the A2M family confirmed that the rat acute phase A2M gene is more closely related to the human gene than to the mouse A2M gene. To generate mice with the A2M gene inactivated, an insertion type of construct containing 7.5 kb of genomic DNA of the mouse strain 129/J, encompassing exons 16 to 19, was synthesized. A hygromycin marker gene was embedded in intron 17. After electroporation, 198 hygromycin-resistant ES cell lines were isolated and analyzed by Southern blotting. Five ES cell lines were obtained with one allele of the mouse A2M gene targeted by this insertion construct, demonstrating that the position and the characteristics of the vector served the intended goal.« less
Molecular Evolution of the Non-Coding Eosinophil Granule Ontogeny Transcript

PubMed Central

Rose, Dominic; Stadler, Peter F.

2011-01-01

Eukaryotic genomes are pervasively transcribed. A large fraction of the transcriptional output consists of long, mRNA-like, non-protein-coding transcripts (mlncRNAs). The evolutionary history of mlncRNAs is still largely uncharted territory. In this contribution, we explore in detail the evolutionary traces of the eosinophil granule ontogeny transcript (EGOT), an experimentally confirmed representative of an abundant class of totally intronic non-coding transcripts (TINs). EGOT is located antisense to an intron of the ITPR1 gene. We computationally identify putative EGOT orthologs in the genomes of 32 different amniotes, including orthologs from primates, rodents, ungulates, carnivores, afrotherians, and xenarthrans, as well as putative candidates from basal amniotes, such as opossum or platypus. We investigate the EGOT gene phylogeny, analyze patterns of sequence conservation, and the evolutionary conservation of the EGOT gene structure. We show that EGO-B, the spliced isoform, may be present throughout the placental mammals, but most likely dates back even further. We demonstrate here for the first time that the whole EGOT locus is highly structured, containing several evolutionary conserved, and thermodynamic stable secondary structures. Our analyses allow us to postulate novel functional roles of a hitherto poorly understood region at the intron of EGO-B which is highly conserved at the sequence level. The region contains a novel ITPR1 exon and also conserved RNA secondary structures together with a conserved TATA-like element, which putatively acts as a promoter of an independent regulatory element. PMID:22303364
Isolation and molecular identification of Sunshine virus, a novel paramyxovirus found in Australian snakes.

PubMed

Hyndman, Timothy H; Marschang, Rachel E; Wellehan, James F X; Nicholls, Philip K

2012-10-01

This paper describes the isolation and molecular identification of a novel paramyxovirus found during an investigation of an outbreak of neurorespiratory disease in a collection of Australian pythons. Using Illumina® high-throughput sequencing, a 17,187 nucleotide sequence was assembled from RNA extracts from infected viper heart cells (VH2) displaying widespread cytopathic effects in the form of multinucleate giant cells. The sequence appears to contain all the coding regions of the genome, including the following predicted paramyxoviral open reading frames (ORFs): 3'--Nucleocapsid (N)--putative Phosphoprotein (P)--Matrix (M)--Fusion (F)--putative attachment protein--Polymerase (L)--5'. There is also a 540 nucleotide ORF between the N and putative P genes that may be an additional coding region. Phylogenetic analyses of the complete N, M, F and L genes support the clustering of this virus within the family Paramyxoviridae but outside both of the current subfamilies: Paramyxovirinae and Pneumovirinae. We propose to name this new virus, Sunshine virus, after the geographic origin of the first isolate--the Sunshine Coast of Queensland, Australia. Copyright © 2012 Elsevier B.V. All rights reserved.
Structural and functional studies of a family of Dictyostelium discoideum developmentally regulated, prestalk genes coding for small proteins.

PubMed

Vicente, Juan J; Galardi-Castilla, María; Escalante, Ricardo; Sastre, Leandro

2008-01-03

The social amoeba Dictyostelium discoideum executes a multicellular development program upon starvation. This morphogenetic process requires the differential regulation of a large number of genes and is coordinated by extracellular signals. The MADS-box transcription factor SrfA is required for several stages of development, including slug migration and spore terminal differentiation. Subtractive hybridization allowed the isolation of a gene, sigN (SrfA-induced gene N), that was dependent on the transcription factor SrfA for expression at the slug stage of development. Homology searches detected the existence of a large family of sigN-related genes in the Dictyostelium discoideum genome. The 13 most similar genes are grouped in two regions of chromosome 2 and have been named Group1 and Group2 sigN genes. The putative encoded proteins are 87-89 amino acids long. All these genes have a similar structure, composed of a first exon containing a 13 nucleotides long open reading frame and a second exon comprising the remaining of the putative coding region. The expression of these genes is induced at10 hours of development. Analyses of their promoter regions indicate that these genes are expressed in the prestalk region of developing structures. The addition of antibodies raised against SigN Group 2 proteins induced disintegration of multi-cellular structures at the mound stage of development. A large family of genes coding for small proteins has been identified in D. discoideum. Two groups of very similar genes from this family have been shown to be specifically expressed in prestalk cells during development. Functional studies using antibodies raised against Group 2 SigN proteins indicate that these genes could play a role during multicellular development.
Complete mitochondrial DNA sequence of the Eastern keelback mullet Liza affinis.

PubMed

Gong, Xiaoling; Zhu, Wenjia; Bao, Baolong

2016-05-01

Eastern keelback mullet (Liza affinis) inhabits inlet waters and estuaries of rivers. In this paper, we initially determined the complete mitochondrial genome of Liza affinis. The entire mtDNA sequence is 16,831 bp in length, including 2 rRNA genes, 22 tRNA genes, 13 protein-coding genes and 1 putative control region. Its order and numbers of genes are similar to most bony fishes.

Cloning, characterization and sequence comparison of the gene coding for IMP dehydrogenase from Pyrococcus furiosus.

PubMed

Collart, F R; Osipiuk, J; Trent, J; Olsen, G J; Huberman, E

1996-10-03

We have cloned and characterized the gene encoding inosine monophosphate dehydrogenase (IMPDH) from Pyrococcus furiosus (Pf), a hyperthermophillic archeon. Sequence analysis of the Pf gene indicated an open reading frame specifying a protein of 485 amino acids (aa) with a calculated M(r) of 52900. Canonical Archaea promoter elements, Box A and Box B, are located -49 and -17 nucleotides (nt), respectively, upstream of the putative start codon. The sequence of the putative active-site region conforms to the IMPDH signature motif and contains a putative active-site cysteine. Phylogenetic relationships derived by using all available IMPDH sequences are consistent with trees developed for other molecules; they do not precisely resolve the history of Pf IMPDH but indicate a close similarity to bacterial IMPDH proteins. The phylogenetic analysis indicates that a gene duplication occurred prior to the division between rodents and humans, accounting for the Type I and II isoforms identified in mice and humans.
The complete mitogenome of the whale shark parasitic copepod Pandarus rhincodonicus norman, Newbound & Knott (Crustacea; Siphonostomatoida; Pandaridae)--a new gene order for the copepoda.

PubMed

Austin, Christopher M; Tan, Mun Hua; Lee, Yin Peng; Croft, Laurence J; Meekan, Mark G; Pierce, Simon J; Gan, Han Ming

2016-01-01

The complete mitochondrial genome of the parasitic copepod Pandarus rhincodonicus was obtained from a partial genome scan using the HiSeq sequencing system. The Pandarus rhincodonicus mitogenome has 14,480 base pairs (62% A+T content) made up of 12 protein-coding genes, 2 ribosomal subunit genes, 22 transfer RNAs, and a putative 384 bp non-coding AT-rich region. This Pandarus mitogenome sequence is the first for the family Pandaridae, the second for the order Siphonostomatoida and the sixth for the Copepoda.
Mu-Like Prophage in Serogroup B Neisseria meningitidis Coding for Surface-Exposed Antigens

PubMed Central

Masignani, Vega; Giuliani, Marzia Monica; Tettelin, Hervé; Comanducci, Maurizio; Rappuoli, Rino; Scarlato, Vincenzo

2001-01-01

Sequence analysis of the genome of Neisseria meningititdis serogroup B revealed the presence of an ∼35-kb region inserted within a putative gene coding for an ABC-type transporter. The region contains 46 open reading frames, 29 of which are colinear and homologous to the genes of Escherichia coli Mu phage. Two prophages with similar organizations were also found in serogroup A meningococcus, and one was found in Haemophilus influenzae. Early and late phage functions are well preserved in this family of Mu-like prophages. Several regions of atypical nucleotide content were identified. These likely represent genes acquired by horizontal transfer. Three of the acquired genes are shown to code for surface-associated antigens, and the encoded proteins are able to induce bactericidal antibodies. PMID:11254622
Identification of the Operon for the Sorbitol (Glucitol) Phosphoenolpyruvate:Sugar Phosphotransferase System in Streptococcus mutans

PubMed Central

Boyd, David A.; Thevenot, Tracy; Gumbmann, Markus; Honeyman, Allen L.; Hamilton, Ian R.

2000-01-01

Transposon mutagenesis and marker rescue were used to isolate and identify an 8.5-kb contiguous region containing six open reading frames constituting the operon for the sorbitol P-enolpyruvate phosphotransferase transport system (PTS) of Streptococcus mutans LT11. The first gene, srlD, codes for sorbitol-6-phosphate dehydrogenase, followed downstream by srlR, coding for a transcriptional regulator; srlM, coding for a putative activator; and the srlA, srlE, and srlB genes, coding for the EIIC, EIIBC, and EIIA components of the sorbitol PTS, respectively. Among all sorbitol PTS operons characterized to date, the srlD gene is found after the genes coding for the EII components; thus, the location of the gene in S. mutans is unique. The SrlR protein is similar to several transcriptional regulators found in Bacillus spp. that contain PTS regulator domains (J. Stülke, M. Arnaud, G. Rapoport, and I. Martin-Verstraete, Mol. Microbiol. 28:865–874, 1998), and its gene overlaps the srlM gene by 1 bp. The arrangement of these two regulatory genes is unique, having not been reported for other bacteria. PMID:10639465
Cloning and identification of bacteriophage T4 gene 2 product gp2 and action of gp2 on infecting DNA in vivo.

PubMed Central

Lipinska, B; Rao, A S; Bolten, B M; Balakrishnan, R; Goldberg, E B

1989-01-01

We sequenced bacteriophage T4 genes 2 and 3 and the putative C-terminal portion of gene 50. They were found to have appropriate open reading frames directed counterclockwise on the T4 map. Mutations in genes 2 and 64 were shown to be in the same open reading frame, which we now call gene 2. This gene codes for a protein of 27,068 daltons. The open reading frame corresponding to gene 3 codes for a protein of 20,634 daltons. Appropriate bands on polyacrylamide gels were identified at 30 and 20 kilodaltons, respectively. We found that the product of the cloned gene 2 can protect T4 DNA double-stranded ends from exonuclease V action. Images PMID:2644202
Complete mitochondrial genome of Platevindex sp. (Gastropoda: Pulmonata: Systellommatophora: Onchidiidae).

PubMed

Liu, Chen; Shen, He Ding; Zhou, Na

2016-01-01

The complete mitochondrial genome sequence of Platevindex sp. is firstly described in the article. The mitogenome (13,908 bp) contains 22 tRNA genes, 2 ribosomal RNA genes and 13 protein-coding genes, and 1 putative control region (CR). CR is not well characterized due to lack of discrete conserved sequence blocks. This characteristic is similar with CRs of other invertebrate mitochondrial genomes. The characteristic is the typical bivalvia mitochondrial gene composition.
Genetic and molecular characterization of a gene encoding a wide specificity purine permease of Aspergillus nidulans reveals a novel family of transporters conserved in prokaryotes and eukaryotes.

PubMed

Diallinas, G; Gorfinkiel, L; Arst, H N; Cecchetto, G; Scazzocchio, C

1995-04-14

In Aspergillus nidulans, loss-of-function mutations in the uapA and azgA genes, encoding the major uric acid-xanthine and hypoxanthine-adenine-guanine permeases, respectively, result in impaired utilization of these purines as sole nitrogen sources. The residual growth of the mutant strains is due to the activity of a broad specificity purine permease. We have identified uapC, the gene coding for this third permease through the isolation of both gain-of-function and loss-of-function mutations. Uptake studies with wild-type and mutant strains confirmed the genetic analysis and showed that the UapC protein contributes 30% and 8-10% to uric acid and hypoxanthine transport rates, respectively. The uapC gene was cloned, its expression studied, its sequence and transcript map established, and the sequence of its putative product analyzed. uapC message accumulation is: (i) weakly induced by 2-thiouric acid; (ii) repressed by ammonium; (iii) dependent on functional uaY and areA regulatory gene products (mediating uric acid induction and nitrogen metabolite repression, respectively); (iv) increased by uapC gain-of-function mutations which specifically, but partially, suppress a leucine to valine mutation in the zinc finger of the protein coded by the areA gene. The putative uapC gene product is a highly hydrophobic protein of 580 amino acids (M(r) = 61,251) including 12-14 putative transmembrane segments. The UapC protein is highly similar (58% identity) to the UapA permease and significantly similar (23-34% identity) to a number of bacterial transporters. Comparisons of the sequences and hydropathy profiles of members of this novel family of transporters yield insights into their structure, functionally important residues, and possible evolutionary relationships.
Protein and gene structure of a blue laccase from Pleurotus ostreatus1.

PubMed Central

Giardina, P; Palmieri, G; Scaloni, A; Fontanella, B; Faraco, V; Cennamo, G; Sannia, G

1999-01-01

A new laccase isoenzyme (POXA1b, where POX is phenol oxidase), produced by Pleurotus ostreatus in cultures supplemented with copper sulphate, has been purified and fully characterized. The main characteristics of this protein (molecular mass in native and denaturing conditions, pI and catalytic properties) are almost identical to the previously studied laccase POXA1w. However, POXA1b contains four copper atoms per molecule instead of one copper, two zinc and one iron atom per molecule of POXA1w. Furthermore, POXA1b shows an unusually high stability at alkaline pH. The gene and cDNA coding for POXA1b have been cloned and sequenced. The gene coding sequence contains 1599 bp, interrupted by 15 introns. Comparison of the structure of the poxa1b gene with the two previously studied P. ostreatus laccase genes (pox1 and poxc) suggests that these genes belong to two different subfamilies. The amino acid sequence of POXA1b deduced from the cDNA sequence has been almost completely verified by means of matrix-assisted laser desorption ionization MS. It has been demonstrated that three out of six putative glycosylation sites are post-translationally modified and the structure of the bound glycosidic moieties has been determined, whereas two other putative glycosylation sites are unmodified. PMID:10417329
The Interactions between the Long Non-coding RNA NERDL and Its Target Gene Affect Wood Formation in Populus tomentosa

PubMed Central

Shi, Wan; Quan, Mingyang; Du, Qingzhang; Zhang, Deqiang

2017-01-01

Long non-coding RNAs (lncRNAs) are important regulatory factors for plant growth and development, but little is known about the allelic interactions of lncRNAs with mRNA in perennial plants. Here, we analyzed the interaction of the NERD (Needed for RDR2-independent DNA methylation) Populus tomentosa gene PtoNERD with its putative regulator, the lncRNA NERDL (NERD-related lncRNA), which partially overlaps with the promoter region of this gene. Expression analysis in eight tissues showed a positive correlation between NERDL and PtoNERD (r = 0.62), suggesting that the interaction of NERDL with its putative target might be involved in wood formation. We conducted association mapping in a natural population of P. tomentosa (435 unrelated individuals) to evaluate genetic variation and the interaction of the lncRNA NERDL with PtoNERD. Using additive and dominant models, we identified 30 SNPs (P < 0.01) associated with five tree growth and wood property traits. Each SNP explained 3.90–8.57% of phenotypic variance, suggesting that NERDL and its putative target play a common role in wood formation. Epistasis analysis uncovered nine SNP-SNP association pairs between NERDL and PtoNERD, with an information gain of -7.55 to 2.16%, reflecting the strong interactions between NERDL and its putative target. This analysis provides a powerful method for deciphering the genetic interactions of lncRNAs with mRNA and dissecting the complex genetic network of quantitative traits in trees. PMID:28674544
Interplay between cardiac transcription factors and non-coding RNAs in predisposing to atrial fibrillation.

PubMed

Mikhailov, Alexander T; Torrado, Mario

2018-05-12

There is growing evidence that putative gene regulatory networks including cardio-enriched transcription factors, such as PITX2, TBX5, ZFHX3, and SHOX2, and their effector/target genes along with downstream non-coding RNAs can play a potentially important role in the process of adaptive and maladaptive atrial rhythm remodeling. In turn, expression of atrial fibrillation-associated transcription factors is under the control of upstream regulatory non-coding RNAs. This review broadly explores gene regulatory mechanisms associated with susceptibility to atrial fibrillation-with key examples from both animal models and patients-within the context of both cardiac transcription factors and non-coding RNAs. These two systems appear to have multiple levels of cross-regulation and act coordinately to achieve effective control of atrial rhythm effector gene expression. Perturbations of a dynamic expression balance between transcription factors and corresponding non-coding RNAs can provoke the development or promote the progression of atrial fibrillation. We also outline deficiencies in current models and discuss ongoing studies to clarify remaining mechanistic questions. An understanding of the function of transcription factors and non-coding RNAs in gene regulatory networks associated with atrial fibrillation risk will enable the development of innovative therapeutic strategies.
Molecular Cloning, Characterization, and Differential Expression of a Glucoamylase Gene from the Basidiomycetous Fungus Lentinula edodes

PubMed Central

Zhao, J.; Chen, Y. H.; Kwan, H. S.

2000-01-01

The complete nucleotide sequence of putative glucoamylase gene gla1 from the basidiomycetous fungus Lentinula edodes strain L54 is reported. The coding region of the genomic glucoamylase sequence, which is preceded by eukaryotic promoter elements CAAT and TATA, spans 2,076 bp. The gla1 gene sequence codes for a putative polypeptide of 571 amino acids and is interrupted by seven introns. The open reading frame sequence of the gla1 gene shows strong homology with those of other fungal glucoamylase genes and encodes a protein with an N-terminal catalytic domain and a C-terminal starch-binding domain. The similarity between the Gla1 protein and other fungal glucoamylases is from 45 to 61%, with the region of highest conservation found in catalytic domains and starch-binding domains. We compared the kinetics of glucoamylase activity and levels of gene expression in L. edodes strain L54 grown on different carbon sources (glucose, starch, cellulose, and potato extract) and in various developmental stages (mycelium growth, primordium appearance, and fruiting body formation). Quantitative reverse transcription PCR utilizing pairs of primers specific for gla1 gene expression shows that expression of gla1 was induced by starch and increased during the process of fruiting body formation, which indicates that glucoamylases may play an important role in the morphogenesis of the basidiomycetous fungus. PMID:10831434
Adaptation, ecology, and evolution of the halophilic stromatolite archaeon Halococcus hamelinensis inferred through genome analyses.

PubMed

Gudhka, Reema K; Neilan, Brett A; Burns, Brendan P

2015-01-01

Halococcus hamelinensis was the first archaeon isolated from stromatolites. These geomicrobial ecosystems are thought to be some of the earliest known on Earth, yet, despite their evolutionary significance, the role of Archaea in these systems is still not well understood. Detailed here is the genome sequencing and analysis of an archaeon isolated from stromatolites. The genome of H. hamelinensis consisted of 3,133,046 base pairs with an average G+C content of 60.08% and contained 3,150 predicted coding sequences or ORFs, 2,196 (68.67%) of which were protein-coding genes with functional assignments and 954 (29.83%) of which were of unknown function. Codon usage of the H. hamelinensis genome was consistent with a highly acidic proteome, a major adaptive mechanism towards high salinity. Amino acid transport and metabolism, inorganic ion transport and metabolism, energy production and conversion, ribosomal structure, and unknown function COG genes were overrepresented. The genome of H. hamelinensis also revealed characteristics reflecting its survival in its extreme environment, including putative genes/pathways involved in osmoprotection, oxidative stress response, and UV damage repair. Finally, genome analyses indicated the presence of putative transposases as well as positive matches of genes of H. hamelinensis against various genomes of Bacteria, Archaea, and viruses, suggesting the potential for horizontal gene transfer.
From Genomes to Protein Models and Back

NASA Astrophysics Data System (ADS)

Tramontano, Anna; Giorgetti, Alejandro; Orsini, Massimiliano; Raimondo, Domenico

2007-12-01

The alternative splicing mechanism allows genes to generate more than one product. When the splicing events occur within protein coding regions they can modify the biological function of the protein. Alternative splicing has been suggested as one way for explaining the discrepancy between the number of human genes and functional complexity. We analysed the putative structure of the alternatively spliced gene products annotated in the ENCODE pilot project and discovered that many of the potential alternative gene products will be unlikely to produce stable functional proteins.
De Novo ORFs in Drosophila Are Important to Organismal Fitness and Evolved Rapidly from Previously Non-coding Sequences

PubMed Central

Reinhardt, Josephine A.; Wanjiru, Betty M.; Brant, Alicia T.; Saelao, Perot; Begun, David J.; Jones, Corbin D.

2013-01-01

How non-coding DNA gives rise to new protein-coding genes (de novo genes) is not well understood. Recent work has revealed the origins and functions of a few de novo genes, but common principles governing the evolution or biological roles of these genes are unknown. To better define these principles, we performed a parallel analysis of the evolution and function of six putatively protein-coding de novo genes described in Drosophila melanogaster. Reconstruction of the transcriptional history of de novo genes shows that two de novo genes emerged from novel long non-coding RNAs that arose at least 5 MY prior to evolution of an open reading frame. In contrast, four other de novo genes evolved a translated open reading frame and transcription within the same evolutionary interval suggesting that nascent open reading frames (proto-ORFs), while not required, can contribute to the emergence of a new de novo gene. However, none of the genes arose from proto-ORFs that existed long before expression evolved. Sequence and structural evolution of de novo genes was rapid compared to nearby genes and the structural complexity of de novo genes steadily increases over evolutionary time. Despite the fact that these genes are transcribed at a higher level in males than females, and are most strongly expressed in testes, RNAi experiments show that most of these genes are essential in both sexes during metamorphosis. This lethality suggests that protein coding de novo genes in Drosophila quickly become functionally important. PMID:24146629
Network analysis of S. aureus response to ramoplanin reveals modules for virulence factors and resistance mechanisms and characteristic novel genes.

PubMed

Subramanian, Devika; Natarajan, Jeyakumar

2015-12-10

Staphylococcus aureus is a major human pathogen and ramoplanin is an antimicrobial attributed for effective treatment. The goal of this study was to examine the transcriptomic profiles of ramoplanin sensitive and resistant S. aureus to identify putative modules responsible for virulence and resistance-mechanisms and its characteristic novel genes. The dysregulated genes were used to reconstruct protein functional association networks for virulence-factors and resistance-mechanisms individually. Strong link between metabolic-pathways and development of virulence/resistance is suggested. We identified 15 putative modules of virulence factors. Six hypothetical genes were annotated with novel virulence activity among which SACOL0281 was discovered to be an essential virulence factor EsaD. The roles of MazEF toxin-antitoxin system, SACOL0202/SACOL0201 two-component system and that of amino-sugar and nucleotide-sugar metabolism in virulence are also suggested. In addition, 14 putative modules of resistance mechanisms including modules of ribosomal protein-coding genes and metabolic pathways such as biotin-synthesis, TCA-cycle, riboflavin-biosynthesis, peptidoglycan-biosynthesis etc. are also indicated. Copyright © 2015 Elsevier B.V. All rights reserved.
A resource for characterizing genome-wide binding and putative target genes of transcription factors expressed during secondary growth and wood formation in Populus.

PubMed

Liu, Lijun; Ramsay, Trevor; Zinkgraf, Matthew; Sundell, David; Street, Nathaniel Robert; Filkov, Vladimir; Groover, Andrew

2015-06-01

Identifying transcription factor target genes is essential for modeling the transcriptional networks underlying developmental processes. Here we report a chromatin immunoprecipitation sequencing (ChIP-seq) resource consisting of genome-wide binding regions and associated putative target genes for four Populus homeodomain transcription factors expressed during secondary growth and wood formation. Software code (programs and scripts) for processing the Populus ChIP-seq data are provided within a publically available iPlant image, including tools for ChIP-seq data quality control and evaluation adapted from the human Encyclopedia of DNA Elements (ENCODE) project. Basic information for each transcription factor (including members of Class I KNOX, Class III HD ZIP, BEL1-like families) binding are summarized, including the number and location of binding regions, distribution of binding regions relative to gene features, associated putative target genes, and enriched functional categories of putative target genes. These ChIP-seq data have been integrated within the Populus Genome Integrative Explorer (PopGenIE) where they can be analyzed using a variety of web-based tools. We present an example analysis that shows preferential binding of transcription factor ARBORKNOX1 to the nearest neighbor genes in a pre-calculated co-expression network module, and enrichment for meristem-related genes within this module including multiple orthologs of Arabidopsis KNOTTED-like Arabidopsis 2/6. © 2015 Society for Experimental Biology and John Wiley & Sons Ltd This article has been contributed to by US Government employees and their work is in the public domain in the USA.
Delineation of the Caffeine C-8 Oxidation Pathway in Pseudomonas sp. Strain CBB1 via Characterization of a New Trimethyluric Acid Monooxygenase and Genes Involved in Trimethyluric Acid Metabolism

PubMed Central

Mohanty, Sujit Kumar; Yu, Chi-Li; Das, Shuvendu; Louie, Tai Man; Gakhar, Lokesh

2012-01-01

The molecular basis of the ability of bacteria to live on caffeine via the C-8 oxidation pathway is unknown. The first step of this pathway, caffeine to trimethyluric acid (TMU), has been attributed to poorly characterized caffeine oxidases and a novel quinone-dependent caffeine dehydrogenase. Here, we report the detailed characterization of the second enzyme, a novel NADH-dependent trimethyluric acid monooxygenase (TmuM), a flavoprotein that catalyzes the conversion of TMU to 1,3,7-trimethyl-5-hydroxyisourate (TM-HIU). This product spontaneously decomposes to racemic 3,6,8-trimethylallantoin (TMA). TmuM prefers trimethyluric acids and, to a lesser extent, dimethyluric acids as substrates, but it exhibits no activity on uric acid. Homology models of TmuM against uric acid oxidase HpxO (which catalyzes uric acid to 5-hydroxyisourate) reveal a much bigger and hydrophobic cavity to accommodate the larger substrates. Genes involved in the caffeine C-8 oxidation pathway are located in a 25.2-kb genomic DNA fragment of CBB1, including cdhABC (coding for caffeine dehydrogenase) and tmuM (coding for TmuM). Comparison of this gene cluster to the uric acid-metabolizing gene cluster and pathway of Klebsiella pneumoniae revealed two major open reading frames coding for the conversion of TM-HIU to S-(+)-trimethylallantoin [S-(+)-TMA]. The first one, designated tmuH, codes for a putative TM-HIU hydrolase, which catalyzes the conversion of TM-HIU to 3,6,8-trimethyl-2-oxo-4-hydroxy-4-carboxy-5-ureidoimidazoline (TM-OHCU). The second one, designated tmuD, codes for a putative TM-OHCU decarboxylase which catalyzes the conversion of TM-OHCU to S-(+)-TMA. Based on a combination of enzymology and gene-analysis, a new degradative pathway for caffeine has been proposed via TMU, TM-HIU, TM-OHCU to S-(+)-TMA. PMID:22609920
DOE Office of Scientific and Technical Information (OSTI.GOV)

Villard, L.; Lossi, A.M.; Fontes, M.

We have previously reported the isolation of a gene from Xq13 that codes for a putative regulator of transcription (XNP) and has now been shown to be the gene involved in the X-linked {alpha}-thalassemia with mental retardation (ATR-X) syndrome. The widespread expression and numerous domains present in the putative protein suggest that this gene could be involved in other phenotypes. The predominant expression of the gene in the developing brain, as well as its association with neuron differentiation, indicates that mutations of this gene might result in a mental retardation (MR) phenotype. In this paper we present a family withmore » a splice junction mutation in XNP that results in the skipping of an exon and in the introduction of a stop codon in the middle of the XNP-coding sequence. Only the abnormal transcript is expressed in two first cousins presenting the classic ATR-X phenotype (with {alpha}-thalassemia and HbH inclusions). In a distant cousin presenting a similar dysmorphic MR phenotype but not having thalassemia, {approximately}30% of the XNP transcripts are normal. These data demonstrate that the mode of action of the XNP gene product on globin expression is distinct from its mode of action in brain development and facial morphogenesis and suggest that other dysmorphic mental retardation phenotypes, such as Juberg-Marsidi or some sporadic cases of Coffin-Lowry, could be due to mutations in XNP. 20 refs., 5 figs., 2 tabs.« less
An operon from Lactobacillus helveticus composed of a proline iminopeptidase gene (pepI) and two genes coding for putative members of the ABC transporter family of proteins.

PubMed

Varmanen, P; Rantanen, T; Palva, A

1996-12-01

A proline iminopeptidase gene (pepI) of an industrial Lactobacillus helveticus strain was cloned and found to be organized in an operon-like structure of three open reading frames (ORF1, ORF2 and ORF3). ORF1 was preceded by a typical prokaryotic promoter region, and a putative transcription terminator was found downstream of ORF3, identified as the pepI gene. Using primer-extension analyses, only one transcription start site, upstream of ORF1, was identifiable in the predicted operon. Although the size of mRNA could not be judged by Northern analysis either with ORF1-, ORF2- or pepI-specific probes, reverse transcription-PCR analyses further supported the operon structure of the three genes. ORF1, ORF2 and ORF3 had coding capacities for 50.7, 24.5 and 33.8 kDa proteins, respectively. The ORF3-encoded PepI protein showed 65% identity with the PepI proteins from Lactobacillus delbrueckii subsp. bulgaricus and Lactobacillus delbrueckii subsp. lactis. The ORF1-encoded protein had significant homology with several members of the ABC transporter family but, with two distinct putative ATP-binding sites, it would represent an unusual type among the bacterial ABC transporters. ORF2 encoded a putative integral membrane protein also characteristic of the ABC transporter family. The pepI gene was overexpressed in Escherichia coli. Purified PepI hydrolysed only di and tripeptides with proline in the first position. Optimum PepI activity was observed at pH 7.5 and 40 degrees C. A gel filtration analysis indicated that PepI is a dimer of M(r) 53,000. PepI was shown to be a metal-independent serine peptidase having thiol groups at or near the active site. Kinetic studies with proline-p-nitroanilide as substrate revealed Km and Vmax values of 0.8 mM and 350 mmol min-1 mg-1, respectively, and a very high turnover number of 135,000 s-1.
Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana.

PubMed

Mayer, K; Schüller, C; Wambutt, R; Murphy, G; Volckaert, G; Pohl, T; Düsterhöft, A; Stiekema, W; Entian, K D; Terryn, N; Harris, B; Ansorge, W; Brandt, P; Grivell, L; Rieger, M; Weichselgartner, M; de Simone, V; Obermaier, B; Mache, R; Müller, M; Kreis, M; Delseny, M; Puigdomenech, P; Watson, M; Schmidtheini, T; Reichert, B; Portatelle, D; Perez-Alonso, M; Boutry, M; Bancroft, I; Vos, P; Hoheisel, J; Zimmermann, W; Wedler, H; Ridley, P; Langham, S A; McCullagh, B; Bilham, L; Robben, J; Van der Schueren, J; Grymonprez, B; Chuang, Y J; Vandenbussche, F; Braeken, M; Weltjens, I; Voet, M; Bastiaens, I; Aert, R; Defoor, E; Weitzenegger, T; Bothe, G; Ramsperger, U; Hilbert, H; Braun, M; Holzer, E; Brandt, A; Peters, S; van Staveren, M; Dirske, W; Mooijman, P; Klein Lankhorst, R; Rose, M; Hauf, J; Kötter, P; Berneiser, S; Hempel, S; Feldpausch, M; Lamberth, S; Van den Daele, H; De Keyser, A; Buysshaert, C; Gielen, J; Villarroel, R; De Clercq, R; Van Montagu, M; Rogers, J; Cronin, A; Quail, M; Bray-Allen, S; Clark, L; Doggett, J; Hall, S; Kay, M; Lennard, N; McLay, K; Mayes, R; Pettett, A; Rajandream, M A; Lyne, M; Benes, V; Rechmann, S; Borkova, D; Blöcker, H; Scharfe, M; Grimm, M; Löhnert, T H; Dose, S; de Haan, M; Maarse, A; Schäfer, M; Müller-Auer, S; Gabel, C; Fuchs, M; Fartmann, B; Granderath, K; Dauner, D; Herzl, A; Neumann, S; Argiriou, A; Vitale, D; Liguori, R; Piravandi, E; Massenet, O; Quigley, F; Clabauld, G; Mündlein, A; Felber, R; Schnabl, S; Hiller, R; Schmidt, W; Lecharny, A; Aubourg, S; Chefdor, F; Cooke, R; Berger, C; Montfort, A; Casacuberta, E; Gibbons, T; Weber, N; Vandenbol, M; Bargues, M; Terol, J; Torres, A; Perez-Perez, A; Purnelle, B; Bent, E; Johnson, S; Tacon, D; Jesse, T; Heijnen, L; Schwarz, S; Scholler, P; Heber, S; Francs, P; Bielke, C; Frishman, D; Haase, D; Lemcke, K; Mewes, H W; Stocker, S; Zaccaria, P; Bevan, M; Wilson, R K; de la Bastide, M; Habermann, K; Parnell, L; Dedhia, N; Gnoj, L; Schutz, K; Huang, E; Spiegel, L; Sehkon, M; Murray, J; Sheet, P; Cordes, M; Abu-Threideh, J; Stoneking, T; Kalicki, J; Graves, T; Harmon, G; Edwards, J; Latreille, P; Courtney, L; Cloud, J; Abbott, A; Scott, K; Johnson, D; Minx, P; Bentley, D; Fulton, B; Miller, N; Greco, T; Kemp, K; Kramer, J; Fulton, L; Mardis, E; Dante, M; Pepin, K; Hillier, L; Nelson, J; Spieth, J; Ryan, E; Andrews, S; Geisel, C; Layman, D; Du, H; Ali, J; Berghoff, A; Jones, K; Drone, K; Cotton, M; Joshu, C; Antonoiu, B; Zidanic, M; Strong, C; Sun, H; Lamar, B; Yordan, C; Ma, P; Zhong, J; Preston, R; Vil, D; Shekher, M; Matero, A; Shah, R; Swaby, I K; O'Shaughnessy, A; Rodriguez, M; Hoffmann, J; Till, S; Granat, S; Shohdy, N; Hasegawa, A; Hameed, A; Lodhi, M; Johnson, A; Chen, E; Marra, M; Martienssen, R; McCombie, W R

1999-12-16

The higher plant Arabidopsis thaliana (Arabidopsis) is an important model for identifying plant genes and determining their function. To assist biological investigations and to define chromosome structure, a coordinated effort to sequence the Arabidopsis genome was initiated in late 1996. Here we report one of the first milestones of this project, the sequence of chromosome 4. Analysis of 17.38 megabases of unique sequence, representing about 17% of the genome, reveals 3,744 protein coding genes, 81 transfer RNAs and numerous repeat elements. Heterochromatic regions surrounding the putative centromere, which has not yet been completely sequenced, are characterized by an increased frequency of a variety of repeats, new repeats, reduced recombination, lowered gene density and lowered gene expression. Roughly 60% of the predicted protein-coding genes have been functionally characterized on the basis of their homology to known genes. Many genes encode predicted proteins that are homologous to human and Caenorhabditis elegans proteins.

Using a Euclid distance discriminant method to find protein coding genes in the yeast genome.

PubMed

Zhang, Chun-Ting; Wang, Ju; Zhang, Ren

2002-02-01

The Euclid distance discriminant method is used to find protein coding genes in the yeast genome, based on the single nucleotide frequencies at three codon positions in the ORFs. The method is extremely simple and may be extended to find genes in prokaryotic genomes or eukaryotic genomes with less introns. Six-fold cross-validation tests have demonstrated that the accuracy of the algorithm is better than 93%. Based on this, it is found that the total number of protein coding genes in the yeast genome is less than or equal to 5579 only, about 3.8-7.0% less than 5800-6000, which is currently widely accepted. The base compositions at three codon positions are analyzed in details using a graphic method. The result shows that the preference codons adopted by yeast genes are of the RGW type, where R, G and W indicate the bases of purine, non-G and A/T, whereas the 'codons' in the intergenic sequences are of the form NNN, where N denotes any base. This fact constitutes the basis of the algorithm to distinguish between coding and non-coding ORFs in the yeast genome. The names of putative non-coding ORFs are listed here in detail.
Promoter analysis reveals globally differential regulation of human long non-coding RNA and protein-coding genes

DOE PAGES

Alam, Tanvir; Medvedeva, Yulia A.; Jia, Hui; ...

2014-10-02

Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptionalmore » regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.« less
Promoter analysis reveals globally differential regulation of human long non-coding RNA and protein-coding genes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Alam, Tanvir; Medvedeva, Yulia A.; Jia, Hui

Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptionalmore » regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.« less
Discovery of numerous novel small genes in the intergenic regions of the Escherichia coli O157:H7 Sakai genome

PubMed Central

Hücker, Sarah M.; Ardern, Zachary; Goldberg, Tatyana; Schafferhans, Andrea; Bernhofer, Michael; Vestergaard, Gisle; Nelson, Chase W.; Schloter, Michael; Rost, Burkhard; Scherer, Siegfried

2017-01-01

In the past, short protein-coding genes were often disregarded by genome annotation pipelines. Transcriptome sequencing (RNAseq) signals outside of annotated genes have usually been interpreted to indicate either ncRNA or pervasive transcription. Therefore, in addition to the transcriptome, the translatome (RIBOseq) of the enteric pathogen Escherichia coli O157:H7 strain Sakai was determined at two optimal growth conditions and a severe stress condition combining low temperature and high osmotic pressure. All intergenic open reading frames potentially encoding a protein of ≥ 30 amino acids were investigated with regard to coverage by transcription and translation signals and their translatability expressed by the ribosomal coverage value. This led to discovery of 465 unique, putative novel genes not yet annotated in this E. coli strain, which are evenly distributed over both DNA strands of the genome. For 255 of the novel genes, annotated homologs in other bacteria were found, and a machine-learning algorithm, trained on small protein-coding E. coli genes, predicted that 89% of these translated open reading frames represent bona fide genes. The remaining 210 putative novel genes without annotated homologs were compared to the 255 novel genes with homologs and to 250 short annotated genes of this E. coli strain. All three groups turned out to be similar with respect to their translatability distribution, fractions of differentially regulated genes, secondary structure composition, and the distribution of evolutionary constraint, suggesting that both novel groups represent legitimate genes. However, the machine-learning algorithm only recognized a small fraction of the 210 genes without annotated homologs. It is possible that these genes represent a novel group of genes, which have unusual features dissimilar to the genes of the machine-learning algorithm training set. PMID:28902868
Multiplexed pyrosequencing of nine sea anemone (Cnidaria: Anthozoa: Hexacorallia: Actiniaria) mitochondrial genomes.

PubMed

Foox, Jonathan; Brugler, Mercer; Siddall, Mark Edward; Rodríguez, Estefanía

2016-07-01

Six complete and three partial actiniarian mitochondrial genomes were amplified in two semi-circles using long-range PCR and pyrosequenced in a single run on a 454 GS Junior, doubling the number of complete mitogenomes available within the order. Typical metazoan mtDNA features included circularity, 13 protein-coding genes, 2 ribosomal RNA genes, and length ranging from 17,498 to 19,727 bp. Several typical anthozoan mitochondrial genome features were also observed including the presence of only two transfer RNA genes, elevated A + T richness ranging from 54.9 to 62.4%, large intergenic regions, and group 1 introns interrupting NADH dehydrogenase subunit 5 and cytochrome c oxidase subunit I, the latter of which possesses a homing endonuclease gene. Within the sea anemone Alicia sansibarensis, we report the first mitochondrial gene order rearrangement within the Actiniaria, as well as putative novel non-canonical protein-coding genes. Phylogenetic analyses of all 13 protein-coding and 2 ribosomal genes largely corroborated current hypotheses of sea anemone interrelatedness, with a few lower-level differences.
Molecular cloning of chitinase 33 (chit33) gene from Trichoderma atroviride

PubMed Central

Matroudi, S.; Zamani, M.R.; Motallebi, M.

2008-01-01

In this study Trichoderma atroviride was selected as over producer of chitinase enzyme among 30 different isolates of Trichoderma sp. on the basis of chitinase specific activity. From this isolate the genomic and cDNA clones encoding chit33 have been isolated and sequenced. Comparison of genomic and cDNA sequences for defining gene structure indicates that this gene contains three short introns and also an open reading frame coding for a protein of 321 amino acids. The deduced amino acid sequence includes a 19 aa putative signal peptide. Homology between this sequence and other reported Trichoderma Chit33 proteins are discussed. The coding sequence of chit33 gene was cloned in pEt26b(+) expression vector and expressed in E. coli. PMID:24031242
Identification of Putative Nuclear Receptors and Steroidogenic Enzymes in Murray-Darling Rainbowfish (Melanotaenia fluviatilis) Using RNA-Seq and De Novo Transcriptome Assembly.

PubMed

Bain, Peter A; Papanicolaou, Alexie; Kumar, Anupama

2015-01-01

Murray-Darling rainbowfish (Melanotaenia fluviatilis [Castelnau, 1878]; Atheriniformes: Melanotaeniidae) is a small-bodied teleost currently under development in Australasia as a test species for aquatic toxicological studies. To date, efforts towards the development of molecular biomarkers of contaminant exposure have been hindered by the lack of available sequence data. To address this, we sequenced messenger RNA from brain, liver and gonads of mature male and female fish and generated a high-quality draft transcriptome using a de novo assembly approach. 149,742 clusters of putative transcripts were obtained, encompassing 43,841 non-redundant protein-coding regions. Deduced amino acid sequences were annotated by functional inference based on similarity with sequences from manually curated protein sequence databases. The draft assembly contained protein-coding regions homologous to 95.7% of the complete cohort of predicted proteins from the taxonomically related species, Oryzias latipes (Japanese medaka). The mean length of rainbowfish protein-coding sequences relative to their medaka homologues was 92.1%, indicating that despite the limited number of tissues sampled a large proportion of the total expected number of protein-coding genes was captured in the study. Because of our interest in the effects of environmental contaminants on endocrine pathways, we manually curated subsets of coding regions for putative nuclear receptors and steroidogenic enzymes in the rainbowfish transcriptome, revealing 61 candidate nuclear receptors encompassing all known subfamilies, and 41 putative steroidogenic enzymes representing all major steroidogenic enzymes occurring in teleosts. The transcriptome presented here will be a valuable resource for researchers interested in biomarker development, protein structure and function, and contaminant-response genomics in Murray-Darling rainbowfish.
[Detection of putative polysaccharide biosynthesis genes in Azospirillum brasilense strains from serogroups I and II].

PubMed

Petrova, L P; Prilipov, A G; Katsy, E I

2017-01-01

It is known that in Azospirillum brasilense strains Sp245 and SR75 included in serogroup I, the repeat units of their O-polysaccharides consist of five residues of D-rhamnose, and in strain SR15, of four; and the heteropolymeric O-polysaccharide of A. brasilense type strain Sp7 from serogroup II contains not less than five types of repeat units. In the present work, a complex of nondegenerate primers to the genes of A. brasilense Sp245 plasmids AZOBR_p6, AZOBR_p3, and AZOBR_p2, which encode putative enzymes for the biosynthesis of core oligosaccharide and O-polysaccharide of lipopolysaccharide, capsular polysaccharides, and exopolysaccharides, was proposed. By using the designed primers, products of the expected sizes were synthesized in polymerase chain reactions on genomic DNA of A. brasilense Sp245, SR75, SR15, and Sp7 in 36, 29, 23, and 12 cases, respectively. As a result of sequencing of a number of amplicons, a high (86–99%) level of identity of the corresponding putative polysaccharide biosynthesis genes in three A. brasilense strains from serogroup I was detected. In a blotting-hybridization reaction with the biotin-labeled DNA of the A. brasilense gene AZOBR_p60122 coding for putative permease of the ABC transporter of polysaccharides, localization of the homologous gene in ~120-MDa plasmids of the bacteria A. brasilense SR15 and SR75 was revealed.
Complete genome sequence of lymphocystis disease virus isolated from China.

PubMed

Zhang, Qi-Ya; Xiao, Feng; Xie, Jian; Li, Zheng-Qiu; Gui, Jian-Fang

2004-07-01

Lymphocystis diseases in fish throughout the world have been extensively described. Here we report the complete genome sequence of lymphocystis disease virus isolated in China (LCDV-C), an LCDV isolated from cultured flounder (Paralichthys olivaceus) with lymphocystis disease in China. The LCDV-C genome is 186,250 bp, with a base composition of 27.25% G+C. Computer-assisted analysis revealed 240 potential open reading frames (ORFs) and 176 nonoverlapping putative viral genes, which encode polypeptides ranging from 40 to 1,193 amino acids. The percent coding density is 67%, and the average length of each ORF is 702 bp. A search of the GenBank database using the 176 individual putative genes revealed 103 homologues to the corresponding ORFs of LCDV-1 and 73 potential genes that were not found in LCDV-1 and other iridoviruses. Among the 73 genes, there are 8 genes that contain conserved domains of cellular genes and 65 novel genes that do not show any significant homology with the sequences in public databases. Although a certain extent of similarity between putative gene products of LCDV-C and corresponding proteins of LCDV-1 was revealed, no colinearity was detected when their ORF arrangements and coding strategies were compared to each other, suggesting that a high degree of genetic rearrangements between them has occurred. And a large number of tandem and overlapping repeated sequences were observed in the LCDV-C genome. The deduced amino acid sequence of the major capsid protein (MCP) presents the highest identity to those of LCDV-1 and other iridoviruses among the LCDV-C gene products. Furthermore, a phylogenetic tree was constructed based on the multiple alignments of nine MCP amino acid sequences. Interestingly, LCDV-C and LCDV-1 were clustered together, but their amino acid identity is much less than that in other clusters. The unexpected levels of divergence between their genomes in size, gene organization, and gene product identity suggest that LCDV-C and LCDV-1 shouldn't belong to a same species and that LCDV-C should be considered a species different from LCDV-1.
Complete Genome Sequence of Lymphocystis Disease Virus Isolated from China

PubMed Central

Zhang, Qi-Ya; Xiao, Feng; Xie, Jian; Li, Zheng-Qiu; Gui, Jian-Fang

2004-01-01

Lymphocystis diseases in fish throughout the world have been extensively described. Here we report the complete genome sequence of lymphocystis disease virus isolated in China (LCDV-C), an LCDV isolated from cultured flounder (Paralichthys olivaceus) with lymphocystis disease in China. The LCDV-C genome is 186,250 bp, with a base composition of 27.25% G+C. Computer-assisted analysis revealed 240 potential open reading frames (ORFs) and 176 nonoverlapping putative viral genes, which encode polypeptides ranging from 40 to 1,193 amino acids. The percent coding density is 67%, and the average length of each ORF is 702 bp. A search of the GenBank database using the 176 individual putative genes revealed 103 homologues to the corresponding ORFs of LCDV-1 and 73 potential genes that were not found in LCDV-1 and other iridoviruses. Among the 73 genes, there are 8 genes that contain conserved domains of cellular genes and 65 novel genes that do not show any significant homology with the sequences in public databases. Although a certain extent of similarity between putative gene products of LCDV-C and corresponding proteins of LCDV-1 was revealed, no colinearity was detected when their ORF arrangements and coding strategies were compared to each other, suggesting that a high degree of genetic rearrangements between them has occurred. And a large number of tandem and overlapping repeated sequences were observed in the LCDV-C genome. The deduced amino acid sequence of the major capsid protein (MCP) presents the highest identity to those of LCDV-1 and other iridoviruses among the LCDV-C gene products. Furthermore, a phylogenetic tree was constructed based on the multiple alignments of nine MCP amino acid sequences. Interestingly, LCDV-C and LCDV-1 were clustered together, but their amino acid identity is much less than that in other clusters. The unexpected levels of divergence between their genomes in size, gene organization, and gene product identity suggest that LCDV-C and LCDV-1 shouldn't belong to a same species and that LCDV-C should be considered a species different from LCDV-1. PMID:15194775
Genome sequence of Plasmopara viticola and insight into the pathogenic mechanism

PubMed Central

Yin, Ling; An, Yunhe; Qu, Junjie; Li, Xinlong; Zhang, Yali; Dry, Ian; Wu, Huijuan; Lu, Jiang

2017-01-01

Plasmopara viticola causes downy mildew disease of grapevine which is one of the most devastating diseases of viticulture worldwide. Here we report a 101.3 Mb whole genome sequence of P. viticola isolate ‘JL-7-2’ obtained by a combination of Illumina and PacBio sequencing technologies. The P. viticola genome contains 17,014 putative protein-coding genes and has ~26% repetitive sequences. A total of 1,301 putative secreted proteins, including 100 putative RXLR effectors and 90 CRN effectors were identified in this genome. In the secretome, 261 potential pathogenicity genes and 95 carbohydrate-active enzymes were predicted. Transcriptional analysis revealed that most of the RXLR effectors, pathogenicity genes and carbohydrate-active enzymes were significantly up-regulated during infection. Comparative genomic analysis revealed that P. viticola evolved independently from the Arabidopsis downy mildew pathogen Hyaloperonospora arabidopsidis. The availability of the P. viticola genome provides a valuable resource not only for comparative genomic analysis and evolutionary studies among oomycetes, but also enhance our knowledge on the mechanism of interactions between this biotrophic pathogen and its host. PMID:28417959
Deciphering the Genome Sequences of the Hydrophobic Cyanobacterium Scytonema tolypothrichoides VB-61278

PubMed Central

Das, Abhishek; Panda, Arijit; Singh, Deeksha; Chandrababunaidu, Mathu Malar; Mishra, Gyan Prakash; Bhan, Sushma

2015-01-01

Scytonema tolypothrichoides VB-61278, a terrestrial cyanobacterium, can be exploited to produce commercially important products. Here, we report for the first time a 10-Mb draft genome assembly of S. tolypothrichoides VB-61278, with 214 scaffolds and 7,148 putative protein-coding genes. PMID:25838486
The Glucuronic Acid Utilization Gene Cluster from Bacillus stearothermophilus T-6

PubMed Central

Shulami, Smadar; Gat, Orit; Sonenshein, Abraham L.; Shoham, Yuval

1999-01-01

A λ-EMBL3 genomic library of Bacillus stearothermophilus T-6 was screened for hemicellulolytic activities, and five independent clones exhibiting β-xylosidase activity were isolated. The clones overlap each other and together represent a 23.5-kb chromosomal segment. The segment contains a cluster of xylan utilization genes, which are organized in at least three transcriptional units. These include the gene for the extracellular xylanase, xylanase T-6; part of an operon coding for an intracellular xylanase and a β-xylosidase; and a putative 15.5-kb-long transcriptional unit, consisting of 12 genes involved in the utilization of α-d-glucuronic acid (GlcUA). The first four genes in the potential GlcUA operon (orf1, -2, -3, and -4) code for a putative sugar transport system with characteristic components of the binding-protein-dependent transport systems. The most likely natural substrate for this transport system is aldotetraouronic acid [2-O-α-(4-O-methyl-α-d-glucuronosyl)-xylotriose] (MeGlcUAXyl3). The following two genes code for an intracellular α-glucuronidase (aguA) and a β-xylosidase (xynB). Five more genes (kdgK, kdgA, uxaC, uxuA, and uxuB) encode proteins that are homologous to enzymes involved in galacturonate and glucuronate catabolism. The gene cluster also includes a potential regulatory gene, uxuR, the product of which resembles repressors of the GntR family. The apparent transcriptional start point of the cluster was determined by primer extension analysis and is located 349 bp from the initial ATG codon. The potential operator site is a perfect 12-bp inverted repeat located downstream from the promoter between nucleotides +170 and +181. Gel retardation assays indicated that UxuR binds specifically to this sequence and that this binding is efficiently prevented in vitro by MeGlcUAXyl3, the most likely molecular inducer. PMID:10368143
Prevalence of transcription promoters within archaeal operons and coding sequences.

PubMed

Koide, Tie; Reiss, David J; Bare, J Christopher; Pang, Wyming Lee; Facciotti, Marc T; Schmid, Amy K; Pan, Min; Marzolf, Bruz; Van, Phu T; Lo, Fang-Yin; Pratap, Abhishek; Deutsch, Eric W; Peterson, Amelia; Martin, Dan; Baliga, Nitin S

2009-01-01

Despite the knowledge of complex prokaryotic-transcription mechanisms, generalized rules, such as the simplified organization of genes into operons with well-defined promoters and terminators, have had a significant role in systems analysis of regulatory logic in both bacteria and archaea. Here, we have investigated the prevalence of alternate regulatory mechanisms through genome-wide characterization of transcript structures of approximately 64% of all genes, including putative non-coding RNAs in Halobacterium salinarum NRC-1. Our integrative analysis of transcriptome dynamics and protein-DNA interaction data sets showed widespread environment-dependent modulation of operon architectures, transcription initiation and termination inside coding sequences, and extensive overlap in 3' ends of transcripts for many convergently transcribed genes. A significant fraction of these alternate transcriptional events correlate to binding locations of 11 transcription factors and regulators (TFs) inside operons and annotated genes-events usually considered spurious or non-functional. Using experimental validation, we illustrate the prevalence of overlapping genomic signals in archaeal transcription, casting doubt on the general perception of rigid boundaries between coding sequences and regulatory elements.
Long non-coding RNAs are associated with spatiotemporal gene expression profiles in the marine gastropod Tegula atra.

PubMed

Détrée, Camille; Núñez-Acuña, Gustavo; Tapia, Fabian; Gallardo-Escárate, Cristian

2017-06-01

Increasing evidence suggests that long non-coding RNAs (lncRNAs) play diverse roles in cellular processes, including in the regulation of embryogenesis and growth. However, little is known about the role of lncRNAs in marine invertebrates inhabiting changing environments. Therefore, the aim of this study was to present the first characterization of lncRNAs in an intertidal marine gastropod. Specifically, Tegula atra individuals were sampled in four sites of the central-northern Chilean coastline (28-31°) during summer and winter. A pipeline was constructed, and 3524 putative lncRNAs were identified from transcriptome databases specific to T. atra. These lncRNAs exhibited characteristics common to known lncRNAs, including a length shorter than coding sequences, low GC-content, and low sequence conservation. Expression analyses revealed that lncRNAs varied more in the summer. Furthermore, a majority of the differentially expressed lncRNAs were found in the southernmost population, the seasonal temperatures of which varied the greatest among all groups. Additionally, co-expression analysis found some lncRNAs strongly correlated with coding genes involved in the environmental stress response, such as heat shock proteins and metalloproteins. In contrast, other lncRNA expressions were strongly uncorrelated with genes involved in lipid/carbohydrates metabolism and cell-cell communication. This study provides the first large-scale characterization of lncRNAs in a marine gastropod, with results suggesting a putative role of lncRNAs in thermal tolerance, as well as an association with molecular mechanisms involved in the local adaptations of marine invertebrate populations. Copyright © 2017 Elsevier B.V. All rights reserved.
Mutant phenotypes for thousands of bacterial genes of unknown function

DOE PAGES

Price, Morgan N.; Wetmore, Kelly M.; Waters, R. Jordan; ...

2018-05-16

One-third of all protein-coding genes from bacterial genomes cannot be annotated with a function. Here, to investigate the functions of these genes, we present genome-wide mutant fitness data from 32 diverse bacteria across dozens of growth conditions. We identified mutant phenotypes for 11,779 protein-coding genes that had not been annotated with a specific function. Many genes could be associated with a specific condition because the gene affected fitness only in that condition, or with another gene in the same bacterium because they had similar mutant phenotypes. Of the poorly annotated genes, 2,316 had associations that have high confidence because theymore » are conserved in other bacteria. By combining these conserved associations with comparative genomics, we identified putative DNA repair proteins; in addition, we propose specific functions for poorly annotated enzymes and transporters and for uncharacterized protein families. Lastly, our study demonstrates the scalability of microbial genetics and its utility for improving gene annotations.« less
Mutant phenotypes for thousands of bacterial genes of unknown function

DOE Office of Scientific and Technical Information (OSTI.GOV)

Price, Morgan N.; Wetmore, Kelly M.; Waters, R. Jordan

One-third of all protein-coding genes from bacterial genomes cannot be annotated with a function. Here, to investigate the functions of these genes, we present genome-wide mutant fitness data from 32 diverse bacteria across dozens of growth conditions. We identified mutant phenotypes for 11,779 protein-coding genes that had not been annotated with a specific function. Many genes could be associated with a specific condition because the gene affected fitness only in that condition, or with another gene in the same bacterium because they had similar mutant phenotypes. Of the poorly annotated genes, 2,316 had associations that have high confidence because theymore » are conserved in other bacteria. By combining these conserved associations with comparative genomics, we identified putative DNA repair proteins; in addition, we propose specific functions for poorly annotated enzymes and transporters and for uncharacterized protein families. Lastly, our study demonstrates the scalability of microbial genetics and its utility for improving gene annotations.« less
The chloroplast tRNALys(UUU) gene from mustard (Sinapis alba) contains a class II intron potentially coding for a maturase-related polypeptide.

PubMed

Neuhaus, H; Link, G

1987-01-01

The trnK gene endocing the tRNALys(UUU) has been located on mustard (Sinapis alba) chloroplast DNA, 263 bp upstream of the psbA gene on the same strand. The nucleotide sequence of the trnK gene and its flanking regions as well as the putative transcription start and termination sites are shown. The 5' end of the transcript lies 121 bp upstream of the 5' tRNA coding region and is preceded by procaryotic-type "-10" and "-35" sequence elements, while the 3' end maps 2.77 kb downstream to a DNA region with possible stemloop secondary structure. The anticodon loop of the tRNALys is interrupted by a 2,574 bp intron containing a long open reading frame, which codes for 524 amino acids. Based on conserved stem and loop structures, this intron has characteristic features of a class II intron. A region near the carboxyl terminus of the derived polypeptide appears structurally related to maturases.
Draft Genome Sequence of Janthinobacterium sp. Strain ROICE36, a Putative Secondary Metabolite-Synthesizing Bacterium Isolated from Antarctic Snow

PubMed Central

Chiriac, Cecilia; Baricz, Andreea

2018-01-01

ABSTRACT The draft genome assembly of Janthinobacterium sp. strain ROICE36 has 207 contigs, with a total genome size of 5,977,006 bp and a G+C content of 62%. Preliminary genome analysis identified 5,363 protein-coding genes and a total of 7 secondary metabolic gene clusters (encoding bacteriocins, nonribosomal peptide-synthetase [NRPS], terpene, hserlactone, and other ketide synthases). PMID:29650588
Improving the genome annotation of the acarbose producer Actinoplanes sp. SE50/110 by sequencing enriched 5'-ends of primary transcripts.

PubMed

Schwientek, Patrick; Neshat, Armin; Kalinowski, Jörn; Klein, Andreas; Rückert, Christian; Schneiker-Bekel, Susanne; Wendler, Sergej; Stoye, Jens; Pühler, Alfred

2014-11-20

Actinoplanes sp. SE50/110 is the producer of the alpha-glucosidase inhibitor acarbose, which is an economically relevant and potent drug in the treatment of type-2 diabetes mellitus. In this study, we present the detection of transcription start sites on this genome by sequencing enriched 5'-ends of primary transcripts. Altogether, 1427 putative transcription start sites were initially identified. With help of the annotated genome sequence, 661 transcription start sites were found to belong to the leader region of protein-coding genes with the surprising result that roughly 20% of these genes rank among the class of leaderless transcripts. Next, conserved promoter motifs were identified for protein-coding genes with and without leader sequences. The mapped transcription start sites were finally used to improve the annotation of the Actinoplanes sp. SE50/110 genome sequence. Concerning protein-coding genes, 41 translation start sites were corrected and 9 novel protein-coding genes could be identified. In addition to this, 122 previously undetermined non-coding RNA (ncRNA) genes of Actinoplanes sp. SE50/110 were defined. Focusing on antisense transcription start sites located within coding genes or their leader sequences, it was discovered that 96 of those ncRNA genes belong to the class of antisense RNA (asRNA) genes. The remaining 26 ncRNA genes were found outside of known protein-coding genes. Four chosen examples of prominent ncRNA genes, namely the transfer messenger RNA gene ssrA, the ribonuclease P class A RNA gene rnpB, the cobalamin riboswitch RNA gene cobRS, and the selenocysteine-specific tRNA gene selC, are presented in more detail. This study demonstrates that sequencing of enriched 5'-ends of primary transcripts and the identification of transcription start sites are valuable tools for advanced genome annotation of Actinoplanes sp. SE50/110 and most probably also for other bacteria. Copyright © 2014 Elsevier B.V. All rights reserved.

Complete mitochondrial genome of the Yellow-spotted skate Okamejei hollandi (Rajiformes: Rajidae).

PubMed

Li, Weidong; Chen, Xiao; Liu, Wenai; Sun, Renjie; Zhou, Haolang

2016-07-01

The complete mitochondrial genome of the Yellow-spotted skate Okamejei hollandi was determined in this study. It is 16,974 bp in length and contains 13 protein-coding genes, two rRNA genes, 22 tRNA genes, and one putative control region. The overall base composition is 30.5% A, 27.8% C, 14.0% G, and 27.8% T. There are 28 bp short intergenic spaces located in 12 gene junctions and 31 bp overlaps located in nine gene junctions in the whole mitogenome. Two start codons (ATG and GTG) and two stop codons (TAG and TAA/T) were used in the protein-coding genes. The lengths of 22 tRNA genes range from 68 (tRNA-Ser2) to 75 (tRNA-Leu1) bp. The origin of L-strand replication (OL) sequence (37 bp) was identified between the tRNA-Asn and tRNA-Cys genes. The control region is 1311 bp in length with high A + T and poor G content.
The complete mitochondrial genome sequence of Aesopia cornuta (Pleuronectiformes: Soleidae).

PubMed

Wang, Shu-Ying; Shi, Wei; Wang, Zhong-Ming; Gong, Li; Kong, Xiao-Yu

2015-02-01

Aesopia cornuta belongs to the family Soleidae of Pleuronectiformes, and the morphological characters are much similar to those of Zebrias. In this article, we sequenced, characterized, and compared the complete mitogenome of A. cornuta for the first time. The genome is 16,737 base pairs in length, and is typically consist of 37 genes, including 13 protein-coding genes, two ribosomal RNA, 22 transfer RNA, as well as a putative L-strand replication origin and a putative control region. The gene organization is identical to that of typical bony fishes. The overall base composition is 29.1, 28.3, 26.8 and 15.8% for C, A, T and G, respectively, with a slight AT bias of 55.1%. This result is expected to contribute to understanding the systematic evolution of the genus Aesopia and further taxonomic and phylogenetic studies of Soleidae and Pleuronectiformes.
Identification and analysis of unitary loss of long-established protein-coding genes in Poaceae shows evidences for biased gene loss and putatively functional transcription of relics.

PubMed

Zhao, Yi; Tang, Liang; Li, Zhe; Jin, Jinpu; Luo, Jingchu; Gao, Ge

2015-04-18

Long-established protein-coding genes may lose their coding potential during evolution ("unitary gene loss"). Members of the Poaceae family are a major food source and represent an ideal model clade for plant evolution research. However, the global pattern of unitary gene loss in Poaceae genomes as well as the evolutionary fate of lost genes are still less-investigated and remain largely elusive. Using a locally developed pipeline, we identified 129 unitary gene loss events for long-established protein-coding genes from four representative species of Poaceae, i.e. brachypodium, rice, sorghum and maize. Functional annotation suggested that the lost genes in all or most of Poaceae species are enriched for genes involved in development and response to endogenous stimulus. We also found that 44 mutated genomic loci of lost genes, which we referred as relics, were still actively transcribed, and of which 84% (37 of 44) showed significantly differential expression across different tissues. More interestingly, we found that there were totally five expressed relics may function as competitive endogenous RNA in brachypodium, rice and sorghum genome. Based on comparative genomics and transcriptome data, we firstly compiled a comprehensive catalogue of unitary gene loss events in Poaceae species and characterized a statistically significant functional preference for these lost genes as well showed the potential of relics functioning as competitive endogenous RNAs in Poaceae genomes.
Deciphering the Genome Sequences of the Hydrophobic Cyanobacterium Scytonema tolypothrichoides VB-61278.

PubMed

Das, Abhishek; Panda, Arijit; Singh, Deeksha; Chandrababunaidu, Mathu Malar; Mishra, Gyan Prakash; Bhan, Sushma; Adhikary, Siba Prasad; Tripathy, Sucheta

2015-04-02

Scytonema tolypothrichoides VB-61278, a terrestrial cyanobacterium, can be exploited to produce commercially important products. Here, we report for the first time a 10-Mb draft genome assembly of S. tolypothrichoides VB-61278, with 214 scaffolds and 7,148 putative protein-coding genes. Copyright © 2015 Das et al.
The genome of black cottonwood, Populus trichocarpa (Torr. & Gray)

Treesearch

G.A. Tuskan; S. DiFazio; S. Jansson; J. Bohlmann; I. Grigoriev; U. Hellsten; N. Putnam; S. Ralph; S. Rombauts; A. Salamov; J. Schein; L. Sterck; A. Aerts; R.R. Bhalerao; R.P. Bhalerao; D. Blaudez; W. Boerjan; A. Brun; A. Brunner; V. Busov; M. Campbell; J. Carlson; M. Chalot; J. Chapman; G.-L. Chen; D. Cooper; P.M. Coutinho; J. Couturier; S. Covert; Q. Cronk; R. Cunningham; J. Davis; S. Degroeve; A. Dejardin; C. dePamphilis; J. Detter; B. Dirks; U. Dubchak; S. Duplessis; J. Ehlting; B. Ellis; K. Gendler; D. Goodstein; M. Gribskov; J. Grimwood; A. Groover; L. Gunter; B. Hamberger; B. Heinze; Y. Helariutta; B. Henrissat; D. Holligan; R. Holt; W. Huang; N. Islam-Faridi; S. Jones; M. Jones-Rhoades; R. Jorgensen; C. Joshi; J. Kangasjarvi; J. Karlsson; C. Kelleher; R. Kirkpatrick; M. Kirst; A. Kohler; U. Kalluri; F. Larimer; J. Leebens-Mack; J.-C. Leple; P. Locascio; Y. Lou; S. Lucas; F. Martin; B. Montanini; C. Napoli; D.R. Nelson; C. Nelson; K. Nieminen; O. Nilsson; V. Pereda; G. Peter; R. Philippe; G. Pilate; A. Poliakov; J. Razumovskaya; P. Richardson; C. Rinaldi; K. Ritland; P. Rouze; D. Ryaboy; J. Schumtz; J. Schrader; B. Segerman; H. Shin; A. Siddiqui; F. Sterky; A. Terry; C.-J. Tsai; E. Uberbacher; P. Unneberg; J. Vahala; K. Wall; S. Wessler; G. Yang; T. Yin; C. Douglas; M. Marra; G. Sandberg; Y. Van de Peer; D. Rokhsar

2006-01-01

We report the draft genome of the black cottonwood tree, Populus trichocarpa. Integration of shotgun sequence assembly with genetic mapping enabled chromosome-scale reconstruction of the genome. More than 45,000 putative protein-coding genes were identified. Analysis of the assembled genome revealed a whole-genome duplication event; about 8000 pairs...
Tenebrio molitor antifreeze protein gene identification and regulation.

PubMed

Qin, Wensheng; Walker, Virginia K

2006-02-15

The yellow mealworm, Tenebrio molitor, is a freeze susceptible, stored product pest. Its winter survival is facilitated by the accumulation of antifreeze proteins (AFPs), encoded by a small gene family. We have now isolated 11 different AFP genomic clones from 3 genomic libraries. All the clones had a single coding sequence, with no evidence of intervening sequences. Three genomic clones were further characterized. All have putative TATA box sequences upstream of the coding regions and multiple potential poly(A) signal sequences downstream of the coding regions. A TmAFP regulatory region, B1037, conferred transcriptional activity when ligated to a luciferase reporter sequence and after transfection into an insect cell line. A 143 bp core promoter including a TATA box sequence was identified. Its promoter activity was increased 4.4 times by inserting an exotic 245 bp intron into the construct, similar to the enhancement of transgenic expression seen in several other systems. The addition of a duplication of the first 120 bp sequence from the 143 bp core promoter decreased promoter activity by half. Although putative hormonal response sequences were identified, none of the five hormones tested enhanced reporter activity. These studies on the mechanisms of AFP transcriptional control are important for the consideration of any transfer of freeze-resistance phenotypes to beneficial hosts.
LncRNAs in Secondary Hair Follicle of Cashmere Goat: Identification, Expression, and Their Regulatory Network in Wnt Signaling Pathway.

PubMed

Bai, Wen L; Zhao, Su J; Wang, Ze Y; Zhu, Yu B; Dang, Yun L; Cong, Yu Y; Xue, Hui L; Wang, Wei; Deng, Liang; Guo, Dan; Wang, Shi Q; Zhu, Yan X; Yin, Rong H

2018-07-03

Long noncoding RNAs (lncRNAs) are a novel class of eukaryotic transcripts. They are thought to act as a critical regulator of protein-coding gene expression. Herein, we identified and characterized 13 putative lncRNAs from the expressed sequence tags from secondary hair follicle of Cashmere goat. Furthermore, we investigated their transcriptional pattern in secondary hair follicle of Liaoning Cashmere goat during telogen and anagen phases. Also, we generated intracellular regulatory networks of upregulated lncRNAs at anagen in Wnt signaling pathway based on bioinformatics analysis. The relative expression of six putative lncRNAs (lncRNA-599618, -599556, -599554, -599547, -599531, and -599509) at the anagen phase is significantly higher than that at telogen. Compared with anagen, the relative expression of four putative lncRNAs (lncRNA-599528, -599518, -599511, and -599497) was found to be significantly upregulated at telogen phase. The network generated showed that a rich and complex regulatory relationship of the putative lncRNAs and related miRNAs with their target genes in Wnt signaling pathway. Our results from the present study provided a foundation for further elucidating the functional and regulatory mechanisms of these putative lncRNAs in the development of secondary hair follicle and cashmere fiber growth of Cashmere goat.
A global analysis of protein expression profiles in Sinorhizobium meliloti: discovery of new genes for nodule occupancy and stress adaptation.

PubMed

Djordjevic, Michael A; Chen, Han Cai; Natera, Siria; Van Noorden, Giel; Menzel, Christian; Taylor, Scott; Renard, Clotilde; Geiger, Otto; Weiller, Georg F

2003-06-01

A proteomic examination of Sinorhizobium meliloti strain 1021 was undertaken using a combination of 2-D gel electrophoresis, peptide mass fingerprinting, and bioinformatics. Our goal was to identify (i) putative symbiosis- or nutrient-stress-specific proteins, (ii) the biochemical pathways active under different conditions, (iii) potential new genes, and (iv) the extent of posttranslational modifications of S. meliloti proteins. In total, we identified the protein products of 810 genes (13.1% of the genome's coding capacity). The 810 genes generated 1,180 gene products, with chromosomal genes accounting for 78% of the gene products identified (18.8% of the chromosome's coding capacity). The activity of 53 metabolic pathways was inferred from bioinformatic analysis of proteins with assigned Enzyme Commission numbers. Of the remaining proteins that did not encode enzymes, ABC-type transporters composed 12.7% and regulatory proteins 3.4% of the total. Proteins with up to seven transmembrane domains were identified in membrane preparations. A total of 27 putative nodule-specific proteins and 35 nutrient-stress-specific proteins were identified and used as a basis to define genes and describe processes occurring in S. meliloti cells in nodules and under stress. Several nodule proteins from the plant host were present in the nodule bacteria preparations. We also identified seven potentially novel proteins not predicted from the DNA sequence. Post-translational modifications such as N-terminal processing could be inferred from the data. The posttranslational addition of UMP to the key regulator of nitrogen metabolism, PII, was demonstrated. This work demonstrates the utility of combining mass spectrometry with protein arraying or separation techniques to identify candidate genes involved in important biological processes and niche occupations that may be intransigent to other methods of gene expression profiling.
The Human Cell Surfaceome of Breast Tumors

PubMed Central

da Cunha, Júlia Pinheiro Chagas; Galante, Pedro Alexandre Favoretto; de Souza, Jorge Estefano Santana; Pieprzyk, Martin; Carraro, Dirce Maria; Old, Lloyd J.; Camargo, Anamaria Aranha; de Souza, Sandro José

2013-01-01

Introduction. Cell surface proteins are ideal targets for cancer therapy and diagnosis. We have identified a set of more than 3700 genes that code for transmembrane proteins believed to be at human cell surface. Methods. We used a high-throuput qPCR system for the analysis of 573 cell surface protein-coding genes in 12 primary breast tumors, 8 breast cell lines, and 21 normal human tissues including breast. To better understand the role of these genes in breast tumors, we used a series of bioinformatics strategies to integrates different type, of the datasets, such as KEGG, protein-protein interaction databases, ONCOMINE, and data from, literature. Results. We found that at least 77 genes are overexpressed in breast primary tumors while at least 2 of them have also a restricted expression pattern in normal tissues. We found common signaling pathways that may be regulated in breast tumors through the overexpression of these cell surface protein-coding genes. Furthermore, a comparison was made between the genes found in this report and other genes associated with features clinically relevant for breast tumorigenesis. Conclusions. The expression profiling generated in this study, together with an integrative bioinformatics analysis, allowed us to identify putative targets for breast tumors. PMID:24195083
A genome-wide survey of maternal and embryonic transcripts during Xenopus tropicalis development.

PubMed

Paranjpe, Sarita S; Jacobi, Ulrike G; van Heeringen, Simon J; Veenstra, Gert Jan C

2013-11-06

Dynamics of polyadenylation vs. deadenylation determine the fate of several developmentally regulated genes. Decay of a subset of maternal mRNAs and new transcription define the maternal-to-zygotic transition, but the full complement of polyadenylated and deadenylated coding and non-coding transcripts has not yet been assessed in Xenopus embryos. To analyze the dynamics and diversity of coding and non-coding transcripts during development, both polyadenylated mRNA and ribosomal RNA-depleted total RNA were harvested across six developmental stages and subjected to high throughput sequencing. The maternally loaded transcriptome is highly diverse and consists of both polyadenylated and deadenylated transcripts. Many maternal genes show peak expression in the oocyte and include genes which are known to be the key regulators of events like oocyte maturation and fertilization. Of all the transcripts that increase in abundance between early blastula and larval stages, about 30% of the embryonic genes are induced by fourfold or more by the late blastula stage and another 35% by late gastrulation. Using a gene model validation and discovery pipeline, we identified novel transcripts and putative long non-coding RNAs (lncRNA). These lncRNA transcripts were stringently selected as spliced transcripts generated from independent promoters, with limited coding potential and a codon bias characteristic of noncoding sequences. Many lncRNAs are conserved and expressed in a developmental stage-specific fashion. These data reveal dynamics of transcriptome polyadenylation and abundance and provides a high-confidence catalogue of novel and long non-coding RNAs.
Characterization of mitochondrial genome of sea cucumber Stichopus horrens: a novel gene arrangement in Holothuroidea.

PubMed

Fan, SiGang; Hu, ChaoQun; Wen, Jing; Zhang, LvPing

2011-05-01

The complete mitochondrial DNA sequence contains useful information for phylogenetic analyses of metazoa. In this study, the complete mitochondrial DNA sequence of sea cucumber Stichopus horrens (Holothuroidea: Stichopodidae: Stichopus) is presented. The complete sequence was determined using normal and long PCRs. The mitochondrial genome of Stichopus horrens is a circular molecule 16257 bps long, composed of 13 protein-coding genes, two ribosomal RNA genes and 22 transfer RNA genes. Most of these genes are coded on the heavy strand except for one protein-coding gene (nad6) and five tRNA genes (tRNA ( Ser(UCN) ), tRNA ( Gln ), tRNA ( Ala ), tRNA ( Val ), tRNA ( Asp )) which are coded on the light strand. The composition of the heavy strand is 30.8% A, 23.7% C, 16.2% G, and 29.3% T bases (AT skew=0.025; GC skew=-0.188). A non-coding region of 675 bp was identified as a putative control region because of its location and AT richness. The intergenic spacers range from 1 to 50 bp in size, totaling 227 bp. A total of 25 overlapping nucleotides, ranging from 1 to 10 bp in size, exist among 11 genes. All 13 protein-coding genes are initiated with an ATG. The TAA codon is used as the stop codon in all the protein coding genes except nad3 and nad4 that use TAG as their termination codon. The most frequently used amino acids are Leu (16.29%), Ser (10.34%) and Phe (8.37%). All of the tRNA genes have the potential to fold into typical cloverleaf secondary structures. We also compared the order of the genes in the mitochondrial DNA from the five holothurians that are now available and found a novel gene arrangement in the mitochondrial DNA of Stichopus horrens.
Genetic basis for mycophenolic acid production and strain-dependent production variability in Penicillium roqueforti.

PubMed

Gillot, Guillaume; Jany, Jean-Luc; Dominguez-Santos, Rebeca; Poirier, Elisabeth; Debaets, Stella; Hidalgo, Pedro I; Ullán, Ricardo V; Coton, Emmanuel; Coton, Monika

2017-04-01

Mycophenolic acid (MPA) is a secondary metabolite produced by various Penicillium species including Penicillium roqueforti. The MPA biosynthetic pathway was recently described in Penicillium brevicompactum. In this study, an in silico analysis of the P. roqueforti FM164 genome sequence localized a 23.5-kb putative MPA gene cluster. The cluster contains seven genes putatively coding seven proteins (MpaA, MpaB, MpaC, MpaDE, MpaF, MpaG, MpaH) and is highly similar (i.e. gene synteny, sequence homology) to the P. brevicompactum cluster. To confirm the involvement of this gene cluster in MPA biosynthesis, gene silencing using RNA interference targeting mpaC, encoding a putative polyketide synthase, was performed in a high MPA-producing P. roqueforti strain (F43-1). In the obtained transformants, decreased MPA production (measured by LC-Q-TOF/MS) was correlated to reduced mpaC gene expression by Q-RT-PCR. In parallel, mycotoxin quantification on multiple P. roqueforti strains suggested strain-dependent MPA-production. Thus, the entire MPA cluster was sequenced for P. roqueforti strains with contrasted MPA production and a 174bp deletion in mpaC was observed in low MPA-producers. PCRs directed towards the deleted region among 55 strains showed an excellent correlation with MPA quantification. Our results indicated the clear involvement of mpaC gene as well as surrounding cluster in P. roqueforti MPA biosynthesis. Copyright Â© 2016 Elsevier Ltd. All rights reserved.
Non-parent of Origin Expression of Numerous Effector Genes Indicates a Role of Gene Regulation in Host Adaption of the Hybrid Triticale Powdery Mildew Pathogen.

PubMed

Praz, Coraline R; Menardo, Fabrizio; Robinson, Mark D; Müller, Marion C; Wicker, Thomas; Bourras, Salim; Keller, Beat

2018-01-01

Powdery mildew is an important disease of cereals. It is caused by one species, Blumeria graminis , which is divided into formae speciales each of which is highly specialized to one host. Recently, a new form capable of growing on triticale ( B.g. triticale ) has emerged through hybridization between wheat and rye mildews ( B.g. tritici and B.g. secalis , respectively). In this work, we used RNA sequencing to study the molecular basis of host adaptation in B.g. triticale . We analyzed gene expression in three B.g. tritici isolates, two B.g. secalis isolates and two B.g. triticale isolates and identified a core set of putative effector genes that are highly expressed in all formae speciales . We also found that the genes differentially expressed between isolates of the same form as well as between different formae speciales were enriched in putative effectors. Their coding genes belong to several families including some which contain known members of mildew avirulence ( Avr ) and suppressor ( Svr ) genes. Based on these findings we propose that effectors play an important role in host adaptation that is mechanistically based on Avr-Resistance gene-Svr interactions. We also found that gene expression in the B.g. triticale hybrid is mostly conserved with the parent-of-origin, but some genes inherited from B.g. tritici showed a B.g. secalis -like expression. Finally, we identified 11 unambiguous cases of putative effector genes with hybrid-specific, non-parent of origin gene expression, and we propose that they are possible determinants of host specialization in triticale mildew. These data suggest that altered expression of multiple effector genes, in particular Avr and Svr related factors, might play a role in mildew host adaptation based on hybridization.
Genome sequence of an enhancin gene-rich nucleopolyhedrovirus (NPV) from Agrotis segetum: collinearity with Spodoptera exigua multiple NPV.

PubMed

Jakubowska, Agata K; Peters, Sander A; Ziemnicka, Jadwiga; Vlak, Just M; van Oers, Monique M

2006-03-01

The genome sequence of a Polish isolate of Agrotis segetum nucleopolyhedrovirus (AgseNPV-A) was determined and analysed. The circular genome is composed of 147,544 bp and has a G+C content of 45.7 mol%. It contains 153 putative, non-overlapping open reading frames (ORFs) encoding predicted proteins of more than 50 aa, together making up 89.8 % of the genome. The remaining 10.2 % of the DNA constitutes non-coding regions and homologous-repeat regions. One hundred and forty-three AgseNPV-A ORFs are homologues of previously reported baculovirus gene sequences. There are ten unique ORFs and they account for 3 % of the genome in total. All 62 lepidopteran baculovirus genes, including the 29 core baculovirus genes, were found in the AgseNPV-A genome. The gene content and gene order of AgseNPV-A are most similar to those of Spodoptera exigua (Se) multiple NPV and their shared homologous genes are 100 % collinear. Three putative enhancin genes were identified in the AgseNPV-A genome. In phylogenetic analysis, the AgseNPV-A enhancins form a cluster separated from enhancins of the Mamestra species NPVs.
Characterization of a human X-linked gene from the DXS732E locus in the candidate region for the anhidrotic ectodermal dysplasia (EDA) gene (Xq13.1)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gault, J.; Zonana, J.; Zeltinger, J.

A conserved mouse genomic clone was used to identify a homologous human genomic clone (the DXS732E locus), which was subsequently employed to isolate cDNAs from a human fetal brain library. Nine unique overlapping cDNAs were isolated, and sequences analysis of 3.9 kb identified a putative 1 kb ORF. GRAIL analysis of the sequence supported the hypothesis that the putative ORF was coding sequence, and Prosite analysis of the putative ORF identified potential glycosylation and phosphorylation sites. The 5{prime} end of the gene maps within a CpG island, and comparison of cDNA sequences indicate the gene is alternatively spliced at itsmore » 3{prime} end. Northern analysis and RT-PCR indicate that two different sized messages appear to be expressed with the gene expressed in human fetal kidney, intestine, brain, and muscle. The gene is expressed in 77 day human skin, a time when hair follicle formation occurs. Anhidrotic ectodermal dysplasia (EDA) results in the abnormal morphogenesis of hair, teeth and eccrine sweat glands. A positional cloning strategy towards cloning the EDA gene had been used, and deletion and X-autosome translocation patients have been useful in further delimiting the EDA region. The present gene at the DXS732E locus is partially deleted in one EDA patient who does not have other apparent abnormalities. No rearrangements of the gene have been detected in two female X-autosome translocation EDA patients, nor in four additional male patients with submicroscopic molecular deletions.« less
Induction of multixenobiotic defense mechanisms in resistant Daphnia magna clones as a general cellular response to stress.

PubMed

Jordão, Rita; Campos, Bruno; Lemos, Marco F L; Soares, Amadeu M V M; Tauler, Romà; Barata, Carlos

2016-06-01

Multixenobiotic resistance mechanisms (MXR) were recently identified in Daphnia magna. Previous results characterized gene transcripts of genes encoding and efflux activities of four putative ABCB1 and ABCC transporters that were chemically induced but showed low specificity against model transporter substrates and inhibitors, thus preventing us from distinguishing between activities of different efflux transporter types. In this study we report on the specificity of induction of ABC transporters and of the stress protein hsp70 in clones selected to be genetically resistant to ABCB1 chemical substrates. Clones resistant to mitoxantrone, ivermectin and pentachlorophenol showed distinctive transcriptional responses of transporter protein coding genes and of putative transporter dye activities. Expression of hsp70 proteins also varied across resistant clones. Clones resistant to mitoxantrone and pentachlorophenol showed high constitutive levels of hsp70. Transcriptional levels of the abcb1 gene transporter and of putative dye transporter activity were also induced to a greater extent in the pentachlorophenol resistant clone. Observed higher dye transporter activities in individuals from clones resistant to mitoxantrone and ivermectin were unrelated with transcriptional levels of the studied four abcc and abcb1 transporter genes. These findings suggest that Abcb1 induction in D. magna may be a part of a general cellular stress response. Copyright © 2016 Elsevier B.V. All rights reserved.
Polymerization of non-complementary RNA: systematic symmetric nucleotide exchanges mainly involving uracil produce mitochondrial RNA transcripts coding for cryptic overlapping genes.

PubMed

Seligmann, Hervé

2013-03-01

Usual DNA→RNA transcription exchanges T→U. Assuming different systematic symmetric nucleotide exchanges during translation, some GenBank RNAs match exactly human mitochondrial sequences (exchange rules listed in decreasing transcript frequencies): C↔U, A↔U, A↔U+C↔G (two nucleotide pairs exchanged), G↔U, A↔G, C↔G, none for A↔C, A↔G+C↔U, and A↔C+G↔U. Most unusual transcripts involve exchanging uracil. Independent measures of rates of rare replicational enzymatic DNA nucleotide misinsertions predict frequencies of RNA transcripts systematically exchanging the corresponding misinserted nucleotides. Exchange transcripts self-hybridize less than other gene regions, self-hybridization increases with length, suggesting endoribonuclease-limited elongation. Blast detects stop codon depleted putative protein coding overlapping genes within exchange-transcribed mitochondrial genes. These align with existing GenBank proteins (mainly metazoan origins, prokaryotic and viral origins underrepresented). These GenBank proteins frequently interact with RNA/DNA, are membrane transporters, or are typical of mitochondrial metabolism. Nucleotide exchange transcript frequencies increase with overlapping gene densities and stop densities, indicating finely tuned counterbalancing regulation of expression of systematic symmetric nucleotide exchange-encrypted proteins. Such expression necessitates combined activities of suppressor tRNAs matching stops, and nucleotide exchange transcription. Two independent properties confirm predicted exchanged overlap coding genes: discrepancy of third codon nucleotide contents from replicational deamination gradients, and codon usage according to circular code predictions. Predictions from both properties converge, especially for frequent nucleotide exchange types. Nucleotide exchanging transcription apparently increases coding densities of protein coding genes without lengthening genomes, revealing unsuspected functional DNA coding potential. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Cloning and sequencing of a laccase gene from the lignin-degrading basidiomycete Pleurotus ostreatus.

PubMed Central

Giardina, P; Cannio, R; Martirani, L; Marzullo, L; Palmieri, G; Sannia, G

1995-01-01

The gene (pox1) encoding a phenol oxidase from Pleurotus ostreatus, a lignin-degrading basidiomycete, was cloned and sequenced, and the corresponding pox1 cDNA was also synthesized and sequenced. The isolated gene consists of 2,592 bp, with the coding sequence being interrupted by 19 introns and flanked by an upstream region in which putative CAAT and TATA consensus sequences could be identified at positions -174 and -84, respectively. The isolation of a second cDNA (pox2 cDNA), showing 84% similarity, and of the corresponding truncated genomic clones demonstrated the existence of a multigene family coding for isoforms of laccase in P. ostreatus. PCR amplifications of specific regions on the DNA of isolated monokaryons proved that the two genes are not allelic forms. The POX1 amino acid sequence deduced was compared with those of other known laccases from different fungi. PMID:7793961
Prevalence of transcription promoters within archaeal operons and coding sequences

PubMed Central

Koide, Tie; Reiss, David J; Bare, J Christopher; Pang, Wyming Lee; Facciotti, Marc T; Schmid, Amy K; Pan, Min; Marzolf, Bruz; Van, Phu T; Lo, Fang-Yin; Pratap, Abhishek; Deutsch, Eric W; Peterson, Amelia; Martin, Dan; Baliga, Nitin S

2009-01-01

Despite the knowledge of complex prokaryotic-transcription mechanisms, generalized rules, such as the simplified organization of genes into operons with well-defined promoters and terminators, have had a significant role in systems analysis of regulatory logic in both bacteria and archaea. Here, we have investigated the prevalence of alternate regulatory mechanisms through genome-wide characterization of transcript structures of ∼64% of all genes, including putative non-coding RNAs in Halobacterium salinarum NRC-1. Our integrative analysis of transcriptome dynamics and protein–DNA interaction data sets showed widespread environment-dependent modulation of operon architectures, transcription initiation and termination inside coding sequences, and extensive overlap in 3′ ends of transcripts for many convergently transcribed genes. A significant fraction of these alternate transcriptional events correlate to binding locations of 11 transcription factors and regulators (TFs) inside operons and annotated genes—events usually considered spurious or non-functional. Using experimental validation, we illustrate the prevalence of overlapping genomic signals in archaeal transcription, casting doubt on the general perception of rigid boundaries between coding sequences and regulatory elements. PMID:19536208
The putative drug efflux systems of the Bacillus cereus group

PubMed Central

Elbourne, Liam D. H.; Vörös, Aniko; Kroeger, Jasmin K.; Simm, Roger; Tourasse, Nicolas J.; Finke, Sarah; Henderson, Peter J. F.; Økstad, Ole Andreas; Paulsen, Ian T.; Kolstø, Anne-Brit

2017-01-01

The Bacillus cereus group of bacteria includes seven closely related species, three of which, B. anthracis, B. cereus and B. thuringiensis, are pathogens of humans, animals and/or insects. Preliminary investigations into the transport capabilities of different bacterial lineages suggested that genes encoding putative efflux systems were unusually abundant in the B. cereus group compared to other bacteria. To explore the drug efflux potential of the B. cereus group all putative efflux systems were identified in the genomes of prototypical strains of B. cereus, B. anthracis and B. thuringiensis using our Transporter Automated Annotation Pipeline. More than 90 putative drug efflux systems were found within each of these strains, accounting for up to 2.7% of their protein coding potential. Comparative analyses demonstrated that the efflux systems are highly conserved between these species; 70–80% of the putative efflux pumps were shared between all three strains studied. Furthermore, 82% of the putative efflux system proteins encoded by the prototypical B. cereus strain ATCC 14579 (type strain) were found to be conserved in at least 80% of 169 B. cereus group strains that have high quality genome sequences available. However, only a handful of these efflux pumps have been functionally characterized. Deletion of individual efflux pump genes from B. cereus typically had little impact to drug resistance phenotypes or the general fitness of the strains, possibly because of the large numbers of alternative efflux systems that may have overlapping substrate specificities. Therefore, to gain insight into the possible transport functions of efflux systems in B. cereus, we undertook large-scale qRT-PCR analyses of efflux pump gene expression following drug shocks and other stress treatments. Clustering of gene expression changes identified several groups of similarly regulated systems that may have overlapping drug resistance functions. In this article we review current knowledge of the small molecule efflux pumps encoded by the B. cereus group and suggest the likely functions of numerous uncharacterised pumps. PMID:28472044

Molecular cloning and characterization of a gene encoding glutaminase from Aspergillus oryzae.

PubMed

Koibuchi, K; Nagasaki, H; Yuasa, A; Kataoka, J; Kitamoto, K

2000-07-01

A glutaminase from Aspergillus oryzae was purified and its molecular weight was determined to be 82,091 by matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Purified glutaminase catalysed the hydrolysis not only of L-glutamine but also of D-glutamine. Both the molecular weight and the substrate specificity of this glutaminase were different from those reported previously [Yano et al. (1998) J Ferment Technol 66: 137-143]. On the basis of its internal amino acid sequences, we have isolated and characterized the glutaminase gene (gtaA) from A. oryzae. The gtaA gene had an open reading frame coding for 690 amino acid residues, including a signal peptide of 20 amino acid residues and a mature protein of 670 amino acid residues. In the 5'-flanking region of the gene, there were three putative CreAp binding sequences and one putative AreAp binding sequence. The gtaA structural gene was introduced into A. oryzae NS4 and a marked increase in activity was detected in comparison with the control strain. The gtaA gene was also isolated from Aspergillus nidulans on the basis of the determined nucleotide sequence of the gtaA gene from A. oryzae.
Deep transcriptome annotation enables the discovery and functional characterization of cryptic small proteins

PubMed Central

Delcourt, Vivian; Lucier, Jean-François; Gagnon, Jules; Beaudoin, Maxime C; Vanderperre, Benoît; Breton, Marc-André; Motard, Julie; Jacques, Jean-François; Brunelle, Mylène; Gagnon-Arsenault, Isabelle; Fournier, Isabelle; Ouangraoua, Aida; Hunting, Darel J; Cohen, Alan A; Landry, Christian R; Scott, Michelle S

2017-01-01

Recent functional, proteomic and ribosome profiling studies in eukaryotes have concurrently demonstrated the translation of alternative open-reading frames (altORFs) in addition to annotated protein coding sequences (CDSs). We show that a large number of small proteins could in fact be coded by these altORFs. The putative alternative proteins translated from altORFs have orthologs in many species and contain functional domains. Evolutionary analyses indicate that altORFs often show more extreme conservation patterns than their CDSs. Thousands of alternative proteins are detected in proteomic datasets by reanalysis using a database containing predicted alternative proteins. This is illustrated with specific examples, including altMiD51, a 70 amino acid mitochondrial fission-promoting protein encoded in MiD51/Mief1/SMCR7L, a gene encoding an annotated protein promoting mitochondrial fission. Our results suggest that many genes are multicoding genes and code for a large protein and one or several small proteins. PMID:29083303
Arabidopsis Polycomb Repressive Complex 2 binding sites contain putative GAGA factor binding motifs within coding regions of genes

PubMed Central

2013-01-01

Background Polycomb Repressive Complex 2 (PRC2) is an essential regulator of gene expression that maintains genes in a repressed state by marking chromatin with trimethylated Histone H3 lysine 27 (H3K27me3). In Arabidopsis, loss of PRC2 function leads to pleiotropic effects on growth and development thought to be due to ectopic expression of seed and embryo-specific genes. While there is some understanding of the mechanisms by which specific genes are targeted by PRC2 in animal systems, it is still not clear how PRC2 is recruited to specific regions of plant genomes. Results We used ChIP-seq to determine the genome-wide distribution of hemagglutinin (HA)-tagged FERTLIZATION INDEPENDENT ENDOSPERM (FIE-HA), the Extra Sex Combs homolog protein present in all Arabidopsis PRC2 complexes. We found that the FIE-HA binding sites co-locate with a subset of the H3K27me3 sites in the genome and that the associated genes were more likely to be de-repressed in mutants of PRC2 components. The FIE-HA binding sites are enriched for three sequence motifs including a putative GAGA factor binding site that is also found in Drosophila Polycomb Response Elements (PREs). Conclusions Our results suggest that PRC2 binding sites in plant genomes share some sequence features with Drosophila PREs. However, unlike Drosophila PREs which are located in promoters and devoid of H3K27me3, Arabidopsis FIE binding sites tend to be in gene coding regions and co-localize with H3K27me3. PMID:24001316
Genetic polymorphism in three glutathione s-transferase genes and breast cancer risk

DOE Office of Scientific and Technical Information (OSTI.GOV)

Woldegiorgis, S.; Ahmed, R.C.; Zhen, Y.

The role of the glutathione S-transferase (GST) enzyme family is to detoxify environmental toxins and carcinogens and to protect organisms from their adverse effects, including cancer. The genes GSTM1, GSTP1, and GSTT1 code for three GSTs involved in the detoxification of carcinogens, such as polycyclic aromatic hydrocarbons (PAHs) and benzene. In humans, GSTM1 is deleted in about 50% of the population, GSTT1 is absent in about 20%, whereas the GSTP1 gene has a single base polymorphism resulting in an enzyme with reduced activity. Epidemiological studies indicate that GST polymorphisms increase the level of carcinogen-induced DNA damage and several studies havemore » found a correlation of polymorphisms in one of the GST genes and an increased risk for certain cancers. We examined the role of polymorphisms in genes coding for these three GST enzymes in breast cancer. A breast tissue collection consisting of specimens of breast cancer patients and non-cancer controls was analyzed by polymerase chain reaction (PCR) for the presence or absence of the GSTM1 and GSTT1 genes and for GSTP1 single base polymorphism by PCR/RFLP. We found that GSTM1 and GSTT1 deletions occurred more frequently in cases than in controls, and GSTP1 polymorphism was more frequent in controls. The effective detoxifier (putative low-risk) genotype (defined as presence of both GSTM1 and GSTT1 genes and GSTP1 wild type) was less frequent in cases than controls (16% vs. 23%, respectively). The poor detoxifier (putative high-risk) genotype was more frequent in cases than controls. However, the sample size of this study was too small to provide conclusive results.« less
Two novel heat shock genes encoding proteins produced in response to heterologous protein expression in Escherichia coli.

PubMed Central

Allen, S P; Polazzi, J O; Gierse, J K; Easton, A M

1992-01-01

In Escherichia coli high-level production of some heterologous proteins (specifically, human prorenin, renin, and bovine insulin-like growth factor 2) resulted in the induction of two new E. coli heat shock proteins, both of which have molecular masses of 16 kDa and are tightly associated with inclusion bodies formed during heterologous protein production. We named these inclusion body-associated proteins IbpA and IbpB. The coding sequences for IbpA and IbpB were identified and isolated from the Kohara E. coli gene bank. The genes for these proteins (ibpA and ibpB) are located at 82.5 min on the chromosome. Nucleotide sequencing of the two genes revealed that they are transcribed in the same direction and are separated by 110 bp. Putative Shine-Dalgarno sequences are located upstream from the initiation codons of both genes. A putative heat shock promoter is located upstream from ibpA, and a putative transcription terminator is located downstream from ibpB. A temperature upshift experiment in which we used a wild-type E. coli strain and an isogenic rpoH mutant strain indicated that a sigma 32-containing RNA polymerase is involved in the regulation of expression of these genes. There is 57.5% identity between the genes at the nucleotide level and 52.2% identity at the amino acid level. A search of the protein data bases showed that both of these 16-kDa proteins exhibit low levels of homology to low-molecular-weight heat shock proteins from eukaryotic species. Images PMID:1356969
Global functional atlas of Escherichia coli encompassing previously uncharacterized proteins.

PubMed

Hu, Pingzhao; Janga, Sarath Chandra; Babu, Mohan; Díaz-Mejía, J Javier; Butland, Gareth; Yang, Wenhong; Pogoutse, Oxana; Guo, Xinghua; Phanse, Sadhna; Wong, Peter; Chandran, Shamanta; Christopoulos, Constantine; Nazarians-Armavil, Anaies; Nasseri, Negin Karimi; Musso, Gabriel; Ali, Mehrab; Nazemof, Nazila; Eroukova, Veronika; Golshani, Ashkan; Paccanaro, Alberto; Greenblatt, Jack F; Moreno-Hagelsieb, Gabriel; Emili, Andrew

2009-04-28

One-third of the 4,225 protein-coding genes of Escherichia coli K-12 remain functionally unannotated (orphans). Many map to distant clades such as Archaea, suggesting involvement in basic prokaryotic traits, whereas others appear restricted to E. coli, including pathogenic strains. To elucidate the orphans' biological roles, we performed an extensive proteomic survey using affinity-tagged E. coli strains and generated comprehensive genomic context inferences to derive a high-confidence compendium for virtually the entire proteome consisting of 5,993 putative physical interactions and 74,776 putative functional associations, most of which are novel. Clustering of the respective probabilistic networks revealed putative orphan membership in discrete multiprotein complexes and functional modules together with annotated gene products, whereas a machine-learning strategy based on network integration implicated the orphans in specific biological processes. We provide additional experimental evidence supporting orphan participation in protein synthesis, amino acid metabolism, biofilm formation, motility, and assembly of the bacterial cell envelope. This resource provides a "systems-wide" functional blueprint of a model microbe, with insights into the biological and evolutionary significance of previously uncharacterized proteins.
Detection of non-coding RNA in bacteria and archaea using the DETR'PROK Galaxy pipeline.

PubMed

Toffano-Nioche, Claire; Luo, Yufei; Kuchly, Claire; Wallon, Claire; Steinbach, Delphine; Zytnicki, Matthias; Jacq, Annick; Gautheret, Daniel

2013-09-01

RNA-seq experiments are now routinely used for the large scale sequencing of transcripts. In bacteria or archaea, such deep sequencing experiments typically produce 10-50 million fragments that cover most of the genome, including intergenic regions. In this context, the precise delineation of the non-coding elements is challenging. Non-coding elements include untranslated regions (UTRs) of mRNAs, independent small RNA genes (sRNAs) and transcripts produced from the antisense strand of genes (asRNA). Here we present a computational pipeline (DETR'PROK: detection of ncRNAs in prokaryotes) based on the Galaxy framework that takes as input a mapping of deep sequencing reads and performs successive steps of clustering, comparison with existing annotation and identification of transcribed non-coding fragments classified into putative 5' UTRs, sRNAs and asRNAs. We provide a step-by-step description of the protocol using real-life example data sets from Vibrio splendidus and Escherichia coli. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.
High-throughput sequencing and analysis of the gill tissue transcriptome from the deep-sea hydrothermal vent mussel Bathymodiolus azoricus

PubMed Central

2010-01-01

Background Bathymodiolus azoricus is a deep-sea hydrothermal vent mussel found in association with large faunal communities living in chemosynthetic environments at the bottom of the sea floor near the Azores Islands. Investigation of the exceptional physiological reactions that vent mussels have adopted in their habitat, including responses to environmental microbes, remains a difficult challenge for deep-sea biologists. In an attempt to reveal genes potentially involved in the deep-sea mussel innate immunity we carried out a high-throughput sequence analysis of freshly collected B. azoricus transcriptome using gills tissues as the primary source of immune transcripts given its strategic role in filtering the surrounding waterborne potentially infectious microorganisms. Additionally, a substantial EST data set was produced and from which a comprehensive collection of genes coding for putative proteins was organized in a dedicated database, "DeepSeaVent" the first deep-sea vent animal transcriptome database based on the 454 pyrosequencing technology. Results A normalized cDNA library from gills tissue was sequenced in a full 454 GS-FLX run, producing 778,996 sequencing reads. Assembly of the high quality reads resulted in 75,407 contigs of which 3,071 were singletons. A total of 39,425 transcripts were conceptually translated into amino-sequences of which 22,023 matched known proteins in the NCBI non-redundant protein database, 15,839 revealed conserved protein domains through InterPro functional classification and 9,584 were assigned with Gene Ontology terms. Queries conducted within the database enabled the identification of genes putatively involved in immune and inflammatory reactions which had not been previously evidenced in the vent mussel. Their physical counterpart was confirmed by semi-quantitative quantitative Reverse-Transcription-Polymerase Chain Reactions (RT-PCR) and their RNA transcription level by quantitative PCR (qPCR) experiments. Conclusions We have established the first tissue transcriptional analysis of a deep-sea hydrothermal vent animal and generated a searchable catalog of genes that provides a direct method of identifying and retrieving vast numbers of novel coding sequences which can be applied in gene expression profiling experiments from a non-conventional model organism. This provides the most comprehensive sequence resource for identifying novel genes currently available for a deep-sea vent organism, in particular, genes putatively involved in immune and inflammatory reactions in vent mussels. The characterization of the B. azoricus transcriptome will facilitate research into biological processes underlying physiological adaptations to hydrothermal vent environments and will provide a basis for expanding our understanding of genes putatively involved in adaptations processes during post-capture long term acclimatization experiments, at "sea-level" conditions, using B. azoricus as a model organism. PMID:20937131
Vru (Sub0144) controls expression of proven and putative virulence determinants and alters the ability of Streptococcus uberis to cause disease in dairy cattle

PubMed Central

Egan, Sharon A.; Ward, Philip N.; Watson, Michael; Field, Terence R.

2012-01-01

The regulation and control of gene expression in response to differing environmental stimuli is crucial for successful pathogen adaptation and persistence. The regulatory gene vru of Streptococcus uberis encodes a stand-alone response regulator with similarity to the Mga of group A Streptococcus. Mga controls expression of a number of important virulence determinants. Experimental intramammary challenge of dairy cattle with a mutant of S. uberis carrying an inactivating lesion in vru showed reduced ability to colonize the mammary gland and an inability to induce clinical signs of mastitis compared with the wild-type strain. Analysis of transcriptional differences of gene expression in the mutant, determined by microarray analysis, identified a number of coding sequences with altered expression in the absence of Vru. These consisted of known and putative virulence determinants, including Lbp (Sub0145), SclB (Sub1095), PauA (Sub1785) and hasA (Sub1696). PMID:22383474
Structural organization and classification of cytochrome P450 genes in flax (Linum usitatissimum L.).

PubMed

Babu, Peram Ravindra; Rao, Khareedu Venkateswara; Reddy, Vudem Dashavantha

2013-01-15

Flax CYPome analysis resulted in the identification of 334 putative cytochrome P450 (CYP450) genes in the cultivated flax genome. Classification of flax CYP450 genes based on the sequence similarity with Arabidopsis orthologs and CYP450 nomenclature, revealed 10 clans representing 44 families and 98 subfamilies. CYP80, CYP83, CYP92, CYP702, CYP705, CYP708, CYP728, CYP729, CYP733 and CYP736 families are absent in the flax genome. The subfamily members exhibited conserved sequences, length of exons and phasing of introns. Similarity search of the genomic resources of wild flax species Linum bienne with CYP450 coding sequences of the cultivated flax, revealed the presence of 127 CYP450 gene orthologs, indicating amplification of novel CYP450 genes in the cultivated flax. Seven families CYP73, 74, 75, 76, 77, 84 and 709, coding for enzymes associated with phenylpropanoid/fatty acid metabolism, showed extensive gene amplification in the flax. About 59% of the flax CYP450 genes were present in the EST libraries. Copyright © 2012 Elsevier B.V. All rights reserved.
Evidence for regulation of columnar habit in apple by a putative 2OG-Fe(II) oxygenase.

PubMed

Wolters, Pieter J; Schouten, Henk J; Velasco, Riccardo; Si-Ammour, Azeddine; Baldi, Paolo

2013-12-01

Understanding the genetic mechanisms controlling columnar-type growth in the apple mutant 'Wijcik' will provide insights on how tree architecture and growth are regulated in fruit trees. In apple, columnar-type growth is controlled by a single major gene at the Columnar (Co) locus. By comparing the genomic sequence of the Co region of 'Wijcik' with its wild-type 'McIntosh', a novel non-coding DNA element of 1956 bp specific to Pyreae was found to be inserted in an intergenic region of 'Wijcik'. Expression analysis of selected genes located in the vicinity of the insertion revealed the upregulation of the MdCo31 gene encoding a putative 2OG-Fe(II) oxygenase in axillary buds of 'Wijcik'. Constitutive expression of MdCo31 in Arabidopsis thaliana resulted in compact plants with shortened floral internodes, a phenotype reminiscent of the one observed in columnar apple trees. We conclude that MdCo31 is a strong candidate gene for the control of columnar growth in 'Wijcik'. No claim to original European Union works. New Phytologist © 2013 New Phytologist Trust.
Regulation of the alpha-glucuronidase-encoding gene ( aguA) from Aspergillus niger.

PubMed

de Vries, R P; van de Vondervoort, P J I; Hendriks, L; van de Belt, M; Visser, J

2002-09-01

The alpha-glucuronidase gene aguA from Aspergillus niger was cloned and characterised. Analysis of the promoter region of aguA revealed the presence of four putative binding sites for the major carbon catabolite repressor protein CREA and one putative binding site for the transcriptional activator XLNR. In addition, a sequence motif was detected which differed only in the last nucleotide from the XLNR consensus site. A construct in which part of the aguA coding region was deleted still resulted in production of a stable mRNA upon transformation of A. niger. The putative XLNR binding sites and two of the putative CREA binding sites were mutated individually in this construct and the effects on expression were examined in A. niger transformants. Northern analysis of the transformants revealed that the consensus XLNR site is not actually functional in the aguA promoter, whereas the sequence that diverges from the consensus at a single position is functional. This indicates that XLNR is also able to bind to the sequence GGCTAG, and the XLNR binding site consensus should therefore be changed to GGCTAR. Both CREA sites are functional, indicating that CREA has a strong influence on aguA expression. A detailed expression analysis of aguA in four genetic backgrounds revealed a second regulatory system involved in activation of aguA gene expression. This system responds to the presence of glucuronic and galacturonic acids, and is not dependent on XLNR.
Functional Characterization of PaLAX1, a Putative Auxin Permease, in Heterologous Plant Systems1[W][OA

PubMed Central

Hoyerová, Klára; Perry, Lucie; Hand, Paul; Laňková, Martina; Kocábek, Tomáš; May, Sean; Kottová, Jana; Pačes, Jan; Napier, Richard; Zažímalová, Eva

2008-01-01

We have isolated the cDNA of the gene PaLAX1 from a wild cherry tree (Prunus avium). The gene and its product are highly similar in sequences to both the cDNAs and the corresponding protein products of AUX/LAX-type genes, coding for putative auxin influx carriers. We have prepared and characterized transformed Nicotiana tabacum and Arabidopsis thaliana plants carrying the gene PaLAX1. We have proved that constitutive overexpression of PaLAX1 is accompanied by changes in the content and distribution of free indole-3-acetic acid, the major endogenous auxin. The increase in free indole-3-acetic acid content in transgenic plants resulted in various phenotype changes, typical for the auxin-overproducing plants. The uptake of synthetic auxin, 2,4-dichlorophenoxyacetic acid, was 3 times higher in transgenic lines compared to the wild-type lines and the treatment with the auxin uptake inhibitor 1-naphthoxyacetic acid reverted the changes caused by the expression of PaLAX1. Moreover, the agravitropic response could be restored by expression of PaLAX1 in the mutant aux1 plants, which are deficient in auxin influx carrier activity. Based on our data, we have concluded that the product of the gene PaLAX1 promotes the uptake of auxin into cells, and, as a putative auxin influx carrier, it affects the content and distribution of free endogenous auxin in transgenic plants. PMID:18184737
Comparative Transcriptome Analysis Identifies Putative Genes Involved in the Biosynthesis of Xanthanolides in Xanthium strumarium L.

PubMed

Li, Yuanjun; Gou, Junbo; Chen, Fangfang; Li, Changfu; Zhang, Yansheng

2016-01-01

Xanthium strumarium L. is a traditional Chinese herb belonging to the Asteraceae family. The major bioactive components of this plant are sesquiterpene lactones (STLs), which include the xanthanolides. To date, the biogenesis of xanthanolides, especially their downstream pathway, remains largely unknown. In X. strumarium, xanthanolides primarily accumulate in its glandular trichomes. To identify putative gene candidates involved in the biosynthesis of xanthanolides, three X. strumarium transcriptomes, which were derived from the young leaves of two different cultivars and the purified glandular trichomes from one of the cultivars, were constructed in this study. In total, 157 million clean reads were generated and assembled into 91,861 unigenes, of which 59,858 unigenes were successfully annotated. All the genes coding for known enzymes in the upstream pathway to the biosynthesis of xanthanolides were present in the X. strumarium transcriptomes. From a comparative analysis of the X. strumarium transcriptomes, this study identified a number of gene candidates that are putatively involved in the downstream pathway to the synthesis of xanthanolides, such as four unigenes encoding CYP71 P450s, 50 unigenes for dehydrogenases, and 27 genes for acetyltransferases. The possible functions of these four CYP71 candidates are extensively discussed. In addition, 116 transcription factors that are highly expressed in X. strumarium glandular trichomes were also identified. Their possible regulatory roles in the biosynthesis of STLs are discussed. The global transcriptomic data for X. strumarium should provide a valuable resource for further research into the biosynthesis of xanthanolides.
Comparative analysis of long non-coding RNAs in Atlantic and Coho salmon reveals divergent transcriptome responses associated with immunity and tissue repair during sea lice infestation.

PubMed

Valenzuela-Muñoz, Valentina; Valenzuela-Miranda, Diego; Gallardo-Escárate, Cristian

2018-05-24

The increasing capacity of transcriptomic analysis by high throughput sequencing has highlighted the presence of a large proportion of transcripts that do not encode proteins. In particular, long non-coding RNAs (lncRNAs) are sequences with low coding potential and conservation among species. Moreover, cumulative evidence has revealed important roles in post-transcriptional gene modulation in several taxa. In fish, the role of lncRNAs has been scarcely studied and even less so during the immune response against sea lice. In the present study we mined for lncRNAs in Atlantic salmon (Salmo salar) and Coho salmon (Oncorhynkus kisutch), which are affected by the sea louse Caligus rogercresseyi, evaluating the degree of sequence conservation between these two fish species and their putative roles during the infection process. Herein, Atlantic and Coho salmon were infected with 35 lice/fish and evaluated after 7 and 14 days post-infestation (dpi). For RNA sequencing, samples from skin and head kidney were collected. A total of 5658/4140 and 3678/2123 lncRNAs were identified in uninfected/infected Atlantic and Coho salmon transcriptomes, respectively. Species-specific transcription patterns were observed in exclusive lncRNAs according to the tissue analyzed. Furthermore, neighbor gene GO enrichment analysis of the top 100 highly regulated lncRNAs in Atlantic salmon showed that lncRNAs were localized near genes related to the immune response. On the other hand, in Coho salmon the highly regulated lncRNAs were localized near genes involved in tissue repair processes. This study revealed high regulation of lncRNAs closely localized to immune and tissue repair-related genes in Atlantic and Coho salmon, respectively, suggesting putative roles for lncRNAs in salmon against sea lice infestation. Copyright © 2018 Elsevier Ltd. All rights reserved.
In Silico Pattern-Based Analysis of the Human Cytomegalovirus Genome

PubMed Central

Rigoutsos, Isidore; Novotny, Jiri; Huynh, Tien; Chin-Bow, Stephen T.; Parida, Laxmi; Platt, Daniel; Coleman, David; Shenk, Thomas

2003-01-01

More than 200 open reading frames (ORFs) from the human cytomegalovirus genome have been reported as potentially coding for proteins. We have used two pattern-based in silico approaches to analyze this set of putative viral genes. With the help of an objective annotation method that is based on the Bio-Dictionary, a comprehensive collection of amino acid patterns that describes the currently known natural sequence space of proteins, we have reannotated all of the previously reported putative genes of the human cytomegalovirus. Also, with the help of MUSCA, a pattern-based multiple sequence alignment algorithm, we have reexamined the original human cytomegalovirus gene family definitions. Our analysis of the genome shows that many of the coded proteins comprise amino acid combinations that are unique to either the human cytomegalovirus or the larger group of herpesviruses. We have confirmed that a surprisingly large portion of the analyzed ORFs encode membrane proteins, and we have discovered a significant number of previously uncharacterized proteins that are predicted to be G-protein-coupled receptor homologues. The analysis also indicates that many of the encoded proteins undergo posttranslational modifications such as hydroxylation, phosphorylation, and glycosylation. ORFs encoding proteins with similar functional behavior appear in neighboring regions of the human cytomegalovirus genome. All of the results of the present study can be found and interactively explored online (http://cbcsrv.watson.ibm.com/virus/). PMID:12634390
In silico pattern-based analysis of the human cytomegalovirus genome.

PubMed

Rigoutsos, Isidore; Novotny, Jiri; Huynh, Tien; Chin-Bow, Stephen T; Parida, Laxmi; Platt, Daniel; Coleman, David; Shenk, Thomas

2003-04-01

More than 200 open reading frames (ORFs) from the human cytomegalovirus genome have been reported as potentially coding for proteins. We have used two pattern-based in silico approaches to analyze this set of putative viral genes. With the help of an objective annotation method that is based on the Bio-Dictionary, a comprehensive collection of amino acid patterns that describes the currently known natural sequence space of proteins, we have reannotated all of the previously reported putative genes of the human cytomegalovirus. Also, with the help of MUSCA, a pattern-based multiple sequence alignment algorithm, we have reexamined the original human cytomegalovirus gene family definitions. Our analysis of the genome shows that many of the coded proteins comprise amino acid combinations that are unique to either the human cytomegalovirus or the larger group of herpesviruses. We have confirmed that a surprisingly large portion of the analyzed ORFs encode membrane proteins, and we have discovered a significant number of previously uncharacterized proteins that are predicted to be G-protein-coupled receptor homologues. The analysis also indicates that many of the encoded proteins undergo posttranslational modifications such as hydroxylation, phosphorylation, and glycosylation. ORFs encoding proteins with similar functional behavior appear in neighboring regions of the human cytomegalovirus genome. All of the results of the present study can be found and interactively explored online (http://cbcsrv.watson.ibm.com/virus/).
Biochemical Characterization of Putative Adenylate Dimethylallyltransferase and Cytokinin Dehydrogenase from Nostoc sp. PCC 7120.

PubMed

Frébortová, Jitka; Greplová, Marta; Seidl, Michael F; Heyl, Alexander; Frébort, Ivo

2015-01-01

Cytokinins, a class of phytohormones, are adenine derivatives common to many different organisms. In plants, these play a crucial role as regulators of plant development and the reaction to abiotic and biotic stress. Key enzymes in the cytokinin synthesis and degradation in modern land plants are the isopentyl transferases and the cytokinin dehydrogenases, respectively. Their encoding genes have been probably introduced into the plant lineage during the primary endosymbiosis. To shed light on the evolution of these proteins, the genes homologous to plant adenylate isopentenyl transferase and cytokinin dehydrogenase were amplified from the genomic DNA of cyanobacterium Nostoc sp. PCC 7120 and expressed in Escherichia coli. The putative isopentenyl transferase was shown to be functional in a biochemical assay. In contrast, no enzymatic activity was detected for the putative cytokinin dehydrogenase, even though the principal domains necessary for its function are present. Several mutant variants, in which conserved amino acids in land plant cytokinin dehydrogenases had been restored, were inactive. A combination of experimental data with phylogenetic analysis indicates that adenylate-type isopentenyl transferases might have evolved several times independently. While the Nostoc genome contains a gene coding for protein with characteristics of cytokinin dehydrogenase, the organism is not able to break down cytokinins in the way shown for land plants.
Biochemical Characterization of Putative Adenylate Dimethylallyltransferase and Cytokinin Dehydrogenase from Nostoc sp. PCC 7120

PubMed Central

Frébortová, Jitka; Greplová, Marta; Seidl, Michael F.; Heyl, Alexander; Frébort, Ivo

2015-01-01

Cytokinins, a class of phytohormones, are adenine derivatives common to many different organisms. In plants, these play a crucial role as regulators of plant development and the reaction to abiotic and biotic stress. Key enzymes in the cytokinin synthesis and degradation in modern land plants are the isopentyl transferases and the cytokinin dehydrogenases, respectively. Their encoding genes have been probably introduced into the plant lineage during the primary endosymbiosis. To shed light on the evolution of these proteins, the genes homologous to plant adenylate isopentenyl transferase and cytokinin dehydrogenase were amplified from the genomic DNA of cyanobacterium Nostoc sp. PCC 7120 and expressed in Escherichia coli. The putative isopentenyl transferase was shown to be functional in a biochemical assay. In contrast, no enzymatic activity was detected for the putative cytokinin dehydrogenase, even though the principal domains necessary for its function are present. Several mutant variants, in which conserved amino acids in land plant cytokinin dehydrogenases had been restored, were inactive. A combination of experimental data with phylogenetic analysis indicates that adenylate-type isopentenyl transferases might have evolved several times independently. While the Nostoc genome contains a gene coding for protein with characteristics of cytokinin dehydrogenase, the organism is not able to break down cytokinins in the way shown for land plants. PMID:26376297
First comparative insight into the architecture of COI mitochondrial minicircle molecules of dicyemids reveals marked inter-species variation.

PubMed

Catalano, Sarah R; Whittington, Ian D; Donnellan, Stephen C; Bertozzi, Terry; Gillanders, Bronwyn M

2015-07-01

Dicyemids, poorly known parasites of benthic cephalopods, are one of the few phyla in which mitochondrial (mt) genome architecture departs from the typical ~16 kb circular metazoan genome. In addition to a putative circular genome, a series of mt minicircles that each comprises the mt encoded units (I-III) of the cytochrome c oxidase complex have been reported. Whether the structure of the mt minicircles is a consistent feature among dicyemid species is unknown. Here we analyse the complete cytochrome c oxidase subunit I (COI) minicircle molecule, containing the COI gene and an associated non-coding region (NCR), for ten dicyemid species, allowing for first time comparisons between species of minicircle architecture, NCR function and inferences of minicircle replication. Divergence in COI nucleotide sequences between dicyemid species was high (average net divergence = 31.6%) while within species diversity was lower (average net divergence = 0.2%). The NCR and putative 5' section of the COI gene were highly divergent between dicyemid species (average net nucleotide divergence of putative 5' COI section = 61.1%). No tRNA genes were found in the NCR, although palindrome sequences with the potential to form stem-loop structures were identified in some species, which may play a role in transcription or other biological processes.

Genome-wide identification and expression analysis of MAPK and MAPKK gene family in Malus domestica.

PubMed

Zhang, Shizhong; Xu, Ruirui; Luo, Xiaocui; Jiang, Zesheng; Shu, Huairui

2013-12-01

MAPK signal transduction modules play crucial roles in regulating many biological processes in plants, which are composed of three classes of hierarchically organized protein kinases, namely MAPKKKs, MAPKKs, and MAPKs. Although genome-wide analysis of this family has been carried out in some species, little is known about MAPK and MAPKK genes in apple (Malus domestica). In this study, a total of 26 putative apple MAPK genes (MdMPKs) and 9 putative apple MAPKK genes (MdMKKs) have been identified and located within the apple genome. Phylogenetic analysis revealed that MdMAPKs and MdMAPKKs could be divided into 4 subfamilies (groups A, B, C and D), respectively. The predicted MdMAPKs and MdMAPKKs were distributed across 13 out of 17 chromosomes with different densities. In addition, analysis of exon-intron junctions and of intron phase inside the predicted coding region of each candidate gene has revealed high levels of conservation within and between phylogenetic groups. According to the microarray and expressed sequence tag (EST) analysis, the different expression patterns indicate that they may play different roles during fruit development and rootstock-scion interaction process. Moreover, MAPK and MAPKK genes were performed expression profile analyses in different tissues (root, stem, leaf, flower and fruit), and all of the selected genes were expressed in at least one of the tissues tested, indicating that the MAPKs and MAPKKs are involved in various aspects of physiological and developmental processes of apple. To our knowledge, this is the first report of a genome-wide analysis of the apple MAPK and MAPKK gene family. This study provides valuable information for understanding the classification and putative functions of the MAPK signal in apple. © 2013.
Transcriptional and Functional Studies of Acidithiobacillus ferrooxidans Genes Related to Survival in the Presence of Copper▿

PubMed Central

Navarro, Claudio A.; Orellana, Luis H.; Mauriaca, Cecilia; Jerez, Carlos A.

2009-01-01

The acidophilic Acidithiobacillus ferrooxidans can resist exceptionally high copper (Cu) concentrations. This property is important for its use in biomining processes, where Cu and other metal levels range usually between 15 and 100 mM. To learn about the mechanisms that allow A. ferrooxidans cells to survive in this environment, a bioinformatic search of its genome showed the presence of at least 10 genes that are possibly related to Cu homeostasis. Among them are three genes coding for putative ATPases related to the transport of Cu (A. ferrooxidans copA1 [copA1Af], copA2Af, and copBAf), three genes related to a system of the resistance nodulation cell division family involved in the extraction of Cu from the cell (cusAAf, cusBAf, and cusCAf), and two genes coding for periplasmic chaperones for this metal (cusFAf and copCAf). The expression of most of these open reading frames was studied by real-time reverse transcriptase PCR using A. ferrooxidans cells adapted for growth in the presence of high concentrations of Cu. The putative A. ferrooxidans Cu resistance determinants were found to be upregulated when this bacterium was exposed to Cu in the range of 5 to 25 mM. These A. ferrooxidans genes conferred to Escherichia coli a greater Cu resistance than wild-type cells, supporting their functionality. The results reported here and previously published data strongly suggest that the high resistance of the extremophilic A. ferrooxidans to Cu may be due to part or all of the following key elements: (i) a wide repertoire of Cu resistance determinants, (ii) the duplication of some of these Cu resistance determinants, (iii) the existence of novel Cu chaperones, and (iv) a polyP-based Cu resistance system. PMID:19666734
Decoding sORF translation - from small proteins to gene regulation.

PubMed

Cabrera-Quio, Luis Enrique; Herberg, Sarah; Pauli, Andrea

2016-11-01

Translation is best known as the fundamental mechanism by which the ribosome converts a sequence of nucleotides into a string of amino acids. Extensive research over many years has elucidated the key principles of translation, and the majority of translated regions were thought to be known. The recent discovery of wide-spread translation outside of annotated protein-coding open reading frames (ORFs) came therefore as a surprise, raising the intriguing possibility that these newly discovered translated regions might have unrecognized protein-coding or gene-regulatory functions. Here, we highlight recent findings that provide evidence that some of these newly discovered translated short ORFs (sORFs) encode functional, previously missed small proteins, while others have regulatory roles. Based on known examples we will also speculate about putative additional roles and the potentially much wider impact that these translated regions might have on cellular homeostasis and gene regulation.
Genomic evidence for genes encoding leucine-rich repeat receptors linked to resistance against the eukaryotic extra- and intracellular Brassica napus pathogens Leptosphaeria maculans and Plasmodiophora brassicae.

PubMed

Stotz, Henrik U; Harvey, Pascoe J; Haddadi, Parham; Mashanova, Alla; Kukol, Andreas; Larkan, Nicholas J; Borhan, M Hossein; Fitt, Bruce D L

2018-01-01

Genes coding for nucleotide-binding leucine-rich repeat (LRR) receptors (NLRs) control resistance against intracellular (cell-penetrating) pathogens. However, evidence for a role of genes coding for proteins with LRR domains in resistance against extracellular (apoplastic) fungal pathogens is limited. Here, the distribution of genes coding for proteins with eLRR domains but lacking kinase domains was determined for the Brassica napus genome. Predictions of signal peptide and transmembrane regions divided these genes into 184 coding for receptor-like proteins (RLPs) and 121 coding for secreted proteins (SPs). Together with previously annotated NLRs, a total of 720 LRR genes were found. Leptosphaeria maculans-induced expression during a compatible interaction with cultivar Topas differed between RLP, SP and NLR gene families; NLR genes were induced relatively late, during the necrotrophic phase of pathogen colonization. Seven RLP, one SP and two NLR genes were found in Rlm1 and Rlm3/Rlm4/Rlm7/Rlm9 loci for resistance against L. maculans on chromosome A07 of B. napus. One NLR gene at the Rlm9 locus was positively selected, as was the RLP gene on chromosome A10 with LepR3 and Rlm2 alleles conferring resistance against L. maculans races with corresponding effectors AvrLm1 and AvrLm2, respectively. Known loci for resistance against L. maculans (extracellular hemi-biotrophic fungus), Sclerotinia sclerotiorum (necrotrophic fungus) and Plasmodiophora brassicae (intracellular, obligate biotrophic protist) were examined for presence of RLPs, SPs and NLRs in these regions. Whereas loci for resistance against P. brassicae were enriched for NLRs, no such signature was observed for the other pathogens. These findings demonstrate involvement of (i) NLR genes in resistance against the intracellular pathogen P. brassicae and a putative NLR gene in Rlm9-mediated resistance against the extracellular pathogen L. maculans.
Establishing the role of rare coding variants in known Parkinson's disease risk loci.

PubMed

Jansen, Iris E; Gibbs, J Raphael; Nalls, Mike A; Price, T Ryan; Lubbe, Steven; van Rooij, Jeroen; Uitterlinden, André G; Kraaij, Robert; Williams, Nigel M; Brice, Alexis; Hardy, John; Wood, Nicholas W; Morris, Huw R; Gasser, Thomas; Singleton, Andrew B; Heutink, Peter; Sharma, Manu

2017-11-01

Many common genetic factors have been identified to contribute to Parkinson's disease (PD) susceptibility, improving our understanding of the related underlying biological mechanisms. The involvement of rarer variants in these loci has been poorly studied. Using International Parkinson's Disease Genomics Consortium data sets, we performed a comprehensive study to determine the impact of rare variants in 23 previously published genome-wide association studies (GWAS) loci in PD. We applied Prix fixe to select the putative causal genes underneath the GWAS peaks, which was based on underlying functional similarities. The Sequence Kernel Association Test was used to analyze the joint effect of rare, common, or both types of variants on PD susceptibility. All genes were tested simultaneously as a gene set and each gene individually. We observed a moderate association of common variants, confirming the involvement of the known PD risk loci within our genetic data sets. Focusing on rare variants, we identified additional association signals for LRRK2, STBD1, and SPATA19. Our study suggests an involvement of rare variants within several putatively causal genes underneath previously identified PD GWAS peaks. Copyright © 2017 Elsevier Inc. All rights reserved.
Genomewide identification and expression analysis of the ARF gene family in apple.

PubMed

Luo, Xiao-Cui; Sun, Mei-Hong; Xu, Rui-Rui; Shu, Huai-Rui; Wang, Jia-Wei; Zhang, Shi-Zhong

2014-12-01

Auxin response factors (ARF) are transcription factors that regulate auxin responses in plants. Although the genomewide analysis of this family has been performed in some species, little is known regarding ARF genes in apple (Malus domestica). In this study, 31 putative apple ARF genes have been identified and located within the apple genome. The phylogenetic analysis revealed that MdARFs could be divided into three subfamilies (groups I, II and III). The predicted MdARFs were distributed across 15 of 17 chromosomes with different densities. In addition, the analysis of exon-intron junctions and of the intron phase inside the predicted coding region of each candidate gene has revealed high levels of conservation within and between phylogenetic groups. Expression profile analyses of MdARF genes were performed in different tissues (root, stem, leaf, flower and fruit), and all the selected genes were expressed in at least one of the tissues that were tested, which indicated that MdARFs are involved in various aspects of physiological and developmental processes of apple. To our knowledge, this report is the first to provide a genomewide analysis of the apple ARF gene family. This study provides valuable information for understanding the classification and putative functions of the ARF signal in apple.
Plant U13 orthologues and orphan snoRNAs identified by RNomics of RNA from Arabidopsis nucleoli

PubMed Central

Kim, Sang Hyon; Spensley, Mark; Choi, Seung Kook; Calixto, Cristiane P. G.; Pendle, Ali F.; Koroleva, Olga; Shaw, Peter J.; Brown, John W. S.

2010-01-01

Small nucleolar RNAs (snoRNAs) and small Cajal body-specific RNAs (scaRNAs) are non-coding RNAs whose main function in eukaryotes is to guide the modification of nucleotides in ribosomal and spliceosomal small nuclear RNAs, respectively. Full-length sequences of Arabidopsis snoRNAs and scaRNAs have been obtained from cDNA libraries of capped and uncapped small RNAs using RNA from isolated nucleoli from Arabidopsis cell cultures. We have identified 31 novel snoRNA genes (9 box C/D and 22 box H/ACA) and 15 new variants of previously described snoRNAs. Three related capped snoRNAs with a distinct gene organization and structure were identified as orthologues of animal U13snoRNAs. In addition, eight of the novel genes had no complementarity to rRNAs or snRNAs and are therefore putative orphan snoRNAs potentially reflecting wider functions for these RNAs. The nucleolar localization of a number of the snoRNAs and the localization to nuclear bodies of two putative scaRNAs was confirmed by in situ hybridization. The majority of the novel snoRNA genes were found in new gene clusters or as part of previously described clusters. These results expand the repertoire of Arabidopsis snoRNAs to 188 snoRNA genes with 294 gene variants. PMID:20081206
Comprehensive analysis of single molecule sequencing-derived complete genome and whole transcriptome of Hyposidra talaca nuclear polyhedrosis virus.

PubMed

Nguyen, Thong T; Suryamohan, Kushal; Kuriakose, Boney; Janakiraman, Vasantharajan; Reichelt, Mike; Chaudhuri, Subhra; Guillory, Joseph; Divakaran, Neethu; Rabins, P E; Goel, Ridhi; Deka, Bhabesh; Sarkar, Suman; Ekka, Preety; Tsai, Yu-Chih; Vargas, Derek; Santhosh, Sam; Mohan, Sangeetha; Chin, Chen-Shan; Korlach, Jonas; Thomas, George; Babu, Azariah; Seshagiri, Somasekar

2018-06-12

We sequenced the Hyposidra talaca NPV (HytaNPV) double stranded circular DNA genome using PacBio single molecule sequencing technology. We found that the HytaNPV genome is 139,089 bp long with a GC content of 39.6%. It encodes 141 open reading frames (ORFs) including the 37 baculovirus core genes, 25 genes conserved among lepidopteran baculoviruses, 72 genes known in baculovirus, and 7 genes unique to the HytaNPV genome. It is a group II alphabaculovirus that codes for the F protein and lacks the gp64 gene found in group I alphabaculovirus viruses. Using RNA-seq, we confirmed the expression of the ORFs identified in the HytaNPV genome. Phylogenetic analysis showed HytaNPV to be closest to BusuNPV, SujuNPV and EcobNPV that infect other tea pests, Buzura suppressaria, Sucra jujuba, and Ectropis oblique, respectively. We identified repeat elements and a conserved non-coding baculovirus element in the genome. Analysis of the putative promoter sequences identified motif consistent with the temporal expression of the genes observed in the RNA-seq data.
The Nitrogen-Fixation Island Insertion Site Is Conserved in Diazotrophic Pseudomonas stutzeri and Pseudomonas sp. Isolated from Distal and Close Geographical Regions

PubMed Central

Venieraki, Anastasia; Dimou, Maria; Vezyri, Eleni; Vamvakas, Alexandros; Katinaki, Pagona-Artemis; Chatzipavlidis, Iordanis; Tampakaki, Anastasia; Katinakis, Panagiotis

2014-01-01

The presence of nitrogen fixers within the genus Pseudomonas has been established and so far most isolated strains are phylogenetically affiliated to Pseudomonas stutzeri. A gene ortholog neighborhood analysis of the nitrogen fixation island (NFI) in four diazotrophic P. stutzeri strains and Pseudomonas azotifigens revealed that all are flanked by genes coding for cobalamin synthase (cobS) and glutathione peroxidise (gshP). The putative NFIs lack all the features characterizing a mobilizable genomic island. Nevertheless, bioinformatic analysis P. stutzeri DSM 4166 NFI demonstrated the presence of short inverted and/or direct repeats within both flanking regions. The other P. stutzeri strains carry only one set of repeats. The genetic diversity of eleven diazotrophic Pseudomonas isolates was also investigated. Multilocus sequence typing grouped nine isolates along with P. stutzeri and two isolates are grouped in a separate clade. A Rep-PCR fingerprinting analysis grouped the eleven isolates into four distinct genotypes. We also provided evidence that the putative NFI in our diazotrophic Pseudomonas isolates is flanked by cobS and gshP genes. Furthermore, we demonstrated that the putative NFI of Pseudomonas sp. Gr65 is flanked by inverted repeats identical to those found in P. stutzeri DSM 4166 and while the other P. stutzeri isolates harbor the repeats located in the intergenic region between cobS and glutaredoxin genes as in the case of P. stutzeri A1501. Taken together these data suggest that all putative NFIs of diazotrophic Pseudomonas isolates are anchored in an intergenic region between cobS and gshP genes and their flanking regions are designated by distinct repeats patterns. Moreover, the presence of almost identical NFIs in diazotrophic Pseudomonas strains isolated from distal geographical locations around the world suggested that this horizontal gene transfer event may have taken place early in the evolution. PMID:25251496
The nitrogen-fixation island insertion site is conserved in diazotrophic Pseudomonas stutzeri and Pseudomonas sp. isolated from distal and close geographical regions.

PubMed

Venieraki, Anastasia; Dimou, Maria; Vezyri, Eleni; Vamvakas, Alexandros; Katinaki, Pagona-Artemis; Chatzipavlidis, Iordanis; Tampakaki, Anastasia; Katinakis, Panagiotis

2014-01-01

The presence of nitrogen fixers within the genus Pseudomonas has been established and so far most isolated strains are phylogenetically affiliated to Pseudomonas stutzeri. A gene ortholog neighborhood analysis of the nitrogen fixation island (NFI) in four diazotrophic P. stutzeri strains and Pseudomonas azotifigens revealed that all are flanked by genes coding for cobalamin synthase (cobS) and glutathione peroxidise (gshP). The putative NFIs lack all the features characterizing a mobilizable genomic island. Nevertheless, bioinformatic analysis P. stutzeri DSM 4166 NFI demonstrated the presence of short inverted and/or direct repeats within both flanking regions. The other P. stutzeri strains carry only one set of repeats. The genetic diversity of eleven diazotrophic Pseudomonas isolates was also investigated. Multilocus sequence typing grouped nine isolates along with P. stutzeri and two isolates are grouped in a separate clade. A Rep-PCR fingerprinting analysis grouped the eleven isolates into four distinct genotypes. We also provided evidence that the putative NFI in our diazotrophic Pseudomonas isolates is flanked by cobS and gshP genes. Furthermore, we demonstrated that the putative NFI of Pseudomonas sp. Gr65 is flanked by inverted repeats identical to those found in P. stutzeri DSM 4166 and while the other P. stutzeri isolates harbor the repeats located in the intergenic region between cobS and glutaredoxin genes as in the case of P. stutzeri A1501. Taken together these data suggest that all putative NFIs of diazotrophic Pseudomonas isolates are anchored in an intergenic region between cobS and gshP genes and their flanking regions are designated by distinct repeats patterns. Moreover, the presence of almost identical NFIs in diazotrophic Pseudomonas strains isolated from distal geographical locations around the world suggested that this horizontal gene transfer event may have taken place early in the evolution.
Non-parent of Origin Expression of Numerous Effector Genes Indicates a Role of Gene Regulation in Host Adaption of the Hybrid Triticale Powdery Mildew Pathogen

PubMed Central

Praz, Coraline R.; Menardo, Fabrizio; Robinson, Mark D.; Müller, Marion C.; Wicker, Thomas; Bourras, Salim; Keller, Beat

2018-01-01

Powdery mildew is an important disease of cereals. It is caused by one species, Blumeria graminis, which is divided into formae speciales each of which is highly specialized to one host. Recently, a new form capable of growing on triticale (B.g. triticale) has emerged through hybridization between wheat and rye mildews (B.g. tritici and B.g. secalis, respectively). In this work, we used RNA sequencing to study the molecular basis of host adaptation in B.g. triticale. We analyzed gene expression in three B.g. tritici isolates, two B.g. secalis isolates and two B.g. triticale isolates and identified a core set of putative effector genes that are highly expressed in all formae speciales. We also found that the genes differentially expressed between isolates of the same form as well as between different formae speciales were enriched in putative effectors. Their coding genes belong to several families including some which contain known members of mildew avirulence (Avr) and suppressor (Svr) genes. Based on these findings we propose that effectors play an important role in host adaptation that is mechanistically based on Avr-Resistance gene-Svr interactions. We also found that gene expression in the B.g. triticale hybrid is mostly conserved with the parent-of-origin, but some genes inherited from B.g. tritici showed a B.g. secalis-like expression. Finally, we identified 11 unambiguous cases of putative effector genes with hybrid-specific, non-parent of origin gene expression, and we propose that they are possible determinants of host specialization in triticale mildew. These data suggest that altered expression of multiple effector genes, in particular Avr and Svr related factors, might play a role in mildew host adaptation based on hybridization. PMID:29441081
Characterisation of the paralytic shellfish toxin biosynthesis gene clusters in Anabaena circinalis AWQC131C and Aphanizomenon sp. NH-5.

PubMed

Mihali, Troco K; Kellmann, Ralf; Neilan, Brett A

2009-03-30

Saxitoxin and its analogues collectively known as the paralytic shellfish toxins (PSTs) are neurotoxic alkaloids and are the cause of the syndrome named paralytic shellfish poisoning. PSTs are produced by a unique biosynthetic pathway, which involves reactions that are rare in microbial metabolic pathways. Nevertheless, distantly related organisms such as dinoflagellates and cyanobacteria appear to produce these toxins using the same pathway. Hypothesised explanations for such an unusual phylogenetic distribution of this shared uncommon metabolic pathway, include a polyphyletic origin, an involvement of symbiotic bacteria, and horizontal gene transfer. We describe the identification, annotation and bioinformatic characterisation of the putative paralytic shellfish toxin biosynthesis clusters in an Australian isolate of Anabaena circinalis and an American isolate of Aphanizomenon sp., both members of the Nostocales. These putative PST gene clusters span approximately 28 kb and contain genes coding for the biosynthesis and export of the toxin. A putative insertion/excision site in the Australian Anabaena circinalis AWQC131C was identified, and the organization and evolution of the gene clusters are discussed. A biosynthetic pathway leading to the formation of saxitoxin and its analogues in these organisms is proposed. The PST biosynthesis gene cluster presents a mosaic structure, whereby genes have apparently transposed in segments of varying size, resulting in different gene arrangements in all three sxt clusters sequenced so far. The gene cluster organizational structure and sequence similarity seems to reflect the phylogeny of the producer organisms, indicating that the gene clusters have an ancient origin, or that their lateral transfer was also an ancient event. The knowledge we gain from the characterisation of the PST biosynthesis gene clusters, including the identity and sequence of the genes involved in the biosynthesis, may also afford the identification of these gene clusters in dinoflagellates, the cause of human mortalities and significant financial loss to the tourism and shellfish industries.
Characterisation of the paralytic shellfish toxin biosynthesis gene clusters in Anabaena circinalis AWQC131C and Aphanizomenon sp. NH-5

PubMed Central

Mihali, Troco K; Kellmann, Ralf; Neilan, Brett A

2009-01-01

Background Saxitoxin and its analogues collectively known as the paralytic shellfish toxins (PSTs) are neurotoxic alkaloids and are the cause of the syndrome named paralytic shellfish poisoning. PSTs are produced by a unique biosynthetic pathway, which involves reactions that are rare in microbial metabolic pathways. Nevertheless, distantly related organisms such as dinoflagellates and cyanobacteria appear to produce these toxins using the same pathway. Hypothesised explanations for such an unusual phylogenetic distribution of this shared uncommon metabolic pathway, include a polyphyletic origin, an involvement of symbiotic bacteria, and horizontal gene transfer. Results We describe the identification, annotation and bioinformatic characterisation of the putative paralytic shellfish toxin biosynthesis clusters in an Australian isolate of Anabaena circinalis and an American isolate of Aphanizomenon sp., both members of the Nostocales. These putative PST gene clusters span approximately 28 kb and contain genes coding for the biosynthesis and export of the toxin. A putative insertion/excision site in the Australian Anabaena circinalis AWQC131C was identified, and the organization and evolution of the gene clusters are discussed. A biosynthetic pathway leading to the formation of saxitoxin and its analogues in these organisms is proposed. Conclusion The PST biosynthesis gene cluster presents a mosaic structure, whereby genes have apparently transposed in segments of varying size, resulting in different gene arrangements in all three sxt clusters sequenced so far. The gene cluster organizational structure and sequence similarity seems to reflect the phylogeny of the producer organisms, indicating that the gene clusters have an ancient origin, or that their lateral transfer was also an ancient event. The knowledge we gain from the characterisation of the PST biosynthesis gene clusters, including the identity and sequence of the genes involved in the biosynthesis, may also afford the identification of these gene clusters in dinoflagellates, the cause of human mortalities and significant financial loss to the tourism and shellfish industries. PMID:19331657
Lineage-Specific Genome Architecture Links Enhancers and Non-coding Disease Variants to Target Gene Promoters.

PubMed

Javierre, Biola M; Burren, Oliver S; Wilder, Steven P; Kreuzhuber, Roman; Hill, Steven M; Sewitz, Sven; Cairns, Jonathan; Wingett, Steven W; Várnai, Csilla; Thiecke, Michiel J; Burden, Frances; Farrow, Samantha; Cutler, Antony J; Rehnström, Karola; Downes, Kate; Grassi, Luigi; Kostadima, Myrto; Freire-Pritchett, Paula; Wang, Fan; Stunnenberg, Hendrik G; Todd, John A; Zerbino, Daniel R; Stegle, Oliver; Ouwehand, Willem H; Frontini, Mattia; Wallace, Chris; Spivakov, Mikhail; Fraser, Peter

2016-11-17

Long-range interactions between regulatory elements and gene promoters play key roles in transcriptional regulation. The vast majority of interactions are uncharted, constituting a major missing link in understanding genome control. Here, we use promoter capture Hi-C to identify interacting regions of 31,253 promoters in 17 human primary hematopoietic cell types. We show that promoter interactions are highly cell type specific and enriched for links between active promoters and epigenetically marked enhancers. Promoter interactomes reflect lineage relationships of the hematopoietic tree, consistent with dynamic remodeling of nuclear architecture during differentiation. Interacting regions are enriched in genetic variants linked with altered expression of genes they contact, highlighting their functional role. We exploit this rich resource to connect non-coding disease variants to putative target promoters, prioritizing thousands of disease-candidate genes and implicating disease pathways. Our results demonstrate the power of primary cell promoter interactomes to reveal insights into genomic regulatory mechanisms underlying common diseases. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
The complete mitochondrial genome of Sika deer Cervus nippon hortulorum (Artiodactyla: Cervidae) and phylogenetic studies.

PubMed

Liu, Yan-Hua; Liu, Xin-Xin; Zhang, Ming-Hai

2016-07-01

Sika deer (Cervus nippon Temminck 1836) are classified in the order Artiodactyla, family Cervidae, subfamily Cervinae. At present, the phylogenetic studies of C. nippon are problematic. In this study, we first determined and described the complete mitochondrial sequence of the wild C. nippon hortulorum. The complete mitogenome sequence is 16 566 bp in length, including 13 protein-coding genes, two rRNA genes, 22 tRNA genes, a putative control region (CR) and a light-strand replication origin (OL). The overall base composition was 33.4% A, 28.6% T, 24.5% C, 13.5% G, with a 62.0% AT bias. The 13 protein-coding genes encode 3782 amino acids in total. To further validate the new determined sequences and phylogeny of Sika deer, phylogenetic trees involving 15 most closely related species available in GenBank database were constructed. These results are expected to provide useful molecular data for deer species identification and further phylogenetic studies of Artiodactyla.
Molecular cloning, structural analysis, and expression in Escherichia coli of a chitinase gene from Enterobacter agglomerans.

PubMed Central

Chernin, L S; De la Fuente, L; Sobolev, V; Haran, S; Vorgias, C E; Oppenheim, A B; Chet, I

1997-01-01

The gene chiA, which codes for endochitinase, was cloned from a soilborne Enterobacter agglomerans. Its complete sequence was determined, and the deduced amino acid sequence of the enzyme designated Chia_Entag yielded an open reading frame coding for 562 amino acids of a 61-kDa precursor protein with a putative leader peptide at its N terminus. The nucleotide and polypeptide sequences of Chia_Entag showed 86.8 and 87.7% identity with the corresponding gene and enzyme, Chia_Serma, of Serratia marcescens, respectively. Homology modeling of Chia_Entag's three-dimensional structure demonstrated that most amino acid substitutions are at solvent-accessible sites. Escherichia coli JM109 carrying the E. agglomerans chiA gene produced and secreted Chia_Entag. The antifungal activity of the secreted endochitinase was demonstrated in vitro by inhibition of Fusarium oxysporum spore germination. The transformed strain inhibited Rhizoctonia solani growth on plates and the root rot disease caused by this fungus in cotton seedlings under greenhouse conditions. PMID:9055404
Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae.

PubMed

Michel, Christian J; Ngoune, Viviane Nguefack; Poch, Olivier; Ripp, Raymond; Thompson, Julie D

2017-12-03

A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading) frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X, using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X, in the complete genome of the yeast Saccharomyces cerevisiae . Several properties of X motifs are identified by basic statistics (at the frequency level), and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R. We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae . We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae , but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions). This property is true for all cardinalities of X motifs (from 4 to 20) and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non-X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together, represent the first evidence for a significant enrichment of X motifs in the genes of an extant organism. They raise two hypotheses: the X motifs may be evolutionary relics of the primitive codes used for translation, or they may continue to play a functional role in the complex processes of genome decoding and protein synthesis.
Evolution of coding and non-coding genes in HOX clusters of a marsupial.

PubMed

Yu, Hongshi; Lindsay, James; Feng, Zhi-Ping; Frankenberg, Stephen; Hu, Yanqiu; Carone, Dawn; Shaw, Geoff; Pask, Andrew J; O'Neill, Rachel; Papenfuss, Anthony T; Renfree, Marilyn B

2012-06-18

The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial.
Evolution of coding and non-coding genes in HOX clusters of a marsupial

PubMed Central

2012-01-01

Background The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Results Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. Conclusions This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial. PMID:22708672
The complete sequences and gene organisation of the mitochondrial genomes of the heterodont bivalves Acanthocardia tuberculata and Hiatella arctica – and the first record for a putative Atpase subunit 8 gene in marine bivalves

PubMed Central

Dreyer, Hermann; Steiner, Gerhard

2006-01-01

Background Mitochondrial (mt) gene arrangement is highly variable among molluscs and especially among bivalves. Of the 30 complete molluscan mt-genomes published to date, only one is of a heterodont bivalve, although this is the most diverse taxon in terms of species numbers. We determined the complete sequence of the mitochondrial genomes of Acanthocardia tuberculata and Hiatella arctica, (Mollusca, Bivalvia, Heterodonta) and describe their gene contents and genome organisations to assess the variability of these features among the Bivalvia and their value for phylogenetic inference. Results The size of the mt-genome in Acanthocardia tuberculata is 16.104 basepairs (bp), and in Hiatella arctica 18.244 bp. The Acanthocardia mt-genome contains 12 of the typical protein coding genes, lacking the Atpase subunit 8 (atp8) gene, as all published marine bivalves. In contrast, a complete atp8 gene is present in Hiatella arctica. In addition, we found a putative truncated atp8 gene when re-annotating the mt-genome of Venerupis philippinarum. Both mt-genomes reported here encode all genes on the same strand and have an additional trnM. In Acanthocardia several large non-coding regions are present. One of these contains 3.5 nearly identical copies of a 167 bp motive. In Hiatella, the 3' end of the NADH dehydrogenase subunit (nad)6 gene is duplicated together with the adjacent non-coding region. The gene arrangement of Hiatella is markedly different from all other known molluscan mt-genomes, that of Acanthocardia shows few identities with the Venerupis philippinarum. Phylogenetic analyses on amino acid and nucleotide levels robustly support the Heterodonta and the sister group relationship of Acanthocardia and Venerupis. Monophyletic Bivalvia are resolved only by a Bayesian inference of the nucleotide data set. In all other analyses the two unionid species, being to only ones with genes located on both strands, do not group with the remaining bivalves. Conclusion The two mt-genomes reported here add to and underline the high variability of gene order and presence of duplications in bivalve and molluscan taxa. Some genomic traits like the loss of the atp8 gene or the encoding of all genes on the same strand are homoplastic among the Bivalvia. These characters, gene order, and the nucleotide sequence data show considerable potential of resolving phylogenetic patterns at lower taxonomic levels. PMID:16948842

Arrangement of the Clostridium baratii F7 Toxin Gene Cluster with Identification of a σ Factor That Recognizes the Botulinum Toxin Gene Cluster Promoters

DOE PAGES

Dover, Nir; Barash, Jason R.; Burke, Julianne N.; ...

2014-05-22

Botulinum neurotoxin (BoNT) is the most poisonous substances known and its eight toxin types (A to H) are distinguished by the inability of polyclonal antibodies that neutralize one toxin type to neutralize any of the other seven toxin types. Infant botulism, an intestinal toxemia orphan disease, is the most common form of human botulism in the United States. It results from swallowed spores of Clostridium botulinum (or rarely, neurotoxigenic Clostridium butyricum or Clostridium baratii) that germinate and temporarily colonize the lumen of the large intestine, where, as vegetative cells, they produce botulinum toxin. Botulinum neurotoxin is encoded by the bontmore » gene that is part of a toxin gene cluster that includes several accessory genes. In this paper, we sequenced for the first time the complete botulinum neurotoxin gene cluster of nonproteolytic C. baratii type F7. Like the type E and the nonproteolytic type F6 botulinum toxin gene clusters, the C. baratii type F7 had an orfX toxin gene cluster that lacked the regulatory botR gene which is found in proteolytic C. botulinum strains and codes for an alternative σ factor. In the absence of botR, we identified a putative alternative regulatory gene located upstream of the C. baratii type F7 toxin gene cluster. This putative regulatory gene codes for a predicted σ factor that contains DNA-binding-domain homologues to the DNA-binding domains both of BotR and of other members of the TcdR-related group 5 of the σ 70 family that are involved in the regulation of toxin gene expression in clostridia. We showed that this TcdR-related protein in association with RNA polymerase core enzyme specifically binds to the C. baratii type F7 botulinum toxin gene cluster promoters. Finally, this TcdR-related protein may therefore be involved in regulating the expression of the genes of the botulinum toxin gene cluster in neurotoxigenic C. baratii.« less
Genome of Epinotia aporema granulovirus (EpapGV), a polyorganotropic fast killing betabaculovirus with a novel thymidylate kinase gene

PubMed Central

2012-01-01

Background Epinotia aporema (Lepidoptera: Tortricidae) is an important pest of legume crops in South America. Epinotia aporema granulovirus (EpapGV) is a baculovirus that causes a polyorganotropic infection in the host larva. Its high pathogenicity and host specificity make EpapGV an excellent candidate to be used as a biological control agent. Results The genome of Epinotia aporema granulovirus (EpapGV) was sequenced and analyzed. Its circular double-stranded DNA genome is 119,082 bp in length and codes for 133 putative genes. It contains the 31 baculovirus core genes and a set of 19 genes that are GV exclusive. Seventeen ORFs were unique to EpapGV in comparison with other baculoviruses. Of these, 16 found no homologues in GenBank, and one encoded a thymidylate kinase. Analysis of nucleotide sequence repeats revealed the presence of 16 homologous regions (hrs) interspersed throughout the genome. Each hr was characterized by the presence of 1 to 3 clustered imperfect palindromes which are similar to previously described palindromes of tortricid-specific GVs. Also, one of the hrs (hr4) has flanking sequences suggestive of a putative non-hr ori. Interestingly, two more complex hrs were found in opposite loci, dividing the circular dsDNA genome in two halves. Gene synteny maps showed the great colinearity of sequenced GVs, being EpapGV the most dissimilar as it has a 20 kb-long gene block inversion. Phylogenetic study performed with 31 core genes of 58 baculoviral genomes suggests that EpapGV is the baculovirus isolate closest to the putative common ancestor of tortricid specific betabaculoviruses. Conclusions This study, along with previous characterization of EpapGV infection, is useful for the better understanding of the pathology caused by this virus and its potential utilization as a bioinsecticide. PMID:23051685
Cloning, overexpression and interaction of recombinant Fur from the cyanobacterium Anabaena PCC 7119 with isiB and its own promoter.

PubMed

Bes, M T; Hernández, J A; Peleato, M L; Fillat, M F

2001-01-15

A gene coding for a Fur (ferric uptake regulation) protein from the cyanobacterium Anabaena PCC 7119 has been cloned and overexpressed in Escherichia coli. DNA sequence analysis confirmed the presence of a 151-amino-acid open reading frame that showed homology with the Fur proteins reported for the unicellular cyanobacteria Synechococcus 7942 and Synechocystis PCC 6803. Two putative Fur-binding sites were detected in the promoter regions of the fur gene from Anabaena. Partially purified recombinant Fur binds to the flavodoxin promoter as well as its own promoter. This suggests that the Fur gene is autoregulated in Anabaena.
Evidence for an ergot alkaloid gene cluster in Claviceps purpurea.

PubMed

Tudzynski, P; Hölter, K; Correia, T; Arntz, C; Grammel, N; Keller, U

1999-02-01

A gene (cpd1) coding for the dimethylallyltryptophan synthase (DMATS) that catalyzes the first specific step in the biosynthesis of ergot alkaloids, was cloned from a strain of Claviceps purpurea that produces alkaloids in axenic culture. The derived gene product (CPD1) shows only 70% similarity to the corresponding gene previously isolated from Claviceps strain ATCC 26245, which is likely to be an isolate of C. fusiformis. Therefore, the related cpd1 most probably represents the first C. purpurea gene coding for an enzymatic step of the alkaloid biosynthetic pathway to be cloned. Analysis of the 3'-flanking region of cpd1 revealed a second, closely linked ergot alkaloid biosynthetic gene named cpps1, which codes for a 356-kDa polypeptide showing significant similarity to fungal modular peptide synthetases. The protein contains three amino acid-activating modules, and in the second module a sequence is found which matches that of an internal peptide (17 amino acids in length) obtained from a tryptic digest of lysergyl peptide synthetase 1 (LPS1) of C. purpurea, thus confirming that cpps1 encodes LPS1. LPS1 activates the three amino acids of the peptide portion of ergot peptide alkaloids during D-lysergyl peptide assembly. Chromosome walking revealed the presence of additional genes upstream of cpd1 which are probably also involved in ergot alkaloid biosynthesis: cpox1 probably codes for an FAD-dependent oxidoreductase (which could represent the chanoclavine cyclase), and a second putative oxidoreductase gene, cpox2, is closely linked to it in inverse orientation. RT-PCR experiments confirm that all four genes are expressed under conditions of peptide alkaloid biosynthesis. These results strongly suggest that at least some genes of ergot alkaloid biosynthesis in C. purpurea are clustered, opening the way for a detailed molecular genetic analysis of the pathway.
High quality draft genome sequence of Olivibacter sitiensis type strain (AW-6T), a diphenol degrader with genes involved in the catechol pathway

PubMed Central

Ntougias, Spyridon; Lapidus, Alla; Han, James; Mavromatis, Konstantinos; Pati, Amrita; Chen, Amy; Klenk, Hans-Peter; Woyke, Tanja; Fasseas, Constantinos; Kyrpides, Nikos C.; Zervakis, Georgios I.

2014-01-01

Olivibacter sitiensis Ntougias et al. 2007 is a member of the family Sphingobacteriaceae, phylum Bacteroidetes. Members of the genus Olivibacter are phylogenetically diverse and of significant interest. They occur in diverse habitats, such as rhizosphere and contaminated soils, viscous wastes, composts, biofilter clean-up facilities on contaminated sites and cave environments, and they are involved in the degradation of complex and toxic compounds. Here we describe the features of O. sitiensis AW-6T, together with the permanent-draft genome sequence and annotation. The organism was sequenced under the Genomic Encyclopedia for Bacteria and Archaea (GEBA) project at the DOE Joint Genome Institute and is the first genome sequence of a species within the genus Olivibacter. The genome is 5,053,571 bp long and is comprised of 110 scaffolds with an average GC content of 44.61%. Of the 4,565 genes predicted, 4,501 were protein-coding genes and 64 were RNA genes. Most protein-coding genes (68.52%) were assigned to a putative function. The identification of 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase-coding genes indicates involvement of this organism in the catechol catabolic pathway. In addition, genes encoding for β-1,4-xylanases and β-1,4-xylosidases reveal the xylanolytic action of O. sitiensis. PMID:25197463
Conserved Non-Coding Regulatory Signatures in Arabidopsis Co-Expressed Gene Modules

PubMed Central

Spangler, Jacob B.; Ficklin, Stephen P.; Luo, Feng; Freeling, Michael; Feltus, F. Alex

2012-01-01

Complex traits and other polygenic processes require coordinated gene expression. Co-expression networks model mRNA co-expression: the product of gene regulatory networks. To identify regulatory mechanisms underlying coordinated gene expression in a tissue-enriched context, ten Arabidopsis thaliana co-expression networks were constructed after manually sorting 4,566 RNA profiling datasets into aerial, flower, leaf, root, rosette, seedling, seed, shoot, whole plant, and global (all samples combined) groups. Collectively, the ten networks contained 30% of the measurable genes of Arabidopsis and were circumscribed into 5,491 modules. Modules were scrutinized for cis regulatory mechanisms putatively encoded in conserved non-coding sequences (CNSs) previously identified as remnants of a whole genome duplication event. We determined the non-random association of 1,361 unique CNSs to 1,904 co-expression network gene modules. Furthermore, the CNS elements were placed in the context of known gene regulatory networks (GRNs) by connecting 250 CNS motifs with known GRN cis elements. Our results provide support for a regulatory role of some CNS elements and suggest the functional consequences of CNS activation of co-expression in specific gene sets dispersed throughout the genome. PMID:23024789
Conserved non-coding regulatory signatures in Arabidopsis co-expressed gene modules.

PubMed

Spangler, Jacob B; Ficklin, Stephen P; Luo, Feng; Freeling, Michael; Feltus, F Alex

2012-01-01

Complex traits and other polygenic processes require coordinated gene expression. Co-expression networks model mRNA co-expression: the product of gene regulatory networks. To identify regulatory mechanisms underlying coordinated gene expression in a tissue-enriched context, ten Arabidopsis thaliana co-expression networks were constructed after manually sorting 4,566 RNA profiling datasets into aerial, flower, leaf, root, rosette, seedling, seed, shoot, whole plant, and global (all samples combined) groups. Collectively, the ten networks contained 30% of the measurable genes of Arabidopsis and were circumscribed into 5,491 modules. Modules were scrutinized for cis regulatory mechanisms putatively encoded in conserved non-coding sequences (CNSs) previously identified as remnants of a whole genome duplication event. We determined the non-random association of 1,361 unique CNSs to 1,904 co-expression network gene modules. Furthermore, the CNS elements were placed in the context of known gene regulatory networks (GRNs) by connecting 250 CNS motifs with known GRN cis elements. Our results provide support for a regulatory role of some CNS elements and suggest the functional consequences of CNS activation of co-expression in specific gene sets dispersed throughout the genome.
Nucleotide sequences of two genomic DNAs encoding peroxidase of Arabidopsis thaliana.

PubMed

Intapruk, C; Higashimura, N; Yamamoto, K; Okada, N; Shinmyo, A; Takano, M

1991-02-15

The peroxidase (EC 1.11.1.7)-encoding gene of Arabidopsis thaliana was screened from a genomic library using a cDNA encoding a neutral isozyme of horseradish, Armoracia rusticana, peroxidase (HRP) as a probe, and two positive clones were isolated. From the comparison with the sequences of the HRP-encoding genes, we concluded that two clones contained peroxidase-encoding genes, and they were named prxCa and prxEa. Both genes consisted of four exons and three introns; the introns had consensus nucleotides, GT and AG, at the 5' and 3' ends, respectively. The lengths of each putative exon of the prxEa gene were the same as those of the HRP-basic-isozyme-encoding gene, prxC3, and coded for 349 amino acids (aa) with a sequence homology of 89% to that encoded by prxC3. The prxCa gene was very close to the HRP-neutral-isozyme-encoding gene, prxC1b, and coded for 354 aa with 91% homology to that encoded by prxC1b. The aa sequence homology was 64% between the two peroxidases encoded by prxCa and prxEa.
Localization of the lysine epsilon-aminotransferase (lat) and delta-(L-alpha-aminoadipyl)-L-cysteinyl-D-valine synthetase (pcbAB) genes from Streptomyces clavuligerus and production of lysine epsilon-aminotransferase activity in Escherichia coli.

PubMed Central

Tobin, M B; Kovacevic, S; Madduri, K; Hoskins, J A; Skatrud, P L; Vining, L C; Stuttard, C; Miller, J R

1991-01-01

Lysine epsilon-aminotransferase (LAT) in the beta-lactam-producing actinomycetes is considered to be the first step in the antibiotic biosynthetic pathway. Cloning of restriction fragments from Streptomyces clavuligerus, a beta-lactam producer, into Streptomyces lividans, a nonproducer that lacks LAT activity, led to the production of LAT in the host. DNA sequencing of restriction fragments containing the putative lat gene revealed a single open reading frame encoding a polypeptide with an approximately Mr 49,000. Expression of this coding sequence in Escherichia coli led to the production of LAT activity. Hence, LAT activity in S. clavuligerus is derived from a single polypeptide. A second open reading frame began immediately downstream from lat. Comparison of this partial sequence with the sequences of delta-(L-alpha-aminoadipyl)-L-cysteinyl-D valine (ACV) synthetases from Penicillium chrysogenum and Cephalosporium acremonium and with nonribosomal peptide synthetases (gramicidin S and tyrocidine synthetases) found similarities among the open reading frames. Since mapping of the putative N and C termini of S. clavuligerus pcbAB suggests that the coding region occupies approximately 12 kbp and codes for a polypeptide related in size to the fungal ACV synthetases, the molecular characterization of the beta-lactam biosynthetic cluster between pcbC and cefE (approximately 25 kbp) is nearly complete. Images PMID:1917855
A global transcriptional analysis of Plasmodium falciparum malaria reveals a novel family of telomere-associated lncRNAs

PubMed Central

2011-01-01

Background Mounting evidence suggests a major role for epigenetic feedback in Plasmodium falciparum transcriptional regulation. Long non-coding RNAs (lncRNAs) have recently emerged as a new paradigm in epigenetic remodeling. We therefore set out to investigate putative roles for lncRNAs in P. falciparum transcriptional regulation. Results We used a high-resolution DNA tiling microarray to survey transcriptional activity across 22.6% of the P. falciparum strain 3D7 genome. We identified 872 protein-coding genes and 60 putative P. falciparum lncRNAs under developmental regulation during the parasite's pathogenic human blood stage. Further characterization of lncRNA candidates led to the discovery of an intriguing family of lncRNA telomere-associated repetitive element transcripts, termed lncRNA-TARE. We have quantified lncRNA-TARE expression at 15 distinct chromosome ends and mapped putative transcriptional start and termination sites of lncRNA-TARE loci. Remarkably, we observed coordinated and stage-specific expression of lncRNA-TARE on all chromosome ends tested, and two dominant transcripts of approximately 1.5 kb and 3.1 kb transcribed towards the telomere. Conclusions We have characterized a family of 22 telomere-associated lncRNAs in P. falciparum. Homologous lncRNA-TARE loci are coordinately expressed after parasite DNA replication, and are poised to play an important role in P. falciparum telomere maintenance, virulence gene regulation, and potentially other processes of parasite chromosome end biology. Further study of lncRNA-TARE and other promising lncRNA candidates may provide mechanistic insight into P. falciparum transcriptional regulation. PMID:21689454
Cloning, Expression, and Nucleotide Sequence of the Pseudomonas aeruginosa 142 ohb Genes Coding for Oxygenolytic ortho Dehalogenation of Halobenzoates

PubMed Central

Tsoi, Tamara V.; Plotnikova, Elena G.; Cole, James R.; Guerin, William F.; Bagdasarian, Michael; Tiedje, James M.

1999-01-01

We have cloned and characterized novel oxygenolytic ortho-dehalogenation (ohb) genes from 2-chlorobenzoate (2-CBA)- and 2,4-dichlorobenzoate (2,4-dCBA)-degrading Pseudomonas aeruginosa 142. Among 3,700 Escherichia coli recombinants, two clones, DH5αF′(pOD22) and DH5αF′(pOD33), converted 2-CBA to catechol and 2,4-dCBA and 2,5-dCBA to 4-chlorocatechol. A subclone of pOD33, plasmid pE43, containing the 3,687-bp minimized ohb DNA region conferred to P. putida PB2440 the ability to grow on 2-CBA as a sole carbon source. Strain PB2440(pE43) also oxidized but did not grow on 2,4-dCBA, 2,5-dCBA, or 2,6-dCBA. Terminal oxidoreductase ISPOHB structural genes ohbA and ohbB, which encode polypeptides with molecular masses of 20,253 Da (β-ISP) and 48,243 Da (α-ISP), respectively, were identified; these proteins are in accord with the 22- and 48-kDa (as determined by sodium dodecyl sulfate-polyacrylamide gel electrophoresis) polypeptides synthesized in E. coli and P. aeruginosa parental strain 142. The ortho-halobenzoate 1,2-dioxygenase activity was manifested in the absence of ferredoxin and reductase genes, suggesting that the ISPOHB utilized electron transfer components provided by the heterologous hosts. ISPOHB formed a new phylogenetic cluster that includes aromatic oxygenases featuring atypical structural-functional organization and is distant from the other members of the family of primary aromatic oxygenases. A putative IclR-type regulatory gene (ohbR) was located upstream of the ohbAB genes. An open reading frame (ohbC) of unknown function that overlaps lengthwise with ohbB but is transcribed in the opposite direction was found. The ohbC gene codes for a 48,969-Da polypeptide, in accord with the 49-kDa protein detected in E. coli. The ohb genes are flanked by an IS1396-like sequence containing a putative gene for a 39,715-Da transposase A (tnpA) at positions 4731 to 5747 and a putative gene for a 45,247-Da DNA topoisomerase I/III (top) at positions 346 to 1563. The ohb DNA region is bordered by 14-bp imperfect inverted repeats at positions 56 to 69 and 5984 to 5997. PMID:10224014
The Putative C2H2 Transcription Factor MtfA Is a Novel Regulator of Secondary Metabolism and Morphogenesis in Aspergillus nidulans

PubMed Central

Ramamoorthy, Vellaisamy; Dhingra, Sourabh; Kincaid, Alexander; Shantappa, Sourabha; Feng, Xuehuan; Calvo, Ana M.

2013-01-01

Secondary metabolism in the model fungus Aspergillus nidulans is controlled by the conserved global regulator VeA, which also governs morphological differentiation. Among the secondary metabolites regulated by VeA is the mycotoxin sterigmatocystin (ST). The presence of VeA is necessary for the biosynthesis of this carcinogenic compound. We identified a revertant mutant able to synthesize ST intermediates in the absence of VeA. The point mutation occurred at the coding region of a gene encoding a novel putative C2H2 zinc finger domain transcription factor that we denominated mtfA. The A. nidulans mtfA gene product localizes at nuclei independently of the illumination regime. Deletion of the mtfA gene restores mycotoxin biosynthesis in the absence of veA, but drastically reduced mycotoxin production when mtfA gene expression was altered, by deletion or overexpression, in A. nidulans strains with a veA wild-type allele. Our study revealed that mtfA regulates ST production by affecting the expression of the specific ST gene cluster activator aflR. Importantly, mtfA is also a regulator of other secondary metabolism gene clusters, such as genes responsible for the synthesis of terrequinone and penicillin. As in the case of ST, deletion or overexpression of mtfA was also detrimental for the expression of terrequinone genes. Deletion of mtfA also decreased the expression of the genes in the penicillin gene cluster, reducing penicillin production. However, in this case, over-expression of mtfA enhanced the transcription of penicillin genes, increasing penicillin production more than 5 fold with respect to the control. Importantly, in addition to its effect on secondary metabolism, mtfA also affects asexual and sexual development in A. nidulans. Deletion of mtfA results in a reduction of conidiation and sexual stage. We found mtfA putative orthologs conserved in other fungal species. PMID:24066102
Repression of YdaS Toxin Is Mediated by Transcriptional Repressor RacR in the Cryptic rac Prophage of Escherichia coli K-12.

PubMed

Krishnamurthi, Revathy; Ghosh, Swagatha; Khedkar, Supriya; Seshasayee, Aswin Sai Narain

2017-01-01

Horizontal gene transfer is a major driving force behind the genomic diversity seen in prokaryotes. The cryptic rac prophage in Escherichia coli K-12 carries the gene for a putative transcription factor RacR, whose deletion is lethal. We have shown that the essentiality of racR in E. coli K-12 is attributed to its role in transcriptionally repressing toxin gene(s) called ydaS and ydaT , which are adjacent to and coded divergently to racR . IMPORTANCE Transcription factors in the bacterium E. coli are rarely essential, and when they are essential, they are largely toxin-antitoxin systems. While studying transcription factors encoded in horizontally acquired regions in E. coli , we realized that the protein RacR, a putative transcription factor encoded by a gene on the rac prophage, is an essential protein. Here, using genetics, biochemistry, and bioinformatics, we show that its essentiality derives from its role as a transcriptional repressor of the ydaS and ydaT genes, whose products are toxic to the cell. Unlike type II toxin-antitoxin systems in which transcriptional regulation involves complexes of the toxin and antitoxin, repression by RacR is sufficient to keep ydaS transcriptionally silent.
A comparative genomics perspective on the genetic content of the alkaliphilic haloarchaeon Natrialba magadii ATCC 43099T

PubMed Central

2012-01-01

Background Natrialba magadii is an aerobic chemoorganotrophic member of the Euryarchaeota and is a dual extremophile requiring alkaline conditions and hypersalinity for optimal growth. The genome sequence of Nab. magadii type strain ATCC 43099 was deciphered to obtain a comprehensive insight into the genetic content of this haloarchaeon and to understand the basis of some of the cellular functions necessary for its survival. Results The genome of Nab. magadii consists of four replicons with a total sequence of 4,443,643 bp and encodes 4,212 putative proteins, some of which contain peptide repeats of various lengths. Comparative genome analyses facilitated the identification of genes encoding putative proteins involved in adaptation to hypersalinity, stress response, glycosylation, and polysaccharide biosynthesis. A proton-driven ATP synthase and a variety of putative cytochromes and other proteins supporting aerobic respiration and electron transfer were encoded by one or more of Nab. magadii replicons. The genome encodes a number of putative proteases/peptidases as well as protein secretion functions. Genes encoding putative transcriptional regulators, basal transcription factors, signal perception/transduction proteins, and chemotaxis/phototaxis proteins were abundant in the genome. Pathways for the biosynthesis of thiamine, riboflavin, heme, cobalamin, coenzyme F420 and other essential co-factors were deduced by in depth sequence analyses. However, approximately 36% of Nab. magadii protein coding genes could not be assigned a function based on Blast analysis and have been annotated as encoding hypothetical or conserved hypothetical proteins. Furthermore, despite extensive comparative genomic analyses, genes necessary for survival in alkaline conditions could not be identified in Nab. magadii. Conclusions Based on genomic analyses, Nab. magadii is predicted to be metabolically versatile and it could use different carbon and energy sources to sustain growth. Nab. magadii has the genetic potential to adapt to its milieu by intracellular accumulation of inorganic cations and/or neutral organic compounds. The identification of Nab. magadii genes involved in coenzyme biosynthesis is a necessary step toward further reconstruction of the metabolic pathways in halophilic archaea and other extremophiles. The knowledge gained from the genome sequence of this haloalkaliphilic archaeon is highly valuable in advancing the applications of extremophiles and their enzymes. PMID:22559199
Divergent evolutionary rates in vertebrate and mammalian specific conserved non-coding elements (CNEs) in echolocating mammals.

PubMed

Davies, Kalina T J; Tsagkogeorga, Georgia; Rossiter, Stephen J

2014-12-19

The majority of DNA contained within vertebrate genomes is non-coding, with a certain proportion of this thought to play regulatory roles during development. Conserved Non-coding Elements (CNEs) are an abundant group of putative regulatory sequences that are highly conserved across divergent groups and thus assumed to be under strong selective constraint. Many CNEs may contain regulatory factor binding sites, and their frequent spatial association with key developmental genes - such as those regulating sensory system development - suggests crucial roles in regulating gene expression and cellular patterning. Yet surprisingly little is known about the molecular evolution of CNEs across diverse mammalian taxa or their role in specific phenotypic adaptations. We examined 3,110 vertebrate-specific and ~82,000 mammalian-specific CNEs across 19 and 9 mammalian orders respectively, and tested for changes in the rate of evolution of CNEs located in the proximity of genes underlying the development or functioning of auditory systems. As we focused on CNEs putatively associated with genes underlying the development/functioning of auditory systems, we incorporated echolocating taxa in our dataset because of their highly specialised and derived auditory systems. Phylogenetic reconstructions of concatenated CNEs broadly recovered accepted mammal relationships despite high levels of sequence conservation. We found that CNE substitution rates were highest in rodents and lowest in primates, consistent with previous findings. Comparisons of CNE substitution rates from several genomic regions containing genes linked to auditory system development and hearing revealed differences between echolocating and non-echolocating taxa. Wider taxonomic sampling of four CNEs associated with the homeobox genes Hmx2 and Hmx3 - which are required for inner ear development - revealed family-wise variation across diverse bat species. Specifically within one family of echolocating bats that utilise frequency-modulated echolocation calls varying widely in frequency and intensity high levels of sequence divergence were found. Levels of selective constraint acting on CNEs differed both across genomic locations and taxa, with observed variation in substitution rates of CNEs among bat species. More work is needed to determine whether this variation can be linked to echolocation, and wider taxonomic sampling is necessary to fully document levels of conservation in CNEs across diverse taxa.
Chromosome-level genome assembly and transcriptome of the green alga Chromochloris zofingiensis illuminates astaxanthin production.

PubMed

Roth, Melissa S; Cokus, Shawn J; Gallaher, Sean D; Walter, Andreas; Lopez, David; Erickson, Erika; Endelman, Benjamin; Westcott, Daniel; Larabell, Carolyn A; Merchant, Sabeeha S; Pellegrini, Matteo; Niyogi, Krishna K

2017-05-23

Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis , because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. To advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ∼58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniform gene density over chromosomes, low repetitive sequence content (∼6%), and a high fraction of protein-coding sequence (∼39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (∼73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase ( BKT ), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. The high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.
Chromosome-level genome assembly and transcriptome of the green alga Chromochloris zofingiensis illuminates astaxanthin production

DOE PAGES

Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.; ...

2017-05-08

Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. Here, to advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ~58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniformmore » gene density over chromosomes, low repetitive sequence content (~6%), and a high fraction of protein-coding sequence (~39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (~73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. Finally, the high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.« less
Chromosome-level genome assembly and transcriptome of the green alga Chromochloris zofingiensis illuminates astaxanthin production

DOE Office of Scientific and Technical Information (OSTI.GOV)

Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.

Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. Here, to advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ~58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniformmore » gene density over chromosomes, low repetitive sequence content (~6%), and a high fraction of protein-coding sequence (~39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (~73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. Finally, the high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.« less
Chromosome-level genome assembly and transcriptome of the green alga Chromochloris zofingiensis illuminates astaxanthin production

PubMed Central

Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.; Walter, Andreas; Lopez, David; Erickson, Erika; Endelman, Benjamin; Westcott, Daniel; Larabell, Carolyn A.; Merchant, Sabeeha S.; Pellegrini, Matteo

2017-01-01

Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. To advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ∼58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniform gene density over chromosomes, low repetitive sequence content (∼6%), and a high fraction of protein-coding sequence (∼39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (∼73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. The high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production. PMID:28484037
Prediction of plant lncRNA by ensemble machine learning classifiers.

PubMed

Simopoulos, Caitlin M A; Weretilnyk, Elizabeth A; Golding, G Brian

2018-05-02

In plants, long non-protein coding RNAs are believed to have essential roles in development and stress responses. However, relative to advances on discerning biological roles for long non-protein coding RNAs in animal systems, this RNA class in plants is largely understudied. With comparatively few validated plant long non-coding RNAs, research on this potentially critical class of RNA is hindered by a lack of appropriate prediction tools and databases. Supervised learning models trained on data sets of mostly non-validated, non-coding transcripts have been previously used to identify this enigmatic RNA class with applications largely focused on animal systems. Our approach uses a training set comprised only of empirically validated long non-protein coding RNAs from plant, animal, and viral sources to predict and rank candidate long non-protein coding gene products for future functional validation. Individual stochastic gradient boosting and random forest classifiers trained on only empirically validated long non-protein coding RNAs were constructed. In order to use the strengths of multiple classifiers, we combined multiple models into a single stacking meta-learner. This ensemble approach benefits from the diversity of several learners to effectively identify putative plant long non-coding RNAs from transcript sequence features. When the predicted genes identified by the ensemble classifier were compared to those listed in GreeNC, an established plant long non-coding RNA database, overlap for predicted genes from Arabidopsis thaliana, Oryza sativa and Eutrema salsugineum ranged from 51 to 83% with the highest agreement in Eutrema salsugineum. Most of the highest ranking predictions from Arabidopsis thaliana were annotated as potential natural antisense genes, pseudogenes, transposable elements, or simply computationally predicted hypothetical protein. Due to the nature of this tool, the model can be updated as new long non-protein coding transcripts are identified and functionally verified. This ensemble classifier is an accurate tool that can be used to rank long non-protein coding RNA predictions for use in conjunction with gene expression studies. Selection of plant transcripts with a high potential for regulatory roles as long non-protein coding RNAs will advance research in the elucidation of long non-protein coding RNA function.

Putative function of hypothetical proteins expressed by Clostridium perfringens type A strains and their protective efficacy in mouse model.

PubMed

Alam, Syed Imteyaz; Dwivedi, Pratistha

2016-10-01

The whole genome sequencing and annotation of Clostridium perfringens strains revealed several genes coding for proteins of unknown function with no significant similarities to genes in other organisms. Our previous studies clearly demonstrated that hypothetical proteins CPF_2500, CPF_1441, CPF_0876, CPF_0093, CPF_2002, CPF_2314, CPF_1179, CPF_1132, CPF_2853, CPF_0552, CPF_2032, CPF_0438, CPF_1440, CPF_2918, CPF_0656, and CPF_2364 are genuine proteins of C. perfringens expressed in high abundance. This study explored the putative role of these hypothetical proteins using bioinformatic tools and evaluated their potential as putative candidates for prophylaxis. Apart from a group of eight hypothetical proteins (HPs), a putative function was predicted for the rest of the hypothetical proteins using one or more of the algorithms used. The phylogenetic analysis did not suggest an evidence of a horizontal gene transfer event except for HP CPF_0876. HP CPF_2918 is an abundant extracellular protein, unique to C. perfringens species with maximum strain coverage and did not show any significant match in the database. CPF_2918 was cloned, recombinant protein was purified to near homogeneity, and probing with mouse anti-CPF_2918 serum revealed surface localization of the protein in C. perfringens ATCC13124 cultures. The purified recombinant CPF_2918 protein induced antibody production, a mixed Th1 and Th2 kind of response, and provided partial protection to immunized mice in direct C. perfringens challenge. Copyright © 2016 Elsevier B.V. All rights reserved.
TCOF1 gene encodes a putative nucleolar phosphoprotein that exhibits mutations in Treacher Collins Syndrome throughout its coding region.

PubMed

Wise, C A; Chiang, L C; Paznekas, W A; Sharma, M; Musy, M M; Ashley, J A; Lovett, M; Jabs, E W

1997-04-01

Treacher Collins Syndrome (TCS) is the most common of the human mandibulofacial dysostosis disorders. Recently, a partial TCOF1 cDNA was identified and shown to contain mutations in TCS families. Here we present the entire exon/intron genomic structure and the complete coding sequence of TCOF1. TCOF1 encodes a low complexity protein of 1,411 amino acids, whose predicted protein structure reveals repeated motifs that mirror the organization of its exons. These motifs are shared with nucleolar trafficking proteins in other species and are predicted to be highly phosphorylated by casein kinase. Consistent with this, the full-length TCOF1 protein sequence also contains putative nuclear and nucleolar localization signals. Throughout the open reading frame, we detected an additional eight mutations in TCS families and several polymorphisms. We postulate that TCS results from defects in a nucleolar trafficking protein that is critically required during human craniofacial development.
Genome of Rhodnius prolixus, an insect vector of Chagas disease, reveals unique adaptations to hematophagy and parasite infection

PubMed Central

Mesquita, Rafael D.; Vionette-Amaral, Raquel J.; Lowenberger, Carl; Rivera-Pomar, Rolando; Monteiro, Fernando A.; Minx, Patrick; Spieth, John; Carvalho, A. Bernardo; Panzera, Francisco; Lawson, Daniel; Torres, André Q.; Ribeiro, Jose M. C.; Sorgine, Marcos H. F.; Waterhouse, Robert M.; Abad-Franch, Fernando; Alves-Bezerra, Michele; Amaral, Laurence R.; Araujo, Helena M.; Aravind, L.; Atella, Georgia C.; Azambuja, Patricia; Berni, Mateus; Bittencourt-Cunha, Paula R.; Braz, Gloria R. C.; Calderón-Fernández, Gustavo; Carareto, Claudia M. A.; Christensen, Mikkel B.; Costa, Igor R.; Costa, Samara G.; Dansa, Marilvia; Daumas-Filho, Carlos R. O.; De-Paula, Iron F.; Dias, Felipe A.; Dimopoulos, George; Emrich, Scott J.; Esponda-Behrens, Natalia; Fampa, Patricia; Fernandez-Medina, Rita D.; da Fonseca, Rodrigo N.; Fontenele, Marcio; Fronick, Catrina; Fulton, Lucinda A.; Gandara, Ana Caroline; Garcia, Eloi S.; Genta, Fernando A.; Giraldo-Calderón, Gloria I.; Gomes, Bruno; Gondim, Katia C.; Granzotto, Adriana; Guarneri, Alessandra A.; Guigó, Roderic; Harry, Myriam; Hughes, Daniel S. T.; Jablonka, Willy; Jacquin-Joly, Emmanuelle; Juárez, M. Patricia; Koerich, Leonardo B.; Lange, Angela B.; Latorre-Estivalis, José Manuel; Lavore, Andrés; Lawrence, Gena G.; Lazoski, Cristiano; Lazzari, Claudio R.; Lopes, Raphael R.; Lorenzo, Marcelo G.; Lugon, Magda D.; Marcet, Paula L.; Mariotti, Marco; Masuda, Hatisaburo; Megy, Karine; Missirlis, Fanis; Mota, Theo; Noriega, Fernando G.; Nouzova, Marcela; Nunes, Rodrigo D.; Oliveira, Raquel L. L.; Oliveira-Silveira, Gilbert; Ons, Sheila; Orchard, Ian; Pagola, Lucia; Paiva-Silva, Gabriela O.; Pascual, Agustina; Pavan, Marcio G.; Pedrini, Nicolás; Peixoto, Alexandre A.; Pereira, Marcos H.; Pike, Andrew; Polycarpo, Carla; Prosdocimi, Francisco; Ribeiro-Rodrigues, Rodrigo; Robertson, Hugh M.; Salerno, Ana Paula; Salmon, Didier; Santesmasses, Didac; Schama, Renata; Seabra-Junior, Eloy S.; Silva-Cardoso, Livia; Silva-Neto, Mario A. C.; Souza-Gomes, Matheus; Sterkel, Marcos; Taracena, Mabel L.; Tojo, Marta; Tu, Zhijian Jake; Tubio, Jose M. C.; Ursic-Bedoya, Raul; Venancio, Thiago M.; Walter-Nuno, Ana Beatriz; Wilson, Derek; Warren, Wesley C.; Wilson, Richard K.; Huebner, Erwin; Dotson, Ellen M.; Oliveira, Pedro L.

2015-01-01

Rhodnius prolixus not only has served as a model organism for the study of insect physiology, but also is a major vector of Chagas disease, an illness that affects approximately seven million people worldwide. We sequenced the genome of R. prolixus, generated assembled sequences covering 95% of the genome (∼702 Mb), including 15,456 putative protein-coding genes, and completed comprehensive genomic analyses of this obligate blood-feeding insect. Although immune-deficiency (IMD)-mediated immune responses were observed, R. prolixus putatively lacks key components of the IMD pathway, suggesting a reorganization of the canonical immune signaling network. Although both Toll and IMD effectors controlled intestinal microbiota, neither affected Trypanosoma cruzi, the causal agent of Chagas disease, implying the existence of evasion or tolerance mechanisms. R. prolixus has experienced an extensive loss of selenoprotein genes, with its repertoire reduced to only two proteins, one of which is a selenocysteine-based glutathione peroxidase, the first found in insects. The genome contained actively transcribed, horizontally transferred genes from Wolbachia sp., which showed evidence of codon use evolution toward the insect use pattern. Comparative protein analyses revealed many lineage-specific expansions and putative gene absences in R. prolixus, including tandem expansions of genes related to chemoreception, feeding, and digestion that possibly contributed to the evolution of a blood-feeding lifestyle. The genome assembly and these associated analyses provide critical information on the physiology and evolution of this important vector species and should be instrumental for the development of innovative disease control methods. PMID:26627243
Comparative Transcriptome Analysis Identifies Putative Genes Involved in the Biosynthesis of Xanthanolides in Xanthium strumarium L.

PubMed Central

Li, Yuanjun; Gou, Junbo; Chen, Fangfang; Li, Changfu; Zhang, Yansheng

2016-01-01

Xanthium strumarium L. is a traditional Chinese herb belonging to the Asteraceae family. The major bioactive components of this plant are sesquiterpene lactones (STLs), which include the xanthanolides. To date, the biogenesis of xanthanolides, especially their downstream pathway, remains largely unknown. In X. strumarium, xanthanolides primarily accumulate in its glandular trichomes. To identify putative gene candidates involved in the biosynthesis of xanthanolides, three X. strumarium transcriptomes, which were derived from the young leaves of two different cultivars and the purified glandular trichomes from one of the cultivars, were constructed in this study. In total, 157 million clean reads were generated and assembled into 91,861 unigenes, of which 59,858 unigenes were successfully annotated. All the genes coding for known enzymes in the upstream pathway to the biosynthesis of xanthanolides were present in the X. strumarium transcriptomes. From a comparative analysis of the X. strumarium transcriptomes, this study identified a number of gene candidates that are putatively involved in the downstream pathway to the synthesis of xanthanolides, such as four unigenes encoding CYP71 P450s, 50 unigenes for dehydrogenases, and 27 genes for acetyltransferases. The possible functions of these four CYP71 candidates are extensively discussed. In addition, 116 transcription factors that are highly expressed in X. strumarium glandular trichomes were also identified. Their possible regulatory roles in the biosynthesis of STLs are discussed. The global transcriptomic data for X. strumarium should provide a valuable resource for further research into the biosynthesis of xanthanolides. PMID:27625674
Genome of Rhodnius prolixus, an insect vector of Chagas disease, reveals unique adaptations to hematophagy and parasite infection.

PubMed

Mesquita, Rafael D; Vionette-Amaral, Raquel J; Lowenberger, Carl; Rivera-Pomar, Rolando; Monteiro, Fernando A; Minx, Patrick; Spieth, John; Carvalho, A Bernardo; Panzera, Francisco; Lawson, Daniel; Torres, André Q; Ribeiro, Jose M C; Sorgine, Marcos H F; Waterhouse, Robert M; Montague, Michael J; Abad-Franch, Fernando; Alves-Bezerra, Michele; Amaral, Laurence R; Araujo, Helena M; Araujo, Ricardo N; Aravind, L; Atella, Georgia C; Azambuja, Patricia; Berni, Mateus; Bittencourt-Cunha, Paula R; Braz, Gloria R C; Calderón-Fernández, Gustavo; Carareto, Claudia M A; Christensen, Mikkel B; Costa, Igor R; Costa, Samara G; Dansa, Marilvia; Daumas-Filho, Carlos R O; De-Paula, Iron F; Dias, Felipe A; Dimopoulos, George; Emrich, Scott J; Esponda-Behrens, Natalia; Fampa, Patricia; Fernandez-Medina, Rita D; da Fonseca, Rodrigo N; Fontenele, Marcio; Fronick, Catrina; Fulton, Lucinda A; Gandara, Ana Caroline; Garcia, Eloi S; Genta, Fernando A; Giraldo-Calderón, Gloria I; Gomes, Bruno; Gondim, Katia C; Granzotto, Adriana; Guarneri, Alessandra A; Guigó, Roderic; Harry, Myriam; Hughes, Daniel S T; Jablonka, Willy; Jacquin-Joly, Emmanuelle; Juárez, M Patricia; Koerich, Leonardo B; Lange, Angela B; Latorre-Estivalis, José Manuel; Lavore, Andrés; Lawrence, Gena G; Lazoski, Cristiano; Lazzari, Claudio R; Lopes, Raphael R; Lorenzo, Marcelo G; Lugon, Magda D; Majerowicz, David; Marcet, Paula L; Mariotti, Marco; Masuda, Hatisaburo; Megy, Karine; Melo, Ana C A; Missirlis, Fanis; Mota, Theo; Noriega, Fernando G; Nouzova, Marcela; Nunes, Rodrigo D; Oliveira, Raquel L L; Oliveira-Silveira, Gilbert; Ons, Sheila; Orchard, Ian; Pagola, Lucia; Paiva-Silva, Gabriela O; Pascual, Agustina; Pavan, Marcio G; Pedrini, Nicolás; Peixoto, Alexandre A; Pereira, Marcos H; Pike, Andrew; Polycarpo, Carla; Prosdocimi, Francisco; Ribeiro-Rodrigues, Rodrigo; Robertson, Hugh M; Salerno, Ana Paula; Salmon, Didier; Santesmasses, Didac; Schama, Renata; Seabra-Junior, Eloy S; Silva-Cardoso, Livia; Silva-Neto, Mario A C; Souza-Gomes, Matheus; Sterkel, Marcos; Taracena, Mabel L; Tojo, Marta; Tu, Zhijian Jake; Tubio, Jose M C; Ursic-Bedoya, Raul; Venancio, Thiago M; Walter-Nuno, Ana Beatriz; Wilson, Derek; Warren, Wesley C; Wilson, Richard K; Huebner, Erwin; Dotson, Ellen M; Oliveira, Pedro L

2015-12-01

Rhodnius prolixus not only has served as a model organism for the study of insect physiology, but also is a major vector of Chagas disease, an illness that affects approximately seven million people worldwide. We sequenced the genome of R. prolixus, generated assembled sequences covering 95% of the genome (∼ 702 Mb), including 15,456 putative protein-coding genes, and completed comprehensive genomic analyses of this obligate blood-feeding insect. Although immune-deficiency (IMD)-mediated immune responses were observed, R. prolixus putatively lacks key components of the IMD pathway, suggesting a reorganization of the canonical immune signaling network. Although both Toll and IMD effectors controlled intestinal microbiota, neither affected Trypanosoma cruzi, the causal agent of Chagas disease, implying the existence of evasion or tolerance mechanisms. R. prolixus has experienced an extensive loss of selenoprotein genes, with its repertoire reduced to only two proteins, one of which is a selenocysteine-based glutathione peroxidase, the first found in insects. The genome contained actively transcribed, horizontally transferred genes from Wolbachia sp., which showed evidence of codon use evolution toward the insect use pattern. Comparative protein analyses revealed many lineage-specific expansions and putative gene absences in R. prolixus, including tandem expansions of genes related to chemoreception, feeding, and digestion that possibly contributed to the evolution of a blood-feeding lifestyle. The genome assembly and these associated analyses provide critical information on the physiology and evolution of this important vector species and should be instrumental for the development of innovative disease control methods.
Complete Genome Sequence of the Symbiotic Strain Bradyrhizobium icense LMTR 13T, Isolated from Lima Bean (Phaseolus lunatus) in Peru

PubMed Central

Rogel, Marco A.; Zúñiga-Dávila, Doris; Martínez-Romero, Esperanza

2018-01-01

ABSTRACT The complete genome sequence of Bradyrhizobium icense LMTR 13T, a root nodule bacterium isolated from the legume Phaseolus lunatus, is reported here. The genome consists of a circular 8,322,773-bp chromosome which codes for a large and novel symbiotic island as well as genes putatively involved in soil and root colonization. PMID:29519840
A computational search for box C/D snoRNA genes in the Drosophila melanogaster genome.

PubMed

Accardo, M C; Giordano, E; Riccardo, S; Digilio, F A; Iazzetti, G; Calogero, R A; Furia, M

2004-12-12

In eukaryotes, the family of non-coding RNA genes includes a number of genes encoding small nucleolar RNAs (mainly C/D and H/ACA snoRNAs), which act as guides in the maturation or post-transcriptional modifications of target RNA molecules. Since in Drosophila melanogaster (Dm) only few examples of snoRNAs have been identified so far by cDNA libraries screening, integration of the molecular data with in silico identification of these types of genes could throw light on their organization in the Dm genome. We have performed a computational screening of the Dm genome for C/D snoRNA genes, followed by experimental validation of the putative candidates. Few of the 26 confirmed snoRNAs had been recognized by cDNA library analysis. Organization of the Dm genome was also found to be more variegated than previously suspected, with snoRNA genes nested in both the introns and exons of protein-coding genes. This finding suggests that the presence of additional mechanisms of snoRNA biogenesis based on the alternative production of overlapping mRNA/snoRNA molecules. Additional information is available at http://www.bioinformatica.unito.it/bioinformatics/snoRNAs.
Genomic Structure of the Luciferase Gene from the Bioluminescent Beetle, Nyctophila cf. Caucasica

PubMed Central

Day, John C.; Chaichi, Mohammad J.; Najafil, Iraj; Whiteley, Andrew S.

2006-01-01

The gene coding for beetle luciferase, the enzyme responsible for bioluminescence in over two thousand coleopteran species has, to date, only been characterized from one Palearctic species of Lampyridae. Here we report the characterization of the luciferase gene from a female beetle of an Iranian lampyrid species, Nyctophila cf. caucasica (Coleoptera:Lampyridae). The luciferase gene was composed of seven exons, coding for 547 amino acids, separated by six introns spanning 1976 bp of genomic DNA. The deduced amino acid sequences of the luciferase gene of N. caucasica showed 98.9% homology to that of the Palearctic species Lampyris noctiluca. Analysis of the 810 bp upstream region of the luciferase gene revealed three TATA boxes and several other consensus transcriptional factor recognition sequences presenting evidence for a putative core promoter region conserved in Lampyrinae from -190 through to -155 upstream of the luciferase start codon. Along with the core promoter region the luciferase gene was compared with orthologous sequences from other lampyrid species and found to have greatest identity to Lampyris turkistanicus and Lampyris noctiluca. The significant sequence identity to the former is discussed in relation to taxonomic issues of Iranian lampyrids. PMID:20298115
Origin and Functional Prediction of Pollen Allergens in Plants1[OPEN

PubMed Central

Chen, Miaolin; Xu, Jie; Ren, Kang; Searle, Iain

2016-01-01

Pollen allergies have long been a major pandemic health problem for human. However, the evolutionary events and biological function of pollen allergens in plants remain largely unknown. Here, we report the genome-wide prediction of pollen allergens and their biological function in the dicotyledonous model plant Arabidopsis (Arabidopsis thaliana) and the monocotyledonous model plant rice (Oryza sativa). In total, 145 and 107 pollen allergens were predicted from rice and Arabidopsis, respectively. These pollen allergens are putatively involved in stress responses and metabolic processes such as cell wall metabolism during pollen development. Interestingly, these putative pollen allergen genes were derived from large gene families and became diversified during evolution. Sequence analysis across 25 plant species from green alga to angiosperms suggest that about 40% of putative pollen allergenic proteins existed in both lower and higher plants, while other allergens emerged during evolution. Although a high proportion of gene duplication has been observed among allergen-coding genes, our data show that these genes might have undergone purifying selection during evolution. We also observed that epitopes of an allergen might have a biological function, as revealed by comprehensive analysis of two known allergens, expansin and profilin. This implies a crucial role of conserved amino acid residues in both in planta biological function and allergenicity. Finally, a model explaining how pollen allergens were generated and maintained in plants is proposed. Prediction and systematic analysis of pollen allergens in model plants suggest that pollen allergens were evolved by gene duplication and then functional specification. This study provides insight into the phylogenetic and evolutionary scenario of pollen allergens that will be helpful to future characterization and epitope screening of pollen allergens. PMID:27436829
Identification of Putative Precursor Genes for the Biosynthesis of Cannabinoid-Like Compound in Radula marginata

PubMed Central

Hussain, Tajammul; Plunkett, Blue; Ejaz, Mahwish; Espley, Richard V.; Kayser, Oliver

2018-01-01

The liverwort Radula marginata belongs to the bryophyte division of land plants and is a prospective alternate source of cannabinoid-like compounds. However, mechanistic insights into the molecular pathways directing the synthesis of these cannabinoid-like compounds have been hindered due to the lack of genetic information. This prompted us to do deep sequencing, de novo assembly and annotation of R. marginata transcriptome, which resulted in the identification and validation of the genes for cannabinoid biosynthetic pathway. In total, we have identified 11,421 putative genes encoding 1,554 enzymes from 145 biosynthetic pathways. Interestingly, we have identified all the upstream genes of the central precursor of cannabinoid biosynthesis, cannabigerolic acid (CBGA), including its two first intermediates, stilbene acid (SA) and geranyl diphosphate (GPP). Expression of all these genes was validated using quantitative real-time PCR. We have characterized the protein structure of stilbene synthase (STS), which is considered as a homolog of olivetolic acid in R. marginata. Moreover, the metabolomics approach enabled us to identify CBGA-analogous compounds using electrospray ionization mass spectrometry (ESI-MS/MS) and gas chromatography mass spectrometry (GC-MS). Transcriptomic analysis revealed 1085 transcription factors (TF) from 39 families. Comparative analysis showed that six TF families have been uniquely predicted in R. marginata. In addition, the bioinformatics analysis predicted a large number of simple sequence repeats (SSRs) and non-coding RNAs (ncRNAs). Our results collectively provide mechanistic insights into the putative precursor genes for the biosynthesis of cannabinoid-like compounds and a novel transcriptomic resource for R. marginata. The large-scale transcriptomic resource generated in this study would further serve as a reference transcriptome to explore the Radulaceae family.
Origin and Functional Prediction of Pollen Allergens in Plants.

PubMed

Chen, Miaolin; Xu, Jie; Devis, Deborah; Shi, Jianxin; Ren, Kang; Searle, Iain; Zhang, Dabing

2016-09-01

Pollen allergies have long been a major pandemic health problem for human. However, the evolutionary events and biological function of pollen allergens in plants remain largely unknown. Here, we report the genome-wide prediction of pollen allergens and their biological function in the dicotyledonous model plant Arabidopsis (Arabidopsis thaliana) and the monocotyledonous model plant rice (Oryza sativa). In total, 145 and 107 pollen allergens were predicted from rice and Arabidopsis, respectively. These pollen allergens are putatively involved in stress responses and metabolic processes such as cell wall metabolism during pollen development. Interestingly, these putative pollen allergen genes were derived from large gene families and became diversified during evolution. Sequence analysis across 25 plant species from green alga to angiosperms suggest that about 40% of putative pollen allergenic proteins existed in both lower and higher plants, while other allergens emerged during evolution. Although a high proportion of gene duplication has been observed among allergen-coding genes, our data show that these genes might have undergone purifying selection during evolution. We also observed that epitopes of an allergen might have a biological function, as revealed by comprehensive analysis of two known allergens, expansin and profilin. This implies a crucial role of conserved amino acid residues in both in planta biological function and allergenicity. Finally, a model explaining how pollen allergens were generated and maintained in plants is proposed. Prediction and systematic analysis of pollen allergens in model plants suggest that pollen allergens were evolved by gene duplication and then functional specification. This study provides insight into the phylogenetic and evolutionary scenario of pollen allergens that will be helpful to future characterization and epitope screening of pollen allergens. © 2016 American Society of Plant Biologists. All rights reserved.
Global Identification and Characterization of Transcriptionally Active Regions in the Rice Genome

PubMed Central

Stolc, Viktor; Deng, Wei; He, Hang; Korbel, Jan; Chen, Xuewei; Tongprasit, Waraporn; Ronald, Pamela; Chen, Runsheng; Gerstein, Mark; Wang Deng, Xing

2007-01-01

Genome tiling microarray studies have consistently documented rich transcriptional activity beyond the annotated genes. However, systematic characterization and transcriptional profiling of the putative novel transcripts on the genome scale are still lacking. We report here the identification of 25,352 and 27,744 transcriptionally active regions (TARs) not encoded by annotated exons in the rice (Oryza. sativa) subspecies japonica and indica, respectively. The non-exonic TARs account for approximately two thirds of the total TARs detected by tiling arrays and represent transcripts likely conserved between japonica and indica. Transcription of 21,018 (83%) japonica non-exonic TARs was verified through expression profiling in 10 tissue types using a re-array in which annotated genes and TARs were each represented by five independent probes. Subsequent analyses indicate that about 80% of the japonica TARs that were not assigned to annotated exons can be assigned to various putatively functional or structural elements of the rice genome, including splice variants, uncharacterized portions of incompletely annotated genes, antisense transcripts, duplicated gene fragments, and potential non-coding RNAs. These results provide a systematic characterization of non-exonic transcripts in rice and thus expand the current view of the complexity and dynamics of the rice transcriptome. PMID:17372628
RADH, a gene of Saccharomyces cerevisiae encoding a putative DNA helicase involved in DNA repair. Characteristics of radH mutants and sequence of the gene.

PubMed

Aboussekhra, A; Chanet, R; Zgaga, Z; Cassier-Chauvat, C; Heude, M; Fabre, F

1989-09-25

A new type of radiation-sensitive mutant of S. cerevisiae is described. The recessive radH mutation sensitizes to the lethal effect of UV radiations haploids in the G1 but not in the G2 mitotic phase. Homozygous diploids are as sensitive as G1 haploids. The UV-induced mutagenesis is depressed, while the induction of gene conversion is increased. The mutation is believed to channel the repair of lesions engaged in the mutagenic pathway into a recombination process, successful if the events involve sister-chromatids but lethal if they involve homologous chromosomes. The sequence of the RADH gene reveals that it may code for a DNA helicase, with a Mr of 134 kDa. All the consensus domains of known DNA helicases are present. Besides these consensus regions, strong homologies with the Rep and UvrD helicases of E. coli were found. The RadH putative helicase appears to belong to the set of proteins involved in the error-prone repair mechanism, at least for UV-induced lesions, and could act in coordination with the Rev3 error-prone DNA polymerase.
The Complete Mitochondrial Genome and Novel Gene Arrangement of the Unique-Headed Bug Stenopirates sp. (Hemiptera: Enicocephalidae)

PubMed Central

Li, Hu; Liu, Hui; Shi, Aimin; Štys, Pavel; Zhou, Xuguo; Cai, Wanzhi

2012-01-01

Many of true bugs are important insect pests to cultivated crops and some are important vectors of human diseases, but few cladistic analyses have addressed relationships among the seven infraorders of Heteroptera. The Enicocephalomorpha and Nepomorpha are consider the basal groups of Heteroptera, but the basal-most lineage remains unresolved. Here we report the mitochondrial genome of the unique-headed bug Stenopirates sp., the first mitochondrial genome sequenced from Enicocephalomorpha. The Stenopirates sp. mitochondrial genome is a typical circular DNA molecule of 15, 384 bp in length, and contains 37 genes and a large non-coding fragment. The gene order differs substantially from other known insect mitochondrial genomes, with rearrangements of both tRNA genes and protein-coding genes. The overall AT content (82.5%) of Stenopirates sp. is the highest among all the known heteropteran mitochondrial genomes. The strand bias is consistent with other true bugs with negative GC-skew and positive AT-skew for the J-strand. The heteropteran mitochondrial atp8 exhibits the highest evolutionary rate, whereas cox1 appears to have the lowest rate. Furthermore, a negative correlation was observed between the variation of nucleotide substitutions and the GC content of each protein-coding gene. A microsatellite was identified in the putative control region. Finally, phylogenetic reconstruction suggests that Enicocephalomorpha is the sister group to all the remaining Heteroptera. PMID:22235294
Putative Nonribosomal Peptide Synthetase and Cytochrome P450 Genes Responsible for Tentoxin Biosynthesis in Alternaria alternata ZJ33.

PubMed

Li, You-Hai; Han, Wen-Jin; Gui, Xi-Wu; Wei, Tao; Tang, Shuang-Yan; Jin, Jian-Ming

2016-08-02

Tentoxin, a cyclic tetrapeptide produced by several Alternaria species, inhibits the F₁-ATPase activity of chloroplasts, resulting in chlorosis in sensitive plants. In this study, we report two clustered genes, encoding a putative non-ribosome peptide synthetase (NRPS) TES and a cytochrome P450 protein TES1, that are required for tentoxin biosynthesis in Alternaria alternata strain ZJ33, which was isolated from blighted leaves of Eupatorium adenophorum. Using a pair of primers designed according to the consensus sequences of the adenylation domain of NRPSs, two fragments containing putative adenylation domains were amplified from A. alternata ZJ33, and subsequent PCR analyses demonstrated that these fragments belonged to the same NRPS coding sequence. With no introns, TES consists of a single 15,486 base pair open reading frame encoding a predicted 5161 amino acid protein. Meanwhile, the TES1 gene is predicted to contain five introns and encode a 506 amino acid protein. The TES protein is predicted to be comprised of four peptide synthase modules with two additional N-methylation domains, and the number and arrangement of the modules in TES were consistent with the number and arrangement of the amino acid residues of tentoxin, respectively. Notably, both TES and TES1 null mutants generated via homologous recombination failed to produce tentoxin. This study provides the first evidence concerning the biosynthesis of tentoxin in A. alternata.
Parallel evolution of chordate cis-regulatory code for development.

PubMed

Doglio, Laura; Goode, Debbie K; Pelleri, Maria C; Pauls, Stefan; Frabetti, Flavia; Shimeld, Sebastian M; Vavouri, Tanya; Elgar, Greg

2013-11-01

Urochordates are the closest relatives of vertebrates and at the larval stage, possess a characteristic bilateral chordate body plan. In vertebrates, the genes that orchestrate embryonic patterning are in part regulated by highly conserved non-coding elements (CNEs), yet these elements have not been identified in urochordate genomes. Consequently the evolution of the cis-regulatory code for urochordate development remains largely uncharacterised. Here, we use genome-wide comparisons between C. intestinalis and C. savignyi to identify putative urochordate cis-regulatory sequences. Ciona conserved non-coding elements (ciCNEs) are associated with largely the same key regulatory genes as vertebrate CNEs. Furthermore, some of the tested ciCNEs are able to activate reporter gene expression in both zebrafish and Ciona embryos, in a pattern that at least partially overlaps that of the gene they associate with, despite the absence of sequence identity. We also show that the ability of a ciCNE to up-regulate gene expression in vertebrate embryos can in some cases be localised to short sub-sequences, suggesting that functional cross-talk may be defined by small regions of ancestral regulatory logic, although functional sub-sequences may also be dispersed across the whole element. We conclude that the structure and organisation of cis-regulatory modules is very different between vertebrates and urochordates, reflecting their separate evolutionary histories. However, functional cross-talk still exists because the same repertoire of transcription factors has likely guided their parallel evolution, exploiting similar sets of binding sites but in different combinations.
Complete mitochondrial genome of the giant African snail, Achatina fulica (Mollusca: Achatinidae): a novel location of putative control regions (CR) in the mitogenome within Pulmonate species.

PubMed

He, Zhang-Ping; Dai, Xia-Bin; Zhang, Shuai; Zhi, Ting-Ting; Lun, Zhao-Rong; Wu, Zhong-Dao; Yang, Ting-Bao

2016-01-01

The whole sequence (15,057 bp) of the mitochondrial DNA (mtDNA) of the terrestrial snail Achatina fulica (order Stylommatophora) was determined. The mitogenome, as the typical metazoan mtDNA, contains 13 protein-coding genes (PCG), 2 ribosomal RNA genes (rRNA) and 22 transfer RNA genes (tRNA). The tRNA genes include two trnS without standard secondary structure. Interestingly, among the known mitogenomes of Pulmonata species, we firstly characterized an unassigned lengthy sequence (551 bp) between the cox1 and the trnV which may be the CR for the sake of its AT bases usage bias (65.70%) and potential hairpin structure.
Insights from the genome of a high alkaline cellulase producing Aspergillus fumigatus strain obtained from Peruvian Amazon rainforest.

PubMed

Paul, Sujay; Zhang, Angel; Ludeña, Yvette; Villena, Gretty K; Yu, Fengan; Sherman, David H; Gutiérrez-Correa, Marcel

2017-06-10

Here, we report the complete genome sequence of a high alkaline cellulase producing Aspergillus fumigatus strain LMB-35Aa isolated from soil of Peruvian Amazon rainforest. The genome is ∼27.5mb in size, comprises of 228 scaffolds with an average GC content of 50%, and is predicted to contain a total of 8660 protein-coding genes. Of which, 6156 are with known function; it codes for 607 putative CAZymes families potentially involved in carbohydrate metabolism. Several important cellulose degrading genes, such as endoglucanase A, endoglucanase B, endoglucanase D and beta-glucosidase, are also identified. The genome of A. fumigatus strain LMB-35Aa represents the first whole sequenced genome of non-clinical, high cellulase producing A. fumigatus strain isolated from forest soil. Copyright © 2017 Elsevier B.V. All rights reserved.
Tetrahymena thermophila acidic ribosomal protein L37 contains an archaebacterial type of C-terminus.

PubMed

Hansen, T S; Andreasen, P H; Dreisig, H; Højrup, P; Nielsen, H; Engberg, J; Kristiansen, K

1991-09-15

We have cloned and characterized a Tetrahymena thermophila macronuclear gene (L37) encoding the acidic ribosomal protein (A-protein) L37. The gene contains a single intron located in the 3'-part of the coding region. Two major and three minor transcription start points (tsp) were mapped 39 to 63 nucleotides upstream from the translational start codon. The uppermost tsp mapped to the first T in a putative T. thermophila RNA polymerase II initiator element, TATAA. The coding region of L37 predicts a protein of 109 amino acid (aa) residues. A substantial part of the deduced aa sequence was verified by protein sequencing. The T. thermophila L37 clearly belongs to the P1-type family of eukaryotic A-proteins, but the C-terminal region has the hallmarks of archaebacterial A-proteins.
Positional cloning of a gene responsible for the cts mutation of the silkworm, Bombyx mori.

PubMed

Ito, Katsuhiko; Kidokoro, Kurako; Katsuma, Susumu; Shimada, Toru; Yamamoto, Kimiko; Mita, Kazuei; Kadono-Okuda, Keiko

2012-07-01

The larval head cuticle and anal plates of the silkworm mutant cheek and tail spot (cts) have chocolate-colored spots, unlike the entirely white appearance of the wild-type (WT) strain. We report the identification and characterization of the gene responsible for the cts mutation. Positional cloning revealed a cts candidate on chromosome 16, designated BmMFS, based on the high similarity of the deduced amino acid sequence between the candidate gene from the WT strain and the major facilitator superfamily (MFS) protein. BmMFS likely encodes a membrane protein with 11 putative transmembrane domains, while the putative structure deduced from the cts-type allele possesses only 10-pass transmembrane domains owing to a deletion in its coding region. Quantitative RT-PCR analysis showed that BmMFS mRNA was strongly expressed in the integument of the head and tail, where the cts phenotype is observed; expression markedly increased at the molting and newly ecdysed stages. These results indicate that the novel BmMFS gene is cts and the membrane structure of its protein accounts for the cts phenotype. These expression profiles and the cts phenotype are quite similar to those of melanin-related genes, such as Bmyellow-e and Bm-iAANAT, suggesting that BmMFS is involved in the melanin synthesis pathway.

Structure and chromosomal localization of the human PD-1 gene (PDCD1)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Shinohara, T.; Ishida, Y.; Kawaichi, M.

1994-10-01

A cDNA encoding mouse PD-1, a member of the immunoglobulin superfamily, was previously isolated from apoptosis-induced cells by subtractive hybridization. To determine the structure and chromosomal location of the human PD-1 gene, we screened a human T cell cDNA library by mouse PD-1 probe and isolated a cDNA coding for the human PD-1 protein. The deduced amino acid sequence of human PD-1 was 60% identical to the mouse counterpart, and a putative tyrosine kinase-association motif was well conserved. The human PD-1 gene was mapped to 2q37.3 by chromosomal in situ hybridization. 7 refs., 3 figs.
Complete Genome Sequence of the Symbiotic Strain Bradyrhizobium icense LMTR 13T, Isolated from Lima Bean (Phaseolus lunatus) in Peru.

PubMed

Ormeño-Orrillo, Ernesto; Rogel, Marco A; Zúñiga-Dávila, Doris; Martínez-Romero, Esperanza

2018-03-08

The complete genome sequence of Bradyrhizobium icense LMTR 13 T , a root nodule bacterium isolated from the legume Phaseolus lunatus , is reported here. The genome consists of a circular 8,322,773-bp chromosome which codes for a large and novel symbiotic island as well as genes putatively involved in soil and root colonization. Copyright © 2018 Ormeño-Orrillo et al.
A Potato cDNA Encoding a Homologue of Mammalian Multidrug Resistant P-Glycoprotein

NASA Technical Reports Server (NTRS)

Wang, W.; Takezawa, D.; Poovaiah, B. W.

1996-01-01

A homologue of the multidrug resistance (MDR) gene was obtained while screening a potato stolon tip cDNA expression library with S-15-labeled calmodulin. The mammalian MDR gene codes for a membrane-bound P-glycoprotein (170-180 kDa) which imparts multidrug resistance to cancerous cells. The potato cDNA (PMDR1) codes for a polypeptide of 1313 amino acid residues (ca. 144 kDa) and its structural features are very similar to the MDR P-glycoprotein. The N-terminal half of the PMDR1-encoded protein shares striking homology with its C-terminal half, and each half contains a conserved ATP-binding site and six putative transmembrane domains. Southern blot analysis indicated that potato has one or two MDR-like genes. PMDR1 mRNA is constitutively expressed in all organs studied with higher expression in the stem and stolon tip. The PMDR1 expression was highest during tuber initiation and decreased during tuber development.
Low-coverage, whole-genome sequencing of Artocarpus camansi (Moraceae) for phylogenetic marker development and gene discovery1

PubMed Central

Gardner, Elliot M.; Johnson, Matthew G.; Ragone, Diane; Wickett, Norman J.; Zerega, Nyree J. C.

2016-01-01

Premise of the study: We used moderately low-coverage (17×) whole-genome sequencing of Artocarpus camansi (Moraceae) to develop genomic resources for Artocarpus and Moraceae. Methods and Results: A de novo assembly of Illumina short reads (251,378,536 pairs, 2 × 100 bp) accounted for 93% of the predicted genome size. Predicted coding regions were used in a three-way orthology search with published genomes of Morus notabilis and Cannabis sativa. Phylogenetic markers for Moraceae were developed from 333 inferred single-copy exons. Ninety-eight putative MADS-box genes were identified. Analysis of all predicted coding regions resulted in preliminary annotation of 49,089 genes. An analysis of synonymous substitutions for pairs of orthologs (Ks analysis) in M. notabilis and A. camansi strongly suggested a lineage-specific whole-genome duplication in Artocarpus. Conclusions: This study substantially increases the genomic resources available for Artocarpus and Moraceae and demonstrates the value of low-coverage de novo assemblies for nonmodel organisms with moderately large genomes. PMID:27437173
Identification of differentially expressed small non-coding RNAs in the legume endosymbiont Sinorhizobium meliloti by comparative genomics

PubMed Central

del Val, Coral; Rivas, Elena; Torres-Quesada, Omar; Toro, Nicolás; Jiménez-Zurdo, José I

2007-01-01

Bacterial small non-coding RNAs (sRNAs) are being recognized as novel widespread regulators of gene expression in response to environmental signals. Here, we present the first search for sRNA-encoding genes in the nitrogen-fixing endosymbiont Sinorhizobium meliloti, performed by a genome-wide computational analysis of its intergenic regions. Comparative sequence data from eight related α-proteobacteria were obtained, and the interspecies pairwise alignments were scored with the programs eQRNA and RNAz as complementary predictive tools to identify conserved and stable secondary structures corresponding to putative non-coding RNAs. Northern experiments confirmed that eight of the predicted loci, selected among the original 32 candidates as most probable sRNA genes, expressed small transcripts. This result supports the combined use of eQRNA and RNAz as a robust strategy to identify novel sRNAs in bacteria. Furthermore, seven of the transcripts accumulated differentially in free-living and symbiotic conditions. Experimental mapping of the 5′-ends of the detected transcripts revealed that their encoding genes are organized in autonomous transcription units with recognizable promoter and, in most cases, termination signatures. These findings suggest novel regulatory functions for sRNAs related to the interactions of α-proteobacteria with their eukaryotic hosts. PMID:17971083
Complete genome sequence of Enterobacter sp. IIT-BT 08: A potential microbial strain for high rate hydrogen production.

PubMed

Khanna, Namita; Ghosh, Ananta Kumar; Huntemann, Marcel; Deshpande, Shweta; Han, James; Chen, Amy; Kyrpides, Nikos; Mavrommatis, Kostas; Szeto, Ernest; Markowitz, Victor; Ivanova, Natalia; Pagani, Ioanna; Pati, Amrita; Pitluck, Sam; Nolan, Matt; Woyke, Tanja; Teshima, Hazuki; Chertkov, Olga; Daligault, Hajnalka; Davenport, Karen; Gu, Wei; Munk, Christine; Zhang, Xiaojing; Bruce, David; Detter, Chris; Xu, Yan; Quintana, Beverly; Reitenga, Krista; Kunde, Yulia; Green, Lance; Erkkila, Tracy; Han, Cliff; Brambilla, Evelyne-Marie; Lang, Elke; Klenk, Hans-Peter; Goodwin, Lynne; Chain, Patrick; Das, Debabrata

2013-12-20

Enterobacter sp. IIT-BT 08 belongs to Phylum: Proteobacteria, Class: Gammaproteobacteria, Order: Enterobacteriales, Family: Enterobacteriaceae. The organism was isolated from the leaves of a local plant near the Kharagpur railway station, Kharagpur, West Bengal, India. It has been extensively studied for fermentative hydrogen production because of its high hydrogen yield. For further enhancement of hydrogen production by strain development, complete genome sequence analysis was carried out. Sequence analysis revealed that the genome was linear, 4.67 Mbp long and had a GC content of 56.01%. The genome properties encode 4,393 protein-coding and 179 RNA genes. Additionally, a putative pathway of hydrogen production was suggested based on the presence of formate hydrogen lyase complex and other related genes identified in the genome. Thus, in the present study we describe the specific properties of the organism and the generation, annotation and analysis of its genome sequence as well as discuss the putative pathway of hydrogen production by this organism.
An open reading frame in intron seven of the sea urchin DNA-methyltransferase gene codes for a functional AP1 endonuclease.

PubMed

Cioffi, Anna Valentina; Ferrara, Diana; Cubellis, Maria Vittoria; Aniello, Francesco; Corrado, Marcella; Liguori, Francesca; Amoroso, Alessandro; Fucci, Laura; Branno, Margherita

2002-08-01

Analysis of the genome structure of the Paracentrotus lividus (sea urchin) DNA methyltransferase (DNA MTase) gene showed the presence of an open reading frame, named METEX, in intron 7 of the gene. METEX expression is developmentally regulated, showing no correlation with DNA MTase expression. In fact, DNA MTase transcripts are present at high concentrations in the early developmental stages, while METEX is expressed at late stages of development. Two METEX cDNA clones (Met1 and Met2) that are different in the 3' end have been isolated in a cDNA library screening. The putative translated protein from Met2 cDNA clone showed similarity with Escherichia coli endonuclease III on the basis of sequence and predictive three-dimensional structure. The protein, overexpressed in E. coli and purified, had functional properties similar to the endonuclease specific for apurinic/apyrimidinic (AP) sites on the basis of the lyase activity. Therefore the open reading frame, present in intron 7 of the P. lividus DNA MTase gene, codes for a functional AP endonuclease designated SuAP1.
Palindromic Genes in the Linear Mitochondrial Genome of the Nonphotosynthetic Green Alga Polytomella magna

PubMed Central

Smith, David Roy; Hua, Jimeng; Archibald, John M.; Lee, Robert W.

2013-01-01

Organelle DNA is no stranger to palindromic repeats. But never has a mitochondrial or plastid genome been described in which every coding region is part of a distinct palindromic unit. While sequencing the mitochondrial DNA of the nonphotosynthetic green alga Polytomella magna, we uncovered precisely this type of genic arrangement. The P. magna mitochondrial genome is linear and made up entirely of palindromes, each containing 1–7 unique coding regions. Consequently, every gene in the genome is duplicated and in an inverted orientation relative to its partner. And when these palindromic genes are folded into putative stem-loops, their predicted translational start sites are often positioned in the apex of the loop. Gel electrophoresis results support the linear, 28-kb monomeric conformation of the P. magna mitochondrial genome. Analyses of other Polytomella taxa suggest that palindromic mitochondrial genes were present in the ancestor of the Polytomella lineage and lost or retained to various degrees in extant species. The possible origins and consequences of this bizarre genomic architecture are discussed. PMID:23940100
MiR-34a regulates the invasive capacity of canine osteosarcoma cell lines

PubMed Central

Lopez, Cecilia M.; Yu, Peter Y.; Zhang, Xiaoli; Yilmaz, Ayse Selen; London, Cheryl A.

2018-01-01

Background Osteosarcoma (OSA) is the most common bone tumor in children and dogs; however, no substantial improvement in clinical outcome has occurred in either species over the past 30 years. MicroRNAs (miRNAs) are small non-coding RNAs that regulate gene expression and play a fundamental role in cancer. The purpose of this study was to investigate the potential contribution of miR-34a loss to the biology of canine OSA, a well-established spontaneous model of the human disease. Methodology and principal findings RT-qPCR demonstrated that miR-34a expression levels were significantly reduced in primary canine OSA tumors and canine OSA cell lines as compared to normal canine osteoblasts. In canine OSA cell lines stably transduced with empty vector or pre-miR-34a lentiviral constructs, overexpression of miR-34a inhibited cellular invasion and migration but had no effect on cell proliferation or cell cycle distribution. Transcriptional profiling of canine OSA8 cells possessing enforced miR-34a expression demonstrated dysregulation of numerous genes, including significant down-regulation of multiple putative targets of miR-34a. Moreover, gene ontology analysis of down-regulated miR-34a target genes showed enrichment of several biological processes related to cell invasion and motility. Lastly, we validated changes in miR-34a putative target gene expression, including decreased expression of KLF4, SEM3A, and VEGFA transcripts in canine OSA cells overexpressing miR-34a and identified KLF4 and VEGFA as direct target genes of miR-34a. Concordant with these data, primary canine OSA tumor tissues demonstrated increased expression levels of putative miR-34a target genes. Conclusions These data demonstrate that miR-34a contributes to invasion and migration in canine OSA cells and suggest that loss of miR-34a may promote a pattern of gene expression contributing to the metastatic phenotype in canine OSA. PMID:29293555
MiR-34a regulates the invasive capacity of canine osteosarcoma cell lines.

PubMed

Lopez, Cecilia M; Yu, Peter Y; Zhang, Xiaoli; Yilmaz, Ayse Selen; London, Cheryl A; Fenger, Joelle M

2018-01-01

Osteosarcoma (OSA) is the most common bone tumor in children and dogs; however, no substantial improvement in clinical outcome has occurred in either species over the past 30 years. MicroRNAs (miRNAs) are small non-coding RNAs that regulate gene expression and play a fundamental role in cancer. The purpose of this study was to investigate the potential contribution of miR-34a loss to the biology of canine OSA, a well-established spontaneous model of the human disease. RT-qPCR demonstrated that miR-34a expression levels were significantly reduced in primary canine OSA tumors and canine OSA cell lines as compared to normal canine osteoblasts. In canine OSA cell lines stably transduced with empty vector or pre-miR-34a lentiviral constructs, overexpression of miR-34a inhibited cellular invasion and migration but had no effect on cell proliferation or cell cycle distribution. Transcriptional profiling of canine OSA8 cells possessing enforced miR-34a expression demonstrated dysregulation of numerous genes, including significant down-regulation of multiple putative targets of miR-34a. Moreover, gene ontology analysis of down-regulated miR-34a target genes showed enrichment of several biological processes related to cell invasion and motility. Lastly, we validated changes in miR-34a putative target gene expression, including decreased expression of KLF4, SEM3A, and VEGFA transcripts in canine OSA cells overexpressing miR-34a and identified KLF4 and VEGFA as direct target genes of miR-34a. Concordant with these data, primary canine OSA tumor tissues demonstrated increased expression levels of putative miR-34a target genes. These data demonstrate that miR-34a contributes to invasion and migration in canine OSA cells and suggest that loss of miR-34a may promote a pattern of gene expression contributing to the metastatic phenotype in canine OSA.
Accumulation of multiple mutations in linezolid-resistant Staphylococcus epidermidis causing bloodstream infections; in silico analysis of L3 amino acid substitutions that might confer high-level linezolid resistance.

PubMed

Ikonomidis, Alexandros; Grapsa, Anastasia; Pavlioglou, Charikleia; Demiri, Antonia; Batarli, Alexandra; Panopoulou, Maria

2016-12-01

Fifty-six Staphylococcus epidermidis clinical isolates, showing high-level linezolid resistance and causing bacteremia in critically ill patients, were studied. All isolates belonged to ST22 clone and carried the T2504A and C2534T mutations in gene coding for 23SrRNA as well as the C189A, G208A, C209T and G384C missense mutations in L3 protein which resulted in Asp159Tyr, Gly152Asp and Leu94Val substitutions. Other silent mutations were also detected in genes coding for ribosomal proteins L3 and L22. In silico analysis of missense mutations showed that although L3 protein retained the sequence of secondary motifs, the tertiary structure was influenced. The observed alteration in L3 protein folding provides an indication on the putative role of L3-coding gene mutations in high-level linezolid resistance. Furthermore, linezolid pressure in health care settings where linezolid consumption is of high rates might lead to the selection of resistant mutants possessing L3 mutations that might confer high-level linezolid resistance.
A new polymorphic and multicopy MHC gene family related to nonmammalian class I

DOE Office of Scientific and Technical Information (OSTI.GOV)

Leelayuwat, C.; Degli-Esposti, M.A.; Abraham, L.J.

1994-12-31

The authors have used genomic analysis to characterize a region of the central major histocompatibility complex (MHC) spanning {approximately} 300 kilobases (kb) between TNF and HLA-B. This region has been suggested to carry genetic factors relevant to the development of autoimmune diseases such as myasthenia gravis (MG) and insulin dependent diabetes mellitus (IDDM). Genomic sequence was analyzed for coding potential, using two neural network programs, GRAIL and GeneParser. A genomic probe, JAB, containing putative coding sequences (PERB11) located 60 kb centromeric of HLA-B, was used for northern analysis of human tissues. Multiple transcripts were detected. Southern analysis of genomic DNAmore » and overlapping YAC clones, covering the region from BAT1 to HLA-F, indicated that there are at least five copies of PERB11, four of which are located within this region of the MHC. The partial cDNA sequence of PERB11 was obtained from poly-A RNA derived from skeletal muscle. The putative amino acid sequence of PERB11 shares {approximately} 30% identity to MHC class I molecules from various species, including reptiles, chickens, and frogs, as well as to other MHC class I-like molecules, such as the IgG FcR of the mouse and rat and the human Zn-{alpha}2-glycoprotein. From direct comparison of amino acid sequences, it is concluded that PERB11 is a distinct molecule more closely related to nonmammalian than known mammalian MHC class I molecules. Genomic sequence analysis of PERB11 from five MHC ancestral haplotypes (AH) indicated that the gene is polymorphic at both DNA and protein level. The results suggest that the authors have identified a novel polymorphic gene family with multiple copies within the MHC. 48 refs., 10 figs., 2 tabs.« less
Genome-Wide Classification and Evolutionary and Expression Analyses of Citrus MYB Transcription Factor Families in Sweet Orange

PubMed Central

Hou, Xiao-Jin; Li, Si-Bei; Liu, Sheng-Rui; Hu, Chun-Gen; Zhang, Jin-Zhi

2014-01-01

MYB family genes are widely distributed in plants and comprise one of the largest transcription factors involved in various developmental processes and defense responses of plants. To date, few MYB genes and little expression profiling have been reported for citrus. Here, we describe and classify 177 members of the sweet orange MYB gene (CsMYB) family in terms of their genomic gene structures and similarity to their putative Arabidopsis orthologs. According to these analyses, these CsMYBs were categorized into four groups (4R-MYB, 3R-MYB, 2R-MYB and 1R-MYB). Gene structure analysis revealed that 1R-MYB genes possess relatively more introns as compared with 2R-MYB genes. Investigation of their chromosomal localizations revealed that these CsMYBs are distributed across nine chromosomes. Sweet orange includes a relatively small number of MYB genes compared with the 198 members in Arabidopsis, presumably due to a paralog reduction related to repetitive sequence insertion into promoter and non-coding transcribed region of the genes. Comparative studies of CsMYBs and Arabidopsis showed that CsMYBs had fewer gene duplication events. Expression analysis revealed that the MYB gene family has a wide expression profile in sweet orange development and plays important roles in development and stress responses. In addition, 337 new putative microsatellites with flanking sequences sufficient for primer design were also identified from the 177 CsMYBs. These results provide a useful reference for the selection of candidate MYB genes for cloning and further functional analysis forcitrus. PMID:25375352
Enzymes involved in the anaerobic degradation of ortho-phthalate by the nitrate-reducing bacterium Azoarcus sp. strain PA01.

PubMed

Junghare, Madan; Spiteller, Dieter; Schink, Bernhard

2016-09-01

The pathway of anaerobic degradation of o-phthalate was studied in the nitrate-reducing bacterium Azoarcus sp. strain PA01. Differential two-dimensional protein gel profiling allowed the identification of specifically induced proteins in o-phthalate-grown compared to benzoate-grown cells. The genes encoding o-phthalate-induced proteins were found in a 9.9 kb gene cluster in the genome of Azoarcus sp. strain PA01. The o-phthalate-induced gene cluster codes for proteins homologous to a dicarboxylic acid transporter, putative CoA-transferases and a UbiD-like decarboxylase that were assigned to be specifically involved in the initial steps of anaerobic o-phthalate degradation. We propose that o-phthalate is first activated to o-phthalyl-CoA by a putative succinyl-CoA-dependent succinyl-CoA:o-phthalate CoA-transferase, and o-phthalyl-CoA is subsequently decarboxylated to benzoyl-CoA by a putative o-phthalyl-CoA decarboxylase. Results from in vitro enzyme assays with cell-free extracts of o-phthalate-grown cells demonstrated the formation of o-phthalyl-CoA from o-phthalate and succinyl-CoA as CoA donor, and its subsequent decarboxylation to benzoyl-CoA. The putative succinyl-CoA:o-phthalate CoA-transferase showed high substrate specificity for o-phthalate and did not accept isophthalate, terephthalate or 3-fluoro-o-phthalate whereas the putative o-phthalyl-CoA decarboxylase converted fluoro-o-phthalyl-CoA to fluoro-benzoyl-CoA. No decarboxylase activity was observed with isophthalyl-CoA or terephthalyl-CoA. Both enzyme activities were oxygen-insensitive and inducible only after growth with o-phthalate. Further degradation of benzoyl-CoA proceeds analogous to the well-established anaerobic benzoyl-CoA degradation pathway of nitrate-reducing bacteria. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.
The unique genomic landscape surrounding the EPSPS gene in glyphosate resistant Amaranthus palmeri: a repetitive path to resistance.

PubMed

Molin, William T; Wright, Alice A; Lawton-Rauh, Amy; Saski, Christopher A

2017-01-17

The expanding number and global distributions of herbicide resistant weedy species threaten food, fuel, fiber and bioproduct sustainability and agroecosystem longevity. Amongst the most competitive weeds, Amaranthus palmeri S. Wats has rapidly evolved resistance to glyphosate primarily through massive amplification and insertion of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene across the genome. Increased EPSPS gene copy numbers results in higher titers of the EPSPS enzyme, the target of glyphosate, and confers resistance to glyphosate treatment. To understand the genomic unit and mechanism of EPSPS gene copy number proliferation, we developed and used a bacterial artificial chromosome (BAC) library from a highly resistant biotype to sequence the local genomic landscape flanking the EPSPS gene. By sequencing overlapping BACs, a 297 kb sequence was generated, hereafter referred to as the "EPSPS cassette." This region included several putative genes, dense clusters of tandem and inverted repeats, putative helitron and autonomous replication sequences, and regulatory elements. Whole genome shotgun sequencing (WGS) of two biotypes exhibiting high and no resistance to glyphosate was performed to compare genomic representation across the EPSPS cassette. Mapping of sequences for both biotypes to the reference EPSPS cassette revealed significant differences in upstream and downstream sequences relative to EPSPS with regard to both repetitive units and coding content between these biotypes. The differences in sequence may have resulted from a compounded-building mechanism such as repetitive transpositional events. The association of putative helitron sequences with the cassette suggests a possible amplification and distribution mechanism. Flow cytometry revealed that the EPSPS cassette added measurable genomic content. The adoption of glyphosate resistant cropping systems in major crops such as corn, soybean, cotton and canola coupled with excessive use of glyphosate herbicide has led to evolved glyphosate resistance in several important weeds. In Amaranthus palmeri, the amplification of the EPSPS cassette, characterized by a complex array of repetitive elements and putative helitron sequences, suggests an adaptive structural genomic mechanism that drives amplification and distribution around the genome. The added genomic content not found in glyphosate sensitive plants may be driving evolution through genome expansion.
Identification of an Na(+)-dependent transporter associated with saxitoxin-producing strains of the cyanobacterium Anabaena circinalis.

PubMed

Pomati, Francesco; Burns, Brendan P; Neilan, Brett A

2004-08-01

Blooms of the freshwater cyanobacterium Anabaena circinalis are recognized as an important health risk worldwide due to the production of a range of toxins such as saxitoxin (STX) and its derivatives. In this study we used HIP1 octameric-palindrome repeated-sequence PCR to compare the genomic structure of phylogenetically similar Australian isolates of A. circinalis. STX-producing and nontoxic cyanobacterial strains showed different HIP1 (highly iterated octameric palindrome 1) DNA patterns, and characteristic interrepeat amplicons for each group were identified. Suppression subtractive hybridization (SSH) was performed using HIP1 PCR-generated libraries to further identify toxic-strain-specific genes. An STX-producing strain and a nontoxic strain of A. circinalis were chosen as testers in two distinct experiments. The two categories of SSH putative tester-specific sequences were characterized by different families of encoded proteins that may be representative of the differences in metabolism between STX-producing and nontoxic A. circinalis strains. DNA-microarray hybridization and genomic screening revealed a toxic-strain-specific HIP1 fragment coding for a putative Na(+)-dependent transporter. Analysis of this gene demonstrated analogy to the mrpF gene of Bacillus subtilis, whose encoded protein is involved in Na(+)-specific pH homeostasis. The application of this gene as a molecular probe in laboratory and environmental screening for STX-producing A. circinalis strains was demonstrated. The possible role of this putative Na(+)-dependent transporter in the toxic cyanobacterial phenotype is also discussed, in light of recent physiological studies of STX-producing cyanobacteria.
The putative protein methyltransferase LAE1 controls cellulase gene expression in Trichoderma reesei

PubMed Central

Seiboth, Bernhard; Karimi, Razieh Aghcheh; Phatale, Pallavi A; Linke, Rita; Hartl, Lukas; Sauer, Dominik G; Smith, Kristina M; Baker, Scott E; Freitag, Michael; Kubicek, Christian P

2012-01-01

Summary Trichoderma reesei is an industrial producer of enzymes that degrade lignocellulosic polysaccharides to soluble monomers, which can be fermented to biofuels. Here we show that the expression of genes for lignocellulose degradation are controlled by the orthologous T. reesei protein methyltransferase LAE1. In a lae1 deletion mutant we observed a complete loss of expression of all seven cellulases, auxiliary factors for cellulose degradation, β-glucosidases and xylanases were no longer expressed. Conversely, enhanced expression of lae1 resulted in significantly increased cellulase gene transcription. Lae1-modulated cellulase gene expression was dependent on the function of the general cellulase regulator XYR1, but also xyr1 expression was LAE1-dependent. LAE1 was also essential for conidiation of T. reesei. Chromatin immunoprecipitation followed by high-throughput sequencing (‘ChIP-seq’) showed that lae1 expression was not obviously correlated with H3K4 di- or trimethylation (indicative of active transcription) or H3K9 trimethylation (typical for heterochromatin regions) in CAZyme coding regions, suggesting that LAE1 does not affect CAZyme gene expression by directly modulating H3K4 or H3K9 methylation. Our data demonstrate that the putative protein methyltransferase LAE1 is essential for cellulase gene expression in T. reesei through mechanisms that remain to be identified. PMID:22554051
Transcriptional and functional studies of Acidithiobacillus ferrooxidans genes related to survival in the presence of copper.

PubMed

Navarro, Claudio A; Orellana, Luis H; Mauriaca, Cecilia; Jerez, Carlos A

2009-10-01

The acidophilic Acidithiobacillus ferrooxidans can resist exceptionally high copper (Cu) concentrations. This property is important for its use in biomining processes, where Cu and other metal levels range usually between 15 and 100 mM. To learn about the mechanisms that allow A. ferrooxidans cells to survive in this environment, a bioinformatic search of its genome showed the presence of at least 10 genes that are possibly related to Cu homeostasis. Among them are three genes coding for putative ATPases related to the transport of Cu (A. ferrooxidans copA1 [copA1(Af)], copA2(Af), and copB(Af)), three genes related to a system of the resistance nodulation cell division family involved in the extraction of Cu from the cell (cusA(Af), cusB(Af), and cusC(Af)), and two genes coding for periplasmic chaperones for this metal (cusF(Af) and copC(Af)). The expression of most of these open reading frames was studied by real-time reverse transcriptase PCR using A. ferrooxidans cells adapted for growth in the presence of high concentrations of Cu. The putative A. ferrooxidans Cu resistance determinants were found to be upregulated when this bacterium was exposed to Cu in the range of 5 to 25 mM. These A. ferrooxidans genes conferred to Escherichia coli a greater Cu resistance than wild-type cells, supporting their functionality. The results reported here and previously published data strongly suggest that the high resistance of the extremophilic A. ferrooxidans to Cu may be due to part or all of the following key elements: (i) a wide repertoire of Cu resistance determinants, (ii) the duplication of some of these Cu resistance determinants, (iii) the existence of novel Cu chaperones, and (iv) a polyP-based Cu resistance system.
Reranking candidate gene models with cross-species comparison for improved gene prediction

PubMed Central

Liu, Qian; Crammer, Koby; Pereira, Fernando CN; Roos, David S

2008-01-01

Background Most gene finders score candidate gene models with state-based methods, typically HMMs, by combining local properties (coding potential, splice donor and acceptor patterns, etc). Competing models with similar state-based scores may be distinguishable with additional information. In particular, functional and comparative genomics datasets may help to select among competing models of comparable probability by exploiting features likely to be associated with the correct gene models, such as conserved exon/intron structure or protein sequence features. Results We have investigated the utility of a simple post-processing step for selecting among a set of alternative gene models, using global scoring rules to rerank competing models for more accurate prediction. For each gene locus, we first generate the K best candidate gene models using the gene finder Evigan, and then rerank these models using comparisons with putative orthologous genes from closely-related species. Candidate gene models with lower scores in the original gene finder may be selected if they exhibit strong similarity to probable orthologs in coding sequence, splice site location, or signal peptide occurrence. Experiments on Drosophila melanogaster demonstrate that reranking based on cross-species comparison outperforms the best gene models identified by Evigan alone, and also outperforms the comparative gene finders GeneWise and Augustus+. Conclusion Reranking gene models with cross-species comparison improves gene prediction accuracy. This straightforward method can be readily adapted to incorporate additional lines of evidence, as it requires only a ranked source of candidate gene models. PMID:18854050
Pre-Bilaterian Origins of the Hox Cluster and the Hox Code: Evidence from the Sea Anemone, Nematostella vectensis

PubMed Central

Ryan, Joseph F.; Mazza, Maureen E.; Pang, Kevin; Matus, David Q.; Baxevanis, Andreas D.; Martindale, Mark Q.; Finnerty, John R.

2007-01-01

Background Hox genes were critical to many morphological innovations of bilaterian animals. However, early Hox evolution remains obscure. Phylogenetic, developmental, and genomic analyses on the cnidarian sea anemone Nematostella vectensis challenge recent claims that the Hox code is a bilaterian invention and that no “true” Hox genes exist in the phylum Cnidaria. Methodology/Principal Findings Phylogenetic analyses of 18 Hox-related genes from Nematostella identify putative Hox1, Hox2, and Hox9+ genes. Statistical comparisons among competing hypotheses bolster these findings, including an explicit consideration of the gene losses implied by alternate topologies. In situ hybridization studies of 20 Hox-related genes reveal that multiple Hox genes are expressed in distinct regions along the primary body axis, supporting the existence of a pre-bilaterian Hox code. Additionally, several Hox genes are expressed in nested domains along the secondary body axis, suggesting a role in “dorsoventral” patterning. Conclusions/Significance A cluster of anterior and posterior Hox genes, as well as ParaHox cluster of genes evolved prior to the cnidarian-bilaterian split. There is evidence to suggest that these clusters were formed from a series of tandem gene duplication events and played a role in patterning both the primary and secondary body axes in a bilaterally symmetrical common ancestor. Cnidarians and bilaterians shared a common ancestor some 570 to 700 million years ago, and as such, are derived from a common body plan. Our work reveals several conserved genetic components that are found in both of these diverse lineages. This finding is consistent with the hypothesis that a set of developmental rules established in the common ancestor of cnidarians and bilaterians is still at work today. PMID:17252055

Real-time multiplex PCR assay for detection of Yersinia pestis and Yersinia pseudotuberculosis.

PubMed

Matero, Pirjo; Pasanen, Tanja; Laukkanen, Riikka; Tissari, Päivi; Tarkka, Eveliina; Vaara, Martti; Skurnik, Mikael

2009-01-01

A multiplex real-time polymerase chain reaction (PCR) assay was developed for the detection of Yersinia pestis and Yersinia pseudotuberculosis. The assay includes four primer pairs, two of which are specific for Y. pestis, one for Y. pestis and Y. pseudotuberculosis and one for bacteriophage lambda; the latter was used as an internal amplification control. The Y. pestis-specific target genes in the assay were ypo2088, a gene coding for a putative methyltransferase, and the pla gene coding for the plasminogen activator. In addition, the wzz gene was used as a target to specifically identify both Y. pestis and the closely related Y. pseudotuberculosis group. The primer and probe sets described for the different genes can be used either in single or in multiplex PCR assays because the individual probes were designed with different fluorochromes. The assays were found to be both sensitive and specific; the lower limit of the detection was 10-100 fg of extracted Y. pestis or Y. pseudotuberculosis total DNA. The sensitivity of the tetraplex assay was determined to be 1 cfu for the ypo2088 and pla probe labelled with FAM and JOE fluorescent dyes, respectively.
Complete mitochondrial genome of the Asian paddle crab Charybdis japonica (Crustacea: Decapoda: Portunidae): gene rearrangement of the marine brachyurans and phylogenetic considerations of the decapods.

PubMed

Liu, Yuan; Cui, Zhaoxia

2010-06-01

Given the commercial and ecological importance of the Asian paddle crab, Charybdis japonica, there is a clearly need for genetic and molecular research on this species. Here, we present the complete mitochondrial genome sequence of C. japonica, determined by the long-polymerase chain reaction and primer walking sequencing method. The entire genome is 15,738 bp in length, encoding a standard set of 13 protein-coding genes, two ribosomal RNA genes, and 22 transfer RNA genes, plus the putative control region, which is typical for metazoans. The total A+T content of the genome is 69.2%, lower than the other brachyuran crabs except for Callinectes sapidus. The gene order is identical to the published marine brachyurans and differs from the ancestral pancrustacean order by only the position of the tRNA ( His ) gene. Phylogenetic analyses using the concatenated nucleotide and amino acid sequences of 13 protein-coding genes strongly support the monophyly of Dendrobranchiata and Pleocyemata, which is consistent with the previous taxonomic classification. However, the systematic status of Charybdis within subfamily Thalamitinae of family Portunidae is not supported. C. japonica, as the first species of Charybdis with complete mitochondrial genome available, will provide important information on both genomics and molecular ecology of the group.
Evidence of cellulose metabolism by the giant panda gut microbiome.

PubMed

Zhu, Lifeng; Wu, Qi; Dai, Jiayin; Zhang, Shanning; Wei, Fuwen

2011-10-25

The giant panda genome codes for all necessary enzymes associated with a carnivorous digestive system but lacks genes for enzymes needed to digest cellulose, the principal component of their bamboo diet. It has been posited that this iconic species must therefore possess microbial symbionts capable of metabolizing cellulose, but these symbionts have remained undetected. Here we examined 5,522 prokaryotic ribosomal RNA gene sequences in wild and captive giant panda fecal samples. We found lower species richness of the panda microbiome than of mammalian microbiomes for herbivores and nonherbivorous carnivores. We detected 13 operational taxonomic units closely related to Clostridium groups I and XIVa, both of which contain taxa known to digest cellulose. Seven of these 13 operational taxonomic units were unique to pandas compared with other mammals. Metagenomic analysis using ~37-Mbp contig sequences from gut microbes recovered putative genes coding two cellulose-digesting enzymes and one hemicellulose-digesting enzyme, cellulase, β-glucosidase, and xylan 1,4-β-xylosidase, in Clostridium group I. Comparing glycoside hydrolase profiles of pandas with those of herbivores and omnivores, we found a moderate abundance of oligosaccharide-degrading enzymes for pandas (36%), close to that for humans (37%), and the lowest abundance of cellulases and endohemicellulases (2%), which may reflect low digestibility of cellulose and hemicellulose in the panda's unique bamboo diet. The presence of putative cellulose-digesting microbes, in combination with adaptations related to feeding, physiology, and morphology, show that giant pandas have evolved a number of traits to overcome the anatomical and physiological challenge of digesting a diet high in fibrous matter.
Draft Genome Sequences of Two Bacillus thuringiensis Strains and Characterization of a Putative 41.9-kDa Insecticidal Toxin

PubMed Central

Palma, Leopoldo; Muñoz, Delia; Berry, Colin; Murillo, Jesús; Caballero, Primitivo

2014-01-01

In this work, we report the genome sequencing of two Bacillus thuringiensis strains using Illumina next-generation sequencing technology (NGS). Strain Hu4-2, toxic to many lepidopteran pest species and to some mosquitoes, encoded genes for two insecticidal crystal (Cry) proteins, cry1Ia and cry9Ea, and a vegetative insecticidal protein (Vip) gene, vip3Ca2. Strain Leapi01 contained genes coding for seven Cry proteins (cry1Aa, cry1Ca, cry1Da, cry2Ab, cry9Ea and two cry1Ia gene variants) and a vip3 gene (vip3Aa10). A putative novel insecticidal protein gene 1143 bp long was found in both strains, whose sequences exhibited 100% nucleotide identity. The predicted protein showed 57 and 100% pairwise identity to protein sequence 72 from a patented Bt strain (US8318900) and to a putative 41.9-kDa insecticidal toxin from Bacillus cereus, respectively. The 41.9-kDa protein, containing a C-terminal 6× HisTag fusion, was expressed in Escherichia coli and tested for the first time against four lepidopteran species (Mamestra brassicae, Ostrinia nubilalis, Spodoptera frugiperda and S. littoralis) and the green-peach aphid Myzus persicae at doses as high as 4.8 µg/cm2 and 1.5 mg/mL, respectively. At these protein concentrations, the recombinant 41.9-kDa protein caused no mortality or symptoms of impaired growth against any of the insects tested, suggesting that these species are outside the protein’s target range or that the protein may not, in fact, be toxic. While the use of the polymerase chain reaction has allowed a significant increase in the number of Bt insecticidal genes characterized to date, novel NGS technologies promise a much faster, cheaper and efficient screening of Bt pesticidal proteins. PMID:24784323
Isolation and sequencing of the gene encoding Sp23, a structural protein of spermatophore of the mealworm beetle, Tenebrio molitor.

PubMed

Feng, X; Happ, G M

1996-11-14

The cDNA for Sp23, a structural protein of the spermatophore of Tenebrio molitor, had been previously cloned and characterized (Paesen, G.C., Schwartz, M.B., Peferoen, M., Weyda, F. and Happ, G.M. (1992a) Amino acid sequence of Sp23, a structure protein of the spermatophore of the mealworm beetle, Tenebrio molitor. J. Biol. Chem. 257, 18852-18857). Using the labeled cDNA for Sp23 as a probe to screen a library of genomic DNA from Tenebrio molitor, we isolated a genomic clone for Sp23. A 5373-base pair (bp) restriction fragment containing the Sp23 gene was sequenced. The coding region is separated by a 55-bp intron which is located close to the translation start site. Three putative ecdysone response elements (EcRE) are identified in the 5' flanking region of the Sp23 gene. Comparison of the flanking regions of the Sp23 gene with those of the D-protein gene expressed in the accessory glands of Tenebrio reveals similar sequences present in the flanking regions of the two genes. The genomic organization of the coding region of the Sp23 gene shares similarities with that of the D-protein gene, three Drosophila accessory gland genes and two Drosophila 20-OH ecdysone-responsive genes.
Putative Nonribosomal Peptide Synthetase and Cytochrome P450 Genes Responsible for Tentoxin Biosynthesis in Alternaria alternata ZJ33

PubMed Central

Li, You-Hai; Han, Wen-Jin; Gui, Xi-Wu; Wei, Tao; Tang, Shuang-Yan; Jin, Jian-Ming

2016-01-01

Tentoxin, a cyclic tetrapeptide produced by several Alternaria species, inhibits the F1-ATPase activity of chloroplasts, resulting in chlorosis in sensitive plants. In this study, we report two clustered genes, encoding a putative non-ribosome peptide synthetase (NRPS) TES and a cytochrome P450 protein TES1, that are required for tentoxin biosynthesis in Alternaria alternata strain ZJ33, which was isolated from blighted leaves of Eupatorium adenophorum. Using a pair of primers designed according to the consensus sequences of the adenylation domain of NRPSs, two fragments containing putative adenylation domains were amplified from A. alternata ZJ33, and subsequent PCR analyses demonstrated that these fragments belonged to the same NRPS coding sequence. With no introns, TES consists of a single 15,486 base pair open reading frame encoding a predicted 5161 amino acid protein. Meanwhile, the TES1 gene is predicted to contain five introns and encode a 506 amino acid protein. The TES protein is predicted to be comprised of four peptide synthase modules with two additional N-methylation domains, and the number and arrangement of the modules in TES were consistent with the number and arrangement of the amino acid residues of tentoxin, respectively. Notably, both TES and TES1 null mutants generated via homologous recombination failed to produce tentoxin. This study provides the first evidence concerning the biosynthesis of tentoxin in A. alternata. PMID:27490569
Genome-wide identification and characterization of putative lncRNAs in the diamondback moth, Plutella xylostella (L.).

PubMed

Wang, Yue; Xu, Tingting; He, Weiyi; Shen, Xiujing; Zhao, Qian; Bai, Jianlin; You, Minsheng

2018-01-01

Long non-coding RNAs (lncRNAs) are of particular interest because of their contributions to many biological processes. Here, we present the genome-wide identification and characterization of putative lncRNAs in a global insect pest, Plutella xylostella. A total of 8096 lncRNAs were identified and classified into three groups. The average length of exons in lncRNAs was longer than that in coding genes and the GC content was lower than that in mRNAs. Most lncRNAs were flanked by canonical splice sites, similar to mRNAs. Expression profiling identified 114 differentially expressed lncRNAs during the DBM development and found that majority were temporally specific. While the biological functions of lncRNAs remain uncharacterized, many are microRNA precursors or competing endogenous RNAs involved in micro-RNA regulatory pathways. This work provides a valuable resource for further studies on molecular bases for development of DBM and lay the foundation for discovery of lncRNA functions in P. xylostella. Copyright © 2017 Elsevier Inc. All rights reserved.
Comprehensive analysis of coding-lncRNA gene co-expression network uncovers conserved functional lncRNAs in zebrafish.

PubMed

Chen, Wen; Zhang, Xuan; Li, Jing; Huang, Shulan; Xiang, Shuanglin; Hu, Xiang; Liu, Changning

2018-05-09

Zebrafish is a full-developed model system for studying development processes and human disease. Recent studies of deep sequencing had discovered a large number of long non-coding RNAs (lncRNAs) in zebrafish. However, only few of them had been functionally characterized. Therefore, how to take advantage of the mature zebrafish system to deeply investigate the lncRNAs' function and conservation is really intriguing. We systematically collected and analyzed a series of zebrafish RNA-seq data, then combined them with resources from known database and literatures. As a result, we obtained by far the most complete dataset of zebrafish lncRNAs, containing 13,604 lncRNA genes (21,128 transcripts) in total. Based on that, a co-expression network upon zebrafish coding and lncRNA genes was constructed and analyzed, and used to predict the Gene Ontology (GO) and the KEGG annotation of lncRNA. Meanwhile, we made a conservation analysis on zebrafish lncRNA, identifying 1828 conserved zebrafish lncRNA genes (1890 transcripts) that have their putative mammalian orthologs. We also found that zebrafish lncRNAs play important roles in regulation of the development and function of nervous system; these conserved lncRNAs present a significant sequential and functional conservation, with their mammalian counterparts. By integrative data analysis and construction of coding-lncRNA gene co-expression network, we gained the most comprehensive dataset of zebrafish lncRNAs up to present, as well as their systematic annotations and comprehensive analyses on function and conservation. Our study provides a reliable zebrafish-based platform to deeply explore lncRNA function and mechanism, as well as the lncRNA commonality between zebrafish and human.
Complete plastid genome of Astragalus mongholicus var. nakaianus (Fabaceae).

PubMed

Choi, In-Su; Kim, Joo-Hwan; Choi, Byoung-Hee

2016-07-01

The first complete plastid genome (plastome) of the largest angiosperm genus, Astragalus, was sequenced for the Korean endangered endemic species A. mongholicus var. nakaianus. Its genome is relatively short (123,633 bp) because it lacks an Inverted Repeat (IR) region. It comprises 110 genes, including four unique rRNAs, 30 tRNAs, and 76 protein-coding genes. Similar to other closely related plastomes, rpl22 and rps16 are absent. The putative pseudogene with abnormal stop codons is atpE. This plastome has no additional inversions when compared with highly variable plastomes from IRLC tribes Fabeae and Trifolieae. Our phylogenetic analysis confirms the non-monophyly of Galegeae.
MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity

PubMed Central

Wang, Yupeng; Tang, Haibao; DeBarry, Jeremy D.; Tan, Xu; Li, Jingping; Wang, Xiyin; Lee, Tae-ho; Jin, Huizhe; Marler, Barry; Guo, Hui; Kissinger, Jessica C.; Paterson, Andrew H.

2012-01-01

MCScan is an algorithm able to scan multiple genomes or subgenomes in order to identify putative homologous chromosomal regions, and align these regions using genes as anchors. The MCScanX toolkit implements an adjusted MCScan algorithm for detection of synteny and collinearity that extends the original software by incorporating 14 utility programs for visualization of results and additional downstream analyses. Applications of MCScanX to several sequenced plant genomes and gene families are shown as examples. MCScanX can be used to effectively analyze chromosome structural changes, and reveal the history of gene family expansions that might contribute to the adaptation of lineages and taxa. An integrated view of various modes of gene duplication can supplement the traditional gene tree analysis in specific families. The source code and documentation of MCScanX are freely available at http://chibba.pgml.uga.edu/mcscan2/. PMID:22217600
dbWFA: a web-based database for functional annotation of Triticum aestivum transcripts

PubMed Central

Vincent, Jonathan; Dai, Zhanwu; Ravel, Catherine; Choulet, Frédéric; Mouzeyar, Said; Bouzidi, M. Fouad; Agier, Marie; Martre, Pierre

2013-01-01

The functional annotation of genes based on sequence homology with genes from model species genomes is time-consuming because it is necessary to mine several unrelated databases. The aim of the present work was to develop a functional annotation database for common wheat Triticum aestivum (L.). The database, named dbWFA, is based on the reference NCBI UniGene set, an expressed gene catalogue built by expressed sequence tag clustering, and on full-length coding sequences retrieved from the TriFLDB database. Information from good-quality heterogeneous sources, including annotations for model plant species Arabidopsis thaliana (L.) Heynh. and Oryza sativa L., was gathered and linked to T. aestivum sequences through BLAST-based homology searches. Even though the complexity of the transcriptome cannot yet be fully appreciated, we developed a tool to easily and promptly obtain information from multiple functional annotation systems (Gene Ontology, MapMan bin codes, MIPS Functional Categories, PlantCyc pathway reactions and TAIR gene families). The use of dbWFA is illustrated here with several query examples. We were able to assign a putative function to 45% of the UniGenes and 81% of the full-length coding sequences from TriFLDB. Moreover, comparison of the annotation of the whole T. aestivum UniGene set along with curated annotations of the two model species assessed the accuracy of the annotation provided by dbWFA. To further illustrate the use of dbWFA, genes specifically expressed during the early cell division or late storage polymer accumulation phases of T. aestivum grain development were identified using a clustering analysis and then annotated using dbWFA. The annotation of these two sets of genes was consistent with previous analyses of T. aestivum grain transcriptomes and proteomes. Database URL: urgi.versailles.inra.fr/dbWFA/ PMID:23660284
Genome-Wide Analysis of Mycoplasma bovirhinis GS01 Reveals Potential Virulence Factors and Phylogenetic Relationships.

PubMed

Chen, Shengli; Hao, Huafang; Zhao, Ping; Liu, Yongsheng; Chu, Yuefeng

2018-05-04

Mycoplasma bovirhinis is a significant etiology in bovine pneumonia and mastitis, but our knowledge about the genetic and pathogenic mechanisms of M. bovirhinis is very limited. In this study, we sequenced the complete genome of M. bovirhinis strain GS01 isolated from the nasal swab of pneumonic calves in Gansu, China, and we found that its genome forms a 847,985 bp single circular chromosome with a GC content of 27.57% and with 707 protein-coding genes. The putative virulence determinants of M. bovirhinis were then analyzed. Results showed that three genomic islands and 16 putative virulence genes, including one adhesion gene enolase, seven surface lipoproteins, proteins involved in glycerol metabolism, and cation transporters, might be potential virulence factors. Glycerol and pyruvate metabolic pathways were defective. Comparative analysis revealed remarkable genome variations between GS01 and a recently reported HAZ141_2 strain, and extremely low homology with others mycoplasma species. Phylogenetic analysis demonstrated that M. bovirhinis was most genetically close to M. canis , distant from other bovine Mycoplasma species. Genomic dissection may provide useful information on the pathogenic mechanisms and genetics of M. bovirhinis . Copyright © 2018 Chen et al.
Transcriptome Sequencing, and Rapid Development and Application of SNP Markers for the Legume Pod Borer Maruca vitrata (Lepidoptera: Crambidae)

PubMed Central

Margam, Venu M.; Coates, Brad S.; Bayles, Darrell O.; Hellmich, Richard L.; Agunbiade, Tolulope; Seufferheld, Manfredo J.; Sun, Weilin; Kroemer, Jeremy A.; Ba, Malick N.; Binso-Dabire, Clementine L.; Baoua, Ibrahim; Ishiyaku, Mohammad F.; Covas, Fernando G.; Srinivasan, Ramasamy; Armstrong, Joel; Murdock, Larry L.; Pittendrigh, Barry R.

2011-01-01

The legume pod borer, Maruca vitrata (Lepidoptera: Crambidae), is an insect pest species of crops grown by subsistence farmers in tropical regions of Africa. We present the de novo assembly of 3729 contigs from 454- and Sanger-derived sequencing reads for midgut, salivary, and whole adult tissues of this non-model species. Functional annotation predicted that 1320 M. vitrata protein coding genes are present, of which 631 have orthologs within the Bombyx mori gene model. A homology-based analysis assigned M. vitrata genes into a group of paralogs, but these were subsequently partitioned into putative orthologs following phylogenetic analyses. Following sequence quality filtering, a total of 1542 putative single nucleotide polymorphisms (SNPs) were predicted within M. vitrata contig assemblies. Seventy one of 1078 designed molecular genetic markers were used to screen M. vitrata samples from five collection sites in West Africa. Population substructure may be present with significant implications in the insect resistance management recommendations pertaining to the release of biological control agents or transgenic cowpea that express Bacillus thuringiensis crystal toxins. Mutation data derived from transcriptome sequencing is an expeditious and economical source for genetic markers that allow evaluation of ecological differentiation. PMID:21754987
Identification of SNPs associated with muscle yield and quality traits using allelic-imbalance analyses of pooled RNA-Seq samples in rainbow trout.

PubMed

Al-Tobasei, Rafet; Ali, Ali; Leeds, Timothy D; Liu, Sixin; Palti, Yniv; Kenney, Brett; Salem, Mohamed

2017-08-07

Coding/functional SNPs change the biological function of a gene and, therefore, could serve as "large-effect" genetic markers. In this study, we used two bioinformatics pipelines, GATK and SAMtools, for discovering coding/functional SNPs with allelic-imbalances associated with total body weight, muscle yield, muscle fat content, shear force, and whiteness. Phenotypic data were collected for approximately 500 fish, representing 98 families (5 fish/family), from a growth-selected line, and the muscle transcriptome was sequenced from 22 families with divergent phenotypes (4 low- versus 4 high-ranked families per trait). GATK detected 59,112 putative SNPs; of these SNPs, 4798 showed allelic imbalances (>2.0 as an amplification and <0.5 as loss of heterozygosity). SAMtools detected 87,066 putative SNPs; and of them, 4962 had allelic imbalances between the low- and high-ranked families. Only 1829 SNPs with allelic imbalances were common between the two datasets, indicating significant differences in algorithms. The two datasets contained 7930 non-redundant SNPs of which 4439 mapped to 1498 protein-coding genes (with 6.4% non-synonymous SNPs) and 684 mapped to 295 lncRNAs. Validation of a subset of 92 SNPs revealed 1) 86.7-93.8% success rate in calling polymorphic SNPs and 2) 95.4% consistent matching between DNA and cDNA genotypes indicating a high rate of identifying SNPs with allelic imbalances. In addition, 4.64% SNPs revealed random monoallelic expression. Genome distribution of the SNPs with allelic imbalances exhibited high density for all five traits in several chromosomes, especially chromosome 9, 20 and 28. Most of the SNP-harboring genes were assigned to important growth-related metabolic pathways. These results demonstrate utility of RNA-Seq in assessing phenotype-associated allelic imbalances in pooled RNA-Seq samples. The SNPs identified in this study were included in a new SNP-Chip design (available from Affymetrix) for genomic and genetic analyses in rainbow trout.
Characteristics of functional enrichment and gene expression level of human putative transcriptional target genes.

PubMed

Osato, Naoki

2018-01-19

Transcriptional target genes show functional enrichment of genes. However, how many and how significantly transcriptional target genes include functional enrichments are still unclear. To address these issues, I predicted human transcriptional target genes using open chromatin regions, ChIP-seq data and DNA binding sequences of transcription factors in databases, and examined functional enrichment and gene expression level of putative transcriptional target genes. Gene Ontology annotations showed four times larger numbers of functional enrichments in putative transcriptional target genes than gene expression information alone, independent of transcriptional target genes. To compare the number of functional enrichments of putative transcriptional target genes between cells or search conditions, I normalized the number of functional enrichment by calculating its ratios in the total number of transcriptional target genes. With this analysis, native putative transcriptional target genes showed the largest normalized number of functional enrichments, compared with target genes including 5-60% of randomly selected genes. The normalized number of functional enrichments was changed according to the criteria of enhancer-promoter interactions such as distance from transcriptional start sites and orientation of CTCF-binding sites. Forward-reverse orientation of CTCF-binding sites showed significantly higher normalized number of functional enrichments than the other orientations. Journal papers showed that the top five frequent functional enrichments were related to the cellular functions in the three cell types. The median expression level of transcriptional target genes changed according to the criteria of enhancer-promoter assignments (i.e. interactions) and was correlated with the changes of the normalized number of functional enrichments of transcriptional target genes. Human putative transcriptional target genes showed significant functional enrichments. Functional enrichments were related to the cellular functions. The normalized number of functional enrichments of human putative transcriptional target genes changed according to the criteria of enhancer-promoter assignments and correlated with the median expression level of the target genes. These analyses and characters of human putative transcriptional target genes would be useful to examine the criteria of enhancer-promoter assignments and to predict the novel mechanisms and factors such as DNA binding proteins and DNA sequences of enhancer-promoter interactions.
Horizontal gene acquisitions contributed to genome expansion in insect-symbiotic Spiroplasma clarkii.

PubMed

Tsai, Yi-Ming; Chang, An; Kuo, Chih-Horng

2018-06-01

Genome reduction is a recurring theme of symbiont evolution. The genus Spiroplasma contains species that are mostly facultative insect symbionts. The typical genome sizes of those species within the Apis clade were estimated to be ∼1.0-1.4 Mb. Intriguingly, Spiroplasma clarkii was found to have a genome size that is > 30% larger than the median of other species within the same clade. To investigate the molecular evolution events that led to the genome expansion of this bacterium, we determined its complete genome sequence and inferred the evolutionary origin of each protein-coding gene based on the phylogenetic distribution of homologs. Among the 1,346 annotated protein-coding genes, 641 were originated from within the Apis clade while 233 were putatively acquired from outside of the clade (including 91 high-confidence candidates). Additionally, 472 were specific to S. clarkii without homologs in the current database (i.e., the origins remained unknown). The acquisition of protein-coding genes, rather than mobile genetic elements, appeared to be a major contributing factor of genome expansion. Notably, >50% of the high-confidence acquired genes are related to carbohydrate transport and metabolism, suggesting that these acquired genes contributed to the expansion of both genome size and metabolic capability. The findings of this work provided an interesting case against the general evolutionary trend observed among symbiotic bacteria and further demonstrated the flexibility of Spiroplasma genomes. For future studies, investigation on the functional integration of these acquired genes, as well as the inference of their contribution to fitness could improve our knowledge of symbiont evolution.
Unraveling the molecular mechanisms of nitrogenase conformational protection against oxygen in diazotrophic bacteria.

PubMed

Lery, Letícia M S; Bitar, Mainá; Costa, Mauricio G S; Rössle, Shaila C S; Bisch, Paulo M

2010-12-22

G. diazotrophicus and A. vinelandii are aerobic nitrogen-fixing bacteria. Although oxygen is essential for the survival of these organisms, it irreversibly inhibits nitrogenase, the complex responsible for nitrogen fixation. Both microorganisms deal with this paradox through compensatory mechanisms. In A. vinelandii a conformational protection mechanism occurs through the interaction between the nitrogenase complex and the FeSII protein. Previous studies suggested the existence of a similar system in G. diazotrophicus, but the putative protein involved was not yet described. This study intends to identify the protein coding gene in the recently sequenced genome of G. diazotrophicus and also provide detailed structural information of nitrogenase conformational protection in both organisms. Genomic analysis of G. diazotrophicus sequences revealed a protein coding ORF (Gdia0615) enclosing a conserved "fer2" domain, typical of the ferredoxin family and found in A. vinelandii FeSII. Comparative models of both FeSII and Gdia0615 disclosed a conserved beta-grasp fold. Cysteine residues that coordinate the 2[Fe-S] cluster are in conserved positions towards the metallocluster. Analysis of solvent accessible residues and electrostatic surfaces unveiled an hydrophobic dimerization interface. Dimers assembled by molecular docking presented a stable behaviour and a proper accommodation of regions possibly involved in binding of FeSII to nitrogenase throughout molecular dynamics simulations in aqueous solution. Molecular modeling of the nitrogenase complex of G. diazotrophicus was performed and models were compared to the crystal structure of A. vinelandii nitrogenase. Docking experiments of FeSII and Gdia0615 with its corresponding nitrogenase complex pointed out in both systems a putative binding site presenting shape and charge complementarities at the Fe-protein/MoFe-protein complex interface. The identification of the putative FeSII coding gene in G. diazotrophicus genome represents a large step towards the understanding of the conformational protection mechanism of nitrogenase against oxygen. In addition, this is the first study regarding the structural complementarities of FeSII-nitrogenase interactions in diazotrophic bacteria. The combination of bioinformatic tools for genome analysis, comparative protein modeling, docking calculations and molecular dynamics provided a powerful strategy for the elucidation of molecular mechanisms and structural features of FeSII-nitrogenase interaction.
High-quality-draft genome sequence of the fermenting bacterium Anaerobium acetethylicum type strain GluBS11T (DSM 29698)

DOE PAGES

Patil, Yogita; Müller, Nicolai; Schink, Bernhard; ...

2017-02-20

Anaerobium acetethylicum strain GluBS11 T belongs to the family Lachnospiraceae within the order Clostridiales. It is a Gram-positive, non-motile and strictly anaerobic bacterium isolated from biogas slurry that was originally enriched with gluconate as carbon source (Patil, et al., Int J Syst Evol Microbiol 65:3289-3296, 2015). Here we describe the draft genome sequence of strain GluBS11 T and provide a detailed insight into its physiological and metabolic features. The draft genome sequence generated 4,609,043 bp, distributed among 105 scaffolds assembled using the SPAdes genome assembler method. It comprises in total 4,132 genes, of which 4,008 were predicted to be proteinmore » coding genes, 124 RNA genes and 867 pseudogenes. The content was 43.51 mol %. The annotated genome of strain GluBS11 T contains putative genes coding for the pentose phosphate pathway, the Embden-Meyerhoff-Parnas pathway, the Entner-Doudoroff pathway and the tricarboxylic acid cycle. The genome revealed the presence of most of the necessary genes required for the fermentation of glucose and gluconate to acetate, ethanol, and hydrogen gas. However, a candidate gene for production of formate was not identified.« less
High-quality-draft genome sequence of the fermenting bacterium Anaerobium acetethylicum type strain GluBS11T (DSM 29698)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Patil, Yogita; Müller, Nicolai; Schink, Bernhard

Anaerobium acetethylicum strain GluBS11 T belongs to the family Lachnospiraceae within the order Clostridiales. It is a Gram-positive, non-motile and strictly anaerobic bacterium isolated from biogas slurry that was originally enriched with gluconate as carbon source (Patil, et al., Int J Syst Evol Microbiol 65:3289-3296, 2015). Here we describe the draft genome sequence of strain GluBS11 T and provide a detailed insight into its physiological and metabolic features. The draft genome sequence generated 4,609,043 bp, distributed among 105 scaffolds assembled using the SPAdes genome assembler method. It comprises in total 4,132 genes, of which 4,008 were predicted to be proteinmore » coding genes, 124 RNA genes and 867 pseudogenes. The content was 43.51 mol %. The annotated genome of strain GluBS11 T contains putative genes coding for the pentose phosphate pathway, the Embden-Meyerhoff-Parnas pathway, the Entner-Doudoroff pathway and the tricarboxylic acid cycle. The genome revealed the presence of most of the necessary genes required for the fermentation of glucose and gluconate to acetate, ethanol, and hydrogen gas. However, a candidate gene for production of formate was not identified.« less
Multiple copies of a bile acid-inducible gene in Eubacterium sp. strain VPI 12708.

PubMed Central

Gopal-Srivastava, R; Mallonee, D H; White, W B; Hylemon, P B

1990-01-01

Eubacterium sp. strain VPI 12708 is an anaerobic intestinal bacterium which possesses inducible bile acid 7-dehydroxylation activity. Several new polypeptides are produced in this strain following induction with cholic acid. Genes coding for two copies of a bile acid-inducible 27,000-dalton polypeptide (baiA1 and baiA2) have been previously cloned and sequenced. We now report on a gene coding for a third copy of this 27,000-dalton polypeptide (baiA3). The baiA3 gene has been cloned in lambda DASH on an 11.2-kilobase DNA fragment from a partial Sau3A digest of the Eubacterium DNA. DNA sequence analysis of the baiA3 gene revealed 100% homology with the baiA1 gene within the coding region of the 27,000-dalton polypeptides. The baiA2 gene shares 81% sequence identity with the other two genes at the nucleotide level. The flanking nucleotide sequences associated with the baiA1 and baiA3 genes are identical for 930 bases in the 5' direction from the initiation codon and for at least 325 bases in the 3' direction from the stop codon, including the putative promoter regions for the genes. An additional open reading frame (occupying from 621 to 648 bases, depending on the correct start codon) was found in the identical 5' regions associated with the baiA1 and baiA3 clones. The 5' sequence 930 bases upstream from the baiA1 and baiA3 genes was totally divergent. The baiA2 gene, which is part of a large bile acid-inducible operon, showed no homology with the other two genes either in the 5' or 3' direction from the polypeptide coding region, except for a 15-base-pair presumed ribosome-binding site in the 5' region. These studies strongly suggest that a gene duplication (baiA1 and baiA3) has occurred and is stably maintained in this bacterium. Images PMID:2376563

Putative and unique gene sequence utilization for the design of species specific probes as modeled by Lactobacillus plantarum

USDA-ARS?s Scientific Manuscript database

The concept of utilizing putative and unique gene sequences for the design of species specific probes was tested. The abundance profile of assigned functions within the Lactobacillus plantarum genome was used for the identification of the putative and unique gene sequence, csh. The targeted gene (cs...
Transcriptome Analysis of Leaves, Flowers and Fruits Perisperm of Coffea arabica L. Reveals the Differential Expression of Genes Involved in Raffinose Biosynthesis

PubMed Central

dos Santos, Tiago Benedito; de Oliveira, Fernanda Freitas; Pot, David; Leroy, Thierry; Vieira, Luiz Gonzaga Esteves; Carazzolle, Marcelo Falsarella; Pereira, Gonçalo Amarante Guimarães

2017-01-01

Coffea arabica L. is an important crop in several developing countries. Despite its economic importance, minimal transcriptome data are available for fruit tissues, especially during fruit development where several compounds related to coffee quality are produced. To understand the molecular aspects related to coffee fruit and grain development, we report a large-scale transcriptome analysis of leaf, flower and perisperm fruit tissue development. Illumina sequencing yielded 41,881,572 high-quality filtered reads. De novo assembly generated 65,364 unigenes with an average length of 1,264 bp. A total of 24,548 unigenes were annotated as protein coding genes, including 12,560 full-length sequences. In the annotation process, we identified nine candidate genes related to the biosynthesis of raffinose family oligossacarides (RFOs). These sugars confer osmoprotection and are accumulated during initial fruit development. Four genes from this pathway had their transcriptional pattern validated by quantitative reverse transcription polymerase chain reaction (qRT-PCR). Furthermore, we identified ~24,000 putative target sites for microRNAs (miRNAs) and 134 putative transcriptionally active transposable elements (TE) sequences in our dataset. This C. arabica transcriptomic atlas provides an important step for identifying candidate genes related to several coffee metabolic pathways, especially those related to fruit chemical composition and therefore beverage quality. Our results are the starting point for enhancing our knowledge about the coffee genes that are transcribed during the flowering and initial fruit development stages. PMID:28068432
Transcriptome Analysis of Leaves, Flowers and Fruits Perisperm of Coffea arabica L. Reveals the Differential Expression of Genes Involved in Raffinose Biosynthesis.

PubMed

Ivamoto, Suzana Tiemi; Reis, Osvaldo; Domingues, Douglas Silva; Dos Santos, Tiago Benedito; de Oliveira, Fernanda Freitas; Pot, David; Leroy, Thierry; Vieira, Luiz Gonzaga Esteves; Carazzolle, Marcelo Falsarella; Pereira, Gonçalo Amarante Guimarães; Pereira, Luiz Filipe Protasio

2017-01-01

Coffea arabica L. is an important crop in several developing countries. Despite its economic importance, minimal transcriptome data are available for fruit tissues, especially during fruit development where several compounds related to coffee quality are produced. To understand the molecular aspects related to coffee fruit and grain development, we report a large-scale transcriptome analysis of leaf, flower and perisperm fruit tissue development. Illumina sequencing yielded 41,881,572 high-quality filtered reads. De novo assembly generated 65,364 unigenes with an average length of 1,264 bp. A total of 24,548 unigenes were annotated as protein coding genes, including 12,560 full-length sequences. In the annotation process, we identified nine candidate genes related to the biosynthesis of raffinose family oligossacarides (RFOs). These sugars confer osmoprotection and are accumulated during initial fruit development. Four genes from this pathway had their transcriptional pattern validated by quantitative reverse transcription polymerase chain reaction (qRT-PCR). Furthermore, we identified ~24,000 putative target sites for microRNAs (miRNAs) and 134 putative transcriptionally active transposable elements (TE) sequences in our dataset. This C. arabica transcriptomic atlas provides an important step for identifying candidate genes related to several coffee metabolic pathways, especially those related to fruit chemical composition and therefore beverage quality. Our results are the starting point for enhancing our knowledge about the coffee genes that are transcribed during the flowering and initial fruit development stages.
Rise of Microbial Culturomics: Noncontiguous Finished Genome Sequence and Description of Beduini massiliensis gen. nov., sp. nov.

PubMed Central

Mourembou, Gaël; Yasir, Muhammad; Azhar, Esam Ibraheem; Lagier, Jean Christophe; Bibi, Fehmida; Jiman-Fatani, Asif Ahmad; Helmy, Nayel; Robert, Catherine; Rathored, Jaishriram; Fournier, Pierre-Edouard; Raoult, Didier

2015-01-01

Abstract Microbial culturomics is a new field of omics sciences that examines the bacterial diversity of human gut coupled with a taxono-genomic strategy. Using microbial culturomics, we report here for the first time a novel Gram negative, catalase- and oxidase-negative, strict anaerobic bacilli named Beduini massiliensis gen. nov., sp nov. strain GM1 (= CSUR P1440 = DSM 100188), isolated from the stools of a female nomadic Bedouin from Saudi Arabia. With a length of 2,850,586 bp, the Beduini massiliensis genome exhibits a G + C content of 35.9%, and contains 2819 genes (2744 protein-coding and 75 RNA genes including 57 tRNA and 18 rRNA genes). It is composed of 6 scaffolds (composed of 6 contigs). A total of 1859 genes (67.75%) were assigned a putative function (by COGs or by NR blast). At least 1457 (53%) orthologous proteins were not shared with the closest phylogenetic species. 274 genes (10.0%) were identified as ORFans. These results show that microbial culturomics can dramatically improve the characterization of the human microbiota repertoire, deciphering new bacterial species and new genes. Further studies will clarify the geographic specificity and the putative role of these new microbes and their related functional genetic content in health and disease. Microbial culturomics is an emerging frontier of omics systems sciences and integrative biology and thus, warrants further consideration as part of the postgenomics methodology toolbox. PMID:26669711
Rise of Microbial Culturomics: Noncontiguous Finished Genome Sequence and Description of Beduini massiliensis gen. nov., sp. nov.

PubMed

Mourembou, Gaël; Yasir, Muhammad; Azhar, Esam Ibraheem; Lagier, Jean Christophe; Bibi, Fehmida; Jiman-Fatani, Asif Ahmad; Helmy, Nayel; Robert, Catherine; Rathored, Jaishriram; Fournier, Pierre-Edouard; Raoult, Didier; Million, Matthieu

2015-12-01

Microbial culturomics is a new field of omics sciences that examines the bacterial diversity of human gut coupled with a taxono-genomic strategy. Using microbial culturomics, we report here for the first time a novel Gram negative, catalase- and oxidase-negative, strict anaerobic bacilli named Beduini massiliensis gen. nov., sp nov. strain GM1 (= CSUR P1440 = DSM 100188), isolated from the stools of a female nomadic Bedouin from Saudi Arabia. With a length of 2,850,586 bp, the Beduini massiliensis genome exhibits a G + C content of 35.9%, and contains 2819 genes (2744 protein-coding and 75 RNA genes including 57 tRNA and 18 rRNA genes). It is composed of 6 scaffolds (composed of 6 contigs). A total of 1859 genes (67.75%) were assigned a putative function (by COGs or by NR blast). At least 1457 (53%) orthologous proteins were not shared with the closest phylogenetic species. 274 genes (10.0%) were identified as ORFans. These results show that microbial culturomics can dramatically improve the characterization of the human microbiota repertoire, deciphering new bacterial species and new genes. Further studies will clarify the geographic specificity and the putative role of these new microbes and their related functional genetic content in health and disease. Microbial culturomics is an emerging frontier of omics systems sciences and integrative biology and thus, warrants further consideration as part of the postgenomics methodology toolbox.
Identification of Putative Olfactory Genes from the Oriental Fruit Moth Grapholita molesta via an Antennal Transcriptome Analysis

PubMed Central

Li, Yiping; Wu, Junxiang

2015-01-01

Background The oriental fruit moth, Grapholita molesta, is an extremely important oligophagous pest species of stone and pome fruits throughout the world. As a host-switching species, adult moths, especially females, depend on olfactory cues to a large extent in locating host plants, finding mates, and selecting oviposition sites. The identification of olfactory genes can facilitate investigation on mechanisms for chemical communications. Methodology/Principal Finding We generated transcriptome of female antennae of G.molesta using the next-generation sequencing technique, and assembled transcripts from RNA-seq reads using Trinity, SOAPdenovo-trans and Abyss-trans assemblers. We identified 124 putative olfactory genes. Among the identified olfactory genes, 118 were novel to this species, including 28 transcripts encoding for odorant binding proteins, 17 chemosensory proteins, 48 odorant receptors, four gustatory receptors, 24 ionotropic receptors, two sensory neuron membrane proteins, and one odor degrading enzyme. The identified genes were further confirmed through semi-quantitative reverse transcription PCR for transcripts coding for 26 OBPs and 17 CSPs. OBP transcripts showed an obvious antenna bias, whereas CSP transcripts were detected in different tissues. Conclusion Antennal transcriptome data derived from the oriental fruit moth constituted an abundant molecular resource for the identification of genes potentially involved in the olfaction process of the species. This study provides a foundation for future research on the molecules involved in olfactory recognition of this insect pest, and in particular, the feasibility of using semiochemicals to control this pest. PMID:26540284
Complex organisation and structure of the ghrelin antisense strand gene GHRLOS, a candidate non-coding RNA gene

PubMed Central

Seim, Inge; Carter, Shea L; Herington, Adrian C; Chopin, Lisa K

2008-01-01

Background The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH) release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS), which spans the promoter and untranslated regions of the ghrelin gene (GHRL). Here we further characterise GHRLOS. Results We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. Conclusion GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames. The gene is also non-conserved, with differential and tissue-restricted expression. The overlapping genomic arrangement of GHRLOS with the ghrelin gene indicates that it is likely to have interesting regulatory and functional roles in the ghrelin axis. PMID:18954468
Complex organisation and structure of the ghrelin antisense strand gene GHRLOS, a candidate non-coding RNA gene.

PubMed

Seim, Inge; Carter, Shea L; Herington, Adrian C; Chopin, Lisa K

2008-10-28

The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH) release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS), which spans the promoter and untranslated regions of the ghrelin gene (GHRL). Here we further characterise GHRLOS. We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames. The gene is also non-conserved, with differential and tissue-restricted expression. The overlapping genomic arrangement of GHRLOS with the ghrelin gene indicates that it is likely to have interesting regulatory and functional roles in the ghrelin axis.
Identification and qualification of 500 nuclear, single-copy, orthologous genes for the Eupulmonata (Gastropoda) using transcriptome sequencing and exon capture.

PubMed

Teasdale, Luisa C; Köhler, Frank; Murray, Kevin D; O'Hara, Tim; Moussalli, Adnan

2016-09-01

The qualification of orthology is a significant challenge when developing large, multiloci phylogenetic data sets from assembled transcripts. Transcriptome assemblies have various attributes, such as fragmentation, frameshifts and mis-indexing, which pose problems to automated methods of orthology assessment. Here, we identify a set of orthologous single-copy genes from transcriptome assemblies for the land snails and slugs (Eupulmonata) using a thorough approach to orthology determination involving manual alignment curation, gene tree assessment and sequencing from genomic DNA. We qualified the orthology of 500 nuclear, protein-coding genes from the transcriptome assemblies of 21 eupulmonate species to produce the most complete phylogenetic data matrix for a major molluscan lineage to date, both in terms of taxon and character completeness. Exon capture targeting 490 of the 500 genes (those with at least one exon >120 bp) from 22 species of Australian Camaenidae successfully captured sequences of 2825 exons (representing all targeted genes), with only a 3.7% reduction in the data matrix due to the presence of putative paralogs or pseudogenes. The automated pipeline Agalma retrieved the majority of the manually qualified 500 single-copy gene set and identified a further 375 putative single-copy genes, although it failed to account for fragmented transcripts resulting in lower data matrix completeness when considering the original 500 genes. This could potentially explain the minor inconsistencies we observed in the supported topologies for the 21 eupulmonate species between the manually curated and 'Agalma-equivalent' data set (sharing 458 genes). Overall, our study confirms the utility of the 500 gene set to resolve phylogenetic relationships at a range of evolutionary depths and highlights the importance of addressing fragmentation at the homolog alignment stage for probe design. © 2016 John Wiley & Sons Ltd.
Gapless genome assembly of Colletotrichum higginsianum reveals chromosome structure and association of transposable elements with secondary metabolite gene clusters.

PubMed

Dallery, Jean-Félix; Lapalu, Nicolas; Zampounis, Antonios; Pigné, Sandrine; Luyten, Isabelle; Amselem, Joëlle; Wittenberg, Alexander H J; Zhou, Shiguo; de Queiroz, Marisa V; Robin, Guillaume P; Auger, Annie; Hainaut, Matthieu; Henrissat, Bernard; Kim, Ki-Tae; Lee, Yong-Hwan; Lespinet, Olivier; Schwartz, David C; Thon, Michael R; O'Connell, Richard J

2017-08-29

The ascomycete fungus Colletotrichum higginsianum causes anthracnose disease of brassica crops and the model plant Arabidopsis thaliana. Previous versions of the genome sequence were highly fragmented, causing errors in the prediction of protein-coding genes and preventing the analysis of repetitive sequences and genome architecture. Here, we re-sequenced the genome using single-molecule real-time (SMRT) sequencing technology and, in combination with optical map data, this provided a gapless assembly of all twelve chromosomes except for the ribosomal DNA repeat cluster on chromosome 7. The more accurate gene annotation made possible by this new assembly revealed a large repertoire of secondary metabolism (SM) key genes (89) and putative biosynthetic pathways (77 SM gene clusters). The two mini-chromosomes differed from the ten core chromosomes in being repeat- and AT-rich and gene-poor but were significantly enriched with genes encoding putative secreted effector proteins. Transposable elements (TEs) were found to occupy 7% of the genome by length. Certain TE families showed a statistically significant association with effector genes and SM cluster genes and were transcriptionally active at particular stages of fungal development. All 24 subtelomeres were found to contain one of three highly-conserved repeat elements which, by providing sites for homologous recombination, were probably instrumental in four segmental duplications. The gapless genome of C. higginsianum provides access to repeat-rich regions that were previously poorly assembled, notably the mini-chromosomes and subtelomeres, and allowed prediction of the complete SM gene repertoire. It also provides insights into the potential role of TEs in gene and genome evolution and host adaptation in this asexual pathogen.
Molecular cloning of the potato Gro1-4 gene conferring resistance to pathotype Ro1 of the root cyst nematode Globodera rostochiensis, based on a candidate gene approach.

PubMed

Paal, Jürgen; Henselewski, Heike; Muth, Jost; Meksem, Khalid; Menéndez, Cristina M; Salamini, Francesco; Ballvora, Agim; Gebhardt, Christiane

2004-04-01

The endoparasitic root cyst nematode Globodera rostochiensis causes considerable damage in potato cultivation. In the past, major genes for nematode resistance have been introgressed from related potato species into cultivars. Elucidating the molecular basis of resistance will contribute to the understanding of nematode-plant interactions and assist in breeding nematode-resistant cultivars. The Gro1 resistance locus to G. rostochiensis on potato chromosome VII co-localized with a resistance-gene-like (RGL) DNA marker. This marker was used to isolate from genomic libraries 15 members of a closely related candidate gene family. Analysis of inheritance, linkage mapping, and sequencing reduced the number of candidate genes to three. Complementation analysis by stable potato transformation showed that the gene Gro1-4 conferred resistance to G. rostochiensis pathotype Ro1. Gro1-4 encodes a protein of 1136 amino acids that contains Toll-interleukin 1 receptor (TIR), nucleotide-binding (NB), leucine-rich repeat (LRR) homology domains and a C-terminal domain with unknown function. The deduced Gro1-4 protein differed by 29 amino acid changes from susceptible members of the Gro1 gene family. Sequence characterization of 13 members of the Gro1 gene family revealed putative regulatory elements and a variable microsatellite in the promoter region, insertion of a retrotransposon-like element in the first intron, and a stop codon in the NB coding region of some genes. Sequence analysis of RT-PCR products showed that Gro1-4 is expressed, among other members of the family including putative pseudogenes, in non-infected roots of nematode-resistant plants. RT-PCR also demonstrated that members of the Gro1 gene family are expressed in most potato tissues.
Non-coding RNAs—Novel targets in neurotoxicity

PubMed Central

Tal, Tamara L.; Tanguay, Robert L.

2012-01-01

Over the past ten years non-coding RNAs (ncRNAs) have emerged as pivotal players in fundamental physiological and cellular processes and have been increasingly implicated in cancer, immune disorders, and cardiovascular, neurodegenerative, and metabolic diseases. MicroRNAs (miRNAs) represent a class of ncRNA molecules that function as negative regulators of post-transcriptional gene expression. miRNAs are predicted to regulate 60% of all human protein-coding genes and as such, play key roles in cellular and developmental processes, human health, and disease. Relative to counterparts that lack bindings sites for miRNAs, genes encoding proteins that are post-transcriptionally regulated by miRNAs are twice as likely to be sensitive to environmental chemical exposure. Not surprisingly, miRNAs have been recognized as targets or effectors of nervous system, developmental, hepatic, and carcinogenic toxicants, and have been identified as putative regulators of phase I xenobiotic-metabolizing enzymes. In this review, we give an overview of the types of ncRNAs and highlight their roles in neurodevelopment, neurological disease, activity-dependent signaling, and drug metabolism. We then delve into specific examples that illustrate their importance as mediators, effectors, or adaptive agents of neurotoxicants or neuroactive pharmaceutical compounds. Finally, we identify a number of outstanding questions regarding ncRNAs and neurotoxicity. PMID:22394481
Chromosomal localization and partial genomic structure of the human peroxisome proliferator activated receptor-gamma (hPPAR gamma) gene.

PubMed

Beamer, B A; Negri, C; Yen, C J; Gavrilova, O; Rumberger, J M; Durcan, M J; Yarnall, D P; Hawkins, A L; Griffin, C A; Burns, D K; Roth, J; Reitman, M; Shuldiner, A R

1997-04-28

We determined the chromosomal localization and partial genomic structure of the coding region of the human PPAR gamma gene (hPPAR gamma), a nuclear receptor important for adipocyte differentiation and function. Sequence analysis and long PCR of human genomic DNA with primers that span putative introns revealed that intron positions and sizes of hPPAR gamma are similar to those previously determined for the mouse PPAR gamma gene[13]. Fluorescent in situ hybridization localized hPPAR gamma to chromosome 3, band 3p25. Radiation hybrid mapping with two independent primer pairs was consistent with hPPAR gamma being within 1.5 Mb of marker D3S1263 on 3p25-p24.2. These sequences of the intron/exon junctions of the 6 coding exons shared by hPPAR gamma 1 and hPPAR gamma 2 will facilitate screening for possible mutations. Furthermore, D3S1263 is a suitable polymorphic marker for linkage analysis to evaluate PPAR gamma's potential contribution to genetic susceptibility to obesity, lipoatrophy, insulin resistance, and diabetes.
Transcriptional regulation of the operon encoding stress-responsive ECF sigma factor SigH and its anti-sigma factor RshA, and control of its regulatory network in Corynebacterium glutamicum

PubMed Central

2012-01-01

Background The expression of genes in Corynebacterium glutamicum, a Gram-positive non-pathogenic bacterium used mainly for the industrial production of amino acids, is regulated by seven different sigma factors of RNA polymerase, including the stress-responsive ECF-sigma factor SigH. The sigH gene is located in a gene cluster together with the rshA gene, putatively encoding an anti-sigma factor. The aim of this study was to analyze the transcriptional regulation of the sigH and rshA gene cluster and the effects of RshA on the SigH regulon, in order to refine the model describing the role of SigH and RshA during stress response. Results Transcription analyses revealed that the sigH gene and rshA gene are cotranscribed from four sigH housekeeping promoters in C. glutamicum. In addition, a SigH-controlled rshA promoter was found to only drive the transcription of the rshA gene. To test the role of the putative anti-sigma factor gene rshA under normal growth conditions, a C. glutamicum rshA deletion strain was constructed and used for genome-wide transcription profiling with DNA microarrays. In total, 83 genes organized in 61 putative transcriptional units, including those previously detected using sigH mutant strains, exhibited increased transcript levels in the rshA deletion mutant compared to its parental strain. The genes encoding proteins related to disulphide stress response, heat stress proteins, components of the SOS-response to DNA damage and proteasome components were the most markedly upregulated gene groups. Altogether six SigH-dependent promoters upstream of the identified genes were determined by primer extension and a refined consensus promoter consisting of 45 original promoter sequences was constructed. Conclusions The rshA gene codes for an anti-sigma factor controlling the function of the stress-responsive sigma factor SigH in C. glutamicum. Transcription of rshA from a SigH-dependent promoter may serve to quickly shutdown the SigH-dependent stress response after the cells have overcome the stress condition. Here we propose a model of the regulation of oxidative and heat stress response including redox homeostasis by SigH, RshA and the thioredoxin system. PMID:22943411
Transcriptional regulation of the operon encoding stress-responsive ECF sigma factor SigH and its anti-sigma factor RshA, and control of its regulatory network in Corynebacterium glutamicum.

PubMed

Busche, Tobias; Silar, Radoslav; Pičmanová, Martina; Pátek, Miroslav; Kalinowski, Jörn

2012-09-03

The expression of genes in Corynebacterium glutamicum, a Gram-positive non-pathogenic bacterium used mainly for the industrial production of amino acids, is regulated by seven different sigma factors of RNA polymerase, including the stress-responsive ECF-sigma factor SigH. The sigH gene is located in a gene cluster together with the rshA gene, putatively encoding an anti-sigma factor. The aim of this study was to analyze the transcriptional regulation of the sigH and rshA gene cluster and the effects of RshA on the SigH regulon, in order to refine the model describing the role of SigH and RshA during stress response. Transcription analyses revealed that the sigH gene and rshA gene are cotranscribed from four sigH housekeeping promoters in C. glutamicum. In addition, a SigH-controlled rshA promoter was found to only drive the transcription of the rshA gene. To test the role of the putative anti-sigma factor gene rshA under normal growth conditions, a C. glutamicum rshA deletion strain was constructed and used for genome-wide transcription profiling with DNA microarrays. In total, 83 genes organized in 61 putative transcriptional units, including those previously detected using sigH mutant strains, exhibited increased transcript levels in the rshA deletion mutant compared to its parental strain. The genes encoding proteins related to disulphide stress response, heat stress proteins, components of the SOS-response to DNA damage and proteasome components were the most markedly upregulated gene groups. Altogether six SigH-dependent promoters upstream of the identified genes were determined by primer extension and a refined consensus promoter consisting of 45 original promoter sequences was constructed. The rshA gene codes for an anti-sigma factor controlling the function of the stress-responsive sigma factor SigH in C. glutamicum. Transcription of rshA from a SigH-dependent promoter may serve to quickly shutdown the SigH-dependent stress response after the cells have overcome the stress condition. Here we propose a model of the regulation of oxidative and heat stress response including redox homeostasis by SigH, RshA and the thioredoxin system.
Rapid Quantification of Mutant Fitness in Diverse Bacteria by Sequencing Randomly Bar-Coded Transposons

PubMed Central

Wetmore, Kelly M.; Price, Morgan N.; Waters, Robert J.; Lamson, Jacob S.; He, Jennifer; Hoover, Cindi A.; Blow, Matthew J.; Bristow, James; Butland, Gareth

2015-01-01

ABSTRACT Transposon mutagenesis with next-generation sequencing (TnSeq) is a powerful approach to annotate gene function in bacteria, but existing protocols for TnSeq require laborious preparation of every sample before sequencing. Thus, the existing protocols are not amenable to the throughput necessary to identify phenotypes and functions for the majority of genes in diverse bacteria. Here, we present a method, random bar code transposon-site sequencing (RB-TnSeq), which increases the throughput of mutant fitness profiling by incorporating random DNA bar codes into Tn5 and mariner transposons and by using bar code sequencing (BarSeq) to assay mutant fitness. RB-TnSeq can be used with any transposon, and TnSeq is performed once per organism instead of once per sample. Each BarSeq assay requires only a simple PCR, and 48 to 96 samples can be sequenced on one lane of an Illumina HiSeq system. We demonstrate the reproducibility and biological significance of RB-TnSeq with Escherichia coli, Phaeobacter inhibens, Pseudomonas stutzeri, Shewanella amazonensis, and Shewanella oneidensis. To demonstrate the increased throughput of RB-TnSeq, we performed 387 successful genome-wide mutant fitness assays representing 130 different bacterium-carbon source combinations and identified 5,196 genes with significant phenotypes across the five bacteria. In P. inhibens, we used our mutant fitness data to identify genes important for the utilization of diverse carbon substrates, including a putative d-mannose isomerase that is required for mannitol catabolism. RB-TnSeq will enable the cost-effective functional annotation of diverse bacteria using mutant fitness profiling. PMID:25968644
Genome-wide identification and characterization of the SBP-box gene family in Petunia.

PubMed

Zhou, Qin; Zhang, Sisi; Chen, Feng; Liu, Baojun; Wu, Lan; Li, Fei; Zhang, Jiaqi; Bao, Manzhu; Liu, Guofeng

2018-03-12

SQUAMOSA PROMOTER BINDING PROTEIN (SBP)-box genes encode a family of plant-specific transcription factors (TFs) that play important roles in many growth and development processes including phase transition, leaf initiation, shoot and inflorescence branching, fruit development and ripening etc. The SBP-box gene family has been identified and characterized in many species, but has not been well studied in Petunia, an important ornamental genus. We identified 21 putative SPL genes of Petunia axillaris and P. inflata from the reference genome of P. axillaris N and P. inflata S6, respectively, which were supported by the transcriptome data. For further confirmation, all the 21 genes were also cloned from P. hybrida line W115 (Mitchel diploid). Phylogenetic analysis based on the highly conserved SBP domains arranged PhSPLs in eight groups, analogous to those from Arabidopsis and tomato. Furthermore, the Petunia SPL genes had similar exon-intron structure and the deduced proteins contained very similar conserved motifs within the same subgroup. Out of 21 PhSPL genes, fourteen were predicted to be potential targets of PhmiR156/157, and the putative miR156/157 response elements (MREs) were located in the coding region of group IV, V, VII and VIII genes, but in the 3'-UTR regions of group VI genes. SPL genes were also identified from another two wild Petunia species, P. integrifolia and P. exserta, based on their transcriptome databases to investigate the origin of PhSPLs. Phylogenetic analysis and multiple alignments of the coding sequences of PhSPLs and their orthologs from wild species indicated that PhSPLs were originated mainly from P. axillaris. qRT-PCR analysis demonstrated differential spatiotemperal expression patterns of PhSPL genes in petunia and many were expressed predominantly in the axillary buds and/or inflorescences. In addition, overexpression of PhSPL9a and PhSPL9b in Arabidopsis suggested that these genes play a conserved role in promoting the vegetative-to-reproductive phase transition. Petunia genome contains at least 21 SPL genes, and most of the genes are expressed in different tissues. The PhSPL genes may play conserved and diverse roles in plant growth and development, including flowering regulation, leaf initiation, axillary bud and inflorescence development. This work provides a comprehensive understanding of the SBP-box gene family in Petunia and lays a significant foundation for future studies on the function and evolution of SPL genes in petunia.
Fine mapping of RYMV3: a new resistance gene to Rice yellow mottle virus from Oryza glaberrima.

PubMed

Pidon, Hélène; Ghesquière, Alain; Chéron, Sophie; Issaka, Souley; Hébrard, Eugénie; Sabot, François; Kolade, Olufisayo; Silué, Drissa; Albar, Laurence

2017-04-01

A new resistance gene against Rice yellow mottle virus was identified and mapped in a 15-kb interval. The best candidate is a CC-NBS-LRR gene. Rice yellow mottle virus (RYMV) disease is a serious constraint to the cultivation of rice in Africa and selection for resistance is considered to be the most effective management strategy. The aim of this study was to characterize the resistance of Tog5307, a highly resistant accession belonging to the African cultivated rice species (Oryza glaberrima), that has none of the previously identified resistance genes to RYMV. The specificity of Tog5307 resistance was analyzed using 18 RYMV isolates. While three of them were able to infect Tog5307 very rapidly, resistance against the others was effective despite infection events attributed to resistance-breakdown or incomplete penetrance of the resistance. Segregation of resistance in an interspecific backcross population derived from a cross between Tog5307 and the susceptible Oryza sativa variety IR64 showed that resistance is dominant and is controlled by a single gene, named RYMV3. RYMV3 was mapped in an approximately 15-kb interval in which two candidate genes, coding for a putative transmembrane protein and a CC-NBS-LRR domain-containing protein, were annotated. Sequencing revealed non-synonymous polymorphisms between Tog5307 and the O. glaberrima susceptible accession CG14 in both candidate genes. An additional resistant O. glaberrima accession, Tog5672, was found to have the Tog5307 genotype for the CC-NBS-LRR gene but not for the putative transmembrane protein gene. Analysis of the cosegregation of Tog5672 resistance with the RYMV3 locus suggests that RYMV3 is also involved in Tog5672 resistance, thereby supporting the CC-NBS-LRR gene as the best candidate for RYMV3.
A comparison of complete mitochondrial genomes of silver carp hypophthalmichthys molitrix and bighead carp hypophthalmichthys nobilis: Implications for their taxonomic relationship and phylogeny

USGS Publications Warehouse

Li, S.-F.; Xu, J.-W.; Yang, Q.-L.; Wang, C.H.; Chen, Q.; Chapman, D.C.; Lu, G.

2009-01-01

Based upon morphological characters, Silver carp Hypophthalmichthys molitrix and bighead carp Hypophthalmichthys nobilis (or Aristichthys nobilis) have been classified into either the same genus or two distinct genera. Consequently, the taxonomic relationship of the two species at the generic level remains equivocal. This issue is addressed by sequencing complete mitochondrial genomes of H. molitrix and H. nobilis, comparing their mitogenome organization, structure and sequence similarity, and conducting a comprehensive phylogenetic analysis of cyprinid species. As with other cyprinid fishes, the mitogenomes of the two species were structurally conserved, containing 37 genes including 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA (tRNAs) genes and a putative control region (D-loop). Sequence similarity between the two mitogenomes varied in different genes or regions, being highest in the tRNA genes (98??8%), lowest in the control region (89??4%) and intermediate in the protein-coding genes (94??2%). Analyses of the sequence comparison and phylogeny using concatenated protein sequences support the view that the two species belong to the genus Hypophthalmichthys. Further studies using nuclear markers and involving more closely related species, and the systematic combination of traditional biology and molecular biology are needed in order to confirm this conclusion. ?? 2009 The Fisheries Society of the British Isles.
Isolation and Characterization of EstC, a New Cold-Active Esterase from Streptomyces coelicolor A3(2)

PubMed Central

Brault, Guillaume; Shareck, François; Hurtubise, Yves; Lépine, François; Doucet, Nicolas

2012-01-01

The genome sequence of Streptomyces coelicolor A3(2) contains more than 50 genes coding for putative lipolytic enzymes. Many studies have shown the capacity of this actinomycete to store important reserves of intracellular triacylglycerols in nutrient depletion situations. In the present study, we used genome mining of S. coelicolor to identify genes coding for putative, non-secreted esterases/lipases. Two genes were cloned and successfully overexpressed in E. coli as His-tagged fusion proteins. One of the recombinant enzymes, EstC, showed interesting cold-active esterase activity with a strong potential for the production of valuable esters. The purified enzyme displayed optimal activity at 35°C and was cold-active with retention of 25% relative activity at 10°C. Its optimal pH was 8.5–9 but the enzyme kept more than 75% of its maximal activity between pH 7.5 and 10. EstC also showed remarkable tolerance over a wide range of pH values, retaining almost full residual activity between pH 6–11. The enzyme was active toward short-chain p-nitrophenyl esters (C2–C12), displaying optimal activity with the valerate (C5) ester (k cat/K m = 737±77 s−1 mM−1). The enzyme was also very active toward short chain triglycerides such as triacetin (C2:0) and tributyrin (C4:0), in addition to showing good primary alcohol and organic solvent tolerance, suggesting it could function as an interesting candidate for organic synthesis of short-chain esters such as flavors. PMID:22396747

Draft genome sequence of Trametes villosa (Sw.) Kreisel CCMB561, a tropical white-rot Basidiomycota from the semiarid region of Brazil.

PubMed

Ferreira, Dalila Souza Santos; Kato, Rodrigo Bentes; Miranda, Fábio Malcher; da Costa Pinheiro, Kenny; Fonseca, Paula Luize Camargos; Tomé, Luiz Marcelo Ribeiro; Vaz, Aline Bruna Martins; Badotti, Fernanda; Ramos, Rommel Thiago Jucá; Brenig, Bertram; Azevedo, Vasco Ariston de Carvalho; Benevides, Raquel Guimarães; Góes-Neto, Aristóteles

2018-06-01

Herein, we present the draft genome of Trametes villosa isolate CCMB561, a wood-decaying Basidiomycota commonly found in tropical semiarid climate. The genome assembly was 57.98 Mb in size with an L50 of 691. A total of 16,711 putative protein-encoding genes was predicted, including 590 genes coding for carbohydrate-active enzymes (CAZy), directly involved in the decomposition of lignocellulosic materials. This is the first genome of this species of high interest in bioenergy research. The draft genome of Trametes villosa isolate CCMB561 will provide an important resource for future investigations in biofuel production, bioremediation and other green technologies.
A direct molecular link between the autism candidate gene RORa and the schizophrenia candidate MIR137

NASA Astrophysics Data System (ADS)

Devanna, Paolo; Vernes, Sonja C.

2014-02-01

Retinoic acid-related orphan receptor alpha gene (RORa) and the microRNA MIR137 have both recently been identified as novel candidate genes for neuropsychiatric disorders. RORa encodes a ligand-dependent orphan nuclear receptor that acts as a transcriptional regulator and miR-137 is a brain enriched small non-coding RNA that interacts with gene transcripts to control protein levels. Given the mounting evidence for RORa in autism spectrum disorders (ASD) and MIR137 in schizophrenia and ASD, we investigated if there was a functional biological relationship between these two genes. Herein, we demonstrate that miR-137 targets the 3'UTR of RORa in a site specific manner. We also provide further support for MIR137 as an autism candidate by showing that a large number of previously implicated autism genes are also putatively targeted by miR-137. This work supports the role of MIR137 as an ASD candidate and demonstrates a direct biological link between these previously unrelated autism candidate genes.
Structure and evolution of the mitochondrial genome of Exorista sorbillans: the Tachinidae (Diptera: Calyptratae) perspective.

PubMed

Shao, Yuan-jun; Hu, Xian-qiong; Peng, Guang-da; Wang, Rui-xian; Gao, Rui-na; Lin, Chao; Shen, Wei-de; Li, Rui; Li, Bing

2012-12-01

The first complete mitochondrial genome (mitogenome) of Tachinidae Exorista sorbillans (Diptera) is sequenced by PCR-based approach. The circular mitogenome is 14,960 bp long and has the representative mitochondrial gene (mt gene) organization and order of Diptera. All protein-coding sequences are initiated with ATN codon; however, the only exception is Cox I gene, which has a 4-bp ATCG putative start codon. Ten of the thirteen protein-coding genes have a complete termination codon (TAA), but the rest are seated on the H strand with incomplete codons. The mitogenome of E. sorbillans is biased toward A+T content at 78.4 %, and the strand-specific bias is in reflection of the third codon positions of mt genes, and their T/C ratios as strand indictor are higher on the H strand more than those on the L strand pointing at any strain of seven Diptera flies. The length of the A+T-rich region of E. sorbillans is 106 bp, including a tandem triple copies of a13-bp fragment. Compared to Haematobia irritans, E. sorbillans holds distant relationship with Drosophila. Phylogenetic topologies based on the amino acid sequences, supporting that E. sorbillans (Tachinidae) is clustered with strains of Calliphoridae and Oestridae, and superfamily Oestroidea are polyphyletic groups with Muscidae in a clade.
Structure-based activity prediction of CYP21A2 stability variants: A survey of available gene variations.

PubMed

Bruque, Carlos D; Delea, Marisol; Fernández, Cecilia S; Orza, Juan V; Taboas, Melisa; Buzzalino, Noemí; Espeche, Lucía D; Solari, Andrea; Luccerini, Verónica; Alba, Liliana; Nadra, Alejandro D; Dain, Liliana

2016-12-14

Congenital adrenal hyperplasia due to 21-hydroxylase deficiency accounts for 90-95% of CAH cases. In this work we performed an extensive survey of mutations and SNPs modifying the coding sequence of the CYP21A2 gene. Using bioinformatic tools and two plausible CYP21A2 structures as templates, we initially classified all known mutants (n = 343) according to their putative functional impacts, which were either reported in the literature or inferred from structural models. We then performed a detailed analysis on the subset of mutations believed to exclusively impact protein stability. For those mutants, the predicted stability was calculated and correlated with the variant's expected activity. A high concordance was obtained when comparing our predictions with available in vitro residual activities and/or the patient's phenotype. The predicted stability and derived activity of all reported mutations and SNPs lacking functional assays (n = 108) were assessed. As expected, most of the SNPs (52/76) showed no biological implications. Moreover, this approach was applied to evaluate the putative synergy that could emerge when two mutations occurred in cis. In addition, we propose a putative pathogenic effect of five novel mutations, p.L107Q, p.L122R, p.R132H, p.P335L and p.H466fs, found in 21-hydroxylase deficient patients of our cohort.
Structure-based activity prediction of CYP21A2 stability variants: A survey of available gene variations

PubMed Central

Bruque, Carlos D.; Delea, Marisol; Fernández, Cecilia S.; Orza, Juan V.; Taboas, Melisa; Buzzalino, Noemí; Espeche, Lucía D.; Solari, Andrea; Luccerini, Verónica; Alba, Liliana; Nadra, Alejandro D.; Dain, Liliana

2016-01-01

Congenital adrenal hyperplasia due to 21-hydroxylase deficiency accounts for 90–95% of CAH cases. In this work we performed an extensive survey of mutations and SNPs modifying the coding sequence of the CYP21A2 gene. Using bioinformatic tools and two plausible CYP21A2 structures as templates, we initially classified all known mutants (n = 343) according to their putative functional impacts, which were either reported in the literature or inferred from structural models. We then performed a detailed analysis on the subset of mutations believed to exclusively impact protein stability. For those mutants, the predicted stability was calculated and correlated with the variant’s expected activity. A high concordance was obtained when comparing our predictions with available in vitro residual activities and/or the patient’s phenotype. The predicted stability and derived activity of all reported mutations and SNPs lacking functional assays (n = 108) were assessed. As expected, most of the SNPs (52/76) showed no biological implications. Moreover, this approach was applied to evaluate the putative synergy that could emerge when two mutations occurred in cis. In addition, we propose a putative pathogenic effect of five novel mutations, p.L107Q, p.L122R, p.R132H, p.P335L and p.H466fs, found in 21-hydroxylase deficient patients of our cohort. PMID:27966633
Long non-coding RNA discovery across the genus anopheles reveals conserved secondary structures within and beyond the Gambiae complex.

PubMed

Jenkins, Adam M; Waterhouse, Robert M; Muskavitch, Marc A T

2015-04-23

Long non-coding RNAs (lncRNAs) have been defined as mRNA-like transcripts longer than 200 nucleotides that lack significant protein-coding potential, and many of them constitute scaffolds for ribonucleoprotein complexes with critical roles in epigenetic regulation. Various lncRNAs have been implicated in the modulation of chromatin structure, transcriptional and post-transcriptional gene regulation, and regulation of genomic stability in mammals, Caenorhabditis elegans, and Drosophila melanogaster. The purpose of this study is to identify the lncRNA landscape in the malaria vector An. gambiae and assess the evolutionary conservation of lncRNAs and their secondary structures across the Anopheles genus. Using deep RNA sequencing of multiple Anopheles gambiae life stages, we have identified 2,949 lncRNAs and more than 300 previously unannotated putative protein-coding genes. The lncRNAs exhibit differential expression profiles across life stages and adult genders. We find that across the genus Anopheles, lncRNAs display much lower sequence conservation than protein-coding genes. Additionally, we find that lncRNA secondary structure is highly conserved within the Gambiae complex, but diverges rapidly across the rest of the genus Anopheles. This study offers one of the first lncRNA secondary structure analyses in vector insects. Our description of lncRNAs in An. gambiae offers the most comprehensive genome-wide insights to date into lncRNAs in this vector mosquito, and defines a set of potential targets for the development of vector-based interventions that may further curb the human malaria burden in disease-endemic countries.
Proliferation of group II introns in the chloroplast genome of the green alga Oedocladium carolinianum (Chlorophyceae).

PubMed

Brouard, Jean-Simon; Turmel, Monique; Otis, Christian; Lemieux, Claude

2016-01-01

The chloroplast genome sustained extensive changes in architecture during the evolution of the Chlorophyceae, a morphologically and ecologically diverse class of green algae belonging to the Chlorophyta; however, the forces driving these changes are poorly understood. The five orders recognized in the Chlorophyceae form two major clades: the CS clade consisting of the Chlamydomonadales and Sphaeropleales, and the OCC clade consisting of the Oedogoniales, Chaetophorales, and Chaetopeltidales. In the OCC clade, considerable variations in chloroplast DNA (cpDNA) structure, size, gene order, and intron content have been observed. The large inverted repeat (IR), an ancestral feature characteristic of most green plants, is present in Oedogonium cardiacum (Oedogoniales) but is lacking in the examined members of the Chaetophorales and Chaetopeltidales. Remarkably, the Oedogonium 35.5-kb IR houses genes that were putatively acquired through horizontal DNA transfer. To better understand the dynamics of chloroplast genome evolution in the Oedogoniales, we analyzed the cpDNA of a second representative of this order, Oedocladium carolinianum . The Oedocladium cpDNA was sequenced and annotated. The evolutionary distances separating Oedocladium and Oedogonium cpDNAs and two other pairs of chlorophycean cpDNAs were estimated using a 61-gene data set. Phylogenetic analysis of an alignment of group IIA introns from members of the OCC clade was performed. Secondary structures and insertion sites of oedogonialean group IIA introns were analyzed. The 204,438-bp Oedocladium genome is 7.9 kb larger than the Oedogonium genome, but its repertoire of conserved genes is remarkably similar and gene order differs by only one reversal. Although the 23.7-kb IR is missing the putative foreign genes found in Oedogonium , it contains sequences coding for a putative phage or bacterial DNA primase and a hypothetical protein. Intergenic sequences are 1.5-fold longer and dispersed repeats are more abundant, but a smaller fraction of the Oedocladium genome is occupied by introns. Six additional group II introns are present, five of which lack ORFs and carry highly similar sequences to that of the ORF-less IIA intron shared with Oedogonium . Secondary structure analysis of the group IIA introns disclosed marked differences in the exon-binding sites; however, each intron showed perfect or nearly perfect base pairing interactions with its target site. Our results suggest that chloroplast genes rearrange more slowly in the Oedogoniales than in the Chaetophorales and raise questions as to what was the nature of the foreign coding sequences in the IR of the common ancestor of the Oedogoniales. They provide the first evidence for intragenomic proliferation of group IIA introns in the Viridiplantae, revealing that intron spread in the Oedocladium lineage likely occurred by retrohoming after sequence divergence of the exon-binding sites.
A two-locus global DNA barcode for land plants: the coding rbcL gene complements the non-coding trnH-psbA spacer region.

PubMed

Kress, W John; Erickson, David L

2007-06-06

A useful DNA barcode requires sufficient sequence variation to distinguish between species and ease of application across a broad range of taxa. Discovery of a DNA barcode for land plants has been limited by intrinsically lower rates of sequence evolution in plant genomes than that observed in animals. This low rate has complicated the trade-off in finding a locus that is universal and readily sequenced and has sufficiently high sequence divergence at the species-level. Here, a global plant DNA barcode system is evaluated by comparing universal application and degree of sequence divergence for nine putative barcode loci, including coding and non-coding regions, singly and in pairs across a phylogenetically diverse set of 48 genera (two species per genus). No single locus could discriminate among species in a pair in more than 79% of genera, whereas discrimination increased to nearly 88% when the non-coding trnH-psbA spacer was paired with one of three coding loci, including rbcL. In silico trials were conducted in which DNA sequences from GenBank were used to further evaluate the discriminatory power of a subset of these loci. These trials supported the earlier observation that trnH-psbA coupled with rbcL can correctly identify and discriminate among related species. A combination of the non-coding trnH-psbA spacer region and a portion of the coding rbcL gene is recommended as a two-locus global land plant barcode that provides the necessary universality and species discrimination.
Whole genome annotation and comparative genomic analyses of bio-control fungus Purpureocillium lilacinum.

PubMed

Prasad, Pushplata; Varshney, Deepti; Adholeya, Alok

2015-11-25

The fungus Purpureocillium lilacinum is widely known as a biological control agent against plant parasitic nematodes. This research article consists of genomic annotation of the first draft of whole genome sequence of P. lilacinum. The study aims to decipher the putative genetic components of the fungus involved in nematode pathogenesis by performing comparative genomic analysis with nine closely related fungal species in Hypocreales. de novo genomic assembly was done and a total of 301 scaffolds were constructed for P. lilacinum genomic DNA. By employing structural genome prediction models, 13, 266 genes coding for proteins were predicted in the genome. Approximately 73% of the predicted genes were functionally annotated using Blastp, InterProScan and Gene Ontology. A 14.7% fraction of the predicted genes shared significant homology with genes in the Pathogen Host Interactions (PHI) database. The phylogenomic analysis carried out using maximum likelihood RAxML algorithm provided insight into the evolutionary relationship of P. lilacinum. In congruence with other closely related species in the Hypocreales namely, Metarhizium spp., Pochonia chlamydosporia, Cordyceps militaris, Trichoderma reesei and Fusarium spp., P. lilacinum has large gene sets coding for G-protein coupled receptors (GPCRs), proteases, glycoside hydrolases and carbohydrate esterases that are required for degradation of nematode-egg shell components. Screening of the genome by Antibiotics & Secondary Metabolite Analysis Shell (AntiSMASH) pipeline indicated that the genome potentially codes for a variety of secondary metabolites, possibly required for adaptation to heterogeneous lifestyles reported for P. lilacinum. Significant up-regulation of subtilisin-like serine protease genes in presence of nematode eggs in quantitative real-time analyses suggested potential role of serine proteases in nematode pathogenesis. The data offer a better understanding of Purpureocillium lilacinum genome and will enhance our understanding on the molecular mechanism involved in nematophagy.
A Cluster of Cuticle Protein Genes of Drosophila Melanogaster at 65a: Sequence, Structure and Evolution

PubMed Central

Charles, J. P.; Chihara, C.; Nejad, S.; Riddiford, L. M.

1997-01-01

A 36-kb genomic DNA segment of the Drosophila melanogaster genome containing 12 clustered cuticle genes has been mapped and partially sequenced. The cluster maps at 65A 5-6 on the left arm of the third chromosome, in agreement with the previously determined location of a putative cluster encompassing the genes for the third instar larval cuticle proteins LCP5, LCP6 and LCP8. This cluster is the largest cuticle gene cluster discovered to date and shows a number of surprising features that explain in part the genetic complexity of the LCP5, LCP6 and LCP8 loci. The genes encoding LCP5 and LCP8 are multiple copy genes and the presence of extensive similarity in their coding regions gives the first evidence for gene conversion in cuticle genes. In addition, five genes in the cluster are intronless. Four of these five have arisen by retroposition. The other genes in the cluster have a single intron located at an unusual location for insect cuticle genes. PMID:9383064
A long natural-antisense RNA is accumulated in the conidia of Aspergillus oryzae.

PubMed

Tsujii, Masaru; Okuda, Satoshi; Ishi, Kazutomo; Madokoro, Kana; Takeuchi, Michio; Yamagata, Youhei

2016-01-01

Analysis of expressed sequence tag libraries from various culture conditions revealed the existence of conidia-specific transcripts assembled to putative conidiation-specific reductase gene (csrA) in Aspergillus oryzae. However, the all transcripts were transcribed with opposite direction to the gene csrA. The sequence analysis of the transcript revealed that the RNA overlapped mRNA of csrA with 3'-end, and did not code protein longer than 60 amino acid residues. We designated the transcript Conidia Specific Long Natural-antisense RNA (CSLNR). The real-time PCR analysis demonstrated that the CSLNR is conidia-specific transcript, which cannot be transcribed in the absence of brlA, and the amount of CSLNR was much more than that of the transcript from csrA in conidia. Furthermore, the csrA deletion, also lacking coding region of CSLNR in A. oryzae reduced the number of conidia. Overexpression of CsrA demonstrated the inhibition of growth and conidiation, while CSLNR did not affect conidiation.
Transcriptional Profiles of Mating-Responsive Genes from Testes and Male Accessory Glands of the Mediterranean Fruit Fly, Ceratitis capitata

PubMed Central

Scolari, Francesca; Gomulski, Ludvik M.; Ribeiro, José M. C.; Siciliano, Paolo; Meraldi, Alice; Falchetto, Marco; Bonomi, Angelica; Manni, Mosè; Gabrieli, Paolo; Malovini, Alberto; Bellazzi, Riccardo; Aksoy, Serap; Gasperi, Giuliano; Malacrida, Anna R.

2012-01-01

Background Insect seminal fluid is a complex mixture of proteins, carbohydrates and lipids, produced in the male reproductive tract. This seminal fluid is transferred together with the spermatozoa during mating and induces post-mating changes in the female. Molecular characterization of seminal fluid proteins in the Mediterranean fruit fly, Ceratitis capitata, is limited, although studies suggest that some of these proteins are biologically active. Methodology/Principal Findings We report on the functional annotation of 5914 high quality expressed sequence tags (ESTs) from the testes and male accessory glands, to identify transcripts encoding putative secreted peptides that might elicit post-mating responses in females. The ESTs were assembled into 3344 contigs, of which over 33% produced no hits against the nr database, and thus may represent novel or rapidly evolving sequences. Extraction of the coding sequences resulted in a total of 3371 putative peptides. The annotated dataset is available as a hyperlinked spreadsheet. Four hundred peptides were identified with putative secretory activity, including odorant binding proteins, protease inhibitor domain-containing peptides, antigen 5 proteins, mucins, and immunity-related sequences. Quantitative RT-PCR-based analyses of a subset of putative secretory protein-encoding transcripts from accessory glands indicated changes in their abundance after one or more copulations when compared to virgin males of the same age. These changes in abundance, particularly evident after the third mating, may be related to the requirement to replenish proteins to be transferred to the female. Conclusions/Significance We have developed the first large-scale dataset for novel studies on functions and processes associated with the reproductive biology of Ceratitis capitata. The identified genes may help study genome evolution, in light of the high adaptive potential of the medfly. In addition, studies of male recovery dynamics in terms of accessory gland gene expression profiles and correlated remating inhibition mechanisms may permit the improvement of pest management approaches. PMID:23071645
Are plant formins integral membrane proteins?

PubMed

Cvrcková, F

2000-01-01

The formin family of proteins has been implicated in signaling pathways of cellular morphogenesis in both animals and fungi; in the latter case, at least, they participate in communication between the actin cytoskeleton and the cell surface. Nevertheless, they appear to be cytoplasmic or nuclear proteins, and it is not clear whether they communicate with the plasma membrane, and if so, how. Because nothing is known about formin function in plants, I performed a systematic search for putative Arabidopsis thaliana formin homologs. I found eight putative formin-coding genes in the publicly available part of the Arabidopsis genome sequence and analyzed their predicted protein sequences. Surprisingly, some of them lack parts of the conserved formin-homology 2 (FH2) domain and the majority of them seem to have signal sequences and putative transmembrane segments that are not found in yeast or animals formins. Plant formins define a distinct subfamily. The presence in most Arabidopsis formins of sequence motifs typical or transmembrane proteins suggests a mechanism of membrane attachment that may be specific to plant formins, and indicates an unexpected evolutionary flexibility of the conserved formin domain.
A 2,4-dichlorophenoxyacetic acid degradation plasmid pM7012 discloses distribution of an unclassified megaplasmid group across bacterial species.

PubMed

Sakai, Yoriko; Ogawa, Naoto; Shimomura, Yumi; Fujii, Takeshi

2014-03-01

Analysis of the complete nucleotide sequence of plasmid pM7012 from 2,4-dichlorophenoxyacetic-acid (2,4-D)-degrading bacterium Burkholderia sp. M701 revealed that the plasmid had 582 142 bp, with 541 putative protein-coding sequences and 39 putative tRNA genes for the transport of the standard 20 aa. pM7012 contains sequences homologous to the regions involved in conjugal transfer and plasmid maintenance found in plasmids byi_2p from Burkholderia sp. YI23 and pBVIE01 from Burkholderia sp. G4. No relaxase gene was found in any of these plasmids, although genes for a type IV secretion system and type IV coupling proteins were identified. Plasmids with no relaxase gene have been classified as non-mobile plasmids. However, nucleotide sequences with a high level of similarity to the genes for plasmid transfer, plasmid maintenance, 2,4-D degradation and arsenic resistance contained on pM7012 were also detected in eight other megaplasmids (~600 or 900 kb) found in seven Burkholderia strains and a strain of Cupriavidus, which were isolated as 2,4-D-degrading bacteria in Japan and the United States. These results suggested that the 2,4-D degradation megaplasmids related to pM7012 are mobile and distributed across various bacterial species worldwide, and that the plasmid group could be distinguished from known mobile plasmid groups.
The ANKK1 kinase gene and psychiatric disorders.

PubMed

Ponce, Guillermo; Pérez-González, Rocío; Aragüés, María; Palomo, Tomás; Rodríguez-Jiménez, Roberto; Jiménez-Arriero, Miguel Angel; Hoenicka, Janet

2009-07-01

The TaqIA single nucleotide polymorphism (SNP, rs1800497), which is located in the gene that codes for the putative kinase ANKK1 (ANKK1) near the termination codon of the D2 dopamine receptor gene (DRD2; chromosome 11q22-q23), is the most studied genetic variation in a broad range of psychiatric disorders and personality traits. A large number of individual genetic association studies have found that the TaqIA SNP is linked to alcoholism and antisocial traits. In addition, it has also been related to other conditions such as schizophrenia, eating disorders, and some behavioral childhood disorders. The TaqIA A1 allele is mainly associated with addictions, antisocial disorders, eating disorders, and attention-deficit/hyperactivity disorders, while the A2 allele occurs more frequently in schizophrenic and obsessive-compulsive patients. Current data show that the TaqIA polymorphism may be a marker of both DRD2 and ANKK1 genetic variants. ANKK1 would belong to a family of kinases involved in signal transduction. This raises the question of whether signaling players intervene in the pathophysiology of psychiatric disorders. Basic research on the ANKK1 protein and its putative interaction with the D2 dopamine receptor could shed light on this issue.
Unique features of a global human ectoparasite identified through sequencing of the bed bug genome.

PubMed

Benoit, Joshua B; Adelman, Zach N; Reinhardt, Klaus; Dolan, Amanda; Poelchau, Monica; Jennings, Emily C; Szuter, Elise M; Hagan, Richard W; Gujar, Hemant; Shukla, Jayendra Nath; Zhu, Fang; Mohan, M; Nelson, David R; Rosendale, Andrew J; Derst, Christian; Resnik, Valentina; Wernig, Sebastian; Menegazzi, Pamela; Wegener, Christian; Peschel, Nicolai; Hendershot, Jacob M; Blenau, Wolfgang; Predel, Reinhard; Johnston, Paul R; Ioannidis, Panagiotis; Waterhouse, Robert M; Nauen, Ralf; Schorn, Corinna; Ott, Mark-Christoph; Maiwald, Frank; Johnston, J Spencer; Gondhalekar, Ameya D; Scharf, Michael E; Peterson, Brittany F; Raje, Kapil R; Hottel, Benjamin A; Armisén, David; Crumière, Antonin Jean Johan; Refki, Peter Nagui; Santos, Maria Emilia; Sghaier, Essia; Viala, Sèverine; Khila, Abderrahman; Ahn, Seung-Joon; Childers, Christopher; Lee, Chien-Yueh; Lin, Han; Hughes, Daniel S T; Duncan, Elizabeth J; Murali, Shwetha C; Qu, Jiaxin; Dugan, Shannon; Lee, Sandra L; Chao, Hsu; Dinh, Huyen; Han, Yi; Doddapaneni, Harshavardhan; Worley, Kim C; Muzny, Donna M; Wheeler, David; Panfilio, Kristen A; Vargas Jentzsch, Iris M; Vargo, Edward L; Booth, Warren; Friedrich, Markus; Weirauch, Matthew T; Anderson, Michelle A E; Jones, Jeffery W; Mittapalli, Omprakash; Zhao, Chaoyang; Zhou, Jing-Jiang; Evans, Jay D; Attardo, Geoffrey M; Robertson, Hugh M; Zdobnov, Evgeny M; Ribeiro, Jose M C; Gibbs, Richard A; Werren, John H; Palli, Subba R; Schal, Coby; Richards, Stephen

2016-02-02

The bed bug, Cimex lectularius, has re-established itself as a ubiquitous human ectoparasite throughout much of the world during the past two decades. This global resurgence is likely linked to increased international travel and commerce in addition to widespread insecticide resistance. Analyses of the C. lectularius sequenced genome (650 Mb) and 14,220 predicted protein-coding genes provide a comprehensive representation of genes that are linked to traumatic insemination, a reduced chemosensory repertoire of genes related to obligate hematophagy, host-symbiont interactions, and several mechanisms of insecticide resistance. In addition, we document the presence of multiple putative lateral gene transfer events. Genome sequencing and annotation establish a solid foundation for future research on mechanisms of insecticide resistance, human-bed bug and symbiont-bed bug associations, and unique features of bed bug biology that contribute to the unprecedented success of C. lectularius as a human ectoparasite.
Unique features of a global human ectoparasite identified through sequencing of the bed bug genome

PubMed Central

Benoit, Joshua B.; Adelman, Zach N.; Reinhardt, Klaus; Dolan, Amanda; Poelchau, Monica; Jennings, Emily C.; Szuter, Elise M.; Hagan, Richard W.; Gujar, Hemant; Shukla, Jayendra Nath; Zhu, Fang; Mohan, M.; Nelson, David R.; Rosendale, Andrew J.; Derst, Christian; Resnik, Valentina; Wernig, Sebastian; Menegazzi, Pamela; Wegener, Christian; Peschel, Nicolai; Hendershot, Jacob M.; Blenau, Wolfgang; Predel, Reinhard; Johnston, Paul R.; Ioannidis, Panagiotis; Waterhouse, Robert M.; Nauen, Ralf; Schorn, Corinna; Ott, Mark-Christoph; Maiwald, Frank; Johnston, J. Spencer; Gondhalekar, Ameya D.; Scharf, Michael E.; Peterson, Brittany F.; Raje, Kapil R.; Hottel, Benjamin A.; Armisén, David; Crumière, Antonin Jean Johan; Refki, Peter Nagui; Santos, Maria Emilia; Sghaier, Essia; Viala, Sèverine; Khila, Abderrahman; Ahn, Seung-Joon; Childers, Christopher; Lee, Chien-Yueh; Lin, Han; Hughes, Daniel S. T.; Duncan, Elizabeth J.; Murali, Shwetha C.; Qu, Jiaxin; Dugan, Shannon; Lee, Sandra L.; Chao, Hsu; Dinh, Huyen; Han, Yi; Doddapaneni, Harshavardhan; Worley, Kim C.; Muzny, Donna M.; Wheeler, David; Panfilio, Kristen A.; Vargas Jentzsch, Iris M.; Vargo, Edward L.; Booth, Warren; Friedrich, Markus; Weirauch, Matthew T.; Anderson, Michelle A. E.; Jones, Jeffery W.; Mittapalli, Omprakash; Zhao, Chaoyang; Zhou, Jing-Jiang; Evans, Jay D.; Attardo, Geoffrey M.; Robertson, Hugh M.; Zdobnov, Evgeny M.; Ribeiro, Jose M. C.; Gibbs, Richard A.; Werren, John H.; Palli, Subba R.; Schal, Coby; Richards, Stephen

2016-01-01

The bed bug, Cimex lectularius, has re-established itself as a ubiquitous human ectoparasite throughout much of the world during the past two decades. This global resurgence is likely linked to increased international travel and commerce in addition to widespread insecticide resistance. Analyses of the C. lectularius sequenced genome (650 Mb) and 14,220 predicted protein-coding genes provide a comprehensive representation of genes that are linked to traumatic insemination, a reduced chemosensory repertoire of genes related to obligate hematophagy, host–symbiont interactions, and several mechanisms of insecticide resistance. In addition, we document the presence of multiple putative lateral gene transfer events. Genome sequencing and annotation establish a solid foundation for future research on mechanisms of insecticide resistance, human–bed bug and symbiont–bed bug associations, and unique features of bed bug biology that contribute to the unprecedented success of C. lectularius as a human ectoparasite. PMID:26836814
Loss of heterozygosity at 7q22 and mutation analysis of the CDP gene in human epithelial ovarian tumors.

PubMed

Neville, P J; Thomas, N; Campbell, I G

2001-02-01

Many tumor types including that of the ovary show loss of heterozygosity (LOH) on chromosome arm 7q, which suggests the existence of at least one tumor suppressor gene (TSG) on this chromosome arm. We have studied the region surrounding the putative tumor suppressor gene CUTL1 at 7q22 in 127 epithelial ovarian tumors. LOH was found across 7q22 in 31% of malignant and 14% of benign ovarian tumors. In 16% of the tumors the LOH appeared to be centered on the CUTL1 gene. This gene has been implicated previously as a TSG in both uterine leiomyomas and breast carcinoma. However, mutation analysis of the CUTL1 gene in 47 tumors with 7q22 LOH failed to identify any somatic alterations in the coding regions. This finding suggests that CUTL1 may not be the target of the 7q22 LOH in ovarian cancers.
The electron transfer system of syntrophically grown Desulfovibrio vulgaris

DOE Office of Scientific and Technical Information (OSTI.GOV)

Walker, C.B.; He, Z.; Yang, Z.K.

2009-05-01

Interspecies hydrogen transfer between organisms producing and consuming hydrogen promotes the decomposition of organic matter in most anoxic environments. Although syntrophic couplings between hydrogen producers and consumers are a major feature of the carbon cycle, mechanisms for energy recovery at the extremely low free energies of reactions typical of these anaerobic communities have not been established. In this study, comparative transcriptional analysis of a model sulfate-reducing microbe, Desulfovibrio vulgaris Hildenborough, suggested the use of alternative electron transfer systems dependent upon growth modality. During syntrophic growth on lactate with a hydrogenotrophic methanogen, D. vulgaris up-regulated numerous genes involved in electron transfermore » and energy generation when compared with sulfate-limited monocultures. In particular, genes coding for the putative membrane-bound Coo hydrogenase, two periplasmic hydrogenases (Hyd and Hyn) and the well-characterized high-molecular weight cytochrome (Hmc) were among the most highly expressed and up-regulated. Additionally, a predicted operon coding for genes involved in lactate transport and oxidation exhibited up-regulation, further suggesting an alternative pathway for electrons derived from lactate oxidation during syntrophic growth. Mutations in a subset of genes coding for Coo, Hmc, Hyd and Hyn impaired or severely limited syntrophic growth but had little affect on growth via sulfate-respiration. These results demonstrate that syntrophic growth and sulfate-respiration use largely independent energy generation pathways and imply that understanding of microbial processes sustaining nutrient cycling must consider lifestyles not captured in pure culture.« less
The Electron Transfer System of Syntrophically Grown Desulfovibrio vulgaris

DOE Office of Scientific and Technical Information (OSTI.GOV)

PBD; ENIGMA; GTL

2009-06-22

Interspecies hydrogen transfer between organisms producing and consuming hydrogen promotes the decomposition of organic matter in most anoxic environments. Although syntrophic couplings between hydrogen producers and consumers are a major feature of the carbon cycle, mechanisms for energy recovery at the extremely low free energies of reactions typical of these anaerobic communities have not been established. In this study, comparative transcriptional analysis of a model sulfate-reducing microbe, Desulfovibrio vulgaris Hildenborough, suggested the use of alternative electron transfer systems dependent upon growth modality. During syntrophic growth on lactate with a hydrogenotrophic methanogen, D. vulgaris up-regulated numerous genes involved in electron transfermore » and energy generation when compared with sulfate-limited monocultures. In particular, genes coding for the putative membrane-bound Coo hydrogenase, two periplasmic hydrogenases (Hyd and Hyn) and the well-characterized high-molecular weight cytochrome (Hmc) were among the most highly expressed and up-regulated. Additionally, a predicted operon coding for genes involved in lactate transport and oxidation exhibited up-regulation, further suggesting an alternative pathway for electrons derived from lactate oxidation during syntrophic growth. Mutations in a subset of genes coding for Coo, Hmc, Hyd and Hyn impaired or severely limited syntrophic growth but had little affect on growth via sulfate-respiration. These results demonstrate that syntrophic growth and sulfate-respiration use largely independent energy generation pathways and imply that understanding of microbial processes sustaining nutrient cycling must consider lifestyles not captured in pure culture.« less

Identification and allelic dissection uncover roles of lncRNAs in secondary growth of Populus tomentosa.

PubMed

Zhou, Daling; Du, Qingzhang; Chen, Jinhui; Wang, Qingshi; Zhang, Deqiang

2017-10-01

Long non-coding RNAs (lncRNAs) function in various biological processes. However, their roles in secondary growth of plants remain poorly understood. Here, 15,691 lncRNAs were identified from vascular cambium, developing xylem, and mature xylem of Populus tomentosa with high and low biomass using RNA-seq, including 1,994 lncRNAs that were differentially expressed (DE) among the six libraries. 3,569 cis-regulated and 3,297 trans-regulated protein-coding genes were predicted as potential target genes (PTGs) of the DE lncRNAs to participate in biological regulation. Then, 476 and 28 lncRNAs were identified as putative targets and endogenous target mimics (eTMs) of Populus known microRNAs (miRNAs), respectively. Genome re-sequencing of 435 individuals from a natural population of P. tomentosa found 34,015 single nucleotide polymorphisms (SNPs) within 178 lncRNA loci and 522 PTGs. Single-SNP associations analysis detected 2,993 associations with 10 growth and wood-property traits under additive and dominance model. Epistasis analysis identified 17,656 epistatic SNP pairs, providing evidence for potential regulatory interactions between lncRNAs and their PTGs. Furthermore, a reconstructed epistatic network, representing interactions of 8 lncRNAs and 15 PTGs, might enrich regulation roles of genes in the phenylpropanoid pathway. These findings may enhance our understanding of non-coding genes in plants. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Characterization and engineering of the biosynthesis gene cluster for antitumor macrolides PM100117 and PM100118 from a marine actinobacteria: generation of a novel improved derivative.

PubMed

Salcedo, Raúl García; Olano, Carlos; Gómez, Cristina; Fernández, Rogelio; Braña, Alfredo F; Méndez, Carmen; de la Calle, Fernando; Salas, José A

2016-02-22

PM100117 and PM100118 are glycosylated polyketides with remarkable antitumor activity, which derive from the marine symbiotic actinobacteria Streptomyces caniferus GUA-06-05-006A. Structurally, PM100117 and PM100118 are composed of a macrocyclic lactone, three deoxysugar units and a naphthoquinone (NQ) chromophore that shows a clear structural similarity to menaquinone. Whole-genome sequencing of S. caniferus GUA-06-05-006A has enabled the identification of PM100117 and PM100118 biosynthesis gene cluster, which has been characterized on the basis of bioinformatics and genetic engineering data. The product of four genes shows high identity to proteins involved in the biosynthesis of menaquinone via futalosine. Deletion of one of these genes led to a decay in PM100117 and PM100118 production, and to the accumulation of several derivatives lacking NQ. Likewise, five additional genes have been genetically characterized to be involved in the biosynthesis of this moiety. Moreover, the generation of a mutant in a gene coding for a putative cytochrome P450 has led to the production of PM100117 and PM100118 structural analogues showing an enhanced in vitro cytotoxic activity relative to the parental products. Although a number of compounds structurally related to PM100117 and PM100118 has been discovered, this is, to our knowledge, the first insight reported into their biosynthesis. The structural resemblance of the NQ moiety to menaquinone, and the presence in the cluster of four putative menaquinone biosynthetic genes, suggests a connection between the biosynthesis pathways of both compounds. The availability of the PM100117 and PM100118 biosynthetic gene cluster will surely pave a way to the combinatorial engineering of more derivatives.
Sugarcane genes differentially expressed in response to Puccinia melanocephala infection: identification and transcript profiling.

PubMed

Oloriz, María I; Gil, Víctor; Rojas, Luis; Portal, Orelvis; Izquierdo, Yovanny; Jiménez, Elio; Höfte, Monica

2012-05-01

Brown rust caused by the fungus Puccinia melanocephala is a major disease of sugarcane (Saccharum spp.). A sugarcane mutant, obtained by chemical mutagenesis of the susceptible variety B4362, showed a post-haustorial hypersensitive response (HR)-mediated resistance to the pathogen and was used to identify genes differentially expressed in response to P. melanocephala via suppression subtractive hybridization (SSH). Tester cDNA was derived from the brown rust-resistant mutant after inoculation with P. melanocephala, while driver cDNAs were obtained from the non-inoculated resistant mutant and the inoculated susceptible donor variety B4362. Database comparisons of the sequences of the SSH recombinant clones revealed that, of a subset of 89 non-redundant sequences, 88% had similarity to known functional genes, while 12% were of unknown function. Thirteen genes were selected for transcript profiling in the resistant mutant and the susceptible donor variety. Genes involved in glycolysis and C4 carbon fixation were up-regulated in both interactions probably due to disturbance of sugarcane carbon metabolism by the pathogen. Genes related with the nascent polypeptide associated complex, post-translational proteome modulation and autophagy were transcribed at higher levels in the compatible interaction. Up-regulation of a putative L-isoaspartyl O-methyltransferase S-adenosylmethionine gene in the compatible interaction may point to fungal manipulation of the cytoplasmatic methionine cycle. Genes coding for a putative no apical meristem protein, S-adenosylmethionine decarboxylase, non-specific lipid transfer protein, and GDP-L-galactose phosphorylase involved in ascorbic acid biosynthesis were up-regulated in the incompatible interaction at the onset of haustorium formation, and may contribute to the HR-mediated defense response in the rust-resistant mutant.
Genomic identification, characterization and differential expression analysis of SBP-box gene family in Brassica napus.

PubMed

Cheng, Hongtao; Hao, Mengyu; Wang, Wenxiang; Mei, Desheng; Tong, Chaobo; Wang, Hui; Liu, Jia; Fu, Li; Hu, Qiong

2016-09-08

SBP-box genes belong to one of the largest families of transcription factors. Though members of this family have been characterized to be important regulators of diverse biological processes, information of SBP-box genes in the third most important oilseed crop Brassica napus is largely undefined. In the present study, by whole genome bioinformatics analysis and transcriptional profiling, 58 putative members of SBP-box gene family in oilseed rape (Brassica napus L.) were identified and their expression pattern in different tissues as well as possible interaction with miRNAs were analyzed. In addition, B. napus lines with contrasting branch angle were used for investigating the involvement of SBP-box genes in plant architecture regulation. Detailed gene information, including genomic organization, structural feature, conserved domain and phylogenetic relationship of the genes were systematically characterized. By phylogenetic analysis, BnaSBP proteins were classified into eight distinct groups representing the clear orthologous relationships to their family members in Arabidopsis and rice. Expression analysis in twelve tissues including vegetative and reproductive organs showed different expression patterns among the SBP-box genes and a number of the genes exhibit tissue specific expression, indicating their diverse functions involved in the developmental process. Forty-four SBP-box genes were ascertained to contain the putative miR156 binding site, with 30 and 14 of the genes targeted by miR156 at the coding and 3'UTR region, respectively. Relative expression level of miR156 is varied across tissues. Different expression pattern of some BnaSBP genes and the negative correlation of transcription levels between miR156 and its target BnaSBP gene were observed in lines with different branch angle. Taken together, this study represents the first systematic analysis of the SBP-box gene family in Brassica napus. The data presented here provides base foundation for understanding the crucial roles of BnaSBP genes in plant development and other biological processes.
Avian sarcoma virus 17 carries the jun oncogene.

PubMed Central

Maki, Y; Bos, T J; Davis, C; Starbuck, M; Vogt, P K

1987-01-01

Biologically active molecular clones of avian sarcoma virus 17 (ASV 17) contain a replication-defective proviral genome of 3.5 kilobases (kb). The genome retains partial gag and env sequences, which flank a cell-derived putative oncogene of 0.93 kb, termed jun. The jun gene lacks preserved coding domains of tyrosine-specific protein kinases. It also shows no significant nucleic acid homology with other known oncogenes. The probable transformation-specific protein in ASV 17-transformed cells is a 55-kDa gag-jun fusion product. Images PMID:3033666
Cloning and characterization of an inulinase gene from the marine yeast Candida membranifaciens subsp. flavinogenie W14-3 and its expression in Saccharomyces sp. W0 for ethanol production.

PubMed

Zhang, Lin-Lin; Tan, Mei-Juan; Liu, Guang-Lei; Chi, Zhe; Wang, Guang-Yuan; Chi, Zhen-Ming

2015-04-01

The INU1 gene encoding an exo-inulinase from the marine-derived yeast Candida membranifaciens subsp. flavinogenie W14-3 was cloned and characterized. It had an open reading frame of 1,536 bp long encoding an inulinase. The coding region of it was not interrupted by any intron. The cloned gene encoded 512 amino acid residues of a protein with a putative signal peptide of 23 amino acids and a calculated molecular mass of 57.8 kDa. The protein sequence deduced from the inulinase gene contained the inulinase consensus sequences (WMNDPNGL), (RDP), ECP FS and Q. The protein also had six conserved putative N-glycosylation sites. The deduced inulinase from the yeast strain W14-3 was found to be closely related to that from Candida kutaonensis sp. nov. KRF1, Kluyveromyces marxianus, and Cryptococcus aureus G7a. The inulinase gene with its signal peptide encoding sequence was subcloned into the pMIRSC11 expression vector and expressed in Saccharomyces sp. W0. The recombinant yeast strain W14-3-INU-112 obtained could produce 16.8 U/ml of inulinase activity and 12.5 % (v/v) ethanol from 250 g/l of inulin within 168 h. The monosaccharides were detected after the hydrolysis of inulin with the crude inulinase (the yeast culture). All the results indicated that the cloned gene and the recombinant yeast strain W14-3-INU-112 had potential applications in biotechnology.
Horizontal gene transfer in Histophilus somni and its role in the evolution of pathogenic strain 2336, as determined by comparative genomic analyses

PubMed Central

2011-01-01

Background Pneumonia and myocarditis are the most commonly reported diseases due to Histophilus somni, an opportunistic pathogen of the reproductive and respiratory tracts of cattle. Thus far only a few genes involved in metabolic and virulence functions have been identified and characterized in H. somni using traditional methods. Analyses of the genome sequences of several Pasteurellaceae species have provided insights into their biology and evolution. In view of the economic and ecological importance of H. somni, the genome sequence of pneumonia strain 2336 has been determined and compared to that of commensal strain 129Pt and other members of the Pasteurellaceae. Results The chromosome of strain 2336 (2,263,857 bp) contained 1,980 protein coding genes, whereas the chromosome of strain 129Pt (2,007,700 bp) contained only 1,792 protein coding genes. Although the chromosomes of the two strains differ in size, their average GC content, gene density (total number of genes predicted on the chromosome), and percentage of sequence (number of genes) that encodes proteins were similar. The chromosomes of these strains also contained a number of discrete prophage regions and genomic islands. One of the genomic islands in strain 2336 contained genes putatively involved in copper, zinc, and tetracycline resistance. Using the genome sequence data and comparative analyses with other members of the Pasteurellaceae, several H. somni genes that may encode proteins involved in virulence (e.g., filamentous haemaggutinins, adhesins, and polysaccharide biosynthesis/modification enzymes) were identified. The two strains contained a total of 17 ORFs that encode putative glycosyltransferases and some of these ORFs had characteristic simple sequence repeats within them. Most of the genes/loci common to both the strains were located in different regions of the two chromosomes and occurred in opposite orientations, indicating genome rearrangement since their divergence from a common ancestor. Conclusions Since the genome of strain 129Pt was ~256,000 bp smaller than that of strain 2336, these genomes provide yet another paradigm for studying evolutionary gene loss and/or gain in regard to virulence repertoire and pathogenic ability. Analyses of the complete genome sequences revealed that bacteriophage- and transposon-mediated horizontal gene transfer had occurred at several loci in the chromosomes of strains 2336 and 129Pt. It appears that these mobile genetic elements have played a major role in creating genomic diversity and phenotypic variability among the two H. somni strains. PMID:22111657
Horizontal gene transfer in Histophilus somni and its role in the evolution of pathogenic strain 2336, as determined by comparative genomic analyses.

PubMed

Siddaramappa, Shivakumara; Challacombe, Jean F; Duncan, Alison J; Gillaspy, Allison F; Carson, Matthew; Gipson, Jenny; Orvis, Joshua; Zaitshik, Jeremy; Barnes, Gentry; Bruce, David; Chertkov, Olga; Detter, J Chris; Han, Cliff S; Tapia, Roxanne; Thompson, Linda S; Dyer, David W; Inzana, Thomas J

2011-11-23

Pneumonia and myocarditis are the most commonly reported diseases due to Histophilus somni, an opportunistic pathogen of the reproductive and respiratory tracts of cattle. Thus far only a few genes involved in metabolic and virulence functions have been identified and characterized in H. somni using traditional methods. Analyses of the genome sequences of several Pasteurellaceae species have provided insights into their biology and evolution. In view of the economic and ecological importance of H. somni, the genome sequence of pneumonia strain 2336 has been determined and compared to that of commensal strain 129Pt and other members of the Pasteurellaceae. The chromosome of strain 2336 (2,263,857 bp) contained 1,980 protein coding genes, whereas the chromosome of strain 129Pt (2,007,700 bp) contained only 1,792 protein coding genes. Although the chromosomes of the two strains differ in size, their average GC content, gene density (total number of genes predicted on the chromosome), and percentage of sequence (number of genes) that encodes proteins were similar. The chromosomes of these strains also contained a number of discrete prophage regions and genomic islands. One of the genomic islands in strain 2336 contained genes putatively involved in copper, zinc, and tetracycline resistance. Using the genome sequence data and comparative analyses with other members of the Pasteurellaceae, several H. somni genes that may encode proteins involved in virulence (e.g., filamentous haemaggutinins, adhesins, and polysaccharide biosynthesis/modification enzymes) were identified. The two strains contained a total of 17 ORFs that encode putative glycosyltransferases and some of these ORFs had characteristic simple sequence repeats within them. Most of the genes/loci common to both the strains were located in different regions of the two chromosomes and occurred in opposite orientations, indicating genome rearrangement since their divergence from a common ancestor. Since the genome of strain 129Pt was ~256,000 bp smaller than that of strain 2336, these genomes provide yet another paradigm for studying evolutionary gene loss and/or gain in regard to virulence repertoire and pathogenic ability. Analyses of the complete genome sequences revealed that bacteriophage- and transposon-mediated horizontal gene transfer had occurred at several loci in the chromosomes of strains 2336 and 129Pt. It appears that these mobile genetic elements have played a major role in creating genomic diversity and phenotypic variability among the two H. somni strains.
Genomic Organization and Molecular Analysis of Virulent Bacteriophage 2972 Infecting an Exopolysaccharide-Producing Streptococcus thermophilus Strain

PubMed Central

Lévesque, Céline; Duplessis, Martin; Labonté, Jessica; Labrie, Steve; Fremaux, Christophe; Tremblay, Denise; Moineau, Sylvain

2005-01-01

The Streptococcus thermophilus virulent pac-type phage 2972 was isolated from a yogurt made in France in 1999. It is a representative of several phages that have emerged with the industrial use of the exopolysaccharide-producing S. thermophilus strain RD534. The genome of phage 2972 has 34,704 bp with an overall G+C content of 40.15%, making it the shortest S. thermophilus phage genome analyzed so far. Forty-four open reading frames (ORFs) encoding putative proteins of 40 or more amino acids were identified, and bioinformatic analyses led to the assignment of putative functions to 23 ORFs. Comparative genomic analysis of phage 2972 with the six other sequenced S. thermophilus phage genomes confirmed that the replication module is conserved and that cos- and pac-type phages have distinct structural and packaging genes. Two group I introns were identified in the genome of 2972. They interrupted the genes coding for the putative endolysin and the terminase large subunit. Phage mRNA splicing was demonstrated for both introns, and the secondary structures were predicted. Eight structural proteins were also identified by N-terminal sequencing and/or matrix-assisted laser desorption ionization—time-of-flight mass spectrometry. Detailed analysis of the putative minor tail proteins ORF19 and ORF21 as well as the putative receptor-binding protein ORF20 showed the following interesting features: (i) ORF19 is a hybrid protein, because it displays significant identity with both pac- and cos-type phages; (ii) ORF20 is unique; and (iii) a protein similar to ORF21 of 2972 was also found in the structure of the cos-type phage DT1, indicating that this structural protein is present in both S. thermophilus phage groups. The implications of these findings for phage classification are discussed. PMID:16000821
Genome wide discovery of long intergenic non-coding RNAs in Diamondback moth (Plutella xylostella) and their expression in insecticide resistant strains

PubMed Central

Etebari, Kayvan; Furlong, Michael J.; Asgari, Sassan

2015-01-01

Long non-coding RNAs (lncRNAs) play important roles in genomic imprinting, cancer, differentiation and regulation of gene expression. Here, we identified 3844 long intergenic ncRNAs (lincRNA) in Plutella xylostella, which is a notorious pest of cruciferous plants that has developed field resistance to all classes of insecticides, including Bacillus thuringiensis (Bt) endotoxins. Further, we found that some of those lincRNAs may potentially serve as precursors for the production of small ncRNAs. We found 280 and 350 lincRNAs that are differentially expressed in Chlorpyrifos and Fipronil resistant larvae. A survey on P. xylostella midgut transcriptome data from Bt-resistant populations revealed 59 altered lincRNA in two resistant strains compared with the susceptible population. We validated the transcript levels of a number of putative lincRNAs in deltamethrin-resistant larvae that were exposed to deltamethrin, which indicated that this group of lincRNAs might be involved in the response to xenobiotics in this insect. To functionally characterize DBM lincRNAs, gene ontology (GO) enrichment of their associated protein-coding genes was extracted and showed over representation of protein, DNA and RNA binding GO terms. The data presented here will facilitate future studies to unravel the function of lincRNAs in insecticide resistance or the response to xenobiotics of eukaryotic cells. PMID:26411386
Chamber Specific Gene Expression Landscape of the Zebrafish Heart

PubMed Central

Singh, Angom Ramcharan; Sivadas, Ambily; Sabharwal, Ankit; Vellarikal, Shamsudheen Karuthedath; Jayarajan, Rijith; Verma, Ankit; Kapoor, Shruti; Joshi, Adita; Scaria, Vinod; Sivasubbu, Sridhar

2016-01-01

The organization of structure and function of cardiac chambers in vertebrates is defined by chamber-specific distinct gene expression. This peculiarity and uniqueness of the genetic signatures demonstrates functional resolution attributed to the different chambers of the heart. Altered expression of the cardiac chamber genes can lead to individual chamber related dysfunctions and disease patho-physiologies. Information on transcriptional repertoire of cardiac compartments is important to understand the spectrum of chamber specific anomalies. We have carried out a genome wide transcriptome profiling study of the three cardiac chambers in the zebrafish heart using RNA sequencing. We have captured the gene expression patterns of 13,396 protein coding genes in the three cardiac chambers—atrium, ventricle and bulbus arteriosus. Of these, 7,260 known protein coding genes are highly expressed (≥10 FPKM) in the zebrafish heart. Thus, this study represents nearly an all-inclusive information on the zebrafish cardiac transcriptome. In this study, a total of 96 differentially expressed genes across the three cardiac chambers in zebrafish were identified. The atrium, ventricle and bulbus arteriosus displayed 20, 32 and 44 uniquely expressing genes respectively. We validated the expression of predicted chamber-restricted genes using independent semi-quantitative and qualitative experimental techniques. In addition, we identified 23 putative novel protein coding genes that are specifically restricted to the ventricle and not in the atrium or bulbus arteriosus. In our knowledge, these 23 novel genes have either not been investigated in detail or are sparsely studied. The transcriptome identified in this study includes 68 differentially expressing zebrafish cardiac chamber genes that have a human ortholog. We also carried out spatiotemporal gene expression profiling of the 96 differentially expressed genes throughout the three cardiac chambers in 11 developmental stages and 6 tissue types of zebrafish. We hypothesize that clustering the differentially expressed genes with both known and unknown functions will deliver detailed insights on fundamental gene networks that are important for the development and specification of the cardiac chambers. It is also postulated that this transcriptome atlas will help utilize zebrafish in a better way as a model for studying cardiac development and to explore functional role of gene networks in cardiac disease pathogenesis. PMID:26815362
Nucleotide sequence and transcriptional start site of the Methylobacterium organophilum XX methanol dehydrogenase structural gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Machlin, S.M.; Hanson, R.S.

The nucleotide sequence of a cloned 2.5-kilobase-pair SmaI fragment containing the methanol dehydrogenase (MDH) structural gene from Methylobacterium organophilum XX was determined. A single open reading frame with a coding capacity of 626 amino acids (molecular weight, 66,000) was identified on one stand, and N-terminal sequencing of purified MDH revealed that 27 of these residues constituted a putative signal peptide. Primer extension mapping of in vivo transcripts indicated that the start of mRNA synthesis was 160 to 170 base pairs upstream of the ATG codon. Northern (RNA) blot analysis further demonstrated that the transcript was 2.1 kilobase pairs in lengthmore » and therefore appeared to encode only MDH.« less
Evolution and Variation of Renin Genes in Mice

PubMed Central

Dickinson, Douglas P.; Gross, Kenneth W.; Piccini, Nina; Wilson, Carol M.

1984-01-01

Inbred strains of mice carry Ren-1, a gene encoding the thermostable Renin-1 isozyme. Ren-1 is expressed at relatively low levels in mouse submandibular gland and kidney. Some strains also carry Ren-2, a gene encoding the thermolabile Renin-2 isozyme. Ren-2 is expressed at high levels in the mouse submandibular gland and at very low levels, if at all, in the kidney. Ren-1 and Ren-2 are closely linked on mouse chromosome 1, show extensive homology in coding and noncoding regions and provide a model for studying the regulation of gene expression. An investigation of renin genes and enzymatic activity in wild-derived mice identified several restriction site polymorphisms as well as putative variants in renin gene expression and protein structure. The number of renin genes carried by different subpopulations of wild-derived mice is consistent with the occurrence of a gene duplication event prior to the divergence of M. spretus (2.75–5.5 million yr ago). This conclusion is in agreement with a prior estimate based upon comparative sequence analysis of Ren-1 and Ren-2 from inbred laboratory mice. PMID:6389258
The Salivary Protein Repertoire of the Polyphagous Spider Mite Tetranychus urticae: A Quest for Effectors.

PubMed

Jonckheere, Wim; Dermauw, Wannes; Zhurov, Vladimir; Wybouw, Nicky; Van den Bulcke, Jan; Villarroel, Carlos A; Greenhalgh, Robert; Grbić, Mike; Schuurink, Rob C; Tirry, Luc; Baggerman, Geert; Clark, Richard M; Kant, Merijn R; Vanholme, Bartel; Menschaert, Gerben; Van Leeuwen, Thomas

2016-12-01

The two-spotted spider mite Tetranychus urticae is an extremely polyphagous crop pest. Alongside an unparalleled detoxification potential for plant secondary metabolites, it has recently been shown that spider mites can attenuate or even suppress plant defenses. Salivary constituents, notably effectors, have been proposed to play an important role in manipulating plant defenses and might determine the outcome of plant-mite interactions. Here, the proteomic composition of saliva from T. urticae lines adapted to various host plants-bean, maize, soy, and tomato-was analyzed using a custom-developed feeding assay coupled with nano-LC tandem mass spectrometry. About 90 putative T. urticae salivary proteins were identified. Many are of unknown function, and in numerous cases belonging to multimembered gene families. RNAseq expression analysis revealed that many genes coding for these salivary proteins were highly expressed in the proterosoma, the mite body region that includes the salivary glands. A subset of genes encoding putative salivary proteins was selected for whole-mount in situ hybridization, and were found to be expressed in the anterior and dorsal podocephalic glands. Strikingly, host plant dependent expression was evident for putative salivary proteins, and was further studied in detail by micro-array based genome-wide expression profiling. This meta-analysis revealed for the first time the salivary protein repertoire of a phytophagous chelicerate. The availability of this salivary proteome will assist in unraveling the molecular interface between phytophagous mites and their host plants, and may ultimately facilitate the development of mite-resistant crops. Furthermore, the technique used in this study is a time- and resource-efficient method to examine the salivary protein composition of other small arthropods for which saliva or salivary glands cannot be isolated easily. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
The Salivary Protein Repertoire of the Polyphagous Spider Mite Tetranychus urticae: A Quest for Effectors*

PubMed Central

Jonckheere, Wim; Zhurov, Vladimir; Villarroel, Carlos A.; Greenhalgh, Robert; Grbić, Mike; Schuurink, Rob C.; Tirry, Luc; Kant, Merijn R.; Vanholme, Bartel

2016-01-01

The two-spotted spider mite Tetranychus urticae is an extremely polyphagous crop pest. Alongside an unparalleled detoxification potential for plant secondary metabolites, it has recently been shown that spider mites can attenuate or even suppress plant defenses. Salivary constituents, notably effectors, have been proposed to play an important role in manipulating plant defenses and might determine the outcome of plant-mite interactions. Here, the proteomic composition of saliva from T. urticae lines adapted to various host plants—bean, maize, soy, and tomato—was analyzed using a custom-developed feeding assay coupled with nano-LC tandem mass spectrometry. About 90 putative T. urticae salivary proteins were identified. Many are of unknown function, and in numerous cases belonging to multimembered gene families. RNAseq expression analysis revealed that many genes coding for these salivary proteins were highly expressed in the proterosoma, the mite body region that includes the salivary glands. A subset of genes encoding putative salivary proteins was selected for whole-mount in situ hybridization, and were found to be expressed in the anterior and dorsal podocephalic glands. Strikingly, host plant dependent expression was evident for putative salivary proteins, and was further studied in detail by micro-array based genome-wide expression profiling. This meta-analysis revealed for the first time the salivary protein repertoire of a phytophagous chelicerate. The availability of this salivary proteome will assist in unraveling the molecular interface between phytophagous mites and their host plants, and may ultimately facilitate the development of mite-resistant crops. Furthermore, the technique used in this study is a time- and resource-efficient method to examine the salivary protein composition of other small arthropods for which saliva or salivary glands cannot be isolated easily. PMID:27703040
Rapid quantification of mutant fitness in diverse bacteria by sequencing randomly bar-coded transposons

DOE PAGES

Wetmore, Kelly M.; Price, Morgan N.; Waters, Robert J.; ...

2015-05-12

Transposon mutagenesis with next-generation sequencing (TnSeq) is a powerful approach to annotate gene function in bacteria, but existing protocols for TnSeq require laborious preparation of every sample before sequencing. Thus, the existing protocols are not amenable to the throughput necessary to identify phenotypes and functions for the majority of genes in diverse bacteria. Here, we present a method, random bar code transposon-site sequencing (RB-TnSeq), which increases the throughput of mutant fitness profiling by incorporating random DNA bar codes into Tn5 and mariner transposons and by using bar code sequencing (BarSeq) to assay mutant fitness. RB-TnSeq can be used with anymore » transposon, and TnSeq is performed once per organism instead of once per sample. Each BarSeq assay requires only a simple PCR, and 48 to 96 samples can be sequenced on one lane of an Illumina HiSeq system. We demonstrate the reproducibility and biological significance of RB-TnSeq with Escherichia coli, Phaeobacter inhibens, Pseudomonas stutzeri, Shewanella amazonensis, and Shewanella oneidensis. To demonstrate the increased throughput of RB-TnSeq, we performed 387 successful genome-wide mutant fitness assays representing 130 different bacterium-carbon source combinations and identified 5,196 genes with significant phenotypes across the five bacteria. In P. inhibens, we used our mutant fitness data to identify genes important for the utilization of diverse carbon substrates, including a putative D-mannose isomerase that is required for mannitol catabolism. RB-TnSeq will enable the cost-effective functional annotation of diverse bacteria using mutant fitness profiling. A large challenge in microbiology is the functional assessment of the millions of uncharacterized genes identified by genome sequencing. Transposon mutagenesis coupled to next-generation sequencing (TnSeq) is a powerful approach to assign phenotypes and functions to genes. However, the current strategies for TnSeq are too laborious to be applied to hundreds of experimental conditions across multiple bacteria. Here, we describe an approach, random bar code transposon-site sequencing (RB-TnSeq), which greatly simplifies the measurement of gene fitness by using bar code sequencing (BarSeq) to monitor the abundance of mutants. We performed 387 genome-wide fitness assays across five bacteria and identified phenotypes for over 5,000 genes. RB-TnSeq can be applied to diverse bacteria and is a powerful tool to annotate uncharacterized genes using phenotype data.« less
Rapid quantification of mutant fitness in diverse bacteria by sequencing randomly bar-coded transposons

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wetmore, Kelly M.; Price, Morgan N.; Waters, Robert J.

Transposon mutagenesis with next-generation sequencing (TnSeq) is a powerful approach to annotate gene function in bacteria, but existing protocols for TnSeq require laborious preparation of every sample before sequencing. Thus, the existing protocols are not amenable to the throughput necessary to identify phenotypes and functions for the majority of genes in diverse bacteria. Here, we present a method, random bar code transposon-site sequencing (RB-TnSeq), which increases the throughput of mutant fitness profiling by incorporating random DNA bar codes into Tn5 and mariner transposons and by using bar code sequencing (BarSeq) to assay mutant fitness. RB-TnSeq can be used with anymore » transposon, and TnSeq is performed once per organism instead of once per sample. Each BarSeq assay requires only a simple PCR, and 48 to 96 samples can be sequenced on one lane of an Illumina HiSeq system. We demonstrate the reproducibility and biological significance of RB-TnSeq with Escherichia coli, Phaeobacter inhibens, Pseudomonas stutzeri, Shewanella amazonensis, and Shewanella oneidensis. To demonstrate the increased throughput of RB-TnSeq, we performed 387 successful genome-wide mutant fitness assays representing 130 different bacterium-carbon source combinations and identified 5,196 genes with significant phenotypes across the five bacteria. In P. inhibens, we used our mutant fitness data to identify genes important for the utilization of diverse carbon substrates, including a putative D-mannose isomerase that is required for mannitol catabolism. RB-TnSeq will enable the cost-effective functional annotation of diverse bacteria using mutant fitness profiling. A large challenge in microbiology is the functional assessment of the millions of uncharacterized genes identified by genome sequencing. Transposon mutagenesis coupled to next-generation sequencing (TnSeq) is a powerful approach to assign phenotypes and functions to genes. However, the current strategies for TnSeq are too laborious to be applied to hundreds of experimental conditions across multiple bacteria. Here, we describe an approach, random bar code transposon-site sequencing (RB-TnSeq), which greatly simplifies the measurement of gene fitness by using bar code sequencing (BarSeq) to monitor the abundance of mutants. We performed 387 genome-wide fitness assays across five bacteria and identified phenotypes for over 5,000 genes. RB-TnSeq can be applied to diverse bacteria and is a powerful tool to annotate uncharacterized genes using phenotype data.« less
Analysis of resistance genes of clinical Pannonibacter phragmitetus strain 31801 by complete genome sequencing.

PubMed

Ming, De-Song; Chen, Qing-Qing; Chen, Xiao-Tin

2018-05-14

To clarify the resistance mechanisms of Pannonibacter phragmitetus 31801, isolated from the blood of a liver abscess patient, at the genomic level, we performed whole genomic sequencing using a PacBio RS II single-molecule real-time long-read sequencer. Bioinformatic analysis of the resulting sequence was then carried out to identify any possible resistance genes. Analyses included Basic Local Alignment Search Tool searches against the Antibiotic Resistance Genes Database, ResFinder analysis of the genome sequence, and Resistance Gene Identifier analysis within the Comprehensive Antibiotic Resistance Database. Prophages, clustered regularly interspaced short palindromic repeats (CRISPR), and other putative virulence factors were also identified using PHAST, CRISPRfinder, and the Virulence Factors Database, respectively. The circular chromosome and single plasmid of P. phragmitetus 31801 contained multiple antibiotic resistance genes, including those coding for three different types of β-lactamase [NPS β-lactamase (EC 3.5.2.6), β-lactamase class C, and a metal-dependent hydrolase of β-lactamase superfamily I]. In addition, genes coding for subunits of several multidrug-resistance efflux pumps were identified, including those targeting macrolides (adeJ, cmeB), tetracycline (acrB, adeAB), fluoroquinolones (acrF, ceoB), and aminoglycosides (acrD, amrB, ceoB, mexY, smeB). However, apart from the tripartite macrolide efflux pump macAB-tolC, the genome did not appear to contain the complete complement of subunit genes required for production of most of the major multidrug-resistance efflux pumps.
Molecular Characterization of the Llamas (Lama glama) Casein Cluster Genes Transcripts (CSN1S1, CSN2, CSN1S2, CSN3) and Regulatory Regions

PubMed Central

Pauciullo, Alfredo; Erhardt, Georg

2015-01-01

In the present paper, we report for the first time the characterization of llama (Lama glama) caseins at transcriptomic and genetic level. A total of 288 casein clones transcripts were analysed from two lactating llamas. The most represented mRNA populations were those correctly assembled (85.07%) and they encoded for mature proteins of 215, 217, 187 and 162 amino acids respectively for the CSN1S1, CSN2, CSN1S2 and CSN3 genes. The exonic subdivision evidenced a structure made of 21, 9, 17 and 6 exons for the αs1-, β-, αs2- and κ-casein genes respectively. Exon skipping and duplication events were evidenced. Two variants A and B were identified in the αs1-casein gene as result of the alternative out-splicing of the exon 18. An additional exon coding for a novel esapeptide was found to be cryptic in the κ-casein gene, whereas one extra exon was found in the αs2-casein gene by the comparison with the Camelus dromedaries sequence. A total of 28 putative phosphorylated motifs highlighted a complex heterogeneity and a potential variable degree of post-translational modifications. Ninety-six polymorphic sites were found through the comparison of the lama casein cDNAs with the homologous camel sequences, whereas the first description and characterization of the 5’- and 3’-regulatory regions allowed to identify the main putative consensus sequences involved in the casein genes expression, thus opening the way to new investigations -so far- never achieved in this species. PMID:25923814
Molecular Characterization of the Llamas (Lama glama) Casein Cluster Genes Transcripts (CSN1S1, CSN2, CSN1S2, CSN3) and Regulatory Regions.

PubMed

Pauciullo, Alfredo; Erhardt, Georg

2015-01-01

In the present paper, we report for the first time the characterization of llama (Lama glama) caseins at transcriptomic and genetic level. A total of 288 casein clones transcripts were analysed from two lactating llamas. The most represented mRNA populations were those correctly assembled (85.07%) and they encoded for mature proteins of 215, 217, 187 and 162 amino acids respectively for the CSN1S1, CSN2, CSN1S2 and CSN3 genes. The exonic subdivision evidenced a structure made of 21, 9, 17 and 6 exons for the αs1-, β-, αs2- and κ-casein genes respectively. Exon skipping and duplication events were evidenced. Two variants A and B were identified in the αs1-casein gene as result of the alternative out-splicing of the exon 18. An additional exon coding for a novel esapeptide was found to be cryptic in the κ-casein gene, whereas one extra exon was found in the αs2-casein gene by the comparison with the Camelus dromedaries sequence. A total of 28 putative phosphorylated motifs highlighted a complex heterogeneity and a potential variable degree of post-translational modifications. Ninety-six polymorphic sites were found through the comparison of the lama casein cDNAs with the homologous camel sequences, whereas the first description and characterization of the 5'- and 3'-regulatory regions allowed to identify the main putative consensus sequences involved in the casein genes expression, thus opening the way to new investigations -so far- never achieved in this species.

A single base substitution in the coding region for neurophysin II associated with familial central diabetes insipidus.

PubMed Central

Ito, M; Mori, Y; Oiso, Y; Saito, H

1991-01-01

To elucidate the molecular mechanism of familial central diabetes insipidus (FDI), we sequenced the arginine vasopressin-neurophysin II (AVP-NPII) gene in 2 patients belonging to a pedigree that is consistent with an autosomal dominant mode of inheritance. 10 patients with idiopathic central diabetes insipidus (IDI) and 5 normals were also studied. The AVP-NPII gene, locating on chromosome 20, consists of three exons that encode putative signal peptide, AVP, NPII, and glycoprotein. Using polymerase chain reaction, fragments including the promoter region and all coding regions were amplified from genomic DNA and subjected to direct sequencing. Sequences of 10 patients with IDI were identical with those of normals, while in 2 patients with FDI, a single base substitution was detected in one of two alleles of the AVP-NPII gene, indicating they were heterozygotes for this mutation. It was a G----A transition at nucleotide position 1859 in the second exon, resulting in a substitution of Gly for Ser at amino acid position 57 in the NPII moiety. It was speculated that the mutated AVP-NPII precursor or the mutated NPII molecule, through their conformational changes, might be responsible for AVP deficiency. Images PMID:1840604
Gene 2 of the sigma rhabdovirus genome encodes the P protein, and gene 3 encodes a protein related to the reverse transcriptase of retroelements.

PubMed

Landès-Devauchelle, C; Bras, F; Dezélée, S; Teninges, D

1995-11-10

The nucleotide sequence of the genes 2 and 3 of the Drosophila rhabdovirus sigma was determined from cDNAs to viral genome and poly(A)+ mRNAs. Gene 2 comprises 1032 nucleotides and contains a long ORF encoding a molecular weight 35,208 polypeptide present in infected cells and in virions which migrates in SDS-PAGE as a doublet of M(r) about 60 kDa. The distribution of acidic charges as well as the electrophoretic properties of the protein are characteristic of the rhabdovirus P proteins. Gene 3 comprises 923 nucleotides and contains a long ORF capable of coding a polypeptide of 298 amino acids of MW 33,790. The putative protein (PP3) is similar in size to a minor component of the virions. Computer analysis shows that the sequence of PP3 contains three motifs related to the conserved motifs of reverse transcriptases.
A Two-Locus Global DNA Barcode for Land Plants: The Coding rbcL Gene Complements the Non-Coding trnH-psbA Spacer Region

PubMed Central

Kress, W. John; Erickson, David L.

2007-01-01

Background A useful DNA barcode requires sufficient sequence variation to distinguish between species and ease of application across a broad range of taxa. Discovery of a DNA barcode for land plants has been limited by intrinsically lower rates of sequence evolution in plant genomes than that observed in animals. This low rate has complicated the trade-off in finding a locus that is universal and readily sequenced and has sufficiently high sequence divergence at the species-level. Methodology/Principal Findings Here, a global plant DNA barcode system is evaluated by comparing universal application and degree of sequence divergence for nine putative barcode loci, including coding and non-coding regions, singly and in pairs across a phylogenetically diverse set of 48 genera (two species per genus). No single locus could discriminate among species in a pair in more than 79% of genera, whereas discrimination increased to nearly 88% when the non-coding trnH-psbA spacer was paired with one of three coding loci, including rbcL. In silico trials were conducted in which DNA sequences from GenBank were used to further evaluate the discriminatory power of a subset of these loci. These trials supported the earlier observation that trnH-psbA coupled with rbcL can correctly identify and discriminate among related species. Conclusions/Significance A combination of the non-coding trnH-psbA spacer region and a portion of the coding rbcL gene is recommended as a two-locus global land plant barcode that provides the necessary universality and species discrimination. PMID:17551588
Association of an SNP in a novel DREB2-like gene SiDREB2 with stress tolerance in foxtail millet [Setaria italica (L.)].

PubMed

Lata, Charu; Bhutty, Sarita; Bahadur, Ranjit Prasad; Majee, Manoj; Prasad, Manoj

2011-06-01

The DREB genes code for important plant transcription factors involved in the abiotic stress response and signal transduction. Characterization of DREB genes and development of functional markers for effective alleles is important for marker-assisted selection in foxtail millet. Here the characterization of a cDNA (SiDREB2) encoding a putative dehydration-responsive element-binding protein 2 from foxtail millet and the development of an allele-specific marker (ASM) for dehydration tolerance is reported. A cDNA clone (GenBank accession no. GT090998) coding for a putative DREB2 protein was isolated as a differentially expressed gene from a 6 h dehydration stress SSH library. A 5' RACE (rapid amplification of cDNA ends) was carried out to obtain the full-length cDNA, and sequence analysis showed that SiDREB2 encoded a polypeptide of 234 amino acids with a predicted mol. wt of 25.72 kDa and a theoretical pI of 5.14. A theoretical model of the tertiary structure shows that it has a highly conserved GCC-box-binding N-terminal domain, and an acidic C-terminus that acts as an activation domain for transcription. Based on its similarity to AP2 domains, SiDREB2 was classified into the A-2 subgroup of the DREB subfamily. Quantitative real-time PCR analysis showed significant up-regulation of SiDREB2 by dehydration (polyethylene glycol) and salinity (NaCl), while its expression was less affected by other stresses. A synonymous single nucleotide polymorphism (SNP) associated with dehydration tolerance was detected at the 558th base pair (an A/G transition) in the SiDREB2 gene in a core set of 45 foxtail millet accessions used. Based on the identified SNP, three primers were designed to develop an ASM for dehydration tolerance. The ASM produced a 261 bp fragment in all the tolerant accessions and produced no amplification in the sensitive accessions. The use of this ASM might be faster, cheaper, and more reproducible than other SNP genotyping methods, and thus will enable marker-aided breeding of foxtail millet for dehydration tolerance.
Integrated DNA/RNA targeted genomic profiling of diffuse large B-cell lymphoma using a clinical assay.

PubMed

Intlekofer, Andrew M; Joffe, Erel; Batlevi, Connie L; Hilden, Patrick; He, Jie; Seshan, Venkatraman E; Zelenetz, Andrew D; Palomba, M Lia; Moskowitz, Craig H; Portlock, Carol; Straus, David J; Noy, Ariela; Horwitz, Steven M; Gerecitano, John F; Moskowitz, Alison; Hamlin, Paul; Matasar, Matthew J; Kumar, Anita; van den Brink, Marcel R; Knapp, Kristina M; Pichardo, Janine D; Nahas, Michelle K; Trabucco, Sally E; Mughal, Tariq; Copeland, Amanda R; Papaemmanuil, Elli; Moarii, Mathai; Levine, Ross L; Dogan, Ahmet; Miller, Vincent A; Younes, Anas

2018-06-12

We sought to define the genomic landscape of diffuse large B-cell lymphoma (DLBCL) by using formalin-fixed paraffin-embedded (FFPE) biopsy specimens. We used targeted sequencing of genes altered in hematologic malignancies, including DNA coding sequence for 405 genes, noncoding sequence for 31 genes, and RNA coding sequence for 265 genes (FoundationOne-Heme). Short variants, rearrangements, and copy number alterations were determined. We studied 198 samples (114 de novo, 58 previously treated, and 26 large-cell transformation from follicular lymphoma). Median number of GAs per case was 6, with 97% of patients harboring at least one alteration. Recurrent GAs were detected in genes with established roles in DLBCL pathogenesis (e.g. MYD88, CREBBP, CD79B, EZH2), as well as notable differences compared to prior studies such as inactivating mutations in TET2 (5%). Less common GAs identified potential targets for approved or investigational therapies, including BRAF, CD274 (PD-L1), IDH2, and JAK1/2. TP53 mutations were more frequently observed in relapsed/refractory DLBCL, and predicted for lack of response to first-line chemotherapy, identifying a subset of patients that could be prioritized for novel therapies. Overall, 90% (n = 169) of the patients harbored a GA which could be explored for therapeutic intervention, with 54% (n = 107) harboring more than one putative target.
The Complete Mitochondrial Genome of Ctenoptilum vasava (Lepidoptera: Hesperiidae: Pyrginae) and Its Phylogenetic Implication

PubMed Central

Hao, Jiasheng; Sun, Qianqian; Zhao, Huabin; Sun, Xiaoyan; Gai, Yonghua; Yang, Qun

2012-01-01

We here report the first complete mitochondrial (mt) genome of a skipper, Ctenoptilum vasava Moore, 1865 (Lepidoptera: Hesperiidae: Pyrginae). The mt genome of the skipper is a circular molecule of 15,468 bp, containing 2 ribosomal RNA genes, 24 putative transfer RNA (tRNA), genes including an extra copy of trnS (AGN) and a tRNA-like insertion trnL (UUR), 13 protein-coding genes and an AT-rich region. All protein-coding genes (PCGs) are initiated by ATN codons and terminated by the typical stop codon TAA or TAG, except for COII which ends with a single T. The intergenic spacer sequence between trnS (AGN) and ND1 genes also contains the ATACTAA motif. The AT-rich region of 429 bp is comprised of nonrepetitive sequences, including the motif ATAGA followed by an 19 bp poly-T stretch, a microsatellite-like (AT)3 (TA)9 element next to the ATTTA motif, an 11 bp poly-A adjacent to tRNAs. Phylogenetic analyses (ML and BI methods) showed that Papilionoidea is not a natural group, and Hesperioidea is placed within the Papilionoidea as a sister to ((Pieridae + Lycaenidae) + Nymphalidae) while Papilionoidae is paraphyletic to Hesperioidea. This result is remarkably different from the traditional view where Papilionoidea and Hesperioidea are considered as two distinct superfamilies. PMID:22577351
Are MAO-A deficiency states in the general population and in putative high-risk populations highly uncommon?

PubMed

Murphy, D L; Sims, K; Eisenhofer, G; Greenberg, B D; George, T; Berlin, F; Zametkin, A; Ernst, M; Breakefield, X O

1998-01-01

Lack of monoamine oxidase A (MAO-A) due to either Xp chromosomal deletions or alterations in the coding sequence of the gene for this enzyme are associated with marked changes in monoamine metabolism and appear to be associated with variable cognitive deficits and behavioral changes in humans and in transgenic mice. In mice, some of the most marked behavioral changes are ameliorated by pharmacologically-induced reductions in serotonin synthesis during early development, raising the question of possible therapeutic interventions in humans with MAO deficiency states. At the present time, only one multi-generational family and a few other individuals with marked MAO-A deficiency states have been identified and studied in detail. Although MAO deficiency states associated with Xp chromosomal deletions were identified by distinct symptoms (including blindness in infancy) produced by the contiguous Norrie disease gene, the primarily behavioral phenotype of individuals with the MAO mutation is less obvious. This paper reports a sequential research design and preliminary results from screening several hundred volunteers in the general population and from putative high-risk groups for possible MAO deficiency states. These preliminary results suggest that marked MAO deficiency states are very rare.
SNPs in putative regulatory regions identified by human mouse comparative sequencing and transcription factor binding site data

DOE Office of Scientific and Technical Information (OSTI.GOV)

Banerjee, Poulabi; Bahlo, Melanie; Schwartz, Jody R.

2002-01-01

Genome wide disease association analysis using SNPs is being explored as a method for dissecting complex genetic traits and a vast number of SNPs have been generated for this purpose. As there are cost and throughput limitations of genotyping large numbers of SNPs and statistical issues regarding the large number of dependent tests on the same data set, to make association analysis practical it has been proposed that SNPs should be prioritized based on likely functional importance. The most easily identifiable functional SNPs are coding SNPs (cSNPs) and accordingly cSNPs have been screened in a number of studies. SNPs inmore » gene regulatory sequences embedded in noncoding DNA are another class of SNPs suggested for prioritization due to their predicted quantitative impact on gene expression. The main challenge in evaluating these SNPs, in contrast to cSNPs is a lack of robust algorithms and databases for recognizing regulatory sequences in noncoding DNA. Approaches that have been previously used to delineate noncoding sequences with gene regulatory activity include cross-species sequence comparisons and the search for sequences recognized by transcription factors. We combined these two methods to sift through mouse human genomic sequences to identify putative gene regulatory elements and subsequently localized SNPs within these sequences in a 1 Megabase (Mb) region of human chromosome 5q31, orthologous to mouse chromosome 11 containing the Interleukin cluster.« less
The putative multidrug resistance protein MRP-7 inhibits methylmercury-associated animal toxicity and dopaminergic neurodegeneration in Caenorhabditis elegans

PubMed Central

VanDuyn, Natalia; Nass, Richard

2013-01-01

Parkinson’s disease (PD) is the most prevalent neurodegenerative motor disorder worldwide, and results in the progressive loss of dopamine (DA) neurons in the substantia nigra pars compacta. Gene-environment interactions are believed to play a significant role in the vast majority of PD cases, yet the toxicants and the associated genes involved in the neuropathology are largely ill-defined. Recent epidemiological and biochemical evidence suggests that methylmercury (MeHg) may be an environmental toxicant that contributes to the development of PD. Here we report that a gene coding for the putative multidrug resistance protein MRP-7 in Caenorhabditis elegans (C. elegans) modulates whole animal and DA neuron sensitivity to MeHg. In this study we demonstrate that genetic knockdown of MRP-7 results in a 2-fold increase in Hg levels and a dramatic increase in stress response proteins associated with the endoplasmic reticulum, golgi apparatus, and mitochondria, as well as an increase in MeHg-associated animal death. Chronic exposure to low concentrations of MeHg induces MRP-7 gene expression, while exposures in MRP-7 genetic knockdown animals results in a loss of DA neuron integrity without affecting whole animal viability. Furthermore, transgenic animals expressing a fluorescent reporter behind the endogenous MRP-7 promoter indicate that the transporter is expressed in DA neurons. These studies show for the first time that a multidrug resistance protein is expressed in DA neurons, and its expression inhibits MeHg-associated DA neuron pathology. PMID:24266639
The putative multidrug resistance protein MRP-7 inhibits methylmercury-associated animal toxicity and dopaminergic neurodegeneration in Caenorhabditis elegans.

PubMed

VanDuyn, Natalia; Nass, Richard

2014-03-01

Parkinson's disease (PD) is the most prevalent neurodegenerative motor disorder worldwide, and results in the progressive loss of dopamine (DA) neurons in the substantia nigra pars compacta. Gene-environment interactions are believed to play a significant role in the vast majority of PD cases, yet the toxicants and the associated genes involved in the neuropathology are largely ill-defined. Recent epidemiological and biochemical evidence suggests that methylmercury (MeHg) may be an environmental toxicant that contributes to the development of PD. Here, we report that a gene coding for the putative multidrug resistance protein MRP-7 in Caenorhabditis elegans modulates whole animal and DA neuron sensitivity to MeHg. In this study, we demonstrate that genetic knockdown of MRP-7 results in a twofold increase in Hg levels and a dramatic increase in stress response proteins associated with the endoplasmic reticulum, golgi apparatus, and mitochondria, as well as an increase in MeHg-associated animal death. Chronic exposure to low concentrations of MeHg induces MRP-7 gene expression, while exposures in MRP-7 genetic knockdown animals results in a loss of DA neuron integrity without affecting whole animal viability. Furthermore, transgenic animals expressing a fluorescent reporter behind the endogenous MRP-7 promoter indicate that the transporter is expressed in DA neurons. These studies show for the first time that a multidrug resistance protein is expressed in DA neurons, and its expression inhibits MeHg-associated DA neuron pathology. © 2013 International Society for Neurochemistry.
Characterization of the genomic organization of the region bordering the centromere of chromosome V of Podospora anserina by direct sequencing.

PubMed

Silar, Philippe; Barreau, Christian; Debuchy, Robert; Kicka, Sébastien; Turcq, Béatrice; Sainsard-Chanet, Annie; Sellem, Carole H; Billault, Alain; Cattolico, Laurence; Duprat, Simone; Weissenbach, Jean

2003-08-01

A Podospora anserina BAC library of 4800 clones has been constructed in the vector pBHYG allowing direct selection in fungi. Screening of the BAC collection for centromeric sequences of chromosome V allowed the recovery of clones localized on either sides of the centromere, but no BAC clone was found to contain the centromere. Seven BAC clones containing 322,195 and 156,244bp from either sides of the centromeric region were sequenced and annotated. One 5S rRNA gene, 5 tRNA genes, and 163 putative coding sequences (CDS) were identified. Among these, only six CDS seem specific to P. anserina. The gene density in the centromeric region is approximately one gene every 2.8kb. Extrapolation of this gene density to the whole genome of P. anserina suggests that the genome contains about 11,000 genes. Synteny analyses between P. anserina and Neurospora crassa show that co-linearity extends at the most to a few genes, suggesting rapid genome rearrangements between these two species.
Mutations in a novel gene, NHS, cause the pleiotropic effects of Nance-Horan syndrome, including severe congenital cataract, dental anomalies, and mental retardation.

PubMed

Burdon, Kathryn P; McKay, James D; Sale, Michèle M; Russell-Eggitt, Isabelle M; Mackey, David A; Wirth, M Gabriela; Elder, James E; Nicoll, Alan; Clarke, Michael P; FitzGerald, Liesel M; Stankovich, James M; Shaw, Marie A; Sharma, Shiwani; Gajovic, Srecko; Gruss, Peter; Ross, Shelley; Thomas, Paul; Voss, Anne K; Thomas, Tim; Gécz, Jozef; Craig, Jamie E

2003-11-01

Nance-Horan syndrome (NHS) is an X-linked disorder characterized by congenital cataracts, dental anomalies, dysmorphic features, and, in some cases, mental retardation. NHS has been mapped to a 1.3-Mb interval on Xp22.13. We have confirmed the same localization in the original, extended Australian family with NHS and have identified protein-truncating mutations in a novel gene, which we have called "NHS," in five families. The NHS gene encompasses approximately 650 kb of genomic DNA, coding for a 1,630-amino acid putative nuclear protein. NHS orthologs were found in other vertebrates, but no sequence similarity to known genes was identified. The murine developmental expression profile of the NHS gene was studied using in situ hybridization and a mouse line containing a lacZ reporter-gene insertion in the Nhs locus. We found a complex pattern of temporally and spatially regulated expression, which, together with the pleiotropic features of NHS, suggests that this gene has key functions in the regulation of eye, tooth, brain, and craniofacial development.
Mutations in a Novel Gene, NHS, Cause the Pleiotropic Effects of Nance-Horan Syndrome, Including Severe Congenital Cataract, Dental Anomalies, and Mental Retardation

PubMed Central

Burdon, Kathryn P.; McKay, James D.; Sale, Michèle M.; Russell-Eggitt, Isabelle M.; Mackey, David A.; Wirth, M. Gabriela; Elder, James E.; Nicoll, Alan; Clarke, Michael P.; FitzGerald, Liesel M.; Stankovich, James M.; Shaw, Marie A.; Sharma, Shiwani; Gajovic, Srecko; Gruss, Peter; Ross, Shelley; Thomas, Paul; Voss, Anne K.; Thomas, Tim; Gécz, Jozef; Craig, Jamie E.

2003-01-01

Nance-Horan syndrome (NHS) is an X-linked disorder characterized by congenital cataracts, dental anomalies, dysmorphic features, and, in some cases, mental retardation. NHS has been mapped to a 1.3-Mb interval on Xp22.13. We have confirmed the same localization in the original, extended Australian family with NHS and have identified protein-truncating mutations in a novel gene, which we have called “NHS,” in five families. The NHS gene encompasses ∼650 kb of genomic DNA, coding for a 1,630–amino acid putative nuclear protein. NHS orthologs were found in other vertebrates, but no sequence similarity to known genes was identified. The murine developmental expression profile of the NHS gene was studied using in situ hybridization and a mouse line containing a lacZ reporter-gene insertion in the Nhs locus. We found a complex pattern of temporally and spatially regulated expression, which, together with the pleiotropic features of NHS, suggests that this gene has key functions in the regulation of eye, tooth, brain, and craniofacial development. PMID:14564667
Directional RNA-seq reveals highly complex condition-dependent transcriptomes in E. coli K12 through accurate full-length transcripts assembling.

PubMed

Li, Shan; Dong, Xia; Su, Zhengchang

2013-07-30

Although prokaryotic gene transcription has been studied over decades, many aspects of the process remain poorly understood. Particularly, recent studies have revealed that transcriptomes in many prokaryotes are far more complex than previously thought. Genes in an operon are often alternatively and dynamically transcribed under different conditions, and a large portion of genes and intergenic regions have antisense RNA (asRNA) and non-coding RNA (ncRNA) transcripts, respectively. Ironically, similar studies have not been conducted in the model bacterium E coli K12, thus it is unknown whether or not the bacterium possesses similar complex transcriptomes. Furthermore, although RNA-seq becomes the major method for analyzing the complexity of prokaryotic transcriptome, it is still a challenging task to accurately assemble full length transcripts using short RNA-seq reads. To fill these gaps, we have profiled the transcriptomes of E. coli K12 under different culture conditions and growth phases using a highly specific directional RNA-seq technique that can capture various types of transcripts in the bacterial cells, combined with a highly accurate and robust algorithm and tool TruHMM (http://bioinfolab.uncc.edu/TruHmm_package/) for assembling full length transcripts. We found that 46.9 ~ 63.4% of expressed operons were utilized in their putative alternative forms, 72.23 ~ 89.54% genes had putative asRNA transcripts and 51.37 ~ 72.74% intergenic regions had putative ncRNA transcripts under different culture conditions and growth phases. As has been demonstrated in many other prokaryotes, E. coli K12 also has a highly complex and dynamic transcriptomes under different culture conditions and growth phases. Such complex and dynamic transcriptomes might play important roles in the physiology of the bacterium. TruHMM is a highly accurate and robust algorithm for assembling full-length transcripts in prokaryotes using directional RNA-seq short reads.
Directional RNA-seq reveals highly complex condition-dependent transcriptomes in E. coli K12 through accurate full-length transcripts assembling

PubMed Central

2013-01-01

Background Although prokaryotic gene transcription has been studied over decades, many aspects of the process remain poorly understood. Particularly, recent studies have revealed that transcriptomes in many prokaryotes are far more complex than previously thought. Genes in an operon are often alternatively and dynamically transcribed under different conditions, and a large portion of genes and intergenic regions have antisense RNA (asRNA) and non-coding RNA (ncRNA) transcripts, respectively. Ironically, similar studies have not been conducted in the model bacterium E coli K12, thus it is unknown whether or not the bacterium possesses similar complex transcriptomes. Furthermore, although RNA-seq becomes the major method for analyzing the complexity of prokaryotic transcriptome, it is still a challenging task to accurately assemble full length transcripts using short RNA-seq reads. Results To fill these gaps, we have profiled the transcriptomes of E. coli K12 under different culture conditions and growth phases using a highly specific directional RNA-seq technique that can capture various types of transcripts in the bacterial cells, combined with a highly accurate and robust algorithm and tool TruHMM (http://bioinfolab.uncc.edu/TruHmm_package/) for assembling full length transcripts. We found that 46.9 ~ 63.4% of expressed operons were utilized in their putative alternative forms, 72.23 ~ 89.54% genes had putative asRNA transcripts and 51.37 ~ 72.74% intergenic regions had putative ncRNA transcripts under different culture conditions and growth phases. Conclusions As has been demonstrated in many other prokaryotes, E. coli K12 also has a highly complex and dynamic transcriptomes under different culture conditions and growth phases. Such complex and dynamic transcriptomes might play important roles in the physiology of the bacterium. TruHMM is a highly accurate and robust algorithm for assembling full-length transcripts in prokaryotes using directional RNA-seq short reads. PMID:23899370
Complete coding sequence characterization and comparative analysis of the putative novel human rhinovirus (HRV) species C and B

PubMed Central

2011-01-01

Background Human Rhinoviruses (HRVs) are well recognized viral pathogens associated with acute respiratory tract illnesses (RTIs) abundant worldwide. Although recent studies have phylogenetically identified the new HRV species (HRV-C), data on molecular epidemiology, genetic diversity, and clinical manifestation have been limited. Result To gain new insight into HRV genetic diversity, we determined the complete coding sequences of putative new members of HRV species C (HRV-CU072 with 1% prevalence) and HRV-B (HRV-CU211) identified from clinical specimens collected from pediatric patients diagnosed with a symptom of acute lower RTI. Complete coding sequence and phylogenetic analysis revealed that the HRV-CU072 strain shared a recent common ancestor with most closely related Chinese strain (N4). Comparative analysis at the protein level showed that HRV-CU072 might accumulate substitutional mutations in structural proteins, as well as nonstructural proteins 3C and 3 D. Comparative analysis of all available HRVs and HEVs indicated that HRV-C contains a relatively high G+C content and is more closely related to HEV-D. This might be correlated to their replication and capability to adapt to the high temperature environment of the human lower respiratory tract. We herein report an infrequently occurring intra-species recombination event in HRV-B species (HRV-CU211) with a crossing over having taken place at the boundary of VP2 and VP3 genes. Moreover, we observed phylogenetic compatibility in all HRV species and suggest that dynamic mechanisms for HRV evolution seem to be related to recombination events. These findings indicated that the elementary units shaping the genetic diversity of HRV-C could be found in the nonstructural 2A and 3D genes. Conclusion This study provides information for understanding HRV genetic diversity and insight into the role of selection pressure and recombination mechanisms influencing HRV evolution. PMID:21214911
Complete coding sequence characterization and comparative analysis of the putative novel human rhinovirus (HRV) species C and B.

PubMed

Linsuwanon, Piyada; Payungporn, Sunchai; Suwannakarn, Kamol; Chieochansin, Thaweesak; Theamboonlers, Apiradee; Poovorawan, Yong

2011-01-07

Human Rhinoviruses (HRVs) are well recognized viral pathogens associated with acute respiratory tract illnesses (RTIs) abundant worldwide. Although recent studies have phylogenetically identified the new HRV species (HRV-C), data on molecular epidemiology, genetic diversity, and clinical manifestation have been limited. To gain new insight into HRV genetic diversity, we determined the complete coding sequences of putative new members of HRV species C (HRV-CU072 with 1% prevalence) and HRV-B (HRV-CU211) identified from clinical specimens collected from pediatric patients diagnosed with a symptom of acute lower RTI. Complete coding sequence and phylogenetic analysis revealed that the HRV-CU072 strain shared a recent common ancestor with most closely related Chinese strain (N4). Comparative analysis at the protein level showed that HRV-CU072 might accumulate substitutional mutations in structural proteins, as well as nonstructural proteins 3C and 3 D. Comparative analysis of all available HRVs and HEVs indicated that HRV-C contains a relatively high G+C content and is more closely related to HEV-D. This might be correlated to their replication and capability to adapt to the high temperature environment of the human lower respiratory tract. We herein report an infrequently occurring intra-species recombination event in HRV-B species (HRV-CU211) with a crossing over having taken place at the boundary of VP2 and VP3 genes. Moreover, we observed phylogenetic compatibility in all HRV species and suggest that dynamic mechanisms for HRV evolution seem to be related to recombination events. These findings indicated that the elementary units shaping the genetic diversity of HRV-C could be found in the nonstructural 2A and 3D genes. This study provides information for understanding HRV genetic diversity and insight into the role of selection pressure and recombination mechanisms influencing HRV evolution.
Genome sequence and comparative analysis of a putative entomopathogenic Serratia isolated from Caenorhabditis briggsae.

PubMed

Abebe-Akele, Feseha; Tisa, Louis S; Cooper, Vaughn S; Hatcher, Philip J; Abebe, Eyualem; Thomas, W Kelley

2015-07-18

Entomopathogenic associations between nematodes in the genera Steinernema and Heterorhabdus with their cognate bacteria from the bacterial genera Xenorhabdus and Photorhabdus, respectively, are extensively studied for their potential as biological control agents against invasive insect species. These two highly coevolved associations were results of convergent evolution. Given the natural abundance of bacteria, nematodes and insects, it is surprising that only these two associations with no intermediate forms are widely studied in the entomopathogenic context. Discovering analogous systems involving novel bacterial and nematode species would shed light on the evolutionary processes involved in the transition from free living organisms to obligatory partners in entomopathogenicity. We report the complete genome sequence of a new member of the enterobacterial genus Serratia that forms a putative entomopathogenic complex with Caenorhabditis briggsae. Analysis of the 5.04 MB chromosomal genome predicts 4599 protein coding genes, seven sets of ribosomal RNA genes, 84 tRNA genes and a 64.8 KB plasmid encoding 74 genes. Comparative genomic analysis with three of the previously sequenced Serratia species, S. marcescens DB11 and S. proteamaculans 568, and Serratia sp. AS12, revealed that these four representatives of the genus share a core set of ~3100 genes and extensive structural conservation. The newly identified species shares a more recent common ancestor with S. marcescens with 99% sequence identity in rDNA sequence and orthology across 85.6% of predicted genes. Of the 39 genes/operons implicated in the virulence, symbiosis, recolonization, immune evasion and bioconversion, 21 (53.8%) were present in Serratia while 33 (84.6%) and 35 (89%) were present in Xenorhabdus and Photorhabdus EPN bacteria respectively. The majority of unique sequences in Serratia sp. SCBI (South African Caenorhabditis briggsae Isolate) are found in ~29 genomic islands of 5 to 65 genes and are enriched in putative functions that are biologically relevant to an entomopathogenic lifestyle, including non-ribosomal peptide synthetases, bacteriocins, fimbrial biogenesis, ushering proteins, toxins, secondary metabolite secretion and multiple drug resistance/efflux systems. By revealing the early stages of adaptation to this lifestyle, the Serratia sp. SCBI genome underscores the fact that in EPN formation the composite end result - killing, bioconversion, cadaver protection and recolonization- can be achieved by dissimilar mechanisms. This genome sequence will enable further study of the evolution of entomopathogenic nematode-bacteria complexes.
Regulatory versus coding signatures of natural selection in a candidate gene involved in the adaptive divergence of whitefish species pairs (Coregonus spp.)

PubMed Central

Jeukens, Julie; Bernatchez, Louis

2012-01-01

While gene expression divergence is known to be involved in adaptive phenotypic divergence and speciation, the relative importance of regulatory and structural evolution of genes is poorly understood. A recent next-generation sequencing experiment allowed identifying candidate genes potentially involved in the ongoing speciation of sympatric dwarf and normal lake whitefish (Coregonus clupeaformis), such as cytosolic malate dehydrogenase (MDH1), which showed both significant expression and sequence divergence. The main goal of this study was to investigate into more details the signatures of natural selection in the regulatory and coding sequences of MDH1 in lake whitefish and test for parallelism of these signatures with other coregonine species. Sequencing of the two regions in 118 fish from four sympatric pairs of whitefish and two cisco species revealed a total of 35 single nucleotide polymorphisms (SNPs), with more genetic diversity in European compared to North American coregonine species. While the coding region was found to be under purifying selection, an SNP in the proximal promoter exhibited significant allele frequency divergence in a parallel manner among independent sympatric pairs of North American lake whitefish and European whitefish (C. lavaretus). According to transcription factor binding simulation for 22 regulatory haplotypes of MDH1, putative binding profiles were fairly conserved among species, except for the region around this SNP. Moreover, we found evidence for the role of this SNP in the regulation of MDH1 expression level. Overall, these results provide further evidence for the role of natural selection in gene regulation evolution among whitefish species pairs and suggest its possible link with patterns of phenotypic diversity observed in coregonine species. PMID:22408741
Regulatory versus coding signatures of natural selection in a candidate gene involved in the adaptive divergence of whitefish species pairs (Coregonus spp.).

PubMed

Jeukens, Julie; Bernatchez, Louis

2012-01-01

While gene expression divergence is known to be involved in adaptive phenotypic divergence and speciation, the relative importance of regulatory and structural evolution of genes is poorly understood. A recent next-generation sequencing experiment allowed identifying candidate genes potentially involved in the ongoing speciation of sympatric dwarf and normal lake whitefish (Coregonus clupeaformis), such as cytosolic malate dehydrogenase (MDH1), which showed both significant expression and sequence divergence. The main goal of this study was to investigate into more details the signatures of natural selection in the regulatory and coding sequences of MDH1 in lake whitefish and test for parallelism of these signatures with other coregonine species. Sequencing of the two regions in 118 fish from four sympatric pairs of whitefish and two cisco species revealed a total of 35 single nucleotide polymorphisms (SNPs), with more genetic diversity in European compared to North American coregonine species. While the coding region was found to be under purifying selection, an SNP in the proximal promoter exhibited significant allele frequency divergence in a parallel manner among independent sympatric pairs of North American lake whitefish and European whitefish (C. lavaretus). According to transcription factor binding simulation for 22 regulatory haplotypes of MDH1, putative binding profiles were fairly conserved among species, except for the region around this SNP. Moreover, we found evidence for the role of this SNP in the regulation of MDH1 expression level. Overall, these results provide further evidence for the role of natural selection in gene regulation evolution among whitefish species pairs and suggest its possible link with patterns of phenotypic diversity observed in coregonine species.

PRGdb: a bioinformatics platform for plant resistance gene analysis

PubMed Central

Sanseverino, Walter; Roma, Guglielmo; De Simone, Marco; Faino, Luigi; Melito, Sara; Stupka, Elia; Frusciante, Luigi; Ercolano, Maria Raffaella

2010-01-01

PRGdb is a web accessible open-source (http://www.prgdb.org) database that represents the first bioinformatic resource providing a comprehensive overview of resistance genes (R-genes) in plants. PRGdb holds more than 16 000 known and putative R-genes belonging to 192 plant species challenged by 115 different pathogens and linked with useful biological information. The complete database includes a set of 73 manually curated reference R-genes, 6308 putative R-genes collected from NCBI and 10463 computationally predicted putative R-genes. Thanks to a user-friendly interface, data can be examined using different query tools. A home-made prediction pipeline called Disease Resistance Analysis and Gene Orthology (DRAGO), based on reference R-gene sequence data, was developed to search for plant resistance genes in public datasets such as Unigene and Genbank. New putative R-gene classes containing unknown domain combinations were discovered and characterized. The development of the PRG platform represents an important starting point to conduct various experimental tasks. The inferred cross-link between genomic and phenotypic information allows access to a large body of information to find answers to several biological questions. The database structure also permits easy integration with other data types and opens up prospects for future implementations. PMID:19906694
Innate immune activity conditions the effect of regulatory variants upon monocyte gene expression.

PubMed

Fairfax, Benjamin P; Humburg, Peter; Makino, Seiko; Naranbhai, Vivek; Wong, Daniel; Lau, Evelyn; Jostins, Luke; Plant, Katharine; Andrews, Robert; McGee, Chris; Knight, Julian C

2014-03-07

To systematically investigate the impact of immune stimulation upon regulatory variant activity, we exposed primary monocytes from 432 healthy Europeans to interferon-γ (IFN-γ) or differing durations of lipopolysaccharide and mapped expression quantitative trait loci (eQTLs). More than half of cis-eQTLs identified, involving hundreds of genes and associated pathways, are detected specifically in stimulated monocytes. Induced innate immune activity reveals multiple master regulatory trans-eQTLs including the major histocompatibility complex (MHC), coding variants altering enzyme and receptor function, an IFN-β cytokine network showing temporal specificity, and an interferon regulatory factor 2 (IRF2) transcription factor-modulated network. Induced eQTL are significantly enriched for genome-wide association study loci, identifying context-specific associations to putative causal genes including CARD9, ATM, and IRF8. Thus, applying pathophysiologically relevant immune stimuli assists resolution of functional genetic variants.
The complete mitogenome of brown trout (Salmo trutta fario) and its phylogeny.

PubMed

Sahoo, Prabhati K; Singh, Lalit; Sharma, Lata; Kumar, Rohit; Singh, Vijay K; Ali, S; Singh, Atul K; Barat, Ashoktaru

2016-11-01

The complete mitochondrial genome of Salmo trutta fario, commonly known as brown trout, was sequenced using NGS technology. The mitochondrial genome size was determined to be 16 677 bp and composed of 13 protein-coding gene (PCG), 22 tRNAs, 2 rRNA genes, and 1 putative control region. The overall mitogenome composition of S. trutta fario is A: 28.13%, G: 16.44%, C: 29.47%, and T: 25.96% with A + T content of 54.09% and G + C content of 45.91%. The gene arrangement and the order are similar to other vertebrates. The phylogenetic tree constructed using 42 complete mitogenomes of Salmonidae fishes confirmed the position of the present species under the genus Salmo of subfamily Salmoninae. NGS platform was proved to be a rapid and time-saving technology to reveal complete mitogenomes.
Fluconazole Resistance Associated with Drug Efflux and Increased Transcription of a Drug Transporter Gene, PDH1, in Candida glabrata

PubMed Central

Miyazaki, Haruko; Miyazaki, Yoshitsugu; Geber, Antonia; Parkinson, Tanya; Hitchcock, Christopher; Falconer, Derek J.; Ward, Douglas J.; Marsden, Katherine; Bennett, John E.

1998-01-01

Sequential Candida glabrata isolates were obtained from the mouth of a patient infected with human immunodeficiency virus type 1 who was receiving high doses of fluconazole for oropharyngeal thrush. Fluconazole-susceptible colonies were replaced by resistant colonies that exhibited both increased fluconazole efflux and increased transcripts of a gene which codes for a protein with 72.5% identity to Pdr5p, an ABC multidrug transporter in Saccharomyces cerevisiae. The deduced protein had a molecular mass of 175 kDa and was composed of two homologous halves, each with six putative transmembrane domains and highly conserved sequences of ATP-binding domains. When the earliest and most azole-susceptible isolate of C. glabrata from this patient was exposed to fluconazole, increased transcripts of the PDR5 homolog appeared, linking azole exposure to regulation of this gene. PMID:9661006
Sequence of the non-phosphorylating glyceraldehyde-3-phosphate dehydrogenase from Nicotiana plumbaginifolia and phylogenetic origin of the gene family.

PubMed

Habenicht, A; Quesada, A; Cerff, R

1997-10-01

A cDNA-library has been constructed from Nicotiana plumbaginifolia seedlings, and the non-phosphorylating glyceraldehyde-3-phosphate dehydrogenase (GapN, EC 1.2.1.9) was isolated by plaque hybridization using the cDNA from pea as a heterologous probe. The cDNA comprises the entire GapN coding region. A putative polyadenylation signal is identified. Phylogenetic analysis based on the deduced amino acid sequences revealed that the GapN gene family represents a separate ancient branch within the aldehyde dehydrogenase superfamily. It can be shown that the GapN gene family and other distinct branches of the superfamily have its phylogenetic origin before the separation of primary life-forms. This further demonstrates that already very early in evolution, a broad diversification of the aldehyde dehydrogenases led to the formation of the superfamily.
Phylogenetic distribution and expression of a penicillin-binding protein homologue, Ear and its significance in virulence of Staphylococcus aureus.

PubMed

Singh, Vineet K; Ring, Robert P; Aswani, Vijay; Stemper, Mary E; Kislow, Jennifer; Ye, Zhan; Shukla, Sanjay K

2017-12-01

Staphylococcus aureus is an opportunistic human pathogen that can cause serious infections in humans. A plethora of known and putative virulence factors are produced by staphylococci that collectively orchestrate pathogenesis. Ear protein (Escherichia coli ampicillin resistance) in S. aureus is an exoprotein in COL strain, predicted to be a superantigen, and speculated to play roles in antibiotic resistance and virulence. The goal of this study was to determine if expression of ear is modulated by single nucleotide polymorphisms in its promoter and coding sequences and whether this gene plays roles in antibiotic resistance and virulence. Promoter, coding sequences and expression of the ear gene in clinical and carriage S. aureus strains with distinct genetic backgrounds were analysed. The JE2 strain and its isogenic ear mutant were used in a systemic infection mouse model to determine the competiveness of the ear mutant.Results/Key findings. The ear gene showed a variable expression, with USA300FPR3757 showing a high-level expression compared to many of the other strains tested including some showing negligible expression. Higher expression was associated with agr type 1 but not correlated with phylogenetic relatedness of the ear gene based upon single nucleotide polymorphisms in the promoter or coding regions suggesting a complex regulation. An isogenic JE2 (USA300 background) ear mutant showed no significant difference in its growth, antibiotic susceptibility or virulence in a mouse model. Our data suggests that despite being highly expressed in a USA300 genetic background, Ear is not a significant contributor to virulence in that strain.
Regulation of neural macroRNAs by the transcriptional repressor REST

PubMed Central

Johnson, Rory; Teh, Christina Hui-Leng; Jia, Hui; Vanisri, Ravi Raj; Pandey, Tridansh; Lu, Zhong-Hao; Buckley, Noel J.; Stanton, Lawrence W.; Lipovich, Leonard

2009-01-01

The essential transcriptional repressor REST (repressor element 1-silencing transcription factor) plays central roles in development and human disease by regulating a large cohort of neural genes. These have conventionally fallen into the class of known, protein-coding genes; recently, however, several noncoding microRNA genes were identified as REST targets. Given the widespread transcription of messenger RNA-like, noncoding RNAs (“macroRNAs”), some of which are functional and implicated in disease in mammalian genomes, we sought to determine whether this class of noncoding RNAs can also be regulated by REST. By applying a new, unbiased target gene annotation pipeline to computationally discovered REST binding sites, we find that 23% of mammalian REST genomic binding sites are within 10 kb of a macroRNA gene. These putative target genes were overlooked by previous studies. Focusing on a set of 18 candidate macroRNA targets from mouse, we experimentally demonstrate that two are regulated by REST in neural stem cells. Flanking protein-coding genes are, at most, weakly repressed, suggesting specific targeting of the macroRNAs by REST. Similar to the majority of known REST target genes, both of these macroRNAs are induced during nervous system development and have neurally restricted expression profiles in adult mouse. We observe a similar phenomenon in human: the DiGeorge syndrome-associated noncoding RNA, DGCR5, is repressed by REST through a proximal upstream binding site. Therefore neural macroRNAs represent an additional component of the REST regulatory network. These macroRNAs are new candidates for understanding the role of REST in neuronal development, neurodegeneration, and cancer. PMID:19050060
Regulation of neural macroRNAs by the transcriptional repressor REST.

PubMed

Johnson, Rory; Teh, Christina Hui-Leng; Jia, Hui; Vanisri, Ravi Raj; Pandey, Tridansh; Lu, Zhong-Hao; Buckley, Noel J; Stanton, Lawrence W; Lipovich, Leonard

2009-01-01

The essential transcriptional repressor REST (repressor element 1-silencing transcription factor) plays central roles in development and human disease by regulating a large cohort of neural genes. These have conventionally fallen into the class of known, protein-coding genes; recently, however, several noncoding microRNA genes were identified as REST targets. Given the widespread transcription of messenger RNA-like, noncoding RNAs ("macroRNAs"), some of which are functional and implicated in disease in mammalian genomes, we sought to determine whether this class of noncoding RNAs can also be regulated by REST. By applying a new, unbiased target gene annotation pipeline to computationally discovered REST binding sites, we find that 23% of mammalian REST genomic binding sites are within 10 kb of a macroRNA gene. These putative target genes were overlooked by previous studies. Focusing on a set of 18 candidate macroRNA targets from mouse, we experimentally demonstrate that two are regulated by REST in neural stem cells. Flanking protein-coding genes are, at most, weakly repressed, suggesting specific targeting of the macroRNAs by REST. Similar to the majority of known REST target genes, both of these macroRNAs are induced during nervous system development and have neurally restricted expression profiles in adult mouse. We observe a similar phenomenon in human: the DiGeorge syndrome-associated noncoding RNA, DGCR5, is repressed by REST through a proximal upstream binding site. Therefore neural macroRNAs represent an additional component of the REST regulatory network. These macroRNAs are new candidates for understanding the role of REST in neuronal development, neurodegeneration, and cancer.
Genetic basis of stage-specific melanism: a putative role for a cysteine sulfinic acid decarboxylase in insect pigmentation.

PubMed

Saenko, S V; Jerónimo, M A; Beldade, P

2012-06-01

Melanism, the overall darkening of the body, is a widespread form of animal adaptation to particular environments, and includes bookcase examples of evolution by natural selection, such as industrial melanism in the peppered moth. The major components of the melanin biosynthesis pathway have been characterized in model insects, but little is known about the genetic basis of life-stage specific melanism such as cases described in some lepidopteran species. Here, we investigate two melanic mutations of Bicyclus anynana butterflies, called Chocolate and melanine, that exclusively affect pigmentation of the larval and adult stages, respectively. Our analysis of Mendelian segregation patterns reveals that the larval and adult melanic phenotypes are due to alleles at different, independently segregating loci. Our linkage mapping analysis excludes the pigmentation candidate gene black as the melanine locus, and implicates a gene encoding a putative pyridoxal phosphate-dependant cysteine sulfinic acid decarboxylase as the Chocolate locus. We show variation in coding sequence and in expression levels for this candidate larval melanism locus. This is the first study that suggests a biological function for this gene in insects. Our findings open up exciting opportunities to study the role of this locus in the evolution of adaptive variation in pigmentation, and the uncoupling of regulation of pigment biosynthesis across developmental stages with different ecologies and pressures on body coloration.
Genetic basis of stage-specific melanism: a putative role for a cysteine sulfinic acid decarboxylase in insect pigmentation

PubMed Central

Saenko, S V; Jerónimo, M A; Beldade, P

2012-01-01

Melanism, the overall darkening of the body, is a widespread form of animal adaptation to particular environments, and includes bookcase examples of evolution by natural selection, such as industrial melanism in the peppered moth. The major components of the melanin biosynthesis pathway have been characterized in model insects, but little is known about the genetic basis of life-stage specific melanism such as cases described in some lepidopteran species. Here, we investigate two melanic mutations of Bicyclus anynana butterflies, called Chocolate and melanine, that exclusively affect pigmentation of the larval and adult stages, respectively. Our analysis of Mendelian segregation patterns reveals that the larval and adult melanic phenotypes are due to alleles at different, independently segregating loci. Our linkage mapping analysis excludes the pigmentation candidate gene black as the melanine locus, and implicates a gene encoding a putative pyridoxal phosphate-dependant cysteine sulfinic acid decarboxylase as the Chocolate locus. We show variation in coding sequence and in expression levels for this candidate larval melanism locus. This is the first study that suggests a biological function for this gene in insects. Our findings open up exciting opportunities to study the role of this locus in the evolution of adaptive variation in pigmentation, and the uncoupling of regulation of pigment biosynthesis across developmental stages with different ecologies and pressures on body coloration. PMID:22234245
Genetic heterogeneity of the dnaK gene locus including transcription terminator region (TTR) in Campylobacter lari.

PubMed

Shitara, M; Tsuboi, Y; Sekizuka, T; Tazumi, A; Moorei, J E; Millar, B C; Taneike, I; Matsuda, M

2008-01-01

Nucleotide sequences of approximately 3.1 kbp consisting of the full-length open reading frame (ORF) for grpE, a non-coding (NC) region and a putative ORF for the full-length dnaK gene (1860 bp) were identified from a urease-positive thermophilic Campylobacter (UPTC) CF89-12 isolate. Then, following the construction of a new degenerate polymerase chain reaction (PCR) primer pair for amplification of the dnaK structural gene, including the transcription terminator region of C. lari isolates, the dnaK region was amplified successfully, TA-cloned and sequenced in nine C. lari isolates. The dnaK gene sequences commenced with an ATG and terminated with a TAA in all 10 isolates, including CF89-12. In addition, the putative ORFs for the dnaK gene locus from seven UPTC isolates consisted of 1860 bases, and the four urease-negative (UN) C. lari isolates included C. lari RM2100 reference strain 1866. Interestingly, different probable ribosome binding sites and hypothetically intrinsic p-independent terminator structures were identified between the seven UPTC and four UN C. lari isolates, respectively. Moreover, it is interesting to note that 20 out of a total of 28 polymorphic sites occurred among amino acid sequences of the dnaK ORF from 11 C. lari isolates, identified to be alternatively UPTC-specific or UN C. lari-specific. In the neighbour-joining tree based on the nucleotide sequence information of the dnaK gene, C. lari forms two major distinct clusters consisting of UPTC and UN C. lari isolates, respectively, with UN C. lari being more closely related to other thermophilic campylobacters than to UPTC.
Polyketide synthases of Diaporthe helianthi and involvement of DhPKS1 in virulence on sunflower.

PubMed

Ruocco, Michelina; Baroncelli, Riccardo; Cacciola, Santa Olga; Pane, Catello; Monti, Maurilia Maria; Firrao, Giuseppe; Vergara, Mariarosaria; Magnano di San Lio, Gaetano; Vannacci, Giovanni; Scala, Felice

2018-01-06

The early phases of Diaporthe helianthi pathogenesis on sunflower are characterized by the production of phytotoxins that may play a role in host colonisation. In previous studies, phytotoxins of a polyketidic nature were isolated and purified from culture filtrates of virulent strains of D. helianthi isolated from sunflower. A highly aggressive isolate (7/96) from France contained a gene fragment of a putative nonaketide synthase (lovB) which was conserved in a virulent D. helianthi population. In order to investigate the role of polyketide synthases in D. helianthi 7/96, a draft genome of this isolate was examined. We were able to find and phylogenetically analyse 40 genes putatively coding for polyketide synthases (PKSs). Analysis of their domains revealed that most PKS genes of D. helianthi are reducing PKSs, whereas only eight lacked reducing domains. Most of the identified PKSs have orthologs shown to be virulence factors or genetic determinants for toxin production in other pathogenic fungi. One of the genes (DhPKS1) corresponded to the previously cloned D. helianthi lovB gene fragment and clustered with a nonribosomal peptide synthetase (NRPS) -PKS hybrid/lovastatin nonaketide like A. nidulans LovB. We used DhPKS1 as a case study and carried out its disruption through Agrobacterium-mediated transformation in the isolate 7/96. D. helianthi DhPKS1 deleted mutants were less virulent to sunflower compared to the wild type, indicating a role for this gene in the pathogenesis of the fungus. The PKS sequences analysed and reported here constitute a new genomic resource that will be useful for further research on the biology, ecology and evolution of D. helianthi and generally of fungal plant pathogens.
Deep sequencing and genome-wide analysis reveals the expansion of MicroRNA genes in the gall midge Mayetiola destructor

PubMed Central

2013-01-01

Background MicroRNAs (miRNAs) are small non-coding RNAs that play critical roles in regulating post transcriptional gene expression. Gall midges encompass a large group of insects that are of economic importance and also possess fascinating biological traits. The gall midge Mayetiola destructor, commonly known as the Hessian fly, is a destructive pest of wheat and model organism for studying gall midge biology and insect – host plant interactions. Results In this study, we systematically analyzed miRNAs from the Hessian fly. Deep-sequencing a Hessian fly larval transcriptome led to the identification of 89 miRNA species that are either identical or very similar to known miRNAs from other insects, and 184 novel miRNAs that have not been reported from other species. A genome-wide search through a draft Hessian fly genome sequence identified a total of 611 putative miRNA-encoding genes based on sequence similarity and the existence of a stem-loop structure for miRNA precursors. Analysis of the 611 putative genes revealed a striking feature: the dramatic expansion of several miRNA gene families. The largest family contained 91 genes that encoded 20 different miRNAs. Microarray analyses revealed the expression of miRNA genes was strictly regulated during Hessian fly larval development and abundance of many miRNA genes were affected by host genotypes. Conclusion The identification of a large number of miRNAs for the first time from a gall midge provides a foundation for further studies of miRNA functions in gall midge biology and behavior. The dramatic expansion of identical or similar miRNAs provides a unique system to study functional relations among miRNA iso-genes as well as changes in sequence specificity due to small changes in miRNAs and in their mRNA targets. These results may also facilitate the identification of miRNA genes for potential pest control through transgenic approaches. PMID:23496979
Secretome Characterization and Correlation Analysis Reveal Putative Pathogenicity Mechanisms and Identify Candidate Avirulence Genes in the Wheat Stripe Rust Fungus Puccinia striiformis f. sp. tritici.

PubMed

Xia, Chongjing; Wang, Meinan; Cornejo, Omar E; Jiwan, Derick A; See, Deven R; Chen, Xianming

2017-01-01

Stripe (yellow) rust, caused by Puccinia striiformis f. sp. tritici ( Pst ), is one of the most destructive diseases of wheat worldwide. Planting resistant cultivars is an effective way to control this disease, but race-specific resistance can be overcome quickly due to the rapid evolving Pst population. Studying the pathogenicity mechanisms is critical for understanding how Pst virulence changes and how to develop wheat cultivars with durable resistance to stripe rust. We re-sequenced 7 Pst isolates and included additional 7 previously sequenced isolates to represent balanced virulence/avirulence profiles for several avirulence loci in seretome analyses. We observed an uneven distribution of heterozygosity among the isolates. Secretome comparison of Pst with other rust fungi identified a large portion of species-specific secreted proteins, suggesting that they may have specific roles when interacting with the wheat host. Thirty-two effectors of Pst were identified from its secretome. We identified candidates for Avr genes corresponding to six Yr genes by correlating polymorphisms for effector genes to the virulence/avirulence profiles of the 14 Pst isolates. The putative AvYr76 was present in the avirulent isolates, but absent in the virulent isolates, suggesting that deleting the coding region of the candidate avirulence gene has produced races virulent to resistance gene Yr76 . We conclude that incorporating avirulence/virulence phenotypes into correlation analysis with variations in genomic structure and secretome, particularly presence/absence polymorphisms of effectors, is an efficient way to identify candidate Avr genes in Pst . The candidate effector genes provide a rich resource for further studies to determine the evolutionary history of Pst populations and the co-evolutionary arms race between Pst and wheat. The Avr candidates identified in this study will lead to cloning avirulence genes in Pst , which will enable us to understand molecular mechanisms underlying Pst -wheat interactions, to determine the effectiveness of resistance genes and further to develop durable resistance to stripe rust.
Fine mapping and identification of candidate genes for the sy-2 locus in a temperature-sensitive chili pepper (Capsicum chinense).

PubMed

Liu, Li; Venkatesh, Jelli; Jo, Yeong Deuk; Koeda, Sota; Hosokawa, Munetaka; Kang, Jin-Ho; Goritschnig, Sandra; Kang, Byoung-Cheorl

2016-08-01

The sy - 2 temperature-sensitive gene from Capsicum chinense was fine mapped to a 138.8-kb region at the distal portion of pepper chromosome 1. Based on expression analyses, two putative F-box genes were identified as sy - 2 candidate genes. Seychelles-2 ('sy-2') is a temperature-sensitive natural mutant of Capsicum chinense, which exhibits an abnormal leaf phenotype when grown at temperatures below 24 °C. We previously showed that the sy-2 phenotype is controlled by a single recessive gene, sy-2, located on pepper chromosome 1. In this study, a high-resolution genetic and physical map for the sy-2 locus was constructed using two individual F2 mapping populations derived from a cross between C. chinense mutant 'sy-2' and wild-type 'No. 3341'. The sy-2 gene was fine mapped to a 138.8-kb region between markers SNP 5-5 and SNP 3-8 at the distal portion of chromosome 1, based on comparative genomic analysis and genomic information from pepper. The sy-2 target region was predicted to contain 27 genes. Expression analysis of these predicted genes showed a differential expression pattern for ORF10 and ORF20 between mutant and wild-type plants; with both having significantly lower expression in 'sy-2' than in wild-type plants. In addition, the coding sequences of both ORF10 and ORF20 contained single nucleotide polymorphisms (SNPs) causing amino acid changes, which may have important functional consequences. ORF10 and ORF20 are predicted to encode F-box proteins, which are components of the SCF complex. Based on the differential expression pattern and the presence of nonsynonymous SNPs, we suggest that these two putative F-box genes are most likely responsible for the temperature-sensitive phenotypes in pepper. Further investigation of these genes may enable a better understanding of the molecular mechanisms of low temperature sensitivity in plants.
In vivo identification of tumor suppressive PTEN ceRNAs in an oncogenic BRAF-induced mouse model of melanoma

PubMed Central

Karreth, Florian A.; Tay, Yvonne; Perna, Daniele; Ala, Ugo; Tan, Shen Mynn; Rust, Alistair G.; DeNicola, Gina; Webster, Kaitlyn A.; Weiss, Dror; Perez-Mancera, Pedro A.; Krauthammer, Michael; Halaban, Ruth; Provero, Paolo; Adams, David J.; Tuveson, David A.; Pandolfi, Pier Paolo

2011-01-01

Summary We recently proposed that competitive endogenous RNAs (ceRNAs) sequester microRNAs to regulate mRNA transcripts containing common microRNA recognition elements (MREs). However, the functional role of ceRNAs in cancer remains unknown. Loss of PTEN, a tumor suppressor regulated by ceRNA activity, frequently occurs in melanoma. Here, we report the discovery of significant enrichment of putative PTEN ceRNAs among genes whose loss accelerates tumorigenesis following Sleeping Beauty insertional mutagenesis in a mouse model of melanoma. We validated several putative PTEN ceRNAs and further characterized one, the ZEB2 transcript. We show that ZEB2 modulates PTEN protein levels in a microRNA-dependent, protein coding-independent manner. Attenuation of ZEB2 expression activates the PI3K/AKT pathway, enhances cell transformation, and commonly occurs in human melanomas and other cancers expressing low PTEN levels. Our study genetically identifies multiple putative microRNA decoys for PTEN, validates ZEB2 mRNA as a bona fide PTEN ceRNA, and demonstrates that abrogated ZEB2 expression cooperates with BRAFV600E to promote melanomagenesis. PMID:22000016
The LacI family protein GlyR3 co-regulates the celC operon and manB in Clostridium thermocellum

DOE PAGES

Choi, Jinlyung; Klingeman, Dawn M.; Brown, Steven D.; ...

2017-06-24

In this paper, we demonstrate that the GlyR3 protein mediates the regulation of manB. We first identify putative GlyR3 binding sites within or just upstream of the coding regions of manB and celT. Using an electrophoretic mobility shift assay (EMSA), we determined that a higher concentration of GlyR3 is required to effectively bind to the putative manB site in comparison to the celC site. Neither the putative celT site nor random DNA significantly binds GlyR3. While laminaribiose interfered with GlyR3 binding to the celC binding site, binding to the manB site was unaffected. In the presence of laminaribiose, in vivomore » transcription of the celC–glyR3–licA gene cluster increases, while manB expression is repressed, compared to in the absence of laminaribiose, consistent with the results from the EMSA. An in vitro transcription assay demonstrated that GlyR3 and laminaribiose interactions were responsible for the observed patters of in vivo transcription.« less
A novel family of integrases associated with prophages and genomic islands integrated within the tRNA-dihydrouridine synthase A (dusA) gene

PubMed Central

Farrugia, Daniel N.; Elbourne, Liam D. H.; Mabbutt, Bridget C.; Paulsen, Ian T.

2015-01-01

Genomic islands play a key role in prokaryotic genome plasticity. Genomic islands integrate into chromosomal loci such as transfer RNA genes and protein coding genes, whilst retaining various cargo genes that potentially bestow novel functions on the host organism. A gene encoding a putative integrase was identified at a single site within the 5′ end of the dusA gene in the genomes of over 200 bacteria. This integrase was discovered to be a component of numerous genomic islands, which appear to share a target site within the dusA gene. dusA encodes the tRNA-dihydrouridine synthase A enzyme, which catalyses the post-transcriptional reduction of uridine to dihydrouridine in tRNA. Genomic islands encoding homologous dusA-associated integrases were found at a much lower frequency within the related dusB and dusC genes, and non-dus genes. Excision of these dusA-associated islands from the chromosome as circularized intermediates was confirmed by polymerase chain reaction. Analysis of the dusA-associated islands indicated that they were highly diverse, with the integrase gene representing the only universal common feature. PMID:25883135
THE GENOMIC LANDSCAPE OF PEDIATRIC AND YOUNG ADULT T-LINEAGE ACUTE LYMPHOBLASTIC LEUKEMIA

PubMed Central

Liu, Yu; Easton, John; Shao, Ying; Maciaszek, Jamie; Wang, Zhaoming; Wilkinson, Mark R.; McCastlain, Kelly; Edmonson, Michael; Pounds, Stanley B.; Shi, Lei; Zhou, Xin; Ma, Xiaotu; Sioson, Edgar; Li, Yongjin; Rusch, Michael; Gupta, Pankaj; Pei, Deqing; Cheng, Cheng; Smith, Malcolm A.; Auvil, Jaime Guidry; Gerhard, Daniela S.; Relling, Mary V.; Winick, Naomi J.; Carroll, Andrew J.; Heerema, Nyla A.; Raetz, Elizabeth; Devidas, Meenakshi; Willman, Cheryl L.; Harvey, Richard C.; Carroll, William L.; Dunsmore, Kimberly P.; Winter, Stuart S.; Wood, Brent L; Sorrentino, Brian P.; Downing, James R.; Loh, Mignon L.; Hunger, Stephen P; Zhang, Jinghui; Mullighan, Charles G.

2017-01-01

Genetic alterations activating NOTCH1 signaling and T cell transcription factors, coupled with inactivation of the INK4/ARF tumor suppressors are hallmarks of T-ALL, but detailed genome-wide sequencing of large T-ALL cohorts has not been performed. Using integrated genomic analysis of 264 T-ALL cases, we identify 106 putative driver genes, half of which were not previously described in childhood T-ALL (e.g. CCND3, CTCF, MYB, SMARCA4, ZFP36L2 and MYCN). We described new mechanisms of coding and non-coding alteration, and identify 10 recurrently altered pathways, with associations between mutated genes and pathways, and stage or subtype of T-ALL. For example, NRAS/FLT3 mutations were associated with immature T-ALL, JAK3/STAT5B mutations in HOX1 deregulated ALL, PTPN2 mutations in TLX1 T-ALL, and PIK3R1/PTEN mutations in TAL1 ALL, suggesting that different signaling pathways have distinct roles according to maturational stage. This genomic landscape provides a logical framework for the development of faithful genetic models and new therapeutic approaches. PMID:28671688
De novo mutations in regulatory elements in neurodevelopmental disorders

PubMed Central

Short, Patrick J.; McRae, Jeremy F.; Gallone, Giuseppe; Sifrim, Alejandro; Won, Hyejung; Geschwind, Daniel H.; Wright, Caroline F.; Firth, Helen V; FitzPatrick, David R.; Barrett, Jeffrey C.; Hurles, Matthew E.

2018-01-01

We previously estimated that 42% of patients with severe developmental disorders carry pathogenic de novo mutations in coding sequences. The role of de novo mutations in regulatory elements affecting genes associated with developmental disorders, or other genes, has been essentially unexplored. We identified de novo mutations in three classes of putative regulatory elements in almost 8,000 patients with developmental disorders. Here we show that de novo mutations in highly evolutionarily conserved fetal brain-active elements are significantly and specifically enriched in neurodevelopmental disorders. We identified a significant twofold enrichment of recurrently mutated elements. We estimate that, genome-wide, 1-3% of patients without a diagnostic coding variant carry pathogenic de novo mutations in fetal brain-active regulatory elements and that only 0.15% of all possible mutations within highly conserved fetal brain-active elements cause neurodevelopmental disorders with a dominant mechanism. Our findings represent a robust estimate of the contribution of de novo mutations in regulatory elements to this genetically heterogeneous set of disorders, and emphasize the importance of combining functional and evolutionary evidence to identify regulatory causes of genetic disorders. PMID:29562236

The draft genome sequence of Mangrovibacter sp. strain MP23, an endophyte isolated from the roots of Phragmites karka.

PubMed

Behera, Pratiksha; Vaishampayan, Parag; Singh, Nitin K; Mishra, Samir R; Raina, Vishakha; Suar, Mrutyunjay; Pattnaik, Ajit K; Rastogi, Gurdeep

2016-09-01

Till date, only one draft genome has been reported within the genus Mangrovibacter. Here, we report the second draft genome shotgun sequence of a Mangrovibacter sp. strain MP23 that was isolated from the roots of Phargmites karka (P. karka), an invasive weed growing in the Chilika Lagoon, Odisha, India. Strain MP23 is a facultative anaerobic, nitrogen-fixing endophytic bacteria that grows optimally at 37 °C, 7.0 pH, and 1% NaCl concentration. The draft genome sequence of strain MP23 contains 4,947,475 bp with an estimated G + C content of 49.9% and total 4392 protein coding genes. The genome sequence has provided information on putative genes that code for proteins involved in oxidative stress, uptake of nutrients, and nitrogen fixation that might offer niche specific ecological fitness and explain the invasive success of P. karka in Chilika Lagoon. The draft genome sequence and annotation have been deposited at DDBJ/EMBL/GenBank under the accession number LYRP00000000.
Expression of homing endonuclease gene and insertion-like element in sea anemone mitochondrial genomes: Lesson learned from Anemonia viridis.

PubMed

Chi, Sylvia Ighem; Urbarova, Ilona; Johansen, Steinar D

2018-04-30

The mitochondrial genomes of sea anemones are dynamic in structure. Invasion by genetic elements, such as self-catalytic group I introns or insertion-like sequences, contribute to sea anemone mitochondrial genome expansion and complexity. By using next generation sequencing we investigated the complete mtDNAs and corresponding transcriptomes of the temperate sea anemone Anemonia viridis and its closer tropical relative Anemonia majano. Two versions of fused homing endonuclease gene (HEG) organization were observed among the Actiniidae sea anemones; in-frame gene fusion and pseudo-gene fusion. We provided support for the pseudo-gene fusion organization in Anemonia species, resulting in a repressed HEG from the COI-884 group I intron. orfA, a putative protein-coding gene with insertion-like features, was present in both Anemonia species. Interestingly, orfA and COI expression were significantly up-regulated upon long-term environmental stress corresponding to low seawater pH conditions. This study provides new insights to the dynamics of sea anemone mitochondrial genome structure and function. Copyright © 2018 Elsevier B.V. All rights reserved.
The complete mitochondrial genome sequence of Eimeria magna (Apicomplexa: Coccidia).

PubMed

Tian, Si-Qin; Cui, Ping; Fang, Su-Fang; Liu, Guo-Hua; Wang, Chun-Ren; Zhu, Xing-Quan

2015-01-01

In the present study, we determined the complete mitochondrial DNA (mtDNA) sequence of Eimeria magna from rabbits for the first time, and compared its gene contents and genome organizations with that of seven Eimeria spp. from domestic chickens. The size of the complete mt genome sequence of E. magna is 6249 bp, which consists of 3 protein-coding genes (cytb, cox1 and cox3), 12 gene fragments for the large subunit (LSU) rRNA, and 7 gene fragments for the small subunit (SSU) rRNA, without transfer RNA genes, in accordance with that of Eimeria spp. from chickens. The putative direction of translation for three genes (cytb, cox1 and cox3) was the same as those of Eimeria species from domestic chickens. The content of A + T is 65.16% for E. magna mt genome (29.73% A, 35.43% T, 17.09 G and 17.75% C). The E. magna mt genome sequence provides novel mtDNA markers for studying the molecular epidemiology and population genetics of Eimeria spp. and has implications for the molecular diagnosis and control of rabbit coccidiosis.
Molecular modelling of the Norrie disease protein predicts a cystine knot growth factor tertiary structure.

PubMed

Meitinger, T; Meindl, A; Bork, P; Rost, B; Sander, C; Haasemann, M; Murken, J

1993-12-01

The X-lined gene for Norrie disease, which is characterized by blindness, deafness and mental retardation has been cloned recently. This gene has been thought to code for a putative extracellular factor; its predicted amino acid sequence is homologous to the C-terminal domain of diverse extracellular proteins. Sequence pattern searches and three-dimensional modelling now suggest that the Norrie disease protein (NDP) has a tertiary structure similar to that of transforming growth factor beta (TGF beta). Our model identifies NDP as a member of an emerging family of growth factors containing a cystine knot motif, with direct implications for the physiological role of NDP. The model also sheds light on sequence related domains such as the C-terminal domain of mucins and of von Willebrand factor.
Gene finding in metatranscriptomic sequences.

PubMed

Ismail, Wazim Mohammed; Ye, Yuzhen; Tang, Haixu

2014-01-01

Metatranscriptomic sequencing is a highly sensitive bioassay of functional activity in a microbial community, providing complementary information to the metagenomic sequencing of the community. The acquisition of the metatranscriptomic sequences will enable us to refine the annotations of the metagenomes, and to study the gene activities and their regulation in complex microbial communities and their dynamics. In this paper, we present TransGeneScan, a software tool for finding genes in assembled transcripts from metatranscriptomic sequences. By incorporating several features of metatranscriptomic sequencing, including strand-specificity, short intergenic regions, and putative antisense transcripts into a Hidden Markov Model, TranGeneScan can predict a sense transcript containing one or multiple genes (in an operon) or an antisense transcript. We tested TransGeneScan on a mock metatranscriptomic data set containing three known bacterial genomes. The results showed that TranGeneScan performs better than metagenomic gene finders (MetaGeneMark and FragGeneScan) on predicting protein coding genes in assembled transcripts, and achieves comparable or even higher accuracy than gene finders for microbial genomes (Glimmer and GeneMark). These results imply, with the assistance of metatranscriptomic sequencing, we can obtain a broad and precise picture about the genes (and their functions) in a microbial community. TransGeneScan is available as open-source software on SourceForge at https://sourceforge.net/projects/transgenescan/.
Integrative analyses of transcriptome sequencing identify novel functional lncRNAs in esophageal squamous cell carcinoma.

PubMed

Li, C-Q; Huang, G-W; Wu, Z-Y; Xu, Y-J; Li, X-C; Xue, Y-J; Zhu, Y; Zhao, J-M; Li, M; Zhang, J; Wu, J-Y; Lei, F; Wang, Q-Y; Li, S; Zheng, C-P; Ai, B; Tang, Z-D; Feng, C-C; Liao, L-D; Wang, S-H; Shen, J-H; Liu, Y-J; Bai, X-F; He, J-Z; Cao, H-H; Wu, B-L; Wang, M-R; Lin, D-C; Koeffler, H P; Wang, L-D; Li, X; Li, E-M; Xu, L-Y

2017-02-13

Long non-coding RNAs (lncRNAs) have a critical role in cancer initiation and progression, and thus may mediate oncogenic or tumor suppressing effects, as well as be a new class of cancer therapeutic targets. We performed high-throughput sequencing of RNA (RNA-seq) to investigate the expression level of lncRNAs and protein-coding genes in 30 esophageal samples, comprised of 15 esophageal squamous cell carcinoma (ESCC) samples and their 15 paired non-tumor tissues. We further developed an integrative bioinformatics method, denoted URW-LPE, to identify key functional lncRNAs that regulate expression of downstream protein-coding genes in ESCC. A number of known onco-lncRNA and many putative novel ones were effectively identified by URW-LPE. Importantly, we identified lncRNA625 as a novel regulator of ESCC cell proliferation, invasion and migration. ESCC patients with high lncRNA625 expression had significantly shorter survival time than those with low expression. LncRNA625 also showed specific prognostic value for patients with metastatic ESCC. Finally, we identified E1A-binding protein p300 (EP300) as a downstream executor of lncRNA625-induced transcriptional responses. These findings establish a catalog of novel cancer-associated functional lncRNAs, which will promote our understanding of lncRNA-mediated regulation in this malignancy.
Decoding the genome with an integrative analysis tool: combinatorial CRM Decoder.

PubMed

Kang, Keunsoo; Kim, Joomyeong; Chung, Jae Hoon; Lee, Daeyoup

2011-09-01

The identification of genome-wide cis-regulatory modules (CRMs) and characterization of their associated epigenetic features are fundamental steps toward the understanding of gene regulatory networks. Although integrative analysis of available genome-wide information can provide new biological insights, the lack of novel methodologies has become a major bottleneck. Here, we present a comprehensive analysis tool called combinatorial CRM decoder (CCD), which utilizes the publicly available information to identify and characterize genome-wide CRMs in a species of interest. CCD first defines a set of the epigenetic features which is significantly associated with a set of known CRMs as a code called 'trace code', and subsequently uses the trace code to pinpoint putative CRMs throughout the genome. Using 61 genome-wide data sets obtained from 17 independent mouse studies, CCD successfully catalogued ∼12 600 CRMs (five distinct classes) including polycomb repressive complex 2 target sites as well as imprinting control regions. Interestingly, we discovered that ∼4% of the identified CRMs belong to at least two different classes named 'multi-functional CRM', suggesting their functional importance for regulating spatiotemporal gene expression. From these examples, we show that CCD can be applied to any potential genome-wide datasets and therefore will shed light on unveiling genome-wide CRMs in various species.
Isolation and characterization of a water stress-specific genomic gene, pwsi 18, from rice.

PubMed

Joshee, N; Kisaka, H; Kitagawa, Y

1998-01-01

One of the water stress-specific cDNA clones of rice characterised previously, wsi18, was selected for further study. The wsi18 gene can be induced by water stress conditions such as mannitol, NaCl, and dryness, but not by ABA, cold, or heat. A genomic clone for wsi18, pwsi18, contained about 1.7 kbp of the 5' upstream sequence, two introns, and the full coding sequence. The 5'-upstream sequence of pwsi18 contained putative cis-acting elements, namely an ABA-responsive element (ABRE), three G-boxes, three E-boxes, a MEF-2 sequence, four direct and two inverted repeats, and four sequences similar to DRE, which is involved in the dehydration response of Arabidopsis genes. The gusA reporter gene under the control of the pwsi18 promoter showed transient expression in response to water stress. Deletion of the downstream DRE-like sequence between the distal G-boxes-2 and -3 resulted in rather low GUS expression.
Complete genome sequence analysis of the fish pathogen Flavobacterium columnare provides insights into antibiotic resistance and pathogenicity related genes.

PubMed

Zhang, Yulei; Zhao, Lijuan; Chen, Wenjie; Huang, Yunmao; Yang, Ling; Sarathbabu, V; Wu, Zaohe; Li, Jun; Nie, Pin; Lin, Li

2017-10-01

We analyzed here the complete genome sequences of a highly virulent Flavobacterium columnare Pf1 strain isolated in our laboratory. The complete genome consists of a 3,171,081 bp circular DNA with 2784 predicted protein-coding genes. Among these, 286 genes were predicted as antibiotic resistance genes, including 32 RND-type efflux pump related genes which were associated with the export of aminoglycosides, indicating inducible aminoglycosides resistances in F. columnare. On the other hand, 328 genes were predicted as pathogenicity related genes which could be classified as virulence factors, gliding motility proteins, adhesins, and many putative secreted proteases. These genes were probably involved in the colonization, invasion and destruction of fish tissues during the infection of F. columnare. Apparently, our obtained complete genome sequences provide the basis for the explanation of the interactions between the F. columnare and the infected fish. The predicted antibiotic resistance and pathogenicity related genes will shed a new light on the development of more efficient preventional strategies against the infection of F. columnare, which is a major worldwide fish pathogen. Copyright © 2017 Elsevier Ltd. All rights reserved.
Discovery of genes related to insecticide resistance in Bactrocera dorsalis by functional genomic analysis of a de novo assembled transcriptome.

PubMed

Hsu, Ju-Chun; Chien, Ting-Ying; Hu, Chia-Cheng; Chen, Mei-Ju May; Wu, Wen-Jer; Feng, Hai-Tung; Haymer, David S; Chen, Chien-Yu

2012-01-01

Insecticide resistance has recently become a critical concern for control of many insect pest species. Genome sequencing and global quantization of gene expression through analysis of the transcriptome can provide useful information relevant to this challenging problem. The oriental fruit fly, Bactrocera dorsalis, is one of the world's most destructive agricultural pests, and recently it has been used as a target for studies of genetic mechanisms related to insecticide resistance. However, prior to this study, the molecular data available for this species was largely limited to genes identified through homology. To provide a broader pool of gene sequences of potential interest with regard to insecticide resistance, this study uses whole transcriptome analysis developed through de novo assembly of short reads generated by next-generation sequencing (NGS). The transcriptome of B. dorsalis was initially constructed using Illumina's Solexa sequencing technology. Qualified reads were assembled into contigs and potential splicing variants (isotigs). A total of 29,067 isotigs have putative homologues in the non-redundant (nr) protein database from NCBI, and 11,073 of these correspond to distinct D. melanogaster proteins in the RefSeq database. Approximately 5,546 isotigs contain coding sequences that are at least 80% complete and appear to represent B. dorsalis genes. We observed a strong correlation between the completeness of the assembled sequences and the expression intensity of the transcripts. The assembled sequences were also used to identify large numbers of genes potentially belonging to families related to insecticide resistance. A total of 90 P450-, 42 GST-and 37 COE-related genes, representing three major enzyme families involved in insecticide metabolism and resistance, were identified. In addition, 36 isotigs were discovered to contain target site sequences related to four classes of resistance genes. Identified sequence motifs were also analyzed to characterize putative polypeptide translational products and associate them with specific genes and protein functions.
Contribution of glutamate decarboxylase in Lactobacillus reuteri to acid resistance and persistence in sourdough fermentation

PubMed Central

2011-01-01

Background Acid stress impacts the persistence of lactobacilli in industrial sourdough fermentations, and in intestinal ecosystems. However, the contribution of glutamate to acid resistance in lactobacilli has not been demonstrated experimentally, and evidence for the contribution of acid resistance to the competitiveness of lactobacilli in sourdough is lacking. It was therefore the aim of this study to investigate the ecological role of glutamate decarboxylase in L. reuteri. Results A gene coding for a putative glutamate decarboxylase, gadB, was identified in the genome of L. reuteri 100-23. Different from the organization of genetic loci coding for glutamate decarboxylase in other lactic acid bacteria, gadB was located adjacent to a putative glutaminase gene, gls3. An isogenic deletion mutant, L. reuteri ∆gadB, was generated by a double crossover method. L. reuteri 100-23 but not L. reuteri ∆gadB converted glutamate to γ-aminobutyrate (GABA) in phosphate butter (pH 2.5). In sourdough, both strains converted glutamine to glutamate but only L. reuteri 100-23 accumulated GABA. Glutamate addition to phosphate buffer, pH 2.5, improved survival of L. reuteri 100-23 100-fold. However, survival of L. reuteri ∆gadB remained essentially unchanged. The disruption of gadB did not affect growth of L. reuteri in mMRS or in sourdough. However, the wild type strain L. reuteri 100-23 displaced L. reuteri ∆gadB after 5 cycles of fermentation in back-slopped sourdough fermentations. Conclusions The conversion of glutamate to GABA by L. reuteri 100-23 contributes to acid resistance and to competitiveness in industrial sourdough fermentations. The organization of the gene cluster for glutamate conversion, and the availability of amino acids in cereals imply that glutamine rather than glutamate functions as the substrate for GABA formation. The exceptional coupling of glutamine deamidation to glutamate decarboxylation in L.Â reuteri likely reflects adaptation to cereal substrates. PMID:21995488
miRWalk--database: prediction of possible miRNA binding sites by "walking" the genes of three genomes.

PubMed

Dweep, Harsh; Sticht, Carsten; Pandey, Priyanka; Gretz, Norbert

2011-10-01

MicroRNAs are small, non-coding RNA molecules that can complementarily bind to the mRNA 3'-UTR region to regulate the gene expression by transcriptional repression or induction of mRNA degradation. Increasing evidence suggests a new mechanism by which miRNAs may regulate target gene expression by binding in promoter and amino acid coding regions. Most of the existing databases on miRNAs are restricted to mRNA 3'-UTR region. To address this issue, we present miRWalk, a comprehensive database on miRNAs, which hosts predicted as well as validated miRNA binding sites, information on all known genes of human, mouse and rat. All mRNAs, mitochondrial genes and 10 kb upstream flanking regions of all known genes of human, mouse and rat were analyzed by using a newly developed algorithm named 'miRWalk' as well as with eight already established programs for putative miRNA binding sites. An automated and extensive text-mining search was performed on PubMed database to extract validated information on miRNAs. Combined information was put into a MySQL database. miRWalk presents predicted and validated information on miRNA-target interaction. Such a resource enables researchers to validate new targets of miRNA not only on 3'-UTR, but also on the other regions of all known genes. The 'Validated Target module' is updated every month and the 'Predicted Target module' is updated every 6 months. miRWalk is freely available at http://mirwalk.uni-hd.de/. Copyright © 2011 Elsevier Inc. All rights reserved.
Molecular and biochemical characterization of two tungsten- and selenium-containing formate dehydrogenases from Eubacterium acidaminophilum that are associated with components of an iron-only hydrogenase.

PubMed

Graentzdoerffer, Andrea; Rauh, David; Pich, Andreas; Andreesen, Jan R

2003-01-01

Two gene clusters encoding similar formate dehydrogenases (FDH) were identified in Eubacterium acidaminophilum. Each cluster is composed of one gene coding for a catalytic subunit ( fdhA-I, fdhA-II) and one for an electron-transferring subunit ( fdhB-I, fdhB-II). Both fdhA genes contain a TGA codon for selenocysteine incorporation and the encoded proteins harbor five putative iron-sulfur clusters in their N-terminal region. Both FdhB subunits resemble the N-terminal region of FdhA on the amino acid level and contain five putative iron-sulfur clusters. Four genes thought to encode the subunits of an iron-only hydrogenase are located upstream of the FDH gene cluster I. By sequence comparison, HymA and HymB are predicted to contain one and four iron-sulfur clusters, respectively, the latter protein also binding sites for FMN and NAD(P). Thus, HymA and HymB seem to represent electron-transferring subunits, and HymC the putative catalytic subunit containing motifs for four iron-sulfur clusters and one H-cluster specific for Fe-only hydrogenases. HymD has six predicted transmembrane helices and might be an integral membrane protein. Viologen-dependent FDH activity was purified from serine-grown cells of E. acidaminophilum and the purified protein complex contained four subunits, FdhA and FdhB, encoded by FDH gene cluster II, and HymA and HymB, identified after determination of their N-terminal sequences. Thus, this complex might represent the most simple type of a formate hydrogen lyase. The purified formate dehydrogenase fraction contained iron, tungsten, a pterin cofactor, and zinc, but no molybdenum. FDH-II had a two-fold higher K(m) for formate (0.37 mM) than FDH-I and also catalyzed CO(2) reduction to formate. Reverse transcription (RT)-PCR pointed to increased expression of FDH-II in serine-grown cells, supporting the isolation of this FDH isoform. The fdhA-I gene was expressed as inactive protein in Escherichia coli. The in-frame UGA codon for selenocysteine incorporation was read in the heterologous system only as stop codon, although its potential SECIS element exhibited a quite high similarity to that of E. coli FDH.
Structural analysis of the 5{prime} region of mouse and human Huntington disease genes reveals conservation of putative promoter region and Di- and trinucleotide polymorphisms

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lin, Biaoyang; Nasir, J.; Kalchman, M.A.

1995-02-10

We have previously cloned and characterized the murine homologue of the Huntington disease (HD) gene and shown that it maps to mouse chromosome 5 within a region of conserved synteny with human chromosome 4p16.3. Here we present a detailed comparison of the sequence of the putative promoter and the organization of the 5{prime} genomic region of the murine (Hdh) and human HD genes encompassing the first five exons. We show that in this region these two genes share identical exon boundaries, but have different-size introns. Two dinucleotide (CT) and one trinucleotide intronic polymorphism in Hdh and an intronic CA polymorphismmore » in the HD gene were identified. Comparison of 940-bp sequence 5{prime} to the putative translation start site reveals a highly conserved region (78.8% nucleotide identity) between Hdh and the HD gene from nucleotide -56 to -206 (of Hdh). Neither Hdh nor the HD gene have typical TATA or CCAAT elements, but both show one putative AP2 binding site and numerous potential Sp1 binding sites. The high sequence identity between Hdh and the HD gene for approximately 200 bp 5{prime} to the putative translation start site indicates that these sequences may play a role in regulating expression of the Huntington disease gene. 30 refs., 4 figs., 2 tabs.« less
A Third Approach to Gene Prediction Suggests Thousands of Additional Human Transcribed Regions

PubMed Central

Glusman, Gustavo; Qin, Shizhen; El-Gewely, M. Raafat; Siegel, Andrew F; Roach, Jared C; Hood, Leroy; Smit, Arian F. A

2006-01-01

The identification and characterization of the complete ensemble of genes is a main goal of deciphering the digital information stored in the human genome. Many algorithms for computational gene prediction have been described, ultimately derived from two basic concepts: (1) modeling gene structure and (2) recognizing sequence similarity. Successful hybrid methods combining these two concepts have also been developed. We present a third orthogonal approach to gene prediction, based on detecting the genomic signatures of transcription, accumulated over evolutionary time. We discuss four algorithms based on this third concept: Greens and CHOWDER, which quantify mutational strand biases caused by transcription-coupled DNA repair, and ROAST and PASTA, which are based on strand-specific selection against polyadenylation signals. We combined these algorithms into an integrated method called FEAST, which we used to predict the location and orientation of thousands of putative transcription units not overlapping known genes. Many of the newly predicted transcriptional units do not appear to code for proteins. The new algorithms are particularly apt at detecting genes with long introns and lacking sequence conservation. They therefore complement existing gene prediction methods and will help identify functional transcripts within many apparent “genomic deserts.” PMID:16543943
The spectrum of low molecular weight alpha-amylase/protease inhibitor genes expressed in the US bread wheat cultivar Butte 86

PubMed Central

2011-01-01

Background Wheat grains accumulate a variety of low molecular weight proteins that are inhibitors of alpha-amylases and proteases and play an important protective role in the grain. These proteins have more balanced amino acid compositions than the major wheat gluten proteins and contribute important reserves for both seedling growth and human nutrition. The alpha-amylase/protease inhibitors also are of interest because they cause IgE-mediated occupational and food allergies and thereby impact human health. Results The complement of genes encoding alpha-amylase/protease inhibitors expressed in the US bread wheat Butte 86 was characterized by analysis of expressed sequence tags (ESTs). Coding sequences for 19 distinct proteins were identified. These included two monomeric (WMAI), four dimeric (WDAI), and six tetrameric (WTAI) inhibitors of exogenous alpha-amylases, two inhibitors of endogenous alpha-amylases (WASI), four putative trypsin inhibitors (CMx and WTI), and one putative chymotrypsin inhibitor (WCI). A number of the encoded proteins were identical or very similar to proteins in the NCBI database. Sequences not reported previously included variants of WTAI-CM3, three CMx inhibitors and WTI. Within the WDAI group, two different genes encoded the same mature protein. Based on numbers of ESTs, transcripts for WTAI-CM3 Bu-1, WMAI Bu-1 and WTAI-CM16 Bu-1 were most abundant in Butte 86 developing grain. Coding sequences for 16 of the inhibitors were unequivocally associated with specific proteins identified by tandem mass spectrometry (MS/MS) in a previous proteomic analysis of milled white flour from Butte 86. Proteins corresponding to WDAI Bu-1/Bu-2, WMAI Bu-1 and the WTAI subunits CM2 Bu-1, CM3 Bu-1 and CM16 Bu-1 were accumulated to the highest levels in flour. Conclusions Information on the spectrum of alpha-amylase/protease inhibitor genes and proteins expressed in a single wheat cultivar is central to understanding the importance of these proteins in both plant defense mechanisms and human allergies and facilitates both breeding and biotechnology approaches for manipulating the composition of these proteins in plants. PMID:21774824
Biotin protein ligase from Corynebacterium glutamicum: role for growth and L: -lysine production.

PubMed

Peters-Wendisch, P; Stansen, K C; Götker, S; Wendisch, V F

2012-03-01

Corynebacterium glutamicum is a biotin auxotrophic Gram-positive bacterium that is used for large-scale production of amino acids, especially of L-glutamate and L-lysine. It is known that biotin limitation triggers L-glutamate production and that L-lysine production can be increased by enhancing the activity of pyruvate carboxylase, one of two biotin-dependent proteins of C. glutamicum. The gene cg0814 (accession number YP_225000) has been annotated to code for putative biotin protein ligase BirA, but the protein has not yet been characterized. A discontinuous enzyme assay of biotin protein ligase activity was established using a 105aa peptide corresponding to the carboxyterminus of the biotin carboxylase/biotin carboxyl carrier protein subunit AccBC of the acetyl CoA carboxylase from C. glutamicum as acceptor substrate. Biotinylation of this biotin acceptor peptide was revealed with crude extracts of a strain overexpressing the birA gene and was shown to be ATP dependent. Thus, birA from C. glutamicum codes for a functional biotin protein ligase (EC 6.3.4.15). The gene birA from C. glutamicum was overexpressed and the transcriptome was compared with the control strain revealing no significant gene expression changes of the bio-genes. However, biotin protein ligase overproduction increased the level of the biotin-containing protein pyruvate carboxylase and entailed a significant growth advantage in glucose minimal medium. Moreover, birA overexpression resulted in a twofold higher L-lysine yield on glucose as compared with the control strain.
Characterization of the Aspergillus nidulans aspnd1 gene demonstrates that the ASPND1 antigen, which it encodes, and several Aspergillus fumigatus immunodominant antigens belong to the same family.

PubMed Central

Calera, J A; Ovejero, M C; López-Medrano, R; Segurado, M; Puente, P; Leal, F

1997-01-01

For the first time, an immunodominant Aspergillus nidulans antigen (ASPND1) consistently reactive with serum samples from aspergilloma patients has been purified and characterized, and its coding gene (aspnd1) has been cloned and sequenced. ASPND1 is a glycoprotein with four N-glycosidically-bound sugar chains (around 2.1 kDa each) which are not necessary for reactivity with immune human sera. The polypeptide part is synthesized as a 277-amino-acid precursor of 30.6 kDa that after cleavage of a putative signal peptide of 16 amino acids, affords a mature protein of 261 amino acids with a molecular mass of 29 kDa and a pI of 4.24 (as deduced from the sequence). The ASPND1 protein is 53.1% identical to the AspfII allergen from Aspergillus fumigatus and 48% identical to an unpublished Candida albicans antigen. All of the cysteine residues and most of the glycosylation sites are perfectly conserved in the three proteins, suggesting a similar but yet unknown function. Analysis of the primary structure of the ASPND1 coding gene (aspnd1) has allowed the establishment of a clear relationship between several previously reported A. fumigatus and A. nidulans immunodominant antigens. PMID:9119471
A New Set of ESTs from Chickpea (Cicer arietinum L.) Embryo Reveals Two Novel F-Box Genes, CarF-box_PP2 and CarF-box_LysM, with Potential Roles in Seed Development

PubMed Central

Gupta, Shefali; Garg, Vanika; Bhatia, Sabhyata

2015-01-01

Considering the economic importance of chickpea (C. arietinum L.) seeds, it is important to understand the mechanisms underlying seed development for which a cDNA library was constructed from 6 day old chickpea embryos. A total of 8,186 ESTs were obtained from which 4,048 high quality ESTs were assembled into 1,480 unigenes that majorly encoded genes involved in various metabolic and regulatory pathways. Of these, 95 ESTs were found to be involved in ubiquitination related protein degradation pathways and 12 ESTs coded specifically for putative F-box proteins. Differential transcript accumulation of these putative F-box genes was observed in chickpea tissues as evidenced by quantitative real-time PCR. Further, to explore the role of F-box proteins in chickpea seed development, two F-box genes were selected for molecular characterization. These were named as CarF-box_PP2 and CarF-box_LysM depending on their C-terminal domains, PP2 and LysM, respectively. Their highly conserved structures led us to predict their target substrates. Subcellular localization experiment revealed that CarF-box_PP2 was localized in the cytoplasm and CarF-box_LysM was localized in the nucleus. We demonstrated their physical interactions with SKP1 protein, which validated that they function as F-box proteins in the formation of SCF complexes. Sequence analysis of their promoter regions revealed certain seed specific cis-acting elements that may be regulating their preferential transcript accumulation in the seed. Overall, the study helped in expanding the EST database of chickpea, which was further used to identify two novel F-box genes having a potential role in seed development. PMID:25803812
Spontaneous mutation reveals influence of exopolysaccharide on Lactobacillus johnsonii surface characteristics.

PubMed

Horn, Nikki; Wegmann, Udo; Dertli, Enes; Mulholland, Francis; Collins, Samuel R A; Waldron, Keith W; Bongaerts, Roy J; Mayer, Melinda J; Narbad, Arjan

2013-01-01

As a competitive exclusion agent, Lactobacillus johnsonii FI9785 has been shown to prevent the colonization of selected pathogenic bacteria from the chicken gastrointestinal tract. During growth of the bacterium a rare but consistent emergence of an altered phenotype was noted, generating smooth colonies in contrast to the wild type rough form. A smooth colony variant was isolated and two-dimensional gel analysis of both strains revealed a protein spot with different migration properties in the two phenotypes. The spot in both gels was identified as a putative tyrosine kinase (EpsC), associated with a predicted exopolysaccharide gene cluster. Sequencing of the epsC gene from the smooth mutant revealed a single substitution (G to A) in the coding strand, resulting in the amino acid change D88N in the corresponding gene product. A native plasmid of L. johnsonii was engineered to produce a novel vector for constitutive expression and this was used to demonstrate that expression of the wild type epsC gene in the smooth mutant produced a reversion to the rough colony phenotype. Both the mutant and epsC complemented strains had increased levels of exopolysaccharides compared to the wild type strain, indicating that the rough phenotype is not solely associated with the quantity of exopolysaccharide. Another gene in the cluster, epsE, that encoded a putative undecaprenyl-phosphate galactosephosphotransferase, was deleted in order to investigate its role in exopolysaccharide biosynthesis. The ΔepsE strain exhibited a large increase in cell aggregation and a reduction in exopolysaccharide content, while plasmid complementation of epsE restored the wild type phenotype. Flow cytometry showed that the wild type and derivative strains exhibited clear differences in their adhesive ability to HT29 monolayers in tissue culture, demonstrating an impact of EPS on surface properties and bacteria-host interactions.

TCOF1 gene encodes a putative nucleolar phosphoprotein that exhibits mutations in Treacher Collins Syndrome throughout its coding region

PubMed Central

Wise, Carol A.; Chiang, Lydia C.; Paznekas, William A.; Sharma, Mridula; Musy, Maurice M.; Ashley, Jennifer A.; Lovett, Michael; Jabs, Ethylin W.

1997-01-01

Treacher Collins Syndrome (TCS) is the most common of the human mandibulofacial dysostosis disorders. Recently, a partial TCOF1 cDNA was identified and shown to contain mutations in TCS families. Here we present the entire exon/intron genomic structure and the complete coding sequence of TCOF1. TCOF1 encodes a low complexity protein of 1,411 amino acids, whose predicted protein structure reveals repeated motifs that mirror the organization of its exons. These motifs are shared with nucleolar trafficking proteins in other species and are predicted to be highly phosphorylated by casein kinase. Consistent with this, the full-length TCOF1 protein sequence also contains putative nuclear and nucleolar localization signals. Throughout the open reading frame, we detected an additional eight mutations in TCS families and several polymorphisms. We postulate that TCS results from defects in a nucleolar trafficking protein that is critically required during human craniofacial development. PMID:9096354
Genome-Wide Identification of Medicago Peptides Involved in Macronutrient Responses and Nodulation1[OPEN

PubMed Central

Dai, Xinbin; Zhuang, Zhaohong; Torres-Jerez, Ivone; Nogales, Joaquina

2017-01-01

Growing evidence indicates that small, secreted peptides (SSPs) play critical roles in legume growth and development, yet the annotation of SSP-coding genes is far from complete. Systematic reannotation of the Medicago truncatula genome identified 1,970 homologs of established SSP gene families and an additional 2,455 genes that are potentially novel SSPs, previously unreported in the literature. The expression patterns of known and putative SSP genes based on 144 RNA sequencing data sets covering various stages of macronutrient deficiencies and symbiotic interactions with rhizobia and mycorrhiza were investigated. Focusing on those known or suspected to act via receptor-mediated signaling, 240 nutrient-responsive and 365 nodulation-responsive Signaling-SSPs were identified, greatly expanding the number of SSP gene families potentially involved in acclimation to nutrient deficiencies and nodulation. Synthetic peptide applications were shown to alter root growth and nodulation phenotypes, revealing additional regulators of legume nutrient acquisition. Our results constitute a powerful resource enabling further investigations of specific SSP functions via peptide treatment and reverse genetics. PMID:29030416
Gene Loss and Lineage-Specific Restriction-Modification Systems Associated with Niche Differentiation in the Campylobacter jejuni Sequence Type 403 Clonal Complex

PubMed Central

Morley, Laura; McNally, Alan; Paszkiewicz, Konrad; Corander, Jukka; Méric, Guillaume; Sheppard, Samuel K.; Blom, Jochen

2015-01-01

Campylobacter jejuni is a highly diverse species of bacteria commonly associated with infectious intestinal disease of humans and zoonotic carriage in poultry, cattle, pigs, and other animals. The species contains a large number of distinct clonal complexes that vary from host generalist lineages commonly found in poultry, livestock, and human disease cases to host-adapted specialized lineages primarily associated with livestock or poultry. Here, we present novel data on the ST403 clonal complex of C. jejuni, a lineage that has not been reported in avian hosts. Our data show that the lineage exhibits a distinctive pattern of intralineage recombination that is accompanied by the presence of lineage-specific restriction-modification systems. Furthermore, we show that the ST403 complex has undergone gene decay at a number of loci. Our data provide a putative link between the lack of association with avian hosts of C. jejuni ST403 and both gene gain and gene loss through nonsense mutations in coding sequences of genes, resulting in pseudogene formation. PMID:25795671
Diversity in copy number and structure of a silkworm morphogenetic gene as a result of domestication.

PubMed

Sakudoh, Takashi; Nakashima, Takeharu; Kuroki, Yoko; Fujiyama, Asao; Kohara, Yuji; Honda, Naoko; Fujimoto, Hirofumi; Shimada, Toru; Nakagaki, Masao; Banno, Yutaka; Tsuchida, Kozo

2011-03-01

The carotenoid-binding protein (CBP) of the domesticated silkworm, Bombyx mori, a major determinant of cocoon color, is likely to have been substantially influenced by domestication of this species. We analyzed the structure of the CBP gene in multiple strains of B. mori, in multiple individuals of the wild silkworm, B. mandarina (the putative wild ancestor of B. mori), and in a number of other lepidopterans. We found the CBP gene copy number in genomic DNA to vary widely among B. mori strains, ranging from 1 to 20. The copies of CBP are of several types, based on the presence of a retrotransposon or partial deletion of the coding sequence. In contrast to B. mori, B. mandarina was found to possess a single copy of CBP without the retrotransposon insertion, regardless of habitat. Several other lepidopterans were found to contain sequences homologous to CBP, revealing that this gene is evolutionarily conserved in the lepidopteran lineage. Thus, domestication can generate significant diversity of gene copy number and structure over a relatively short evolutionary time. © 2011 by the Genetics Society of America
Diversity in Copy Number and Structure of a Silkworm Morphogenetic Gene as a Result of Domestication

PubMed Central

Sakudoh, Takashi; Nakashima, Takeharu; Kuroki, Yoko; Fujiyama, Asao; Kohara, Yuji; Honda, Naoko; Fujimoto, Hirofumi; Shimada, Toru; Nakagaki, Masao; Banno, Yutaka; Tsuchida, Kozo

2011-01-01

The carotenoid-binding protein (CBP) of the domesticated silkworm, Bombyx mori, a major determinant of cocoon color, is likely to have been substantially influenced by domestication of this species. We analyzed the structure of the CBP gene in multiple strains of B. mori, in multiple individuals of the wild silkworm, B. mandarina (the putative wild ancestor of B. mori), and in a number of other lepidopterans. We found the CBP gene copy number in genomic DNA to vary widely among B. mori strains, ranging from 1 to 20. The copies of CBP are of several types, based on the presence of a retrotransposon or partial deletion of the coding sequence. In contrast to B. mori, B. mandarina was found to possess a single copy of CBP without the retrotransposon insertion, regardless of habitat. Several other lepidopterans were found to contain sequences homologous to CBP, revealing that this gene is evolutionarily conserved in the lepidopteran lineage. Thus, domestication can generate significant diversity of gene copy number and structure over a relatively short evolutionary time. PMID:21242537
MicroRNA-181 promotes synaptogenesis and attenuates axonal outgrowth in cortical neurons

PubMed Central

Kos, Aron; Olde Loohuis, Nikkie; Meinhardt, Julia; van Bokhoven, Hans; Kaplan, Barry B; Martens, Gerard; Aschrafi, Armaz

2016-01-01

MicroRNAs (miRs) are non-coding gene transcripts abundantly expressed in both the developing and adult mammalian brain. They act as important modulators of complex gene regulatory networks during neuronal development and plasticity. miR-181c is highly abundant in cerebellar cortex and its expression is increased in autism patients as well as in an animal model of autism. To systematically identify putative targets of miR-181c, we repressed this miR in growing cortical neurons and found over 70 differentially expressed target genes using transcriptome profiling. Pathway analysis showed that the miR-181c-modulated genes converge on signaling cascades relevant to neurite and synapse developmental processes. To experimentally examine the significance of these data, we inhibited miR-181c during rat cortical neuronal maturation in vitro; this loss-of miR-181c function resulted in enhanced neurite sprouting and reduced synaptogenesis. Collectively, our findings suggest that miR-181c is a modulator of gene networks associated with cortical neuronal maturation. PMID:27017280
Effects of Halide Ions on the Carbamidocyclophane Biosynthesis in Nostoc sp. CAVN2

PubMed Central

Preisitsch, Michael; Heiden, Stefan E.; Beerbaum, Monika; Niedermeyer, Timo H. J.; Schneefeld, Marie; Herrmann, Jennifer; Kumpfmüller, Jana; Thürmer, Andrea; Neidhardt, Inga; Wiesner, Christoph; Daniel, Rolf; Müller, Rolf; Bange, Franz-Christoph; Schmieder, Peter; Schweder, Thomas; Mundt, Sabine

2016-01-01

In this study, the influence of halide ions on [7.7]paracyclophane biosynthesis in the cyanobacterium Nostoc sp. CAVN2 was investigated. In contrast to KI and KF, supplementation of the culture medium with KCl or KBr resulted not only in an increase of growth but also in an up-regulation of carbamidocyclophane production. LC-MS analysis indicated the presence of chlorinated, brominated, but also non-halogenated derivatives. In addition to 22 known cylindrocyclophanes and carbamidocyclophanes, 27 putative congeners have been detected. Nine compounds, carbamidocyclophanes M−U, were isolated, and their structural elucidation by 1D and 2D NMR experiments in combination with HRMS and ECD analysis revealed that they are brominated analogues of chlorinated carbamidocyclophanes. Quantification of the carbamidocyclophanes showed that chloride is the preferably utilized halide, but incorporation is reduced in the presence of bromide. Evaluation of the antibacterial activity of 30 [7.7]paracyclophanes and related derivatives against selected pathogenic Gram-positive and Gram-negative bacteria exhibited remarkable effects especially against methicillin- and vancomycin-resistant staphylococci and Mycobacterium tuberculosis. For deeper insights into the mechanisms of biosynthesis, the carbamidocyclophane biosynthetic gene cluster in Nostoc sp. CAVN2 was studied. The gene putatively coding for the carbamoyltransferase has been identified. Based on bioinformatic analyses, a possible biosynthetic assembly is discussed. PMID:26805858
Complex Interplay among DNA Modification, Noncoding RNA Expression and Protein-Coding RNA Expression in Salvia miltiorrhiza Chloroplast Genome

PubMed Central

Chen, Haimei; Zhang, Jianhui; Yuan, George; Liu, Chang

2014-01-01

Salvia miltiorrhiza is one of the most widely used medicinal plants. As a first step to develop a chloroplast-based genetic engineering method for the over-production of active components from S. miltiorrhiza, we have analyzed the genome, transcriptome, and base modifications of the S. miltiorrhiza chloroplast. Total genomic DNA and RNA were extracted from fresh leaves and then subjected to strand-specific RNA-Seq and Single-Molecule Real-Time (SMRT) sequencing analyses. Mapping the RNA-Seq reads to the genome assembly allowed us to determine the relative expression levels of 80 protein-coding genes. In addition, we identified 19 polycistronic transcription units and 136 putative antisense and intergenic noncoding RNA (ncRNA) genes. Comparison of the abundance of protein-coding transcripts (cRNA) with and without overlapping antisense ncRNAs (asRNA) suggest that the presence of asRNA is associated with increased cRNA abundance (p<0.05). Using the SMRT Portal software (v1.3.2), 2687 potential DNA modification sites and two potential DNA modification motifs were predicted. The two motifs include a TATA box–like motif (CPGDMM1, “TATANNNATNA”), and an unknown motif (CPGDMM2 “WNYANTGAW”). Specifically, 35 of the 97 CPGDMM1 motifs (36.1%) and 91 of the 369 CPGDMM2 motifs (24.7%) were found to be significantly modified (p<0.01). Analysis of genes downstream of the CPGDMM1 motif revealed the significantly increased abundance of ncRNA genes that are less than 400 bp away from the significantly modified CPGDMM1motif (p<0.01). Taking together, the present study revealed a complex interplay among DNA modifications, ncRNA and cRNA expression in chloroplast genome. PMID:24914614
Complex interplay among DNA modification, noncoding RNA expression and protein-coding RNA expression in Salvia miltiorrhiza chloroplast genome.

PubMed

Chen, Haimei; Zhang, Jianhui; Yuan, George; Liu, Chang

2014-01-01

Salvia miltiorrhiza is one of the most widely used medicinal plants. As a first step to develop a chloroplast-based genetic engineering method for the over-production of active components from S. miltiorrhiza, we have analyzed the genome, transcriptome, and base modifications of the S. miltiorrhiza chloroplast. Total genomic DNA and RNA were extracted from fresh leaves and then subjected to strand-specific RNA-Seq and Single-Molecule Real-Time (SMRT) sequencing analyses. Mapping the RNA-Seq reads to the genome assembly allowed us to determine the relative expression levels of 80 protein-coding genes. In addition, we identified 19 polycistronic transcription units and 136 putative antisense and intergenic noncoding RNA (ncRNA) genes. Comparison of the abundance of protein-coding transcripts (cRNA) with and without overlapping antisense ncRNAs (asRNA) suggest that the presence of asRNA is associated with increased cRNA abundance (p<0.05). Using the SMRT Portal software (v1.3.2), 2687 potential DNA modification sites and two potential DNA modification motifs were predicted. The two motifs include a TATA box-like motif (CPGDMM1, "TATANNNATNA"), and an unknown motif (CPGDMM2 "WNYANTGAW"). Specifically, 35 of the 97 CPGDMM1 motifs (36.1%) and 91 of the 369 CPGDMM2 motifs (24.7%) were found to be significantly modified (p<0.01). Analysis of genes downstream of the CPGDMM1 motif revealed the significantly increased abundance of ncRNA genes that are less than 400 bp away from the significantly modified CPGDMM1motif (p<0.01). Taking together, the present study revealed a complex interplay among DNA modifications, ncRNA and cRNA expression in chloroplast genome.
Complete nucleotide sequence and annotation of the temperate corynephage ϕ16 genome.

PubMed

Lobanova, Juliya S; Gak, Evgueni R; Andreeva, Irina G; Rybak, Konstantin V; Krylov, Alexander A; Mashko, Sergey V

2017-08-01

The complete genome of ϕ16, a temperate corynephage from Corynebacterium glutamicum ATCC 21792, was sequenced and annotated (GenBank: KY250482). The electron microscopy study of ϕ16 virion confirmed that it belongs to the family Siphoviridae. The ϕ16 genome consists of a linear double-stranded DNA molecule of 58,200 bp (G+C = 52.2%) with protruding cohesive 3'-ends of 14 nt. Four major structural proteins were separated by SDS-PAGE and identified by peptide mass fingerprinting technique. Using bioinformatics analysis, 101 putative ORFs and 5 tRNA genes were predicted. Only 27 putative gene products could be assigned to known biological functions. The ϕ16 genome was divided into functional modules. Seven putative promoters and eight putative unidirectional intrinsic terminators were predicted. One site of putative «-1» programmed ribosomal frameshifting was proposed in the phage tail assembly genome region. C. glutamicum genetic tools could be broadened by exploiting the known integrase gene (gp33) and the newly identified excisionase gene (gp47), participating in site-specific recombination between ϕ16-attP/attB.
Identification of positive selection in disease response genes within members of the Poaceae.

PubMed

Rech, Gabriel E; Vargas, Walter A; Sukno, Serenella A; Thon, Michael R

2012-12-01

Millions of years of coevolution between plants and pathogens can leave footprints on their genomes and genes involved on this interaction are expected to show patterns of positive selection in which novel, beneficial alleles are rapidly fixed within the population. Using information about upregulated genes in maize during Colletotrichum graminicola infection and resources available in the Phytozome database, we looked for evidence of positive selection in the Poaceae lineage, acting on protein coding sequences related with plant defense. We found six genes with evidence of positive selection and another eight with sites showing episodic selection. Some of them have already been described as evolving under positive selection, but others are reported here for the first time including genes encoding isocitrate lyase, dehydrogenases, a multidrug transporter, a protein containing a putative leucine-rich repeat and other proteins with unknown functions. Mapping positively selected residues onto the predicted 3-D structure of proteins showed that most of them are located on the surface, where proteins are in contact with other molecules. We present here a set of Poaceae genes that are likely to be involved in plant defense mechanisms and have evidence of positive selection. These genes are excellent candidates for future functional validation.
A Glycine Riboswitch in Streptococcus pyogenes Controls Expression of a Sodium:Alanine Symporter Family Protein Gene.

PubMed

Khani, Afsaneh; Popp, Nicole; Kreikemeyer, Bernd; Patenge, Nadja

2018-01-01

Regulatory RNAs play important roles in the control of bacterial gene expression. In this study, we investigated gene expression regulation by a putative glycine riboswitch located in the 5'-untranslated region of a sodium:alanine symporter family (SAF) protein gene in the group A Streptococcus pyogenes serotype M49 strain 591. Glycine-dependent gene expression mediated by riboswitch activity was studied using a luciferase reporter gene system. Maximal reporter gene expression was observed in the absence of glycine and in the presence of low glycine concentrations. Differences in glycine-dependent gene expression were not based on differential promoter activity. Expression of the SAF protein gene and the downstream putative cation efflux protein gene was investigated in wild-type bacteria by RT-qPCR transcript analyses. During growth in the presence of glycine (≥1 mM), expression of the genes were downregulated. Northern blot analyses revealed premature transcription termination in the presence of high glycine concentrations. Growth in the presence of 0.1 mM glycine led to the production of a full-length transcript. Furthermore, stability of the SAF protein gene transcript was drastically reduced in the presence of glycine. We conclude that the putative glycine riboswitch in S. pyogenes serotype M49 strain 591 represses expression of the SAF protein gene and the downstream putative cation efflux protein gene in the presence of high glycine concentrations. Sequence and secondary structure comparisons indicated that the streptococcal riboswitch belongs to the class of tandem aptamer glycine riboswitches.
The transcriptomic and evolutionary signature of social interactions regulating honey bee caste development.

PubMed

Vojvodic, Svjetlana; Johnson, Brian R; Harpur, Brock A; Kent, Clement F; Zayed, Amro; Anderson, Kirk E; Linksvayer, Timothy A

2015-11-01

The caste fate of developing female honey bee larvae is strictly socially regulated by adult nurse workers. As a result of this social regulation, nurse-expressed genes as well as larval-expressed genes may affect caste expression and evolution. We used a novel transcriptomic approach to identify genes with putative direct and indirect effects on honey bee caste development, and we subsequently studied the relative rates of molecular evolution at these caste-associated genes. We experimentally induced the production of new queens by removing the current colony queen, and we used RNA sequencing to study the gene expression profiles of both developing larvae and their caregiving nurses before and after queen removal. By comparing the gene expression profiles of queen-destined versus worker-destined larvae as well as nurses observed feeding these two types of larvae, we identified larval and nurse genes associated with caste development. Of 950 differentially expressed genes associated with caste, 82% were expressed in larvae with putative direct effects on larval caste, and 18% were expressed in nurses with putative indirect effects on caste. Estimated selection coefficients suggest that both nurse and larval genes putatively associated with caste are rapidly evolving, especially those genes associated with worker development. Altogether, our results suggest that indirect effect genes play important roles in both the expression and evolution of socially influenced traits such as caste.
BTKbase, mutation database for X-linked agammaglobulinemia (XLA).

PubMed Central

Vihinen, M; Brandau, O; Brandén, L J; Kwan, S P; Lappalainen, I; Lester, T; Noordzij, J G; Ochs, H D; Ollila, J; Pienaar, S M; Riikonen, P; Saha, B K; Smith, C I

1998-01-01

X-linked agammaglobulinemia (XLA) is an immunodeficiency caused by mutations in the gene coding for Bruton's agammaglobulinemia tyrosine kinase (BTK). A database (BTKbase) of BTK mutations has been compiled and the recent update lists 463 mutation entries from 406 unrelated families showing 303 unique molecular events. In addition to mutations, the database also lists variants or polymorphisms. Each patient is given a unique patient identity number (PIN). Information is included regarding the phenotype including symptoms. Mutations in all the five domains of BTK have been noticed to cause the disease, the most common event being missense mutations. The mutations appear almost uniformly throughout the molecule and frequently affect CpG sites that code for arginine residues. The putative structural implications of all the missense mutations are given in the database. The improved version of the registry having a number of new features is available at http://www. helsinki.fi/science/signal/btkbase.html PMID:9399844
Identification of Putative Coffee Rust Mycoparasites via Single-Molecule DNA Sequencing of Infected Pustules

PubMed Central

Marino, John A.; Perfecto, Ivette; Vandermeer, John

2015-01-01

The interaction of crop pests with their natural enemies is a fundament to their control. Natural enemies of fungal pathogens of crops are poorly known relative to those of insect pests, despite the diversity of fungal pathogens and their economic importance. Currently, many regions across Latin America are experiencing unprecedented epidemics of coffee rust (Hemileia vastatrix). Identification of natural enemies of coffee rust could aid in developing management strategies or in pinpointing species that could be used for biocontrol. In the present study, we characterized fungal communities associated with coffee rust lesions by single-molecule DNA sequencing of fungal rRNA gene bar codes from leaf discs (≈28 mm2) containing rust lesions and control discs with no rust lesions. The leaf disc communities were hyperdiverse in terms of fungi, with up to 69 operational taxonomic units (putative species) per control disc, and the diversity was only slightly reduced in rust-infected discs, with up to 63 putative species. However, geography had a greater influence on the fungal community than whether the disc was infected by coffee rust. Through comparisons between control and rust-infected leaf discs, as well as taxonomic criteria, we identified 15 putative mycoparasitic fungi. These fungi are concentrated in the fungal family Cordycipitaceae and the order Tremellales. These data emphasize the complexity of diverse fungi of unknown ecological function within a leaf that might influence plant disease epidemics or lead to the development of species for biocontrol of fungal disease. PMID:26567299
Genomic analysis and temperature-dependent transcriptome profiles of the rhizosphere originating strain Pseudomonas aeruginosa M18

PubMed Central

2011-01-01

Background Our previously published reports have described an effective biocontrol agent named Pseudomonas sp. M18 as its 16S rDNA sequence and several regulator genes share homologous sequences with those of P. aeruginosa, but there are several unusual phenotypic features. This study aims to explore its strain specific genomic features and gene expression patterns at different temperatures. Results The complete M18 genome is composed of a single chromosome of 6,327,754 base pairs containing 5684 open reading frames. Seven genomic islands, including two novel prophages and five specific non-phage islands were identified besides the conserved P. aeruginosa core genome. Each prophage contains a putative chitinase coding gene, and the prophage II contains a capB gene encoding a putative cold stress protein. The non-phage genomic islands contain genes responsible for pyoluteorin biosynthesis, environmental substance degradation and type I and III restriction-modification systems. Compared with other P. aeruginosa strains, the fewest number (3) of insertion sequences and the most number (3) of clustered regularly interspaced short palindromic repeats in M18 genome may contribute to the relative genome stability. Although the M18 genome is most closely related to that of P. aeruginosa strain LESB58, the strain M18 is more susceptible to several antimicrobial agents and easier to be erased in a mouse acute lung infection model than the strain LESB58. The whole M18 transcriptomic analysis indicated that 10.6% of the expressed genes are temperature-dependent, with 22 genes up-regulated at 28°C in three non-phage genomic islands and one prophage but none at 37°C. Conclusions The P. aeruginosa strain M18 has evolved its specific genomic structures and temperature dependent expression patterns to meet the requirement of its fitness and competitiveness under selective pressures imposed on the strain in rhizosphere niche. PMID:21884571
A resource for characterizing genome-wide binding and putative target genes of transcription factors expressed during secondary growth and wood formation in Populus

Treesearch

Lijun Liu; Trevor Ramsay; Matthew S. Zinkgraf; David Sundell; Nathaniel Robert Street; Vladimir Filkov; Andrew Groover

2015-01-01

Identifying transcription factor target genes is essential for modeling the transcriptional networks underlying developmental processes. Here we report a chromatin immunoprecipitation sequencing (ChIP-seq) resource consisting of genome-wide binding regions and associated putative target genes for four Populus homeodomain transcription factors...
Identification of Viscum album L. miRNAs and prediction of their medicinal values

PubMed Central

Adolf, Jacob; Melzig, Matthias F.

2017-01-01

MicroRNAs (miRNAs) are a class of approximately 22 nucleotides single-stranded non-coding RNA molecules that play crucial roles in gene expression. It has been reported that the plant miRNAs might enter mammalian bloodstream and have a functional role in human metabolism, indicating that miRNAs might be one of the hidden bioactive ingredients in medicinal plants. Viscum album L. (Loranthaceae, European mistletoe) has been widely used for the treatment of cancer and cardiovascular diseases, but its functional compounds have not been well characterized. We considered that miRNAs might be involved in the pharmacological activities of V. album. High-throughput Illumina sequencing was performed to identify the novel and conserved miRNAs of V. album. The putative human targets were predicted. In total, 699 conserved miRNAs and 1373 novel miRNAs have been identified from V. album. Based on the combined use of TargetScan, miRanda, PITA, and RNAhybrid methods, the intersection of 30697 potential human genes have been predicted as putative targets of 29 novel miRNAs, while 14559 putative targets were highly enriched in 33 KEGG pathways. Interestingly, these highly enriched KEGG pathways were associated with some human diseases, especially cancer, cardiovascular diseases and neurological disorders, which might explain the clinical use as well as folk medicine use of mistletoe. However, further experimental validation is necessary to confirm these human targets of mistletoe miRNAs. Additionally, target genes involved in bioactive components synthesis in V. album were predicted as well. A total of 68 miRNAs were predicted to be involved in terpenoid biosynthesis, while two miRNAs including val-miR152 and miR9738 were predicted to target viscotoxins and lectins, respectively, which increased the knowledge regarding miRNA-based regulation of terpenoid biosynthesis, lectin and viscotoxin expressions in V. album. PMID:29112983
Gut Microbiome and Putative Resistome of Inca and Italian Nobility Mummies

PubMed Central

Santiago-Rodriguez, Tasha M.; Luciani, Stefania; Toranzos, Gary A.; Marota, Isolina; Giuffra, Valentina; Cano, Raul J.

2017-01-01

Little is still known about the microbiome resulting from the process of mummification of the human gut. In the present study, the gut microbiota, genes associated with metabolism, and putative resistome of Inca and Italian nobility mummies were characterized by using high-throughput sequencing. The Italian nobility mummies exhibited a higher bacterial diversity as compared to the Inca mummies when using 16S ribosomal (rRNA) gene amplicon sequencing, but both groups showed bacterial and fungal taxa when using shotgun metagenomic sequencing that may resemble both the thanatomicrobiome and extant human gut microbiomes. Identification of sequences associated with plants, animals, and carbohydrate-active enzymes (CAZymes) may provide further insights into the dietary habits of Inca and Italian nobility mummies. Putative antibiotic-resistance genes in the Inca and Italian nobility mummies support a human gut resistome prior to the antibiotic therapy era. The higher proportion of putative antibiotic-resistance genes in the Inca compared to Italian nobility mummies may support the hypotheses that a greater exposure to the environment may result in a greater acquisition of antibiotic-resistance genes. The present study adds knowledge of the microbiome resulting from the process of mummification of the human gut, insights of ancient dietary habits, and the preserved putative human gut resistome prior the antibiotic therapy era. PMID:29112136
Gut Microbiome and Putative Resistome of Inca and Italian Nobility Mummies.

PubMed

Santiago-Rodriguez, Tasha M; Fornaciari, Gino; Luciani, Stefania; Toranzos, Gary A; Marota, Isolina; Giuffra, Valentina; Cano, Raul J

2017-11-07

Little is still known about the microbiome resulting from the process of mummification of the human gut. In the present study, the gut microbiota, genes associated with metabolism, and putative resistome of Inca and Italian nobility mummies were characterized by using high-throughput sequencing. The Italian nobility mummies exhibited a higher bacterial diversity as compared to the Inca mummies when using 16S ribosomal (rRNA) gene amplicon sequencing, but both groups showed bacterial and fungal taxa when using shotgun metagenomic sequencing that may resemble both the thanatomicrobiome and extant human gut microbiomes. Identification of sequences associated with plants, animals, and carbohydrate-active enzymes (CAZymes) may provide further insights into the dietary habits of Inca and Italian nobility mummies. Putative antibiotic-resistance genes in the Inca and Italian nobility mummies support a human gut resistome prior to the antibiotic therapy era. The higher proportion of putative antibiotic-resistance genes in the Inca compared to Italian nobility mummies may support the hypotheses that a greater exposure to the environment may result in a greater acquisition of antibiotic-resistance genes. The present study adds knowledge of the microbiome resulting from the process of mummification of the human gut, insights of ancient dietary habits, and the preserved putative human gut resistome prior the antibiotic therapy era.

Using reporter gene assays to identify cis regulatory differences between humans and chimpanzees.

PubMed

Chabot, Adrien; Shrit, Ralla A; Blekhman, Ran; Gilad, Yoav

2007-08-01

Most phenotypic differences between human and chimpanzee are likely to result from differences in gene regulation, rather than changes to protein-coding regions. To date, however, only a handful of human-chimpanzee nucleotide differences leading to changes in gene regulation have been identified. To hone in on differences in regulatory elements between human and chimpanzee, we focused on 10 genes that were previously found to be differentially expressed between the two species. We then designed reporter gene assays for the putative human and chimpanzee promoters of the 10 genes. Of seven promoters that we found to be active in human liver cell lines, human and chimpanzee promoters had significantly different activity in four cases, three of which recapitulated the gene expression difference seen in the microarray experiment. For these three genes, we were therefore able to demonstrate that a change in cis influences expression differences between humans and chimpanzees. Moreover, using site-directed mutagenesis on one construct, the promoter for the DDA3 gene, we were able to identify three nucleotides that together lead to a cis regulatory difference between the species. High-throughput application of this approach can provide a map of regulatory element differences between humans and our close evolutionary relatives.
Systematic Phenotyping of a Large-Scale Candida glabrata Deletion Collection Reveals Novel Antifungal Tolerance Genes

PubMed Central

Hiller, Ekkehard; Istel, Fabian; Tscherner, Michael; Brunke, Sascha; Ames, Lauren; Firon, Arnaud; Green, Brian; Cabral, Vitor; Marcet-Houben, Marina; Jacobsen, Ilse D.; Quintin, Jessica; Seider, Katja; Frohner, Ingrid; Glaser, Walter; Jungwirth, Helmut; Bachellier-Bassi, Sophie; Chauvel, Murielle; Zeidler, Ute; Ferrandon, Dominique; Gabaldón, Toni; Hube, Bernhard; d'Enfert, Christophe; Rupp, Steffen; Cormack, Brendan; Haynes, Ken; Kuchler, Karl

2014-01-01

The opportunistic fungal pathogen Candida glabrata is a frequent cause of candidiasis, causing infections ranging from superficial to life-threatening disseminated disease. The inherent tolerance of C. glabrata to azole drugs makes this pathogen a serious clinical threat. To identify novel genes implicated in antifungal drug tolerance, we have constructed a large-scale C. glabrata deletion library consisting of 619 unique, individually bar-coded mutant strains, each lacking one specific gene, all together representing almost 12% of the genome. Functional analysis of this library in a series of phenotypic and fitness assays identified numerous genes required for growth of C. glabrata under normal or specific stress conditions, as well as a number of novel genes involved in tolerance to clinically important antifungal drugs such as azoles and echinocandins. We identified 38 deletion strains displaying strongly increased susceptibility to caspofungin, 28 of which encoding proteins that have not previously been linked to echinocandin tolerance. Our results demonstrate the potential of the C. glabrata mutant collection as a valuable resource in functional genomics studies of this important fungal pathogen of humans, and to facilitate the identification of putative novel antifungal drug target and virulence genes. PMID:24945925
Spectrum of mutations in leiomyosarcomas identified by clinical targeted next-generation sequencing.

PubMed

Lee, Paul J; Yoo, Naomi S; Hagemann, Ian S; Pfeifer, John D; Cottrell, Catherine E; Abel, Haley J; Duncavage, Eric J

2017-02-01

Recurrent genomic mutations in uterine and non-uterine leiomyosarcomas have not been well established. Using a next generation sequencing (NGS) panel of common cancer-associated genes, 25 leiomyosarcomas arising from multiple sites were examined to explore genetic alterations, including single nucleotide variants (SNV), small insertions/deletions (indels), and copy number alterations (CNA). Sequencing showed 86 non-synonymous, coding region somatic variants within 151 gene targets in 21 cases, with a mean of 4.1 variants per case; 4 cases had no putative mutations in the panel of genes assayed. The most frequently altered genes were TP53 (36%), ATM and ATRX (16%), and EGFR and RB1 (12%). CNA were identified in 85% of cases, with the most frequent copy number losses observed in chromosomes 10 and 13 including PTEN and RB1; the most frequent gains were seen in chromosomes 7 and 17. Our data show that deletions in canonical cancer-related genes are common in leiomyosarcomas. Further, the spectrum of gene mutations observed shows that defects in DNA repair and chromosomal maintenance are central to the biology of leiomyosarcomas, and that activating mutations observed in other common cancer types are rare in leiomyosarcomas. Copyright © 2017 Elsevier Inc. All rights reserved.
Genome and Proteome Analysis of Rhodococcus erythropolis MI2: Elucidation of the 4,4´-Dithiodibutyric Acid Catabolism

PubMed Central

Khairy, Heba; Meinert, Christina; Wübbeler, Jan Hendrik; Poehlein, Anja; Daniel, Rolf; Voigt, Birgit; Riedel, Katharina; Steinbüchel, Alexander

2016-01-01

Rhodococcus erythropolis MI2 has the extraordinary ability to utilize the xenobiotic 4,4´-dithiodibutyric acid (DTDB). Cleavage of DTDB by the disulfide-reductase Nox, which is the only verified enzyme involved in DTDB-degradation, raised 4-mercaptobutyric acid (4MB). 4MB could act as building block of a novel polythioester with unknown properties. To completely unravel the catabolism of DTDB, the genome of R. erythropolis MI2 was sequenced, and subsequently the proteome was analyzed. The draft genome sequence consists of approximately 7.2 Mbp with an overall G+C content of 62.25% and 6,859 predicted protein-encoding genes. The genome of strain MI2 is composed of three replicons: one chromosome and two megaplasmids with sizes of 6.45, 0.4 and 0.35 Mbp, respectively. When cells of strain MI2 were cultivated with DTDB as sole carbon source and compared to cells grown with succinate, several interesting proteins with significantly higher expression levels were identified using 2D-PAGE and MALDI-TOF mass spectrometry. A putative luciferase-like monooxygenase-class F420-dependent oxidoreductase (RERY_05640), which is encoded by one of the 126 monooxygenase-encoding genes of the MI2-genome, showed a 3-fold increased expression level. This monooxygenase could oxidize the intermediate 4MB into 4-oxo-4-sulfanylbutyric acid. Next, a desulfurization step, which forms succinic acid and volatile hydrogen sulfide, is proposed. One gene coding for a putative desulfhydrase (RERY_06500) was identified in the genome of strain MI2. However, the gene product was not recognized in the proteome analyses. But, a significant expression level with a ratio of up to 7.3 was determined for a putative sulfide:quinone oxidoreductase (RERY_02710), which could also be involved in the abstraction of the sulfur group. As response to the toxicity of the intermediates, several stress response proteins were strongly expressed, including a superoxide dismutase (RERY_05600) and an osmotically induced protein (RERY_02670). Accordingly, novel insights in the catabolic pathway of DTDB were gained. PMID:27977722
Genome and Proteome Analysis of Rhodococcus erythropolis MI2: Elucidation of the 4,4´-Dithiodibutyric Acid Catabolism.

PubMed

Khairy, Heba; Meinert, Christina; Wübbeler, Jan Hendrik; Poehlein, Anja; Daniel, Rolf; Voigt, Birgit; Riedel, Katharina; Steinbüchel, Alexander

2016-01-01

Rhodococcus erythropolis MI2 has the extraordinary ability to utilize the xenobiotic 4,4´-dithiodibutyric acid (DTDB). Cleavage of DTDB by the disulfide-reductase Nox, which is the only verified enzyme involved in DTDB-degradation, raised 4-mercaptobutyric acid (4MB). 4MB could act as building block of a novel polythioester with unknown properties. To completely unravel the catabolism of DTDB, the genome of R. erythropolis MI2 was sequenced, and subsequently the proteome was analyzed. The draft genome sequence consists of approximately 7.2 Mbp with an overall G+C content of 62.25% and 6,859 predicted protein-encoding genes. The genome of strain MI2 is composed of three replicons: one chromosome and two megaplasmids with sizes of 6.45, 0.4 and 0.35 Mbp, respectively. When cells of strain MI2 were cultivated with DTDB as sole carbon source and compared to cells grown with succinate, several interesting proteins with significantly higher expression levels were identified using 2D-PAGE and MALDI-TOF mass spectrometry. A putative luciferase-like monooxygenase-class F420-dependent oxidoreductase (RERY_05640), which is encoded by one of the 126 monooxygenase-encoding genes of the MI2-genome, showed a 3-fold increased expression level. This monooxygenase could oxidize the intermediate 4MB into 4-oxo-4-sulfanylbutyric acid. Next, a desulfurization step, which forms succinic acid and volatile hydrogen sulfide, is proposed. One gene coding for a putative desulfhydrase (RERY_06500) was identified in the genome of strain MI2. However, the gene product was not recognized in the proteome analyses. But, a significant expression level with a ratio of up to 7.3 was determined for a putative sulfide:quinone oxidoreductase (RERY_02710), which could also be involved in the abstraction of the sulfur group. As response to the toxicity of the intermediates, several stress response proteins were strongly expressed, including a superoxide dismutase (RERY_05600) and an osmotically induced protein (RERY_02670). Accordingly, novel insights in the catabolic pathway of DTDB were gained.
Molecular cloning and evolutionary analysis of the calcium-modulated contractile protein, centrin, in green algae and land plants.

PubMed

Bhattacharya, D; Steinkötter, J; Melkonian, M

1993-12-01

Centrin (= caltractin) is a ubiquitous, cytoskeletal protein which is a member of the EF-hand superfamily of calcium-binding proteins. A centrin-coding cDNA was isolated and characterized from the prasinophyte green alga Scherffelia dubia. Centrin PCR amplification primers were used to isolate partial, homologous cDNA sequences from the green algae Tetraselmis striata and Spermatozopsis similis. Annealing analyses suggested that centrin is a single-copy-coding region in T. striata and S. similis and other green algae studied. Centrin-coding regions from S. dubia, S. similis and T. striata encode four colinear EF-hand domains which putatively bind calcium. Phylogenetic analyses, including homologous sequences from Chlamydomonas reinhardtii and the land plant Atriplex nummularia, demonstrate that the domains of centrins are congruent and arose from the two-fold duplication of an ancestral EF hand with Domains 1+3 and Domains 2+4 clustering. The domains of centrins are also congruent with those of calmodulins demonstrating that, like calmodulin, centrin is an ancient protein which arose within the ancestor of all eukaryotes via gene duplication. Phylogenetic relationships inferred from centrin-coding region comparisons mirror results of small subunit ribosomal RNA sequence analyses suggesting that centrin-coding regions are useful evolutionary markers within the green algae.
RhoA Regulation of Cardiomyocyte Differentiation

PubMed Central

Kaarbø, Mari; Crane, Denis I.; Murrell, Wayne G.

2013-01-01

Earlier findings from our laboratory implicated RhoA in heart developmental processes. To investigate factors that potentially regulate RhoA expression, RhoA gene organisation and promoter activity were analysed. Comparative analysis indicated strict conservation of both gene organisation and coding sequence of the chick, mouse, and human RhoA genes. Bioinformatics analysis of the derived promoter region of mouse RhoA identified putative consensus sequence binding sites for several transcription factors involved in heart formation and organogenesis generally. Using luciferase reporter assays, RhoA promoter activity was shown to increase in mouse-derived P19CL6 cells that were induced to differentiate into cardiomyocytes. Overexpression of a dominant negative mutant of mouse RhoA (mRhoAN19) blocked this cardiomyocyte differentiation of P19CL6 cells and led to the accumulation of the cardiac transcription factors SRF and GATA4 and the early cardiac marker cardiac α-actin. Taken together, these findings indicate a fundamental role for RhoA in the differentiation of cardiomyocytes. PMID:23935420
Isolation and characterization of the promoter sequence of a cassava gene coding for Pt2L4, a glutamic acid-rich protein differentially expressed in storage roots.

PubMed

de Souza, C R; Aragão, F J; Moreira, E C O; Costa, C N M; Nascimento, S B; Carvalho, L J

2009-03-24

Cassava is one of the most important tropical food crops for more than 600 million people worldwide. Transgenic technologies can be useful for increasing its nutritional value and its resistance to viral diseases and insect pests. However, tissue-specific promoters that guarantee correct expression of transgenes would be necessary. We used inverse polymerase chain reaction to isolate a promoter sequence of the Mec1 gene coding for Pt2L4, a glutamic acid-rich protein differentially expressed in cassava storage roots. In silico analysis revealed putative cis-acting regulatory elements within this promoter sequence, including root-specific elements that may be required for its expression in vascular tissues. Transient expression experiments showed that the Mec1 promoter is functional, since this sequence was able to drive GUS expression in bean embryonic axes. Results from our computational analysis can serve as a guide for functional experiments to identify regions with tissue-specific Mec1 promoter activity. The DNA sequence that we identified is a new promoter that could be a candidate for genetic engineering of cassava roots.
Evolutionary impact of transposable elements on genomic diversity and lineage-specific innovation in vertebrates.

PubMed

Warren, Ian A; Naville, Magali; Chalopin, Domitille; Levin, Perrine; Berger, Chloé Suzanne; Galiana, Delphine; Volff, Jean-Nicolas

2015-09-01

Since their discovery, a growing body of evidence has emerged demonstrating that transposable elements are important drivers of species diversity. These mobile elements exhibit a great variety in structure, size and mechanisms of transposition, making them important putative actors in organism evolution. The vertebrates represent a highly diverse and successful lineage that has adapted to a wide range of different environments. These animals also possess a rich repertoire of transposable elements, with highly diverse content between lineages and even between species. Here, we review how transposable elements are driving genomic diversity and lineage-specific innovation within vertebrates. We discuss the large differences in TE content between different vertebrate groups and then go on to look at how they affect organisms at a variety of levels: from the structure of chromosomes to their involvement in the regulation of gene expression, as well as in the formation and evolution of non-coding RNAs and protein-coding genes. In the process of doing this, we highlight how transposable elements have been involved in the evolution of some of the key innovations observed within the vertebrate lineage, driving the group's diversity and success.
Phylogenetic and comparative gene expression analysis of barley (Hordeum vulgare)WRKY transcription factor family reveals putatively retained functions betweenmonocots and dicots

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mangelsen, Elke; Kilian, Joachim; Berendzen, Kenneth W.

2008-02-01

WRKY proteins belong to the WRKY-GCM1 superfamily of zinc finger transcription factors that have been subject to a large plant-specific diversification. For the cereal crop barley (Hordeum vulgare), three different WRKY proteins have been characterized so far, as regulators in sucrose signaling, in pathogen defense, and in response to cold and drought, respectively. However, their phylogenetic relationship remained unresolved. In this study, we used the available sequence information to identify a minimum number of 45 barley WRKY transcription factor (HvWRKY) genes. According to their structural features the HvWRKY factors were classified into the previously defined polyphyletic WRKY subgroups 1 tomore » 3. Furthermore, we could assign putative orthologs of the HvWRKY proteins in Arabidopsis and rice. While in most cases clades of orthologous proteins were formed within each group or subgroup, other clades were composed of paralogous proteins for the grasses and Arabidopsis only, which is indicative of specific gene radiation events. To gain insight into their putative functions, we examined expression profiles of WRKY genes from publicly available microarray data resources and found group specific expression patterns. While putative orthologs of the HvWRKY transcription factors have been inferred from phylogenetic sequence analysis, we performed a comparative expression analysis of WRKY genes in Arabidopsis and barley. Indeed, highly correlative expression profiles were found between some of the putative orthologs. HvWRKY genes have not only undergone radiation in monocot or dicot species, but exhibit evolutionary traits specific to grasses. HvWRKY proteins exhibited not only sequence similarities between orthologs with Arabidopsis, but also relatedness in their expression patterns. This correlative expression is indicative for a putative conserved function of related WRKY proteins in mono- and dicot species.« less
DLEU2 encodes an antisense RNA for the putative bicistronic RFP2/LEU5 gene in humans and mouse.

PubMed

Corcoran, Martin M; Hammarsund, Marianne; Zhu, Chaoyong; Lerner, Mikael; Kapanadze, Bagrat; Wilson, Bill; Larsson, Catharina; Forsberg, Lars; Ibbotson, Rachel E; Einhorn, Stefan; Oscier, David G; Grandér, Dan; Sangfelt, Olle

2004-08-01

Our group previously identified two novel genes, RFP2/LEU5 and DLEU2, within a 13q14.3 genomic region of loss seen in various malignancies. However, no specific inactivating mutations were found in these or other genes in the vicinity of the deletion, suggesting that a nonclassical tumor-suppressor mechanism may be involved. Here, we present data showing that the DLEU2 gene encodes a putative noncoding antisense RNA, with one exon directly overlapping the first exon of the RFP2/LEU5 gene in the opposite orientation. In addition, the RFP2/LEU5 transcript can be alternatively spliced to produce either several monocistronic transcripts or a putative bicistronic transcript encoding two separate open-reading frames, adding to the complexity of the locus. The finding that these gene structures are conserved in the mouse, including the putative bicistronic RFP2/LEU5 transcript as well as the antisense relationship with DLEU2, further underlines the significance of this unusual organization and suggests a biological function for DLEU2 in the regulation of RFP2/LEU5. Copyright 2004 Wiley-Liss, Inc.
Nucleotide polymorphisms in the bovine lymphotoxin A gene and their distribution among Bos indicus zebu cattle breeds.

PubMed

Behl, Jyotsna Dhingra; Mishra, Priyanka; Verma, N K; Niranjan, S K; Dangi, P S; Sharma, Rekha; Behl, Rahul

2016-03-15

The present study was undertaken to characterize the genetic variation present in lymphoxin A gene (LTA gene) encoding for the lymphotoxin A protein also known as tumor necrosis factor beta, a cytokine produced by lymphocytes, known to be cytotoxic for a wide range of tumor cells both in vitro and in vivo, and, which is essential for normal immunological development; in 40 animals of 5 diverse Bos indicus Indian zebu cattle breeds. These breeds survive under the harsh and tough tropical climatic conditions of various parts of the Indian subcontinent. The LTA gene in the present study was observed to contain 33 SNPs and 3 small insertion/deletion polymorphisms. Four SNPs occurred in the coding regions of the gene viz. g.1327A>G and g.1400C>T in exon 2 and g.1840C>T and g.1942C>T in exon 3, of which the SNP g.1327A>G in exon 2 resulted in a non-synonymous amino acid change G38D. This amino acid change was however predicted not be affecting the protein function in any manner. The gene contained putative transcription factor binding sites for the c-Re1 and for Pax-4 transcription factors. A putative promoter region was also predicted on the reverse DNA strand from position 894 to 644. Several repeat elements and microsatellite repeats were detected to be occurring across the 3.2kb LTA gene sequence. The study showed the occurrence of 40 genotypes and 48 most probable haplotypes. The genotypes at the observed SNP positions in the LTA gene were in near Hardy-Weinberg equilibrium. A negative Tajima's D value that was not significant statistically at P>0.10 indicated that the neutral mutation hypothesis could not be excluded. The genetic variations observed in the LTA gene in the present study have not been reported earlier and these could possibly be used as molecular markers for further studies involving association of the gene variability with disease resistance/tolerance traits. Copyright © 2015 Elsevier B.V. All rights reserved.
Whole exome sequencing in recurrent early pregnancy loss.

PubMed

Qiao, Ying; Wen, Jiadi; Tang, Flamingo; Martell, Sally; Shomer, Naomi; Leung, Peter C K; Stephenson, Mary D; Rajcan-Separovic, Evica

2016-05-01

Exome sequencing can identify genetic causes of idiopathic recurrent pregnancy loss (RPL). We identified compound heterozygous deleterious mutations affecting DYNC2H1 and ALOX15 in two out of four families with RPL. Both genes have a role in early development. Bioinformatics analysis of all genes with rare and putatively pathogenic mutations in miscarriages and couples showed enrichment in pathways relevant to pregnancy loss, including the complement and coagulation cascades pathways. Next generation sequencing (NGS) is increasingly being used to identify known and novel gene mutations in children with developmental delay and in fetuses with ultrasound-detected anomalies. In contrast, NGS is rarely used to study pregnancy loss. Chromosome microarray analysis detects putatively causative DNA copy number variants (CNVs) in ∼2% of miscarriages and CNVs of unknown significance (predominantly parental in origin) in up to 40% of miscarriages. Therefore, a large number of miscarriages still have an unknown cause. Whole exome sequencing (WES) was performed using Illumina HiSeq 2000 platform on seven euploid miscarriages from four families with RPL. Golden Helix SVS v8.1.5 was used for data assessment and inheritance analysis for deleterious DNA variants predicted to severely disrupt protein-coding genes by introducing a frameshift, loss of the stop codon, gain of the stop codon, changes in splicing or the initial codon. Webgestalt (http://bioinfo.vanderbilt.edu/webgestalt/) was used for pathway and disease association enrichment analysis of a gene pool containing putatively pathogenic variants in miscarriages and couples in comparison to control gene pools. Compound heterozygous mutations in DYNC2H1 and ALOX15 were identified in miscarriages from two families with RPL. DYNC2H1 is involved in cilia biogenesis and has been associated with fetal lethality in humans. ALOX15 is expressed in placenta and its dysregulation has been associated with inflammation, placental, dysfunction, abnormal oxidative stress response and angiogenesis. The pool of putatively pathogenic single nucleotide variants (SNVs) and small insertions and deletions (indels) detected in the miscarriages showed enrichment in 'complement and coagulation cascades pathway', and 'ciliary motility disorders'. We conclude that CNVs, individual SNVs and pool of deleterious gene mutations identified by exome sequencing could contribute to RPL. The size of our sample cohort is small. The functional effect of candidate mutations should be evaluated to determine whether the mutations are causative. This is the first study to assess whether SNVs may contribute to the pathogenesis of miscarriage. Furthermore, our findings suggest that collective effect of mutations in relevant biological pathways could be implicated in RPL. The study was funded by Canadian Institutes of Health Research (grant MOP 106467) and Michael Smith Foundation of Health Research Career Scholar salary award to ERS. © The Author 2016. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Whole exome sequencing in recurrent early pregnancy loss

PubMed Central

Qiao, Ying; Wen, Jiadi; Tang, Flamingo; Martell, Sally; Shomer, Naomi; Leung, Peter C.K.; Stephenson, Mary D.; Rajcan-Separovic, Evica

2016-01-01

STUDY HYPOTHESIS Exome sequencing can identify genetic causes of idiopathic recurrent pregnancy loss (RPL). STUDY FINDING We identified compound heterozygous deleterious mutations affecting DYNC2H1 and ALOX15 in two out of four families with RPL. Both genes have a role in early development. Bioinformatics analysis of all genes with rare and putatively pathogenic mutations in miscarriages and couples showed enrichment in pathways relevant to pregnancy loss, including the complement and coagulation cascades pathways. WHAT IS KNOWN ALREADY Next generation sequencing (NGS) is increasingly being used to identify known and novel gene mutations in children with developmental delay and in fetuses with ultrasound-detected anomalies. In contrast, NGS is rarely used to study pregnancy loss. Chromosome microarray analysis detects putatively causative DNA copy number variants (CNVs) in ∼2% of miscarriages and CNVs of unknown significance (predominantly parental in origin) in up to 40% of miscarriages. Therefore, a large number of miscarriages still have an unknown cause. STUDY DESIGN, SAMPLES/MATERIALS, METHODS Whole exome sequencing (WES) was performed using Illumina HiSeq 2000 platform on seven euploid miscarriages from four families with RPL. Golden Helix SVS v8.1.5 was used for data assessment and inheritance analysis for deleterious DNA variants predicted to severely disrupt protein-coding genes by introducing a frameshift, loss of the stop codon, gain of the stop codon, changes in splicing or the initial codon. Webgestalt (http://bioinfo.vanderbilt.edu/webgestalt/) was used for pathway and disease association enrichment analysis of a gene pool containing putatively pathogenic variants in miscarriages and couples in comparison to control gene pools. MAIN RESULTS AND THE ROLE OF CHANCE Compound heterozygous mutations in DYNC2H1 and ALOX15 were identified in miscarriages from two families with RPL. DYNC2H1 is involved in cilia biogenesis and has been associated with fetal lethality in humans. ALOX15 is expressed in placenta and its dysregulation has been associated with inflammation, placental, dysfunction, abnormal oxidative stress response and angiogenesis. The pool of putatively pathogenic single nucleotide variants (SNVs) and small insertions and deletions (indels) detected in the miscarriages showed enrichment in ‘complement and coagulation cascades pathway’, and ‘ciliary motility disorders’. We conclude that CNVs, individual SNVs and pool of deleterious gene mutations identified by exome sequencing could contribute to RPL. LIMITATIONS, REASONS FOR CAUTION The size of our sample cohort is small. The functional effect of candidate mutations should be evaluated to determine whether the mutations are causative. WIDER IMPLICATIONS OF THE FINDINGS This is the first study to assess whether SNVs may contribute to the pathogenesis of miscarriage. Furthermore, our findings suggest that collective effect of mutations in relevant biological pathways could be implicated in RPL. STUDY FUNDING AND COMPETING INTEREST(S) The study was funded by Canadian Institutes of Health Research (grant MOP 106467) and Michael Smith Foundation of Health Research Career Scholar salary award to ERS. PMID:26826164
Cloning, sequencing, and expression of the gene encoding cyclic 2, 3-diphosphoglycerate synthetase, the key enzyme of cyclic 2, 3-diphosphoglycerate metabolism in Methanothermus fervidus.

PubMed

Matussek, K; Moritz, P; Brunner, N; Eckerskorn, C; Hensel, R

1998-11-01

Cyclic 2,3-diphosphoglycerate synthetase (cDPGS) catalyzes the synthesis of cyclic 2,3-diphosphoglycerate (cDPG) by formation of an intramolecular phosphoanhydride bond in 2,3-diphosphoglycerate. cDPG is known to be accumulated to high intracellular concentrations (>300 mM) as a putative thermoadapter in some hyperthermophilic methanogens. For the first time, we have purified active cDPGS from a methanogen, the hyperthermophilic archaeon Methanothermus fervidus, sequenced the coding gene, and expressed it in Escherichia coli. cDPGS purification resulted in enzyme preparations containing two isoforms differing in their electrophoretic mobility under denaturing conditions. Since both polypeptides showed the same N-terminal amino acid sequence and Southern analyses indicate the presence of only one gene coding for cDPGS in M. fervidus, the two polypeptides originate from the same gene but differ by a not yet identified modification. The native cDPGS represents a dimer with an apparent molecular mass of 112 kDa and catalyzes the reversible formation of the intramolecular phosphoanhydride bond at the expense of ATP. The enzyme shows a clear preference for the synthetic reaction: the substrate affinity and the Vmax of the synthetic reaction are a factor of 8 to 10 higher than the corresponding values for the reverse reaction. Comparison with the kinetic properties of the electrophoretically homogeneous, apparently unmodified recombinant enzyme from E. coli revealed a twofold-higher Vmax of the enzyme from M. fervidus in the synthesizing direction.
Biocomputational identification and validation of novel microRNAs predicted from bubaline whole genome shotgun sequences.

PubMed

Manku, H K; Dhanoa, J K; Kaur, S; Arora, J S; Mukhopadhyay, C S

2017-10-01

MicroRNAs (miRNAs) are small (19-25 base long), non-coding RNAs that regulate post-transcriptional gene expression by cleaving targeted mRNAs in several eukaryotes. The miRNAs play vital roles in multiple biological and metabolic processes, including developmental timing, signal transduction, cell maintenance and differentiation, diseases and cancers. Experimental identification of microRNAs is expensive and lab-intensive. Alternatively, computational approaches for predicting putative miRNAs from genomic or exomic sequences rely on features of miRNAs viz. secondary structures, sequence conservation, minimum free energy index (MFEI) etc. To date, not a single miRNA has been identified in bubaline (Bubalus bubalis), which is an economically important livestock. The present study aims at predicting the putative miRNAs of buffalo using comparative computational approach from buffalo whole genome shotgun sequencing data (INSDC: AWWX00000000.1). The sequences were blasted against the known mammalian miRNA. The obtained miRNAs were then passed through a series of filtration criteria to obtain the set of predicted (putative and novel) bubaline miRNA. Eight miRNAs were selected based on lowest E-value and validated by real time PCR (SYBR green chemistry) using RNU6 as endogenous control. The results from different trails of real time PCR shows that out of selected 8 miRNAs, only 2 (hsa-miR-1277-5p; bta-miR-2285b) are not expressed in bubaline PBMCs. The potential target genes based on their sequence complementarities were then predicted using miRanda. This work is the first report on prediction of bubaline miRNA from whole genome sequencing data followed by experimental validation. The finding could pave the way to future studies in economically important traits in buffalo. Copyright © 2017 Elsevier Ltd. All rights reserved.
Chromosome-based survey sequencing reveals the genome organization of wild wheat progenitor Triticum dicoccoides.

PubMed

Akpinar, Bala Ani; Biyiklioglu, Sezgi; Alptekin, Burcu; Havránková, Miroslava; Vrána, Jan; Doležel, Jaroslav; Distelfeld, Assaf; Hernandez, Pilar; Budak, Hikmet

2018-05-04

Wild emmer wheat (Triticum turgidum ssp. dicoccoides) is the progenitor of wheat. We performed chromosome-based survey sequencing of the 14 chromosomes, examining repetitive sequences, protein-coding genes, miRNA/target pairs and tRNA genes, as well as syntenic relationships with related grasses. We found considerable differences in the content and distribution of repetitive sequences between the A and B subgenomes. The gene contents of individual chromosomes varied widely, not necessarily correlating with chromosome size. We catalogued candidate agronomically important loci, along with new alleles and flanking sequences that can be used to design exome sequencing. Syntenic relationships and virtual gene orders revealed several small-scale evolutionary rearrangements, in addition to providing evidence for the 4AL-5AL-7BS translocation in wild emmer wheat. Chromosome-based sequence assemblies contained five novel miRNA families, among 59 families putatively encoded in the entire genome which provide insight into the domestication of wheat and an overview of the genome content and organization. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Genomicus update 2015: KaryoView and MatrixView provide a genome-wide perspective to multispecies comparative genomics

PubMed Central

Louis, Alexandra; Nguyen, Nga Thi Thuy; Muffato, Matthieu; Roest Crollius, Hugues

2015-01-01

The Genomicus web server (http://www.genomicus.biologie.ens.fr/genomicus) is a visualization tool allowing comparative genomics in four different phyla (Vertebrate, Fungi, Metazoan and Plants). It provides access to genomic information from extant species, as well as ancestral gene content and gene order for vertebrates and flowering plants. Here we present the new features available for vertebrate genome with a focus on new graphical tools. The interface to enter the database has been improved, two pairwise genome comparison tools are now available (KaryoView and MatrixView) and the multiple genome comparison tools (PhyloView and AlignView) propose three new kinds of representation and a more intuitive menu. These new developments have been implemented for Genomicus portal dedicated to vertebrates. This allows the analysis of 68 extant animal genomes, as well as 58 ancestral reconstructed genomes. The Genomicus server also provides access to ancestral gene orders, to facilitate evolutionary and comparative genomics studies, as well as computationally predicted regulatory interactions, thanks to the representation of conserved non-coding elements with their putative gene targets. PMID:25378326
Microbial culturomics to isolate halophilic bacteria from table salt: genome sequence and description of the moderately halophilic bacterium Bacillus salis sp. nov.

PubMed

Seck, E H; Diop, A; Armstrong, N; Delerce, J; Fournier, P-E; Raoult, D; Khelaifia, S

2018-05-01

Bacillus salis strain ES3 T (= CSUR P1478 = DSM 100598) is the type strain of B. salis sp. nov. It is an aerobic, Gram-positive, moderately halophilic, motile and spore-forming bacterium. It was isolated from commercial table salt as part of a broad culturomics study aiming to maximize the culture conditions for the in-depth exploration of halophilic bacteria in salty food. Here we describe the phenotypic characteristics of this isolate, its complete genome sequence and annotation, together with a comparison with closely related bacteria. Phylogenetic analysis based on 16S rRNA gene sequences indicated 97.5% similarity with Bacillus aquimaris, the closest species. The 8 329 771 bp long genome (one chromosome, no plasmids) exhibits a G+C content of 39.19%. It is composed of 18 scaffolds with 29 contigs. Of the 8303 predicted genes, 8109 were protein-coding genes and 194 were RNAs. A total of 5778 genes (71.25%) were assigned a putative function.
Complete genome sequence of the bacteriochlorophyll a-containing Roseibacterium elongatum type strain (DSM 19469T), a representative of the Roseobacter group isolated from Australian coast sand

PubMed Central

Riedel, Thomas; Fiebig, Anne; Göker, Markus; Klenk, Hans-Peter

2014-01-01

Roseibacterium elongatum Suzuki et al. 2006 is a pink-pigmented and bacteriochlorophyll a-producing representative of the Roseobacter group within the alphaproteobacterial family Rhodobacteraceae. Representatives of the marine ‘Roseobacter group’ were found to be abundant in the ocean and play an important role in global and biogeochemical processes. In the present study we describe the features of R. elongatum strain OCh 323T together with its genome sequence and annotation. The 3,555,102 bp long genome consists of one circular chromosome with no extrachromosomal elements and is one of the smallest known Roseobacter genomes. It contains 3,540 protein-coding genes and 59 RNA genes. Genome analysis revealed the presence of a photosynthetic gene cluster, which putatively enables a photoheterotrophic lifestyle. Gene sequences associated with quorum sensing, motility, surface attachment, and thiosulfate and carbon monoxide oxidation could be detected. The genome was sequenced as part of the activities of the Transregional Collaborative Research Centre 51 (TRR51) funded by the German Research Foundation (DFG). PMID:25197467

Complete genome sequence of the bacteriochlorophyll a-containing Roseibacterium elongatum type strain (DSM 19469(T)), a representative of the Roseobacter group isolated from Australian coast sand.

PubMed

Riedel, Thomas; Fiebig, Anne; Göker, Markus; Klenk, Hans-Peter

2014-06-15

Roseibacterium elongatum Suzuki et al. 2006 is a pink-pigmented and bacteriochlorophyll a-producing representative of the Roseobacter group within the alphaproteobacterial family Rhodobacteraceae. Representatives of the marine 'Roseobacter group' were found to be abundant in the ocean and play an important role in global and biogeochemical processes. In the present study we describe the features of R. elongatum strain OCh 323(T) together with its genome sequence and annotation. The 3,555,102 bp long genome consists of one circular chromosome with no extrachromosomal elements and is one of the smallest known Roseobacter genomes. It contains 3,540 protein-coding genes and 59 RNA genes. Genome analysis revealed the presence of a photosynthetic gene cluster, which putatively enables a photoheterotrophic lifestyle. Gene sequences associated with quorum sensing, motility, surface attachment, and thiosulfate and carbon monoxide oxidation could be detected. The genome was sequenced as part of the activities of the Transregional Collaborative Research Centre 51 (TRR51) funded by the German Research Foundation (DFG).
Whole-genome sequencing reveals clonal expansion of multiresistant Staphylococcus haemolyticus in European hospitals.

PubMed

Cavanagh, Jorunn Pauline; Hjerde, Erik; Holden, Matthew T G; Kahlke, Tim; Klingenberg, Claus; Flægstad, Trond; Parkhill, Julian; Bentley, Stephen D; Sollid, Johanna U Ericson

2014-11-01

Staphylococcus haemolyticus is an emerging cause of nosocomial infections, primarily affecting immunocompromised patients. A comparative genomic analysis was performed on clinical S. haemolyticus isolates to investigate their genetic relationship and explore the coding sequences with respect to antimicrobial resistance determinants and putative hospital adaptation. Whole-genome sequencing was performed on 134 isolates of S. haemolyticus from geographically diverse origins (Belgium, 2; Germany, 10; Japan, 13; Norway, 54; Spain, 2; Switzerland, 43; UK, 9; USA, 1). Each genome was individually assembled. Protein coding sequences (CDSs) were predicted and homologous genes were categorized into three types: Type I, core genes, homologues present in all strains; Type II, unique core genes, homologues shared by only a subgroup of strains; and Type III, unique genes, strain-specific CDSs. The phylogenetic relationship between the isolates was built from variable sites in the form of single nucleotide polymorphisms (SNPs) in the core genome and used to construct a maximum likelihood phylogeny. SNPs in the genome core regions divided the isolates into one major group of 126 isolates and one minor group of isolates with highly diverse genomes. The major group was further subdivided into seven clades (A-G), of which four (A-D) encompassed isolates only from Europe. Antimicrobial multiresistance was observed in 77.7% of the collection. High levels of homologous recombination were detected in genes involved in adherence, staphylococcal host adaptation and bacterial cell communication. The presence of several successful and highly resistant clones underlines the adaptive potential of this opportunistic pathogen. © The Author 2014. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy.
Whole-genome sequencing reveals clonal expansion of multiresistant Staphylococcus haemolyticus in European hospitals

PubMed Central

Cavanagh, Jorunn Pauline; Hjerde, Erik; Holden, Matthew T. G.; Kahlke, Tim; Klingenberg, Claus; Flægstad, Trond; Parkhill, Julian; Bentley, Stephen D.; Sollid, Johanna U. Ericson

2014-01-01

Objectives Staphylococcus haemolyticus is an emerging cause of nosocomial infections, primarily affecting immunocompromised patients. A comparative genomic analysis was performed on clinical S. haemolyticus isolates to investigate their genetic relationship and explore the coding sequences with respect to antimicrobial resistance determinants and putative hospital adaptation. Methods Whole-genome sequencing was performed on 134 isolates of S. haemolyticus from geographically diverse origins (Belgium, 2; Germany, 10; Japan, 13; Norway, 54; Spain, 2; Switzerland, 43; UK, 9; USA, 1). Each genome was individually assembled. Protein coding sequences (CDSs) were predicted and homologous genes were categorized into three types: Type I, core genes, homologues present in all strains; Type II, unique core genes, homologues shared by only a subgroup of strains; and Type III, unique genes, strain-specific CDSs. The phylogenetic relationship between the isolates was built from variable sites in the form of single nucleotide polymorphisms (SNPs) in the core genome and used to construct a maximum likelihood phylogeny. Results SNPs in the genome core regions divided the isolates into one major group of 126 isolates and one minor group of isolates with highly diverse genomes. The major group was further subdivided into seven clades (A–G), of which four (A–D) encompassed isolates only from Europe. Antimicrobial multiresistance was observed in 77.7% of the collection. High levels of homologous recombination were detected in genes involved in adherence, staphylococcal host adaptation and bacterial cell communication. Conclusions The presence of several successful and highly resistant clones underlines the adaptive potential of this opportunistic pathogen. PMID:25038069
Pathogenomic Inference of Virulence-Associated Genes in Leptospira interrogans

PubMed Central

Lehmann, Jason S.; Fouts, Derrick E.; Haft, Daniel H.; Cannella, Anthony P.; Ricaldi, Jessica N.; Brinkac, Lauren; Harkins, Derek; Durkin, Scott; Sanka, Ravi; Sutton, Granger; Moreno, Angelo; Vinetz, Joseph M.; Matthias, Michael A.

2013-01-01

Leptospirosis is a globally important, neglected zoonotic infection caused by spirochetes of the genus Leptospira. Since genetic transformation remains technically limited for pathogenic Leptospira, a systems biology pathogenomic approach was used to infer leptospiral virulence genes by whole genome comparison of culture-attenuated Leptospira interrogans serovar Lai with its virulent, isogenic parent. Among the 11 pathogen-specific protein-coding genes in which non-synonymous mutations were found, a putative soluble adenylate cyclase with host cell cAMP-elevating activity, and two members of a previously unstudied ∼15 member paralogous gene family of unknown function were identified. This gene family was also uniquely found in the alpha-proteobacteria Bartonella bacilliformis and Bartonella australis that are geographically restricted to the Andes and Australia, respectively. How the pathogenic Leptospira and these two Bartonella species came to share this expanded gene family remains an evolutionary mystery. In vivo expression analyses demonstrated up-regulation of 10/11 Leptospira genes identified in the attenuation screen, and profound in vivo, tissue-specific up-regulation by members of the paralogous gene family, suggesting a direct role in virulence and host-pathogen interactions. The pathogenomic experimental design here is generalizable as a functional systems biology approach to studying bacterial pathogenesis and virulence and should encourage similar experimental studies of other pathogens. PMID:24098822
Pathogenomic inference of virulence-associated genes in Leptospira interrogans.

PubMed

Lehmann, Jason S; Fouts, Derrick E; Haft, Daniel H; Cannella, Anthony P; Ricaldi, Jessica N; Brinkac, Lauren; Harkins, Derek; Durkin, Scott; Sanka, Ravi; Sutton, Granger; Moreno, Angelo; Vinetz, Joseph M; Matthias, Michael A

2013-01-01

Leptospirosis is a globally important, neglected zoonotic infection caused by spirochetes of the genus Leptospira. Since genetic transformation remains technically limited for pathogenic Leptospira, a systems biology pathogenomic approach was used to infer leptospiral virulence genes by whole genome comparison of culture-attenuated Leptospira interrogans serovar Lai with its virulent, isogenic parent. Among the 11 pathogen-specific protein-coding genes in which non-synonymous mutations were found, a putative soluble adenylate cyclase with host cell cAMP-elevating activity, and two members of a previously unstudied ∼15 member paralogous gene family of unknown function were identified. This gene family was also uniquely found in the alpha-proteobacteria Bartonella bacilliformis and Bartonella australis that are geographically restricted to the Andes and Australia, respectively. How the pathogenic Leptospira and these two Bartonella species came to share this expanded gene family remains an evolutionary mystery. In vivo expression analyses demonstrated up-regulation of 10/11 Leptospira genes identified in the attenuation screen, and profound in vivo, tissue-specific up-regulation by members of the paralogous gene family, suggesting a direct role in virulence and host-pathogen interactions. The pathogenomic experimental design here is generalizable as a functional systems biology approach to studying bacterial pathogenesis and virulence and should encourage similar experimental studies of other pathogens.
Assessment of the Antimicrobial Activity and the Entomocidal Potential of Bacillus thuringiensis Isolates from Algeria

PubMed Central

Djenane, Zahia; Nateche, Farida; Amziane, Meriam; Gomis-Cebolla, Joaquín; El-Aichar, Fairouz; Khorf, Hassiba; Ferré, Juan

2017-01-01

This work represents the first initiative to analyze the distribution of B. thuringiensis in Algeria and to evaluate the biological potential of the isolates. A total of 157 isolates were recovered, with at least one isolate in 94.4% of the samples. The highest Bt index was found in samples from rhizospheric soil (0.48) and from the Mediterranean area (0.44). Most isolates showed antifungal activity (98.5%), in contrast to the few that had antibacterial activity (29.9%). A high genetic diversity was made evident by the finding of many different crystal shapes and various combinations of shapes within a single isolate (in 58.4% of the isolates). Also, over 50% of the isolates harbored cry1, cry2, or cry9 genes, and 69.3% contained a vip3 gene. A good correlation between the presence of chitinase genes and antifungal activity was observed. More than half of the isolates with a broad spectrum of antifungal activity harbored both endochitinase and exochitinase genes. Interestingly, 15 isolates contained the two chitinase genes and all of the above cry family genes, with some of them harboring a vip3 gene as well. The combination of this large number of genes coding for entomopathogenic proteins suggests a putative wide range of entomotoxic activity. PMID:28406460
'Candidatus Phytoplasma phoenicium' associated with almond witches'-broom disease: from draft genome to genetic diversity among strain populations.

PubMed

Quaglino, Fabio; Kube, Michael; Jawhari, Maan; Abou-Jawdah, Yusuf; Siewert, Christin; Choueiri, Elia; Sobh, Hana; Casati, Paola; Tedeschi, Rosemarie; Lova, Marina Molino; Alma, Alberto; Bianco, Piero Attilio

2015-07-30

Almond witches'-broom (AlmWB), a devastating disease of almond, peach and nectarine in Lebanon, is associated with 'Candidatus Phytoplasma phoenicium'. In the present study, we generated a draft genome sequence of 'Ca. P. phoenicium' strain SA213, representative of phytoplasma strain populations from different host plants, and determined the genetic diversity among phytoplasma strain populations by phylogenetic analyses of 16S rRNA, groEL, tufB and inmp gene sequences. Sequence-based typing and phylogenetic analysis of the gene inmp, coding an integral membrane protein, distinguished AlmWB-associated phytoplasma strains originating from diverse host plants, whereas their 16S rRNA, tufB and groEL genes shared 100 % sequence identity. Moreover, dN/dS analysis indicated positive selection acting on inmp gene. Additionally, the analysis of 'Ca. P. phoenicium' draft genome revealed the presence of integral membrane proteins and effector-like proteins and potential candidates for interaction with hosts. One of the integral membrane proteins was predicted as BI-1, an inhibitor of apoptosis-promoting Bax factor. Bioinformatics analyses revealed the presence of putative BI-1 in draft and complete genomes of other 'Ca. Phytoplasma' species. The genetic diversity within 'Ca. P. phoenicium' strain populations in Lebanon suggested that AlmWB disease could be associated with phytoplasma strains derived from the adaptation of an original strain to diverse hosts. Moreover, the identification of a putative inhibitor of apoptosis-promoting Bax factor (BI-1) in 'Ca. P. phoenicium' draft genome and within genomes of other 'Ca. Phytoplasma' species suggested its potential role as a phytoplasma fitness-increasing factor by modification of the host-defense response.
Blunt Snout Bream (Megalobrama amblycephala) MyD88 and TRAF6: Characterisation, Comparative Homology Modelling and Expression

PubMed Central

Tran, Ngoc Tuan; Liu, Han; Jakovlić, Ivan; Wang, Wei-Min

2015-01-01

MyD88 and TRAF6 play an essential role in the innate immune response in most animals. This study reports the full-length MaMyD88 and MaTRAF6 genes identified from the blunt snout bream (Megalobrama amblycephala) transcriptome profile. MaMyD88 is 2501 base pairs (bp) long, encoding a putative protein of 284 amino acids (aa), including the N-terminal DEATH domain of 78 aa and the C-terminal TIR domain of 138 aa. MaTRAF6 is 2252 bp long, encoding a putative protein of 542 aa, including the N-terminal low-complexity region, RING domain (40 aa), a coiled-coil region (64 aa) and C-terminal MATH domain (147 aa). Coding regions of MaMyD88 and MaTRAF6 genomic sequences consisted of five and six exons, respectively. Physicochemical and functional characteristics of the proteins were analysed. Alpha helices were dominant in the secondary structure of the proteins. Homology models of the MaMyD88 and MaTRAF6 domains were constructed applying the comparative modelling method. RT-qPCR was used to analyse the expression of MaMyD88 and MaTRAF6 mRNA transcripts in response to Aeromonas hydrophila challenge. Both genes were highly upregulated in the liver, spleen and kidney during the first 24 h after the challenge. While MyD88 and TRAF6 have been reported in various aquatic species, this is the first report and characterisation of these genes in blunt snout bream. This research also provides evidence of the important roles of these two genes in the blunt snout bream innate immune system. PMID:25830478
Molecular diagnosis of putative Stargardt disease probands by exome sequencing

PubMed Central

2012-01-01

Background The commonest genetic form of juvenile or early adult onset macular degeneration is Stargardt Disease (STGD) caused by recessive mutations in the gene ABCA4. However, high phenotypic and allelic heterogeneity and a small but non-trivial amount of locus heterogeneity currently impede conclusive molecular diagnosis in a significant proportion of cases. Methods We performed whole exome sequencing (WES) of nine putative Stargardt Disease probands and searched for potentially disease-causing genetic variants in previously identified retinal or macular dystrophy genes. Follow-up dideoxy sequencing was performed for confirmation and to screen for mutations in an additional set of affected individuals lacking a definitive molecular diagnosis. Results Whole exome sequencing revealed seven likely disease-causing variants across four genes, providing a confident genetic diagnosis in six previously uncharacterized participants. We identified four previously missed mutations in ABCA4 across three individuals. Likely disease-causing mutations in RDS/PRPH2, ELOVL, and CRB1 were also identified. Conclusions Our findings highlight the enormous potential of whole exome sequencing in Stargardt Disease molecular diagnosis and research. WES adequately assayed all coding sequences and canonical splice sites of ABCA4 in this study. Additionally, WES enables the identification of disease-related alleles in other genes. This work highlights the importance of collecting parental genetic material for WES testing as the current knowledge of human genome variation limits the determination of causality between identified variants and disease. While larger sample sizes are required to establish the precision and accuracy of this type of testing, this study supports WES for inherited early onset macular degeneration disorders as an alternative to standard mutation screening techniques. PMID:22863181
Sequencing of GJB2 in Cameroonians and Black South Africans and comparison to 1000 Genomes Project Data Support Need to Revise Strategy for Discovery of Nonsyndromic Deafness Genes in Africans.

PubMed

Bosch, Jason; Noubiap, Jean Jacques N; Dandara, Collet; Makubalo, Nomlindo; Wright, Galen; Entfellner, Jean-Baka Domelevo; Tiffin, Nicki; Wonkam, Ambroise

2014-11-01

Mutations in the GJB2 gene, encoding connexin 26, could account for 50% of congenital, nonsyndromic, recessive deafness cases in some Caucasian/Asian populations. There is a scarcity of published data in sub-Saharan Africans. We Sanger sequenced the coding region of the GJB2 gene in 205 Cameroonian and Xhosa South Africans with congenital, nonsyndromic deafness; and performed bioinformatic analysis of variations in the GJB2 gene, incorporating data from the 1000 Genomes Project. Amongst Cameroonian patients, 26.1% were familial. The majority of patients (70%) suffered from sensorineural hearing loss. Ten GJB2 genetic variants were detected by sequencing. A previously reported pathogenic mutation, g.3741_3743delTTC (p.F142del), and a putative pathogenic mutation, g.3816G>A (p.V167M), were identified in single heterozygous samples. Amongst eight the remaining variants, two novel variants, g.3318-41G>A and g.3332G>A, were reported. There were no statistically significant differences in allele frequencies between cases and controls. Principal Components Analyses differentiated between Africans, Asians, and Europeans, but only explained 40% of the variation. The present study is the first to compare African GJB2 sequences with the data from the 1000 Genomes Project and have revealed the low variation between population groups. This finding has emphasized the hypothesis that the prevalence of mutations in GJB2 in nonsyndromic deafness amongst European and Asian populations is due to founder effects arising after these individuals migrated out of Africa, and not to a putative "protective" variant in the genomic structure of GJB2 in Africans. Our results confirm that mutations in GJB2 are not associated with nonsyndromic deafness in Africans.
Cloning and characterisation of type 4 fimbrial genes from Actinobacillus pleuropneumoniae.

PubMed

Stevenson, Andrew; Macdonald, Julie; Roberts, Mark

2003-03-20

Actinobacillus pleuropneumoniae is the cause of porcine pleuropneumoniae. Little is known about the mechanisms by which A. pleuropneumoniae colonises the respiratory tract. Fimbriae are common mediators of bacterial adherence to mucosal epithelia and have been observed on the surface of A. pleuropneumoniae cells. Here we report the identification and characterisation of the type 4 fimbrial structural gene (apfA) from A. pleuropneumoniae. In addition a number of open reading frames were identified in A. pleuropneumoniae that have significant homology to type 4 fimbrial biogenesis genes from other species, including a putative leader specific peptidase (apfD). A. pleuropneumoniae apfA codes for a predicted polypeptide of approximately 16kDa, removal of the leader sequence at the predicted cleavage site would yield a 14.5kDa polypeptide. The first 30 residues of the mature polypeptide are well conserved with other members of the group A type 4 fimbriae family. The signal sequence of ApfA is 13 amino acids in length and, unusually, the residue that precedes the cleavage site is alanine rather than glycine which is found in most other type 4 fimbriae. The C-terminus of ApfA possesses cysteine residues that are conserved in type 4 fimbriae of many species. In other type 4 fimbriae the distal C-terminal cysteines form a disulphide bond that produces a loop, which is important for the function of fimbriae and also comprises a major antigenic determinant. A motif within the predicted loop in ApfA was found to be highly conserved in type 4 fimbriae of other HAP organisms (Haemophilus, Actinobacillus, Pasteurella). The A. pleuropneumoniae type 4 fimbrial biogenesis genes showed the strongest homology to putative type 4 fimbrial genes of Haemophilus ducreyi. A. pleuropneumoniae apfA gene was shown to be present and highly conserved in different serotypes of A. pleuropneumoniae. Recombinant ApfA was produced and used to raise anti-ApfA antisera.
Comparative Transcriptomics to Identify Novel Genes and Pathways in Dinoflagellates

NASA Astrophysics Data System (ADS)

Ryan, D.

2016-02-01

The unarmored dinoflagellate Karenia brevis is among the most prominent harmful, bloom-forming phytoplankton species in the Gulf of Mexico. During blooms, the polyketides PbTx-1 and PbTx-2 (brevetoxins) are produced by K. brevis. Brevetoxins negatively impact human health and the Gulf shellfish harvest. However, the genes underlying brevetoxin synthesis are currently unknown. Because the K. brevis genome is extremely large ( 1 × 1011 base pairs long), and with a high proportion of repetitive, non-coding DNA, it has not been sequenced. In fact, large, repetitive genomes are common among the dinoflagellate group. High-throughput RNA sequencing technology enabled us to assemble Karenia transcriptomes de novo and investigate potential genes in the brevetoxin pathway through comparative transcriptomics. The brevetoxin profile varies among K. brevis clonal cultures. For example, well-documented Wilson-CCFWC268 typically produces 8-10 pg PbTx per cell, whereas SP1 produces < 2 pg PbTx/cell, and the mutant low-toxin Wilson clone produces undetectable to low (<0.05 pg/cell) amounts. Further, PbTx-2 has been measured in Karenia papilionacea but not Karenia mikimotoi. We compared the transcriptomes of four K. brevis clones (Wilson-CCFWC268, SP3, SP1, and mutant low-toxin Wilson) with K. papilionacea and K. mikimotoi to investigate nucleotide-level genetic variations and differences in gene expression. Of the 85,000 transcripts in the K. brevis transcriptome, 4,600 transcripts, including novel unannotated orthologs and putative polyketide synthases (PKSs), were only expressed by brevetoxin-producing K. brevis and K. papilionacea, not K. mikimotoi. Examination of gene expression between the typical- and low-toxin Wilson clones identified about 3,500 genes with significantly different expression levels, including 2 putative PKSs. One of the 2 PKSs was only found in the brevetoxin-producing Karenia species. These transcriptomes could not have been characterized without high-throughput RNA sequencing.
Comparative Transcriptome Profiles of Near-Isogenic Hexaploid Wheat Lines Differing for Effective Alleles at the 2DL FHB Resistance QTL

PubMed Central

Biselli, Chiara; Bagnaresi, Paolo; Faccioli, Primetta; Hu, Xinkun; Balcerzak, Margaret; Mattera, Maria G.; Yan, Zehong; Ouellet, Therese; Cattivelli, Luigi; Valè, Giampiero

2018-01-01

Fusarium head blight (FHB), caused by the fungus Fusarium graminearum, represents one of the major wheat diseases worldwide, determining severe yield losses and reduction of grain quality due to the accumulation of mycotoxins. The molecular response associated with the wheat 2DL FHB resistance QTL was mined through a comprehensive transcriptomic analysis of the early response to F. graminearum infection, at 3 days post-inoculation, in spikelets and rachis. The analyses were conducted on two near isogenic lines (NILs) differing for the presence of the 2DL QTL (2-2618, resistant 2DL+ and 2-2890, susceptible null). The general response to fungal infection in terms of mRNAs accumulation trend was similar in both NILs, even though involving an higher number of DEGs in the susceptible NIL, and included down-regulation of the primary and energy metabolism, up-regulation of enzymes implicated in lignin and phenylpropanoid biosynthesis, activation of hormons biosynthesis and signal transduction pathways and genes involved in redox homeostasis and transcriptional regulation. The search for candidate genes with expression profiles associated with the 2DL QTL for FHB resistance led to the discovery of processes differentially modulated in the R and S NILs related to cell wall metabolism, sugar and JA signaling, signal reception and transduction, regulation of the redox status and transcription factors. Wheat FHB response-related miRNAs differentially regulated were also identified as putatively implicated in the superoxide dismutase activities and affecting genes regulating responses to biotic/abiotic stresses and auxin signaling. Altered gene expression was also observed for fungal non-codingRNAs. The putative targets of two of these were represented by the wheat gene WIR1A, involved in resistance response, and a gene encoding a jacalin-related lectin protein, which participate in biotic and abiotic stress response, supporting the presence of a cross-talk between the plant and the fungus. PMID:29434615
High-quality permanent draft genome sequence of the extremely osmotolerant diphenol degrading bacterium Halotalea alkalilenta AW-7T, and emended description of the genus Halotalea

DOE PAGES

Ntougias, Spyridon; Lapidus, Alla; Copeland, Alex; ...

2015-08-13

Members of the genus Halotalea (family Halomonadaceae) are of high significance since they can tolerate the greatest glucose and maltose concentrations ever reported for known bacteria and are involved in the degradation of industrial effluents. Here, the characteristics and the permanent-draft genome sequence and annotation of Halotalea alkalilenta AW-7T are described. The microorganism was sequenced as a part of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG) project at the DOE Joint Genome Institute, and it is the only strain within the genus Halotalea having its genome sequenced. The genome is 4,467,826 bp longmore » and consists of 40 scaffolds with 64.62 % average GC content. A total of 4,104 genes were predicted, comprising of 4,028 protein-coding and 76 RNA genes. Most protein-coding genes (87.79 %) were assigned to a putative function. Halotalea alkalilenta AW-7T encodes the catechol and protocatechuate degradation to β-ketoadipate via the β-ketoadipate and protocatechuate ortho-cleavage degradation pathway, and it possesses the genetic ability to detoxify fluoroacetate, cyanate and acrylonitrile. Lastly, an emended description of the genus Halotalea Ntougias et al. 2007 is also provided in order to describe the delayed fermentation ability of the type strain.« less
Novel AroA from Pseudomonas putida Confers Tobacco Plant with High Tolerance to Glyphosate

PubMed Central

Yan, Hai-Qin; Chang, Su-Hua; Tian, Zhe-Xian; Zhang, Le; Sun, Yi-Cheng; Li, Yan; Wang, Jing; Wang, Yi-Ping

2011-01-01

Glyphosate is a non-selective broad-spectrum herbicide that inhibits 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS, also designated as AroA), a key enzyme in the aromatic amino acid biosynthesis pathway in microorganisms and plants. Previously, we reported that a novel AroA (PpAroA1) from Pseudomonas putida had high tolerance to glyphosate, with little homology to class I or class II glyphosate-tolerant AroA. In this study, the coding sequence of PpAroA1 was optimized for tobacco. For maturation of the enzyme in chloroplast, a chloroplast transit peptide coding sequence was fused in frame with the optimized aroA gene (PparoA1optimized) at the 5′ end. The PparoA1optimized gene was introduced into the tobacco (Nicotiana tabacum L. cv. W38) genome via Agrobacterium-mediated transformation. The transformed explants were first screened in shoot induction medium containing kanamycin. Then glyphosate tolerance was assayed in putative transgenic plants and its T1 progeny. Our results show that the PpAroA1 from Pseudomonas putida can efficiently confer tobacco plants with high glyphosate tolerance. Transgenic tobacco overexpressing the PparoA1optimized gene exhibit high tolerance to glyphosate, which suggest that the novel PpAroA1 is a new and good candidate applied in transgenic crops with glyphosate tolerance in future. PMID:21611121
Characterization by Suppression Subtractive Hybridization of Transcripts That Are Differentially Expressed in Leaves of Anthracnose-Resistant Ramie Cultivar.

PubMed

Xuxia, Wang; Jie, Chen; Bo, Wang; Lijun, Liu; Hui, Jiang; Diluo, Tang; Dingxiang, Peng

2012-01-01

For the purpose of screening putative anthracnose resistance-related genes of ramie ( Boehmeria nivea L. Gaud), a cDNA library was constructed by suppression subtractive hybridization using anthracnose-resistant cultivar Huazhu no. 4. The cDNAs from Huazhu no. 4, which were infected with Colletotrichum gloeosporioides , were used as the tester and cDNAs from uninfected Huazhu no. 4 as the driver. Sequencing analysis and homology searching showed that these clones represented 132 single genes, which were assigned to functional categories, including 14 putative cellular functions, according to categories established for Arabidopsis . These 132 genes included 35 disease resistance and stress tolerance-related genes including putative heat-shock protein 90, metallothionein, PR-1.2 protein, catalase gene, WRKY family genes, and proteinase inhibitor-like protein. Partial disease-related genes were further analyzed by reverse transcription PCR and RNA gel blot. These expressed sequence tags are the first anthracnose resistance-related expressed sequence tags reported in ramie.
Isolation, molecular cloning and in vitro expression of rhesus monkey (Macaca mulatta) prominin-1.s1 complementary DNA encoding a potential hematopoietic stem cell antigen.

PubMed

Husain, S M; Shou, Y; Sorrentino, B P; Handgretinger, R

2006-10-01

Human prominin-1 (CD133 or AC133) is an important cell surface marker used to isolate primitive hematopoietic stem cells. The commercially available antibody to human prominin-1 does not recognize rhesus prominin-1. Therefore, we isolated, cloned and characterized the complementary DNA (cDNA) of rhesus prominin-1 gene and determined its coding potential. Following the nomenclature of prominin family of genes, we named this cDNA as rhesus prominin-1.s1. The amino acid sequence data of the putative rhesus prominin-1.s1 could be used in designing antigenic peptides to raise antibodies for use in isolation of pure populations of rhesus prominin-1(+) hematopoietic cells. To the best of our knowledge, there has been no previously published report about the isolation of a prominin-1 cDNA from rhesus monkey (Macaca mulatta).
Missing genes, multiple ORFs, and C-to-U type RNA editing in Acrasis kona (Heterolobosea, Excavata) mitochondrial DNA.

PubMed

Fu, Cheng-Jie; Sheikh, Sanea; Miao, Wei; Andersson, Siv G E; Baldauf, Sandra L

2014-08-21

Discoba (Excavata) is an ancient group of eukaryotes with great morphological and ecological diversity. Unlike the other major divisions of Discoba (Jakobida and Euglenozoa), little is known about the mitochondrial DNAs (mtDNAs) of Heterolobosea. We have assembled a complete mtDNA genome from the aggregating heterolobosean amoeba, Acrasis kona, which consists of a single circular highly AT-rich (83.3%) molecule of 51.5 kb. Unexpectedly, A. kona mtDNA is missing roughly 40% of the protein-coding genes and nearly half of the transfer RNAs found in the only other sequenced heterolobosean mtDNAs, those of Naegleria spp. Instead, over a quarter of A. kona mtDNA consists of novel open reading frames. Eleven of the 16 protein-coding genes missing from A. kona mtDNA were identified in its nuclear DNA and polyA RNA, and phylogenetic analyses indicate that at least 10 of these 11 putative nuclear-encoded mitochondrial (NcMt) proteins arose by direct transfer from the mitochondrion. Acrasis kona mtDNA also employs C-to-U type RNA editing, and 12 homologs of DYW-type pentatricopeptide repeat (PPR) proteins implicated in plant organellar RNA editing are found in A. kona nuclear DNA. A mapping of mitochondrial gene content onto a consensus phylogeny reveals a sporadic pattern of relative stasis and rampant gene loss in Discoba. Rampant loss occurred independently in the unique common lineage leading to Heterolobosea + Tsukubamonadida and later in the unique lineage leading to Acrasis. Meanwhile, mtDNA gene content appears to be remarkably stable in the Acrasis sister lineage leading to Naegleria and in their distant relatives Jakobida. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Genome-wide transcription start site profiling in biofilm-grown Burkholderia cenocepacia J2315.

PubMed

Sass, Andrea M; Van Acker, Heleen; Förstner, Konrad U; Van Nieuwerburgh, Filip; Deforce, Dieter; Vogel, Jörg; Coenye, Tom

2015-10-13

Burkholderia cenocepacia is a soil-dwelling Gram-negative Betaproteobacterium with an important role as opportunistic pathogen in humans. Infections with B. cenocepacia are very difficult to treat due to their high intrinsic resistance to most antibiotics. Biofilm formation further adds to their antibiotic resistance. B. cenocepacia harbours a large, multi-replicon genome with a high GC-content, the reference genome of strain J2315 includes 7374 annotated genes. This study aims to annotate transcription start sites and identify novel transcripts on a whole genome scale. RNA extracted from B. cenocepacia J2315 biofilms was analysed by differential RNA-sequencing and the resulting dataset compared to data derived from conventional, global RNA-sequencing. Transcription start sites were annotated and further analysed according to their position relative to annotated genes. Four thousand ten transcription start sites were mapped over the whole B. cenocepacia genome and the primary transcription start site of 2089 genes expressed in B. cenocepacia biofilms were defined. For 64 genes a start codon alternative to the annotated one was proposed. Substantial antisense transcription for 105 genes and two novel protein coding sequences were identified. The distribution of internal transcription start sites can be used to identify genomic islands in B. cenocepacia. A potassium pump strongly induced only under biofilm conditions was found and 15 non-coding small RNAs highly expressed in biofilms were discovered. Mapping transcription start sites across the B. cenocepacia genome added relevant information to the J2315 annotation. Genes and novel regulatory RNAs putatively involved in B. cenocepacia biofilm formation were identified. These findings will help in understanding regulation of B. cenocepacia biofilm formation.
Transcriptome-Derived Tetranucleotide Microsatellites and Their Associated Genes from the Giant Panda (Ailuropoda melanoleuca).

PubMed

Song, Xuhao; Shen, Fujun; Huang, Jie; Huang, Yan; Du, Lianming; Wang, Chengdong; Fan, Zhenxin; Hou, Rong; Yue, Bisong; Zhang, Xiuyue

2016-09-01

Recently, an increasing number of microsatellites or simple sequence repeats (SSRs) have been found and characterized from transcriptomes. Such SSRs can be employed as putative functional markers to easily tag corresponding genes, which play an important role in biomedical studies and genetic analysis. However, the transcriptome-derived SSRs for giant panda (Ailuropoda melanoleuca) are not yet available. In this work, we identified and characterized 20 tetranucleotide microsatellite loci from a transcript database generated from the blood of giant panda. Furthermore, we assigned their predicted transcriptome locations: 16 loci were assigned to untranslated regions (UTRs) and 4 loci were assigned to coding regions (CDSs). Gene identities of 14 transcripts contained corresponding microsatellites were determined, which provide useful information to study the potential contribution of SSRs to gene regulation in giant panda. The polymorphic information content (PIC) values ranged from 0.293 to 0.789 with an average of 0.603 for the 16 UTRs-derived SSRs. Interestingly, 4 CDS-derived microsatellites developed in our study were also polymorphic, and the instability of these 4 CDS-derived SSRs was further validated by re-genotyping and sequencing. The genes containing these 4 CDS-derived SSRs were embedded with various types of repeat motifs. The interaction of all the length-changing SSRs might provide a way against coding region frameshift caused by microsatellite instability. We hope these newly gene-associated biomarkers will pave the way for genetic and biomedical studies for giant panda in the future. In sum, this set of transcriptome-derived markers complements the genetic resources available for giant panda. © The American Genetic Association. 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Molecular characterization of two serine proteases expressed in gut tissue of the African trypanosome vector, Glossina morsitans morsitans.

PubMed

Yan, J; Cheng, Q; Li, C B; Aksoy, S

2001-02-01

Serine proteases are major insect gut enzymes involved in digestion of dietary proteins, and in addition they have been implicated in the process of pathogen establishment in several vector insects. The medically important vector, tsetse fly (Diptera:Glossinidiae), is involved in the transmission of African trypanosomes, which cause devastating diseases in animals and humans. Both the male and female tsetse can transmit trypanosomes and both are strict bloodfeeders throughout all stages of their development. Here, we describe the characterization of two putative serine protease-encoding genes, Glossina serine protease-1 (Gsp1) and Glossina serine protease-2 (Gsp2) from gut tissue. Both putative cDNA products represent prepro peptides with hydrophobic signal peptide sequences associated with their 5'-end terminus. The Gsp1 cDNA encodes a putative mature protein of 245 amino acids with a molecular mass of 26 428 Da, while the predicted size of the 228 amino acid mature peptide encoded by Gsp2 cDNA is 24 573 Da. Both deduced peptides contain the Asp/His/Ser catalytic triad and the conserved residues surrounding it which are characteristic of serine proteases. In addition, both proteins have the six-conserved cysteine residues to form the three-cysteine bonds typically present in invertebrate serine proteases. Based on the presence of substrate specific residues, the Gsp1 gene encodes a chymotrypsin-like protease while Gsp2 gene encodes for a protein with trypsin-like activity. Both proteins are encoded by few loci in tsetse genome, being present in one or two copies only. The mRNA expression levels for the genes do not vary extensively throughout the digestive cycle, and high levels of mRNAs can be readily detected in the gut tissue of newly emerged flies. The levels of trypsin and chymotrypsin activities in the gut lumen increase following blood feeding and change significantly in the gut cells throughout the digestion cycle. Hence, the regulation of expression for trypsin and chymotrypsin occurs at the post-transcriptional level in tsetse. Both the coding sequences and patterns of expression of Gsp1 and Gsp2 genes are similar to the serine proteases that have been reported from the bloodfeeding insect Stomoxys calcitrans.
Identification of lptA, lpxE, and lpxO, Three Genes Involved in the Remodeling of Brucella Cell Envelope.

PubMed

Conde-Álvarez, Raquel; Palacios-Chaves, Leyre; Gil-Ramírez, Yolanda; Salvador-Bescós, Miriam; Bárcena-Varela, Marina; Aragón-Aranda, Beatriz; Martínez-Gómez, Estrella; Zúñiga-Ripa, Amaia; de Miguel, María J; Bartholomew, Toby Leigh; Hanniffy, Sean; Grilló, María-Jesús; Vences-Guzmán, Miguel Ángel; Bengoechea, José A; Arce-Gorvel, Vilma; Gorvel, Jean-Pierre; Moriyón, Ignacio; Iriarte, Maite

2017-01-01

The brucellae are facultative intracellular bacteria that cause a worldwide extended zoonosis. One of the pathogenicity mechanisms of these bacteria is their ability to avoid rapid recognition by innate immunity because of a reduction of the pathogen-associated molecular pattern (PAMP) of the lipopolysaccharide (LPS), free-lipids, and other envelope molecules. We investigated the Brucella homologs of lptA, lpxE , and lpxO , three genes that in some pathogens encode enzymes that mask the LPS PAMP by upsetting the core-lipid A charge/hydrophobic balance. Brucella lptA , which encodes a putative ethanolamine transferase, carries a frame-shift in B. abortus but not in other Brucella spp. and phylogenetic neighbors like the opportunistic pathogen Ochrobactrum anthropi. Consistent with the genomic evidence, a B. melitensis lptA mutant lacked lipid A-linked ethanolamine and displayed increased sensitivity to polymyxin B (a surrogate of innate immunity bactericidal peptides), while B. abortus carrying B. melitensis lptA displayed increased resistance. Brucella lpxE encodes a putative phosphatase acting on lipid A or on a free-lipid that is highly conserved in all brucellae and O. anthropi. Although we found no evidence of lipid A dephosphorylation, a B. abortus lpxE mutant showed increased polymyxin B sensitivity, suggesting the existence of a hitherto unidentified free-lipid involved in bactericidal peptide resistance. Gene lpxO putatively encoding an acyl hydroxylase carries a frame-shift in all brucellae except B. microti and is intact in O. anthropi . Free-lipid analysis revealed that lpxO corresponded to olsC , the gene coding for the ornithine lipid (OL) acyl hydroxylase active in O. anthropi and B. microti , while B. abortus carrying the olsC of O. anthropi and B. microti synthesized hydroxylated OLs. Interestingly, mutants in lptA, lpxE , or olsC were not attenuated in dendritic cells or mice. This lack of an obvious effect on virulence together with the presence of the intact homolog genes in O. anthropi and B. microti but not in other brucellae suggests that LptA, LpxE, or OL β-hydroxylase do not significantly alter the PAMP properties of Brucella LPS and free-lipids and are therefore not positively selected during the adaptation to intracellular life.
"PINK1"-Linked Parkinsonism Is Associated with Lewy Body Pathology

ERIC Educational Resources Information Center

Samaranch, Lluis; Lorenzo-Betancor, Oswaldo; Arbelo, Jose M.; Ferrer, Isidre; Lorenzo, Elena; Irigoyen, Jaione; Pastor, Maria A.; Marrero, Carmen; Isla, Concepcion; Herrera-Henriquez, Joanna; Pastor, Pau

2010-01-01

Phosphatase and tensin homolog-induced putative kinase 1 gene mutations have been associated with autosomal recessive early-onset Parkinson's disease. To date, no neuropathological reports have been published from patients with Parkinson's disease with both phosphatase and tensin homolog-induced putative kinase 1 gene copies mutated. We analysed…
Cloning and characterization of prunus serotina AGAMOUS, a putative flower homeotic gene

Treesearch

Xiaomei Liu; Joseph Anderson; Paula Pijut

2010-01-01

Members of the AGAMOUS subfamily of MADS-box transcription factors play an important role in regulating the development of reproductive organs in flowering plants. To help understand the mechanism of floral development in black cherry (Prunus serotina), PsAG (a putative flower homeotic identity gene) was isolated...
Cellulose as an extracellular matrix component present in Enterobacter sakazakii biofilms.

PubMed

Grimm, Maya; Stephan, Roger; Iversen, Carol; Manzardo, Giuseppe G G; Rattei, Thomas; Riedel, Kathrin; Ruepp, Andreas; Frishman, Dmitrij; Lehner, Angelika

2008-01-01

Cellulose was identified and characterized as an extracellular matrix component present in the biofilm of an Enterobacter sakazakii clinical isolate grown in nutrient-deficient (M9) medium. Using a bacterial artificial cloning approach in Escherichia coli and subsequent screening of transformants for fluorescence on calcofluor plates, nine genes organized in two operons were identified as putatively responsible for the biosynthesis of cellulose. In addition to the genes already described for cellulose production, two more genes were identified, putatively transcribed together with the genes from the first operon. Putative cellulose in E. sakazakii ES5 biofilm grown on glass coverslips was visualized by calcofluor staining and confocal fluorescence laser scanning microscopy. For the first time, the presence of cellulose in biofilms produced by E. sakazakii was confirmed by methylation analysis.
Signaling coupled epigenomic regulation of gene expression.

PubMed

Kumar, R; Deivendran, S; Santhoshkumar, T R; Pillai, M R

2017-10-26

Inheritance of genomic information independent of the DNA sequence, the epigenetics, as well as gene transcription are profoundly shaped by serine/threonine and tyrosine signaling kinases and components of the chromatin remodeling complexes. To precisely respond to a changing external milieu, human cells efficiently translate upstream signals into post-translational modifications (PTMs) on histones and coregulators such as corepressors, coactivators, DNA-binding factors and PTM modifying enzymes. Because a protein with multiple residues for putative PTMs is expected to undergo more than one PTM in cells stimulated with growth factors, the outcome of combinational PTM codes on histones and coregulators is profoundly shaped by regulatory interplays between PTMs. The genomic functions of signaling kinases in cancer cells are manifested by the downstream effectors of cytoplasmic signaling cascades as well as translocation of the cytoplasmic signaling kinases to the nucleus. Signaling-mediated phosphorylation of histones serves as a regulatory switch for other PTMs, and connects chromatin remodeling complexes into gene transcription and gene activity. Here, we will discuss the recent advances in signaling-dependent epigenomic regulation of gene transcription using a few representative cancer-relevant serine/threonine and tyrosine kinases and their interplay with chromatin remodeling factors in cancer cells.
Dehydration-induced WRKY genes from tobacco and soybean respond to jasmonic acid treatments in BY-2 cell culture.

PubMed

Rabara, Roel C; Tripathi, Prateek; Lin, Jun; Rushton, Paul J

2013-02-15

Drought is one of the important environmental factors affecting crop production worldwide and therefore understanding the molecular response of plant to stress is an important step in crop improvement. WRKY transcription factors are one of the 10 largest transcription factor families across the green lineage. In this study, highly upregulated dehydration-induced WRKY and enzyme-coding genes from tobacco and soybean were selected from microarray data for promoter analyses. Putative stress-related cis-regulatory elements such as TGACG motif, ABRE-like elements; W and G-like sequences were identified by an in silico analyses of promoter region of the selected genes. GFP quantification of transgenic BY-2 cell culture showed these promoters direct higher expression in-response to 100 μM JA treatment compared to 100 μM ABA, 10% PEG and 85 mM NaCl treatments. Thus promoter activity upon JA treatment and enrichment of MeJA-responsive elements in the promoter of the selected genes provides insights for these genes to be jasmonic acid responsive with potential of mediating cross-talk during dehydration responses. Copyright © 2013 Elsevier Inc. All rights reserved.
Genomic organization of human fetal specific P-450IIIA7 (cytochrome P-450HFLa)-related gene(s) and interaction of transcriptional regulatory factor with its DNA element in the 5' flanking region.

PubMed

Itoh, S; Yanagimoto, T; Tagawa, S; Hashimoto, H; Kitamura, R; Nakajima, Y; Okochi, T; Fujimoto, S; Uchino, J; Kamataki, T

1992-03-24

P-450IIIA7 is a form of cytochrome P-450 which was isolated from human fetal livers and termed P-450HFLa. This form has been clarified to be expressed during fetal life specifically (Komori, M., Nishio, K., Kitada, M., Shiramatsu, K., Muroya, K., Soma, M., Nagashima, K. and Kamataki, T. (1990) Biochemistry 29, 4430-4433). In the present study, we isolated five independent clones which probably corresponded to the human P-450IIIA7 gene. These clones were completely sequenced, all exons, exon-intron junctions and the 5' flanking region from the cap site to-869. Although the sequences in the coding region were completely identical to P-450IIIA7, it is possible that genomic fragments sequenced in this study encode portions of other P-450IIIA7-related genes since we could not obtain a complete overlapping set of genomic clones. Within its 5' flanking sequence, the putative binding sites of several transcriptional regulatory factors existed. Among them, it was shown that a basic transcription element binding factor (BTEB) actually interacted with the 5' flanking region of this gene.
Molecular Diagnosis of Putative Stargardt Disease by Capture Next Generation Sequencing

PubMed Central

Shi, Wei; Huang, Ping; Min, Qingjie; Li, Minghan; Yu, Xinping; Wu, Yaming; Zhao, Guangyu; Tong, Yi; Jin, Zi-Bing; Qu, Jia; Gu, Feng

2014-01-01

Stargardt Disease (STGD) is the commonest genetic form of juvenile or early adult onset macular degeneration, which is a genetically heterogeneous disease. Molecular diagnosis of STGD remains a challenge in a significant proportion of cases. To address this, seven patients from five putative STGD families were recruited. We performed capture next generation sequencing (CNGS) of the probands and searched for potentially disease-causing genetic variants in previously identified retinal or macular dystrophy genes. Seven disease-causing mutations in ABCA4 and two in PROM1 were identified by CNGS, which provides a confident genetic diagnosis in these five families. We also provided a genetic basis to explain the differences among putative STGD due to various mutations in different genes. Meanwhile, we show for the first time that compound heterozygous mutations in PROM1 gene could cause cone-rod dystrophy. Our findings support the enormous potential of CNGS in putative STGD molecular diagnosis. PMID:24763286
Evolutionary dynamics of a conserved sequence motif in the ribosomal genes of the ciliate Paramecium.

PubMed

Catania, Francesco; Lynch, Michael

2010-05-04

In protozoa, the identification of preserved motifs by comparative genomics is often impeded by difficulties to generate reliable alignments for non-coding sequences. Moreover, the evolutionary dynamics of regulatory elements in 3' untranslated regions (both in protozoa and metazoa) remains a virtually unexplored issue. By screening Paramecium tetraurelia's 3' untranslated regions for 8-mers that were previously found to be preserved in mammalian 3' UTRs, we detect and characterize a motif that is distinctly conserved in the ribosomal genes of this ciliate. The motif appears to be conserved across Paramecium aurelia species but is absent from the ribosomal genes of four additional non-Paramecium species surveyed, including another ciliate, Tetrahymena thermophila. Motif-free ribosomal genes retain fewer paralogs in the genome and appear to be lost more rapidly relative to motif-containing genes. Features associated with the discovered preserved motif are consistent with this 8-mer playing a role in post-transcriptional regulation. Our observations 1) shed light on the evolution of a putative regulatory motif across large phylogenetic distances; 2) are expected to facilitate the understanding of the modulation of ribosomal genes expression in Paramecium; and 3) reveal a largely unexplored--and presumably not restricted to Paramecium--association between the presence/absence of a DNA motif and the evolutionary fate of its host genes.
Tobacco plants transformed with the bean. alpha. ai gene express an inhibitor of insect. alpha. -amylase in their seeds. [Nicotiana tabacum; Tenebrio molitor

DOE Office of Scientific and Technical Information (OSTI.GOV)

Altabella, T.; Chrispeels, M.J.

Bean (Phaseolus vulgaris L.) seeds contain a putative plant defense protein that inhibits insect and mammalian but not plant {alpha}-amylases. We recently presented strong circumstantial evidence that this {alpha}-amylase inhibitor ({alpha}Al) is encoded by an already-identified lectin gene whose product is referred to as lectin-like-protein (LLP). We have now made a chimeric gene consisting of the coding sequence of the lectin gene that encodes LLP and the 5{prime} and 3{prime} flanking sequences of the lectin gene that encodes phytohemagglutinin-L. When this chimeric gene was expressed in transgenic tobacco (Nicotiana tabacum), we observed in the seeds a series of polypeptides (M{submore » r} 10,000-18,000) that cross-react with antibodies to the bean {alpha}-amylase inhibitor. Most of these polypeptides bind to a pig pancreas {alpha}-amylase affinity column. An extract of the seeds of the transformed tobacco plants inhibits pig pancreas {alpha}-amylase activity as well as the {alpha}-amylase present in the midgut of Tenebrio molitor. We suggest that introduction of this lectin gene (to be called {alpha}ai) into other leguminous plants may be a strategy to protect the seeds from the seed-eating larvae of Coleoptera.« less
Characterization of an In Vivo Z-DNA Detection Probe Based on a Cell Nucleus Accumulating Intrabody.

PubMed

Gulis, Galina; Silva, Izabel Cristina Rodrigues; Sousa, Herdson Renney; Sousa, Isabel Garcia; Bezerra, Maryani Andressa Gomes; Quilici, Luana Salgado; Maranhao, Andrea Queiroz; Brigido, Marcelo Macedo

2016-09-01

Left-handed Z-DNA is a physiologically unstable DNA conformation, and its existence in vivo can be attributed to localized torsional distress. Despite evidence for the existence of Z-DNA in vivo, its precise role in the control of gene expression is not fully understood. Here, an in vivo probe based on an anti-Z-DNA intrabody is proposed for native Z-DNA detection. The probe was used for chromatin immunoprecipitation of potential Z-DNA-forming sequences in the human genome. One of the isolated putative Z-DNA-forming sequences was cloned upstream of a reporter gene expression cassette under control of the CMV promoter. The reporter gene encoded an antibody fragment fused to GFP. Transient co-transfection of this vector along with the Z-probe coding vector improved reporter gene expression. This improvement was demonstrated by measuring reporter gene mRNA and protein levels and the amount of fluorescence in co-transfected CHO-K1 cells. These results suggest that the presence of the anti-Z-DNA intrabody can interfere with a Z-DNA-containing reporter gene expression. Therefore, this in vivo probe for the detection of Z-DNA could be used for global correlation of Z-DNA-forming sequences and gene expression regulation.
Comprehensive genomic analysis and expression profiling of diacylglycerol kinase gene family in Malus prunifolia (Willd.) Borkh.

PubMed

Li, Yali; Tan, Yanxiao; Shao, Yun; Li, Mingjun; Ma, Fengwang

2015-05-01

Diacylglycerol kinase (DGK) is a pivotal enzyme that phosphorylates diacylglycerol (DAG) to form phosphatidic acid (PA). The production of PA from phospholipase D (PLD) and the coupled phospholipase C (PLC)/DGK route is a critical signaling process in animal and plant cells. Next to PLD, DGK is the second most important generator of PA in biotic and abiotic stress responses. We identified 8 DGK members within the apple genome and all of their putative proteins contain one DGK catalytic domain and one DGK accessory domain. Four coding sequences were confirmed by cloning from Malus prunifolia. Phylogenetic and gene structure analyses showed that the apple DGK genes could be assigned to Clusters I, II, or III. Expression analysis of 6 of them revealed that their transcript levels were highest in stems. Some apple DGK genes were also significantly up-regulated in response to salt and drought stresses. This suggested their possible roles in plant defenses against environmental challenges. As a first step toward genome-wide analyses of the DGK genes in woody plants, our results imply that apple DGK genes are involved in the signaling of stress responses. These findings will contribute to further functional dissection of this gene family. Copyright © 2015 Elsevier B.V. All rights reserved.
Genetic structure and viability selection in the golden eagle (Aquila chrysaetos), a vagile raptor with a Holarctic distribution

USGS Publications Warehouse

Doyle, Jacqueline M.; Katzner, Todd E.; Roemer, Gary; Cain, James W.; Millsap, Brian; McIntyre, Carol; Sonsthagen, Sarah A.; Fernandez, Nadia B.; Wheeler, Maria; Bulut, Zafer; Bloom, Peter; DeWoody, J. Andrew

2016-01-01

Molecular markers can reveal interesting aspects of organismal ecology and evolution, especially when surveyed in rare or elusive species. Herein, we provide a preliminary assessment of golden eagle (Aquila chrysaetos) population structure in North America using novel single nucleotide polymorphisms (SNPs). These SNPs included one molecular sexing marker, two mitochondrial markers, 85 putatively neutral markers that were derived from noncoding regions within large intergenic intervals, and 74 putatively nonneutral markers found in or very near protein-coding genes. We genotyped 523 eagle samples at these 162 SNPs and quantified genotyping error rates and variability at each marker. Our samples corresponded to 344 individual golden eagles as assessed by unique multilocus genotypes. Observed heterozygosity of known adults was significantly higher than of chicks, as was the number of heterozygous loci, indicating that mean zygosity measured across all 159 autosomal markers was an indicator of fitness as it is associated with eagle survival to adulthood. Finally, we used chick samples of known provenance to test for population differentiation across portions of North America and found pronounced structure among geographic sampling sites. These data indicate that cryptic genetic population structure is likely widespread in the golden eagle gene pool, and that extensive field sampling and genotyping will be required to more clearly delineate management units within North America and elsewhere.
The polyadenylation code: a unified model for the regulation of mRNA alternative polyadenylation*

PubMed Central

Davis, Ryan; Shi, Yongsheng

2014-01-01

The majority of eukaryotic genes produce multiple mRNA isoforms with distinct 3′ ends through a process called mRNA alternative polyadenylation (APA). Recent studies have demonstrated that APA is dynamically regulated during development and in response to environmental stimuli. A number of mechanisms have been described for APA regulation. In this review, we attempt to integrate all the known mechanisms into a unified model. This model not only explains most of previous results, but also provides testable predictions that will improve our understanding of the mechanistic details of APA regulation. Finally, we briefly discuss the known and putative functions of APA regulation. PMID:24793760
Recurrent nonsense mutations in the growth hormone receptor from patients with Laron dwarfism.

PubMed Central

Amselem, S; Sobrier, M L; Duquesnoy, P; Rappaport, R; Postel-Vinay, M C; Gourmelen, M; Dallapiccola, B; Goossens, M

1991-01-01

In addition to its classical effects on growth, growth hormone (GH) has been shown to have a number of other actions, all of which are initiated by an interaction with specific high affinity receptors present in a variety of tissues. Purification of a rabbit liver protein via its ability to bind GH has allowed the isolation of a cDNA encoding a putative human growth hormone receptor that belongs to a new class of transmembrane receptors. We have previously shown that this putative growth hormone receptor gene is genetically linked to Laron dwarfism, a rare autosomal recessive syndrome caused by target resistance to GH. Nevertheless, the inability to express the corresponding full-length coding sequence and the lack of a test for growth-promoting function have hampered a direct confirmation of its role in growth. We have now identified three nonsense mutations within this growth hormone receptor gene, lying at positions corresponding to the amino terminal extremity and causing a truncation of the molecule, thereby deleting a large portion of both the GH binding domain and the full transmembrane and intracellular domains. Three independent patients with Laron dwarfism born of consanguineous parents were homozygous for these defects. Two defects were identical and consisted of a CG to TG transition. Not only do these results confirm the growth-promoting activity of this receptor but they also suggest that CpG doublets may represent hot spots for mutations in the growth hormone receptor gene that are responsible for hereditary dwarfism. Images PMID:1999489
Genome-wide identification of Hami melon miRNAs with putative roles during fruit development

PubMed Central

Wang, Guangzhi; Ma, Xinli; Li, Meihua; Wu, Haibo; Fu, Qiushi; Zhang, Yi; Yi, Hongping

2017-01-01

MicroRNAs represent a family of small endogenous, non-coding RNAs that play critical regulatory roles in plant growth, development, and environmental stress responses. Hami melon is famous for its attractive flavor and excellent nutritional value, however, the mechanisms underlying the fruit development and ripening remains largely unknown. Here, we performed small RNA sequencing to investigate the roles of miRNAs during Hami melon fruit development. Two batches of flesh samples were collected at four fruit development stages. Small RNA sequencing yielded a total of 54,553,424 raw reads from eight libraries. 113 conserved miRNAs belonging to 30 miRNA families and nine novel miRNAs comprising nine miRNA families were identified. The expression of 42 conserved miRNAs and three Hami melon-specific miRNAs significantly changed during fruit development. Furthermore, 484 and 124 melon genes were predicted as putative targets of 29 conserved and nine Hami melon-specific miRNA families, respectively. GO enrichment analysis were performed on target genes, “transcription, DNA-dependent”, “rRNA processing”, “oxidation reduction”, “signal transduction”, “regulation of transcription, DNA-dependent”, and “metabolic process” were the over-represented biological process terms. Cleavage sites of six target genes were validated using 5’ RACE. Our results present a comprehensive set of identification and characterization of Hami melon fruit miRNAs and their potential targets, which provide valuable basis towards understanding the regulatory mechanisms in programmed process of normal Hami fruit development and ripening. Specific miRNAs could be selected for further research and applications in breeding practices. PMID:28742088
Transcriptome analysis of the couch potato (CPO) protein reveals an expression pattern associated with early development in the salmon louse Caligus rogercresseyi.

PubMed

Gallardo-Escárate, Cristian; Valenzuela-Muñoz, Valentina; Nuñez-Acuña, Gustavo; Chávez-Mardones, Jacqueline; Maldonado-Aguayo, Waleska

2014-02-15

The couch potato (CPO) protein is a key biomolecule involved in regulating diapause through the RNA-binding process of the peripheral and central nervous systems in insects and also recently discovered in a few crustacean species. As such, ectoparasitic copepods are interesting model species that have no evidence of developmental arrest. The present study is the first to report on the cloning of a putative CPO gene from the salmon louse Caligus rogercresseyi (CrCPO), as identified by high-throughput transcriptome sequencing. In addition, the transcription expression in larvae and adults was evaluated using quantitative real-time PCR. The CrCPO cDNA sequence showed 3261 base pairs (bp), consisting of 713bp of 5' UTR, 1741bp of 3' UTR, and an open reading frame of 807bp encoding for 268 amino acids. The highly conserved RNA binding regions RNP2 (LFVSGL) and RNP1 (SPVGFVTF), as well the dimerization site (LEF), were also found. Furthermore, eight single nucleotide polymorphisms located in the untranslated regions and one located in the coding region were detected. Gene transcription analysis revealed that CrCPO has ubiquitous expression across larval stages and in adult individuals, with the highest expression from nauplius to copepodid stages. The present study suggests a putative biological function of CrCPO associated with the development of the nervous system in salmon lice and contributes molecular evidence for candidate genes related to host-parasite interactions. Copyright © 2013 Elsevier B.V. All rights reserved.
Plasmid AZOBR_p1-borne fabG gene for putative 3-oxoacyl-[acyl-carrier protein] reductase is essential for proper assembly and work of the dual flagellar system in the alphaproteobacterium Azospirillum brasilense Sp245.

PubMed

Filip'echeva, Yulia A; Shelud'ko, Andrei V; Prilipov, Alexei G; Burygin, Gennady L; Telesheva, Elizaveta M; Yevstigneyeva, Stella S; Chernyshova, Marina P; Petrova, Lilia P; Katsy, Elena I

2018-02-01

Azospirillum brasilense can swim and swarm owing to the activity of a constitutive polar flagellum (Fla) and inducible lateral flagella (Laf), respectively. Experimental data on the regulation of the Fla and Laf assembly in azospirilla are scarce. Here, the coding sequence (CDS) AZOBR_p1160043 (fabG1) for a putative 3-oxoacyl-[acyl-carrier protein (ACP)] reductase was found essential for the construction of both types of flagella. In an immotile leaky Fla - Laf - fabG1::Omegon-Km mutant, Sp245.1610, defects in flagellation and motility were fully complemented by expressing the CDS AZOBR_p1160043 from plasmid pRK415. When pRK415 with the cloned CDS AZOBR_p1160045 (fliC) for a putative 65.2 kDa Sp245 Fla flagellin was transferred into the Sp245.1610 cells, the bacteria also became able to assemble a motile single flagellum. Some cells, however, had unusual swimming behavior, probably because of the side location of the organelle. Although the assembly of Laf was not restored in Sp245.1610 (pRK415-p1160045), this strain was somewhat capable of swarming motility. We propose that the putative 3-oxoacyl-[ACP] reductase encoded by the CDS AZOBR_p1160043 plays a role in correct flagellar location in the cell envelope and (or) in flagellar modification(s), which are also required for the inducible construction of Laf and for proper swimming and swarming motility of A. brasilense Sp245.
Identification of Putative Coffee Rust Mycoparasites via Single-Molecule DNA Sequencing of Infected Pustules.

PubMed

James, Timothy Y; Marino, John A; Perfecto, Ivette; Vandermeer, John

2016-01-15

The interaction of crop pests with their natural enemies is a fundament to their control. Natural enemies of fungal pathogens of crops are poorly known relative to those of insect pests, despite the diversity of fungal pathogens and their economic importance. Currently, many regions across Latin America are experiencing unprecedented epidemics of coffee rust (Hemileia vastatrix). Identification of natural enemies of coffee rust could aid in developing management strategies or in pinpointing species that could be used for biocontrol. In the present study, we characterized fungal communities associated with coffee rust lesions by single-molecule DNA sequencing of fungal rRNA gene bar codes from leaf discs (≈28 mm(2)) containing rust lesions and control discs with no rust lesions. The leaf disc communities were hyperdiverse in terms of fungi, with up to 69 operational taxonomic units (putative species) per control disc, and the diversity was only slightly reduced in rust-infected discs, with up to 63 putative species. However, geography had a greater influence on the fungal community than whether the disc was infected by coffee rust. Through comparisons between control and rust-infected leaf discs, as well as taxonomic criteria, we identified 15 putative mycoparasitic fungi. These fungi are concentrated in the fungal family Cordycipitaceae and the order Tremellales. These data emphasize the complexity of diverse fungi of unknown ecological function within a leaf that might influence plant disease epidemics or lead to the development of species for biocontrol of fungal disease. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

Genome-wide identification and expression profiling of the SnRK2 gene family in Malus prunifolia.

PubMed

Shao, Yun; Qin, Yuan; Zou, Yangjun; Ma, Fengwang

2014-11-15

Sucrose non-fermenting-1-related protein kinase 2 (SnRK2) constitutes a small plant-specific serine/threonine kinase family with essential roles in the abscisic acid (ABA) signal pathway and in responses to osmotic stress. Although a genome-wide analysis of this family has been conducted in some species, little is known about SnRK2 genes in apple (Malus domestica). We identified 14 putative sequences encoding 12 deduced SnRK2 proteins within the apple genome. Gene chromosomal location and synteny analysis of the apple SnRK2 genes indicated that tandem and segmental duplications have likely contributed to the expansion and evolution of these genes. All 12 full-length coding sequences were confirmed by cloning from Malus prunifolia. The gene structure and motif compositions of the apple SnRK2 genes were analyzed. Phylogenetic analysis showed that MpSnRK2s could be classified into four groups. Profiling of these genes presented differential patterns of expression in various tissues. Under stress conditions, transcript levels for some family members were up-regulated in the leaves in response to drought, salinity, or ABA treatments. This suggested their possible roles in plant response to abiotic stress. Our findings provide essential information about SnRK2 genes in apple and will contribute to further functional dissection of this gene family. Copyright © 2014 Elsevier B.V. All rights reserved.
Genome-scale metabolic network of Cordyceps militaris useful for comparative analysis of entomopathogenic fungi.

PubMed

Vongsangnak, Wanwipa; Raethong, Nachon; Mujchariyakul, Warasinee; Nguyen, Nam Ninh; Leong, Hon Wai; Laoteng, Kobkul

2017-08-30

The first genome-scale metabolic network of Cordyceps militaris (iWV1170) was constructed representing its whole metabolisms, which consisted of 894 metabolites and 1,267 metabolic reactions across five compartments, including the plasma membrane, cytoplasm, mitochondria, peroxisome and extracellular space. The iWV1170 could be exploited to explain its phenotypes of growth ability, cordycepin and other metabolites production on various substrates. A high number of genes encoding extracellular enzymes for degradation of complex carbohydrates, lipids and proteins were existed in C. militaris genome. By comparative genome-scale analysis, the adenine metabolic pathway towards putative cordycepin biosynthesis was reconstructed, indicating their evolutionary relationships across eleven species of entomopathogenic fungi. The overall metabolic routes involved in the putative cordycepin biosynthesis were also identified in C. militaris, including central carbon metabolism, amino acid metabolism (glycine, l-glutamine and l-aspartate) and nucleotide metabolism (adenosine and adenine). Interestingly, a lack of the sequence coding for ribonucleotide reductase inhibitor was observed in C. militaris that might contribute to its over-production of cordycepin. Copyright © 2017. Published by Elsevier B.V.
Pharmacophore screening of the protein data bank for specific binding site chemistry.

PubMed

Campagna-Slater, Valérie; Arrowsmith, Andrew G; Zhao, Yong; Schapira, Matthieu

2010-03-22

A simple computational approach was developed to screen the Protein Data Bank (PDB) for putative pockets possessing a specific binding site chemistry and geometry. The method employs two commonly used 3D screening technologies, namely identification of cavities in protein structures and pharmacophore screening of chemical libraries. For each protein structure, a pocket finding algorithm is used to extract potential binding sites containing the correct types of residues, which are then stored in a large SDF-formatted virtual library; pharmacophore filters describing the desired binding site chemistry and geometry are then applied to screen this virtual library and identify pockets matching the specified structural chemistry. As an example, this approach was used to screen all human protein structures in the PDB and identify sites having chemistry similar to that of known methyl-lysine binding domains that recognize chromatin methylation marks. The selected genes include known readers of the histone code as well as novel binding pockets that may be involved in epigenetic signaling. Putative allosteric sites were identified on the structures of TP53BP1, L3MBTL3, CHEK1, KDM4A, and CREBBP.
Two pheromone precursor genes are transcriptionally expressed in the homothallic ascomycete Sordaria macrospora.

PubMed

Pöggeler, S

2000-06-01

In order to analyze the involvement of pheromones in cell recognition and mating in a homothallic fungus, two putative pheromone precursor genes, named ppg1 and ppg2, were isolated from a genomic library of Sordaria macrospora. The ppg1 gene is predicted to encode a precursor pheromone that is processed by a Kex2-like protease to yield a pheromone that is structurally similar to the alpha-factor of the yeast Saccharomyces cerevisiae. The ppg2 gene encodes a 24-amino-acid polypeptide that contains a putative farnesylated and carboxy methylated C-terminal cysteine residue. The sequences of the predicted pheromones display strong structural similarity to those encoded by putative pheromones of heterothallic filamentous ascomycetes. Both genes are expressed during the life cycle of S. macrospora. This is the first description of pheromone precursor genes encoded by a homothallic fungus. Southern-hybridization experiments indicated that ppg1 and ppg2 homologues are also present in other homothallic ascomycetes.
A Fruit-Specific Putative Dihydroflavonol 4-Reductase Gene Is Differentially Expressed in Strawberry during the Ripening Process1

PubMed Central

Moyano, Enriqueta; Portero-Robles, Ignacio; Medina-Escobar, Nieves; Valpuesta, Victoriano; Muñoz-Blanco, Juan; Luis Caballero, José

1998-01-01

A cDNA clone encoding a putative dihydroflavonol 4-reductase gene has been isolated from a strawberry (Fragaria × ananassa cv Chandler) DNA subtractive library. Northern analysis showed that the corresponding gene is predominantly expressed in fruit, where it is first detected during elongation (green stages) and then declines and sharply increases when the initial fruit ripening events occur, at the time of initiation of anthocyanin accumulation. The transcript can be induced in unripe green fruit by removing the achenes, and this induction can be partially inhibited by treatment of de-achened fruit with naphthylacetic acid, indicating that the expression of this gene is under hormonal control. We propose that the putative dihydroflavonol 4-reductase gene in strawberry plays a main role in the biosynthesis of anthocyanin during color development at the late stages of fruit ripening; during the first stages the expression of this gene could be related to the accumulation of condensed tannins. PMID:9625725
Identification of estrogen-responsive genes using a genome-wide analysis of promoter elements for transcription factor binding sites.

PubMed

Kamalakaran, Sitharthan; Radhakrishnan, Senthil K; Beck, William T

2005-06-03

We developed a pipeline to identify novel genes regulated by the steroid hormone-dependent transcription factor, estrogen receptor, through a systematic analysis of upstream regions of all human and mouse genes. We built a data base of putative promoter regions for 23,077 human and 19,984 mouse transcripts from National Center for Biotechnology Information annotation and 8793 human and 6785 mouse promoters from the Data Base of Transcriptional Start Sites. We used this data base of putative promoters to identify potential targets of estrogen receptor by identifying estrogen response elements (EREs) in their promoters. Our program correctly identified EREs in genes known to be regulated by estrogen in addition to several new genes whose putative promoters contained EREs. We validated six genes (KIAA1243, NRIP1, MADH9, NME3, TPD52L, and ABCG2) to be estrogen-responsive in MCF7 cells using reverse transcription PCR. To allow for extensibility of our program in identifying targets of other transcription factors, we have built a Web interface to access our data base and programs. Our Web-based program for Promoter Analysis of Genome, PAGen@UIC, allows a user to identify putative target genes for vertebrate transcription factors through the analysis of their upstream sequences. The interface allows the user to search the human and mouse promoter data bases for potential target genes containing one or more listed transcription factor binding sites (TFBSs) in their upstream elements, using either regular expression-based consensus or position weight matrices. The data base can also be searched for promoters harboring user-defined TFBSs given as a consensus or a position weight matrix. Furthermore, the user can retrieve putative promoter sequences for any given gene together with identified TFBSs located on its promoter. Orthologous promoters are also analyzed to determine conserved elements.
Development of a set of SNP markers present in expressed genes of the apple.

PubMed

Chagné, David; Gasic, Ksenija; Crowhurst, Ross N; Han, Yuepeng; Bassett, Heather C; Bowatte, Deepa R; Lawrence, Timothy J; Rikkerink, Erik H A; Gardiner, Susan E; Korban, Schuyler S

2008-11-01

Molecular markers associated with gene coding regions are useful tools for bridging functional and structural genomics. Due to their high abundance in plant genomes, single nucleotide polymorphisms (SNPs) are present within virtually all genomic regions, including most coding sequences. The objective of this study was to develop a set of SNPs for the apple by taking advantage of the wealth of genomics resources available for the apple, including a large collection of expressed sequenced tags (ESTs). Using bioinformatics tools, a search for SNPs within an EST database of approximately 350,000 sequences developed from a variety of apple accessions was conducted. This resulted in the identification of a total of 71,482 putative SNPs. As the apple genome is reported to be an ancient polyploid, attempts were made to verify whether those SNPs detected in silico were attributable either to allelic polymorphisms or to gene duplication or paralogous or homeologous sequence variations. To this end, a set of 464 PCR primer pairs was designed, PCR was amplified using two subsets of plants, and the PCR products were sequenced. The SNPs retrieved from these sequences were then mapped onto apple genetic maps, including a newly constructed map of a Royal Gala x A689-24 cross and a Malling 9 x Robusta 5, map using a bin mapping strategy. The SNP genotyping was performed using the high-resolution melting (HRM) technique. A total of 93 new markers containing 210 coding SNPs were successfully mapped. This new set of SNP markers for the apple offers new opportunities for understanding the genetic control of important horticultural traits using quantitative trait loci (QTL) or linkage disequilibrium analysis. These also serve as useful markers for aligning physical and genetic maps, and as potential transferable markers across the Rosaceae family.
Ability of secondary metabolites from trichoderma virens to mediate communication during mutualistic or pathogenic interactions

USDA-ARS?s Scientific Manuscript database

A bioinformatic study was conducted to identify the putative genes in the biocontrol agent Trichoderma virens that encode for non-ribosomal peptide synthetases (NRPS). Gene expression analysis of 22 putative NRPSs and 4 NRPS/PKS (polyketide synthase) hybrid enzymes was conducted in the presence and...
Constitutive expression of a putative high-affinity nitrate transporter in Nicotiana plumbaginifolia: evidence for post-transcriptional regulation by a reduced nitrogen source.

PubMed

Fraisier, V; Gojon, A; Tillard, P; Daniel-Vedele, F

2000-08-01

The NpNRT2.1 gene encodes a putative inducible component of the high-affinity nitrate (NO3-) uptake system in Nicotiana plumbaginifolia. Here we report functional and physiological analyses of transgenic plants expressing the NpNRT2.1 coding sequence fused to the CaMV 35S or rolD promoters. Irrespective of the level of NO3- supplied, NO3- contents were found to be remarkably similar in wild-type and transgenic plants. Under specific conditions (growth on 10 mM NO3-), the steady-state NpNRT2. 1 mRNA level resulting from the deregulated transgene expression was accompanied by an increase in 15NO3- influx measured in the low concentration range. This demonstrates for the first time that the NRT2.1 sequence codes a limiting element of the inducible high-affinity transport system. Both 15NO3- influx and mRNA levels decreased in the wild type after exposure to ammonium, in agreement with previous results from many species. Surprisingly, however, influx was also markedly decreased in transgenic plants, despite stable levels of transgene expression in independent transformants after ammonium addition. We conclude that the conditions associated with the supply of a reduced nitrogen source such as ammonium, or with the generation of a further downstream metabolite, probably exert a repressive effect on NO3- influx at both transcriptional and post-transcriptional levels.
Putative Porin of Bradyrhizobium sp. (Lupinus) Bacteroids Induced by Glyphosate▿

PubMed Central

de María, Nuria; Guevara, Ángeles; Serra, M. Teresa; García-Luque, Isabel; González-Sama, Alfonso; de Lacoba, Mario García; de Felipe, M. Rosario; Fernández-Pascual, Mercedes

2007-01-01

Application of glyphosate (N-[phosphonomethyl] glycine) to Bradyrhizobium sp. (Lupinus)-nodulated lupin plants caused modifications in the protein pattern of bacteroids. The most significant change was the presence of a 44-kDa polypeptide in bacteroids from plants treated with the higher doses of glyphosate employed (5 and 10 mM). The polypeptide has been characterized by the amino acid sequencing of its N terminus and the isolation and nucleic acid sequencing of its encoding gene. It is putatively encoded by a single gene, and the protein has been identified as a putative porin. Protein modeling revealed the existence of several domains sharing similarity to different porins, such as a transmembrane beta-barrel. The protein has been designated BLpp, for Bradyrhizobium sp. (Lupinus) putative porin, and would be the first porin described in Bradyrhizobium sp. (Lupinus). In addition, a putative conserved domain of porins has been identified which consists of 87 amino acids, located in the BLpp sequence 30 amino acids downstream of the N-terminal region. In bacteroids, mRNA of the BLpp gene shows a basal constitutive expression that increases under glyphosate treatment, and the expression of the gene is seemingly regulated at the transcriptional level. By contrast, in free-living bacteria glyphosate treatment leads to an inhibition of BLpp mRNA accumulation, indicating a different effect of glyphosate on BLpp gene expression in bacteroids and free-living bacteria. The possible role of BLpp in a metabolite interchange between Bradyrhizobium and lupin is discussed. PMID:17557843
PlantTribes: a gene and gene family resource for comparative genomics in plants

PubMed Central

Wall, P. Kerr; Leebens-Mack, Jim; Müller, Kai F.; Field, Dawn; Altman, Naomi S.; dePamphilis, Claude W.

2008-01-01

The PlantTribes database (http://fgp.huck.psu.edu/tribe.html) is a plant gene family database based on the inferred proteomes of five sequenced plant species: Arabidopsis thaliana, Carica papaya, Medicago truncatula, Oryza sativa and Populus trichocarpa. We used the graph-based clustering algorithm MCL [Van Dongen (Technical Report INS-R0010 2000) and Enright et al. (Nucleic Acids Res. 2002; 30: 1575–1584)] to classify all of these species’ protein-coding genes into putative gene families, called tribes, using three clustering stringencies (low, medium and high). For all tribes, we have generated protein and DNA alignments and maximum-likelihood phylogenetic trees. A parallel database of microarray experimental results is linked to the genes, which lets researchers identify groups of related genes and their expression patterns. Unified nomenclatures were developed, and tribes can be related to traditional gene families and conserved domain identifiers. SuperTribes, constructed through a second iteration of MCL clustering, connect distant, but potentially related gene clusters. The global classification of nearly 200 000 plant proteins was used as a scaffold for sorting ∼4 million additional cDNA sequences from over 200 plant species. All data and analyses are accessible through a flexible interface allowing users to explore the classification, to place query sequences within the classification, and to download results for further study. PMID:18073194
Altered Gene Expression in Three Plant Species in Response to Treatment with Nep1, a Fungal Protein That Causes Necrosis

PubMed Central

Keates, Sarah E.; Kostman, Todd A.; Anderson, James D.; Bailey, Bryan A.

2003-01-01

Nep1 is an extracellular fungal protein that causes necrosis when applied to many dicotyledonous plants, including invasive weed species. Using transmission electron microscopy, it was determined that application of Nep1 (1.0 μg mL–1, 0.1% [v/v] Silwet-L77) to Arabidopsis and two invasive weed species, spotted knapweed (Centaurea maculosa) and dandelion (Taraxacum officinale), caused a reduction in the thickness of the cuticle and a breakdown of chloroplasts 1 to 4 h after treatment. Membrane breakdown was most severe in cells closest to the surface of application. Differential display was used to isolate cDNA clones from the three species showing differential expression in response to Nep1 treatment. Differential gene expression was observed for a putative serpin (CmSER-1) and a calmodulin-like (CmCAL-1) protein from spotted knapweed, and a putative protein phosphatase 2C (ToPP2C-1) and cytochrome P-450 (ToCYP-1) protein from dandelion. In addition, differential expression was observed for genes coding for a putative protein kinase (AtPK-1), a homolog (AtWI-12) of wound-induced WI12, a homolog (AtLEA-1) of late embryogenesis abundant LEA-5, a WRKY-18 DNA-binding protein (AtWRKY-18), and a phospholipase D (AtPLD-1) from Arabidopsis. Genes showing elevated mRNA levels in Nep1-treated (5 μg mL–1, 0.1% [v/v] Silwet-L77) leaves 15 min after Nep1 treatment included CmSER-1 and CmCAL-1 for spotted knapweed, ToCYP-1 and CmCAL-1 for dandelion, and AtPK-1, AtWRKY-18, AtWI-12, and AtLEA-1 for Arabidopsis. Levels of mRNA for AtPLD-1 (Arabidopsis) and ToPP2C-1 (dandelion) decreased rapidly in Silwet-l77-treated plants between 15 min and 4 h of treatment, but were maintained or decreased more slowly over time in Nep1-treated (5 μg mL–1, 0.1% [v/v] Silwet-L77) leaves. In general, increases in mRNA band intensities were in the range of two to five times, with only ToCYP-1 in dandelion exceeding an increase of 10 times. The identified genes have been shown to be involved or are related to gene families that are involved in plant stress responses, including wounding, drought, senescence, and disease resistance. PMID:12857840
Identification of druggable cancer driver genes amplified across TCGA datasets.

PubMed

Chen, Ying; McGee, Jeremy; Chen, Xianming; Doman, Thompson N; Gong, Xueqian; Zhang, Youyan; Hamm, Nicole; Ma, Xiwen; Higgs, Richard E; Bhagwat, Shripad V; Buchanan, Sean; Peng, Sheng-Bin; Staschke, Kirk A; Yadav, Vipin; Yue, Yong; Kouros-Mehr, Hosein

2014-01-01

The Cancer Genome Atlas (TCGA) projects have advanced our understanding of the driver mutations, genetic backgrounds, and key pathways activated across cancer types. Analysis of TCGA datasets have mostly focused on somatic mutations and translocations, with less emphasis placed on gene amplifications. Here we describe a bioinformatics screening strategy to identify putative cancer driver genes amplified across TCGA datasets. We carried out GISTIC2 analysis of TCGA datasets spanning 16 cancer subtypes and identified 486 genes that were amplified in two or more datasets. The list was narrowed to 75 cancer-associated genes with potential "druggable" properties. The majority of the genes were localized to 14 amplicons spread across the genome. To identify potential cancer driver genes, we analyzed gene copy number and mRNA expression data from individual patient samples and identified 42 putative cancer driver genes linked to diverse oncogenic processes. Oncogenic activity was further validated by siRNA/shRNA knockdown and by referencing the Project Achilles datasets. The amplified genes represented a number of gene families, including epigenetic regulators, cell cycle-associated genes, DNA damage response/repair genes, metabolic regulators, and genes linked to the Wnt, Notch, Hedgehog, JAK/STAT, NF-KB and MAPK signaling pathways. Among the 42 putative driver genes were known driver genes, such as EGFR, ERBB2 and PIK3CA. Wild-type KRAS was amplified in several cancer types, and KRAS-amplified cancer cell lines were most sensitive to KRAS shRNA, suggesting that KRAS amplification was an independent oncogenic event. A number of MAP kinase adapters were co-amplified with their receptor tyrosine kinases, such as the FGFR adapter FRS2 and the EGFR family adapters GRB2 and GRB7. The ubiquitin-like ligase DCUN1D1 and the histone methyltransferase NSD3 were also identified as novel putative cancer driver genes. We discuss the patient tailoring implications for existing cancer drug targets and we further discuss potential novel opportunities for drug discovery efforts.
Identification of Druggable Cancer Driver Genes Amplified across TCGA Datasets

PubMed Central

Chen, Ying; McGee, Jeremy; Chen, Xianming; Doman, Thompson N.; Gong, Xueqian; Zhang, Youyan; Hamm, Nicole; Ma, Xiwen; Higgs, Richard E.; Bhagwat, Shripad V.; Buchanan, Sean; Peng, Sheng-Bin; Staschke, Kirk A.; Yadav, Vipin; Yue, Yong; Kouros-Mehr, Hosein

2014-01-01

The Cancer Genome Atlas (TCGA) projects have advanced our understanding of the driver mutations, genetic backgrounds, and key pathways activated across cancer types. Analysis of TCGA datasets have mostly focused on somatic mutations and translocations, with less emphasis placed on gene amplifications. Here we describe a bioinformatics screening strategy to identify putative cancer driver genes amplified across TCGA datasets. We carried out GISTIC2 analysis of TCGA datasets spanning 14 cancer subtypes and identified 461 genes that were amplified in two or more datasets. The list was narrowed to 73 cancer-associated genes with potential “druggable” properties. The majority of the genes were localized to 14 amplicons spread across the genome. To identify potential cancer driver genes, we analyzed gene copy number and mRNA expression data from individual patient samples and identified 40 putative cancer driver genes linked to diverse oncogenic processes. Oncogenic activity was further validated by siRNA/shRNA knockdown and by referencing the Project Achilles datasets. The amplified genes represented a number of gene families, including epigenetic regulators, cell cycle-associated genes, DNA damage response/repair genes, metabolic regulators, and genes linked to the Wnt, Notch, Hedgehog, JAK/STAT, NF-KB and MAPK signaling pathways. Among the 40 putative driver genes were known driver genes, such as EGFR, ERBB2 and PIK3CA. Wild-type KRAS was amplified in several cancer types, and KRAS-amplified cancer cell lines were most sensitive to KRAS shRNA, suggesting that KRAS amplification was an independent oncogenic event. A number of MAP kinase adapters were co-amplified with their receptor tyrosine kinases, such as the FGFR adapter FRS2 and the EGFR family adapter GRB7. The ubiquitin-like ligase DCUN1D1 and the histone methyltransferase NSD3 were also identified as novel putative cancer driver genes. We discuss the patient tailoring implications for existing cancer drug targets and we further discuss potential novel opportunities for drug discovery efforts. PMID:24874471
The human serotonin 5-HT{sub 2C} receptor: Complete cDNA, genomic structure, and alternatively spliced variant

DOE Office of Scientific and Technical Information (OSTI.GOV)

Xie, Enzhong; Zhu, Lingyu; Zhao, Lingyun

1996-08-01

The complete 4775-nt cDNA encoding the human serotonin 5-HT{sub 2C} receptor (5-HT{sub 2C}R), a G-protein-coupled receptor, has been isolated. It contains a 1377-nt coding region flanked by a 728-nt 5{prime}-untranslated region and a 2670-nt 3{prime}-untranslated region. By using the cloned 5-HT{sub 2C}R cDNA probe, the complete human gene for this receptor has been isolated and shown to contain six exons and five introns spanning at least 230 kb of DNA. The coding region of the human 5-HT{sub 2C}R gene is interrupted by three introns, and the positions of the intron/exon junctions are conserved between the human and the rodent genes.more » In addition, an alternatively spliced 5-HT{sub 2C}R RNA that contains a 95-nt deletion in the region coding for the second intracellular loop and the fourth transmembrane domain of the receptor has been identified. This deletion leads to a frameshift and premature termination so that the short isoform RNA encodes a putative protein of 248 amino acids. The ratio for the short isoform over the 5-HT{sub 2C}R RNA was found to be higher in choroid plexus tumor than in normal brain tissue, suggesting the possibility of differential regulation of the 5-HT{sub 2C}R gene in different neural tissues or during tumorigenesis. Transcription of the human 5-HT{sub 2C}R gene was found to be initiated at multiple sites. No classical TATA-box sequence was found at the appropriate location, and the 5{prime}-flanking sequence contains many potential transcription factor-binding sites. A 7.3-kb 5{prime}-flanking 5-HT{sub 2C}R DNA directed the efficient expression of a luciferase reported gene in SK-N-SH and IMR32 neuroblastoma cells, indicating that is contains a functional promoter. 69 refs., 8 figs., 1 tab.« less
Sequence characterization of S100A8 gene reveals structural differences of protein and transcriptional factor binding sites in water buffalo and yak.

PubMed

Kathiravan, P; Goyal, S; Kataria, R S; Mishra, B P; Jayakumar, S; Joshi, B K

2011-01-01

The present study was undertaken to characterize the structure of S100A8 gene and its promoter in water buffalo and yak. Sequence data of 2.067 kb, 2.071 kb, and 2.052 kb with respect to complete S100A8 gene including 5' flanking region was generated in river buffalo, swamp buffalo, and yak, respectively. BLAST analysis of coding DNA sequences (CDS) of S100A8 gene revealed 95% homology of buffalo sequence with cattle, 85% with pig and horse, 83% with dog, 72-73% with murines, and around 79% with primates and humans. Phylogenetic analysis of predicted CDS revealed distinct clustering of murines, primates, and domestic animals with bovines and bubalines forming a subcluster among farm animals. In silico translation of predicted CDS revealed a sequence of 89 amino acids with 7 amino acid changes between cattle and buffalo and 2 changes between cattle and yak. The search for Pfam family revealed the N-terminal calcium binding domain and the noncanonical EF hand domain in the carboxy terminus, with more variations being observed in the N-terminal domain among different species. Two amino acid changes observed in carboxy terminal EF hand domain resulted in altered secondary structure of yak S100A8 protein. Analysis of S100A8 gene promoter revealed 14 putative motifs for transcriptional factor binding sites. Two putative motifs viz. C/EBP and v-Myb were found to be absent in swamp buffalo as compared to river buffalo and cattle. Differences in the structure of S100A8 protein and the transcriptional factor binding sites identified in the present study need to be analyzed further for their functional significance in yak and swamp buffalo respectively. Copyright © Taylor & Francis Group, LLC
Spontaneous Mutation Reveals Influence of Exopolysaccharide on Lactobacillus johnsonii Surface Characteristics

PubMed Central

Horn, Nikki; Wegmann, Udo; Dertli, Enes; Mulholland, Francis; Collins, Samuel R. A.; Waldron, Keith W.; Bongaerts, Roy J.; Mayer, Melinda J.; Narbad, Arjan

2013-01-01

As a competitive exclusion agent, Lactobacillus johnsonii FI9785 has been shown to prevent the colonization of selected pathogenic bacteria from the chicken gastrointestinal tract. During growth of the bacterium a rare but consistent emergence of an altered phenotype was noted, generating smooth colonies in contrast to the wild type rough form. A smooth colony variant was isolated and two-dimensional gel analysis of both strains revealed a protein spot with different migration properties in the two phenotypes. The spot in both gels was identified as a putative tyrosine kinase (EpsC), associated with a predicted exopolysaccharide gene cluster. Sequencing of the epsC gene from the smooth mutant revealed a single substitution (G to A) in the coding strand, resulting in the amino acid change D88N in the corresponding gene product. A native plasmid of L. johnsonii was engineered to produce a novel vector for constitutive expression and this was used to demonstrate that expression of the wild type epsC gene in the smooth mutant produced a reversion to the rough colony phenotype. Both the mutant and epsC complemented strains had increased levels of exopolysaccharides compared to the wild type strain, indicating that the rough phenotype is not solely associated with the quantity of exopolysaccharide. Another gene in the cluster, epsE, that encoded a putative undecaprenyl-phosphate galactosephosphotransferase, was deleted in order to investigate its role in exopolysaccharide biosynthesis. The ΔepsE strain exhibited a large increase in cell aggregation and a reduction in exopolysaccharide content, while plasmid complementation of epsE restored the wild type phenotype. Flow cytometry showed that the wild type and derivative strains exhibited clear differences in their adhesive ability to HT29 monolayers in tissue culture, demonstrating an impact of EPS on surface properties and bacteria-host interactions. PMID:23544114
Identification and Characterization of Cyprinid Herpesvirus-3 (CyHV-3) Encoded MicroRNAs

PubMed Central

Donohoe, Owen H.; Henshilwood, Kathy; Way, Keith; Hakimjavadi, Roya; Stone, David M.; Walls, Dermot

2015-01-01

MicroRNAs (miRNAs) are a class of small non-coding RNAs involved in post-transcriptional gene regulation. Some viruses encode their own miRNAs and these are increasingly being recognized as important modulators of viral and host gene expression. Cyprinid herpesvirus 3 (CyHV-3) is a highly pathogenic agent that causes acute mass mortalities in carp (Cyprinus carpio carpio) and koi (Cyprinus carpio koi) worldwide. Here, bioinformatic analyses of the CyHV-3 genome suggested the presence of non-conserved precursor miRNA (pre-miRNA) genes. Deep sequencing of small RNA fractions prepared from in vitro CyHV-3 infections led to the identification of potential miRNAs and miRNA–offset RNAs (moRNAs) derived from some bioinformatically predicted pre-miRNAs. DNA microarray hybridization analysis, Northern blotting and stem-loop RT-qPCR were then used to definitively confirm that CyHV-3 expresses two pre-miRNAs during infection in vitro. The evidence also suggested the presence of an additional four high-probability and two putative viral pre-miRNAs. MiRNAs from the two confirmed pre-miRNAs were also detected in gill tissue from CyHV-3-infected carp. We also present evidence that one confirmed miRNA can regulate the expression of a putative CyHV-3-encoded dUTPase. Candidate homologues of some CyHV-3 pre-miRNAs were identified in CyHV-1 and CyHV-2. This is the first report of miRNA and moRNA genes encoded by members of the Alloherpesviridae family, a group distantly related to the Herpesviridae family. The discovery of these novel CyHV-3 genes may help further our understanding of the biology of this economically important virus and their encoded miRNAs may have potential as biomarkers for the diagnosis of latent CyHV-3. PMID:25928140
Identification and Characterization of microRNA319a and Its Putative Target Gene, PvPCF5, in the Bioenergy Grass Switchgrass (Panicum virgatum).

PubMed

Xie, Qi; Liu, Xue; Zhang, Yinbing; Tang, Jinfu; Yin, Dedong; Fan, Bo; Zhu, Lihuang; Han, Liebao; Song, Guilong; Li, Dayong

2017-01-01

Due to its high biomass yield, low environmental impact, and widespread adaptability to poor soils and harsh conditions, switchgrass ( Panicum virgatum L.), a warm-region perennial herbaceous plant, has attracted much attention in recent years. However, little is known about microRNAs (miRNAs) and their functions in this bioenergy grass. Here, we identified and characterized a miRNA gene, Pvi-MIR319a , encoding microRNA319a in switchgrass. Transgenic rice lines generated by overexpressing the Pvi-MIR319a precursor gene exhibited broader leaves and delayed flowering compared with the control. Gene expression analysis indicated at least four putative target genes were downregulated. Additionally, we cloned a putative target gene ( PvPCF5 ) of Pvi-MIR319a from switchgrass. PvPCF5, a TCP transcription factor, is a nuclear-localized protein with transactivation activity and control the development of leaf. Our results suggest that Pvi-MIR319a and its target genes may be used as potential genetic regulators for future switchgrass genetic improvement.
Improvement and Optimization of Two Engineered Phage Resistance Mechanisms in Lactococcus lactis

PubMed Central

McGrath, Stephen; Fitzgerald, Gerald F.; van Sinderen, Douwe

2001-01-01

Homologous replication module genes were identified for four P335 type phages. DNA sequence analysis revealed that all four phages exhibited more than 90% DNA homology for at least two genes, designated rep2009 and orf17. One of these genes, rep2009, codes for a putative replisome organizer protein and contains an assumed origin of phage DNA replication (ori2009), which was identical for all four phages. DNA fragments representing the ori2009 sequence confer a phage-encoded resistance (Per) phenotype on lactococcal hosts when they are supplied on a high-copy-number vector. Furthermore, cloning multiple copies of the ori2009 sequence was found to increase the effectiveness of the Per phenotype conferred. A number of antisense plasmids targeting specific genes of the replication module were constructed. Two separate plasmids targeting rep2009 and orf17 were found to efficiently inhibit proliferation of all four phages by interfering with intracellular phage DNA replication. These results represent two highly effective strategies for inhibiting bacteriophage proliferation, and they also identify a novel gene, orf17, which appears to be important for phage DNA replication. Furthermore, these results indicate that although the actual mechanisms of DNA replication are very similar, if not identical, for all four phages, expression of the replication genes is significantly different in each case. PMID:11157223

A low-pungency S3212 genotype of Capsicum frutescens caused by a mutation in the putative aminotransferase (p-AMT) gene.

PubMed

Park, Young-Jun; Nishikawa, Tomotaro; Minami, Mineo; Nemoto, Kazuhiro; Iwasaki, Tomohiro; Matsushima, Kenichi

2015-12-01

The purpose of this study was to identify the genetic mechanism underlying capsinoid biosynthesis in S3212, a low-pungency genotype of Capsicum frutescens. Screening of C. frutescens accessions for capsaicinoid and capsiate contents by high-performance liquid chromatography revealed that low-pungency S3212 contained high levels of capsiate but no capsaicin. Comparison of DNA coding sequences of pungent (T1 and Bird Eye) and low-pungency (S3212) genotypes uncovered a significant 12-bp deletion mutation in exon 7 of the p-AMT gene of S3212. In addition, p-AMT gene transcript levels in placental tissue were positively correlated with the degree of pungency. S3212, the low-pungency genotype, exhibited no significant p-AMT transcript levels, whereas T1, one of the pungent genotypes, displayed high transcript levels of this gene. We therefore conclude that the deletion mutation in the p-AMT gene is related to the loss of pungency in placental tissue and has given rise to the low-pungency S3212 C. frutescens genotype. C. frutescens S3212 represents a good natural source of capsinoids. Finally, our basic characterization of the uncovered p-AMT gene mutation should contribute to future studies of capsinoid biosynthesis in Capsicum.
Horizontal gene transfer in an acid mine drainage microbial community.

PubMed

Guo, Jiangtao; Wang, Qi; Wang, Xiaoqi; Wang, Fumeng; Yao, Jinxian; Zhu, Huaiqiu

2015-07-04

Horizontal gene transfer (HGT) has been widely identified in complete prokaryotic genomes. However, the roles of HGT among members of a microbial community and in evolution remain largely unknown. With the emergence of metagenomics, it is nontrivial to investigate such horizontal flow of genetic materials among members in a microbial community from the natural environment. Because of the lack of suitable methods for metagenomics gene transfer detection, microorganisms from a low-complexity community acid mine drainage (AMD) with near-complete genomes were used to detect possible gene transfer events and suggest the biological significance. Using the annotation of coding regions by the current tools, a phylogenetic approach, and an approximately unbiased test, we found that HGTs in AMD organisms are not rare, and we predicted 119 putative transferred genes. Among them, 14 HGT events were determined to be transfer events among the AMD members. Further analysis of the 14 transferred genes revealed that the HGT events affected the functional evolution of archaea or bacteria in AMD, and it probably shaped the community structure, such as the dominance of G-plasma in archaea in AMD through HGT. Our study provides a novel insight into HGT events among microorganisms in natural communities. The interconnectedness between HGT and community evolution is essential to understand microbial community formation and development.
The oestrogen receptor alpha-regulated lncRNA NEAT1 is a critical modulator of prostate cancer.

PubMed

Chakravarty, Dimple; Sboner, Andrea; Nair, Sujit S; Giannopoulou, Eugenia; Li, Ruohan; Hennig, Sven; Mosquera, Juan Miguel; Pauwels, Jonathan; Park, Kyung; Kossai, Myriam; MacDonald, Theresa Y; Fontugne, Jacqueline; Erho, Nicholas; Vergara, Ismael A; Ghadessi, Mercedeh; Davicioni, Elai; Jenkins, Robert B; Palanisamy, Nallasivam; Chen, Zhengming; Nakagawa, Shinichi; Hirose, Tetsuro; Bander, Neil H; Beltran, Himisha; Fox, Archa H; Elemento, Olivier; Rubin, Mark A

2014-11-21

The androgen receptor (AR) plays a central role in establishing an oncogenic cascade that drives prostate cancer progression. Some prostate cancers escape androgen dependence and are often associated with an aggressive phenotype. The oestrogen receptor alpha (ERα) is expressed in prostate cancers, independent of AR status. However, the role of ERα remains elusive. Using a combination of chromatin immunoprecipitation (ChIP) and RNA-sequencing data, we identified an ERα-specific non-coding transcriptome signature. Among putatively ERα-regulated intergenic long non-coding RNAs (lncRNAs), we identified nuclear enriched abundant transcript 1 (NEAT1) as the most significantly overexpressed lncRNA in prostate cancer. Analysis of two large clinical cohorts also revealed that NEAT1 expression is associated with prostate cancer progression. Prostate cancer cells expressing high levels of NEAT1 were recalcitrant to androgen or AR antagonists. Finally, we provide evidence that NEAT1 drives oncogenic growth by altering the epigenetic landscape of target gene promoters to favour transcription.
The oestrogen receptor alpha-regulated lncRNA NEAT1 is a critical modulator of prostate cancer

PubMed Central

Chakravarty, Dimple; Sboner, Andrea; Nair, Sujit S.; Giannopoulou, Eugenia; Li, Ruohan; Hennig, Sven; Mosquera, Juan Miguel; Pauwels, Jonathan; Park, Kyung; Kossai, Myriam; MacDonald, Theresa Y.; Fontugne, Jacqueline; Erho, Nicholas; Vergara, Ismael A.; Ghadessi, Mercedeh; Davicioni, Elai; Jenkins, Robert B.; Palanisamy, Nallasivam; Chen, Zhengming; Nakagawa, Shinichi; Hirose, Tetsuro; Bander, Neil H.; Beltran, Himisha; Fox, Archa H.; Elemento, Olivier; Rubin, Mark A.

2014-01-01

The androgen receptor (AR) plays a central role in establishing an oncogenic cascade that drives prostate cancer progression. Some prostate cancers escape androgen dependence and are often associated with an aggressive phenotype. The oestrogen receptor alpha (ERα) is expressed in prostate cancers, independent of AR status. However, the role of ERα remains elusive. Using a combination of chromatin immunoprecipitation (ChIP) and RNA-sequencing data, we identified an ERα-specific non-coding transcriptome signature. Among putatively ERα-regulated intergenic long non-coding RNAs (lncRNAs), we identified nuclear enriched abundant transcript 1 (NEAT1) as the most significantly overexpressed lncRNA in prostate cancer. Analysis of two large clinical cohorts also revealed that NEAT1 expression is associated with prostate cancer progression. Prostate cancer cells expressing high levels of NEAT1 were recalcitrant to androgen or AR antagonists. Finally, we provide evidence that NEAT1 drives oncogenic growth by altering the epigenetic landscape of target gene promoters to favour transcription. PMID:25415230
Discovery and Complete Genome Sequence of a Bacteriophage from an Obligate Intracellular Symbiont of a Cellulolytic Protist in the Termite Gut

PubMed Central

Pramono, Ajeng K.; Kuwahara, Hirokazu; Itoh, Takehiko; Toyoda, Atsushi; Yamada, Akinori; Hongoh, Yuichi

2017-01-01

Termites depend nutritionally on their gut microbes, and protistan, bacterial, and archaeal gut communities have been extensively studied. However, limited information is available on viruses in the termite gut. We herein report the complete genome sequence (99,517 bp) of a phage obtained during a genome analysis of “Candidatus Azobacteroides pseudotrichonymphae” phylotype ProJPt-1, which is an obligate intracellular symbiont of the cellulolytic protist Pseudotrichonympha sp. in the gut of the termite Prorhinotermes japonicus. The genome of the phage, designated ProJPt-Bp1, was circular or circularly permuted, and was not integrated into the two circular chromosomes or five circular plasmids composing the host ProJPt-1 genome. The phage was putatively affiliated with the order Caudovirales based on sequence similarities with several phage-related genes; however, most of the 52 protein-coding sequences had no significant homology to sequences in the databases. The phage genome contained a tRNA-Gln (CAG) gene, which showed the highest sequence similarity to the tRNA-Gln (CAA) gene of the host “Ca. A. pseudotrichonymphae” phylotype ProJPt-1. Since the host genome lacked a tRNA-Gln (CAG) gene, the phage tRNA gene may compensate for differences in codon usage bias between the phage and host genomes. The phage genome also contained a non-coding region with high nucleotide sequence similarity to a region in one of the host plasmids. No other phage-related sequences were found in the host ProJPt-1 genome. To the best of our knowledge, this is the first report of a phage from an obligate, mutualistic endosymbiont permanently associated with eukaryotic cells. PMID:28321010
Identification and validation of Asteraceae miRNAs by the expressed sequence tag analysis.

PubMed

Monavar Feshani, Aboozar; Mohammadi, Saeed; Frazier, Taylor P; Abbasi, Abbas; Abedini, Raha; Karimi Farsad, Laleh; Ehya, Farveh; Salekdeh, Ghasem Hosseini; Mardi, Mohsen

2012-02-10

MicroRNAs (miRNAs) are small non-coding RNA molecules that play a vital role in the regulation of gene expression. Despite their identification in hundreds of plant species, few miRNAs have been identified in the Asteraceae, a large family that comprises approximately one tenth of all flowering plants. In this study, we used the expressed sequence tag (EST) analysis to identify potential conserved miRNAs and their putative target genes in the Asteraceae. We applied quantitative Real-Time PCR (qRT-PCR) to confirm the expression of eight potential miRNAs in Carthamus tinctorius and Helianthus annuus. We also performed qRT-PCR analysis to investigate the differential expression pattern of five newly identified miRNAs during five different cotyledon growth stages in safflower. Using these methods, we successfully identified and characterized 151 potentially conserved miRNAs, belonging to 26 miRNA families, in 11 genus of Asteraceae. EST analysis predicted that the newly identified conserved Asteraceae miRNAs target 130 total protein-coding ESTs in sunflower and safflower, as well as 433 additional target genes in other plant species. We experimentally confirmed the existence of seven predicted miRNAs, (miR156, miR159, miR160, miR162, miR166, miR396, and miR398) in safflower and sunflower seedlings. We also observed that five out of eight miRNAs are differentially expressed during cotyledon development. Our results indicate that miRNAs may be involved in the regulation of gene expression during seed germination and the formation of the cotyledons in the Asteraceae. The findings of this study might ultimately help in the understanding of miRNA-mediated gene regulation in important crop species. Copyright © 2011 Elsevier B.V. All rights reserved.
Comparison of the Genome Sequence of the Poultry Pathogen Bordetella avium with Those of B. bronchiseptica, B. pertussis, and B. parapertussis Reveals Extensive Diversity in Surface Structures Associated with Host Interaction

PubMed Central

Sebaihia, Mohammed; Preston, Andrew; Maskell, Duncan J.; Kuzmiak, Holly; Connell, Terry D.; King, Natalie D.; Orndorff, Paul E.; Miyamoto, David M.; Thomson, Nicholas R.; Harris, David; Goble, Arlette; Lord, Angela; Murphy, Lee; Quail, Michael A.; Rutter, Simon; Squares, Robert; Squares, Steven; Woodward, John; Parkhill, Julian; Temple, Louise M.

2006-01-01

Bordetella avium is a pathogen of poultry and is phylogenetically distinct from Bordetella bronchiseptica, Bordetella pertussis, and Bordetella parapertussis, which are other species in the Bordetella genus that infect mammals. In order to understand the evolutionary relatedness of Bordetella species and further the understanding of pathogenesis, we obtained the complete genome sequence of B. avium strain 197N, a pathogenic strain that has been extensively studied. With 3,732,255 base pairs of DNA and 3,417 predicted coding sequences, it has the smallest genome and gene complement of the sequenced bordetellae. In this study, the presence or absence of previously reported virulence factors from B. avium was confirmed, and the genetic bases for growth characteristics were elucidated. Over 1,100 genes present in B. avium but not in B. bronchiseptica were identified, and most were predicted to encode surface or secreted proteins that are likely to define an organism adapted to the avian rather than the mammalian respiratory tracts. These include genes coding for the synthesis of a polysaccharide capsule, hemagglutinins, a type I secretion system adjacent to two very large genes for secreted proteins, and unique genes for both lipopolysaccharide and fimbrial biogenesis. Three apparently complete prophages are also present. The BvgAS virulence regulatory system appears to have polymorphisms at a poly(C) tract that is involved in phase variation in other bordetellae. A number of putative iron-regulated outer membrane proteins were predicted from the sequence, and this regulation was confirmed experimentally for five of these. PMID:16885469
Functional similarity and molecular divergence of a novel reproductive transcriptome in two male-pregnant Syngnathus pipefish species

PubMed Central

Small, Clayton M; Harlin-Cognato, April D; Jones, Adam G

2013-01-01

Evolutionary studies have revealed that reproductive proteins in animals and plants often evolve more rapidly than the genome-wide average. The causes of this pattern, which may include relaxed purifying selection, sexual selection, sexual conflict, pathogen resistance, reinforcement, or gene duplication, remain elusive. Investigative expansions to additional taxa and reproductive tissues have the potential to shed new light on this unresolved problem. Here, we embark on such an expansion, in a comparison of the brood-pouch transcriptome between two male-pregnant species of the pipefish genus Syngnathus. Male brooding tissues in syngnathid fishes represent a novel, nonurogenital reproductive trait, heretofore mostly uncharacterized from a molecular perspective. We leveraged next-generation sequencing (Roche 454 pyrosequencing) to compare transcript abundance in the male brooding tissues of pregnant with nonpregnant samples from Gulf (S. scovelli) and dusky (S. floridae) pipefish. A core set of protein-coding genes, including multiple members of astacin metalloprotease and c-type lectin gene families, is consistent between species in both the direction and magnitude of expression bias. As predicted, coding DNA sequence analysis of these putative “male pregnancy proteins” suggests rapid evolution relative to nondifferentially expressed genes and reflects signatures of adaptation similar in magnitude to those reported from Drosophila male accessory gland proteins. Although the precise drivers of male pregnancy protein divergence remain unknown, we argue that the male pregnancy transcriptome in syngnathid fishes, a clade diverse with respect to brooding morphology and mating system, represents a unique and promising object of study for understanding the perplexing evolutionary nature of reproductive molecules. PMID:24324861
Gene-enriched draft genome of the cattle tick Rhipicephalus microplus: assembly by the hybrid Pacific Biosciences/Illumina approach enabled analysis of the highly repetitive genome.

PubMed

Barrero, Roberto A; Guerrero, Felix D; Black, Michael; McCooke, John; Chapman, Brett; Schilkey, Faye; Pérez de León, Adalberto A; Miller, Robert J; Bruns, Sara; Dobry, Jason; Mikhaylenko, Galina; Stormo, Keith; Bell, Callum; Tao, Quanzhou; Bogden, Robert; Moolhuijzen, Paula M; Hunter, Adam; Bellgard, Matthew I

2017-08-01

The genome of the cattle tick Rhipicephalus microplus, an ectoparasite with global distribution, is estimated to be 7.1Gbp in length and consists of approximately 70% repetitive DNA. We report the draft assembly of a tick genome that utilized a hybrid sequencing and assembly approach to capture the repetitive fractions of the genome. Our hybrid approach produced an assembly consisting of 2.0Gbp represented in 195,170 scaffolds with a N50 of 60,284bp. The Rmi v2.0 assembly is 51.46% repetitive with a large fraction of unclassified repeats, short interspersed elements, long interspersed elements and long terminal repeats. We identified 38,827 putative R. microplus gene loci, of which 24,758 were protein coding genes (≥100 amino acids). OrthoMCL comparative analysis against 11 selected species including insects and vertebrates identified 10,835 and 3,423 protein coding gene loci that are unique to R. microplus or common to both R. microplus and Ixodes scapularis ticks, respectively. We identified 191 microRNA loci, of which 168 have similarity to known miRNAs and 23 represent novel miRNA families. We identified the genomic loci of several highly divergent R. microplus esterases with sequence similarity to acetylcholinesterase. Additionally we report the finding of a novel cytochrome P450 CYP41 homolog that shows similar protein folding structures to known CYP41 proteins known to be involved in acaricide resistance. Copyright © 2017 Australian Society for Parasitology. Published by Elsevier Ltd. All rights reserved.
Cloning, Sequencing, and Expression of the Gene Encoding Cyclic 2,3-Diphosphoglycerate Synthetase, the Key Enzyme of Cyclic 2,3-Diphosphoglycerate Metabolism in Methanothermus fervidus

PubMed Central

Matussek, Karl; Moritz, Patrick; Brunner, Nina; Eckerskorn, Christoph; Hensel, Reinhard

1998-01-01

Cyclic 2,3-diphosphoglycerate synthetase (cDPGS) catalyzes the synthesis of cyclic 2,3-diphosphoglycerate (cDPG) by formation of an intramolecular phosphoanhydride bond in 2,3-diphosphoglycerate. cDPG is known to be accumulated to high intracellular concentrations (>300 mM) as a putative thermoadapter in some hyperthermophilic methanogens. For the first time, we have purified active cDPGS from a methanogen, the hyperthermophilic archaeon Methanothermus fervidus, sequenced the coding gene, and expressed it in Escherichia coli. cDPGS purification resulted in enzyme preparations containing two isoforms differing in their electrophoretic mobility under denaturing conditions. Since both polypeptides showed the same N-terminal amino acid sequence and Southern analyses indicate the presence of only one gene coding for cDPGS in M. fervidus, the two polypeptides originate from the same gene but differ by a not yet identified modification. The native cDPGS represents a dimer with an apparent molecular mass of 112 kDa and catalyzes the reversible formation of the intramolecular phosphoanhydride bond at the expense of ATP. The enzyme shows a clear preference for the synthetic reaction: the substrate affinity and the Vmax of the synthetic reaction are a factor of 8 to 10 higher than the corresponding values for the reverse reaction. Comparison with the kinetic properties of the electrophoretically homogeneous, apparently unmodified recombinant enzyme from E. coli revealed a twofold-higher Vmax of the enzyme from M. fervidus in the synthesizing direction. PMID:9811660
The complete mitochondrial genome of the common sea slater, Ligia oceanica (Crustacea, Isopoda) bears a novel gene order and unusual control region features

PubMed Central

Kilpert, Fabian; Podsiadlowski, Lars

2006-01-01

Background Sequence data and other characters from mitochondrial genomes (gene translocations, secondary structure of RNA molecules) are useful in phylogenetic studies among metazoan animals from population to phylum level. Moreover, the comparison of complete mitochondrial sequences gives valuable information about the evolution of small genomes, e.g. about different mechanisms of gene translocation, gene duplication and gene loss, or concerning nucleotide frequency biases. The Peracarida (gammarids, isopods, etc.) comprise about 21,000 species of crustaceans, living in many environments from deep sea floor to arid terrestrial habitats. Ligia oceanica is a terrestrial isopod living at rocky seashores of the european North Sea and Atlantic coastlines. Results The study reveals the first complete mitochondrial DNA sequence from a peracarid crustacean. The mitochondrial genome of Ligia oceanica is a circular double-stranded DNA molecule, with a size of 15,289 bp. It shows several changes in mitochondrial gene order compared to other crustacean species. An overview about mitochondrial gene order of all crustacean taxa yet sequenced is also presented. The largest non-coding part (the putative mitochondrial control region) of the mitochondrial genome of Ligia oceanica is unexpectedly not AT-rich compared to the remainder of the genome. It bears two repeat regions (4× 10 bp and 3× 64 bp), and a GC-rich hairpin-like secondary structure. Some of the transfer RNAs show secondary structures which derive from the usual cloverleaf pattern. While some tRNA genes are putative targets for RNA editing, trnR could not be localized at all. Conclusion Gene order is not conserved among Peracarida, not even among isopods. The two isopod species Ligia oceanica and Idotea baltica show a similarly derived gene order, compared to the arthropod ground pattern and to the amphipod Parhyale hawaiiensis, suggesting that most of the translocation events were already present the last common ancestor of these isopods. Beyond that, the positions of three tRNA genes differ in the two isopod species. Strand bias in nucleotide frequency is reversed in both isopod species compared to other Malacostraca. This is probably due to a reversal of the replication origin, which is further supported by the fact that the hairpin structure typically found in the control region shows a reversed orientation in the isopod species, compared to other crustaceans. PMID:16987408
RNA Interference of NADPH-Cytochrome P450 Reductase Results in Reduced Insecticide Resistance in the Bed Bug, Cimex lectularius

PubMed Central

Zhu, Fang; Sams, Sarah; Moural, Tim; Haynes, Kenneth F.; Potter, Michael F.; Palli, Subba R.

2012-01-01

Background NADPH-cytochrome P450 reductase (CPR) plays a central role in cytochrome P450 action. The genes coding for P450s are not yet fully identified in the bed bug, Cimex lectularius. Hence, we decided to clone cDNA and knockdown the expression of the gene coding for CPR which is suggested to be required for the function of all P450s to determine whether or not P450s are involved in resistance of bed bugs to insecticides. Methodology/Principal Findings The full length Cimex lectularius CPR (ClCPR) cDNA was isolated from a deltamethrin resistant bed bug population (CIN-1) using a combined PCR strategy. Bioinformatics and in silico modeling were employed to identify three conserved binding domains (FMN, FAD, NADP), a FAD binding motif, and the catalytic residues. The critical amino acids involved in FMN, FAD, NADP binding and their putative functions were also analyzed. No signal peptide but a membrane anchor domain with 21 amino acids which facilitates the localization of ClCPR on the endoplasmic reticulum was identified in ClCPR protein. Phylogenetic analysis showed that ClCPR is closer to the CPR from the body louse, Pediculus humanus corporis than to the CPRs from the other insect species studied. The ClCPR gene was ubiquitously expressed in all tissues tested but showed an increase in expression as immature stages develop into adults. We exploited the traumatic insemination mechanism of bed bugs to inject dsRNA and successfully knockdown the expression of the gene coding for ClCPR. Suppression of the ClCPR expression increased susceptibility to deltamethrin in resistant populations but not in the susceptible population of bed bugs. Conclusions/Significance These data suggest that P450-mediated metabolic detoxification may serve as one of the resistance mechanisms in bed bugs. PMID:22347424
RNA interference of NADPH-cytochrome P450 reductase results in reduced insecticide resistance in the bed bug, Cimex lectularius.

PubMed

Zhu, Fang; Sams, Sarah; Moural, Tim; Haynes, Kenneth F; Potter, Michael F; Palli, Subba R

2012-01-01

NADPH-cytochrome P450 reductase (CPR) plays a central role in cytochrome P450 action. The genes coding for P450s are not yet fully identified in the bed bug, Cimex lectularius. Hence, we decided to clone cDNA and knockdown the expression of the gene coding for CPR which is suggested to be required for the function of all P450s to determine whether or not P450s are involved in resistance of bed bugs to insecticides. The full length Cimex lectularius CPR (ClCPR) cDNA was isolated from a deltamethrin resistant bed bug population (CIN-1) using a combined PCR strategy. Bioinformatics and in silico modeling were employed to identify three conserved binding domains (FMN, FAD, NADP), a FAD binding motif, and the catalytic residues. The critical amino acids involved in FMN, FAD, NADP binding and their putative functions were also analyzed. No signal peptide but a membrane anchor domain with 21 amino acids which facilitates the localization of ClCPR on the endoplasmic reticulum was identified in ClCPR protein. Phylogenetic analysis showed that ClCPR is closer to the CPR from the body louse, Pediculus humanus corporis than to the CPRs from the other insect species studied. The ClCPR gene was ubiquitously expressed in all tissues tested but showed an increase in expression as immature stages develop into adults. We exploited the traumatic insemination mechanism of bed bugs to inject dsRNA and successfully knockdown the expression of the gene coding for ClCPR. Suppression of the ClCPR expression increased susceptibility to deltamethrin in resistant populations but not in the susceptible population of bed bugs. These data suggest that P450-mediated metabolic detoxification may serve as one of the resistance mechanisms in bed bugs.
Genotyping microsatellite DNA markers at putative disease loci in inbred/multiplex families with respiratory chain complex I deficiency allows rapid identification of a novel nonsense mutation (IVS1nt -1) in the NDUFS4 gene in Leigh syndrome.

PubMed

Bénit, Paule; Steffann, Julie; Lebon, Sophie; Chretien, Dominique; Kadhom, Noman; de Lonlay, Pascale; Goldenberg, Alice; Dumez, Yves; Dommergues, Marc; Rustin, Pierre; Munnich, Arnold; Rötig, Agnès

2003-05-01

Complex I deficiency, the most common cause of mitochondrial disorders, accounts for a variety of clinical symptoms and its genetic heterogeneity makes identification of the disease genes particularly tedious. Indeed, most of the 43 complex I subunits are encoded by nuclear genes, only seven of them being mitochondrially encoded. In order to offer urgent prenatal diagnosis, we have studied an inbred/multiplex family with complex I deficiency by using microsatellite DNA markers flanking the putative disease loci. Microsatellite DNA markers have allowed us to exclude the NDUFS7, NDUFS8, NDUFV1 and NDUFS1 genes and to find homozygosity at the NDUFS4 locus. Direct sequencing has led to identification of a homozygous splice acceptor site mutation in intron 1 of the NDUFS4 gene (IVS1nt -1, G-->A); this was not found in chorion villi of the ongoing pregnancy. We suggest that genotyping microsatellite DNA markers at putative disease loci in inbred/multiplex families helps to identify the disease-causing mutation. More generally, we suggest giving consideration to a more systematic microsatellite analysis of putative disease loci for identification of disease genes in inbred/multiplex families affected with genetically heterogeneous conditions.
Insights into plant biomass conversion from the genome of the anaerobic thermophilic bacterium Caldicellulosiruptor bescii DSM 6725

PubMed Central

Dam, Phuongan; Kataeva, Irina; Yang, Sung-Jae; Zhou, Fengfeng; Yin, Yanbin; Chou, Wenchi; Poole, Farris L.; Westpheling, Janet; Hettich, Robert; Giannone, Richard; Lewis, Derrick L.; Kelly, Robert; Gilbert, Harry J.; Henrissat, Bernard; Xu, Ying; Adams, Michael W. W.

2011-01-01

Caldicellulosiruptor bescii DSM 6725 utilizes various polysaccharides and grows efficiently on untreated high-lignin grasses and hardwood at an optimum temperature of ∼80°C. It is a promising anaerobic bacterium for studying high-temperature biomass conversion. Its genome contains 2666 protein-coding sequences organized into 1209 operons. Expression of 2196 genes (83%) was confirmed experimentally. At least 322 genes appear to have been obtained by lateral gene transfer (LGT). Putative functions were assigned to 364 conserved/hypothetical protein (C/HP) genes. The genome contains 171 and 88 genes related to carbohydrate transport and utilization, respectively. Growth on cellulose led to the up-regulation of 32 carbohydrate-active (CAZy), 61 sugar transport, 25 transcription factor and 234 C/HP genes. Some C/HPs were overproduced on cellulose or xylan, suggesting their involvement in polysaccharide conversion. A unique feature of the genome is enrichment with genes encoding multi-modular, multi-functional CAZy proteins organized into one large cluster, the products of which are proposed to act synergistically on different components of plant cell walls and to aid the ability of C. bescii to convert plant biomass. The high duplication of CAZy domains coupled with the ability to acquire foreign genes by LGT may have allowed the bacterium to rapidly adapt to changing plant biomass-rich environments. PMID:21227922
Molecular identification of arsenic-resistant estuarine bacteria and characterization of their ars genotype.

PubMed

Sri Lakshmi Sunita, M; Prashant, S; Bramha Chari, P V; Nageswara Rao, S; Balaravi, Padma; Kavi Kishor, P B

2012-01-01

In the present study, 44 arsenic-resistant bacteria were isolated through serial dilutions on agar plate with concentrations ≥0.05 mM of sodium arsenite and ≥10 mM of sodium arsenate from Mandovi and Zuari--estuarine water systems. The ars genotype characterization in 36 bacterial isolates (resistant to 100 mM of sodium arsenate) revealed that only 17 isolates harboured the arsA (ATPase), B (arsenite permease) and C (arsenate reductase) genes on the plasmid DNA. The arsA, B and C genes were individually detected using PCR in 16, 9 and 13 bacterial isolates respectively. Molecular identification of the 17 isolates bearing the ars genotype was carried using 16S rDNA sequencing. A 1300 bp full length arsB gene encoding arsenite efflux pump and a 409 bp fragment of arsC gene coding for arsenate reductase were isolated from the genera Halomonas and Acinetobacter. Phylogenetic analysis of arsB and arsC genes indicated their close genetic relationship with plasmid borne ars genes of E. coli and arsenate reductase of plant origin. The putative arsenate reductase gene isolated from Acinetobacter species complemented arsenate resistance in E. coli WC3110 and JM109 validating its function. This study dealing with isolation of native arsenic-resistant bacteria and characterization of their ars genes might be useful to develop efficient arsenic detoxification strategies for arsenic contaminated aquifers.
Evolution and expression analysis of the grape (Vitis vinifera L.) WRKY gene family.

PubMed

Guo, Chunlei; Guo, Rongrong; Xu, Xiaozhao; Gao, Min; Li, Xiaoqin; Song, Junyang; Zheng, Yi; Wang, Xiping

2014-04-01

WRKY proteins comprise a large family of transcription factors that play important roles in plant defence regulatory networks, including responses to various biotic and abiotic stresses. To date, no large-scale study of WRKY genes has been undertaken in grape (Vitis vinifera L.). In this study, a total of 59 putative grape WRKY genes (VvWRKY) were identified and renamed on the basis of their respective chromosome distribution. A multiple sequence alignment analysis using all predicted grape WRKY genes coding sequences, together with those from Arabidopsis thaliana and tomato (Solanum lycopersicum), indicated that the 59 VvWRKY genes can be classified into three main groups (I-III). An evaluation of the duplication events suggested that several WRKY genes arose before the divergence of the grape and Arabidopsis lineages. Moreover, expression profiles derived from semiquantitative PCR and real-time quantitative PCR analyses showed distinct expression patterns in various tissues and in response to different treatments. Four VvWRKY genes showed a significantly higher expression in roots or leaves, 55 responded to varying degrees to at least one abiotic stress treatment, and the expression of 38 were altered following powdery mildew (Erysiphe necator) infection. Most VvWRKY genes were downregulated in response to abscisic acid or salicylic acid treatments, while the expression of a subset was upregulated by methyl jasmonate or ethylene treatments.
Evolution and expression analysis of the grape (Vitis vinifera L.) WRKY gene family

PubMed Central

Guo, Chunlei; Guo, Rongrong; Wang, Xiping

2014-01-01

WRKY proteins comprise a large family of transcription factors that play important roles in plant defence regulatory networks, including responses to various biotic and abiotic stresses. To date, no large-scale study of WRKY genes has been undertaken in grape (Vitis vinifera L.). In this study, a total of 59 putative grape WRKY genes (VvWRKY) were identified and renamed on the basis of their respective chromosome distribution. A multiple sequence alignment analysis using all predicted grape WRKY genes coding sequences, together with those from Arabidopsis thaliana and tomato (Solanum lycopersicum), indicated that the 59 VvWRKY genes can be classified into three main groups (I–III). An evaluation of the duplication events suggested that several WRKY genes arose before the divergence of the grape and Arabidopsis lineages. Moreover, expression profiles derived from semiquantitative PCR and real-time quantitative PCR analyses showed distinct expression patterns in various tissues and in response to different treatments. Four VvWRKY genes showed a significantly higher expression in roots or leaves, 55 responded to varying degrees to at least one abiotic stress treatment, and the expression of 38 were altered following powdery mildew (Erysiphe necator) infection. Most VvWRKY genes were downregulated in response to abscisic acid or salicylic acid treatments, while the expression of a subset was upregulated by methyl jasmonate or ethylene treatments. PMID:24510937
Evolution of Synonymous Codon Usage in Neurospora tetrasperma and Neurospora discreta

PubMed Central

Whittle, C. A.; Sun, Y.; Johannesson, H.

2011-01-01

Neurospora comprises a primary model system for the study of fungal genetics and biology. In spite of this, little is known about genome evolution in Neurospora. For example, the evolution of synonymous codon usage is largely unknown in this genus. In the present investigation, we conducted a comprehensive analysis of synonymous codon usage and its relationship to gene expression and gene length (GL) in Neurospora tetrasperma and Neurospora discreta. For our analysis, we examined codon usage among 2,079 genes per organism and assessed gene expression using large-scale expressed sequenced tag (EST) data sets (279,323 and 453,559 ESTs for N. tetrasperma and N. discreta, respectively). Data on relative synonymous codon usage revealed 24 codons (and two putative codons) that are more frequently used in genes with high than with low expression and thus were defined as optimal codons. Although codon-usage bias was highly correlated with gene expression, it was independent of selectively neutral base composition (introns); thus demonstrating that translational selection drives synonymous codon usage in these genomes. We also report that GL (coding sequences [CDS]) was inversely associated with optimal codon usage at each gene expression level, with highly expressed short genes having the greatest frequency of optimal codons. Optimal codon frequency was moderately higher in N. tetrasperma than in N. discreta, which might be due to variation in selective pressures and/or mating systems. PMID:21402862
De Novo Assembly of the Japanese Flounder (Paralichthys olivaceus) Spleen Transcriptome to Identify Putative Genes Involved in Immunity

PubMed Central

Huang, Lin; Li, Guiyang; Mo, Zhaolan; Xiao, Peng; Li, Jie; Huang, Jie

2015-01-01

Background Japanese flounder (Paralichthys olivaceus) is an economically important marine fish in Asia and has suffered from disease outbreaks caused by various pathogens, which requires more information for immune relevant genes on genome background. However, genomic and transcriptomic data for Japanese flounder remain scarce, which limits studies on the immune system of this species. In this study, we characterized the Japanese flounder spleen transcriptome using an Illumina paired-end sequencing platform to identify putative genes involved in immunity. Methodology/Principal Findings A cDNA library from the spleen of P. olivaceus was constructed and randomly sequenced using an Illumina technique. The removal of low quality reads generated 12,196,968 trimmed reads, which assembled into 96,627 unigenes. A total of 21,391 unigenes (22.14%) were annotated in the NCBI Nr database, and only 1.1% of the BLASTx top-hits matched P. olivaceus protein sequences. Approximately 12,503 (58.45%) unigenes were categorized into three Gene Ontology groups, 19,547 (91.38%) were classified into 26 Cluster of Orthologous Groups, and 10,649 (49.78%) were assigned to six Kyoto Encyclopedia of Genes and Genomes pathways. Furthermore, 40,928 putative simple sequence repeats and 47, 362 putative single nucleotide polymorphisms were identified. Importantly, we identified 1,563 putative immune-associated unigenes that mapped to 15 immune signaling pathways. Conclusions/Significance The P. olivaceus transciptome data provides a rich source to discover and identify new genes, and the immune-relevant sequences identified here will facilitate our understanding of the mechanisms involved in the immune response. Furthermore, the plentiful potential SSRs and SNPs found in this study are important resources with respect to future development of a linkage map or marker assisted breeding programs for the flounder. PMID:25723398

The De Novo Transcriptome and Its Functional Annotation in the Seed Beetle Callosobruchus maculatus.

PubMed

Sayadi, Ahmed; Immonen, Elina; Bayram, Helen; Arnqvist, Göran

2016-01-01

Despite their unparalleled biodiversity, the genomic resources available for beetles (Coleoptera) remain relatively scarce. We present an integrative and high quality annotated transcriptome of the beetle Callosobruchus maculatus, an important and cosmopolitan agricultural pest as well as an emerging model species in ecology and evolutionary biology. Using Illumina sequencing technology, we sequenced 492 million read pairs generated from 51 samples of different developmental stages (larvae, pupae and adults) of C. maculatus. Reads were de novo assembled using the Trinity software, into a single combined assembly as well as into three separate assemblies based on data from the different developmental stages. The combined assembly generated 218,192 transcripts and 145,883 putative genes. Putative genes were annotated with the Blast2GO software and the Trinotate pipeline. In total, 33,216 putative genes were successfully annotated using Blastx against the Nr (non-redundant) database and 13,382 were assigned to 34,100 Gene Ontology (GO) terms. We classified 5,475 putative genes into Clusters of Orthologous Groups (COG) and 116 metabolic pathways maps were predicted based on the annotation. Our analyses suggested that the transcriptional specificity increases with ontogeny. For example, out of 33,216 annotated putative genes, 51 were only expressed in larvae, 63 only in pupae and 171 only in adults. Our study illustrates the importance of including samples from several developmental stages when the aim is to provide an integrative and high quality annotated transcriptome. Our results will represent an invaluable resource for those working with the ecology, evolution and pest control of C. maculatus, as well for comparative studies of the transcriptomics and genomics of beetles more generally.
The De Novo Transcriptome and Its Functional Annotation in the Seed Beetle Callosobruchus maculatus

PubMed Central

Sayadi, Ahmed; Immonen, Elina; Bayram, Helen

2016-01-01

Despite their unparalleled biodiversity, the genomic resources available for beetles (Coleoptera) remain relatively scarce. We present an integrative and high quality annotated transcriptome of the beetle Callosobruchus maculatus, an important and cosmopolitan agricultural pest as well as an emerging model species in ecology and evolutionary biology. Using Illumina sequencing technology, we sequenced 492 million read pairs generated from 51 samples of different developmental stages (larvae, pupae and adults) of C. maculatus. Reads were de novo assembled using the Trinity software, into a single combined assembly as well as into three separate assemblies based on data from the different developmental stages. The combined assembly generated 218,192 transcripts and 145,883 putative genes. Putative genes were annotated with the Blast2GO software and the Trinotate pipeline. In total, 33,216 putative genes were successfully annotated using Blastx against the Nr (non-redundant) database and 13,382 were assigned to 34,100 Gene Ontology (GO) terms. We classified 5,475 putative genes into Clusters of Orthologous Groups (COG) and 116 metabolic pathways maps were predicted based on the annotation. Our analyses suggested that the transcriptional specificity increases with ontogeny. For example, out of 33,216 annotated putative genes, 51 were only expressed in larvae, 63 only in pupae and 171 only in adults. Our study illustrates the importance of including samples from several developmental stages when the aim is to provide an integrative and high quality annotated transcriptome. Our results will represent an invaluable resource for those working with the ecology, evolution and pest control of C. maculatus, as well for comparative studies of the transcriptomics and genomics of beetles more generally. PMID:27442123
Single nucleotide primer extension to detect genetic diseases: Experimental application to hemophilia B (factor IX) and cystic fibrosis genes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kuppuswamy, M.N.; Hoffmann, J.W.; Spitzer, S.G.

1991-02-15

In this report, the authors describe an approach to detect the presence of abnormal alleles in those genetic diseases in which frequency of occurrence of the same mutation is high (e.g., hemophilia B). Initially, from each subject, the DNA fragment containing the putative mutation site is amplified by the polymerase chain reaction. For each fragment two reaction mixtures are then prepared. Each contains the amplified fragment, a primer (18-mer or longer) whose sequence is identical to the coding sequence of the normal gene immediately flanking the 5{prime} end of the mutation site, and either an {alpha}-{sup 32}P-labeled nucleotide corresponding tomore » the normal coding sequence at the mutation site or an {alpha}-{sup 32}P-labeled nucleotide corresponding to the mutant sequence. An essential feature of the present methodology is that the base immediately 3{prime} to the template-bound primer is one of those altered in the mutant, since in this way an extension of the primer by a single base will give an extended molecule characteristic of either the mutant or the wild type. The method is rapid and should be useful in carrier detection and prenatal diagnosis of every genetic disease with a known sequence variation.« less
Comparative genome analysis of 24 bovine-associated Staphylococcus isolates with special focus on the putative virulence genes

PubMed Central

Åvall-Jääskeläinen, Silja; Paulin, Lars; Blom, Jochen

2018-01-01

Non-aureus staphylococci (NAS) are most commonly isolated from subclinical mastitis. Different NAS species may, however, have diverse effects on the inflammatory response in the udder. We determined the genome sequences of 20 staphylococcal isolates from clinical or subclinical bovine mastitis, belonging to the NAS species Staphylococcus agnetis, S. chromogenes, and S. simulans, and focused on the putative virulence factor genes present in the genomes. For comparison we used our previously published genome sequences of four S. aureus isolates from bovine mastitis. The pan-genome and core genomes of the non-aureus isolates were characterized. After that, putative virulence factor orthologues were searched in silico. We compared the presence of putative virulence factors in the NAS species and S. aureus and evaluated the potential association between bacterial genotype and type of mastitis (clinical vs. subclinical). The NAS isolates had much less virulence gene orthologues than the S. aureus isolates. One third of the virulence genes were detected only in S. aureus. About 100 virulence genes were present in all S. aureus isolates, compared to about 40 to 50 in each NAS isolate. S. simulans differed the most. Several of the virulence genes detected among NAS were harbored only by S. simulans, but it also lacked a number of genes present both in S. agnetis and S. chromogenes. The type of mastitis was not associated with any specific virulence gene profile. It seems that the virulence gene profiles or cumulative number of different virulence genes are not directly associated with the type of mastitis (clinical or subclinical), indicating that host derived factors such as the immune status play a pivotal role in the manifestation of mastitis. PMID:29610707
Analysis of gene expression provides insights into the mechanism of cadmium tolerance in Acidithiobacillus ferrooxidans.

PubMed

Chen, Minjie; Li, Yanjun; Zhang, Li; Wang, Jianying; Zheng, Chunli; Zhang, Xuefeng

2015-02-01

Acidithiobacillus ferrooxidans plays a critical role in metal solubilization in the biomining industry, and occupies an ecological niche characterized by high acidity and high concentrations of toxic heavy metal ions. In order to investigate the possible metal resistance mechanism, the cellular distribution of cadmium was tested. The result indicated that Cd(2+) entered the cells upon initial exposure resulting in increased intracellular concentrations, followed by its excretion from the cells during subsequent growth and adaptation. Sequence homology analyses were used to identify 10 genes predicted to participate in heavy metal homeostasis, and the expression of these genes was investigated in cells cultured in the presence of increasing concentrations of toxic divalent cadmium (Cd(2+)). The results suggested that one gene (cmtR A.f ) encoded a putative Cd(2+)/Pb(2+)-responsive transcriptional regulator; four genes (czcA1 A.f , czcA2 A.f , czcB1 A.f ; and czcC1 A.f ) encoded heavy metal efflux proteins for Cd(2+); two genes (cadA1 A.f and cadB1 A.f ) encoded putative cation channel proteins related to the transport of Cd(2+). No significant enhancement of gene expression was observed at low concentrations of Cd(2+) (5 mM) and most of the putative metal resistance genes were up-regulated except cmtR A.f , cadB3 A.f ; and czcB1 A.f at higher concentrations (15 and 30 mM) according to real-time polymerase chain reaction. A model was developed for the mechanism of resistance to cadmium ions based on homology analyses of the predicted genes, the transcription of putative Cd(2+) resistance genes, and previous work.
Regulation of Bacteriocin Production in Streptococcus mutans by the Quorum-Sensing System Required for Development of Genetic Competence

PubMed Central

van der Ploeg, Jan R.

2005-01-01

In Streptococcus mutans, competence for genetic transformation and biofilm formation are dependent on the two-component signal transduction system ComDE together with the inducer peptide pheromone competence-stimulating peptide (CSP) (encoded by comC). Here, it is shown that the same system is also required for expression of the nlmAB genes, which encode a two-peptide nonlantibiotic bacteriocin. Expression from a transcriptional nlmAB′-lacZ fusion was highest at high cell density and was increased up to 60-fold following addition of CSP, but it was abolished when the comDE genes were interrupted. Two more genes, encoding another putative bacteriocin and a putative bacteriocin immunity protein, were also regulated by this system. The regions upstream of these genes and of two further putative bacteriocin-encoding genes and a gene encoding a putative bacteriocin immunity protein contained a conserved 9-bp repeat element just upstream of the transcription start, which suggests that expression of these genes is also dependent on the ComCDE regulatory system. Mutations in the repeat element of the nlmAB promoter region led to a decrease in CSP-dependent expression of nlmAB′-lacZ. In agreement with these results, a comDE mutant and mutants unable to synthesize or export CSP did not produce bacteriocins. It is speculated that, at high cell density, bacteriocin production is induced to liberate DNA from competing streptococci. PMID:15937160
Connection between nitrogen and manganese cycles revealed by transcriptomic analysis in Shewanella algae C6G3

NASA Astrophysics Data System (ADS)

Michotey, V.; Aigle, A.; Armougom, F.; Mejean, V.; Guasco, S.; Bonin, P.

2016-02-01

In sedimentary systems, the repartition of terminal electron-accepting molecules is often stratified on a permanent or seasonal basis. Just below to oxic zone, the suboxic one is characterized by high concentrations of oxidized inorganic compounds such as nitrate, manganese oxides (MnIII/IV) and iron oxides that are in close vicinity. Several studies have reported unexpected anaerobic nitrite/nitrate production at the expense of ammonium mediated by MnIII/IV, however this transient processes is difficult to discern and poorly understood. In the frame of this study, genes organization of nitrate and MnIII/IV respiration was investigated in S.algae. Additional genes were identified in S. algae compare to S. oneidensis: genes coding for nitrate and nitrite reductase (napA-a and nrfA-2) and an OMC protein (mtrH). In contrast to S. oneidensis, an anaerobic transitory nitrite accumulation at the expense of ammonium was observed in S. algae during growth with MnIII/IV, concomitantly with expression of nitrate/nitrite reductase genes (napA, nrfA, nrfA-2). Among the hypothesis explaining this data, the potential putative expression of unidentified gene able to perform ammonium oxidation was not observed on the global transcriptional level, however several signs of oxidative stress were detected and the existence of a secondary reaction generated by a putative oxidative s could not be excluded. Another option could be the action of reverse reaction by an enzyme such as NrfA or NrfA-2 due to the electron flow equilibrium. Whatever the electron acceptor (Nitrate/ MnIII/IV), the unexpected expression level of of omcA, mtrF, mtrH, mtrC was observed and peaked at the end of the exponential phase. Different expression patterns of the omc genes were observed depending on electron acceptor and growth phase. Only mtrF-2 gene was specifically expressed in Mn(III/IV) condition. Nitrate and Mn(III/IV) respirations seem connected at physiological as well as at transcriptional level
Sequence and Role in Virulence of the Three Plasmid Complement of the Model Tumor-Inducing Bacterium Pseudomonas savastanoi pv. savastanoi NCPPB 3335

PubMed Central

Bardaji, Leire; Pérez-Martínez, Isabel; Rodríguez-Moreno, Luis; Rodríguez-Palenzuela, Pablo; Sundin, George W.; Ramos, Cayo; Murillo, Jesús

2011-01-01

Pseudomonas savastanoi pv. savastanoi NCPPB 3335 is a model for the study of the molecular basis of disease production and tumor formation in woody hosts, and its draft genome sequence has been recently obtained. Here we closed the sequence of the plasmid complement of this strain, composed of three circular molecules of 78,357 nt (pPsv48A), 45,220 nt (pPsv48B), and 42,103 nt (pPsv48C), all belonging to the pPT23A-like family of plasmids widely distributed in the P. syringae complex. A total of 152 coding sequences were predicted in the plasmid complement, of which 38 are hypothetical proteins and seven correspond to putative virulence genes. Plasmid pPsv48A contains an incomplete Type IVB secretion system, the type III secretion system (T3SS) effector gene hopAF1, gene ptz, involved in cytokinin biosynthesis, and three copies of a gene highly conserved in plant-associated proteobacteria, which is preceded by a hrp box motif. A complete Type IVA secretion system, a well conserved origin of transfer (oriT), and a homolog of the T3SS effector gene hopAO1 are present in pPsv48B, while pPsv48C contains a gene with significant homology to isopentenyl-diphosphate delta-isomerase, type 1. Several potential mobile elements were found on the three plasmids, including three types of MITE, a derivative of IS801, and a new transposon effector, ISPsy30. Although the replication regions of these three plasmids are phylogenetically closely related, their structure is diverse, suggesting that the plasmid architecture results from an active exchange of sequences. Artificial inoculations of olive plants with mutants cured of plasmids pPsv48A and pPsv48B showed that pPsv48A is necessary for full virulence and for the development of mature xylem vessels within the knots; we were unable to obtain mutants cured of pPsv48C, which contains five putative toxin-antitoxin genes. PMID:22022435
Genetic toxicology of putative nongenotoxic carcinogens

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jackson, M.A.; Stack, H.F.; Waters, M.D.

1993-01-01

The report examines a group of putative nongenotoxic carcinogens that have been cited in the published literature. Using short-term test data from the US Environmental Protection Agency/International Agency for Research on Cancer genetic activity profile (EPA/IARC GAP) database, these agents are classified on the basis of their mutagenicity emphasizing three genetic endpoints: gene mutation, chromosomal aberration and aneuploidy. On the basis of results of short-term tests for these effects, criteria was defined for evidence of mutagenicity (and nonmutagenicity) these criteria were applied in classifying the group of putative nongenotoxic carcinogens. The results from this evaluation based on the EPA/IARC GAPmore » database are presented along with a summary of the short-term test data for each chemical and the relevant carcinogenicity results from the NTP, Gene-Tox and IARC databases. The data clearly demonstrate that many of the putative nongenotoxic carcinogens that have been adequately tested in short-term bioassays induce gene or chromosomal mutations or aneuploidy.« less
Integrated genome sequence and linkage map of physic nut (Jatropha curcas L.), a biodiesel plant.

PubMed

Wu, Pingzhi; Zhou, Changpin; Cheng, Shifeng; Wu, Zhenying; Lu, Wenjia; Han, Jinli; Chen, Yanbo; Chen, Yan; Ni, Peixiang; Wang, Ying; Xu, Xun; Huang, Ying; Song, Chi; Wang, Zhiwen; Shi, Nan; Zhang, Xudong; Fang, Xiaohua; Yang, Qing; Jiang, Huawu; Chen, Yaping; Li, Meiru; Wang, Ying; Chen, Fan; Wang, Jun; Wu, Guojiang

2015-03-01

The family Euphorbiaceae includes some of the most efficient biomass accumulators. Whole genome sequencing and the development of genetic maps of these species are important components in molecular breeding and genetic improvement. Here we report the draft genome of physic nut (Jatropha curcas L.), a biodiesel plant. The assembled genome has a total length of 320.5 Mbp and contains 27,172 putative protein-coding genes. We established a linkage map containing 1208 markers and anchored the genome assembly (81.7%) to this map to produce 11 pseudochromosomes. After gene family clustering, 15,268 families were identified, of which 13,887 existed in the castor bean genome. Analysis of the genome highlighted specific expansion and contraction of a number of gene families during the evolution of this species, including the ribosome-inactivating proteins and oil biosynthesis pathway enzymes. The genomic sequence and linkage map provide a valuable resource not only for fundamental and applied research on physic nut but also for evolutionary and comparative genomics analysis, particularly in the Euphorbiaceae. © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.
Cloning, Sequencing, and Role in Virulence of Two Phospholipases (A1 and C) from Mesophilic Aeromonas sp. Serogroup O:34

PubMed Central

Merino, Susana; Aguilar, Alicia; Nogueras, Maria Mercedes; Regue, Miguel; Swift, Simon; Tomás, Juan M.

1999-01-01

Two different representative recombinant clones encoding Aeromonas hydrophila lipases were found upon screening on tributyrin (phospholipase A1) and egg yolk agar (lecithinase-phospholipase C) plates of a cosmid-based genomic library of Aeromonas hydrophila AH-3 (serogroup O34) introduced into Escherichia coli DH5α. Subcloning, nucleotide sequencing, and in vitro-coupled transcription-translation experiments showed that the phospholipase A1 (pla) and C (plc) genes code for an 83-kDa putative lipoprotein and a 65-kDa protein, respectively. Defined insertion mutants of A. hydrophila AH-3 defective in either pla or plc genes were defective in phospholipase A1 and C activities, respectively. Lecithinase (phospholipase C) was shown to be cytotoxic but nonhemolytic or poorly hemolytic. A. hydrophila AH-3 plc mutants showed a more than 10-fold increase in their 50% lethal dose on fish and mice, and complementation of the plc single gene on these mutants abolished this effect, suggesting that Plc protein is a virulence factor in the mesophilic Aeromonas sp. serogroup O:34 infection process. PMID:10417167
An in silico pipeline to filter the Toxoplasma gondii proteome for proteins that could traffic to the host cell nucleus and influence host cell epigenetic regulation.

PubMed

Syn, Genevieve; Blackwell, Jenefer M; Jamieson, Sarra E; Francis, Richard W

2018-01-01

Toxoplasma gondii uses epigenetic mechanisms to regulate both endogenous and host cell gene expression. To identify genes with putative epigenetic functions, we developed an in silico pipeline to interrogate the T. gondii proteome of 8313 proteins. Step 1 employs PredictNLS and NucPred to identify genes predicted to target eukaryotic nuclei. Step 2 uses GOLink to identify proteins of epigenetic function based on Gene Ontology terms. This resulted in 611 putative nuclear localised proteins with predicted epigenetic functions. Step 3 filtered for secretory proteins using SignalP, SecretomeP, and experimental data. This identified 57 of the 611 putative epigenetic proteins as likely to be secreted. The pipeline is freely available online, uses open access tools and software with user-friendly Perl scripts to automate and manage the results, and is readily adaptable to undertake any such in silico search for genes contributing to particular functions.
Identification and Characterization of Putative Integron-Like Elements of the Heavy-Metal-Hypertolerant Strains of Pseudomonas spp.

PubMed

Ciok, Anna; Adamczuk, Marcin; Bartosik, Dariusz; Dziewit, Lukasz

2016-11-28

Pseudomonas strains isolated from the heavily contaminated Lubin copper mine and Zelazny Most post-flotation waste reservoir in Poland were screened for the presence of integrons. This analysis revealed that two strains carried homologous DNA regions composed of a gene encoding a DNA_BRE_C domain-containing tyrosine recombinase (with no significant sequence similarity to other integrases of integrons) plus a three-component array of putative integron gene cassettes. The predicted gene cassettes encode three putative polypeptides with homology to (i) transmembrane proteins, (ii) GCN5 family acetyltransferases, and (iii) hypothetical proteins of unknown function (homologous proteins are encoded by the gene cassettes of several class 1 integrons). Comparative sequence analyses identified three structural variants of these novel integron-like elements within the sequenced bacterial genomes. Analysis of their distribution revealed that they are found exclusively in strains of the genus Pseudomonas .
Age-associated microbiome shows the giant panda lives on hemicelluloses, not on cellulose.

PubMed

Zhang, Wenping; Liu, Wenbin; Hou, Rong; Zhang, Liang; Schmitz-Esser, Stephan; Sun, Huaibo; Xie, Junjin; Zhang, Yunfei; Wang, Chengdong; Li, Lifeng; Yue, Bisong; Huang, He; Wang, Hairui; Shen, Fujun; Zhang, Zhihe

2018-05-01

The giant panda feeds almost exclusively on bamboo, a diet highly enriched in lignin and cellulose, but is characterized by a digestive tract similar to carnivores. It is still large unknown if and how the giant panda gut microbiota contributes to lignin and cellulose degradation. Here we show the giant pandas' gut microbiota does not significantly contribute to cellulose and lignin degradation. We found that no operational taxonomic unit had a nearest neighbor identified as a cellulolytic species or strain with a significant higher abundance in juvenile than cubs, a very low abundance of putative lignin and cellulose genes existed in part of analyzing samples but a significant higher abundance of genes involved in starch and hemicellulose degradation in juveniles than cubs. Moreover, a significant lower abundance of putative cellulolytic genes and a significant higher abundance of putative α-amylase and hemicellulase gene families were present in giant pandas than in omnivores or herbivores.
Genes Involved in Anaerobic Metabolism of Phenol in the Bacterium Thauera aromatica

PubMed Central

Breinig, Sabine; Schiltz, Emile; Fuchs, Georg

2000-01-01

Genes involved in the anaerobic metabolism of phenol in the denitrifying bacterium Thauera aromatica have been studied. The first two committed steps in this metabolism appear to be phosphorylation of phenol to phenylphosphate by an unknown phosphoryl donor (“phenylphosphate synthase”) and subsequent carboxylation of phenylphosphate to 4-hydroxybenzoate under release of phosphate (“phenylphosphate carboxylase”). Both enzyme activities are strictly phenol induced. Two-dimensional gel electrophoresis allowed identification of several phenol-induced proteins. Based on N-terminal and internal amino acid sequences of such proteins, degenerate oligonucleotides were designed to identify the corresponding genes. A chromosomal DNA segment of about 14 kbp was sequenced which contained 10 genes transcribed in the same direction. These are organized in two adjacent gene clusters and include the genes coding for five identified phenol-induced proteins. Comparison with sequences in the databases revealed the following similarities: the gene products of two open reading frames (ORFs) are each similar to either the central part and N-terminal part of phosphoenolpyruvate synthases. We propose that these ORFs are components of the phenylphosphate synthase system. Three ORFs showed similarity to the ubiD gene product, 3-octaprenyl-4-hydroxybenzoate carboxy lyase; UbiD catalyzes the decarboxylation of a 4-hydroxybenzoate analogue in ubiquinone biosynthesis. Another ORF was similar to the ubiX gene product, an isoenzyme of UbiD. We propose that (some of) these four proteins are involved in the carboxylation of phenylphosphate. A 700-bp PCR product derived from one of these ORFs cross-hybridized with DNA from different Thauera and Azoarcus strains, even from those which have not been reported to grow with phenol. One ORF showed similarity to the mutT gene product, and three ORFs showed no strong similarities to sequences in the databases. Upstream of the first gene cluster, an ORF which is transcribed in the opposite direction codes for a protein highly similar to the DmpR regulatory protein of Pseudomonas putida. DmpR controls transcription of the genes of aerobic phenol metabolism, suggesting a similar regulation of anaerobic phenol metabolism by the putative regulator. PMID:11004186
Diversity and Divergence of Dinoflagellate Histone Proteins

PubMed Central

Marinov, Georgi K.; Lynch, Michael

2015-01-01

Histone proteins and the nucleosomal organization of chromatin are near-universal eukaroytic features, with the exception of dinoflagellates. Previous studies have suggested that histones do not play a major role in the packaging of dinoflagellate genomes, although several genomic and transcriptomic surveys have detected a full set of core histone genes. Here, transcriptomic and genomic sequence data from multiple dinoflagellate lineages are analyzed, and the diversity of histone proteins and their variants characterized, with particular focus on their potential post-translational modifications and the conservation of the histone code. In addition, the set of putative epigenetic mark readers and writers, chromatin remodelers and histone chaperones are examined. Dinoflagellates clearly express the most derived set of histones among all autonomous eukaryote nuclei, consistent with a combination of relaxation of sequence constraints imposed by the histone code and the presence of numerous specialized histone variants. The histone code itself appears to have diverged significantly in some of its components, yet others are conserved, implying conservation of the associated biochemical processes. Specifically, and with major implications for the function of histones in dinoflagellates, the results presented here strongly suggest that transcription through nucleosomal arrays happens in dinoflagellates. Finally, the plausible roles of histones in dinoflagellate nuclei are discussed. PMID:26646152
Exome sequencing in an admixed isolated population indicates NFXL1 variants confer a risk for specific language impairment.

PubMed

Villanueva, Pía; Nudel, Ron; Hoischen, Alexander; Fernández, María Angélica; Simpson, Nuala H; Gilissen, Christian; Reader, Rose H; Jara, Lillian; Echeverry, María Magdalena; Echeverry, Maria Magdalena; Francks, Clyde; Baird, Gillian; Conti-Ramsden, Gina; O'Hare, Anne; Bolton, Patrick F; Hennessy, Elizabeth R; Palomino, Hernán; Carvajal-Carmona, Luis; Veltman, Joris A; Cazier, Jean-Baptiste; De Barbieri, Zulema; Fisher, Simon E; Newbury, Dianne F

2015-03-01

Children affected by Specific Language Impairment (SLI) fail to acquire age appropriate language skills despite adequate intelligence and opportunity. SLI is highly heritable, but the understanding of underlying genetic mechanisms has proved challenging. In this study, we use molecular genetic techniques to investigate an admixed isolated founder population from the Robinson Crusoe Island (Chile), who are affected by a high incidence of SLI, increasing the power to discover contributory genetic factors. We utilize exome sequencing in selected individuals from this population to identify eight coding variants that are of putative significance. We then apply association analyses across the wider population to highlight a single rare coding variant (rs144169475, Minor Allele Frequency of 4.1% in admixed South American populations) in the NFXL1 gene that confers a nonsynonymous change (N150K) and is significantly associated with language impairment in the Robinson Crusoe population (p = 2.04 × 10-4, 8 variants tested). Subsequent sequencing of NFXL1 in 117 UK SLI cases identified four individuals with heterozygous variants predicted to be of functional consequence. We conclude that coding variants within NFXL1 confer an increased risk of SLI within a complex genetic model.
The RNA world in the 21st century-a systems approach to finding non-coding keys to clinical questions.

PubMed

Schmitz, Ulf; Naderi-Meshkin, Hojjat; Gupta, Shailendra K; Wolkenhauer, Olaf; Vera, Julio

2016-05-01

There was evidence that RNAs are a functionally rich class of molecules not only since the arrival of the next-generation sequencing technology. Non-coding RNAs (ncRNA) could be the key to accelerated diagnosis and enhanced prediction of disease and therapy outcomes as well as the design of advanced therapeutic strategies to overcome yet unsatisfactory approaches.In this review, we discuss the state of the art in RNA systems biology with focus on the application in the systems biomedicine field. We propose guidelines for analysing the role of microRNAs and long non-coding RNAs in human pathologies. We introduce RNA expression profiling and network approaches for the identification of stable and effective RNomics-based biomarkers, providing insights into the role of ncRNAs in disease regulation. Towards this, we discuss ways to model the dynamics of gene regulatory networks and signalling pathways that involve ncRNAs. We also describe data resources and computational methods for finding putative mechanisms of action of ncRNAs. Finally, we discuss avenues for the computer-aided design of novel RNA-based therapeutics. © The Author 2015. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.
Highly tissue specific expression of Sphinx supports its male courtship related role in Drosophila melanogaster.

PubMed

Chen, Ying; Dai, Hongzheng; Chen, Sidi; Zhang, Luoying; Long, Manyuan

2011-04-26

Sphinx is a lineage-specific non-coding RNA gene involved in regulating courtship behavior in Drosophila melanogaster. The 5' flanking region of the gene is conserved across Drosophila species, with the proximal 300 bp being conserved out to D. virilis and a further 600 bp region being conserved amongst the melanogaster subgroup (D. melanogaster, D. simulans, D. sechellia, D. yakuba, and D. erecta). Using a green fluorescence protein transformation system, we demonstrated that a 253 bp region of the highly conserved segment was sufficient to drive sphinx expression in male accessory gland. GFP signals were also observed in brain, wing hairs and leg bristles. An additional ∼800 bp upstream region was able to enhance expression specifically in proboscis, suggesting the existence of enhancer elements. Using anti-GFP staining, we identified putative sphinx expression signal in the brain antennal lobe and inner antennocerebral tract, suggesting that sphinx might be involved in olfactory neuron mediated regulation of male courtship behavior. Whole genome expression profiling of the sphinx knockout mutation identified significant up-regulated gene categories related to accessory gland protein function and odor perception, suggesting sphinx might be a negative regulator of its target genes.
Highly Tissue Specific Expression of Sphinx Supports Its Male Courtship Related Role in Drosophila melanogaster

PubMed Central

Chen, Sidi; Zhang, Luoying; Long, Manyuan

2011-01-01

Sphinx is a lineage-specific non-coding RNA gene involved in regulating courtship behavior in Drosophila melanogaster. The 5′ flanking region of the gene is conserved across Drosophila species, with the proximal 300 bp being conserved out to D. virilis and a further 600 bp region being conserved amongst the melanogaster subgroup (D. melanogaster, D. simulans, D. sechellia, D. yakuba, and D. erecta). Using a green fluorescence protein transformation system, we demonstrated that a 253 bp region of the highly conserved segment was sufficient to drive sphinx expression in male accessory gland. GFP signals were also observed in brain, wing hairs and leg bristles. An additional ∼800 bp upstream region was able to enhance expression specifically in proboscis, suggesting the existence of enhancer elements. Using anti-GFP staining, we identified putative sphinx expression signal in the brain antennal lobe and inner antennocerebral tract, suggesting that sphinx might be involved in olfactory neuron mediated regulation of male courtship behavior. Whole genome expression profiling of the sphinx knockout mutation identified significant up-regulated gene categories related to accessory gland protein function and odor perception, suggesting sphinx might be a negative regulator of its target genes. PMID:21541324

Genomicus update 2015: KaryoView and MatrixView provide a genome-wide perspective to multispecies comparative genomics.

PubMed

Louis, Alexandra; Nguyen, Nga Thi Thuy; Muffato, Matthieu; Roest Crollius, Hugues

2015-01-01

The Genomicus web server (http://www.genomicus.biologie.ens.fr/genomicus) is a visualization tool allowing comparative genomics in four different phyla (Vertebrate, Fungi, Metazoan and Plants). It provides access to genomic information from extant species, as well as ancestral gene content and gene order for vertebrates and flowering plants. Here we present the new features available for vertebrate genome with a focus on new graphical tools. The interface to enter the database has been improved, two pairwise genome comparison tools are now available (KaryoView and MatrixView) and the multiple genome comparison tools (PhyloView and AlignView) propose three new kinds of representation and a more intuitive menu. These new developments have been implemented for Genomicus portal dedicated to vertebrates. This allows the analysis of 68 extant animal genomes, as well as 58 ancestral reconstructed genomes. The Genomicus server also provides access to ancestral gene orders, to facilitate evolutionary and comparative genomics studies, as well as computationally predicted regulatory interactions, thanks to the representation of conserved non-coding elements with their putative gene targets. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Soybean kinome: functional classification and gene expression patterns

PubMed Central

Liu, Jinyi; Chen, Nana; Grant, Joshua N.; Cheng, Zong-Ming (Max); Stewart, C. Neal; Hewezi, Tarek

2015-01-01

The protein kinase (PK) gene family is one of the largest and most highly conserved gene families in plants and plays a role in nearly all biological functions. While a large number of genes have been predicted to encode PKs in soybean, a comprehensive functional classification and global analysis of expression patterns of this large gene family is lacking. In this study, we identified the entire soybean PK repertoire or kinome, which comprised 2166 putative PK genes, representing 4.67% of all soybean protein-coding genes. The soybean kinome was classified into 19 groups, 81 families, and 122 subfamilies. The receptor-like kinase (RLK) group was remarkably large, containing 1418 genes. Collinearity analysis indicated that whole-genome segmental duplication events may have played a key role in the expansion of the soybean kinome, whereas tandem duplications might have contributed to the expansion of specific subfamilies. Gene structure, subcellular localization prediction, and gene expression patterns indicated extensive functional divergence of PK subfamilies. Global gene expression analysis of soybean PK subfamilies revealed tissue- and stress-specific expression patterns, implying regulatory functions over a wide range of developmental and physiological processes. In addition, tissue and stress co-expression network analysis uncovered specific subfamilies with narrow or wide interconnected relationships, indicative of their association with particular or broad signalling pathways, respectively. Taken together, our analyses provide a foundation for further functional studies to reveal the biological and molecular functions of PKs in soybean. PMID:25614662
Genome-wide characterization of pectin methyl esterase genes reveals members differentially expressed in tolerant and susceptible wheats in response to Fusarium graminearum.

PubMed

Zega, Alessandra; D'Ovidio, Renato

2016-11-01

Pectin methyl esterase (PME) genes code for enzymes that are involved in structural modifications of the plant cell wall during plant growth and development. They are also involved in plant-pathogen interaction. PME genes belong to a multigene family and in this study we report the first comprehensive analysis of the PME gene family in bread wheat (Triticum aestivum L.). Like in other species, the members of the TaPME family are dispersed throughout the genome and their encoded products retain the typical structural features of PMEs. qRT-PCR analysis showed variation in the expression pattern of TaPME genes in different tissues and revealed that these genes are mainly expressed in flowering spikes. In our attempt to identify putative TaPME genes involved in wheat defense, we revealed a strong variation in the expression of the TaPME following Fusarium graminearum infection, the causal agent of Fusarium head blight (FHB). Particularly interesting was the finding that the expression profile of some PME genes was markedly different between the FHB-resistant wheat cultivar Sumai3 and the FHB-susceptible cultivar Bobwhite, suggesting a possible involvement of these PME genes in FHB resistance. Moreover, the expression analysis of the TaPME genes during F. graminearum progression within the spike revealed those genes that responded more promptly to pathogen invasion. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
Structure of the coding region and mRNA variants of the apyrase gene from pea (Pisum sativum)

NASA Technical Reports Server (NTRS)

Shibata, K.; Abe, S.; Davies, E.

2001-01-01

Partial amino acid sequences of a 49 kDa apyrase (ATP diphosphohydrolase, EC 3.6.1.5) from the cytoskeletal fraction of etiolated pea stems were used to derive oligonucleotide DNA primers to generate a cDNA fragment of pea apyrase mRNA by RT-PCR and these primers were used to screen a pea stem cDNA library. Two almost identical cDNAs differing in just 6 nucleotides within the coding regions were found, and these cDNA sequences were used to clone genomic fragments by PCR. Two nearly identical gene fragments containing 8 exons and 7 introns were obtained. One of them (H-type) encoded the mRNA sequence described by Hsieh et al. (1996) (DDBJ/EMBL/GenBank Z32743), while the other (S-type) differed by the same 6 nucleotides as the mRNAs, suggesting that these genes may be alleles. The six nucleotide differences between these two alleles were found solely in the first exon, and these mutation sites had two types of consensus sequences. These mRNAs were found with varying lengths of 3' untranslated regions (3'-UTR). There are some similarities between the 3'-UTR of these mRNAs and those of actin and actin binding proteins in plants. The putative roles of the 3'-UTR and alternative polyadenylation sites are discussed in relation to their possible role in targeting the mRNAs to different subcellular compartments.
Bioinformatic Analysis Reveals Archaeal tRNATyr and tRNATrp Identities in Bacteria

PubMed Central

Mukai, Takahito; Reynolds, Noah M.; Crnković, Ana; Söll, Dieter

2017-01-01

The tRNA identity elements for some amino acids are distinct between the bacterial and archaeal domains. Searching in recent genomic and metagenomic sequence data, we found some candidate phyla radiation (CPR) bacteria with archaeal tRNA identity for Tyr-tRNA and Trp-tRNA synthesis. These bacteria possess genes for tyrosyl-tRNA synthetase (TyrRS) and tryptophanyl-tRNA synthetase (TrpRS) predicted to be derived from DPANN superphylum archaea, while the cognate tRNATyr and tRNATrp genes reveal bacterial or archaeal origins. We identified a trace of domain fusion and swapping in the archaeal-type TyrRS gene of a bacterial lineage, suggesting that CPR bacteria may have used this mechanism to create diverse proteins. Archaeal-type TrpRS of bacteria and a few TrpRS species of DPANN archaea represent a new phylogenetic clade (named TrpRS-A). The TrpRS-A open reading frames (ORFs) are always associated with another ORF (named ORF1) encoding an unknown protein without global sequence identity to any known protein. However, our protein structure prediction identified a putative HIGH-motif and KMSKS-motif as well as many α-helices that are characteristic of class I aminoacyl-tRNA synthetase (aaRS) homologs. These results provide another example of the diversity of molecular components that implement the genetic code and provide a clue to the early evolution of life and the genetic code. PMID:28230768
Comparative Mitogenomics of the Assassin Bug Genus Peirates (Hemiptera: Reduviidae: Peiratinae) Reveal Conserved Mitochondrial Genome Organization of P. atromaculatus, P. fulvescens and P. turpis

PubMed Central

Zhao, Guangyu; Li, Hu; Zhao, Ping; Cai, Wanzhi

2015-01-01

In this study, we sequenced four new mitochondrial genomes and presented comparative mitogenomic analyses of five species in the genus Peirates (Hemiptera: Reduviidae). Mitochondrial genomes of these five assassin bugs had a typical set of 37 genes and retained the ancestral gene arrangement of insects. The A+T content, AT- and GC-skews were similar to the common base composition biases of insect mtDNA. Genomic size ranges from 15,702 bp to 16,314 bp and most of the size variation was due to length and copy number of the repeat unit in the putative control region. All of the control region sequences included large tandem repeats present in two or more copies. Our result revealed similarity in mitochondrial genomes of P. atromaculatus, P. fulvescens and P. turpis, as well as the highly conserved genomic-level characteristics of these three species, e.g., the same start and stop codons of protein-coding genes, conserved secondary structure of tRNAs, identical location and length of non-coding and overlapping regions, and conservation of structural elements and tandem repeat unit in control region. Phylogenetic analyses also supported a close relationship between P. atromaculatus, P. fulvescens and P. turpis, which might be recently diverged species. The present study indicates that mitochondrial genome has important implications on phylogenetics, population genetics and speciation in the genus Peirates. PMID:25689825
Genetic Characterization of the Carotenoid Biosynthetic Pathway in Methylobacterium extorquens AM1 and Isolation of a Colorless Mutant

PubMed Central

Van Dien, Stephen J.; Marx, Christopher J.; O'Brien, Brooke N.; Lidstrom, Mary E.

2003-01-01

Genomic searches were used to reconstruct the putative carotenoid biosynthesis pathway in the pink-pigmented facultative methylotroph Methylobacterium extorquens AM1. Four genes for putative phytoene desaturases were identified. A colorless mutant was obtained by transposon mutagenesis, and the insertion was shown to be in one of the putative phytoene desaturase genes. Mutations in the other three did not affect color. The tetracycline marker was removed from the original transposon mutant, resulting in a pigment-free strain with wild-type growth properties useful as a tool for future experiments. PMID:14660416
Genetic characterization of the carotenoid biosynthetic pathway in Methylobacterium extorquens AM1 and isolation of a colorless mutant.

PubMed

Van Dien, Stephen J; Marx, Christopher J; O'Brien, Brooke N; Lidstrom, Mary E

2003-12-01

Genomic searches were used to reconstruct the putative carotenoid biosynthesis pathway in the pink-pigmented facultative methylotroph Methylobacterium extorquens AM1. Four genes for putative phytoene desaturases were identified. A colorless mutant was obtained by transposon mutagenesis, and the insertion was shown to be in one of the putative phytoene desaturase genes. Mutations in the other three did not affect color. The tetracycline marker was removed from the original transposon mutant, resulting in a pigment-free strain with wild-type growth properties useful as a tool for future experiments.
Digital gene expression profiling of flax (Linum usitatissimum L.) stem peel identifies genes enriched in fiber-bearing phloem tissue.

PubMed

Guo, Yuan; Qiu, Caisheng; Long, Songhua; Chen, Ping; Hao, Dongmei; Preisner, Marta; Wang, Hui; Wang, Yufu

2017-08-30

To better understand the molecular mechanisms and gene expression characteristics associated with development of bast fiber cell within flax stem phloem, the gene expression profiling of flax stem peels and leaves were screened, using Illumina's Digital Gene Expression (DGE) analysis. Four DGE libraries (2 for stem peel and 2 for leaf), ranging from 6.7 to 9.2 million clean reads were obtained, which produced 7.0 million and 6.8 million mapped reads for flax stem peel and leave, respectively. By differential gene expression analysis, a total of 975 genes, of which 708 (73%) genes have protein-coding annotation, were identified as phloem enriched genes putatively involved in the processes of polysaccharide and cell wall metabolism. Differential expression genes (DEGs) was validated using quantitative RT-PCR, the expression pattern of all nine genes determined by qRT-PCR fitted in well with that obtained by sequencing analysis. Cluster and Gene Ontology (GO) analysis revealed that a large number of genes related to metabolic process, catalytic activity and binding category were expressed predominantly in the stem peels. The Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis of the phloem enriched genes suggested approximately 111 biological pathways. The large number of genes and pathways produced from DGE sequencing will expand our understanding of the complex molecular and cellular events in flax bast fiber development and provide a foundation for future studies on fiber development in other bast fiber crops. Copyright © 2017 Elsevier B.V. All rights reserved.
[Genetic hypophosphatemia: recent advances in physiopathogenic concept].

PubMed

Beraud, G; Perimenis, P; Velayoudom, Fr-L; Wemeau, J-L; Vantyghem, M-Chr

2005-04-01

Renal proximal tubular reabsorption of phosphate and intestinal absorption both regulate phosphate homeostasis. Brush-border membrane Npt2a cotransporter is the key element in proximal tubular P (i) reabsorption. Inactivating mutations of Npt2a cause bone demineralisation and urolithiasis. An excess of a phosphaturic factor, called "Phosphatonin", could modulate phosphate reabsorption by inhibition on Npt2a. Inactivating mutation of PHEX, an endopeptidase-membrane coding gene, is responsible for X-linked Hypophosphatemia (XLH), because of an impaired degradation of phosphatonine by PHEX product. Autosomic Dominant Hypophosphatemic Rickets (ADHR) is explained by a mutation preventing FGF23 (one of the best identified phosphatonines) from cleavage. According recent data, FGF23, MEPE (Matrix Extracellular Phosphoglycoprotein) et FRP4 (frizzled related protein-4) are 3 putative "phosphatonines".
Trypanosome RNA polymerases and transcription factors: sensible trypanocidal drug targets?

PubMed

Vanhamme, Luc

2008-11-01

Trypanosomes and Leishmaniae are the agents of several important parasitic diseases threatening hundreds of million human beings worldwide. As they diverged early in evolution, they display original molecular characteristics. These peculiarities are each defining putative specific targets for anti-parasitic drugs. Transcription displays its lot of unique characteristics in trypanosomes and will be taken as an example to uncover these targets. Unique features of transcription in trypanosomes include constitutive and poly-cistronic transcription by RNA polymerase II as well as transcription of protein-coding genes by RNA polymerase I. It is becoming clear that these unique mechanisms are performed by dedicated molecular players. The first of them have been recently characterized. They are reviewed and their suitability as drug targets is commented.
Search for protein partners of mitochondrial single-stranded DNA-binding protein Rim1p using a yeast two-hybrid system.

PubMed

Kucejová, B; Foury, F

2003-01-01

RIM1 is a nuclear gene of the yeast Saccharomyces cerevisiae coding for a protein with single-stranded DNA-binding activity that is essential for mitochondrial genome maintenance. No protein partners of Rim1p have been described so far in yeast. To better understand the role of this protein in mitochondrial DNA replication and recombination, a search for protein interactors by the yeast two-hybrid system was performed. This approach led to the identification of several candidates, including a putative transcription factor, Azf1p, and Mph1p, a protein with an RNA helicase domain which is known to influence the mutation rate of nuclear and mitochondrial genomes.
Comparative genome analysis reveals genetic adaptation to versatile environmental conditions and importance of biofilm lifestyle in Comamonas testosteroni.

PubMed

Wu, Yichao; Arumugam, Krithika; Tay, Martin Qi Xiang; Seshan, Hari; Mohanty, Anee; Cao, Bin

2015-04-01

Comamonas testosteroni is an important environmental bacterium capable of degrading a variety of toxic aromatic pollutants and has been demonstrated to be a promising biocatalyst for environmental decontamination. This organism is often found to be among the primary surface colonizers in various natural and engineered ecosystems, suggesting an extraordinary capability of this organism in environmental adaptation and biofilm formation. The goal of this study was to gain genetic insights into the adaption of C. testosteroni to versatile environments and the importance of a biofilm lifestyle. Specifically, a draft genome of C. testosteroni I2 was obtained. The draft genome is 5,778,710 bp in length and comprises 110 contigs. The average G+C content was 61.88 %. A total of 5365 genes with 5263 protein-coding genes were predicted, whereas 4324 (80.60 % of total genes) protein-encoding genes were associated with predicted functions. The catabolic genes responsible for biodegradation of steroid and other aromatic compounds on draft genome were identified. Plasmid pI2 was found to encode a complete pathway for aniline degradation and a partial catabolic pathway for chloroaniline. This organism was found to be equipped with a sophisticated signaling system which helps it find ideal niches and switch between planktonic and biofilm lifestyles. A large number of putative multi-drug-resistant genes coding for abundant outer membrane transporters, chaperones, and heat shock proteins for the protection of cellular function were identified in the genome of strain I2. In addition, the genome of strain I2 was predicted to encode several proteins involved in producing, secreting, and uptaking siderophores under iron-limiting conditions. The genome of strain I2 contains a number of genes responsible for the synthesis and secretion of exopolysaccharides, an extracellular component essential for biofilm formation. Overall, our results reveal the genomic features underlying the adaption of C. testosteroni to versatile environments and highlighting the importance of its biofilm lifestyle.
Characterization of Transcriptional Complexity during Adipose Tissue Development in Bovines of Different Ages and Sexes

PubMed Central

Zhou, Yang; Sun, Jiajie; Li, Congjun; Wang, Yanhong; Li, Lan; Cai, Hanfang; Lan, Xianyong; Lei, Chuzhao; Zhao, Xin; Chen, Hong

2014-01-01

Background Adipose tissue has long been recognized to play an extremely important role in development. In bovines, it not only serves a fundamental function but also plays a key role in the quality of beef and, consequently, has drawn much public attention. Age and sex are two key factors that affect the development of adipose tissue, and there has not yet been a global study detailing the effects of these two factors on expressional differences of adipose tissues. Results In this study, total RNA from the back fat of fetal bovines, adult bulls, adult heifers and adult steers were used to construct libraries for Illumina next-generation sequencing. We detected the expression levels of 12,233 genes, with over 3,000 differently expressed genes when comparing fetal and adult patterns and an average of 1000 differently expressed genes when comparing adult patterns. Multiple Gene Ontology terms and pathways were found to be significantly enriched for these differentially expressed genes. Of the 12,233 detected genes, a total of 4,753 genes (38.85%) underwent alternative splicing events, and over 50% were specifically expressed in each library. Over 4,000 novel transcript units were discovered for one library, whereas only approximately 30% were considered to have coding ability, which supplied a large amount of information for the lncRNA study. Additionally, we detected 56,564 (fetal bovine), 65,154 (adult bull), 78,061 (adult heifer) and 86,965 (adult steer) putative single nucleotide polymorphisms located in coding regions of the four pooled libraries. Conclusion Here, we present, for the first time, a complete dataset involving the spatial and temporal transcriptome of bovine adipose tissue using RNA-seq. These data will facilitate the understanding of the effects of age and sex on the development of adipose tissue and supply essential information towards further studies on the genomes of beef cattle and other related mammals. PMID:24983926
Diversification and Expression of the PIN, AUX/LAX, and ABCB Families of Putative Auxin Transporters in Populus

PubMed Central

Carraro, Nicola; Tisdale-Orr, Tracy Eizabeth; Clouse, Ronald Matthew; Knöller, Anne Sophie; Spicer, Rachel

2012-01-01

Intercellular transport of the plant hormone auxin is mediated by three families of membrane-bound protein carriers, with the PIN and ABCB families coding primarily for efflux proteins and the AUX/LAX family coding for influx proteins. In the last decade our understanding of gene and protein function for these transporters in Arabidopsis has expanded rapidly but very little is known about their role in woody plant development. Here we present a comprehensive account of all three families in the model woody species Populus, including chromosome distribution, protein structure, quantitative gene expression, and evolutionary relationships. The PIN and AUX/LAX gene families in Populus comprise 16 and 8 members respectively and show evidence for the retention of paralogs following a relatively recent whole genome duplication. There is also differential expression across tissues within many gene pairs. The ABCB family is previously undescribed in Populus and includes 20 members, showing a much deeper evolutionary history, including both tandem and whole genome duplication as well as probable gene loss. A striking number of these transporters are expressed in developing Populus stems and we suggest that evolutionary and structural relationships with known auxin transporters in Arabidopsis can point toward candidate genes for further study in Populus. This is especially important for the ABCBs, which is a large family and includes members in Arabidopsis that are able to transport other substrates in addition to auxin. Protein modeling, sequence alignment and expression data all point to ABCB1.1 as a likely auxin transport protein in Populus. Given that basipetal auxin flow through the cambial zone shapes the development of woody stems, it is important that we identify the full complement of genes involved in this process. This work should lay the foundation for studies targeting specific proteins for functional characterization and in situ localization. PMID:22645571
Identification of single nucleotide polymorphisms in the agouti signaling protein (ASIP) gene in some goat breeds in tropical and temperate climates.

PubMed

Adefenwa, Mufliat A; Peters, Sunday O; Agaviezor, Brilliant O; Wheto, Matthew; Adekoya, Khalid O; Okpeku, Moses; Oboh, Bola; Williams, Gabriel O; Adebambo, Olufunmilayo A; Singh, Mahipal; Thomas, Bolaji; De Donato, Marcos; Imumorin, Ikhide G

2013-07-01

The agouti-signaling protein (ASIP) plays a major role in mammalian pigmentation as an antagonist to melanocortin-1 receptor gene to stimulate pheomelanin synthesis, a major pigment conferring mammalian coat color. We sequenced a 352 bp fragment of ASIP gene spanning part of exon 2 and part of intron 2 in 215 animals representing six goat breeds from Nigeria and the United States: West African Dwarf, predominantly black; Red Sokoto, mostly red; and Sahel, mostly white from Nigeria; black and white Alpine, brown and white Spanish and white Saanen from the US. Twenty haplotypes from nine mutations representing three intronic, one silent and five missense (p.S19R, p.N35K, p.L36V, p.M42L and p.L45W) mutations were identified in Nigerian goats. Approximately 89 % of Nigerian goats carry haplotype 1 (TGCCATCCG) which seems to be the wild type configuration of mutations in this region of the gene. Although we found no association between these polymorphisms in the ASIP gene and coat color in Nigerian goats, in-silico functional analysis predicts putative deleterious functional impact of the p.L45W mutation on the basic amino-terminal domain of ASIP. In the American goats, two intronic mutations, g.293G>A and g.327C>A, were identified in the Alpine breed, although the g.293G>A mutation is common to American and Nigerian goat populations. All Sannen and Sahel goats in this study belong to haplotypes 1 of both populations which seem to be the wild-type composite ASIP haplotype. Overall, there was no clear association of this portion of the ASIP gene interrogated in this study with coat color variation. Therefore, additional genomic analyses of promoter sequence, the entire coding and non-coding regions of the ASIP gene will be required to obtain a definite conclusion.
Characterization of a gene cluster responsible for the biosynthesis of anticancer agent FK228 in Chromobacterium violaceum No. 968.

PubMed

Cheng, Yi-Qiang; Yang, Min; Matter, Andrea M

2007-06-01

A gene cluster responsible for the biosynthesis of anticancer agent FK228 has been identified, cloned, and partially characterized in Chromobacterium violaceum no. 968. First, a genome-scanning approach was applied to identify three distinctive C. violaceum no. 968 genomic DNA clones that code for portions of nonribosomal peptide synthetase and polyketide synthase. Next, a gene replacement system developed originally for Pseudomonas aeruginosa was adapted to inactivate the genomic DNA-associated candidate natural product biosynthetic genes in vivo with high efficiency. Inactivation of a nonribosomal peptide synthetase-encoding gene completely abolished FK228 production in mutant strains. Subsequently, the entire FK228 biosynthetic gene cluster was cloned and sequenced. This gene cluster is predicted to encompass a 36.4-kb DNA region that includes 14 genes. The products of nine biosynthetic genes are proposed to constitute an unusual hybrid nonribosomal peptide synthetase-polyketide synthase-nonribosomal peptide synthetase assembly line including accessory activities for the biosynthesis of FK228. In particular, a putative flavin adenine dinucleotide-dependent pyridine nucleotide-disulfide oxidoreductase is proposed to catalyze disulfide bond formation between two sulfhydryl groups of cysteine residues as the final step in FK228 biosynthesis. Acquisition of the FK228 biosynthetic gene cluster and acclimation of an efficient genetic system should enable genetic engineering of the FK228 biosynthetic pathway in C. violaceum no. 968 for the generation of structural analogs as anticancer drug candidates.
Mutually Exclusive Alterations in Secondary Metabolism Are Critical for the Uptake of Insoluble Iron Compounds by Arabidopsis and Medicago truncatula1[C][W

PubMed Central

Rodríguez-Celma, Jorge; Lin, Wen-Dar; Fu, Guin-Mau; Abadía, Javier; López-Millán, Ana-Flor; Schmidt, Wolfgang

2013-01-01

The generally low bioavailability of iron in aerobic soil systems forced plants to evolve sophisticated genetic strategies to improve the acquisition of iron from sparingly soluble and immobile iron pools. To distinguish between conserved and species-dependent components of such strategies, we analyzed iron deficiency-induced changes in the transcriptome of two model species, Arabidopsis (Arabidopsis thaliana) and Medicago truncatula. Transcriptional profiling by RNA sequencing revealed a massive up-regulation of genes coding for enzymes involved in riboflavin biosynthesis in M. truncatula and phenylpropanoid synthesis in Arabidopsis upon iron deficiency. Coexpression and promoter analysis indicated that the synthesis of flavins and phenylpropanoids is tightly linked to and putatively coregulated with other genes encoding proteins involved in iron uptake. We further provide evidence that the production and secretion of phenolic compounds is critical for the uptake of iron from sources with low bioavailability but dispensable under conditions where iron is readily available. In Arabidopsis, homozygous mutations in the Fe(II)- and 2-oxoglutarate-dependent dioxygenase family gene F6′H1 and defects in the expression of PLEIOTROPIC DRUG RESISTANCE9, encoding a putative efflux transporter for products from the phenylpropanoid pathway, compromised iron uptake from an iron source of low bioavailability. Both mutants were partially rescued when grown alongside wild-type Arabidopsis or M. truncatula seedlings, presumably by secreted phenolics and flavins. We concluded that production and secretion of compounds that facilitate the uptake of iron is an essential but poorly understood aspect of the reduction-based iron acquisition strategy, which is likely to contribute substantially to the efficiency of iron uptake in natural conditions. PMID:23735511
Characterization of the biotin uptake system encoded by the biotin-inducible bioYMN operon of Corynebacterium glutamicum

PubMed Central

2012-01-01

Background The amino acid-producing Gram-positive Corynebacterium glutamicum is auxotrophic for biotin although biotin ring assembly starting from the precursor pimeloyl-CoA is still functional. It possesses AccBC, the α-subunit of the acyl-carboxylases involved in fatty acid and mycolic acid synthesis, and pyruvate carboxylase as the only biotin-containing proteins. Comparative genome analyses suggested that the putative transport system BioYMN encoded by cg2147, cg2148 and cg2149 might be involved in biotin uptake by C. glutamicum. Results By comparison of global gene expression patterns of cells grown with limiting or excess supply of biotin or with dethiobiotin as supplement replacing biotin revealed that expression of genes coding for enzymes of biotin ring assembly and for the putative uptake system was regulated according to biotin availability. RT-PCR and 5'-RACE experiments demonstrated that the genes bioY, bioM, and bioN are transcribed from one promoter as a single transcript. Biochemical analyses revealed that BioYMN catalyzes the effective uptake of biotin with a concentration of 60 nM biotin supporting a half-maximal transport rate. Maximal biotin uptake rates were at least five fold higher in biotin-limited cells as compared to cells grown with excess biotin. Overexpression of bioYMN led to an at least 50 fold higher biotin uptake rate as compared to the empty vector control. Overproduction of BioYMN alleviated biotin limitation and interfered with triggering L-glutamate production by biotin limitation. Conclusions The operon bioYMN from C. glutamicum was shown to be induced by biotin limitation. Transport assays with radio-labeled biotin revealed that BioYMN functions as a biotin uptake system. Overexpression of bioYMN affected L-glutamate production triggered by biotin limitation. PMID:22243621
Characterization of constitutive and putative differentially expressed mRNAs by means of expressed sequence tags, differential display reverse transcriptase-PCR and randomly amplified polymorphic DNA-PCR from the sand fly vector Lutzomyia longipalpis.

PubMed

Ramalho-Ortigão, J M; Temporal, P; de Oliveira , S M; Barbosa, A F; Vilela, M L; Rangel, E F; Brazil, R P; Traub-Cseko, Y M

2001-01-01

Molecular studies of insect disease vectors are of paramount importance for understanding parasite-vector relationship. Advances in this area have led to important findings regarding changes in vectors' physiology upon blood feeding and parasite infection. Mechanisms for interfering with the vectorial capacity of insects responsible for the transmission of diseases such as malaria, Chagas disease and dengue fever are being devised with the ultimate goal of developing transgenic insects. A primary necessity for this goal is information on gene expression and control in the target insect. Our group is investigating molecular aspects of the interaction between Leishmania parasites and Lutzomyia sand flies. As an initial step in our studies we have used random sequencing of cDNA clones from two expression libraries made from head/thorax and abdomen of sugar fed L. longipalpis for the identification of expressed sequence tags (EST). We applied differential display reverse transcriptase-PCR and randomly amplified polymorphic DNA-PCR to characterize differentially expressed mRNA from sugar and blood fed insects, and, in one case, from a L. (V.) braziliensis-infected L. longipalpis. We identified 37 cDNAs that have shown homology to known sequences from GeneBank. Of these, 32 cDNAs code for constitutive proteins such as zinc finger protein, glutamine synthetase, G binding protein, ubiquitin conjugating enzyme. Three are putative differentially expressed cDNAs from blood fed and Leishmania-infected midgut, a chitinase, a V-ATPase and a MAP kinase. Finally, two sequences are homologous to Drosophila melanogaster gene products recently discovered through the Drosophila genome initiative.

First molecular cloning and characterisation of caspase-9 gene in fish and its involvement in a gram negative septicaemia.

PubMed

Reis, Marta I R; do Vale, Ana; Pinto, Cristina; Nascimento, Diana S; Costa-Ramos, Carolina; Silva, Daniela S P; Silva, Manuel T; Dos Santos, Nuno M S

2007-03-01

Caspase-9 is an initiator caspase in the apoptotic process whose function is to activate effector caspases that are downstream in the mitochondrial pathway of apoptosis. This work reports for the first time the complete sequencing and characterisation of caspase-9 in fish. A 1924bp cDNA of sea bass caspase-9 was obtained, consisting of 1308bp open reading frame coding for 435 amino acids, 199bp of the 5'-UTR and 417bp of the 3'-UTR including a canonical polyadenilation signal 10 nucleotides upstream the polyadenilation tail. The sequence retains the pentapeptide active-site motif (QACGG) and the putative cleavage sites at Asp(121), Asp(325) and Asp(343). The sequence of sea bass caspase-9 exhibits a very close homology to the sequences of caspase-9 from other vertebrates, particularly with the putative caspases-9 of Danio rerio and Tetraodon nigroviridis (77.5 and 75.4% similarity, respectively), justifying the fact that the phylogenetic analysis groups these species together with sea bass. The sea bass caspase-9 gene exists as a single copy gene and is organised in 9 introns and 10 exons. The sea bass caspase-9 showed a basal expression in all the organs analysed, although weaker in spleen. The expression of sea bass caspase-9 in the head kidney of sea bass infected with the Photobacterium damselae ssp. piscicida (Phdp) strain PP3, showed increased expression from 0 to 12h returning to control levels at 24h. Caspase-9 activity was detected in Phdp infected sea bass head kidney from 18 to 48h post-infection, when the fish were with advanced septicaemia.
Cloning and characterization of indole synthase (INS) and a putative tryptophan synthase α-subunit (TSA) genes from Polygonum tinctorium.

PubMed

Jin, Zhehao; Kim, Jin-Hee; Park, Sang Un; Kim, Soo-Un

2016-12-01

Two cDNAs for indole-3-glycerol phosphate lyase homolog were cloned from Polygonum tinctorium. One encoded cytosolic indole synthase possibly in indigoid synthesis, whereas the other encoded a putative tryptophan synthase α-subunit. Indigo is an old natural blue dye produced by plants such as Polygonum tinctorium. Key step in plant indigoid biosynthesis is production of indole by indole-3-glycerol phosphate lyase (IGL). Two tryptophan synthase α-subunit (TSA) homologs, PtIGL-short and -long, were isolated by RACE PCR from P. tinctorium. The genome of the plant contained two genes coding for IGL. The short and the long forms, respectively, encoded 273 and 316 amino acid residue-long proteins. The short form complemented E. coli ΔtnaA ΔtrpA mutant on tryptophan-depleted agar plate signifying production of free indole, and thus was named indole synthase gene (PtINS). The long form, either intact or without the transit peptide sequence, did not complement the mutant and was tentatively named PtTSA. PtTSA was delivered into chloroplast as predicted by 42-residue-long targeting sequence, whereas PtINS was localized in cytosol. Genomic structure analysis suggested that a TSA duplicate acquired splicing sites during the course of evolution toward PtINS so that the targeting sequence-containing pre-mRNA segment was deleted as an intron. PtINS had about two to fivefolds higher transcript level than that of PtTSA, and treatment of 2,1,3-benzothiadiazole caused the relative transcript level of PtINS over PtTSA was significantly enhanced in the plant. The results indicate participation of PtINS in indigoid production.
Discovery of precursor and mature microRNAs and their putative gene targets using high-throughput sequencing in pineapple (Ananas comosus var. comosus).

PubMed

Yusuf, Noor Hydayaty Md; Ong, Wen Dee; Redwan, Raimi Mohamed; Latip, Mariam Abd; Kumar, S Vijay

2015-10-15

MicroRNAs (miRNAs) are a class of small, endogenous non-coding RNAs that negatively regulate gene expression, resulting in the silencing of target mRNA transcripts through mRNA cleavage or translational inhibition. MiRNAs play significant roles in various biological and physiological processes in plants. However, the miRNA-mediated gene regulatory network in pineapple, the model tropical non-climacteric fruit, remains largely unexplored. Here, we report a complete list of pineapple mature miRNAs obtained from high-throughput small RNA sequencing and precursor miRNAs (pre-miRNAs) obtained from ESTs. Two small RNA libraries were constructed from pineapple fruits and leaves, respectively, using Illumina's Solexa technology. Sequence similarity analysis using miRBase revealed 579,179 reads homologous to 153 miRNAs from 41 miRNA families. In addition, a pineapple fruit transcriptome library consisting of approximately 30,000 EST contigs constructed using Solexa sequencing was used for the discovery of pre-miRNAs. In all, four pre-miRNAs were identified (MIR156, MIR399, MIR444 and MIR2673). Furthermore, the same pineapple transcriptome was used to dissect the function of the miRNAs in pineapple by predicting their putative targets in conjunction with their regulatory networks. In total, 23 metabolic pathways were found to be regulated by miRNAs in pineapple. The use of high-throughput sequencing in pineapples to unveil the presence of miRNAs and their regulatory pathways provides insight into the repertoire of miRNA regulation used exclusively in this non-climacteric model plant. Copyright © 2015 Elsevier B.V. All rights reserved.
Characterization of the biotin uptake system encoded by the biotin-inducible bioYMN operon of Corynebacterium glutamicum.

PubMed

Schneider, Jens; Peters-Wendisch, Petra; Stansen, K Corinna; Götker, Susanne; Maximow, Stanislav; Krämer, Reinhard; Wendisch, Volker F

2012-01-13

The amino acid-producing Gram-positive Corynebacterium glutamicum is auxotrophic for biotin although biotin ring assembly starting from the precursor pimeloyl-CoA is still functional. It possesses AccBC, the α-subunit of the acyl-carboxylases involved in fatty acid and mycolic acid synthesis, and pyruvate carboxylase as the only biotin-containing proteins. Comparative genome analyses suggested that the putative transport system BioYMN encoded by cg2147, cg2148 and cg2149 might be involved in biotin uptake by C. glutamicum. By comparison of global gene expression patterns of cells grown with limiting or excess supply of biotin or with dethiobiotin as supplement replacing biotin revealed that expression of genes coding for enzymes of biotin ring assembly and for the putative uptake system was regulated according to biotin availability. RT-PCR and 5'-RACE experiments demonstrated that the genes bioY, bioM, and bioN are transcribed from one promoter as a single transcript. Biochemical analyses revealed that BioYMN catalyzes the effective uptake of biotin with a concentration of 60 nM biotin supporting a half-maximal transport rate. Maximal biotin uptake rates were at least five fold higher in biotin-limited cells as compared to cells grown with excess biotin. Overexpression of bioYMN led to an at least 50 fold higher biotin uptake rate as compared to the empty vector control. Overproduction of BioYMN alleviated biotin limitation and interfered with triggering L-glutamate production by biotin limitation. The operon bioYMN from C. glutamicum was shown to be induced by biotin limitation. Transport assays with radio-labeled biotin revealed that BioYMN functions as a biotin uptake system. Overexpression of bioYMN affected L-glutamate production triggered by biotin limitation.
Evolutionary dynamics of a conserved sequence motif in the ribosomal genes of the ciliate Paramecium

PubMed Central

2010-01-01

Background In protozoa, the identification of preserved motifs by comparative genomics is often impeded by difficulties to generate reliable alignments for non-coding sequences. Moreover, the evolutionary dynamics of regulatory elements in 3' untranslated regions (both in protozoa and metazoa) remains a virtually unexplored issue. Results By screening Paramecium tetraurelia's 3' untranslated regions for 8-mers that were previously found to be preserved in mammalian 3' UTRs, we detect and characterize a motif that is distinctly conserved in the ribosomal genes of this ciliate. The motif appears to be conserved across Paramecium aurelia species but is absent from the ribosomal genes of four additional non-Paramecium species surveyed, including another ciliate, Tetrahymena thermophila. Motif-free ribosomal genes retain fewer paralogs in the genome and appear to be lost more rapidly relative to motif-containing genes. Features associated with the discovered preserved motif are consistent with this 8-mer playing a role in post-transcriptional regulation. Conclusions Our observations 1) shed light on the evolution of a putative regulatory motif across large phylogenetic distances; 2) are expected to facilitate the understanding of the modulation of ribosomal genes expression in Paramecium; and 3) reveal a largely unexplored--and presumably not restricted to Paramecium--association between the presence/absence of a DNA motif and the evolutionary fate of its host genes. PMID:20441586
Functional analysis of the ComK protein of Bacillus coagulans.

PubMed

Kovács, Ákos T; Eckhardt, Tom H; van Hartskamp, Mariska; van Kranenburg, Richard; Kuipers, Oscar P

2013-01-01

The genes for DNA uptake and recombination in Bacilli are commonly regulated by the transcriptional factor ComK. We have identified a ComK homologue in Bacillus coagulans, an industrial relevant organism that is recalcitrant for transformation. Introduction of B. coagulans comK gene under its own promoter region into Bacillus subtilis comK strain results in low transcriptional induction of the late competence gene comGA, but lacking bistable expression. The promoter regions of B. coagulans comK and the comGA genes are recognized in B. subtilis and expression from these promoters is activated by B. subtilis ComK. Purified ComK protein of B. coagulans showed DNA-binding ability in gel retardation assays with B. subtilis- and B. coagulans-derived probes. These experiments suggest that the function of B. coagulans ComK is similar to that of ComK of B. subtilis. When its own comK is overexpressed in B. coagulans the comGA gene expression increases 40-fold, while the expression of another late competence gene, comC is not elevated and no reproducible DNA-uptake could be observed under these conditions. Our results demonstrate that B. coagulans ComK can recognize several B.subtilis comK-responsive elements, and vice versa, but indicate that the activation of the transcription of complete sets of genes coding for a putative DNA uptake apparatus in B. coagulans might differ from that of B. subtilis.
A cosmid and cDNA fine physical map of a human chromosome 13q14 region frequently lost in B-cell chronic lymphocytic leukemia and identification of a new putative tumor suppressor gene, Leu5.

PubMed

Kapanadze, B; Kashuba, V; Baranova, A; Rasool, O; van Everdink, W; Liu, Y; Syomov, A; Corcoran, M; Poltaraus, A; Brodyansky, V; Syomova, N; Kazakov, A; Ibbotson, R; van den Berg, A; Gizatullin, R; Fedorova, L; Sulimova, G; Zelenin, A; Deaven, L; Lehrach, H; Grander, D; Buys, C; Oscier, D; Zabarovsky, E R; Einhorn, S; Yankovsky, N

1998-04-17

B-cell chronic lymphocytic leukemia (B-CLL) is a human hematological neoplastic disease often associated with the loss of a chromosome 13 region between RB1 gene and locus D13S25. A new tumor suppressor gene (TSG) may be located in the region. A cosmid contig has been constructed between the loci D13S1168 (WI9598) and D13S25 (H2-42), which corresponds to the minimal region shared by B-CLL associated deletions. The contig includes more than 200 LANL and ICRF cosmid clones covering 620 kb. Three cDNAs likely corresponding to three different genes have been found in the minimally deleted region, sequenced and mapped against the contigged cosmids. cDNA clone 10k4 as well as a chimeric clone 13g3, codes for a zinc-finger domain of the RING type and shares homology to some known genes involved in tumorigenesis (RET finger protein, BRCA1) and embryogenesis (MID1). We have termed the gene corresponding to 10k4/13g3 clones LEU5. This is the first gene with homology to known TSGs which has been found in the region of B-CLL rearrangements.
antiSMASH 3.0—a comprehensive resource for the genome mining of biosynthetic gene clusters

PubMed Central

Blin, Kai; Duddela, Srikanth; Krug, Daniel; Kim, Hyun Uk; Bruccoleri, Robert; Lee, Sang Yup; Fischbach, Michael A; Müller, Rolf; Wohlleben, Wolfgang; Breitling, Rainer; Takano, Eriko

2015-01-01

Abstract Microbial secondary metabolism constitutes a rich source of antibiotics, chemotherapeutics, insecticides and other high-value chemicals. Genome mining of gene clusters that encode the biosynthetic pathways for these metabolites has become a key methodology for novel compound discovery. In 2011, we introduced antiSMASH, a web server and stand-alone tool for the automatic genomic identification and analysis of biosynthetic gene clusters, available at http://antismash.secondarymetabolites.org. Here, we present version 3.0 of antiSMASH, which has undergone major improvements. A full integration of the recently published ClusterFinder algorithm now allows using this probabilistic algorithm to detect putative gene clusters of unknown types. Also, a new dereplication variant of the ClusterBlast module now identifies similarities of identified clusters to any of 1172 clusters with known end products. At the enzyme level, active sites of key biosynthetic enzymes are now pinpointed through a curated pattern-matching procedure and Enzyme Commission numbers are assigned to functionally classify all enzyme-coding genes. Additionally, chemical structure prediction has been improved by incorporating polyketide reduction states. Finally, in order for users to be able to organize and analyze multiple antiSMASH outputs in a private setting, a new XML output module allows offline editing of antiSMASH annotations within the Geneious software. PMID:25948579
Draft genome sequence of bitter gourd (Momordica charantia), a vegetable and medicinal plant in tropical and subtropical regions.

PubMed

Urasaki, Naoya; Takagi, Hiroki; Natsume, Satoshi; Uemura, Aiko; Taniai, Naoki; Miyagi, Norimichi; Fukushima, Mai; Suzuki, Shouta; Tarora, Kazuhiko; Tamaki, Moritoshi; Sakamoto, Moriaki; Terauchi, Ryohei; Matsumura, Hideo

2017-02-01

Bitter gourd (Momordica charantia) is an important vegetable and medicinal plant in tropical and subtropical regions globally. In this study, the draft genome sequence of a monoecious bitter gourd inbred line, OHB3-1, was analyzed. Through Illumina sequencing and de novo assembly, scaffolds of 285.5 Mb in length were generated, corresponding to ∼84% of the estimated genome size of bitter gourd (339 Mb). In this draft genome sequence, 45,859 protein-coding gene loci were identified, and transposable elements accounted for 15.3% of the whole genome. According to synteny mapping and phylogenetic analysis of conserved genes, bitter gourd was more related to watermelon (Citrullus lanatus) than to cucumber (Cucumis sativus) or melon (C. melo). Using RAD-seq analysis, 1507 marker loci were genotyped in an F2 progeny of two bitter gourd lines, resulting in an improved linkage map, comprising 11 linkage groups. By anchoring RAD tag markers, 255 scaffolds were assigned to the linkage map. Comparative analysis of genome sequences and predicted genes determined that putative trypsin-inhibitor and ribosome-inactivating genes were distinctive in the bitter gourd genome. These genes could characterize the bitter gourd as a medicinal plant. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Comparative Analysis of the Mitochondrial Genomes of Callitettixini Spittlebugs (Hemiptera: Cercopidae) Confirms the Overall High Evolutionary Speed of the AT-Rich Region but Reveals the Presence of Short Conservative Elements at the Tribal Level

PubMed Central

Liu, Jie; Bu, Cuiping; Wipfler, Benjamin; Liang, Aiping

2014-01-01

The present study compares the mitochondrial genomes of five species of the spittlebug tribe Callitettixini (Hemiptera: Cercopoidea: Cercopidae) from eastern Asia. All genomes of the five species sequenced are circular double-stranded DNA molecules and range from 15,222 to 15,637 bp in length. They contain 22 tRNA genes, 13 protein coding genes (PCGs) and 2 rRNA genes and share the putative ancestral gene arrangement of insects. The PCGs show an extreme bias of nucleotide and amino acid composition. Significant differences of the substitution rates among the different genes as well as the different codon position of each PCG are revealed by the comparative evolutionary analyses. The substitution speeds of the first and second codon position of different PCGs are negatively correlated with their GC content. Among the five species, the AT-rich region features great differences in length and pattern and generally shows a 2–5 times higher substitution rate than the fastest PCG in the mitochondrial genome, atp8. Despite the significant variability in length, short conservative segments were identified in the AT-rich region within Callitettixini, although absent from the other groups of the spittlebug superfamily Cercopoidea. PMID:25285442
The first two mitochondrial genomes from Taeniopterygidae (Insecta: Plecoptera): Structural features and phylogenetic implications.

PubMed

Chen, Zhi-Teng; Du, Yu-Zhou

2018-05-01

The complete mitochondrial genomes (mitogenomes) of Taeniopteryx ugola and Doddsia occidentalis (Plecoptera: Taeniopterygidae) were firstly sequenced from the family Taeniopterygidae. The 15,353-bp long mitogenome of T. ugola and the 16,020-bp long mitogenome of D. occidentalis each contained 37 genes including 13 protein-coding genes (PCGs), 22 transfer RNA genes (tRNAs), two ribosomal RNA genes (rRNAs) and a control region (CR). The mitochondrial gene arrangement of the two taeniopterygids and other stoneflies was identical with the putative ancestral mitogenome of Drosophila yakuba. Most PCGs used standard ATN start codons and TAN termination codons. Twenty-one of the 22 tRNAs in each mitogenome could fold into the cloverleaf secondary structures, while the dihydrouridine (DHU) arm of trnSer (AGN) was reduced or absent. Stem-loop (SL) structures, poly-T stretch, poly-[AT] n stretch and tandem repeats were found in the CRs of the two mitogenomes. The phylogenetic analyses using Bayesian inference (BI) and maximum likelihood methods (ML) generated identical results, both supporting the monophyly of all stonefly families and the two infraorders, Systellognatha and Euholognatha. Taeniopterygidae was grouped with another two families from Euholognatha. The relationships within Plecoptera were recovered as (((Perlidae+Peltoperlidae)+((Pteronarcyidae+Chloroperlidae)+Styloperlidae))+((Capniidae+Taeniopterygidae)+Nemouridae))+Gripopterygidae. Copyright © 2017 Elsevier B.V. All rights reserved.
Genomic interval engineering of mice identified a novel modulator of triglyceride production

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhu, Y.; Jong, M.C.; Frazer, K.A.

1999-10-01

To accelerate the biological annotation of novel genes discovered in sequenced of mammalian genomes, we are creating large deletions in the mouse genome targeted to include clusters of such genes. Here we describe the targeted deletion of a 450 kb region on mouse chromosome 11 which, based on computational analysis of the deleted murine sequences and human 5q orthologous sequences, codes for nine putative genes. Mice homozygous for the deletion had a variety of abnormalities including severe hypertriglyceridemia, hepatic and cardiac enlargement, growth retardation and premature mortality. Analysis of triglyceride metabolism in these animals demonstrated a several-fold increase in hepaticmore » very-low density lipoprotein (VLDL) triglyceride secretion, the most prevalent mechanism responsible for hypertriglyceridemia in humans. A series of mouse BAC and human YAC transgenes covering different intervals of the 450 kb deleted region were assessed for their ability to complement the deletion induced abnormalities. These studies revealed that OCTN2, a gene recently shown to play a role in carnitine transport, was able to correct the triglyceride abnormalities. The discovery of this previously unappreciated relationship between OCTN2, carnitine and hepatic triglyceride production is of particular importance due to the clinical consequence of hypertriglyceridemia and the paucity of genes known to modulate triglyceride secretion.« less
Differential microRNA Analysis of Glandular Trichomes and Young Leaves in Xanthium strumarium L. Reveals Their Putative Roles in Regulating Terpenoid Biosynthesis.

PubMed

Fan, Rongyan; Li, Yuanjun; Li, Changfu; Zhang, Yansheng

2015-01-01

The medicinal plant Xanthium strumarium L. (X. strumarium) is covered with glandular trichomes, which are the sites for synthesizing pharmacologically active terpenoids such as xanthatin. MicroRNAs (miRNAs) are a class of 21-24 nucleotide (nt) non-coding RNAs, most of which are identified as regulators of plant growth development. Identification of miRNAs involved in the biosynthesis of plant secondary metabolites remains limited. In this study, high-throughput Illumina sequencing, combined with target gene prediction, was performed to discover novel and conserved miRNAs with potential roles in regulating terpenoid biosynthesis in X. strumarium glandular trichomes. Two small RNA libraries from leaves and glandular trichomes of X. strumarium were established. In total, 1,185 conserved miRNAs and 37 novel miRNAs were identified, with 494 conserved miRNAs and 18 novel miRNAs being differentially expressed between the two tissue sources. Based on the X. strumarium transcriptome data that we recently constructed, 3,307 annotated mRNA transcripts were identified as putative targets of the differentially expressed miRNAs. KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway analysis suggested that some of the differentially expressed miRNAs, including miR6435, miR5021 and miR1134, might be involved in terpenoid biosynthesis in the X. strumarium glandular trichomes. This study provides the first comprehensive analysis of miRNAs in X. strumarium, which forms the basis for further understanding of miRNA-based regulation on terpenoid biosynthesis.
Streptococcus suis serotype 9 strain GZ0565 contains a type VII secretion system putative substrate EsxA that contributes to bacterial virulence and a vanZ-like gene that confers resistance to teicoplanin and dalbavancin in Streptococcus agalactiae.

PubMed

Lai, Liying; Dai, Jiao; Tang, Huanyu; Zhang, Shouming; Wu, Chunyan; Qiu, Wancen; Lu, Chengping; Yao, Huochun; Fan, Hongjie; Wu, Zongfu

2017-06-01

Streptococcus suis (SS), an important pathogen for pigs, is not only considered as a zoonotic agent for humans, but is also recognized as a major reservoir of antimicrobial resistance contributing to the spread of resistance genes to other pathogenic Streptococcus species. In addition to serotype 2 (SS2), serotype 9 (SS9) is another prevalent serotype isolated from diseased pigs. Although many SS strains have been sequenced, the complete genome of a non-SS2 virulent strain has been unavailable to date. Here, we report the complete genome of GZ0565, a virulent strain of SS9, isolated from a pig with meningitis. Comparative genomic analysis revealed five new putative virulence or antimicrobial resistance-associated genes in strain GZ0565 but not in SS2 virulent strains. These five genes encode a putative triacylglycerol lipase, a TipAS antibiotic-recognition domain protein, a putative TetR family transcriptional repressor, a protein containing a LPXTG domain and a G5 domain, and a type VII secretion system (T7SS) putative substrate (EsxA), respectively. Western blot analysis showed that strain GZ0565 can secrete EsxA. We generated an esxA deletion mutant and showed that EsxA contributes to SS virulence in a mouse infection model. Additionally, the antibiotic resistance gene vanZ SS was identified and expression of vanZ SS conferred resistance to teicoplanin and dalbavancin in Streptococcus agalactiae. We believe this is the first experimental demonstration of the existence of the T7SS putative substrate EsxA and its contribution to bacterial virulence in SS. Together, our results contribute to further understanding of the virulence and antimicrobial resistance characteristics of SS. Copyright © 2017 Elsevier B.V. All rights reserved.
Bioinformatic analysis of the nucleotide binding site-encoding disease-resistance genes in foxtail millet (Setaria italica (L.) Beauv.).

PubMed

Zhu, Y B; Xie, X Q; Li, Z Y; Bai, H; Dong, L; Dong, Z P; Dong, J G

2014-08-28

The nucleotide-binding site (NBS) disease-resistance genes are the largest category of plant disease-resistance gene analogs. The complete set of disease-resistant candidate genes, which encode the NBS sequence, was filtered in the genomes of two varieties of foxtail millet (Yugu1 and 'Zhang gu'). This study investigated a number of characteristics of the putative NBS genes, such as structural diversity and phylogenetic relationships. A total of 269 and 281 NBS-coding sequences were identified in Yugu1 and 'Zhang gu', respectively. When the two databases were compared, 72 genes were found to be identical and 164 genes showed more than 90% similarity. Physical positioning and gene family analysis of the NBS disease-resistance genes in the genome revealed that the number of genes on each chromosome was similar in both varieties. The eighth chromosome contained the largest number of genes and the ninth chromosome contained the lowest number of genes. Exactly 34 gene clusters containing the 161 genes were found in the Yugu1 genome, with each cluster containing 4.7 genes on average. In comparison, the 'Zhang gu' genome possessed 28 gene clusters, which had 151 genes, with an average of 5.4 genes in each cluster. The largest gene cluster, located on the eighth chromosome, contained 12 genes in the Yugu1 database, whereas it contained 16 genes in the 'Zhang gu' database. The classification results showed that the CC-NBS-LRR gene made up the largest part of each chromosome in the two databases. Two TIR-NBS genes were also found in the Yugu1 genome.
DNA sequence of a ColV plasmid and prevalence of selected plasmid-encoded virulence genes among avian Escherichia coli strains.

PubMed

Johnson, Timothy J; Siek, Kylie E; Johnson, Sara J; Nolan, Lisa K

2006-01-01

ColV plasmids have long been associated with the virulence of Escherichia coli, despite the fact that their namesake trait, ColV production, does not appear to contribute to virulence. Such plasmids or their associated sequences appear to be quite common among avian pathogenic E. coli (APEC) and are strongly linked to the virulence of these organisms. In the present study, a 180-kb ColV plasmid was sequenced and analyzed. This plasmid, pAPEC-O2-ColV, possesses a 93-kb region containing several putative virulence traits, including iss, tsh, and four putative iron acquisition and transport systems. The iron acquisition and transport systems include those encoding aerobactin and salmochelin, the sit ABC iron transport system, and a putative iron transport system novel to APEC, eit. In order to determine the prevalence of the virulence-associated genes within this region among avian E. coli strains, 595 APEC and 199 avian commensal E. coli isolates were examined for genes of this region using PCR. Results indicate that genes contained within a portion of this putative virulence region are highly conserved among APEC and that the genes of this region occur significantly more often in APEC than in avian commensal E. coli. The region of pAPEC-O2-ColV containing genes that are highly prevalent among APEC appears to be a distinguishing trait of APEC strains.
DNA Sequence of a ColV Plasmid and Prevalence of Selected Plasmid-Encoded Virulence Genes among Avian Escherichia coli Strains

PubMed Central

Johnson, Timothy J.; Siek, Kylie E.; Johnson, Sara J.; Nolan, Lisa K.

2006-01-01

ColV plasmids have long been associated with the virulence of Escherichia coli, despite the fact that their namesake trait, ColV production, does not appear to contribute to virulence. Such plasmids or their associated sequences appear to be quite common among avian pathogenic E. coli (APEC) and are strongly linked to the virulence of these organisms. In the present study, a 180-kb ColV plasmid was sequenced and analyzed. This plasmid, pAPEC-O2-ColV, possesses a 93-kb region containing several putative virulence traits, including iss, tsh, and four putative iron acquisition and transport systems. The iron acquisition and transport systems include those encoding aerobactin and salmochelin, the sit ABC iron transport system, and a putative iron transport system novel to APEC, eit. In order to determine the prevalence of the virulence-associated genes within this region among avian E. coli strains, 595 APEC and 199 avian commensal E. coli isolates were examined for genes of this region using PCR. Results indicate that genes contained within a portion of this putative virulence region are highly conserved among APEC and that the genes of this region occur significantly more often in APEC than in avian commensal E. coli. The region of pAPEC-O2-ColV containing genes that are highly prevalent among APEC appears to be a distinguishing trait of APEC strains. PMID:16385064
Sequencing and functional annotation of the whole genome of the filamentous fungus Aspergillus westerdijkiae.

PubMed

Han, Xiaolong; Chakrabortti, Alolika; Zhu, Jindong; Liang, Zhao-Xun; Li, Jinming

2016-08-15

Aspergillus westerdijkiae produces ochratoxin A (OTA) in Aspergillus section Circumdati. It is responsible for the contamination of agricultural crops, fruits, and food commodities, as its secondary metabolite OTA poses a potential threat to animals and humans. As a member of the filamentous fungi family, its capacity for enzymatic catalysis and secondary metabolite production is valuable in industrial production and medicine. To understand the genetic factors underlying its pathogenicity, enzymatic degradation, and secondary metabolism, we analysed the whole genome of A. westerdijkiae and compared it with eight other sequenced Aspergillus species. We sequenced the complete genome of A. westerdijkiae and assembled approximately 36 Mb of its genomic DNA, in which we identified 10,861 putative protein-coding genes. We constructed a phylogenetic tree of A. westerdijkiae and eight other sequenced Aspergillus species and found that the sister group of A. westerdijkiae was the A. oryzae - A. flavus clade. By searching the associated databases, we identified 716 cytochrome P450 enzymes, 633 carbohydrate-active enzymes, and 377 proteases. By combining comparative analysis with Kyoto Encyclopaedia of Genes and Genomes (KEGG), Conserved Domains Database (CDD), and Pfam annotations, we predicted 228 potential carbohydrate-active enzymes related to plant polysaccharide degradation (PPD). We found a large number of secondary biosynthetic gene clusters, which suggested that A. westerdijkiae had a remarkable capacity to produce secondary metabolites. Furthermore, we obtained two more reliable and integrated gene sequences containing the reported portions of OTA biosynthesis and identified their respective secondary metabolite clusters. We also systematically annotated these two hybrid t1pks-nrps gene clusters involved in OTA biosynthesis. These two clusters were separate in the genome, and one of them encoded a couple of GH3 and AA3 enzyme genes involved in sucrose and glucose metabolism. The genomic information obtained in this study is valuable for understanding the life cycle and pathogenicity of A. westerdijkiae. We identified numerous enzyme genes that are potentially involved in host invasion and pathogenicity, and we provided a preliminary prediction for each putative secondary metabolite (SM) gene cluster. In particular, for the OTA-related SM gene clusters, we delivered their components with domain and pathway annotations. This study sets the stage for experimental verification of the biosynthetic and regulatory mechanisms of OTA and for the discovery of new secondary metabolites.
Long Non-Coding RNAs Responsive to Salt and Boron Stress in the Hyper-Arid Lluteño Maize from Atacama Desert.

PubMed

Huanca-Mamani, Wilson; Arias-Carrasco, Raúl; Cárdenas-Ninasivincha, Steffany; Rojas-Herrera, Marcelo; Sepúlveda-Hermosilla, Gonzalo; Caris-Maldonado, José Carlos; Bastías, Elizabeth; Maracaja-Coutinho, Vinicius

2018-03-20

Long non-coding RNAs (lncRNAs) have been defined as transcripts longer than 200 nucleotides, which lack significant protein coding potential and possess critical roles in diverse cellular processes. Long non-coding RNAs have recently been functionally characterized in plant stress-response mechanisms. In the present study, we perform a comprehensive identification of lncRNAs in response to combined stress induced by salinity and excess of boron in the Lluteño maize, a tolerant maize landrace from Atacama Desert, Chile. We use deep RNA sequencing to identify a set of 48,345 different lncRNAs, of which 28,012 (58.1%) are conserved with other maize (B73, Mo17 or Palomero), with the remaining 41.9% belonging to potentially Lluteño exclusive lncRNA transcripts. According to B73 maize reference genome sequence, most Lluteño lncRNAs correspond to intergenic transcripts. Interestingly, Lluteño lncRNAs presents an unusual overall higher expression compared to protein coding genes under exposure to stressed conditions. In total, we identified 1710 putatively responsive to the combined stressed conditions of salt and boron exposure. We also identified a set of 848 stress responsive potential trans natural antisense transcripts ( trans -NAT) lncRNAs, which seems to be regulating genes associated with regulation of transcription, response to stress, response to abiotic stimulus and participating of the nicotianamine metabolic process. Reverse transcription-quantitative PCR (RT-qPCR) experiments were performed in a subset of lncRNAs, validating their existence and expression patterns. Our results suggest that a diverse set of maize lncRNAs from leaves and roots is responsive to combined salt and boron stress, being the first effort to identify lncRNAs from a maize landrace adapted to extreme conditions such as the Atacama Desert. The information generated is a starting point to understand the genomic adaptabilities suffered by this maize to surpass this extremely stressed environment.
Long Non-Coding RNAs Responsive to Salt and Boron Stress in the Hyper-Arid Lluteño Maize from Atacama Desert

PubMed Central

Huanca-Mamani, Wilson; Arias-Carrasco, Raúl; Cárdenas-Ninasivincha, Steffany; Rojas-Herrera, Marcelo; Sepúlveda-Hermosilla, Gonzalo; Caris-Maldonado, José Carlos; Bastías, Elizabeth; Maracaja-Coutinho, Vinicius

2018-01-01

Long non-coding RNAs (lncRNAs) have been defined as transcripts longer than 200 nucleotides, which lack significant protein coding potential and possess critical roles in diverse cellular processes. Long non-coding RNAs have recently been functionally characterized in plant stress–response mechanisms. In the present study, we perform a comprehensive identification of lncRNAs in response to combined stress induced by salinity and excess of boron in the Lluteño maize, a tolerant maize landrace from Atacama Desert, Chile. We use deep RNA sequencing to identify a set of 48,345 different lncRNAs, of which 28,012 (58.1%) are conserved with other maize (B73, Mo17 or Palomero), with the remaining 41.9% belonging to potentially Lluteño exclusive lncRNA transcripts. According to B73 maize reference genome sequence, most Lluteño lncRNAs correspond to intergenic transcripts. Interestingly, Lluteño lncRNAs presents an unusual overall higher expression compared to protein coding genes under exposure to stressed conditions. In total, we identified 1710 putatively responsive to the combined stressed conditions of salt and boron exposure. We also identified a set of 848 stress responsive potential trans natural antisense transcripts (trans-NAT) lncRNAs, which seems to be regulating genes associated with regulation of transcription, response to stress, response to abiotic stimulus and participating of the nicotianamine metabolic process. Reverse transcription-quantitative PCR (RT-qPCR) experiments were performed in a subset of lncRNAs, validating their existence and expression patterns. Our results suggest that a diverse set of maize lncRNAs from leaves and roots is responsive to combined salt and boron stress, being the first effort to identify lncRNAs from a maize landrace adapted to extreme conditions such as the Atacama Desert. The information generated is a starting point to understand the genomic adaptabilities suffered by this maize to surpass this extremely stressed environment. PMID:29558449

Some links on this page may take you to non-federal websites. Their policies may differ from this site.