functional genes present: Topics by Science.gov

Sample records for functional genes present

GO-based functional dissimilarity of gene sets.

PubMed

Díaz-Díaz, Norberto; Aguilar-Ruiz, Jesús S

2011-09-01

The Gene Ontology (GO) provides a controlled vocabulary for describing the functions of genes and can be used to evaluate the functional coherence of gene sets. Many functional coherence measures consider each pair of gene functions in a set and produce an output based on all pairwise distances. A single gene can encode multiple proteins that may differ in function. For each functionality, other proteins that exhibit the same activity may also participate. Therefore, an identification of the most common function for all of the genes involved in a biological process is important in evaluating the functional similarity of groups of genes and a quantification of functional coherence can helps to clarify the role of a group of genes working together. To implement this approach to functional assessment, we present GFD (GO-based Functional Dissimilarity), a novel dissimilarity measure for evaluating groups of genes based on the most relevant functions of the whole set. The measure assigns a numerical value to the gene set for each of the three GO sub-ontologies. Results show that GFD performs robustly when applied to gene set of known functionality (extracted from KEGG). It performs particularly well on randomly generated gene sets. An ROC analysis reveals that the performance of GFD in evaluating the functional dissimilarity of gene sets is very satisfactory. A comparative analysis against other functional measures, such as GS2 and those presented by Resnik and Wang, also demonstrates the robustness of GFD.
Extensive gene content variation in the Brachypodium distachyon pan-genome correlates with population structure

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gordon, Sean P.; Contreras-Moreira, Bruno; Woods, Daniel P.

While prokaryotic pan-genomes have been shown to contain many more genes than any individual organism, the prevalence and functional significance of differentially present genes in eukaryotes remains poorly understood. Whole-genome de novo assembly and annotation of 54 lines of the grass Brachypodium distachyon yield a pan-genome containing nearly twice the number of genes found in any individual genome. Genes present in all lines are enriched for essential biological functions, while genes present in only some lines are enriched for conditionally beneficial functions (e.g., defense and development), display faster evolutionary rates, lie closer to transposable elements and are less likely tomore » be syntenic with orthologous genes in other grasses. Our data suggest that differentially present genes contribute substantially to phenotypic variation within a eukaryote species, these genes have a major influence in population genetics, and transposable elements play a key role in pan-genome evolution.« less
Extensive gene content variation in the Brachypodium distachyon pan-genome correlates with population structure.

PubMed

Gordon, Sean P; Contreras-Moreira, Bruno; Woods, Daniel P; Des Marais, David L; Burgess, Diane; Shu, Shengqiang; Stritt, Christoph; Roulin, Anne C; Schackwitz, Wendy; Tyler, Ludmila; Martin, Joel; Lipzen, Anna; Dochy, Niklas; Phillips, Jeremy; Barry, Kerrie; Geuten, Koen; Budak, Hikmet; Juenger, Thomas E; Amasino, Richard; Caicedo, Ana L; Goodstein, David; Davidson, Patrick; Mur, Luis A J; Figueroa, Melania; Freeling, Michael; Catalan, Pilar; Vogel, John P

2017-12-19

While prokaryotic pan-genomes have been shown to contain many more genes than any individual organism, the prevalence and functional significance of differentially present genes in eukaryotes remains poorly understood. Whole-genome de novo assembly and annotation of 54 lines of the grass Brachypodium distachyon yield a pan-genome containing nearly twice the number of genes found in any individual genome. Genes present in all lines are enriched for essential biological functions, while genes present in only some lines are enriched for conditionally beneficial functions (e.g., defense and development), display faster evolutionary rates, lie closer to transposable elements and are less likely to be syntenic with orthologous genes in other grasses. Our data suggest that differentially present genes contribute substantially to phenotypic variation within a eukaryote species, these genes have a major influence in population genetics, and transposable elements play a key role in pan-genome evolution.
Extensive gene content variation in the Brachypodium distachyon pan-genome correlates with population structure

DOE PAGES

Gordon, Sean P.; Contreras-Moreira, Bruno; Woods, Daniel P.; ...

2017-12-19

While prokaryotic pan-genomes have been shown to contain many more genes than any individual organism, the prevalence and functional significance of differentially present genes in eukaryotes remains poorly understood. Whole-genome de novo assembly and annotation of 54 lines of the grass Brachypodium distachyon yield a pan-genome containing nearly twice the number of genes found in any individual genome. Genes present in all lines are enriched for essential biological functions, while genes present in only some lines are enriched for conditionally beneficial functions (e.g., defense and development), display faster evolutionary rates, lie closer to transposable elements and are less likely tomore » be syntenic with orthologous genes in other grasses. Our data suggest that differentially present genes contribute substantially to phenotypic variation within a eukaryote species, these genes have a major influence in population genetics, and transposable elements play a key role in pan-genome evolution.« less
Homeosis and beyond. What is the function of the Hox genes?

PubMed

Deutsch, Jean S

2010-01-01

What is the function of the Hox genes? At first glance, it is a curious question. Indeed, the answer seems so obvious that several authors have spoken of 'the Hox function' about some of the Hox genes, namely Hox3/zen and Hox6/ftz that seem to have lost it during the evolution of Arthropods. What these authors meant is that these genes have lost their 'homeotic' function. Indeed, 'homeotic' refers to a functional property that is so often associated with the Hox genes. However, the word 'Hox' should not be used to refer to a function, but to a group of genes. The above examples of Hox3/zen (see Schmitt-Ott's chapter, this book) and Hox6/ftz show that the homeotic function may be not so tightly linked to the Hox genes. Reversely, many genes, not belonging to the Hox group, do present a homeotic function. In the present chapter, I will first give a definition of the Hox genes. I will then ask what is the 'function' of a gene, examining its various meanings at different levels of biological organization. I will review and revisit the relation between the Hox genes and homeosis. I will suggest that their morphological homeotic function has been secondarily derived during the evolution of the Bilateria.
Biological interpretation of genome-wide association studies using predicted gene functions.

PubMed

Pers, Tune H; Karjalainen, Juha M; Chan, Yingleong; Westra, Harm-Jan; Wood, Andrew R; Yang, Jian; Lui, Julian C; Vedantam, Sailaja; Gustafsson, Stefan; Esko, Tonu; Frayling, Tim; Speliotes, Elizabeth K; Boehnke, Michael; Raychaudhuri, Soumya; Fehrmann, Rudolf S N; Hirschhorn, Joel N; Franke, Lude

2015-01-19

The main challenge for gaining biological insights from genetic associations is identifying which genes and pathways explain the associations. Here we present DEPICT, an integrative tool that employs predicted gene functions to systematically prioritize the most likely causal genes at associated loci, highlight enriched pathways and identify tissues/cell types where genes from associated loci are highly expressed. DEPICT is not limited to genes with established functions and prioritizes relevant gene sets for many phenotypes.
Identification and function analysis of contrary genes in Dupuytren's contracture.

PubMed

Ji, Xianglu; Tian, Feng; Tian, Lijie

2015-07-01

The present study aimed to analyze the expression of genes involved in Dupuytren's contracture (DC), using bioinformatic methods. The profile of GSE21221 was downloaded from the gene expression ominibus, which included six samples, derived from fibroblasts and six healthy control samples, derived from carpal-tunnel fibroblasts. A Distributed Intrusion Detection System was used in order to identify differentially expressed genes. The term contrary genes is proposed. Contrary genes were the genes that exhibited opposite expression patterns in the positive and negative groups, and likely exhibited opposite functions. These were identified using Coexpress software. Gene ontology (GO) function analysis was conducted for the contrary genes. A network of GO terms was constructed using the reduce and visualize gene ontology database. Significantly expressed genes (801) and contrary genes (98) were screened. A significant association was observed between Chitinase-3-like protein 1 and ten genes in the positive gene set. Positive regulation of transcription and the activation of nuclear factor-κB (NF-κB)-inducing kinase activity exhibited the highest degree values in the network of GO terms. In the present study, the expression of genes involved in the development of DC was analyzed, and the concept of contrary genes proposed. The genes identified in the present study are involved in the positive regulation of transcription and activation of NF-κB-inducing kinase activity. The contrary genes and GO terms identified in the present study may potentially be used for DC diagnosis and treatment.
Biological interpretation of genome-wide association studies using predicted gene functions

PubMed Central

Pers, Tune H.; Karjalainen, Juha M.; Chan, Yingleong; Westra, Harm-Jan; Wood, Andrew R.; Yang, Jian; Lui, Julian C.; Vedantam, Sailaja; Gustafsson, Stefan; Esko, Tonu; Frayling, Tim; Speliotes, Elizabeth K.; Boehnke, Michael; Raychaudhuri, Soumya; Fehrmann, Rudolf S.N.; Hirschhorn, Joel N.; Franke, Lude

2015-01-01

The main challenge for gaining biological insights from genetic associations is identifying which genes and pathways explain the associations. Here we present DEPICT, an integrative tool that employs predicted gene functions to systematically prioritize the most likely causal genes at associated loci, highlight enriched pathways and identify tissues/cell types where genes from associated loci are highly expressed. DEPICT is not limited to genes with established functions and prioritizes relevant gene sets for many phenotypes. PMID:25597830
MorphDB: Prioritizing Genes for Specialized Metabolism Pathways and Gene Ontology Categories in Plants.

PubMed

Zwaenepoel, Arthur; Diels, Tim; Amar, David; Van Parys, Thomas; Shamir, Ron; Van de Peer, Yves; Tzfadia, Oren

2018-01-01

Recent times have seen an enormous growth of "omics" data, of which high-throughput gene expression data are arguably the most important from a functional perspective. Despite huge improvements in computational techniques for the functional classification of gene sequences, common similarity-based methods often fall short of providing full and reliable functional information. Recently, the combination of comparative genomics with approaches in functional genomics has received considerable interest for gene function analysis, leveraging both gene expression based guilt-by-association methods and annotation efforts in closely related model organisms. Besides the identification of missing genes in pathways, these methods also typically enable the discovery of biological regulators (i.e., transcription factors or signaling genes). A previously built guilt-by-association method is MORPH, which was proven to be an efficient algorithm that performs particularly well in identifying and prioritizing missing genes in plant metabolic pathways. Here, we present MorphDB, a resource where MORPH-based candidate genes for large-scale functional annotations (Gene Ontology, MapMan bins) are integrated across multiple plant species. Besides a gene centric query utility, we present a comparative network approach that enables researchers to efficiently browse MORPH predictions across functional gene sets and species, facilitating efficient gene discovery and candidate gene prioritization. MorphDB is available at http://bioinformatics.psb.ugent.be/webtools/morphdb/morphDB/index/. We also provide a toolkit, named "MORPH bulk" (https://github.com/arzwa/morph-bulk), for running MORPH in bulk mode on novel data sets, enabling researchers to apply MORPH to their own species of interest.
fabp4 is central to eight obesity associated genes: a functional gene network-based polymorphic study.

PubMed

Bag, Susmita; Ramaiah, Sudha; Anbarasu, Anand

2015-01-07

Network study on genes and proteins offers functional basics of the complexity of gene and protein, and its interacting partners. The gene fatty acid-binding protein 4 (fabp4) is found to be highly expressed in adipose tissue, and is one of the most abundant proteins in mature adipocytes. Our investigations on functional modules of fabp4 provide useful information on the functional genes interacting with fabp4, their biochemical properties and their regulatory functions. The present study shows that there are eight set of candidate genes: acp1, ext2, insr, lipe, ostf1, sncg, usp15, and vim that are strongly and functionally linked up with fabp4. Gene ontological analysis of network modules of fabp4 provides an explicit idea on the functional aspect of fabp4 and its interacting nodes. The hierarchal mapping on gene ontology indicates gene specific processes and functions as well as their compartmentalization in tissues. The fabp4 along with its interacting genes are involved in lipid metabolic activity and are integrated in multi-cellular processes of tissues and organs. They also have important protein/enzyme binding activity. Our study elucidated disease-associated nsSNP prediction for fabp4 and it is interesting to note that there are four rsID׳s (rs1051231, rs3204631, rs140925685 and rs141169989) with disease allelic variation (T104P, T126P, G27D and G90V respectively). On the whole, our gene network analysis presents a clear insight about the interactions and functions associated with fabp4 gene network. Copyright © 2014 Elsevier Ltd. All rights reserved.
Gene Fusion: A Genome Wide Survey

NASA Technical Reports Server (NTRS)

Liang, Ping; Riley, Monica

2001-01-01

As a well known fact, organisms form larger and complex multimodular (composite or chimeric) and mostly multi-functional proteins through gene fusion of two or more individual genes which have independent evolution histories and functions. We call each of these components a module. The existence of multimodular proteins may improves the efficiency in gene regulation and in cellular functions, and thus may give the host organism advantages in adaptation to environments. Analysis of all gene fusions in present-day organisms should allow us to examine the patterns of gene fusion in context with cellular functions, to trace back the evolution processes from the ancient smaller and uni-functional proteins to the present-day larger and complex multi-functional proteins, and to estimate the minimal number of ancestor proteins that existed in the last common ancestor for all life on earth. Although many multimodular proteins have been experimentally known, identification of gene fusion events systematically at genome scale had not been possible until recently when large number of completed genome sequences have been becoming available. In addition, technical difficulties for such analysis also exist due to the complexity of this biological and evolutionary process. We report from this study a new strategy to computationally identify multimodular proteins using completed genome sequences and the results surveyed from 22 organisms with the data from over 40 organisms to be presented during the meeting. Additional information is contained in the original extended abstract.
Reveal genes functionally associated with ACADS by a network study.

PubMed

Chen, Yulong; Su, Zhiguang

2015-09-15

Establishing a systematic network is aimed at finding essential human gene-gene/gene-disease pathway by means of network inter-connecting patterns and functional annotation analysis. In the present study, we have analyzed functional gene interactions of short-chain acyl-coenzyme A dehydrogenase gene (ACADS). ACADS plays a vital role in free fatty acid β-oxidation and regulates energy homeostasis. Modules of highly inter-connected genes in disease-specific ACADS network are derived by integrating gene function and protein interaction data. Among the 8 genes in ACADS web retrieved from both STRING and GeneMANIA, ACADS is effectively conjoined with 4 genes including HAHDA, HADHB, ECHS1 and ACAT1. The functional analysis is done via ontological briefing and candidate disease identification. We observed that the highly efficient-interlinked genes connected with ACADS are HAHDA, HADHB, ECHS1 and ACAT1. Interestingly, the ontological aspect of genes in the ACADS network reveals that ACADS, HAHDA and HADHB play equally vital roles in fatty acid metabolism. The gene ACAT1 together with ACADS indulges in ketone metabolism. Our computational gene web analysis also predicts potential candidate disease recognition, thus indicating the involvement of ACADS, HAHDA, HADHB, ECHS1 and ACAT1 not only with lipid metabolism but also with infant death syndrome, skeletal myopathy, acute hepatic encephalopathy, Reye-like syndrome, episodic ketosis, and metabolic acidosis. The current study presents a comprehensible layout of ACADS network, its functional strategies and candidate disease approach associated with ACADS network. Copyright © 2015 Elsevier B.V. All rights reserved.
Ensemble gene function prediction database reveals genes important for complex I formation in Arabidopsis thaliana.

PubMed

Hansen, Bjoern Oest; Meyer, Etienne H; Ferrari, Camilla; Vaid, Neha; Movahedi, Sara; Vandepoele, Klaas; Nikoloski, Zoran; Mutwil, Marek

2018-03-01

Recent advances in gene function prediction rely on ensemble approaches that integrate results from multiple inference methods to produce superior predictions. Yet, these developments remain largely unexplored in plants. We have explored and compared two methods to integrate 10 gene co-function networks for Arabidopsis thaliana and demonstrate how the integration of these networks produces more accurate gene function predictions for a larger fraction of genes with unknown function. These predictions were used to identify genes involved in mitochondrial complex I formation, and for five of them, we confirmed the predictions experimentally. The ensemble predictions are provided as a user-friendly online database, EnsembleNet. The methods presented here demonstrate that ensemble gene function prediction is a powerful method to boost prediction performance, whereas the EnsembleNet database provides a cutting-edge community tool to guide experimentalists. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
Integrated pathway-based approach identifies association between genomic regions at CTCF and CACNB2 and schizophrenia.

PubMed

Juraeva, Dilafruz; Haenisch, Britta; Zapatka, Marc; Frank, Josef; Witt, Stephanie H; Mühleisen, Thomas W; Treutlein, Jens; Strohmaier, Jana; Meier, Sandra; Degenhardt, Franziska; Giegling, Ina; Ripke, Stephan; Leber, Markus; Lange, Christoph; Schulze, Thomas G; Mössner, Rainald; Nenadic, Igor; Sauer, Heinrich; Rujescu, Dan; Maier, Wolfgang; Børglum, Anders; Ophoff, Roel; Cichon, Sven; Nöthen, Markus M; Rietschel, Marcella; Mattheisen, Manuel; Brors, Benedikt

2014-06-01

In the present study, an integrated hierarchical approach was applied to: (1) identify pathways associated with susceptibility to schizophrenia; (2) detect genes that may be potentially affected in these pathways since they contain an associated polymorphism; and (3) annotate the functional consequences of such single-nucleotide polymorphisms (SNPs) in the affected genes or their regulatory regions. The Global Test was applied to detect schizophrenia-associated pathways using discovery and replication datasets comprising 5,040 and 5,082 individuals of European ancestry, respectively. Information concerning functional gene-sets was retrieved from the Kyoto Encyclopedia of Genes and Genomes, Gene Ontology, and the Molecular Signatures Database. Fourteen of the gene-sets or pathways identified in the discovery dataset were confirmed in the replication dataset. These include functional processes involved in transcriptional regulation and gene expression, synapse organization, cell adhesion, and apoptosis. For two genes, i.e. CTCF and CACNB2, evidence for association with schizophrenia was available (at the gene-level) in both the discovery study and published data from the Psychiatric Genomics Consortium schizophrenia study. Furthermore, these genes mapped to four of the 14 presently identified pathways. Several of the SNPs assigned to CTCF and CACNB2 have potential functional consequences, and a gene in close proximity to CACNB2, i.e. ARL5B, was identified as a potential gene of interest. Application of the present hierarchical approach thus allowed: (1) identification of novel biological gene-sets or pathways with potential involvement in the etiology of schizophrenia, as well as replication of these findings in an independent cohort; (2) detection of genes of interest for future follow-up studies; and (3) the highlighting of novel genes in previously reported candidate regions for schizophrenia.
A high resolution atlas of gene expression in the domestic sheep (Ovis aries)

PubMed Central

Farquhar, Iseabail L.; Young, Rachel; Lefevre, Lucas; Pridans, Clare; Tsang, Hiu G.; Afrasiabi, Cyrus; Watson, Mick; Whitelaw, C. Bruce; Freeman, Tom C.; Archibald, Alan L.; Hume, David A.

2017-01-01

Sheep are a key source of meat, milk and fibre for the global livestock sector, and an important biomedical model. Global analysis of gene expression across multiple tissues has aided genome annotation and supported functional annotation of mammalian genes. We present a large-scale RNA-Seq dataset representing all the major organ systems from adult sheep and from several juvenile, neonatal and prenatal developmental time points. The Ovis aries reference genome (Oar v3.1) includes 27,504 genes (20,921 protein coding), of which 25,350 (19,921 protein coding) had detectable expression in at least one tissue in the sheep gene expression atlas dataset. Network-based cluster analysis of this dataset grouped genes according to their expression pattern. The principle of ‘guilt by association’ was used to infer the function of uncharacterised genes from their co-expression with genes of known function. We describe the overall transcriptional signatures present in the sheep gene expression atlas and assign those signatures, where possible, to specific cell populations or pathways. The findings are related to innate immunity by focusing on clusters with an immune signature, and to the advantages of cross-breeding by examining the patterns of genes exhibiting the greatest expression differences between purebred and crossbred animals. This high-resolution gene expression atlas for sheep is, to our knowledge, the largest transcriptomic dataset from any livestock species to date. It provides a resource to improve the annotation of the current reference genome for sheep, presenting a model transcriptome for ruminants and insight into gene, cell and tissue function at multiple developmental stages. PMID:28915238
A high resolution atlas of gene expression in the domestic sheep (Ovis aries).

PubMed

Clark, Emily L; Bush, Stephen J; McCulloch, Mary E B; Farquhar, Iseabail L; Young, Rachel; Lefevre, Lucas; Pridans, Clare; Tsang, Hiu G; Wu, Chunlei; Afrasiabi, Cyrus; Watson, Mick; Whitelaw, C Bruce; Freeman, Tom C; Summers, Kim M; Archibald, Alan L; Hume, David A

2017-09-01

Sheep are a key source of meat, milk and fibre for the global livestock sector, and an important biomedical model. Global analysis of gene expression across multiple tissues has aided genome annotation and supported functional annotation of mammalian genes. We present a large-scale RNA-Seq dataset representing all the major organ systems from adult sheep and from several juvenile, neonatal and prenatal developmental time points. The Ovis aries reference genome (Oar v3.1) includes 27,504 genes (20,921 protein coding), of which 25,350 (19,921 protein coding) had detectable expression in at least one tissue in the sheep gene expression atlas dataset. Network-based cluster analysis of this dataset grouped genes according to their expression pattern. The principle of 'guilt by association' was used to infer the function of uncharacterised genes from their co-expression with genes of known function. We describe the overall transcriptional signatures present in the sheep gene expression atlas and assign those signatures, where possible, to specific cell populations or pathways. The findings are related to innate immunity by focusing on clusters with an immune signature, and to the advantages of cross-breeding by examining the patterns of genes exhibiting the greatest expression differences between purebred and crossbred animals. This high-resolution gene expression atlas for sheep is, to our knowledge, the largest transcriptomic dataset from any livestock species to date. It provides a resource to improve the annotation of the current reference genome for sheep, presenting a model transcriptome for ruminants and insight into gene, cell and tissue function at multiple developmental stages.
Functional comparison of microarray data across multiple platforms using the method of percentage of overlapping functions.

PubMed

Li, Zhiguang; Kwekel, Joshua C; Chen, Tao

2012-01-01

Functional comparison across microarray platforms is used to assess the comparability or similarity of the biological relevance associated with the gene expression data generated by multiple microarray platforms. Comparisons at the functional level are very important considering that the ultimate purpose of microarray technology is to determine the biological meaning behind the gene expression changes under a specific condition, not just to generate a list of genes. Herein, we present a method named percentage of overlapping functions (POF) and illustrate how it is used to perform the functional comparison of microarray data generated across multiple platforms. This method facilitates the determination of functional differences or similarities in microarray data generated from multiple array platforms across all the functions that are presented on these platforms. This method can also be used to compare the functional differences or similarities between experiments, projects, or laboratories.
Mutant phenotypes for thousands of bacterial genes of unknown function

DOE PAGES

Price, Morgan N.; Wetmore, Kelly M.; Waters, R. Jordan; ...

2018-05-16

One-third of all protein-coding genes from bacterial genomes cannot be annotated with a function. Here, to investigate the functions of these genes, we present genome-wide mutant fitness data from 32 diverse bacteria across dozens of growth conditions. We identified mutant phenotypes for 11,779 protein-coding genes that had not been annotated with a specific function. Many genes could be associated with a specific condition because the gene affected fitness only in that condition, or with another gene in the same bacterium because they had similar mutant phenotypes. Of the poorly annotated genes, 2,316 had associations that have high confidence because theymore » are conserved in other bacteria. By combining these conserved associations with comparative genomics, we identified putative DNA repair proteins; in addition, we propose specific functions for poorly annotated enzymes and transporters and for uncharacterized protein families. Lastly, our study demonstrates the scalability of microbial genetics and its utility for improving gene annotations.« less
Mutant phenotypes for thousands of bacterial genes of unknown function

DOE Office of Scientific and Technical Information (OSTI.GOV)

Price, Morgan N.; Wetmore, Kelly M.; Waters, R. Jordan

One-third of all protein-coding genes from bacterial genomes cannot be annotated with a function. Here, to investigate the functions of these genes, we present genome-wide mutant fitness data from 32 diverse bacteria across dozens of growth conditions. We identified mutant phenotypes for 11,779 protein-coding genes that had not been annotated with a specific function. Many genes could be associated with a specific condition because the gene affected fitness only in that condition, or with another gene in the same bacterium because they had similar mutant phenotypes. Of the poorly annotated genes, 2,316 had associations that have high confidence because theymore » are conserved in other bacteria. By combining these conserved associations with comparative genomics, we identified putative DNA repair proteins; in addition, we propose specific functions for poorly annotated enzymes and transporters and for uncharacterized protein families. Lastly, our study demonstrates the scalability of microbial genetics and its utility for improving gene annotations.« less
FOXP2

PubMed Central

Nudel, Ron; Newbury, Dianne F

2013-01-01

The forkhead box P2 gene, designated FOXP2, is the first gene implicated in a speech and language disorder. Since its discovery, many studies have been carried out in an attempt to explain the mechanism by which it influences these characteristically human traits. This review presents the story of the discovery of the FOXP2 gene, including early studies of the phenotypic implications of a disruption in the gene. We then discuss recent investigations into the molecular function of the FOXP2 gene, including functional and gene expression studies. We conclude this review by presenting the fascinating results of recent studies of the FOXP2 ortholog in other species that are capable of vocal communication. WIREs Cogn Sci 2013, 4:547–560. doi: 10.1002/wcs.1247 PMID:24765219

Dramatic Increases of Soil Microbial Functional Gene Diversity at the Treeline Ecotone of Changbai Mountain.

PubMed

Shen, Congcong; Shi, Yu; Ni, Yingying; Deng, Ye; Van Nostrand, Joy D; He, Zhili; Zhou, Jizhong; Chu, Haiyan

2016-01-01

The elevational and latitudinal diversity patterns of microbial taxa have attracted great attention in the past decade. Recently, the distribution of functional attributes has been in the spotlight. Here, we report a study profiling soil microbial communities along an elevation gradient (500-2200 m) on Changbai Mountain. Using a comprehensive functional gene microarray (GeoChip 5.0), we found that microbial functional gene richness exhibited a dramatic increase at the treeline ecotone, but the bacterial taxonomic and phylogenetic diversity based on 16S rRNA gene sequencing did not exhibit such a similar trend. However, the β-diversity (compositional dissimilarity among sites) pattern for both bacterial taxa and functional genes was similar, showing significant elevational distance-decay patterns which presented increased dissimilarity with elevation. The bacterial taxonomic diversity/structure was strongly influenced by soil pH, while the functional gene diversity/structure was significantly correlated with soil dissolved organic carbon (DOC). This finding highlights that soil DOC may be a good predictor in determining the elevational distribution of microbial functional genes. The finding of significant shifts in functional gene diversity at the treeline ecotone could also provide valuable information for predicting the responses of microbial functions to climate change.
Diametrical clustering for identifying anti-correlated gene clusters.

PubMed

Dhillon, Inderjit S; Marcotte, Edward M; Roshan, Usman

2003-09-01

Clustering genes based upon their expression patterns allows us to predict gene function. Most existing clustering algorithms cluster genes together when their expression patterns show high positive correlation. However, it has been observed that genes whose expression patterns are strongly anti-correlated can also be functionally similar. Biologically, this is not unintuitive-genes responding to the same stimuli, regardless of the nature of the response, are more likely to operate in the same pathways. We present a new diametrical clustering algorithm that explicitly identifies anti-correlated clusters of genes. Our algorithm proceeds by iteratively (i). re-partitioning the genes and (ii). computing the dominant singular vector of each gene cluster; each singular vector serving as the prototype of a 'diametric' cluster. We empirically show the effectiveness of the algorithm in identifying diametrical or anti-correlated clusters. Testing the algorithm on yeast cell cycle data, fibroblast gene expression data, and DNA microarray data from yeast mutants reveals that opposed cellular pathways can be discovered with this method. We present systems whose mRNA expression patterns, and likely their functions, oppose the yeast ribosome and proteosome, along with evidence for the inverse transcriptional regulation of a number of cellular systems.
Fungal Genes in Context: Genome Architecture Reflects Regulatory Complexity and Function

PubMed Central

Noble, Luke M.; Andrianopoulos, Alex

2013-01-01

Gene context determines gene expression, with local chromosomal environment most influential. Comparative genomic analysis is often limited in scope to conserved or divergent gene and protein families, and fungi are well suited to this approach with low functional redundancy and relatively streamlined genomes. We show here that one aspect of gene context, the amount of potential upstream regulatory sequence maintained through evolution, is highly predictive of both molecular function and biological process in diverse fungi. Orthologs with large upstream intergenic regions (UIRs) are strongly enriched in information processing functions, such as signal transduction and sequence-specific DNA binding, and, in the genus Aspergillus, include the majority of experimentally studied, high-level developmental and metabolic transcriptional regulators. Many uncharacterized genes are also present in this class and, by implication, may be of similar importance. Large intergenic regions also share two novel sequence characteristics, currently of unknown significance: they are enriched for plus-strand polypyrimidine tracts and an information-rich, putative regulatory motif that was present in the last common ancestor of the Pezizomycotina. Systematic consideration of gene UIR in comparative genomics, particularly for poorly characterized species, could help reveal organisms’ regulatory priorities. PMID:23699226
Genome-Wide Gene Expression in relation to Age in Large Laboratory Cohorts of Drosophila melanogaster

PubMed Central

Carlson, Kimberly A.; Gardner, Kylee; Pashaj, Anjeza; Carlson, Darby J.; Yu, Fang; Eudy, James D.; Zhang, Chi; Harshman, Lawrence G.

2015-01-01

Aging is a complex process characterized by a steady decline in an organism's ability to perform life-sustaining tasks. In the present study, two cages of approximately 12,000 mated Drosophila melanogaster females were used as a source of RNA from individuals sampled frequently as a function of age. A linear model for microarray data method was used for the microarray analysis to adjust for the box effect; it identified 1,581 candidate aging genes. Cluster analyses using a self-organizing map algorithm on the 1,581 significant genes identified gene expression patterns across different ages. Genes involved in immune system function and regulation, chorion assembly and function, and metabolism were all significantly differentially expressed as a function of age. The temporal pattern of data indicated that gene expression related to aging is affected relatively early in life span. In addition, the temporal variance in gene expression in immune function genes was compared to a random set of genes. There was an increase in the variance of gene expression within each cohort, which was not observed in the set of random genes. This observation is compatible with the hypothesis that D. melanogaster immune function genes lose control of gene expression as flies age. PMID:26090231
Improving the measurement of semantic similarity by combining gene ontology and co-functional network: a random walk based approach.

PubMed

Peng, Jiajie; Zhang, Xuanshuo; Hui, Weiwei; Lu, Junya; Li, Qianqian; Liu, Shuhui; Shang, Xuequn

2018-03-19

Gene Ontology (GO) is one of the most popular bioinformatics resources. In the past decade, Gene Ontology-based gene semantic similarity has been effectively used to model gene-to-gene interactions in multiple research areas. However, most existing semantic similarity approaches rely only on GO annotations and structure, or incorporate only local interactions in the co-functional network. This may lead to inaccurate GO-based similarity resulting from the incomplete GO topology structure and gene annotations. We present NETSIM2, a new network-based method that allows researchers to measure GO-based gene functional similarities by considering the global structure of the co-functional network with a random walk with restart (RWR)-based method, and by selecting the significant term pairs to decrease the noise information. Based on the EC number (Enzyme Commission)-based groups of yeast and Arabidopsis, evaluation test shows that NETSIM2 can enhance the accuracy of Gene Ontology-based gene functional similarity. Using NETSIM2 as an example, we found that the accuracy of semantic similarities can be significantly improved after effectively incorporating the global gene-to-gene interactions in the co-functional network, especially on the species that gene annotations in GO are far from complete.
Prokaryotic cDNA Subtraction: A Method to Rapidly Identify Functional Gene Biomarkers

DTIC Science & Technology

2008-10-01

perchlorate-reducing bacteria (PRB) must not only be present, but they must also synthesize the enzymes that catalyze perchlorate reduction. The...synthesis of specific enzymes , termed gene expression, is often regulated by each cell in response to environmental conditions (e.g., influent water...diverse. MBT that target functional genes (e.g., genes that encode biodegradation enzymes ), might prove more useful for determining the capabilities of
Pattern Genes Suggest Functional Connectivity of Organs

NASA Astrophysics Data System (ADS)

Qin, Yangmei; Pan, Jianbo; Cai, Meichun; Yao, Lixia; Ji, Zhiliang

2016-05-01

Human organ, as the basic structural and functional unit in human body, is made of a large community of different cell types that organically bound together. Each organ usually exerts highly specified physiological function; while several related organs work smartly together to perform complicated body functions. In this study, we present a computational effort to understand the roles of genes in building functional connection between organs. More specifically, we mined multiple transcriptome datasets sampled from 36 human organs and tissues, and quantitatively identified 3,149 genes whose expressions showed consensus modularly patterns: specific to one organ/tissue, selectively expressed in several functionally related tissues and ubiquitously expressed. These pattern genes imply intrinsic connections between organs. According to the expression abundance of the 766 selective genes, we consistently cluster the 36 human organs/tissues into seven functional groups: adipose & gland, brain, muscle, immune, metabolism, mucoid and nerve conduction. The organs and tissues in each group either work together to form organ systems or coordinate to perform particular body functions. The particular roles of specific genes and selective genes suggest that they could not only be used to mechanistically explore organ functions, but also be designed for selective biomarkers and therapeutic targets.
Multifunctionality and diversity of GDSL esterase/lipase gene family in rice (Oryza sativa L. japonica) genome: new insights from bioinformatics analysis

PubMed Central

2012-01-01

Background GDSL esterases/lipases are a newly discovered subclass of lipolytic enzymes that are very important and attractive research subjects because of their multifunctional properties, such as broad substrate specificity and regiospecificity. Compared with the current knowledge regarding these enzymes in bacteria, our understanding of the plant GDSL enzymes is very limited, although the GDSL gene family in plant species include numerous members in many fully sequenced plant genomes. Only two genes from a large rice GDSL esterase/lipase gene family were previously characterised, and the majority of the members remain unknown. In the present study, we describe the rice OsGELP (Oryza sativa GDSL esterase/lipase protein) gene family at the genomic and proteomic levels, and use this knowledge to provide insights into the multifunctionality of the rice OsGELP enzymes. Results In this study, an extensive bioinformatics analysis identified 114 genes in the rice OsGELP gene family. A complete overview of this family in rice is presented, including the chromosome locations, gene structures, phylogeny, and protein motifs. Among the OsGELPs and the plant GDSL esterase/lipase proteins of known functions, 41 motifs were found that represent the core secondary structure elements or appear specifically in different phylogenetic subclades. The specification and distribution of identified putative conserved clade-common and -specific peptide motifs, and their location on the predicted protein three dimensional structure may possibly signify their functional roles. Potentially important regions for substrate specificity are highlighted, in accordance with protein three-dimensional model and location of the phylogenetic specific conserved motifs. The differential expression of some representative genes were confirmed by quantitative real-time PCR. The phylogenetic analysis, together with protein motif architectures, and the expression profiling were analysed to predict the possible biological functions of the rice OsGELP genes. Conclusions Our current genomic analysis, for the first time, presents fundamental information on the organization of the rice OsGELP gene family. With combination of the genomic, phylogenetic, microarray expression, protein motif distribution, and protein structure analyses, we were able to create supported basis for the functional prediction of many members in the rice GDSL esterase/lipase family. The present study provides a platform for the selection of candidate genes for further detailed functional study. PMID:22793791
Dramatic Increases of Soil Microbial Functional Gene Diversity at the Treeline Ecotone of Changbai Mountain

PubMed Central

Shen, Congcong; Shi, Yu; Ni, Yingying; Deng, Ye; Van Nostrand, Joy D.; He, Zhili; Zhou, Jizhong; Chu, Haiyan

2016-01-01

The elevational and latitudinal diversity patterns of microbial taxa have attracted great attention in the past decade. Recently, the distribution of functional attributes has been in the spotlight. Here, we report a study profiling soil microbial communities along an elevation gradient (500–2200 m) on Changbai Mountain. Using a comprehensive functional gene microarray (GeoChip 5.0), we found that microbial functional gene richness exhibited a dramatic increase at the treeline ecotone, but the bacterial taxonomic and phylogenetic diversity based on 16S rRNA gene sequencing did not exhibit such a similar trend. However, the β-diversity (compositional dissimilarity among sites) pattern for both bacterial taxa and functional genes was similar, showing significant elevational distance-decay patterns which presented increased dissimilarity with elevation. The bacterial taxonomic diversity/structure was strongly influenced by soil pH, while the functional gene diversity/structure was significantly correlated with soil dissolved organic carbon (DOC). This finding highlights that soil DOC may be a good predictor in determining the elevational distribution of microbial functional genes. The finding of significant shifts in functional gene diversity at the treeline ecotone could also provide valuable information for predicting the responses of microbial functions to climate change. PMID:27524983
GeoChip-Based Analysis of the Functional Gene Diversity and Metabolic Potential of Microbial Communities in Acid Mine Drainage▿ †

PubMed Central

Xie, Jianping; He, Zhili; Liu, Xinxing; Liu, Xueduan; Van Nostrand, Joy D.; Deng, Ye; Wu, Liyou; Zhou, Jizhong; Qiu, Guanzhou

2011-01-01

Acid mine drainage (AMD) is an extreme environment, usually with low pH and high concentrations of metals. Although the phylogenetic diversity of AMD microbial communities has been examined extensively, little is known about their functional gene diversity and metabolic potential. In this study, a comprehensive functional gene array (GeoChip 2.0) was used to analyze the functional diversity, composition, structure, and metabolic potential of AMD microbial communities from three copper mines in China. GeoChip data indicated that these microbial communities were functionally diverse as measured by the number of genes detected, gene overlapping, unique genes, and various diversity indices. Almost all key functional gene categories targeted by GeoChip 2.0 were detected in the AMD microbial communities, including carbon fixation, carbon degradation, methane generation, nitrogen fixation, nitrification, denitrification, ammonification, nitrogen reduction, sulfur metabolism, metal resistance, and organic contaminant degradation, which suggested that the functional gene diversity was higher than was previously thought. Mantel test results indicated that AMD microbial communities are shaped largely by surrounding environmental factors (e.g., S, Mg, and Cu). Functional genes (e.g., narG and norB) and several key functional processes (e.g., methane generation, ammonification, denitrification, sulfite reduction, and organic contaminant degradation) were significantly (P < 0.10) correlated with environmental variables. This study presents an overview of functional gene diversity and the structure of AMD microbial communities and also provides insights into our understanding of metabolic potential in AMD ecosystems. PMID:21097602
Paralogous ALT1 and ALT2 Retention and Diversification Have Generated Catalytically Active and Inactive Aminotransferases in Saccharomyces cerevisiae

PubMed Central

Peñalosa-Ruiz, Georgina; Aranda, Cristina; Ongay-Larios, Laura; Colon, Maritrini; Quezada, Hector; Gonzalez, Alicia

2012-01-01

Background Gene duplication and the subsequent divergence of paralogous pairs play a central role in the evolution of novel gene functions. S. cerevisiae possesses two paralogous genes (ALT1/ALT2) which presumably encode alanine aminotransferases. It has been previously shown that Alt1 encodes an alanine aminotransferase, involved in alanine metabolism; however the physiological role of Alt2 is not known. Here we investigate whether ALT2 encodes an active alanine aminotransferase. Principal Findings Our results show that although ALT1 and ALT2 encode 65% identical proteins, only Alt1 displays alanine aminotransferase activity; in contrast ALT2 encodes a catalytically inert protein. ALT1 and ALT2 expression is modulated by Nrg1 and by the intracellular alanine pool. ALT1 is alanine-induced showing a regulatory profile of a gene encoding an enzyme involved in amino acid catabolism, in agreement with the fact that Alt1 is the sole pathway for alanine catabolism present in S. cerevisiae. Conversely, ALT2 expression is alanine-repressed, indicating a role in alanine biosynthesis, although the encoded-protein has no alanine aminotransferase enzymatic activity. In the ancestral-like yeast L. kluyveri, the alanine aminotransferase activity was higher in the presence of alanine than in the presence of ammonium, suggesting that as for ALT1, LkALT1 expression could be alanine-induced. ALT2 retention poses the questions of whether the encoded protein plays a particular function, and if this function was present in the ancestral gene. It could be hypotesized that ALT2 diverged after duplication, through neo-functionalization or that ALT2 function was present in the ancestral gene, with a yet undiscovered function. Conclusions ALT1 and ALT2 divergence has resulted in delegation of alanine aminotransferase activity to Alt1. These genes display opposed regulatory profiles: ALT1 is alanine-induced, while ALT2 is alanine repressed. Both genes are negatively regulated by the Nrg1 repressor. Presented results indicate that alanine could act as ALT2 Nrg1-co-repressor. PMID:23049841
Comparative genome analysis of PHB gene family reveals deep evolutionary origins and diverse gene function.

PubMed

Di, Chao; Xu, Wenying; Su, Zhen; Yuan, Joshua S

2010-10-07

PHB (Prohibitin) gene family is involved in a variety of functions important for different biological processes. PHB genes are ubiquitously present in divergent species from prokaryotes to eukaryotes. Human PHB genes have been found to be associated with various diseases. Recent studies by our group and others have shown diverse function of PHB genes in plants for development, senescence, defence, and others. Despite the importance of the PHB gene family, no comprehensive gene family analysis has been carried to evaluate the relatedness of PHB genes across different species. In order to better guide the gene function analysis and understand the evolution of the PHB gene family, we therefore carried out the comparative genome analysis of the PHB genes across different kingdoms. The relatedness, motif distribution, and intron/exon distribution all indicated that PHB genes is a relatively conserved gene family. The PHB genes can be classified into 5 classes and each class have a very deep evolutionary origin. The PHB genes within the class maintained the same motif patterns during the evolution. With Arabidopsis as the model species, we found that PHB gene intron/exon structure and domains are also conserved during the evolution. Despite being a conserved gene family, various gene duplication events led to the expansion of the PHB genes. Both segmental and tandem gene duplication were involved in Arabidopsis PHB gene family expansion. However, segmental duplication is predominant in Arabidopsis. Moreover, most of the duplicated genes experienced neofunctionalization. The results highlighted that PHB genes might be involved in important functions so that the duplicated genes are under the evolutionary pressure to derive new function. PHB gene family is a conserved gene family and accounts for diverse but important biological functions based on the similar molecular mechanisms. The highly diverse biological function indicated that more research needs to be carried out to dissect the PHB gene function. The conserved gene evolution indicated that the study in the model species can be translated to human and mammalian studies.
New Dimensions in Microbial Ecology-Functional Genes in Studies to Unravel the Biodiversity and Role of Functional Microbial Groups in the Environment.

PubMed

Imhoff, Johannes F

2016-05-24

During the past decades, tremendous advances have been made in the possibilities to study the diversity of microbial communities in the environment. The development of methods to study these communities on the basis of 16S rRNA gene sequences analysis was a first step into the molecular analysis of environmental communities and the study of biodiversity in natural habitats. A new dimension in this field was reached with the introduction of functional genes of ecological importance and the establishment of genetic tools to study the diversity of functional microbial groups and their responses to environmental factors. Functional gene approaches are excellent tools to study the diversity of a particular function and to demonstrate changes in the composition of prokaryote communities contributing to this function. The phylogeny of many functional genes largely correlates with that of the 16S rRNA gene, and microbial species may be identified on the basis of functional gene sequences. Functional genes are perfectly suited to link culture-based microbiological work with environmental molecular genetic studies. In this review, the development of functional gene studies in environmental microbiology is highlighted with examples of genes relevant for important ecophysiological functions. Examples are presented for bacterial photosynthesis and two types of anoxygenic phototrophic bacteria, with genes of the Fenna-Matthews-Olson-protein (fmoA) as target for the green sulfur bacteria and of two reaction center proteins (pufLM) for the phototrophic purple bacteria, with genes of adenosine-5'phosphosulfate (APS) reductase (aprA), sulfate thioesterase (soxB) and dissimilatory sulfite reductase (dsrAB) for sulfur oxidizing and sulfate reducing bacteria, with genes of ammonia monooxygenase (amoA) for nitrifying/ammonia-oxidizing bacteria, with genes of particulate nitrate reductase and nitrite reductases (narH/G, nirS, nirK) for denitrifying bacteria and with genes of methane monooxygenase (pmoA) for methane oxidizing bacteria.
Analysis of multiplex gene expression maps obtained by voxelation.

PubMed

An, Li; Xie, Hongbo; Chin, Mark H; Obradovic, Zoran; Smith, Desmond J; Megalooikonomou, Vasileios

2009-04-29

Gene expression signatures in the mammalian brain hold the key to understanding neural development and neurological disease. Researchers have previously used voxelation in combination with microarrays for acquisition of genome-wide atlases of expression patterns in the mouse brain. On the other hand, some work has been performed on studying gene functions, without taking into account the location information of a gene's expression in a mouse brain. In this paper, we present an approach for identifying the relation between gene expression maps obtained by voxelation and gene functions. To analyze the dataset, we chose typical genes as queries and aimed at discovering similar gene groups. Gene similarity was determined by using the wavelet features extracted from the left and right hemispheres averaged gene expression maps, and by the Euclidean distance between each pair of feature vectors. We also performed a multiple clustering approach on the gene expression maps, combined with hierarchical clustering. Among each group of similar genes and clusters, the gene function similarity was measured by calculating the average gene function distances in the gene ontology structure. By applying our methodology to find similar genes to certain target genes we were able to improve our understanding of gene expression patterns and gene functions. By applying the clustering analysis method, we obtained significant clusters, which have both very similar gene expression maps and very similar gene functions respectively to their corresponding gene ontologies. The cellular component ontology resulted in prominent clusters expressed in cortex and corpus callosum. The molecular function ontology gave prominent clusters in cortex, corpus callosum and hypothalamus. The biological process ontology resulted in clusters in cortex, hypothalamus and choroid plexus. Clusters from all three ontologies combined were most prominently expressed in cortex and corpus callosum. The experimental results confirm the hypothesis that genes with similar gene expression maps might have similar gene functions. The voxelation data takes into account the location information of gene expression level in mouse brain, which is novel in related research. The proposed approach can potentially be used to predict gene functions and provide helpful suggestions to biologists.
Characterizing genes with distinct methylation patterns in the context of protein-protein interaction network: application to human brain tissues.

PubMed

Li, Yongsheng; Xu, Juan; Chen, Hong; Zhao, Zheng; Li, Shengli; Bai, Jing; Wu, Aiwei; Jiang, Chunjie; Wang, Yuan; Su, Bin; Li, Xia

2013-01-01

DNA methylation is an essential epigenetic mechanism involved in transcriptional control. However, how genes with different methylation patterns are assembled in the protein-protein interaction network (PPIN) remains a mystery. In the present study, we systematically dissected the characterization of genes with different methylation patterns in the PPIN. A negative association was detected between the methylation levels in the brain tissues and topological centralities. By focusing on two classes of genes with considerably different methylation levels in the brain tissues, namely the low methylated genes (LMGs) and high methylated genes (HMGs), we found that their organizing principles in the PPIN are distinct. The LMGs tend to be the center of the PPIN, and attacking them causes a more deleterious effect on the network integrity. Furthermore, the LMGs express their functions in a modular pattern and substantial differences in functions are observed between the two types of genes. The LMGs are enriched in the basic biological functions, such as binding activity and regulation of transcription. More importantly, cancer genes, especially recessive cancer genes, essential genes, and aging-related genes were all found more often in the LMGs. Additionally, our analysis presented that the intra-classes communications are enhanced, but inter-classes communications are repressed. Finally, a functional complementation was revealed between methylation and miRNA regulation in the human genome. We have elucidated the assembling principles of genes with different methylation levels in the context of the PPIN, providing key insights into the complex epigenetic regulation mechanisms.
Characterizing Genes with Distinct Methylation Patterns in the Context of Protein-Protein Interaction Network: Application to Human Brain Tissues

PubMed Central

Zhao, Zheng; Li, Shengli; Bai, Jing; Wu, Aiwei; Jiang, Chunjie; Wang, Yuan; Su, Bin; Li, Xia

2013-01-01

Background DNA methylation is an essential epigenetic mechanism involved in transcriptional control. However, how genes with different methylation patterns are assembled in the protein-protein interaction network (PPIN) remains a mystery. Results In the present study, we systematically dissected the characterization of genes with different methylation patterns in the PPIN. A negative association was detected between the methylation levels in the brain tissues and topological centralities. By focusing on two classes of genes with considerably different methylation levels in the brain tissues, namely the low methylated genes (LMGs) and high methylated genes (HMGs), we found that their organizing principles in the PPIN are distinct. The LMGs tend to be the center of the PPIN, and attacking them causes a more deleterious effect on the network integrity. Furthermore, the LMGs express their functions in a modular pattern and substantial differences in functions are observed between the two types of genes. The LMGs are enriched in the basic biological functions, such as binding activity and regulation of transcription. More importantly, cancer genes, especially recessive cancer genes, essential genes, and aging-related genes were all found more often in the LMGs. Additionally, our analysis presented that the intra-classes communications are enhanced, but inter-classes communications are repressed. Finally, a functional complementation was revealed between methylation and miRNA regulation in the human genome. Conclusions We have elucidated the assembling principles of genes with different methylation levels in the context of the PPIN, providing key insights into the complex epigenetic regulation mechanisms. PMID:23776563
Gene Ontology Terms and Automated Annotation for Energy-Related Microbial Genomes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mukhopadhyay, Biswarup; Tyler, Brett M.; Setubal, Joao

Gene Ontology (GO) is one of the more widely used functional ontologies for describing gene functions at various levels. The project developed 660 GO terms for describing energy-related microbial processes and filled the known gaps in this area of the GO system, and then used these terms to describe functions of 179 genes to showcase the utilities of the new resources. It hosted a series of workshops and made presentations at key meetings to inform and train scientific community members on these terms and to receive inputs from them for the GO term generation efforts. The project has developed amore » website for storing and displaying the resources (http://www.mengo.biochem.vt.edu/). The outcome of the project was further disseminated through peer-reviewed publications and poster and seminar presentations.« less
A comparative analysis of the avirulence and translational transactivator functions of gene VI of Cauliflower mosaic virus.

PubMed

Palanichelvam, Karuppaiah; Schoelz, James E

2002-02-15

The primary function associated at present with the gene VI product of Cauliflower mosaic virus (CaMV) is that of a translational transactivator (TAV). In this capacity, it alters the host translational machinery to allow reinitiation of translation of other CaMV genes on the polycistronic 35S RNA of CaMV. In addition, the gene VI protein can elicit a specific type of plant defense response called the hypersensitive response (HR) in Nicotiana edwardsonii. In this study, we have adapted the agroinfiltration technique to compare the sequences of CaMV gene VI required for TAV function and elicitation of HR. To measure the activity of the TAV, we coagroinfiltrated gene VI of CaMV strain W260 with a bicistronic GUS reporter plasmid. TAV function could be assayed 4 days postinfiltration, before the onset of HR in N. edwardsonii. Through the use of the TAV and HR assays, we could show that the TAV functions of gene VI of CaMV strains W260 and D4 were equivalent, but only W260 gene VI elicited HR. A mutational analysis of W260 gene VI showed that the structural requirements for elicitation of HR were much more stringent than those for TAV function. Small deletions from either the 5' or 3' end of W260 gene VI abolished its ability to elicit HR, although the TAV function was retained in the mutant. The TAV function could also tolerate a small insertion within gene VI; this insertion abolished the elicitor function. This study provides direct evidence that the TAV function of gene VI is separate from its role as an elicitor of HR.
Comparative methods for the analysis of gene-expression evolution: an example using yeast functional genomic data.

PubMed

Oakley, Todd H; Gu, Zhenglong; Abouheif, Ehab; Patel, Nipam H; Li, Wen-Hsiung

2005-01-01

Understanding the evolution of gene function is a primary challenge of modern evolutionary biology. Despite an expanding database from genomic and developmental studies, we are lacking quantitative methods for analyzing the evolution of some important measures of gene function, such as gene-expression patterns. Here, we introduce phylogenetic comparative methods to compare different models of gene-expression evolution in a maximum-likelihood framework. We find that expression of duplicated genes has evolved according to a nonphylogenetic model, where closely related genes are no more likely than more distantly related genes to share common expression patterns. These results are consistent with previous studies that found rapid evolution of gene expression during the history of yeast. The comparative methods presented here are general enough to test a wide range of evolutionary hypotheses using genomic-scale data from any organism.
Exact time-dependent solutions for a self-regulating gene.

PubMed

Ramos, A F; Innocentini, G C P; Hornos, J E M

2011-06-01

The exact time-dependent solution for the stochastic equations governing the behavior of a binary self-regulating gene is presented. Using the generating function technique to rephrase the master equations in terms of partial differential equations, we show that the model is totally integrable and the analytical solutions are the celebrated confluent Heun functions. Self-regulation plays a major role in the control of gene expression, and it is remarkable that such a microscopic model is completely integrable in terms of well-known complex functions.

Functional Annotations of Paralogs: A Blessing and a Curse

PubMed Central

Zallot, Rémi; Harrison, Katherine J.; Kolaczkowski, Bryan; de Crécy-Lagard, Valérie

2016-01-01

Gene duplication followed by mutation is a classic mechanism of neofunctionalization, producing gene families with functional diversity. In some cases, a single point mutation is sufficient to change the substrate specificity and/or the chemistry performed by an enzyme, making it difficult to accurately separate enzymes with identical functions from homologs with different functions. Because sequence similarity is often used as a basis for assigning functional annotations to genes, non-isofunctional gene families pose a great challenge for genome annotation pipelines. Here we describe how integrating evolutionary and functional information such as genome context, phylogeny, metabolic reconstruction and signature motifs may be required to correctly annotate multifunctional families. These integrative analyses can also lead to the discovery of novel gene functions, as hints from specific subgroups can guide the functional characterization of other members of the family. We demonstrate how careful manual curation processes using comparative genomics can disambiguate subgroups within large multifunctional families and discover their functions. We present the COG0720 protein family as a case study. We also discuss strategies to automate this process to improve the accuracy of genome functional annotation pipelines. PMID:27618105
An adaptive radiation model for the origin of new genefunctions

DOE Office of Scientific and Technical Information (OSTI.GOV)

Francino, M. Pilar

2004-10-18

The evolution of new gene functions is one of the keys to evolutionary innovation. Most novel functions result from gene duplication followed by divergence. However, the models hitherto proposed to account for this process are not fully satisfactory. The classic model of neofunctionalization holds that the two paralogous gene copies resulting from a duplication are functionally redundant, such that one of them can evolve under no functional constraints and occasionally acquire a new function. This model lacks a convincing mechanism for the new gene copies to increase in frequency in the population and survive the mutational load expected to accumulatemore » under neutrality, before the acquisition of the rare beneficial mutations that would confer new functionality. The subfunctionalization model has been proposed as an alternative way to generate genes with altered functions. This model also assumes that new paralogous gene copies are functionally redundant and therefore neutral, but it predicts that relaxed selection will affect both gene copies such that some of the capabilities of the parent gene will disappear in one of the copies and be retained in the other. Thus, the functions originally present in a single gene will be partitioned between the two descendant copies. However, although this model can explain increases in gene number, it does not really address the main evolutionary question, which is the development of new biochemical capabilities. Recently, a new concept has been introduced into the gene evolution literature which is most likely to help solve this dilemma. The key point is to allow for a period of natural selection for the duplication per se, before new function evolves, rather than considering gene duplication to be neutral as in the previous models. Here, I suggest a new model that draws on the advantage of postulating selection for gene duplication, and proposes that bursts of adaptive gene amplification in response to specific selection pressures provide the raw material for the evolution of new function.« less
The Bacillus subtilis ywjI (glpX) gene encodes a class II fructose-1,6-bisphosphatase, functionally equivalent to the class III Fbp enzyme.

PubMed

Jules, Matthieu; Le Chat, Ludovic; Aymerich, Stéphane; Le Coq, Dominique

2009-05-01

We present here experimental evidence that the Bacillus subtilis ywjI gene encodes a class II fructose-1,6-bisphosphatase, functionally equivalent to the fbp-encoded class III enzyme, and constitutes with the upstream gene, murAB, an operon transcribed at the same level under glycolytic or gluconeogenic conditions.
The Bacillus subtilis ywjI (glpX) Gene Encodes a Class II Fructose-1,6-Bisphosphatase, Functionally Equivalent to the Class III Fbp Enzyme▿

PubMed Central

Jules, Matthieu; Le Chat, Ludovic; Aymerich, Stéphane; Le Coq, Dominique

2009-01-01

We present here experimental evidence that the Bacillus subtilis ywjI gene encodes a class II fructose-1,6-bisphosphatase, functionally equivalent to the fbp-encoded class III enzyme, and constitutes with the upstream gene, murAB, an operon transcribed at the same level under glycolytic or gluconeogenic conditions. PMID:19270101
Bioinformatics for spermatogenesis: annotation of male reproduction based on proteomics

PubMed Central

Zhou, Tao; Zhou, Zuo-Min; Guo, Xue-Jiang

2013-01-01

Proteomics strategies have been widely used in the field of male reproduction, both in basic and clinical research. Bioinformatics methods are indispensable in proteomics-based studies and are used for data presentation, database construction and functional annotation. In the present review, we focus on the functional annotation of gene lists obtained through qualitative or quantitative methods, summarizing the common and male reproduction specialized proteomics databases. We introduce several integrated tools used to find the hidden biological significance from the data obtained. We further describe in detail the information on male reproduction derived from Gene Ontology analyses, pathway analyses and biomedical analyses. We provide an overview of bioinformatics annotations in spermatogenesis, from gene function to biological function and from biological function to clinical application. On the basis of recently published proteomics studies and associated data, we show that bioinformatics methods help us to discover drug targets for sperm motility and to scan for cancer-testis genes. In addition, we summarize the online resources relevant to male reproduction research for the exploration of the regulation of spermatogenesis. PMID:23852026
Function-driven discovery of disease genes in zebrafish using an integrated genomics big data resource.

PubMed

Shim, Hongseok; Kim, Ji Hyun; Kim, Chan Yeong; Hwang, Sohyun; Kim, Hyojin; Yang, Sunmo; Lee, Ji Eun; Lee, Insuk

2016-11-16

Whole exome sequencing (WES) accelerates disease gene discovery using rare genetic variants, but further statistical and functional evidence is required to avoid false-discovery. To complement variant-driven disease gene discovery, here we present function-driven disease gene discovery in zebrafish (Danio rerio), a promising human disease model owing to its high anatomical and genomic similarity to humans. To facilitate zebrafish-based function-driven disease gene discovery, we developed a genome-scale co-functional network of zebrafish genes, DanioNet (www.inetbio.org/danionet), which was constructed by Bayesian integration of genomics big data. Rigorous statistical assessment confirmed the high prediction capacity of DanioNet for a wide variety of human diseases. To demonstrate the feasibility of the function-driven disease gene discovery using DanioNet, we predicted genes for ciliopathies and performed experimental validation for eight candidate genes. We also validated the existence of heterozygous rare variants in the candidate genes of individuals with ciliopathies yet not in controls derived from the UK10K consortium, suggesting that these variants are potentially involved in enhancing the risk of ciliopathies. These results showed that an integrated genomics big data for a model animal of diseases can expand our opportunity for harnessing WES data in disease gene discovery. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
A Resource of Quantitative Functional Annotation for Homo sapiens Genes.

PubMed

Taşan, Murat; Drabkin, Harold J; Beaver, John E; Chua, Hon Nian; Dunham, Julie; Tian, Weidong; Blake, Judith A; Roth, Frederick P

2012-02-01

The body of human genomic and proteomic evidence continues to grow at ever-increasing rates, while annotation efforts struggle to keep pace. A surprisingly small fraction of human genes have clear, documented associations with specific functions, and new functions continue to be found for characterized genes. Here we assembled an integrated collection of diverse genomic and proteomic data for 21,341 human genes and make quantitative associations of each to 4333 Gene Ontology terms. We combined guilt-by-profiling and guilt-by-association approaches to exploit features unique to the data types. Performance was evaluated by cross-validation, prospective validation, and by manual evaluation with the biological literature. Functional-linkage networks were also constructed, and their utility was demonstrated by identifying candidate genes related to a glioma FLN using a seed network from genome-wide association studies. Our annotations are presented-alongside existing validated annotations-in a publicly accessible and searchable web interface.
Captured metagenomics: large-scale targeting of genes based on ‘sequence capture’ reveals functional diversity in soils

PubMed Central

Manoharan, Lokeshwaran; Kushwaha, Sandeep K.; Hedlund, Katarina; Ahrén, Dag

2015-01-01

Microbial enzyme diversity is a key to understand many ecosystem processes. Whole metagenome sequencing (WMG) obtains information on functional genes, but it is costly and inefficient due to large amount of sequencing that is required. In this study, we have applied a captured metagenomics technique for functional genes in soil microorganisms, as an alternative to WMG. Large-scale targeting of functional genes, coding for enzymes related to organic matter degradation, was applied to two agricultural soil communities through captured metagenomics. Captured metagenomics uses custom-designed, hybridization-based oligonucleotide probes that enrich functional genes of interest in metagenomic libraries where only probe-bound DNA fragments are sequenced. The captured metagenomes were highly enriched with targeted genes while maintaining their target diversity and their taxonomic distribution correlated well with the traditional ribosomal sequencing. The captured metagenomes were highly enriched with genes related to organic matter degradation; at least five times more than similar, publicly available soil WMG projects. This target enrichment technique also preserves the functional representation of the soils, thereby facilitating comparative metagenomics projects. Here, we present the first study that applies the captured metagenomics approach in large scale, and this novel method allows deep investigations of central ecosystem processes by studying functional gene abundances. PMID:26490729
Conifer reproductive development involves B-type MADS-box genes with distinct and different activities in male organ primordia.

PubMed

Sundström, Jens; Engström, Peter

2002-07-01

The Norway spruce MADS-box genes DAL11, DAL12 and DAL13 are phylogenetically related to the angiosperm B-function MADS-box genes: genes that act together with A-function genes in specifying petal identity and with C-function genes in specifying stamen identity to floral organs. In this report we present evidence to suggest that the B-gene function in the specification of identity of the pollen-bearing organs has been conserved between conifers and angiosperms. Expression of DAL11 or DAL12 in transgenic Arabidopsis causes phenotypic changes which partly resemble those caused by ectopic expression of the endogenous B-genes. In similar experiments, flowers of Arabidopsis plants expressing DAL13 showed a different homeotic change in that they formed ectopic anthers in whorls one, two or four. We also demonstrate the capacity of the spruce gene products to form homodimers, and that DAL11 and DAL13 may form heterodimers with each other and with the Arabidopsis B-protein AP3, but not with PI, the second B-gene product in Arabidopsis. In situ hybridization experiments show that the conifer B-like genes are expressed specifically in developing pollen cones, but differ in both temporal and spatial distribution patterns. These results suggest that the B-function in conifers is dual and is separated into a meristem identity and an organ identity function, the latter function possibly being independent of an interaction with the C-function. Thus, even though an ancestral B-function may have acted in combination with C to specify micro- and megasporangia, the B-function has evolved differently in conifers and angiosperms.
Identifying arsenic trioxide (ATO) functions in leukemia cells by using time series gene expression profiles.

PubMed

Yang, Hong; Lin, Shan; Cui, Jingru

2014-02-10

Arsenic trioxide (ATO) is presently the most active single agent in the treatment of acute promyelocytic leukemia (APL). In order to explore the molecular mechanism of ATO in leukemia cells with time series, we adopted bioinformatics strategy to analyze expression changing patterns and changes in transcription regulation modules of time series genes filtered from Gene Expression Omnibus database (GSE24946). We totally screened out 1847 time series genes for subsequent analysis. The KEGG (Kyoto encyclopedia of genes and genomes) pathways enrichment analysis of these genes showed that oxidative phosphorylation and ribosome were the top 2 significantly enriched pathways. STEM software was employed to compare changing patterns of gene expression with assigned 50 expression patterns. We screened out 7 significantly enriched patterns and 4 tendency charts of time series genes. The result of Gene Ontology showed that functions of times series genes mainly distributed in profiles 41, 40, 39 and 38. Seven genes with positive regulation of cell adhesion function were enriched in profile 40, and presented the same first increased model then decreased model as profile 40. The transcription module analysis showed that they mainly involved in oxidative phosphorylation pathway and ribosome pathway. Overall, our data summarized the gene expression changes in ATO treated K562-r cell lines with time and suggested that time series genes mainly regulated cell adhesive. Furthermore, our result may provide theoretical basis of molecular biology in treating acute promyelocytic leukemia. Copyright © 2013 Elsevier B.V. All rights reserved.
Identification and expression profiling analysis of TCP family genes involved in growth and development in maize.

PubMed

Chai, Wenbo; Jiang, Pengfei; Huang, Guoyu; Jiang, Haiyang; Li, Xiaoyu

2017-10-01

The TCP family is a group of plant-specific transcription factors. TCP genes encode proteins harboring bHLH structure, which is implicated in DNA binding and protein-protein interactions and known as the TCP domain. TCP genes play important roles in plant development and have been evolutionarily and functionally elaborated in various plants, however, no overall phylogenetic analysis or expression profiling of TCP genes in Zea mays has been reported. In the present study, a systematic analysis of molecular evolution and functional prediction of TCP family genes in maize ( Z . mays L.) has been conducted. We performed a genome-wide survey of TCP genes in maize, revealing the gene structure, chromosomal location and phylogenetic relationship of family members. Microsynteny between grass species and tissue-specific expression profiles were also investigated. In total, 29 TCP genes were identified in the maize genome, unevenly distributed on the 10 maize chromosomes. Additionally, ZmTCP genes were categorized into nine classes based on phylogeny and purifying selection may largely be responsible for maintaining the functions of maize TCP genes. What's more, microsynteny analysis suggested that TCP genes have been conserved during evolution. Finally, expression analysis revealed that most TCP genes are expressed in the stem and ear, which suggests that ZmTCP genes influence stem and ear growth. This result is consistent with the previous finding that maize TCP genes represses the growth of axillary organs and enables the formation of female inflorescences. Altogether, this study presents a thorough overview of TCP family in maize and provides a new perspective on the evolution of this gene family. The results also indicate that TCP family genes may be involved in development stage in plant growing conditions. Additionally, our results will be useful for further functional analysis of the TCP gene family in maize.
Analysis of the Prefoldin Gene Family in 14 Plant Species

PubMed Central

Cao, Jun

2016-01-01

Prefoldin is a hexameric molecular chaperone complex present in all eukaryotes and archaea. The evolution of this gene family in plants is unknown. Here, I identified 140 prefoldin genes in 14 plant species. These prefoldin proteins were divided into nine groups through phylogenetic analysis. Highly conserved gene organization and motif distribution exist in each prefoldin group, implying their functional conservation. I also observed the segmental duplication of maize prefoldin gene family. Moreover, a few functional divergence sites were identified within each group pairs. Functional network analyses identified 78 co-expressed genes, and most of them were involved in carrying, binding and kinase activity. Divergent expression profiles of the maize prefoldin genes were further investigated in different tissues and development periods and under auxin and some abiotic stresses. I also found a few cis-elements responding to abiotic stress and phytohormone in the upstream sequences of the maize prefoldin genes. The results provided a foundation for exploring the characterization of the prefoldin genes in plants and will offer insights for additional functional studies. PMID:27014333
Protoplast isolation, transient transformation of leaf mesophyll protoplasts and improved Agrobacterium-mediated leaf disc infiltration of Phaseolus vulgaris: tools for rapid gene expression analysis.

PubMed

Nanjareddy, Kalpana; Arthikala, Manoj-Kumar; Blanco, Lourdes; Arellano, Elizabeth S; Lara, Miguel

2016-06-24

Phaseolus vulgaris is one of the most extensively studied model legumes in the world. The P. vulgaris genome sequence is available; therefore, the need for an efficient and rapid transformation system is more imperative than ever. The functional characterization of P. vulgaris genes is impeded chiefly due to the non-amenable nature of Phaseolus sp. to stable genetic transformation. Transient transformation systems are convenient and versatile alternatives for rapid gene functional characterization studies. Hence, the present work focuses on standardizing methodologies for protoplast isolation from multiple tissues and transient transformation protocols for rapid gene expression analysis in the recalcitrant grain legume P. vulgaris. Herein, we provide methodologies for the high-throughput isolation of leaf mesophyll-, flower petal-, hypocotyl-, root- and nodule-derived protoplasts from P. vulgaris. The highly efficient polyethylene glycol-mannitol magnesium (PEG-MMG)-mediated transformation of leaf mesophyll protoplasts was optimized using a GUS reporter gene. We used the P. vulgaris SNF1-related protein kinase 1 (PvSnRK1) gene as proof of concept to demonstrate rapid gene functional analysis. An RT-qPCR analysis of protoplasts that had been transformed with PvSnRK1-RNAi and PvSnRK1-OE vectors showed the significant downregulation and ectopic constitutive expression (overexpression), respectively, of the PvSnRK1 transcript. We also demonstrated an improved transient transformation approach, sonication-assisted Agrobacterium-mediated transformation (SAAT), for the leaf disc infiltration of P. vulgaris. Interestingly, this method resulted in a 90 % transformation efficiency and transformed 60-85 % of the cells in a given area of the leaf surface. The constitutive expression of YFP further confirmed the amenability of the system to gene functional characterization studies. We present simple and efficient methodologies for protoplast isolation from multiple P. vulgaris tissues. We also provide a high-efficiency and amenable method for leaf mesophyll transformation for rapid gene functional characterization studies. Furthermore, a modified SAAT leaf disc infiltration approach aids in validating genes and their functions. Together, these methods help to rapidly unravel novel gene functions and are promising tools for P. vulgaris research.
Assessing duplication and loss of APETALA1/FRUITFULL homologs in Ranunculales

PubMed Central

Pabón-Mora, Natalia; Hidalgo, Oriane; Gleissberg, Stefan; Litt, Amy

2013-01-01

Gene duplication and loss provide raw material for evolutionary change within organismal lineages as functional diversification of gene copies provide a mechanism for phenotypic variation. Here we focus on the APETALA1/FRUITFULL MADS-box gene lineage evolution. AP1/FUL genes are angiosperm-specific and have undergone several duplications. By far the most significant one is the core-eudicot duplication resulting in the euAP1 and euFUL clades. Functional characterization of several euAP1 and euFUL genes has shown that both function in proper floral meristem identity, and axillary meristem repression. Independently, euAP1 genes function in floral meristem and sepal identity, whereas euFUL genes control phase transition, cauline leaf growth, compound leaf morphogenesis and fruit development. Significant functional variation has been detected in the function of pre-duplication basal-eudicot FUL-like genes, but the underlying mechanisms for change have not been identified. FUL-like genes in the Papaveraceae encode all functions reported for euAP1 and euFUL genes, whereas FUL-like genes in Aquilegia (Ranunculaceae) function in inflorescence development and leaf complexity, but not in flower or fruit development. Here we isolated FUL-like genes across the Ranunculales and used phylogenetic approaches to analyze their evolutionary history. We identified an early duplication resulting in the RanFL1 and RanFL2 clades. RanFL1 genes were present in all the families sampled and are mostly under strong negative selection in the MADS, I and K domains. RanFL2 genes were only identified from Eupteleaceae, Papaveraceae s.l., Menispermaceae and Ranunculaceae and show relaxed purifying selection at the I and K domains. We discuss how asymmetric sequence diversification, new motifs, differences in codon substitutions and likely protein-protein interactions resulting from this Ranunculiid-specific duplication can help explain the functional differences among basal-eudicot FUL-like genes. PMID:24062757
Functional characterization of the late embryogenesis abundant (LEA) protein gene family from Pinus tabuliformis (Pinaceae) in Escherichia coli.

PubMed

Gao, Jie; Lan, Ting

2016-01-19

Late embryogenesis abundant (LEA) proteins are a large and highly diverse gene family present in a wide range of plant species. LEAs are proposed to play a role in various stress tolerance responses. Our study represents the first-ever survey of LEA proteins and their encoding genes in a widely distributed pine (Pinus tabuliformis) in China. Twenty-three LEA genes were identified from the P. tabuliformis belonging to seven groups. Proteins with repeated motifs are an important feature specific to LEA groups. Ten of 23 pine LEA genes were selectively expressed in specific tissues, and showed expression divergence within each group. In addition, we selected 13 genes representing each group and introduced theses genes into Escherichia coli to assess the protective function of PtaLEA under heat and salt stresses. Compared with control cells, the E. coli cells expressing PtaLEA fusion protein exhibited enhanced salt and heat resistance and viability, indicating the protein may play a protective role in cells under stress conditions. Furthermore, among these enhanced tolerance genes, a certain extent of function divergence appeared within a gene group as well as between gene groups, suggesting potential functional diversity of this gene family in conifers.
Simulating evolution of protein complexes through gene duplication and co-option.

PubMed

Haarsma, Loren; Nelesen, Serita; VanAndel, Ethan; Lamine, James; VandeHaar, Peter

2016-06-21

We present a model of the evolution of protein complexes with novel functions through gene duplication, mutation, and co-option. Under a wide variety of input parameters, digital organisms evolve complexes of 2-5 bound proteins which have novel functions but whose component proteins are not independently functional. Evolution of complexes with novel functions happens more quickly as gene duplication rates increase, point mutation rates increase, protein complex functional probability increases, protein complex functional strength increases, and protein family size decreases. Evolution of complexity is inhibited when the metabolic costs of making proteins exceeds the fitness gain of having functional proteins, or when point mutation rates get so large the functional proteins undergo deleterious mutations faster than new functional complexes can evolve. Copyright © 2016 Elsevier Ltd. All rights reserved.
Revealing the Strong Functional Association of adipor2 and cdh13 with adipoq: A Gene Network Study.

PubMed

Bag, Susmita; Anbarasu, Anand

2015-04-01

In the present study, we have analyzed functional gene interactions of adiponectin gene (adipoq). The key role of adipoq is in regulating energy homeostasis and it functions as a novel signaling molecule for adipose tissue. Modules of highly inter-connected genes in disease-specific adipoq network are derived by integrating gene function and protein interaction data. Among twenty genes in adipoq web, adipoq is effectively conjoined with two genes: Adiponectin receptor 2 (adipor2) and cadherin 13 (cdh13). The functional analysis is done via ontological briefing and candidate disease identification. We observed that the highly efficient-interlinked genes connected with adipoq are adipor2 and cdh13. Interestingly, the ontological aspect of adipor2 and cdh13 in the adipoq network reveal the fact that adipoq and adipor2 are involved mostly in glucose and lipid metabolic processes. The gene cdh13 indulge in cell adhesion process with adipoq and adipor2. Our computational gene web analysis also predicts potential candidate disease recognition, thus indicating the involvement of adipoq, adipor2, and cdh13 with not only with obesity but also with breast cancer, leukemia, renal cancer, lung cancer, and cervical cancer. The current study provides researchers a comprehensible layout of adipoq network, its functional strategies and candidate disease approach associated with adipoq network.
Prevalence of duodenal ulcer-promoting gene (dupA) of Helicobacter pylori in patients with duodenal ulcer in North Indian population.

PubMed

Arachchi, H S Jayasinghe; Kalra, Vijay; Lal, Banwari; Bhatia, Vikram; Baba, C S; Chakravarthy, S; Rohatgi, S; Sarma, Priyangshu M; Mishra, V; Das, Bimal; Ahuja, Vineet

2007-12-01

The duodenal ulcer (DU)-promoting gene (dupA) of Helicobacter pylori has been identified as a novel virulent marker associated with an increased risk for DU. The presence or absence of dupA gene of H. pylori present in patients with DU and functional dyspepsia in North Indian population was studied by polymerase chain reaction (PCR) and hybridization analysis. One hundred and sixty-six patients (96 DU and 70 functional dyspepsia) were included in this study. In addition, sequence diversity of dupA gene of H. pylori found in these patients was analyzed by sequencing the PCR products jhp0917 and jhp0918 on both strands with appropriate primers. PCR and hybridization analyses indicated that dupA gene was present in 37.5% (36/96) of H. pylori strains isolated from DU patients and 22.86% (16/70) of functional dyspepsia patients (p < or = .05). Of these, 35 patients with DU (97.2%) and 14 patients with functional dyspepsia (81.25%) were infected by H. pylori positive for cagA genotype. Furthermore, the presence of dupA was significantly associated with the cagA-positive genotype (p < or = .02). Results of our study have shown that significant association of dupA gene with DU in this population. The dupA gene can be considered as a novel virulent marker for DU in this population.
Expression atlas and comparative coexpression network analyses reveal important genes involved in the formation of lignified cell wall in Brachypodium distachyon.

PubMed

Sibout, Richard; Proost, Sebastian; Hansen, Bjoern Oest; Vaid, Neha; Giorgi, Federico M; Ho-Yue-Kuang, Severine; Legée, Frédéric; Cézart, Laurent; Bouchabké-Coussa, Oumaya; Soulhat, Camille; Provart, Nicholas; Pasha, Asher; Le Bris, Philippe; Roujol, David; Hofte, Herman; Jamet, Elisabeth; Lapierre, Catherine; Persson, Staffan; Mutwil, Marek

2017-08-01

While Brachypodium distachyon (Brachypodium) is an emerging model for grasses, no expression atlas or gene coexpression network is available. Such tools are of high importance to provide insights into the function of Brachypodium genes. We present a detailed Brachypodium expression atlas, capturing gene expression in its major organs at different developmental stages. The data were integrated into a large-scale coexpression database ( www.gene2function.de), enabling identification of duplicated pathways and conserved processes across 10 plant species, thus allowing genome-wide inference of gene function. We highlight the importance of the atlas and the platform through the identification of duplicated cell wall modules, and show that a lignin biosynthesis module is conserved across angiosperms. We identified and functionally characterised a putative ferulate 5-hydroxylase gene through overexpression of it in Brachypodium, which resulted in an increase in lignin syringyl units and reduced lignin content of mature stems, and led to improved saccharification of the stem biomass. Our Brachypodium expression atlas thus provides a powerful resource to reveal functionally related genes, which may advance our understanding of important biological processes in grasses. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
Combining classifiers to predict gene function in Arabidopsis thaliana using large-scale gene expression measurements.

PubMed

Lan, Hui; Carson, Rachel; Provart, Nicholas J; Bonner, Anthony J

2007-09-21

Arabidopsis thaliana is the model species of current plant genomic research with a genome size of 125 Mb and approximately 28,000 genes. The function of half of these genes is currently unknown. The purpose of this study is to infer gene function in Arabidopsis using machine-learning algorithms applied to large-scale gene expression data sets, with the goal of identifying genes that are potentially involved in plant response to abiotic stress. Using in house and publicly available data, we assembled a large set of gene expression measurements for A. thaliana. Using those genes of known function, we first evaluated and compared the ability of basic machine-learning algorithms to predict which genes respond to stress. Predictive accuracy was measured using ROC50 and precision curves derived through cross validation. To improve accuracy, we developed a method for combining these classifiers using a weighted-voting scheme. The combined classifier was then trained on genes of known function and applied to genes of unknown function, identifying genes that potentially respond to stress. Visual evidence corroborating the predictions was obtained using electronic Northern analysis. Three of the predicted genes were chosen for biological validation. Gene knockout experiments confirmed that all three are involved in a variety of stress responses. The biological analysis of one of these genes (At1g16850) is presented here, where it is shown to be necessary for the normal response to temperature and NaCl. Supervised learning methods applied to large-scale gene expression measurements can be used to predict gene function. However, the ability of basic learning methods to predict stress response varies widely and depends heavily on how much dimensionality reduction is used. Our method of combining classifiers can improve the accuracy of such predictions - in this case, predictions of genes involved in stress response in plants - and it effectively chooses the appropriate amount of dimensionality reduction automatically. The method provides a useful means of identifying genes in A. thaliana that potentially respond to stress, and we expect it would be useful in other organisms and for other gene functions.

Genes Important for Schizosaccharomyces pombe Meiosis Identified Through a Functional Genomics Screen

PubMed Central

Blyth, Julie; Makrantoni, Vasso; Barton, Rachael E.; Spanos, Christos; Rappsilber, Juri; Marston, Adele L.

2018-01-01

Meiosis is a specialized cell division that generates gametes, such as eggs and sperm. Errors in meiosis result in miscarriages and are the leading cause of birth defects; however, the molecular origins of these defects remain unknown. Studies in model organisms are beginning to identify the genes and pathways important for meiosis, but the parts list is still poorly defined. Here we present a comprehensive catalog of genes important for meiosis in the fission yeast, Schizosaccharomyces pombe. Our genome-wide functional screen surveyed all nonessential genes for roles in chromosome segregation and spore formation. Novel genes important at distinct stages of the meiotic chromosome segregation and differentiation program were identified. Preliminary characterization implicated three of these genes in centrosome/spindle pole body, centromere, and cohesion function. Our findings represent a near-complete parts list of genes important for meiosis in fission yeast, providing a valuable resource to advance our molecular understanding of meiosis. PMID:29259000
Basic Helix-Loop-Helix Transcription Factor Gene Family Phylogenetics and Nomenclature

PubMed Central

Skinner, Michael K.; Rawls, Alan; Wilson-Rawls, Jeanne; Roalson, Eric H.

2010-01-01

A phylogenetic analysis of the basic helix-loop-helix (bHLH) gene superfamily was performed using seven different species (human, mouse, rat, worm, fly, yeast, and plant Arabidopsis) and involving over 600 bHLH genes [1]. All bHLH genes were identified in the genomes of the various species, including expressed sequence tags, and the entire coding sequence was used in the analysis. Nearly 15% of the gene family has been updated or added since the original publication. A super-tree involving six clades and all structural relationships was established and is now presented for four of the species. The wealth of functional data available for members of the bHLH gene superfamily provides us with the opportunity to use this exhaustive phylogenetic tree to predict potential functions of uncharacterized members of the family. This phylogenetic and genomic analysis of the bHLH gene family has revealed unique elements of the evolution and functional relationships of the different genes in the bHLH gene family. PMID:20219281
Semantics based approach for analyzing disease-target associations.

PubMed

Kaalia, Rama; Ghosh, Indira

2016-08-01

A complex disease is caused by heterogeneous biological interactions between genes and their products along with the influence of environmental factors. There have been many attempts for understanding the cause of these diseases using experimental, statistical and computational methods. In the present work the objective is to address the challenge of representation and integration of information from heterogeneous biomedical aspects of a complex disease using semantics based approach. Semantic web technology is used to design Disease Association Ontology (DAO-db) for representation and integration of disease associated information with diabetes as the case study. The functional associations of disease genes are integrated using RDF graphs of DAO-db. Three semantic web based scoring algorithms (PageRank, HITS (Hyperlink Induced Topic Search) and HITS with semantic weights) are used to score the gene nodes on the basis of their functional interactions in the graph. Disease Association Ontology for Diabetes (DAO-db) provides a standard ontology-driven platform for describing genes, proteins, pathways involved in diabetes and for integrating functional associations from various interaction levels (gene-disease, gene-pathway, gene-function, gene-cellular component and protein-protein interactions). An automatic instance loader module is also developed in present work that helps in adding instances to DAO-db on a large scale. Our ontology provides a framework for querying and analyzing the disease associated information in the form of RDF graphs. The above developed methodology is used to predict novel potential targets involved in diabetes disease from the long list of loose (statistically associated) gene-disease associations. Copyright © 2016 Elsevier Inc. All rights reserved.
Bridging the knowledge gap: from microbiome composition to function.

PubMed

Faith, Jeremiah J

2015-03-01

Despite the wealth of metagenomic sequencing data, the functions of most bacterial genes from the mammalian microbiota have remained poorly understood. In their recent study (Yaung et al 2015), Wang, Gerber, and colleagues present a platform which allows functional mining of bacterial genomes for genes that contribute to fitness in vivo and holds great potential for forward engineering microbes with enhanced colonization abilities in the microbiota.
Genome-wide analysis and expression profiling suggest diverse roles of GH3 genes during development and abiotic stress responses in legumes

PubMed Central

Singh, Vikash K.; Jain, Mukesh; Garg, Rohini

2014-01-01

Growth hormone auxin regulates various cellular processes by altering the expression of diverse genes in plants. Among various auxin-responsive genes, GH3 genes maintain endogenous auxin homeostasis by conjugating excess of auxin with amino acids. GH3 genes have been characterized in many plant species, but not in legumes. In the present work, we identified members of GH3 gene family and analyzed their chromosomal distribution, gene structure, gene duplication and phylogenetic analysis in different legumes, including chickpea, soybean, Medicago, and Lotus. A comprehensive expression analysis in different vegetative and reproductive tissues/stages revealed that many of GH3 genes were expressed in a tissue-specific manner. Notably, chickpea CaGH3-3, soybean GmGH3-8 and -25, and Lotus LjGH3-4, -5, -9 and -18 genes were up-regulated in root, indicating their putative role in root development. In addition, chickpea CaGH3-1 and -7, and Medicago MtGH3-7, -8, and -9 were found to be highly induced under drought and/or salt stresses, suggesting their role in abiotic stress responses. We also observed the examples of differential expression pattern of duplicated GH3 genes in soybean, indicating their functional diversification. Furthermore, analyses of three-dimensional structures, active site residues and ligand preferences provided molecular insights into function of GH3 genes in legumes. The analysis presented here would help in investigation of precise function of GH3 genes in legumes during development and stress conditions. PMID:25642236
Identification and Characterization of Genes That Interact with Lin-12 in Caenorhabditis Elegans

PubMed Central

Tax, F. E.; Thomas, J. H.; Ferguson, E. L.; Horvitz, H. R.

1997-01-01

We identified and characterized 14 extragenic mutations that suppressed the dominant egg-laying defect of certain lin-12 gain-of-function mutations. These suppressors defined seven genes: sup-17, lag-2, sel-4, sel-5, sel-6, sel-7 and sel-8. Mutations in six of the genes are recessive suppressors, whereas the two mutations that define the seventh gene, lag-2, are semi-dominant suppressors. These suppressor mutations were able to suppress other lin-12 gain-of-function mutations. The suppressor mutations arose at a very low frequency per gene, 10-50 times below the typical loss-of-function mutation frequency. The suppressor mutations in sup-17 and lag-2 were shown to be rare non-null alleles, and we present evidence that null mutations in these two genes cause lethality. Temperature-shift studies for two suppressor genes, sup-17 and lag-2, suggest that both genes act at approximately the same time as lin-12 in specifying a cell fate. Suppressor alleles of six of these genes enhanced a temperature-sensitive loss-of-function allele of glp-1, a gene related to lin-12 in structure and function. Our analysis of these suppressors suggests that the majority of these genes are part of a shared lin-12/glp-1 signal transduction pathway, or act to regulate the expression or stability of lin-12 and glp-1. PMID:9409830
EWS and FUS bind a subset of transcribed genes encoding proteins enriched in RNA regulatory functions.

PubMed

Luo, Yonglun; Blechingberg, Jenny; Fernandes, Ana Miguel; Li, Shengting; Fryland, Tue; Børglum, Anders D; Bolund, Lars; Nielsen, Anders Lade

2015-11-14

FUS (TLS) and EWS (EWSR1) belong to the FET-protein family of RNA and DNA binding proteins. FUS and EWS are structurally and functionally related and participate in transcriptional regulation and RNA processing. FUS and EWS are identified in translocation generated cancer fusion proteins and involved in the human neurological diseases amyotrophic lateral sclerosis and fronto-temporal lobar degeneration. To determine the gene regulatory functions of FUS and EWS at the level of chromatin, we have performed chromatin immunoprecipitation followed by next generation sequencing (ChIP-seq). Our results show that FUS and EWS bind to a subset of actively transcribed genes, that binding often is downstream the poly(A)-signal, and that binding overlaps with RNA polymerase II. Functional examinations of selected target genes identified that FUS and EWS can regulate gene expression at different levels. Gene Ontology analyses showed that FUS and EWS target genes preferentially encode proteins involved in regulatory processes at the RNA level. The presented results yield new insights into gene interactions of EWS and FUS and have identified a set of FUS and EWS target genes involved in pathways at the RNA regulatory level with potential to mediate normal and disease-associated functions of the FUS and EWS proteins.
InteGO2: A web tool for measuring and visualizing gene semantic similarities using Gene Ontology

DOE PAGES

Peng, Jiajie; Li, Hongxiang; Liu, Yongzhuang; ...

2016-08-31

Here, the Gene Ontology (GO) has been used in high-throughput omics research as a major bioinformatics resource. The hierarchical structure of GO provides users a convenient platform for biological information abstraction and hypothesis testing. Computational methods have been developed to identify functionally similar genes. However, none of the existing measurements take into account all the rich information in GO. Similarly, using these existing methods, web-based applications have been constructed to compute gene functional similarities, and to provide pure text-based outputs. Without a graphical visualization interface, it is difficult for result interpretation. As a result, we present InteGO2, a web toolmore » that allows researchers to calculate the GO-based gene semantic similarities using seven widely used GO-based similarity measurements. Also, we provide an integrative measurement that synergistically integrates all the individual measurements to improve the overall performance. Using HTML5 and cytoscape.js, we provide a graphical interface in InteGO2 to visualize the resulting gene functional association networks. In conclusion, InteGO2 is an easy-to-use HTML5 based web tool. With it, researchers can measure gene or gene product functional similarity conveniently, and visualize the network of functional interactions in a graphical interface.« less
InteGO2: a web tool for measuring and visualizing gene semantic similarities using Gene Ontology.

PubMed

Peng, Jiajie; Li, Hongxiang; Liu, Yongzhuang; Juan, Liran; Jiang, Qinghua; Wang, Yadong; Chen, Jin

2016-08-31

The Gene Ontology (GO) has been used in high-throughput omics research as a major bioinformatics resource. The hierarchical structure of GO provides users a convenient platform for biological information abstraction and hypothesis testing. Computational methods have been developed to identify functionally similar genes. However, none of the existing measurements take into account all the rich information in GO. Similarly, using these existing methods, web-based applications have been constructed to compute gene functional similarities, and to provide pure text-based outputs. Without a graphical visualization interface, it is difficult for result interpretation. We present InteGO2, a web tool that allows researchers to calculate the GO-based gene semantic similarities using seven widely used GO-based similarity measurements. Also, we provide an integrative measurement that synergistically integrates all the individual measurements to improve the overall performance. Using HTML5 and cytoscape.js, we provide a graphical interface in InteGO2 to visualize the resulting gene functional association networks. InteGO2 is an easy-to-use HTML5 based web tool. With it, researchers can measure gene or gene product functional similarity conveniently, and visualize the network of functional interactions in a graphical interface. InteGO2 can be accessed via http://mlg.hit.edu.cn:8089/ .
InteGO2: A web tool for measuring and visualizing gene semantic similarities using Gene Ontology

DOE Office of Scientific and Technical Information (OSTI.GOV)

Peng, Jiajie; Li, Hongxiang; Liu, Yongzhuang

Here, the Gene Ontology (GO) has been used in high-throughput omics research as a major bioinformatics resource. The hierarchical structure of GO provides users a convenient platform for biological information abstraction and hypothesis testing. Computational methods have been developed to identify functionally similar genes. However, none of the existing measurements take into account all the rich information in GO. Similarly, using these existing methods, web-based applications have been constructed to compute gene functional similarities, and to provide pure text-based outputs. Without a graphical visualization interface, it is difficult for result interpretation. As a result, we present InteGO2, a web toolmore » that allows researchers to calculate the GO-based gene semantic similarities using seven widely used GO-based similarity measurements. Also, we provide an integrative measurement that synergistically integrates all the individual measurements to improve the overall performance. Using HTML5 and cytoscape.js, we provide a graphical interface in InteGO2 to visualize the resulting gene functional association networks. In conclusion, InteGO2 is an easy-to-use HTML5 based web tool. With it, researchers can measure gene or gene product functional similarity conveniently, and visualize the network of functional interactions in a graphical interface.« less
Modularity and evolutionary constraints in a baculovirus gene regulatory network

PubMed Central

2013-01-01

Background The structure of regulatory networks remains an open question in our understanding of complex biological systems. Interactions during complete viral life cycles present unique opportunities to understand how host-parasite network take shape and behave. The Anticarsia gemmatalis multiple nucleopolyhedrovirus (AgMNPV) is a large double-stranded DNA virus, whose genome may encode for 152 open reading frames (ORFs). Here we present the analysis of the ordered cascade of the AgMNPV gene expression. Results We observed an earlier onset of the expression than previously reported for other baculoviruses, especially for genes involved in DNA replication. Most ORFs were expressed at higher levels in a more permissive host cell line. Genes with more than one copy in the genome had distinct expression profiles, which could indicate the acquisition of new functionalities. The transcription gene regulatory network (GRN) for 149 ORFs had a modular topology comprising five communities of highly interconnected nodes that separated key genes that are functionally related on different communities, possibly maximizing redundancy and GRN robustness by compartmentalization of important functions. Core conserved functions showed expression synchronicity, distinct GRN features and significantly less genetic diversity, consistent with evolutionary constraints imposed in key elements of biological systems. This reduced genetic diversity also had a positive correlation with the importance of the gene in our estimated GRN, supporting a relationship between phylogenetic data of baculovirus genes and network features inferred from expression data. We also observed that gene arrangement in overlapping transcripts was conserved among related baculoviruses, suggesting a principle of genome organization. Conclusions Albeit with a reduced number of nodes (149), the AgMNPV GRN had a topology and key characteristics similar to those observed in complex cellular organisms, which indicates that modularity may be a general feature of biological gene regulatory networks. PMID:24006890
nfi-1 affects behavior and life-span in C. elegans but is not essential for DNA replication or survival

PubMed Central

Lazakovitch, Elena; Kalb, John M; Matsumoto, Reiko; Hirono, Keiko; Kohara, Yuji; Gronostajski, Richard M

2005-01-01

Background The Nuclear Factor I (one) (NFI) family of transcription/replication factors plays essential roles in mammalian gene expression and development and in adenovirus DNA replication. Because of its role in viral DNA replication NFI has long been suspected to function in host DNA synthesis. Determining the requirement for NFI proteins in mammalian DNA replication is complicated by the presence of 4 NFI genes in mice and humans. Loss of individual NFI genes in mice cause defects in brain, lung and tooth development, but the presence of 4 homologous NFI genes raises the issue of redundant roles for NFI genes in DNA replication. No NFI genes are present in bacteria, fungi or plants. However single NFI genes are present in several simple animals including Drosophila and C. elegans, making it possible to test for a requirement for NFI in multicellular eukaryotic DNA replication and development. Here we assess the functions of the single nfi-1 gene in C. elegans. Results C. elegans NFI protein (CeNFI) binds specifically to the same NFI-binding site recognized by vertebrate NFIs. nfi-1 encodes alternatively-spliced, maternally-inherited transcripts that are expressed at the single cell stage, during embryogenesis, and in adult muscles, neurons and gut cells. Worms lacking nfi-1 survive but have defects in movement, pharyngeal pumping and egg-laying and have a reduced life-span. Expression of the muscle gene Ce titin is decreased in nfi-1 mutant worms. Conclusion NFI gene function is not needed for survival in C. elegans and thus NFI is likely not essential for DNA replication in multi-cellular eukaryotes. The multiple defects in motility, egg-laying, pharyngeal pumping, and reduced lifespan indicate that NFI is important for these processes. Reduction in Ce titin expression could affect muscle function in multiple tissues. The phenotype of nfi-1 null worms indicates that NFI functions in multiple developmental and behavioral systems in C. elegans, likely regulating genes that function in motility, egg-laying, pharyngeal pumping and lifespan maintenance. PMID:16242019
Meta genome-wide network from functional linkages of genes in human gut microbial ecosystems.

PubMed

Ji, Yan; Shi, Yixiang; Wang, Chuan; Dai, Jianliang; Li, Yixue

2013-03-01

The human gut microbial ecosystem (HGME) exerts an important influence on the human health. In recent researches, meta-genomics provided deep insights into the HGME in terms of gene contents, metabolic processes and genome constitutions of meta-genome. Here we present a novel methodology to investigate the HGME on the basis of a set of functionally coupled genes regardless of their genome origins when considering the co-evolution properties of genes. By analyzing these coupled genes, we showed some basic properties of HGME significantly associated with each other, and further constructed a protein interaction map of human gut meta-genome to discover some functional modules that may relate with essential metabolic processes. Compared with other studies, our method provides a new idea to extract basic function elements from meta-genome systems and investigate complex microbial environment by associating its biological traits with co-evolutionary fingerprints encoded in it.
The Association between Infants' Self-Regulatory Behavior and MAOA Gene Polymorphism

ERIC Educational Resources Information Center

Zhang, Minghao; Chen, Xinyin; Way, Niobe; Yoshikawa, Hirokazu; Deng, Huihua; Ke, Xiaoyan; Yu, Weiwei; Chen, Ping; He, Chuan; Chi, Xia; Lu, Zuhong

2011-01-01

Self-regulatory behavior in early childhood is an important characteristic that has considerable implications for the development of adaptive and maladaptive functioning. The present study investigated the relations between a functional polymorphism in the upstream region of monoamine oxidase A gene (MAOA) and self-regulatory behavior in a sample…
Lineage-specific expansion of IFIT gene family: an insight into coevolution with IFN gene family.

PubMed

Liu, Ying; Zhang, Yi-Bing; Liu, Ting-Kai; Gui, Jian-Fang

2013-01-01

In mammals, IFIT (Interferon [IFN]-induced proteins with Tetratricopeptide Repeat [TPR] motifs) family genes are involved in many cellular and viral processes, which are tightly related to mammalian IFN response. However, little is known about non-mammalian IFIT genes. In the present study, IFIT genes are identified in the genome databases from the jawed vertebrates including the cartilaginous elephant shark but not from non-vertebrates such as lancelet, sea squirt and acorn worm, suggesting that IFIT gene family originates from a vertebrate ancestor about 450 million years ago. IFIT family genes show conserved gene structure and gene arrangements. Phylogenetic analyses reveal that this gene family has expanded through lineage-specific and species-specific gene duplication. Interestingly, IFN gene family seem to share a common ancestor and a similar evolutionary mechanism; the function link of IFIT genes to IFN response is present early since the origin of both gene families, as evidenced by the finding that zebrafish IFIT genes are upregulated by fish IFNs, poly(I:C) and two transcription factors IRF3/IRF7, likely via the IFN-stimulated response elements (ISRE) within the promoters of vertebrate IFIT family genes. These coevolution features creates functional association of both family genes to fulfill a common biological process, which is likely selected by viral infection during evolution of vertebrates. Our results are helpful for understanding of evolution of vertebrate IFN system.
Genome-wide analysis of the Dof transcription factor gene family reveals soybean-specific duplicable and functional characteristics.

PubMed

Guo, Yong; Qiu, Li-Juan

2013-01-01

The Dof domain protein family is a classic plant-specific zinc-finger transcription factor family involved in a variety of biological processes. There is great diversity in the number of Dof genes in different plants. However, there are only very limited reports on the characterization of Dof transcription factors in soybean (Glycine max). In the present study, 78 putative Dof genes were identified from the whole-genome sequence of soybean. The predicted GmDof genes were non-randomly distributed within and across 19 out of 20 chromosomes and 97.4% (38 pairs) were preferentially retained duplicate paralogous genes located in duplicated regions of the genome. Soybean-specific segmental duplications contributed significantly to the expansion of the soybean Dof gene family. These Dof proteins were phylogenetically clustered into nine distinct subgroups among which the gene structure and motif compositions were considerably conserved. Comparative phylogenetic analysis of these Dof proteins revealed four major groups, similar to those reported for Arabidopsis and rice. Most of the GmDofs showed specific expression patterns based on RNA-seq data analyses. The expression patterns of some duplicate genes were partially redundant while others showed functional diversity, suggesting the occurrence of sub-functionalization during subsequent evolution. Comprehensive expression profile analysis also provided insights into the soybean-specific functional divergence among members of the Dof gene family. Cis-regulatory element analysis of these GmDof genes suggested diverse functions associated with different processes. Taken together, our results provide useful information for the functional characterization of soybean Dof genes by combining phylogenetic analysis with global gene-expression profiling.
Homology-dependent Gene Silencing in Paramecium

PubMed Central

Ruiz, Françoise; Vayssié, Laurence; Klotz, Catherine; Sperling, Linda; Madeddu, Luisa

1998-01-01

Microinjection at high copy number of plasmids containing only the coding region of a gene into the Paramecium somatic macronucleus led to a marked reduction in the expression of the corresponding endogenous gene(s). The silencing effect, which is stably maintained throughout vegetative growth, has been observed for all Paramecium genes examined so far: a single-copy gene (ND7), as well as members of multigene families (centrin genes and trichocyst matrix protein genes) in which all closely related paralogous genes appeared to be affected. This phenomenon may be related to posttranscriptional gene silencing in transgenic plants and quelling in Neurospora and allows the efficient creation of specific mutant phenotypes thus providing a potentially powerful tool to study gene function in Paramecium. For the two multigene families that encode proteins that coassemble to build up complex subcellular structures the analysis presented herein provides the first experimental evidence that the members of these gene families are not functionally redundant. PMID:9529389
Comprehensive analysis of coding-lncRNA gene co-expression network uncovers conserved functional lncRNAs in zebrafish.

PubMed

Chen, Wen; Zhang, Xuan; Li, Jing; Huang, Shulan; Xiang, Shuanglin; Hu, Xiang; Liu, Changning

2018-05-09

Zebrafish is a full-developed model system for studying development processes and human disease. Recent studies of deep sequencing had discovered a large number of long non-coding RNAs (lncRNAs) in zebrafish. However, only few of them had been functionally characterized. Therefore, how to take advantage of the mature zebrafish system to deeply investigate the lncRNAs' function and conservation is really intriguing. We systematically collected and analyzed a series of zebrafish RNA-seq data, then combined them with resources from known database and literatures. As a result, we obtained by far the most complete dataset of zebrafish lncRNAs, containing 13,604 lncRNA genes (21,128 transcripts) in total. Based on that, a co-expression network upon zebrafish coding and lncRNA genes was constructed and analyzed, and used to predict the Gene Ontology (GO) and the KEGG annotation of lncRNA. Meanwhile, we made a conservation analysis on zebrafish lncRNA, identifying 1828 conserved zebrafish lncRNA genes (1890 transcripts) that have their putative mammalian orthologs. We also found that zebrafish lncRNAs play important roles in regulation of the development and function of nervous system; these conserved lncRNAs present a significant sequential and functional conservation, with their mammalian counterparts. By integrative data analysis and construction of coding-lncRNA gene co-expression network, we gained the most comprehensive dataset of zebrafish lncRNAs up to present, as well as their systematic annotations and comprehensive analyses on function and conservation. Our study provides a reliable zebrafish-based platform to deeply explore lncRNA function and mechanism, as well as the lncRNA commonality between zebrafish and human.
Methods for the isolation of genes encoding novel PHB cycle enzymes from complex microbial communities.

PubMed

Nordeste, Ricardo F; Trainer, Maria A; Charles, Trevor C

2010-01-01

Development of different PHAs as alternatives to petrochemically derived plastics can be facilitated by mining metagenomic libraries for diverse PHA cycle genes that might be useful for synthesis of bioplastics. The specific phenotypes associated with mutations of the PHA synthesis pathway genes in Sinorhizobium meliloti allows for the use of powerful selection and screening tools to identify complementing novel PHA synthesis genes. Identification of novel genes through their function rather than sequence facilitates finding functional proteins that may otherwise have been excluded through sequence-only screening methodology. We present here methods that we have developed for the isolation of clones expressing novel PHA metabolism genes from metagenomic libraries.
Methods for the Isolation of Genes Encoding Novel PHA Metabolism Enzymes from Complex Microbial Communities.

PubMed

Cheng, Jiujun; Nordeste, Ricardo; Trainer, Maria A; Charles, Trevor C

2017-01-01

Development of different PHAs as alternatives to petrochemically derived plastics can be facilitated by mining metagenomic libraries for diverse PHA cycle genes that might be useful for synthesis of bio-plastics. The specific phenotypes associated with mutations of the PHA synthesis pathway genes in Sinorhizobium meliloti and Pseudomonas putida, allows the use of powerful selection and screening tools to identify complementing novel PHA synthesis genes. Identification of novel genes through their function rather than sequence facilitates the functional proteins that may otherwise have been excluded through sequence-only screening methodology. We present here methods that we have developed for the isolation of clones expressing novel PHA metabolism genes from metagenomic libraries.

Characterization and Functional Analysis of PEBP Family Genes in Upland Cotton (Gossypium hirsutum L.).

PubMed

Zhang, Xiaohong; Wang, Congcong; Pang, Chaoyou; Wei, Hengling; Wang, Hantao; Song, Meizhen; Fan, Shuli; Yu, Shuxun

2016-01-01

Upland cotton (Gossypium hirsutum L.) is a naturally occurring photoperiod-sensitive perennial plant species. However, sensitivity to the day length was lost during domestication. The phosphatidylethanolamine-binding protein (PEBP) gene family, of which three subclades have been identified in angiosperms, functions to promote and suppress flowering in photoperiod pathway. Recent evidence indicates that PEBP family genes play an important role in generating mobile flowering signals. We isolated homologues of the PEBP gene family in upland cotton and examined their regulation and function. Nine PEBP-like genes were cloned and phylogenetic analysis indicated the genes belonged to four subclades (FT, MFT, TFL1 and PEBP). Cotton PEBP-like genes showed distinct expression patterns in relation to different cotton genotypes, photoperiod responsive and cultivar maturity. The GhFT gene expression of a semi-wild race of upland cotton were strongly induced under short day condition, whereas the GhPEBP2 gene expression was induced under long days. We also elucidated that GhFT but not GhPEBP2 interacted with FD-like bZIP transcription factor GhFD and promote flowering under both long- and short-day conditions. The present result indicated that GhPEBP-like genes may perform different functions. This work corroborates the involvement of PEBP-like genes in photoperiod response and regulation of flowering time in different cotton genotypes, and contributes to an improved understanding of the function of PEBP-like genes in cotton.
Characterization and Functional Analysis of PEBP Family Genes in Upland Cotton (Gossypium hirsutum L.)

PubMed Central

Wang, Congcong; Pang, Chaoyou; Wei, Hengling; Wang, Hantao; Song, Meizhen; Fan, Shuli; Yu, Shuxun

2016-01-01

Upland cotton (Gossypium hirsutum L.) is a naturally occurring photoperiod-sensitive perennial plant species. However, sensitivity to the day length was lost during domestication. The phosphatidylethanolamine-binding protein (PEBP) gene family, of which three subclades have been identified in angiosperms, functions to promote and suppress flowering in photoperiod pathway. Recent evidence indicates that PEBP family genes play an important role in generating mobile flowering signals. We isolated homologues of the PEBP gene family in upland cotton and examined their regulation and function. Nine PEBP-like genes were cloned and phylogenetic analysis indicated the genes belonged to four subclades (FT, MFT, TFL1 and PEBP). Cotton PEBP-like genes showed distinct expression patterns in relation to different cotton genotypes, photoperiod responsive and cultivar maturity. The GhFT gene expression of a semi-wild race of upland cotton were strongly induced under short day condition, whereas the GhPEBP2 gene expression was induced under long days. We also elucidated that GhFT but not GhPEBP2 interacted with FD-like bZIP transcription factor GhFD and promote flowering under both long- and short-day conditions. The present result indicated that GhPEBP-like genes may perform different functions. This work corroborates the involvement of PEBP-like genes in photoperiod response and regulation of flowering time in different cotton genotypes, and contributes to an improved understanding of the function of PEBP-like genes in cotton. PMID:27552108
Arabidopsis Ensemble Reverse-Engineered Gene Regulatory Network Discloses Interconnected Transcription Factors in Oxidative Stress[W

PubMed Central

Vermeirssen, Vanessa; De Clercq, Inge; Van Parys, Thomas; Van Breusegem, Frank; Van de Peer, Yves

2014-01-01

The abiotic stress response in plants is complex and tightly controlled by gene regulation. We present an abiotic stress gene regulatory network of 200,014 interactions for 11,938 target genes by integrating four complementary reverse-engineering solutions through average rank aggregation on an Arabidopsis thaliana microarray expression compendium. This ensemble performed the most robustly in benchmarking and greatly expands upon the availability of interactions currently reported. Besides recovering 1182 known regulatory interactions, cis-regulatory motifs and coherent functionalities of target genes corresponded with the predicted transcription factors. We provide a valuable resource of 572 abiotic stress modules of coregulated genes with functional and regulatory information, from which we deduced functional relationships for 1966 uncharacterized genes and many regulators. Using gain- and loss-of-function mutants of seven transcription factors grown under control and salt stress conditions, we experimentally validated 141 out of 271 predictions (52% precision) for 102 selected genes and mapped 148 additional transcription factor-gene regulatory interactions (49% recall). We identified an intricate core oxidative stress regulatory network where NAC13, NAC053, ERF6, WRKY6, and NAC032 transcription factors interconnect and function in detoxification. Our work shows that ensemble reverse-engineering can generate robust biological hypotheses of gene regulation in a multicellular eukaryote that can be tested by medium-throughput experimental validation. PMID:25549671
Arabidopsis ensemble reverse-engineered gene regulatory network discloses interconnected transcription factors in oxidative stress.

PubMed

Vermeirssen, Vanessa; De Clercq, Inge; Van Parys, Thomas; Van Breusegem, Frank; Van de Peer, Yves

2014-12-01

The abiotic stress response in plants is complex and tightly controlled by gene regulation. We present an abiotic stress gene regulatory network of 200,014 interactions for 11,938 target genes by integrating four complementary reverse-engineering solutions through average rank aggregation on an Arabidopsis thaliana microarray expression compendium. This ensemble performed the most robustly in benchmarking and greatly expands upon the availability of interactions currently reported. Besides recovering 1182 known regulatory interactions, cis-regulatory motifs and coherent functionalities of target genes corresponded with the predicted transcription factors. We provide a valuable resource of 572 abiotic stress modules of coregulated genes with functional and regulatory information, from which we deduced functional relationships for 1966 uncharacterized genes and many regulators. Using gain- and loss-of-function mutants of seven transcription factors grown under control and salt stress conditions, we experimentally validated 141 out of 271 predictions (52% precision) for 102 selected genes and mapped 148 additional transcription factor-gene regulatory interactions (49% recall). We identified an intricate core oxidative stress regulatory network where NAC13, NAC053, ERF6, WRKY6, and NAC032 transcription factors interconnect and function in detoxification. Our work shows that ensemble reverse-engineering can generate robust biological hypotheses of gene regulation in a multicellular eukaryote that can be tested by medium-throughput experimental validation. © 2014 American Society of Plant Biologists. All rights reserved.
A data science approach to candidate gene selection of pain regarded as a process of learning and neural plasticity.

PubMed

Ultsch, Alfred; Kringel, Dario; Kalso, Eija; Mogil, Jeffrey S; Lötsch, Jörn

2016-12-01

The increasing availability of "big data" enables novel research approaches to chronic pain while also requiring novel techniques for data mining and knowledge discovery. We used machine learning to combine the knowledge about n = 535 genes identified empirically as relevant to pain with the knowledge about the functions of thousands of genes. Starting from an accepted description of chronic pain as displaying systemic features described by the terms "learning" and "neuronal plasticity," a functional genomics analysis proposed that among the functions of the 535 "pain genes," the biological processes "learning or memory" (P = 8.6 × 10) and "nervous system development" (P = 2.4 × 10) are statistically significantly overrepresented as compared with the annotations to these processes expected by chance. After establishing that the hypothesized biological processes were among important functional genomics features of pain, a subset of n = 34 pain genes were found to be annotated with both Gene Ontology terms. Published empirical evidence supporting their involvement in chronic pain was identified for almost all these genes, including 1 gene identified in March 2016 as being involved in pain. By contrast, such evidence was virtually absent in a randomly selected set of 34 other human genes. Hence, the present computational functional genomics-based method can be used for candidate gene selection, providing an alternative to established methods.
The Genome Sequence of Bacillus cereus ATCC 10987 Reveals Metabolic Adaptations and a Large Plasmid Related to Bacillus anthracis pXO1

DTIC Science & Technology

2004-01-01

Flagellar genes Presentb Presentc Presentc Tagatose utilization genes Absent Present Partiald Functional PlcR Absente Presente Presente Mobile genetic...closely related and one that is divergent (Supplementary ®g. S3). dThere are similar tagatose utilization genes in B.cereus ATCC 14579; however, they...replacement responsible for the transport and utilization of the carbohydrate tagatose (BCE1896±BCE1912). The corres- ponding 5.0 kb region in
Bridging the knowledge gap: from microbiome composition to function

PubMed Central

Faith, Jeremiah J

2015-01-01

Despite the wealth of metagenomic sequencing data, the functions of most bacterial genes from the mammalian microbiota have remained poorly understood. In their recent study (Yaung et al 2015), Wang, Gerber, and colleagues present a platform which allows functional mining of bacterial genomes for genes that contribute to fitness in vivo and holds great potential for forward engineering microbes with enhanced colonization abilities in the microbiota. PMID:26148349
FARO server: Meta-analysis of gene expression by matching gene expression signatures to a compendium of public gene expression data.

PubMed

Manijak, Mieszko P; Nielsen, Henrik B

2011-06-11

Although, systematic analysis of gene annotation is a powerful tool for interpreting gene expression data, it sometimes is blurred by incomplete gene annotation, missing expression response of key genes and secondary gene expression responses. These shortcomings may be partially circumvented by instead matching gene expression signatures to signatures of other experiments. To facilitate this we present the Functional Association Response by Overlap (FARO) server, that match input signatures to a compendium of 242 gene expression signatures, extracted from more than 1700 Arabidopsis microarray experiments. Hereby we present a publicly available tool for robust characterization of Arabidopsis gene expression experiments which can point to similar experimental factors in other experiments. The server is available at http://www.cbs.dtu.dk/services/faro/.
MUFFINN: cancer gene discovery via network analysis of somatic mutation data.

PubMed

Cho, Ara; Shim, Jung Eun; Kim, Eiru; Supek, Fran; Lehner, Ben; Lee, Insuk

2016-06-23

A major challenge for distinguishing cancer-causing driver mutations from inconsequential passenger mutations is the long-tail of infrequently mutated genes in cancer genomes. Here, we present and evaluate a method for prioritizing cancer genes accounting not only for mutations in individual genes but also in their neighbors in functional networks, MUFFINN (MUtations For Functional Impact on Network Neighbors). This pathway-centric method shows high sensitivity compared with gene-centric analyses of mutation data. Notably, only a marginal decrease in performance is observed when using 10 % of TCGA patient samples, suggesting the method may potentiate cancer genome projects with small patient populations.
Amiloride-enhanced gene transfection of octa-arginine functionalized calcium phosphate nanoparticles.

PubMed

Vanegas Sáenz, Juan Ramón; Tenkumo, Taichi; Kamano, Yuya; Egusa, Hiroshi; Sasaki, Keiichi

2017-01-01

Nanoparticles represent promising gene delivery systems in biomedicine to facilitate prolonged gene expression with low toxicity compared to viral vectors. Specifically, nanoparticles of calcium phosphate (nCaP), the main inorganic component of human bone, exhibit high biocompatibility and good biodegradability and have been reported to have high affinity for protein or DNA, having thus been used as gene transfer vectors. On the other hand, Octa-arginine (R8), which has a high permeability to cell membrane, has been reported to improve intracellular delivery systems. Here, we present an optimized method for nCaP-mediated gene delivery using an octa-arginine (R8)-functionalized nCaP vector containing a marker or functional gene construct. nCaP particle size was between 220-580 nm in diameter and all R8-functionalized nCaPs carried a positive charge. R8 concentration significantly improved nCaP transfection efficiency with high cell compatibility in human mesenchymal stem cells (hMSC) and human osteoblasts (hOB) in particular, suggesting nCaPs as a good option for non-viral vector gene delivery. Furthermore, pre-treatment with different endocytosis inhibitors identified that the endocytic pathway differed among cell lines and functionalized nanoparticles, with amiloride increasing transfection efficiency of R8-functionalized nCaPs in hMSC and hOB.
Decoding transcriptional enhancers: Evolving from annotation to functional interpretation

PubMed Central

Engel, Krysta L.; Mackiewicz, Mark; Hardigan, Andrew A.; Myers, Richard M.; Savic, Daniel

2016-01-01

Deciphering the intricate molecular processes that orchestrate the spatial and temporal regulation of genes has become an increasingly major focus of biological research. The differential expression of genes by diverse cell types with a common genome is a hallmark of complex cellular functions, as well as the basis for multicellular life. Importantly, a more coherent understanding of gene regulation is critical for defining developmental processes, evolutionary principles and disease etiologies. Here we present our current understanding of gene regulation by focusing on the role of enhancer elements in these complex processes. Although functional genomic methods have provided considerable advances to our understanding of gene regulation, these assays, which are usually performed on a genome-wide scale, typically provide correlative observations that lack functional interpretation. Recent innovations in genome editing technologies have placed gene regulatory studies at an exciting crossroads, as systematic, functional evaluation of enhancers and other transcriptional regulatory elements can now be performed in a coordinated, high-throughput manner across the entire genome. This review provides insights on transcriptional enhancer function, their role in development and disease, and catalogues experimental tools commonly used to study these elements. Additionally, we discuss the crucial role of novel techniques in deciphering the complex gene regulatory landscape and how these studies will shape future research. PMID:27224938
Decoding transcriptional enhancers: Evolving from annotation to functional interpretation.

PubMed

Engel, Krysta L; Mackiewicz, Mark; Hardigan, Andrew A; Myers, Richard M; Savic, Daniel

2016-09-01

Deciphering the intricate molecular processes that orchestrate the spatial and temporal regulation of genes has become an increasingly major focus of biological research. The differential expression of genes by diverse cell types with a common genome is a hallmark of complex cellular functions, as well as the basis for multicellular life. Importantly, a more coherent understanding of gene regulation is critical for defining developmental processes, evolutionary principles and disease etiologies. Here we present our current understanding of gene regulation by focusing on the role of enhancer elements in these complex processes. Although functional genomic methods have provided considerable advances to our understanding of gene regulation, these assays, which are usually performed on a genome-wide scale, typically provide correlative observations that lack functional interpretation. Recent innovations in genome editing technologies have placed gene regulatory studies at an exciting crossroads, as systematic, functional evaluation of enhancers and other transcriptional regulatory elements can now be performed in a coordinated, high-throughput manner across the entire genome. This review provides insights on transcriptional enhancer function, their role in development and disease, and catalogues experimental tools commonly used to study these elements. Additionally, we discuss the crucial role of novel techniques in deciphering the complex gene regulatory landscape and how these studies will shape future research. Copyright © 2016 Elsevier Ltd. All rights reserved.
Function-Based Algorithms for Biological Sequences

ERIC Educational Resources Information Center

Mohanty, Pragyan Sheela P.

2015-01-01

Two problems at two different abstraction levels of computational biology are studied. At the molecular level, efficient pattern matching algorithms in DNA sequences are presented. For gene order data, an efficient data structure is presented capable of storing all gene re-orderings in a systematic manner. A common characteristic of presented…
Nitrogen Cycle Evaluation (NiCE) Chip for the Simultaneous Analysis of Multiple N-Cycle Associated Genes.

PubMed

Oshiki, Mamoru; Segawa, Takahiro; Ishii, Satoshi

2018-02-02

Various microorganisms play key roles in the Nitrogen (N) cycle. Quantitative PCR (qPCR) and PCR-amplicon sequencing of the N cycle functional genes allow us to analyze the abundance and diversity of microbes responsible in the N transforming reactions in various environmental samples. However, analysis of multiple target genes can be cumbersome and expensive. PCR-independent analysis, such as metagenomics and metatranscriptomics, is useful but expensive especially when we analyze multiple samples and try to detect N cycle functional genes present at relatively low abundance. Here, we present the application of microfluidic qPCR chip technology to simultaneously quantify and prepare amplicon sequence libraries for multiple N cycle functional genes as well as taxon-specific 16S rRNA gene markers for many samples. This approach, named as N cycle evaluation (NiCE) chip, was evaluated by using DNA from pure and artificially mixed bacterial cultures and by comparing the results with those obtained by conventional qPCR and amplicon sequencing methods. Quantitative results obtained by the NiCE chip were comparable to those obtained by conventional qPCR. In addition, the NiCE chip was successfully applied to examine abundance and diversity of N cycle functional genes in wastewater samples. Although non-specific amplification was detected on the NiCE chip, this could be overcome by optimizing the primer sequences in the future. As the NiCE chip can provide high-throughput format to quantify and prepare sequence libraries for multiple N cycle functional genes, this tool should advance our ability to explore N cycling in various samples. Importance. We report a novel approach, namely Nitrogen Cycle Evaluation (NiCE) chip by using microfluidic qPCR chip technology. By sequencing the amplicons recovered from the NiCE chip, we can assess diversities of the N cycle functional genes. The NiCE chip technology is applicable to analyze the temporal dynamics of the N cycle gene transcriptions in wastewater treatment bioreactors. The NiCE chip can provide high-throughput format to quantify and prepare sequence libraries for multiple N cycle functional genes. While there is a room for future improvement, this tool should significantly advance our ability to explore the N cycle in various environmental samples. Copyright © 2018 American Society for Microbiology.
MANTIS: a phylogenetic framework for multi-species genome comparisons.

PubMed

Tzika, Athanasia C; Helaers, Raphaël; Van de Peer, Yves; Milinkovitch, Michel C

2008-01-15

Practitioners of comparative genomics face huge analytical challenges as whole genome sequences and functional/expression data accumulate. Furthermore, the field would greatly benefit from a better integration of this wealth of data with evolutionary concepts. Here, we present MANTIS, a relational database for the analysis of (i) gains and losses of genes on specific branches of the metazoan phylogeny, (ii) reconstructed genome content of ancestral species and (iii) over- or under-representation of functions/processes and tissue specificity of gained, duplicated and lost genes. MANTIS estimates the most likely positions of gene losses on the true phylogeny using a maximum-likelihood function. A user-friendly interface and an extensive query system allow to investigate questions pertaining to gene identity, phylogenetic mapping and function/expression parameters. MANTIS is freely available at http://www.mantisdb.org and constitutes the missing link between multi-species genome comparisons and functional analyses.
A hitchhiker's guide to the MADS world of plants.

PubMed

Gramzow, Lydia; Theissen, Guenter

2010-01-01

Plant life critically depends on the function of MADS-box genes encoding MADS-domain transcription factors, which are present to a limited extent in nearly all major eukaryotic groups, but constitute a large gene family in land plants. There are two types of MADS-box genes, termed type I and type II, and in plants these groups are distinguished by exon-intron and domain structure, rates of evolution, developmental function and degree of functional redundancy. The type I genes are further subdivided into three groups - M alpha, M beta and M gamma - while the type II genes are subdivided into the MIKCC and MIKC* groups. The functional diversification of MIKCC genes is closely linked to the origin of developmental and morphological novelties in the sporophytic (usually diploid) generation of seed plants, most spectacularly the floral organs and fruits of angiosperms. Functional studies suggest different specializations for the different classes of genes; whereas type I genes may preferentially contribute to female gametophyte, embryo and seed development and MIKC*-group genes to male gametophyte development, the MIKCC-group genes became essential for diverse aspects of sporophyte development. Beyond the usual transcriptional regulation, including feedback and feed-forward loops, various specialized mechanisms have evolved to control the expression of MADS-box genes, such as epigenetic control and regulation by small RNAs. In future, more data from genome projects and reverse genetic studies will allow us to understand the birth, functional diversification and death of members of this dynamic and important family of transcription factors in much more detail.
DynGO: a tool for visualizing and mining of Gene Ontology and its associations

PubMed Central

Liu, Hongfang; Hu, Zhang-Zhi; Wu, Cathy H

2005-01-01

Background A large volume of data and information about genes and gene products has been stored in various molecular biology databases. A major challenge for knowledge discovery using these databases is to identify related genes and gene products in disparate databases. The development of Gene Ontology (GO) as a common vocabulary for annotation allows integrated queries across multiple databases and identification of semantically related genes and gene products (i.e., genes and gene products that have similar GO annotations). Meanwhile, dozens of tools have been developed for browsing, mining or editing GO terms, their hierarchical relationships, or their "associated" genes and gene products (i.e., genes and gene products annotated with GO terms). Tools that allow users to directly search and inspect relations among all GO terms and their associated genes and gene products from multiple databases are needed. Results We present a standalone package called DynGO, which provides several advanced functionalities in addition to the standard browsing capability of the official GO browsing tool (AmiGO). DynGO allows users to conduct batch retrieval of GO annotations for a list of genes and gene products, and semantic retrieval of genes and gene products sharing similar GO annotations. The result are shown in an association tree organized according to GO hierarchies and supported with many dynamic display options such as sorting tree nodes or changing orientation of the tree. For GO curators and frequent GO users, DynGO provides fast and convenient access to GO annotation data. DynGO is generally applicable to any data set where the records are annotated with GO terms, as illustrated by two examples. Conclusion We have presented a standalone package DynGO that provides functionalities to search and browse GO and its association databases as well as several additional functions such as batch retrieval and semantic retrieval. The complete documentation and software are freely available for download from the website . PMID:16091147
Atopic Dermatitis Susceptibility Variants in Filaggrin Hitchhike Hornerin Selective Sweep

PubMed Central

Eaaswarkhanth, Muthukrishnan; Xu, Duo; Flanagan, Colin; Rzhetskaya, Margarita; Hayes, M. Geoffrey; Blekhman, Ran; Jablonski, Nina G.; Gokcumen, Omer

2016-01-01

Human skin has evolved rapidly, leaving evolutionary signatures in the genome. The filaggrin (FLG) gene is widely studied for its skin-barrier function in humans. The extensive genetic variation in this gene, especially common loss-of-function (LoF) mutations, has been established as primary risk factors for atopic dermatitis. To investigate the evolution of this gene, we analyzed 2,504 human genomes and genotyped the copy number variation of filaggrin repeats within FLG in 126 individuals from diverse ancestral backgrounds. We were unable to replicate a recent study claiming that LoF of FLG is adaptive in northern latitudes with lower ultraviolet light exposure. Instead, we present multiple lines of evidence suggesting that FLG genetic variation, including LoF variants, have little or no effect on fitness in modern humans. Haplotype-level scrutinization of the locus revealed signatures of a recent selective sweep in Asia, which increased the allele frequency of a haplotype group (Huxian haplogroup) in Asian populations. Functionally, we found that the Huxian haplogroup carries dozens of functional variants in FLG and hornerin (HRNR) genes, including those that are associated with atopic dermatitis susceptibility, HRNR expression levels and microbiome diversity on the skin. Our results suggest that the target of the adaptive sweep is HRNR gene function, and the functional FLG variants that involve susceptibility to atopic dermatitis, seem to hitchhike the selective sweep on HRNR. Our study presents a novel case of a locus that harbors clinically relevant common genetic variation with complex evolutionary trajectories. PMID:27678121
Autophagy genes in immunity

PubMed Central

Virgin, Herbert W; Levine, Beth

2009-01-01

In its classical form, autophagy is a pathway by which cytoplasmic constituents, including intracellular pathogens, are sequestered in a double-membrane–bound autophagosome and delivered to the lysosome for degradation. This pathway has been linked to diverse aspects of innate and adaptive immunity, including pathogen resistance, production of type I interferon, antigen presentation, tolerance and lymphocyte development, as well as the negative regulation of cytokine signaling and inflammation. Most of these links have emerged from studies in which genes encoding molecules involved in autophagy are inactivated in immune effector cells. However, it is not yet known whether all of the critical functions of such genes in immunity represent ‘classical autophagy’ or possible as-yet-undefined autophagolysosome-independent functions of these genes. This review summarizes phenotypes that result from the inactivation of autophagy genes in the immune system and discusses the pleiotropic functions of autophagy genes in immunity. PMID:19381141
Gymnosperm B-sister genes may be involved in ovule/seed development and, in some species, in the growth of fleshy fruit-like structures.

PubMed

Lovisetto, Alessandro; Guzzo, Flavia; Busatto, Nicola; Casadoro, Giorgio

2013-08-01

The evolution of seeds together with the mechanisms related to their dispersal into the environment represented a turning point in the evolution of plants. Seeds are produced by gymnosperms and angiosperms but only the latter have an ovary to be transformed into a fruit. Yet some gymnosperms produce fleshy structures attractive to animals, thus behaving like fruits from a functional point of view. The aim of this work is to increase our knowledge of possible mechanisms common to the development of both gymnosperm and angiosperm fruits. B-sister genes from two gymnosperms (Ginkgo biloba and Taxus baccata) were isolated and studied. The Ginkgo gene was also functionally characterized by ectopically expressing it in tobacco. In Ginkgo the fleshy structure derives from the outer seed integument and the B-sister gene is involved in its growth. In Taxus the fleshy structure is formed de novo as an outgrowth of the ovule peduncle, and the B-sister gene is not involved in this growth. In transgenic tobacco the Ginkgo gene has a positive role in tissue growth and confirms its importance in ovule/seed development. This study suggests that B-sister genes have a main function in ovule/seed development and a subsidiary role in the formation of fleshy fruit-like structures when the latter have an ovular origin, as occurs in Ginkgo. Thus, the 'fruit function' of B-sister genes is quite old, already being present in Gymnosperms as ancient as Ginkgoales, and is also present in Angiosperms where a B-sister gene has been shown to be involved in the formation of the Arabidopsis fruit.

The Evolutionary Fate of the Genes Encoding the Purine Catabolic Enzymes in Hominoids, Birds, and Reptiles

PubMed Central

Keebaugh, Alaine C.; Thomas, James W.

2010-01-01

Gene loss has been proposed to play a major role in adaptive evolution, and recent studies are beginning to reveal its importance in human evolution. However, the potential consequence of a single gene-loss event upon the fates of functionally interrelated genes is poorly understood. Here, we use the purine metabolic pathway as a model system in which to explore this important question. The loss of urate oxidase (UOX) activity, a necessary step in this pathway, has occurred independently in the hominoid and bird/reptile lineages. Because the loss of UOX would have removed the functional constraint upon downstream genes in this pathway, these downstream genes are generally assumed to have subsequently deteriorated. In this study, we used a comparative genomics approach to empirically determine the fate of UOX itself and the downstream genes in five hominoids, two birds, and a reptile. Although we found that the loss of UOX likely triggered the genetic deterioration of the immediate downstream genes in the hominoids, surprisingly in the birds and reptiles, the UOX locus itself and some of the downstream genes were present in the genome and predicted to encode proteins. To account for the variable pattern of gene retention and loss after the inactivation of UOX, we hypothesize that although gene loss is a common fate for genes that have been rendered obsolete due to the upstream loss of an enzyme a metabolic pathway, it is also possible that same lack of constraint will foster the evolution of new functions or allow the optimization of preexisting alternative functions in the downstream genes, thereby resulting in gene retention. Thus, adaptive single-gene losses have the potential to influence the long-term evolutionary fate of functionally interrelated genes. PMID:20106906
The evolutionary fate of the genes encoding the purine catabolic enzymes in hominoids, birds, and reptiles.

PubMed

Keebaugh, Alaine C; Thomas, James W

2010-06-01

Gene loss has been proposed to play a major role in adaptive evolution, and recent studies are beginning to reveal its importance in human evolution. However, the potential consequence of a single gene-loss event upon the fates of functionally interrelated genes is poorly understood. Here, we use the purine metabolic pathway as a model system in which to explore this important question. The loss of urate oxidase (UOX) activity, a necessary step in this pathway, has occurred independently in the hominoid and bird/reptile lineages. Because the loss of UOX would have removed the functional constraint upon downstream genes in this pathway, these downstream genes are generally assumed to have subsequently deteriorated. In this study, we used a comparative genomics approach to empirically determine the fate of UOX itself and the downstream genes in five hominoids, two birds, and a reptile. Although we found that the loss of UOX likely triggered the genetic deterioration of the immediate downstream genes in the hominoids, surprisingly in the birds and reptiles, the UOX locus itself and some of the downstream genes were present in the genome and predicted to encode proteins. To account for the variable pattern of gene retention and loss after the inactivation of UOX, we hypothesize that although gene loss is a common fate for genes that have been rendered obsolete due to the upstream loss of an enzyme a metabolic pathway, it is also possible that same lack of constraint will foster the evolution of new functions or allow the optimization of preexisting alternative functions in the downstream genes, thereby resulting in gene retention. Thus, adaptive single-gene losses have the potential to influence the long-term evolutionary fate of functionally interrelated genes.
Development and use of the Cytoscape app GFD-Net for measuring semantic dissimilarity of gene networks

PubMed Central

Diaz-Montana, Juan J.; Diaz-Diaz, Norberto

2014-01-01

Gene networks are one of the main computational models used to study the interaction between different elements during biological processes being widely used to represent gene–gene, or protein–protein interaction complexes. We present GFD-Net, a Cytoscape app for visualizing and analyzing the functional dissimilarity of gene networks. PMID:25400907
Nucleotide diversity, natural variation, and evolution of Flexible culm-1 and Strong culm-2 lodging resistance genes in rice.

PubMed

Rashid, Muhammad Abdul Rehman; Zhao, Yan; Zhang, Hongliang; Li, Jinjie; Li, Zichao

2016-07-01

Lodging resistance is one of the vital traits in yield improvement and sustainability. Culm wall thickness, diameter, and strength are different traits that can govern the lodging resistance in rice. The genes SCM2 and FC1 have been isolated for culm thickness, strength, and flexibility, but their functional nucleotide variations were still unknown. We used a 13× deep sequence of 795 diverse genotypes to present the functional variation and SNP diversity in SCM2 and FC1. The major functional variant for the SCM2 gene was at position 27480181 and for the FC1 gene at position 31072992. Haplotype analysis of both genes provided their various allelic differences among haplotypes. SCM2 alleles further presented the evolution of Oryza sativa L. subsp. indica and subsp. japonica genomes from common parent in different geographical zones, while the haplotypes of FC1 suggested their evolution from different strains of the common parent Oryza rufipogon. SCM2 showed purifying selection and functional associations with rare alleles, while FC1 displayed balanced selection favored by multiple heterozygous alleles. Genotypes with an allelic combination of SCM2-3 and FC1-2 in japonica background exhibited striking resistance against lodging, which can be used in further breeding programs.
Functional annotation of the vlinc class of non-coding RNAs using systems biology approach

PubMed Central

Laurent, Georges St.; Vyatkin, Yuri; Antonets, Denis; Ri, Maxim; Qi, Yao; Saik, Olga; Shtokalo, Dmitry; de Hoon, Michiel J.L.; Kawaji, Hideya; Itoh, Masayoshi; Lassmann, Timo; Arner, Erik; Forrest, Alistair R.R.; Nicolas, Estelle; McCaffrey, Timothy A.; Carninci, Piero; Hayashizaki, Yoshihide; Wahlestedt, Claes; Kapranov, Philipp

2016-01-01

Functionality of the non-coding transcripts encoded by the human genome is the coveted goal of the modern genomics research. While commonly relied on the classical methods of forward genetics, integration of different genomics datasets in a global Systems Biology fashion presents a more productive avenue of achieving this very complex aim. Here we report application of a Systems Biology-based approach to dissect functionality of a newly identified vast class of very long intergenic non-coding (vlinc) RNAs. Using highly quantitative FANTOM5 CAGE dataset, we show that these RNAs could be grouped into 1542 novel human genes based on analysis of insulators that we show here indeed function as genomic barrier elements. We show that vlincRNAs genes likely function in cis to activate nearby genes. This effect while most pronounced in closely spaced vlincRNA–gene pairs can be detected over relatively large genomic distances. Furthermore, we identified 101 vlincRNA genes likely involved in early embryogenesis based on patterns of their expression and regulation. We also found another 109 such genes potentially involved in cellular functions also happening at early stages of development such as proliferation, migration and apoptosis. Overall, we show that Systems Biology-based methods have great promise for functional annotation of non-coding RNAs. PMID:27001520
Construction of a Bacterial Cell that Contains Only the Set of Essential Genes Necessary to Impart Life

DTIC Science & Technology

2014-05-16

native uncharacterized genes for characterized genes from Bacillus subtilis , that is presented in a constitutive expression module. If the B... subtilis gene containing M. mycoides mutant is viable than the function of the conserved hypothetical gene is the same as the input B. subtilis gene...Characterized genes from B. subtilis were swapped with similar, but not so similar as to be clearly the same, essential genes from M. mycoides. The B. subtilis
Ortholog-based screening and identification of genes related to intracellular survival.

PubMed

Yang, Xiaowen; Wang, Jiawei; Bing, Guoxia; Bie, Pengfei; De, Yanyan; Lyu, Yanli; Wu, Qingmin

2018-04-20

Bioinformatics and comparative genomics analysis methods were used to predict unknown pathogen genes based on homology with identified or functionally clustered genes. In this study, the genes of common pathogens were analyzed to screen and identify genes associated with intracellular survival through sequence similarity, phylogenetic tree analysis and the λ-Red recombination system test method. The total 38,952 protein-coding genes of common pathogens were divided into 19,775 clusters. As demonstrated through a COG analysis, information storage and processing genes might play an important role intracellular survival. Only 19 clusters were present in facultative intracellular pathogens, and not all were present in extracellular pathogens. Construction of a phylogenetic tree selected 18 of these 19 clusters. Comparisons with the DEG database and previous research revealed that seven other clusters are considered essential gene clusters and that seven other clusters are associated with intracellular survival. Moreover, this study confirmed that clusters screened by orthologs with similar function could be replaced with an approved uvrY gene and its orthologs, and the results revealed that the usg gene is associated with intracellular survival. The study improves the current understanding of intracellular pathogens characteristics and allows further exploration of the intracellular survival-related gene modules in these pathogens. Copyright © 2018. Published by Elsevier B.V.
Correlated gene expression and anatomical communication support synchronized brain activity in the mouse functional connectome.

PubMed

Mills, Brian D; Grayson, David S; Shunmugavel, Anandakumar; Miranda-Dominguez, Oscar; Feczko, Eric; Earl, Eric; Neve, Kim; Fair, Damien A

2018-05-22

Cognition and behavior depend on synchronized intrinsic brain activity that is organized into functional networks across the brain. Research has investigated how anatomical connectivity both shapes and is shaped by these networks, but not how anatomical connectivity interacts with intra-areal molecular properties to drive functional connectivity. Here, we present a novel linear model to explain functional connectivity by integrating systematically obtained measurements of axonal connectivity, gene expression, and resting state functional connectivity MRI in the mouse brain. The model suggests that functional connectivity arises from both anatomical links and inter-areal similarities in gene expression. By estimating these effects, we identify anatomical modules in which correlated gene expression and anatomical connectivity support functional connectivity. Along with providing evidence that not all genes equally contribute to functional connectivity, this research establishes new insights regarding the biological underpinnings of coordinated brain activity measured by BOLD fMRI. SIGNIFICANCE STATEMENT Efforts at characterizing the functional connectome with fMRI have risen exponentially over the last decade. Yet despite this rise, the biological underpinnings of these functional measurements are still largely unknown. The current report begins to fill this void by investigating the molecular underpinnings of the functional connectome through an integration of systematically obtained structural information and gene expression data throughout the rodent brain. We find that both white matter connectivity and similarity in regional gene expression relate to resting state functional connectivity. The current report furthers our understanding of the biological underpinnings of the functional connectome and provides a linear model that can be utilized to streamline preclinical animal studies of disease. Copyright © 2018 the authors.
Genomics and functional genomics in Chlamydomonas reinhardtii

DOE Office of Scientific and Technical Information (OSTI.GOV)

Blaby, Ian K.; Blaby-Haas, Crysten E.

The availability of the Chlamydomonas reinhardtii nuclear genome sequence continues to enable researchers to address biological questions relevant to algae, land plants and animals in unprecedented ways. As we continue to characterize and understand biological processes in C. reinhardtii and translate that knowledge to other systems, we are faced with the realization that many genes encode proteins without a defined function. The field of functional genomics aims to close this gap between genome sequence and protein function. Transcriptomes, proteomes and phenomes can each provide layers of gene-specific functional data while supplying a global snapshot of cellular behavior under different conditions.more » Herein we present a brief history of functional genomics, the present status of the C. reinhardtii genome, how genome-wide experiments can aid in supplying protein function inferences, and provide an outlook for functional genomics in C. reinhardtii.« less
Genomics and functional genomics in Chlamydomonas reinhardtii

DOE PAGES

Blaby, Ian K.; Blaby-Haas, Crysten E.

2017-03-21

The availability of the Chlamydomonas reinhardtii nuclear genome sequence continues to enable researchers to address biological questions relevant to algae, land plants and animals in unprecedented ways. As we continue to characterize and understand biological processes in C. reinhardtii and translate that knowledge to other systems, we are faced with the realization that many genes encode proteins without a defined function. The field of functional genomics aims to close this gap between genome sequence and protein function. Transcriptomes, proteomes and phenomes can each provide layers of gene-specific functional data while supplying a global snapshot of cellular behavior under different conditions.more » Herein we present a brief history of functional genomics, the present status of the C. reinhardtii genome, how genome-wide experiments can aid in supplying protein function inferences, and provide an outlook for functional genomics in C. reinhardtii.« less
Identifying metabolic enzymes with multiple types of association evidence

PubMed Central

Kharchenko, Peter; Chen, Lifeng; Freund, Yoav; Vitkup, Dennis; Church, George M

2006-01-01

Background Existing large-scale metabolic models of sequenced organisms commonly include enzymatic functions which can not be attributed to any gene in that organism. Existing computational strategies for identifying such missing genes rely primarily on sequence homology to known enzyme-encoding genes. Results We present a novel method for identifying genes encoding for a specific metabolic function based on a local structure of metabolic network and multiple types of functional association evidence, including clustering of genes on the chromosome, similarity of phylogenetic profiles, gene expression, protein fusion events and others. Using E. coli and S. cerevisiae metabolic networks, we illustrate predictive ability of each individual type of association evidence and show that significantly better predictions can be obtained based on the combination of all data. In this way our method is able to predict 60% of enzyme-encoding genes of E. coli metabolism within the top 10 (out of 3551) candidates for their enzymatic function, and as a top candidate within 43% of the cases. Conclusion We illustrate that a combination of genome context and other functional association evidence is effective in predicting genes encoding metabolic enzymes. Our approach does not rely on direct sequence homology to known enzyme-encoding genes, and can be used in conjunction with traditional homology-based metabolic reconstruction methods. The method can also be used to target orphan metabolic activities. PMID:16571130
Transient gene expression in epidermal cells of plant leaves by biolistic DNA delivery.

PubMed

Ueki, Shoko; Magori, Shimpei; Lacroix, Benoît; Citovsky, Vitaly

2013-01-01

Transient gene expression is a useful approach for studying the functions of gene products. In the case of plants, Agrobacterium infiltration is a method of choice for transient introduction of genes for many species. However, this technique does not work efficiently in some species, such as Arabidopsis thaliana. Moreover, the infection of Agrobacterium is known to induce dynamic changes in gene expression patterns in the host plants, possibly affecting the function and localization of the proteins to be tested. These problems can be circumvented by biolistic delivery of the genes of interest. Here, we present an optimized protocol for biolistic delivery of plasmid DNA into epidermal cells of plant leaves, which can be easily performed using the Bio-Rad Helios gene gun system. This protocol allows efficient and reproducible transient expression of diverse genes in Arabidopsis, Nicotiana benthamiana and N. tabacum, and is suitable for studies of the biological function and subcellular localization of the gene products directly in planta. The protocol also can be easily adapted to other species by optimizing the delivery gas pressure.
Annotation of gene function in citrus using gene expression information and co-expression networks

PubMed Central

2014-01-01

Background The genus Citrus encompasses major cultivated plants such as sweet orange, mandarin, lemon and grapefruit, among the world’s most economically important fruit crops. With increasing volumes of transcriptomics data available for these species, Gene Co-expression Network (GCN) analysis is a viable option for predicting gene function at a genome-wide scale. GCN analysis is based on a “guilt-by-association” principle whereby genes encoding proteins involved in similar and/or related biological processes may exhibit similar expression patterns across diverse sets of experimental conditions. While bioinformatics resources such as GCN analysis are widely available for efficient gene function prediction in model plant species including Arabidopsis, soybean and rice, in citrus these tools are not yet developed. Results We have constructed a comprehensive GCN for citrus inferred from 297 publicly available Affymetrix Genechip Citrus Genome microarray datasets, providing gene co-expression relationships at a genome-wide scale (33,000 transcripts). The comprehensive citrus GCN consists of a global GCN (condition-independent) and four condition-dependent GCNs that survey the sweet orange species only, all citrus fruit tissues, all citrus leaf tissues, or stress-exposed plants. All of these GCNs are clustered using genome-wide, gene-centric (guide) and graph clustering algorithms for flexibility of gene function prediction. For each putative cluster, gene ontology (GO) enrichment and gene expression specificity analyses were performed to enhance gene function, expression and regulation pattern prediction. The guide-gene approach was used to infer novel roles of genes involved in disease susceptibility and vitamin C metabolism, and graph-clustering approaches were used to investigate isoprenoid/phenylpropanoid metabolism in citrus peel, and citric acid catabolism via the GABA shunt in citrus fruit. Conclusions Integration of citrus gene co-expression networks, functional enrichment analysis and gene expression information provide opportunities to infer gene function in citrus. We present a publicly accessible tool, Network Inference for Citrus Co-Expression (NICCE, http://citrus.adelaide.edu.au/nicce/home.aspx), for the gene co-expression analysis in citrus. PMID:25023870
Combining Shigella Tn-seq data with gold-standard E. coli gene deletion data suggests rare transitions between essential and non-essential gene functionality.

PubMed

Freed, Nikki E; Bumann, Dirk; Silander, Olin K

2016-09-06

Gene essentiality - whether or not a gene is necessary for cell growth - is a fundamental component of gene function. It is not well established how quickly gene essentiality can change, as few studies have compared empirical measures of essentiality between closely related organisms. Here we present the results of a Tn-seq experiment designed to detect essential protein coding genes in the bacterial pathogen Shigella flexneri 2a 2457T on a genome-wide scale. Superficial analysis of this data suggested that 481 protein-coding genes in this Shigella strain are critical for robust cellular growth on rich media. Comparison of this set of genes with a gold-standard data set of essential genes in the closely related Escherichia coli K12 BW25113 revealed that an excessive number of genes appeared essential in Shigella but non-essential in E. coli. Importantly, and in converse to this comparison, we found no genes that were essential in E. coli and non-essential in Shigella, implying that many genes were artefactually inferred as essential in Shigella. Controlling for such artefacts resulted in a much smaller set of discrepant genes. Among these, we identified three sets of functionally related genes, two of which have previously been implicated as critical for Shigella growth, but which are dispensable for E. coli growth. The data presented here highlight the small number of protein coding genes for which we have strong evidence that their essentiality status differs between the closely related bacterial taxa E. coli and Shigella. A set of genes involved in acetate utilization provides a canonical example. These results leave open the possibility of developing strain-specific antibiotic treatments targeting such differentially essential genes, but suggest that such opportunities may be rare in closely related bacteria.
Biolistics-based gene silencing in plants using a modified particle inflow gun.

PubMed

Davies, Kevin M; Deroles, Simon C; Boase, Murray R; Hunter, Don A; Schwinn, Kathy E

2013-01-01

RNA interference (RNAi) is one of the most commonly used techniques for examining the function of genes of interest. In this chapter we present two examples of RNAi that use the particle inflow gun for delivery of the DNA constructs. In one example transient RNAi is used to show the function of an anthocyanin regulatory gene in flower petals. In the second example stably transformed cell cultures are produced with an RNAi construct that results in a change in the anthocyanin hydroxylation pattern.
Insights on the functional impact of microRNAs present in autism-associated copy number variants.

PubMed

Vaishnavi, Varadarajan; Manikandan, Mayakannan; Tiwary, Basant K; Munirajan, Arasambattu Kannan

2013-01-01

Autism spectrum disorder is a complex neurodevelopmental disorder that appears during the first three years of infancy and lasts throughout a person's life. Recently a large category of genomic structural variants, denoted as copy number variants (CNVs), were established to be a major contributor of the pathophysiology of autism. To date almost all studies have focussed only on the genes present in the CNV loci, but the impact of non-coding regulatory microRNAs (miRNAs) present in these regions remain largely unexplored. Hence we attempted to elucidate the biological and functional significance of miRNAs present in autism-associated CNV loci and their target genes by using a series of computational tools. We demonstrate that nearly 11% of the CNV loci harbor miRNAs and a few of these miRNAs were previously reported to be associated with autism. A systematic analysis of the CNV-miRNAs based on their interactions with the target genes enabled the identification of top 10 miRNAs namely hsa-miR-590-3p, hsa-miR-944, hsa-miR-570, hsa-miR-34a, hsa-miR-124, hsa-miR-548f, hsa-miR-429, hsa-miR-200b, hsa-miR-195 and hsa-miR-497 as hub molecules. Further, the CNV-miRNAs formed a regulatory loop with transcription factors and their downstream target genes, and annotation of these target genes indicated their functional involvement in neurodevelopment and synapse. Moreover, miRNAs present in deleted and duplicated CNV loci may explain the difference in dosage of the crucial genes controlled by them. These CNV-miRNAs can also impair the global processing and biogenesis of all miRNAs by targeting key molecules in the miRNA pathway. To our knowledge, this is the first report to highlight the significance of CNV-microRNAs and their target genes to contribute towards the genetic heterogeneity and phenotypic variability of autism.
Broad Integration of Expression Maps and Co-Expression Networks Compassing Novel Gene Functions in the Brain

PubMed Central

Okamura-Oho, Yuko; Shimokawa, Kazuro; Nishimura, Masaomi; Takemoto, Satoko; Sato, Akira; Furuichi, Teiichi; Yokota, Hideo

2014-01-01

Using a recently invented technique for gene expression mapping in the whole-anatomy context, termed transcriptome tomography, we have generated a dataset of 36,000 maps of overall gene expression in the adult-mouse brain. Here, using an informatics approach, we identified a broad co-expression network that follows an inverse power law and is rich in functional interaction and gene-ontology terms. Our framework for the integrated analysis of expression maps and graphs of co-expression networks revealed that groups of combinatorially expressed genes, which regulate cell differentiation during development, were present in the adult brain and each of these groups was associated with a discrete cell types. These groups included non-coding genes of unknown function. We found that these genes specifically linked developmentally conserved groups in the network. A previously unrecognized robust expression pattern covering the whole brain was related to the molecular anatomy of key biological processes occurring in particular areas. PMID:25382412
Functional Annotation, Genome Organization and Phylogeny of the Grapevine (Vitis vinifera) Terpene Synthase Gene Family Based on Genome Assembly, FLcDNA Cloning, and Enzyme Assays

PubMed Central

2010-01-01

Background Terpenoids are among the most important constituents of grape flavour and wine bouquet, and serve as useful metabolite markers in viticulture and enology. Based on the initial 8-fold sequencing of a nearly homozygous Pinot noir inbred line, 89 putative terpenoid synthase genes (VvTPS) were predicted by in silico analysis of the grapevine (Vitis vinifera) genome assembly [1]. The finding of this very large VvTPS family, combined with the importance of terpenoid metabolism for the organoleptic properties of grapevine berries and finished wines, prompted a detailed examination of this gene family at the genomic level as well as an investigation into VvTPS biochemical functions. Results We present findings from the analysis of the up-dated 12-fold sequencing and assembly of the grapevine genome that place the number of predicted VvTPS genes at 69 putatively functional VvTPS, 20 partial VvTPS, and 63 VvTPS probable pseudogenes. Gene discovery and annotation included information about gene architecture and chromosomal location. A dense cluster of 45 VvTPS is localized on chromosome 18. Extensive FLcDNA cloning, gene synthesis, and protein expression enabled functional characterization of 39 VvTPS; this is the largest number of functionally characterized TPS for any species reported to date. Of these enzymes, 23 have unique functions and/or phylogenetic locations within the plant TPS gene family. Phylogenetic analyses of the TPS gene family showed that while most VvTPS form species-specific gene clusters, there are several examples of gene orthology with TPS of other plant species, representing perhaps more ancient VvTPS, which have maintained functions independent of speciation. Conclusions The highly expanded VvTPS gene family underpins the prominence of terpenoid metabolism in grapevine. We provide a detailed experimental functional annotation of 39 members of this important gene family in grapevine and comprehensive information about gene structure and phylogeny for the entire currently known VvTPS gene family. PMID:20964856
Conserved syntenic clusters of protein coding genes are missing in birds.

PubMed

Lovell, Peter V; Wirthlin, Morgan; Wilhelm, Larry; Minx, Patrick; Lazar, Nathan H; Carbone, Lucia; Warren, Wesley C; Mello, Claudio V

2014-01-01

Birds are one of the most highly successful and diverse groups of vertebrates, having evolved a number of distinct characteristics, including feathers and wings, a sturdy lightweight skeleton and unique respiratory and urinary/excretion systems. However, the genetic basis of these traits is poorly understood. Using comparative genomics based on extensive searches of 60 avian genomes, we have found that birds lack approximately 274 protein coding genes that are present in the genomes of most vertebrate lineages and are for the most part organized in conserved syntenic clusters in non-avian sauropsids and in humans. These genes are located in regions associated with chromosomal rearrangements, and are largely present in crocodiles, suggesting that their loss occurred subsequent to the split of dinosaurs/birds from crocodilians. Many of these genes are associated with lethality in rodents, human genetic disorders, or biological functions targeting various tissues. Functional enrichment analysis combined with orthogroup analysis and paralog searches revealed enrichments that were shared by non-avian species, present only in birds, or shared between all species. Together these results provide a clearer definition of the genetic background of extant birds, extend the findings of previous studies on missing avian genes, and provide clues about molecular events that shaped avian evolution. They also have implications for fields that largely benefit from avian studies, including development, immune system, oncogenesis, and brain function and cognition. With regards to the missing genes, birds can be considered ‘natural knockouts’ that may become invaluable model organisms for several human diseases.
An in silico assessment of gene function and organization of the phenylpropanoid pathway metabolic networks in Arabidopsis thaliana and limitations thereof

NASA Technical Reports Server (NTRS)

Costa, Michael A.; Collins, R. Eric; Anterola, Aldwin M.; Cochrane, Fiona C.; Davin, Laurence B.; Lewis, Norman G.

2003-01-01

The Arabidopsis genome sequencing in 2000 gave to science the first blueprint of a vascular plant. Its successful completion also prompted the US National Science Foundation to launch the Arabidopsis 2010 initiative, the goal of which is to identify the function of each gene by 2010. In this study, an exhaustive analysis of The Institute for Genomic Research (TIGR) and The Arabidopsis Information Resource (TAIR) databases, together with all currently compiled EST sequence data, was carried out in order to determine to what extent the various metabolic networks from phenylalanine ammonia lyase (PAL) to the monolignols were organized and/or could be predicted. In these databases, there are some 65 genes which have been annotated as encoding putative enzymatic steps in monolignol biosynthesis, although many of them have only very low homology to monolignol pathway genes of known function in other plant systems. Our detailed analysis revealed that presently only 13 genes (two PALs, a cinnamate-4-hydroxylase, a p-coumarate-3-hydroxylase, a ferulate-5-hydroxylase, three 4-coumarate-CoA ligases, a cinnamic acid O-methyl transferase, two cinnamoyl-CoA reductases) and two cinnamyl alcohol dehydrogenases can be classified as having a bona fide (definitive) function; the remaining 52 genes currently have undetermined physiological roles. The EST database entries for this particular set of genes also provided little new insight into how the monolignol pathway was organized in the different tissues and organs, this being perhaps a consequence of both limitations in how tissue samples were collected and in the incomplete nature of the EST collections. This analysis thus underscores the fact that even with genomic sequencing, presumed to provide the entire suite of putative genes in the monolignol-forming pathway, a very large effort needs to be conducted to establish actual catalytic roles (including enzyme versatility), as well as the physiological function(s) for each member of the (multi)gene families present and the metabolic networks that are operative. Additionally, one key to identifying physiological functions for many of these (and other) unknown genes, and their corresponding metabolic networks, awaits the development of technologies to comprehensively study molecular processes at the single cell level in particular tissues and organs, in order to establish the actual metabolic context.

Comprehensive analysis of alternative splicing and functionality in neuronal differentiation of P19 cells.

PubMed

Suzuki, Hitoshi; Osaki, Ken; Sano, Kaori; Alam, A H M Khurshid; Nakamura, Yuichiro; Ishigaki, Yasuhito; Kawahara, Kozo; Tsukahara, Toshifumi

2011-02-18

Alternative splicing, which produces multiple mRNAs from a single gene, occurs in most human genes and contributes to protein diversity. Many alternative isoforms are expressed in a spatio-temporal manner, and function in diverse processes, including in the neural system. The purpose of the present study was to comprehensively investigate neural-splicing using P19 cells. GeneChip Exon Array analysis was performed using total RNAs purified from cells during neuronal cell differentiation. To efficiently and readily extract the alternative exon candidates, 9 filtering conditions were prepared, yielding 262 candidate exons (236 genes). Semiquantitative RT-PCR results in 30 randomly selected candidates suggested that 87% of the candidates were differentially alternatively spliced in neuronal cells compared to undifferentiated cells. Gene ontology and pathway analyses suggested that many of the candidate genes were associated with neural events. Together with 66 genes whose functions in neural cells or organs were reported previously, 47 candidate genes were found to be linked to 189 events in the gene-level profile of neural differentiation. By text-mining for the alternative isoform, distinct functions of the isoforms of 9 candidate genes indicated by the result of Exon Array were confirmed. Alternative exons were successfully extracted. Results from the informatics analyses suggested that neural events were primarily governed by genes whose expression was increased and whose transcripts were differentially alternatively spliced in the neuronal cells. In addition to known functions in neural cells or organs, the uninvestigated alternative splicing events of 11 genes among 47 candidate genes suggested that cell cycle events are also potentially important. These genes may help researchers to differentiate the roles of alternative splicing in cell differentiation and cell proliferation.
A novel bioinformatics pipeline to discover genes related to arbuscular mycorrhizal symbiosis based on their evolutionary conservation pattern among higher plants.

PubMed

Favre, Patrick; Bapaume, Laure; Bossolini, Eligio; Delorenzi, Mauro; Falquet, Laurent; Reinhardt, Didier

2014-12-03

Genes involved in arbuscular mycorrhizal (AM) symbiosis have been identified primarily by mutant screens, followed by identification of the mutated genes (forward genetics). In addition, a number of AM-related genes has been identified by their AM-related expression patterns, and their function has subsequently been elucidated by knock-down or knock-out approaches (reverse genetics). However, genes that are members of functionally redundant gene families, or genes that have a vital function and therefore result in lethal mutant phenotypes, are difficult to identify. If such genes are constitutively expressed and therefore escape differential expression analyses, they remain elusive. The goal of this study was to systematically search for AM-related genes with a bioinformatics strategy that is insensitive to these problems. The central element of our approach is based on the fact that many AM-related genes are conserved only among AM-competent species. Our approach involves genome-wide comparisons at the proteome level of AM-competent host species with non-mycorrhizal species. Using a clustering method we first established orthologous/paralogous relationships and subsequently identified protein clusters that contain members only of the AM-competent species. Proteins of these clusters were then analyzed in an extended set of 16 plant species and ranked based on their relatedness among AM-competent monocot and dicot species, relative to non-mycorrhizal species. In addition, we combined the information on the protein-coding sequence with gene expression data and with promoter analysis. As a result we present a list of yet uncharacterized proteins that show a strongly AM-related pattern of sequence conservation, indicating that the respective genes may have been under selection for a function in AM. Among the top candidates are three genes that encode a small family of similar receptor-like kinases that are related to the S-locus receptor kinases involved in sporophytic self-incompatibility. We present a new systematic strategy of gene discovery based on conservation of the protein-coding sequence that complements classical forward and reverse genetics. This strategy can be applied to diverse other biological phenomena if species with established genome sequences fall into distinguished groups that differ in a defined functional trait of interest.
Construction of a Bacterial Cell that Contains Only the Set of Essential Genes Necessary to Impart Life

DTIC Science & Technology

2014-08-15

characterized genes from Bacillus subtilis , that is presented in a constitutive expression module. If the B. subtilis gene containing M. mycoides mutant is...essential gene MMYC_0361 with the rlmH gene from Bacillus subtilis . Mycoplasma mycoides containing the B. subtilis rlmH was viable. This tells us the...viable than the function of the conserved hypothetical gene is the same as the input B. subtilis gene. Table of Contents: Section
Dynamic changes in gene expression during human trophoblast differentiation.

PubMed

Handwerger, Stuart; Aronow, Bruce

2003-01-01

The genetic program that directs human placental differentiation is poorly understood. In a recent study, we used DNA microarray analyses to determine genes that are dynamically regulated during human placental development in an in vitro model system in which highly purified cytotrophoblast cells aggregate spontaneously and fuse to form a multinucleated syncytium that expresses placental lactogen, human chorionic gonadotropin, and other proteins normally expressed by fully differentiated syncytiotrophoblast cells. Of the 6918 genes present on the Incyte Human GEM V microarray that we analyzed over a 9-day period, 141 were induced and 256 were downregulated by more than 2-fold. The dynamically regulated genes fell into nine distinct kinetic patterns of induction or repression, as detected by the K-means algorithm. Classifying the genes according to functional characteristics, the regulated genes could be divided into six overall categories: cell and tissue structural dynamics, cell cycle and apoptosis, intercellular communication, metabolism, regulation of gene expression, and expressed sequence tags and function unknown. Gene expression changes within key functional categories were tightly coupled to the morphological changes that occurred during trophoblast differentiation. Within several key gene categories (e.g., cell and tissue structure), many genes were strongly activated, while others with related function were strongly repressed. These findings suggest that trophoblast differentiation is augmented by "categorical reprogramming" in which the ability of induced genes to function is enhanced by diminished synthesis of other genes within the same category. We also observed categorical reprogramming in human decidual fibroblasts decidualized in vitro in response to progesterone, estradiol, and cyclic AMP. While there was little overlap between genes that are dynamically regulated during trophoblast differentiation versus decidualization, many of the categories in which genes were strongly activated also contained genes whose expression was strongly diminished. Taken together, these findings point to a fundamental role for simultaneous induction and repression of mRNAs that encode functionally related proteins during the differentiation process.
New genes as drivers of phenotypic evolution

PubMed Central

Chen, Sidi; Krinsky, Benjamin H.; Long, Manyuan

2014-01-01

During the course of evolution, genomes acquire novel genetic elements as sources of functional and phenotypic diversity, including new genes that originated in recent evolution. In the past few years, substantial progress has been made in understanding the evolution and phenotypic effects of new genes. In particular, an emerging picture is that new genes, despite being present in the genomes of only a subset of species, can rapidly evolve indispensable roles in fundamental biological processes, including development, reproduction, brain function and behaviour. The molecular underpinnings of how new genes can develop these roles are starting to be characterized. These recent discoveries yield fresh insights into our broad understanding of biological diversity at refined resolution. PMID:23949544
New genes as drivers of phenotypic evolution.

PubMed

Chen, Sidi; Krinsky, Benjamin H; Long, Manyuan

2013-09-01

During the course of evolution, genomes acquire novel genetic elements as sources of functional and phenotypic diversity, including new genes that originated in recent evolution. In the past few years, substantial progress has been made in understanding the evolution and phenotypic effects of new genes. In particular, an emerging picture is that new genes, despite being present in the genomes of only a subset of species, can rapidly evolve indispensable roles in fundamental biological processes, including development, reproduction, brain function and behaviour. The molecular underpinnings of how new genes can develop these roles are starting to be characterized. These recent discoveries yield fresh insights into our broad understanding of biological diversity at refined resolution.
A Sorghum bicolor expression atlas reveals dynamic genotype-specific expression profiles for vegetative tissues of grain, sweet and bioenergy sorghums.

PubMed

Shakoor, Nadia; Nair, Ramesh; Crasta, Oswald; Morris, Geoffrey; Feltus, Alex; Kresovich, Stephen

2014-01-23

Effective improvement in sorghum crop development necessitates a genomics-based approach to identify functional genes and QTLs. Sequenced in 2009, a comprehensive annotation of the sorghum genome and the development of functional genomics resources is key to enable the discovery and deployment of regulatory and metabolic genes and gene networks for crop improvement. This study utilizes the first commercially available whole-transcriptome sorghum microarray (Sorgh-WTa520972F) to identify tissue and genotype-specific expression patterns for all identified Sorghum bicolor exons and UTRs. The genechip contains 1,026,373 probes covering 149,182 exons (27,577 genes) across the Sorghum bicolor nuclear, chloroplast, and mitochondrial genomes. Specific probesets were also included for putative non-coding RNAs that may play a role in gene regulation (e.g., microRNAs), and confirmed functional small RNAs in related species (maize and sugarcane) were also included in our array design. We generated expression data for 78 samples with a combination of four different tissue types (shoot, root, leaf and stem), two dissected stem tissues (pith and rind) and six diverse genotypes, which included 6 public sorghum lines (R159, Atlas, Fremont, PI152611, AR2400 and PI455230) representing grain, sweet, forage, and high biomass ideotypes. Here we present a summary of the microarray dataset, including analysis of tissue-specific gene expression profiles and associated expression profiles of relevant metabolic pathways. With an aim to enable identification and functional characterization of genes in sorghum, this expression atlas presents a new and valuable resource to the research community.
A Sorghum bicolor expression atlas reveals dynamic genotype-specific expression profiles for vegetative tissues of grain, sweet and bioenergy sorghums

PubMed Central

2014-01-01

Background Effective improvement in sorghum crop development necessitates a genomics-based approach to identify functional genes and QTLs. Sequenced in 2009, a comprehensive annotation of the sorghum genome and the development of functional genomics resources is key to enable the discovery and deployment of regulatory and metabolic genes and gene networks for crop improvement. Results This study utilizes the first commercially available whole-transcriptome sorghum microarray (Sorgh-WTa520972F) to identify tissue and genotype-specific expression patterns for all identified Sorghum bicolor exons and UTRs. The genechip contains 1,026,373 probes covering 149,182 exons (27,577 genes) across the Sorghum bicolor nuclear, chloroplast, and mitochondrial genomes. Specific probesets were also included for putative non-coding RNAs that may play a role in gene regulation (e.g., microRNAs), and confirmed functional small RNAs in related species (maize and sugarcane) were also included in our array design. We generated expression data for 78 samples with a combination of four different tissue types (shoot, root, leaf and stem), two dissected stem tissues (pith and rind) and six diverse genotypes, which included 6 public sorghum lines (R159, Atlas, Fremont, PI152611, AR2400 and PI455230) representing grain, sweet, forage, and high biomass ideotypes. Conclusions Here we present a summary of the microarray dataset, including analysis of tissue-specific gene expression profiles and associated expression profiles of relevant metabolic pathways. With an aim to enable identification and functional characterization of genes in sorghum, this expression atlas presents a new and valuable resource to the research community. PMID:24456189
Enhancing biological relevance of a weighted gene co-expression network for functional module identification.

PubMed

Prom-On, Santitham; Chanthaphan, Atthawut; Chan, Jonathan Hoyin; Meechai, Asawin

2011-02-01

Relationships among gene expression levels may be associated with the mechanisms of the disease. While identifying a direct association such as a difference in expression levels between case and control groups links genes to disease mechanisms, uncovering an indirect association in the form of a network structure may help reveal the underlying functional module associated with the disease under scrutiny. This paper presents a method to improve the biological relevance in functional module identification from the gene expression microarray data by enhancing the structure of a weighted gene co-expression network using minimum spanning tree. The enhanced network, which is called a backbone network, contains only the essential structural information to represent the gene co-expression network. The entire backbone network is decoupled into a number of coherent sub-networks, and then the functional modules are reconstructed from these sub-networks to ensure minimum redundancy. The method was tested with a simulated gene expression dataset and case-control expression datasets of autism spectrum disorder and colorectal cancer studies. The results indicate that the proposed method can accurately identify clusters in the simulated dataset, and the functional modules of the backbone network are more biologically relevant than those obtained from the original approach.
Informatic and genomic analysis of melanocyte cDNA libraries as a resource for the study of melanocyte development and function.

PubMed

Baxter, Laura L; Hsu, Benjamin J; Umayam, Lowell; Wolfsberg, Tyra G; Larson, Denise M; Frith, Martin C; Kawai, Jun; Hayashizaki, Yoshihide; Carninci, Piero; Pavan, William J

2007-06-01

As part of the RIKEN mouse encyclopedia project, two cDNA libraries were prepared from melanocyte-derived cell lines, using techniques of full-length clone selection and subtraction/normalization to enrich for rare transcripts. End sequencing showed that these libraries display over 83% complete coding sequence at the 5' end and 96-97% complete coding sequence at the 3' end. Evaluation of the libraries, derived from B16F10Y tumor cells and melan-c cells, revealed that they contain clones for a majority of the genes previously demonstrated to function in melanocyte biology. Analysis of genomic locations for transcripts revealed that the distribution of melanocyte genes is non-random throughout the genome. Three genomic regions identified that showed significant clustering of melanocyte-expressed genes contain one or more genes previously shown to regulate melanocyte development or function. A catalog of genes expressed in these libraries is presented, providing a valuable resource of cDNA clones and sequence information that can be used for identification of new genes important for melanocyte development, function, and disease.
SoyFN: a knowledge database of soybean functional networks.

PubMed

Xu, Yungang; Guo, Maozu; Liu, Xiaoyan; Wang, Chunyu; Liu, Yang

2014-01-01

Many databases for soybean genomic analysis have been built and made publicly available, but few of them contain knowledge specifically targeting the omics-level gene-gene, gene-microRNA (miRNA) and miRNA-miRNA interactions. Here, we present SoyFN, a knowledge database of soybean functional gene networks and miRNA functional networks. SoyFN provides user-friendly interfaces to retrieve, visualize, analyze and download the functional networks of soybean genes and miRNAs. In addition, it incorporates much information about KEGG pathways, gene ontology annotations and 3'-UTR sequences as well as many useful tools including SoySearch, ID mapping, Genome Browser, eFP Browser and promoter motif scan. SoyFN is a schema-free database that can be accessed as a Web service from any modern programming language using a simple Hypertext Transfer Protocol call. The Web site is implemented in Java, JavaScript, PHP, HTML and Apache, with all major browsers supported. We anticipate that this database will be useful for members of research communities both in soybean experimental science and bioinformatics. Database URL: http://nclab.hit.edu.cn/SoyFN.
Amiloride-enhanced gene transfection of octa-arginine functionalized calcium phosphate nanoparticles

PubMed Central

Tenkumo, Taichi; Kamano, Yuya; Egusa, Hiroshi; Sasaki, Keiichi

2017-01-01

Nanoparticles represent promising gene delivery systems in biomedicine to facilitate prolonged gene expression with low toxicity compared to viral vectors. Specifically, nanoparticles of calcium phosphate (nCaP), the main inorganic component of human bone, exhibit high biocompatibility and good biodegradability and have been reported to have high affinity for protein or DNA, having thus been used as gene transfer vectors. On the other hand, Octa-arginine (R8), which has a high permeability to cell membrane, has been reported to improve intracellular delivery systems. Here, we present an optimized method for nCaP-mediated gene delivery using an octa-arginine (R8)-functionalized nCaP vector containing a marker or functional gene construct. nCaP particle size was between 220–580 nm in diameter and all R8-functionalized nCaPs carried a positive charge. R8 concentration significantly improved nCaP transfection efficiency with high cell compatibility in human mesenchymal stem cells (hMSC) and human osteoblasts (hOB) in particular, suggesting nCaPs as a good option for non-viral vector gene delivery. Furthermore, pre-treatment with different endocytosis inhibitors identified that the endocytic pathway differed among cell lines and functionalized nanoparticles, with amiloride increasing transfection efficiency of R8-functionalized nCaPs in hMSC and hOB. PMID:29145481
PTGBase: an integrated database to study tandem duplicated genes in plants.

PubMed

Yu, Jingyin; Ke, Tao; Tehrim, Sadia; Sun, Fengming; Liao, Boshou; Hua, Wei

2015-01-01

Tandem duplication is a wide-spread phenomenon in plant genomes and plays significant roles in evolution and adaptation to changing environments. Tandem duplicated genes related to certain functions will lead to the expansion of gene families and bring increase of gene dosage in the form of gene cluster arrays. Many tandem duplication events have been studied in plant genomes; yet, there is a surprising shortage of efforts to systematically present the integration of large amounts of information about publicly deposited tandem duplicated gene data across the plant kingdom. To address this shortcoming, we developed the first plant tandem duplicated genes database, PTGBase. It delivers the most comprehensive resource available to date, spanning 39 plant genomes, including model species and newly sequenced species alike. Across these genomes, 54 130 tandem duplicated gene clusters (129 652 genes) are presented in the database. Each tandem array, as well as its member genes, is characterized in complete detail. Tandem duplicated genes in PTGBase can be explored through browsing or searching by identifiers or keywords of functional annotation and sequence similarity. Users can download tandem duplicated gene arrays easily to any scale, up to the complete annotation data set for an entire plant genome. PTGBase will be updated regularly with newly sequenced plant species as they become available. © The Author(s) 2015. Published by Oxford University Press.
Identification of genes containing expanded purine repeats in the human genome and their apparent protective role against cancer.

PubMed

Singh, Himanshu Narayan; Rajeswari, Moganty R

2016-01-01

Purine repeat sequences present in a gene are unique as they have high propensity to form unusual DNA-triple helix structures. Friedreich's ataxia is the only human disease that is well known to be associated with DNA-triplexes formed by purine repeats. The purpose of this study was to recognize the expanded purine repeats (EPRs) in human genome and find their correlation with cancer pathogenesis. We developed "PuRepeatFinder.pl" algorithm to identify non-overlapping EPRs without pyrimidine interruptions in the human genome and customized for searching repeat lengths, n ≥ 200. A total of 1158 EPRs were identified in the genome which followed Wakeby distribution. Two hundred and ninety-six EPRs were found in geneic regions of 282 genes (EPR-genes). Gene clustering of EPR-genes was done based on their cellular function and a large number of EPR-genes were found to be enzymes/enzyme modulators. Meta-analysis of 282 EPR-genes identified only 63 EPR-genes in association with cancer, mostly in breast, lung, and blood cancers. Protein-protein interaction network analysis of all 282 EPR-genes identified proteins including those in cadherins and VEGF. The two observations, that EPRs can induce mutations under malignant conditions and that identification of some EPR-gene products in vital cell signaling-mediated pathways, together suggest the crucial role of EPRs in carcinogenesis. The new link between EPR-genes and their functionally interacting proteins throws a new dimension in the present understanding of cancer pathogenesis and can help in planning therapeutic strategies. Validation of present results using techniques like NGS is required to establish the role of the EPR genes in cancer pathology.
In search of functional association from time-series microarray data based on the change trend and level of gene expression

PubMed Central

He, Feng; Zeng, An-Ping

2006-01-01

Background The increasing availability of time-series expression data opens up new possibilities to study functional linkages of genes. Present methods used to infer functional linkages between genes from expression data are mainly based on a point-to-point comparison. Change trends between consecutive time points in time-series data have been so far not well explored. Results In this work we present a new method based on extracting main features of the change trend and level of gene expression between consecutive time points. The method, termed as trend correlation (TC), includes two major steps: 1, calculating a maximal local alignment of change trend score by dynamic programming and a change trend correlation coefficient between the maximal matched change levels of each gene pair; 2, inferring relationships of gene pairs based on two statistical extraction procedures. The new method considers time shifts and inverted relationships in a similar way as the local clustering (LC) method but the latter is merely based on a point-to-point comparison. The TC method is demonstrated with data from yeast cell cycle and compared with the LC method and the widely used Pearson correlation coefficient (PCC) based clustering method. The biological significance of the gene pairs is examined with several large-scale yeast databases. Although the TC method predicts an overall lower number of gene pairs than the other two methods at a same p-value threshold, the additional number of gene pairs inferred by the TC method is considerable: e.g. 20.5% compared with the LC method and 49.6% with the PCC method for a p-value threshold of 2.7E-3. Moreover, the percentage of the inferred gene pairs consistent with databases by our method is generally higher than the LC method and similar to the PCC method. A significant number of the gene pairs only inferred by the TC method are process-identity or function-similarity pairs or have well-documented biological interactions, including 443 known protein interactions and some known cell cycle related regulatory interactions. It should be emphasized that the overlapping of gene pairs detected by the three methods is normally not very high, indicating a necessity of combining the different methods in search of functional association of genes from time-series data. For a p-value threshold of 1E-5 the percentage of process-identity and function-similarity gene pairs among the shared part of the three methods reaches 60.2% and 55.6% respectively, building a good basis for further experimental and functional study. Furthermore, the combined use of methods is important to infer more complete regulatory circuits and network as exemplified in this study. Conclusion The TC method can significantly augment the current major methods to infer functional linkages and biological network and is well suitable for exploring temporal relationships of gene expression in time-series data. PMID:16478547
Essential RNA-Based Technologies and Their Applications in Plant Functional Genomics.

PubMed

Teotia, Sachin; Singh, Deepali; Tang, Xiaoqing; Tang, Guiliang

2016-02-01

Genome sequencing has not only extended our understanding of the blueprints of many plant species but has also revealed the secrets of coding and non-coding genes. We present here a brief introduction to and personal account of key RNA-based technologies, as well as their development and applications for functional genomics of plant coding and non-coding genes, with a focus on short tandem target mimics (STTMs), artificial microRNAs (amiRNAs), and CRISPR/Cas9. In addition, their use in multiplex technologies for the functional dissection of gene networks is discussed. Copyright © 2015 Elsevier Ltd. All rights reserved.
Identification of the Core Set of Carbon-Associated Genes in a Bioenergy Grassland Soil

DOE PAGES

Howe, Adina; Yang, Fan; Williams, Ryan J.; ...

2016-11-17

Despite the central role of soil microbial communities in global carbon (C) cycling, little is known about soil microbial community structure and even less about their metabolic pathways. Efforts to characterize soil communities often focus on identifying differences in gene content across environmental gradients, but an alternative question is what genes are similar in soils. These genes may indicate critical species or potential functions that are required in all soils. Here we identified the “core” set of C cycling sequences widely present in multiple soil metagenomes from a fertilized prairie (FP). Of 226,887 sequences associated with known enzymes involved inmore » the synthesis, metabolism, and transport of carbohydrates, 843 were identified to be consistently prevalent across four replicate soil metagenomes. This core metagenome was functionally and taxonomically diverse, representing five enzyme classes and 99 enzyme families within the CAZy database. Though it only comprised 0.4% of all CAZy-associated genes identified in FP metagenomes, the core was found to be comprised of functions similar to those within cumulative soils. The FP CAZy-associated core sequences were present in multiple publicly available soil metagenomes and most similar to soils sharing geographic proximity. As a result, in soil ecosystems, where high diversity remains a key challenge for metagenomic investigations, these core genes represent a subset of critical functions necessary for carbohydrate metabolism, which can be targeted to evaluate important C fluxes in these and other similar soils.« less
Function and expression pattern of nonsyndromic deafness genes

PubMed Central

Hilgert, Nele; Smith, Richard J.H.; Van Camp, Guy

2010-01-01

Hearing loss is the most common sensory disorder, present in 1 of every 500 newborns. To date, 46 genes have been identified that cause nonsyndromic hearing loss, making it an extremely heterogeneous trait. This review provides a comprehensive overview of the inner ear function and expression pattern of these genes. In general, they are involved in hair bundle morphogenesis, form constituents of the extracellular matrix, play a role in cochlear ion homeostasis or serve as transcription factors. During the past few years, our knowledge of genes involved in hair bundle morphogenesis has increased substantially. We give an up-to-date overview of both the nonsyndromic and Usher syndrome genes involved in this process, highlighting proteins that interact to form macromolecular complexes. For every gene, we also summarize its expression pattern and impact on hearing at the functional level. Gene-specific cochlear expression is summarized in a unique table by structure/cell type and is illustrated on a cochlear cross-section, which is available online via the Hereditary Hearing Loss Homepage. This review should provide auditory scientists the most relevant information for all identified nonsyndromic deafness genes. PMID:19601806
Transcriptome analysis of phosphorus stress responsiveness in the seedlings of Dongxiang wild rice (Oryza rufipogon Griff.).

PubMed

Deng, Qian-Wen; Luo, Xiang-Dong; Chen, Ya-Ling; Zhou, Yi; Zhang, Fan-Tao; Hu, Biao-Lin; Xie, Jian-Kun

2018-03-15

Low phosphorus availability is a major factor restricting rice growth. Dongxiang wild rice (Oryza rufipogon Griff.) has many useful genes lacking in cultivated rice, including stress resistance to phosphorus deficiency, cold, salt and drought, which is considered to be a precious germplasm resource for rice breeding. However, the molecular mechanism of regulation of phosphorus deficiency tolerance is not clear. In this study, cDNA libraries were constructed from the leaf and root tissues of phosphorus stressed and untreated Dongxiang wild rice seedlings, and transcriptome sequencing was performed with the goal of elucidating the molecular mechanisms involved in phosphorus stress response. The results indicated that 1184 transcripts were differentially expressed in the leaves (323 up-regulated and 861 down-regulated) and 986 transcripts were differentially expressed in the roots (756 up-regulated and 230 down-regulated). 43 genes were up-regulated both in leaves and roots, 38 genes were up-regulated in roots but down-regulated in leaves, and only 2 genes were down-regulated in roots but up-regulated in leaves. Among these differentially expressed genes, the detection of many transcription factors and functional genes demonstrated that multiple regulatory pathways were involved in phosphorus deficiency tolerance. Meanwhile, the differentially expressed genes were also annotated with gene ontology terms and key pathways via functional classification and Kyoto Encyclopedia of Gene and Genomes pathway mapping, respectively. A set of the most important candidate genes was then identified by combining the differentially expressed genes found in the present study with previously identified phosphorus deficiency tolerance quantitative trait loci. The present work provides abundant genomic information for functional dissection of the phosphorus deficiency resistance of Dongxiang wild rice, which will be help to understand the biological regulatory mechanisms of phosphorus deficiency tolerance in Dongxiang wild rice.
Potential Functional Replacement of the Plastidic Acetyl-CoA Carboxylase Subunit (accD) Gene by Recent Transfers to the Nucleus in Some Angiosperm Lineages1[W][OA

PubMed Central

Rousseau-Gueutin, Mathieu; Huang, Xun; Higginson, Emily; Ayliffe, Michael; Day, Anil; Timmis, Jeremy N.

2013-01-01

Eukaryotic cells originated when an ancestor of the nucleated cell engulfed bacterial endosymbionts that gradually evolved into the mitochondrion and the chloroplast. Soon after these endosymbiotic events, thousands of ancestral prokaryotic genes were functionally transferred from the endosymbionts to the nucleus. This process of functional gene relocation, now rare in eukaryotes, continues in angiosperms. In this article, we show that the chloroplastic acetyl-CoA carboxylase subunit (accD) gene that is present in the plastome of most angiosperms has been functionally relocated to the nucleus in the Campanulaceae. Surprisingly, the nucleus-encoded accD transcript is considerably smaller than the plastidic version, consisting of little more than the carboxylase domain of the plastidic accD gene fused to a coding region encoding a plastid targeting peptide. We verified experimentally the presence of a chloroplastic transit peptide by showing that the product of the nuclear accD fused to green fluorescent protein was imported in the chloroplasts. The nuclear gene regulatory elements that enabled the erstwhile plastidic gene to become functional in the nuclear genome were identified, and the evolution of the intronic and exonic sequences in the nucleus is described. Relocation and truncation of the accD gene is a remarkable example of the processes underpinning endosymbiotic evolution. PMID:23435694

Inferring evolution of gene duplicates using probabilistic models and nonparametric belief propagation.

PubMed

Zeng, Jia; Hannenhalli, Sridhar

2013-01-01

Gene duplication, followed by functional evolution of duplicate genes, is a primary engine of evolutionary innovation. In turn, gene expression evolution is a critical component of overall functional evolution of paralogs. Inferring evolutionary history of gene expression among paralogs is therefore a problem of considerable interest. It also represents significant challenges. The standard approaches of evolutionary reconstruction assume that at an internal node of the duplication tree, the two duplicates evolve independently. However, because of various selection pressures functional evolution of the two paralogs may be coupled. The coupling of paralog evolution corresponds to three major fates of gene duplicates: subfunctionalization (SF), conserved function (CF) or neofunctionalization (NF). Quantitative analysis of these fates is of great interest and clearly influences evolutionary inference of expression. These two interrelated problems of inferring gene expression and evolutionary fates of gene duplicates have not been studied together previously and motivate the present study. Here we propose a novel probabilistic framework and algorithm to simultaneously infer (i) ancestral gene expression and (ii) the likely fate (SF, NF, CF) at each duplication event during the evolution of gene family. Using tissue-specific gene expression data, we develop a nonparametric belief propagation (NBP) algorithm to predict the ancestral expression level as a proxy for function, and describe a novel probabilistic model that relates the predicted and known expression levels to the possible evolutionary fates. We validate our model using simulation and then apply it to a genome-wide set of gene duplicates in human. Our results suggest that SF tends to be more frequent at the earlier stage of gene family expansion, while NF occurs more frequently later on.
Characterization of a novel gene at the Gaucher disease locus spanning the region between the glucocerebrosidase (GC) pseudogene and thrombospondin (TSP)3

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ginns, E.I.; Winfield, S.; Sidransky, E.

1994-09-01

The human GC locus on chromosome 1q21 encompasses a 7 kb functional gene encoding the enzyme deficient in Gaucher disease, and a highly homologous sequence 16 Kb downstream that has the properties of a pseudogene. A novel gene, gene X, spanning the 6 kb region between the pseudogene and TSP3 has been identified and characterized in the mouse, and appears to be critical for normal embryonic development. As in the mouse, the human gene X is located 5{prime} to the TSP3 gene and two genes are transcribed divergently from a bidirectional promoter; the direction of transcription of gene X andmore » GC is convergent. However, in the human, gene X and GC are separated by gene X and GC pseudogenes that are the consequence of a gene duplication. The gene X pseudogene lacks the first exon and part of the second exon of the functional gene and may not be transcribed. Northern blot analyses indicate that gene X is transcribed in both normal individuals and in patients with Gaucher disease, but the function of this gene is still unknown. The possibility that mutations in gene X could account for some of the diversity of symptoms encountered in individuals with the more atypical presentations of Gaucher disease is under investigation.« less
A transversal approach to predict gene product networks from ontology-based similarity

PubMed Central

Chabalier, Julie; Mosser, Jean; Burgun, Anita

2007-01-01

Background Interpretation of transcriptomic data is usually made through a "standard" approach which consists in clustering the genes according to their expression patterns and exploiting Gene Ontology (GO) annotations within each expression cluster. This approach makes it difficult to underline functional relationships between gene products that belong to different expression clusters. To address this issue, we propose a transversal analysis that aims to predict functional networks based on a combination of GO processes and data expression. Results The transversal approach presented in this paper consists in computing the semantic similarity between gene products in a Vector Space Model. Through a weighting scheme over the annotations, we take into account the representativity of the terms that annotate a gene product. Comparing annotation vectors results in a matrix of gene product similarities. Combined with expression data, the matrix is displayed as a set of functional gene networks. The transversal approach was applied to 186 genes related to the enterocyte differentiation stages. This approach resulted in 18 functional networks proved to be biologically relevant. These results were compared with those obtained through a standard approach and with an approach based on information content similarity. Conclusion Complementary to the standard approach, the transversal approach offers new insight into the cellular mechanisms and reveals new research hypotheses by combining gene product networks based on semantic similarity, and data expression. PMID:17605807
A Genome-wide CRISPR Screen in Toxoplasma Identifies Essential Apicomplexan Genes.

PubMed

Sidik, Saima M; Huet, Diego; Ganesan, Suresh M; Huynh, My-Hang; Wang, Tim; Nasamu, Armiyaw S; Thiru, Prathapan; Saeij, Jeroen P J; Carruthers, Vern B; Niles, Jacquin C; Lourido, Sebastian

2016-09-08

Apicomplexan parasites are leading causes of human and livestock diseases such as malaria and toxoplasmosis, yet most of their genes remain uncharacterized. Here, we present the first genome-wide genetic screen of an apicomplexan. We adapted CRISPR/Cas9 to assess the contribution of each gene from the parasite Toxoplasma gondii during infection of human fibroblasts. Our analysis defines ∼200 previously uncharacterized, fitness-conferring genes unique to the phylum, from which 16 were investigated, revealing essential functions during infection of human cells. Secondary screens identify as an invasion factor the claudin-like apicomplexan microneme protein (CLAMP), which resembles mammalian tight-junction proteins and localizes to secretory organelles, making it critical to the initiation of infection. CLAMP is present throughout sequenced apicomplexan genomes and is essential during the asexual stages of the malaria parasite Plasmodium falciparum. These results provide broad-based functional information on T. gondii genes and will facilitate future approaches to expand the horizon of antiparasitic interventions. Copyright © 2016 Elsevier Inc. All rights reserved.
Functional genomics analysis of low concentration of ethanol in human hepatocellular carcinoma (HepG2) cells. Role of genes involved in transcriptional and translational processes.

PubMed

Castaneda, Francisco; Rosin-Steiner, Sigrid; Jung, Klaus

2006-12-21

We previously found that ethanol at millimolar level (1 mM) activates the expression of transcription factors with subsequent regulation of apoptotic genes in human hepatocellular carcinoma (HCC) HepG2 cells. However, the role of ethanol on the expression of genes implicated in transcriptional and translational processes remains unknown. Therefore, the aim of this study was to characterize the effect of low concentration of ethanol on gene expression profiling in HepG2 cells using cDNA microarrays with especial interest in genes with transcriptional and translational function. The gene expression pattern observed in the ethanol-treated HepG2 cells revealed a relatively similar pattern to that found in the untreated control cells. The pairwise comparison analysis demonstrated four significantly up-regulated (COBRA1, ITGB4, STAU2, and HMGN3) genes and one down-regulated (ANK3) gene. All these genes exert their function on transcriptional and translational processes and until now none of these genes have been associated with ethanol. This functional genomic analysis demonstrates the reported interaction between ethanol and ethanol-regulated genes. Moreover, it confirms the relationship between ethanol-regulated genes and various signaling pathways associated with ethanol-induced apoptosis. The data presented in this study represents an important contribution toward the understanding of the molecular mechanisms of ethanol at low concentration in HepG2 cells, a HCC-derived cell line.
Functional genomics analysis of low concentration of ethanol in human hepatocellular carcinoma (HepG2) cells. Role of genes involved in transcriptional and translational processes

PubMed Central

Castaneda, Francisco; Rosin-Steiner, Sigrid; Jung, Klaus

2007-01-01

We previously found that ethanol at millimolar level (1 mM) activates the expression of transcription factors with subsequent regulation of apoptotic genes in human hepatocellular carcinoma (HCC) HepG2 cells. However, the role of ethanol on the expression of genes implicated in transcriptional and translational processes remains unknown. Therefore, the aim of this study was to characterize the effect of low concentration of ethanol on gene expression profiling in HepG2 cells using cDNA microarrays with especial interest in genes with transcriptional and translational function. The gene expression pattern observed in the ethanol-treated HepG2 cells revealed a relatively similar pattern to that found in the untreated control cells. The pairwise comparison analysis demonstrated four significantly up-regulated (COBRA1, ITGB4, STAU2, and HMGN3) genes and one down-regulated (ANK3) gene. All these genes exert their function on transcriptional and translational processes and until now none of these genes have been associated with ethanol. This functional genomic analysis demonstrates the reported interaction between ethanol and ethanol-regulated genes. Moreover, it confirms the relationship between ethanol-regulated genes and various signaling pathways associated with ethanol-induced apoptosis. The data presented in this study represents an important contribution toward the understanding of the molecular mechanisms of ethanol at low concentration in HepG2 cells, a HCC-derived cell line. PMID:17211498
Molecular analysis of the glucocerebrosidase gene locus

DOE Office of Scientific and Technical Information (OSTI.GOV)

Winfield, S.L.; Martin, B.M.; Fandino, A.

1994-09-01

Gaucher disease is due to a deficiency in the activity of the lysosomal enzyme glucocerebrosidase. Both the functional gene for this enzyme and a pseudogene are located in close proximity on chromosome 1q21. Analysis of the mutations present in patient samples has suggested interaction between the functional gene and the pseudogene in the origin of mutant genotypes. To investigate the involvement of regions flanking the functional gene and pseudogene in the origin of mutations found in Gaucher disease, a YAC clone containing DNA from this locus has been subcloned and characterized. The original YAC containing {approximately}360 kb was truncated withmore » the use of fragmentation plasmids to about 85 kb. A lambda library derived from this YAC was screened to obtain clones containing glucocerebrosidase sequences. PCR amplification was used to identify subclones containing 5{prime}, central, or 3{prime} sequences of the functional gene or of the pseudogene. Clones spanning the entire distance from the last exon of the functional gene to intron 1 of the pseudogene, the 5{prime} end of the functional gene and 16 kb of 5{prime} flanking region and approximately 15 kb of 3{prime} flanking region of the pseudogene were sequenced. Sequence data from 48 kb of intergenic and flanking regions of the glucocerebrosidase gene and its pseudogene has been generated. A large number of Alu sequences and several simple repeats have been found. Two of these repeats exhibit fragment length polymorphism. There is almost 100% homology between the 3{prime} flanking regions of the functional gene and the pseudogene, extending to about 4 kb past the termination codons. A much lower degree of homology is observed in the 5{prime} flanking region. Patient samples are currently being screened for polymorphisms in these flanking regions.« less
An Arabidopsis Gene Regulatory Network for Secondary Cell Wall Synthesis

PubMed Central

Taylor-Teeples, M; Lin, L; de Lucas, M; Turco, G; Toal, TW; Gaudinier, A; Young, NF; Trabucco, GM; Veling, MT; Lamothe, R; Handakumbura, PP; Xiong, G; Wang, C; Corwin, J; Tsoukalas, A; Zhang, L; Ware, D; Pauly, M; Kliebenstein, DJ; Dehesh, K; Tagkopoulos, I; Breton, G; Pruneda-Paz, JL; Ahnert, SE; Kay, SA; Hazen, SP; Brady, SM

2014-01-01

Summary The plant cell wall is an important factor for determining cell shape, function and response to the environment. Secondary cell walls, such as those found in xylem, are composed of cellulose, hemicelluloses and lignin and account for the bulk of plant biomass. The coordination between transcriptional regulation of synthesis for each polymer is complex and vital to cell function. A regulatory hierarchy of developmental switches has been proposed, although the full complement of regulators remains unknown. Here, we present a protein-DNA network between Arabidopsis transcription factors and secondary cell wall metabolic genes with gene expression regulated by a series of feed-forward loops. This model allowed us to develop and validate new hypotheses about secondary wall gene regulation under abiotic stress. Distinct stresses are able to perturb targeted genes to potentially promote functional adaptation. These interactions will serve as a foundation for understanding the regulation of a complex, integral plant component. PMID:25533953
Functional profiles of orphan membrane transporters in the life cycle of the malaria parasite

PubMed Central

Kenthirapalan, Sanketha; Waters, Andrew P.; Matuschewski, Kai; Kooij, Taco W. A.

2016-01-01

Assigning function to orphan membrane transport proteins and prioritizing candidates for detailed biochemical characterization remain fundamental challenges and are particularly important for medically relevant pathogens, such as malaria parasites. Here we present a comprehensive genetic analysis of 35 orphan transport proteins of Plasmodium berghei during its life cycle in mice and Anopheles mosquitoes. Six genes, including four candidate aminophospholipid transporters, are refractory to gene deletion, indicative of essential functions. We generate and phenotypically characterize 29 mutant strains with deletions of individual transporter genes. Whereas seven genes appear to be dispensable under the experimental conditions tested, deletion of any of the 22 other genes leads to specific defects in life cycle progression in vivo and/or host transition. Our study provides growing support for a potential link between heavy metal homeostasis and host switching and reveals potential targets for rational design of new intervention strategies against malaria. PMID:26796412
Integrative and conjugative elements and their hosts: composition, distribution and organization

PubMed Central

Touchon, Marie; Rocha, Eduardo P. C.

2017-01-01

Abstract Conjugation of single-stranded DNA drives horizontal gene transfer between bacteria and was widely studied in conjugative plasmids. The organization and function of integrative and conjugative elements (ICE), even if they are more abundant, was only studied in a few model systems. Comparative genomics of ICE has been precluded by the difficulty in finding and delimiting these elements. Here, we present the results of a method that circumvents these problems by requiring only the identification of the conjugation genes and the species’ pan-genome. We delimited 200 ICEs and this allowed the first large-scale characterization of these elements. We quantified the presence in ICEs of a wide set of functions associated with the biology of mobile genetic elements, including some that are typically associated with plasmids, such as partition and replication. Protein sequence similarity networks and phylogenetic analyses revealed that ICEs are structured in functional modules. Integrases and conjugation systems have different evolutionary histories, even if the gene repertoires of ICEs can be grouped in function of conjugation types. Our characterization of the composition and organization of ICEs paves the way for future functional and evolutionary analyses of their cargo genes, composed of a majority of unknown function genes. PMID:28911112
Functional annotation of the vlinc class of non-coding RNAs using systems biology approach.

PubMed

St Laurent, Georges; Vyatkin, Yuri; Antonets, Denis; Ri, Maxim; Qi, Yao; Saik, Olga; Shtokalo, Dmitry; de Hoon, Michiel J L; Kawaji, Hideya; Itoh, Masayoshi; Lassmann, Timo; Arner, Erik; Forrest, Alistair R R; Nicolas, Estelle; McCaffrey, Timothy A; Carninci, Piero; Hayashizaki, Yoshihide; Wahlestedt, Claes; Kapranov, Philipp

2016-04-20

Functionality of the non-coding transcripts encoded by the human genome is the coveted goal of the modern genomics research. While commonly relied on the classical methods of forward genetics, integration of different genomics datasets in a global Systems Biology fashion presents a more productive avenue of achieving this very complex aim. Here we report application of a Systems Biology-based approach to dissect functionality of a newly identified vast class of very long intergenic non-coding (vlinc) RNAs. Using highly quantitative FANTOM5 CAGE dataset, we show that these RNAs could be grouped into 1542 novel human genes based on analysis of insulators that we show here indeed function as genomic barrier elements. We show that vlinc RNAs genes likely function in cisto activate nearby genes. This effect while most pronounced in closely spaced vlinc RNA-gene pairs can be detected over relatively large genomic distances. Furthermore, we identified 101 vlinc RNA genes likely involved in early embryogenesis based on patterns of their expression and regulation. We also found another 109 such genes potentially involved in cellular functions also happening at early stages of development such as proliferation, migration and apoptosis. Overall, we show that Systems Biology-based methods have great promise for functional annotation of non-coding RNAs. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Cloning, Characterization, Regulation, and Function of Dormancy-Associated MADS-Box Genes from Leafy Spurge

USDA-ARS?s Scientific Manuscript database

DORMANCY-ASSOCIATED MADS-BOX (DAM) genes are SHORT VEGETATIVE PHASE–Like MADS box transcription factors linked to endodormancy induction. We have cloned and characterized several cDNA and genomic clones of DAM genes from the model perennial weed leafy spurge (Euphorbia esula). We present evidence fo...
Proteomic analysis reveals novel extracellular virulence-associated proteins and functions regulated by the diffusible signal factor (DSF) in Xanthomonas oryzae pv. oryzicola.

PubMed

Qian, Guoliang; Zhou, Yijing; Zhao, Yancun; Song, Zhiwei; Wang, Suyan; Fan, Jiaqin; Hu, Baishi; Venturi, Vittorio; Liu, Fengquan

2013-07-05

Quorum sensing (QS) in Xanthomonas oryzae pv. oryzicola (Xoc), the causal agent of bacterial leaf streak, is mediated by the diffusible signal factor (DSF). DSF-mediating QS has been shown to control virulence and a set of virulence-related functions; however, the expression profiles and functions of extracellular proteins controlled by DSF signal remain largely unclear. In the present study, 33 DSF-regulated extracellular proteins, whose functions include small-protein mediating QS, oxidative adaptation, macromolecule metabolism, cell structure, biosynthesis of small molecules, intermediary metabolism, cellular process, protein catabolism, and hypothetical function, were identified by proteomics in Xoc. Of these, 15 protein encoding genes were in-frame deleted, and 4 of them, including three genes encoding type II secretion system (T2SS)-dependent proteins and one gene encoding an Ax21 (activator of XA21-mediated immunity)-like protein (a novel small-protein type QS signal) were determined to be required for full virulence in Xoc. The contributions of these four genes to important virulence-associated functions, including bacterial colonization, extracellular polysaccharide, cell motility, biofilm formation, and antioxidative ability, are presented. To our knowledge, our analysis is the first complete list of DSF-regulated extracellular proteins and functions in a Xanthomonas species. Our results show that DSF-type QS played critical roles in regulation of T2SS and Ax21-mediating QS, which sheds light on the role of DSF signaling in Xanthomonas.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Howe, Adina; Yang, Fan; Williams, Ryan J.

Despite the central role of soil microbial communities in global carbon (C) cycling, little is known about soil microbial community structure and even less about their metabolic pathways. Efforts to characterize soil communities often focus on identifying differences in gene content across environmental gradients, but an alternative question is what genes are similar in soils. These genes may indicate critical species or potential functions that are required in all soils. Here we identified the “core” set of C cycling sequences widely present in multiple soil metagenomes from a fertilized prairie (FP). Of 226,887 sequences associated with known enzymes involved inmore » the synthesis, metabolism, and transport of carbohydrates, 843 were identified to be consistently prevalent across four replicate soil metagenomes. This core metagenome was functionally and taxonomically diverse, representing five enzyme classes and 99 enzyme families within the CAZy database. Though it only comprised 0.4% of all CAZy-associated genes identified in FP metagenomes, the core was found to be comprised of functions similar to those within cumulative soils. The FP CAZy-associated core sequences were present in multiple publicly available soil metagenomes and most similar to soils sharing geographic proximity. As a result, in soil ecosystems, where high diversity remains a key challenge for metagenomic investigations, these core genes represent a subset of critical functions necessary for carbohydrate metabolism, which can be targeted to evaluate important C fluxes in these and other similar soils.« less
An open reading frame in intron seven of the sea urchin DNA-methyltransferase gene codes for a functional AP1 endonuclease.

PubMed

Cioffi, Anna Valentina; Ferrara, Diana; Cubellis, Maria Vittoria; Aniello, Francesco; Corrado, Marcella; Liguori, Francesca; Amoroso, Alessandro; Fucci, Laura; Branno, Margherita

2002-08-01

Analysis of the genome structure of the Paracentrotus lividus (sea urchin) DNA methyltransferase (DNA MTase) gene showed the presence of an open reading frame, named METEX, in intron 7 of the gene. METEX expression is developmentally regulated, showing no correlation with DNA MTase expression. In fact, DNA MTase transcripts are present at high concentrations in the early developmental stages, while METEX is expressed at late stages of development. Two METEX cDNA clones (Met1 and Met2) that are different in the 3' end have been isolated in a cDNA library screening. The putative translated protein from Met2 cDNA clone showed similarity with Escherichia coli endonuclease III on the basis of sequence and predictive three-dimensional structure. The protein, overexpressed in E. coli and purified, had functional properties similar to the endonuclease specific for apurinic/apyrimidinic (AP) sites on the basis of the lyase activity. Therefore the open reading frame, present in intron 7 of the P. lividus DNA MTase gene, codes for a functional AP endonuclease designated SuAP1.
dbWFA: a web-based database for functional annotation of Triticum aestivum transcripts

PubMed Central

Vincent, Jonathan; Dai, Zhanwu; Ravel, Catherine; Choulet, Frédéric; Mouzeyar, Said; Bouzidi, M. Fouad; Agier, Marie; Martre, Pierre

2013-01-01

The functional annotation of genes based on sequence homology with genes from model species genomes is time-consuming because it is necessary to mine several unrelated databases. The aim of the present work was to develop a functional annotation database for common wheat Triticum aestivum (L.). The database, named dbWFA, is based on the reference NCBI UniGene set, an expressed gene catalogue built by expressed sequence tag clustering, and on full-length coding sequences retrieved from the TriFLDB database. Information from good-quality heterogeneous sources, including annotations for model plant species Arabidopsis thaliana (L.) Heynh. and Oryza sativa L., was gathered and linked to T. aestivum sequences through BLAST-based homology searches. Even though the complexity of the transcriptome cannot yet be fully appreciated, we developed a tool to easily and promptly obtain information from multiple functional annotation systems (Gene Ontology, MapMan bin codes, MIPS Functional Categories, PlantCyc pathway reactions and TAIR gene families). The use of dbWFA is illustrated here with several query examples. We were able to assign a putative function to 45% of the UniGenes and 81% of the full-length coding sequences from TriFLDB. Moreover, comparison of the annotation of the whole T. aestivum UniGene set along with curated annotations of the two model species assessed the accuracy of the annotation provided by dbWFA. To further illustrate the use of dbWFA, genes specifically expressed during the early cell division or late storage polymer accumulation phases of T. aestivum grain development were identified using a clustering analysis and then annotated using dbWFA. The annotation of these two sets of genes was consistent with previous analyses of T. aestivum grain transcriptomes and proteomes. Database URL: urgi.versailles.inra.fr/dbWFA/ PMID:23660284
Gene Duplication and Transference of Function in the paleoAP3 Lineage of Floral Organ Identity Genes

PubMed Central

Galimba, Kelsey D.; Martínez-Gómez, Jesús; Di Stilio, Verónica S.

2018-01-01

The floral organ identity gene APETALA3 (AP3) is a MADS-box transcription factor involved in stamen and petal identity that belongs to the B-class of the ABC model of flower development. Thalictrum (Ranunculaceae), an emerging model in the non-core eudicots, has AP3 homologs derived from both ancient and recent gene duplications. Prior work has shown that petals have been lost repeatedly and independently in Ranunculaceae in correlation with the loss of a specific AP3 paralog, and Thalictrum represents one of these instances. The main goal of this study was to conduct a functional analysis of the three AP3 orthologs present in Thalictrum thalictroides, representing the paleoAP3 gene lineage, to determine the degree of redundancy versus divergence after gene duplication. Because Thalictrum lacks petals, and has lost the petal-specific AP3, we also asked whether heterotopic expression of the remaining AP3 genes contributes to the partial transference of petal function to the first whorl found in insect-pollinated species. To address these questions, we undertook functional characterization by virus-induced gene silencing (VIGS), protein–protein interaction and binding site analyses. Our results illustrate partial redundancy among Thalictrum AP3s, with deep conservation of B-class function in stamen identity and a novel role in ectopic petaloidy of sepals. Certain aspects of petal function of the lost AP3 locus have apparently been transferred to the other paralogs. A novel result is that the protein products interact not only with each other, but also as homodimers. Evidence presented here also suggests that expression of the different ThtAP3 paralogs is tightly integrated, with an apparent disruption of B function homeostasis upon silencing of one of the paralogs that codes for a truncated protein. To explain this result, we propose two testable alternative scenarios: that the truncated protein is a dominant negative mutant or that there is a compensational response as part of a back-up circuit. The evidence for promiscuous protein–protein interactions via yeast two-hybrid combined with the detection of AP3 specific binding motifs in all B-class gene promoters provide partial support for these hypotheses. PMID:29628932
Virus induced gene silencing (VIGS) for functional analysis of wheat genes involved in Zymoseptoria tritici susceptibility and resistance.

PubMed

Lee, Wing-Sham; Rudd, Jason J; Kanyuka, Kostya

2015-06-01

Virus-induced gene silencing (VIGS) has emerged as a powerful reverse genetic technology in plants supplementary to stable transgenic RNAi and, in certain species, as a viable alternative approach for gene functional analysis. The RNA virus Barley stripe mosaic virus (BSMV) was developed as a VIGS vector in the early 2000s and since then it has been used to study the function of wheat genes. Several variants of BSMV vectors are available, with some requiring in vitro transcription of infectious viral RNA, while others rely on in planta production of viral RNA from DNA-based vectors delivered to plant cells either by particle bombardment or Agrobacterium tumefaciens. We adapted the latest generation of binary BSMV VIGS vectors for the identification and study of wheat genes of interest involved in interactions with Zymoseptoria tritici and here present detailed and the most up-to-date protocols. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Intragenic Locus in Human PIWIL2 Gene Shares Promoter and Enhancer Functions.

PubMed

Skvortsova, Yulia V; Kondratieva, Sofia A; Zinovyeva, Marina V; Nikolaev, Lev G; Azhikina, Tatyana L; Gainetdinov, Ildar V

2016-01-01

Recently, more evidence supporting common nature of promoters and enhancers has been accumulated. In this work, we present data on chromatin modifications and non-polyadenylated transcription characteristic for enhancers as well as results of in vitro luciferase reporter assays suggesting that PIWIL2 alternative promoter in exon 7 also functions as an enhancer for gene PHYHIP located 60Kb upstream. This finding of an intragenic enhancer serving as a promoter for a shorter protein isoform implies broader impact on understanding enhancer-promoter networks in regulation of gene expression.
A Genomic View of the Sea Urchin Nervous System

PubMed Central

Burke, RD; Angerer, LM; Elphick, MR; Humphrey, GW; Yaguchi, S; Kiyama, T; Liang, S; Mu, X; Agca, C; Klein, WH; Brandhorst, BP; Rowe, M; Wilson, K; Churcher, AM; Taylor, JS; Chen, N; Murray, G; Wang, D; Mellott, D; Olinski, R; Hallböök, F; Thorndyke, MC

2007-01-01

The sequencing of the Strongylocentrotus purpuratus genome provides a unique opportunity to investigate the function and evolution of neural genes. The neurobiology of sea urchins is of particular interest because they have a close phylogenetic relationship with chordates, yet a distinctive pentaradiate body plan and unusual neural organization. Orthologues of transcription factors that regulate neurogenesis in other animals have been identified and several are expressed in neurogenic domains before gastrulation indicating that they may operate near the top of a conserved neural gene regulatory network. A family of genes encoding voltage-gated ion channels is present but, surprisingly, genes encoding gap junction proteins (connexins and pannexins) appear to be absent. Genes required for synapse formation and function have been identified and genes for synthesis and transport of neurotransmitters are present. There is a large family of G-protein-coupled receptors, including 874 rhodopsin-type receptors, 28 metabotropic glutamate-like receptors and a remarkably expanded group of 161 secretin receptor-like proteins. Absence of cannabinoid, lysophospholipid and melanocortin receptors indicates that this group may be unique to chordates. There are at least 37 putative G-protein coupled peptide receptors and precursors for several neuropeptides and peptide hormones have been identified, including SALMFamides, NGFFFamide, a vasotocin-like peptide, glycoprotein hormones, and insulin/insulin-like growth factors. Identification of a neurotrophin-like gene and Trk receptor in sea urchin indicates that this neural signaling system is not unique to chordates. Several hundred chemoreceptor genes have been predicted using several approaches, a number similar to that for other animals. Intriguingly, genes encoding homologues of rhodopsin, Pax6 and several other key mammalian retinal transcription factors are expressed in tube feet, suggesting tube feet function as photosensory organs. Analysis of the sea urchin genome presents a unique perspective on the evolutionary history of deuterostome nervous systems and reveals new approaches to investigate the development and neurobiology of sea urchins. PMID:16965768

Immunoglobulin λ Gene Rearrangement Can Precede κ Gene Rearrangement

DOE PAGES

Berg, Jörg; Mcdowell, Mindy; Jäck, Hans-Martin; ...

1990-01-01

Imore » mmunoglobulin genes are generated during differentiation of B lymphocytes by joining gene segments. A mouse pre-B cell contains a functional immunoglobulin heavy-chain gene, but no light-chain gene. Although there is only one heavy-chain locus, there are two lightchain loci: κ and λ .t has been reported that κ loci in the germ-line configuration are never (in man) or very rarely (in the mouse) present in cells with functionally rearranged λ -chain genes. Two explanations have been proposed to explain this: (a) the ordered rearrangement theory, which postulates that light-chain gene rearrangement in the pre-B cell is first attempted at the κ locus, and that only upon failure to produce a functional κ chain is there an attempt to rearrange the λ locus; and (b) the stochastic theory, which postulates that rearrangement at the λ locus proceeds at a rate that is intrinsically much slower than that at the κ locus. We show here that λ -chain genes are generated whether or not the κ locus has lost its germ-line arrangement, a result that is compatible only with the stochastic theory.« less
Toward an understanding of the pathophysiology of clear cell carcinoma of the ovary (Review)

PubMed Central

UEKURI, CHIHARU; SHIGETOMI, HIROSHI; ONO, SUMIRE; SASAKI, YOSHIKAZU; MATSUURA, MIYUKI; KOBAYASHI, HIROSHI

2013-01-01

Endometriosis-associated ovarian cancers demonstrate substantial morphological and genetic diversity. The transcription factor, hepatocyte nuclear factor (HNF)-1β, may be one of several key genes involved in the identity of ovarian clear cell carcinoma (CCC). The present study reviews a considerably expanded set of HNF-1β-associated genes and proteins that determine the pathophysiology of CCC. The current literature was reviewed by searching MEDLINE/PubMed. Functional interpretations of gene expression profiling in CCC are provided. Several important CCC-related genes overlap with those known to be regulated by the upregulation of HNF-1β expression, along with a lack of estrogen receptor (ER) expression. Furthermore, the genetic expression pattern in CCC resembles that of the Arias-Stella reaction, decidualization and placentation. HNF-1β regulates a subset of progesterone target genes. HNF-1β may also act as a modulator of female reproduction, playing a role in endometrial regeneration, differentiation, decidualization, glycogen synthesis, detoxification, cell cycle regulation, implantation, uterine receptivity and a successful pregnancy. In conclusion, the present study focused on reviewing the aberrant expression of CCC-specific genes and provided an update on the pathological implications and molecular functions of well-characterized CCC-specific genes. PMID:24179489
MADS-Box gene diversity in seed plants 300 million years ago.

PubMed

Becker, A; Winter, K U; Meyer, B; Saedler, H; Theissen, G

2000-10-01

MADS-box genes encode a family of transcription factors which control diverse developmental processes in flowering plants ranging from root development to flower and fruit development. Through phylogeny reconstructions, most of these genes can be subdivided into defined monophyletic gene clades whose members share similar expression patterns and functions. Therefore, the establishment of the diversity of gene clades was probably an important event in land plant evolution. In order to determine when these clades originated, we isolated cDNAs of 19 different MADS-box genes from Gnetum gnemon, a gymnosperm model species and thus a representative of the sister group of the angiosperms. Phylogeny reconstructions involving all published MADS-box genes were then used to identify gene clades containing putative orthologs from both angiosperm and gymnosperm lineages. Thus, the minimal number of MADS-box genes that were already present in the last common ancestor of extant gymnosperms and angiosperms was determined. Comparative expression studies involving pairs of putatively orthologous genes revealed a diversity of patterns that has been largely conserved since the time when the angiosperm and gymnosperm lineages separated. Taken together, our data suggest that there were already at least seven different MADS-box genes present at the base of extant seed plants about 300 MYA. These genes were probably already quite diverse in terms of both sequence and function. In addition, our data demonstrate that the MADS-box gene families of extant gymnosperms and angiosperms are of similar complexities.
Prior knowledge based mining functional modules from Yeast PPI networks with gene ontology

PubMed Central

2010-01-01

Background In the literature, there are fruitful algorithmic approaches for identification functional modules in protein-protein interactions (PPI) networks. Because of accumulation of large-scale interaction data on multiple organisms and non-recording interaction data in the existing PPI database, it is still emergent to design novel computational techniques that can be able to correctly and scalably analyze interaction data sets. Indeed there are a number of large scale biological data sets providing indirect evidence for protein-protein interaction relationships. Results The main aim of this paper is to present a prior knowledge based mining strategy to identify functional modules from PPI networks with the aid of Gene Ontology. Higher similarity value in Gene Ontology means that two gene products are more functionally related to each other, so it is better to group such gene products into one functional module. We study (i) to encode the functional pairs into the existing PPI networks; and (ii) to use these functional pairs as pairwise constraints to supervise the existing functional module identification algorithms. Topology-based modularity metric and complex annotation in MIPs will be used to evaluate the identified functional modules by these two approaches. Conclusions The experimental results on Yeast PPI networks and GO have shown that the prior knowledge based learning methods perform better than the existing algorithms. PMID:21172053
Microarray analysis reveals key genes and pathways in Tetralogy of Fallot

PubMed Central

He, Yue-E; Qiu, Hui-Xian; Jiang, Jian-Bing; Wu, Rong-Zhou; Xiang, Ru-Lian; Zhang, Yuan-Hai

2017-01-01

The aim of the present study was to identify key genes that may be involved in the pathogenesis of Tetralogy of Fallot (TOF) using bioinformatics methods. The GSE26125 microarray dataset, which includes cardiovascular tissue samples derived from 16 children with TOF and five healthy age-matched control infants, was downloaded from the Gene Expression Omnibus database. Differential expression analysis was performed between TOF and control samples to identify differentially expressed genes (DEGs) using Student's t-test, and the R/limma package, with a log2 fold-change of >2 and a false discovery rate of <0.01 set as thresholds. The biological functions of DEGs were analyzed using the ToppGene database. The ReactomeFIViz application was used to construct functional interaction (FI) networks, and the genes in each module were subjected to pathway enrichment analysis. The iRegulon plugin was used to identify transcription factors predicted to regulate the DEGs in the FI network, and the gene-transcription factor pairs were then visualized using Cytoscape software. A total of 878 DEGs were identified, including 848 upregulated genes and 30 downregulated genes. The gene FI network contained seven function modules, which were all comprised of upregulated genes. Genes enriched in Module 1 were enriched in the following three neurological disorder-associated signaling pathways: Parkinson's disease, Alzheimer's disease and Huntington's disease. Genes in Modules 0, 3 and 5 were dominantly enriched in pathways associated with ribosomes and protein translation. The Xbox binding protein 1 transcription factor was demonstrated to be involved in the regulation of genes encoding the subunits of cytoplasmic and mitochondrial ribosomes, as well as genes involved in neurodegenerative disorders. Therefore, dysfunction of genes involved in signaling pathways associated with neurodegenerative disorders, ribosome function and protein translation may contribute to the pathogenesis of TOF. PMID:28713939
Genetic Dissection of Dendritic Cell Homeostasis and Function: Lessons from Cell Type–Specific Gene Ablation

PubMed Central

Karmaus, Peer W.F.; Chi, Hongbo

2014-01-01

Dendritic cells (DCs) are a heterogeneous cell population of great importance in the immune system. The emergence of new genetic technology utilizing the CD11c promoter and Cre recombinase has facilitated the dissection of functional significance and molecular regulation of DCs in immune responses and homeostasis in vivo. For the first time, this strategy allows observation of the effects of DC-specific gene deletion on immune system function in an intact organism. In this review, we present the latest findings from studies using the Cre recombinase system for cell type–specific deletion of key molecules that mediate DC homeostasis and function. Our focus is on the molecular pathways that orchestrate DC life span, migration, antigen presentation, pattern recognition, and cytokine production and signaling. PMID:24366237
Comparative whole genome transcriptome and metabolome analyses of five Klebsiella pneumonia strains.

PubMed

Lee, Soojin; Kim, Borim; Yang, Jeongmo; Jeong, Daun; Park, Soohyun; Shin, Sang Heum; Kook, Jun Ho; Yang, Kap-Seok; Lee, Jinwon

2015-11-01

The integration of transcriptomics and metabolomics can provide precise information on gene-to-metabolite networks for identifying the function of novel genes. The goal of this study was to identify novel gene functions involved in 2,3-butanediol (2,3-BDO) biosynthesis by a comprehensive analysis of the transcriptome and metabolome of five mutated Klebsiella pneumonia strains (∆wabG = SGSB100, ∆wabG∆budA = SGSB106, ∆wabG∆budB = SGSB107, ∆wabG∆budC = SGSB108, ∆wabG∆budABC = SGSB109). First, the transcriptomes of all five mutants were analyzed and the genes exhibiting reproducible changes in expression were determined. The transcriptome was well conserved among the five strains, and differences in gene expression occurred mainly in genes coding for 2,3-BDO biosynthesis (budA, budB, and budC) and the genes involved in the degradation of reactive oxygen, biosynthesis and transport of arginine, cysteine biosynthesis, sulfur metabolism, oxidoreductase reaction, and formate dehydrogenase reaction. Second, differences in the metabolome (estimated by carbon distribution, CO2 emission, and redox balance) among the five mutant strains due to gene alteration of the 2,3-BDO operon were detected. The functional genomics approach integrating metabolomics and transcriptomics in K. Pneumonia presented here provides an innovative means of identifying novel gene functions involved in 2,3-BDO biosynthesis metabolism and whole cell metabolism.
Genome-wide analysis of the structural genes regulating defense phenylpropanoid metabolism in Populus

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tschaplinski, Timothy J; Tsai, Chung-Jui; Harding, Scott A

Salicin-based phenolic glycosides, hydroxycinnamate derivatives and flavonoid-derived condensed tannins comprise up to one-third of Populus leaf dry mass. Genes regulating the abundance and chemical diversity of these substances have not been comprehensively analysed in tree species exhibiting this metabolically demanding level of phenolic metabolism. Here, shikimate-phenylpropanoid pathway genes thought to give rise to these phenolic products were annotated from the Populus genome, their expression assessed by semiquantitative or quantitative reverse transcription polymerase chain reaction (PCR), and metabolic evidence for function presented. Unlike Arabidopsis, Populus leaves accumulate an array of hydroxycinnamoyl-quinate esters, which is consistent with broadened function of the expandedmore » hydroxycinnamoyl-CoA transferase gene family. Greater flavonoid pathway diversity is also represented, and flavonoid gene families are larger. Consistent with expanded pathway function, most of these genes were upregulated during wound-stimulated condensed tannin synthesis in leaves. The suite of Populus genes regulating phenylpropanoid product accumulation should have important application in managing phenolic carbon pools in relation to climate change and global carbon cycling.« less
Decoding directional genetic dependencies through orthogonal CRISPR/Cas screens | Office of Cancer Genomics

Cancer.gov

Genetic interaction studies are a powerful approach to identify functional interactions between genes. This approach can reveal networks of regulatory hubs and connect uncharacterized genes to well-studied pathways. However, this approach has previously been limited to simple gene inactivation studies. Here, we present an orthogonal CRISPR/Cas-mediated genetic interaction approach that allows the systematic activation of one gene while simultaneously knocking out a second gene in the same cell.
A Sorghum bicolor expression atlas reveals dynamic genotype-specific expression profiles for vegetative tissues of grain, sweet and bioenergy sorghums

DOE Office of Scientific and Technical Information (OSTI.GOV)

Shakoor, N; Nair, R; Crasta, O

2014-01-23

Background: Effective improvement in sorghum crop development necessitates a genomics-based approach to identify functional genes and QTLs. Sequenced in 2009, a comprehensive annotation of the sorghum genome and the development of functional genomics resources is key to enable the discovery and deployment of regulatory and metabolic genes and gene networks for crop improvement. Results: This study utilizes the first commercially available whole-transcriptome sorghum microarray (Sorgh-WTa520972F) to identify tissue and genotype-specific expression patterns for all identified Sorghum bicolor exons and UTRs. The genechip contains 1,026,373 probes covering 149,182 exons (27,577 genes) across the Sorghum bicolor nuclear, chloroplast, and mitochondrial genomes. Specificmore » probesets were also included for putative non-coding RNAs that may play a role in gene regulation (e. g., microRNAs), and confirmed functional small RNAs in related species (maize and sugarcane) were also included in our array design. We generated expression data for 78 samples with a combination of four different tissue types (shoot, root, leaf and stem), two dissected stem tissues (pith and rind) and six diverse genotypes, which included 6 public sorghum lines (R159, Atlas, Fremont, PI152611, AR2400 and PI455230) representing grain, sweet, forage, and high biomass ideotypes. Conclusions: Here we present a summary of the microarray dataset, including analysis of tissue-specific gene expression profiles and associated expression profiles of relevant metabolic pathways. With an aim to enable identification and functional characterization of genes in sorghum, this expression atlas presents a new and valuable resource to the research community.« less
Versatile types of polysaccharide-based supramolecular polycation/pDNA nanoplexes for gene delivery

NASA Astrophysics Data System (ADS)

Hu, Yang; Zhao, Nana; Yu, Bingran; Liu, Fusheng; Xu, Fu-Jian

2014-06-01

Different polysaccharide-based supramolecular polycations were readily synthesized by assembling multiple β-cyclodextrin-cored star polycations with an adamantane-functionalized dextran via host-guest interaction in the absence or presence of bioreducible linkages. Compared with nanoplexes of the starting star polycation and pDNA, the supramolecular polycation/pDNA nanoplexes exhibited similarly low cytotoxicity, improved cellular internalization and significantly higher gene transfection efficiencies. The incorporation of disulfide linkages imparted the supramolecular polycation/pDNA nanoplexes with the advantage of intracellular bioreducibility, resulting in better gene delivery properties. In addition, the antitumor properties of supramolecular polycation/pDNA nanoplexes were also investigated using a suicide gene therapy system. The present study demonstrates that the proper assembly of cyclodextrin-cored polycations with adamantane-functionalized polysaccharides is an effective strategy for the production of new nanoplex delivery systems.Different polysaccharide-based supramolecular polycations were readily synthesized by assembling multiple β-cyclodextrin-cored star polycations with an adamantane-functionalized dextran via host-guest interaction in the absence or presence of bioreducible linkages. Compared with nanoplexes of the starting star polycation and pDNA, the supramolecular polycation/pDNA nanoplexes exhibited similarly low cytotoxicity, improved cellular internalization and significantly higher gene transfection efficiencies. The incorporation of disulfide linkages imparted the supramolecular polycation/pDNA nanoplexes with the advantage of intracellular bioreducibility, resulting in better gene delivery properties. In addition, the antitumor properties of supramolecular polycation/pDNA nanoplexes were also investigated using a suicide gene therapy system. The present study demonstrates that the proper assembly of cyclodextrin-cored polycations with adamantane-functionalized polysaccharides is an effective strategy for the production of new nanoplex delivery systems. Electronic supplementary information (ESI) available: 1H NMR assay and synthetic route of Dex-Ad and Dex-SS-Ad. See DOI: 10.1039/c4nr01590h
Nuclear functions of prefoldin

PubMed Central

Millán-Zambrano, Gonzalo; Chávez, Sebastián

2014-01-01

Prefoldin is a cochaperone, present in all eukaryotes, that cooperates with the chaperonin CCT. It is known mainly for its functional relevance in the cytoplasmic folding of actin and tubulin monomers during cytoskeleton assembly. However, both canonical and prefoldin-like subunits of this heterohexameric complex have also been found in the nucleus, and are functionally connected with nuclear processes in yeast and metazoa. Plant prefoldin has also been detected in the nucleus and physically associated with a gene regulator. In this review, we summarize the information available on the involvement of prefoldin in nuclear phenomena, place special emphasis on gene transcription, and discuss the possibility of a global coordination between gene regulation and cytoplasmic dynamics mediated by prefoldin. PMID:25008233
Genome-wide STAT3 binding analysis after histone deacetylase inhibition reveals novel target genes in dendritic cells

PubMed Central

Sun, Yaping; Iyer, Matthew; McEachin, Richard; Zhao, Meng; Wu, Yi-Mi; Cao, Xuhong; Oravecz-Wilson, Katherine; Zajac, Cynthia; Mathewson, Nathan; Wu, Shin-Rong Julia; Rossi, Corinne; Toubai, Tomomi; Qin, Zhaohui S.; Chinnaiya, Arul M.; Reddy, Pavan

2016-01-01

STAT3 is a master transcriptional regulator that plays an important role in the induction of both immune activation and immune tolerance in dendritic cells (DCs). The transcriptional targets of STAT3 in promoting DC activation are becoming increasingly understood; however, the mechanisms underpinning its role in causing DC suppression remain largely unknown. To determine the functional gene targets of STAT3, we compared the genome-wide binding of STAT3 using ChIP-seq coupled with gene expression microarrays to determine STAT3-dependent gene regulation in DCs after histone deacetylase (HDAC) inhibition. HDAC inhibition boosted the ability of STAT3 to bind to distinct DNA targets and regulate gene expression. Among the top 500 STAT3 binding sites, the frequency of canonical motifs was significantly higher than that of non-canonical motifs. Functional analysis revealed that after treatment with an HDAC inhibitor, the upregulated STAT3 target genes were those that were primarily the negative regulators of pro-inflammatory cytokines and those in the IL-10 signaling pathway. The downregulated STAT3-dependent targets were those involved in immune effector processes and antigen processing/presentation. The expression and functional relevance of these genes were validated. Specifically, functional studies confirmed that the upregulation of IL-10Ra by STAT3 contributed to the suppressive function of DCs following HDAC inhibition. PMID:27866206
Bioinformatics-Based Identification of Candidate Genes from QTLs Associated with Cell Wall Traits in Populus

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ranjan, Priya; Yin, Tongming; Zhang, Xinye

2009-11-01

Quantitative trait locus (QTL) studies are an integral part of plant research and are used to characterize the genetic basis of phenotypic variation observed in structured populations and inform marker-assisted breeding efforts. These QTL intervals can span large physical regions on a chromosome comprising hundreds of genes, thereby hampering candidate gene identification. Genome history, evolution, and expression evidence can be used to narrow the genes in the interval to a smaller list that is manageable for detailed downstream functional genomics characterization. Our primary motivation for the present study was to address the need for a research methodology that identifies candidatemore » genes within a broad QTL interval. Here we present a bioinformatics-based approach for subdividing candidate genes within QTL intervals into alternate groups of high probability candidates. Application of this approach in the context of studying cell wall traits, specifically lignin content and S/G ratios of stem and root in Populus plants, resulted in manageable sets of genes of both known and putative cell wall biosynthetic function. These results provide a roadmap for future experimental work leading to identification of new genes controlling cell wall recalcitrance and, ultimately, in the utility of plant biomass as an energy feedstock.« less
Characterization and expression of the cytochrome P450 gene family in diamondback moth, Plutella xylostella (L.).

PubMed

Yu, Liying; Tang, Weiqi; He, Weiyi; Ma, Xiaoli; Vasseur, Liette; Baxter, Simon W; Yang, Guang; Huang, Shiguo; Song, Fengqin; You, Minsheng

2015-03-10

Cytochrome P450 monooxygenases are present in almost all organisms and can play vital roles in hormone regulation, metabolism of xenobiotics and in biosynthesis or inactivation of endogenous compounds. In the present study, a genome-wide approach was used to identify and analyze the P450 gene family of diamondback moth, Plutella xylostella, a destructive worldwide pest of cruciferous crops. We identified 85 putative cytochrome P450 genes from the P. xylostella genome, including 84 functional genes and 1 pseudogene. These genes were classified into 26 families and 52 subfamilies. A phylogenetic tree constructed with three additional insect species shows extensive gene expansions of P. xylostella P450 genes from clans 3 and 4. Gene expression of cytochrome P450s was quantified across multiple developmental stages (egg, larva, pupa and adult) and tissues (head and midgut) using P. xylostella strains susceptible or resistant to insecticides chlorpyrifos and fiprinol. Expression of the lepidopteran specific CYP367s predominantly occurred in head tissue suggesting a role in either olfaction or detoxification. CYP340s with abundant transposable elements and relatively high expression in the midgut probably contribute to the detoxification of insecticides or plant toxins in P. xylostella. This study will facilitate future functional studies of the P. xylostella P450s in detoxification.
Characterization and expression of the cytochrome P450 gene family in diamondback moth, Plutella xylostella (L.)

PubMed Central

Yu, Liying; Tang, Weiqi; He, Weiyi; Ma, Xiaoli; Vasseur, Liette; Baxter, Simon W.; Yang, Guang; Huang, Shiguo; Song, Fengqin; You, Minsheng

2015-01-01

Cytochrome P450 monooxygenases are present in almost all organisms and can play vital roles in hormone regulation, metabolism of xenobiotics and in biosynthesis or inactivation of endogenous compounds. In the present study, a genome-wide approach was used to identify and analyze the P450 gene family of diamondback moth, Plutella xylostella, a destructive worldwide pest of cruciferous crops. We identified 85 putative cytochrome P450 genes from the P. xylostella genome, including 84 functional genes and 1 pseudogene. These genes were classified into 26 families and 52 subfamilies. A phylogenetic tree constructed with three additional insect species shows extensive gene expansions of P. xylostella P450 genes from clans 3 and 4. Gene expression of cytochrome P450s was quantified across multiple developmental stages (egg, larva, pupa and adult) and tissues (head and midgut) using P. xylostella strains susceptible or resistant to insecticides chlorpyrifos and fiprinol. Expression of the lepidopteran specific CYP367s predominantly occurred in head tissue suggesting a role in either olfaction or detoxification. CYP340s with abundant transposable elements and relatively high expression in the midgut probably contribute to the detoxification of insecticides or plant toxins in P. xylostella. This study will facilitate future functional studies of the P. xylostella P450s in detoxification. PMID:25752830
Identification of possible genetic polymorphisms involved in cancer cachexia: a systematic review.

PubMed

Tan, Benjamin H L; Ross, James A; Kaasa, Stein; Skorpen, Frank; Fearon, Kenneth C H

2011-04-01

Cancer cachexia is a polygenic and complex syndrome. Genetic variations in regulation of the inflammatory response, muscle and fat metabolic pathways, and pathways in appetite regulation are likely to contribute to the susceptibility or resistance to developing cancer cachexia. A systematic search of Medline and EmBase databases, covering 1986-2008 was performed for potential candidate genes/genetic polymorphisms relating to cancer cachexia. Related genes were then identified using pathway functional analysis software. All candidate genes were reviewed for functional polymorphisms or clinically significant polymorphisms associated with cachexia using the OMIM and GeneRIF databases. Genes with variants which had functional or clinical associations with cachexia and replicated in at least one study were entered into pathway analysis software to reveal possible network associations between genes. A total of 184 polymorphisms with functional or clinical relevance to cancer cachexia were identified in 92 candidate genes. Of these, 42 polymorphisms (in 33 genes) were replicated in more than one study with 13 polymorphisms found to influence two or more hallmarks of cachexia (i.e. inflammation, loss of fat mass and/or lean mass and reduced survival). Thirty-three genes were found to be significantly interconnected in two major networks with four genes (ADIPOQ, IL6, NFKB1 and TLR4) interlinking both networks. Selection of candidate genes and polymorphisms is a key element of multigene study design. The present study provides an initial framework to select genes/polymorphisms for further study in cancer cachexia, and to develop their potential as susceptibility biomarkers of developing cachexia.
Literature and patent analysis of the cloning and identification of human functional genes in China.

PubMed

Xia, Yan; Tang, LiSha; Yao, Lei; Wan, Bo; Yang, XianMei; Yu, Long

2012-03-01

The Human Genome Project was launched at the end of the 1980s. Since then, the cloning and identification of functional genes has been a major focus of research across the world. In China too, the potentially profound impact of such studies on the life sciences and on human health was realized, and relevant studies were initiated in the 1990s. To advance China's involvement in the Human Genome Project, in the mid-1990s, Committee of Experts in Biology from National High Technology Research and Development Program of China (863 Program) proposed the "two 1%" goal. This goal envisaged China contributing 1% of the total sequencing work, and cloning and identifying 1% of the total human functional genes. Over the past 20 years, tremendous achievement has been accomplished by Chinese scientists. It is well known that scientists in China finished the 1% of sequencing work of the Human Genome Project, whereas, there is no comprehensive report about "whether China had finished cloning and identifying 1% of human functional genes". In the present study, the GenBank database at the National Center of Biotechnology Information, the PubMed search tool, and the patent database of the State Intellectual Property Office, China, were used to retrieve entries based on two screening standards: (i) Were the newly cloned and identified genes first reported by Chinese scientists? (ii) Were the Chinese scientists awarded the gene sequence patent? Entries were retrieved from the databases up to the cut-off date of 30 June 2011 and the obtained data were analyzed further. The results showed that 589 new human functional genes were first reported by Chinese scientists and 159 gene sequences were patented (http://gene.fudan.sh.cn/introduction/database/chinagene/chinagene.html). This study systematically summarizes China's contributions to human functional genomics research and answers the question "has China finished cloning and identifying 1% of human functional genes?" in the affirmative.
GEsture: an online hand-drawing tool for gene expression pattern search.

PubMed

Wang, Chunyan; Xu, Yiqing; Wang, Xuelin; Zhang, Li; Wei, Suyun; Ye, Qiaolin; Zhu, Youxiang; Yin, Hengfu; Nainwal, Manoj; Tanon-Reyes, Luis; Cheng, Feng; Yin, Tongming; Ye, Ning

2018-01-01

Gene expression profiling data provide useful information for the investigation of biological function and process. However, identifying a specific expression pattern from extensive time series gene expression data is not an easy task. Clustering, a popular method, is often used to classify similar expression genes, however, genes with a 'desirable' or 'user-defined' pattern cannot be efficiently detected by clustering methods. To address these limitations, we developed an online tool called GEsture. Users can draw, or graph a curve using a mouse instead of inputting abstract parameters of clustering methods. GEsture explores genes showing similar, opposite and time-delay expression patterns with a gene expression curve as input from time series datasets. We presented three examples that illustrate the capacity of GEsture in gene hunting while following users' requirements. GEsture also provides visualization tools (such as expression pattern figure, heat map and correlation network) to display the searching results. The result outputs may provide useful information for researchers to understand the targets, function and biological processes of the involved genes.
Disruption of DNA methylation-dependent long gene repression in Rett syndrome

PubMed Central

Gabel, Harrison W.; Kinde, Benyam Z.; Stroud, Hume; Gilbert, Caitlin S.; Harmin, David A.; Kastan, Nathaniel R.; Hemberg, Martin; Ebert, Daniel H.; Greenberg, Michael E.

2015-01-01

Disruption of the MECP2 gene leads to Rett syndrome (RTT), a severe neurological disorder with features of autism1. MECP2 encodes a methyl-DNA-binding protein2 that has been proposed to function as a transcriptional repressor, but despite numerous studies examining neuronal gene expression in Mecp2 mutants, no clear model has emerged for how MeCP2 regulates transcription3–9. Here we identify a genome-wide length-dependent increase in gene expression in MeCP2 mutant mouse models and human RTT brains. We present evidence that MeCP2 represses gene expression by binding to methylated CA sites within long genes, and that in neurons lacking MeCP2, decreasing the expression of long genes attenuates RTT-associated cellular deficits. In addition, we find that long genes as a population are enriched for neuronal functions and selectively expressed in the brain. These findings suggest that mutations in MeCP2 may cause neurological dysfunction by specifically disrupting long gene expression in the brain. PMID:25762136

Refinement of light-responsive transcript lists using rice oligonucleotide arrays: evaluation of gene-redundancy.

PubMed

Jung, Ki-Hong; Dardick, Christopher; Bartley, Laura E; Cao, Peijian; Phetsom, Jirapa; Canlas, Patrick; Seo, Young-Su; Shultz, Michael; Ouyang, Shu; Yuan, Qiaoping; Frank, Bryan C; Ly, Eugene; Zheng, Li; Jia, Yi; Hsia, An-Ping; An, Kyungsook; Chou, Hui-Hsien; Rocke, David; Lee, Geun Cheol; Schnable, Patrick S; An, Gynheung; Buell, C Robin; Ronald, Pamela C

2008-10-06

Studies of gene function are often hampered by gene-redundancy, especially in organisms with large genomes such as rice (Oryza sativa). We present an approach for using transcriptomics data to focus functional studies and address redundancy. To this end, we have constructed and validated an inexpensive and publicly available rice oligonucleotide near-whole genome array, called the rice NSF45K array. We generated expression profiles for light- vs. dark-grown rice leaf tissue and validated the biological significance of the data by analyzing sources of variation and confirming expression trends with reverse transcription polymerase chain reaction. We examined trends in the data by evaluating enrichment of gene ontology terms at multiple false discovery rate thresholds. To compare data generated with the NSF45K array with published results, we developed publicly available, web-based tools (www.ricearray.org). The Oligo and EST Anatomy Viewer enables visualization of EST-based expression profiling data for all genes on the array. The Rice Multi-platform Microarray Search Tool facilitates comparison of gene expression profiles across multiple rice microarray platforms. Finally, we incorporated gene expression and biochemical pathway data to reduce the number of candidate gene products putatively participating in the eight steps of the photorespiration pathway from 52 to 10, based on expression levels of putatively functionally redundant genes. We confirmed the efficacy of this method to cope with redundancy by correctly predicting participation in photorespiration of a gene with five paralogs. Applying these methods will accelerate rice functional genomics.
An Overview of Hox Genes in Lophotrochozoa: Evolution and Functionality

PubMed Central

Barucca, Marco; Canapa, Adriana; Biscotti, Maria Assunta

2016-01-01

Hox genes are regulators of animal embryonic development. Changes in the number and sequence of Hox genes as well as in their expression patterns have been related to the evolution of the body plan. Lophotrochozoa is a clade of Protostomia characterized by several phyla which show a wide morphological diversity. Despite that the works summarized in this review emphasize the fragmentary nature of the data available regarding the presence and expression of Hox genes, they also offer interesting insight into the evolution of the Hox cluster and the role played by Hox genes in several phyla. However, the number of genes involved in the cluster of the lophotrochozoan ancestor is still a question of debate. The data presented here suggest that at least nine genes were present while two other genes, Lox4 and Post-2, may either have been present in the ancestor or may have arisen as a result of duplication in the Brachiopoda-Mollusca-Annelida lineage. Spatial and temporal collinearity is a feature of Hox gene expression which was probably present in the ancestor of deuterostomes and protostomes. However, in Lophotrochozoa, it has been detected in only a few species belonging to Annelida and Mollusca. PMID:29615580
A genetic method for sex determination in Ovis spp. by interruption of the zinc finger protein, Y-linked (ZFY) gene on the Y chromosome.

PubMed

Zhang, Yong Sheng; Du, Ying Chun; Sun, Li Rong; Wang, Xu Hai; Liu, Shuai Bing; Xi, Ji Feng; Li, Chao Cheng; Ying, Rui Wen; Jiang, Song; Wang, Xiang Zu; Shen, Hong; Jia, Bin

2018-03-06

The mammalian Y chromosome plays a critical role in spermatogenesis. However, the exact functions of each gene on the Y chromosome have not been completely elucidated, due, in part, to difficulties in gene targeting analysis of the Y chromosome. The zinc finger protein, Y-linked (ZFY) gene was first proposed to be a sex determination factor, although its function in spermatogenesis has recently been elucidated. Nevertheless, ZFY gene targeting analysis has not been performed to date. In the present study, RNA interference (RNAi) was used to generate ZFY-interrupted Hu sheep by injecting short hairpin RNA (shRNA) into round spermatids. The resulting spermatozoa exhibited abnormal sperm morphology, including spermatozoa without tails and others with head and tail abnormalities. Quantitative real-time polymerase chain reaction analysis showed that ZFY mRNA expression was decreased significantly in Hu sheep with interrupted ZFY compared with wild-type Hu sheep. The sex ratio of lambs also exhibited a bias towards females. Together, the experimental strategy and findings of the present study reveal that ZFY also functions in spermatogenesis in Hu sheep and facilitate the use of RNAi in the control of sex in Hu sheep.
De novo characterisation of the greenlip abalone transcriptome (Haliotis laevigata) with a focus on the heat shock protein 70 (HSP70) family.

PubMed

Shiel, Brett P; Hall, Nathan E; Cooke, Ira R; Robinson, Nicholas A; Strugnell, Jan M

2015-02-01

Abalone (Haliotis) are economically important molluscs for fisheries and aquaculture industries worldwide. Despite this, genomic resources for abalone and molluscs are still limited. Here we present a description and functional annotation of the greenlip abalone (Haliotis laevigata) transcriptome. We present a focused analysis on the heat shock protein 70 (HSP70) family of genes with putative functions affecting temperature stress and immunity. A total of ~38 million paired end Illumina reads were obtained, resulting in a Trinity assembly of 222,172 contigs with minimum length of 200 base pairs and maximum length of 33 kilobases. The 20,702 contigs were annotated with gene descriptions by BLAST. We created a program to maximise the number of functionally annotated genes, and over 10,000 contigs were assigned Gene ontologies (GO terms). By using CateGOrizer, immunity related GO terms for stressors such as heat, hypoxia, oxidative stress and wounding received the highest counts. Twenty-six contigs with homology to the HSP70 family of genes were identified. Ninety-one putative single-nucleotide polymorphisms were observed in the abalone HSP70 contigs. Eleven of these were considered non-synonymous. The annotated transcriptome described in this study will be a useful basis for future work investigating the genetic response of abalone to stress.
Dlx homeobox gene family expression in osteoclasts.

PubMed

Lézot, F; Thomas, B L; Blin-Wakkach, C; Castaneda, B; Bolanos, A; Hotton, D; Sharpe, P T; Heymann, D; Carles, G F; Grigoriadis, A E; Berdal, A

2010-06-01

Skeletal growth and homeostasis require the finely orchestrated secretion of mineralized tissue matrices by highly specialized cells, balanced with their degradation by osteoclasts. Time- and site-specific expression of Dlx and Msx homeobox genes in the cells secreting these matrices have been identified as important elements in the regulation of skeletal morphology. Such specific expression patterns have also been reported in osteoclasts for Msx genes. The aim of the present study was to establish the expression patterns of Dlx genes in osteoclasts and identify their function in regulating skeletal morphology. The expression patterns of all Dlx genes were examined during the whole osteoclastogenesis using different in vitro models. The results revealed that Dlx1 and Dlx2 are the only Dlx family members with a possible function in osteoclastogenesis as well as in mature osteoclasts. Dlx5 and Dlx6 were detected in the cultures but appear to be markers of monocytes and their derivatives. In vivo, Dlx2 expression in osteoclasts was examined using a Dlx2/LacZ transgenic mouse. Dlx2 is expressed in a subpopulation of osteoclasts in association with tooth, brain, nerve, and bone marrow volumetric growths. Altogether the present data suggest a role for Dlx2 in regulation of skeletal morphogenesis via functions within osteoclasts. (c) 2010 Wiley-Liss, Inc.
Studying Individual Plant AOX Gene Functionality in Early Growth Regulation: A New Approach.

PubMed

Arnholdt-Schmitt, Birgit; Patil, Vinod Kumar

2017-01-01

AOX1 and AOX2 genes are thought to play different physiological roles. Whereas AOX1 is typically expected to associate to stress and growth responses, AOX2 was more often found to be linked to development and housekeeping functions. However, this view is questioned by several adverse observations. For example, co-regulated expression for DcAOX1 and DcAOX2a genes was recently reported during growth induction in carrot (Daucus carota L.). Early expression peaks for both genes during the lag phase of growth coincided with a critical time point for biomass prediction, a result achieved by applying calorespirometry. The effect of both AOX family member genes cannot easily be separated. However, separate functional analysis is required in order to identify important gene-specific polymorphisms or patterns of polymorphisms for functional marker development and its use in breeding. Specifically, a methodology is missing that enables studying functional effects of individual genes or polymorphisms/polymorphic patterns on early growth regulation.This protocol aims to provide the means for identifying plant alternative oxidase (AOX) gene variants as functional markers for early growth regulation. Prerequisite for applying this protocol is available Schizosaccharomyces pombe strains that were transformed with individual AOX genes following published protocols from Anthony Moore's group (Albury et al., J Biol Chem 271:17062-17066, 1996; Affourtit et al., J Biol Chem 274:6212-6218, 1999). The novelty of the present protocol comes by modifying yeast cell densities in a way that allows studying critical qualitative and quantitative effects of AOX gene variants (isoenzymes or polymorphic genes) during the early phase of growth. Calorimetry is used as a novel tool to confirm differences obtained by optical density measurements in early growth regulation by metabolic phenotyping (released heat rates). This protocol enables discriminating between AOX genes that inhibit growth and AOX genes that enhance growth under comparable conditions. It also allows studying dependency of AOX gene effects on gene copy number. The protocol can also be combined with laser microdissection of individual cells from target tissues for specified breeding traits.
Genome-wide investigation and expression analyses of WD40 protein family in the model plant foxtail millet (Setaria italica L.).

PubMed

Mishra, Awdhesh Kumar; Muthamilarasan, Mehanathan; Khan, Yusuf; Parida, Swarup Kumar; Prasad, Manoj

2014-01-01

WD40 proteins play a crucial role in diverse protein-protein interactions by acting as scaffolding molecules and thus assisting in the proper activity of proteins. Hence, systematic characterization and expression profiling of these WD40 genes in foxtail millet would enable us to understand the networks of WD40 proteins and their biological processes and gene functions. In the present study, a genome-wide survey was conducted and 225 potential WD40 genes were identified. Phylogenetic analysis categorized the WD40 proteins into 5 distinct sub-families (I-V). Gene Ontology annotation revealed the biological roles of the WD40 proteins along with its cellular components and molecular functions. In silico comparative mapping with sorghum, maize and rice demonstrated the orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of WD40 genes. Estimation of synonymous and non-synonymous substitution rates revealed its evolutionary significance in terms of gene-duplication and divergence. Expression profiling against abiotic stresses provided novel insights into specific and/or overlapping expression patterns of SiWD40 genes. Homology modeling enabled three-dimensional structure prediction was performed to understand the molecular functions of WD40 proteins. Although, recent findings had shown the importance of WD40 domains in acting as hubs for cellular networks during many biological processes, it has invited a lesser research attention unlike other common domains. Being a most promiscuous interactors, WD40 domains are versatile in mediating critical cellular functions and hence this genome-wide study especially in the model crop foxtail millet would serve as a blue-print for functional characterization of WD40s in millets and bioenergy grass species. In addition, the present analyses would also assist the research community in choosing the candidate WD40s for comprehensive studies towards crop improvement of millets and biofuel grasses.
Genome-Wide Investigation and Expression Analyses of WD40 Protein Family in the Model Plant Foxtail Millet (Setaria italica L.)

PubMed Central

Mishra, Awdhesh Kumar; Muthamilarasan, Mehanathan; Khan, Yusuf; Parida, Swarup Kumar; Prasad, Manoj

2014-01-01

WD40 proteins play a crucial role in diverse protein-protein interactions by acting as scaffolding molecules and thus assisting in the proper activity of proteins. Hence, systematic characterization and expression profiling of these WD40 genes in foxtail millet would enable us to understand the networks of WD40 proteins and their biological processes and gene functions. In the present study, a genome-wide survey was conducted and 225 potential WD40 genes were identified. Phylogenetic analysis categorized the WD40 proteins into 5 distinct sub-families (I–V). Gene Ontology annotation revealed the biological roles of the WD40 proteins along with its cellular components and molecular functions. In silico comparative mapping with sorghum, maize and rice demonstrated the orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of WD40 genes. Estimation of synonymous and non-synonymous substitution rates revealed its evolutionary significance in terms of gene-duplication and divergence. Expression profiling against abiotic stresses provided novel insights into specific and/or overlapping expression patterns of SiWD40 genes. Homology modeling enabled three-dimensional structure prediction was performed to understand the molecular functions of WD40 proteins. Although, recent findings had shown the importance of WD40 domains in acting as hubs for cellular networks during many biological processes, it has invited a lesser research attention unlike other common domains. Being a most promiscuous interactors, WD40 domains are versatile in mediating critical cellular functions and hence this genome-wide study especially in the model crop foxtail millet would serve as a blue-print for functional characterization of WD40s in millets and bioenergy grass species. In addition, the present analyses would also assist the research community in choosing the candidate WD40s for comprehensive studies towards crop improvement of millets and biofuel grasses. PMID:24466268
Differential network analysis reveals the genome-wide landscape of estrogen receptor modulation in hormonal cancers

PubMed Central

Hsiao, Tzu-Hung; Chiu, Yu-Chiao; Hsu, Pei-Yin; Lu, Tzu-Pin; Lai, Liang-Chuan; Tsai, Mong-Hsun; Huang, Tim H.-M.; Chuang, Eric Y.; Chen, Yidong

2016-01-01

Several mutual information (MI)-based algorithms have been developed to identify dynamic gene-gene and function-function interactions governed by key modulators (genes, proteins, etc.). Due to intensive computation, however, these methods rely heavily on prior knowledge and are limited in genome-wide analysis. We present the modulated gene/gene set interaction (MAGIC) analysis to systematically identify genome-wide modulation of interaction networks. Based on a novel statistical test employing conjugate Fisher transformations of correlation coefficients, MAGIC features fast computation and adaption to variations of clinical cohorts. In simulated datasets MAGIC achieved greatly improved computation efficiency and overall superior performance than the MI-based method. We applied MAGIC to construct the estrogen receptor (ER) modulated gene and gene set (representing biological function) interaction networks in breast cancer. Several novel interaction hubs and functional interactions were discovered. ER+ dependent interaction between TGFβ and NFκB was further shown to be associated with patient survival. The findings were verified in independent datasets. Using MAGIC, we also assessed the essential roles of ER modulation in another hormonal cancer, ovarian cancer. Overall, MAGIC is a systematic framework for comprehensively identifying and constructing the modulated interaction networks in a whole-genome landscape. MATLAB implementation of MAGIC is available for academic uses at https://github.com/chiuyc/MAGIC. PMID:26972162
Acetylcholinesterase genes within the Diptera: takeover and loss in true flies

PubMed Central

Huchard, Elise; Martinez, Michel; Alout, Haoues; Douzery, Emmanuel J.P; Lutfalla, Georges; Berthomieu, Arnaud; Berticat, Claire; Raymond, Michel; Weill, Mylène

2006-01-01

It has recently been reported that the synaptic acetylcholinesterase (AChE) in mosquitoes is encoded by the ace-1 gene, distinct and divergent from the ace-2 gene, which performs this function in Drosophila. This is an unprecedented situation within the Diptera order because both ace genes derive from an old duplication and are present in most insects and arthropods. Nevertheless, Drosophila possesses only the ace-2 gene. Thus, a secondary loss occurred during the evolution of Diptera, implying a vital function switch from one gene (ace-1) to the other (ace-2). We sampled 78 species, representing 50 families (27% of the Dipteran families) spread over all major subdivisions of the Diptera, and looked for ace-1 and ace-2 by systematic PCR screening to determine which taxonomic groups within the Diptera have this gene change. We show that this loss probably extends to all true flies (or Cyclorrhapha), a large monophyletic group of the Diptera. We also show that ace-2 plays a non-detectable role in the synaptic AChE in a lower Diptera species, suggesting that it has non-synaptic functions. A relative molecular evolution rate test showed that the intensity of purifying selection on ace-2 sequences is constant across the Diptera, irrespective of the presence or absence of ace-1, confirming the evolutionary importance of non-synaptic functions for this gene. We discuss the evolutionary scenarios for the takeover of ace-2 and the loss of ace-1, taking into account our limited knowledge of non-synaptic functions of ace genes and some specific adaptations of true flies. PMID:17002944
GFD-Net: A novel semantic similarity methodology for the analysis of gene networks.

PubMed

Díaz-Montaña, Juan J; Díaz-Díaz, Norberto; Gómez-Vela, Francisco

2017-04-01

Since the popularization of biological network inference methods, it has become crucial to create methods to validate the resulting models. Here we present GFD-Net, the first methodology that applies the concept of semantic similarity to gene network analysis. GFD-Net combines the concept of semantic similarity with the use of gene network topology to analyze the functional dissimilarity of gene networks based on Gene Ontology (GO). The main innovation of GFD-Net lies in the way that semantic similarity is used to analyze gene networks taking into account the network topology. GFD-Net selects a functionality for each gene (specified by a GO term), weights each edge according to the dissimilarity between the nodes at its ends and calculates a quantitative measure of the network functional dissimilarity, i.e. a quantitative value of the degree of dissimilarity between the connected genes. The robustness of GFD-Net as a gene network validation tool was demonstrated by performing a ROC analysis on several network repositories. Furthermore, a well-known network was analyzed showing that GFD-Net can also be used to infer knowledge. The relevance of GFD-Net becomes more evident in Section "GFD-Net applied to the study of human diseases" where an example of how GFD-Net can be applied to the study of human diseases is presented. GFD-Net is available as an open-source Cytoscape app which offers a user-friendly interface to configure and execute the algorithm as well as the ability to visualize and interact with the results(http://apps.cytoscape.org/apps/gfdnet). Copyright © 2017 Elsevier Inc. All rights reserved.
On the Use of Gene Ontology Annotations to Assess Functional Similarity among Orthologs and Paralogs: A Short Report

PubMed Central

Thomas, Paul D.; Wood, Valerie; Mungall, Christopher J.; Lewis, Suzanna E.; Blake, Judith A.

2012-01-01

A recent paper (Nehrt et al., PLoS Comput. Biol. 7:e1002073, 2011) has proposed a metric for the “functional similarity” between two genes that uses only the Gene Ontology (GO) annotations directly derived from published experimental results. Applying this metric, the authors concluded that paralogous genes within the mouse genome or the human genome are more functionally similar on average than orthologous genes between these genomes, an unexpected result with broad implications if true. We suggest, based on both theoretical and empirical considerations, that this proposed metric should not be interpreted as a functional similarity, and therefore cannot be used to support any conclusions about the “ortholog conjecture” (or, more properly, the “ortholog functional conservation hypothesis”). First, we reexamine the case studies presented by Nehrt et al. as examples of orthologs with divergent functions, and come to a very different conclusion: they actually exemplify how GO annotations for orthologous genes provide complementary information about conserved biological functions. We then show that there is a global ascertainment bias in the experiment-based GO annotations for human and mouse genes: particular types of experiments tend to be performed in different model organisms. We conclude that the reported statistical differences in annotations between pairs of orthologous genes do not reflect differences in biological function, but rather complementarity in experimental approaches. Our results underscore two general considerations for researchers proposing novel types of analysis based on the GO: 1) that GO annotations are often incomplete, potentially in a biased manner, and subject to an “open world assumption” (absence of an annotation does not imply absence of a function), and 2) that conclusions drawn from a novel, large-scale GO analysis should whenever possible be supported by careful, in-depth examination of examples, to help ensure the conclusions have a justifiable biological basis. PMID:22359495
Integrative and conjugative elements and their hosts: composition, distribution and organization.

PubMed

Cury, Jean; Touchon, Marie; Rocha, Eduardo P C

2017-09-06

Conjugation of single-stranded DNA drives horizontal gene transfer between bacteria and was widely studied in conjugative plasmids. The organization and function of integrative and conjugative elements (ICE), even if they are more abundant, was only studied in a few model systems. Comparative genomics of ICE has been precluded by the difficulty in finding and delimiting these elements. Here, we present the results of a method that circumvents these problems by requiring only the identification of the conjugation genes and the species' pan-genome. We delimited 200 ICEs and this allowed the first large-scale characterization of these elements. We quantified the presence in ICEs of a wide set of functions associated with the biology of mobile genetic elements, including some that are typically associated with plasmids, such as partition and replication. Protein sequence similarity networks and phylogenetic analyses revealed that ICEs are structured in functional modules. Integrases and conjugation systems have different evolutionary histories, even if the gene repertoires of ICEs can be grouped in function of conjugation types. Our characterization of the composition and organization of ICEs paves the way for future functional and evolutionary analyses of their cargo genes, composed of a majority of unknown function genes. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Genomic-scale measurement of mRNA turnover and the mechanisms of action of the anti-cancer drug flavopiridol.

PubMed

Lam, L T; Pickeral, O K; Peng, A C; Rosenwald, A; Hurt, E M; Giltnane, J M; Averett, L M; Zhao, H; Davis, R E; Sathyamoorthy, M; Wahl, L M; Harris, E D; Mikovits, J A; Monks, A P; Hollingshead, M G; Sausville, E A; Staudt, L M

2001-01-01

Flavopiridol, a flavonoid currently in cancer clinical trials, inhibits cyclin-dependent kinases (CDKs) by competitively blocking their ATP-binding pocket. However, the mechanism of action of flavopiridol as an anti-cancer agent has not been fully elucidated. Using DNA microarrays, we found that flavopiridol inhibited gene expression broadly, in contrast to two other CDK inhibitors, roscovitine and 9-nitropaullone. The gene expression profile of flavopiridol closely resembled the profiles of two transcription inhibitors, actinomycin D and 5,6-dichloro-1-beta-D-ribofuranosyl-benzimidazole (DRB), suggesting that flavopiridol inhibits transcription globally. We were therefore able to use flavopiridol to measure mRNA turnover rates comprehensively and we found that different functional classes of genes had distinct distributions of mRNA turnover rates. In particular, genes encoding apoptosis regulators frequently had very short half-lives, as did several genes encoding key cell-cycle regulators. Strikingly, genes that were transcriptionally inducible were disproportionately represented in the class of genes with rapid mRNA turnover. The present genomic-scale measurement of mRNA turnover uncovered a regulatory logic that links gene function with mRNA half-life. The observation that transcriptionally inducible genes often have short mRNA half-lives demonstrates that cells have a coordinated strategy to rapidly modulate the mRNA levels of these genes. In addition, the present results suggest that flavopiridol may be more effective against types of cancer that are highly dependent on genes with unstable mRNAs.
SoyNet: a database of co-functional networks for soybean Glycine max.

PubMed

Kim, Eiru; Hwang, Sohyun; Lee, Insuk

2017-01-04

Soybean (Glycine max) is a legume crop with substantial economic value, providing a source of oil and protein for humans and livestock. More than 50% of edible oils consumed globally are derived from this crop. Soybean plants are also important for soil fertility, as they fix atmospheric nitrogen by symbiosis with microorganisms. The latest soybean genome annotation (version 2.0) lists 56 044 coding genes, yet their functional contributions to crop traits remain mostly unknown. Co-functional networks have proven useful for identifying genes that are involved in a particular pathway or phenotype with various network algorithms. Here, we present SoyNet (available at www.inetbio.org/soynet), a database of co-functional networks for G. max and a companion web server for network-based functional predictions. SoyNet maps 1 940 284 co-functional links between 40 812 soybean genes (72.8% of the coding genome), which were inferred from 21 distinct types of genomics data including 734 microarrays and 290 RNA-seq samples from soybean. SoyNet provides a new route to functional investigation of the soybean genome, elucidating genes and pathways of agricultural importance. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Genome Wide Identification of Orthologous ZIP Genes Associated with Zinc and Iron Translocation in Setaria italica.

PubMed

Alagarasan, Ganesh; Dubey, Mahima; Aswathy, Kumar S; Chandel, Girish

2017-01-01

Genes in the ZIP family encode transcripts to store and transport bivalent metal micronutrient, particularly iron (Fe) and or zinc (Zn). These transcripts are important for a variety of functions involved in the developmental and physiological processes in many plant species, including most, if not all, Poaceae plant species and the model species Arabidopsis. Here, we present the report of a genome wide investigation of orthologous ZIP genes in Setaria italica and the identification of 7 single copy genes. RT-PCR shows 4 of them could be used to increase the bio-availability of zinc and iron content in grains. Of 36 ZIP members, 25 genes have traces of signal peptide based sub-cellular localization, as compared to those of plant species studied previously, yet translocation of ions remains unclear. In silico analysis of gene structure and protein nature suggests that these two were preeminent in shaping the functional diversity of the ZIP gene family in S. italica . NAC, bZIP and bHLH are the predominant Fe and Zn responsive transcription factors present in SiZIP genes. Together, our results provide new insights into the signal peptide based/independent iron and zinc translocation in the plant system and allowed identification of ZIP genes that may be involved in the zinc and iron absorption from the soil, and thus transporting it to the cereal grain underlying high micronutrient accumulation.
Gene context conservation of a higher order than operons.

PubMed

Lathe, W C; Snel, B; Bork, P

2000-10-01

Operons, co-transcribed and co-regulated contiguous sets of genes, are poorly conserved over short periods of evolutionary time. The gene order, gene content and regulatory mechanisms of operons can be very different, even in closely related species. Here, we present several lines of evidence which suggest that, although an operon and its individual genes and regulatory structures are rearranged when comparing the genomes of different species, this rearrangement is a conservative process. Genomic rearrangements invariably maintain individual genes in very specific functional and regulatory contexts. We call this conserved context an uber-operon.
Functional Metagenomics Reveals Previously Unrecognized Diversity of Antibiotic Resistance Genes in Gulls

PubMed Central

Martiny, Adam C.; Martiny, Jennifer B. H.; Weihe, Claudia; Field, Andrew; Ellis, Julie C.

2011-01-01

Wildlife may facilitate the spread of antibiotic resistance (AR) between human-dominated habitats and the surrounding environment. Here, we use functional metagenomics to survey the diversity and genomic context of AR genes in gulls. Using this approach, we found a variety of AR genes not previously detected in gulls and wildlife, including class A and C β-lactamases as well as six tetracycline resistance gene types. An analysis of the flanking sequences indicates that most of these genes are present in Enterobacteriaceae and various Gram-positive bacteria. In addition to finding known gene types, we detected 31 previously undescribed AR genes. These undescribed genes include one most similar to an uncharacterized gene in Verrucomicrobium and another to a putative DNA repair protein in Lactobacillus. Overall, the study more than doubled the number of clinically relevant AR gene types known to be carried by gulls or by wildlife in general. Together with the propensity of gulls to visit human-dominated habitats, this high diversity of AR gene types suggests that gulls could facilitate the spread of AR. PMID:22347872
Partnering for functional genomics research conference: Abstracts of poster presentations

DOE Office of Scientific and Technical Information (OSTI.GOV)

NONE

1998-06-01

This reports contains abstracts of poster presentations presented at the Functional Genomics Research Conference held April 16--17, 1998 in Oak Ridge, Tennessee. Attention is focused on the following areas: mouse mutagenesis and genomics; phenotype screening; gene expression analysis; DNA analysis technology development; bioinformatics; comparative analyses of mouse, human, and yeast sequences; and pilot projects to evaluate methodologies.
A statistical method for measuring activation of gene regulatory networks.

PubMed

Esteves, Gustavo H; Reis, Luiz F L

2018-06-13

Gene expression data analysis is of great importance for modern molecular biology, given our ability to measure the expression profiles of thousands of genes and enabling studies rooted in systems biology. In this work, we propose a simple statistical model for the activation measuring of gene regulatory networks, instead of the traditional gene co-expression networks. We present the mathematical construction of a statistical procedure for testing hypothesis regarding gene regulatory network activation. The real probability distribution for the test statistic is evaluated by a permutation based study. To illustrate the functionality of the proposed methodology, we also present a simple example based on a small hypothetical network and the activation measuring of two KEGG networks, both based on gene expression data collected from gastric and esophageal samples. The two KEGG networks were also analyzed for a public database, available through NCBI-GEO, presented as Supplementary Material. This method was implemented in an R package that is available at the BioConductor project website under the name maigesPack.

Functional and DNA-protein binding studies of WRKY transcription factors and their expression analysis in response to biotic and abiotic stress in wheat (Triticum aestivum L.).

PubMed

Satapathy, Lopamudra; Kumar, Dhananjay; Kumar, Manish; Mukhopadhyay, Kunal

2018-01-01

WRKY, a plant-specific transcription factor family, plays vital roles in pathogen defense, abiotic stress, and phytohormone signalling. Little is known about the roles and function of WRKY transcription factors in response to rust diseases in wheat. In the present study, three TaWRKY genes encoding complete protein sequences were cloned. They belonged to class II and III WRKY based on the number of WRKY domains and the pattern of zinc finger structures. Twenty-two DNA-protein binding docking complexes predicted stable interactions of WRKY domain with W-box. Quantitative real-time-PCR using wheat near-isogenic lines with or without Lr28 gene revealed differential up- or down-regulation in response to biotic and abiotic stress treatments which could be responsible for their functional divergence in wheat. TaWRKY62 was found to be induced upon treatment with JA, MJ, and SA and reduced after ABA treatments. Maximum induction of six out of seven genes occurred at 48 h post inoculation due to pathogen inoculation. Hence, TaWRKY (49, 50 , 52 , 55 , 57, and 62 ) can be considered as potential candidate genes for further functional validation as well as for crop improvement programs for stress resistance. The results of the present study will enhance knowledge towards understanding the molecular basis of mode of action of WRKY transcription factor genes in wheat and their role during leaf rust pathogenesis in particular.
Rare copy number variants in patients with congenital conotruncal heart defects.

PubMed

Xie, Hongbo M; Werner, Petra; Stambolian, Dwight; Bailey-Wilson, Joan E; Hakonarson, Hakon; White, Peter S; Taylor, Deanne M; Goldmuntz, Elizabeth

2017-03-01

Previous studies using different cardiac phenotypes, technologies and designs suggest a burden of large, rare or de novo copy number variants (CNVs) in subjects with congenital heart defects. We sought to identify disease-related CNVs, candidate genes, and functional pathways in a large number of cases with conotruncal and related defects that carried no known genetic syndrome. Cases and control samples were divided into two cohorts and genotyped to assess each subject's CNV content. Analyses were performed to ascertain differences in overall CNV prevalence and to identify enrichment of specific genes and functional pathways in conotruncal cases relative to healthy controls. Only findings present in both cohorts are presented. From 973 total conotruncal cases, a burden of rare CNVs was detected in both cohorts. Candidate genes from rare CNVs found in both cohorts were identified based on their association with cardiac development or disease, and/or their reported disruption in published studies. Functional and pathway analyses revealed significant enrichment of terms involved in either heart or early embryonic development. Our study tested one of the largest cohorts specifically with cardiac conotruncal and related defects. These results confirm and extend previous findings that CNVs contribute to disease risk for congenital heart defects in general and conotruncal defects in particular. As disease heterogeneity renders identification of single recurrent genes or loci difficult, functional pathway and gene regulation network analyses appear to be more informative. Birth Defects Research 109:271-295, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Roles of miR319 and TCP Transcription Factors in Leaf Development1[OPEN

PubMed Central

2017-01-01

Sophisticated regulation of gene expression, including microRNAs (miRNAs) and their target genes, is required for leaf differentiation, growth, and senescence. The impact of miR319 and its target TEOSINTE BRANCHED1, CYCLOIDEA, and PROLIFERATING CELL NUCLEAR ANTIGEN BINDING FACTOR (TCP) genes on leaf development has been extensively investigated, but the redundancies of these gene families often interfere with the evaluation of their function and regulation in the developmental context. Here, we present the genetic evidence of the involvement of the MIR319 and TCP gene families in Arabidopsis (Arabidopsis thaliana) leaf development. Single mutations in MIR319A and MIR319B genes moderately inhibited the formation of leaf serrations, whereas double mutations increased the extent of this inhibition and resulted in the formation of smooth leaves. Mutations in MIR319 and gain-of-function mutations in the TCP4 gene conferred resistance against miR319 and impaired the cotyledon boundary and leaf serration formation. These mutations functionally associated with CUP-SHAPED COTYLEDON genes, which regulate the cotyledon boundary and leaf serration formation. In contrast, loss-of-function mutations in miR319-targeted and nontargeted TCP genes cooperatively induced the formation of serrated leaves in addition to changes in the levels of their downstream gene transcript. Taken together, these findings demonstrate that the MIR319 and TCP gene families underlie robust and multilayer control of leaf development. This study also provides a framework toward future researches on redundant miRNAs and transcription factors in Arabidopsis and crop plants. PMID:28842549
Roles of miR319 and TCP Transcription Factors in Leaf Development.

PubMed

Koyama, Tomotsugu; Sato, Fumihiko; Ohme-Takagi, Masaru

2017-10-01

Sophisticated regulation of gene expression, including microRNAs (miRNAs) and their target genes, is required for leaf differentiation, growth, and senescence. The impact of miR319 and its target TEOSINTE BRANCHED1 , CYCLOIDEA , and PROLIFERATING CELL NUCLEAR ANTIGEN BINDING FACTOR ( TCP ) genes on leaf development has been extensively investigated, but the redundancies of these gene families often interfere with the evaluation of their function and regulation in the developmental context. Here, we present the genetic evidence of the involvement of the MIR319 and TCP gene families in Arabidopsis ( Arabidopsis thaliana ) leaf development. Single mutations in MIR319A and MIR319B genes moderately inhibited the formation of leaf serrations, whereas double mutations increased the extent of this inhibition and resulted in the formation of smooth leaves. Mutations in MIR319 and gain-of-function mutations in the TCP4 gene conferred resistance against miR319 and impaired the cotyledon boundary and leaf serration formation. These mutations functionally associated with CUP-SHAPED COTYLEDON genes, which regulate the cotyledon boundary and leaf serration formation. In contrast, loss-of-function mutations in miR319-targeted and nontargeted TCP genes cooperatively induced the formation of serrated leaves in addition to changes in the levels of their downstream gene transcript. Taken together, these findings demonstrate that the MIR319 and TCP gene families underlie robust and multilayer control of leaf development. This study also provides a framework toward future researches on redundant miRNAs and transcription factors in Arabidopsis and crop plants. © 2017 American Society of Plant Biologists. All Rights Reserved.
Mining, identification and function analysis of microRNAs and target genes in peanut (Arachis hypogaea L.).

PubMed

Zhang, Tingting; Hu, Shuhao; Yan, Caixia; Li, Chunjuan; Zhao, Xiaobo; Wan, Shubo; Shan, Shihua

2017-02-01

In the present investigation, a total of 60 conserved peanut (Arachis hypogaea L.) microRNA (miRNA) sequences, belonging to 16 families, were identified using bioinformatics methods. There were 392 target gene sequences, identified from 58 miRNAs with Target-align software and BLASTx analyses. Gene Ontology (GO) functional analysis suggested that these target genes were involved in mediating peanut growth and development, signal transduction and stress resistance. There were 55 miRNA sequences, verified employing a poly (A) tailing test, with a success rate of up to 91.67%. Twenty peanut target gene sequences were randomly selected, and the 5' rapid amplification of the cDNA ends (5'-RACE) method were used to validate the cleavage sites of these target genes. Of these, 14 (70%) peanut miRNA targets were verified by means of gel electrophoresis, cloning and sequencing. Furthermore, functional analysis and homologous sequence retrieval were conducted for target gene sequences, and 26 target genes were chosen as the objects for stress resistance experimental study. Real-time fluorescence quantitative PCR (qRT-PCR) technology was applied to measure the expression level of resistance-associated miRNAs and their target genes in peanut exposed to Aspergillus flavus (A. flavus) infection and drought stress, respectively. In consequence, 5 groups of miRNAs & targets were found accorded with the mode of miRNA negatively controlling the expression of target genes. This study, preliminarily determined the biological functions of some resistance-associated miRNAs and their target genes in peanut. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
A functional metagenomic approach for expanding the synthetic biology toolbox for biomass conversion

PubMed Central

Sommer, Morten OA; Church, George M; Dantas, Gautam

2010-01-01

Sustainable biofuel alternatives to fossil fuel energy are hampered by recalcitrance and toxicity of biomass substrates to microbial biocatalysts. To address this issue, we present a culture-independent functional metagenomic platform for mining Nature's vast enzymatic reservoir and show its relevance to biomass conversion. We performed functional selections on 4.7 Gb of metagenomic fosmid libraries and show that genetic elements conferring tolerance toward seven important biomass inhibitors can be identified. We select two metagenomic fosmids that improve the growth of Escherichia coli by 5.7- and 6.9-fold in the presence of inhibitory concentrations of syringaldehyde and 2-furoic acid, respectively, and identify the individual genes responsible for these tolerance phenotypes. Finally, we combine the individual genes to create a three-gene construct that confers tolerance to mixtures of these important biomass inhibitors. This platform presents a route for expanding the repertoire of genetic elements available to synthetic biology and provides a starting point for efforts to engineer robust strains for biofuel generation. PMID:20393580
Genome-wide survey and characterization of the WRKY gene family in Populus trichocarpa.

PubMed

He, Hongsheng; Dong, Qing; Shao, Yuanhua; Jiang, Haiyang; Zhu, Suwen; Cheng, Beijiu; Xiang, Yan

2012-07-01

WRKY transcription factors participate in diverse physiological and developmental processes in plants. They have highly conserved WRKYGQK amino acid sequences in their N-termini, followed by the novel zinc-finger-like motifs, Cys₂His₂ or Cys₂HisCys. To date, numerous WRKY genes have been identified and characterized in a number of herbaceous species. Survey and characterization of WRKY genes in a ligneous species would facilitate a better understanding of the evolutionary processes and functions of this gene family. In this study, 104 poplar WRKY genes (PtWRKY) were identified in the latest poplar genome sequence. According to their structural features, the predicted members were divided into the previously defined groups I-III, as described in rice. In addition, chromosomal localization of the genes demonstrated that there might be WRKY gene hot spots in 2.3 Mb regions on chromosome 14. Furthermore, approximately 83% (86 out of 104) WRKY genes participated in gene duplication events, including 69% (29 out of 42) gene pairs which exhibited segmental duplication. Using semi-quantitative RT-PCR, the expression patterns of subgroup III genes were investigated under different stresses [cold, drought, salinity and salicylic acid (SA)]. The data revealed that these genes presented different expression levels in response to various stress conditions. Expression analysis exhibited PtWRKY76 gene induced markedly in 0.1 mM SA or 25% PEG-6000 treatment. The results presented here provide a fundamental clue for cloning specific function genes in further studies and applications. This study identified 104 poplar WRKY genes and demonstrated WRKY gene hot spots on chromosome 14. Furthermore, semi-quantitative RT-PCR showed variable stress responses in subgroup III.
A functional gene cluster for toxoflavin biosynthesis in the genome of the soil bacterium Pseudomonas protegens Pf-5

USDA-ARS?s Scientific Manuscript database

Toxoflavin is a broad-spectrum toxin best known for its role in virulence of Burkholderia glumae, which causes panicle blight of rice. A gene cluster containing homologs of toxoflavin biosynthesis genes (toxA-E) of B. glumae is present in the genome of Pseudomonas protegens Pf-5, a biological contr...
Transcriptomic data analysis and differential gene expression of antioxidant pathways in king penguin juveniles (Aptenodytes patagonicus) before and after acclimatization to marine life.

PubMed

Rey, Benjamin; Dégletagne, Cyril; Duchamp, Claude

2016-12-01

In this article, we present differentially expressed gene profiles in the pectoralis muscle of wild juvenile king penguins that were either naturally acclimated to cold marine environment or experimentally immersed in cold water as compared with penguin juveniles that never experienced cold water immersion. Transcriptomic data were obtained by hybridizing penguins total cDNA on Affymetrix GeneChip Chicken Genome arrays and analyzed using maxRS algorithm , " Transcriptome analysis in non-model species: a new method for the analysis of heterologous hybridization on microarrays " (Dégletagne et al., 2010) [1] . We focused on genes involved in multiple antioxidant pathways. For better clarity, these differentially expressed genes were clustered into six functional groups according to their role in controlling redox homeostasis. The data are related to a comprehensive research study on the ontogeny of antioxidant functions in king penguins, "Hormetic response triggers multifaceted anti-oxidant strategies in immature king penguins (Aptenodytes patagonicus)" (Rey et al., 2016) [2] . The raw microarray dataset supporting the present analyses has been deposited at the Gene Expression Omnibus (GEO) repository under accessions GEO: GSE17725 and GEO: GSE82344.
Evolution and functional divergence of NLRP genes in mammalian reproductive systems

PubMed Central

2009-01-01

Background NLRPs (Nucleotide-binding oligomerization domain, Leucine rich Repeat and Pyrin domain containing Proteins) are members of NLR (Nod-like receptors) protein family. Recent researches have shown that NLRP genes play important roles in both mammalian innate immune system and reproductive system. Several of NLRP genes were shown to be specifically expressed in the oocyte in mammals. The aim of the present work was to study how these genes evolved and diverged after their duplication, as well as whether natural selection played a role during their evolution. Results By using in silico methods, we have evaluated the evolution and functional divergence of NLRP genes, in particular of mouse reproduction-related Nlrp genes. We found that (1) major NLRP genes have been duplicated before the divergence of mammals, with certain lineage-specific duplications in primates (NLRP7 and 11) and in rodents (Nlrp1, 4 and 9 duplicates); (2) tandem duplication events gave rise to a mammalian reproduction-related NLRP cluster including NLRP2, 4, 5, 7, 8, 9, 11, 13 and 14 genes; (3) the function of mammalian oocyte-specific NLRP genes (NLRP4, 5, 9 and 14) might have diverged during gene evolution; (4) recent segmental duplications concerning Nlrp4 copies and vomeronasal 1 receptor encoding genes (V1r) have been undertaken in the mouse; and (5) duplicates of Nlrp4 and 9 in the mouse might have been subjected to adaptive evolution. Conclusion In conclusion, this study brings us novel information on the evolution of mammalian reproduction-related NLRPs. On the one hand, NLRP genes duplicated and functionally diversified in mammalian reproductive systems (such as NLRP4, 5, 9 and 14). On the other hand, during evolution, different lineages adapted to develop their own NLRP genes, particularly in reproductive function (such as the specific expansion of Nlrp4 and Nlrp9 in the mouse). PMID:19682372
Identifying Novel Helix–Loop–Helix Genes in Caenorhabditis elegans through a Classroom Demonstration of Functional Genomics

PubMed Central

Griffin, Vernetta; McMiller, Tracee; Jones, Erika; Johnson, Casonya M.

2003-01-01

A 14-week, undergraduate-level Genetics and Population Biology course at Morgan State University was modified to include a demonstration of functional genomics in the research laboratory. Students performed a rudimentary sequence analysis of the Caenorhabditis elegans genome and further characterized three sequences that were predicted to encode helix–loop–helix proteins. Students then used reverse transcription–polymerase chain reaction to determine which of the three genes is normally expressed in C. elegans. At the end of this laboratory activity, students were 1) to demonstrate a rudimentary knowledge of bioinformatics, including the ability to differentiate between “having” a gene and “expressing” a gene, and 2) to understand basic approaches to functional genomics, including one specific technique for assaying for gene expression. It was also anticipated that students would increase their skills at effectively communicating their research activities through written and/or oral presentation. This article describes the laboratory activity and the assessment of the effectiveness of the activity. PMID:12822036
GenCLiP 2.0: a web server for functional clustering of genes and construction of molecular networks based on free terms.

PubMed

Wang, Jia-Hong; Zhao, Ling-Feng; Lin, Pei; Su, Xiao-Rong; Chen, Shi-Jun; Huang, Li-Qiang; Wang, Hua-Feng; Zhang, Hai; Hu, Zhen-Fu; Yao, Kai-Tai; Huang, Zhong-Xi

2014-09-01

Identifying biological functions and molecular networks in a gene list and how the genes may relate to various topics is of considerable value to biomedical researchers. Here, we present a web-based text-mining server, GenCLiP 2.0, which can analyze human genes with enriched keywords and molecular interactions. Compared with other similar tools, GenCLiP 2.0 offers two unique features: (i) analysis of gene functions with free terms (i.e. any terms in the literature) generated by literature mining or provided by the user and (ii) accurate identification and integration of comprehensive molecular interactions from Medline abstracts, to construct molecular networks and subnetworks related to the free terms. http://ci.smu.edu.cn. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
An Arabidopsis gene regulatory network for secondary cell wall synthesis

DOE PAGES

Taylor-Teeples, M.; Lin, L.; de Lucas, M.; ...

2014-12-24

The plant cell wall is an important factor for determining cell shape, function and response to the environment. Secondary cell walls, such as those found in xylem, are composed of cellulose, hemicelluloses and lignin and account for the bulk of plant biomass. The coordination between transcriptional regulation of synthesis for each polymer is complex and vital to cell function. A regulatory hierarchy of developmental switches has been proposed, although the full complement of regulators remains unknown. In this paper, we present a protein–DNA network between Arabidopsis thaliana transcription factors and secondary cell wall metabolic genes with gene expression regulated bymore » a series of feed-forward loops. This model allowed us to develop and validate new hypotheses about secondary wall gene regulation under abiotic stress. Distinct stresses are able to perturb targeted genes to potentially promote functional adaptation. Finally, these interactions will serve as a foundation for understanding the regulation of a complex, integral plant component.« less
Disturbed Glucose Metabolism in Rat Neurons Exposed to Cerebrospinal Fluid Obtained from Multiple Sclerosis Subjects

PubMed Central

Mathur, Deepali; María-Lafuente, Eva; Ureña-Peralta, Juan R.; Sorribes, Lucas; Hernández, Alberto; Casanova, Bonaventura; López-Rodas, Gerardo; Coret-Ferrer, Francisco; Burgal-Marti, Maria

2017-01-01

Axonal damage is widely accepted as a major cause of permanent functional disability in Multiple Sclerosis (MS). In relapsing-remitting MS, there is a possibility of remyelination by myelin producing cells and restoration of neurological function. The purpose of this study was to delineate the pathophysiological mechanisms underpinning axonal injury through hitherto unknown factors present in cerebrospinal fluid (CSF) that may regulate axonal damage, remyelinate the axon and make functional recovery possible. We employed primary cultures of rat unmyelinated cerebellar granule neurons and treated them with CSF obtained from MS and Neuromyelitis optica (NMO) patients. We performed microarray gene expression profiling to study changes in gene expression in treated neurons as compared to controls. Additionally, we determined the influence of gene-gene interaction upon the whole metabolic network in our experimental conditions using the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) program. Our findings revealed the downregulated expression of genes involved in glucose metabolism in MS-derived CSF-treated neurons and upregulated expression of genes in NMO-derived CSF-treated neurons. We conclude that factors in the CSF of these patients caused a perturbation in metabolic gene(s) expression and suggest that MS appears to be linked with metabolic deformity. PMID:29267205
VTCdb: a gene co-expression database for the crop species Vitis vinifera (grapevine).

PubMed

Wong, Darren C J; Sweetman, Crystal; Drew, Damian P; Ford, Christopher M

2013-12-16

Gene expression datasets in model plants such as Arabidopsis have contributed to our understanding of gene function and how a single underlying biological process can be governed by a diverse network of genes. The accumulation of publicly available microarray data encompassing a wide range of biological and environmental conditions has enabled the development of additional capabilities including gene co-expression analysis (GCA). GCA is based on the understanding that genes encoding proteins involved in similar and/or related biological processes may exhibit comparable expression patterns over a range of experimental conditions, developmental stages and tissues. We present an open access database for the investigation of gene co-expression networks within the cultivated grapevine, Vitis vinifera. The new gene co-expression database, VTCdb (http://vtcdb.adelaide.edu.au/Home.aspx), offers an online platform for transcriptional regulatory inference in the cultivated grapevine. Using condition-independent and condition-dependent approaches, grapevine co-expression networks were constructed using the latest publicly available microarray datasets from diverse experimental series, utilising the Affymetrix Vitis vinifera GeneChip (16 K) and the NimbleGen Grape Whole-genome microarray chip (29 K), thus making it possible to profile approximately 29,000 genes (95% of the predicted grapevine transcriptome). Applications available with the online platform include the use of gene names, probesets, modules or biological processes to query the co-expression networks, with the option to choose between Affymetrix or Nimblegen datasets and between multiple co-expression measures. Alternatively, the user can browse existing network modules using interactive network visualisation and analysis via CytoscapeWeb. To demonstrate the utility of the database, we present examples from three fundamental biological processes (berry development, photosynthesis and flavonoid biosynthesis) whereby the recovered sub-networks reconfirm established plant gene functions and also identify novel associations. Together, we present valuable insights into grapevine transcriptional regulation by developing network models applicable to researchers in their prioritisation of gene candidates, for on-going study of biological processes related to grapevine development, metabolism and stress responses.
Genome-wide analysis of TCP family in tobacco.

PubMed

Chen, L; Chen, Y Q; Ding, A M; Chen, H; Xia, F; Wang, W F; Sun, Y H

2016-05-23

The TCP family is a transcription factor family, members of which are extensively involved in plant growth and development as well as in signal transduction in the response against many physiological and biochemical stimuli. In the present study, 61 TCP genes were identified in tobacco (Nicotiana tabacum) genome. Bioinformatic methods were employed for predicting and analyzing the gene structure, gene expression, phylogenetic analysis, and conserved domains of TCP proteins in tobacco. The 61 NtTCP genes were divided into three diverse groups, based on the division of TCP genes in tomato and Arabidopsis, and the results of the conserved domain and sequence analyses further confirmed the classification of the NtTCP genes. The expression pattern of NtTCP also demonstrated that majority of these genes play important roles in all the tissues, while some special genes exercise their functions only in specific tissues. In brief, the comprehensive and thorough study of the TCP family in other plants provides sufficient resources for studying the structure and functions of TCPs in tobacco.
Rice choline monooxygenase (OsCMO) protein functions in enhancing glycine betaine biosynthesis in transgenic tobacco but does not accumulate in rice (Oryza sativa L. ssp. japonica).

PubMed

Luo, Di; Niu, Xiangli; Yu, Jinde; Yan, Jun; Gou, Xiaojun; Lu, Bao-Rong; Liu, Yongsheng

2012-09-01

Glycine betaine (GB) is a compatible quaternary amine that enables plants to tolerate abiotic stresses, including salt, drought and cold. In plants, GB is synthesized through two-step of successive oxidations from choline, catalyzed by choline monooxygenase (CMO) and betaine aldehyde dehydrogenase (BADH), respectively. Rice is considered as a typical non-GB accumulating species, although the entire genome sequencing revealed rice contains orthologs of both CMO and BADH. Several studies unraveled that rice has a functional BADH gene, but whether rice CMO gene (OsCMO) is functional or a pseudogene remains to be elucidated. In the present study, we report the functional characterization of rice CMO gene. The OsCMO gene was isolated from rice cv. Nipponbare (Oryza sativa L. ssp. japonica) using RT-PCR. Northern blot demonstrated the transcription of OsCMO is enhanced by salt stress. Transgenic tobacco plants overexpressing OsCMO results in increased GB content and elevated tolerance to salt stress. Immunoblotting analysis demonstrates that a functional OsCMO protein with correct size was present in transgenic tobacco but rarely accumulated in wild-type rice plants. Surprisingly, a large amount of truncated proteins derived from OsCMO was induced in the rice seedlings in response to salt stresses. This suggests that it is the lack of a functional OsCMO protein that presumably results in non-GB accumulation in the tested rice plant. Expression and transgenic studies demonstrate OsCMO is transcriptionally induced in response to salt stress and functions in increasing glycinebetaine accumulation and enhancing tolerance to salt stress. Immunoblotting analysis suggests that no accumulation of glycinebetaine in the Japonica rice plant presumably results from lack of a functional OsCMO protein.
CATH-Gene3D: Generation of the Resource and Its Use in Obtaining Structural and Functional Annotations for Protein Sequences.

PubMed

Dawson, Natalie L; Sillitoe, Ian; Lees, Jonathan G; Lam, Su Datt; Orengo, Christine A

2017-01-01

This chapter describes the generation of the data in the CATH-Gene3D online resource and how it can be used to study protein domains and their evolutionary relationships. Methods will be presented for: comparing protein structures, recognizing homologs, predicting domain structures within protein sequences, and subclassifying superfamilies into functionally pure families, together with a guide on using the webpages.
Systems Level Approaches to Understanding and Manipulating Heterocyst Differentiation in Nostoc Punctiforme: Sites of Hydrogenase and Nitrogenase Synthesis and Activity

DOE Office of Scientific and Technical Information (OSTI.GOV)

Meeks, John C.

Heterocysts are specialized cells that establish a physiologically low oxygen concentration; they function as the sites of oxygen-sensitive nitrogen fixation and hydrogen metabolism in certain filamentous cyanobacteria. They are present at a frequency of less than 10% of the cells and singly in a nonrandom spacing pattern in the filaments. The extent of differential gene expression during heterocyst differentiation was defined by DNA microarray analysis in wild type and mutant cultures of Nostoc punctiforme. The results in wild-type cultures identified two groups of genes; approximately 440 that are unique to heterocyst formation and function, and 500 that respond positively andmore » negatively to the transient stress of nitrogen starvation. Nitrogen fixation is initiated within 24 h after induction, but the cultures require another 24 h before growth is reinitiated. Microarray analyses were conducted on strains with altered expression of three genes that regulate the presence and spacing of heterocysts in the filaments; loss of function or over expression of these genes increases the heterocyst frequency 2 to 3 fold compared to the wild-type. Mutations in the genes hetR and hetF result in the inability to differentiate heterocysts, whereas over expression of each gene individually yields multiple contiguous heterocysts at sites in the filaments; they are positive regulatory elements. Mutation of the gene patN results in an increase in heterocysts frequency, but, in this case, the heterocysts are singly spaced in the filaments with a decrease in the number of vegetative cells in the interval between heterocysts; this is a negative regulatory element. However, over expression of patN resulted in the wild-type heterocyst frequency and spacing pattern. Microarray results indicated HetR and HetF influence the transcription of a common set of about 395 genes, as well as about 350 genes unique to each protein. HetR is known to be a transcriptional regulator and HetF is predicted to be a protease, perhaps operating thorough stability of HetR; thus, the influence of HetF on transcription of a unique set of genes was unanticipated. These two proteins are also found in non-heterocyst-forming filamentous cyanobacteria and the results have implications on their other physiological role(s). The PatN protein is unique to heterocyst-forming cyanobacteria. Cytological analysis indicated PatN is present in only one of the two daughter cells following division, but is present in both cell less than 8 h after division. Microarray analysis indicated only five genes were differentially transcribed in the patN mutant compared to the wild type; three up-regulated genes that are known to influence heterocyst differentiation and two down-regulated genes that have an unassigned function. Mutational analyses indicted the two down-regulated genes do not have a distinct role in heterocyst differentiation. Thus, PatN only indirectly impacts transcription. These databases provide lists of differentially transcribed genes involved in nitrogen starvation and cellular differentiation that can be mined for detailed genetic analysis of the regulation of heterocyst formation and function for subsequent photo-biohydrogen production.« less
Efficient computation of the phylogenetic likelihood function on multi-gene alignments and multi-core architectures.

PubMed

Stamatakis, Alexandros; Ott, Michael

2008-12-27

The continuous accumulation of sequence data, for example, due to novel wet-laboratory techniques such as pyrosequencing, coupled with the increasing popularity of multi-gene phylogenies and emerging multi-core processor architectures that face problems of cache congestion, poses new challenges with respect to the efficient computation of the phylogenetic maximum-likelihood (ML) function. Here, we propose two approaches that can significantly speed up likelihood computations that typically represent over 95 per cent of the computational effort conducted by current ML or Bayesian inference programs. Initially, we present a method and an appropriate data structure to efficiently compute the likelihood score on 'gappy' multi-gene alignments. By 'gappy' we denote sampling-induced gaps owing to missing sequences in individual genes (partitions), i.e. not real alignment gaps. A first proof-of-concept implementation in RAXML indicates that this approach can accelerate inferences on large and gappy alignments by approximately one order of magnitude. Moreover, we present insights and initial performance results on multi-core architectures obtained during the transition from an OpenMP-based to a Pthreads-based fine-grained parallelization of the ML function.

Improving microbial fitness in the mammalian gut by in vivo temporal functional metagenomics

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yaung, Stephanie J.; Deng, Luxue; Li, Ning

Elucidating functions of commensal microbial genes in the mammalian gut is challenging because many commensals are recalcitrant to laboratory cultivation and genetic manipulation. We present Temporal FUnctional Metagenomics sequencing (TFUMseq), a platform to functionally mine bacterial genomes for genes that contribute to fitness of commensal bacteria in vivo. Our approach uses metagenomic DNA to construct large-scale heterologous expression libraries that are tracked over time in vivo by deep sequencing and computational methods. To demonstrate our approach, we built a TFUMseq plasmid library using the gut commensal Bacteroides thetaiotaomicron (Bt) and introduced Escherichia coli carrying this library into germfree mice. Populationmore » dynamics of library clones revealed Bt genes conferring significant fitness advantages in E. coli over time, including carbohydrate utilization genes, with a Bt galactokinase central to early colonization, and subsequent dominance by a Bt glycoside hydrolase enabling sucrose metabolism coupled with co-evolution of the plasmid library and E. coli genome driving increased galactose utilization. Here, our findings highlight the utility of functional metagenomics for engineering commensal bacteria with improved properties, including expanded colonization capabilities in vivo.« less
Improving microbial fitness in the mammalian gut by in vivo temporal functional metagenomics

DOE PAGES

Yaung, Stephanie J.; Deng, Luxue; Li, Ning; ...

2015-03-11

Elucidating functions of commensal microbial genes in the mammalian gut is challenging because many commensals are recalcitrant to laboratory cultivation and genetic manipulation. We present Temporal FUnctional Metagenomics sequencing (TFUMseq), a platform to functionally mine bacterial genomes for genes that contribute to fitness of commensal bacteria in vivo. Our approach uses metagenomic DNA to construct large-scale heterologous expression libraries that are tracked over time in vivo by deep sequencing and computational methods. To demonstrate our approach, we built a TFUMseq plasmid library using the gut commensal Bacteroides thetaiotaomicron (Bt) and introduced Escherichia coli carrying this library into germfree mice. Populationmore » dynamics of library clones revealed Bt genes conferring significant fitness advantages in E. coli over time, including carbohydrate utilization genes, with a Bt galactokinase central to early colonization, and subsequent dominance by a Bt glycoside hydrolase enabling sucrose metabolism coupled with co-evolution of the plasmid library and E. coli genome driving increased galactose utilization. Here, our findings highlight the utility of functional metagenomics for engineering commensal bacteria with improved properties, including expanded colonization capabilities in vivo.« less
Functional modules by relating protein interaction networks and gene expression.

PubMed

Tornow, Sabine; Mewes, H W

2003-11-01

Genes and proteins are organized on the basis of their particular mutual relations or according to their interactions in cellular and genetic networks. These include metabolic or signaling pathways and protein interaction, regulatory or co-expression networks. Integrating the information from the different types of networks may lead to the notion of a functional network and functional modules. To find these modules, we propose a new technique which is based on collective, multi-body correlations in a genetic network. We calculated the correlation strength of a group of genes (e.g. in the co-expression network) which were identified as members of a module in a different network (e.g. in the protein interaction network) and estimated the probability that this correlation strength was found by chance. Groups of genes with a significant correlation strength in different networks have a high probability that they perform the same function. Here, we propose evaluating the multi-body correlations by applying the superparamagnetic approach. We compare our method to the presently applied mean Pearson correlations and show that our method is more sensitive in revealing functional relationships.
Functional modules by relating protein interaction networks and gene expression

PubMed Central

Tornow, Sabine; Mewes, H. W.

2003-01-01

Genes and proteins are organized on the basis of their particular mutual relations or according to their interactions in cellular and genetic networks. These include metabolic or signaling pathways and protein interaction, regulatory or co-expression networks. Integrating the information from the different types of networks may lead to the notion of a functional network and functional modules. To find these modules, we propose a new technique which is based on collective, multi-body correlations in a genetic network. We calculated the correlation strength of a group of genes (e.g. in the co-expression network) which were identified as members of a module in a different network (e.g. in the protein interaction network) and estimated the probability that this correlation strength was found by chance. Groups of genes with a significant correlation strength in different networks have a high probability that they perform the same function. Here, we propose evaluating the multi-body correlations by applying the superparamagnetic approach. We compare our method to the presently applied mean Pearson correlations and show that our method is more sensitive in revealing functional relationships. PMID:14576317
FunGene: the functional gene pipeline and repository.

PubMed

Fish, Jordan A; Chai, Benli; Wang, Qiong; Sun, Yanni; Brown, C Titus; Tiedje, James M; Cole, James R

2013-01-01

Ribosomal RNA genes have become the standard molecular markers for microbial community analysis for good reasons, including universal occurrence in cellular organisms, availability of large databases, and ease of rRNA gene region amplification and analysis. As markers, however, rRNA genes have some significant limitations. The rRNA genes are often present in multiple copies, unlike most protein-coding genes. The slow rate of change in rRNA genes means that multiple species sometimes share identical 16S rRNA gene sequences, while many more species share identical sequences in the short 16S rRNA regions commonly analyzed. In addition, the genes involved in many important processes are not distributed in a phylogenetically coherent manner, potentially due to gene loss or horizontal gene transfer. While rRNA genes remain the most commonly used markers, key genes in ecologically important pathways, e.g., those involved in carbon and nitrogen cycling, can provide important insights into community composition and function not obtainable through rRNA analysis. However, working with ecofunctional gene data requires some tools beyond those required for rRNA analysis. To address this, our Functional Gene Pipeline and Repository (FunGene; http://fungene.cme.msu.edu/) offers databases of many common ecofunctional genes and proteins, as well as integrated tools that allow researchers to browse these collections and choose subsets for further analysis, build phylogenetic trees, test primers and probes for coverage, and download aligned sequences. Additional FunGene tools are specialized to process coding gene amplicon data. For example, FrameBot produces frameshift-corrected protein and DNA sequences from raw reads while finding the most closely related protein reference sequence. These tools can help provide better insight into microbial communities by directly studying key genes involved in important ecological processes.
Profiling Hyporheic Microbial Community Nitrogen Cycle and Carbohydrate Active Enzyme Gene Abundances across Seasons

NASA Astrophysics Data System (ADS)

Nelson, W. C.; Graham, E.; Stegen, J.

2016-12-01

The hyporheic zone (HZ) is the permanently inundated sediment layer between a surface channel and adjacent groundwater-saturated sediments. It has been hypothesized to play a major role in macronutrient (C, N, P) cycling in rivers. The correlation between community taxonomic composition dynamics and functional gene representation is poorly understood for hyporheic communities. To explore how microbial communities respond to temporal changes in environmental conditions, metagenomes were derived from communities captured in sterile sandpacks deployed within the HZ of the Columbia River. HMM databases were used to enumerate protein families present. Functional classification of reads allowed a general assessment of community function over time, while targeted assembly of specific genes enabled investigation of the diversity of organisms encoding these functions. Preliminary analysis of nitrogen cycle pathways shows most gene families examined to have quite steady representation across seasons, with most observed changes being less than an order of magnitude. Analysis of ammonia oxidation genes showed bacterial ammonia oxidizers (AOB) to be stably present across the year, while the archaeal amoA gene increased in late summer, peaking sharply in November, mirroring results from 16S rRNA amplicon analysis which showed an increase in Thaumarcheal OTUs during that same period. Most glycosyl hydrolase GH families had low representation. Highly abundant classes of GH included the GH94 (beta-glucosidase), GH95 (1-2-alpha-L-fucosidase) and GH103 (lytic transglycosylase) families, suggesting activity on plant, fungus and insect polysaccharides and peptidoglycans. Further work is investigating the taxonomy of the sequences identified, to determine how changes in the community composition contribute to the stable gene family profiles observed. These results are intended to work towards a greater understanding of the role of species diversity and functional redundancy in the dynamics of community composition in response to changes in environmental conditions and stochastic processes. In addition, it will serve as a foundation enabling modeling of generalized microbial function in the hyporheic zone, improving our ability to predict fluxes of carbon and nitrogen through riverine systems.
System-level insights into the cellular interactome of a non-model organism: inferring, modelling and analysing functional gene network of soybean (Glycine max).

PubMed

Xu, Yungang; Guo, Maozu; Zou, Quan; Liu, Xiaoyan; Wang, Chunyu; Liu, Yang

2014-01-01

Cellular interactome, in which genes and/or their products interact on several levels, forming transcriptional regulatory-, protein interaction-, metabolic-, signal transduction networks, etc., has attracted decades of research focuses. However, such a specific type of network alone can hardly explain the various interactive activities among genes. These networks characterize different interaction relationships, implying their unique intrinsic properties and defects, and covering different slices of biological information. Functional gene network (FGN), a consolidated interaction network that models fuzzy and more generalized notion of gene-gene relations, have been proposed to combine heterogeneous networks with the goal of identifying functional modules supported by multiple interaction types. There are yet no successful precedents of FGNs on sparsely studied non-model organisms, such as soybean (Glycine max), due to the absence of sufficient heterogeneous interaction data. We present an alternative solution for inferring the FGNs of soybean (SoyFGNs), in a pioneering study on the soybean interactome, which is also applicable to other organisms. SoyFGNs exhibit the typical characteristics of biological networks: scale-free, small-world architecture and modularization. Verified by co-expression and KEGG pathways, SoyFGNs are more extensive and accurate than an orthology network derived from Arabidopsis. As a case study, network-guided disease-resistance gene discovery indicates that SoyFGNs can provide system-level studies on gene functions and interactions. This work suggests that inferring and modelling the interactome of a non-model plant are feasible. It will speed up the discovery and definition of the functions and interactions of other genes that control important functions, such as nitrogen fixation and protein or lipid synthesis. The efforts of the study are the basis of our further comprehensive studies on the soybean functional interactome at the genome and microRNome levels. Additionally, a web tool for information retrieval and analysis of SoyFGNs can be accessed at SoyFN: http://nclab.hit.edu.cn/SoyFN.
System-Level Insights into the Cellular Interactome of a Non-Model Organism: Inferring, Modelling and Analysing Functional Gene Network of Soybean (Glycine max)

PubMed Central

Xu, Yungang; Guo, Maozu; Zou, Quan; Liu, Xiaoyan; Wang, Chunyu; Liu, Yang

2014-01-01

Cellular interactome, in which genes and/or their products interact on several levels, forming transcriptional regulatory-, protein interaction-, metabolic-, signal transduction networks, etc., has attracted decades of research focuses. However, such a specific type of network alone can hardly explain the various interactive activities among genes. These networks characterize different interaction relationships, implying their unique intrinsic properties and defects, and covering different slices of biological information. Functional gene network (FGN), a consolidated interaction network that models fuzzy and more generalized notion of gene-gene relations, have been proposed to combine heterogeneous networks with the goal of identifying functional modules supported by multiple interaction types. There are yet no successful precedents of FGNs on sparsely studied non-model organisms, such as soybean (Glycine max), due to the absence of sufficient heterogeneous interaction data. We present an alternative solution for inferring the FGNs of soybean (SoyFGNs), in a pioneering study on the soybean interactome, which is also applicable to other organisms. SoyFGNs exhibit the typical characteristics of biological networks: scale-free, small-world architecture and modularization. Verified by co-expression and KEGG pathways, SoyFGNs are more extensive and accurate than an orthology network derived from Arabidopsis. As a case study, network-guided disease-resistance gene discovery indicates that SoyFGNs can provide system-level studies on gene functions and interactions. This work suggests that inferring and modelling the interactome of a non-model plant are feasible. It will speed up the discovery and definition of the functions and interactions of other genes that control important functions, such as nitrogen fixation and protein or lipid synthesis. The efforts of the study are the basis of our further comprehensive studies on the soybean functional interactome at the genome and microRNome levels. Additionally, a web tool for information retrieval and analysis of SoyFGNs can be accessed at SoyFN: http://nclab.hit.edu.cn/SoyFN. PMID:25423109
Extreme Mutation Tolerance: Nearly Half of the Archaeal Fusellovirus Sulfolobus Spindle-Shaped Virus 1 Genes Are Not Required for Virus Function, Including the Minor Capsid Protein Gene vp3

PubMed Central

Iverson, Eric A.; Goodman, David A.; Gorchels, Madeline E.

2017-01-01

ABSTRACT Viruses infecting the Archaea harbor a tremendous amount of genetic diversity. This is especially true for the spindle-shaped viruses of the family Fuselloviridae, where >90% of the viral genes do not have detectable homologs in public databases. This significantly limits our ability to elucidate the role of viral proteins in the infection cycle. To address this, we have developed genetic techniques to study the well-characterized fusellovirus Sulfolobus spindle-shaped virus 1 (SSV1), which infects Sulfolobus solfataricus in volcanic hot springs at 80°C and pH 3. Here, we present a new comparative genome analysis and a thorough genetic analysis of SSV1 using both specific and random mutagenesis and thereby generate mutations in all open reading frames. We demonstrate that almost half of the SSV1 genes are not essential for infectivity, and the requirement for a particular gene correlates well with its degree of conservation within the Fuselloviridae. The major capsid gene vp1 is essential for SSV1 infectivity. However, the universally conserved minor capsid gene vp3 could be deleted without a loss in infectivity and results in virions with abnormal morphology. IMPORTANCE Most of the putative genes in the spindle-shaped archaeal hyperthermophile fuselloviruses have no sequences that are clearly similar to characterized genes. In order to determine which of these SSV genes are important for function, we disrupted all of the putative genes in the prototypical fusellovirus, SSV1. Surprisingly, about half of the genes could be disrupted without destroying virus function. Even deletions of one of the known structural protein genes that is present in all known fuselloviruses, vp3, allows the production of infectious viruses. However, viruses lacking vp3 have abnormal shapes, indicating that the vp3 gene is important for virus structure. Identification of essential genes will allow focused research on minimal SSV genomes and further understanding of the structure of these unique, ubiquitous, and extremely stable archaeal viruses. PMID:28148789
Extreme Mutation Tolerance: Nearly Half of the Archaeal Fusellovirus Sulfolobus Spindle-Shaped Virus 1 Genes Are Not Required for Virus Function, Including the Minor Capsid Protein Gene vp3.

PubMed

Iverson, Eric A; Goodman, David A; Gorchels, Madeline E; Stedman, Kenneth M

2017-05-15

Viruses infecting the Archaea harbor a tremendous amount of genetic diversity. This is especially true for the spindle-shaped viruses of the family Fuselloviridae , where >90% of the viral genes do not have detectable homologs in public databases. This significantly limits our ability to elucidate the role of viral proteins in the infection cycle. To address this, we have developed genetic techniques to study the well-characterized fusellovirus Sulfolobus spindle-shaped virus 1 (SSV1), which infects Sulfolobus solfataricus in volcanic hot springs at 80°C and pH 3. Here, we present a new comparative genome analysis and a thorough genetic analysis of SSV1 using both specific and random mutagenesis and thereby generate mutations in all open reading frames. We demonstrate that almost half of the SSV1 genes are not essential for infectivity, and the requirement for a particular gene correlates well with its degree of conservation within the Fuselloviridae The major capsid gene vp1 is essential for SSV1 infectivity. However, the universally conserved minor capsid gene vp3 could be deleted without a loss in infectivity and results in virions with abnormal morphology. IMPORTANCE Most of the putative genes in the spindle-shaped archaeal hyperthermophile fuselloviruses have no sequences that are clearly similar to characterized genes. In order to determine which of these SSV genes are important for function, we disrupted all of the putative genes in the prototypical fusellovirus, SSV1. Surprisingly, about half of the genes could be disrupted without destroying virus function. Even deletions of one of the known structural protein genes that is present in all known fuselloviruses, vp3 , allows the production of infectious viruses. However, viruses lacking vp3 have abnormal shapes, indicating that the vp3 gene is important for virus structure. Identification of essential genes will allow focused research on minimal SSV genomes and further understanding of the structure of these unique, ubiquitous, and extremely stable archaeal viruses. Copyright © 2017 American Society for Microbiology.
Analysis of the GRNs Inference by Using Tsallis Entropy and a Feature Selection Approach

NASA Astrophysics Data System (ADS)

Lopes, Fabrício M.; de Oliveira, Evaldo A.; Cesar, Roberto M.

An important problem in the bioinformatics field is to understand how genes are regulated and interact through gene networks. This knowledge can be helpful for many applications, such as disease treatment design and drugs creation purposes. For this reason, it is very important to uncover the functional relationship among genes and then to construct the gene regulatory network (GRN) from temporal expression data. However, this task usually involves data with a large number of variables and small number of observations. In this way, there is a strong motivation to use pattern recognition and dimensionality reduction approaches. In particular, feature selection is specially important in order to select the most important predictor genes that can explain some phenomena associated with the target genes. This work presents a first study about the sensibility of entropy methods regarding the entropy functional form, applied to the problem of topology recovery of GRNs. The generalized entropy proposed by Tsallis is used to study this sensibility. The inference process is based on a feature selection approach, which is applied to simulated temporal expression data generated by an artificial gene network (AGN) model. The inferred GRNs are validated in terms of global network measures. Some interesting conclusions can be drawn from the experimental results, as reported for the first time in the present paper.
Whole-Genome Duplication and the Functional Diversification of Teleost Fish Hemoglobins

PubMed Central

Opazo, Juan C.; Butts, G. Tyler; Nery, Mariana F.; Storz, Jay F.; Hoffmann, Federico G.

2013-01-01

Subsequent to the two rounds of whole-genome duplication that occurred in the common ancestor of vertebrates, a third genome duplication occurred in the stem lineage of teleost fishes. This teleost-specific genome duplication (TGD) is thought to have provided genetic raw materials for the physiological, morphological, and behavioral diversification of this highly speciose group. The extreme physiological versatility of teleost fish is manifest in their diversity of blood–gas transport traits, which reflects the myriad solutions that have evolved to maintain tissue O2 delivery in the face of changing metabolic demands and environmental O2 availability during different ontogenetic stages. During the course of development, regulatory changes in blood–O2 transport are mediated by the expression of multiple, functionally distinct hemoglobin (Hb) isoforms that meet the particular O2-transport challenges encountered by the developing embryo or fetus (in viviparous or oviparous species) and in free-swimming larvae and adults. The main objective of the present study was to assess the relative contributions of whole-genome duplication, large-scale segmental duplication, and small-scale gene duplication in producing the extraordinary functional diversity of teleost Hbs. To accomplish this, we integrated phylogenetic reconstructions with analyses of conserved synteny to characterize the genomic organization and evolutionary history of the globin gene clusters of teleosts. These results were then integrated with available experimental data on functional properties and developmental patterns of stage-specific gene expression. Our results indicate that multiple α- and β-globin genes were present in the common ancestor of gars (order Lepisoteiformes) and teleosts. The comparative genomic analysis revealed that teleosts possess a dual set of TGD-derived globin gene clusters, each of which has undergone lineage-specific changes in gene content via repeated duplication and deletion events. Phylogenetic reconstructions revealed that paralogous genes convergently evolved similar functional properties in different teleost lineages. Consistent with other recent studies of globin gene family evolution in vertebrates, our results revealed evidence for repeated evolutionary transitions in the developmental regulation of Hb synthesis. PMID:22949522
Genome-wide Analyses of the Structural Gene Families Involved in the Legume-specific 5-Deoxyisoflavonoid Biosynthesis of Lotus japonicus

PubMed Central

Shimada, Norimoto; Sato, Shusei; Akashi, Tomoyoshi; Nakamura, Yasukazu; Tabata, Satoshi; Ayabe, Shin-ichi; Aoki, Toshio

2007-01-01

Abstract A model legume Lotus japonicus (Regel) K. Larsen is one of the subjects of genome sequencing and functional genomics programs. In the course of targeted approaches to the legume genomics, we analyzed the genes encoding enzymes involved in the biosynthesis of the legume-specific 5-deoxyisoflavonoid of L. japonicus, which produces isoflavan phytoalexins on elicitor treatment. The paralogous biosynthetic genes were assigned as comprehensively as possible by biochemical experiments, similarity searches, comparison of the gene structures, and phylogenetic analyses. Among the 10 biosynthetic genes investigated, six comprise multigene families, and in many cases they form gene clusters in the chromosomes. Semi-quantitative reverse transcriptase–PCR analyses showed coordinate up-regulation of most of the genes during phytoalexin induction and complex accumulation patterns of the transcripts in different organs. Some paralogous genes exhibited similar expression specificities, suggesting their genetic redundancy. The molecular evolution of the biosynthetic genes is discussed. The results presented here provide reliable annotations of the genes and genetic markers for comparative and functional genomics of leguminous plants. PMID:17452423
Versatile control of Plasmodium falciparum gene expression with an inducible protein-RNA interaction

PubMed Central

Goldfless, Stephen J.; Wagner, Jeffrey C.; Niles, Jacquin C.

2014-01-01

The available tools for conditional gene expression in Plasmodium falciparum are limited. Here, to enable reliable control of target gene expression, we build a system to efficiently modulate translation. We overcame several problems associated with other approaches for regulating gene expression in P. falciparum. Specifically, our system functions predictably across several native and engineered promoter contexts, and affords control over reporter and native parasite proteins irrespective of their subcellular compartmentalization. Induction and repression of gene expression are rapid, homogeneous, and stable over prolonged periods. To demonstrate practical application of our system, we used it to reveal direct links between antimalarial drugs and their native parasite molecular target. This is an important out come given the rapid spread of resistance, and intensified efforts to efficiently discover and optimize new antimalarial drugs. Overall, the studies presented highlight the utility of our system for broadly controlling gene expression and performing functional genetics in P. falciparum. PMID:25370483
Fishing for biodiversity: Novel methanopterin-linked C1 transfergenes deduced from the Sargasso Sea metagenome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kalyuzhnaya, Marina G.; Nercessian, Olivier; Lapidus, Alla

2004-07-01

The recently generated database of microbial genes from anoligotrophic environment populated by a calculated 1,800 of major phylotypes (the Sargasso Sea metagenome) presents a great source for expanding local databases of genes indicative of a specific function. In this paper we analyze the Sargasso Sea metagenome in terms of the presence of methanopterin-linked C1 transfer genes that are signature for methylotrophy. We conclude that more than 10 phylotypes possessing genes of interest are present in this environment, and a few of these are relatively abundant species. The sequences representative of the major phylotypes do not appear to belong to anymore » known microbial group capable of methanopterin-linked C1 transfer. Instead, they separate from all known sequences on phylogenetic trees, pointing towards their affiliation with a novel microbial phylum. These data imply a broader distribution of methanopterin-linked functions in the microbial world than previously known.« less
Autosomal Genes of Autosomal/X-Linked Duplicated Gene Pairs and Germ-Line Proliferation in Caenorhabditis elegans

PubMed Central

Maciejowski, John; Ahn, James Hyungsoo; Cipriani, Patricia Giselle; Killian, Darrell J.; Chaudhary, Aisha L.; Lee, Ji Inn; Voutev, Roumen; Johnsen, Robert C.; Baillie, David L.; Gunsalus, Kristin C.; Fitch, David H. A.; Hubbard, E. Jane Albert

2005-01-01

We report molecular genetic studies of three genes involved in early germ-line proliferation in Caenorhabditis elegans that lend unexpected insight into a germ-line/soma functional separation of autosomal/X-linked duplicated gene pairs. In a genetic screen for germ-line proliferation-defective mutants, we identified mutations in rpl-11.1 (L11 protein of the large ribosomal subunit), pab-1 [a poly(A)-binding protein], and glp-3/eft-3 (an elongation factor 1-α homolog). All three are members of autosome/X gene pairs. Consistent with a germ-line-restricted function of rpl-11.1 and pab-1, mutations in these genes extend life span and cause gigantism. We further examined the RNAi phenotypes of the three sets of rpl genes (rpl-11, rpl-24, and rpl-25) and found that for the two rpl genes with autosomal/X-linked pairs (rpl-11 and rpl-25), zygotic germ-line function is carried by the autosomal copy. Available RNAi results for highly conserved autosomal/X-linked gene pairs suggest that other duplicated genes may follow a similar trend. The three rpl and the pab-1/2 duplications predate the divergence between C. elegans and C. briggsae, while the eft-3/4 duplication appears to have occurred in the lineage to C. elegans after it diverged from C. briggsae. The duplicated C. briggsae orthologs of the three C. elegans autosomal/X-linked gene pairs also display functional differences between paralogs. We present hypotheses for evolutionary mechanisms that may underlie germ-line/soma subfunctionalization of duplicated genes, taking into account the role of X chromosome silencing in the germ line and analogous mammalian phenomena. PMID:15687263
Comprehensive Expression Profiling and Functional Network Analysis of Porphyra-334, One Mycosporine-Like Amino Acid (MAA), in Human Keratinocyte Exposed with UV-radiation.

PubMed

Suh, Sung-Suk; Lee, Sung Gu; Youn, Ui Joung; Han, Se Jong; Kim, Il-Chan; Kim, Sanghee

2017-06-24

Mycosporine-like amino acids (MAAs) have been highlighted as pharmacologically active secondary compounds to protect cells from harmful UV-radiation by absorbing its energy. Previous studies have mostly focused on characterizing their physiological properties such as antioxidant activity and osmotic regulation. However, molecular mechanisms underlying their UV-protective capability have not yet been revealed. In the present study, we investigated the expression profiling of porphyra-334-modulated genes or microRNA (miRNAs) in response to UV-exposure and their functional networks, using cDNA and miRNAs microarray. Based on our data, we showed that porphyra-334-regulated genes play essential roles in UV-affected biological processes such as Wnt (Wingless/integrase-1) and Notch pathways which exhibit antagonistic relationship in various biological processes; the UV-repressed genes were in the Wnt signaling pathway, while the activated genes were in the Notch signaling. In addition, porphyra-334-regulated miRNAs can target many genes related with UV-mediated biological processes such as apoptosis, cell proliferation and translational elongation. Notably, we observed that functional roles of the target genes for up-regulated miRNAs are inversely correlated with those for down-regulated miRNAs; the former genes promote apoptosis and translational elongation, whereas the latter function as inhibitors in these processes. Taken together, these data suggest that porphyra-334 protects cells from harmful UV radiation through the comprehensive modulation of expression patterns of genes involved in UV-mediated biological processes, and that provide a new insight to understand its functional molecular networks.
The insulin and islet amyloid polypeptide genes contain similar cell-specific promoter elements that bind identical beta-cell nuclear complexes.

PubMed Central

German, M S; Moss, L G; Wang, J; Rutter, W J

1992-01-01

The pancreatic beta cell makes several unique gene products, including insulin, islet amyloid polypeptide (IAPP), and beta-cell-specific glucokinase (beta GK). The functions of isolated portions of the insulin, IAPP, and beta GK promoters were studied by using transient expression and DNA binding assays. A short portion (-247 to -197 bp) of the rat insulin I gene, the FF minienhancer, contains three interacting transcriptional regulatory elements. The FF minienhancer binds at least two nuclear complexes with limited tissue distribution. Sequences similar to that of the FF minienhancer are present in the 5' flanking DNA of the human IAPP and rat beta GK genes and also the rat insulin II and mouse insulin I and II genes. Similar minienhancer constructs from the insulin and IAPP genes function as cell-specific transcriptional regulatory elements and compete for binding of the same nuclear factors, while the beta GK construct competes for protein binding but functions poorly as a minienhancer. These observations suggest that the patterns of expression of the beta-cell-specific genes result in part from sharing the same transcriptional regulators. Images PMID:1549125
[Current status of gene test market].

PubMed

Ohtani, Shinichi

2002-12-01

The technological innovation of the gene analysis makes the adaptation range of the gene test in clinical diagnosis expand. Then, gene test has popularized increasingly around the infection disease for clinical inspection. Also in the field of clinical inspection, the increase of the importance of clinical application and the inspection item new year by year have appeared with the functional analysis of a gene. Moreover, the new test method and automation analysis equipment tend to be developed by progress of gene-analysis technology, and it is going to be introduced. The spread of gene test and development of a gene test market have an important possibility of activating the present clinical inspection field.
Effects of drought stress on global gene expression profile in leaf and root samples of Dongxiang wild rice (Oryza rufipogon).

PubMed

Zhang, Fantao; Zhou, Yi; Zhang, Meng; Luo, Xiangdong; Xie, Jiankun

2017-06-30

Drought is a serious constraint to rice production throughout the world, and although Dongxiang wild rice ( Oryza rufipogon , DXWR) possesses a high degree of drought resistance, the underlying mechanisms of this trait remains unclear. In the present study, cDNA libraries were constructed from the leaf and root tissues of drought-stressed and untreated DXWR seedlings, and transcriptome sequencing was performed with the goal of elucidating the molecular mechanisms involved in drought-stress response. The results indicated that 11231 transcripts were differentially expressed in the leaves (4040 up-regulated and 7191 down-regulated) and 7025 transcripts were differentially expressed in the roots (3097 up-regulated and 3928 down-regulated). Among these differentially expressed genes (DEGs), the detection of many transcriptional factors and functional genes demonstrated that multiple regulatory pathways were involved in drought resistance. Meanwhile, the DEGs were also annotated with gene ontology (GO) terms and key pathways via functional classification and Kyoto Encyclopedia of Gene and Genomes (KEGG) pathway mapping, respectively. A set of the most interesting candidate genes was then identified by combining the DEGs with previously identified drought-resistant quantitative trait loci (QTL). The present work provides abundant genomic information for functional dissection of the drought resistance of DXWR, and findings will further help the current understanding of the biological regulatory mechanisms of drought resistance in plants and facilitate the breeding of new drought-resistant rice cultivars. © 2017 The Author(s).

Reliable cloning of functional antibody variable domains from hybridomas and spleen cell repertoires employing a reengineered phage display system.

PubMed

Krebber, A; Bornhauser, S; Burmester, J; Honegger, A; Willuda, J; Bosshard, H R; Plückthun, A

1997-02-14

A prerequisite for the use of recombinant antibody technologies starting from hybridomas or immune repertoires is the reliable cloning of functional immunoglobulin genes. For this purpose, a standard phage display system was optimized for robustness, vector stability, tight control of scFv-delta geneIII expression, primer usage for PCR amplification of variable region genes, scFv assembly strategy and subsequent directional cloning using a single rare cutting restriction enzyme. This integrated cloning, screening and selection system allowed us to rapidly obtain antigen binding scFvs derived from spleen-cell repertoires of mice immunized with ampicillin as well as from all hybridoma cell lines tested to date. As representative examples, cloning of monoclonal antibodies against a his tag, leucine zippers, the tumor marker EGP-2 and the insecticide DDT is presented. Several hybridomas whose genes could not be cloned in previous experimental setups, but were successfully obtained with the present system, expressed high amounts of aberrant heavy and light chain mRNAs, which were amplified by PCR and greatly exceeded the amount of binding antibody sequences. These contaminating variable region genes were successfully eliminated by employing the optimized phage display system, thus avoiding time consuming sequencing of non-binding scFv genes. To maximize soluble expression of functional scFvs subsequent to cloning, a compatible vector series to simplify modification, detection, multimerization and rapid purification of recombinant antibody fragments was constructed.
pico-PLAZA, a genome database of microbial photosynthetic eukaryotes.

PubMed

Vandepoele, Klaas; Van Bel, Michiel; Richard, Guilhem; Van Landeghem, Sofie; Verhelst, Bram; Moreau, Hervé; Van de Peer, Yves; Grimsley, Nigel; Piganeau, Gwenael

2013-08-01

With the advent of next generation genome sequencing, the number of sequenced algal genomes and transcriptomes is rapidly growing. Although a few genome portals exist to browse individual genome sequences, exploring complete genome information from multiple species for the analysis of user-defined sequences or gene lists remains a major challenge. pico-PLAZA is a web-based resource (http://bioinformatics.psb.ugent.be/pico-plaza/) for algal genomics that combines different data types with intuitive tools to explore genomic diversity, perform integrative evolutionary sequence analysis and study gene functions. Apart from homologous gene families, multiple sequence alignments, phylogenetic trees, Gene Ontology, InterPro and text-mining functional annotations, different interactive viewers are available to study genome organization using gene collinearity and synteny information. Different search functions, documentation pages, export functions and an extensive glossary are available to guide non-expert scientists. To illustrate the versatility of the platform, different case studies are presented demonstrating how pico-PLAZA can be used to functionally characterize large-scale EST/RNA-Seq data sets and to perform environmental genomics. Functional enrichments analysis of 16 Phaeodactylum tricornutum transcriptome libraries offers a molecular view on diatom adaptation to different environments of ecological relevance. Furthermore, we show how complementary genomic data sources can easily be combined to identify marker genes to study the diversity and distribution of algal species, for example in metagenomes, or to quantify intraspecific diversity from environmental strains. © 2013 John Wiley & Sons Ltd and Society for Applied Microbiology.
Natural killer cell receptor genes in the family Equidae: not only Ly49.

PubMed

Futas, Jan; Horin, Petr

2013-01-01

Natural killer (NK) cells have important functions in immunity. NK recognition in mammals can be mediated through killer cell immunoglobulin-like receptors (KIR) and/or killer cell lectin-like Ly49 receptors. Genes encoding highly variable NK cell receptors (NKR) represent rapidly evolving genomic regions. No single conservative model of NKR genes was observed in mammals. Single-copy low polymorphic NKR genes present in one mammalian species may expand into highly polymorphic multigene families in other species. In contrast to other non-rodent mammals, multiple Ly49-like genes appear to exist in the horse, while no functional KIR genes were observed in this species. In this study, Ly49 and KIR were sought and their evolution was characterized in the entire family Equidae. Genomic sequences retrieved showed the presence of at least five highly conserved polymorphic Ly49 genes in horses, asses and zebras. These findings confirmed that the expansion of Ly49 occurred in the entire family. Several KIR-like sequences were also identified in the genome of Equids. Besides a previously identified non-functional KIR-Immunoglobulin-like transcript fusion gene (KIR-ILTA) and two putative pseudogenes, a KIR3DL-like sequence was analyzed. In contrast to previous observations made in the horse, the KIR3DL sequence, genomic organization and mRNA expression suggest that all Equids might produce a functional KIR receptor protein molecule with a single non-mutated immune tyrosine-based inhibition motif (ITIM) domain. No evidence for positive selection in the KIR3DL gene was found. Phylogenetic analysis including rhinoceros and tapir genomic DNA and deduced amino acid KIR-related sequences showed differences between families and even between species within the order Perissodactyla. The results suggest that the order Perissodactyla and its family Equidae with expanded Ly49 genes and with a potentially functional KIR gene may represent an interesting model for evolutionary biology of NKR genes.
Natural Killer Cell Receptor Genes in the Family Equidae: Not only Ly49

PubMed Central

Futas, Jan; Horin, Petr

2013-01-01

Natural killer (NK) cells have important functions in immunity. NK recognition in mammals can be mediated through killer cell immunoglobulin-like receptors (KIR) and/or killer cell lectin-like Ly49 receptors. Genes encoding highly variable NK cell receptors (NKR) represent rapidly evolving genomic regions. No single conservative model of NKR genes was observed in mammals. Single-copy low polymorphic NKR genes present in one mammalian species may expand into highly polymorphic multigene families in other species. In contrast to other non-rodent mammals, multiple Ly49-like genes appear to exist in the horse, while no functional KIR genes were observed in this species. In this study, Ly49 and KIR were sought and their evolution was characterized in the entire family Equidae. Genomic sequences retrieved showed the presence of at least five highly conserved polymorphic Ly49 genes in horses, asses and zebras. These findings confirmed that the expansion of Ly49 occurred in the entire family. Several KIR-like sequences were also identified in the genome of Equids. Besides a previously identified non-functional KIR-Immunoglobulin-like transcript fusion gene (KIR-ILTA) and two putative pseudogenes, a KIR3DL-like sequence was analyzed. In contrast to previous observations made in the horse, the KIR3DL sequence, genomic organization and mRNA expression suggest that all Equids might produce a functional KIR receptor protein molecule with a single non-mutated immune tyrosine-based inhibition motif (ITIM) domain. No evidence for positive selection in the KIR3DL gene was found. Phylogenetic analysis including rhinoceros and tapir genomic DNA and deduced amino acid KIR-related sequences showed differences between families and even between species within the order Perissodactyla. The results suggest that the order Perissodactyla and its family Equidae with expanded Ly49 genes and with a potentially functional KIR gene may represent an interesting model for evolutionary biology of NKR genes. PMID:23724088
GEM2Net: from gene expression modeling to -omics networks, a new CATdb module to investigate Arabidopsis thaliana genes involved in stress response.

PubMed

Zaag, Rim; Tamby, Jean Philippe; Guichard, Cécile; Tariq, Zakia; Rigaill, Guillem; Delannoy, Etienne; Renou, Jean-Pierre; Balzergue, Sandrine; Mary-Huard, Tristan; Aubourg, Sébastien; Martin-Magniette, Marie-Laure; Brunaud, Véronique

2015-01-01

CATdb (http://urgv.evry.inra.fr/CATdb) is a database providing a public access to a large collection of transcriptomic data, mainly for Arabidopsis but also for other plants. This resource has the rare advantage to contain several thousands of microarray experiments obtained with the same technical protocol and analyzed by the same statistical pipelines. In this paper, we present GEM2Net, a new module of CATdb that takes advantage of this homogeneous dataset to mine co-expression units and decipher Arabidopsis gene functions. GEM2Net explores 387 stress conditions organized into 18 biotic and abiotic stress categories. For each one, a model-based clustering is applied on expression differences to identify clusters of co-expressed genes. To characterize functions associated with these clusters, various resources are analyzed and integrated: Gene Ontology, subcellular localization of proteins, Hormone Families, Transcription Factor Families and a refined stress-related gene list associated to publications. Exploiting protein-protein interactions and transcription factors-targets interactions enables to display gene networks. GEM2Net presents the analysis of the 18 stress categories, in which 17,264 genes are involved and organized within 681 co-expression clusters. The meta-data analyses were stored and organized to compose a dynamic Web resource. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
INHIBITION OF ERN1 SIGNALING ENZYME AFFECTS HYPOXIC REGULATION OF THE EXPRESSION OF E2F8, EPAS1, HOXC6, ATF3, TBX3 AND FOXF1 GENES IN U87 GLIOMA CELLS.

PubMed

Minchenko, O H; Tsymbal, D O; Minchenko, D O; Kovalevska, O V; Karbovskyi, L L; Bikfalvi, A

2015-01-01

Hypoxia as well as the endoplasmic reticulum stress are important factors of malignant tumor growth and control of the expression of genes, which regulate numerous metabolic processes and cell proliferation. Furthermore, blockade of ERN1 (endoplasmic reticulum to nucleus 1) suppresses cell proliferation and tumor growth. We studied the effect of hypoxia on the expression of genes encoding the transcription factors such as E2F8 (E2F transcription factor 8), EPAS1 (endothelial PAS domain protein 1), TBX3 (T-box 3), ATF3 (activating transcription factor 3), FOXF1 (forkhead box F), and HOXC6 (homeobox C6) in U87 glioma cells with and without ERN1 signaling enzyme function. We have established that hypoxia enhances the expression of HOXC6, E2F8, ATF3, and EPAS1 genes but does not change TBX3 and FOXF1 gene expression in glioma cells with ERNI function. At the same time, the expression level of all studied genes is strongly decreased, except for TBX3 gene, in glioma cells without ERN1 function. Moreover, the inhibition of ERN1 signaling enzyme function significantly modifies the effect of hypoxia on the expression of these transcription factor genes. removes or introduces this regulation as well as changes a direction or magnitude of hypoxic regulation. Present study demonstrates that fine-tuning of the expression of proliferation related genes depends upon hypoxia and ERN1-mediated endoplasmic reticulum stress signaling and correlates with slower proliferation rate of glioma cells without ERN1 function.
Genome-Wide Characterization of Transcriptional Patterns in High and Low Antibody Responders to Rubella Vaccination

PubMed Central

Haralambieva, Iana H.; Oberg, Ann L.; Ovsyannikova, Inna G.; Kennedy, Richard B.; Grill, Diane E.; Middha, Sumit; Bot, Brian M.; Wang, Vivian W.; Smith, David I.; Jacobson, Robert M.; Poland, Gregory A.

2013-01-01

Immune responses to current rubella vaccines demonstrate significant inter-individual variability. We performed mRNA-Seq profiling on PBMCs from high and low antibody responders to rubella vaccination to delineate transcriptional differences upon viral stimulation. Generalized linear models were used to assess the per gene fold change (FC) for stimulated versus unstimulated samples or the interaction between outcome and stimulation. Model results were evaluated by both FC and p-value. Pathway analysis and self-contained gene set tests were performed for assessment of gene group effects. Of 17,566 detected genes, we identified 1,080 highly significant differentially expressed genes upon viral stimulation (p<1.00E−15, FDR<1.00E−14), including various immune function and inflammation-related genes, genes involved in cell signaling, cell regulation and transcription, and genes with unknown function. Analysis by immune outcome and stimulation status identified 27 genes (p≤0.0006 and FDR≤0.30) that responded differently to viral stimulation in high vs. low antibody responders, including major histocompatibility complex (MHC) class I genes (HLA-A, HLA-B and B2M with p = 0.0001, p = 0.0005 and p = 0.0002, respectively), and two genes related to innate immunity and inflammation (EMR3 and MEFV with p = 1.46E−08 and p = 0.0004, respectively). Pathway and gene set analysis also revealed transcriptional differences in antigen presentation and innate/inflammatory gene sets and pathways between high and low responders. Using mRNA-Seq genome-wide transcriptional profiling, we identified antigen presentation and innate/inflammatory genes that may assist in explaining rubella vaccine-induced immune response variations. Such information may provide new scientific insights into vaccine-induced immunity useful in rational vaccine development and immune response monitoring. PMID:23658707
Gene Expression Architecture of Mouse Dorsal and Tail Skin Reveals Functional Differences in Inflammation and Cancer | Office of Cancer Genomics

Cancer.gov

Inherited germline polymorphisms can cause gene expression levels in normal tissues to differ substantially between individuals. We present an analysis of the genetic architecture of normal adult skin from 470 genetically unique mice, demonstrating the effect of germline variants, skin tissue location, and perturbation by exogenous inflammation or tumorigenesis on gene signaling pathways.
Evolution dynamics of a model for gene duplication under adaptive conflict

NASA Astrophysics Data System (ADS)

Ancliff, Mark; Park, Jeong-Man

2014-06-01

We present and solve the dynamics of a model for gene duplication showing escape from adaptive conflict. We use a Crow-Kimura quasispecies model of evolution where the fitness landscape is a function of Hamming distances from two reference sequences, which are assumed to optimize two different gene functions, to describe the dynamics of a mixed population of individuals with single and double copies of a pleiotropic gene. The evolution equations are solved through a spin coherent state path integral, and we find two phases: one is an escape from an adaptive conflict phase, where each copy of a duplicated gene evolves toward subfunctionalization, and the other is a duplication loss of function phase, where one copy maintains its pleiotropic form and the other copy undergoes neutral mutation. The phase is determined by a competition between the fitness benefits of subfunctionalization and the greater mutational load associated with maintaining two gene copies. In the escape phase, we find a dynamics of an initial population of single gene sequences only which escape adaptive conflict through gene duplication and find that there are two time regimes: until a time t* single gene sequences dominate, and after t* double gene sequences outgrow single gene sequences. The time t* is identified as the time necessary for subfunctionalization to evolve and spread throughout the double gene sequences, and we show that there is an optimum mutation rate which minimizes this time scale.
Gene finding in metatranscriptomic sequences.

PubMed

Ismail, Wazim Mohammed; Ye, Yuzhen; Tang, Haixu

2014-01-01

Metatranscriptomic sequencing is a highly sensitive bioassay of functional activity in a microbial community, providing complementary information to the metagenomic sequencing of the community. The acquisition of the metatranscriptomic sequences will enable us to refine the annotations of the metagenomes, and to study the gene activities and their regulation in complex microbial communities and their dynamics. In this paper, we present TransGeneScan, a software tool for finding genes in assembled transcripts from metatranscriptomic sequences. By incorporating several features of metatranscriptomic sequencing, including strand-specificity, short intergenic regions, and putative antisense transcripts into a Hidden Markov Model, TranGeneScan can predict a sense transcript containing one or multiple genes (in an operon) or an antisense transcript. We tested TransGeneScan on a mock metatranscriptomic data set containing three known bacterial genomes. The results showed that TranGeneScan performs better than metagenomic gene finders (MetaGeneMark and FragGeneScan) on predicting protein coding genes in assembled transcripts, and achieves comparable or even higher accuracy than gene finders for microbial genomes (Glimmer and GeneMark). These results imply, with the assistance of metatranscriptomic sequencing, we can obtain a broad and precise picture about the genes (and their functions) in a microbial community. TransGeneScan is available as open-source software on SourceForge at https://sourceforge.net/projects/transgenescan/.
Action of multiple intra-QTL genes concerted around a co-localized transcription factor underpins a large effect QTL

PubMed Central

Dixit, Shalabh; Kumar Biswal, Akshaya; Min, Aye; Henry, Amelia; Oane, Rowena H.; Raorane, Manish L.; Longkumer, Toshisangba; Pabuayon, Isaiah M.; Mutte, Sumanth K.; Vardarajan, Adithi R.; Miro, Berta; Govindan, Ganesan; Albano-Enriquez, Blesilda; Pueffeld, Mandy; Sreenivasulu, Nese; Slamet-Loedin, Inez; Sundarvelpandian, Kalaipandian; Tsai, Yuan-Ching; Raghuvanshi, Saurabh; Hsing, Yue-Ie C.; Kumar, Arvind; Kohli, Ajay

2015-01-01

Sub-QTLs and multiple intra-QTL genes are hypothesized to underpin large-effect QTLs. Known QTLs over gene families, biosynthetic pathways or certain traits represent functional gene-clusters of genes of the same gene ontology (GO). Gene-clusters containing genes of different GO have not been elaborated, except in silico as coexpressed genes within QTLs. Here we demonstrate the requirement of multiple intra-QTL genes for the full impact of QTL qDTY12.1 on rice yield under drought. Multiple evidences are presented for the need of the transcription factor ‘no apical meristem’ (OsNAM12.1) and its co-localized target genes of separate GO categories for qDTY12.1 function, raising a regulon-like model of genetic architecture. The molecular underpinnings of qDTY12.1 support its effectiveness in further improving a drought tolerant genotype and for its validity in multiple genotypes/ecosystems/environments. Resolving the combinatorial value of OsNAM12.1 with individual intra-QTL genes notwithstanding, identification and analyses of qDTY12.1has fast-tracked rice improvement towards food security. PMID:26507552
Conceptual Variation or Incoherence? Textbook Discourse on Genes in Six Countries

NASA Astrophysics Data System (ADS)

Gericke, Niklas M.; Hagberg, Mariana; dos Santos, Vanessa Carvalho; Joaquim, Leyla Mariane; El-Hani, Charbel N.

2014-02-01

The aim of this paper is to investigate in a systematic and comparative way previous results of independent studies on the treatment of genes and gene function in high school textbooks from six different countries. We analyze how the conceptual variation within the scientific domain of Genetics regarding gene function models and gene concepts is transformed via the didactic transposition into school science textbooks. The results indicate that a common textbook discourse on genes and their function exist in textbooks from the different countries. The structure of science as represented by conceptual variation and the use of multiple models was present in all the textbooks. However, the existence of conceptual variation and multiple models is implicit in these textbooks, i.e., the phenomenon of conceptual variation and multiple models are not addressed explicitly, nor its consequences and, thus, it ends up introducing conceptual incoherence about the gene concept and its function within the textbooks. We conclude that within the found textbook-discourse ontological aspects of the academic disciplines of genetics and molecular biology were retained, but without their epistemological underpinnings; these are lost in the didactic transposition. These results are of interest since students might have problems reconstructing the correct scientific understanding from the transformed school science knowledge as depicted within the high school textbooks. Implications for textbook writing as well as teaching are discussed in the paper.
HLA Immune Function Genes in Autism

PubMed Central

Torres, Anthony R.; Westover, Jonna B.; Rosenspire, Allen J.

2012-01-01

The human leukocyte antigen (HLA) genes on chromosome 6 are instrumental in many innate and adaptive immune responses. The HLA genes/haplotypes can also be involved in immune dysfunction and autoimmune diseases. It is now becoming apparent that many of the non-antigen-presenting HLA genes make significant contributions to autoimmune diseases. Interestingly, it has been reported that autism subjects often have associations with HLA genes/haplotypes, suggesting an underlying dysregulation of the immune system mediated by HLA genes. Genetic studies have only succeeded in identifying autism-causing genes in a small number of subjects suggesting that the genome has not been adequately interrogated. Close examination of the HLA region in autism has been relatively ignored, largely due to extraordinary genetic complexity. It is our proposition that genetic polymorphisms in the HLA region, especially in the non-antigen-presenting regions, may be important in the etiology of autism in certain subjects. PMID:22928105
A Different Microbiome Gene Repertoire in the Airways of Cystic Fibrosis Patients with Severe Lung Disease

PubMed Central

Bacci, Giovanni; Fiscarelli, Ersilia; Taccetti, Giovanni; Dolce, Daniela; Paganin, Patrizia; Morelli, Patrizia; Tuccio, Vanessa; De Alessandri, Alessandra; Lucidi, Vincenzina

2017-01-01

In recent years, next-generation sequencing (NGS) was employed to decipher the structure and composition of the microbiota of the airways in cystic fibrosis (CF) patients. However, little is still known about the overall gene functions harbored by the resident microbial populations and which specific genes are associated with various stages of CF lung disease. In the present study, we aimed to identify the microbial gene repertoire of CF microbiota in twelve patients with severe and normal/mild lung disease by performing sputum shotgun metagenome sequencing. The abundance of metabolic pathways encoded by microbes inhabiting CF airways was reconstructed from the metagenome. We identified a set of metabolic pathways differently distributed in patients with different pulmonary function; namely, pathways related to bacterial chemotaxis and flagellar assembly, as well as genes encoding efflux-mediated antibiotic resistance mechanisms and virulence-related genes. The results indicated that the microbiome of CF patients with low pulmonary function is enriched in virulence-related genes and in genes encoding efflux-mediated antibiotic resistance mechanisms. Overall, the microbiome of severely affected adults with CF seems to encode different mechanisms for the facilitation of microbial colonization and persistence in the lung, consistent with the characteristics of multidrug-resistant microbial communities that are commonly observed in patients with severe lung disease. PMID:28758937
[Endoplasmic reticulum stress in INS-1-3 cell associated with the expression changes of MODY gene pathway].

PubMed

Liu, Y T; Li, S R; Wang, Z; Xiao, J Z

2016-09-13

Objective: To profile the gene expression changes associated with endoplasmic reticulum stress in INS-1-3 cells induced by thapsigargin (TG) and tunicamycin (TM). Methods: Normal cultured INS-1-3 cells were used as a control. TG and TM were used to induce endoplasmic reticulum stress in INS-1-3 cells. Digital gene expression profiling technique was used to detect differentially expressed gene. The changes of gene expression were detected by expression pattern clustering analysis, gene ontology (GO) function and pathway enrichment analysis. Real time polymerase chain reaction (RT-PCR) was used to verify the key changes of gene expression. Results: Compared with the control group, there were 57 (45 up-regulated, 12 down-regulated) and 135 (99 up-regulated, 36 down-regulated) differentially expressed genes in TG and TM group, respectively. GO function enrichment analyses indicated that the main enrichment was in the endoplasmic reticulum. In signaling pathway analysis, the identified pathways were related with endoplasmic reticulum stress, antigen processing and presentation, protein export, and most of all, the maturity onset diabetes of the young (MODY) pathway. Conclusion: Under the condition of endoplasmic reticulum stress, the related expression changes of transcriptional factors in MODY signaling pathway may be related with the impaired function in islet beta cells.
Rice DB: an Oryza Information Portal linking annotation, subcellular location, function, expression, regulation, and evolutionary information for rice and Arabidopsis

PubMed Central

Narsai, Reena; Devenish, James; Castleden, Ian; Narsai, Kabir; Xu, Lin; Shou, Huixia; Whelan, James

2013-01-01

Omics research in Oryza sativa (rice) relies on the use of multiple databases to obtain different types of information to define gene function. We present Rice DB, an Oryza information portal that is a functional genomics database, linking gene loci to comprehensive annotations, expression data and the subcellular location of encoded proteins. Rice DB has been designed to integrate the direct comparison of rice with Arabidopsis (Arabidopsis thaliana), based on orthology or ‘expressology’, thus using and combining available information from two pre-eminent plant models. To establish Rice DB, gene identifiers (more than 40 types) and annotations from a variety of sources were compiled, functional information based on large-scale and individual studies was manually collated, hundreds of microarrays were analysed to generate expression annotations, and the occurrences of potential functional regulatory motifs in promoter regions were calculated. A range of computational subcellular localization predictions were also run for all putative proteins encoded in the rice genome, and experimentally confirmed protein localizations have been collated, curated and linked to functional studies in rice. A single search box allows anything from gene identifiers (for rice and/or Arabidopsis), motif sequences, subcellular location, to keyword searches to be entered, with the capability of Boolean searches (such as AND/OR). To demonstrate the utility of Rice DB, several examples are presented including a rice mitochondrial proteome, which draws on a variety of sources for subcellular location data within Rice DB. Comparisons of subcellular location, functional annotations, as well as transcript expression in parallel with Arabidopsis reveals examples of conservation between rice and Arabidopsis, using Rice DB (http://ricedb.plantenergy.uwa.edu.au). PMID:24147765
Rice DB: an Oryza Information Portal linking annotation, subcellular location, function, expression, regulation, and evolutionary information for rice and Arabidopsis.

PubMed

Narsai, Reena; Devenish, James; Castleden, Ian; Narsai, Kabir; Xu, Lin; Shou, Huixia; Whelan, James

2013-12-01

Omics research in Oryza sativa (rice) relies on the use of multiple databases to obtain different types of information to define gene function. We present Rice DB, an Oryza information portal that is a functional genomics database, linking gene loci to comprehensive annotations, expression data and the subcellular location of encoded proteins. Rice DB has been designed to integrate the direct comparison of rice with Arabidopsis (Arabidopsis thaliana), based on orthology or 'expressology', thus using and combining available information from two pre-eminent plant models. To establish Rice DB, gene identifiers (more than 40 types) and annotations from a variety of sources were compiled, functional information based on large-scale and individual studies was manually collated, hundreds of microarrays were analysed to generate expression annotations, and the occurrences of potential functional regulatory motifs in promoter regions were calculated. A range of computational subcellular localization predictions were also run for all putative proteins encoded in the rice genome, and experimentally confirmed protein localizations have been collated, curated and linked to functional studies in rice. A single search box allows anything from gene identifiers (for rice and/or Arabidopsis), motif sequences, subcellular location, to keyword searches to be entered, with the capability of Boolean searches (such as AND/OR). To demonstrate the utility of Rice DB, several examples are presented including a rice mitochondrial proteome, which draws on a variety of sources for subcellular location data within Rice DB. Comparisons of subcellular location, functional annotations, as well as transcript expression in parallel with Arabidopsis reveals examples of conservation between rice and Arabidopsis, using Rice DB (http://ricedb.plantenergy.uwa.edu.au). © 2013 The Authors The Plant Journal © 2013 John Wiley & Sons Ltd.
Evaluation of reference gene suitability for quantitative expression analysis by quantitative polymerase chain reaction in the mandibular condyle of sheep.

PubMed

Jiang, Xin; Xue, Yang; Zhou, Hongzhi; Li, Shouhong; Zhang, Zongmin; Hou, Rui; Ding, Yuxiang; Hu, Kaijin

2015-10-01

Reference genes are commonly used as a reliable approach to normalize the results of quantitative polymerase chain reaction (qPCR), and to reduce errors in the relative quantification of gene expression. Suitable reference genes belonging to numerous functional classes have been identified for various types of species and tissue. However, little is currently known regarding the most suitable reference genes for bone, specifically for the sheep mandibular condyle. Sheep are important for the study of human bone diseases, particularly for temporomandibular diseases. The present study aimed to identify a set of reference genes suitable for the normalization of qPCR data from the mandibular condyle of sheep. A total of 12 reference genes belonging to various functional classes were selected, and the expression stability of the reference genes was determined in both the normal and fractured area of the sheep mandibular condyle. RefFinder, which integrates the following currently available computational algorithms: geNorm, NormFinder, BestKeeper, and the comparative ΔCt method, was used to compare and rank the candidate reference genes. The results obtained from the four methods demonstrated a similar trend: RPL19, ACTB, and PGK1 were the most stably expressed reference genes in the sheep mandibular condyle. As determined by RefFinder comprehensive analysis, the results of the present study suggested that RPL19 is the most suitable reference gene for studies associated with the sheep mandibular condyle. In addition, ACTB and PGK1 may be considered suitable alternatives.
Unraveling transcriptional control and cis-regulatory codes using the software suite GeneACT

PubMed Central

Cheung, Tom Hiu; Kwan, Yin Lam; Hamady, Micah; Liu, Xuedong

2006-01-01

Deciphering gene regulatory networks requires the systematic identification of functional cis-acting regulatory elements. We present a suite of web-based bioinformatics tools, called GeneACT , that can rapidly detect evolutionarily conserved transcription factor binding sites or microRNA target sites that are either unique or over-represented in differentially expressed genes from DNA microarray data. GeneACT provides graphic visualization and extraction of common regulatory sequence elements in the promoters and 3'-untranslated regions that are conserved across multiple mammalian species. PMID:17064417
Interhemispheric gene expression differences in the cerebral cortex of humans and macaque monkeys.

PubMed

Muntané, Gerard; Santpere, Gabriel; Verendeev, Andrey; Seeley, William W; Jacobs, Bob; Hopkins, William D; Navarro, Arcadi; Sherwood, Chet C

2017-09-01

Handedness and language are two well-studied examples of asymmetrical brain function in humans. Approximately 90% of humans exhibit a right-hand preference, and the vast majority shows left-hemisphere dominance for language function. Although genetic models of human handedness and language have been proposed, the actual gene expression differences between cerebral hemispheres in humans remain to be fully defined. In the present study, gene expression profiles were examined in both hemispheres of three cortical regions involved in handedness and language in humans and their homologues in rhesus macaques: ventrolateral prefrontal cortex, posterior superior temporal cortex (STC), and primary motor cortex. Although the overall pattern of gene expression was very similar between hemispheres in both humans and macaques, weighted gene correlation network analysis revealed gene co-expression modules associated with hemisphere, which are different among the three cortical regions examined. Notably, a receptor-enriched gene module in STC was particularly associated with hemisphere and showed different expression levels between hemispheres only in humans.

Zebrafish atoh1 genes: classic proneural activity in the inner ear and regulation by Fgf and Notch.

PubMed

Millimaki, Bonny B; Sweet, Elly M; Dhason, Mary S; Riley, Bruce B

2007-01-01

Hair cells of the inner ear develop from an equivalence group marked by expression of the proneural gene Atoh1. In mouse, Atoh1 is necessary for hair cell differentiation, but its role in specifying the equivalence group (proneural function) has been questioned and little is known about its upstream activators. We have addressed these issues in zebrafish. Two zebrafish homologs, atoh1a and atoh1b, are together necessary for hair cell development. These genes crossregulate each other but are differentially required during distinct developmental periods, first in the preotic placode and later in the otic vesicle. Interactions with the Notch pathway confirm that atoh1 genes have early proneural function. Fgf3 and Fgf8 are upstream activators of atoh1 genes during both phases, and foxi1, pax8 and dlx genes regulate atoh1b in the preplacode. A model is presented in which zebrafish atoh1 genes operate in a complex network leading to hair cell development.
Chemical inducible promoter used to obtain transgenic plants with a silent marker and organisms and cells and methods of using same for screening for mutations

DOEpatents

Zuo, Jianru [New York, NY; Chua, Nam-Hai [Scarsdale, NY

2007-06-12

Disclosed is a chemically inducible promoter for transforming plants or plant cells with genes which are regulatable by adding the plants or cells to a medium containing an inducer or by removing them from such medium. The promoter is inducible by a glucocorticoid, estrogen or inducer not endogenous to plants. Such promoters may be used with any plant genes that can promote shoot regeneration and development to induce shoot formation in the presence of a glucocorticoid, estrogen or inducer. The promoter may be used with antibiotic or herbicide resistance genes or other genes which are regulatable by the presence or absence of a given inducer. Also presented are organisms or cells comprising a gene wherein the natural promoter of the gene is disrupted and the gene is placed under the control of a transgenic inducible promoter. These organisms and cells and their progeny are useful for screening for conditional gain of function and loss of function mutations.
A framework for list representation, enabling list stabilization through incorporation of gene exchangeabilities.

PubMed

Soneson, Charlotte; Fontes, Magnus

2012-01-01

Analysis of multivariate data sets from, for example, microarray studies frequently results in lists of genes which are associated with some response of interest. The biological interpretation is often complicated by the statistical instability of the obtained gene lists, which may partly be due to the functional redundancy among genes, implying that multiple genes can play exchangeable roles in the cell. In this paper, we use the concept of exchangeability of random variables to model this functional redundancy and thereby account for the instability. We present a flexible framework to incorporate the exchangeability into the representation of lists. The proposed framework supports straightforward comparison between any 2 lists. It can also be used to generate new more stable gene rankings incorporating more information from the experimental data. Using 2 microarray data sets, we show that the proposed method provides more robust gene rankings than existing methods with respect to sampling variations, without compromising the biological significance of the rankings.
Estimation of gene induction enables a relevance-based ranking of gene sets.

PubMed

Bartholomé, Kilian; Kreutz, Clemens; Timmer, Jens

2009-07-01

In order to handle and interpret the vast amounts of data produced by microarray experiments, the analysis of sets of genes with a common biological functionality has been shown to be advantageous compared to single gene analyses. Some statistical methods have been proposed to analyse the differential gene expression of gene sets in microarray experiments. However, most of these methods either require threshhold values to be chosen for the analysis, or they need some reference set for the determination of significance. We present a method that estimates the number of differentially expressed genes in a gene set without requiring a threshold value for significance of genes. The method is self-contained (i.e., it does not require a reference set for comparison). In contrast to other methods which are focused on significance, our approach emphasizes the relevance of the regulation of gene sets. The presented method measures the degree of regulation of a gene set and is a useful tool to compare the induction of different gene sets and place the results of microarray experiments into the biological context. An R-package is available.
Functional gene diversity of soil microbial communities from five oil-contaminated fields in China.

PubMed

Liang, Yuting; Van Nostrand, Joy D; Deng, Ye; He, Zhili; Wu, Liyou; Zhang, Xu; Li, Guanghe; Zhou, Jizhong

2011-03-01

To compare microbial functional diversity in different oil-contaminated fields and to know the effects of oil contaminant and environmental factors, soil samples were taken from typical oil-contaminated fields located in five geographic regions of China. GeoChip, a high-throughput functional gene array, was used to evaluate the microbial functional genes involved in contaminant degradation and in other major biogeochemical/metabolic processes. Our results indicated that the overall microbial community structures were distinct in each oil-contaminated field, and samples were clustered by geographic locations. The organic contaminant degradation genes were most abundant in all samples and presented a similar pattern under oil contaminant stress among the five fields. In addition, alkane and aromatic hydrocarbon degradation genes such as monooxygenase and dioxygenase were detected in high abundance in the oil-contaminated fields. Canonical correspondence analysis indicated that the microbial functional patterns were highly correlated to the local environmental variables, such as oil contaminant concentration, nitrogen and phosphorus contents, salt and pH. Finally, a total of 59% of microbial community variation from GeoChip data can be explained by oil contamination, geographic location and soil geochemical parameters. This study provided insights into the in situ microbial functional structures in oil-contaminated fields and discerned the linkages between microbial communities and environmental variables, which is important to the application of bioremediation in oil-contaminated sites.
Functional gene diversity of soil microbial communities from five oil-contaminated fields in China

PubMed Central

Liang, Yuting; Van Nostrand, Joy D; Deng, Ye; He, Zhili; Wu, Liyou; Zhang, Xu; Li, Guanghe; Zhou, Jizhong

2011-01-01

To compare microbial functional diversity in different oil-contaminated fields and to know the effects of oil contaminant and environmental factors, soil samples were taken from typical oil-contaminated fields located in five geographic regions of China. GeoChip, a high-throughput functional gene array, was used to evaluate the microbial functional genes involved in contaminant degradation and in other major biogeochemical/metabolic processes. Our results indicated that the overall microbial community structures were distinct in each oil-contaminated field, and samples were clustered by geographic locations. The organic contaminant degradation genes were most abundant in all samples and presented a similar pattern under oil contaminant stress among the five fields. In addition, alkane and aromatic hydrocarbon degradation genes such as monooxygenase and dioxygenase were detected in high abundance in the oil-contaminated fields. Canonical correspondence analysis indicated that the microbial functional patterns were highly correlated to the local environmental variables, such as oil contaminant concentration, nitrogen and phosphorus contents, salt and pH. Finally, a total of 59% of microbial community variation from GeoChip data can be explained by oil contamination, geographic location and soil geochemical parameters. This study provided insights into the in situ microbial functional structures in oil-contaminated fields and discerned the linkages between microbial communities and environmental variables, which is important to the application of bioremediation in oil-contaminated sites. PMID:20861922
Think like a sponge: The genetic signal of sensory cells in sponges.

PubMed

Mah, Jasmine L; Leys, Sally P

2017-11-01

A complex genetic repertoire underlies the apparently simple body plan of sponges. Among the genes present in poriferans are those fundamental to the sensory and nervous systems of other animals. Sponges are dynamic and sensitive animals and it is intuitive to link these genes to behaviour. The proposal that ctenophores are the earliest diverging metazoan has led to the question of whether sponges possess a 'pre-nervous' system or have undergone nervous system loss. Both lines of thought generally assume that the last common ancestor of sponges and eumetazoans possessed the genetic modules that underlie sensory abilities. By corollary extant sponges may possess a sensory cell homologous to one present in the last common ancestor, a hypothesis that has been studied by gene expression. We have performed a meta-analysis of all gene expression studies published to date to explore whether gene expression is indicative of a feature's sensory function. In sponges we find that eumetazoan sensory-neural markers are not particularly expressed in structures with known sensory functions. Instead it is common for these genes to be expressed in cells with no known or uncharacterized sensory function. Indeed, many sensory-neural markers so far studied are expressed during development, perhaps because many are transcription factors. This suggests that the genetic signal of a sponge sensory cell is dissimilar enough to be unrecognizable when compared to a bilaterian sensory or neural cell. It is possible that sensory-neural markers have as yet unknown functions in sponge cells, such as assembling an immunological synapse in the larval globular cell. Furthermore, the expression of sensory-neural markers in non-sensory cells, such as adult and larval epithelial cells, suggest that these cells may have uncharacterized sensory functions. While this does not rule out the co-option of ancestral sensory modules in later evolving groups, a distinct genetic foundation may underlie the sponge sensory system. Copyright © 2017 Elsevier Inc. All rights reserved.
Single-Cell RNA-Seq Reveals the Transcriptional Landscape and Heterogeneity of Aortic Macrophages in Murine Atherosclerosis.

PubMed

Cochain, Clément; Vafadarnejad, Ehsan; Arampatzi, Panagiota; Jaroslav, Pelisek; Winkels, Holger; Ley, Klaus; Wolf, Dennis; Saliba, Antoine-Emmanuel; Zernecke, Alma

2018-03-15

Rationale: It is assumed that atherosclerotic arteries contain several macrophage subsets endowed with specific functions. The precise identity of these subsets is poorly characterized as they ha ve been defined by the expression of a restricted number of markers. Objective: We have applied single-cell RNA-seq as an unbiased profiling strategy to interrogate and classify aortic macrophage heterogeneity at the single-cell level in atherosclerosis. Methods and Results: We performed single-cell RNA sequencing of total aortic CD45 + cells extracted from the non-diseased (chow fed) and atherosclerotic (11 weeks of high fat diet) aorta of Ldlr -/- mice. Unsupervised clustering singled out 13 distinct aortic cell clusters. Among the myeloid cell populations, Resident-like macrophages with a gene expression profile similar to aortic resident macrophages were found in healthy and diseased aortae, whereas monocytes, monocyte-derived dendritic cells (MoDC), and two populations of macrophages were almost exclusively detectable in atherosclerotic aortae, comprising Inflammatory macrophages showing enrichment in I l1b , and previously undescribed TREM2 hi macrophages. Differential gene expression and gene ontology enrichment analyses revealed specific gene expression patterns distinguishing these three macrophage subsets and MoDC, and uncovered putative functions of each cell type. Notably, TREM2 hi macrophages appeared to be endowed with specialized functions in lipid metabolism and catabolism, and presented a gene expression signature reminiscent of osteoclasts, suggesting a role in lesion calcification. TREM2 expression was moreover detected in human lesional macrophages. Importantly, these macrophage populations were present also in advanced atherosclerosis and in Apoe -/- aortae, indicating relevance of our findings in different stages of atherosclerosis and mouse models. Conclusions: These data unprecedentedly uncovered the transcriptional landscape and phenotypic heterogeneity of aortic macrophages and MoDCs in atherosclerotic and identified previously unrecognized macrophage populations and their gene expression signature, suggesting specialized functions. Our findings will open up novel opportunities to explore distinct myeloid cell populations and their functions in atherosclerosis.
Impact of Cigarette Smoke on the Human and Mouse Lungs: A Gene-Expression Comparison Study

PubMed Central

Morissette, Mathieu C.; Lamontagne, Maxime; Bérubé, Jean-Christophe; Gaschler, Gordon; Williams, Andrew; Yauk, Carole; Couture, Christian; Laviolette, Michel; Hogg, James C.; Timens, Wim; Halappanavar, Sabina; Stampfli, Martin R.; Bossé, Yohan

2014-01-01

Cigarette smoke is well known for its adverse effects on human health, especially on the lungs. Basic research is essential to identify the mechanisms involved in the development of cigarette smoke-related diseases, but translation of new findings from pre-clinical models to the clinic remains difficult. In the present study, we aimed at comparing the gene expression signature between the lungs of human smokers and mice exposed to cigarette smoke to identify the similarities and differences. Using human and mouse whole-genome gene expression arrays, changes in gene expression, signaling pathways and biological functions were assessed. We found that genes significantly modulated by cigarette smoke in humans were enriched for genes modulated by cigarette smoke in mice, suggesting a similar response of both species. Sixteen smoking-induced genes were in common between humans and mice including six newly reported to be modulated by cigarette smoke. In addition, we identified a new conserved pulmonary response to cigarette smoke in the induction of phospholipid metabolism/degradation pathways. Finally, the majority of biological functions modulated by cigarette smoke in humans were also affected in mice. Altogether, the present study provides information on similarities and differences in lung gene expression response to cigarette smoke that exist between human and mouse. Our results foster the idea that animal models should be used to study the involvement of pathways rather than single genes in human diseases. PMID:24663285
CRISPR-Cas9 and CRISPR-Cpf1 mediated targeting of a stomatal developmental gene EPFL9 in rice.

PubMed

Yin, Xiaojia; Biswal, Akshaya K; Dionora, Jacqueline; Perdigon, Kristel M; Balahadia, Christian P; Mazumdar, Shamik; Chater, Caspar; Lin, Hsiang-Chun; Coe, Robert A; Kretzschmar, Tobias; Gray, Julie E; Quick, Paul W; Bandyopadhyay, Anindya

2017-05-01

CRISPR-Cas9/Cpf1 system with its unique gene targeting efficiency, could be an important tool for functional study of early developmental genes through the generation of successful knockout plants. The introduction and utilization of systems biology approaches have identified several genes that are involved in early development of a plant and with such knowledge a robust tool is required for the functional validation of putative candidate genes thus obtained. The development of the CRISPR-Cas9/Cpf1 genome editing system has provided a convenient tool for creating loss of function mutants for genes of interest. The present study utilized CRISPR/Cas9 and CRISPR-Cpf1 technology to knock out an early developmental gene EPFL9 (Epidermal Patterning Factor like-9, a positive regulator of stomatal development in Arabidopsis) orthologue in rice. Germ-line mutants that were generated showed edits that were carried forward into the T2 generation when Cas9-free homozygous mutants were obtained. The homozygous mutant plants showed more than an eightfold reduction in stomatal density on the abaxial leaf surface of the edited rice plants. Potential off-target analysis showed no significant off-target effects. This study also utilized the CRISPR-LbCpf1 (Lachnospiracae bacterium Cpf1) to target the same OsEPFL9 gene to test the activity of this class-2 CRISPR system in rice and found that Cpf1 is also capable of genome editing and edits get transmitted through generations with similar phenotypic changes seen with CRISPR-Cas9. This study demonstrates the application of CRISPR-Cas9/Cpf1 to precisely target genomic locations and develop transgene-free homozygous heritable gene edits and confirms that the loss of function analysis of the candidate genes emerging from different systems biology based approaches, could be performed, and therefore, this system adds value in the validation of gene function studies.
GOMA: functional enrichment analysis tool based on GO modules

PubMed Central

Huang, Qiang; Wu, Ling-Yun; Wang, Yong; Zhang, Xiang-Sun

2013-01-01

Analyzing the function of gene sets is a critical step in interpreting the results of high-throughput experiments in systems biology. A variety of enrichment analysis tools have been developed in recent years, but most output a long list of significantly enriched terms that are often redundant, making it difficult to extract the most meaningful functions. In this paper, we present GOMA, a novel enrichment analysis method based on the new concept of enriched functional Gene Ontology (GO) modules. With this method, we systematically revealed functional GO modules, i.e., groups of functionally similar GO terms, via an optimization model and then ranked them by enrichment scores. Our new method simplifies enrichment analysis results by reducing redundancy, thereby preventing inconsistent enrichment results among functionally similar terms and providing more biologically meaningful results. PMID:23237213
Marsupials and monotremes possess a novel family of MHC class I genes that is lost from the eutherian lineage.

PubMed

Papenfuss, Anthony T; Feng, Zhi-Ping; Krasnec, Katina; Deakin, Janine E; Baker, Michelle L; Miller, Robert D

2015-07-22

Major histocompatibility complex (MHC) class I genes are found in the genomes of all jawed vertebrates. The evolution of this gene family is closely tied to the evolution of the vertebrate genome. Family members are frequently found in four paralogous regions, which were formed in two rounds of genome duplication in the early vertebrates, but in some species class Is have been subject to additional duplication or translocation, creating additional clusters. The gene family is traditionally grouped into two subtypes: classical MHC class I genes that are usually MHC-linked, highly polymorphic, expressed in a broad range of tissues and present endogenously-derived peptides to cytotoxic T-cells; and non-classical MHC class I genes generally have lower polymorphism, may have tissue-specific expression and have evolved to perform immune-related or non-immune functions. As immune genes can evolve rapidly and are subject to different selection pressure, we hypothesised that there may be divergent, as yet unannotated or uncharacterised class I genes. Application of a novel method of sensitive genome searching of available vertebrate genome sequences revealed a new, extensive sub-family of divergent MHC class I genes, denoted as UT, which has not previously been characterized. These class I genes are found in both American and Australian marsupials, and in monotremes, at an evolutionary chromosomal breakpoint, but are not present in non-mammalian genomes and have been lost from the eutherian lineage. We show that UT family members are expressed in the thymus of the gray short-tailed opossum and in other immune tissues of several Australian marsupials. Structural homology modelling shows that the proteins encoded by this family are predicted to have an open, though short, antigen-binding groove. We have identified a novel sub-family of putatively non-classical MHC class I genes that are specific to marsupials and monotremes. This family was present in the ancestral mammal and is found in extant marsupials and monotremes, but has been lost from the eutherian lineage. The function of this family is as yet unknown, however, their predicted structure may be consistent with presentation of antigens to T-cells.
Knowledge Driven Variable Selection (KDVS) – a new approach to enrichment analysis of gene signatures obtained from high–throughput data

PubMed Central

2013-01-01

Background High–throughput (HT) technologies provide huge amount of gene expression data that can be used to identify biomarkers useful in the clinical practice. The most frequently used approaches first select a set of genes (i.e. gene signature) able to characterize differences between two or more phenotypical conditions, and then provide a functional assessment of the selected genes with an a posteriori enrichment analysis, based on biological knowledge. However, this approach comes with some drawbacks. First, gene selection procedure often requires tunable parameters that affect the outcome, typically producing many false hits. Second, a posteriori enrichment analysis is based on mapping between biological concepts and gene expression measurements, which is hard to compute because of constant changes in biological knowledge and genome analysis. Third, such mapping is typically used in the assessment of the coverage of gene signature by biological concepts, that is either score–based or requires tunable parameters as well, limiting its power. Results We present Knowledge Driven Variable Selection (KDVS), a framework that uses a priori biological knowledge in HT data analysis. The expression data matrix is transformed, according to prior knowledge, into smaller matrices, easier to analyze and to interpret from both computational and biological viewpoints. Therefore KDVS, unlike most approaches, does not exclude a priori any function or process potentially relevant for the biological question under investigation. Differently from the standard approach where gene selection and functional assessment are applied independently, KDVS embeds these two steps into a unified statistical framework, decreasing the variability derived from the threshold–dependent selection, the mapping to the biological concepts, and the signature coverage. We present three case studies to assess the usefulness of the method. Conclusions We showed that KDVS not only enables the selection of known biological functionalities with accuracy, but also identification of new ones. An efficient implementation of KDVS was devised to obtain results in a fast and robust way. Computing time is drastically reduced by the effective use of distributed resources. Finally, integrated visualization techniques immediately increase the interpretability of results. Overall, KDVS approach can be considered as a viable alternative to enrichment–based approaches. PMID:23302187
Characterization and detection of a widely distributed gene cluster that predicts anaerobic choline utilization by human gut bacteria.

PubMed

Martínez-del Campo, Ana; Bodea, Smaranda; Hamer, Hilary A; Marks, Jonathan A; Haiser, Henry J; Turnbaugh, Peter J; Balskus, Emily P

2015-04-14

Elucidation of the molecular mechanisms underlying the human gut microbiota's effects on health and disease has been complicated by difficulties in linking metabolic functions associated with the gut community as a whole to individual microorganisms and activities. Anaerobic microbial choline metabolism, a disease-associated metabolic pathway, exemplifies this challenge, as the specific human gut microorganisms responsible for this transformation have not yet been clearly identified. In this study, we established the link between a bacterial gene cluster, the choline utilization (cut) cluster, and anaerobic choline metabolism in human gut isolates by combining transcriptional, biochemical, bioinformatic, and cultivation-based approaches. Quantitative reverse transcription-PCR analysis and in vitro biochemical characterization of two cut gene products linked the entire cluster to growth on choline and supported a model for this pathway. Analyses of sequenced bacterial genomes revealed that the cut cluster is present in many human gut bacteria, is predictive of choline utilization in sequenced isolates, and is widely but discontinuously distributed across multiple bacterial phyla. Given that bacterial phylogeny is a poor marker for choline utilization, we were prompted to develop a degenerate PCR-based method for detecting the key functional gene choline TMA-lyase (cutC) in genomic and metagenomic DNA. Using this tool, we found that new choline-metabolizing gut isolates universally possessed cutC. We also demonstrated that this gene is widespread in stool metagenomic data sets. Overall, this work represents a crucial step toward understanding anaerobic choline metabolism in the human gut microbiota and underscores the importance of examining this microbial community from a function-oriented perspective. Anaerobic choline utilization is a bacterial metabolic activity that occurs in the human gut and is linked to multiple diseases. While bacterial genes responsible for choline fermentation (the cut gene cluster) have been recently identified, there has been no characterization of these genes in human gut isolates and microbial communities. In this work, we use multiple approaches to demonstrate that the pathway encoded by the cut genes is present and functional in a diverse range of human gut bacteria and is also widespread in stool metagenomes. We also developed a PCR-based strategy to detect a key functional gene (cutC) involved in this pathway and applied it to characterize newly isolated choline-utilizing strains. Both our analyses of the cut gene cluster and this molecular tool will aid efforts to further understand the role of choline metabolism in the human gut microbiota and its link to disease. Copyright © 2015 Martínez-del Campo et al.
Functional characterization of a prokaryotic Kir channel.

PubMed

Enkvetchakul, Decha; Bhattacharyya, Jaya; Jeliazkova, Iana; Groesbeck, Darcy K; Cukras, Catherine A; Nichols, Colin G

2004-11-05

The Kir gene family encodes inward rectifying K+ (Kir) channels that are widespread and critical regulators of excitability in eukaryotic cells. A related gene family (KirBac) has recently been identified in prokaryotes. While a crystal structure of one member, Kir-Bac1.1, has been solved, there has been no functional characterization of any KirBac gene products. Here we present functional characterization of KirBac1.1 reconstituted in liposomes. Utilizing a 86Rb+ uptake assay, we demonstrate that KirBac1.1 generates a K+ -selective permeation path that is inhibited by extraliposomal Ba2+ and Ca2+ ions. In contrast to KcsA (an acid-activated bacterial potassium channel), KirBac1.1 is inhibited by extraliposomal acid (pKa approximately 6). This characterization of KirBac1.1 activity now paves the way for further correlation of structure and function in this model Kir channel.
Conservation of regulatory sequences and gene expression patterns in the disintegrating Drosophila Hox gene complex

PubMed Central

Negre, Bárbara; Casillas, Sònia; Suzanne, Magali; Sánchez-Herrero, Ernesto; Akam, Michael; Nefedov, Michael; Barbadilla, Antonio; de Jong, Pieter; Ruiz, Alfredo

2005-01-01

Homeotic (Hox) genes are usually clustered and arranged in the same order as they are expressed along the anteroposterior body axis of metazoans. The mechanistic explanation for this colinearity has been elusive, and it may well be that a single and universal cause does not exist. The Hox-gene complex (HOM-C) has been rearranged differently in several Drosophila species, producing a striking diversity of Hox gene organizations. We investigated the genomic and functional consequences of the two HOM-C splits present in Drosophila buzzatii. Firstly, we sequenced two regions of the D. buzzatii genome, one containing the genes labial and abdominal A, and another one including proboscipedia, and compared their organization with that of D. melanogaster and D. pseudoobscura in order to map precisely the two splits. Then, a plethora of conserved noncoding sequences, which are putative enhancers, were identified around the three Hox genes closer to the splits. The position and order of these enhancers are conserved, with minor exceptions, between the three Drosophila species. Finally, we analyzed the expression patterns of the same three genes in embryos and imaginal discs of four Drosophila species with different Hox-gene organizations. The results show that their expression patterns are conserved despite the HOM-C splits. We conclude that, in Drosophila, Hox-gene clustering is not an absolute requirement for proper function. Rather, the organization of Hox genes is modular, and their clustering seems the result of phylogenetic inertia more than functional necessity. PMID:15867430
BioGraph: unsupervised biomedical knowledge discovery via automated hypothesis generation

PubMed Central

2011-01-01

We present BioGraph, a data integration and data mining platform for the exploration and discovery of biomedical information. The platform offers prioritizations of putative disease genes, supported by functional hypotheses. We show that BioGraph can retrospectively confirm recently discovered disease genes and identify potential susceptibility genes, outperforming existing technologies, without requiring prior domain knowledge. Additionally, BioGraph allows for generic biomedical applications beyond gene discovery. BioGraph is accessible at http://www.biograph.be. PMID:21696594
A dual selection based, targeted gene replacement tool for Magnaporthe grisea and Fusarium oxysporum.

PubMed

Khang, Chang Hyun; Park, Sook-Young; Lee, Yong-Hwan; Kang, Seogchan

2005-06-01

Rapid progress in fungal genome sequencing presents many new opportunities for functional genomic analysis of fungal biology through the systematic mutagenesis of the genes identified through sequencing. However, the lack of efficient tools for targeted gene replacement is a limiting factor for fungal functional genomics, as it often necessitates the screening of a large number of transformants to identify the desired mutant. We developed an efficient method of gene replacement and evaluated factors affecting the efficiency of this method using two plant pathogenic fungi, Magnaporthe grisea and Fusarium oxysporum. This method is based on Agrobacterium tumefaciens-mediated transformation with a mutant allele of the target gene flanked by the herpes simplex virus thymidine kinase (HSVtk) gene as a conditional negative selection marker against ectopic transformants. The HSVtk gene product converts 5-fluoro-2'-deoxyuridine to a compound toxic to diverse fungi. Because ectopic transformants express HSVtk, while gene replacement mutants lack HSVtk, growing transformants on a medium amended with 5-fluoro-2'-deoxyuridine facilitates the identification of targeted mutants by counter-selecting against ectopic transformants. In addition to M. grisea and F. oxysporum, the method and associated vectors are likely to be applicable to manipulating genes in a broad spectrum of fungi, thus potentially serving as an efficient, universal functional genomic tool for harnessing the growing body of fungal genome sequence data to study fungal biology.
Selecting and validating reference genes for quantitative real-time PCR in Plutella xylostella (L.).

PubMed

You, Yanchun; Xie, Miao; Vasseur, Liette; You, Minsheng

2018-05-01

Gene expression analysis provides important clues regarding gene functions, and quantitative real-time PCR (qRT-PCR) is a widely used method in gene expression studies. Reference genes are essential for normalizing and accurately assessing gene expression. In the present study, 16 candidate reference genes (ACTB, CyPA, EF1-α, GAPDH, HSP90, NDPk, RPL13a, RPL18, RPL19, RPL32, RPL4, RPL8, RPS13, RPS4, α-TUB, and β-TUB) from Plutella xylostella were selected to evaluate gene expression stability across different experimental conditions using five statistical algorithms (geNorm, NormFinder, Delta Ct, BestKeeper, and RefFinder). The results suggest that different reference genes or combinations of reference genes are suitable for normalization in gene expression studies of P. xylostella according to the different developmental stages, strains, tissues, and insecticide treatments. Based on the given experimental sets, the most stable reference genes were RPS4 across different developmental stages, RPL8 across different strains and tissues, and EF1-α across different insecticide treatments. A comprehensive and systematic assessment of potential reference genes for gene expression normalization is essential for post-genomic functional research in P. xylostella, a notorious pest with worldwide distribution and a high capacity to adapt and develop resistance to insecticides.
Saccharomyces cerevisiae Bat1 and Bat2 Aminotransferases Have Functionally Diverged from the Ancestral-Like Kluyveromyces lactis Orthologous Enzyme

PubMed Central

Colón, Maritrini; Hernández, Fabiola; López, Karla; Quezada, Héctor; González, James; López, Geovani; Aranda, Cristina; González, Alicia

2011-01-01

Background Gene duplication is a key evolutionary mechanism providing material for the generation of genes with new or modified functions. The fate of duplicated gene copies has been amply discussed and several models have been put forward to account for duplicate conservation. The specialization model considers that duplication of a bifunctional ancestral gene could result in the preservation of both copies through subfunctionalization, resulting in the distribution of the two ancestral functions between the gene duplicates. Here we investigate whether the presumed bifunctional character displayed by the single branched chain amino acid aminotransferase present in K. lactis has been distributed in the two paralogous genes present in S. cerevisiae, and whether this conservation has impacted S. cerevisiae metabolism. Principal Findings Our results show that the KlBat1 orthologous BCAT is a bifunctional enzyme, which participates in the biosynthesis and catabolism of branched chain aminoacids (BCAAs). This dual role has been distributed in S. cerevisiae Bat1 and Bat2 paralogous proteins, supporting the specialization model posed to explain the evolution of gene duplications. BAT1 is highly expressed under biosynthetic conditions, while BAT2 expression is highest under catabolic conditions. Bat1 and Bat2 differential relocalization has favored their physiological function, since biosynthetic precursors are generated in the mitochondria (Bat1), while catabolic substrates are accumulated in the cytosol (Bat2). Under respiratory conditions, in the presence of ammonium and BCAAs the bat1Δ bat2Δ double mutant shows impaired growth, indicating that Bat1 and Bat2 could play redundant roles. In K. lactis wild type growth is independent of BCAA degradation, since a Klbat1Δ mutant grows under this condition. Conclusions Our study shows that BAT1 and BAT2 differential expression and subcellular relocalization has resulted in the distribution of the biosynthetic and catabolic roles of the ancestral BCAT in two isozymes improving BCAAs metabolism and constituting an adaptation to facultative metabolism. PMID:21267457

Integrated annotation and analysis of in situ hybridization images using the ImAnno system: application to the ear and sensory organs of the fetal mouse.

PubMed

Romand, Raymond; Ripp, Raymond; Poidevin, Laetitia; Boeglin, Marcel; Geffers, Lars; Dollé, Pascal; Poch, Olivier

2015-01-01

An in situ hybridization (ISH) study was performed on 2000 murine genes representing around 10% of the protein-coding genes present in the mouse genome using data generated by the EURExpress consortium. This study was carried out in 25 tissues of late gestation embryos (E14.5), with a special emphasis on the developing ear and on five distinct developing sensory organs, including the cochlea, the vestibular receptors, the sensory retina, the olfactory organ, and the vibrissae follicles. The results obtained from an analysis of more than 11,000 micrographs have been integrated in a newly developed knowledgebase, called ImAnno. In addition to managing the multilevel micrograph annotations performed by human experts, ImAnno provides public access to various integrated databases and tools. Thus, it facilitates the analysis of complex ISH gene expression patterns, as well as functional annotation and interaction of gene sets. It also provides direct links to human pathways and diseases. Hierarchical clustering of expression patterns in the 25 tissues revealed three main branches corresponding to tissues with common functions and/or embryonic origins. To illustrate the integrative power of ImAnno, we explored the expression, function and disease traits of the sensory epithelia of the five presumptive sensory organs. The study identified 623 genes (out of 2000) concomitantly expressed in the five embryonic epithelia, among which many (∼12%) were involved in human disorders. Finally, various multilevel interaction networks were characterized, highlighting differential functional enrichments of directly or indirectly interacting genes. These analyses exemplify an under-represention of "sensory" functions in the sensory gene set suggests that E14.5 is a pivotal stage between the developmental stage and the functional phase that will be fully reached only after birth.
Molecular characterization of locus of enterocyte effacement pathogenicity island in shigatoxic Escherichia coli isolated from human & cattle in West Bengal, India

PubMed Central

Das, Suresh Chandra; Ramamurthy, Thandavanaryanalu; Ghosh, Santanu; Pazhani, Gururaja Perumal; Sen, Tista; Singh, Raghubir

2017-01-01

Background & objectives: Shigatoxic Escherichia coli (STEC) recovered from dairy animals of Kolkata, India, harboured the putative virulence genes; however, the animals did not exhibit clinical symptoms. Similarly, human isolates in this locality also showed variations in degree of symptoms. Hence, this study was designed to know the presence of recognized gene(s) in the locus of enterocyte effacement (LEE) pathogenicity island in these STEC isolates and functional status of the cardinal gene (eae) related to pathogenicity. Methods: Genes were characterized using polymerase chain reaction (PCR) assays, and functional status of cardinal gene (eae) was evaluated by fluorescent actin staining (FAS) assay. Variation in eae gene was determined by intimin PCR. Results: Cattle STEC isolates carried 22 genes in LEE pathogenicity island in different frequencies ranging from 5.63 to 47.88 per cent of the isolates. In human isolates, the genes namely ler, escRSTU, orf2, escC, escV, orf3 and tir that are associated with secretory function, were found to be absent and rest of the genes were present in lower frequency. Further, the cardinal gene (eae) responsible for initiation of pathogenesis was in a very low frequency in human (n=2; 10.5%) and cattle (n=11; 15.5%) isolates. None of these eae+ STEC isolates from human and cattle revealed positivity in FAS assay. Interpretation & conclusions: Majority of human STEC isolates lacked the cardinal virulence gene (eae), and genes for secretory function that are essential for facilitating pathogenesis. This may partially be attributed to low occurrence of STEC in human clinical diarrhoea in this area. Although a few isolates (11 of 71) from cattle had eae gene, they did not express phenotypically. This could be one of the reasons for not appearing of clinical symptoms in the hosts. PMID:29205193
In-silico analysis of cis-acting regulatory elements of pathogenesis-related proteins of Arabidopsis thaliana and Oryza sativa

PubMed Central

Kaur, Amritpreet; Pati, Pratap Kumar; Pati, Aparna Maitra; Nagpal, Avinash Kaur

2017-01-01

Pathogenesis related (PR) proteins are low molecular weight family of proteins induced in plants under various biotic and abiotic stresses. They play an important role in plant-defense mechanism. PRs have wide range of functions, acting as hydrolases, peroxidases, chitinases, anti-fungal, protease inhibitors etc. In the present study, an attempt has been made to analyze promoter regions of PR1, PR2, PR5, PR9, PR10 and PR12 of Arabidopsis thaliana and Oryza sativa. Analysis of cis-element distribution revealed the functional multiplicity of PRs and provides insight into the gene regulation. CpG islands are observed only in rice PRs, which indicates that monocot genome contains more GC rich motifs than dicots. Tandem repeats were also observed in 5’ UTR of PR genes. Thus, the present study provides an understanding of regulation of PR genes and their versatile roles in plants. PMID:28910327
In-silico analysis of cis-acting regulatory elements of pathogenesis-related proteins of Arabidopsis thaliana and Oryza sativa.

PubMed

Kaur, Amritpreet; Pati, Pratap Kumar; Pati, Aparna Maitra; Nagpal, Avinash Kaur

2017-01-01

Pathogenesis related (PR) proteins are low molecular weight family of proteins induced in plants under various biotic and abiotic stresses. They play an important role in plant-defense mechanism. PRs have wide range of functions, acting as hydrolases, peroxidases, chitinases, anti-fungal, protease inhibitors etc. In the present study, an attempt has been made to analyze promoter regions of PR1, PR2, PR5, PR9, PR10 and PR12 of Arabidopsis thaliana and Oryza sativa. Analysis of cis-element distribution revealed the functional multiplicity of PRs and provides insight into the gene regulation. CpG islands are observed only in rice PRs, which indicates that monocot genome contains more GC rich motifs than dicots. Tandem repeats were also observed in 5' UTR of PR genes. Thus, the present study provides an understanding of regulation of PR genes and their versatile roles in plants.
More than anticipated - production of antibiotics and other secondary metabolites by Bacillus amyloliquefaciens FZB42.

PubMed

Chen, Xiao-Hua; Koumoutsi, Alexandra; Scholz, Romy; Borriss, Rainer

2009-01-01

The genome of environmental Bacillus amyloliquefaciens FZB42 harbors numerous gene clusters involved in synthesis of antifungal and antibacterial acting secondary metabolites. Five gene clusters, srf, bmy, fen, nrs, dhb, covering altogether 137 kb, direct non-ribosomal synthesis of the cyclic lipopeptides surfactin, bacillomycin, fengycin, an unknown peptide, and the iron siderophore bacillibactin. Bacillomycin and fengycin were shown to act against phytopathogenic fungi in a synergistic manner. Three gene clusters, mln, bae, and dif, with a total length of 199 kb were shown to direct synthesis of the antibacterial acting polyketides macrolactin, bacillaene, and difficidin. Both, non-ribosomal synthesis of cyclic lipopeptides and synthesis of polyketides are dependent on the presence of a functional sfp gene product, 4'-phosphopantetheinyl transferase, as evidenced by knockout mutation of the sfp gene resulting in complete absence of all those eight compounds. In addition, here we present evidence that a gene cluster encoding enzymes involved in synthesis and export of the antibacterial acting dipeptide bacilysin is also functional in FZB42. In summary, environmental FZB42 devoted about 340 kb, corresponding to 8.5% of its total genetic capacity, to synthesis of secondary metabolites useful to cope with other competing microorganisms present in the plant rhizosphere. Copyright (c) 2008 S. Karger AG, Basel.
Periodic, Quasi-periodic and Chaotic Dynamics in Simple Gene Elements with Time Delays

PubMed Central

Suzuki, Yoko; Lu, Mingyang; Ben-Jacob, Eshel; Onuchic, José N.

2016-01-01

Regulatory gene circuit motifs play crucial roles in performing and maintaining vital cellular functions. Frequently, theoretical studies of gene circuits focus on steady-state behaviors and do not include time delays. In this study, the inclusion of time delays is shown to entirely change the time-dependent dynamics for even the simplest possible circuits with one and two gene elements with self and cross regulations. These elements can give rise to rich behaviors including periodic, quasi-periodic, weak chaotic, strong chaotic and intermittent dynamics. We introduce a special power-spectrum-based method to characterize and discriminate these dynamical modes quantitatively. Our simulation results suggest that, while a single negative feedback loop of either one- or two-gene element can only have periodic dynamics, the elements with two positive/negative feedback loops are the minimalist elements to have chaotic dynamics. These elements typically have one negative feedback loop that generates oscillations, and another unit that allows frequent switches among multiple steady states or between oscillatory and non-oscillatory dynamics. Possible dynamical features of several simple one- and two-gene elements are presented in details. Discussion is presented for possible roles of the chaotic behavior in the robustness of cellular functions and diseases, for example, in the context of cancer. PMID:26876008
Periodic, Quasi-periodic and Chaotic Dynamics in Simple Gene Elements with Time Delays

NASA Astrophysics Data System (ADS)

Suzuki, Yoko; Lu, Mingyang; Ben-Jacob, Eshel; Onuchic, José N.

2016-02-01

Regulatory gene circuit motifs play crucial roles in performing and maintaining vital cellular functions. Frequently, theoretical studies of gene circuits focus on steady-state behaviors and do not include time delays. In this study, the inclusion of time delays is shown to entirely change the time-dependent dynamics for even the simplest possible circuits with one and two gene elements with self and cross regulations. These elements can give rise to rich behaviors including periodic, quasi-periodic, weak chaotic, strong chaotic and intermittent dynamics. We introduce a special power-spectrum-based method to characterize and discriminate these dynamical modes quantitatively. Our simulation results suggest that, while a single negative feedback loop of either one- or two-gene element can only have periodic dynamics, the elements with two positive/negative feedback loops are the minimalist elements to have chaotic dynamics. These elements typically have one negative feedback loop that generates oscillations, and another unit that allows frequent switches among multiple steady states or between oscillatory and non-oscillatory dynamics. Possible dynamical features of several simple one- and two-gene elements are presented in details. Discussion is presented for possible roles of the chaotic behavior in the robustness of cellular functions and diseases, for example, in the context of cancer.
Development of an efficient genetic manipulation strategy for sequential gene disruption and expression of different heterologous GFP genes in Candida tropicalis.

PubMed

Zhang, Lihua; Chen, Xianzhong; Chen, Zhen; Wang, Zezheng; Jiang, Shan; Li, Li; Pötter, Markus; Shen, Wei; Fan, You

2016-11-01

The diploid yeast Candida tropicalis, which can utilize n-alkane as a carbon and energy source, is an attractive strain for both physiological studies and practical applications. However, it presents some characteristics, such as rare codon usage, difficulty in sequential gene disruption, and inefficiency in foreign gene expression, that hamper strain improvement through genetic engineering. In this work, we present a simple and effective method for sequential gene disruption in C. tropicalis based on the use of an auxotrophic mutant host defective in orotidine monophosphate decarboxylase (URA3). The disruption cassette, which consists of a functional yeast URA3 gene flanked by a 0.3 kb gene disruption auxiliary sequence (gda) direct repeat derived from downstream or upstream of the URA3 gene and of homologous arms of the target gene, was constructed and introduced into the yeast genome by integrative transformation. Stable integrants were isolated by selection for Ura + and identified by PCR and sequencing. The important feature of this construct, which makes it very attractive, is that recombination between the flanking direct gda repeats occurs at a high frequency (10 -8 ) during mitosis. After excision of the URA3 marker, only one copy of the gda sequence remains at the recombinant locus. Thus, the resulting ura3 strain can be used again to disrupt a second allelic gene in a similar manner. In addition to this effective sequential gene disruption method, a codon-optimized green fluorescent protein-encoding gene (GFP) was functionally expressed in C. tropicalis. Thus, we propose a simple and reliable method to improve C. tropicalis by genetic manipulation.
Development of Novel Nonagonist PPAR-Gamma Ligands for Lung Cancer Treatment

DTIC Science & Technology

2016-08-01

Affymetrix gene expression profiling. To get the purest representation of this gene set, we generated fibroblasts from the brown adipose tissue of mice... tissues . It has been shown that p53 plays an important role in metabolism and adipose tissue function, and this may be modulated by PPARγ expression as...presentations. Poster Presentation: Melin J. Khandekar, Alex S. Banks , Dina Laznik- Bogoslavski, James P. White, Jang H. Choi, Kwok-kin Wong, Ted
Summary of mutations underlying autosomal recessive congenital ichthyoses (ARCI) in Arabs with four novel mutations in ARCI-related genes from the United Arab Emirates.

PubMed

Bastaki, Fatma; Mohamed, Madiha; Nair, Pratibha; Saif, Fatima; Mustafa, Ethar M; Bizzari, Sami; Al-Ali, Mahmoud T; Hamzeh, Abdul Rezzak

2017-05-01

Clinical and molecular heterogeneity is a prominent characteristic of congenital ichthyoses, with the involvement of numerous causative loci. Mutations in these loci feature in autosomal recessive congenital ichthyoses (ARCIs) quite variably, with certain genes/mutations being more frequently uncovered in particular populations. In this study, we used whole exome sequencing as well as direct Sanger sequencing to uncover four novel mutations in ARCI-related genes, which were found in families from the United Arab Emirates. In silico tools such as CADD and SIFT Indel were used to predict the functional consequences of these mutations. The here-presented mutations occurred in three genes (ALOX12B, TGM1, ABCA12), and these are a mixture of missense and indel variants with damaging functional consequences on their encoded proteins. This study presents an overview of the mutations that were found in ARCI-related genes in Arabs and discusses molecular and clinical details pertaining to the above-mentioned Emirati cases and their novel mutations with special emphasis on the resulting protein changes. © 2017 The International Society of Dermatology.
Genome-wide analysis of the Solanum tuberosum (potato) trehalose-6-phosphate synthase (TPS) gene family: evolution and differential expression during development and stress.

PubMed

Xu, Yingchun; Wang, Yanjie; Mattson, Neil; Yang, Liu; Jin, Qijiang

2017-12-01

Trehalose-6-phosphate synthase (TPS) serves important functions in plant desiccation tolerance and response to environmental stimuli. At present, a comprehensive analysis, i.e. functional classification, molecular evolution, and expression patterns of this gene family are still lacking in Solanum tuberosum (potato). In this study, a comprehensive analysis of the TPS gene family was conducted in potato. A total of eight putative potato TPS genes (StTPSs) were identified by searching the latest potato genome sequence. The amino acid identity among eight StTPSs varied from 59.91 to 89.54%. Analysis of d N /d S ratios suggested that regions in the TPP (trehalose-6-phosphate phosphatase) domains evolved faster than the TPS domains. Although the sequence of the eight StTPSs showed high similarity (2571-2796 bp), their gene length is highly differentiated (3189-8406 bp). Many of the regulatory elements possibly related to phytohormones, abiotic stress and development were identified in different TPS genes. Based on the phylogenetic tree constructed using TPS genes of potato, and four other Solanaceae plants, TPS genes could be categorized into 6 distinct groups. Analysis revealed that purifying selection most likely played a major role during the evolution of this family. Amino acid changes detected in specific branches of the phylogenetic tree suggests relaxed constraints might have contributed to functional divergence among groups. Moreover, StTPSs were found to exhibit tissue and treatment specific expression patterns upon analysis of transcriptome data, and performing qRT-PCR. This study provides a reference for genome-wide identification of the potato TPS gene family and sets a framework for further functional studies of this important gene family in development and stress response.
In silico experiment system for testing hypothesis on gene functions using three condition specific biological networks.

PubMed

Lee, Chai-Jin; Kang, Dongwon; Lee, Sangseon; Lee, Sunwon; Kang, Jaewoo; Kim, Sun

2018-05-25

Determining functions of a gene requires time consuming, expensive biological experiments. Scientists can speed up this experimental process if the literature information and biological networks can be adequately provided. In this paper, we present a web-based information system that can perform in silico experiments of computationally testing hypothesis on the function of a gene. A hypothesis that is specified in English by the user is converted to genes using a literature and knowledge mining system called BEST. Condition-specific TF, miRNA and PPI (protein-protein interaction) networks are automatically generated by projecting gene and miRNA expression data to template networks. Then, an in silico experiment is to test how well the target genes are connected from the knockout gene through the condition-specific networks. The test result visualizes path from the knockout gene to the target genes in the three networks. Statistical and information-theoretic scores are provided on the resulting web page to help scientists either accept or reject the hypothesis being tested. Our web-based system was extensively tested using three data sets, such as E2f1, Lrrk2, and Dicer1 knockout data sets. We were able to re-produce gene functions reported in the original research papers. In addition, we comprehensively tested with all disease names in MalaCards as hypothesis to show the effectiveness of our system. Our in silico experiment system can be very useful in suggesting biological mechanisms which can be further tested in vivo or in vitro. http://biohealth.snu.ac.kr/software/insilico/. Copyright © 2018 Elsevier Inc. All rights reserved.
Identification of Loci and Functional Characterization of Trichothecene Biosynthesis Genes in Filamentous Fungi of the Genus Trichoderma▿†

PubMed Central

Cardoza, R. E.; Malmierca, M. G.; Hermosa, M. R.; Alexander, N. J.; McCormick, S. P.; Proctor, R. H.; Tijerino, A. M.; Rumbero, A.; Monte, E.; Gutiérrez, S.

2011-01-01

Trichothecenes are mycotoxins produced by Trichoderma, Fusarium, and at least four other genera in the fungal order Hypocreales. Fusarium has a trichothecene biosynthetic gene (TRI) cluster that encodes transport and regulatory proteins as well as most enzymes required for the formation of the mycotoxins. However, little is known about trichothecene biosynthesis in the other genera. Here, we identify and characterize TRI gene orthologues (tri) in Trichoderma arundinaceum and Trichoderma brevicompactum. Our results indicate that both Trichoderma species have a tri cluster that consists of orthologues of seven genes present in the Fusarium TRI cluster. Organization of genes in the cluster is the same in the two Trichoderma species but differs from the organization in Fusarium. Sequence and functional analysis revealed that the gene (tri5) responsible for the first committed step in trichothecene biosynthesis is located outside the cluster in both Trichoderma species rather than inside the cluster as it is in Fusarium. Heterologous expression analysis revealed that two T. arundinaceum cluster genes (tri4 and tri11) differ in function from their Fusarium orthologues. The Tatri4-encoded enzyme catalyzes only three of the four oxygenation reactions catalyzed by the orthologous enzyme in Fusarium. The Tatri11-encoded enzyme catalyzes a completely different reaction (trichothecene C-4 hydroxylation) than the Fusarium orthologue (trichothecene C-15 hydroxylation). The results of this study indicate that although some characteristics of the tri/TRI cluster have been conserved during evolution of Trichoderma and Fusarium, the cluster has undergone marked changes, including gene loss and/or gain, gene rearrangement, and divergence of gene function. PMID:21642405
Design and interpretation of microRNA-reporter gene activity.

PubMed

Carroll, Adam P; Tooney, Paul A; Cairns, Murray J

2013-06-15

MicroRNAs (miRNAs) are small noncoding RNA molecules that act as sequence specificity guides to direct post-transcriptional gene silencing. In doing so, miRNAs regulate many critical developmental processes, including cellular proliferation, differentiation, migration, and apoptosis, as well as more specialized biological functions such as dendritic spine development and synaptogenesis. Interactions between miRNAs and their miRNA recognition elements occur via partial complementarity, rendering tremendous redundancy in targeting such that miRNAs are predicted to regulate 60% of the genome, with each miRNA estimated to regulate more than 200 genes. Because these predictions are prone to false positives and false negatives, there is an ever present need to provide material support to these assertions to firmly establish the biological function of specific miRNAs in both normal and pathophysiological contexts. Using schizophrenia-associated miR-181b as an example, we present detailed guidelines and novel insights for the rapid establishment of a streamlined miRNA-reporter gene assay and explore various design concepts for miRNA-reporter gene applications, including bidirectional miRNA modulation. In exemplifying this approach, we report seven novel miR-181b target sites for five schizophrenia candidate genes (DISC1, BDNF, ENKUR, GRIA1, and GRIK1) and dissect a number of vital concepts regarding future developments for miRNA-reporter gene assays and the interpretation of their results. Copyright © 2013 Elsevier Inc. All rights reserved.
Assembly of a biocompatible triazole-linked gene by one-pot click-DNA ligation

NASA Astrophysics Data System (ADS)

Kukwikila, Mikiembo; Gale, Nittaya; El-Sagheer, Afaf H.; Brown, Tom; Tavassoli, Ali

2017-11-01

The chemical synthesis of oligonucleotides and their enzyme-mediated assembly into genes and genomes has significantly advanced multiple scientific disciplines. However, these approaches are not without their shortcomings; enzymatic amplification and ligation of oligonucleotides into genes and genomes makes automation challenging, and site-specific incorporation of epigenetic information and/or modified bases into large constructs is not feasible. Here we present a fully chemical one-pot method for the assembly of oligonucleotides into a gene by click-DNA ligation. We synthesize the 335 base-pair gene that encodes the green fluorescent protein iLOV from ten functionalized oligonucleotides that contain 5ʹ-azide and 3ʹ-alkyne units. The resulting click-linked iLOV gene contains eight triazoles at the sites of chemical ligation, and yet is fully biocompatible; it is replicated by DNA polymerases in vitro and encodes a functional iLOV protein in Escherichia coli. We demonstrate the power and potential of our one-pot gene-assembly method by preparing an epigenetically modified variant of the iLOV gene.
Influence of molecular weight upon mannosylated bio-synthetic hybrids for targeted antigen presenting cell gene delivery

PubMed Central

Jones, Charles H.; Gollakota, Akhila; Chen, Mingfu; Chung, Tai-Chun; Ravikrishnan, Anitha; Zhang, Guojian; Pfeifer, Blaine A.

2015-01-01

Given the rise of antibiotic resistant microbes, genetic vaccination is a promising prophylactic strategy that enables rapid design and manufacture. Facilitating this process is the choice of vector, which is often situationally-specific and limited in engineering capacity. Furthermore, these shortcomings are usually tied to an incomplete understanding of the structure-function relationships driving vector-mediated gene delivery. Building upon our initial report of a hybrid bacterial-biomaterial gene delivery vector, a comprehensive structure-function assessment was completed using a class of mannosylated poly(beta-amino esters). Through a top-down screening methodology, an ideal polymer was selected on the basis of gene delivery efficacy and then used for the synthesis of a stratified molecular weight polymer library. By eliminating contributions of polymer chemical background, we were able to complete an in-depth assessment of gene delivery as a function of (1) polymer molecular weight, (2) relative mannose content, (3) polymer-membrane biophysical properties, (4) APC uptake specificity, and (5) serum inhibition. In summary, the flexibility and potential of the hybrid design featured in this work highlights the ability to systematically probe vector-associated properties for the development of translational gene delivery candidates. PMID:25941787
Microarray gene expression profiling analysis combined with bioinformatics in multiple sclerosis.

PubMed

Liu, Mingyuan; Hou, Xiaojun; Zhang, Ping; Hao, Yong; Yang, Yiting; Wu, Xiongfeng; Zhu, Desheng; Guan, Yangtai

2013-05-01

Multiple sclerosis (MS) is the most prevalent demyelinating disease and the principal cause of neurological disability in young adults. Recent microarray gene expression profiling studies have identified several genetic variants contributing to the complex pathogenesis of MS, however, expressional and functional studies are still required to further understand its molecular mechanism. The present study aimed to analyze the molecular mechanism of MS using microarray analysis combined with bioinformatics techniques. We downloaded the gene expression profile of MS from Gene Expression Omnibus (GEO) and analysed the microarray data using the differentially coexpressed genes (DCGs) and links package in R and Database for Annotation, Visualization and Integrated Discovery. The regulatory impact factor (RIF) algorithm was used to measure the impact factor of transcription factor. A total of 1,297 DCGs between MS patients and healthy controls were identified. Functional annotation indicated that these DCGs were associated with immune and neurological functions. Furthermore, the RIF result suggested that IKZF1, BACH1, CEBPB, EGR1, FOS may play central regulatory roles in controlling gene expression in the pathogenesis of MS. Our findings confirm the presence of multiple molecular alterations in MS and indicate the possibility for identifying prognostic factors associated with MS pathogenesis.
Handling gene and protein names in the age of bioinformatics: the special challenge of secreted multimodular bacterial enzymes such as the cbhA/cbh9A gene of Clostridium thermocellum.

PubMed

Schwarz, Wolfgang H; Brunecky, Roman; Broeker, Jannis; Liebl, Wolfgang; Zverlov, Vladimir V

2018-02-26

An increasing number of researchers working in biology, biochemistry, biotechnology, bioengineering, bioinformatics and other related fields of science are using biological molecules. As the scientific background of the members of different scientific communities is more diverse than ever before, the number of scientists not familiar with the rules for non-ambiguous designation of genetic elements is increasing. However, with biological molecules gaining importance through biotechnology, their functional and unambiguous designation is vital. Unfortunately, naming genes and proteins is not an easy task. In addition, the traditional concepts of bioinformatics are challenged with the appearance of proteins comprising different modules with a respective function in each module. This article highlights basic rules and novel solutions in designation recently used within the community of bacterial geneticists, and we discuss the present-day handling of gene and protein designations. As an example we will utilize a recent mischaracterization of gene nomenclature. We make suggestions for better handling of names in future literature as well as in databases and annotation projects. Our methodology emphasizes the hydrolytic function of multi-modular genes and extracellular proteins from bacteria.
Revealing complex function, process and pathway interactions with high-throughput expression and biological annotation data.

PubMed

Singh, Nitesh Kumar; Ernst, Mathias; Liebscher, Volkmar; Fuellen, Georg; Taher, Leila

2016-10-20

The biological relationships both between and within the functions, processes and pathways that operate within complex biological systems are only poorly characterized, making the interpretation of large scale gene expression datasets extremely challenging. Here, we present an approach that integrates gene expression and biological annotation data to identify and describe the interactions between biological functions, processes and pathways that govern a phenotype of interest. The product is a global, interconnected network, not of genes but of functions, processes and pathways, that represents the biological relationships within the system. We validated our approach on two high-throughput expression datasets describing organismal and organ development. Our findings are well supported by the available literature, confirming that developmental processes and apoptosis play key roles in cell differentiation. Furthermore, our results suggest that processes related to pluripotency and lineage commitment, which are known to be critical for development, interact mainly indirectly, through genes implicated in more general biological processes. Moreover, we provide evidence that supports the relevance of cell spatial organization in the developing liver for proper liver function. Our strategy can be viewed as an abstraction that is useful to interpret high-throughput data and devise further experiments.
Molecular Genetics of Ubiquinone Biosynthesis in Animals

PubMed Central

Wang, Ying; Hekimi, Siegfried

2014-01-01

Ubiquinone (UQ), also known as coenzyme Q (CoQ), is a redox-active lipid present in all cellular membranes where it functions in a variety of cellular processes. The best known functions of UQ are to act as a mobile electron carrier in the mitochondrial respiratory chain and to serve as a lipid soluble antioxidant in cellular membranes. All eukaryotic cells synthesize their own UQ. Most of the current knowledge on the UQ biosynthetic pathway was obtained by studying Escherichia coli and S. cerevisiae UQ-deficient mutants. The orthologues of all the genes known from yeast studies to be involved in UQ biosynthesis have subsequently been found in higher organisms. Animal mutants with different genetic defects in UQ biosynthesis display very different phenotypes, despite the fact that in all these mutants the same biosynthetic pathway is affected. This review summarizes the present knowledge of the eukaryotic biosynthesis of UQ, with focus on the biosynthetic genes identified in animals, including C. elegans, rodents and humans. Moreover, we review the phenotypes of mutants in these genes and discuss the functional consequences of UQ deficiency in general. PMID:23190198

Cornelia de Lange Syndrome: NIPBL haploinsufficiency downregulates canonical Wnt pathway in zebrafish embryos and patients fibroblasts.

PubMed

Pistocchi, A; Fazio, G; Cereda, A; Ferrari, L; Bettini, L R; Messina, G; Cotelli, F; Biondi, A; Selicorni, A; Massa, V

2013-10-17

Cornelia de Lange Syndrome is a severe genetic disorder characterized by malformations affecting multiple systems, with a common feature of severe mental retardation. Genetic variants within four genes (NIPBL (Nipped-B-like), SMC1A, SMC3, and HDAC8) are believed to be responsible for the majority of cases; all these genes encode proteins that are part of the 'cohesin complex'. Cohesins exhibit two temporally separated major roles in cells: one controlling the cell cycle and the other involved in regulating the gene expression. The present study focuses on the role of the zebrafish nipblb paralog during neural development, examining its expression in the central nervous system, and analyzing the consequences of nipblb loss of function. Neural development was impaired by the knockdown of nipblb in zebrafish. nipblb-loss-of-function embryos presented with increased apoptosis in the developing neural tissues, downregulation of canonical Wnt pathway genes, and subsequent decreased Cyclin D1 (Ccnd1) levels. Importantly, the same pattern of canonical WNT pathway and CCND1 downregulation was observed in NIPBL-mutated patient-specific fibroblasts. Finally, chemical activation of the pathway in nipblb-loss-of-function embryos rescued the adverse phenotype and restored the physiological levels of cell death.
Functional characterization of KanP, a methyltransferase from the kanamycin biosynthetic gene cluster of Streptomyces kanamyceticus.

PubMed

Nepal, Keshav Kumar; Yoo, Jin Cheol; Sohng, Jae Kyung

2010-09-20

KanP, a putative methyltransferase, is located in the kanamycin biosynthetic gene cluster of Streptomyces kanamyceticus ATCC12853. Amino acid sequence analysis of KanP revealed the presence of S-adenosyl-L-methionine binding motifs, which are present in other O-methyltransferases. The kanP gene was expressed in Escherichia coli BL21 (DE3) to generate the E. coli KANP recombinant strain. The conversion of external quercetin to methylated quercetin in the culture extract of E. coli KANP proved the function of kanP as S-adenosyl-L-methionine-dependent methyltransferase. This is the first report concerning the identification of an O-methyltransferase gene from the kanamycin gene cluster. The resistant activity assay and RT-PCR analysis demonstrated the leeway for obtaining methylated kanamycin derivatives from the wild-type strain of kanamycin producer. 2009 Elsevier GmbH. All rights reserved.
Cloning of DOG1, a quantitative trait locus controlling seed dormancy in Arabidopsis.

PubMed

Bentsink, Leónie; Jowett, Jemma; Hanhart, Corrie J; Koornneef, Maarten

2006-11-07

Genetic variation for seed dormancy in nature is a typical quantitative trait controlled by multiple loci on which environmental factors have a strong effect. Finding the genes underlying dormancy quantitative trait loci is a major scientific challenge, which also has relevance for agriculture and ecology. In this study we describe the identification of the DELAY OF GERMINATION 1 (DOG1) gene previously identified as a quantitative trait locus involved in the control of seed dormancy. This gene was isolated by a combination of positional cloning and mutant analysis and is absolutely required for the induction of seed dormancy. DOG1 is a member of a small gene family of unknown molecular function, with five members in Arabidopsis. The functional natural allelic variation present in Arabidopsis is caused by polymorphisms in the cis-regulatory region of the DOG1 gene and results in considerable expression differences between the DOG1 alleles of the accessions analyzed.
B-BOX genes: genome-wide identification, evolution and their contribution to pollen growth in pear (Pyrus bretschneideri Rehd.).

PubMed

Cao, Yunpeng; Han, Yahui; Meng, Dandan; Li, Dahui; Jiao, Chunyan; Jin, Qing; Lin, Yi; Cai, Yongping

2017-09-19

The B-BOX (BBX) proteins have important functions in regulating plant growth and development. In plants, the BBX gene family has been identified in several plants, such as rice, Arabidopsis and tomato. However, there still lack a genome-wide survey of BBX genes in pear. In the present study, a total of 25 BBX genes were identified in pear (Pyrus bretschneideri Rehd.). Subsequently, phylogenetic relationship, gene structure, gene duplication, transcriptome data and qRT-PCR were conducted on these BBX gene members. The transcript analysis revealed that twelve PbBBX genes (48%) were specifically expressed in pear pollen tubes. Furthermore, qRT-PCR analysis indicated that both PbBBX4 and PbBBX13 have potential role in pear fruit development, while PbBBX5 should be involved in the senescence of pear pollen tube. This study provided a genome-wide survey of BBX gene family in pear, and highlighted its roles in both pear fruits and pollen tubes. The results will be useful in improving our understanding of the complexity of BBX gene family and functional characteristics of its members in future study.
Conservation, Divergence, and Genome-Wide Distribution of PAL and POX A Gene Families in Plants.

PubMed

Rawal, H C; Singh, N K; Sharma, T R

2013-01-01

Genome-wide identification and phylogenetic and syntenic comparison were performed for the genes responsible for phenylalanine ammonia lyase (PAL) and peroxidase A (POX A) enzymes in nine plant species representing very diverse groups like legumes (Glycine max and Medicago truncatula), fruits (Vitis vinifera), cereals (Sorghum bicolor, Zea mays, and Oryza sativa), trees (Populus trichocarpa), and model dicot (Arabidopsis thaliana) and monocot (Brachypodium distachyon) species. A total of 87 and 1045 genes in PAL and POX A gene families, respectively, have been identified in these species. The phylogenetic and syntenic comparison along with motif distributions shows a high degree of conservation of PAL genes, suggesting that these genes may predate monocot/eudicot divergence. The POX A family genes, present in clusters at the subtelomeric regions of chromosomes, might be evolving and expanding with higher rate than the PAL gene family. Our analysis showed that during the expansion of POX A gene family, many groups and subgroups have evolved, resulting in a high level of functional divergence among monocots and dicots. These results will act as a first step toward the understanding of monocot/eudicot evolution and functional characterization of these gene families in the future.
Conservation, Divergence, and Genome-Wide Distribution of PAL and POX A Gene Families in Plants

PubMed Central

Rawal, H. C.; Singh, N. K.; Sharma, T. R.

2013-01-01

Genome-wide identification and phylogenetic and syntenic comparison were performed for the genes responsible for phenylalanine ammonia lyase (PAL) and peroxidase A (POX A) enzymes in nine plant species representing very diverse groups like legumes (Glycine max and Medicago truncatula), fruits (Vitis vinifera), cereals (Sorghum bicolor, Zea mays, and Oryza sativa), trees (Populus trichocarpa), and model dicot (Arabidopsis thaliana) and monocot (Brachypodium distachyon) species. A total of 87 and 1045 genes in PAL and POX A gene families, respectively, have been identified in these species. The phylogenetic and syntenic comparison along with motif distributions shows a high degree of conservation of PAL genes, suggesting that these genes may predate monocot/eudicot divergence. The POX A family genes, present in clusters at the subtelomeric regions of chromosomes, might be evolving and expanding with higher rate than the PAL gene family. Our analysis showed that during the expansion of POX A gene family, many groups and subgroups have evolved, resulting in a high level of functional divergence among monocots and dicots. These results will act as a first step toward the understanding of monocot/eudicot evolution and functional characterization of these gene families in the future. PMID:23671845
A remarkable synergistic effect at the transcriptomic level in peach fruits doubly infected by prunus necrotic ringspot virus and peach latent mosaic viroid.

PubMed

Herranz, Mari Carmen; Niehl, Annette; Rosales, Marlene; Fiore, Nicola; Zamorano, Alan; Granell, Antonio; Pallas, Vicente

2013-05-28

Microarray profiling is a powerful technique to investigate expression changes of large amounts of genes in response to specific environmental conditions. The majority of the studies investigating gene expression changes in virus-infected plants are limited to interactions between a virus and a model host plant, which usually is Arabidopsis thaliana or Nicotiana benthamiana. In the present work, we performed microarray profiling to explore changes in the expression profile of field-grown Prunus persica (peach) originating from Chile upon single and double infection with Prunus necrotic ringspot virus (PNRSV) and Peach latent mosaic viroid (PLMVd), worldwide natural pathogens of peach trees. Upon single PLMVd or PNRSV infection, the number of statistically significant gene expression changes was relatively low. By contrast, doubly-infected fruits presented a high number of differentially regulated genes. Among these, down-regulated genes were prevalent. Functional categorization of the gene expression changes upon double PLMVd and PNRSV infection revealed protein modification and degradation as the functional category with the highest percentage of repressed genes whereas induced genes encoded mainly proteins related to phosphate, C-compound and carbohydrate metabolism and also protein modification. Overrepresentation analysis upon double infection with PLMVd and PNRSV revealed specific functional categories over- and underrepresented among the repressed genes indicating active counter-defense mechanisms of the pathogens during infection. Our results identify a novel synergistic effect of PLMVd and PNRSV on the transcriptome of peach fruits. We demonstrate that mixed infections, which occur frequently in field conditions, result in a more complex transcriptional response than that observed in single infections. Thus, our data demonstrate for the first time that the simultaneous infection of a viroid and a plant virus synergistically affect the host transcriptome in infected peach fruits. These field studies can help to fully understand plant-pathogen interactions and to develop appropriate crop protection strategies.
A remarkable synergistic effect at the transcriptomic level in peach fruits doubly infected by prunus necrotic ringspot virus and peach latent mosaic viroid

PubMed Central

2013-01-01

Background Microarray profiling is a powerful technique to investigate expression changes of large amounts of genes in response to specific environmental conditions. The majority of the studies investigating gene expression changes in virus-infected plants are limited to interactions between a virus and a model host plant, which usually is Arabidopsis thaliana or Nicotiana benthamiana. In the present work, we performed microarray profiling to explore changes in the expression profile of field-grown Prunus persica (peach) originating from Chile upon single and double infection with Prunus necrotic ringspot virus (PNRSV) and Peach latent mosaic viroid (PLMVd), worldwide natural pathogens of peach trees. Results Upon single PLMVd or PNRSV infection, the number of statistically significant gene expression changes was relatively low. By contrast, doubly-infected fruits presented a high number of differentially regulated genes. Among these, down-regulated genes were prevalent. Functional categorization of the gene expression changes upon double PLMVd and PNRSV infection revealed protein modification and degradation as the functional category with the highest percentage of repressed genes whereas induced genes encoded mainly proteins related to phosphate, C-compound and carbohydrate metabolism and also protein modification. Overrepresentation analysis upon double infection with PLMVd and PNRSV revealed specific functional categories over- and underrepresented among the repressed genes indicating active counter-defense mechanisms of the pathogens during infection. Conclusions Our results identify a novel synergistic effect of PLMVd and PNRSV on the transcriptome of peach fruits. We demonstrate that mixed infections, which occur frequently in field conditions, result in a more complex transcriptional response than that observed in single infections. Thus, our data demonstrate for the first time that the simultaneous infection of a viroid and a plant virus synergistically affect the host transcriptome in infected peach fruits. These field studies can help to fully understand plant-pathogen interactions and to develop appropriate crop protection strategies. PMID:23710752
Molecular characterization of the apical organ of the anthozoan Nematostella vectensis

PubMed Central

Sinigaglia, Chiara; Busengdal, Henriette; Lerner, Avi; Oliveri, Paola; Rentzsch, Fabian

2015-01-01

Apical organs are sensory structures present in many marine invertebrate larvae where they are considered to be involved in their settlement, metamorphosis and locomotion. In bilaterians they are characterised by a tuft of long cilia and receptor cells and they are associated with groups of neurons, but their relatively low morphological complexity and dispersed phylogenetic distribution have left their evolutionary relationship unresolved. Moreover, since apical organs are not present in the standard model organisms, their development and function are not well understood. To provide a foundation for a better understanding of this structure we have characterised the molecular composition of the apical organ of the sea anemone Nematostella vectensis. In a microarray-based comparison of the gene expression profiles of planulae with either a wildtype or an experimentally expanded apical organ, we identified 78 evolutionarily conserved genes, which are predominantly or specifically expressed in the apical organ of Nematostella. This gene set comprises signalling molecules, transcription factors, structural and metabolic genes. The majority of these genes, including several conserved, but previously uncharacterized ones, are potentially involved in different aspects of the development or function of the long cilia of the apical organ. To demonstrate the utility of this gene set for comparative analyses, we further analysed the expression of a subset of previously uncharacterized putative orthologs in sea urchin larvae and detected expression for twelve out of eighteen of them in the apical domain. Our study provides a molecular characterization of the apical organ of Nematostella and represents an informative tool for future studies addressing the development, function and evolutionary history of apical organ cells. PMID:25478911
ATP-dependent chromatin remodeling in T cells.

PubMed

Wurster, Andrea L; Pazin, Michael J

2012-02-01

One of the best studied systems for mammalian chromatin remodeling is transcriptional regulation during T cell development. The variety of these studies have led to important findings in T cell gene regulation and cell fate determination. Importantly, these findings have also advanced our knowledge of the function of remodeling enzymes in mammalian gene regulation. First we briefly present biochemical and cell-free analysis of 3 types of ATP dependent remodeling enzymes (SWI/SNF, Mi2, and ISWI) to construct an intellectual framework to understand how these enzymes might be working. Second, we compare and contrast the function of these enzymes during early (thymic) and late (peripheral) T cell development. Finally, we examine some of the gaps in our present understanding.
Genetic basis of interindividual susceptibility to cancer cachexia: selection of potential candidate gene polymorphisms for association studies.

PubMed

Johns, N; Tan, B H; MacMillan, M; Solheim, T S; Ross, J A; Baracos, V E; Damaraju, S; Fearon, K C H

2014-12-01

Cancer cachexia is a complex and multifactorial disease. Evolving definitions highlight the fact that a diverse range of biological processes contribute to cancer cachexia. Part of the variation in who will and who will not develop cancer cachexia may be genetically determined. As new definitions, classifications and biological targets continue to evolve, there is a need for reappraisal of the literature for future candidate association studies. This review summarizes genes identified or implicated as well as putative candidate genes contributing to cachexia, identified through diverse technology platforms and model systems to further guide association studies. A systematic search covering 1986-2012 was performed for potential candidate genes / genetic polymorphisms relating to cancer cachexia. All candidate genes were reviewed for functional polymorphisms or clinically significant polymorphisms associated with cachexia using the OMIM and GeneRIF databases. Pathway analysis software was used to reveal possible network associations between genes. Functionality of SNPs/genes was explored based on published literature, algorithms for detecting putative deleterious SNPs and interrogating the database for expression of quantitative trait loci (eQTLs). A total of 154 genes associated with cancer cachexia were identified and explored for functional polymorphisms. Of these 154 genes, 119 had a combined total of 281 polymorphisms with functional and/or clinical significance in terms of cachexia associated with them. Of these, 80 polymorphisms (in 51 genes) were replicated in more than one study with 24 polymorphisms found to influence two or more hallmarks of cachexia (i.e., inflammation, loss of fat mass and/or lean mass and reduced survival). Selection of candidate genes and polymorphisms is a key element of multigene study design. The present study provides a contemporary basis to select genes and/or polymorphisms for further association studies in cancer cachexia, and to develop their potential as susceptibility biomarkers of cachexia.
Genes encoding giant danio and golden shiner ependymin.

PubMed

Adams, D S; Kiyokawa, M; Getman, M E; Shashoua, V E

1996-03-01

Ependymin (EPN) is a brain glycoprotein that functions as a neurotrophic factor in optic nerve regeneration and long-term memory consolidation in goldfish. To date, true epn genes have been characterized in one order of teleost fish, Cypriniformes. In the study presented here, polymerase chain reactions were used to analyze the complete epn genes, gd (1480 bp), and sh (2071 bp), from Cypriniformes giant danio and shiner, respectively. Southern hybridizations demonstrated the existence of one copy of each gene per corresponding haploid genome. Each gene was found to contain six exons and five introns. Gene gd encodes a predicted 218-amino acid (aa) protein GD 93 percent conserved to goldfish EPN, while sh encodes a predicted 214-aa protein SH 91 percent homologous to goldfish. Evidence is presented classifying proteins previously termed "EPNs" into two major categories: true EPNs and non-EPN cerebrospinal fluid glycoproteins. Proteins GD and SH contain all the hallmark, features of true EPNs.
SPP1 and AGER as potential prognostic biomarkers for lung adenocarcinoma.

PubMed

Zhang, Weiguo; Fan, Junli; Chen, Qiang; Lei, Caipeng; Qiao, Bin; Liu, Qin

2018-05-01

Overdue treatment and prognostic evaluation lead to low survival rates in patients with lung adenocarcinoma (LUAD). To date, effective biomarkers for prognosis are still required. The aim of the present study was to screen differentially expressed genes (DEGs) as biomarkers for prognostic evaluation of LUAD. DEGs in tumor and normal samples were identified and analyzed for Kyoto Encyclopedia of Genes and Genomes/Gene Ontology functional enrichments. The common genes that are up and downregulated were selected for prognostic analysis using RNAseq data in The Cancer Genome Atlas. Differential expression analysis was performed with 164 samples in GSE10072 and GSE7670 datasets. A total of 484 DEGs that were present in GSE10072 and GSE7670 datasets were screened, including secreted phosphoprotein 1 (SPP1) that was highly expressed and DEGs ficolin 3, advanced glycosylation end-product specific receptor (AGER), transmembrane protein 100 that were lowly expressed in tumor tissues. These four key genes were subsequently verified using an independent dataset, GSE19804. The gene expression model was consistent with GSE10072 and GSE7670 datasets. The dysregulation of highly expressed SPP1 and lowly expressed AGER significantly reduced the median survival time of patients with LUAD. These findings suggest that SPP1 and AGER are risk factors for LUAD, and these two genes may be utilized in the prognostic evaluation of patients with LUAD. Additionally, the key genes and functional enrichments may provide a reference for investigating the molecular expression mechanisms underlying LUAD.
HRGFish: A database of hypoxia responsive genes in fishes

NASA Astrophysics Data System (ADS)

Rashid, Iliyas; Nagpure, Naresh Sahebrao; Srivastava, Prachi; Kumar, Ravindra; Pathak, Ajey Kumar; Singh, Mahender; Kushwaha, Basdeo

2017-02-01

Several studies have highlighted the changes in the gene expression due to the hypoxia response in fishes, but the systematic organization of the information and the analytical platform for such genes are lacking. In the present study, an attempt was made to develop a database of hypoxia responsive genes in fishes (HRGFish), integrated with analytical tools, using LAMPP technology. Genes reported in hypoxia response for fishes were compiled through literature survey and the database presently covers 818 gene sequences and 35 gene types from 38 fishes. The upstream fragments (3,000 bp), covered in this database, enables to compute CG dinucleotides frequencies, motif finding of the hypoxia response element, identification of CpG island and mapping with the reference promoter of zebrafish. The database also includes functional annotation of genes and provides tools for analyzing sequences and designing primers for selected gene fragments. This may be the first database on the hypoxia response genes in fishes that provides a workbench to the scientific community involved in studying the evolution and ecological adaptation of the fish species in relation to hypoxia.
Negative Example Selection for Protein Function Prediction: The NoGO Database

PubMed Central

Youngs, Noah; Penfold-Brown, Duncan; Bonneau, Richard; Shasha, Dennis

2014-01-01

Negative examples – genes that are known not to carry out a given protein function – are rarely recorded in genome and proteome annotation databases, such as the Gene Ontology database. Negative examples are required, however, for several of the most powerful machine learning methods for integrative protein function prediction. Most protein function prediction efforts have relied on a variety of heuristics for the choice of negative examples. Determining the accuracy of methods for negative example prediction is itself a non-trivial task, given that the Open World Assumption as applied to gene annotations rules out many traditional validation metrics. We present a rigorous comparison of these heuristics, utilizing a temporal holdout, and a novel evaluation strategy for negative examples. We add to this comparison several algorithms adapted from Positive-Unlabeled learning scenarios in text-classification, which are the current state of the art methods for generating negative examples in low-density annotation contexts. Lastly, we present two novel algorithms of our own construction, one based on empirical conditional probability, and the other using topic modeling applied to genes and annotations. We demonstrate that our algorithms achieve significantly fewer incorrect negative example predictions than the current state of the art, using multiple benchmarks covering multiple organisms. Our methods may be applied to generate negative examples for any type of method that deals with protein function, and to this end we provide a database of negative examples in several well-studied organisms, for general use (The NoGO database, available at: bonneaulab.bio.nyu.edu/nogo.html). PMID:24922051
Population density approach for discrete mRNA distributions in generalized switching models for stochastic gene expression.

PubMed

Stinchcombe, Adam R; Peskin, Charles S; Tranchina, Daniel

2012-06-01

We present a generalization of a population density approach for modeling and analysis of stochastic gene expression. In the model, the gene of interest fluctuates stochastically between an inactive state, in which transcription cannot occur, and an active state, in which discrete transcription events occur; and the individual mRNA molecules are degraded stochastically in an independent manner. This sort of model in simplest form with exponential dwell times has been used to explain experimental estimates of the discrete distribution of random mRNA copy number. In our generalization, the random dwell times in the inactive and active states, T_{0} and T_{1}, respectively, are independent random variables drawn from any specified distributions. Consequently, the probability per unit time of switching out of a state depends on the time since entering that state. Our method exploits a connection between the fully discrete random process and a related continuous process. We present numerical methods for computing steady-state mRNA distributions and an analytical derivation of the mRNA autocovariance function. We find that empirical estimates of the steady-state mRNA probability mass function from Monte Carlo simulations of laboratory data do not allow one to distinguish between underlying models with exponential and nonexponential dwell times in some relevant parameter regimes. However, in these parameter regimes and where the autocovariance function has negative lobes, the autocovariance function disambiguates the two types of models. Our results strongly suggest that temporal data beyond the autocovariance function is required in general to characterize gene switching.
Revealing Alzheimer's disease genes spectrum in the whole-genome by machine learning.

PubMed

Huang, Xiaoyan; Liu, Hankui; Li, Xinming; Guan, Liping; Li, Jiankang; Tellier, Laurent Christian Asker M; Yang, Huanming; Wang, Jian; Zhang, Jianguo

2018-01-10

Alzheimer's disease (AD) is an important, progressive neurodegenerative disease, with a complex genetic architecture. A key goal of biomedical research is to seek out disease risk genes, and to elucidate the function of these risk genes in the development of disease. For this purpose, expanding the AD-associated gene set is necessary. In past research, the prediction methods for AD related genes has been limited in their exploration of the target genome regions. We here present a genome-wide method for AD candidate genes predictions. We present a machine learning approach (SVM), based upon integrating gene expression data with human brain-specific gene network data, to discover the full spectrum of AD genes across the whole genome. We classified AD candidate genes with an accuracy and the area under the receiver operating characteristic (ROC) curve of 84.56% and 94%. Our approach provides a supplement for the spectrum of AD-associated genes extracted from more than 20,000 genes in a genome wide scale. In this study, we have elucidated the whole-genome spectrum of AD, using a machine learning approach. Through this method, we expect for the candidate gene catalogue to provide a more comprehensive annotation of AD for researchers.
Computational analysis of microRNA function in heart development.

PubMed

Liu, Ganqiang; Ding, Min; Chen, Jiajia; Huang, Jinyan; Wang, Haiyun; Jing, Qing; Shen, Bairong

2010-09-01

Emerging evidence suggests that specific spatio-temporal microRNA (miRNA) expression is required for heart development. In recent years, hundreds of miRNAs have been discovered. In contrast, functional annotations are available only for a very small fraction of these regulatory molecules. In order to provide a global perspective for the biologists who study the relationship between differentially expressed miRNAs and heart development, we employed computational analysis to uncover the specific cellular processes and biological pathways targeted by miRNAs in mouse heart development. Here, we utilized Gene Ontology (GO) categories, KEGG Pathway, and GeneGo Pathway Maps as a gene functional annotation system for miRNA target enrichment analysis. The target genes of miRNAs were found to be enriched in functional categories and pathway maps in which miRNAs could play important roles during heart development. Meanwhile, we developed miRHrt (http://sysbio.suda.edu.cn/mirhrt/), a database aiming to provide a comprehensive resource of miRNA function in regulating heart development. These computational analysis results effectively illustrated the correlation of differentially expressed miRNAs with cellular functions and heart development. We hope that the identified novel heart development-associated pathways and the database presented here would facilitate further understanding of the roles and mechanisms of miRNAs in heart development.
Defining functional distance using manifold embeddings of gene ontology annotations

PubMed Central

Lerman, Gilad; Shakhnovich, Boris E.

2007-01-01

Although rigorous measures of similarity for sequence and structure are now well established, the problem of defining functional relationships has been particularly daunting. Here, we present several manifold embedding techniques to compute distances between Gene Ontology (GO) functional annotations and consequently estimate functional distances between protein domains. To evaluate accuracy, we correlate the functional distance to the well established measures of sequence, structural, and phylogenetic similarities. Finally, we show that manual classification of structures into folds and superfamilies is mirrored by proximity in the newly defined function space. We show how functional distances place structure–function relationships in biological context resulting in insight into divergent and convergent evolution. The methods and results in this paper can be readily generalized and applied to a wide array of biologically relevant investigations, such as accuracy of annotation transference, the relationship between sequence, structure, and function, or coherence of expression modules. PMID:17595300
Comparative transcriptional profiling-based identification of raphanusanin-inducible genes

PubMed Central

2010-01-01

Background Raphanusanin (Ra) is a light-induced growth inhibitor involved in the inhibition of hypocotyl growth in response to unilateral blue-light illumination in radish seedlings. Knowledge of the roles of Ra still remains elusive. To understand the roles of Ra and its functional coupling to light signalling, we constructed the Ra-induced gene library using the Suppression Subtractive Hybridisation (SSH) technique and present a comparative investigation of gene regulation in radish seedlings in response to short-term Ra and blue-light exposure. Results The predicted gene ontology (GO) term revealed that 55% of the clones in the Ra-induced gene library were associated with genes involved in common defence mechanisms, including thirty four genes homologous to Arabidopsis genes implicated in R-gene-triggered resistance in the programmed cell death (PCD) pathway. Overall, the library was enriched with transporters, hydrolases, protein kinases, and signal transducers. The transcriptome analysis revealed that, among the fifty genes from various functional categories selected from 88 independent genes of the Ra-induced library, 44 genes were up-regulated and 4 were down-regulated. The comparative analysis showed that, among the transcriptional profiles of 33 highly Ra-inducible genes, 25 ESTs were commonly regulated by different intensities and duration of blue-light irradiation. The transcriptional profiles, coupled with the transcriptional regulation of early blue light, have provided the functional roles of many genes expected to be involved in the light-mediated defence mechanism. Conclusions This study is the first comprehensive survey of transcriptional regulation in response to Ra. The results described herein suggest a link between Ra and cellular defence and light signalling, and thereby contribute to further our understanding of how Ra is involved in light-mediated mechanisms of plant defence. PMID:20553608

Using the TIGR gene index databases for biological discovery.

PubMed

Lee, Yuandan; Quackenbush, John

2003-11-01

The TIGR Gene Index web pages provide access to analyses of ESTs and gene sequences for nearly 60 species, as well as a number of resources derived from these. Each species-specific database is presented using a common format with a homepage. A variety of methods exist that allow users to search each species-specific database. Methods implemented currently include nucleotide or protein sequence queries using WU-BLAST, text-based searches using various sequence identifiers, searches by gene, tissue and library name, and searches using functional classes through Gene Ontology assignments. This protocol provides guidance for using the Gene Index Databases to extract information.
Study of Staphylococcus aureus N315 Pathogenic Genes by Text Mining and Enrichment Analysis of Pathways and Operons.

PubMed

Yang, Chun-Feng; Gou, Wei-Hui; Dai, Xin-Lun; Li, Yu-Mei

2018-06-01

Staphylococcus aureus (S. aureus) is a versatile pathogen found in many environments and can cause nosocomial infections in the community and hospitals. S. aureus infection is an increasingly serious threat to global public health that requires action across many government bodies, medical and health sectors, and scientific research institutions. In the present study, S. aureus N315 genes that have been shown in the literature to be pathogenic were extracted using a bibliometric method for functional enrichment analysis of pathways and operons to statistically discover novel pathogenic genes associated with S. aureus N315. A total of 383 pathogenic genes were mined from the literature using bibliometrics, and subsequently a few new pathogenic genes of S. aureus N315 were identified by functional enrichment analysis of pathways and operons. The discovery of these novel S. aureus N315 pathogenic genes is of great significance to treat S. aureus induced diseases and identify potential diagnostic markers, thus providing theoretical fundamentals for epidemiological prevention.
Orthologs, paralogs and genome comparisons

NASA Technical Reports Server (NTRS)

Gogarten, J. P.; Olendzenski, L.

1999-01-01

During the past decade, ancient gene duplications were recognized as one of the main forces in the generation of diverse gene families and the creation of new functional capabilities. New tools developed to search data banks for homologous sequences, and an increased availability of reliable three-dimensional structural information led to the recognition that proteins with diverse functions can belong to the same superfamily. Analyses of the evolution of these superfamilies promises to provide insights into early evolution but are complicated by several important evolutionary processes. Horizontal transfer of genes can lead to a vertical spread of innovations among organisms, therefore finding a certain property in some descendants of an ancestor does not guarantee that it was present in that ancestor. Complete or partial gene conversion between duplicated genes can yield phylogenetic trees with several, apparently independent gene duplications, suggesting an often surprising parallelism in the evolution of independent lineages. Additionally, the breakup of domains within a protein and the fusion of domains into multifunctional proteins makes the delineation of superfamilies a task that remains difficult to automate.
Promoting gene expression in plants by permissive histone lysine methylation

PubMed Central

Millar, Tony; Finnegan, E Jean

2009-01-01

Plants utilize sophisticated epigenetic regulatory mechanisms to coordinate changes in gene expression during development and in response to environmental stimuli. Epigenetics refers to the modification of DNA and chromatin associated proteins, which affect gene expression and cell function, without changing the DNA sequence. Such modifications are inherited through mitosis, and in rare instances through meiosis, although it can be reversible and thus regulatory. Epigenetic modifications are controlled by groups of proteins, such as the family of histone lysine methytransferases (HKMTs). The catalytic core known as the SET domain encodes HKMT activity and either promotes or represses gene expression. A large family of SET domain proteins is present in Arabidopsis where there is growing evidence that two classes of these genes are involved in promoting gene expression in a diverse range of developmental processes. This review will focus on the function of these two classes and the processes that they control, highlighting the huge potential this regulatory mechanism has in plants. PMID:19816124
Systematic bacterialization of yeast genes identifies a near-universally swappable pathway

PubMed Central

Kachroo, Aashiq H; Laurent, Jon M; Akhmetov, Azat; Szilagyi-Jones, Madelyn; McWhite, Claire D; Zhao, Alice; Marcotte, Edward M

2017-01-01

Eukaryotes and prokaryotes last shared a common ancestor ~2 billion years ago, and while many present-day genes in these lineages predate this divergence, the extent to which these genes still perform their ancestral functions is largely unknown. To test principles governing retention of ancient function, we asked if prokaryotic genes could replace their essential eukaryotic orthologs. We systematically replaced essential genes in yeast by their 1:1 orthologs from Escherichia coli. After accounting for mitochondrial localization and alternative start codons, 31 out of 51 bacterial genes tested (61%) could complement a lethal growth defect and replace their yeast orthologs with minimal effects on growth rate. Replaceability was determined on a pathway-by-pathway basis; codon usage, abundance, and sequence similarity contributed predictive power. The heme biosynthesis pathway was particularly amenable to inter-kingdom exchange, with each yeast enzyme replaceable by its bacterial, human, or plant ortholog, suggesting it as a near-universally swappable pathway. DOI: http://dx.doi.org/10.7554/eLife.25093.001 PMID:28661399
Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA Clones

PubMed Central

Imanishi, Tadashi; Itoh, Takeshi; Suzuki, Yutaka; O'Donovan, Claire; Fukuchi, Satoshi; Koyanagi, Kanako O; Barrero, Roberto A; Tamura, Takuro; Yamaguchi-Kabata, Yumi; Tanino, Motohiko; Yura, Kei; Miyazaki, Satoru; Ikeo, Kazuho; Homma, Keiichi; Kasprzyk, Arek; Nishikawa, Tetsuo; Hirakawa, Mika; Thierry-Mieg, Jean; Thierry-Mieg, Danielle; Ashurst, Jennifer; Jia, Libin; Nakao, Mitsuteru; Thomas, Michael A; Mulder, Nicola; Karavidopoulou, Youla; Jin, Lihua; Kim, Sangsoo; Yasuda, Tomohiro; Lenhard, Boris; Eveno, Eric; Suzuki, Yoshiyuki; Yamasaki, Chisato; Takeda, Jun-ichi; Gough, Craig; Hilton, Phillip; Fujii, Yasuyuki; Sakai, Hiroaki; Tanaka, Susumu; Amid, Clara; Bellgard, Matthew; Bonaldo, Maria de Fatima; Bono, Hidemasa; Bromberg, Susan K; Brookes, Anthony J; Bruford, Elspeth; Carninci, Piero; Chelala, Claude; Couillault, Christine; de Souza, Sandro J.; Debily, Marie-Anne; Devignes, Marie-Dominique; Dubchak, Inna; Endo, Toshinori; Estreicher, Anne; Eyras, Eduardo; Fukami-Kobayashi, Kaoru; R. Gopinath, Gopal; Graudens, Esther; Hahn, Yoonsoo; Han, Michael; Han, Ze-Guang; Hanada, Kousuke; Hanaoka, Hideki; Harada, Erimi; Hashimoto, Katsuyuki; Hinz, Ursula; Hirai, Momoki; Hishiki, Teruyoshi; Hopkinson, Ian; Imbeaud, Sandrine; Inoko, Hidetoshi; Kanapin, Alexander; Kaneko, Yayoi; Kasukawa, Takeya; Kelso, Janet; Kersey, Paul; Kikuno, Reiko; Kimura, Kouichi; Korn, Bernhard; Kuryshev, Vladimir; Makalowska, Izabela; Makino, Takashi; Mano, Shuhei; Mariage-Samson, Regine; Mashima, Jun; Matsuda, Hideo; Mewes, Hans-Werner; Minoshima, Shinsei; Nagai, Keiichi; Nagasaki, Hideki; Nagata, Naoki; Nigam, Rajni; Ogasawara, Osamu; Ohara, Osamu; Ohtsubo, Masafumi; Okada, Norihiro; Okido, Toshihisa; Oota, Satoshi; Ota, Motonori; Ota, Toshio; Otsuki, Tetsuji; Piatier-Tonneau, Dominique; Poustka, Annemarie; Ren, Shuang-Xi; Saitou, Naruya; Sakai, Katsunaga; Sakamoto, Shigetaka; Sakate, Ryuichi; Schupp, Ingo; Servant, Florence; Sherry, Stephen; Shiba, Rie; Shimizu, Nobuyoshi; Shimoyama, Mary; Simpson, Andrew J; Soares, Bento; Steward, Charles; Suwa, Makiko; Suzuki, Mami; Takahashi, Aiko; Tamiya, Gen; Tanaka, Hiroshi; Taylor, Todd; Terwilliger, Joseph D; Unneberg, Per; Veeramachaneni, Vamsi; Watanabe, Shinya; Wilming, Laurens; Yasuda, Norikazu; Yoo, Hyang-Sook; Stodolsky, Marvin; Makalowski, Wojciech; Go, Mitiko; Nakai, Kenta; Takagi, Toshihisa; Kanehisa, Minoru; Sakaki, Yoshiyuki; Quackenbush, John; Okazaki, Yasushi; Hayashizaki, Yoshihide; Hide, Winston; Chakraborty, Ranajit; Nishikawa, Ken; Sugawara, Hideaki; Tateno, Yoshio; Chen, Zhu; Oishi, Michio; Tonellato, Peter; Apweiler, Rolf; Okubo, Kousaku; Wagner, Lukas; Wiemann, Stefan; Strausberg, Robert L; Isogai, Takao; Auffray, Charles; Nomura, Nobuo; Sugano, Sumio

2004-01-01

The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs), identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly) may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci) did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA genes. In addition, among 72,027 uniquely mapped SNPs and insertions/deletions localized within human genes, 13,215 nonsynonymous SNPs, 315 nonsense SNPs, and 452 indels occurred in coding regions. Together with 25 polymorphic microsatellite repeats present in coding regions, they may alter protein structure, causing phenotypic effects or resulting in disease. The H-InvDB platform represents a substantial contribution to resources needed for the exploration of human biology and pathology. PMID:15103394
Cloning and functional characterization of the Xenopus orthologue of the Treacher Collins syndrome (TCOF1) gene product.

PubMed

Gonzales, Bianca; Yang, Hushan; Henning, Dale; Valdez, Benigno C

2005-10-10

Treacher Collins syndrome (TCS) is an autosomal dominant disorder of craniofacial development caused by mutations in the TCOF1 gene, which encodes the nucleolar phosphoprotein treacle. We previously reported a function for mammalian treacle in ribosomal DNA gene transcription by its interaction with upstream binding factor. As an initial step in the development of a TCS model for frog the cDNA that encodes the Xenopus laevis treacle was cloned. Although the derived amino acid sequence shows a poor homology with its mammalian orthologues, Xenopus treacle has 11 highly homologous direct repeats near the center of the protein molecule similar to those present in its human, dog and mouse orthologues. Comparison of their amino acid compositions indicates conservation of predominant specific amino acid residues. Antisense-mediated down-regulation of treacle expression in X. laevis oocytes resulted in inhibition of rDNA gene transcription. The results suggest evolutionary conservation of the function of treacle in ribosomal RNA biogenesis in higher eukaryotes.
Evolving phenotypic networks in silico.

PubMed

François, Paul

2014-11-01

Evolved gene networks are constrained by natural selection. Their structures and functions are consequently far from being random, as exemplified by the multiple instances of parallel/convergent evolution. One can thus ask if features of actual gene networks can be recovered from evolutionary first principles. I review a method for in silico evolution of small models of gene networks aiming at performing predefined biological functions. I summarize the current implementation of the algorithm, insisting on the construction of a proper "fitness" function. I illustrate the approach on three examples: biochemical adaptation, ligand discrimination and vertebrate segmentation (somitogenesis). While the structure of the evolved networks is variable, dynamics of our evolved networks are usually constrained and present many similar features to actual gene networks, including properties that were not explicitly selected for. In silico evolution can thus be used to predict biological behaviours without a detailed knowledge of the mapping between genotype and phenotype. Copyright © 2014 The Author. Published by Elsevier Ltd.. All rights reserved.
Curated genome annotation of Oryza sativa ssp. japonica and comparative genome analysis with Arabidopsis thaliana

PubMed Central

Itoh, Takeshi; Tanaka, Tsuyoshi; Barrero, Roberto A.; Yamasaki, Chisato; Fujii, Yasuyuki; Hilton, Phillip B.; Antonio, Baltazar A.; Aono, Hideo; Apweiler, Rolf; Bruskiewich, Richard; Bureau, Thomas; Burr, Frances; Costa de Oliveira, Antonio; Fuks, Galina; Habara, Takuya; Haberer, Georg; Han, Bin; Harada, Erimi; Hiraki, Aiko T.; Hirochika, Hirohiko; Hoen, Douglas; Hokari, Hiroki; Hosokawa, Satomi; Hsing, Yue; Ikawa, Hiroshi; Ikeo, Kazuho; Imanishi, Tadashi; Ito, Yukiyo; Jaiswal, Pankaj; Kanno, Masako; Kawahara, Yoshihiro; Kawamura, Toshiyuki; Kawashima, Hiroaki; Khurana, Jitendra P.; Kikuchi, Shoshi; Komatsu, Setsuko; Koyanagi, Kanako O.; Kubooka, Hiromi; Lieberherr, Damien; Lin, Yao-Cheng; Lonsdale, David; Matsumoto, Takashi; Matsuya, Akihiro; McCombie, W. Richard; Messing, Joachim; Miyao, Akio; Mulder, Nicola; Nagamura, Yoshiaki; Nam, Jongmin; Namiki, Nobukazu; Numa, Hisataka; Nurimoto, Shin; O’Donovan, Claire; Ohyanagi, Hajime; Okido, Toshihisa; OOta, Satoshi; Osato, Naoki; Palmer, Lance E.; Quetier, Francis; Raghuvanshi, Saurabh; Saichi, Naomi; Sakai, Hiroaki; Sakai, Yasumichi; Sakata, Katsumi; Sakurai, Tetsuya; Sato, Fumihiko; Sato, Yoshiharu; Schoof, Heiko; Seki, Motoaki; Shibata, Michie; Shimizu, Yuji; Shinozaki, Kazuo; Shinso, Yuji; Singh, Nagendra K.; Smith-White, Brian; Takeda, Jun-ichi; Tanino, Motohiko; Tatusova, Tatiana; Thongjuea, Supat; Todokoro, Fusano; Tsugane, Mika; Tyagi, Akhilesh K.; Vanavichit, Apichart; Wang, Aihui; Wing, Rod A.; Yamaguchi, Kaori; Yamamoto, Mayu; Yamamoto, Naoyuki; Yu, Yeisoo; Zhang, Hao; Zhao, Qiang; Higo, Kenichi; Burr, Benjamin; Gojobori, Takashi; Sasaki, Takuji

2007-01-01

We present here the annotation of the complete genome of rice Oryza sativa L. ssp. japonica cultivar Nipponbare. All functional annotations for proteins and non-protein-coding RNA (npRNA) candidates were manually curated. Functions were identified or inferred in 19,969 (70%) of the proteins, and 131 possible npRNAs (including 58 antisense transcripts) were found. Almost 5000 annotated protein-coding genes were found to be disrupted in insertional mutant lines, which will accelerate future experimental validation of the annotations. The rice loci were determined by using cDNA sequences obtained from rice and other representative cereals. Our conservative estimate based on these loci and an extrapolation suggested that the gene number of rice is ∼32,000, which is smaller than previous estimates. We conducted comparative analyses between rice and Arabidopsis thaliana and found that both genomes possessed several lineage-specific genes, which might account for the observed differences between these species, while they had similar sets of predicted functional domains among the protein sequences. A system to control translational efficiency seems to be conserved across large evolutionary distances. Moreover, the evolutionary process of protein-coding genes was examined. Our results suggest that natural selection may have played a role for duplicated genes in both species, so that duplication was suppressed or favored in a manner that depended on the function of a gene. PMID:17210932
Functional molecular markers for crop improvement.

PubMed

Kage, Udaykumar; Kumar, Arun; Dhokane, Dhananjay; Karre, Shailesh; Kushalappa, Ajjamada C

2016-10-01

A tremendous decline in cultivable land and resources and a huge increase in food demand calls for immediate attention to crop improvement. Though molecular plant breeding serves as a viable solution and is considered as "foundation for twenty-first century crop improvement", a major stumbling block for crop improvement is the availability of a limited functional gene pool for cereal crops. Advancement in the next generation sequencing (NGS) technologies integrated with tools like metabolomics, proteomics and association mapping studies have facilitated the identification of candidate genes, their allelic variants and opened new avenues to accelerate crop improvement through development and use of functional molecular markers (FMMs). The FMMs are developed from the sequence polymorphisms present within functional gene(s) which are associated with phenotypic trait variations. Since FMMs obviate the problems associated with random DNA markers, these are considered as "the holy grail" of plant breeders who employ targeted marker assisted selections (MAS) for crop improvement. This review article attempts to consider the current resources and novel methods such as metabolomics, proteomics and association studies for the identification of candidate genes and their validation through virus-induced gene silencing (VIGS) for the development of FMMs. A number of examples where the FMMs have been developed and used for the improvement of cereal crops for agronomic, food quality, disease resistance and abiotic stress tolerance traits have been considered.
The Tomato Terpene Synthase Gene Family1[W][OA

PubMed Central

Falara, Vasiliki; Akhtar, Tariq A.; Nguyen, Thuong T.H.; Spyropoulou, Eleni A.; Bleeker, Petra M.; Schauvinhold, Ines; Matsuba, Yuki; Bonini, Megan E.; Schilmiller, Anthony L.; Last, Robert L.; Schuurink, Robert C.; Pichersky, Eran

2011-01-01

Compounds of the terpenoid class play numerous roles in the interactions of plants with their environment, such as attracting pollinators and defending the plant against pests. We show here that the genome of cultivated tomato (Solanum lycopersicum) contains 44 terpene synthase (TPS) genes, including 29 that are functional or potentially functional. Of these 29 TPS genes, 26 were expressed in at least some organs or tissues of the plant. The enzymatic functions of eight of the TPS proteins were previously reported, and here we report the specific in vitro catalytic activity of 10 additional tomato terpene synthases. Many of the tomato TPS genes are found in clusters, notably on chromosomes 1, 2, 6, 8, and 10. All TPS family clades previously identified in angiosperms are also present in tomato. The largest clade of functional TPS genes found in tomato, with 12 members, is the TPS-a clade, and it appears to encode only sesquiterpene synthases, one of which is localized to the mitochondria, while the rest are likely cytosolic. A few additional sesquiterpene synthases are encoded by TPS-b clade genes. Some of the tomato sesquiterpene synthases use z,z-farnesyl diphosphate in vitro as well, or more efficiently than, the e,e-farnesyl diphosphate substrate. Genes encoding monoterpene synthases are also prevalent, and they fall into three clades: TPS-b, TPS-g, and TPS-e/f. With the exception of two enzymes involved in the synthesis of ent-kaurene, the precursor of gibberellins, no other tomato TPS genes could be demonstrated to encode diterpene synthases so far. PMID:21813655
A molecular characterization of the choroid plexus and stress-induced gene regulation

PubMed Central

Sathyanesan, M; Girgenti, M J; Banasr, M; Stone, K; Bruce, C; Guilchicek, E; Wilczak-Havill, K; Nairn, A; Williams, K; Sass, S; Duman, J G; Newton, S S

2012-01-01

The role of the choroid plexus (CP) in brain homeostasis is being increasingly recognized and recent studies suggest that the CP has a more important role in physiological and pathological brain functions than currently appreciated. To obtain additional insight on the CP function, we performed a proteomics and transcriptomics characterization employing a combination of high resolution tandem mass spectrometry and gene expression analyses in normal rodent brain. Using multiple protein fractionation approaches, we identified 1400 CP proteins in adult CP. Microarray-based comparison of CP gene expression with the kidney, cortex and hippocampus showed significant overlap between the CP and the kidney. CP gene profiles were validated by in situ hybridization analysis of several target genes including klotho, CLIC 6, OATP 14 and Ezrin. Immunohistochemical analyses were performed for CP and enpendyma detection of several target proteins including cytokeratin, Rab7, klotho, tissue inhibitor of metalloprotease 1 (TIMP1), MMP9 and glial fibrillary acidic protein (GFAP). The molecular functions associated with various proteins of the CP proteome indicate that it is a blood–cerebrospinal fluid (CSF) barrier that exhibits high levels of metabolic activity. We also analyzed the gene expression changes induced by stress, an exacerbating factor for many illnesses, particularly mood disorders. Chronic stress altered the expression of several genes, downregulating 5HT2C, glucocorticoid receptor and the cilia genes IFT88 and smoothened while upregulating 5HT2A, BDNF, TNFα and IL-1b. The data presented here attach additional significance to the emerging importance of CP function in brain health and CNS disease states. PMID:22781172
Comparative Metagenomics Revealed Commonly Enriched Gene Sets in Human Gut Microbiomes

PubMed Central

Kurokawa, Ken; Itoh, Takehiko; Kuwahara, Tomomi; Oshima, Kenshiro; Toh, Hidehiro; Toyoda, Atsushi; Takami, Hideto; Morita, Hidetoshi; Sharma, Vineet K.; Srivastava, Tulika P.; Taylor, Todd D.; Noguchi, Hideki; Mori, Hiroshi; Ogura, Yoshitoshi; Ehrlich, Dusko S.; Itoh, Kikuji; Takagi, Toshihisa; Sakaki, Yoshiyuki; Hayashi, Tetsuya; Hattori, Masahira

2007-01-01

Numerous microbes inhabit the human intestine, many of which are uncharacterized or uncultivable. They form a complex microbial community that deeply affects human physiology. To identify the genomic features common to all human gut microbiomes as well as those variable among them, we performed a large-scale comparative metagenomic analysis of fecal samples from 13 healthy individuals of various ages, including unweaned infants. We found that, while the gut microbiota from unweaned infants were simple and showed a high inter-individual variation in taxonomic and gene composition, those from adults and weaned children were more complex but showed a high functional uniformity regardless of age or sex. In searching for the genes over-represented in gut microbiomes, we identified 237 gene families commonly enriched in adult-type and 136 families in infant-type microbiomes, with a small overlap. An analysis of their predicted functions revealed various strategies employed by each type of microbiota to adapt to its intestinal environment, suggesting that these gene sets encode the core functions of adult and infant-type gut microbiota. By analysing the orphan genes, 647 new gene families were identified to be exclusively present in human intestinal microbiomes. In addition, we discovered a conjugative transposon family explosively amplified in human gut microbiomes, which strongly suggests that the intestine is a ‘hot spot’ for horizontal gene transfer between microbes. PMID:17916580
Genome-wide analysis of the GRAS gene family in physic nut (Jatropha curcas L.).

PubMed

Wu, Z Y; Wu, P Z; Chen, Y P; Li, M R; Wu, G J; Jiang, H W

2015-12-29

GRAS proteins play vital roles in plant growth and development. Physic nut (Jatropha curcas L.) was found to have a total of 48 GRAS family members (JcGRAS), 15 more than those found in Arabidopsis. The JcGRAS genes were divided into 12 subfamilies or 15 ancient monophyletic lineages based on the phylogenetic analysis of GRAS proteins from both flowering and lower plants. The functions of GRAS genes in 9 subfamilies have been reported previously for several plants, while the genes in the remaining 3 subfamilies were of unknown function; we named the latter families U1 to U3. No member of U3 subfamily is present in Arabidopsis and Poaceae species according to public genome sequence data. In comparison with the number of GRAS genes in Arabidopsis, more were detected in physic nut, resulting from the retention of many ancient GRAS subfamilies and the formation of tandem repeats during evolution. No evidence of recent duplication among JcGRAS genes was observed in physic nut. Based on digital gene expression data, 21 of the 48 genes exhibited differential expression in four tissues analyzed. Two members of subfamily U3 were expressed only in buds and flowers, implying that they may play specific roles. Our results provide valuable resources for future studies on the functions of GRAS proteins in physic nut.
Computational Selection of Transcriptomics Experiments Improves Guilt-by-Association Analyses

PubMed Central

Bhat, Prajwal; Yang, Haixuan; Bögre, László; Devoto, Alessandra; Paccanaro, Alberto

2012-01-01

The Guilt-by-Association (GBA) principle, according to which genes with similar expression profiles are functionally associated, is widely applied for functional analyses using large heterogeneous collections of transcriptomics data. However, the use of such large collections could hamper GBA functional analysis for genes whose expression is condition specific. In these cases a smaller set of condition related experiments should instead be used, but identifying such functionally relevant experiments from large collections based on literature knowledge alone is an impractical task. We begin this paper by analyzing, both from a mathematical and a biological point of view, why only condition specific experiments should be used in GBA functional analysis. We are able to show that this phenomenon is independent of the functional categorization scheme and of the organisms being analyzed. We then present a semi-supervised algorithm that can select functionally relevant experiments from large collections of transcriptomics experiments. Our algorithm is able to select experiments relevant to a given GO term, MIPS FunCat term or even KEGG pathways. We extensively test our algorithm on large dataset collections for yeast and Arabidopsis. We demonstrate that: using the selected experiments there is a statistically significant improvement in correlation between genes in the functional category of interest; the selected experiments improve GBA-based gene function prediction; the effectiveness of the selected experiments increases with annotation specificity; our algorithm can be successfully applied to GBA-based pathway reconstruction. Importantly, the set of experiments selected by the algorithm reflects the existing literature knowledge about the experiments. [A MATLAB implementation of the algorithm and all the data used in this paper can be downloaded from the paper website: http://www.paccanarolab.org/papers/CorrGene/]. PMID:22879875
Integrating gene and protein expression data with genome-scale metabolic networks to infer functional pathways.

PubMed

Pey, Jon; Valgepea, Kaspar; Rubio, Angel; Beasley, John E; Planes, Francisco J

2013-12-08

The study of cellular metabolism in the context of high-throughput -omics data has allowed us to decipher novel mechanisms of importance in biotechnology and health. To continue with this progress, it is essential to efficiently integrate experimental data into metabolic modeling. We present here an in-silico framework to infer relevant metabolic pathways for a particular phenotype under study based on its gene/protein expression data. This framework is based on the Carbon Flux Path (CFP) approach, a mixed-integer linear program that expands classical path finding techniques by considering additional biophysical constraints. In particular, the objective function of the CFP approach is amended to account for gene/protein expression data and influence obtained paths. This approach is termed integrative Carbon Flux Path (iCFP). We show that gene/protein expression data also influences the stoichiometric balancing of CFPs, which provides a more accurate picture of active metabolic pathways. This is illustrated in both a theoretical and real scenario. Finally, we apply this approach to find novel pathways relevant in the regulation of acetate overflow metabolism in Escherichia coli. As a result, several targets which could be relevant for better understanding of the phenomenon leading to impaired acetate overflow are proposed. A novel mathematical framework that determines functional pathways based on gene/protein expression data is presented and validated. We show that our approach is able to provide new insights into complex biological scenarios such as acetate overflow in Escherichia coli.
Fine mapping and identification of a candidate gene for the barley Un8 true loose smut resistance gene.

PubMed

Zang, Wen; Eckstein, Peter E; Colin, Mark; Voth, Doug; Himmelbach, Axel; Beier, Sebastian; Stein, Nils; Scoles, Graham J; Beattie, Aaron D

2015-07-01

The candidate gene for the barley Un8 true loose smut resistance gene encodes a deduced protein containing two tandem protein kinase domains. In North America, durable resistance against all known isolates of barley true loose smut, caused by the basidiomycete pathogen Ustilago nuda (Jens.) Rostr. (U. nuda), is under the control of the Un8 resistance gene. Previous genetic studies mapped Un8 to the long arm of chromosome 5 (1HL). Here, a population of 4625 lines segregating for Un8 was used to delimit the Un8 gene to a 0.108 cM interval on chromosome arm 1HL, and assign it to fingerprinted contig 546 of the barley physical map. The minimal tilling path was identified for the Un8 locus using two flanking markers and consisted of two overlapping bacterial artificial chromosomes. One gene located close to a marker co-segregating with Un8 showed high sequence identity to a disease resistance gene containing two kinase domains. Sequence of the candidate gene from the parents of the segregating population, and in an additional 19 barley lines representing a broader spectrum of diversity, showed there was no intron in alleles present in either resistant or susceptible lines, and fifteen amino acid variations unique to the deduced protein sequence in resistant lines differentiated it from the deduced protein sequences in susceptible lines. Some of these variations were present within putative functional domains which may cause a loss of function in the deduced protein sequences within susceptible lines.
Papillae formation on trichome cell walls requires the function of the mediator complex subunit Med25.

PubMed

Fornero, Christy; Suo, Bangxia; Zahde, Mais; Juveland, Katelyn; Kirik, Viktor

2017-11-01

Glassy Hair 1 (GLH1) gene that promotes papillae formation on trichome cell walls was identified as a subunit of the transcriptional mediator complex MED25. The MED25 gene is shown to be expressed in trichomes. The expression of the trichome development marker genes GLABRA2 (GL2) and Ethylene Receptor2 (ETR2) is not affected in the glh1 mutant. Presented data suggest that Arabidopsis MED25 mediator component is likely involved in the transcription of genes promoting papillae deposition in trichomes. The plant cell wall plays an important role in communication, defense, organization and support. The importance of each of these functions varies by cell type. Specialized cells, such as Arabidopsis trichomes, exhibit distinct cell wall characteristics including papillae. To better understand the molecular processes important for papillae deposition on the cell wall surface, we identified the GLASSY HAIR 1 (GLH1) gene, which is necessary for papillae formation. We found that a splice-site mutation in the component of the transcriptional mediator complex MED25 gene is responsible for the near papillae-less phenotype of the glh1 mutant. The MED25 gene is expressed in trichomes. Reporters for trichome developmental marker genes GLABRA2 (GL2) and Ethylene Receptor2 (ETR2) were not affected in the glh1 mutant. Collectively, the presented results show that MED25 is necessary for papillae formation on the cell wall surface of leaf trichomes and suggest that the Arabidopsis MED25 mediator component is likely involved in the transcription of a subset of genes that promote papillae deposition in trichomes.
Non-functional plastid ndh gene fragments are present in the nuclear genome of Norway spruce (Picea abies L. Karsch): insights from in silico analysis of nuclear and organellar genomes.

PubMed

Ranade, Sonali Sachin; García-Gil, María Rosario; Rosselló, Josep A

2016-04-01

Many genes have been lost from the prokaryote plastidial genome during the early events of endosymbiosis in eukaryotes. Some of them were definitively lost, but others were relocated and functionally integrated to the host nuclear genomes through serial events of gene transfer during plant evolution. In gymnosperms, plastid genome sequencing has revealed the loss of ndh genes from several species of Gnetales and Pinaceae, including Norway spruce (Picea abies). This study aims to trace the ndh genes in the nuclear and organellar Norway spruce genomes. The plastid genomes of higher plants contain 11 ndh genes which are homologues of mitochondrial genes encoding subunits of the proton-pumping NADH-dehydrogenase (nicotinamide adenine dinucleotide dehydrogenase) or complex I (electron transport chain). Ndh genes encode 11 NDH polypeptides forming the Ndh complex (analogous to complex I) which seems to be primarily involved in chloro-respiration processes. We considered ndh genes from the plastidial genome of four gymnosperms (Cryptomeria japonica, Cycas revoluta, Ginkgo biloba, Podocarpus totara) and a single angiosperm species (Arabidopsis thaliana) to trace putative homologs in the nuclear and organellar Norway spruce genomes using tBLASTn to assess the evolutionary fate of ndh genes in Norway spruce and to address their genomic location(s), structure, integrity and functionality. The results obtained from tBLASTn were subsequently analyzed by performing homology search for finding ndh specific conserved domains using conserved domain search. We report the presence of non-functional plastid ndh gene fragments, excepting ndhE and ndhG genes, in the nuclear genome of Norway spruce. Regulatory transcriptional elements like promoters, TATA boxes and enhancers were detected in the upstream regions of some ndh fragments. We also found transposable elements in the flanking regions of few ndh fragments suggesting nuclear rearrangements in those regions. These evidences support the hypothesis that, at least in Picea, ndh translocations from the plastid to the nuclear genome have occurred, and that there might have been a functional machinery at some time during evolution to accommodate them within a nuclear-encoded environment, or attempts to form it.
The Chlamydomonas genome project: a decade on

PubMed Central

Blaby, Ian K.; Blaby-Haas, Crysten; Tourasse, Nicolas; Hom, Erik F. Y.; Lopez, David; Aksoy, Munevver; Grossman, Arthur; Umen, James; Dutcher, Susan; Porter, Mary; King, Stephen; Witman, George; Stanke, Mario; Harris, Elizabeth H.; Goodstein, David; Grimwood, Jane; Schmutz, Jeremy; Vallon, Olivier; Merchant, Sabeeha S.; Prochnik, Simon

2014-01-01

The green alga Chlamydomonas reinhardtii is a popular unicellular organism for studying photosynthesis, cilia biogenesis and micronutrient homeostasis. Ten years since its genome project was initiated, an iterative process of improvements to the genome and gene predictions has propelled this organism to the forefront of the “omics” era. Housed at Phytozome, the Joint Genome Institute’s (JGI) plant genomics portal, the most up-to-date genomic data include a genome arranged on chromosomes and high-quality gene models with alternative splice forms supported by an abundance of RNA-Seq data. Here, we present the past, present and future of Chlamydomonas genomics. Specifically, we detail progress on genome assembly and gene model refinement, discuss resources for gene annotations, functional predictions and locus ID mapping between versions and, importantly, outline a standardized framework for naming genes. PMID:24950814

Characteristics of functional enrichment and gene expression level of human putative transcriptional target genes.

PubMed

Osato, Naoki

2018-01-19

Transcriptional target genes show functional enrichment of genes. However, how many and how significantly transcriptional target genes include functional enrichments are still unclear. To address these issues, I predicted human transcriptional target genes using open chromatin regions, ChIP-seq data and DNA binding sequences of transcription factors in databases, and examined functional enrichment and gene expression level of putative transcriptional target genes. Gene Ontology annotations showed four times larger numbers of functional enrichments in putative transcriptional target genes than gene expression information alone, independent of transcriptional target genes. To compare the number of functional enrichments of putative transcriptional target genes between cells or search conditions, I normalized the number of functional enrichment by calculating its ratios in the total number of transcriptional target genes. With this analysis, native putative transcriptional target genes showed the largest normalized number of functional enrichments, compared with target genes including 5-60% of randomly selected genes. The normalized number of functional enrichments was changed according to the criteria of enhancer-promoter interactions such as distance from transcriptional start sites and orientation of CTCF-binding sites. Forward-reverse orientation of CTCF-binding sites showed significantly higher normalized number of functional enrichments than the other orientations. Journal papers showed that the top five frequent functional enrichments were related to the cellular functions in the three cell types. The median expression level of transcriptional target genes changed according to the criteria of enhancer-promoter assignments (i.e. interactions) and was correlated with the changes of the normalized number of functional enrichments of transcriptional target genes. Human putative transcriptional target genes showed significant functional enrichments. Functional enrichments were related to the cellular functions. The normalized number of functional enrichments of human putative transcriptional target genes changed according to the criteria of enhancer-promoter assignments and correlated with the median expression level of the target genes. These analyses and characters of human putative transcriptional target genes would be useful to examine the criteria of enhancer-promoter assignments and to predict the novel mechanisms and factors such as DNA binding proteins and DNA sequences of enhancer-promoter interactions.
Epigenetic Modifications Unlock the Milk Protein Gene Loci during Mouse Mammary Gland Development and Differentiation

PubMed Central

Rijnkels, Monique; Freeman-Zadrowski, Courtneay; Hernandez, Joseph; Potluri, Vani; Wang, Liguo; Li, Wei; Lemay, Danielle G.

2013-01-01

Background Unlike other tissues, development and differentiation of the mammary gland occur mostly after birth. The roles of systemic hormones and local growth factors important for this development and functional differentiation are well-studied. In other tissues, it has been shown that chromatin organization plays a key role in transcriptional regulation and underlies epigenetic regulation during development and differentiation. However, the role of chromatin organization in mammary gland development and differentiation is less well-defined. Here, we have studied the changes in chromatin organization at the milk protein gene loci (casein, whey acidic protein, and others) in the mouse mammary gland before and after functional differentiation. Methodology/Principal Findings Distal regulatory elements within the casein gene cluster and whey acidic protein gene region have an open chromatin organization after pubertal development, while proximal promoters only gain open-chromatin marks during pregnancy in conjunction with the major induction of their expression. In contrast, other milk protein genes, such as alpha-lactalbumin, already have an open chromatin organization in the mature virgin gland. Changes in chromatin organization in the casein gene cluster region that are present after puberty persisted after lactation has ceased, while the changes which occurred during pregnancy at the gene promoters were not maintained. In general, mammary gland expressed genes and their regulatory elements exhibit developmental stage- and tissue-specific chromatin organization. Conclusions/Significance A progressive gain of epigenetic marks indicative of open/active chromatin on genes marking functional differentiation accompanies the development of the mammary gland. These results support a model in which a chromatin organization is established during pubertal development that is then poised to respond to the systemic hormonal signals of pregnancy and lactation to achieve the full functional capacity of the mammary gland. PMID:23301053
Microarray analysis of gene expression patterns in the leaf during potato tuberization in the potato somatic hybrid Solanum tuberosum and Solanum etuberosum.

PubMed

Tiwari, Jagesh Kumar; Devi, Sapna; Sundaresha, S; Chandel, Poonam; Ali, Nilofer; Singh, Brajesh; Bhardwaj, Vinay; Singh, Bir Pal

2015-06-01

Genes involved in photoassimilate partitioning and changes in hormonal balance are important for potato tuberization. In the present study, we investigated gene expression patterns in the tuber-bearing potato somatic hybrid (E1-3) and control non-tuberous wild species Solanum etuberosum (Etb) by microarray. Plants were grown under controlled conditions and leaves were collected at eight tuber developmental stages for microarray analysis. A t-test analysis identified a total of 468 genes (94 up-regulated and 374 down-regulated) that were statistically significant (p ≤ 0.05) and differentially expressed in E1-3 and Etb. Gene Ontology (GO) characterization of the 468 genes revealed that 145 were annotated and 323 were of unknown function. Further, these 145 genes were grouped based on GO biological processes followed by molecular function and (or) PGSC description into 15 gene sets, namely (1) transport, (2) metabolic process, (3) biological process, (4) photosynthesis, (5) oxidation-reduction, (6) transcription, (7) translation, (8) binding, (9) protein phosphorylation, (10) protein folding, (11) ubiquitin-dependent protein catabolic process, (12) RNA processing, (13) negative regulation of protein, (14) methylation, and (15) mitosis. RT-PCR analysis of 10 selected highly significant genes (p ≤ 0.01) confirmed the microarray results. Overall, we show that candidate genes induced in leaves of E1-3 were implicated in tuberization processes such as transport, carbohydrate metabolism, phytohormones, and transcription/translation/binding functions. Hence, our results provide an insight into the candidate genes induced in leaf tissues during tuberization in E1-3.
Comparative genomic analysis of SET domain family reveals the origin, expansion, and putative function of the arthropod-specific SmydA genes as histone modifiers in insects.

PubMed

Jiang, Feng; Liu, Qing; Wang, Yanli; Zhang, Jie; Wang, Huimin; Song, Tianqi; Yang, Meiling; Wang, Xianhui; Kang, Le

2017-06-01

The SET domain is an evolutionarily conserved motif present in histone lysine methyltransferases, which are important in the regulation of chromatin and gene expression in animals. In this study, we searched for SET domain-containing genes (SET genes) in all of the 147 arthropod genomes sequenced at the time of carrying out this experiment to understand the evolutionary history by which SET domains have evolved in insects. Phylogenetic and ancestral state reconstruction analysis revealed an arthropod-specific SET gene family, named SmydA, that is ancestral to arthropod animals and specifically diversified during insect evolution. Considering that pseudogenization is the most probable fate of the new emerging gene copies, we provided experimental and evolutionary evidence to demonstrate their essential functions. Fluorescence in situ hybridization analysis and in vitro methyltransferase activity assays showed that the SmydA-2 gene was transcriptionally active and retained the original histone methylation activity. Expression knockdown by RNA interference significantly increased mortality, implying that the SmydA genes may be essential for insect survival. We further showed predominantly strong purifying selection on the SmydA gene family and a potential association between the regulation of gene expression and insect phenotypic plasticity by transcriptome analysis. Overall, these data suggest that the SmydA gene family retains essential functions that may possibly define novel regulatory pathways in insects. This work provides insights into the roles of lineage-specific domain duplication in insect evolution. © The Authors 2017. Published by Oxford University Press.
Comparative genomic analysis of SET domain family reveals the origin, expansion, and putative function of the arthropod-specific SmydA genes as histone modifiers in insects

PubMed Central

Jiang, Feng; Liu, Qing; Wang, Yanli; Zhang, Jie; Wang, Huimin; Song, Tianqi; Yang, Meiling

2017-01-01

Abstract The SET domain is an evolutionarily conserved motif present in histone lysine methyltransferases, which are important in the regulation of chromatin and gene expression in animals. In this study, we searched for SET domain–containing genes (SET genes) in all of the 147 arthropod genomes sequenced at the time of carrying out this experiment to understand the evolutionary history by which SET domains have evolved in insects. Phylogenetic and ancestral state reconstruction analysis revealed an arthropod-specific SET gene family, named SmydA, that is ancestral to arthropod animals and specifically diversified during insect evolution. Considering that pseudogenization is the most probable fate of the new emerging gene copies, we provided experimental and evolutionary evidence to demonstrate their essential functions. Fluorescence in situ hybridization analysis and in vitro methyltransferase activity assays showed that the SmydA-2 gene was transcriptionally active and retained the original histone methylation activity. Expression knockdown by RNA interference significantly increased mortality, implying that the SmydA genes may be essential for insect survival. We further showed predominantly strong purifying selection on the SmydA gene family and a potential association between the regulation of gene expression and insect phenotypic plasticity by transcriptome analysis. Overall, these data suggest that the SmydA gene family retains essential functions that may possibly define novel regulatory pathways in insects. This work provides insights into the roles of lineage-specific domain duplication in insect evolution. PMID:28444351
Integrating text mining, data mining, and network analysis for identifying genetic breast cancer trends.

PubMed

Jurca, Gabriela; Addam, Omar; Aksac, Alper; Gao, Shang; Özyer, Tansel; Demetrick, Douglas; Alhajj, Reda

2016-04-26

Breast cancer is a serious disease which affects many women and may lead to death. It has received considerable attention from the research community. Thus, biomedical researchers aim to find genetic biomarkers indicative of the disease. Novel biomarkers can be elucidated from the existing literature. However, the vast amount of scientific publications on breast cancer make this a daunting task. This paper presents a framework which investigates existing literature data for informative discoveries. It integrates text mining and social network analysis in order to identify new potential biomarkers for breast cancer. We utilized PubMed for the testing. We investigated gene-gene interactions, as well as novel interactions such as gene-year, gene-country, and abstract-country to find out how the discoveries varied over time and how overlapping/diverse are the discoveries and the interest of various research groups in different countries. Interesting trends have been identified and discussed, e.g., different genes are highlighted in relationship to different countries though the various genes were found to share functionality. Some text analysis based results have been validated against results from other tools that predict gene-gene relations and gene functions.
Analyses of the NAC transcription factor gene family in Gossypium raimondii Ulbr.: chromosomal location, structure, phylogeny, and expression patterns.

PubMed

Shang, Haihong; Li, Wei; Zou, Changsong; Yuan, Youlu

2013-07-01

NAC domain proteins are plant-specific transcription factors known to play diverse roles in various plant developmental processes. In the present study, we performed the first comprehensive study of the NAC gene family in Gossypium raimondii Ulbr., incorporating phylogenetic, chromosomal location, gene structure, conserved motif, and expression profiling analyses. We identified 145 NAC transcription factor (NAC-TF) genes that were phylogenetically clustered into 18 distinct subfamilies. Of these, 127 NAC-TF genes were distributed across the 13 chromosomes, 80 (55%) were preferentially retained duplicates located in both duplicated regions and six were located in triplicated chromosomal regions. The majority of NAC-TF genes showed temporal-, spatial-, and tissue-specific expression patterns based on transcriptomic and qRT-PCR analyses. However, the expression patterns of several duplicate genes were partially redundant, suggesting the occurrence of sub-functionalization during their evolution. Based on their genomic organization, we concluded that genomic duplications contributed significantly to the expansion of the NAC-TF gene family in G. raimondii. Comprehensive analysis of their expression profiles could provide novel insights into the functional divergence among members of the NAC gene family in G. raimondii. © 2013 Institute of Botany, Chinese Academy of Sciences.
Identification of functional differences in metabolic networks using comparative genomics and constraint-based models.

PubMed

Hamilton, Joshua J; Reed, Jennifer L

2012-01-01

Genome-scale network reconstructions are useful tools for understanding cellular metabolism, and comparisons of such reconstructions can provide insight into metabolic differences between organisms. Recent efforts toward comparing genome-scale models have focused primarily on aligning metabolic networks at the reaction level and then looking at differences and similarities in reaction and gene content. However, these reaction comparison approaches are time-consuming and do not identify the effect network differences have on the functional states of the network. We have developed a bilevel mixed-integer programming approach, CONGA, to identify functional differences between metabolic networks by comparing network reconstructions aligned at the gene level. We first identify orthologous genes across two reconstructions and then use CONGA to identify conditions under which differences in gene content give rise to differences in metabolic capabilities. By seeking genes whose deletion in one or both models disproportionately changes flux through a selected reaction (e.g., growth or by-product secretion) in one model over another, we are able to identify structural metabolic network differences enabling unique metabolic capabilities. Using CONGA, we explore functional differences between two metabolic reconstructions of Escherichia coli and identify a set of reactions responsible for chemical production differences between the two models. We also use this approach to aid in the development of a genome-scale model of Synechococcus sp. PCC 7002. Finally, we propose potential antimicrobial targets in Mycobacterium tuberculosis and Staphylococcus aureus based on differences in their metabolic capabilities. Through these examples, we demonstrate that a gene-centric approach to comparing metabolic networks allows for a rapid comparison of metabolic models at a functional level. Using CONGA, we can identify differences in reaction and gene content which give rise to different functional predictions. Because CONGA provides a general framework, it can be applied to find functional differences across models and biological systems beyond those presented here.
Identification of Functional Differences in Metabolic Networks Using Comparative Genomics and Constraint-Based Models

PubMed Central

Hamilton, Joshua J.; Reed, Jennifer L.

2012-01-01

Genome-scale network reconstructions are useful tools for understanding cellular metabolism, and comparisons of such reconstructions can provide insight into metabolic differences between organisms. Recent efforts toward comparing genome-scale models have focused primarily on aligning metabolic networks at the reaction level and then looking at differences and similarities in reaction and gene content. However, these reaction comparison approaches are time-consuming and do not identify the effect network differences have on the functional states of the network. We have developed a bilevel mixed-integer programming approach, CONGA, to identify functional differences between metabolic networks by comparing network reconstructions aligned at the gene level. We first identify orthologous genes across two reconstructions and then use CONGA to identify conditions under which differences in gene content give rise to differences in metabolic capabilities. By seeking genes whose deletion in one or both models disproportionately changes flux through a selected reaction (e.g., growth or by-product secretion) in one model over another, we are able to identify structural metabolic network differences enabling unique metabolic capabilities. Using CONGA, we explore functional differences between two metabolic reconstructions of Escherichia coli and identify a set of reactions responsible for chemical production differences between the two models. We also use this approach to aid in the development of a genome-scale model of Synechococcus sp. PCC 7002. Finally, we propose potential antimicrobial targets in Mycobacterium tuberculosis and Staphylococcus aureus based on differences in their metabolic capabilities. Through these examples, we demonstrate that a gene-centric approach to comparing metabolic networks allows for a rapid comparison of metabolic models at a functional level. Using CONGA, we can identify differences in reaction and gene content which give rise to different functional predictions. Because CONGA provides a general framework, it can be applied to find functional differences across models and biological systems beyond those presented here. PMID:22666308
Gene expression analysis identify a metabolic and cell function alterations as a hallmark of obesity without metabolic syndrome in peripheral blood, a pilot study.

PubMed

de Luis, Daniel Antonio; Almansa, Raquel; Aller, Rocío; Izaola, Olatz; Romero, E

2017-06-10

Understanding molecular basis involved in overweight is an important first step in developing therapeutic pathways against excess in body weight gain. The purpose of our pilot study was to evaluate the gene expression profiles in the peripheral blood of obese patients without other metabolic complications. A sample of 17 obese patients without metabolic syndrome and 15 non obese control subjects was evaluated in a prospective way. Following 'One-Color Microarray-Based Gene Expression Analysis' protocol Version 5.7 (Agilent p/n 4140-90040), cRNA was hybridized with Whole Human Genome Oligo Microarray Kit (Agilent p/n G2519F-014850) containing 41,000+ unique human genes and transcripts. The average age of the study group was 43.6 ± 19.7 years with a sex distribution of 64.7% females and 35.3% males. No statistical differences were detected with healthy controls 41.9 ± 12.3 years with a sex distribution of 70% females and 30% males. Obese patients showed 1436 genes that were differentially expressed compared to control group. Ingenuity Pathway Analysis showed that these genes participated in 13 different categories related to metabolism and cellular functions. In the gene set of cellular function, the most important genes were C-terminal region of Nel-like molecule 1 protein (NELL1) and Pigment epithelium-derived factor (SPEDF), both genes were over-expressed. In the gene set of metabolism, insulin growth factor type 1 (IGF1), ApoA5 (apolipoprotein subtype 5), Foxo4 (Forkhead transcription factor 4), ADIPOR1 (receptor of adiponectin type 1) and AQP7 (aquaporin channel proteins7) were over expressed. Moreover, PIKFYVE (PtdIns(3) P 5-kinase), and ROCK-2 (rho-kinase II) were under expressed. We showed that PBMCs from obese subjects presented significant changes in gene expression, exhibiting 1436 differentially expressed genes compared to PBMCs from non-obese subjects. Furthermore, our data showed a number of genes involved in relevant processes implicated in metabolism, with genes presenting high fold-change values (up-regulation and down regulation) associated with lipid, carbohydrate and protein metabolism. Copyright © 2017 Elsevier Ltd and European Society for Clinical Nutrition and Metabolism. All rights reserved.
GOEAST: a web-based software toolkit for Gene Ontology enrichment analysis.

PubMed

Zheng, Qi; Wang, Xiu-Jie

2008-07-01

Gene Ontology (GO) analysis has become a commonly used approach for functional studies of large-scale genomic or transcriptomic data. Although there have been a lot of software with GO-related analysis functions, new tools are still needed to meet the requirements for data generated by newly developed technologies or for advanced analysis purpose. Here, we present a Gene Ontology Enrichment Analysis Software Toolkit (GOEAST), an easy-to-use web-based toolkit that identifies statistically overrepresented GO terms within given gene sets. Compared with available GO analysis tools, GOEAST has the following improved features: (i) GOEAST displays enriched GO terms in graphical format according to their relationships in the hierarchical tree of each GO category (biological process, molecular function and cellular component), therefore, provides better understanding of the correlations among enriched GO terms; (ii) GOEAST supports analysis for data from various sources (probe or probe set IDs of Affymetrix, Illumina, Agilent or customized microarrays, as well as different gene identifiers) and multiple species (about 60 prokaryote and eukaryote species); (iii) One unique feature of GOEAST is to allow cross comparison of the GO enrichment status of multiple experiments to identify functional correlations among them. GOEAST also provides rigorous statistical tests to enhance the reliability of analysis results. GOEAST is freely accessible at http://omicslab.genetics.ac.cn/GOEAST/
De Novo Protein Structure Prediction

NASA Astrophysics Data System (ADS)

Hung, Ling-Hong; Ngan, Shing-Chung; Samudrala, Ram

An unparalleled amount of sequence data is being made available from large-scale genome sequencing efforts. The data provide a shortcut to the determination of the function of a gene of interest, as long as there is an existing sequenced gene with similar sequence and of known function. This has spurred structural genomic initiatives with the goal of determining as many protein folds as possible (Brenner and Levitt, 2000; Burley, 2000; Brenner, 2001; Heinemann et al., 2001). The purpose of this is twofold: First, the structure of a gene product can often lead to direct inference of its function. Second, since the function of a protein is dependent on its structure, direct comparison of the structures of gene products can be more sensitive than the comparison of sequences of genes for detecting homology. Presently, structural determination by crystallography and NMR techniques is still slow and expensive in terms of manpower and resources, despite attempts to automate the processes. Computer structure prediction algorithms, while not providing the accuracy of the traditional techniques, are extremely quick and inexpensive and can provide useful low-resolution data for structure comparisons (Bonneau and Baker, 2001). Given the immense number of structures which the structural genomic projects are attempting to solve, there would be a considerable gain even if the computer structure prediction approach were applicable to a subset of proteins.
Modeling Fragile X Syndrome in Drosophila

PubMed Central

Drozd, Małgorzata; Bardoni, Barbara; Capovilla, Maria

2018-01-01

Intellectual disability (ID) and autism are hallmarks of Fragile X Syndrome (FXS), a hereditary neurodevelopmental disorder. The gene responsible for FXS is Fragile X Mental Retardation gene 1 (FMR1) encoding the Fragile X Mental Retardation Protein (FMRP), an RNA-binding protein involved in RNA metabolism and modulating the expression level of many targets. Most cases of FXS are caused by silencing of FMR1 due to CGG expansions in the 5′-UTR of the gene. Humans also carry the FXR1 and FXR2 paralogs of FMR1 while flies have only one FMR1 gene, here called dFMR1, sharing the same level of sequence homology with all three human genes, but functionally most similar to FMR1. This enables a much easier approach for FMR1 genetic studies. Drosophila has been widely used to investigate FMR1 functions at genetic, cellular, and molecular levels since dFMR1 mutants have many phenotypes in common with the wide spectrum of FMR1 functions that underlay the disease. In this review, we present very recent Drosophila studies investigating FMRP functions at genetic, cellular, molecular, and electrophysiological levels in addition to research on pharmacological treatments in the fly model. These studies have the potential to aid the discovery of pharmacological therapies for FXS. PMID:29713264
Genome-wide analysis of the DNA-binding with one zinc finger (Dof) transcription factor family in bananas.

PubMed

Dong, Chen; Hu, Huigang; Xie, Jianghui

2016-12-01

DNA-binding with one finger (Dof) domain proteins are a multigene family of plant-specific transcription factors involved in numerous aspects of plant growth and development. In this study, we report a genome-wide search for Musa acuminata Dof (MaDof) genes and their expression profiles at different developmental stages and in response to various abiotic stresses. In addition, a complete overview of the Dof gene family in bananas is presented, including the gene structures, chromosomal locations, cis-regulatory elements, conserved protein domains, and phylogenetic inferences. Based on the genome-wide analysis, we identified 74 full-length protein-coding MaDof genes unevenly distributed on 11 chromosomes. Phylogenetic analysis with Dof members from diverse plant species showed that MaDof genes can be classified into four subgroups (StDof I, II, III, and IV). The detailed genomic information of the MaDof gene homologs in the present study provides opportunities for functional analyses to unravel the exact role of the genes in plant growth and development.
Perspectives: Gene Expression in Fisheries Management

USGS Publications Warehouse

Nielsen, Jennifer L.; Pavey, Scott A.

2010-01-01

Functional genes and gene expression have been connected to physiological traits linked to effective production and broodstock selection in aquaculture, selective implications of commercial fish harvest, and adaptive changes reflected in non-commercial fish populations subject to human disturbance and climate change. Gene mapping using single nucleotide polymorphisms (SNPs) to identify functional genes, gene expression (analogue microarrays and real-time PCR), and digital sequencing technologies looking at RNA transcripts present new concepts and opportunities in support of effective and sustainable fisheries. Genomic tools have been rapidly growing in aquaculture research addressing aspects of fish health, toxicology, and early development. Genomic technologies linking effects in functional genes involved in growth, maturation and life history development have been tied to selection resulting from harvest practices. Incorporating new and ever-increasing knowledge of fish genomes is opening a different perspective on local adaptation that will prove invaluable in wild fish conservation and management. Conservation of fish stocks is rapidly incorporating research on critical adaptive responses directed at the effects of human disturbance and climate change through gene expression studies. Genomic studies of fish populations can be generally grouped into three broad categories: 1) evolutionary genomics and biodiversity; 2) adaptive physiological responses to a changing environment; and 3) adaptive behavioral genomics and life history diversity. We review current genomic research in fisheries focusing on those that use microarrays to explore differences in gene expression among phenotypes and within or across populations, information that is critically important to the conservation of fish and their relationship to humans.
Selection shaped the evolution of mouse androgen-binding protein (ABP) function and promoted the duplication of Abp genes.

PubMed

Karn, Robert C; Laukaitis, Christina M

2014-08-01

In the present article, we summarize two aspects of our work on mouse ABP (androgen-binding protein): (i) the sexual selection function producing incipient reinforcement on the European house mouse hybrid zone, and (ii) the mechanism behind the dramatic expansion of the Abp gene region in the mouse genome. Selection unifies these two components, although the ways in which selection has acted differ. At the functional level, strong positive selection has acted on key sites on the surface of one face of the ABP dimer, possibly to influence binding to a receptor. A different kind of selection has apparently driven the recent and rapid expansion of the gene region, probably by increasing the amount of Abp transcript, in one or both of two ways. We have shown previously that groups of Abp genes behave as LCRs (low-copy repeats), duplicating as relatively large blocks of genes by NAHR (non-allelic homologous recombination). The second type of selection involves the close link between the accumulation of L1 elements and the expansion of the Abp gene family by NAHR. It is probably predicated on an initial selection for increased transcription of existing Abp genes and/or an increase in Abp gene number providing more transcriptional sites. Either or both could increase initial transcript production, a quantitative change similar to increasing the volume of a radio transmission. In closing, we also provide a note on Abp gene nomenclature.
Differential gene expression in Schistosoma japonicum schistosomula from Wistar rats and BALB/c mice

PubMed Central

2011-01-01

Background More than 46 species of mammals can be naturally infected with Schistosoma japonicum in the mainland of China. Mice are permissive and may act as the definitive host of the life cycle. In contrast, rats are less susceptible to S. japonicum infection, and are considered to provide an unsuitable micro-environment for parasite growth and development. Since little is known of what effects this micro-environment has on the parasite itself, we have in the present study utilised a S. japonicum oligonucleotide microarray to compare the gene expression differences of 10-day-old schistosomula maintained in Wistar rats with those maintained in BALB/c mice. Results In total 3,468 schistosome genes were found to be differentially expressed, of which the majority (3,335) were down-regulated (≤ 2 fold) and 133 were up-regulated (≥ 2 fold) in schistosomula from Wistar rats compared with those from BALB/c mice. Gene ontology (GO) analysis revealed that of the differentially expressed genes with already established functions or close homology to well characterized genes in another organisms, many are related to important biological functions or molecular processes. Among the genes that were down-regulated in schistosomula from Wistar rats, some were associated with metabolism, signal transduction and development. Of these genes related to metabolic processes, areas including translation, protein and amino acid phosphorylation, proteolysis, oxidoreductase activities, catalytic activities and hydrolase activities, were represented. KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway analysis of differential expressed genes indicated that of the 328 genes that had a specific KEGG pathway annotation, 324 were down-regulated and were mainly associated with metabolism, growth, redox pathway, oxidative phosphorylation, the cell cycle, ubiquitin-mediated proteolysis, protein export and the MAPK (mitogen-activated protein kinases) signaling pathway. Conclusions This work presents the first large scale gene expression study identifying the differences between schistosomula maintained in mice and those maintained in rats, and specifically highlights differential expression that may impact on the survival and development of the parasite within the definitive host. The research presented here provides valuable information for the better understanding of schistosome development and host-parasite interactions. PMID:21819550
Fe₃O₄ Nanoparticles in Targeted Drug/Gene Delivery Systems.

PubMed

Shen, Lazhen; Li, Bei; Qiao, Yongsheng

2018-02-23

Fe₃O₄ nanoparticles (NPs), the most traditional magnetic nanoparticles, have received a great deal of attention in the biomedical field, especially for targeted drug/gene delivery systems, due to their outstanding magnetism, biocompatibility, lower toxicity, biodegradability, and other features. Naked Fe₃O₄ NPs are easy to aggregate and oxidize, and thus are often made with various coatings to realize superior properties for targeted drug/gene delivery. In this review, we first list the three commonly utilized synthesis methods of Fe₃O₄ NPs, and their advantages and disadvantages. In the second part, we describe coating materials that exhibit noticeable features that allow functionalization of Fe₃O₄ NPs and summarize their methods of drug targeting/gene delivery. Then our efforts will be devoted to the research status and progress of several different functionalized Fe₃O₄ NP delivery systems loaded with chemotherapeutic agents, and we present targeted gene transitive carriers in detail. In the following section, we illuminate the most effective treatment systems of the combined drug and gene therapy. Finally, we propose opportunities and challenges of the clinical transformation of Fe₃O₄ NPs targeting drug/gene delivery systems.
The Genome Sequence of Mannheimia haemolytica A1: Insights into Virulence, Natural Competence, and Pasteurellaceae Phylogeny†

PubMed Central

Gioia, Jason; Qin, Xiang; Jiang, Huaiyang; Clinkenbeard, Kenneth; Lo, Reggie; Liu, Yamei; Fox, George E.; Yerrapragada, Shailaja; McLeod, Michael P.; McNeill, Thomas Z.; Hemphill, Lisa; Sodergren, Erica; Wang, Qiaoyan; Muzny, Donna M.; Homsi, Farah J.; Weinstock, George M.; Highlander, Sarah K.

2006-01-01

The draft genome sequence of Mannheimia haemolytica A1, the causative agent of bovine respiratory disease complex (BRDC), is presented. Strain ATCC BAA-410, isolated from the lung of a calf with BRDC, was the DNA source. The annotated genome includes 2,839 coding sequences, 1,966 of which were assigned a function and 436 of which are unique to M. haemolytica. Through genome annotation many features of interest were identified, including bacteriophages and genes related to virulence, natural competence, and transcriptional regulation. In addition to previously described virulence factors, M. haemolytica encodes adhesins, including the filamentous hemagglutinin FhaB and two trimeric autotransporter adhesins. Two dual-function immunoglobulin-protease/adhesins are also present, as is a third immunoglobulin protease. Genes related to iron acquisition and drug resistance were identified and are likely important for survival in the host and virulence. Analysis of the genome indicates that M. haemolytica is naturally competent, as genes for natural competence and DNA uptake signal sequences (USS) are present. Comparison of competence loci and USS in other species in the family Pasteurellaceae indicates that M. haemolytica, Actinobacillus pleuropneumoniae, and Haemophilus ducreyi form a lineage distinct from other Pasteurellaceae. This observation was supported by a phylogenetic analysis using sequences of predicted housekeeping genes. PMID:17015664
PanFP: Pangenome-based functional profiles for microbial communities

DOE PAGES

Jun, Se -Ran; Hauser, Loren John; Schadt, Christopher Warren; ...

2015-09-26

For decades there has been increasing interest in understanding the relationships between microbial communities and ecosystem functions. Current DNA sequencing technologies allows for the exploration of microbial communities in two principle ways: targeted rRNA gene surveys and shotgun metagenomics. For large study designs, it is often still prohibitively expensive to sequence metagenomes at both the breadth and depth necessary to statistically capture the true functional diversity of a community. Although rRNA gene surveys provide no direct evidence of function, they do provide a reasonable estimation of microbial diversity, while being a very cost effective way to screen samples of interestmore » for later shotgun metagenomic analyses. However, there is a great deal of 16S rRNA gene survey data currently available from diverse environments, and thus a need for tools to infer functional composition of environmental samples based on 16S rRNA gene survey data. As a result, we present a computational method called pangenome based functional profiles (PanFP), which infers functional profiles of microbial communities from 16S rRNA gene survey data for Bacteria and Archaea. PanFP is based on pangenome reconstruction of a 16S rRNA gene operational taxonomic unit (OTU) from known genes and genomes pooled from the OTU s taxonomic lineage. From this lineage, we derive an OTU functional profile by weighting a pangenome s functional profile with the OTUs abundance observed in a given sample. We validated our method by comparing PanFP to the functional profiles obtained from the direct shotgun metagenomic measurement of 65 diverse communities via Spearman correlation coefficients. These correlations improved with increasing sequencing depth, within the range of 0.8 0.9 for the most deeply sequenced Human Microbiome Project mock community samples. PanFP is very similar in performance to another recently released tool, PICRUSt, for almost all of survey data analysed here. But, our method is unique in that any OTU building method can be used, as opposed to being limited to closed reference OTU picking strategies against specific reference sequence databases. In conclusion, we developed an automated computational method, which derives an inferred functional profile based on the 16S rRNA gene surveys of microbial communities. The inferred functional profile provides a cost effective way to study complex ecosystems through predicted comparative functional metagenomes and metadata analysis. All PanFP source code and additional documentation are freely available online at GitHub.« less

PanFP: pangenome-based functional profiles for microbial communities.

PubMed

Jun, Se-Ran; Robeson, Michael S; Hauser, Loren J; Schadt, Christopher W; Gorin, Andrey A

2015-09-26

For decades there has been increasing interest in understanding the relationships between microbial communities and ecosystem functions. Current DNA sequencing technologies allows for the exploration of microbial communities in two principle ways: targeted rRNA gene surveys and shotgun metagenomics. For large study designs, it is often still prohibitively expensive to sequence metagenomes at both the breadth and depth necessary to statistically capture the true functional diversity of a community. Although rRNA gene surveys provide no direct evidence of function, they do provide a reasonable estimation of microbial diversity, while being a very cost-effective way to screen samples of interest for later shotgun metagenomic analyses. However, there is a great deal of 16S rRNA gene survey data currently available from diverse environments, and thus a need for tools to infer functional composition of environmental samples based on 16S rRNA gene survey data. We present a computational method called pangenome-based functional profiles (PanFP), which infers functional profiles of microbial communities from 16S rRNA gene survey data for Bacteria and Archaea. PanFP is based on pangenome reconstruction of a 16S rRNA gene operational taxonomic unit (OTU) from known genes and genomes pooled from the OTU's taxonomic lineage. From this lineage, we derive an OTU functional profile by weighting a pangenome's functional profile with the OTUs abundance observed in a given sample. We validated our method by comparing PanFP to the functional profiles obtained from the direct shotgun metagenomic measurement of 65 diverse communities via Spearman correlation coefficients. These correlations improved with increasing sequencing depth, within the range of 0.8-0.9 for the most deeply sequenced Human Microbiome Project mock community samples. PanFP is very similar in performance to another recently released tool, PICRUSt, for almost all of survey data analysed here. But, our method is unique in that any OTU building method can be used, as opposed to being limited to closed-reference OTU picking strategies against specific reference sequence databases. We developed an automated computational method, which derives an inferred functional profile based on the 16S rRNA gene surveys of microbial communities. The inferred functional profile provides a cost effective way to study complex ecosystems through predicted comparative functional metagenomes and metadata analysis. All PanFP source code and additional documentation are freely available online at GitHub ( https://github.com/srjun/PanFP ).
Cloning of novel rice blast resistance genes from two rapidly evolving NBS-LRR gene families in rice.

PubMed

Guo, Changjiang; Sun, Xiaoguang; Chen, Xiao; Yang, Sihai; Li, Jing; Wang, Long; Zhang, Xiaohui

2016-01-01

Most rice blast resistance genes (R-genes) encode proteins with nucleotide-binding site (NBS) and leucine-rich repeat (LRR) domains. Our previous study has shown that more rice blast R-genes can be cloned in rapidly evolving NBS-LRR gene families. In the present study, two rapidly evolving R-gene families in rice were selected for cloning a subset of genes from their paralogs in three resistant rice lines. A total of eight functional blast R-genes were identified among nine NBS-LRR genes, and some of these showed resistance to three or more blast strains. Evolutionary analysis indicated that high nucleotide diversity of coding regions served as important parameters in the determination of gene resistance. We also observed that amino-acid variants (nonsynonymous mutations, insertions, or deletions) in essential motifs of the NBS domain contribute to the blast resistance capacity of NBS-LRR genes. These results suggested that the NBS regions might also play an important role in resistance specificity determination. On the other hand, different splicing patterns of introns were commonly observed in R-genes. The results of the present study contribute to improving the effectiveness of R-gene identification by using evolutionary analysis method and acquisition of novel blast resistance genes.
The Genetic Basis for Variation in Sensitivity to Lead Toxicity in Drosophila melanogaster.

PubMed

Zhou, Shanshan; Morozova, Tatiana V; Hussain, Yasmeen N; Luoma, Sarah E; McCoy, Lenovia; Yamamoto, Akihiko; Mackay, Trudy F C; Anholt, Robert R H

2016-07-01

Lead toxicity presents a worldwide health problem, especially due to its adverse effects on cognitive development in children. However, identifying genes that give rise to individual variation in susceptibility to lead toxicity is challenging in human populations. Our goal was to use Drosophila melanogaster to identify evolutionarily conserved candidate genes associated with individual variation in susceptibility to lead exposure. To identify candidate genes associated with variation in susceptibility to lead toxicity, we measured effects of lead exposure on development time, viability and adult activity in the Drosophila melanogaster Genetic Reference Panel (DGRP) and performed genome-wide association analyses to identify candidate genes. We used mutants to assess functional causality of candidate genes and constructed a genetic network associated with variation in sensitivity to lead exposure, on which we could superimpose human orthologs. We found substantial heritabilities for all three traits and identified candidate genes associated with variation in susceptibility to lead exposure for each phenotype. The genetic architectures that determine variation in sensitivity to lead exposure are highly polygenic. Gene ontology and network analyses showed enrichment of genes associated with early development and function of the nervous system. Drosophila melanogaster presents an advantageous model to study the genetic underpinnings of variation in susceptibility to lead toxicity. Evolutionary conservation of cellular pathways that respond to toxic exposure allows predictions regarding orthologous genes and pathways across phyla. Thus, studies in the D. melanogaster model system can identify candidate susceptibility genes to guide subsequent studies in human populations. Zhou S, Morozova TV, Hussain YN, Luoma SE, McCoy L, Yamamoto A, Mackay TF, Anholt RR. 2016. The genetic basis for variation in sensitivity to lead toxicity in Drosophila melanogaster. Environ Health Perspect 124:1062-1070; http://dx.doi.org/10.1289/ehp.1510513.
Genomic Signatures of Selective Pressures and Introgression from Archaic Hominins at Human Innate Immunity Genes

PubMed Central

Deschamps, Matthieu; Laval, Guillaume; Fagny, Maud; Itan, Yuval; Abel, Laurent; Casanova, Jean-Laurent; Patin, Etienne; Quintana-Murci, Lluis

2016-01-01

Human genes governing innate immunity provide a valuable tool for the study of the selective pressure imposed by microorganisms on host genomes. A comprehensive, genome-wide study of how selective constraints and adaptations have driven the evolution of innate immunity genes is missing. Using full-genome sequence variation from the 1000 Genomes Project, we first show that innate immunity genes have globally evolved under stronger purifying selection than the remainder of protein-coding genes. We identify a gene set under the strongest selective constraints, mutations in which are likely to predispose individuals to life-threatening disease, as illustrated by STAT1 and TRAF3. We then evaluate the occurrence of local adaptation and detect 57 high-scoring signals of positive selection at innate immunity genes, variation in which has been associated with susceptibility to common infectious or autoimmune diseases. Furthermore, we show that most adaptations targeting coding variation have occurred in the last 6,000–13,000 years, the period at which populations shifted from hunting and gathering to farming. Finally, we show that innate immunity genes present higher Neandertal introgression than the remainder of the coding genome. Notably, among the genes presenting the highest Neandertal ancestry, we find the TLR6-TLR1-TLR10 cluster, which also contains functional adaptive variation in Europeans. This study identifies highly constrained genes that fulfill essential, non-redundant functions in host survival and reveals others that are more permissive to change—containing variation acquired from archaic hominins or adaptive variants in specific populations—improving our understanding of the relative biological importance of innate immunity pathways in natural conditions. PMID:26748513
Genome-Wide Identification and Comprehensive Expression Profiling of Ribosomal Protein Small Subunit (RPS) Genes and their Comparative Analysis with the Large Subunit (RPL) Genes in Rice

PubMed Central

Saha, Anusree; Das, Shubhajit; Moin, Mazahar; Dutta, Mouboni; Bakshi, Achala; Madhav, M. S.; Kirti, P. B.

2017-01-01

Ribosomal proteins (RPs) are indispensable in ribosome biogenesis and protein synthesis, and play a crucial role in diverse developmental processes. Our previous studies on Ribosomal Protein Large subunit (RPL) genes provided insights into their stress responsive roles in rice. In the present study, we have explored the developmental and stress regulated expression patterns of Ribosomal Protein Small (RPS) subunit genes for their differential expression in a spatiotemporal and stress dependent manner. We have also performed an in silico analysis of gene structure, cis-elements in upstream regulatory regions, protein properties and phylogeny. Expression studies of the 34 RPS genes in 13 different tissues of rice covering major growth and developmental stages revealed that their expression was substantially elevated, mostly in shoots and leaves indicating their possible involvement in the development of vegetative organs. The majority of the RPS genes have manifested significant expression under all abiotic stress treatments with ABA, PEG, NaCl, and H2O2. Infection with important rice pathogens, Xanthomonas oryzae pv. oryzae (Xoo) and Rhizoctonia solani also induced the up-regulation of several of the RPS genes. RPS4, 13a, 18a, and 4a have shown higher transcript levels under all the abiotic stresses, whereas, RPS4 is up-regulated in both the biotic stress treatments. The information obtained from the present investigation would be useful in appreciating the possible stress-regulatory attributes of the genes coding for rice ribosomal small subunit proteins apart from their functions as house-keeping proteins. A detailed functional analysis of independent genes is required to study their roles in stress tolerance and generating stress- tolerant crops. PMID:28966624
Genome-wide annotation of the soybean WRKY family and functional characterization of genes involved in response to Phakopsora pachyrhizi infection.

PubMed

Bencke-Malato, Marta; Cabreira, Caroline; Wiebke-Strohm, Beatriz; Bücker-Neto, Lauro; Mancini, Estefania; Osorio, Marina B; Homrich, Milena S; Turchetto-Zolet, Andreia Carina; De Carvalho, Mayra C C G; Stolf, Renata; Weber, Ricardo L M; Westergaard, Gastón; Castagnaro, Atílio P; Abdelnoor, Ricardo V; Marcelino-Guimarães, Francismar C; Margis-Pinheiro, Márcia; Bodanese-Zanettini, Maria Helena

2014-09-10

Many previous studies have shown that soybean WRKY transcription factors are involved in the plant response to biotic and abiotic stresses. Phakopsora pachyrhizi is the causal agent of Asian Soybean Rust, one of the most important soybean diseases. There are evidences that WRKYs are involved in the resistance of some soybean genotypes against that fungus. The number of WRKY genes already annotated in soybean genome was underrepresented. In the present study, a genome-wide annotation of the soybean WRKY family was carried out and members involved in the response to P. pachyrhizi were identified. As a result of a soybean genomic databases search, 182 WRKY-encoding genes were annotated and 33 putative pseudogenes identified. Genes involved in the response to P. pachyrhizi infection were identified using superSAGE, RNA-Seq of microdissected lesions and microarray experiments. Seventy-five genes were differentially expressed during fungal infection. The expression of eight WRKY genes was validated by RT-qPCR. The expression of these genes in a resistant genotype was earlier and/or stronger compared with a susceptible genotype in response to P. pachyrhizi infection. Soybean somatic embryos were transformed in order to overexpress or silence WRKY genes. Embryos overexpressing a WRKY gene were obtained, but they were unable to convert into plants. When infected with P. pachyrhizi, the leaves of the silenced transgenic line showed a higher number of lesions than the wild-type plants. The present study reports a genome-wide annotation of soybean WRKY family. The participation of some members in response to P. pachyrhizi infection was demonstrated. The results contribute to the elucidation of gene function and suggest the manipulation of WRKYs as a strategy to increase fungal resistance in soybean plants.
Developmental Regulation of Genes Encoding Universal Stress Proteins in Schistosoma mansoni

PubMed Central

Isokpehi, Raphael D.; Mahmud, Ousman; Mbah, Andreas N.; Simmons, Shaneka S.; Avelar, Lívia; Rajnarayanan, Rajendram V.; Udensi, Udensi K.; Ayensu, Wellington K.; Cohly, Hari H.; Brown, Shyretha D.; Dates, Centdrika R.; Hentz, Sonya D.; Hughes, Shawntae J.; Smith-McInnis, Dominique R.; Patterson, Carvey O.; Sims, Jennifer N.; Turner, Kelisha T.; Williams, Baraka S.; Johnson, Matilda O.; Adubi, Taiwo; Mbuh, Judith V.; Anumudu, Chiaka I.; Adeoye, Grace O.; Thomas, Bolaji N.; Nashiru, Oyekanmi; Oliveira, Guilherme

2011-01-01

The draft nuclear genome sequence of the snail-transmitted, dimorphic, parasitic, platyhelminth Schistosoma mansoni revealed eight genes encoding proteins that contain the Universal Stress Protein (USP) domain. Schistosoma mansoni is a causative agent of human schistosomiasis, a severe and debilitating Neglected Tropical Disease (NTD) of poverty, which is endemic in at least 76 countries. The availability of the genome sequences of Schistosoma species presents opportunities for bioinformatics and genomics analyses of associated gene families that could be targets for understanding schistosomiasis ecology, intervention, prevention and control. Proteins with the USP domain are known to provide bacteria, archaea, fungi, protists and plants with the ability to respond to diverse environmental stresses. In this research investigation, the functional annotations of the USP genes and predicted nucleotide and protein sequences were initially verified. Subsequently, sequence clusters and distinctive features of the sequences were determined. A total of twelve ligand binding sites were predicted based on alignment to the ATP-binding universal stress protein from Methanocaldococcus jannaschii. In addition, six USP sequences showed the presence of ATP-binding motif residues indicating that they may be regulated by ATP. Public domain gene expression data and RT-PCR assays confirmed that all the S. mansoni USP genes were transcribed in at least one of the developmental life cycle stages of the helminth. Six of these genes were up-regulated in the miracidium, a free-swimming stage that is critical for transmission to the snail intermediate host. It is possible that during the intra-snail stages, S. mansoni gene transcripts for universal stress proteins are low abundant and are induced to perform specialized functions triggered by environmental stressors such as oxidative stress due to hydrogen peroxide that is present in the snail hemocytes. This report serves to catalyze the formation of a network of researchers to understand the function and regulation of the universal stress proteins encoded in genomes of schistosomes and their snail intermediate hosts. PMID:22084571
A multilevel analysis of cognitive dysfunction and psychopathology associated with chromosome 22q11.2 deletion syndrome in children

PubMed Central

SIMON, TONY J.; BISH, JOEL P.; BEARDEN, CARRIE E.; DING, LIJUN; FERRANTE, SAMANTHA; NGUYEN, VY; GEE, JAMES C.; McDONALD–McGINN, DONNA M.; ZACKAI, ELAINE H.; EMANUEL, BEVERLY S.

2006-01-01

We present a multilevel approach to developing potential explanations of cognitive impairments and psychopathologies common to individuals with chromosome 22q11.2 deletion syndrome. Results presented support our hypothesis of posterior parietal dysfunction as a central determinant of characteristic visuospatial and numerical cognitive impairments. Converging data suggest that brain development anomalies, primarily tissue reductions in the posterior brain and changes to the corpus callosum, may affect parietal connectivity. Further findings indicate that dysfunction in “frontal” attention systems may explain some executive cognition impairments observed in affected children, and that there may be links between these domains of cognitive function and some of the serious psychiatric conditions, such as attention-deficit/hyperactivity disorder, autism, and schizophrenia, that have elevated incidence rates in the syndrome. Linking the neural structure and the cognitive processing levels in this way enabled us to develop an elaborate structure/function mapping hypothesis for the impairments that are observed. We show also, that in the case of the catechol-O-methyltransferase gene, a fairly direct relationship between gene expression, cognitive function, and psychopathology exists in the affected population. Beyond that, we introduce the idea that variation in other genes may further explain the phenotypic variation in cognitive function and possibly the anomalies in brain development. PMID:16262991
Directed evolution induces tributyrin hydrolysis in a virulence factor of Xylella fastidiosa using a duplicated gene as a template.

PubMed

Gouran, Hossein; Chakraborty, Sandeep; Rao, Basuthkar J; Asgeirsson, Bjarni; Dandekar, Abhaya

2014-01-01

Duplication of genes is one of the preferred ways for natural selection to add advantageous functionality to the genome without having to reinvent the wheel with respect to catalytic efficiency and protein stability. The duplicated secretory virulence factors of Xylella fastidiosa (LesA, LesB and LesC), implicated in Pierce's disease of grape and citrus variegated chlorosis of citrus species, epitomizes the positive selection pressures exerted on advantageous genes in such pathogens. A deeper insight into the evolution of these lipases/esterases is essential to develop resistance mechanisms in transgenic plants. Directed evolution, an attempt to accelerate the evolutionary steps in the laboratory, is inherently simple when targeted for loss of function. A bigger challenge is to specify mutations that endow a new function, such as a lost functionality in a duplicated gene. Previously, we have proposed a method for enumerating candidates for mutations intended to transfer the functionality of one protein into another related protein based on the spatial and electrostatic properties of the active site residues (DECAAF). In the current work, we present in vivo validation of DECAAF by inducing tributyrin hydrolysis in LesB based on the active site similarity to LesA. The structures of these proteins have been modeled using RaptorX based on the closely related LipA protein from Xanthomonas oryzae. These mutations replicate the spatial and electrostatic conformation of LesA in the modeled structure of the mutant LesB as well, providing in silico validation before proceeding to the laborious in vivo work. Such focused mutations allows one to dissect the relevance of the duplicated genes in finer detail as compared to gene knockouts, since they do not interfere with other moonlighting functions, protein expression levels or protein-protein interaction.
Directed evolution induces tributyrin hydrolysis in a virulence factor of Xylella fastidiosa using a duplicated gene as a template

PubMed Central

Rao, Basuthkar J.; Asgeirsson, Bjarni; Dandekar, Abhaya

2014-01-01

Duplication of genes is one of the preferred ways for natural selection to add advantageous functionality to the genome without having to reinvent the wheel with respect to catalytic efficiency and protein stability. The duplicated secretory virulence factors of Xylella fastidiosa (LesA, LesB and LesC), implicated in Pierce's disease of grape and citrus variegated chlorosis of citrus species, epitomizes the positive selection pressures exerted on advantageous genes in such pathogens. A deeper insight into the evolution of these lipases/esterases is essential to develop resistance mechanisms in transgenic plants. Directed evolution, an attempt to accelerate the evolutionary steps in the laboratory, is inherently simple when targeted for loss of function. A bigger challenge is to specify mutations that endow a new function, such as a lost functionality in a duplicated gene. Previously, we have proposed a method for enumerating candidates for mutations intended to transfer the functionality of one protein into another related protein based on the spatial and electrostatic properties of the active site residues (DECAAF). In the current work, we present in vivo validation of DECAAF by inducing tributyrin hydrolysis in LesB based on the active site similarity to LesA. The structures of these proteins have been modeled using RaptorX based on the closely related LipA protein from Xanthomonas oryzae. These mutations replicate the spatial and electrostatic conformation of LesA in the modeled structure of the mutant LesB as well, providing in silico validation before proceeding to the laborious in vivo work. Such focused mutations allows one to dissect the relevance of the duplicated genes in finer detail as compared to gene knockouts, since they do not interfere with other moonlighting functions, protein expression levels or protein-protein interaction. PMID:25717364
Genome-Wide Distribution, Organisation and Functional Characterization of Disease Resistance and Defence Response Genes across Rice Species

PubMed Central

Singh, Sangeeta; Chand, Suresh; Singh, N. K.; Sharma, Tilak Raj

2015-01-01

The resistance (R) genes and defense response (DR) genes have become very important resources for the development of disease resistant cultivars. In the present investigation, genome-wide identification, expression, phylogenetic and synteny analysis was done for R and DR-genes across three species of rice viz: Oryza sativa ssp indica cv 93-11, Oryza sativa ssp japonica and wild rice species, Oryza brachyantha. We used the in silico approach to identify and map 786 R -genes and 167 DR-genes, 672 R-genes and 142 DR-genes, 251 R-genes and 86 DR-genes in the japonica, indica and O. brachyanth a genomes, respectively. Our analysis showed that 60.5% and 55.6% of the R-genes are tandemly repeated within clusters and distributed over all the rice chromosomes in indica and japonica genomes, respectively. The phylogenetic analysis along with motif distribution shows high degree of conservation of R- and DR-genes in clusters. In silico expression analysis of R-genes and DR-genes showed more than 85% were expressed genes showing corresponding EST matches in the databases. This study gave special emphasis on mechanisms of gene evolution and duplication for R and DR genes across species. Analysis of paralogs across rice species indicated 17% and 4.38% R-genes, 29% and 11.63% DR-genes duplication in indica and Oryza brachyantha, as compared to 20% and 26% duplication of R-genes and DR-genes in japonica respectively. We found that during the course of duplication only 9.5% of R- and DR-genes changed their function and rest of the genes have maintained their identity. Syntenic relationship across three genomes inferred that more orthology is shared between indica and japonica genomes as compared to brachyantha genome. Genome wide identification of R-genes and DR-genes in the rice genome will help in allele mining and functional validation of these genes, and to understand molecular mechanism of disease resistance and their evolution in rice and related species. PMID:25902056
Expression profiles of urbilaterian genes uniquely shared between honey bee and vertebrates

PubMed Central

Matsui, Toshiaki; Yamamoto, Toshiyuki; Wyder, Stefan; Zdobnov, Evgeny M; Kadowaki, Tatsuhiko

2009-01-01

Background Large-scale comparison of metazoan genomes has revealed that a significant fraction of genes of the last common ancestor of Bilateria (Urbilateria) is lost in each animal lineage. This event could be one of the underlying mechanisms involved in generating metazoan diversity. However, the present functions of these ancient genes have not been addressed extensively. To understand the functions and evolutionary mechanisms of such ancient Urbilaterian genes, we carried out comprehensive expression profile analysis of genes shared between vertebrates and honey bees but not with the other sequenced ecdysozoan genomes (honey bee-vertebrate specific, HVS genes) as a model. Results We identified 30 honey bee and 55 mouse HVS genes. Many HVS genes exhibited tissue-selective expression patterns; intriguingly, the expression of 60% of honey bee HVS genes was found to be brain enriched, and 24% of mouse HVS genes were highly expressed in either or both the brain and testis. Moreover, a minimum of 38% of mouse HVS genes demonstrated neuron-enriched expression patterns, and 62% of them exhibited expression in selective brain areas, particularly the forebrain and cerebellum. Furthermore, gene ontology (GO) analysis of HVS genes predicted that 35% of genes are associated with DNA transcription and RNA processing. Conclusion These results suggest that HVS genes include genes that are biased towards expression in the brain and gonads. They also demonstrate that at least some of Urbilaterian genes retained in the specific animal lineage may be selectively maintained to support the species-specific phenotypes. PMID:19138430
Expression profiles of urbilaterian genes uniquely shared between honey bee and vertebrates.

PubMed

Matsui, Toshiaki; Yamamoto, Toshiyuki; Wyder, Stefan; Zdobnov, Evgeny M; Kadowaki, Tatsuhiko

2009-01-12

Large-scale comparison of metazoan genomes has revealed that a significant fraction of genes of the last common ancestor of Bilateria (Urbilateria) is lost in each animal lineage. This event could be one of the underlying mechanisms involved in generating metazoan diversity. However, the present functions of these ancient genes have not been addressed extensively. To understand the functions and evolutionary mechanisms of such ancient Urbilaterian genes, we carried out comprehensive expression profile analysis of genes shared between vertebrates and honey bees but not with the other sequenced ecdysozoan genomes (honey bee-vertebrate specific, HVS genes) as a model. We identified 30 honey bee and 55 mouse HVS genes. Many HVS genes exhibited tissue-selective expression patterns; intriguingly, the expression of 60% of honey bee HVS genes was found to be brain enriched, and 24% of mouse HVS genes were highly expressed in either or both the brain and testis. Moreover, a minimum of 38% of mouse HVS genes demonstrated neuron-enriched expression patterns, and 62% of them exhibited expression in selective brain areas, particularly the forebrain and cerebellum. Furthermore, gene ontology (GO) analysis of HVS genes predicted that 35% of genes are associated with DNA transcription and RNA processing. These results suggest that HVS genes include genes that are biased towards expression in the brain and gonads. They also demonstrate that at least some of Urbilaterian genes retained in the specific animal lineage may be selectively maintained to support the species-specific phenotypes.
Gene Therapy Rescues Cone Structure and Function in the 3-Month-Old rd12 Mouse: A Model for Midcourse RPE65 Leber Congenital Amaurosis

PubMed Central

Li, Xia; Li, Wensheng; Dai, Xufeng; Kong, Fansheng; Zheng, Qinxiang; Zhou, Xiangtian; Lü, Fan; Chang, Bo; Rohrer, Bärbel; Hauswirth, William. W.; Qu, Jia; Pang, Ji-jing

2011-01-01

Purpose. RPE65 function is necessary in the retinal pigment epithelium (RPE) to generate chromophore for all opsins. Its absence results in vision loss and rapid cone degeneration. Recent Leber congenital amaurosis type 2 (LCA with RPE65 mutations) phase I clinical trials demonstrated restoration of vision on RPE65 gene transfer into RPE cells overlying cones. In the rd12 mouse, a naturally occurring model of RPE65-LCA early cone degeneration was observed; however, some peripheral M-cones remained. A prior study showed that AAV-mediated RPE65 expression can prevent early cone degeneration. The present study was conducted to test whether the remaining cones in older rd12 mice can be rescued. Methods. Subretinal treatment with the scAAV5-smCBA-hRPE65 vector was initiated at postnatal day (P)14 and P90. After 2 months, electroretinograms were recorded, and cone morphology was analyzed by using cone-specific peanut agglutinin and cone opsin–specific antibodies. Results. Cone degeneration started centrally and spread ventrally, with cells losing cone-opsin staining before that for the PNA-lectin–positive cone sheath. Gene therapy starting at P14 resulted in almost wild-type M- and S-cone function and morphology. Delaying gene-replacement rescued the remaining M-cones, and most important, more M-cone opsin–positive cells were identified than were present at the onset of gene therapy, suggesting that opsin expression could be reinitiated in cells with cone sheaths. Conclusions. The results support and extend those of the previous study that gene therapy can stop early cone degeneration, and, more important, they provide proof that delayed treatment can restore the function and morphology of the remaining cones. These results have important implications for the ongoing LCA2 clinical trials. PMID:21169527
Automated genomic context analysis and experimental validation platform for discovery of prokaryote transcriptional regulator functions

DOE PAGES

Martí-Arbona, Ricardo; Mu, Fangping; Nowak-Lovato, Kristy L.; ...

2014-12-18

In this study, the clustering of genes in a pathway and the co-location of functionally related genes is widely recognized in prokaryotes. We used these characteristics to predict the metabolic involvement for a Transcriptional Regulator (TR) of unknown function, identified and confirmed its biological activity. software tool that identifies the genes encoded within a defined genomic neighborhood for the subject TR and its homologs was developed. The output lists of genes in the genetic neighborhoods, their annotated functions, the reactants/products, and identifies the metabolic pathway in which the encoded-proteins function. When a set of TRs of known function was analyzed,more » we observed that their homologs frequently had conserved genomic neighborhoods that co-located the metabolically related genes regulated by the subject TR. We postulate that TR effectors are metabolites in the identified pathways; indeed the known effectors were present. We analyzed Bxe_B3018 from Burkholderia xenovorans, a TR of unknown function and predicted that this TR was related to the glycine, threonine and serine degradation. We tested the binding of metabolites in these pathways and for those that bound, their ability to modulate TR binding to its specific DNA operator sequence. Using rtPCR, we confirmed that methylglyoxal was an effector of Bxe_3018. These studies provide the proof of concept and validation of a systematic approach to the discovery of the biological activity for proteins of unknown function, in this case a TR. Bxe_B3018 is a methylglyoxal responsive TR that controls the expression of an operon composed of a putative efflux system.« less
A Genome-Wide Analysis of the LBD (LATERAL ORGAN BOUNDARIES Domain) Gene Family in Malus domestica with a Functional Characterization of MdLBD11

PubMed Central

Su, Ling; Liu, Xin; Hao, Yujin

2013-01-01

The plant-specific LBD (LATERAL ORGAN BOUNDARIES domain) genes belong to a major family of transcription factor that encode a zinc finger-like domain. It has been shown that LBD genes play crucial roles in the growth and development of Arabidopsis and other plant species. However, no detailed information concerning this family is available for apple. In the present study, we analyzed the apple (Malus domestica) genome and identified 58 LBD genes. This gene family was tested for its phylogenetic relationships with homologous genes in the Arabidopsis genome, as well as its location in the genome, structure and expression. We also transformed one MdLBD gene into Arabidopsis to evaluate its function. Like Arabidopsis, apple LBD genes also have a conserved CX2CX6CX3C zinc finger-like domain in the N terminus and can be divided into two classes. The expression profile indicated that apple LBD genes exhibited a variety of expression patterns, suggesting that they have diverse functions. At the same time, the expression analysis implied that members of this apple gene family were responsive to hormones and stress and that they may participate in hormone-mediated plant organogenesis, which was demonstrated with the overexpression of the apple LBD gene MdLBD11, resulting in an abnormal phenotype. This phenotype included upward curling leaves, delayed flowering, downward-pointing flowers, siliques and other abnormal traits. Based on these data, we concluded that the MdLBD genes may play an important role in apple growth and development as in Arabidopsis and other species. PMID:23468909
Investigating a multigene prognostic assay based on significant pathways for Luminal A breast cancer through gene expression profile analysis.

PubMed

Gao, Haiyan; Yang, Mei; Zhang, Xiaolan

2018-04-01

The present study aimed to investigate potential recurrence-risk biomarkers based on significant pathways for Luminal A breast cancer through gene expression profile analysis. Initially, the gene expression profiles of Luminal A breast cancer patients were downloaded from The Cancer Genome Atlas database. The differentially expressed genes (DEGs) were identified using a Limma package and the hierarchical clustering analysis was conducted for the DEGs. In addition, the functional pathways were screened using Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses and rank ratio calculation. The multigene prognostic assay was exploited based on the statistically significant pathways and its prognostic function was tested using train set and verified using the gene expression data and survival data of Luminal A breast cancer patients downloaded from the Gene Expression Omnibus. A total of 300 DEGs were identified between good and poor outcome groups, including 176 upregulated genes and 124 downregulated genes. The DEGs may be used to effectively distinguish Luminal A samples with different prognoses verified by hierarchical clustering analysis. There were 9 pathways screened as significant pathways and a total of 18 DEGs involved in these 9 pathways were identified as prognostic biomarkers. According to the survival analysis and receiver operating characteristic curve, the obtained 18-gene prognostic assay exhibited good prognostic function with high sensitivity and specificity to both the train and test samples. In conclusion the 18-gene prognostic assay including the key genes, transcription factor 7-like 2, anterior parietal cortex and lymphocyte enhancer factor-1 may provide a new method for predicting outcomes and may be conducive to the promotion of precision medicine for Luminal A breast cancer.
Comparative symbiotic plasmid analysis indicates that symbiosis gene ancestor type affects plasmid genetic evolution.

PubMed

Wang, X; Zhao, L; Zhang, L; Wu, Y; Chou, M; Wei, G

2018-07-01

Rhizobial symbiotic plasmids play vital roles in mutualistic symbiosis with legume plants by executing the functions of nodulation and nitrogen fixation. To explore the gene composition and genetic constitution of rhizobial symbiotic plasmids, comparison analyses of 24 rhizobial symbiotic plasmids derived from four rhizobial genera was carried out. Results illustrated that rhizobial symbiotic plasmids had higher proportion of functional genes participating in amino acid transport and metabolism, replication; recombination and repair; carbohydrate transport and metabolism; energy production and conversion and transcription. Mesorhizobium amorphae CCNWGS0123 symbiotic plasmid - pM0123d had similar gene composition with pR899b and pSNGR234a. All symbiotic plasmids shared 13 orthologous genes, including five nod and eight nif/fix genes which participate in the rhizobia-legume symbiosis process. These plasmids contained nod genes from four ancestors and fix genes from six ancestors. The ancestral type of pM0123d nod genes was similar with that of Rhizobium etli plasmids, while the ancestral type of pM0123d fix genes was same as that of pM7653Rb. The phylogenetic trees constructed based on nodCIJ and fixABC displayed different topological structures mainly due to nodCIJ and fixABC ancestral type discordance. The study presents valuable insights into mosaic structures and the evolution of rhizobial symbiotic plasmids. This study compared 24 rhizobial symbiotic plasmids that included four genera and 11 species, illuminating the functional gene composition and symbiosis gene ancestor types of symbiotic plasmids from higher taxonomy. It provides valuable insights into mosaic structures and the evolution of symbiotic plasmids. © 2018 The Society for Applied Microbiology.
A genome-wide analysis of the LBD (LATERAL ORGAN BOUNDARIES domain) gene family in Malus domestica with a functional characterization of MdLBD11.

PubMed

Wang, Xiaofei; Zhang, Shizhong; Su, Ling; Liu, Xin; Hao, Yujin

2013-01-01

The plant-specific LBD (LATERAL ORGAN BOUNDARIES domain) genes belong to a major family of transcription factor that encode a zinc finger-like domain. It has been shown that LBD genes play crucial roles in the growth and development of Arabidopsis and other plant species. However, no detailed information concerning this family is available for apple. In the present study, we analyzed the apple (Malus domestica) genome and identified 58 LBD genes. This gene family was tested for its phylogenetic relationships with homologous genes in the Arabidopsis genome, as well as its location in the genome, structure and expression. We also transformed one MdLBD gene into Arabidopsis to evaluate its function. Like Arabidopsis, apple LBD genes also have a conserved CX2CX6CX3C zinc finger-like domain in the N terminus and can be divided into two classes. The expression profile indicated that apple LBD genes exhibited a variety of expression patterns, suggesting that they have diverse functions. At the same time, the expression analysis implied that members of this apple gene family were responsive to hormones and stress and that they may participate in hormone-mediated plant organogenesis, which was demonstrated with the overexpression of the apple LBD gene MdLBD11, resulting in an abnormal phenotype. This phenotype included upward curling leaves, delayed flowering, downward-pointing flowers, siliques and other abnormal traits. Based on these data, we concluded that the MdLBD genes may play an important role in apple growth and development as in Arabidopsis and other species.
Expression and phylogenetic analyses reveal paralogous lineages of putatively classical and non-classical MHC-I genes in three sparrow species (Passer).

PubMed

Drews, Anna; Strandh, Maria; Råberg, Lars; Westerdahl, Helena

2017-06-26

The Major Histocompatibility Complex (MHC) plays a central role in immunity and has been given considerable attention by evolutionary ecologists due to its associations with fitness-related traits. Songbirds have unusually high numbers of MHC class I (MHC-I) genes, but it is not known whether all are expressed and equally important for immune function. Classical MHC-I genes are highly expressed, polymorphic and present peptides to T-cells whereas non-classical MHC-I genes have lower expression, are more monomorphic and do not present peptides to T-cells. To get a better understanding of the highly duplicated MHC genes in songbirds, we studied gene expression in a phylogenetic framework in three species of sparrows (house sparrow, tree sparrow and Spanish sparrow), using high-throughput sequencing. We hypothesize that sparrows could have classical and non-classical genes, as previously indicated though never tested using gene expression. The phylogenetic analyses reveal two distinct types of MHC-I alleles among the three sparrow species, one with high and one with low level of polymorphism, thus resembling classical and non-classical genes, respectively. All individuals had both types of alleles, but there was copy number variation both within and among the sparrow species. However, the number of highly polymorphic alleles that were expressed did not vary between species, suggesting that the structural genomic variation is counterbalanced by conserved gene expression. Overall, 50% of the MHC-I alleles were expressed in sparrows. Expression of the highly polymorphic alleles was very variable, whereas the alleles with low polymorphism had uniformly low expression. Interestingly, within an individual only one or two alleles from the polymorphic genes were highly expressed, indicating that only a single copy of these is highly expressed. Taken together, the phylogenetic reconstruction and the analyses of expression suggest that sparrows have both classical and non-classical MHC-I genes, and that the evolutionary origin of these genes predate the split of the three investigated sparrow species 7 million years ago. Because only the classical MHC-I genes are involved in antigen presentation, the function of different MHC-I genes should be considered in future ecological and evolutionary studies of MHC-I in sparrows and other songbirds.

Cloning of Gossypium hirsutum Sucrose Non-Fermenting 1-Related Protein Kinase 2 Gene (GhSnRK2) and Its Overexpression in Transgenic Arabidopsis Escalates Drought and Low Temperature Tolerance

PubMed Central

Bello, Babatunde; Zhang, Xueyan; Liu, Chuanliang; Yang, Zhaoen; Yang, Zuoren; Wang, Qianhua; Zhao, Ge; Li, Fuguang

2014-01-01

The molecular mechanisms of stress tolerance and the use of modern genetics approaches for the improvement of drought stress tolerance have been major focuses of plant molecular biologists. In the present study, we cloned the Gossypium hirsutum sucrose non-fermenting 1-related protein kinase 2 (GhSnRK2) gene and investigated its functions in transgenic Arabidopsis. We further elucidated the function of this gene in transgenic cotton using virus-induced gene silencing (VIGS) techniques. We hypothesized that GhSnRK2 participates in the stress signaling pathway and elucidated its role in enhancing stress tolerance in plants via various stress-related pathways and stress-responsive genes. We determined that the subcellular localization of the GhSnRK2-green fluorescent protein (GFP) was localized in the nuclei and cytoplasm. In contrast to wild-type plants, transgenic plants overexpressing GhSnRK2 exhibited increased tolerance to drought, cold, abscisic acid and salt stresses, suggesting that GhSnRK2 acts as a positive regulator in response to cold and drought stresses. Plants overexpressing GhSnRK2 displayed evidence of reduced water loss, turgor regulation, elevated relative water content, biomass, and proline accumulation. qRT-PCR analysis of GhSnRK2 expression suggested that this gene may function in diverse tissues. Under normal and stress conditions, the expression levels of stress-inducible genes, such as AtRD29A, AtRD29B, AtP5CS1, AtABI3, AtCBF1, and AtABI5, were increased in the GhSnRK2-overexpressing plants compared to the wild-type plants. GhSnRK2 gene silencing alleviated drought tolerance in cotton plants, indicating that VIGS technique can certainly be used as an effective means to examine gene function by knocking down the expression of distinctly expressed genes. The results of this study suggested that the GhSnRK2 gene, when incorporated into Arabidopsis, functions in positive responses to drought stress and in low temperature tolerance. PMID:25393623
Multiple Multi-Copper Oxidase Gene Families in Basidiomycetes – What for?

PubMed Central

Kües, Ursula; Rühl, Martin

2011-01-01

Genome analyses revealed in various basidiomycetes the existence of multiple genes for blue multi-copper oxidases (MCOs). Whole genomes are now available from saprotrophs, white rot and brown rot species, plant and animal pathogens and ectomycorrhizal species. Total numbers (from 1 to 17) and types of mco genes differ between analyzed species with no easy to recognize connection of gene distribution to fungal life styles. Types of mco genes might be present in one and absent in another fungus. Distinct types of genes have been multiplied at speciation in different organisms. Phylogenetic analysis defined different subfamilies of laccases sensu stricto (specific to Agaricomycetes), classical Fe2+-oxidizing Fet3-like ferroxidases, potential ferroxidases/laccases exhibiting either one or both of these enzymatic functions, enzymes clustering with pigment MCOs and putative ascorbate oxidases. Biochemically best described are laccases sensu stricto due to their proposed roles in degradation of wood, straw and plant litter and due to the large interest in these enzymes in biotechnology. However, biological functions of laccases and other MCOs are generally little addressed. Functions in substrate degradation, symbiontic and pathogenic intercations, development, pigmentation and copper homeostasis have been put forward. Evidences for biological functions are in most instances rather circumstantial by correlations of expression. Multiple factors impede research on biological functions such as difficulties of defining suitable biological systems for molecular research, the broad and overlapping substrate spectrum multi-copper oxidases usually possess, the low existent knowledge on their natural substrates, difficulties imposed by low expression or expression of multiple enzymes, and difficulties in expressing enzymes heterologously. PMID:21966246
Lessons from ten years of genome-wide association studies of asthma

PubMed Central

Vicente, Cristina T; Revez, Joana A; Ferreira, Manuel A R

2017-01-01

Twenty-five genome-wide association studies (GWAS) of asthma were published between 2007 and 2016, the largest with a sample size of 157242 individuals. Across these studies, 39 genetic variants in low linkage disequilibrium (LD) with each other were reported to associate with disease risk at a significance threshold of P<5 × 10−8, including 31 in populations of European ancestry. Results from analyses of the UK Biobank data (n=380 503) indicate that at least 28 of the 31 associations reported in Europeans represent true-positive findings, collectively explaining 2.5% of the variation in disease liability (median of 0.06% per variant). We identified 49 transcripts as likely target genes of the published asthma risk variants, mostly based on LD with expression quantitative trait loci (eQTL). Of these genes, 16 were previously implicated in disease pathophysiology by functional studies, including TSLP, TNFSF4, ADORA1, CHIT1 and USF1. In contrast, at present, there is limited or no functional evidence directly implicating the remaining 33 likely target genes in asthma pathophysiology. Some of these genes have a known function that is relevant to allergic disease, including F11R, CD247, PGAP3, AAGAB, CAMK4 and PEX14, and so could be prioritized for functional follow-up. We conclude by highlighting three areas of research that are essential to help translate GWAS findings into clinical research or practice, namely validation of target gene predictions, understanding target gene function and their role in disease pathophysiology and genomics-guided prioritization of targets for drug development. PMID:29333270
Medicago truncatula contains a second gene encoding a plastid located glutamine synthetase exclusively expressed in developing seeds.

PubMed

Seabra, Ana R; Vieira, Cristina P; Cullimore, Julie V; Carvalho, Helena G

2010-08-19

Nitrogen is a crucial nutrient that is both essential and rate limiting for plant growth and seed production. Glutamine synthetase (GS), occupies a central position in nitrogen assimilation and recycling, justifying the extensive number of studies that have been dedicated to this enzyme from several plant sources. All plants species studied to date have been reported as containing a single, nuclear gene encoding a plastid located GS isoenzyme per haploid genome. This study reports the existence of a second nuclear gene encoding a plastid located GS in Medicago truncatula. This study characterizes a new, second gene encoding a plastid located glutamine synthetase (GS2) in M. truncatula. The gene encodes a functional GS isoenzyme with unique kinetic properties, which is exclusively expressed in developing seeds. Based on molecular data and the assumption of a molecular clock, it is estimated that the gene arose from a duplication event that occurred about 10 My ago, after legume speciation and that duplicated sequences are also present in closely related species of the Vicioide subclade. Expression analysis by RT-PCR and western blot indicate that the gene is exclusively expressed in developing seeds and its expression is related to seed filling, suggesting a specific function of the enzyme associated to legume seed metabolism. Interestingly, the gene was found to be subjected to alternative splicing over the first intron, leading to the formation of two transcripts with similar open reading frames but varying 5' UTR lengths, due to retention of the first intron. To our knowledge, this is the first report of alternative splicing on a plant GS gene. This study shows that Medicago truncatula contains an additional GS gene encoding a plastid located isoenzyme, which is functional and exclusively expressed during seed development. Legumes produce protein-rich seeds requiring high amounts of nitrogen, we postulate that this gene duplication represents a functional innovation of plastid located GS related to storage protein accumulation exclusive to legume seed metabolism.
Genome-Wide Identification and Expression Analysis of Homeodomain Leucine Zipper Subfamily IV (HDZ IV) Gene Family from Musa accuminata

PubMed Central

Pandey, Ashutosh; Misra, Prashant; Alok, Anshu; Kaur, Navneet; Sharma, Shivani; Lakhwani, Deepika; Asif, Mehar H.; Tiwari, Siddharth; Trivedi, Prabodh K.

2016-01-01

The homeodomain zipper family (HD-ZIP) of transcription factors is present only in plants and plays important role in the regulation of plant-specific processes. The subfamily IV of HDZ transcription factors (HD-ZIP IV) has primarily been implicated in the regulation of epidermal structure development. Though this gene family is present in all lineages of land plants, members of this gene family have not been identified in banana, which is one of the major staple fruit crops. In the present work, we identified 21 HDZIV encoding genes in banana by the computational analysis of banana genome resource. Our analysis suggested that these genes putatively encode proteins having all the characteristic domains of HDZIV transcription factors. The phylogenetic analysis of the banana HDZIV family genes further confirmed that after separation from a common ancestor, the banana, and poales lineages might have followed distinct evolutionary paths. Further, we conclude that segmental duplication played a major role in the evolution of banana HDZIV encoding genes. All the identified banana HDZIV genes expresses in different banana tissue, however at varying levels. The transcript levels of some of the banana HDZIV genes were also detected in banana fruit pulp, suggesting their putative role in fruit attributes. A large number of genes of this family showed modulated expression under drought and salinity stress. Taken together, the present work lays a foundation for elucidation of functional aspects of the banana HDZIV encoding genes and for their possible use in the banana improvement programs. PMID:26870050
Coaction of Stress and Serotonin Transporter Genotype in Predicting Aggression at the Transition to Adulthood

ERIC Educational Resources Information Center

Conway, Christopher C.; Keenan-Miller, Danielle; Hammen, Constance; Lind, Penelope A.; Najman, Jake M.; Brennan, Patricia A.

2012-01-01

Despite consistent evidence that serotonin functioning affects stress reactivity and vulnerability to aggression, research on serotonin gene-stress interactions (G x E) in the development of aggression remains limited. The present study investigated variation in the promoter region of the serotonin transporter gene (5-HTTLPR) as a moderator of the…
Calmodulin Methyltransferase Is Required for Growth, Muscle Strength, Somatosensory Development and Brain Function

PubMed Central

Haziza, Sitvanit; Magnani, Roberta; Lan, Dima; Keinan, Omer; Saada, Ann; Hershkovitz, Eli; Yanay, Nurit; Cohen, Yoram; Nevo, Yoram; Houtz, Robert L.; Sheffield, Val C.; Golan, Hava; Parvari, Ruti

2015-01-01

Calmodulin lysine methyl transferase (CaM KMT) is ubiquitously expressed and highly conserved from plants to vertebrates. CaM is frequently trimethylated at Lys-115, however, the role of CaM methylation in vertebrates has not been studied. CaM KMT was found to be homozygously deleted in the 2P21 deletion syndrome that includes 4 genes. These patients present with cystinuria, severe intellectual disabilities, hypotonia, mitochondrial disease and facial dysmorphism. Two siblings with deletion of three of the genes included in the 2P21 deletion syndrome presented with cystinuria, hypotonia, a mild/moderate mental retardation and a respiratory chain complex IV deficiency. To be able to attribute the functional significance of the methylation of CaM in the mouse and the contribution of CaM KMT to the clinical presentation of the 2p21deletion patients, we produced a mouse model lacking only CaM KMT with deletion borders as in the human 2p21deletion syndrome. No compensatory activity for CaM methylation was found. Impairment of complexes I and IV, and less significantly III, of the mitochondrial respiratory chain was more pronounced in the brain than in muscle. CaM KMT is essential for normal body growth and somatosensory development, as well as for the proper functioning of the adult mouse brain. Developmental delay was demonstrated for somatosensory function and for complex behavior, which involved both basal motor function and motivation. The mutant mice also had deficits in motor learning, complex coordination and learning of aversive stimuli. The mouse model contributes to the evaluation of the role of methylated CaM. CaM methylation appears to have a role in growth, muscle strength, somatosensory development and brain function. The current study has clinical implications for human patients. Patients presenting slow growth and muscle weakness that could result from a mitochondrial impairment and mental retardation should be considered for sequence analysis of the CaM KMT gene. PMID:26247364
Calmodulin Methyltransferase Is Required for Growth, Muscle Strength, Somatosensory Development and Brain Function.

PubMed

Haziza, Sitvanit; Magnani, Roberta; Lan, Dima; Keinan, Omer; Saada, Ann; Hershkovitz, Eli; Yanay, Nurit; Cohen, Yoram; Nevo, Yoram; Houtz, Robert L; Sheffield, Val C; Golan, Hava; Parvari, Ruti

2015-08-01

Calmodulin lysine methyl transferase (CaM KMT) is ubiquitously expressed and highly conserved from plants to vertebrates. CaM is frequently trimethylated at Lys-115, however, the role of CaM methylation in vertebrates has not been studied. CaM KMT was found to be homozygously deleted in the 2P21 deletion syndrome that includes 4 genes. These patients present with cystinuria, severe intellectual disabilities, hypotonia, mitochondrial disease and facial dysmorphism. Two siblings with deletion of three of the genes included in the 2P21 deletion syndrome presented with cystinuria, hypotonia, a mild/moderate mental retardation and a respiratory chain complex IV deficiency. To be able to attribute the functional significance of the methylation of CaM in the mouse and the contribution of CaM KMT to the clinical presentation of the 2p21deletion patients, we produced a mouse model lacking only CaM KMT with deletion borders as in the human 2p21deletion syndrome. No compensatory activity for CaM methylation was found. Impairment of complexes I and IV, and less significantly III, of the mitochondrial respiratory chain was more pronounced in the brain than in muscle. CaM KMT is essential for normal body growth and somatosensory development, as well as for the proper functioning of the adult mouse brain. Developmental delay was demonstrated for somatosensory function and for complex behavior, which involved both basal motor function and motivation. The mutant mice also had deficits in motor learning, complex coordination and learning of aversive stimuli. The mouse model contributes to the evaluation of the role of methylated CaM. CaM methylation appears to have a role in growth, muscle strength, somatosensory development and brain function. The current study has clinical implications for human patients. Patients presenting slow growth and muscle weakness that could result from a mitochondrial impairment and mental retardation should be considered for sequence analysis of the CaM KMT gene.
Transcriptomic Profiles of Brain Provide Insights into Molecular Mechanism of Feed Conversion Efficiency in Crucian Carp (Carassius auratus)

PubMed Central

Pang, Meixia; Luo, Weiwei; Yu, Xiaomu; Zhou, Ying; Tong, Jingou

2018-01-01

Feed efficiency is an economically crucial trait for cultured animals, however, progress has been scarcely made in the genetic analyses of feed conversion efficiency (FCE) in fish because of the difficulties in measurement of trait phenotypes. In the present investigation, we present the first application of RNA sequencing (RNA-Seq) combined with differentially expressed genes (DEGs) analysis for identification of functional determinants related to FCE at the gene level in an aquaculture fish, crucian carp (Carassius auratus). Brain tissues of six crucian carp with extreme FCE performances were subjected to transcriptome analysis. A total of 544,612 unigenes with a mean size of 644.38 bp were obtained from Low- and High-FCE groups, and 246 DEGs that may be involved in FCE traits were identified in these two groups. qPCR confirmed that genes previously identified as up- or down-regulated by RNA-Seq were effectively up- or down-regulated under the studied conditions. Thirteen key genes, whose functions are associated with metabolism (Dgkk, Mgst3 and Guk1b), signal transduction (Vdnccsa1b, Tgfα, Nr4a1 and Tacr2) and growth (Endog, Crebrtc2, Myh7, Myh1, Myh14 and Igfbp7) were identified according to GO (Gene Ontology) and KEGG (Kyoto Encyclopedia of Genes and Genomes) annotations. Our novel findings provide useful pathway information and candidate genes for future studies of genetic mechanisms underlying FCE in crucian carp. PMID:29538345
Genome-wide transcriptome profiling reveals novel insights into Luffa cylindrica browning.

PubMed

Chen, Xia; Tan, Taiming; Xu, Changcheng; Huang, Shuping; Tan, Jie; Zhang, Min; Wang, Chunli; Xie, Conghua

2015-08-07

Luffa cylindrica (sponge gourd) is one of the most popular vegetables in China. Production and consumption of L. cylindrica are limited due to postharvest browning; however, little is known about the genetic regulation of the browning process. In the present study, transcriptome profiles of L. cylindrica cultivars, YLB05 (browning resistant) and XTR05 (browning sensitive), were analyzed using next-generation sequencing to clarify the genes and mechanisms associated with browning. A total of 9.1 Gb of valid data including 116,703 unigenes (>200 bp) were obtained and 39,473 sequences were annotated by alignment against five public databases. Of these, there were 27,407 genes assigned to 747 Gene Ontology functional categories; and 12,350 genes were annotated with 25 Eukaryotic Orthologous Groups (KOG) categories with 343 KOG functional terms. Additionally, by searching against the Kyoto Encyclopedia of Genes and Genomes database, 8689 unigenes were mapped to 189 pathways. Furthermore, there were 24,556 sequences found to be differentially regulated, including 4344 annotated unigenes. Several genes potentially associated with phenolic oxidation, carbohydrate and hormone metabolism were found differentially regulated between the cultivars of different browning sensitivities. Our results suggest that elements involved in enzymatic processes and other pathways might be responsible for L. cylindrica browning. The present study provides a comprehensive transcriptome sequence resource, which will facilitate further studies on gene discovery and exploiting the fruit browning mechanism of L. cylindrica. Copyright © 2015 Elsevier Inc. All rights reserved.
ATP-dependent chromatin remodeling in T cells

PubMed Central

Wurster, Andrea L.; Pazin, Michael J.

2012-01-01

One of the best studied systems for mammalian chromatin remodeling is transcriptional regulation during T cell development. The variety of these studies have led to important findings in T cell gene regulation and cell fate determination. Importantly, these findings have also advanced our knowledge of the function of remodeling enzymes in mammalian gene regulation. In this review, first we briefly present biochemical/cell-free analysis of 3 types of ATP dependent remodeling enzymes (SWI/SNF, Mi2, and ISWI), to construct an intellectual framework to understand how these enzymes might be working. Second, we compare and contrast the function of these enzymes, during early (thymic) and late (peripheral) T cell development. Finally, we examine some of the gaps in our present understanding. PMID:21999456
A functional polymorphism of the MAOA gene is associated with neural responses to induced anger control.

PubMed

Denson, Thomas F; Dobson-Stone, Carol; Ronay, Richard; von Hippel, William; Schira, Mark M

2014-07-01

Aggressiveness is highly heritable. Recent experimental work has linked individual differences in a functional polymorphism of the monoamine oxidase-A gene (MAOA) to anger-driven aggression. Other work has implicated the dorsal ACC (dACC) in cognitive-emotional control and the amygdala in emotional arousal. The present imaging genetics study investigated dACC and amygdala reactivity to induced anger control as a function of MAOA genotype. A research assistant asked 38 healthy male undergraduates to control their anger in response to an insult by a rude experimenter. Men with the low-expression allele showed increased dACC and amygdala activation after the insult, but men with the high-expression allele did not. Both dACC and amygdala activation independently mediated the relationship between MAOA genotype and self-reported anger control. Moreover, following the insult, men with the high-functioning allele showed functional decoupling between the amygdala and dACC, but men with the low-functioning allele did not. These results suggest that heightened dACC and amygdala activation and their connectivity are neuroaffective mechanisms underlying anger control in participants with the low-functioning allele of the MAOA gene.
Coordinated Gene Regulation in the Initial Phase of Salt Stress Adaptation*

PubMed Central

Vanacloig-Pedros, Elena; Bets-Plasencia, Carolina; Pascual-Ahuir, Amparo; Proft, Markus

2015-01-01

Stress triggers complex transcriptional responses, which include both gene activation and repression. We used time-resolved reporter assays in living yeast cells to gain insights into the coordination of positive and negative control of gene expression upon salt stress. We found that the repression of “housekeeping” genes coincides with the transient activation of defense genes and that the timing of this expression pattern depends on the severity of the stress. Moreover, we identified mutants that caused an alteration in the kinetics of this transcriptional control. Loss of function of the vacuolar H+-ATPase (vma1) or a defect in the biosynthesis of the osmolyte glycerol (gpd1) caused a prolonged repression of housekeeping genes and a delay in gene activation at inducible loci. Both mutants have a defect in the relocation of RNA polymerase II complexes at stress defense genes. Accordingly salt-activated transcription is delayed and less efficient upon partially respiratory growth conditions in which glycerol production is significantly reduced. Furthermore, the loss of Hog1 MAP kinase function aggravates the loss of RNA polymerase II from housekeeping loci, which apparently do not accumulate at inducible genes. Additionally the Def1 RNA polymerase II degradation factor, but not a high pool of nuclear polymerase II complexes, is needed for efficient stress-induced gene activation. The data presented here indicate that the finely tuned transcriptional control upon salt stress is dependent on physiological functions of the cell, such as the intracellular ion balance, the protective accumulation of osmolyte molecules, and the RNA polymerase II turnover. PMID:25745106
Predicted Arabidopsis Interactome Resource and Gene Set Linkage Analysis: A Transcriptomic Analysis Resource.

PubMed

Yao, Heng; Wang, Xiaoxuan; Chen, Pengcheng; Hai, Ling; Jin, Kang; Yao, Lixia; Mao, Chuanzao; Chen, Xin

2018-05-01

An advanced functional understanding of omics data is important for elucidating the design logic of physiological processes in plants and effectively controlling desired traits in plants. We present the latest versions of the Predicted Arabidopsis Interactome Resource (PAIR) and of the gene set linkage analysis (GSLA) tool, which enable the interpretation of an observed transcriptomic change (differentially expressed genes [DEGs]) in Arabidopsis ( Arabidopsis thaliana ) with respect to its functional impact for biological processes. PAIR version 5.0 integrates functional association data between genes in multiple forms and infers 335,301 putative functional interactions. GSLA relies on this high-confidence inferred functional association network to expand our perception of the functional impacts of an observed transcriptomic change. GSLA then interprets the biological significance of the observed DEGs using established biological concepts (annotation terms), describing not only the DEGs themselves but also their potential functional impacts. This unique analytical capability can help researchers gain deeper insights into their experimental results and highlight prospective directions for further investigation. We demonstrate the utility of GSLA with two case studies in which GSLA uncovered how molecular events may have caused physiological changes through their collective functional influence on biological processes. Furthermore, we showed that typical annotation-enrichment tools were unable to produce similar insights to PAIR/GSLA. The PAIR version 5.0-inferred interactome and GSLA Web tool both can be accessed at http://public.synergylab.cn/pair/. © 2018 American Society of Plant Biologists. All Rights Reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Jun, Se -Ran; Hauser, Loren John; Schadt, Christopher Warren

For decades there has been increasing interest in understanding the relationships between microbial communities and ecosystem functions. Current DNA sequencing technologies allows for the exploration of microbial communities in two principle ways: targeted rRNA gene surveys and shotgun metagenomics. For large study designs, it is often still prohibitively expensive to sequence metagenomes at both the breadth and depth necessary to statistically capture the true functional diversity of a community. Although rRNA gene surveys provide no direct evidence of function, they do provide a reasonable estimation of microbial diversity, while being a very cost effective way to screen samples of interestmore » for later shotgun metagenomic analyses. However, there is a great deal of 16S rRNA gene survey data currently available from diverse environments, and thus a need for tools to infer functional composition of environmental samples based on 16S rRNA gene survey data. As a result, we present a computational method called pangenome based functional profiles (PanFP), which infers functional profiles of microbial communities from 16S rRNA gene survey data for Bacteria and Archaea. PanFP is based on pangenome reconstruction of a 16S rRNA gene operational taxonomic unit (OTU) from known genes and genomes pooled from the OTU s taxonomic lineage. From this lineage, we derive an OTU functional profile by weighting a pangenome s functional profile with the OTUs abundance observed in a given sample. We validated our method by comparing PanFP to the functional profiles obtained from the direct shotgun metagenomic measurement of 65 diverse communities via Spearman correlation coefficients. These correlations improved with increasing sequencing depth, within the range of 0.8 0.9 for the most deeply sequenced Human Microbiome Project mock community samples. PanFP is very similar in performance to another recently released tool, PICRUSt, for almost all of survey data analysed here. But, our method is unique in that any OTU building method can be used, as opposed to being limited to closed reference OTU picking strategies against specific reference sequence databases. In conclusion, we developed an automated computational method, which derives an inferred functional profile based on the 16S rRNA gene surveys of microbial communities. The inferred functional profile provides a cost effective way to study complex ecosystems through predicted comparative functional metagenomes and metadata analysis. All PanFP source code and additional documentation are freely available online at GitHub.« less
APPRIS: annotation of principal and alternative splice isoforms

PubMed Central

Rodriguez, Jose Manuel; Maietta, Paolo; Ezkurdia, Iakes; Pietrelli, Alessandro; Wesselink, Jan-Jaap; Lopez, Gonzalo; Valencia, Alfonso; Tress, Michael L.

2013-01-01

Here, we present APPRIS (http://appris.bioinfo.cnio.es), a database that houses annotations of human splice isoforms. APPRIS has been designed to provide value to manual annotations of the human genome by adding reliable protein structural and functional data and information from cross-species conservation. The visual representation of the annotations provided by APPRIS for each gene allows annotators and researchers alike to easily identify functional changes brought about by splicing events. In addition to collecting, integrating and analyzing reliable predictions of the effect of splicing events, APPRIS also selects a single reference sequence for each gene, here termed the principal isoform, based on the annotations of structure, function and conservation for each transcript. APPRIS identifies a principal isoform for 85% of the protein-coding genes in the GENCODE 7 release for ENSEMBL. Analysis of the APPRIS data shows that at least 70% of the alternative (non-principal) variants would lose important functional or structural information relative to the principal isoform. PMID:23161672
VirtualLeaf: an open-source framework for cell-based modeling of plant tissue growth and development.

PubMed

Merks, Roeland M H; Guravage, Michael; Inzé, Dirk; Beemster, Gerrit T S

2011-02-01

Plant organs, including leaves and roots, develop by means of a multilevel cross talk between gene regulation, patterned cell division and cell expansion, and tissue mechanics. The multilevel regulatory mechanisms complicate classic molecular genetics or functional genomics approaches to biological development, because these methodologies implicitly assume a direct relation between genes and traits at the level of the whole plant or organ. Instead, understanding gene function requires insight into the roles of gene products in regulatory networks, the conditions of gene expression, etc. This interplay is impossible to understand intuitively. Mathematical and computer modeling allows researchers to design new hypotheses and produce experimentally testable insights. However, the required mathematics and programming experience makes modeling poorly accessible to experimental biologists. Problem-solving environments provide biologically intuitive in silico objects ("cells", "regulation networks") required for setting up a simulation and present those to the user in terms of familiar, biological terminology. Here, we introduce the cell-based computer modeling framework VirtualLeaf for plant tissue morphogenesis. The current version defines a set of biologically intuitive C++ objects, including cells, cell walls, and diffusing and reacting chemicals, that provide useful abstractions for building biological simulations of developmental processes. We present a step-by-step introduction to building models with VirtualLeaf, providing basic example models of leaf venation and meristem development. VirtualLeaf-based models provide a means for plant researchers to analyze the function of developmental genes in the context of the biophysics of growth and patterning. VirtualLeaf is an ongoing open-source software project (http://virtualleaf.googlecode.com) that runs on Windows, Mac, and Linux.
Molecular evolution and expression profile of the chemerine encoding gene RARRES2 in baboon and chimpanzee.

PubMed

González-Alvarez, Rafael; Garza-Rodríguez, María de Lourdes; Delgado-Enciso, Iván; Treviño-Alvarado, Víctor Manuel; Canales-Del-Castillo, Ricardo; Martínez-De-Villarreal, Laura Elia; Lugo-Trampe, Ángel; Tejero, María Elizabeth; Schlabritz-Loutsevitch, Natalia E; Rocha-Pizaña, María Del Refugio; Cole, Shelley A; Reséndez-Pérez, Diana; Moises-Alvarez, Mario; Comuzzie, Anthony G; Barrera-Saldaña, Hugo Alberto; Garza-Guajardo, Raquel; Barboza-Quintana, Oralia; Rodríguez-Sánchez, Irám Pablo

2015-06-12

Chemerin, encoded by the retinoic acid receptor responder 2 (RARRES2) gene is an adipocytesecreted protein with autocrine/paracrine functions in adipose tissue, metabolism and inflammation with a recently described function in vascular tone regulation, liver, steatosis, etc. This molecule is believed to represent a critical endocrine signal linking obesity to diabetes. There are no data available regarding evolution of RARRES2 in non-human primates and great apes. Expression profile and orthology in RARRES2 genes are unknown aspects in the biology of this multigene family in primates. Thus; we attempt to describe expression profile and phylogenetic relationship as complementary knowledge in the function of this gene in primates. To do that, we performed A RT-PCR from different tissues obtained during necropsies. Also we tested the hypotheses of positive evolution, purifying selection, and neutrality. And finally a phylogenetic analysis was made between primates RARRES2 protein. RARRES2 transcripts were present in liver, lung, adipose tissue, ovary, pancreas, heart, hypothalamus and pituitary tissues. Expression in kidney and leukocytes were not detectable in either species. It was determined that the studied genes are orthologous. RARRES2 evolution fits the hypothesis of purifying selection. Expression profiles of the RARRES2 gene are similar in baboons and chimpanzees and are also phylogenetically related.
Influence of molecular weight upon mannosylated bio-synthetic hybrids for targeted antigen presenting cell gene delivery.

PubMed

Jones, Charles H; Gollakota, Akhila; Chen, Mingfu; Chung, Tai-Chun; Ravikrishnan, Anitha; Zhang, Guojian; Pfeifer, Blaine A

2015-07-01

Given the rise of antibiotic resistant microbes, genetic vaccination is a promising prophylactic strategy that enables rapid design and manufacture. Facilitating this process is the choice of vector, which is often situationally-specific and limited in engineering capacity. Furthermore, these shortcomings are usually tied to an incomplete understanding of the structure-function relationships driving vector-mediated gene delivery. Building upon our initial report of a hybrid bacterial-biomaterial gene delivery vector, a comprehensive structure-function assessment was completed using a class of mannosylated poly(beta-amino esters). Through a top-down screening methodology, an ideal polymer was selected on the basis of gene delivery efficacy and then used for the synthesis of a stratified molecular weight polymer library. By eliminating contributions of polymer chemical background, we were able to complete an in-depth assessment of gene delivery as a function of (1) polymer molecular weight, (2) relative mannose content, (3) polymer-membrane biophysical properties, (4) APC uptake specificity, and (5) serum inhibition. In summary, the flexibility and potential of the hybrid design featured in this work highlights the ability to systematically probe vector-associated properties for the development of translational gene delivery candidates. Copyright © 2015 Elsevier Ltd. All rights reserved.
Handling Gene and Protein Names in the Age of Bioinformatics: The Special Challenge of Secreted Multimodular Bacterial Enzymes such as the cbhA/cbh9A Gene of Clostridium thermocellum

DOE Office of Scientific and Technical Information (OSTI.GOV)

Brunecky, Roman; Schwarz, Wolfgang H.; Broeker, Jannis

An increasing number of researchers working in biology, biochemistry, biotechnology, bioengineering, bioinformatics and other related fields of science are using biological molecules. As the scientific background of the members of different scientific communities is more diverse than ever before, the number of scientists not familiar with the rules for non-ambiguous designation of genetic elements is increasing. However, with biological molecules gaining importance through biotechnology, their functional and unambiguous designation is vital. Unfortunately, naming genes and proteins is not an easy task. In addition, the traditional concepts of bioinformatics are challenged with the appearance of proteins comprising different modules with amore » respective function in each module. This article highlights basic rules and novel solutions in designation recently used within the community of bacterial geneticists, and we discuss the present-day handling of gene and protein designations. As an example we will utilize a recent mischaracterization of gene nomenclature. We make suggestions for better handling of names in future literature as well as in databases and annotation projects. Our methodology emphasizes the hydrolytic function of multi-modular genes and extracellular proteins from bacteria.« less

Defended to the Nines: 25 Years of Resistance Gene Cloning Identifies Nine Mechanisms for R Protein Function[OPEN

PubMed Central

2018-01-01

Plants have many, highly variable resistance (R) gene loci, which provide resistance to a variety of pathogens. The first R gene to be cloned, maize (Zea mays) Hm1, was published over 25 years ago, and since then, many different R genes have been identified and isolated. The encoded proteins have provided clues to the diverse molecular mechanisms underlying immunity. Here, we present a meta-analysis of 314 cloned R genes. The majority of R genes encode cell surface or intracellular receptors, and we distinguish nine molecular mechanisms by which R proteins can elevate or trigger disease resistance: direct (1) or indirect (2) perception of pathogen-derived molecules on the cell surface by receptor-like proteins and receptor-like kinases; direct (3) or indirect (4) intracellular detection of pathogen-derived molecules by nucleotide binding, leucine-rich repeat receptors, or detection through integrated domains (5); perception of transcription activator-like effectors through activation of executor genes (6); and active (7), passive (8), or host reprogramming-mediated (9) loss of susceptibility. Although the molecular mechanisms underlying the functions of R genes are only understood for a small proportion of known R genes, a clearer understanding of mechanisms is emerging and will be crucial for rational engineering and deployment of novel R genes. PMID:29382771
Clustering Algorithms: Their Application to Gene Expression Data

PubMed Central

Oyelade, Jelili; Isewon, Itunuoluwa; Oladipupo, Funke; Aromolaran, Olufemi; Uwoghiren, Efosa; Ameh, Faridah; Achas, Moses; Adebiyi, Ezekiel

2016-01-01

Gene expression data hide vital information required to understand the biological process that takes place in a particular organism in relation to its environment. Deciphering the hidden patterns in gene expression data proffers a prodigious preference to strengthen the understanding of functional genomics. The complexity of biological networks and the volume of genes present increase the challenges of comprehending and interpretation of the resulting mass of data, which consists of millions of measurements; these data also inhibit vagueness, imprecision, and noise. Therefore, the use of clustering techniques is a first step toward addressing these challenges, which is essential in the data mining process to reveal natural structures and identify interesting patterns in the underlying data. The clustering of gene expression data has been proven to be useful in making known the natural structure inherent in gene expression data, understanding gene functions, cellular processes, and subtypes of cells, mining useful information from noisy data, and understanding gene regulation. The other benefit of clustering gene expression data is the identification of homology, which is very important in vaccine design. This review examines the various clustering algorithms applicable to the gene expression data in order to discover and provide useful knowledge of the appropriate clustering technique that will guarantee stability and high degree of accuracy in its analysis procedure. PMID:27932867
Hox gene duplications correlate with posterior heteronomy in scorpions

PubMed Central

Sharma, Prashant P.; Schwager, Evelyn E.; Extavour, Cassandra G.; Wheeler, Ward C.

2014-01-01

The evolutionary success of the largest animal phylum, Arthropoda, has been attributed to tagmatization, the coordinated evolution of adjacent metameres to form morphologically and functionally distinct segmental regions called tagmata. Specification of regional identity is regulated by the Hox genes, of which 10 are inferred to be present in the ancestor of arthropods. With six different posterior segmental identities divided into two tagmata, the bauplan of scorpions is the most heteronomous within Chelicerata. Expression domains of the anterior eight Hox genes are conserved in previously surveyed chelicerates, but it is unknown how Hox genes regionalize the three tagmata of scorpions. Here, we show that the scorpion Centruroides sculpturatus has two paralogues of all Hox genes except Hox3, suggesting cluster and/or whole genome duplication in this arachnid order. Embryonic anterior expression domain boundaries of each of the last four pairs of Hox genes (two paralogues each of Antp, Ubx, abd-A and Abd-B) are unique and distinguish segmental groups, such as pectines, book lungs and the characteristic tail, while maintaining spatial collinearity. These distinct expression domains suggest neofunctionalization of Hox gene paralogues subsequent to duplication. Our data reconcile previous understanding of Hox gene function across arthropods with the extreme heteronomy of scorpions. PMID:25122224
Navigating the complex path between the oxytocin receptor gene (OXTR) and cooperation: an endophenotype approach.

PubMed

Haas, Brian W; Anderson, Ian W; Smith, Jessica M

2013-11-28

Although cooperation represents a core facet of human social behavior there exists considerable variability across people in terms of the tendency to cooperate. One factor that may contribute to individual differences in cooperation is a key gene within the oxytocin (OT) system, the OT reception gene (OXTR). In this article, we aim to bridge the gap between the OXTR gene and cooperation by using an endophenotype approach. We present evidence that the association between the OXTR gene and cooperation may in part be due to how the OXTR gene affects brain systems involved in emotion recognition, empathy/theory of mind, social communication and social reward seeking. There is evidence that the OXTR gene is associated with the functional anatomy of the amygdala, visual cortex (VC), anterior cingulate and superior temporal gyrus (STG). However, it is currently unknown how the OXTR gene may be linked to the functional anatomy of other relevant brain regions that include the fusiform gyrus (FG), superior temporal sulcus (STS), ventromedial prefrontal cortex (VMPFC), temporoparietal junction (TPJ) and nucleus accumbens (NAcc). We conclude by highlighting potential future research directions that may elucidate the path between OXTR and complex behaviors such as cooperation.
Navigating the complex path between the oxytocin receptor gene (OXTR) and cooperation: an endophenotype approach

PubMed Central

Haas, Brian W.; Anderson, Ian W.; Smith, Jessica M.

2013-01-01

Although cooperation represents a core facet of human social behavior there exists considerable variability across people in terms of the tendency to cooperate. One factor that may contribute to individual differences in cooperation is a key gene within the oxytocin (OT) system, the OT reception gene (OXTR). In this article, we aim to bridge the gap between the OXTR gene and cooperation by using an endophenotype approach. We present evidence that the association between the OXTR gene and cooperation may in part be due to how the OXTR gene affects brain systems involved in emotion recognition, empathy/theory of mind, social communication and social reward seeking. There is evidence that the OXTR gene is associated with the functional anatomy of the amygdala, visual cortex (VC), anterior cingulate and superior temporal gyrus (STG). However, it is currently unknown how the OXTR gene may be linked to the functional anatomy of other relevant brain regions that include the fusiform gyrus (FG), superior temporal sulcus (STS), ventromedial prefrontal cortex (VMPFC), temporoparietal junction (TPJ) and nucleus accumbens (NAcc). We conclude by highlighting potential future research directions that may elucidate the path between OXTR and complex behaviors such as cooperation. PMID:24348360
Role of miRNAs in CD4 T cell plasticity during inflammation and tolerance

PubMed Central

Sethi, Apoorva; Kulkarni, Neeraja; Sonar, Sandip; Lal, Girdhari

2013-01-01

Gene expression is tightly regulated in a tuneable, cell-specific and time-dependent manner. Recent advancement in epigenetics and non-coding RNA (ncRNA) revolutionized the concept of gene regulation. In order to regulate the transcription, ncRNA can promptly response to the extracellular signals as compared to transcription factors present in the cells. microRNAs (miRNAs) are ncRNA (~22 bp) encoded in the genome, and present as intergenic or oriented antisense to neighboring genes. The strategic location of miRNA in coding genes helps in the coupled regulation of its expression with host genes. miRNA together with complex machinery called RNA-induced silencing complex (RISC) interacts with target mRNA and degrade the mRNA or inhibits the translation. CD4 T cells play an important role in the generation and maintenance of inflammation and tolerance. Cytokines and chemokines present in the inflamed microenvironment controls the differentiation and function of various subsets of CD4 T cells [Th1, Th2, Th17, and regulatory CD4 T cells (Tregs)]. Recent studies suggest that miRNAs play an important role in the development and function of all subsets of CD4 T cells. In current review, we focused on how various miRNAs are regulated by cell's extrinsic and intrinsic signaling, and how miRNAs affect the transdifferentiation of subsets of CD4 T cell and controls their plasticity during inflammation and tolerance. PMID:23386861
Broad Phylogenetic Occurrence of the Oxygen-Binding Hemerythrins in Bilaterians

PubMed Central

Schrago, Carlos G.; Halanych, Kenneth M.

2017-01-01

Abstract Animal tissues need to be properly oxygenated for carrying out catabolic respiration and, as such, natural selection has presumably favored special molecules that can reversibly bind and transport oxygen. Hemoglobins, hemocyanins, and hemerythrins (Hrs) fulfill this role, with Hrs being the least studied. Knowledge of oxygen-binding proteins is crucial for understanding animal physiology. Hr genes are present in the three domains of life, Archaea, Bacteria, and Eukaryota; however, within Animalia, Hrs has been reported only in marine species in six phyla (Annelida, Brachiopoda, Priapulida, Bryozoa, Cnidaria, and Arthropoda). Given this observed Hr distribution, whether all metazoan Hrs share a common origin is circumspect. We investigated Hr diversity and evolution in metazoans, by employing in silico approaches to survey for Hrs from of 120 metazoan transcriptomes and genomes. We found 58 candidate Hr genes actively transcribed in 36 species distributed in 11 animal phyla, with new records in Echinodermata, Hemichordata, Mollusca, Nemertea, Phoronida, and Platyhelminthes. Moreover, we found that “Hrs” reported from Cnidaria and Arthropoda were not consistent with that of other metazoan Hrs. Contrary to previous suggestions that Hr genes were absent in deuterostomes, we find Hr genes present in deuterostomes and were likely present in early bilaterians, but not in nonbilaterian animal lineages. As expected, the Hr gene tree did not mirror metazoan phylogeny, suggesting that Hrs evolutionary history was complex and besides the oxygen carrying capacity, the drivers of Hr evolution may also consist of secondary functional specializations of the proteins, like immunological functions. PMID:29016798
Autoinducer-2 Plays a Crucial Role in Gut Colonization and Probiotic Functionality of Bifidobacterium breve UCC2003

PubMed Central

Bottacini, Francesca; Lanigan, Noreen; Casey, Pat G.; Huys, Geert; Nelis, Hans J.; van Sinderen, Douwe; Coenye, Tom

2014-01-01

In the present study we show that luxS of Bifidobacterium breve UCC2003 is involved in the production of the interspecies signaling molecule autoinducer-2 (AI-2), and that this gene is essential for gastrointestinal colonization of a murine host, while it is also involved in providing protection against Salmonella infection in Caenorhabditis elegans. We demonstrate that a B. breve luxS-insertion mutant is significantly more susceptible to iron chelators than the WT strain and that this sensitivity can be partially reverted in the presence of the AI-2 precursor DPD. Furthermore, we show that several genes of an iron starvation-induced gene cluster, which are downregulated in the luxS-insertion mutant and which encodes a presumed iron-uptake system, are transcriptionally upregulated under in vivo conditions. Mutation of two genes of this cluster in B. breve UCC2003 renders the derived mutant strains sensitive to iron chelators while deficient in their ability to confer gut pathogen protection to Salmonella-infected nematodes. Since a functional luxS gene is present in all tested members of the genus Bifidobacterium, we conclude that bifidobacteria operate a LuxS-mediated system for gut colonization and pathogen protection that is correlated with iron acquisition. PMID:24871429
Autoinducer-2 plays a crucial role in gut colonization and probiotic functionality of Bifidobacterium breve UCC2003.

PubMed

Christiaen, Steven E A; O'Connell Motherway, Mary; Bottacini, Francesca; Lanigan, Noreen; Casey, Pat G; Huys, Geert; Nelis, Hans J; van Sinderen, Douwe; Coenye, Tom

2014-01-01

In the present study we show that luxS of Bifidobacterium breve UCC2003 is involved in the production of the interspecies signaling molecule autoinducer-2 (AI-2), and that this gene is essential for gastrointestinal colonization of a murine host, while it is also involved in providing protection against Salmonella infection in Caenorhabditis elegans. We demonstrate that a B. breve luxS-insertion mutant is significantly more susceptible to iron chelators than the WT strain and that this sensitivity can be partially reverted in the presence of the AI-2 precursor DPD. Furthermore, we show that several genes of an iron starvation-induced gene cluster, which are downregulated in the luxS-insertion mutant and which encodes a presumed iron-uptake system, are transcriptionally upregulated under in vivo conditions. Mutation of two genes of this cluster in B. breve UCC2003 renders the derived mutant strains sensitive to iron chelators while deficient in their ability to confer gut pathogen protection to Salmonella-infected nematodes. Since a functional luxS gene is present in all tested members of the genus Bifidobacterium, we conclude that bifidobacteria operate a LuxS-mediated system for gut colonization and pathogen protection that is correlated with iron acquisition.
Curd development associated gene (CDAG1) in cauliflower (Brassica oleracea L. var. botrytis) could result in enlarged organ size and increased biomass.

PubMed

Li, Hui; Liu, Qian; Zhang, Qingli; Qin, Erjun; Jin, Chuan; Wang, Yu; Wu, Mei; Shen, Guangshuang; Chen, Chengbin; Song, Wenqin; Wang, Chunguo

2017-01-01

The curd is a specialized organ and the most important product organ of cauliflower (Brassica oleracea L. var. botrytis). However, the mechanism underlying the regulation of curd formation and development remains largely unknown. In the present study, a novel homologous gene containing the Organ Size Related (OSR) domain, namely, CDAG1 (Curd Development Associated Gene 1) was identified in cauliflower. Quantitative analysis indicated that CDAG1 showed significantly higher transcript levels in young tissues. Functional analysis demonstrated that the ectopic overexpression of CDAG1 in Arabidopsis and cauliflower could significantly promote organ growth and result in larger organ size and increased biomass. Organ enlargement was predominantly due to increased cell number. In addition, 228 genes involved in the CDAG1-mediated regulatory network were discovered by transcriptome analysis. Among these genes, CDAG1 was confirmed to inhibit the transcriptional expression of the endogenous OSR genes, ARGOS and ARL, while a series of ethylene-responsive transcription factors (ERFs) were found to increased expression in 35S:CDAG1 transgenic Arabidopsis plants. This implies that CDAG1 may function in the ethylene-mediated signal pathway. These findings provide new insight into the function of OSR genes, and suggest potential applications of CDAG1 in breeding high-yielding crops. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Cloning and characterization of the ONAC106 gene from Oryza sativa cultivar Kuku Belang

NASA Astrophysics Data System (ADS)

Basri, Khairunnisa; Sukiran, Noor Liyana; Zainal, Zamri

2016-11-01

Plants possess different mechanisms in stress response, where induction of stress-responsive genes provides tolerance to unfavorable conditions. Stress-responsive genes are characterized for functional and regulatory genes that help in overcoming stress by molecular, biochemical and morphological adaptations. NAC transcription factors are one of the regulatory proteins that involved in stress signaling pathway. A putative NAC transcription factor, ONAC016 was identified from drought transcriptomic data. Our data suggested that ONAC106 was induced by drought, but its function in abiotic stress is still unclear. In silico analysis of ONAC106 showed that this gene encodes 334 amino acids, and its protein consists of NAM (No Apical Meristem) domain. The orthologue of ONAC106 was present in several Poaceae family members, suggesting that ONAC106 is unique to monocot plants only. We found that ONAC106 was induced by salt and cold stresses, indicating that this gene involves in abiotic stress response. In addition, we also found that ONAC106 might function in defense response to pathogen invasion. The ABRE (Abscisic Acid Regulatory Element) cis-element was identified in the promoter region of ONAC106, suggesting that it may involve in the abscisic acid (ABA)-dependent signaling pathway. Based on this preliminary result, we hypothesize that ONAC106 may play a role in abiotic stress response by regulating ABA-responsive genes.
Understanding Transcription Factor Regulation by Integrating Gene Expression and DNase I Hypersensitive Sites.

PubMed

Wang, Guohua; Wang, Fang; Huang, Qian; Li, Yu; Liu, Yunlong; Wang, Yadong

2015-01-01

Transcription factors are proteins that bind to DNA sequences to regulate gene transcription. The transcription factor binding sites are short DNA sequences (5-20 bp long) specifically bound by one or more transcription factors. The identification of transcription factor binding sites and prediction of their function continue to be challenging problems in computational biology. In this study, by integrating the DNase I hypersensitive sites with known position weight matrices in the TRANSFAC database, the transcription factor binding sites in gene regulatory region are identified. Based on the global gene expression patterns in cervical cancer HeLaS3 cell and HelaS3-ifnα4h cell (interferon treatment on HeLaS3 cell for 4 hours), we present a model-based computational approach to predict a set of transcription factors that potentially cause such differential gene expression. Significantly, 6 out 10 predicted functional factors, including IRF, IRF-2, IRF-9, IRF-1 and IRF-3, ICSBP, belong to interferon regulatory factor family and upregulate the gene expression levels responding to the interferon treatment. Another factor, ISGF-3, is also a transcriptional activator induced by interferon alpha. Using the different transcription factor binding sites selected criteria, the prediction result of our model is consistent. Our model demonstrated the potential to computationally identify the functional transcription factors in gene regulation.
Complexity and specificity of the maize (Zea mays L.) root hair transcriptome.

PubMed

Hey, Stefan; Baldauf, Jutta; Opitz, Nina; Lithio, Andrew; Pasha, Asher; Provart, Nicholas; Nettleton, Dan; Hochholdinger, Frank

2017-04-01

Root hairs are tubular extensions of epidermis cells. Transcriptome profiling demonstrated that the single cell-type root hair transcriptome was less complex than the transcriptome of multiple cell-type primary roots without root hairs. In total, 831 genes were exclusively and 5585 genes were preferentially expressed in root hairs [false discovery rate (FDR) ≤1%]. Among those, the most significantly enriched Gene Ontology (GO) functional terms were related to energy metabolism, highlighting the high energy demand for the development and function of root hairs. Subsequently, the maize homologs for 138 Arabidopsis genes known to be involved in root hair development were identified and their phylogenetic relationship and expression in root hairs were determined. This study indicated that the genetic regulation of root hair development in Arabidopsis and maize is controlled by common genes, but also shows differences which need to be dissected in future genetic experiments. Finally, a maize root view of the eFP browser was implemented including the root hair transcriptome of the present study and several previously published maize root transcriptome data sets. The eFP browser provides color-coded expression levels for these root types and tissues for any gene of interest, thus providing a novel resource to study gene expression and function in maize roots. © The Author 2017. Published by Oxford University Press on behalf of the Society for Experimental Biology.
[Cloning, mutagenesis and symbiotic phenotype of three lipid transfer protein encoding genes from Mesorhizobium huakuii 7653R].

PubMed

Li, Yanan; Zeng, Xiaobo; Zhou, Xuejuan; Li, Youguo

2016-12-04

Lipid transfer protein superfamily is involved in lipid transport and metabolism. This study aimed to construct mutants of three lipid transfer protein encoding genes in Mesorhizobium huakuii 7653R, and to study the phenotypes and function of mutations during symbiosis with Astragalus sinicus. We used bioinformatics to predict structure characteristics and biological functions of lipid transfer proteins, and conducted semi-quantitative and fluorescent quantitative real-time PCR to analyze the expression levels of target genes in free-living and symbiotic conditions. Using pK19mob insertion mutagenesis to construct mutants, we carried out pot plant experiments to observe symbiotic phenotypes. MCHK-5577, MCHK-2172 and MCHK-2779 genes encoding proteins belonged to START/RHO alpha_C/PITP/Bet_v1/CoxG/CalC (SRPBCC) superfamily, involved in lipid transport or metabolism, and were identical to M. loti at 95% level. Gene relative transcription level of the three genes all increased compared to free-living condition. We obtained three mutants. Compared with wild-type 7653R, above-ground biomass of plants and nodulenitrogenase activity induced by the three mutants significantly decreased. Results indicated that lipid transfer protein encoding genes of Mesorhizobium huakuii 7653R may play important roles in symbiotic nitrogen fixation, and the mutations significantly affected the symbiotic phenotypes. The present work provided a basis to study further symbiotic function mechanism associated with lipid transfer proteins from rhizobia.
Genome-wide targeted prediction of ABA responsive genes in rice based on over-represented cis-motif in co-expressed genes.

PubMed

Lenka, Sangram K; Lohia, Bikash; Kumar, Abhay; Chinnusamy, Viswanathan; Bansal, Kailash C

2009-02-01

Abscisic acid (ABA), the popular plant stress hormone, plays a key role in regulation of sub-set of stress responsive genes. These genes respond to ABA through specific transcription factors which bind to cis-regulatory elements present in their promoters. We discovered the ABA Responsive Element (ABRE) core (ACGT) containing CGMCACGTGB motif as over-represented motif among the promoters of ABA responsive co-expressed genes in rice. Targeted gene prediction strategy using this motif led to the identification of 402 protein coding genes potentially regulated by ABA-dependent molecular genetic network. RT-PCR analysis of arbitrarily chosen 45 genes from the predicted 402 genes confirmed 80% accuracy of our prediction. Plant Gene Ontology (GO) analysis of ABA responsive genes showed enrichment of signal transduction and stress related genes among diverse functional categories.
A comprehensive insight into functional profiles of free-living microbial community responses to a toxic Akashiwo sanguinea bloom

NASA Astrophysics Data System (ADS)

Yang, Caiyun; Li, Yi; Zhou, Yanyan; Lei, Xueqian; Zheng, Wei; Tian, Yun; van Nostrand, Joy D.; He, Zhili; Wu, Liyou; Zhou, Jizhong; Zheng, Tianling

2016-10-01

Phytoplankton blooms are a worldwide problem and can greatly affect ecological processes in aquatic systems, but its impacts on the functional potential of microbial communities are limited. In this study, a high-throughput microarray-based technology (GeoChip) was used to profile the functional potential of free-living microbes from the Xiamen Sea Area in response to a 2011 Akashiwo sanguinea bloom. The bloom altered the overall community functional structure. Genes that were significantly (p < 0.05) increased during the bloom included carbon degradation genes and genes involved in nitrogen (N) and/or phosphorus (P) limitation stress. Such significantly changed genes were well explained by chosen environmental factors (COD, nitrite-N, nitrate-N, dissolved inorganic phosphorus, chlorophyll-a and algal density). Overall results suggested that this bloom might enhance the microbial converting of nitrate to N2 and ammonia nitrogen, decrease P removal from seawater, activate the glyoxylate cycle, and reduce infection activity of bacteriophage. This study presents new information on the relationship of algae to other microbes in aquatic systems, and provides new insights into our understanding of ecological impacts of phytoplankton blooms.
Transrepressive function of TLX requires the histone demethylase LSD1.

PubMed

Yokoyama, Atsushi; Takezawa, Shinichiro; Schüle, Roland; Kitagawa, Hirochika; Kato, Shigeaki

2008-06-01

TLX is an orphan nuclear receptor (also called NR2E1) that regulates the expression of target genes by functioning as a constitutive transrepressor. The physiological significance of TLX in the cytodifferentiation of neural cells in the brain is known. However, the corepressors supporting the transrepressive function of TLX have yet to be identified. In this report, Y79 retinoblastoma cells were subjected to biochemical techniques to purify proteins that interact with TLX, and we identified LSD1 (also called KDM1), which appears to form a complex with CoREST and histone deacetylase 1. LSD1 interacted with TLX directly through its SWIRM and amine oxidase domains. LSD1 potentiated the transrepressive function of TLX through its histone demethylase activity as determined by a luciferase assay using a genomically integrated reporter gene. LSD1 and TLX were recruited to a TLX-binding site in the PTEN gene promoter, accompanied by the demethylation of H3K4me2 and deacetylation of H3. Knockdown of either TLX or LSD1 derepressed expression of the endogenous PTEN gene and inhibited cell proliferation of Y79 cells. Thus, the present study suggests that LSD1 is a prime corepressor for TLX.
Multi-tissue transcriptomics for construction of a comprehensive gene resource for the terrestrial snail Theba pisana.

PubMed

Zhao, M; Wang, T; Adamson, K J; Storey, K B; Cummins, S F

2016-02-08

The land snail Theba pisana is native to the Mediterranean region but has become one of the most abundant invasive species worldwide. Here, we present three transcriptomes of this agriculture pest derived from three tissues: the central nervous system, hepatopancreas (digestive gland), and foot muscle. Sequencing of the three tissues produced 339,479,092 high quality reads and a global de novo assembly generated a total of 250,848 unique transcripts (unigenes). BLAST analysis mapped 52,590 unigenes to NCBI non-redundant protein databases and further functional analysis annotated 21,849 unigenes with gene ontology. We report that T. pisana transcripts have representatives in all functional classes and a comparison of differentially expressed transcripts amongst all three tissues demonstrates enormous differences in their potential metabolic activities. The genes differentially expressed include those with sequence similarity to those genes associated with multiple bacterial diseases and neurological diseases. To provide a valuable resource that will assist functional genomics study, we have implemented a user-friendly web interface, ThebaDB (http://thebadb.bioinfo-minzhao.org/). This online database allows for complex text queries, sequence searches, and data browsing by enriched functional terms and KEGG mapping.
Molecular Properties and Functional Divergence of the Dehydroascorbate Reductase Gene Family in Lower and Higher Plants.

PubMed

Zhang, Yuan-Jie; Wang, Wei; Yang, Hai-Ling; Li, Yue; Kang, Xiang-Yang; Wang, Xiao-Ru; Yang, Zhi-Ling

2015-01-01

Dehydroascorbate reductase (DHAR), which reduces oxidized ascorbate, is important for maintaining an appropriate ascorbate redox state in plant cells. To date, genome-wide molecular characterization of DHARs has only been conducted in bryophytes (Physcomitrella patens) and eudicots (e.g. Arabidopsis thaliana). In this study, to gain a general understanding of the molecular properties and functional divergence of the DHARs in land plants, we further conducted a comprehensive analysis of DHARs from the lycophyte Selaginella moellendorffii, gymnosperm Picea abies and monocot Zea mays. DHARs were present as a small gene family in all of the land plants we examined, with gene numbers ranging from two to four. All the plants contained cytosolic and chloroplastic DHARs, indicating dehydroascorbate (DHA) can be directly reduced in the cytoplasm and chloroplast by DHARs in all the plants. A novel vacuolar DHAR was found in Z. mays, indicating DHA may also be reduced in the vacuole by DHARs in Z. mays. The DHARs within each species showed extensive functional divergence in their gene structures, subcellular localizations, and enzymatic characteristics. This study provides new insights into the molecular characteristics and functional divergence of DHARs in land plants.
Divergent functional isoforms drive niche specialisation for nutrient acquisition and use in rumen microbiome.

PubMed

Rubino, Francesco; Carberry, Ciara; M Waters, Sinéad; Kenny, David; McCabe, Matthew S; Creevey, Christopher J

2017-04-01

Many microbes in complex competitive environments share genes for acquiring and utilising nutrients, questioning whether niche specialisation exists and if so, how it is maintained. We investigated the genomic signatures of niche specialisation in the rumen microbiome, a highly competitive, anaerobic environment, with limited nutrient availability determined by the biomass consumed by the host. We generated individual metagenomic libraries from 14 cows fed an ad libitum diet of grass silage and calculated functional isoform diversity for each microbial gene identified. The animal replicates were used to calculate confidence intervals to test for differences in diversity of functional isoforms between microbes that may drive niche specialisation. We identified 153 genes with significant differences in functional isoform diversity between the two most abundant bacterial genera in the rumen (Prevotella and Clostridium). We found Prevotella possesses a more diverse range of isoforms capable of degrading hemicellulose, whereas Clostridium for cellulose. Furthermore, significant differences were observed in key metabolic processes indicating that isoform diversity plays an important role in maintaining their niche specialisation. The methods presented represent a novel approach for untangling complex interactions between microorganisms in natural environments and have resulted in an expanded catalogue of gene targets central to rumen cellulosic biomass degradation.

Aging is Associated with Impaired Renal Function, INF-gamma Induced Inflammation and with Alterations in Iron Regulatory Proteins Gene Expression.

PubMed

Costa, Elísio; Fernandes, João; Ribeiro, Sandra; Sereno, José; Garrido, Patrícia; Rocha-Pereira, Petronila; Coimbra, Susana; Catarino, Cristina; Belo, Luís; Bronze-da-Rocha, Elsa; Vala, Helena; Alves, Rui; Reis, Flávio; Santos-Silva, Alice

2014-12-01

Our aim was to contribute to a better understanding of the pathophysiology of anemia in elderly, by studying how aging affects renal function, iron metabolism, erythropoiesis and the inflammatory response, using an experimental animal model. The study was performed in male Wistar, a group of young rats with 2 months age and an old one with 18 months age. Old rats presented a significant higher urea, creatinine, interferon (INF)-gamma, ferritin and soluble transferrin receptor serum levels, as well as increased counts of reticulocytes and RDW. In addition, these rats showed significant lower erythropoietin (EPO) and iron serum levels. Concerning gene expression of iron regulatory proteins, old rats presented significantly higher mRNA levels of hepcidin (Hamp), transferrin (TF), transferrin receptor 2 (TfR2) and hemojuvelin (HJV); divalent metal transporter 1 (DMT1) mRNA levels were significantly higher in duodenal tissue; EPO gene expression was significantly higher in liver and lower in kidney, and the expression of the EPOR was significantly higher in both liver and kidney. Our results showed that aging is associated with impaired renal function, which could be in turn related with the inflammatory process and with a decline in EPO renal production. Moreover, we also propose that aging may be associated with INF-gamma-induced inflammation and with alterations upon iron regulatory proteins gene expression.
Evidence for Moonlighting Functions of the θ Subunit of Escherichia coli DNA Polymerase III

PubMed Central

Dietrich, M.; Pedró, L.; García, J.; Pons, M.; Hüttener, M.; Paytubi, S.; Madrid, C.

2014-01-01

The holE gene is an enterobacterial ORFan gene (open reading frame [ORF] with no detectable homology to other ORFs in a database). It encodes the θ subunit of the DNA polymerase III core complex. The precise function of the θ subunit within this complex is not well established, and loss of holE does not result in a noticeable phenotype. Paralogs of holE are also present on many conjugative plasmids and on phage P1 (hot gene). In this study, we provide evidence indicating that θ (HolE) exhibits structural and functional similarities to a family of nucleoid-associated regulatory proteins, the Hha/YdgT-like proteins that are also encoded by enterobacterial ORFan genes. Microarray studies comparing the transcriptional profiles of Escherichia coli holE, hha, and ydgT mutants revealed highly similar expression patterns for strains harboring holE and ydgT alleles. Among the genes differentially regulated in both mutants were genes of the tryptophanase (tna) operon. The tna operon consists of a transcribed leader region, tnaL, and two structural genes, tnaA and tnaB. Further experiments with transcriptional lacZ fusions (tnaL::lacZ and tnaA::lacZ) indicate that HolE and YdgT downregulate expression of the tna operon by possibly increasing the level of Rho-dependent transcription termination at the tna operon's leader region. Thus, for the first time, a regulatory function can be attributed to HolE, in addition to its role as structural component of the DNA polymerase III complex. PMID:24375106
Gene expression during skeletal development in three osteopetrotic rat mutations. Evidence for osteoblast abnormalities.

PubMed

Shalhoub, V; Jackson, M E; Lian, J B; Stein, G S; Marks, S C

1991-05-25

Osteopetrosis is a group of metabolic bone diseases characterized by reductions in osteoclast development and/or function. These aspects of osteoclast biology are known to be influenced by osteoblasts and their products. To ascertain whether osteoblast dysfunction contributes to aberrations in the structural and functional properties of osteoclasts in osteopetrosis, we systematically examined gene expression as reflected by mRNA levels for a series of cell growth- and tissue-related genes associated with the osteoblast phenotype during skeletal development in normal and mutant rats of three different osteopetrotic stocks. We show that the methods used permit the reproducible isolation of undegraded total cellular RNA from bone and that mRNA levels can be reliably quantitated in these preparations. Each osteopetrotic mutation exhibits a distinct aberrant pattern of osteoblast gene expression that may be correlated with and explain some abnormalities in extracellular matrix composition, mineralization, osteoclast development, and effects of elevated serum levels of 1 alpha,25-dihydroxyvitamin D3, depending upon the mutation. Normal rats show minor variations in gene expression that reflect the genetic background (stock). This, the first comprehensive molecular analysis of osteoblast gene expression in osteopetrosis, suggests that some osteopetroses, particularly in the toothless rat, are associated with and potentially related to mechanisms associated with aberrations in osteoblast function. More generally, the present studies demonstrate alterations in gene expression as reflected by mRNA levels that are associated with functional properties of the osteoblast, particularly those contributing to the recruitment and/or differentiation of osteoclasts, thereby influencing skeletal modeling.
Identification, Nomenclature, and Evolutionary Relationships of Mitogen-Activated Protein Kinase (MAPK) Genes in Soybean

PubMed Central

Neupane, Achal; Nepal, Madhav P.; Piya, Sarbottam; Subramanian, Senthil; Rohila, Jai S.; Reese, R. Neil; Benson, Benjamin V.

2013-01-01

Mitogen-activated protein kinase (MAPK) genes in eukaryotes regulate various developmental and physiological processes including those associated with biotic and abiotic stresses. Although MAPKs in some plant species including Arabidopsis have been identified, they are yet to be identified in soybean. Major objectives of this study were to identify GmMAPKs, assess their evolutionary relationships, and analyze their functional divergence. We identified a total of 38 MAPKs, eleven MAPKKs, and 150 MAPKKKs in soybean. Within the GmMAPK family, we also identified a new clade of six genes: four genes with TEY and two genes with TQY motifs requiring further investigation into possible legume-specific functions. The results indicated the expansion of the GmMAPK families attributable to the ancestral polyploidy events followed by chromosomal rearrangements. The GmMAPK and GmMAPKKK families were substantially larger than those in other plant species. The duplicated GmMAPK members presented complex evolutionary relationships and functional divergence when compared to their counterparts in Arabidopsis. We also highlighted existing nomenclatural issues, stressing the need for nomenclatural consistency. GmMAPK identification is vital to soybean crop improvement, and novel insights into the evolutionary relationships will enhance our understanding about plant genome evolution. PMID:24137047
Towards the elements of successful insect RNAi.

PubMed

Scott, Jeffrey G; Michel, Kristin; Bartholomay, Lyric C; Siegfried, Blair D; Hunter, Wayne B; Smagghe, Guy; Zhu, Kun Yan; Douglas, Angela E

2013-12-01

RNA interference (RNAi), the sequence-specific suppression of gene expression, offers great opportunities for insect science, especially to analyze gene function, manage pest populations, and reduce disease pathogens. The accumulating body of literature on insect RNAi has revealed that the efficiency of RNAi varies between different species, the mode of RNAi delivery, and the genes being targeted. There is also variation in the duration of transcript suppression. At present, we have a limited capacity to predict the ideal experimental strategy for RNAi of a particular gene/insect because of our incomplete understanding of whether and how the RNAi signal is amplified and spread among insect cells. Consequently, development of the optimal RNAi protocols is a highly empirical process. This limitation can be relieved by systematic analysis of the molecular physiological basis of RNAi mechanisms in insects. An enhanced conceptual understanding of RNAi function in insects will facilitate the application of RNAi for dissection of gene function, and to fast-track the application of RNAi to both control pests and develop effective methods to protect beneficial insects and non-insect arthropods, particularly the honey bee (Apis mellifera) and cultured Pacific white shrimp (Litopenaeus vannamei) from viral and parasitic diseases. Copyright © 2013 Elsevier Ltd. All rights reserved.
Phylogenetic analysis reveals conservation and diversification of micro RNA166 genes among diverse plant species.

PubMed

Barik, Suvakanta; SarkarDas, Shabari; Singh, Archita; Gautam, Vibhav; Kumar, Pramod; Majee, Manoj; Sarkar, Ananda K

2014-01-01

Similar to the majority of the microRNAs, mature miR166s are derived from multiple members of MIR166 genes (precursors) and regulate various aspects of plant development by negatively regulating their target genes (Class III HD-ZIP). The evolutionary conservation or functional diversification of miRNA166 family members remains elusive. Here, we show the phylogenetic relationships among MIR166 precursor and mature sequences from three diverse model plant species. Despite strong conservation, some mature miR166 sequences, such as ppt-miR166m, have undergone sequence variation. Critical sequence variation in ppt-miR166m has led to functional diversification, as it targets non-HD-ZIPIII gene transcript (s). MIR166 precursor sequences have diverged in a lineage specific manner, and both precursors and mature osa-miR166i/j are highly conserved. Interestingly, polycistronic MIR166s were present in Physcomitrella and Oryza but not in Arabidopsis. The nature of cis-regulatory motifs on the upstream promoter sequences of MIR166 genes indicates their possible contribution to the functional variation observed among miR166 species. Copyright © 2013 Elsevier Inc. All rights reserved.
Characterization of the flgG operon of Rhodobacter sphaeroides WS8 and its role in flagellum biosynthesis.

PubMed

González-Pedrajo, Bertha; de la Mora, Javier; Ballado, Teresa; Camarena, Laura; Dreyfus, Georges

2002-11-13

In this work, we show evidence regarding the functionality of a large cluster of flagellar genes in Rhodobacter sphaeroides. The genes of this cluster, flgGHIJKL and orf-1, are mainly involved in the formation of the basal body, and flgK and flgL encode the hook-associated proteins HAP1 and HAP3. In general, these genes showed a good similarity as compared with those reported for Salmonella enterica. However, flgJ and flgK showed particular features that make them unique among the flagellar sequences already reported. flgJ is only a third of the size reported for flgJ from Salmonella; whereas flgK is about three times larger than any other flgK sequence previously known. Our results indicate that both genes are functional, and their products are essential for flagellar assembly. In contrast, the interruption of orf-1, did not affect motility suggesting that this sequence, if functional, is not indispensable for flagellar assembly. Finally, we present genetic evidence suggesting that the flgGHIJKL genes are expressed as a single transcriptional unit depending on the sigma-54 factor.
Gene expression during different periods of the handling-stress response in Pampus argenteus

NASA Astrophysics Data System (ADS)

Sun, Peng; Tang, Baojun; Yin, Fei

2017-11-01

Common aquaculture practices subject fish to a variety of acute and chronic stressors. Such stressors are inherent in aquaculture production but can adversely affect survival, growth, immune response, reproductive capacity, and behavior. Understanding the biological mechanisms underlying stress responses helps with methods to alleviate the negative effects through better aquaculture practices, resulting in improved animal welfare and production efficiency. In the present study, transcriptome sequencing of liver and kidney was performed in silver pomfret (Pampus argenteus) subjected to handling stress versus controls. A total of 162.19 million clean reads were assembled to 30 339 unigenes. The quality of the assembly was high, with an N50 length of 2 472 bases. For function classification and pathway assignment, the unigenes were categorized into three GO (gene ontology) categories, twenty-six clusters of eggNOG (evolutionary genealogy of genes: non-supervised orthologous groups) function categories, and thirty-eight KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways. Stress affected different functional groups of genes in the tissues studied. Differentially expressed genes were mainly involved in metabolic pathways (carbohydrate metabolism, lipid metabolism, amino-acid metabolism, uptake of cofactors and vitamins, and biosynthesis of other secondary metabolites), environmental information processing (signaling molecules and their interactions), organismal systems (endocrine system, digestive system), and disease (immune, neurodegenerative, endocrine and metabolic diseases). This is the first reported analysis of genome-wide transcriptome in P. argenteus, and the findings expand our understanding of the silver pomfret genome and gene expression in association with stress. The results will be useful to future analyses of functional genes and studies of healthy artificial breeding in P. argenteus and other related fish species.
The Chlamydomonas genome project: a decade on.

PubMed

Blaby, Ian K; Blaby-Haas, Crysten E; Tourasse, Nicolas; Hom, Erik F Y; Lopez, David; Aksoy, Munevver; Grossman, Arthur; Umen, James; Dutcher, Susan; Porter, Mary; King, Stephen; Witman, George B; Stanke, Mario; Harris, Elizabeth H; Goodstein, David; Grimwood, Jane; Schmutz, Jeremy; Vallon, Olivier; Merchant, Sabeeha S; Prochnik, Simon

2014-10-01

The green alga Chlamydomonas reinhardtii is a popular unicellular organism for studying photosynthesis, cilia biogenesis, and micronutrient homeostasis. Ten years since its genome project was initiated an iterative process of improvements to the genome and gene predictions has propelled this organism to the forefront of the omics era. Housed at Phytozome, the plant genomics portal of the Joint Genome Institute (JGI), the most up-to-date genomic data include a genome arranged on chromosomes and high-quality gene models with alternative splice forms supported by an abundance of whole transcriptome sequencing (RNA-Seq) data. We present here the past, present, and future of Chlamydomonas genomics. Specifically, we detail progress on genome assembly and gene model refinement, discuss resources for gene annotations, functional predictions, and locus ID mapping between versions and, importantly, outline a standardized framework for naming genes. Copyright © 2014 Elsevier Ltd. All rights reserved.
Xander: employing a novel method for efficient gene-targeted metagenomic assembly

DOE PAGES

Wang, Qiong; Fish, Jordan A.; Gilman, Mariah; ...

2015-08-05

Here, metagenomics can provide important insight into microbial communities. However, assembling metagenomic datasets has proven to be computationally challenging. Current methods often assemble only fragmented partial genes. We present a novel method for targeting assembly of specific protein-coding genes. This method combines a de Bruijn graph, as used in standard assembly approaches, and a protein profile hidden Markov model (HMM) for the gene of interest, as used in standard annotation approaches. These are used to create a novel combined weighted assembly graph. Xander performs both assembly and annotation concomitantly using information incorporated in this graph. We demonstrate the utility ofmore » this approach by assembling contigs for one phylogenetic marker gene and for two functional marker genes, first on Human Microbiome Project (HMP)-defined community Illumina data and then on 21 rhizosphere soil metagenomic datasets from three different crops totaling over 800 Gbp of unassembled data. We compared our method to a recently published bulk metagenome assembly method and a recently published gene-targeted assembler and found our method produced more, longer, and higher quality gene sequences. In conclusion, xander combines gene assignment with the rapid assembly of full-length or near full-length functional genes from metagenomic data without requiring bulk assembly or post-processing to find genes of interest. HMMs used for assembly can be tailored to the targeted genes, allowing flexibility to improve annotation over generic annotation pipelines.« less
Quasispecies theory for finite populations

PubMed Central

Park, Jeong-Man; Muñoz, Enrique; Deem, Michael W.

2015-01-01

We present stochastic, finite-population formulations of the Crow-Kimura and Eigen models of quasispecies theory, for fitness functions that depend in an arbitrary way on the number of mutations from the wild type. We include back mutations in our description. We show that the fluctuation of the population numbers about the average values are exceedingly large in these physical models of evolution. We further show that horizontal gene transfer reduces by orders of magnitude the fluctuations in the population numbers and reduces the accumulation of deleterious mutations in the finite population due to Muller’s ratchet. Indeed the population sizes needed to converge to the infinite population limit are often larger than those found in nature for smooth fitness functions in the absence of horizontal gene transfer. These analytical results are derived for the steady-state by means of a field-theoretic representation. Numerical results are presented that indicate horizontal gene transfer speeds up the dynamics of evolution as well. PMID:20365394
Molecular insights into the origin of the Hox-TALE patterning system

PubMed Central

Hudry, Bruno; Thomas-Chollier, Morgane; Volovik, Yael; Duffraisse, Marilyne; Dard, Amélie; Frank, Dale; Technau, Ulrich; Merabet, Samir

2014-01-01

Despite tremendous body form diversity in nature, bilaterian animals share common sets of developmental genes that display conserved expression patterns in the embryo. Among them are the Hox genes, which define different identities along the anterior–posterior axis. Hox proteins exert their function by interaction with TALE transcription factors. Hox and TALE members are also present in some but not all non-bilaterian phyla, raising the question of how Hox–TALE interactions evolved to provide positional information. By using proteins from unicellular and multicellular lineages, we showed that these networks emerged from an ancestral generic motif present in Hox and other related protein families. Interestingly, Hox-TALE networks experienced additional and extensive molecular innovations that were likely crucial for differentiating Hox functions along body plans. Together our results highlight how homeobox gene families evolved during eukaryote evolution to eventually constitute a major patterning system in Eumetazoans. DOI: http://dx.doi.org/10.7554/eLife.01939.001 PMID:24642410
Molecular insights into the origin of the Hox-TALE patterning system.

PubMed

Hudry, Bruno; Thomas-Chollier, Morgane; Volovik, Yael; Duffraisse, Marilyne; Dard, Amélie; Frank, Dale; Technau, Ulrich; Merabet, Samir

2014-03-18

Despite tremendous body form diversity in nature, bilaterian animals share common sets of developmental genes that display conserved expression patterns in the embryo. Among them are the Hox genes, which define different identities along the anterior-posterior axis. Hox proteins exert their function by interaction with TALE transcription factors. Hox and TALE members are also present in some but not all non-bilaterian phyla, raising the question of how Hox-TALE interactions evolved to provide positional information. By using proteins from unicellular and multicellular lineages, we showed that these networks emerged from an ancestral generic motif present in Hox and other related protein families. Interestingly, Hox-TALE networks experienced additional and extensive molecular innovations that were likely crucial for differentiating Hox functions along body plans. Together our results highlight how homeobox gene families evolved during eukaryote evolution to eventually constitute a major patterning system in Eumetazoans. DOI: http://dx.doi.org/10.7554/eLife.01939.001.
DIANA-microT web server: elucidating microRNA functions through target prediction.

PubMed

Maragkakis, M; Reczko, M; Simossis, V A; Alexiou, P; Papadopoulos, G L; Dalamagas, T; Giannopoulos, G; Goumas, G; Koukis, E; Kourtis, K; Vergoulis, T; Koziris, N; Sellis, T; Tsanakas, P; Hatzigeorgiou, A G

2009-07-01

Computational microRNA (miRNA) target prediction is one of the key means for deciphering the role of miRNAs in development and disease. Here, we present the DIANA-microT web server as the user interface to the DIANA-microT 3.0 miRNA target prediction algorithm. The web server provides extensive information for predicted miRNA:target gene interactions with a user-friendly interface, providing extensive connectivity to online biological resources. Target gene and miRNA functions may be elucidated through automated bibliographic searches and functional information is accessible through Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. The web server offers links to nomenclature, sequence and protein databases, and users are facilitated by being able to search for targeted genes using different nomenclatures or functional features, such as the genes possible involvement in biological pathways. The target prediction algorithm supports parameters calculated individually for each miRNA:target gene interaction and provides a signal-to-noise ratio and a precision score that helps in the evaluation of the significance of the predicted results. Using a set of miRNA targets recently identified through the pSILAC method, the performance of several computational target prediction programs was assessed. DIANA-microT 3.0 achieved there with 66% the highest ratio of correctly predicted targets over all predicted targets. The DIANA-microT web server is freely available at www.microrna.gr/microT.
Microbial Mechanisms Mediating Increased Soil C Storage under Elevated Atmospheric N Deposition

PubMed Central

Freedman, Zachary; Zak, Donald R.; Xue, Kai; He, Zhili; Zhou, Jizhong

2013-01-01

Future rates of anthropogenic N deposition can slow the cycling and enhance the storage of C in forest ecosystems. In a northern hardwood forest ecosystem, experimental N deposition has decreased the extent of forest floor decay, leading to increased soil C storage. To better understand the microbial mechanisms mediating this response, we examined the functional genes derived from communities of actinobacteria and fungi present in the forest floor using GeoChip 4.0, a high-throughput functional-gene microarray. The compositions of functional genes derived from actinobacterial and fungal communities was significantly altered by experimental nitrogen deposition, with more heterogeneity detected in both groups. Experimental N deposition significantly decreased the richness and diversity of genes involved in the depolymerization of starch (∼12%), hemicellulose (∼16%), cellulose (∼16%), chitin (∼15%), and lignin (∼16%). The decrease in richness occurred across all taxonomic groupings detected by the microarray. The compositions of genes encoding oxidoreductases, which plausibly mediate lignin decay, were responsible for much of the observed dissimilarity between actinobacterial communities under ambient and experimental N deposition. This shift in composition and decrease in richness and diversity of genes encoding enzymes that mediate the decay process has occurred in parallel with a reduction in the extent of decay and accumulation of soil organic matter. Our observations indicate that compositional changes in actinobacterial and fungal communities elicited by experimental N deposition have functional implications for the cycling and storage of carbon in forest ecosystems. PMID:23220961
Functional Gene Analysis of Freshwater Iron-Rich Flocs at Circumneutral pH and Isolation of a Stalk-Forming Microaerophilic Iron-Oxidizing Bacterium

PubMed Central

Chan, Clara; Itoh, Takashi; Ohkuma, Moriya

2013-01-01

Iron-rich flocs often occur where anoxic water containing ferrous iron encounters oxygenated environments. Culture-independent molecular analyses have revealed the presence of 16S rRNA gene sequences related to diverse bacteria, including autotrophic iron oxidizers and methanotrophs in iron-rich flocs; however, the metabolic functions of the microbial communities remain poorly characterized, particularly regarding carbon cycling. In the present study, we cultivated iron-oxidizing bacteria (FeOB) and performed clone library analyses of functional genes related to carbon fixation and methane oxidization (cbbM and pmoA, respectively), in addition to bacterial and archaeal 16S rRNA genes, in freshwater iron-rich flocs at groundwater discharge points. The analyses of 16S rRNA, cbbM, and pmoA genes strongly suggested the coexistence of autotrophic iron oxidizers and methanotrophs in the flocs. Furthermore, a novel stalk-forming microaerophilic FeOB, strain OYT1, was isolated and characterized phylogenetically and physiologically. The 16S rRNA and cbbM gene sequences of OYT1 are related to those of other microaerophilic FeOB in the family Gallionellaceae, of the Betaproteobacteria, isolated from freshwater environments at circumneutral pH. The physiological characteristics of OYT1 will help elucidate the ecophysiology of microaerophilic FeOB. Overall, this study demonstrates functional roles of microorganisms in iron flocs, suggesting several possible linkages between Fe and C cycling. PMID:23811518
Uptake and Function Studies of Maternal Milk-derived MicroRNAs*

PubMed Central

Title, Alexandra C.; Denzler, Rémy; Stoffel, Markus

2015-01-01

MicroRNAs (miRNAs) are important regulators of cell-autonomous gene expression that influence many biological processes. They are also released from cells and are present in virtually all body fluids, including blood, urine, saliva, sweat, and milk. The functional role of nutritionally obtained extracellular miRNAs is controversial, and irrefutable demonstration of exogenous miRNA uptake by cells and canonical miRNA function is still lacking. Here we show that miRNAs are present at high levels in the milk of lactating mice. To investigate intestinal uptake of miRNAs in newborn mice, we employed genetic models in which newborn miR-375 and miR-200c/141 knockout mice received milk from wild-type foster mothers. Analysis of the intestinal epithelium, blood, liver, and spleen revealed no evidence for miRNA uptake. miR-375 levels in hepatocytes were at the limit of detection and remained orders of magnitude below the threshold for target gene regulation (between 1000 and 10,000 copies/cell). Furthermore, our study revealed rapid degradation of milk miRNAs in intestinal fluid. Together, our results indicate a nutritional rather than gene-regulatory role of miRNAs in the milk of newborn mice. PMID:26240150
Loss of Function of KCNC1 is associated with intellectual disability without seizures

PubMed Central

Poirier, Karine; Viot, Géraldine; Lombardi, Laura; Jauny, Clémence; Billuart, Pierre; Bienvenu, Thierry

2017-01-01

p.(Arg320His) mutation in the KCNC1 gene in human 11p15.1 has recently been identified in patients with progressive myoclonus epilepsies, a group of rare inherited disorders manifesting with action myoclonus, myoclonic epilepsy, and ataxia. This KCNC1 variant causes a dominant-negative effect. Here we describe three patients from the same family with intellectual disability and dysmorphic features. The three affected individuals carry a c.1015C>T (p.(Arg339*)) nonsense variant in KCNC1 gene. As previously observed in the mutant mouse carrying a disrupted KCNC1 gene, these findings reveal that individuals with a KCNC1 loss-of-function variant can present intellectual disability without seizure and epilepsy. PMID:28145425
Identification of Streptococcus mitis321A vaccine antigens based on reverse vaccinology

PubMed Central

Zhang, Qiao; Lin, Kexiong; Wang, Changzheng; Xu, Zhi; Yang, Li; Ma, Qianli

2018-01-01

Streptococcus mitis (S. mitis) may transform into highly pathogenic bacteria. The aim of the present study was to identify potential antigen targets for designing an effective vaccine against the pathogenic S. mitis321A. The genome of S. mitis321A was sequenced using an Illumina Hiseq2000 instrument. Subsequently, Glimmer 3.02 and Tandem Repeat Finder (TRF) 4.04 were used to predict genes and tandem repeats, respectively, with DNA sequence function analysis using the Basic Local Alignment Search Tool (BLAST) in the Kyoto Encyclopedia of Genes and Genomes (KEGG) and Cluster of Orthologous Groups of proteins (COG) databases. Putative gene antigen candidates were screened with BLAST ahead of phylogenetic tree analysis. The DNA sequence assembly size was 2,110,680 bp with 40.12% GC, 6 scaffolds and 9 contig. Consequently, 1,944 genes were predicted, and 119 TRF, 56 microsatellite DNA, 10 minisatellite DNA and 154 transposons were acquired. The predicted genes were associated with various pathways and functions concerning membrane transport and energy metabolism. Multiple putative genes encoding surface proteins, secreted proteins and virulence factors, as well as essential genes were determined. The majority of essential genes belonged to a phylogenetic lineage, while 321AGL000129 and 321AGL000299 were on the same branch. The current study provided useful information regarding the biological function of the S. mitis321A genome and recommends putative antigen candidates for developing a potent vaccine against S. mitis. PMID:29620181
Massive activation of archaeal defense genes during viral infection.

PubMed

Quax, Tessa E F; Voet, Marleen; Sismeiro, Odile; Dillies, Marie-Agnes; Jagla, Bernd; Coppée, Jean-Yves; Sezonov, Guennadi; Forterre, Patrick; van der Oost, John; Lavigne, Rob; Prangishvili, David

2013-08-01

Archaeal viruses display unusually high genetic and morphological diversity. Studies of these viruses proved to be instrumental for the expansion of knowledge on viral diversity and evolution. The Sulfolobus islandicus rod-shaped virus 2 (SIRV2) is a model to study virus-host interactions in Archaea. It is a lytic virus that exploits a unique egress mechanism based on the formation of remarkable pyramidal structures on the host cell envelope. Using whole-transcriptome sequencing, we present here a global map defining host and viral gene expression during the infection cycle of SIRV2 in its hyperthermophilic host S. islandicus LAL14/1. This information was used, in combination with a yeast two-hybrid analysis of SIRV2 protein interactions, to advance current understanding of viral gene functions. As a consequence of SIRV2 infection, transcription of more than one-third of S. islandicus genes was differentially regulated. While expression of genes involved in cell division decreased, those genes playing a role in antiviral defense were activated on a large scale. Expression of genes belonging to toxin-antitoxin and clustered regularly interspaced short palindromic repeat (CRISPR)-Cas systems was specifically pronounced. The observed different degree of activation of various CRISPR-Cas systems highlights the specialized functions they perform. The information on individual gene expression and activation of antiviral defense systems is expected to aid future studies aimed at detailed understanding of the functions and interplay of these systems in vivo.

A Phylogenomic Investigation of CYCLOIDEA-Like TCP Genes in the Leguminosae1

PubMed Central

Citerne, Hélène L.; Luo, Da; Pennington, R. Toby; Coen, Enrico; Cronk, Quentin C.B.

2003-01-01

Numerous TCP genes (transcription factors with a TCP domain) occur in legumes. Genes of this class in Arabidopsis (TCP1) and snapdragon (Antirrhinum majus; CYCLOIDEA) have been shown to be asymmetrically expressed in developing floral primordia, and in snapdragon, they are required for floral zygomorphy (bilaterally symmetrical flowers). These genes are therefore particularly interesting in Leguminosae, a family that is thought to have evolved zygomorphy independently from other zygomorphic angiosperm lineages. Using a phylogenomic approach, we show that homologs of TCP1/CYCLOIDEA occur in legumes and may be divided into two main classes (LEGCYC group I and II), apparently the result of an early duplication, and each class is characterized by a typical amino acid signature in the TCP domain. Furthermore, group I genes in legumes may be divided into two subclasses (LEGCYC IA and IB), apparently the result of a duplication near the base of the papilionoid legumes or below. Most papilionoid legumes investigated have all three genes present (LEGCYC IA, IB, and II), inviting further work to investigate possible functional difference between the three types. However, within these three major gene groups, the precise relationships of the paralogs between species are difficult to determine probably because of a complex history of duplication and loss with lineage sorting or heterotachy (within-site rate variation) due to functional differentiation. The results illustrate both the potential and the difficulties of orthology determination in variable gene families, on which the phylogenomic approach to formulating hypotheses of function depends. PMID:12644657
Transcriptomic Response of Porcine PBMCs to Vaccination with Tetanus Toxoid as a Model Antigen

PubMed Central

Adler, Marcel; Murani, Eduard; Brunner, Ronald; Ponsuksili, Siriluck; Wimmers, Klaus

2013-01-01

The aim of the present study was to characterize in vivo genome-wide transcriptional responses to immune stimulation in order to get insight into the resulting changes of allocation of resources. Vaccination with tetanus toxoid was used as a model for a mixed Th1 and Th2 immune response in pig. Expression profiles of PBMCs (peripheral blood mononuclear cells) before and at 12 time points over a period of four weeks after initial and booster vaccination at day 14 were studied by use of Affymetrix GeneChip microarrays and Ingenuity Pathway Analysis (IPA). The transcriptome data in total comprised more than 5000 genes with different transcript abundances (DE-genes). Within the single time stages the numbers of DE-genes were between several hundred and more than 1000. Ingenuity Pathway Analysis mainly revealed canonical pathways of cellular immune response and cytokine signaling as well as a broad range of processes in cellular and organismal growth, proliferation and development, cell signaling, biosynthesis and metabolism. Significant changes in the expression profiles of PBMCs already occurred very early after immune stimulation. At two hours after the first vaccination 679 DE-genes corresponding to 110 canonical pathways of cytokine signaling, cellular immune response and other multiple cellular functions were found. Immune competence and global disease resistance are heritable but difficult to measure and to address by breeding. Besides QTL mapping of immune traits gene expression profiling facilitates the detection of functional gene networks and thus functional candidate genes. PMID:23536793
Transcriptomic response of porcine PBMCs to vaccination with tetanus toxoid as a model antigen.

PubMed

Adler, Marcel; Murani, Eduard; Brunner, Ronald; Ponsuksili, Siriluck; Wimmers, Klaus

2013-01-01

The aim of the present study was to characterize in vivo genome-wide transcriptional responses to immune stimulation in order to get insight into the resulting changes of allocation of resources. Vaccination with tetanus toxoid was used as a model for a mixed Th1 and Th2 immune response in pig. Expression profiles of PBMCs (peripheral blood mononuclear cells) before and at 12 time points over a period of four weeks after initial and booster vaccination at day 14 were studied by use of Affymetrix GeneChip microarrays and Ingenuity Pathway Analysis (IPA). The transcriptome data in total comprised more than 5000 genes with different transcript abundances (DE-genes). Within the single time stages the numbers of DE-genes were between several hundred and more than 1000. Ingenuity Pathway Analysis mainly revealed canonical pathways of cellular immune response and cytokine signaling as well as a broad range of processes in cellular and organismal growth, proliferation and development, cell signaling, biosynthesis and metabolism. Significant changes in the expression profiles of PBMCs already occurred very early after immune stimulation. At two hours after the first vaccination 679 DE-genes corresponding to 110 canonical pathways of cytokine signaling, cellular immune response and other multiple cellular functions were found. Immune competence and global disease resistance are heritable but difficult to measure and to address by breeding. Besides QTL mapping of immune traits gene expression profiling facilitates the detection of functional gene networks and thus functional candidate genes.
A Functional and Regulatory Network Associated with PIP Expression in Human Breast Cancer

PubMed Central

Debily, Marie-Anne; Marhomy, Sandrine El; Boulanger, Virginie; Eveno, Eric; Mariage-Samson, Régine; Camarca, Alessandra; Auffray, Charles; Piatier-Tonneau, Dominique; Imbeaud, Sandrine

2009-01-01

Background The PIP (prolactin-inducible protein) gene has been shown to be expressed in breast cancers, with contradictory results concerning its implication. As both the physiological role and the molecular pathways in which PIP is involved are poorly understood, we conducted combined gene expression profiling and network analysis studies on selected breast cancer cell lines presenting distinct PIP expression levels and hormonal receptor status, to explore the functional and regulatory network of PIP co-modulated genes. Principal Findings Microarray analysis allowed identification of genes co-modulated with PIP independently of modulations resulting from hormonal treatment or cell line heterogeneity. Relevant clusters of genes that can discriminate between [PIP+] and [PIP−] cells were identified. Functional and regulatory network analyses based on a knowledge database revealed a master network of PIP co-modulated genes, including many interconnecting oncogenes and tumor suppressor genes, half of which were detected as differentially expressed through high-precision measurements. The network identified appears associated with an inhibition of proliferation coupled with an increase of apoptosis and an enhancement of cell adhesion in breast cancer cell lines, and contains many genes with a STAT5 regulatory motif in their promoters. Conclusions Our global exploratory approach identified biological pathways modulated along with PIP expression, providing further support for its good prognostic value of disease-free survival in breast cancer. Moreover, our data pointed to the importance of a regulatory subnetwork associated with PIP expression in which STAT5 appears as a potential transcriptional regulator. PMID:19262752
Identification of novel and known oocyte-specific genes using complementary DNA subtraction and microarray analysis in three different species.

PubMed

Vallée, Maud; Gravel, Catherine; Palin, Marie-France; Reghenas, Hélène; Stothard, Paul; Wishart, David S; Sirard, Marc-André

2005-07-01

The main objective of the present study was to identify novel oocyte-specific genes in three different species: bovine, mouse, and Xenopus laevis. To achieve this goal, two powerful technologies were combined: a polymerase chain reaction (PCR)-based cDNA subtraction, and cDNA microarrays. Three subtractive libraries consisting of 3456 clones were established and enriched for oocyte-specific transcripts. Sequencing analysis of the positive insert-containing clones resulted in the following classification: 53% of the clones corresponded to known cDNAs, 26% were classified as uncharacterized cDNAs, and a final 9% were classified as novel sequences. All these clones were used for cDNA microarray preparation. Results from these microarray analyses revealed that in addition to already known oocyte-specific genes, such as GDF9, BMP15, and ZP, known genes with unknown function in the oocyte were identified, such as a MLF1-interacting protein (MLF1IP), B-cell translocation gene 4 (BTG4), and phosphotyrosine-binding protein (xPTB). Furthermore, 15 novel oocyte-specific genes were validated by reverse transcription-PCR to confirm their preferential expression in the oocyte compared to somatic tissues. The results obtained in the present study confirmed that microarray analysis is a robust technique to identify true positives from the suppressive subtractive hybridization experiment. Furthermore, obtaining oocyte-specific genes from three species simultaneously allowed us to look at important genes that are conserved across species. Further characterization of these novel oocyte-specific genes will lead to a better understanding of the molecular mechanisms related to the unique functions found in the oocyte.
Oncogene 6b from Agrobacterium tumefaciens induces abaxial cell division at late stages of leaf development and modifies vascular development in petioles.

PubMed

Terakura, Shinji; Kitakura, Saeko; Ishikawa, Masaki; Ueno, Yoshihisa; Fujita, Tomomichi; Machida, Chiyoko; Wabiko, Hiroetsu; Machida, Yasunori

2006-05-01

The 6b gene in the T-DNA region of the Ti plasmids of Agrobacterium tumefaciens and A. vitis is able to generate shooty calli in phytohormone-free culture of leaf sections of tobacco transformed with 6b. In the present study, we report characteristic morphological abnormalities of the leaves of transgenic tobacco and Arabidopsis that express 6b from pTiAKE10 (AK-6b), and altered expression of genes related to cell division and meristem formation in the transgenic plants. Cotyledons and leaves of both transgenic tobacco and Arabidopsis exhibited various abnormalities including upward curling of leaf blades, and transgenic tobacco leaves produced leaf-like outgrowths from the abaxial side. Transcripts of some class 1 KNOX homeobox genes, which are thought to be related to meristem functions, and cell cycle regulating genes were ectopically accumulated in mature leaves. M phase-specific genes were also ectopically expressed at the abaxial sides of mature leaves. These results suggest that the AK-6b gene stimulates the cellular potential for division and meristematic functions preferentially in the abaxial side of leaves and that the leaf phenotypes generated by AK-6b are at least in part due to such biased cell division during polar development of leaves. The results of the present experiments with a fusion gene between the AK-6b gene and the glucocorticoid receptor gene showed that nuclear import of the AK-6b protein was essential for upward curling of leaves and hormone-free callus formation, suggesting a role for AK-6b in nuclear events.
Lamins of the sea lamprey (Petromyzon marinus) and the evolution of the vertebrate lamin protein family.

PubMed

Schilf, Paul; Peter, Annette; Hurek, Thomas; Stick, Reimer

2014-07-01

Lamin proteins are found in all metazoans. Most non-vertebrate genomes including those of the closest relatives of vertebrates, the cephalochordates and tunicates, encode only a single lamin. In teleosts and tetrapods the number of lamin genes has quadrupled. They can be divided into four sub-types, lmnb1, lmnb2, LIII, and lmna, each characterized by particular features and functional differentiations. Little is known when during vertebrate evolution these features have emerged. Lampreys belong to the Agnatha, the sister group of the Gnathostomata. They split off first within the vertebrate lineage. Analysis of the sea lamprey (Petromyzon marinus) lamin complement presented here, identified three functional lamin genes, one encoding a lamin LIII, indicating that the characteristic gene structure of this subtype had been established prior to the agnathan/gnathostome split. Two other genes encode lamins for which orthology to gnathostome lamins cannot be designated. Search for lamin gene sequences in all vertebrate taxa for which sufficient sequence data are available reveals the evolutionary time frame in which specific features of the vertebrate lamins were established. Structural features characteristic for A-type lamins are not found in the lamprey genome. In contrast, lmna genes are present in all gnathostome lineages suggesting that this gene evolved with the emergence of the gnathostomes. The analysis of lamin gene neighborhoods reveals noticeable similarities between the different vertebrate lamin genes supporting the hypothesis that they emerged due to two rounds of whole genome duplication and makes clear that an orthologous relationship between a particular vertebrate paralog and lamins outside the vertebrate lineage cannot be established. Copyright © 2014 Elsevier GmbH. All rights reserved.
Discovering semantic features in the literature: a foundation for building functional associations

PubMed Central

Chagoyen, Monica; Carmona-Saez, Pedro; Shatkay, Hagit; Carazo, Jose M; Pascual-Montano, Alberto

2006-01-01

Background Experimental techniques such as DNA microarray, serial analysis of gene expression (SAGE) and mass spectrometry proteomics, among others, are generating large amounts of data related to genes and proteins at different levels. As in any other experimental approach, it is necessary to analyze these data in the context of previously known information about the biological entities under study. The literature is a particularly valuable source of information for experiment validation and interpretation. Therefore, the development of automated text mining tools to assist in such interpretation is one of the main challenges in current bioinformatics research. Results We present a method to create literature profiles for large sets of genes or proteins based on common semantic features extracted from a corpus of relevant documents. These profiles can be used to establish pair-wise similarities among genes, utilized in gene/protein classification or can be even combined with experimental measurements. Semantic features can be used by researchers to facilitate the understanding of the commonalities indicated by experimental results. Our approach is based on non-negative matrix factorization (NMF), a machine-learning algorithm for data analysis, capable of identifying local patterns that characterize a subset of the data. The literature is thus used to establish putative relationships among subsets of genes or proteins and to provide coherent justification for this clustering into subsets. We demonstrate the utility of the method by applying it to two independent and vastly different sets of genes. Conclusion The presented method can create literature profiles from documents relevant to sets of genes. The representation of genes as additive linear combinations of semantic features allows for the exploration of functional associations as well as for clustering, suggesting a valuable methodology for the validation and interpretation of high-throughput experimental data. PMID:16438716
Characterisation of the Manduca sexta sperm proteome: Genetic novelty underlying sperm composition in Lepidoptera.

PubMed

Whittington, Emma; Zhao, Qian; Borziak, Kirill; Walters, James R; Dorus, Steve

2015-07-01

The application of mass spectrometry based proteomics to sperm biology has greatly accelerated progress in understanding the molecular composition and function of spermatozoa. To date, these approaches have been largely restricted to model organisms, all of which produce a single sperm morph capable of oocyte fertilisation. Here we apply high-throughput mass spectrometry proteomic analysis to characterise sperm composition in Manduca sexta, the tobacco hornworm moth, which produce heteromorphic sperm, including one fertilisation competent (eupyrene) and one incompetent (apyrene) sperm type. This resulted in the high confidence identification of 896 proteins from a co-mixed sample of both sperm types, of which 167 are encoded by genes with strict one-to-one orthology in Drosophila melanogaster. Importantly, over half (55.1%) of these orthologous proteins have previously been identified in the D. melanogaster sperm proteome and exhibit significant conservation in quantitative protein abundance in sperm between the two species. Despite the complex nature of gene expression across spermatogenic stages, a significant correlation was also observed between sperm protein abundance and testis gene expression. Lepidopteran-specific sperm proteins (e.g., proteins with no homology to proteins in non-Lepidopteran taxa) were present in significantly greater abundance on average than those with homology outside the Lepidoptera. Given the disproportionate production of apyrene sperm (96% of all mature sperm in Manduca) relative to eupyrene sperm, these evolutionarily novel and highly abundant proteins are candidates for possessing apyrene-specific functions. Lastly, comparative genomic analyses of testis-expressed, ovary-expressed and sperm genes identified a concentration of novel sperm proteins shared amongst Lepidoptera of potential relevance to the evolutionary origin of heteromorphic spermatogenesis. As the first published Lepidopteran sperm proteome, this whole-cell proteomic characterisation will facilitate future evolutionary genetic and developmental studies of heteromorphic sperm production and parasperm function. Furthermore, the analyses presented here provide useful annotation information regarding sex-biased gene expression, novel Lepidopteran genes and gene function in the male gamete to complement the newly sequenced and annotated Manduca genome. Copyright © 2015 Elsevier Ltd. All rights reserved.
Zebrafish models for the functional genomics of neurogenetic disorders.

PubMed

Kabashi, Edor; Brustein, Edna; Champagne, Nathalie; Drapeau, Pierre

2011-03-01

In this review, we consider recent work using zebrafish to validate and study the functional consequences of mutations of human genes implicated in a broad range of degenerative and developmental disorders of the brain and spinal cord. Also we present technical considerations for those wishing to study their own genes of interest by taking advantage of this easily manipulated and clinically relevant model organism. Zebrafish permit mutational analyses of genetic function (gain or loss of function) and the rapid validation of human variants as pathological mutations. In particular, neural degeneration can be characterized at genetic, cellular, functional, and behavioral levels. Zebrafish have been used to knock down or express mutations in zebrafish homologs of human genes and to directly express human genes bearing mutations related to neurodegenerative disorders such as spinal muscular atrophy, ataxia, hereditary spastic paraplegia, amyotrophic lateral sclerosis (ALS), epilepsy, Huntington's disease, Parkinson's disease, fronto-temporal dementia, and Alzheimer's disease. More recently, we have been using zebrafish to validate mutations of synaptic genes discovered by large-scale genomic approaches in developmental disorders such as autism, schizophrenia, and non-syndromic mental retardation. Advances in zebrafish genetics such as multigenic analyses and chemical genetics now offer a unique potential for disease research. Thus, zebrafish hold much promise for advancing the functional genomics of human diseases, the understanding of the genetics and cell biology of degenerative and developmental disorders, and the discovery of therapeutics. This article is part of a Special Issue entitled Zebrafish Models of Neurological Diseases. Copyright Â© 2010 Elsevier B.V. All rights reserved.
Identification of mycoparasitism-related genes against the phytopathogen Sclerotinia sclerotiorum through transcriptome and expression profile analysis in Trichoderma harzianum.

PubMed

Steindorff, Andrei Stecca; Ramada, Marcelo Henrique Soller; Coelho, Alexandre Siqueira Guedes; Miller, Robert Neil Gerard; Pappas, Georgios Joannis; Ulhoa, Cirano José; Noronha, Eliane Ferreira

2014-03-18

The species of T. harzianum are well known for their biocontrol activity against plant pathogens. However, few studies have been conducted to further our understanding of its role as a biological control agent against S. sclerotiorum, a pathogen involved in several crop diseases around the world. In this study, we have used RNA-seq and quantitative real-time PCR (RT-qPCR) techniques in order to explore changes in T. harzianum gene expression during growth on cell wall of S. sclerotiorum (SSCW) or glucose. RT-qPCR was also used to examine genes potentially involved in biocontrol, during confrontation between T. harzianum and S. sclerotiorum. Data obtained from six RNA-seq libraries were aligned onto the T. harzianum CBS 226.95 reference genome and compared after annotation using the Blast2GO suite. A total of 297 differentially expressed genes were found in mycelia grown for 12, 24 and 36 h under the two different conditions: supplemented with glucose or SSCW. Functional annotation of these genes identified diverse biological processes and molecular functions required during T. harzianum growth on SSCW or glucose. We identified various genes of biotechnological value encoding proteins with functions such as transporters, hydrolytic activity, adherence, appressorium development and pathogenesis. To validate the expression profile, RT-qPCR was performed using 20 randomly chosen genes. RT-qPCR expression profiles were in complete agreement with the RNA-Seq data for 17 of the genes evaluated. The other three showed differences at one or two growth times. During the confrontation assay, some genes were up-regulated during and after contact, as shown in the presence of SSCW which is commonly used as a model to mimic this interaction. The present study is the first initiative to use RNA-seq for identification of differentially expressed genes in T. harzianum strain TR274, in response to the phytopathogenic fungus S. sclerotiorum. It provides insights into the mechanisms of gene expression involved in mycoparasitism of T. harzianum against S.sclerotiorum. The RNA-seq data presented will facilitate improvement of the annotation of gene models in the draft T. harzianum genome and provide important information regarding the transcriptome during this interaction.
SinEx DB: a database for single exon coding sequences in mammalian genomes.

PubMed

Jorquera, Roddy; Ortiz, Rodrigo; Ossandon, F; Cárdenas, Juan Pablo; Sepúlveda, Rene; González, Carolina; Holmes, David S

2016-01-01

Eukaryotic genes are typically interrupted by intragenic, noncoding sequences termed introns. However, some genes lack introns in their coding sequence (CDS) and are generally known as 'single exon genes' (SEGs). In this work, a SEG is defined as a nuclear, protein-coding gene that lacks introns in its CDS. Whereas, many public databases of Eukaryotic multi-exon genes are available, there are only two specialized databases for SEGs. The present work addresses the need for a more extensive and diverse database by creating SinEx DB, a publicly available, searchable database of predicted SEGs from 10 completely sequenced mammalian genomes including human. SinEx DB houses the DNA and protein sequence information of these SEGs and includes their functional predictions (KOG) and the relative distribution of these functions within species. The information is stored in a relational database built with My SQL Server 5.1.33 and the complete dataset of SEG sequences and their functional predictions are available for downloading. SinEx DB can be interrogated by: (i) a browsable phylogenetic schema, (ii) carrying out BLAST searches to the in-house SinEx DB of SEGs and (iii) via an advanced search mode in which the database can be searched by key words and any combination of searches by species and predicted functions. SinEx DB provides a rich source of information for advancing our understanding of the evolution and function of SEGs.Database URL: www.sinex.cl. © The Author(s) 2016. Published by Oxford University Press.
Lessons from the canine Oxtr gene: populations, variants and functional aspects.

PubMed

Bence, M; Marx, P; Szantai, E; Kubinyi, E; Ronai, Z; Banlaki, Z

2017-04-01

Oxytocin receptor (OXTR) acts as a key behavioral modulator of the central nervous system, affecting social behavior, stress, affiliation and cognitive functions. Variants of the Oxtr gene are known to influence behavior both in animals and humans; however, canine Oxtr polymorphisms are less characterized in terms of possible relevance to function, selection criteria in breeding and domestication. In this report, we provide a detailed characterization of common variants of the canine Oxtr gene. In particular (1) novel polymorphisms were identified by direct sequencing of wolf and dog samples, (2) allelic distributions and pairwise linkage disequilibrium patterns of several canine populations were compared, (3) neighbor joining (NJ) tree based on common single nucleotide polymorphisms (SNPs) was constructed, (4) mRNA expression features were assessed, (5) a novel splice variant was detected and (6) in vitro functional assays were performed. Results indicate marked differences regarding Oxtr variations between purebred dogs of different breeds, free-ranging dog populations, wolf subspecies and golden jackals. This, together with existence of explicitly dog-specific alleles and data obtained from the NJ tree implies that Oxtr could indeed have been a target gene during domestication and selection for human preferred aspects of temperament and social behavior. This assumption is further supported by the present observations on gene expression patterns within the brain and luciferase reporter experiments, providing a molecular level link between certain canine Oxtr polymorphisms and differences in nervous system function and behavior. © 2016 John Wiley & Sons Ltd and International Behavioural and Neural Genetics Society.
Bipartite Community Structure of eQTLs.

PubMed

Platig, John; Castaldi, Peter J; DeMeo, Dawn; Quackenbush, John

2016-09-01

Genome Wide Association Studies (GWAS) and expression quantitative trait locus (eQTL) analyses have identified genetic associations with a wide range of human phenotypes. However, many of these variants have weak effects and understanding their combined effect remains a challenge. One hypothesis is that multiple SNPs interact in complex networks to influence functional processes that ultimately lead to complex phenotypes, including disease states. Here we present CONDOR, a method that represents both cis- and trans-acting SNPs and the genes with which they are associated as a bipartite graph and then uses the modular structure of that graph to place SNPs into a functional context. In applying CONDOR to eQTLs in chronic obstructive pulmonary disease (COPD), we found the global network "hub" SNPs were devoid of disease associations through GWAS. However, the network was organized into 52 communities of SNPs and genes, many of which were enriched for genes in specific functional classes. We identified local hubs within each community ("core SNPs") and these were enriched for GWAS SNPs for COPD and many other diseases. These results speak to our intuition: rather than single SNPs influencing single genes, we see groups of SNPs associated with the expression of families of functionally related genes and that disease SNPs are associated with the perturbation of those functions. These methods are not limited in their application to COPD and can be used in the analysis of a wide variety of disease processes and other phenotypic traits.
A targeted genotyping approach enhances identification of variants in taste receptor and appetite/reward genes of potential functional importance for obesity-related porcine traits.

PubMed

Cirera, S; Clop, A; Jacobsen, M J; Guerin, M; Lesnik, P; Jørgensen, C B; Fredholm, M; Karlskov-Mortensen, P

2018-04-01

Taste receptors (TASRs) and appetite and reward (AR) mechanisms influence eating behaviour, which in turn affects food intake and risk of obesity. In a previous study, we used next generation sequencing to identify potentially functional mutations in TASR and AR genes and found indications for genetic associations between identified variants and growth and fat deposition in a subgroup of animals (n = 38) from the UNIK resource pig population. This population was created for studying obesity and obesity-related diseases. In the present study we validated results from our previous study by investigating genetic associations between 24 selected single nucleotide variants in TASR and AR gene variants and 35 phenotypes describing obesity and metabolism in the entire UNIK population (n = 564). Fifteen variants showed significant association with specific obesity-related phenotypes after Bonferroni correction. Six of the 15 genes, namely SIM1, FOS, TAS2R4, TAS2R9, MCHR2 and LEPR, showed good correlation between known biological function and associated phenotype. We verified a genetic association between potentially functional variants in TASR/AR genes and growth/obesity and conclude that the combination of identification of potentially functional variants by next generation sequencing followed by targeted genotyping and association studies is a powerful and cost-effective approach for increasing the power of genetic association studies. © 2018 Stichting International Foundation for Animal Genetics.
Comprehensive Analysis of Interaction Networks of Telomerase Reverse Transcriptase with Multiple Bioinformatic Approaches: Deep Mining the Potential Functions of Telomere and Telomerase.

PubMed

Hou, Chunyu; Wang, Fei; Liu, Xuewen; Chang, Guangming; Wang, Feng; Geng, Xin

2017-08-01

Telomerase reverse transcriptase (TERT) is the protein component of telomerase complex. Evidence has accumulated showing that the nontelomeric functions of TERT are independent of telomere elongation. However, the mechanisms governing the interaction between TERT and its target genes are not clearly revealed. The biological functions of TERT are not fully elucidated and have thus far been underestimated. To further explore these functions, we investigated TERT interaction networks using multiple bioinformatic databases, including BioGRID, STRING, DAVID, GeneCards, GeneMANIA, PANTHER, miRWalk, mirTarBase, miRNet, miRDB, and TargetScan. In addition, network diagrams were built using Cytoscape software. As competing endogenous RNAs (ceRNAs) are endogenous transcripts that compete for the binding of microRNAs (miRNAs) by using shared miRNA recognition elements, they are involved in creating widespread regulatory networks. Therefore, the ceRNA regulatory networks of TERT were also investigated in this study. Interestingly, we found that the three genes PABPC1, SLC7A11, and TP53 were present in both TERT interaction networks and ceRNAs target genes. It was predicted that TERT might play nontelomeric roles in the generation or development of some rare diseases, such as Rift Valley fever and dyscalculia. Thus, our data will help to decipher the interaction networks of TERT and reveal the unknown functions of telomerase in cancer and aging-related diseases.
Autosomal-Recessive Hypophosphatemic Rickets Is Associated with an Inactivation Mutation in the ENPP1 Gene

PubMed Central

Levy-Litan, Varda; Hershkovitz, Eli; Avizov, Luba; Leventhal, Neta; Bercovich, Dani; Chalifa-Caspi, Vered; Manor, Esther; Buriakovsky, Sophia; Hadad, Yair; Goding, James; Parvari, Ruti

2010-01-01

Human disorders of phosphate (Pi) handling and hypophosphatemic rickets have been shown to result from mutations in PHEX, FGF23, and DMP1, presenting as X-linked recessive, autosomal-dominant, and autosomal-recessive patterns, respectively. We present the identification of an inactivating mutation in the ecto-nucleotide pyrophosphatase/phosphodiesterase 1 (ENPP1) gene causing autosomal-recessive hypophosphatemic rickets (ARHR) with phosphaturia by positional cloning. ENPP1 generates inorganic pyrophosphate (PPi), an essential physiologic inhibitor of calcification, and previously described inactivating mutations in this gene were shown to cause aberrant ectopic calcification disorders, whereas no aberrant calcifications were present in our patients. Our surprising result suggests a different pathway involved in the generation of ARHR and possible additional functions for ENPP1. PMID:20137772
Unexplored Potentials of Epigenetic Mechanisms of Plants and Animals—Theoretical Considerations

PubMed Central

Seffer, Istvan; Nemeth, Zoltan; Hoffmann, Gyula; Matics, Robert; Seffer, A Gergely; Koller, Akos

2013-01-01

Morphological and functional changes of cells are important for adapting to environmental changes and associated with continuous regulation of gene expressions. Genes are regulated–in part–by epigenetic mechanisms resulting in alternating patterns of gene expressions throughout life. Epigenetic changes responding to the environmental and intercellular signals can turn on/off specific genes, but do not modify the DNA sequence. Most epigenetic mechanisms are evolutionary conserved in eukaryotic organisms, and several homologs of epigenetic factors are present in plants and animals. Moreover, in vitro studies suggest that the plant cytoplasm is able to induce a nuclear reassembly of the animal cell, whereas others suggest that the ooplasm is able to induce condensation of plant chromatin. Here, we provide an overview of the main epigenetic mechanisms regulating gene expression and discuss fundamental epigenetic mechanisms and factors functioning in both plants and animals. Finally, we hypothesize that animal genome can be reprogrammed by epigenetic factors from the plant protoplast. PMID:25512705
The Reconstruction and Analysis of Gene Regulatory Networks.

PubMed

Zheng, Guangyong; Huang, Tao

2018-01-01

In post-genomic era, an important task is to explore the function of individual biological molecules (i.e., gene, noncoding RNA, protein, metabolite) and their organization in living cells. For this end, gene regulatory networks (GRNs) are constructed to show relationship between biological molecules, in which the vertices of network denote biological molecules and the edges of network present connection between nodes (Strogatz, Nature 410:268-276, 2001; Bray, Science 301:1864-1865, 2003). Biologists can understand not only the function of biological molecules but also the organization of components of living cells through interpreting the GRNs, since a gene regulatory network is a comprehensively physiological map of living cells and reflects influence of genetic and epigenetic factors (Strogatz, Nature 410:268-276, 2001; Bray, Science 301:1864-1865, 2003). In this paper, we will review the inference methods of GRN reconstruction and analysis approaches of network structure. As a powerful tool for studying complex diseases and biological processes, the applications of the network method in pathway analysis and disease gene identification will be introduced.
Function and Phylogeny of Bacterial Butyryl Coenzyme A:Acetate Transferases and Their Diversity in the Proximal Colon of Swine.

PubMed

Trachsel, Julian; Bayles, Darrell O; Looft, Torey; Levine, Uri Y; Allen, Heather K

2016-11-15

Studying the host-associated butyrate-producing bacterial community is important, because butyrate is essential for colonic homeostasis and gut health. Previous research has identified the butyryl coenzyme A (CoA):acetate-CoA transferase (EC 2.3.8.3) as a gene of primary importance for butyrate production in intestinal ecosystems; however, this gene family (but) remains poorly defined. We developed tools for the analysis of butyrate-producing bacteria based on 12 putative but genes identified in the genomes of nine butyrate-producing bacteria obtained from the swine intestinal tract. Functional analyses revealed that eight of these genes had strong But enzyme activity. When but paralogues were found within a genome, only one gene per genome encoded strong activity, with the exception of one strain in which no gene encoded strong But activity. Degenerate primers were designed to amplify the functional but genes and were tested by amplifying environmental but sequences from DNA and RNA extracted from swine colonic contents. The results show diverse but sequences from swine-associated butyrate-producing bacteria, most of which clustered near functionally confirmed sequences. Here, we describe tools and a framework that allow the bacterial butyrate-producing community to be profiled in the context of animal health and disease. Butyrate is a compound produced by the microbiota in the intestinal tracts of animals. This compound is of critical importance for intestinal health, and yet studying its production by diverse intestinal bacteria is technically challenging. Here, we present an additional way to study the butyrate-producing community of bacteria using one degenerate primer set that selectively targets genes experimentally demonstrated to encode butyrate production. This work will enable researchers to more easily study this very important bacterial function that has implications for host health and resistance to disease. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

Integrated platform for genome-wide screening and construction of high-density genetic interaction maps in mammalian cells

PubMed Central

Kampmann, Martin; Bassik, Michael C.; Weissman, Jonathan S.

2013-01-01

A major challenge of the postgenomic era is to understand how human genes function together in normal and disease states. In microorganisms, high-density genetic interaction (GI) maps are a powerful tool to elucidate gene functions and pathways. We have developed an integrated methodology based on pooled shRNA screening in mammalian cells for genome-wide identification of genes with relevant phenotypes and systematic mapping of all GIs among them. We recently demonstrated the potential of this approach in an application to pathways controlling the susceptibility of human cells to the toxin ricin. Here we present the complete quantitative framework underlying our strategy, including experimental design, derivation of quantitative phenotypes from pooled screens, robust identification of hit genes using ultra-complex shRNA libraries, parallel measurement of tens of thousands of GIs from a single double-shRNA experiment, and construction of GI maps. We describe the general applicability of our strategy. Our pooled approach enables rapid screening of the same shRNA library in different cell lines and under different conditions to determine a range of different phenotypes. We illustrate this strategy here for single- and double-shRNA libraries. We compare the roles of genes for susceptibility to ricin and Shiga toxin in different human cell lines and reveal both toxin-specific and cell line-specific pathways. We also present GI maps based on growth and ricin-resistance phenotypes, and we demonstrate how such a comparative GI mapping strategy enables functional dissection of physical complexes and context-dependent pathways. PMID:23739767
Taxonomic and functional characteristics of microbial communities and their correlation with physicochemical properties of four geothermal springs in Odisha, India

PubMed Central

Badhai, Jhasketan; Ghosh, Tarini S.; Das, Subrata K.

2015-01-01

This study describes microbial diversity in four tropical hot springs representing moderately thermophilic environments (temperature range: 40–58°C; pH: 7.2–7.4) with discrete geochemistry. Metagenome sequence data showed a dominance of Bacteria over Archaea; the most abundant phyla were Chloroflexi and Proteobacteria, although other phyla were also present, such as Acetothermia, Nitrospirae, Acidobacteria, Firmicutes, Deinococcus-Thermus, Bacteroidetes, Thermotogae, Euryarchaeota, Verrucomicrobia, Ignavibacteriae, Cyanobacteria, Actinobacteria, Planctomycetes, Spirochaetes, Armatimonadetes, Crenarchaeota, and Aquificae. The distribution of major genera and their statistical correlation analyses with the physicochemical parameters predicted that the temperature, aqueous concentrations of ions (such as sodium, chloride, sulfate, and bicarbonate), total hardness, dissolved solids and conductivity were the main environmental variables influencing microbial community composition and diversity. Despite the observed high taxonomic diversity, there were only little variations in the overall functional profiles of the microbial communities in the four springs. Genes involved in the metabolism of carbohydrates and carbon fixation were the most abundant functional class of genes present in these hot springs. The distribution of genes involved in carbon fixation predicted the presence of all the six known autotrophic pathways in the metagenomes. A high prevalence of genes involved in membrane transport, signal transduction, stress response, bacterial chemotaxis, and flagellar assembly were observed along with genes involved in the pathways of xenobiotic degradation and metabolism. The analysis of the metagenomic sequences affiliated to the candidate phylum Acetothermia from spring TB-3 provided new insight into the metabolism and physiology of yet-unknown members of this lineage of bacteria. PMID:26579081
Taxonomic and functional characteristics of microbial communities and their correlation with physicochemical properties of four geothermal springs in Odisha, India.

PubMed

Badhai, Jhasketan; Ghosh, Tarini S; Das, Subrata K

2015-01-01

This study describes microbial diversity in four tropical hot springs representing moderately thermophilic environments (temperature range: 40-58°C; pH: 7.2-7.4) with discrete geochemistry. Metagenome sequence data showed a dominance of Bacteria over Archaea; the most abundant phyla were Chloroflexi and Proteobacteria, although other phyla were also present, such as Acetothermia, Nitrospirae, Acidobacteria, Firmicutes, Deinococcus-Thermus, Bacteroidetes, Thermotogae, Euryarchaeota, Verrucomicrobia, Ignavibacteriae, Cyanobacteria, Actinobacteria, Planctomycetes, Spirochaetes, Armatimonadetes, Crenarchaeota, and Aquificae. The distribution of major genera and their statistical correlation analyses with the physicochemical parameters predicted that the temperature, aqueous concentrations of ions (such as sodium, chloride, sulfate, and bicarbonate), total hardness, dissolved solids and conductivity were the main environmental variables influencing microbial community composition and diversity. Despite the observed high taxonomic diversity, there were only little variations in the overall functional profiles of the microbial communities in the four springs. Genes involved in the metabolism of carbohydrates and carbon fixation were the most abundant functional class of genes present in these hot springs. The distribution of genes involved in carbon fixation predicted the presence of all the six known autotrophic pathways in the metagenomes. A high prevalence of genes involved in membrane transport, signal transduction, stress response, bacterial chemotaxis, and flagellar assembly were observed along with genes involved in the pathways of xenobiotic degradation and metabolism. The analysis of the metagenomic sequences affiliated to the candidate phylum Acetothermia from spring TB-3 provided new insight into the metabolism and physiology of yet-unknown members of this lineage of bacteria.
Identification of functional modules that correlate with phenotypic difference: the influence of network topology

PubMed Central

2010-01-01

One of the important challenges to post-genomic biology is relating observed phenotypic alterations to the underlying collective alterations in genes. Current inferential methods, however, invariably omit large bodies of information on the relationships between genes. We present a method that takes account of such information - expressed in terms of the topology of a correlation network - and we apply the method in the context of current procedures for gene set enrichment analysis. PMID:20187943
A new gene for asthma: would you ADAM and Eve it?

PubMed

Cookson, William

2003-04-01

Recently, a novel gene was reported to underlie asthma. Linkage to the short arm of chromosome 20 in a genome screen was followed by positive tests of association that centre on the gene for a membrane-anchored zinc-dependent metalloproteinase known as ADAM33. The domain structure of the ADAM33 protein gives capabilities of proteolysis, adhesion, cell fusion and intracellular signalling. Although its function is at present unknown, these potential actions of ADAM33 provide many possibilities for further research.
Novel microcephalic primordial dwarfism disorder associated with variants in the centrosomal protein ninein.

PubMed

Dauber, Andrew; Lafranchi, Stephen H; Maliga, Zoltan; Lui, Julian C; Moon, Jennifer E; McDeed, Cailin; Henke, Katrin; Zonana, Jonathan; Kingman, Garrett A; Pers, Tune H; Baron, Jeffrey; Rosenfeld, Ron G; Hirschhorn, Joel N; Harris, Matthew P; Hwa, Vivian

2012-11-01

Microcephalic primordial dwarfism (MPD) is a rare, severe form of human growth failure in which growth restriction is evident in utero and continues into postnatal life. Single causative gene defects have been identified in a number of patients with MPD, and all involve genes fundamental to cellular processes including centrosome functions. The objective of the study was to find the genetic etiology of a novel presentation of MPD. The design of the study was whole-exome sequencing performed on two affected sisters in a single family. Molecular and functional studies of a candidate gene were performed using patient-derived primary fibroblasts and a zebrafish morpholino oligonucleotides knockdown model. Two sisters presented with a novel subtype of MPD, including severe intellectual disabilities. NIN, encoding Ninein, a centrosomal protein critically involved in asymmetric cell division, was identified as a candidate gene, and functional impacts in fibroblasts and zebrafish were studied. From 34,606 genomic variants, two very rare missense variants in NIN were identified. Both probands were compound heterozygotes. In the zebrafish, ninein knockdown led to specific and novel defects in the specification and morphogenesis of the anterior neuroectoderm, resulting in a deformity of the developing cranium with a small, squared skull highly reminiscent of the human phenotype. We identified a novel clinical subtype of MPD in two sisters who have rare variants in NIN. We show, for the first time, that reduction of ninein function in the developing zebrafish leads to specific deficiencies of brain and skull development, offering a developmental basis for the myriad phenotypes in our patients.
Filling gaps in PPAR-alpha signaling through comparative nutrigenomics analysis.

PubMed

Cavalieri, Duccio; Calura, Enrica; Romualdi, Chiara; Marchi, Emmanuela; Radonjic, Marijana; Van Ommen, Ben; Müller, Michael

2009-12-11

The application of high-throughput genomic tools in nutrition research is a widespread practice. However, it is becoming increasingly clear that the outcome of individual expression studies is insufficient for the comprehensive understanding of such a complex field. Currently, the availability of the large amounts of expression data in public repositories has opened up new challenges on microarray data analyses. We have focused on PPARalpha, a ligand-activated transcription factor functioning as fatty acid sensor controlling the gene expression regulation of a large set of genes in various metabolic organs such as liver, small intestine or heart. The function of PPARalpha is strictly connected to the function of its target genes and, although many of these have already been identified, major elements of its physiological function remain to be uncovered. To further investigate the function of PPARalpha, we have applied a cross-species meta-analysis approach to integrate sixteen microarray datasets studying high fat diet and PPARalpha signal perturbations in different organisms. We identified 164 genes (MDEGs) that were differentially expressed in a constant way in response to a high fat diet or to perturbations in PPARs signalling. In particular, we found five genes in yeast which were highly conserved and homologous of PPARalpha targets in mammals, potential candidates to be used as models for the equivalent mammalian genes. Moreover, a screening of the MDEGs for all known transcription factor binding sites and the comparison with a human genome-wide screening of Peroxisome Proliferating Response Elements (PPRE), enabled us to identify, 20 new potential candidate genes that show, both binding site, both change in expression in the condition studied. Lastly, we found a non random localization of the differentially expressed genes in the genome. The results presented are potentially of great interest to resume the currently available expression data, exploiting the power of in silico analysis filtered by evolutionary conservation. The analysis enabled us to indicate potential gene candidates that could fill in the gaps with regards to the signalling of PPARalpha and, moreover, the non-random localization of the differentially expressed genes in the genome, suggest that epigenetic mechanisms are of importance in the regulation of the transcription operated by PPARalpha.
Functions of bromodomain-containing proteins and their roles in homeostasis and cancer.

PubMed

Fujisawa, Takao; Filippakopoulos, Panagis

2017-04-01

Bromodomains (BRDs) are evolutionarily conserved protein-protein interaction modules that are found in a wide range of proteins with diverse catalytic and scaffolding functions and are present in most tissues. BRDs selectively recognize and bind to acetylated Lys residues - particularly in histones - and thereby have important roles in the regulation of gene expression. BRD-containing proteins are frequently dysregulated in cancer, they participate in gene fusions that generate diverse, frequently oncogenic proteins, and many cancer-causing mutations have been mapped to the BRDs themselves. Importantly, BRDs can be targeted by small-molecule inhibitors, which has stimulated many translational research projects that seek to attenuate the aberrant functions of BRD-containing proteins in disease.
Estradiol targets T cell signaling pathways in human systemic lupus.

PubMed

Walters, Emily; Rider, Virginia; Abdou, Nabih I; Greenwell, Cindy; Svojanovsky, Stan; Smith, Peter; Kimler, Bruce F

2009-12-01

The major risk factor for developing systemic lupus erythematosus (SLE) is being female. The present study utilized gene profiles of activated T cells from females with SLE and healthy controls to identify signaling pathways uniquely regulated by estradiol that could contribute to SLE pathogenesis. Selected downstream pathway genes (+/- estradiol) were measured by real time polymerase chain amplification. Estradiol uniquely upregulated six pathways in SLE T cells that control T cell function including interferon-alpha signaling. Measurement of interferon-alpha pathway target gene expression revealed significant differences (p= 0.043) in DRIP150 (+/- estradiol) in SLE T cell samples while IFIT1 expression was bimodal and correlated moderately (r= 0.55) with disease activity. The results indicate that estradiol alters signaling pathways in activated SLE T cells that control T cell function. Differential expression of transcriptional coactivators could influence estrogen-dependent gene regulation in T cell signaling and contribute to SLE onset and disease pathogenesis.
Binding and condensation of plasmid DNA onto functionalized carbon nanotubes: toward the construction of nanotube-based gene delivery vectors.

PubMed

Singh, Ravi; Pantarotto, Davide; McCarthy, David; Chaloin, Olivier; Hoebeke, Johan; Partidos, Charalambos D; Briand, Jean-Paul; Prato, Maurizio; Bianco, Alberto; Kostarelos, Kostas

2005-03-30

Carbon nanotubes (CNTs) constitute a class of nanomaterials that possess characteristics suitable for a variety of possible applications. Their compatibility with aqueous environments has been made possible by the chemical functionalization of their surface, allowing for exploration of their interactions with biological components including mammalian cells. Functionalized CNTs (f-CNTs) are being intensively explored in advanced biotechnological applications ranging from molecular biosensors to cellular growth substrates. We have been exploring the potential of f-CNTs as delivery vehicles of biologically active molecules in view of possible biomedical applications, including vaccination and gene delivery. Recently we reported the capability of ammonium-functionalized single-walled CNTs to penetrate human and murine cells and facilitate the delivery of plasmid DNA leading to expression of marker genes. To optimize f-CNTs as gene delivery vehicles, it is essential to characterize their interactions with DNA. In the present report, we study the interactions of three types of f-CNTs, ammonium-functionalized single-walled and multiwalled carbon nanotubes (SWNT-NH3+; MWNT-NH3+), and lysine-functionalized single-walled carbon nanotubes (SWNT-Lys-NH3+), with plasmid DNA. Nanotube-DNA complexes were analyzed by scanning electron microscopy, surface plasmon resonance, PicoGreen dye exclusion, and agarose gel shift assay. The results indicate that all three types of cationic carbon nanotubes are able to condense DNA to varying degrees, indicating that both nanotube surface area and charge density are critical parameters that determine the interaction and electrostatic complex formation between f-CNTs with DNA. All three different f-CNT types in this study exhibited upregulation of marker gene expression over naked DNA using a mammalian (human) cell line. Differences in the levels of gene expression were correlated with the structural and biophysical data obtained for the f-CNT:DNA complexes to suggest that large surface area leading to very efficient DNA condensation is not necessary for effective gene transfer. However, it will require further investigation to determine whether the degree of binding and tight association between DNA and nanotubes is a desirable trait to increase gene expression efficiency in vitro or in vivo. This study constitutes the first thorough investigation into the physicochemical interactions between cationic functionalized carbon nanotubes and DNA toward construction of carbon nanotube-based gene transfer vector systems.
Characterization of WRKY transcription factors in Solanum lycopersicum reveals collinearity and their expression patterns under cold treatment.

PubMed

Chen, Lin; Yang, Yang; Liu, Can; Zheng, Yanyan; Xu, Mingshuang; Wu, Na; Sheng, Jiping; Shen, Lin

2015-08-28

WRKY transcription factors play an important role in cold defense of plants. However, little information is available about the cold-responsive WRKYs in tomato (Solanum lycopersicum). In the present study, a complete characterization of this gene family was described. Eighty WRKY genes in the tomato genome were identified. Almost all WRKY genes contain putative stress-responsive cis-elements in their promoter regions. Segmental duplications contributed significantly to the expansion of the SlWRKY gene family. Transcriptional analysis revealed notable differential expression in tomato tissues and expression patterns under cold stress, which indicated wide functional divergence in this family. Ten WRKYs in tomato were strongly induced more than 2-fold during cold stress. These genes represented candidate genes for future functional analysis of WRKYs involved in the cold-related signal pathways. Our data provide valuable information about tomato WRKY proteins and form a foundation for future studies of these proteins, especially for those that play an important role in response to cold stress. Copyright © 2015 Elsevier Inc. All rights reserved.
Secondary metabolism in Fusarium fujikuroi: strategies to unravel the function of biosynthetic pathways.

PubMed

Janevska, Slavica; Tudzynski, Bettina

2018-01-01

The fungus Fusarium fujikuroi causes bakanae disease of rice due to its ability to produce the plant hormones, the gibberellins. The fungus is also known for producing harmful mycotoxins (e.g., fusaric acid and fusarins) and pigments (e.g., bikaverin and fusarubins). However, for a long time, most of these well-known products could not be linked to biosynthetic gene clusters. Recent genome sequencing has revealed altogether 47 putative gene clusters. Most of them were orphan clusters for which the encoded natural product(s) were unknown. In this review, we describe the current status of our research on identification and functional characterizations of novel secondary metabolite gene clusters. We present several examples where linking known metabolites to the respective biosynthetic genes has been achieved and describe recent strategies and methods to access new natural products, e.g., by genetic manipulation of pathway-specific or global transcritption factors. In addition, we demonstrate that deletion and over-expression of histone-modifying genes is a powerful tool to activate silent gene clusters and to discover their products.
A platform for rapid prototyping of synthetic gene networks in mammalian cells

PubMed Central

Duportet, Xavier; Wroblewska, Liliana; Guye, Patrick; Li, Yinqing; Eyquem, Justin; Rieders, Julianne; Rimchala, Tharathorn; Batt, Gregory; Weiss, Ron

2014-01-01

Mammalian synthetic biology may provide novel therapeutic strategies, help decipher new paths for drug discovery and facilitate synthesis of valuable molecules. Yet, our capacity to genetically program cells is currently hampered by the lack of efficient approaches to streamline the design, construction and screening of synthetic gene networks. To address this problem, here we present a framework for modular and combinatorial assembly of functional (multi)gene expression vectors and their efficient and specific targeted integration into a well-defined chromosomal context in mammalian cells. We demonstrate the potential of this framework by assembling and integrating different functional mammalian regulatory networks including the largest gene circuit built and chromosomally integrated to date (6 transcription units, 27kb) encoding an inducible memory device. Using a library of 18 different circuits as a proof of concept, we also demonstrate that our method enables one-pot/single-flask chromosomal integration and screening of circuit libraries. This rapid and powerful prototyping platform is well suited for comparative studies of genetic regulatory elements, genes and multi-gene circuits as well as facile development of libraries of isogenic engineered cell lines. PMID:25378321
Functional diversification of the dehydrin gene family in apple and its contribution to cold acclimation during dormancy.

PubMed

Falavigna, Vítor da Silveira; Miotto, Yohanna Evelyn; Porto, Diogo Denardi; Anzanello, Rafael; Santos, Henrique Pessoa dos; Fialho, Flávio Bello; Margis-Pinheiro, Márcia; Pasquali, Giancarlo; Revers, Luís Fernando

2015-11-01

Dehydrins (DHN) are proteins involved in plant adaptive responses to abiotic stresses, mainly dehydration. Several studies in perennial crops have linked bud dormancy progression, a process characterized by the inability to initiate growth from meristems under favorable conditions, with DHN gene expression. However, an in-depth characterization of DHNs during bud dormancy progression is still missing. An extensive in silico characterization of the apple DHN gene family was performed. Additionally, we used five different experiments that generated samples with different dormancy status, including genotypes with contrasting dormancy traits, to analyze how DHN genes are being regulated during bud dormancy progression in apple by real-time quantitative polymerase chain reaction (RT-qPCR). Duplication events took place in the diversification of apple DHN family. Additionally, MdDHN genes presented tissue- and bud dormant-specific expression patterns. Our results indicate that MdDHN genes are highly divergent in function, with overlapping levels, and that their expressions are fine-tuned by the environment during the dormancy process in apple. © 2015 Scandinavian Plant Physiology Society.
Mycobacterium tuberculosis Exploits a Molecular Off Switch of the Immune System for Intracellular Survival.

PubMed

von Both, Ulrich; Berk, Maurice; Agapow, Paul-Michael; Wright, Joseph D; Git, Anna; Hamilton, Melissa Shea; Goldgof, Greg; Siddiqui, Nazneen; Bellos, Evangelos; Wright, Victoria J; Coin, Lachlan J; Newton, Sandra M; Levin, Michael

2018-01-12

Mycobacterium tuberculosis (M. tuberculosis) survives and multiplies inside human macrophages by subversion of immune mechanisms. Although these immune evasion strategies are well characterised functionally, the underlying molecular mechanisms are poorly understood. Here we show that during infection of human whole blood with M. tuberculosis, host gene transcriptional suppression, rather than activation, is the predominant response. Spatial, temporal and functional characterisation of repressed genes revealed their involvement in pathogen sensing and phagocytosis, degradation within the phagolysosome and antigen processing and presentation. To identify mechanisms underlying suppression of multiple immune genes we undertook epigenetic analyses. We identified significantly differentially expressed microRNAs with known targets in suppressed genes. In addition, after searching regions upstream of the start of transcription of suppressed genes for common sequence motifs, we discovered novel enriched composite sequence patterns, which corresponded to Alu repeat elements, transposable elements known to have wide ranging influences on gene expression. Our findings suggest that to survive within infected cells, mycobacteria exploit a complex immune "molecular off switch" controlled by both microRNAs and Alu regulatory elements.
SoFoCles: feature filtering for microarray classification based on gene ontology.

PubMed

Papachristoudis, Georgios; Diplaris, Sotiris; Mitkas, Pericles A

2010-02-01

Marker gene selection has been an important research topic in the classification analysis of gene expression data. Current methods try to reduce the "curse of dimensionality" by using statistical intra-feature set calculations, or classifiers that are based on the given dataset. In this paper, we present SoFoCles, an interactive tool that enables semantic feature filtering in microarray classification problems with the use of external, well-defined knowledge retrieved from the Gene Ontology. The notion of semantic similarity is used to derive genes that are involved in the same biological path during the microarray experiment, by enriching a feature set that has been initially produced with legacy methods. Among its other functionalities, SoFoCles offers a large repository of semantic similarity methods that are used in order to derive feature sets and marker genes. The structure and functionality of the tool are discussed in detail, as well as its ability to improve classification accuracy. Through experimental evaluation, SoFoCles is shown to outperform other classification schemes in terms of classification accuracy in two real datasets using different semantic similarity computation approaches.
A gene catalogue of the Sprague-Dawley rat gut metagenome.

PubMed

Pan, Hudan; Guo, Ruijin; Zhu, Jie; Wang, Qi; Ju, Yanmei; Xie, Ying; Zheng, Yanfang; Wang, Zhifeng; Li, Ting; Liu, Zhongqiu; Lu, Linlin; Li, Fei; Tong, Bin; Xiao, Liang; Xu, Xun; Li, Runze; Yuan, Zhongwen; Yang, Huanming; Wang, Jian; Kristiansen, Karsten; Jia, Huijue; Liu, Liang

2018-05-01

Laboratory rats such as the Sprague-Dawley (SD) rats are an important model for biomedical studies in relation to human physiological or pathogenic processes. Here we report the first catalog of microbial genes in fecal samples from Sprague-Dawley rats. The catalog was established using 98 fecal samples from 49 SD rats, divided in 7 experimental groups, and collected at different time points 30 days apart. The established gene catalog comprises 5,130,167 non-redundant genes with an average length of 750 bp, among which 64.6% and 26.7% were annotated to phylum and genus levels, respectively. Functionally, 53.1%, 21.8%,and 31% of the genes could be annotated to KEGG orthologous groups, modules, and pathways, respectively. A comparison of rat gut metagenome catalogue with human or mouse revealed a higher pairwise overlap between rats and humans (2.47%) than between mice and humans (1.19%) at the gene level. Ninety-seven percent of the functional pathways in the human catalog were present in the rat catalogue, underscoring the potential use of rats for biomedical research.
A Compendium of Canine Normal Tissue Gene Expression

PubMed Central

Chen, Qing-Rong; Wen, Xinyu; Khan, Javed; Khanna, Chand

2011-01-01

Background Our understanding of disease is increasingly informed by changes in gene expression between normal and abnormal tissues. The release of the canine genome sequence in 2005 provided an opportunity to better understand human health and disease using the dog as clinically relevant model. Accordingly, we now present the first genome-wide, canine normal tissue gene expression compendium with corresponding human cross-species analysis. Methodology/Principal Findings The Affymetrix platform was utilized to catalogue gene expression signatures of 10 normal canine tissues including: liver, kidney, heart, lung, cerebrum, lymph node, spleen, jejunum, pancreas and skeletal muscle. The quality of the database was assessed in several ways. Organ defining gene sets were identified for each tissue and functional enrichment analysis revealed themes consistent with known physio-anatomic functions for each organ. In addition, a comparison of orthologous gene expression between matched canine and human normal tissues uncovered remarkable similarity. To demonstrate the utility of this dataset, novel canine gene annotations were established based on comparative analysis of dog and human tissue selective gene expression and manual curation of canine probeset mapping. Public access, using infrastructure identical to that currently in use for human normal tissues, has been established and allows for additional comparisons across species. Conclusions/Significance These data advance our understanding of the canine genome through a comprehensive analysis of gene expression in a diverse set of tissues, contributing to improved functional annotation that has been lacking. Importantly, it will be used to inform future studies of disease in the dog as a model for human translational research and provides a novel resource to the community at large. PMID:21655323
Genome-wide identification of the MADS-box transcription factor family in pear (Pyrus bretschneideri) reveals evolution and functional divergence.

PubMed

Wang, Runze; Ming, Meiling; Li, Jiaming; Shi, Dongqing; Qiao, Xin; Li, Leiting; Zhang, Shaoling; Wu, Jun

2017-01-01

MADS-box transcription factors play significant roles in plant developmental processes such as floral organ conformation, flowering time, and fruit development. Pear ( Pyrus ), as the third-most crucial temperate fruit crop, has been fully sequenced. However, there is limited information about the MADS family and its functional divergence in pear. In this study, a total of 95 MADS-box genes were identified in the pear genome, and classified into two types by phylogenetic analysis. Type I MADS-box genes were divided into three subfamilies and type II genes into 14 subfamilies. Synteny analysis suggested that whole-genome duplications have played key roles in the expansion of the MADS family, followed by rearrangement events. Purifying selection was the primary force driving MADS-box gene evolution in pear, and one gene pairs presented three codon sites under positive selection. Full-scale expression information for PbrMADS genes in vegetative and reproductive organs was provided and proved by transcriptional and reverse transcription PCR analysis. Furthermore, the PbrMADS11(12) gene, together with partners PbMYB10 and PbbHLH3 was confirmed to activate the promoters of the structural genes in anthocyanin pathway of red pear through dual luciferase assay. In addition, the PbrMADS11 and PbrMADS12 were deduced involving in the regulation of anthocyanin synthesis response to light and temperature changes. These results provide a solid foundation for future functional analysis of PbrMADS genes in different biological processes, especially of pigmentation in pear.
Genome-wide identification of the MADS-box transcription factor family in pear (Pyrus bretschneideri) reveals evolution and functional divergence

PubMed Central

Li, Jiaming; Shi, Dongqing; Qiao, Xin; Li, Leiting; Zhang, Shaoling

2017-01-01

MADS-box transcription factors play significant roles in plant developmental processes such as floral organ conformation, flowering time, and fruit development. Pear (Pyrus), as the third-most crucial temperate fruit crop, has been fully sequenced. However, there is limited information about the MADS family and its functional divergence in pear. In this study, a total of 95 MADS-box genes were identified in the pear genome, and classified into two types by phylogenetic analysis. Type I MADS-box genes were divided into three subfamilies and type II genes into 14 subfamilies. Synteny analysis suggested that whole-genome duplications have played key roles in the expansion of the MADS family, followed by rearrangement events. Purifying selection was the primary force driving MADS-box gene evolution in pear, and one gene pairs presented three codon sites under positive selection. Full-scale expression information for PbrMADS genes in vegetative and reproductive organs was provided and proved by transcriptional and reverse transcription PCR analysis. Furthermore, the PbrMADS11(12) gene, together with partners PbMYB10 and PbbHLH3 was confirmed to activate the promoters of the structural genes in anthocyanin pathway of red pear through dual luciferase assay. In addition, the PbrMADS11 and PbrMADS12 were deduced involving in the regulation of anthocyanin synthesis response to light and temperature changes. These results provide a solid foundation for future functional analysis of PbrMADS genes in different biological processes, especially of pigmentation in pear. PMID:28924499

Hundreds of Genes Experienced Convergent Shifts in Selective Pressure in Marine Mammals

PubMed Central

Chikina, Maria; Robinson, Joseph D.; Clark, Nathan L.

2016-01-01

Abstract Mammal species have made the transition to the marine environment several times, and their lineages represent one of the classical examples of convergent evolution in morphological and physiological traits. Nevertheless, the genetic mechanisms of their phenotypic transition are poorly understood, and investigations into convergence at the molecular level have been inconclusive. While past studies have searched for convergent changes at specific amino acid sites, we propose an alternative strategy to identify those genes that experienced convergent changes in their selective pressures, visible as changes in evolutionary rate specifically in the marine lineages. We present evidence of widespread convergence at the gene level by identifying parallel shifts in evolutionary rate during three independent episodes of mammalian adaptation to the marine environment. Hundreds of genes accelerated their evolutionary rates in all three marine mammal lineages during their transition to aquatic life. These marine-accelerated genes are highly enriched for pathways that control recognized functional adaptations in marine mammals, including muscle physiology, lipid-metabolism, sensory systems, and skin and connective tissue. The accelerations resulted from both adaptive evolution as seen in skin and lung genes, and loss of function as in gustatory and olfactory genes. In regard to sensory systems, this finding provides further evidence that reduced senses of taste and smell are ubiquitous in marine mammals. Our analysis demonstrates the feasibility of identifying genes underlying convergent organism-level characteristics on a genome-wide scale and without prior knowledge of adaptations, and provides a powerful approach for investigating the physiological functions of mammalian genes. PMID:27329977
Simple Shared Motifs (SSM) in conserved region of promoters: a new approach to identify co-regulation patterns.

PubMed

Gruel, Jérémy; LeBorgne, Michel; LeMeur, Nolwenn; Théret, Nathalie

2011-09-12

Regulation of gene expression plays a pivotal role in cellular functions. However, understanding the dynamics of transcription remains a challenging task. A host of computational approaches have been developed to identify regulatory motifs, mainly based on the recognition of DNA sequences for transcription factor binding sites. Recent integration of additional data from genomic analyses or phylogenetic footprinting has significantly improved these methods. Here, we propose a different approach based on the compilation of Simple Shared Motifs (SSM), groups of sequences defined by their length and similarity and present in conserved sequences of gene promoters. We developed an original algorithm to search and count SSM in pairs of genes. An exceptional number of SSM is considered as a common regulatory pattern. The SSM approach is applied to a sample set of genes and validated using functional gene-set enrichment analyses. We demonstrate that the SSM approach selects genes that are over-represented in specific biological categories (Ontology and Pathways) and are enriched in co-expressed genes. Finally we show that genes co-expressed in the same tissue or involved in the same biological pathway have increased SSM values. Using unbiased clustering of genes, Simple Shared Motifs analysis constitutes an original contribution to provide a clearer definition of expression networks.
Simple Shared Motifs (SSM) in conserved region of promoters: a new approach to identify co-regulation patterns

PubMed Central

2011-01-01

Background Regulation of gene expression plays a pivotal role in cellular functions. However, understanding the dynamics of transcription remains a challenging task. A host of computational approaches have been developed to identify regulatory motifs, mainly based on the recognition of DNA sequences for transcription factor binding sites. Recent integration of additional data from genomic analyses or phylogenetic footprinting has significantly improved these methods. Results Here, we propose a different approach based on the compilation of Simple Shared Motifs (SSM), groups of sequences defined by their length and similarity and present in conserved sequences of gene promoters. We developed an original algorithm to search and count SSM in pairs of genes. An exceptional number of SSM is considered as a common regulatory pattern. The SSM approach is applied to a sample set of genes and validated using functional gene-set enrichment analyses. We demonstrate that the SSM approach selects genes that are over-represented in specific biological categories (Ontology and Pathways) and are enriched in co-expressed genes. Finally we show that genes co-expressed in the same tissue or involved in the same biological pathway have increased SSM values. Conclusions Using unbiased clustering of genes, Simple Shared Motifs analysis constitutes an original contribution to provide a clearer definition of expression networks. PMID:21910886
Differential gene expression profiles of peripheral blood mononuclear cells in childhood asthma.

PubMed

Kong, Qian; Li, Wen-Jing; Huang, Hua-Rong; Zhong, Ying-Qiang; Fang, Jian-Pei

2015-05-01

Asthma is a common childhood disease with strong genetic components. This study compared whole-genome expression differences between asthmatic young children and healthy controls to identify gene signatures of childhood asthma. Total RNA extracted from peripheral blood mononuclear cells (PBMC) was subjected to microarray analysis. QRT-PCR was performed to verify the microarray results. Classification and functional characterization of differential genes were illustrated by hierarchical clustering and gene ontology analysis. Multiple logistic regression (MLR) analysis, receiver operating characteristic (ROC) curve analysis, and discriminate power were used to scan asthma-specific diagnostic markers. For fold-change>2 and p < 0.05, there were 758 named differential genes. The results of QRT-PCR confirmed successfully the array data. Hierarchical clustering divided 29 highly possible genes into seven categories and the genes in the same cluster were likely to possess similar expression patterns or functions. Gene ontology analysis presented that differential genes primarily enriched in immune response, response to stress or stimulus, and regulation of apoptosis in biological process. MLR and ROC curve analysis revealed that the combination of ADAM33, Smad7, and LIGHT possessed excellent discriminating power. The combination of ADAM33, Smad7, and LIGHT would be a reliable and useful childhood asthma model for prediction and diagnosis.
EcoGene 3.0

PubMed Central

Zhou, Jindan; Rudd, Kenneth E.

2013-01-01

EcoGene (http://ecogene.org) is a database and website devoted to continuously improving the structural and functional annotation of Escherichia coli K-12, one of the most well understood model organisms, represented by the MG1655(Seq) genome sequence and annotations. Major improvements to EcoGene in the past decade include (i) graphic presentations of genome map features; (ii) ability to design Boolean queries and Venn diagrams from EcoArray, EcoTopics or user-provided GeneSets; (iii) the genome-wide clone and deletion primer design tool, PrimerPairs; (iv) sequence searches using a customized EcoBLAST; (v) a Cross Reference table of synonymous gene and protein identifiers; (vi) proteome-wide indexing with GO terms; (vii) EcoTools access to >2000 complete bacterial genomes in EcoGene-RefSeq; (viii) establishment of a MySql relational database; and (ix) use of web content management systems. The biomedical literature is surveyed daily to provide citation and gene function updates. As of September 2012, the review of 37 397 abstracts and articles led to creation of 98 425 PubMed-Gene links and 5415 PubMed-Topic links. Annotation updates to Genbank U00096 are transmitted from EcoGene to NCBI. Experimental verifications include confirmation of a CTG start codon, pseudogene restoration and quality assurance of the Keio strain collection. PMID:23197660
Genome-Wide Evolutionary Characterization and Expression Analyses of WRKY Family Genes in Brachypodium distachyon

PubMed Central

Wen, Feng; Zhu, Hong; Li, Peng; Jiang, Min; Mao, Wenqing; Ong, Chermaine; Chu, Zhaoqing

2014-01-01

Members of plant WRKY gene family are ancient transcription factors that function in plant growth and development and respond to biotic and abiotic stresses. In our present study, we have investigated WRKY family genes in Brachypodium distachyon, a new model plant of family Poaceae. We identified a total of 86 WRKY genes from B. distachyon and explored their chromosomal distribution and evolution, domain alignment, promoter cis-elements, and expression profiles. Combining the analysis of phylogenetic tree of BdWRKY genes and the result of expression profiling, results showed that most of clustered gene pairs had higher similarities in the WRKY domain, suggesting that they might be functionally redundant. Neighbour-joining analysis of 301 WRKY domains from Oryza sativa, Arabidopsis thaliana, and B. distachyon suggested that BdWRKY domains are evolutionarily more closely related to O. sativa WRKY domains than those of A. thaliana. Moreover, tissue-specific expression profile of BdWRKY genes and their responses to phytohormones and several biotic or abiotic stresses were analysed by quantitative real-time PCR. The results showed that the expression of BdWRKY genes was rapidly regulated by stresses and phytohormones, and there was a strong correlation between promoter cis-elements and the phytohormones-induced BdWRKY gene expression. PMID:24453041
EcoGene 3.0.

PubMed

Zhou, Jindan; Rudd, Kenneth E

2013-01-01

EcoGene (http://ecogene.org) is a database and website devoted to continuously improving the structural and functional annotation of Escherichia coli K-12, one of the most well understood model organisms, represented by the MG1655(Seq) genome sequence and annotations. Major improvements to EcoGene in the past decade include (i) graphic presentations of genome map features; (ii) ability to design Boolean queries and Venn diagrams from EcoArray, EcoTopics or user-provided GeneSets; (iii) the genome-wide clone and deletion primer design tool, PrimerPairs; (iv) sequence searches using a customized EcoBLAST; (v) a Cross Reference table of synonymous gene and protein identifiers; (vi) proteome-wide indexing with GO terms; (vii) EcoTools access to >2000 complete bacterial genomes in EcoGene-RefSeq; (viii) establishment of a MySql relational database; and (ix) use of web content management systems. The biomedical literature is surveyed daily to provide citation and gene function updates. As of September 2012, the review of 37 397 abstracts and articles led to creation of 98 425 PubMed-Gene links and 5415 PubMed-Topic links. Annotation updates to Genbank U00096 are transmitted from EcoGene to NCBI. Experimental verifications include confirmation of a CTG start codon, pseudogene restoration and quality assurance of the Keio strain collection.
The molecular genetics of the telomere biology disorders.

PubMed

Bertuch, Alison A

2016-08-02

The importance of telomere function for human health is exemplified by a collection of Mendelian disorders referred to as the telomere biology disorders (TBDs), telomeropathies, or syndromes of telomere shortening. Collectively, the TBDs cover a spectrum of conditions from multisystem disease presenting in infancy to isolated disease presentations in adulthood, most notably idiopathic pulmonary fibrosis. Eleven genes have been found mutated in the TBDs to date, each of which is linked to some aspect of telomere maintenance. This review summarizes the molecular defects that result from mutations in these genes, highlighting recent advances, including the addition of PARN to the TBD gene family and the discovery of heterozygous mutations in RTEL1 as a cause of familial pulmonary fibrosis.
Coexistence of multiple globin genes conferring protection against nitrosative stress to the Antarctic bacterium Pseudoalteromonas haloplanktis TAC125.

PubMed

Coppola, Daniela; Giordano, Daniela; Milazzo, Lisa; Howes, Barry D; Ascenzi, Paolo; di Prisco, Guido; Smulevich, Giulietta; Poole, Robert K; Verde, Cinzia

2018-02-28

Despite the large number of globins recently discovered in bacteria, our knowledge of their physiological functions is restricted to only a few examples. In the microbial world, globins appear to perform multiple roles in addition to the reversible binding of oxygen; all these functions are attributable to the heme pocket that dominates functional properties. Resistance to nitrosative stress and involvement in oxygen chemistry seem to be the most prevalent functions for bacterial globins, although the number of globins for which functional roles have been studied via mutation and genetic complementation is very limited. The acquisition of structural information has considerably outpaced the physiological and molecular characterisation of these proteins. The genome of the Antarctic cold-adapted bacterium Pseudoalteromonas haloplanktis TAC125 (PhTAC125) contains genes encoding three distinct single-chain 2/2 globins, supporting the hypothesis of their crucial involvement in a number of functions, including protection against oxidative and nitrosative stress in the cold and O 2 -rich environment. In the genome of PhTAC125, the genes encoding 2/2 globins are constitutively transcribed, thus suggesting that these globins are not functionally redundant in their physiological function in PhTAC125. In the present study, the physiological role of one of the 2/2 globins, Ph-2/2HbO-2217, was investigated by integrating in vivo and in vitro results. This role includes the involvement in the detoxification of reactive nitrogen and O 2 species including NO by developing two in vivo and in vitro models to highlight the protective role of Ph-2/2HbO-2217 against reactive nitrogen species. The PSHAa2217 gene was cloned and over-expressed in the flavohemoglobin-deficient mutant of Escherichia coli and the growth properties and O 2 uptake in the presence of NO of the mutant carrying the PSHAa2217 gene were analysed. The ferric form of Ph-2/2HbO-2217 is able to catalyse peroxynitrite isomerisation in vitro, indicating its potential role in the scavenging of reactive nitrogen species. Here we present in vitro evidence for the detoxification of NO by Ph-2/2HbO-2217. Copyright © 2017. Published by Elsevier Inc.
A case report of recessive myotonia congenita and early onset cognitive impairment: Is it a causal or casual link?

PubMed

Portaro, Simona; Cacciola, Alberto; Naro, Antonino; Milardi, Demetrio; Morabito, Rosa; Corallo, Francesco; Marino, Silvia; Bramanti, Alessia; Mazzon, Emanuela; Calabrò, Rocco Salvatore

2018-06-01

Myotonia congenita (MC) is a non-dystrophic myotonia inherited either in dominant (Thomsen) or recessive (Becker) form. MC is due to an abnormal functioning of skeletal muscle voltage-gated chloride channel (CLCN1), but the genotype/phenotype correlation remains unclear. A 48-year-old man, from consanguineous parents, presented with a fixed muscle weakness, muscle atrophy, and a cognitive impairment. Notably, his brother presented the same mutation but with a different phenotype, mainly involving cognitive function. The patient was submitted to cognitive assessment, needle electromyography, brain and muscle MRI, and genetic analysis. The Milan Overall Dementia Assessment showed short-term memory, verbal fluency and verbal intelligence impairment. His genetic analysis showed a recessive splice-site mutation in the CLCN1 gene (IVS19+2T>A). Muscle MRI revealed a symmetric and bilateral fat infiltration of the tensor of fascia lata, gluteus medius, and gluteus maximus muscles, associated to mild atrophy. Recessive myotonia congenita was diagnosed. Further studies should establish if and to which extent the CLCN1 mutation is responsible for this c MC phenotype, taking into account a gene-gene and /or a gene-environment.
Functional genomics platform for pooled screening and mammalian genetic interaction maps

PubMed Central

Kampmann, Martin; Bassik, Michael C.; Weissman, Jonathan S.

2014-01-01

Systematic genetic interaction maps in microorganisms are powerful tools for identifying functional relationships between genes and defining the function of uncharacterized genes. We have recently implemented this strategy in mammalian cells as a two-stage approach. First, genes of interest are robustly identified in a pooled genome-wide screen using complex shRNA libraries. Second, phenotypes for all pairwise combinations of hit genes are measured in a double-shRNA screen and used to construct a genetic interaction map. Our protocol allows for rapid pooled screening under various conditions without a requirement for robotics, in contrast to arrayed approaches. Each stage of the protocol can be implemented in ~2 weeks, with additional time for analysis and generation of reagents. We discuss considerations for screen design, and present complete experimental procedures as well as a full computational analysis suite for identification of hits in pooled screens and generation of genetic interaction maps. While the protocols outlined here were developed for our original shRNA-based approach, they can be applied more generally, including to CRISPR-based approaches. PMID:24992097
Functional expression of a cattle MHC class II DR-like antigen on mouse L cells

DOE Office of Scientific and Technical Information (OSTI.GOV)

Fraser, D.C.; Craigmile, S.; Campbell, J.D.M.

1996-09-01

Cattle DRA and DRB genes, cloned by reverse-transcription polymerase chain reaction, were transfected into mouse L cells. The cattle DR-expressing L-cell transfectant generated was analyzed serologically, biochemically, and functionally. Sequence analysis of the transfected DRB gene clearly showed showed that it was DRB3 allele DRB3*0101, which corresponds to the 1D-IEF-determined allele DRBF3. 1D-IEF analysis of the tranfectant confirmed that the expressed DR product was DRBF3. Functional integrity of the transfected gene products was demonstrated by the ability of the transfectant cell line to present two antigens (the foot-and-mouth disease virus-derived peptide FMDV15, and ovalbumin) to antigen-specific CD4{sup +} T cellsmore » from both the original animal used to obtain the genes, and also from an unrelated DRBF3{sup +} heterozygous animal. Such transfectants will be invaluable tools, allowing us to dissect the precise contributions each locus product makes to the overall immune response in heterozygous animals, information essential for rational vaccine design. 45 refs., 5 figs., 1 tab.« less
Functional characterization of the vitellogenin promoter in the silkworm, Bombyx mori.

PubMed

Xu, J; Wang, Y Q; Li, Z Q; Ling, L; Zeng, B S; You, L; Chen, Y Z; Aslam, A F M; Huang, Y P; Tan, A J

2014-10-01

Genetic transformation and genome editing technologies have been successfully established in the lepidopteran insect model, the domesticated silkworm, Bombyx mori, providing great potential for functional genomics and practical applications. However, the current lack of cis-regulatory elements in B. mori gene manipulation research limits further exploitation in functional gene analysis. In the present study, we characterized a B. mori endogenous promoter, Bmvgp, which is a 798-bp DNA sequence adjacent to the 5'-end of the vitellogenin gene (Bmvg). PiggyBac-based transgenic analysis shows that Bmvgp precisely directs expression of a reporter gene, enhanced green fluorescent protein (EGFP), in a sex-, tissue- and stage-specific manner. In transgenic animals, EGFP expression can be detected in the female fat body from larval-pupal ecdysis to the following pupal and adult stage. Furthermore, in vitro and in vivo experiments revealed that EGFP expression can be activated by 20-hydroxyecdysone, which is consistent with endogenous Bmvg expression. These data indicate that Bmvgp is an effective endogenous cis-regulatory element in B. mori. © 2014 The Royal Entomological Society.
Synthetic analog computation in living cells.

PubMed

Daniel, Ramiz; Rubens, Jacob R; Sarpeshkar, Rahul; Lu, Timothy K

2013-05-30

A central goal of synthetic biology is to achieve multi-signal integration and processing in living cells for diagnostic, therapeutic and biotechnology applications. Digital logic has been used to build small-scale circuits, but other frameworks may be needed for efficient computation in the resource-limited environments of cells. Here we demonstrate that synthetic analog gene circuits can be engineered to execute sophisticated computational functions in living cells using just three transcription factors. Such synthetic analog gene circuits exploit feedback to implement logarithmically linear sensing, addition, ratiometric and power-law computations. The circuits exhibit Weber's law behaviour as in natural biological systems, operate over a wide dynamic range of up to four orders of magnitude and can be designed to have tunable transfer functions. Our circuits can be composed to implement higher-order functions that are well described by both intricate biochemical models and simple mathematical functions. By exploiting analog building-block functions that are already naturally present in cells, this approach efficiently implements arithmetic operations and complex functions in the logarithmic domain. Such circuits may lead to new applications for synthetic biology and biotechnology that require complex computations with limited parts, need wide-dynamic-range biosensing or would benefit from the fine control of gene expression.
Cancerouspdomains: comprehensive analysis of cancer type-specific recurrent somatic mutations in proteins and domains.

PubMed

Hashemi, Seirana; Nowzari Dalini, Abbas; Jalali, Adrin; Banaei-Moghaddam, Ali Mohammad; Razaghi-Moghadam, Zahra

2017-08-16

Discriminating driver mutations from the ones that play no role in cancer is a severe bottleneck in elucidating molecular mechanisms underlying cancer development. Since protein domains are representatives of functional regions within proteins, mutations on them may disturb the protein functionality. Therefore, studying mutations at domain level may point researchers to more accurate assessment of the functional impact of the mutations. This article presents a comprehensive study to map mutations from 29 cancer types to both sequence- and structure-based domains. Statistical analysis was performed to identify candidate domains in which mutations occur with high statistical significance. For each cancer type, the corresponding type-specific domains were distinguished among all candidate domains. Subsequently, cancer type-specific domains facilitated the identification of specific proteins for each cancer type. Besides, performing interactome analysis on specific proteins of each cancer type showed high levels of interconnectivity among them, which implies their functional relationship. To evaluate the role of mitochondrial genes, stem cell-specific genes and DNA repair genes in cancer development, their mutation frequency was determined via further analysis. This study has provided researchers with a publicly available data repository for studying both CATH and Pfam domain regions on protein-coding genes. Moreover, the associations between different groups of genes/domains and various cancer types have been clarified. The work is available at http://www.cancerouspdomains.ir .
Partial kinetoplast-mitochondrial gene organization and expression in the respiratory deficient plant trypanosomatid Phytomonas serpens.

PubMed

Maslov, D A; Nawathean, P; Scheel, J

1999-04-30

In plant-dwelling trypanosomatids from the genus Phytomonas, mitochondrial functions, such as cytochrome mediated respiration, ATP production and Krebs cycle, are missing, and cell energetics is based on the glycolysis. Using Blue Native/Tricine-SDS two-dimensional gel electrophoretic analysis, we observed that mitochondrial respiratory Complexes III (cytochrome bc1) and IV (cytochrome c oxidase) were absent in Phytomonas serpens; however, Complex V (ATPase) was present. A deletion of the genes for cytochrome c oxidase subunit III (COIII) and apocytochrome b (Cyb) was identified within the 6234 bp sequenced region of the 31 kb maxicircle kinetoplast DNA. Genes, found in this region, include 12S and 9S ribosomal RNAs, subunits 7, 8 and 9 of NADH dehydrogenase (ND7, ND8 and ND9) and subunit 6 of ATPase (A6 or MURF4), as well as the genes (MURF1, MURF5 and G3) with unknown function. Most genes are actively transcribed and some mRNAs are edited. Fully edited mRNAs for A6 and G3 were abundant, while edited ND7 transcripts were rare, and only partially edited and pre-edited transcripts for ND8 were detected. The data show that the mitochondrial genome of P. serpens is functional, although its functions may be limited to expressing the ATPase and, possibly, NADH dehydrogenase complexes.
The protocadherin 17 gene affects cognition, personality, amygdala structure and function, synapse development and risk of major mood disorders.

PubMed

Chang, H; Hoshina, N; Zhang, C; Ma, Y; Cao, H; Wang, Y; Wu, D-D; Bergen, S E; Landén, M; Hultman, C M; Preisig, M; Kutalik, Z; Castelao, E; Grigoroiu-Serbanescu, M; Forstner, A J; Strohmaier, J; Hecker, J; Schulze, T G; Müller-Myhsok, B; Reif, A; Mitchell, P B; Martin, N G; Schofield, P R; Cichon, S; Nöthen, M M; Walter, H; Erk, S; Heinz, A; Amin, N; van Duijn, C M; Meyer-Lindenberg, A; Tost, H; Xiao, X; Yamamoto, T; Rietschel, M; Li, M

2018-02-01

Major mood disorders, which primarily include bipolar disorder and major depressive disorder, are the leading cause of disability worldwide and pose a major challenge in identifying robust risk genes. Here, we present data from independent large-scale clinical data sets (including 29 557 cases and 32 056 controls) revealing brain expressed protocadherin 17 (PCDH17) as a susceptibility gene for major mood disorders. Single-nucleotide polymorphisms (SNPs) spanning the PCDH17 region are significantly associated with major mood disorders; subjects carrying the risk allele showed impaired cognitive abilities, increased vulnerable personality features, decreased amygdala volume and altered amygdala function as compared with non-carriers. The risk allele predicted higher transcriptional levels of PCDH17 mRNA in postmortem brain samples, which is consistent with increased gene expression in patients with bipolar disorder compared with healthy subjects. Further, overexpression of PCDH17 in primary cortical neurons revealed significantly decreased spine density and abnormal dendritic morphology compared with control groups, which again is consistent with the clinical observations of reduced numbers of dendritic spines in the brains of patients with major mood disorders. Given that synaptic spines are dynamic structures which regulate neuronal plasticity and have crucial roles in myriad brain functions, this study reveals a potential underlying biological mechanism of a novel risk gene for major mood disorders involved in synaptic function and related intermediate phenotypes.
Analysis of functional polymorphisms in three synaptic plasticity-related genes (BDNF, COMT AND UCHL1) in Alzheimer's disease in Colombia.

PubMed

Forero, Diego A; Benítez, Bruno; Arboleda, Gonzalo; Yunis, Juan J; Pardo, Rodrigo; Arboleda, Humberto

2006-07-01

In recent years, it has been proposed that synaptic dysfunction may be an important etiological factor for Alzheimer's disease (AD). This hypothesis has important implications for the analysis of AD genetic risk in case-control studies. In the present work, we analyzed common functional polymorphisms in three synaptic plasticity-related genes (brain-derived neurotrophic factor, BDNF Val66Met; catechol-O-methyl transferase, COMT Val158; ubiquitin carboxyl-terminal hydroxylase, UCHL1 S18Y) in a sample of 102 AD cases and 168 age and sex matched controls living in Bogotá, Colombia. There was not association between UCHL1 polymorphism and AD in our sample. We have found an initial association with BDNF polymorphism in familial cases and with COMT polymorphism in male and sporadic patients. These initial associations were lost after Bonferroni correction for multiple testing. Unadjusted results may be compatible with the expected functional effect of variations in these genes on pathological memory and cognitive dysfunction, as has been implicated in animal and cell models and also from neuropsychological analysis of normal subjects carriers of the AD associated genotypes. An exploration of functional variants in these and in other synaptic plasticity-related genes (a synaptogenomics approach) in independent larger samples will be important to discover new genes associated with AD.
Genome-wide analysis of the R2R3-MYB transcription factor gene family in sweet orange (Citrus sinensis).

PubMed

Liu, Chaoyang; Wang, Xia; Xu, Yuantao; Deng, Xiuxin; Xu, Qiang

2014-10-01

MYB transcription factor represents one of the largest gene families in plant genomes. Sweet orange (Citrus sinensis) is one of the most important fruit crops worldwide, and recently the genome has been sequenced. This provides an opportunity to investigate the organization and evolutionary characteristics of sweet orange MYB genes from whole genome view. In the present study, we identified 100 R2R3-MYB genes in the sweet orange genome. A comprehensive analysis of this gene family was performed, including the phylogeny, gene structure, chromosomal localization and expression pattern analyses. The 100 genes were divided into 29 subfamilies based on the sequence similarity and phylogeny, and the classification was also well supported by the highly conserved exon/intron structures and motif composition. The phylogenomic comparison of MYB gene family among sweet orange and related plant species, Arabidopsis, cacao and papaya suggested the existence of functional divergence during evolution. Expression profiling indicated that sweet orange R2R3-MYB genes exhibited distinct temporal and spatial expression patterns. Our analysis suggested that the sweet orange MYB genes may play important roles in different plant biological processes, some of which may be potentially involved in citrus fruit quality. These results will be useful for future functional analysis of the MYB gene family in sweet orange.
Mobile genes in the human microbiome are structured from global to individual scales

PubMed Central

Brito, IL; Jupiter, SD; Jenkins, AP; Naisilisili, W; Tamminen, M; Smillie, CS; Wortman, JR; Birren, BW; Xavier, RJ; Blainey, PC; Singh, AK; Gevers, D; Alm, EJ

2016-01-01

Recent work has underscored the importance of the microbiome in human health, largely attributing differences in phenotype to differences in the species present across individuals1,2,3,4,5. But mobile genes can confer profoundly different phenotypes on different strains of the same species. Little is known about the function and distribution of mobile genes in the human microbiome, and in particular whether the gene pool is globally homogenous or constrained by human population structure. Here, we investigate this question by comparing the mobile genes found in the microbiomes of 81 metropolitan North Americans with that of 172 agrarian Fiji islanders using a combination of single-cell genomics and metagenomics. We find large differences in mobile gene content between the Fijian and North American microbiomes, with functional variation that mirrors known dietary differences such as the excess of plant-based starch degradation genes. Remarkably, differences are also observed between the mobile gene pools of proximal Fijian villages, even though microbiome composition across villages is similar. Finally, we observe high rates of recombination leading to individual-specific mobile elements, suggesting that the abundance of some genes may reflect environmental selection rather than dispersal limitation. Together, these data support the hypothesis that human activities and behaviors provide selective pressures that shape mobile gene pools, and that acquisition of mobile genes is important to colonizing specific human populations. PMID:27409808

Some links on this page may take you to non-federal websites. Their policies may differ from this site.