Song, Hyun-Seob; McClure, Ryan S.; Bernstein, Hans C.; ...
2015-03-27
Cyanobacteria dynamically relay environmental inputs to intracellular adaptations through a coordinated adjustment of photosynthetic efficiency and carbon processing rates. The output of such adaptations is reflected through changes in transcriptional patterns and metabolic flux distributions that ultimately define growth strategy. To address interrelationships between metabolism and regulation, we performed integrative analyses of metabolic and gene co-expression networks in a model cyanobacterium, Synechococcus sp. PCC 7002. Centrality analyses using the gene co-expression network identified a set of key genes, which were defined here as ‘topologically important.’ Parallel in silico gene knock-out simulations, using the genome-scale metabolic network, classified what we termedmore » as ‘functionally important’ genes, deletion of which affected growth or metabolism. A strong positive correlation was observed between topologically and functionally important genes. Functionally important genes exhibited variable levels of topological centrality; however, the majority of topologically central genes were found to be functionally essential for growth. Subsequent functional enrichment analysis revealed that both functionally and topologically important genes in Synechococcus sp. PCC 7002 are predominantly associated with translation and energy metabolism, two cellular processes critical for growth. This research demonstrates how synergistic network-level analyses can be used for reconciliation of metabolic and gene expression data to uncover fundamental biological principles.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Song, Hyun-Seob; McClure, Ryan S.; Bernstein, Hans C.
Cyanobacteria dynamically relay environmental inputs to intracellular adaptations through a coordinated adjustment of photosynthetic efficiency and carbon processing rates. The output of such adaptations is reflected through changes in transcriptional patterns and metabolic flux distributions that ultimately define growth strategy. To address interrelationships between metabolism and regulation, we performed integrative analyses of metabolic and gene co-expression networks in a model cyanobacterium, Synechococcus sp. PCC 7002. Centrality analyses using the gene co-expression network identified a set of key genes, which were defined here as ‘topologically important.’ Parallel in silico gene knock-out simulations, using the genome-scale metabolic network, classified what we termedmore » as ‘functionally important’ genes, deletion of which affected growth or metabolism. A strong positive correlation was observed between topologically and functionally important genes. Functionally important genes exhibited variable levels of topological centrality; however, the majority of topologically central genes were found to be functionally essential for growth. Subsequent functional enrichment analysis revealed that both functionally and topologically important genes in Synechococcus sp. PCC 7002 are predominantly associated with translation and energy metabolism, two cellular processes critical for growth. This research demonstrates how synergistic network-level analyses can be used for reconciliation of metabolic and gene expression data to uncover fundamental biological principles.« less
Niu, Sheng-Yong; Yang, Jinyu; McDermaid, Adam; Zhao, Jing; Kang, Yu; Ma, Qin
2017-05-08
Metagenomic and metatranscriptomic sequencing approaches are more frequently being used to link microbiota to important diseases and ecological changes. Many analyses have been used to compare the taxonomic and functional profiles of microbiota across habitats or individuals. While a large portion of metagenomic analyses focus on species-level profiling, some studies use strain-level metagenomic analyses to investigate the relationship between specific strains and certain circumstances. Metatranscriptomic analysis provides another important insight into activities of genes by examining gene expression levels of microbiota. Hence, combining metagenomic and metatranscriptomic analyses will help understand the activity or enrichment of a given gene set, such as drug-resistant genes among microbiome samples. Here, we summarize existing bioinformatics tools of metagenomic and metatranscriptomic data analysis, the purpose of which is to assist researchers in deciding the appropriate tools for their microbiome studies. Additionally, we propose an Integrated Meta-Function mapping pipeline to incorporate various reference databases and accelerate functional gene mapping procedures for both metagenomic and metatranscriptomic analyses. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Evolutionary analyses of hedgehog and Hoxd-10 genes in fish species closely related to the zebrafish
Zardoya, Rafael; Abouheif, Ehab; Meyer, Axel
1996-01-01
The study of development has relied primarily on the isolation of mutations in genes with specific functions in development and on the comparison of their expression patterns in normal and mutant phenotypes. Comparative evolutionary analyses can complement these approaches. Phylogenetic analyses of Sonic hedgehog (Shh) and Hoxd-10 genes from 18 cyprinid fish species closely related to the zebrafish provide novel insights into the functional constraints acting on Shh. Our results confirm and extend those gained from expression and crystalline structure analyses of this gene. Unexpectedly, exon 1 of Shh is found to be almost invariant even in third codon positions among these morphologically divergent species suggesting that this exon encodes for a functionally important domain of the hedgehog protein. This is surprising because the main functional domain of Shh had been thought to be that encoded by exon 2. Comparisons of Shh and Hoxd-10 gene sequences and of resulting gene trees document higher evolutionary constraints on the former than on the latter. This might be indicative of more general evolutionary patterns in networks of developmental regulatory genes interacting in a hierarchical fashion. The presence of four members of the hedgehog gene family in cyprinid fishes was documented and their homologies to known hedgehog genes in other vertebrates were established. PMID:8917540
Zardoya, R; Abouheif, E; Meyer, A
1996-11-12
The study of development has relied primarily on the isolation of mutations in genes with specific functions in development and on the comparison of their expression patterns in normal and mutant phenotypes. Comparative evolutionary analyses can complement these approaches. Phylogenetic analyses of Sonic hedgehog (Shh) and Hoxd-10 genes from 18 cyprinid fish species closely related to the zebrafish provide novel insights into the functional constraints acting on Shh. Our results confirm and extend those gained from expression and crystalline structure analyses of this gene. Unexpectedly, exon 1 of Shh is found to be almost invariant even in third codon positions among these morphologically divergent species suggesting that this exon encodes for a functionally important domain of the hedgehog protein. This is surprising because the main functional domain of Shh had been thought to be that encoded by exon 2. Comparisons of Shh and Hoxd-10 gene sequences and of resulting gene trees document higher evolutionary constraints on the former than on the latter. This might be indicative of more general evolutionary patterns in networks of developmental regulatory genes interacting in a hierarchical fashion. The presence of four members of the hedgehog gene family in cyprinid fishes was documented and their homologies to known hedgehog genes in other vertebrates were established.
Qiu, Ying-Hua; Deng, Fei-Yan; Tang, Zai-Xiang; Jiang, Zhen-Huan; Lei, Shu-Feng
2015-10-01
Type 1 diabetes mellitus (type 1 DM) is an autoimmune disease. Although genome-wide association studies (GWAS) and meta-analyses have successfully identified numerous type 1 DM-associated susceptibility loci, the underlying mechanisms for these susceptibility loci are currently largely unclear. Based on publicly available datasets, we performed integrative analyses (i.e., integrated gene relationships among implicated loci, differential gene expression analysis, functional prediction and functional annotation clustering analysis) and combined with expression quantitative trait loci (eQTL) results to further explore function mechanisms underlying the associations between genetic variants and type 1 DM. Among a total of 183 type 1 DM-associated SNPs, eQTL analysis showed that 17 SNPs with cis-regulated eQTL effects on 9 genes. All the 9 eQTL genes enrich in immune-related pathways or Gene Ontology (GO) terms. Functional prediction analysis identified 5 SNPs located in transcription factor (TF) binding sites. Of the 9 eQTL genes, 6 (TAP2, HLA-DOB, HLA-DQB1, HLA-DQA1, HLA-DRB5 and CTSH) were differentially expressed in type 1 DM-associated related cells. Especially, rs3825932 in CTSH has integrative functional evidence supporting the association with type 1 DM. These findings indicated that integrative analyses can yield important functional information to link genetic variants and type 1 DM. Copyright © 2015 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.
Genome-wide association and network analysis of lung function in the Framingham Heart Study.
Liao, Shu-Yi; Lin, Xihong; Christiani, David C
2014-09-01
Single nucleotide polymorphisms have been found to be associated with pulmonary function using genome-wide association studies. However, lung function is a complex trait that is likely to be influenced by multiple gene-gene interactions besides individual genes. Our goal is to build a cellular network to explore the relationship between pulmonary function and genotypes by combining SNP level and network analyses using longitudinal lung function data from the Framingham Heart Study. We analyzed 2,698 genotyped participants from the Offspring cohort that had an average of 3.35 spirometry measurements per person for a mean length of 13 years. Repeated forced expiratory volume in one second (FEV1 ) and the ratio of FEV1 to forced vital capacity (FVC) were used as outcomes. Data were analyzed using linear-mixed models for the association between lung function and alleles by accounting for the correlation among repeated measures over time within the same subject and within-family correlation. Network analyses were performed using dmGWAS and validated with data from the Third Generation cohort. Analyses identified SMAD3, TGFBR2, CD44, CTGF, VCAN, CTNNB1, SCGB1A1, PDE4D, NRG1, EPHB1, and LYN as contributors to pulmonary function. Most of these genes were novel that were not found previously using solely SNP-level analysis. These novel genes are involving the transforming growth factor beta (TGFB)-SMAD pathway, Wnt/beta-catenin pathway, etc. Therefore, combining SNP-level and network analyses using longitudinal lung function data is a useful alternative strategy to identify risk genes. © 2014 WILEY PERIODICALS, INC.
Wei, Qing; Khan, Ishita K; Ding, Ziyun; Yerneni, Satwica; Kihara, Daisuke
2017-03-20
The number of genomics and proteomics experiments is growing rapidly, producing an ever-increasing amount of data that are awaiting functional interpretation. A number of function prediction algorithms were developed and improved to enable fast and automatic function annotation. With the well-defined structure and manual curation, Gene Ontology (GO) is the most frequently used vocabulary for representing gene functions. To understand relationship and similarity between GO annotations of genes, it is important to have a convenient pipeline that quantifies and visualizes the GO function analyses in a systematic fashion. NaviGO is a web-based tool for interactive visualization, retrieval, and computation of functional similarity and associations of GO terms and genes. Similarity of GO terms and gene functions is quantified with six different scores including protein-protein interaction and context based association scores we have developed in our previous works. Interactive navigation of the GO function space provides intuitive and effective real-time visualization of functional groupings of GO terms and genes as well as statistical analysis of enriched functions. We developed NaviGO, which visualizes and analyses functional similarity and associations of GO terms and genes. The NaviGO webserver is freely available at: http://kiharalab.org/web/navigo .
Physcomitrella MADS-box genes regulate water supply and sperm movement for fertilization.
Koshimizu, Shizuka; Kofuji, Rumiko; Sasaki-Sekimoto, Yuko; Kikkawa, Masahide; Shimojima, Mie; Ohta, Hiroyuki; Shigenobu, Shuji; Kabeya, Yukiko; Hiwatashi, Yuji; Tamada, Yosuke; Murata, Takashi; Hasebe, Mitsuyasu
2018-01-01
MIKC classic (MIKC C )-type MADS-box genes encode transcription factors that function in various developmental processes, including angiosperm floral organ identity. Phylogenetic analyses of the MIKC C -type MADS-box family, including genes from non-flowering plants, suggest that the increased numbers of these genes in flowering plants is related to their functional divergence; however, their precise functions in non-flowering plants and their evolution throughout land plant diversification are unknown. Here, we show that MIKC C -type MADS-box genes in the moss Physcomitrella patens function in two ways to enable fertilization. Analyses of protein localization, deletion mutants and overexpression lines of all six genes indicate that three MIKC C -type MADS-box genes redundantly regulate cell division and growth in the stems for appropriate external water conduction, as well as the formation of sperm with motile flagella. The former function appears to be maintained in the flowering plant lineage, while the latter was lost in accordance with the loss of sperm.
Functional and Genomic Features of Human Genes Mutated in Neuropsychiatric Disorders.
Forero, Diego A; Prada, Carlos F; Perry, George
2016-01-01
In recent years, a large number of studies around the world have led to the identification of causal genes for hereditary types of common and rare neurological and psychiatric disorders. To explore the functional and genomic features of known human genes mutated in neuropsychiatric disorders. A systematic search was used to develop a comprehensive catalog of genes mutated in neuropsychiatric disorders (NPD). Functional enrichment and protein-protein interaction analyses were carried out. A false discovery rate approach was used for correction for multiple testing. We found several functional categories that are enriched among NPD genes, such as gene ontologies, protein domains, tissue expression, signaling pathways and regulation by brain-expressed miRNAs and transcription factors. Sixty six of those NPD genes are known to be druggable. Several topographic parameters of protein-protein interaction networks and the degree of conservation between orthologous genes were identified as significant among NPD genes. These results represent one of the first analyses of enrichment of functional categories of genes known to harbor mutations for NPD. These findings could be useful for a future creation of computational tools for prioritization of novel candidate genes for NPD.
Functional and Genomic Features of Human Genes Mutated in Neuropsychiatric Disorders
Forero, Diego A.; Prada, Carlos F.; Perry, George
2016-01-01
Background: In recent years, a large number of studies around the world have led to the identification of causal genes for hereditary types of common and rare neurological and psychiatric disorders. Objective: To explore the functional and genomic features of known human genes mutated in neuropsychiatric disorders. Methods: A systematic search was used to develop a comprehensive catalog of genes mutated in neuropsychiatric disorders (NPD). Functional enrichment and protein-protein interaction analyses were carried out. A false discovery rate approach was used for correction for multiple testing. Results: We found several functional categories that are enriched among NPD genes, such as gene ontologies, protein domains, tissue expression, signaling pathways and regulation by brain-expressed miRNAs and transcription factors. Sixty six of those NPD genes are known to be druggable. Several topographic parameters of protein-protein interaction networks and the degree of conservation between orthologous genes were identified as significant among NPD genes. Conclusion: These results represent one of the first analyses of enrichment of functional categories of genes known to harbor mutations for NPD. These findings could be useful for a future creation of computational tools for prioritization of novel candidate genes for NPD. PMID:27990183
Raychaudhuri, Soumya; Korn, Joshua M.; McCarroll, Steven A.; Altshuler, David; Sklar, Pamela; Purcell, Shaun; Daly, Mark J.
2010-01-01
Investigators have linked rare copy number variation (CNVs) to neuropsychiatric diseases, such as schizophrenia. One hypothesis is that CNV events cause disease by affecting genes with specific brain functions. Under these circumstances, we expect that CNV events in cases should impact brain-function genes more frequently than those events in controls. Previous publications have applied “pathway” analyses to genes within neuropsychiatric case CNVs to show enrichment for brain-functions. While such analyses have been suggestive, they often have not rigorously compared the rates of CNVs impacting genes with brain function in cases to controls, and therefore do not address important confounders such as the large size of brain genes and overall differences in rates and sizes of CNVs. To demonstrate the potential impact of confounders, we genotyped rare CNV events in 2,415 unaffected controls with Affymetrix 6.0; we then applied standard pathway analyses using four sets of brain-function genes and observed an apparently highly significant enrichment for each set. The enrichment is simply driven by the large size of brain-function genes. Instead, we propose a case-control statistical test, cnv-enrichment-test, to compare the rate of CNVs impacting specific gene sets in cases versus controls. With simulations, we demonstrate that cnv-enrichment-test is robust to case-control differences in CNV size, CNV rate, and systematic differences in gene size. Finally, we apply cnv-enrichment-test to rare CNV events published by the International Schizophrenia Consortium (ISC). This approach reveals nominal evidence of case-association in neuronal-activity and the learning gene sets, but not the other two examined gene sets. The neuronal-activity genes have been associated in a separate set of schizophrenia cases and controls; however, testing in independent samples is necessary to definitively confirm this association. Our method is implemented in the PLINK software package. PMID:20838587
Obayashi, Takeshi; Kinoshita, Kengo
2010-05-01
Gene coexpression analyses are a powerful method to predict the function of genes and/or to identify genes that are functionally related to query genes. The basic idea of gene coexpression analyses is that genes with similar functions should have similar expression patterns under many different conditions. This approach is now widely used by many experimental researchers, especially in the field of plant biology. In this review, we will summarize recent successful examples obtained by using our gene coexpression database, ATTED-II. Specifically, the examples will describe the identification of new genes, such as the subunits of a complex protein, the enzymes in a metabolic pathway and transporters. In addition, we will discuss the discovery of a new intercellular signaling factor and new regulatory relationships between transcription factors and their target genes. In ATTED-II, we provide two basic views of gene coexpression, a gene list view and a gene network view, which can be used as guide gene approach and narrow-down approach, respectively. In addition, we will discuss the coexpression effectiveness for various types of gene sets.
Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data.
Tintle, Nathan L; Sitarik, Alexandra; Boerema, Benjamin; Young, Kylie; Best, Aaron A; Dejongh, Matthew
2012-08-08
Statistical analyses of whole genome expression data require functional information about genes in order to yield meaningful biological conclusions. The Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) are common sources of functionally grouped gene sets. For bacteria, the SEED and MicrobesOnline provide alternative, complementary sources of gene sets. To date, no comprehensive evaluation of the data obtained from these resources has been performed. We define a series of gene set consistency metrics directly related to the most common classes of statistical analyses for gene expression data, and then perform a comprehensive analysis of 3581 Affymetrix® gene expression arrays across 17 diverse bacteria. We find that gene sets obtained from GO and KEGG demonstrate lower consistency than those obtained from the SEED and MicrobesOnline, regardless of gene set size. Despite the widespread use of GO and KEGG gene sets in bacterial gene expression data analysis, the SEED and MicrobesOnline provide more consistent sets for a wide variety of statistical analyses. Increased use of the SEED and MicrobesOnline gene sets in the analysis of bacterial gene expression data may improve statistical power and utility of expression data.
bc-GenExMiner 3.0: new mining module computes breast cancer gene expression correlation analyses.
Jézéquel, Pascal; Frénel, Jean-Sébastien; Campion, Loïc; Guérin-Charbonnel, Catherine; Gouraud, Wilfried; Ricolleau, Gabriel; Campone, Mario
2013-01-01
We recently developed a user-friendly web-based application called bc-GenExMiner (http://bcgenex.centregauducheau.fr), which offered the possibility to evaluate prognostic informativity of genes in breast cancer by means of a 'prognostic module'. In this study, we develop a new module called 'correlation module', which includes three kinds of gene expression correlation analyses. The first one computes correlation coefficient between 2 or more (up to 10) chosen genes. The second one produces two lists of genes that are most correlated (positively and negatively) to a 'tested' gene. A gene ontology (GO) mining function is also proposed to explore GO 'biological process', 'molecular function' and 'cellular component' terms enrichment for the output lists of most correlated genes. The third one explores gene expression correlation between the 15 telomeric and 15 centromeric genes surrounding a 'tested' gene. These correlation analyses can be performed in different groups of patients: all patients (without any subtyping), in molecular subtypes (basal-like, HER2+, luminal A and luminal B) and according to oestrogen receptor status. Validation tests based on published data showed that these automatized analyses lead to results consistent with studies' conclusions. In brief, this new module has been developed to help basic researchers explore molecular mechanisms of breast cancer. DATABASE URL: http://bcgenex.centregauducheau.fr
Bioinformatics for spermatogenesis: annotation of male reproduction based on proteomics
Zhou, Tao; Zhou, Zuo-Min; Guo, Xue-Jiang
2013-01-01
Proteomics strategies have been widely used in the field of male reproduction, both in basic and clinical research. Bioinformatics methods are indispensable in proteomics-based studies and are used for data presentation, database construction and functional annotation. In the present review, we focus on the functional annotation of gene lists obtained through qualitative or quantitative methods, summarizing the common and male reproduction specialized proteomics databases. We introduce several integrated tools used to find the hidden biological significance from the data obtained. We further describe in detail the information on male reproduction derived from Gene Ontology analyses, pathway analyses and biomedical analyses. We provide an overview of bioinformatics annotations in spermatogenesis, from gene function to biological function and from biological function to clinical application. On the basis of recently published proteomics studies and associated data, we show that bioinformatics methods help us to discover drug targets for sperm motility and to scan for cancer-testis genes. In addition, we summarize the online resources relevant to male reproduction research for the exploration of the regulation of spermatogenesis. PMID:23852026
Shimada, Norimoto; Sato, Shusei; Akashi, Tomoyoshi; Nakamura, Yasukazu; Tabata, Satoshi; Ayabe, Shin-ichi; Aoki, Toshio
2007-01-01
Abstract A model legume Lotus japonicus (Regel) K. Larsen is one of the subjects of genome sequencing and functional genomics programs. In the course of targeted approaches to the legume genomics, we analyzed the genes encoding enzymes involved in the biosynthesis of the legume-specific 5-deoxyisoflavonoid of L. japonicus, which produces isoflavan phytoalexins on elicitor treatment. The paralogous biosynthetic genes were assigned as comprehensively as possible by biochemical experiments, similarity searches, comparison of the gene structures, and phylogenetic analyses. Among the 10 biosynthetic genes investigated, six comprise multigene families, and in many cases they form gene clusters in the chromosomes. Semi-quantitative reverse transcriptase–PCR analyses showed coordinate up-regulation of most of the genes during phytoalexin induction and complex accumulation patterns of the transcripts in different organs. Some paralogous genes exhibited similar expression specificities, suggesting their genetic redundancy. The molecular evolution of the biosynthetic genes is discussed. The results presented here provide reliable annotations of the genes and genetic markers for comparative and functional genomics of leguminous plants. PMID:17452423
Yang, Shuzhi; Cai, Qunfeng; Bard, Jonathan; Jamison, Jennifer; Wang, Jianmin; Yang, Weiping; Hu, Bo Hua
2015-12-01
Individual variation in the susceptibility of the auditory system to acoustic overstimulation has been well-documented at both the functional and structural levels. However, the molecular mechanism responsible for this variation is unclear. The current investigation was designed to examine the variation patterns of cochlear gene expression using RNA-seq data and to identify the genes with expression variation that increased following acoustic trauma. This study revealed that the constitutive expressions of cochlear genes displayed diverse levels of gene-specific variation. These variation patterns were altered by acoustic trauma; approximately one-third of the examined genes displayed marked increases in their expression variation. Bioinformatics analyses revealed that the genes that exhibited increased variation were functionally related to cell death, biomolecule metabolism, and membrane function. In contrast, the stable genes were primarily related to basic cellular processes, including protein and macromolecular syntheses and transport. There was no functional overlap between the stable and variable genes. Importantly, we demonstrated that glutamate metabolism is related to the variation in the functional response of the cochlea to acoustic overstimulation. Taken together, the results indicate that our analyses of the individual variations in transcriptome changes of cochlear genes provide important information for the identification of genes that potentially contribute to the generation of individual variation in cochlear responses to acoustic overstimulation. Copyright © 2015 Elsevier B.V. All rights reserved.
Functional Abstraction as a Method to Discover Knowledge in Gene Ontologies
Ultsch, Alfred; Lötsch, Jörn
2014-01-01
Computational analyses of functions of gene sets obtained in microarray analyses or by topical database searches are increasingly important in biology. To understand their functions, the sets are usually mapped to Gene Ontology knowledge bases by means of over-representation analysis (ORA). Its result represents the specific knowledge of the functionality of the gene set. However, the specific ontology typically consists of many terms and relationships, hindering the understanding of the ‘main story’. We developed a methodology to identify a comprehensibly small number of GO terms as “headlines” of the specific ontology allowing to understand all central aspects of the roles of the involved genes. The Functional Abstraction method finds a set of headlines that is specific enough to cover all details of a specific ontology and is abstract enough for human comprehension. This method exceeds the classical approaches at ORA abstraction and by focusing on information rather than decorrelation of GO terms, it directly targets human comprehension. Functional abstraction provides, with a maximum of certainty, information value, coverage and conciseness, a representation of the biological functions in a gene set plays a role. This is the necessary means to interpret complex Gene Ontology results thus strengthening the role of functional genomics in biomarker and drug discovery. PMID:24587272
NASA Astrophysics Data System (ADS)
Holtorf, Hauke; Guitton, Marie-Christine; Reski, Ralf
2002-04-01
Functional genome analysis of plants has entered the high-throughput stage. The complete genome information from key species such as Arabidopsis thaliana and rice is now available and will further boost the application of a range of new technologies to functional plant gene analysis. To broadly assign functions to unknown genes, different fast and multiparallel approaches are currently used and developed. These new technologies are based on known methods but are adapted and improved to accommodate for comprehensive, large-scale gene analysis, i.e. such techniques are novel in the sense that their design allows researchers to analyse many genes at the same time and at an unprecedented pace. Such methods allow analysis of the different constituents of the cell that help to deduce gene function, namely the transcripts, proteins and metabolites. Similarly the phenotypic variations of entire mutant collections can now be analysed in a much faster and more efficient way than before. The different methodologies have developed to form their own fields within the functional genomics technological platform and are termed transcriptomics, proteomics, metabolomics and phenomics. Gene function, however, cannot solely be inferred by using only one such approach. Rather, it is only by bringing together all the information collected by different functional genomic tools that one will be able to unequivocally assign functions to unknown plant genes. This review focuses on current technical developments and their impact on the field of plant functional genomics. The lower plant Physcomitrella is introduced as a new model system for gene function analysis, owing to its high rate of homologous recombination.
Campanini, Emeline B.; Vandewege, Michael W.; Pillai, Nisha E.; Tay, Boon-Hui; Jones, Justin L.; Venkatesh, Byrappa; Hoffmann, Federico G.
2015-01-01
Abstract The genes in the Myb superfamily encode for three related transcription factors in most vertebrates, A-, B-, and c-Myb, with functionally distinct roles, whereas most invertebrates have a single Myb. B-Myb plays an essential role in cell division and cell cycle progression, c-Myb is involved in hematopoiesis, and A-Myb is involved in spermatogenesis and regulating expression of pachytene PIWI interacting RNAs, a class of small RNAs involved in posttranscriptional gene regulation and the maintenance of reproductive tissues. Comparisons between teleost fish and tetrapods suggest that the emergence and functional divergence of the Myb genes were linked to the two rounds of whole-genome duplication early in vertebrate evolution. We combined phylogenetic, synteny, structural, and gene expression analyses of the Myb paralogs from elephant shark and lampreys with data from 12 bony vertebrates to reconstruct the early evolution of vertebrate Mybs. Phylogenetic and synteny analyses suggest that the elephant shark and Japanese lamprey have copies of the A-, B-, and c-Myb genes, implying their origin could be traced back to the common ancestor of lampreys and gnathostomes. However, structural and gene expression analyses suggest that their functional roles diverged between gnathostomes and cyclostomes. In particular, we did not detect A-Myb expression in testis suggesting that the involvement of A-Myb in the pachytene PIWI interacting RNA pathway is probably a gnathostome-specific innovation. We speculate that the secondary loss of a central domain in lamprey A-Myb underlies the functional differences between the cyclostome and gnathostome A-Myb proteins. PMID:26475318
The ergot alkaloid gene cluster: functional analyses and evolutionary aspects.
Lorenz, Nicole; Haarmann, Thomas; Pazoutová, Sylvie; Jung, Manfred; Tudzynski, Paul
2009-01-01
Ergot alkaloids and their derivatives have been traditionally used as therapeutic agents in migraine, blood pressure regulation and help in childbirth and abortion. Their production in submerse culture is a long established biotechnological process. Ergot alkaloids are produced mainly by members of the genus Claviceps, with Claviceps purpurea as best investigated species concerning the biochemistry of ergot alkaloid synthesis (EAS). Genes encoding enzymes involved in EAS have been shown to be clustered; functional analyses of EAS cluster genes have allowed to assign specific functions to several gene products. Various Claviceps species differ with respect to their host specificity and their alkaloid content; comparison of the ergot alkaloid clusters in these species (and of clavine alkaloid clusters in other genera) yields interesting insights into the evolution of cluster structure. This review focuses on recently published and also yet unpublished data on the structure and evolution of the EAS gene cluster and on the function and regulation of cluster genes. These analyses have also significant biotechnological implications: the characterization of non-ribosomal peptide synthetases (NRPS) involved in the synthesis of the peptide moiety of ergopeptines opened interesting perspectives for the synthesis of ergot alkaloids; on the other hand, defined mutants could be generated producing interesting intermediates or only single peptide alkaloids (instead of the alkaloid mixtures usually produced by industrial strains).
Tang, Ho Man; Liu, Sanzhen; Hill-Skinner, Sarah; Wu, Wei; Reed, Danielle; Yeh, Cheng-Ting; Nettleton, Dan; Schnable, Patrick S
2014-01-01
The midribs of maize brown midrib (bm) mutants exhibit a reddish-brown color associated with reductions in lignin concentration and alterations in lignin composition. Here, we report the mapping, cloning, and functional and biochemical analyses of the bm2 gene. The bm2 gene was mapped to a small region of chromosome 1 that contains a putative methylenetetrahydrofolate reductase (MTHFR) gene, which is down-regulated in bm2 mutant plants. Analyses of multiple Mu-induced bm2-Mu mutant alleles confirmed that this constitutively expressed gene is bm2. Yeast complementation experiments and a previously published biochemical characterization show that the bm2 gene encodes a functional MTHFR. Quantitative RT-PCR analyses demonstrated that the bm2 mutants accumulate substantially reduced levels of bm2 transcript. Alteration of MTHFR function is expected to influence accumulation of the methyl donor S-adenosyl-l-methionine (SAM). Because SAM is consumed by two methyltransferases in the lignin pathway (Ye et al., 1994), the finding that bm2 encodes a functional MTHFR is consistent with its lignin phenotype. Consistent with this functional assignment of bm2, the expression patterns of genes in a variety of SAM-dependent or -related pathways, including lignin biosynthesis, are altered in the bm2 mutant. Biochemical assays confirmed that bm2 mutants accumulate reduced levels of lignin with altered composition compared to wild-type. Hence, this study demonstrates a role for MTHFR in lignin biosynthesis. PMID:24286468
Two euAGAMOUS Genes Control C-Function in Medicago truncatula
Gómez-Mena, Concepción; Constantin, Gabriela D.; Wen, Jiangqi; Mysore, Kirankumar S.; Lund, Ole S.; Johansen, Elisabeth; Beltrán, José Pío; Cañas, Luis A.
2014-01-01
C-function MADS-box transcription factors belong to the AGAMOUS (AG) lineage and specify both stamen and carpel identity and floral meristem determinacy. In core eudicots, the AG lineage is further divided into two branches, the euAG and PLE lineages. Functional analyses across flowering plants strongly support the idea that duplicated AG lineage genes have different degrees of subfunctionalization of the C-function. The legume Medicago truncatula contains three C-lineage genes in its genome: two euAG genes (MtAGa and MtAGb) and one PLENA-like gene (MtSHP). This species is therefore a good experimental system to study the effects of gene duplication within the AG subfamily. We have studied the respective functions of each euAG genes in M. truncatula employing expression analyses and reverse genetic approaches. Our results show that the M. truncatula euAG- and PLENA-like genes are an example of subfunctionalization as a result of a change in expression pattern. MtAGa and MtAGb are the only genes showing a full C-function activity, concomitant with their ancestral expression profile, early in the floral meristem, and in the third and fourth floral whorls during floral development. In contrast, MtSHP expression appears late during floral development suggesting it does not contribute significantly to the C-function. Furthermore, the redundant MtAGa and MtAGb paralogs have been retained which provides the overall dosage required to specify the C-function in M. truncatula. PMID:25105497
Mukherjee, Shubhabrata; Russell, Joshua C; Carr, Daniel T; Burgess, Jeremy D; Allen, Mariet; Serie, Daniel J; Boehme, Kevin L; Kauwe, John S K; Naj, Adam C; Fardo, David W; Dickson, Dennis W; Montine, Thomas J; Ertekin-Taner, Nilufer; Kaeberlein, Matt R; Crane, Paul K
2017-10-01
We sought to determine whether a systems biology approach may identify novel late-onset Alzheimer's disease (LOAD) loci. We performed gene-wide association analyses and integrated results with human protein-protein interaction data using network analyses. We performed functional validation on novel genes using a transgenic Caenorhabditis elegans Aβ proteotoxicity model and evaluated novel genes using brain expression data from people with LOAD and other neurodegenerative conditions. We identified 13 novel candidate LOAD genes outside chromosome 19. Of those, RNA interference knockdowns of the C. elegans orthologs of UBC, NDUFS3, EGR1, and ATP5H were associated with Aβ toxicity, and NDUFS3, SLC25A11, ATP5H, and APP were differentially expressed in the temporal cortex. Network analyses identified novel LOAD candidate genes. We demonstrated a functional role for four of these in a C. elegans model and found enrichment of differentially expressed genes in the temporal cortex. Copyright © 2017 the Alzheimer's Association. Published by Elsevier Inc. All rights reserved.
Tang, Ho Man; Liu, Sanzhen; Hill-Skinner, Sarah; Wu, Wei; Reed, Danielle; Yeh, Cheng-Ting; Nettleton, Dan; Schnable, Patrick S
2014-02-01
The midribs of maize brown midrib (bm) mutants exhibit a reddish-brown color associated with reductions in lignin concentration and alterations in lignin composition. Here, we report the mapping, cloning, and functional and biochemical analyses of the bm2 gene. The bm2 gene was mapped to a small region of chromosome 1 that contains a putative methylenetetrahydrofolate reductase (MTHFR) gene, which is down-regulated in bm2 mutant plants. Analyses of multiple Mu-induced bm2-Mu mutant alleles confirmed that this constitutively expressed gene is bm2. Yeast complementation experiments and a previously published biochemical characterization show that the bm2 gene encodes a functional MTHFR. Quantitative RT-PCR analyses demonstrated that the bm2 mutants accumulate substantially reduced levels of bm2 transcript. Alteration of MTHFR function is expected to influence accumulation of the methyl donor S-adenosyl-L-methionine (SAM). Because SAM is consumed by two methyltransferases in the lignin pathway (Ye et al., ), the finding that bm2 encodes a functional MTHFR is consistent with its lignin phenotype. Consistent with this functional assignment of bm2, the expression patterns of genes in a variety of SAM-dependent or -related pathways, including lignin biosynthesis, are altered in the bm2 mutant. Biochemical assays confirmed that bm2 mutants accumulate reduced levels of lignin with altered composition compared to wild-type. Hence, this study demonstrates a role for MTHFR in lignin biosynthesis. © 2013 The Authors The Plant Journal © 2013 John Wiley & Sons Ltd.
Shang, Haihong; Li, Wei; Zou, Changsong; Yuan, Youlu
2013-07-01
NAC domain proteins are plant-specific transcription factors known to play diverse roles in various plant developmental processes. In the present study, we performed the first comprehensive study of the NAC gene family in Gossypium raimondii Ulbr., incorporating phylogenetic, chromosomal location, gene structure, conserved motif, and expression profiling analyses. We identified 145 NAC transcription factor (NAC-TF) genes that were phylogenetically clustered into 18 distinct subfamilies. Of these, 127 NAC-TF genes were distributed across the 13 chromosomes, 80 (55%) were preferentially retained duplicates located in both duplicated regions and six were located in triplicated chromosomal regions. The majority of NAC-TF genes showed temporal-, spatial-, and tissue-specific expression patterns based on transcriptomic and qRT-PCR analyses. However, the expression patterns of several duplicate genes were partially redundant, suggesting the occurrence of sub-functionalization during their evolution. Based on their genomic organization, we concluded that genomic duplications contributed significantly to the expansion of the NAC-TF gene family in G. raimondii. Comprehensive analysis of their expression profiles could provide novel insights into the functional divergence among members of the NAC gene family in G. raimondii. © 2013 Institute of Botany, Chinese Academy of Sciences.
Abernathy, Jason; Brezas, Andreas; Snekvik, Kevin R; Hardy, Ronald W; Overturf, Ken
2017-01-01
Finding suitable alternative protein sources for diets of carnivorous fish species remains a major concern for sustainable aquaculture. Through genetic selection, we created a strain of rainbow trout that outperforms parental lines in utilizing an all-plant protein diet and does not develop enteritis in the distal intestine, as is typical with salmonids on long-term plant protein-based feeds. By incorporating this strain into functional analyses, we set out to determine which genes are critical to plant protein utilization in the absence of gut inflammation. After a 12-week feeding trial with our selected strain and a control trout strain fed either a fishmeal-based diet or an all-plant protein diet, high-throughput RNA sequencing was completed on both liver and muscle tissues. Differential gene expression analyses, weighted correlation network analyses and further functional characterization were performed. A strain-by-diet design revealed differential expression ranging from a few dozen to over one thousand genes among the various comparisons and tissues. Major gene ontology groups identified between comparisons included those encompassing central, intermediary and foreign molecule metabolism, associated biosynthetic pathways as well as immunity. A systems approach indicated that genes involved in purine metabolism were highly perturbed. Systems analysis among the tissues tested further suggests the interplay between selection for growth, dietary utilization and protein tolerance may also have implications for nonspecific immunity. By combining data from differential gene expression and co-expression networks using selected trout, along with ontology and pathway analyses, a set of 63 candidate genes for plant diet tolerance was found. Risk loci in human inflammatory bowel diseases were also found in our datasets, indicating rainbow trout selected for plant-diet tolerance may have added utility as a potential biomedical model.
Rice functional genomics research in China.
Han, Bin; Xue, Yongbiao; Li, Jiayang; Deng, Xing-Wang; Zhang, Qifa
2007-06-29
Rice functional genomics is a scientific approach that seeks to identify and define the function of rice genes, and uncover when and how genes work together to produce phenotypic traits. Rapid progress in rice genome sequencing has facilitated research in rice functional genomics in China. The Ministry of Science and Technology of China has funded two major rice functional genomics research programmes for building up the infrastructures of the functional genomics study such as developing rice functional genomics tools and resources. The programmes were also aimed at cloning and functional analyses of a number of genes controlling important agronomic traits from rice. National and international collaborations on rice functional genomics study are accelerating rice gene discovery and application.
“Guilt by Association” Is the Exception Rather Than the Rule in Gene Networks
Gillis, Jesse; Pavlidis, Paul
2012-01-01
Gene networks are commonly interpreted as encoding functional information in their connections. An extensively validated principle called guilt by association states that genes which are associated or interacting are more likely to share function. Guilt by association provides the central top-down principle for analyzing gene networks in functional terms or assessing their quality in encoding functional information. In this work, we show that functional information within gene networks is typically concentrated in only a very few interactions whose properties cannot be reliably related to the rest of the network. In effect, the apparent encoding of function within networks has been largely driven by outliers whose behaviour cannot even be generalized to individual genes, let alone to the network at large. While experimentalist-driven analysis of interactions may use prior expert knowledge to focus on the small fraction of critically important data, large-scale computational analyses have typically assumed that high-performance cross-validation in a network is due to a generalizable encoding of function. Because we find that gene function is not systemically encoded in networks, but dependent on specific and critical interactions, we conclude it is necessary to focus on the details of how networks encode function and what information computational analyses use to extract functional meaning. We explore a number of consequences of this and find that network structure itself provides clues as to which connections are critical and that systemic properties, such as scale-free-like behaviour, do not map onto the functional connectivity within networks. PMID:22479173
From Genomes to Protein Models and Back
NASA Astrophysics Data System (ADS)
Tramontano, Anna; Giorgetti, Alejandro; Orsini, Massimiliano; Raimondo, Domenico
2007-12-01
The alternative splicing mechanism allows genes to generate more than one product. When the splicing events occur within protein coding regions they can modify the biological function of the protein. Alternative splicing has been suggested as one way for explaining the discrepancy between the number of human genes and functional complexity. We analysed the putative structure of the alternatively spliced gene products annotated in the ENCODE pilot project and discovered that many of the potential alternative gene products will be unlikely to produce stable functional proteins.
Integrative and conjugative elements and their hosts: composition, distribution and organization
Touchon, Marie; Rocha, Eduardo P. C.
2017-01-01
Abstract Conjugation of single-stranded DNA drives horizontal gene transfer between bacteria and was widely studied in conjugative plasmids. The organization and function of integrative and conjugative elements (ICE), even if they are more abundant, was only studied in a few model systems. Comparative genomics of ICE has been precluded by the difficulty in finding and delimiting these elements. Here, we present the results of a method that circumvents these problems by requiring only the identification of the conjugation genes and the species’ pan-genome. We delimited 200 ICEs and this allowed the first large-scale characterization of these elements. We quantified the presence in ICEs of a wide set of functions associated with the biology of mobile genetic elements, including some that are typically associated with plasmids, such as partition and replication. Protein sequence similarity networks and phylogenetic analyses revealed that ICEs are structured in functional modules. Integrases and conjugation systems have different evolutionary histories, even if the gene repertoires of ICEs can be grouped in function of conjugation types. Our characterization of the composition and organization of ICEs paves the way for future functional and evolutionary analyses of their cargo genes, composed of a majority of unknown function genes. PMID:28911112
Liu, Hongyun; Qin, Jiajia; Fan, Hui; Cheng, Jinjin; Li, Lin; Liu, Zheng
2017-07-01
As a member of the GRAS gene family, SCARECROW - LIKE ( SCL ) genes encode transcriptional regulators that are involved in plant information transmission and signal transduction. In this study, 44 SCL genes including two SCARECROW genes in millet were identified to be distributed on eight chromosomes, except chromosome 6. All the millet genes contain motifs 6-8, indicating that these motifs are conserved during the evolution. SCL genes of millet were divided into eight groups based on the phylogenetic relationship and classification of Arabidopsis SCL genes. Several putative millet orthologous genes in Arabidopsis , maize and rice were identified. High throughput RNA sequencing revealed that the expressions of millet SCL genes in root, stem, leaf, spica, and along leaf gradient varied greatly. Analyses combining the gene expression patterns, gene structures, motif compositions, promoter cis -elements identification, alternative splicing of transcripts and phylogenetic relationship of SCL genes indicate that the these genes may play diverse functions. Functionally characterized SCL genes in maize, rice and Arabidopsis would provide us some clues for future characterization of their homologues in millet. To the best of our knowledge, this is the first study of millet SCL genes at the genome wide level. Our work provides a useful platform for functional analysis of SCL genes in millet, a model crop for C 4 photosynthesis and bioenergy studies.
Duplicated growth hormone genes in a passerine bird, the jungle crow (Corvus macrorhynchos).
Arai, Natsumi; Iigo, Masayuki
2010-07-02
Molecular cloning, molecular phylogeny, gene structure and expression analyses of growth hormone (GH) were performed in a passerine bird, the jungle crow (Corvus macrorhynchos). Unexpectedly, duplicated GH cDNA and genes were identified and designated as GH1A and GH1B. In silico analyses identified the zebra finch orthologs. Both GH genes encode 217 amino acid residues and consist of five exons and four introns, spanning 5.2 kbp in GH1A and 4.2 kbp in GH1B. Predicted GH proteins of the jungle crow and zebra finch contain four conserved cysteine residues, suggesting duplicated GH genes are functional. Molecular phylogenetic analysis revealed that duplication of GH genes occur after divergence of the passerine lineage from the other avian orders as has been suggested from partial genomic DNA sequences of passerine GH genes. RT-PCR analyses confirmed expression of GH1A and GH1B in the pituitary gland. In addition, GH1A gene is expressed in all the tissues examined. However, expression of GH1B is confined to several brain areas and blood cells. These results indicate that the regulatory mechanisms of duplicated GH genes are different and that duplicated GH genes exert both endocrine and autocrine/paracrine functions. Copyright 2010 Elsevier Inc. All rights reserved.
Saand, Mumtaz Ali; Xu, You-Ping; Munyampundu, Jean-Pierre; Li, Wen; Zhang, Xuan-Rui; Cai, Xin-Zhong
2015-01-01
Cyclic nucleotide-gated ion channels (CNGCs) are calcium-permeable channels that are involved in various biological functions. Nevertheless, phylogeny and function of plant CNGCs are not well understood. In this study, 333 CNGC genes from 15 plant species were identified using comprehensive bioinformatics approaches. Extensive bioinformatics analyses demonstrated that CNGCs of Group IVa were distinct to those of other groups in gene structure and amino acid sequence of cyclic nucleotide-binding domain. A CNGC-specific motif that recognizes all identified plant CNGCs was generated. Phylogenetic analysis indicated that CNGC proteins of flowering plant species formed five groups. However, CNGCs of the non-vascular plant Physcomitrella patens clustered only in two groups (IVa and IVb), while those of the vascular non-flowering plant Selaginella moellendorffii gathered in four (IVa, IVb, I and II). These data suggest that Group IV CNGCs are most ancient and Group III CNGCs are most recently evolved in flowering plants. Furthermore, silencing analyses revealed that a set of CNGC genes might be involved in disease resistance and abiotic stress responses in tomato and function of SlCNGCs does not correlate with the group that they are belonging to. Our results indicate that Group IVa CNGCs are structurally but not functionally unique among plant CNGCs. PMID:26546226
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bock KW; D Honys; JM. Ward
Male fertility depends on the proper development of the male gametophyte, successful pollen germination, tube growth and delivery of the sperm cells to the ovule. Previous studies have shown that nutrients like boron, and ion gradients or currents of Ca2+, H+, and K+ are critical for pollen tube growth. However, the molecular identities of transporters mediating these fluxes are mostly unknown. As a first step to integrate transport with pollen development and function, a genome-wide analysis of transporter genes expressed in the male gametophyte at four developmental stages was conducted. About 1269 genes encoding classified transporters were collected from themore » Arabidopsis thaliana genome. Of 757 transporter genes expressed in pollen, 16% or 124 genes, including AHA6, CNGC18, TIP1.3 and CHX08, are specifically or preferentially expressed relative to sporophytic tissues. Some genes are highly expressed in microspores and bicellular pollen (COPT3, STP2, OPT9); while others are activated only in tricellular or mature pollen (STP11, LHT7). Analyses of entire gene families showed that a subset of genes, including those expressed in sporophytic tissues, were developmentally-regulated during pollen maturation. Early and late expression patterns revealed by transcriptome analysis are supported by promoter::GUS analyses of CHX genes and by other methods. Recent genetic studies based on a few transporters, including plasma membrane H+ pump AHA3, Ca2+ pump ACA9, and K+ channel SPIK, further support the expression patterns and the inferred functions revealed by our analyses. Thus, revealing the distinct expression patterns of specific transporters and unknown polytopic proteins during microgametogenesis provides new insights for strategic mutant analyses necessary to integrate the roles of transporters and potential receptors with male gametophyte development.« less
Bock, Kevin W; Honys, David; Ward, John M; Padmanaban, Senthilkumar; Nawrocki, Eric P; Hirschi, Kendal D; Twell, David; Sze, Heven
2006-04-01
Male fertility depends on the proper development of the male gametophyte, successful pollen germination, tube growth, and delivery of the sperm cells to the ovule. Previous studies have shown that nutrients like boron, and ion gradients or currents of Ca2+, H+, and K+ are critical for pollen tube growth. However, the molecular identities of transporters mediating these fluxes are mostly unknown. As a first step to integrate transport with pollen development and function, a genome-wide analysis of transporter genes expressed in the male gametophyte at four developmental stages was conducted. Approximately 1,269 genes encoding classified transporters were collected from the Arabidopsis (Arabidopsis thaliana) genome. Of 757 transporter genes expressed in pollen, 16% or 124 genes, including AHA6, CNGC18, TIP1.3, and CHX08, are specifically or preferentially expressed relative to sporophytic tissues. Some genes are highly expressed in microspores and bicellular pollen (COPT3, STP2, OPT9), while others are activated only in tricellular or mature pollen (STP11, LHT7). Analyses of entire gene families showed that a subset of genes, including those expressed in sporophytic tissues, was developmentally regulated during pollen maturation. Early and late expression patterns revealed by transcriptome analysis are supported by promoter::beta-glucuronidase analyses of CHX genes and by other methods. Recent genetic studies based on a few transporters, including plasma membrane H+ pump AHA3, Ca2+ pump ACA9, and K+ channel SPIK, further support the expression patterns and the inferred functions revealed by our analyses. Thus, revealing the distinct expression patterns of specific transporters and unknown polytopic proteins during microgametogenesis provides new insights for strategic mutant analyses necessary to integrate the roles of transporters and potential receptors with male gametophyte development.
2013-01-01
Background We describe the genome of the western painted turtle, Chrysemys picta bellii, one of the most widespread, abundant, and well-studied turtles. We place the genome into a comparative evolutionary context, and focus on genomic features associated with tooth loss, immune function, longevity, sex differentiation and determination, and the species' physiological capacities to withstand extreme anoxia and tissue freezing. Results Our phylogenetic analyses confirm that turtles are the sister group to living archosaurs, and demonstrate an extraordinarily slow rate of sequence evolution in the painted turtle. The ability of the painted turtle to withstand complete anoxia and partial freezing appears to be associated with common vertebrate gene networks, and we identify candidate genes for future functional analyses. Tooth loss shares a common pattern of pseudogenization and degradation of tooth-specific genes with birds, although the rate of accumulation of mutations is much slower in the painted turtle. Genes associated with sex differentiation generally reflect phylogeny rather than convergence in sex determination functionality. Among gene families that demonstrate exceptional expansions or show signatures of strong natural selection, immune function and musculoskeletal patterning genes are consistently over-represented. Conclusions Our comparative genomic analyses indicate that common vertebrate regulatory networks, some of which have analogs in human diseases, are often involved in the western painted turtle's extraordinary physiological capacities. As these regulatory pathways are analyzed at the functional level, the painted turtle may offer important insights into the management of a number of human health disorders. PMID:23537068
Analysis of functional redundancies within the Arabidopsis TCP transcription factor family.
Danisman, Selahattin; van Dijk, Aalt D J; Bimbo, Andrea; van der Wal, Froukje; Hennig, Lars; de Folter, Stefan; Angenent, Gerco C; Immink, Richard G H
2013-12-01
Analyses of the functions of TEOSINTE-LIKE1, CYCLOIDEA, and PROLIFERATING CELL FACTOR1 (TCP) transcription factors have been hampered by functional redundancy between its individual members. In general, putative functionally redundant genes are predicted based on sequence similarity and confirmed by genetic analysis. In the TCP family, however, identification is impeded by relatively low overall sequence similarity. In a search for functionally redundant TCP pairs that control Arabidopsis leaf development, this work performed an integrative bioinformatics analysis, combining protein sequence similarities, gene expression data, and results of pair-wise protein-protein interaction studies for the 24 members of the Arabidopsis TCP transcription factor family. For this, the work completed any lacking gene expression and protein-protein interaction data experimentally and then performed a comprehensive prediction of potential functional redundant TCP pairs. Subsequently, redundant functions could be confirmed for selected predicted TCP pairs by genetic and molecular analyses. It is demonstrated that the previously uncharacterized class I TCP19 gene plays a role in the control of leaf senescence in a redundant fashion with TCP20. Altogether, this work shows the power of combining classical genetic and molecular approaches with bioinformatics predictions to unravel functional redundancies in the TCP transcription factor family.
Analysis of functional redundancies within the Arabidopsis TCP transcription factor family
Danisman, Selahattin; de Folter, Stefan; Immink, Richard G. H.
2013-01-01
Analyses of the functions of TEOSINTE-LIKE1, CYCLOIDEA, and PROLIFERATING CELL FACTOR1 (TCP) transcription factors have been hampered by functional redundancy between its individual members. In general, putative functionally redundant genes are predicted based on sequence similarity and confirmed by genetic analysis. In the TCP family, however, identification is impeded by relatively low overall sequence similarity. In a search for functionally redundant TCP pairs that control Arabidopsis leaf development, this work performed an integrative bioinformatics analysis, combining protein sequence similarities, gene expression data, and results of pair-wise protein–protein interaction studies for the 24 members of the Arabidopsis TCP transcription factor family. For this, the work completed any lacking gene expression and protein–protein interaction data experimentally and then performed a comprehensive prediction of potential functional redundant TCP pairs. Subsequently, redundant functions could be confirmed for selected predicted TCP pairs by genetic and molecular analyses. It is demonstrated that the previously uncharacterized class I TCP19 gene plays a role in the control of leaf senescence in a redundant fashion with TCP20. Altogether, this work shows the power of combining classical genetic and molecular approaches with bioinformatics predictions to unravel functional redundancies in the TCP transcription factor family. PMID:24129704
Search for hidden messenger molecules: capa-gene expression in ants
USDA-ARS?s Scientific Manuscript database
Recent genome analyses suggested the absence of a number of neuropeptide genes and corresponding receptor genes in ants. That absence raised questions about compensation of functions of these peptides in hymenopteran insects. One of the missing genes is the capa-gene. CAPA-peptides are known to regu...
Suzuki, Hitoshi; Osaki, Ken; Sano, Kaori; Alam, A H M Khurshid; Nakamura, Yuichiro; Ishigaki, Yasuhito; Kawahara, Kozo; Tsukahara, Toshifumi
2011-02-18
Alternative splicing, which produces multiple mRNAs from a single gene, occurs in most human genes and contributes to protein diversity. Many alternative isoforms are expressed in a spatio-temporal manner, and function in diverse processes, including in the neural system. The purpose of the present study was to comprehensively investigate neural-splicing using P19 cells. GeneChip Exon Array analysis was performed using total RNAs purified from cells during neuronal cell differentiation. To efficiently and readily extract the alternative exon candidates, 9 filtering conditions were prepared, yielding 262 candidate exons (236 genes). Semiquantitative RT-PCR results in 30 randomly selected candidates suggested that 87% of the candidates were differentially alternatively spliced in neuronal cells compared to undifferentiated cells. Gene ontology and pathway analyses suggested that many of the candidate genes were associated with neural events. Together with 66 genes whose functions in neural cells or organs were reported previously, 47 candidate genes were found to be linked to 189 events in the gene-level profile of neural differentiation. By text-mining for the alternative isoform, distinct functions of the isoforms of 9 candidate genes indicated by the result of Exon Array were confirmed. Alternative exons were successfully extracted. Results from the informatics analyses suggested that neural events were primarily governed by genes whose expression was increased and whose transcripts were differentially alternatively spliced in the neuronal cells. In addition to known functions in neural cells or organs, the uninvestigated alternative splicing events of 11 genes among 47 candidate genes suggested that cell cycle events are also potentially important. These genes may help researchers to differentiate the roles of alternative splicing in cell differentiation and cell proliferation.
Serial analysis of gene expression in a rat lung model of asthma.
Yin, Lei-Miao; Jiang, Gong-Hao; Wang, Yu; Wang, Yan; Liu, Yan-Yan; Jin, Wei-Rong; Zhang, Zen; Xu, Yu-Dong; Yang, Yong-Qing
2008-11-01
The pathogenesis and molecular mechanism underlying asthma remain undetermined. The purpose of this study was to identify genes and pathways involved in the early airway response (EAR) phase of asthma by using serial analysis of gene expression (SAGE). Two SAGE tag libraries of lung tissues derived from a rat model of asthma and controls were generated. Bioinformatic analyses were carried out using the Database for Annotation, Visualization and IntegratedDiscovery Functional Annotation Tool, Gene Ontology (GO) TreeMachine and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis. A total of 26 552 SAGE tags of asthmatic rat lung were obtained, of which 12 221 were unique tags. Of the unique tags, 55.5% were matched with known genes. By comparison of the two libraries, 186 differentially expressed tags (P < 0.05) were identified, of which 103 were upregulated and 83 were downregulated. Using the bioinformatic tools these genes were classified into 23 functional groups, 15 KEGG pathways and 37 enriched GO categories. The bioinformatic analyses of gene distribution, enriched categories and the involvement of specific pathways in the SAGE libraries have provided information on regulatory networks of the EAR phase of asthma. Analyses of the regulated genes of interest may inform new hypotheses, increase our understanding of the disease and provide a foundation for future research.
Chan, Clara; Itoh, Takashi; Ohkuma, Moriya
2013-01-01
Iron-rich flocs often occur where anoxic water containing ferrous iron encounters oxygenated environments. Culture-independent molecular analyses have revealed the presence of 16S rRNA gene sequences related to diverse bacteria, including autotrophic iron oxidizers and methanotrophs in iron-rich flocs; however, the metabolic functions of the microbial communities remain poorly characterized, particularly regarding carbon cycling. In the present study, we cultivated iron-oxidizing bacteria (FeOB) and performed clone library analyses of functional genes related to carbon fixation and methane oxidization (cbbM and pmoA, respectively), in addition to bacterial and archaeal 16S rRNA genes, in freshwater iron-rich flocs at groundwater discharge points. The analyses of 16S rRNA, cbbM, and pmoA genes strongly suggested the coexistence of autotrophic iron oxidizers and methanotrophs in the flocs. Furthermore, a novel stalk-forming microaerophilic FeOB, strain OYT1, was isolated and characterized phylogenetically and physiologically. The 16S rRNA and cbbM gene sequences of OYT1 are related to those of other microaerophilic FeOB in the family Gallionellaceae, of the Betaproteobacteria, isolated from freshwater environments at circumneutral pH. The physiological characteristics of OYT1 will help elucidate the ecophysiology of microaerophilic FeOB. Overall, this study demonstrates functional roles of microorganisms in iron flocs, suggesting several possible linkages between Fe and C cycling. PMID:23811518
Database of cattle candidate genes and genetic markers for milk production and mastitis
Ogorevc, J; Kunej, T; Razpet, A; Dovc, P
2009-01-01
A cattle database of candidate genes and genetic markers for milk production and mastitis has been developed to provide an integrated research tool incorporating different types of information supporting a genomic approach to study lactation, udder development and health. The database contains 943 genes and genetic markers involved in mammary gland development and function, representing candidates for further functional studies. The candidate loci were drawn on a genetic map to reveal positional overlaps. For identification of candidate loci, data from seven different research approaches were exploited: (i) gene knockouts or transgenes in mice that result in specific phenotypes associated with mammary gland (143 loci); (ii) cattle QTL for milk production (344) and mastitis related traits (71); (iii) loci with sequence variations that show specific allele-phenotype interactions associated with milk production (24) or mastitis (10) in cattle; (iv) genes with expression profiles associated with milk production (207) or mastitis (107) in cattle or mouse; (v) cattle milk protein genes that exist in different genetic variants (9); (vi) miRNAs expressed in bovine mammary gland (32) and (vii) epigenetically regulated cattle genes associated with mammary gland function (1). Fourty-four genes found by multiple independent analyses were suggested as the most promising candidates and were further in silico analysed for expression levels in lactating mammary gland, genetic variability and top biological functions in functional networks. A miRNA target search for mammary gland expressed miRNAs identified 359 putative binding sites in 3′UTRs of candidate genes. PMID:19508288
Molecular genetic analyses of microsporogenesis and microgametogenesis in flowering plants.
Ma, Hong
2005-01-01
In flowering plants, male reproductive development requires the formation of the stamen, including the differentiation of anther tissues. Within the anther, male meiosis produces microspores, which further develop into pollen grains, relying on both sporophytic and gametophytic gene functions. The mature pollen is released when the anther dehisces, allowing pollination to occur. Molecular studies have identified a large number of genes that are expressed during stamen and pollen development. Genetic analyses have demonstrated the function of some of these genes in specifying stamen identity, regulating anther cell division and differentiation, controlling male meiosis, supporting pollen development, and promoting anther dehiscence. These genes encode a variety of proteins, including transcriptional regulators, signal transduction proteins, regulators of protein degradation, and enzymes for the biosynthesis of hormones. Although much has been learned in recent decades, much more awaits to be discovered and understood; the future of the study of plant male reproduction remains bright and exciting with the ever-growing tool kits and rapidly expanding information and resources for gene function studies.
Raherison, Elie S M; Giguère, Isabelle; Caron, Sébastien; Lamara, Mebarek; MacKay, John J
2015-07-01
Transcript profiling has shown the molecular bases of several biological processes in plants but few studies have developed an understanding of overall transcriptome variation. We investigated transcriptome structure in white spruce (Picea glauca), aiming to delineate its modular organization and associated functional and evolutionary attributes. Microarray analyses were used to: identify and functionally characterize groups of co-expressed genes; investigate expressional and functional diversity of vascular tissue preferential genes which were conserved among Picea species, and identify expression networks underlying wood formation. We classified 22 857 genes as variable (79%; 22 coexpression groups) or invariant (21%) by profiling across several vegetative tissues. Modular organization and complex transcriptome restructuring among vascular tissue preferential genes was revealed by their assignment to coexpression groups with partially overlapping profiles and partially distinct functions. Integrated analyses of tissue-based and temporally variable profiles identified secondary xylem gene networks, showed their remodelling over a growing season and identified PgNAC-7 (no apical meristerm (NAM), Arabidopsis transcription activation factor (ATAF) and cup-shaped cotyledon (CUC) transcription factor 007 in Picea glauca) as a major hub gene specific to earlywood formation. Reference profiling identified comprehensive, statistically robust coexpressed groups, revealing that modular organization underpins the evolutionary conservation of the transcriptome structure. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
Horizontal functional gene transfer from bacteria to fishes.
Sun, Bao-Fa; Li, Tong; Xiao, Jin-Hua; Jia, Ling-Yi; Liu, Li; Zhang, Peng; Murphy, Robert W; He, Shun-Min; Huang, Da-Wei
2015-12-22
Invertebrates can acquire functional genes via horizontal gene transfer (HGT) from bacteria but fishes are not known to do so. We provide the first reliable evidence of one HGT event from marine bacteria to fishes. The HGT appears to have occurred after emergence of the teleosts. The transferred gene is expressed and regulated developmentally. Its successful integration and expression may change the genetic and metabolic repertoire of fishes. In addition, this gene contains conserved domains and similar tertiary structures in fishes and their putative donor bacteria. Thus, it may function similarly in both groups. Evolutionary analyses indicate that it evolved under purifying selection, further indicating its conserved function. We document the first likely case of HGT of functional gene from prokaryote to fishes. This discovery certifies that HGT can influence vertebrate evolution.
Lu, Chenqi; Liu, Xiaoqin; Wang, Lin; Jiang, Ning; Yu, Jun; Zhao, Xiaobo; Hu, Hairong; Zheng, Saihua; Li, Xuelian; Wang, Guiying
2017-01-10
Due to genetic heterogeneity and variable diagnostic criteria, genetic studies of polycystic ovary syndrome are particularly challenging. Furthermore, lack of sufficiently large cohorts limits the identification of susceptibility genes contributing to polycystic ovary syndrome. Here, we carried out a systematic search of studies deposited in the Gene Expression Omnibus database through August 31, 2016. The present analyses included studies with: 1) patients with polycystic ovary syndrome and normal controls, 2) gene expression profiling of messenger RNA, and 3) sufficient data for our analysis. Ultimately, a total of 9 studies with 13 datasets met the inclusion criteria and were performed for the subsequent integrated analyses. Through comprehensive analyses, there were 13 genetic factors overlapped in all datasets and identified as significant specific genes for polycystic ovary syndrome. After quality control assessment, there were six datasets remained. Further gene ontology enrichment and pathway analyses suggested that differentially expressed genes mainly enriched in oocyte pathways. These findings provide potential molecular markers for diagnosis and prognosis of polycystic ovary syndrome, and need in-depth studies on the exact function and mechanism in polycystic ovary syndrome.
Störmer, Rebecca; Wichels, Antje; Gerdts, Gunnar
2013-12-15
The dumping of dredged sediments represents a major stressor for coastal ecosystems. The impact on the ecosystem function is determined by its complexity not easy to assess. In the present study, we evaluated the potential of bacterial community analyses to act as ecological indicators in environmental monitoring programmes. We investigated the functional structure of bacterial communities, applying functional gene arrays (GeoChip4.2). The relationship between functional genes and environmental factors was analysed using distance-based multivariate multiple regression. Apparently, both the function and structure of the bacterial communities are impacted by dumping activities. The bacterial community at the dumping centre displayed a significant reduction of its entire functional diversity compared with that found at a reference site. DDX compounds separated bacterial communities of the dumping site from those of un-impacted sites. Thus, bacterial community analyses show great potential as ecological indicators in environmental monitoring. Copyright © 2013 Elsevier Ltd. All rights reserved.
Henríquez-Valencia, Carlos; Arenas-M, Anita; Medina, Joaquín; Canales, Javier
2018-01-01
Sulfur is an essential nutrient for plant growth and development. Sulfur is a constituent of proteins, the plasma membrane and cell walls, among other important cellular components. To obtain new insights into the gene regulatory networks underlying the sulfate response, we performed an integrative meta-analysis of transcriptomic data from five different sulfate experiments available in public databases. This bioinformatic approach allowed us to identify a robust set of genes whose expression depends only on sulfate availability, indicating that those genes play an important role in the sulfate response. In relation to sulfate metabolism, the biological function of approximately 45% of these genes is currently unknown. Moreover, we found several consistent Gene Ontology terms related to biological processes that have not been extensively studied in the context of the sulfate response; these processes include cell wall organization, carbohydrate metabolism, nitrogen compound transport, and the regulation of proteolysis. Gene co-expression network analyses revealed relationships between the sulfate-responsive genes that were distributed among seven function-specific co-expression modules. The most connected genes in the sulfate co-expression network belong to a module related to the carbon response, suggesting that this biological function plays an important role in the control of the sulfate response. Temporal analyses of the network suggest that sulfate starvation generates a biphasic response, which involves that major changes in gene expression occur during both the early and late responses. Network analyses predicted that the sulfate response is regulated by a limited number of transcription factors, including MYBs, bZIPs, and NF-YAs. In conclusion, our analysis identified new candidate genes and provided new hypotheses to advance our understanding of the transcriptional regulation of sulfate metabolism in plants. PMID:29692794
Henríquez-Valencia, Carlos; Arenas-M, Anita; Medina, Joaquín; Canales, Javier
2018-01-01
Sulfur is an essential nutrient for plant growth and development. Sulfur is a constituent of proteins, the plasma membrane and cell walls, among other important cellular components. To obtain new insights into the gene regulatory networks underlying the sulfate response, we performed an integrative meta-analysis of transcriptomic data from five different sulfate experiments available in public databases. This bioinformatic approach allowed us to identify a robust set of genes whose expression depends only on sulfate availability, indicating that those genes play an important role in the sulfate response. In relation to sulfate metabolism, the biological function of approximately 45% of these genes is currently unknown. Moreover, we found several consistent Gene Ontology terms related to biological processes that have not been extensively studied in the context of the sulfate response; these processes include cell wall organization, carbohydrate metabolism, nitrogen compound transport, and the regulation of proteolysis. Gene co-expression network analyses revealed relationships between the sulfate-responsive genes that were distributed among seven function-specific co-expression modules. The most connected genes in the sulfate co-expression network belong to a module related to the carbon response, suggesting that this biological function plays an important role in the control of the sulfate response. Temporal analyses of the network suggest that sulfate starvation generates a biphasic response, which involves that major changes in gene expression occur during both the early and late responses. Network analyses predicted that the sulfate response is regulated by a limited number of transcription factors, including MYBs, bZIPs, and NF-YAs. In conclusion, our analysis identified new candidate genes and provided new hypotheses to advance our understanding of the transcriptional regulation of sulfate metabolism in plants.
Trubiroha, A; Gillotay, P; Giusti, N; Gacquer, D; Libert, F; Lefort, A; Haerlingen, B; De Deken, X; Opitz, R; Costagliola, S
2018-04-04
The foregut endoderm gives rise to several organs including liver, pancreas, lung and thyroid with important roles in human physiology. Understanding which genes and signalling pathways regulate their development is crucial for understanding developmental disorders as well as diseases in adulthood. We exploited unique advantages of the zebrafish model to develop a rapid and scalable CRISPR/Cas-based mutagenesis strategy aiming at the identification of genes involved in morphogenesis and function of the thyroid. Core elements of the mutagenesis assay comprise bi-allelic gene invalidation in somatic mutants, a non-invasive monitoring of thyroid development in live transgenic fish, complementary analyses of thyroid function in fixed specimens and quantitative analyses of mutagenesis efficiency by Illumina sequencing of individual fish. We successfully validated our mutagenesis-phenotyping strategy in experiments targeting genes with known functions in early thyroid morphogenesis (pax2a, nkx2.4b) and thyroid functional differentiation (duox, duoxa, tshr). We also demonstrate that duox and duoxa crispants phenocopy thyroid phenotypes previously observed in human patients with bi-allelic DUOX2 and DUOXA2 mutations. The proposed combination of efficient mutagenesis protocols, rapid non-invasive phenotyping and sensitive genotyping holds great potential to systematically characterize the function of larger candidate gene panels during thyroid development and is applicable to other organs and tissues.
GSDM family genes meet autophagy.
Tamura, Masaru; Shiroishi, Toshihiko
2015-07-15
In the previous issue of Biochemical Journal, Shi et al. [(2015) 468, 325-336] report that Gasdermin (Gsdm) family proteins regulate autophagy activity, which is counter-balanced by the opposite functions of well-conserved N- and C-terminal domains of the proteins. The Gsdm family was originally identified as the causative gene of dominant skin mutations exhibiting alopecia. Each member of the Gsdm gene family shows characteristic expression patterns in the epithelium, which is tissue and differentiation stage-specific. Previous phenotype analyses of mutant mice, biochemical analyses of proteins and genome-wide association studies showed that the Gsdm gene family might be involved in epithelial cell development, apoptosis, inflammation, carcinogenesis and immune-related diseases. To date, however, their molecular function(s) remain unclear. Shi et al. found that mutations in the C-terminal domain of Gsdma3, a member of the Gsdm family, induce autophagy. Further studies revealed that the wild-type N-terminal domain has pro-autophagic activity and that the C-terminal domain conversely inhibits this N-terminal function. These opposite functions of the two domains were also observed in other Gsdm family members. Thus, their study provides a new insight into the function of Gsdm genes in epithelial cell lineage, causality of cancers and immune-related diseases including childhood-onset asthma. © 2015 Authors; published by Portland Press Limited.
Integrative and conjugative elements and their hosts: composition, distribution and organization.
Cury, Jean; Touchon, Marie; Rocha, Eduardo P C
2017-09-06
Conjugation of single-stranded DNA drives horizontal gene transfer between bacteria and was widely studied in conjugative plasmids. The organization and function of integrative and conjugative elements (ICE), even if they are more abundant, was only studied in a few model systems. Comparative genomics of ICE has been precluded by the difficulty in finding and delimiting these elements. Here, we present the results of a method that circumvents these problems by requiring only the identification of the conjugation genes and the species' pan-genome. We delimited 200 ICEs and this allowed the first large-scale characterization of these elements. We quantified the presence in ICEs of a wide set of functions associated with the biology of mobile genetic elements, including some that are typically associated with plasmids, such as partition and replication. Protein sequence similarity networks and phylogenetic analyses revealed that ICEs are structured in functional modules. Integrases and conjugation systems have different evolutionary histories, even if the gene repertoires of ICEs can be grouped in function of conjugation types. Our characterization of the composition and organization of ICEs paves the way for future functional and evolutionary analyses of their cargo genes, composed of a majority of unknown function genes. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Brezas, Andreas; Snekvik, Kevin R.; Hardy, Ronald W.; Overturf, Ken
2017-01-01
Finding suitable alternative protein sources for diets of carnivorous fish species remains a major concern for sustainable aquaculture. Through genetic selection, we created a strain of rainbow trout that outperforms parental lines in utilizing an all-plant protein diet and does not develop enteritis in the distal intestine, as is typical with salmonids on long-term plant protein-based feeds. By incorporating this strain into functional analyses, we set out to determine which genes are critical to plant protein utilization in the absence of gut inflammation. After a 12-week feeding trial with our selected strain and a control trout strain fed either a fishmeal-based diet or an all-plant protein diet, high-throughput RNA sequencing was completed on both liver and muscle tissues. Differential gene expression analyses, weighted correlation network analyses and further functional characterization were performed. A strain-by-diet design revealed differential expression ranging from a few dozen to over one thousand genes among the various comparisons and tissues. Major gene ontology groups identified between comparisons included those encompassing central, intermediary and foreign molecule metabolism, associated biosynthetic pathways as well as immunity. A systems approach indicated that genes involved in purine metabolism were highly perturbed. Systems analysis among the tissues tested further suggests the interplay between selection for growth, dietary utilization and protein tolerance may also have implications for nonspecific immunity. By combining data from differential gene expression and co-expression networks using selected trout, along with ontology and pathway analyses, a set of 63 candidate genes for plant diet tolerance was found. Risk loci in human inflammatory bowel diseases were also found in our datasets, indicating rainbow trout selected for plant-diet tolerance may have added utility as a potential biomedical model. PMID:28723948
Bessonov, Kyrylo; Walkey, Christopher J.; Shelp, Barry J.; van Vuuren, Hennie J. J.; Chiu, David; van der Merwe, George
2013-01-01
Analyzing time-course expression data captured in microarray datasets is a complex undertaking as the vast and complex data space is represented by a relatively low number of samples as compared to thousands of available genes. Here, we developed the Interdependent Correlation Clustering (ICC) method to analyze relationships that exist among genes conditioned on the expression of a specific target gene in microarray data. Based on Correlation Clustering, the ICC method analyzes a large set of correlation values related to gene expression profiles extracted from given microarray datasets. ICC can be applied to any microarray dataset and any target gene. We applied this method to microarray data generated from wine fermentations and selected NSF1, which encodes a C2H2 zinc finger-type transcription factor, as the target gene. The validity of the method was verified by accurate identifications of the previously known functional roles of NSF1. In addition, we identified and verified potential new functions for this gene; specifically, NSF1 is a negative regulator for the expression of sulfur metabolism genes, the nuclear localization of Nsf1 protein (Nsf1p) is controlled in a sulfur-dependent manner, and the transcription of NSF1 is regulated by Met4p, an important transcriptional activator of sulfur metabolism genes. The inter-disciplinary approach adopted here highlighted the accuracy and relevancy of the ICC method in mining for novel gene functions using complex microarray datasets with a limited number of samples. PMID:24130853
Tian, Honglai; Guan, Donghui; Li, Jianmin
2018-06-01
Osteosarcoma (OS), the most common malignant bone tumor, accounts for the heavy healthy threat in the period of children and adolescents. OS occurrence usually correlates with early metastasis and high death rate. This study aimed to better understand the mechanism of OS metastasis.Based on Gene Expression Omnibus (GEO) database, we downloaded 4 expression profile data sets associated with OS metastasis, and selected differential expressed genes. Weighted gene co-expression network analysis (WGCNA) approach allowed us to investigate the most OS metastasis-correlated module. Gene Ontology functional and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses were used to give annotation of selected OS metastasis-associated genes.We select 897 differential expressed genes from OS metastasis and OS non-metastasis groups. Based on these selected genes, WGCNA further explored 142 genes included in the most OS metastasis-correlated module. Gene Ontology functional and KEGG pathway enrichment analyses showed that significantly OS metastasis-associated genes were involved in pathway correlated with insulin-like growth factor binding.Our research figured out several potential molecules participating in metastasis process and factors acting as biomarker. With this study, we could better explore the mechanism of OS metastasis and further discover more therapy targets.
Liang, Yuting; Van Nostrand, Joy D.; N′Guessan, Lucie A.; Peacock, Aaron D.; Deng, Ye; Long, Philip E.; Resch, C. Tom; Wu, Liyou; He, Zhili; Li, Guanghe; Hazen, Terry C.; Lovley, Derek R.
2012-01-01
To better understand the microbial functional diversity changes with subsurface redox conditions during in situ uranium bioremediation, key functional genes were studied with GeoChip, a comprehensive functional gene microarray, in field experiments at a uranium mill tailings remedial action (UMTRA) site (Rifle, CO). The results indicated that functional microbial communities altered with a shift in the dominant metabolic process, as documented by hierarchical cluster and ordination analyses of all detected functional genes. The abundance of dsrAB genes (dissimilatory sulfite reductase genes) and methane generation-related mcr genes (methyl coenzyme M reductase coding genes) increased when redox conditions shifted from Fe-reducing to sulfate-reducing conditions. The cytochrome genes detected were primarily from Geobacter sp. and decreased with lower subsurface redox conditions. Statistical analysis of environmental parameters and functional genes indicated that acetate, U(VI), and redox potential (Eh) were the most significant geochemical variables linked to microbial functional gene structures, and changes in microbial functional diversity were strongly related to the dominant terminal electron-accepting process following acetate addition. The study indicates that the microbial functional genes clearly reflect the in situ redox conditions and the dominant microbial processes, which in turn influence uranium bioreduction. Microbial functional genes thus could be very useful for tracking microbial community structure and dynamics during bioremediation. PMID:22327592
New genes often acquire male-specific functions but rarely become essential in Drosophila.
Kondo, Shu; Vedanayagam, Jeffrey; Mohammed, Jaaved; Eizadshenass, Sogol; Kan, Lijuan; Pang, Nan; Aradhya, Rajaguru; Siepel, Adam; Steinhauer, Josefa; Lai, Eric C
2017-09-15
Relatively little is known about the in vivo functions of newly emerging genes, especially in metazoans. Although prior RNAi studies reported prevalent lethality among young gene knockdowns, our phylogenomic analyses reveal that young Drosophila genes are frequently restricted to the nonessential male reproductive system. We performed large-scale CRISPR/Cas9 mutagenesis of "conserved, essential" and "young, RNAi-lethal" genes and broadly confirmed the lethality of the former but the viability of the latter. Nevertheless, certain young gene mutants exhibit defective spermatogenesis and/or male sterility. Moreover, we detected widespread signatures of positive selection on young male-biased genes. Thus, young genes have a preferential impact on male reproductive system function. © 2017 Kondo et al.; Published by Cold Spring Harbor Laboratory Press.
Rare copy number variants in patients with congenital conotruncal heart defects.
Xie, Hongbo M; Werner, Petra; Stambolian, Dwight; Bailey-Wilson, Joan E; Hakonarson, Hakon; White, Peter S; Taylor, Deanne M; Goldmuntz, Elizabeth
2017-03-01
Previous studies using different cardiac phenotypes, technologies and designs suggest a burden of large, rare or de novo copy number variants (CNVs) in subjects with congenital heart defects. We sought to identify disease-related CNVs, candidate genes, and functional pathways in a large number of cases with conotruncal and related defects that carried no known genetic syndrome. Cases and control samples were divided into two cohorts and genotyped to assess each subject's CNV content. Analyses were performed to ascertain differences in overall CNV prevalence and to identify enrichment of specific genes and functional pathways in conotruncal cases relative to healthy controls. Only findings present in both cohorts are presented. From 973 total conotruncal cases, a burden of rare CNVs was detected in both cohorts. Candidate genes from rare CNVs found in both cohorts were identified based on their association with cardiac development or disease, and/or their reported disruption in published studies. Functional and pathway analyses revealed significant enrichment of terms involved in either heart or early embryonic development. Our study tested one of the largest cohorts specifically with cardiac conotruncal and related defects. These results confirm and extend previous findings that CNVs contribute to disease risk for congenital heart defects in general and conotruncal defects in particular. As disease heterogeneity renders identification of single recurrent genes or loci difficult, functional pathway and gene regulation network analyses appear to be more informative. Birth Defects Research 109:271-295, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Structure and transcriptional regulation of the major intrinsic protein gene family in grapevine.
Wong, Darren Chern Jan; Zhang, Li; Merlin, Isabelle; Castellarin, Simone D; Gambetta, Gregory A
2018-04-11
The major intrinsic protein (MIP) family is a family of proteins, including aquaporins, which facilitate water and small molecule transport across plasma membranes. In plants, MIPs function in a huge variety of processes including water transport, growth, stress response, and fruit development. In this study, we characterize the structure and transcriptional regulation of the MIP family in grapevine, describing the putative genome duplication events leading to the family structure and characterizing the family's tissue and developmental specific expression patterns across numerous preexisting microarray and RNAseq datasets. Gene co-expression network (GCN) analyses were carried out across these datasets and the promoters of each family member were analyzed for cis-regulatory element structure in order to provide insight into their transcriptional regulation. A total of 29 Vitis vinifera MIP family members (excluding putative pseudogenes) were identified of which all but two were mapped onto Vitis vinifera chromosomes. In this study, segmental duplication events were identified for five plasma membrane intrinsic protein (PIP) and four tonoplast intrinsic protein (TIP) genes, contributing to the expansion of PIPs and TIPs in grapevine. Grapevine MIP family members have distinct tissue and developmental expression patterns and hierarchical clustering revealed two primary groups regardless of the datasets analyzed. Composite microarray and RNA-seq gene co-expression networks (GCNs) highlighted the relationships between MIP genes and functional categories involved in cell wall modification and transport, as well as with other MIPs revealing a strong co-regulation within the family itself. Some duplicated MIP family members have undergone sub-functionalization and exhibit distinct expression patterns and GCNs. Cis-regulatory element (CRE) analyses of the MIP promoters and their associated GCN members revealed enrichment for numerous CREs including AP2/ERFs and NACs. Combining phylogenetic analyses, gene expression profiling, gene co-expression network analyses, and cis-regulatory element enrichment, this study provides a comprehensive overview of the structure and transcriptional regulation of the grapevine MIP family. The study highlights the duplication and sub-functionalization of the family, its strong coordinated expression with genes involved in growth and transport, and the putative classes of TFs responsible for its regulation.
Genomic analysis of expressed sequence tags in American black bear Ursus americanus
2010-01-01
Background Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Results Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. Conclusion We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes. PMID:20338065
Genomic analysis of expressed sequence tags in American black bear Ursus americanus.
Zhao, Sen; Shao, Chunxuan; Goropashnaya, Anna V; Stewart, Nathan C; Xu, Yichi; Tøien, Øivind; Barnes, Brian M; Fedorov, Vadim B; Yan, Jun
2010-03-26
Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes.
Zahn, L M; Leebens-Mack, J; DePamphilis, C W; Ma, H; Theissen, G
2005-01-01
DEFICIENS (DEF) and GLOBOSA (GLO) function in petal and stamen organ identity in Antirrhinum and are orthologs of APETALA3 and PISTILLATA in Arabidopsis. These genes are known as B-function genes for their role in the ABC genetic model of floral organ identity. Phylogenetic analyses show that DEF and GLO are closely related paralogs, having originated from a gene duplication event after the separation of the lineages leading to the extant gymnosperms and the extant angiosperms. Several additional gene duplications followed, providing multiple potential opportunities for functional divergence. In most angiosperms studied to date, genes in the DEF/GLO MADS-box subfamily are expressed in the petals and stamens during flower development. However, in some angiosperms, the expression of DEF and GLO orthologs are occasionally observed in the first and fourth whorls of flowers or in nonfloral organs, where their function is unknown. In this article we review what is known about function, phylogeny, and expression in the DEF/GLO subfamily to examine their evolution in the angiosperms. Our analyses demonstrate that although the primary role of the DEF/GLO subfamily appears to be in specifying the stamens and inner perianth, several examples of potential sub- and neofunctionalization are observed.
Lee, Ann-Ying; Chen, Chun-Yi; Chang, Yao-Chien Alex; Chao, Ya-Ting; Shih, Ming-Che
2013-01-01
Previously we developed genomic resources for orchids, including transcriptomic analyses using next-generation sequencing techniques and construction of a web-based orchid genomic database. Here, we report a modified molecular model of flower development in the Orchidaceae based on functional analysis of gene expression profiles in Phalaenopsis aphrodite (a moth orchid) that revealed novel roles for the transcription factors involved in floral organ pattern formation. Phalaenopsis orchid floral organ-specific genes were identified by microarray analysis. Several critical transcription factors including AP3, PI, AP1 and AGL6, displayed distinct spatial distribution patterns. Phylogenetic analysis of orchid MADS box genes was conducted to infer the evolutionary relationship among floral organ-specific genes. The results suggest that gene duplication MADS box genes in orchid may have resulted in their gaining novel functions during evolution. Based on these analyses, a modified model of orchid flowering was proposed. Comparison of the expression profiles of flowers of a peloric mutant and wild-type Phalaenopsis orchid further identified genes associated with lip morphology and peloric effects. Large scale investigation of gene expression profiles revealed that homeotic genes from the ABCDE model of flower development classes A and B in the Phalaenopsis orchid have novel functions due to evolutionary diversification, and display differential expression patterns. PMID:24265826
DOE Office of Scientific and Technical Information (OSTI.GOV)
Edward DeLong
2011-10-07
Our overarching goals in this project were to: Develop and improve high-throughput sequencing methods and analytical approaches for quantitative analyses of microbial gene expression at the Hawaii Ocean Time Series Station and the Bermuda Atlantic Time Series Station; Conduct field analyses following gene expression patterns in picoplankton microbial communities in general, and Prochlorococcus flow sorted from that community, as they respond to different environmental variables (light, macronutrients, dissolved organic carbon), that are predicted to influence activity, productivity, and carbon cycling; Use the expression analyses of flow sorted Prochlorococcus to identify horizontally transferred genes and gene products, in particular those thatmore » are located in genomic islands and likely to confer habitat-specific fitness advantages; Use the microbial community gene expression data that we generate to gain insights, and test hypotheses, about the variability, genomic context, activity and function of as yet uncharacterized gene products, that appear highly expressed in the environment. We achieved the above goals, and even more over the course of the project. This includes a number of novel methodological developments, as well as the standardization of microbial community gene expression analyses in both field surveys, and experimental modalities. The availability of these methods, tools and approaches is changing current practice in microbial community analyses.« less
Xu, H; Li, C; Zeng, Q; Agrawal, I; Zhu, X; Gong, Z
2016-06-01
In this study, to systematically identify the most stably expressed genes for internal reference in zebrafish Danio rerio investigations, 37 D. rerio transcriptomic datasets (both RNA sequencing and microarray data) were collected from gene expression omnibus (GEO) database and unpublished data, and gene expression variations were analysed under three experimental conditions: tissue types, developmental stages and chemical treatments. Forty-four putative candidate genes were identified with the c.v. <0·2 from all datasets. Following clustering into different functional groups, 21 genes, in addition to four conventional housekeeping genes (eef1a1l1, b2m, hrpt1l and actb1), were selected from different functional groups for further quantitative real-time (qrt-)PCR validation using 25 RNA samples from different adult tissues, developmental stages and chemical treatments. The qrt-PCR data were then analysed using the statistical algorithm refFinder for gene expression stability. Several new candidate genes showed better expression stability than the conventional housekeeping genes in all three categories. It was found that sep15 and metap1 were the top two stable genes for tissue types, ube2a and tmem50a the top two for different developmental stages, and rpl13a and rp1p0 the top two for chemical treatments. Thus, based on the extensive transcriptomic analyses and qrt-PCR validation, these new reference genes are recommended for normalization of D. rerio qrt-PCR data respectively for the three different experimental conditions. © 2016 The Fisheries Society of the British Isles.
Functional Analyses of the Crohn's Disease Risk Gene LACC1.
Assadi, Ghazaleh; Vesterlund, Liselotte; Bonfiglio, Ferdinando; Mazzurana, Luca; Cordeddu, Lina; Schepis, Danika; Mjösberg, Jenny; Ruhrmann, Sabrina; Fabbri, Alessia; Vukojevic, Vladana; Percipalle, Piergiorgio; Salomons, Florian A; Laurencikiene, Jurga; Törkvist, Leif; Halfvarson, Jonas; D'Amato, Mauro
2016-01-01
Genetic variation in the Laccase (multicopper oxidoreductase) domain-containing 1 (LACC1) gene has been shown to affect the risk of Crohn's disease, leprosy and, more recently, ulcerative colitis and juvenile idiopathic arthritis. LACC1 function appears to promote fatty-acid oxidation, with concomitant inflammasome activation, reactive oxygen species production, and anti-bacterial responses in macrophages. We sought to contribute to elucidating LACC1 biological function by extensive characterization of its expression in human tissues and cells, and through preliminary analyses of the regulatory mechanisms driving such expression. We implemented Western blot, quantitative real-time PCR, immunofluorescence microscopy, and flow cytometry analyses to investigate fatty acid metabolism-immune nexus (FAMIN; the LACC1 encoded protein) expression in subcellular compartments, cell lines and relevant human tissues. Gene-set enrichment analyses were performed to initially investigate modulatory mechanisms of LACC1 expression. A small-interference RNA knockdown in vitro model system was used to study the effect of FAMIN depletion on peroxisome function. FAMIN expression was detected in macrophage-differentiated THP-1 cells and several human tissues, being highest in neutrophils, monocytes/macrophages, myeloid and plasmacytoid dendritic cells among peripheral blood cells. Subcellular co-localization was exclusively confined to peroxisomes, with some additional positivity for organelle endomembrane structures. LACC1 co-expression signatures were enriched for genes involved in peroxisome proliferator-activated receptors (PPAR) signaling pathways, and PPAR ligands downregulated FAMIN expression in in vitro model systems. FAMIN is a peroxisome-associated protein with primary role(s) in macrophages and other immune cells, where its metabolic functions may be modulated by PPAR signaling events. However, the precise molecular mechanisms through which FAMIN exerts its biological effects in immune cells remain to be elucidated.
ISAAC - InterSpecies Analysing Application using Containers.
Baier, Herbert; Schultz, Jörg
2014-01-15
Information about genes, transcripts and proteins is spread over a wide variety of databases. Different tools have been developed using these databases to identify biological signals in gene lists from large scale analysis. Mostly, they search for enrichments of specific features. But, these tools do not allow an explorative walk through different views and to change the gene lists according to newly upcoming stories. To fill this niche, we have developed ISAAC, the InterSpecies Analysing Application using Containers. The central idea of this web based tool is to enable the analysis of sets of genes, transcripts and proteins under different biological viewpoints and to interactively modify these sets at any point of the analysis. Detailed history and snapshot information allows tracing each action. Furthermore, one can easily switch back to previous states and perform new analyses. Currently, sets can be viewed in the context of genomes, protein functions, protein interactions, pathways, regulation, diseases and drugs. Additionally, users can switch between species with an automatic, orthology based translation of existing gene sets. As todays research usually is performed in larger teams and consortia, ISAAC provides group based functionalities. Here, sets as well as results of analyses can be exchanged between members of groups. ISAAC fills the gap between primary databases and tools for the analysis of large gene lists. With its highly modular, JavaEE based design, the implementation of new modules is straight forward. Furthermore, ISAAC comes with an extensive web-based administration interface including tools for the integration of third party data. Thus, a local installation is easily feasible. In summary, ISAAC is tailor made for highly explorative interactive analyses of gene, transcript and protein sets in a collaborative environment.
The Chlamydomonas Genome Reveals the Evolution of Key Animal and Plant Functions
Merchant, Sabeeha S.; Prochnik, Simon E.; Vallon, Olivier; Harris, Elizabeth H.; Karpowicz, Steven J.; Witman, George B.; Terry, Astrid; Salamov, Asaf; Fritz-Laylin, Lillian K.; Maréchal-Drouard, Laurence; Marshall, Wallace F.; Qu, Liang-Hu; Nelson, David R.; Sanderfoot, Anton A.; Spalding, Martin H.; Kapitonov, Vladimir V.; Ren, Qinghu; Ferris, Patrick; Lindquist, Erika; Shapiro, Harris; Lucas, Susan M.; Grimwood, Jane; Schmutz, Jeremy; Cardol, Pierre; Cerutti, Heriberto; Chanfreau, Guillaume; Chen, Chun-Long; Cognat, Valérie; Croft, Martin T.; Dent, Rachel; Dutcher, Susan; Fernández, Emilio; Ferris, Patrick; Fukuzawa, Hideya; González-Ballester, David; González-Halphen, Diego; Hallmann, Armin; Hanikenne, Marc; Hippler, Michael; Inwood, William; Jabbari, Kamel; Kalanon, Ming; Kuras, Richard; Lefebvre, Paul A.; Lemaire, Stéphane D.; Lobanov, Alexey V.; Lohr, Martin; Manuell, Andrea; Meier, Iris; Mets, Laurens; Mittag, Maria; Mittelmeier, Telsa; Moroney, James V.; Moseley, Jeffrey; Napoli, Carolyn; Nedelcu, Aurora M.; Niyogi, Krishna; Novoselov, Sergey V.; Paulsen, Ian T.; Pazour, Greg; Purton, Saul; Ral, Jean-Philippe; Riaño-Pachón, Diego Mauricio; Riekhof, Wayne; Rymarquis, Linda; Schroda, Michael; Stern, David; Umen, James; Willows, Robert; Wilson, Nedra; Zimmer, Sara Lana; Allmer, Jens; Balk, Janneke; Bisova, Katerina; Chen, Chong-Jian; Elias, Marek; Gendler, Karla; Hauser, Charles; Lamb, Mary Rose; Ledford, Heidi; Long, Joanne C.; Minagawa, Jun; Page, M. Dudley; Pan, Junmin; Pootakham, Wirulda; Roje, Sanja; Rose, Annkatrin; Stahlberg, Eric; Terauchi, Aimee M.; Yang, Pinfen; Ball, Steven; Bowler, Chris; Dieckmann, Carol L.; Gladyshev, Vadim N.; Green, Pamela; Jorgensen, Richard; Mayfield, Stephen; Mueller-Roeber, Bernd; Rajamani, Sathish; Sayre, Richard T.; Brokstein, Peter; Dubchak, Inna; Goodstein, David; Hornick, Leila; Huang, Y. Wayne; Jhaveri, Jinal; Luo, Yigong; Martínez, Diego; Ngau, Wing Chi Abby; Otillar, Bobby; Poliakov, Alexander; Porter, Aaron; Szajkowski, Lukasz; Werner, Gregory; Zhou, Kemin; Grigoriev, Igor V.; Rokhsar, Daniel S.; Grossman, Arthur R.
2010-01-01
Chlamydomonas reinhardtii is a unicellular green alga whose lineage diverged from land plants over 1 billion years ago. It is a model system for studying chloroplast-based photosynthesis, as well as the structure, assembly, and function of eukaryotic flagella (cilia), which were inherited from the common ancestor of plants and animals, but lost in land plants. We sequenced the ∼120-megabase nuclear genome of Chlamydomonas and performed comparative phylogenomic analyses, identifying genes encoding uncharacterized proteins that are likely associated with the function and biogenesis of chloroplasts or eukaryotic flagella. Analyses of the Chlamydomonas genome advance our understanding of the ancestral eukaryotic cell, reveal previously unknown genes associated with photosynthetic and flagellar functions, and establish links between ciliopathy and the composition and function of flagella. PMID:17932292
Different functional classes of genes are characterized by different compositional properties.
D'Onofrio, Giuseppe; Ghosh, Tapash Chandra; Saccone, Salvatore
2007-12-22
A compositional analysis on a set of human genes classified in several functional classes was performed. We found out that the GC3, i.e. the GC level at the third codon positions, of the genes involved in cellular metabolism was significantly higher than those involved in information storage and processing. Analyses of human/Xenopus ortologous genes showed that: (i) the GC3 increment of the genes involved in cellular metabolism was significantly higher than those involved in information storage and processing; and (ii) a strong correlation between the GC3 and the corresponding GCi, i.e. the GC level of introns, was found in each functional class. The non-randomness of the GC increments favours the selective hypothesis of gene/genome evolution.
González, Carolina; Lazcano, Marcelo; Valdés, Jorge; Holmes, David S.
2016-01-01
Using phylogenomic and gene compositional analyses, five highly conserved gene families have been detected in the core genome of the phylogenetically coherent genus Acidithiobacillus of the class Acidithiobacillia. These core gene families are absent in the closest extant genus Thermithiobacillus tepidarius that subtends the Acidithiobacillus genus and roots the deepest in this class. The predicted proteins encoded by these core gene families are not detected by a BLAST search in the NCBI non-redundant database of more than 90 million proteins using a relaxed cut-off of 1.0e−5. None of the five families has a clear functional prediction. However, bioinformatic scrutiny, using pI prediction, motif/domain searches, cellular location predictions, genomic context analyses, and chromosome topology studies together with previously published transcriptomic and proteomic data, suggests that some may have functions associated with membrane remodeling during cell division perhaps in response to pH stress. Despite the high level of amino acid sequence conservation within each family, there is sufficient nucleotide variation of the respective genes to permit the use of the DNA sequences to distinguish different species of Acidithiobacillus, making them useful additions to the armamentarium of tools for phylogenetic analysis. Since the protein families are unique to the Acidithiobacillus genus, they can also be leveraged as probes to detect the genus in environmental metagenomes and metatranscriptomes, including industrial biomining operations, and acid mine drainage (AMD). PMID:28082953
González, Carolina; Lazcano, Marcelo; Valdés, Jorge; Holmes, David S
2016-01-01
Using phylogenomic and gene compositional analyses, five highly conserved gene families have been detected in the core genome of the phylogenetically coherent genus Acidithiobacillus of the class Acidithiobacillia . These core gene families are absent in the closest extant genus Thermithiobacillus tepidarius that subtends the Acidithiobacillus genus and roots the deepest in this class. The predicted proteins encoded by these core gene families are not detected by a BLAST search in the NCBI non-redundant database of more than 90 million proteins using a relaxed cut-off of 1.0e -5 . None of the five families has a clear functional prediction. However, bioinformatic scrutiny, using pI prediction, motif/domain searches, cellular location predictions, genomic context analyses, and chromosome topology studies together with previously published transcriptomic and proteomic data, suggests that some may have functions associated with membrane remodeling during cell division perhaps in response to pH stress. Despite the high level of amino acid sequence conservation within each family, there is sufficient nucleotide variation of the respective genes to permit the use of the DNA sequences to distinguish different species of Acidithiobacillus , making them useful additions to the armamentarium of tools for phylogenetic analysis. Since the protein families are unique to the Acidithiobacillus genus, they can also be leveraged as probes to detect the genus in environmental metagenomes and metatranscriptomes, including industrial biomining operations, and acid mine drainage (AMD).
Integration of biological networks and gene expression data using Cytoscape
Cline, Melissa S; Smoot, Michael; Cerami, Ethan; Kuchinsky, Allan; Landys, Nerius; Workman, Chris; Christmas, Rowan; Avila-Campilo, Iliana; Creech, Michael; Gross, Benjamin; Hanspers, Kristina; Isserlin, Ruth; Kelley, Ryan; Killcoyne, Sarah; Lotia, Samad; Maere, Steven; Morris, John; Ono, Keiichiro; Pavlovic, Vuk; Pico, Alexander R; Vailaya, Aditya; Wang, Peng-Liang; Adler, Annette; Conklin, Bruce R; Hood, Leroy; Kuiper, Martin; Sander, Chris; Schmulevich, Ilya; Schwikowski, Benno; Warner, Guy J; Ideker, Trey; Bader, Gary D
2013-01-01
Cytoscape is a free software package for visualizing, modeling and analyzing molecular and genetic interaction networks. This protocol explains how to use Cytoscape to analyze the results of mRNA expression profiling, and other functional genomics and proteomics experiments, in the context of an interaction network obtained for genes of interest. Five major steps are described: (i) obtaining a gene or protein network, (ii) displaying the network using layout algorithms, (iii) integrating with gene expression and other functional attributes, (iv) identifying putative complexes and functional modules and (v) identifying enriched Gene Ontology annotations in the network. These steps provide a broad sample of the types of analyses performed by Cytoscape. PMID:17947979
Genome-wide differential gene expression in immortalized DF-1 chicken embryo fibroblast cell line
2011-01-01
Background When compared to primary chicken embryo fibroblast (CEF) cells, the immortal DF-1 CEF line exhibits enhanced growth rates and susceptibility to oxidative stress. Although genes responsible for cell cycle regulation and antioxidant functions have been identified, the genome-wide transcription profile of immortal DF-1 CEF cells has not been previously reported. Global gene expression in primary CEF and DF-1 cells was performed using a 4X44K chicken oligo microarray. Results A total of 3876 differentially expressed genes were identified with a 2 fold level cutoff that included 1706 up-regulated and 2170 down-regulated genes in DF-1 cells. Network and functional analyses using Ingenuity Pathways Analysis (IPA, Ingenuity® Systems, http://www.ingenuity.com) revealed that 902 of 3876 differentially expressed genes were classified into a number of functional groups including cellular growth and proliferation, cell cycle, cellular movement, cancer, genetic disorders, and cell death. Also, the top 5 gene networks with intermolecular connections were identified. Bioinformatic analyses suggested that DF-1 cells were characterized by enhanced molecular mechanisms for cell cycle progression and proliferation, suppressing cell death pathways, altered cellular morphogenesis, and accelerated capacity for molecule transport. Key molecules for these functions include E2F1, BRCA1, SRC, CASP3, and the peroxidases. Conclusions The global gene expression profiles provide insight into the cellular mechanisms that regulate the unique characteristics observed in immortal DF-1 CEF cells. PMID:22111699
Loss of delta catenin function in severe autism
Turner, Tychele N.; Sharma, Kamal; Oh, Edwin C.; Liu, Yangfan P.; Collins, Ryan L.; Sosa, Maria X.; Auer, Dallas R.; Brand, Harrison; Sanders, Stephan J.; Moreno-De-Luca, Daniel; Pihur, Vasyl; Plona, Teri; Pike, Kristen; Soppet, Daniel R.; Smith, Michael W.; Cheung, Sau Wai; Martin, Christa Lese; State, Matthew W.; Talkowski, Michael E.; Cook, Edwin; Huganir, Richard; Katsanis, Nicholas; Chakravarti, Aravinda
2015-01-01
SUMMARY Autism is a multifactorial neurodevelopmental disorder affecting more males than females; consequently, under a multifactorial genetic hypothesis, females are affected only when they cross a higher biological threshold. We hypothesize that deleterious variants at conserved residues are enriched in severely affected patients arising from FEMFs (female-enriched multiplex families) with severe disease, enhancing the detection of key autism genes in modest numbers of cases. We show the utility of this strategy by identifying missense and dosage sequence variants in the gene encoding the adhesive junction-associated delta catenin protein (CTNND2) in FEMFs and demonstrating their loss-of-function effect by functional analyses in zebrafish embryos and cultured hippocampal neurons from wildtype and Ctnnd2 null mouse embryos. Finally, through gene expression and network analyses, we highlight a critical role for CTNND2 in neuronal development and an intimate connection to chromatin biology. Our data contribute to the understanding of the genetic architecture of autism and suggest that genetic analyses of phenotypic extremes, such as FEMFs, are of innate value in multifactorial disorders. PMID:25807484
Genome Editing in the Cricket, Gryllus bimaculatus.
Watanabe, Takahito; Noji, Sumihare; Mito, Taro
2017-01-01
Hemimetabolous, or incompletely metamorphosing, insects are phylogenetically basal and include many beneficial and deleterious species. The cricket, Gryllus bimaculatus, is an emerging model for hemimetabolous insects, based on the success of RNA interference (RNAi)-based gene-functional analyses and transgenic technology. Taking advantage of genome editing technologies in this species would greatly promote functional genomics studies. Genome editing has proven to be an effective method for site-specific genome manipulation in various species. Here, we describe a protocol for genome editing including gene knockout and gene knockin in G. bimaculatus for functional genomics studies.
Liu, Juan; Qi, Zhe-Chen; Zhao, Yun-Peng; Fu, Cheng-Xin; Jenny Xiang, Qiu-Yun
2012-09-01
The complete nucleotide sequence of the chloroplast genome (cpDNA) of Smilax china L. (Smilacaceae) is reported. It is the first complete cp genome sequence in Liliales. Genomic analyses were conducted to examine the rate and pattern of cpDNA genome evolution in Smilax relative to other major lineages of monocots. The cpDNA genomic sequences were combined with those available for Lilium to evaluate the phylogenetic position of Liliales and to investigate the influence of taxon sampling, gene sampling, gene function, natural selection, and substitution rate on phylogenetic inference in monocots. Phylogenetic analyses using sequence data of gene groups partitioned according to gene function, selection force, and total substitution rate demonstrated evident impacts of these factors on phylogenetic inference of monocots and the placement of Liliales, suggesting potential evolutionary convergence or adaptation of some cpDNA genes in monocots. Our study also demonstrated that reduced taxon sampling reduced the bootstrap support for the placement of Liliales in the cpDNA phylogenomic analysis. Analyses of sequences of 77 protein genes with some missing data and sequences of 81 genes (all protein genes plus the rRNA genes) support a sister relationship of Liliales to the commelinids-Asparagales clade, consistent with the APG III system. Analyses of 63 cpDNA protein genes for 32 taxa with few missing data, however, support a sister relationship of Liliales (represented by Smilax and Lilium) to Dioscoreales-Pandanales. Topology tests indicated that these two alignments do not significantly differ given any of these three cpDNA genomic sequence data sets. Furthermore, we found no saturation effect of the data, suggesting that the cpDNA genomic sequence data used in the study are appropriate for monocot phylogenetic study and long-branch attraction is unlikely to be the cause to explain the result of two well-supported, conflict placements of Liliales. Further analyses using sufficient nuclear data remain necessary to evaluate these two phylogenetic hypotheses regarding the position of Liliales and to address the causes of signal conflict among genes and partitions. Copyright © 2012 Elsevier Inc. All rights reserved.
USDA-ARS?s Scientific Manuscript database
The research was to elucidate the function of the ß-glucosidase of Formosan subterranean termites in vitro and in vivo. Quantitative RT-PCR analyses indicated that the gene transcript was relatively more abundant in the foraging worker caste than in other castes and salivary glands were the major ex...
Zhang, Liyuan; Gu, Lingkun; Ringler, Patricia; Smith, Stanley; Rushton, Paul J; Shen, Qingxi J
2015-07-01
Members of the WRKY transcription factor superfamily are essential for the regulation of many plant pathways. Functional redundancy due to duplications of WRKY transcription factors, however, complicates genetic analysis by allowing single-mutant plants to maintain wild-type phenotypes. Our analyses indicate that three group I WRKY genes, OsWRKY24, -53, and -70, act in a partially redundant manner. All three showed characteristics of typical WRKY transcription factors: each localized to nuclei and yeast one-hybrid assays indicated that they all bind to W-boxes, including those present in their own promoters. Quantitative real time-PCR (qRT-PCR) analyses indicated that the expression levels of the three WRKY genes varied in the different tissues tested. Particle bombardment-mediated transient expression analyses indicated that all three genes repress the GA and ABA signaling in a dosage-dependent manner. Combination of all three WRKY genes showed additive antagonism of ABA and GA signaling. These results suggest that these WRKY proteins function as negative transcriptional regulators of GA and ABA signaling. However, different combinations of these WRKY genes can lead to varied strengths in suppression of their targets. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Yin, Wei; Wang, Zong-ji; Li, Qi-ye; Lian, Jin-ming; Zhou, Yang; Lu, Bing-zheng; Jin, Li-jun; Qiu, Peng-xin; Zhang, Pei; Zhu, Wen-bo; Wen, Bo; Huang, Yi-jun; Lin, Zhi-long; Qiu, Bi-tao; Su, Xing-wen; Yang, Huan-ming; Zhang, Guo-jie; Yan, Guang-mei; Zhou, Qi
2016-01-01
Snakes have numerous features distinctive from other tetrapods and a rich history of genome evolution that is still obscure. Here, we report the high-quality genome of the five-pacer viper, Deinagkistrodon acutus, and comparative analyses with other representative snake and lizard genomes. We map the evolutionary trajectories of transposable elements (TEs), developmental genes and sex chromosomes onto the snake phylogeny. TEs exhibit dynamic lineage-specific expansion, and many viper TEs show brain-specific gene expression along with their nearby genes. We detect signatures of adaptive evolution in olfactory, venom and thermal-sensing genes and also functional degeneration of genes associated with vision and hearing. Lineage-specific relaxation of functional constraints on respective Hox and Tbx limb-patterning genes supports fossil evidence for a successive loss of forelimbs then hindlimbs during snake evolution. Finally, we infer that the ZW sex chromosome pair had undergone at least three recombination suppression events in the ancestor of advanced snakes. These results altogether forge a framework for our deep understanding into snakes' history of molecular evolution. PMID:27708285
Carlson, Kimberly A.; Gardner, Kylee; Pashaj, Anjeza; Carlson, Darby J.; Yu, Fang; Eudy, James D.; Zhang, Chi; Harshman, Lawrence G.
2015-01-01
Aging is a complex process characterized by a steady decline in an organism's ability to perform life-sustaining tasks. In the present study, two cages of approximately 12,000 mated Drosophila melanogaster females were used as a source of RNA from individuals sampled frequently as a function of age. A linear model for microarray data method was used for the microarray analysis to adjust for the box effect; it identified 1,581 candidate aging genes. Cluster analyses using a self-organizing map algorithm on the 1,581 significant genes identified gene expression patterns across different ages. Genes involved in immune system function and regulation, chorion assembly and function, and metabolism were all significantly differentially expressed as a function of age. The temporal pattern of data indicated that gene expression related to aging is affected relatively early in life span. In addition, the temporal variance in gene expression in immune function genes was compared to a random set of genes. There was an increase in the variance of gene expression within each cohort, which was not observed in the set of random genes. This observation is compatible with the hypothesis that D. melanogaster immune function genes lose control of gene expression as flies age. PMID:26090231
Rustenholz, Camille; Choulet, Frédéric; Laugier, Christel; Safár, Jan; Simková, Hana; Dolezel, Jaroslav; Magni, Federica; Scalabrin, Simone; Cattonaro, Federica; Vautrin, Sonia; Bellec, Arnaud; Bergès, Hélène; Feuillet, Catherine; Paux, Etienne
2011-12-01
To improve our understanding of the organization and regulation of the wheat (Triticum aestivum) gene space, we established a transcription map of a wheat chromosome (3B) by hybridizing a newly developed wheat expression microarray with bacterial artificial chromosome pools from a new version of the 3B physical map as well as with cDNA probes derived from 15 RNA samples. Mapping data for almost 3,000 genes showed that the gene space spans the whole chromosome 3B with a 2-fold increase of gene density toward the telomeres due to an increase in the number of genes in islands. Comparative analyses with rice (Oryza sativa) and Brachypodium distachyon revealed that these gene islands are composed mainly of genes likely originating from interchromosomal gene duplications. Gene Ontology and expression profile analyses for the 3,000 genes located along the chromosome revealed that the gene islands are enriched significantly in genes sharing the same function or expression profile, thereby suggesting that genes in islands acquired shared regulation during evolution. Only a small fraction of these clusters of cofunctional and coexpressed genes was conserved with rice and B. distachyon, indicating a recent origin. Finally, genes with the same expression profiles in remote islands (coregulation islands) were identified suggesting long-distance regulation of gene expression along the chromosomes in wheat.
Short, Michael D.; Abell, Guy C. J.; Bodrossy, Levente; van den Akker, Ben
2013-01-01
We report on the first study trialling a newly-developed, functional gene microarray (FGA) for characterising bacterial and archaeal ammonia oxidisers in activated sludge. Mixed liquor (ML) and media biofilm samples from a full-scale integrated fixed-film activated sludge (IFAS) plant were analysed with the FGA to profile the diversity and relative abundance of ammonia-oxidising archaea and bacteria (AOA and AOB respectively). FGA analyses of AOA and AOB communities revealed ubiquitous distribution of AOA across all samples – an important finding for these newly-discovered and poorly characterised organisms. Results also revealed striking differences in the functional ecology of attached versus suspended communities within the IFAS reactor. Quantitative assessment of AOB and AOA functional gene abundance revealed a dominance of AOB in the ML and approximately equal distribution of AOA and AOB in the media-attached biofilm. Subsequent correlations of functional gene abundance data with key water quality parameters suggested an important functional role for media-attached AOB in particular for IFAS reactor nitrification performance and indicate possible functional redundancy in some IFAS ammonia oxidiser communities. Results from this investigation demonstrate the capacity of the FGA to resolve subtle ecological shifts in key microbial communities in nitrifying activated sludge and indicate its value as a tool for better understanding the linkages between the ecology and performance of these engineered systems. PMID:24155925
2010-01-01
Background The biological dimensions of genes are manifold. These include genomic properties, (e.g., X/autosomal linkage, recombination) and functional properties (e.g., expression level, tissue specificity). Multiple properties, each generally of subtle influence individually, may affect the evolution of genes or merely be (auto-)correlates. Results of multidimensional analyses may reveal the relative importance of these properties on the evolution of genes, and therefore help evaluate whether these properties should be considered during analyses. While numerous properties are now considered during studies, most work still assumes the stereotypical solitary gene as commonly depicted in textbooks. Here, we investigate the Drosophila melanogaster genome to determine whether deviations from the stereotypical gene architecture correlate with other properties of genes. Results Deviations from the stereotypical gene architecture were classified as the following gene constellations: Overlapping genes were defined as those that overlap in the 5-prime, exonic, or intronic regions. Chromatin co-clustering genes were defined as genes that co-clustered within 20 kb of transcriptional territories. If this scheme is applied the stereotypical gene emerges as a rare occurrence (7.5%), slightly varied schemes yielded between ~1%-50%. Moreover, when following our scheme, paired-overlapping genes and chromatin co-clustering genes accounted for 50.1 and 42.4% of the genes analyzed, respectively. Gene constellation was a correlate of a number of functional and evolutionary properties of genes, but its statistical effect was ~1-2 orders of magnitude lower than the effects of recombination, chromosome linkage and protein function. Analysis of datasets on male reproductive proteins showed these were biased in their representation of gene constellations and evolutionary rate Ka/Ks estimates, but these biases did not overwhelm the biologically meaningful observation of high evolutionary rates of male reproductive genes. Conclusion Given the rarity of the solitary stereotypical gene, and the abundance of gene constellations that deviate from it, the presence of gene constellations, while once thought to be exceptional in large Eukaryote genomes, might have broader relevance to the understanding and study of the genome. However, according to our definition, while gene constellations can be significant correlates of functional properties of genes, they generally are weak correlates of the evolution of genes. Thus, the need for their consideration would depend on the context of studies. PMID:20497561
Xia, Wei; Wu, Jian; Deng, Fei-Yan; Wu, Long-Fei; Zhang, Yong-Hong; Guo, Yu-Fan; Lei, Shu-Feng
2017-02-01
Rheumatoid arthritis (RA) is a systemic autoimmune disease. So far, it is unclear whether there exist common RA-related genes shared in different tissues/cells. In this study, we conducted an integrative analysis on multiple datasets to identify potential shared genes that are significant in multiple tissues/cells for RA. Seven microarray gene expression datasets representing various RA-related tissues/cells were downloaded from the Gene Expression Omnibus (GEO). Statistical analyses, testing both marginal and joint effects, were conducted to identify significant genes shared in various samples. Followed-up analyses were conducted on functional annotation clustering analysis, protein-protein interaction (PPI) analysis, gene-based association analysis, and ELISA validation analysis in in-house samples. We identified 18 shared significant genes, which were mainly involved in the immune response and chemokine signaling pathway. Among the 18 genes, eight genes (PPBP, PF4, HLA-F, S100A8, RNASEH2A, P2RY6, JAG2, and PCBP1) interact with known RA genes. Two genes (HLA-F and PCBP1) are significant in gene-based association analysis (P = 1.03E-31, P = 1.30E-2, respectively). Additionally, PCBP1 also showed differential protein expression levels in in-house case-control plasma samples (P = 2.60E-2). This study represented the first effort to identify shared RA markers from different functional cells or tissues. The results suggested that one of the shared genes, i.e., PCBP1, is a promising biomarker for RA.
Evidence for a large expansion and subfunctionalisation of globin genes in sea anemones.
Smith, Hayden L; Pavasovic, Ana; Surm, Joachim M; Phillips, Matthew J; Prentis, Peter J
2018-06-27
The globin gene superfamily has been well-characterised in vertebrates, however, there has been limited research in early-diverging lineages, such as phylum Cnidaria. This study aimed to identify globin genes in multiple cnidarian lineages, and use bioinformatic approaches to characterise the evolution, structure and expression of these genes. Phylogenetic analyses and in silico protein predictions showed that all cnidarians have undergone an expansion of globin genes, which likely have a hexacoordinate protein structure. Our protein modelling has also revealed the possibility of a single pentacoordinate globin lineage in anthozoan species. Some cnidarian globin genes displayed tissue and development specific expression with very few orthologous genes similarly expressed across species. Our phylogenetic analyses also revealed that eumetazoan globin genes form a polyphyletic relationship with vertebrate globin genes. Overall, our analyses suggest that a Ngb-like and GbX-like gene were most likely present in the globin gene repertoire for the last common ancestor of eumetazoans. The identification of a large-scale expansion and subfunctionalisation of globin genes in actiniarians provides an excellent starting point to further our understanding of the evolution and function of the globin gene superfamily in early-diverging lineages.
Fitzgerald, Timothy L; Powell, Jonathan J; Stiller, Jiri; Weese, Terri L; Abe, Tomoko; Zhao, Guangyao; Jia, Jizeng; McIntyre, C Lynne; Li, Zhongyi; Manners, John M; Kazan, Kemal
2015-01-01
Reverse genetic techniques harnessing mutational approaches are powerful tools that can provide substantial insight into gene function in plants. However, as compared to diploid species, reverse genetic analyses in polyploid plants such as bread wheat can present substantial challenges associated with high levels of sequence and functional similarity amongst homoeologous loci. We previously developed a high-throughput method to identify deletions of genes within a physically mutagenized wheat population. Here we describe our efforts to combine multiple homoeologous deletions of three candidate disease susceptibility genes (TaWRKY11, TaPFT1 and TaPLDß1). We were able to produce lines featuring homozygous deletions at two of the three homoeoloci for all genes, but this was dependent on the individual mutants used in crossing. Intriguingly, despite extensive efforts, viable lines possessing homozygous deletions at all three homoeoloci could not be produced for any of the candidate genes. To investigate deletion size as a possible reason for this phenomenon, we developed an amplicon sequencing approach based on synteny to Brachypodium distachyon to assess the size of the deletions removing one candidate gene (TaPFT1) in our mutants. These analyses revealed that genomic deletions removing the locus are relatively large, resulting in the loss of multiple additional genes. The implications of this work for the use of heavy ion mutagenesis for reverse genetic analyses in wheat are discussed.
Fitzgerald, Timothy L.; Powell, Jonathan J.; Stiller, Jiri; Weese, Terri L.; Abe, Tomoko; Zhao, Guangyao; Jia, Jizeng; McIntyre, C. Lynne; Li, Zhongyi; Manners, John M.; Kazan, Kemal
2015-01-01
Reverse genetic techniques harnessing mutational approaches are powerful tools that can provide substantial insight into gene function in plants. However, as compared to diploid species, reverse genetic analyses in polyploid plants such as bread wheat can present substantial challenges associated with high levels of sequence and functional similarity amongst homoeologous loci. We previously developed a high-throughput method to identify deletions of genes within a physically mutagenized wheat population. Here we describe our efforts to combine multiple homoeologous deletions of three candidate disease susceptibility genes (TaWRKY11, TaPFT1 and TaPLDß1). We were able to produce lines featuring homozygous deletions at two of the three homoeoloci for all genes, but this was dependent on the individual mutants used in crossing. Intriguingly, despite extensive efforts, viable lines possessing homozygous deletions at all three homoeoloci could not be produced for any of the candidate genes. To investigate deletion size as a possible reason for this phenomenon, we developed an amplicon sequencing approach based on synteny to Brachypodium distachyon to assess the size of the deletions removing one candidate gene (TaPFT1) in our mutants. These analyses revealed that genomic deletions removing the locus are relatively large, resulting in the loss of multiple additional genes. The implications of this work for the use of heavy ion mutagenesis for reverse genetic analyses in wheat are discussed. PMID:25719507
CoNekT: an open-source framework for comparative genomic and transcriptomic network analyses.
Proost, Sebastian; Mutwil, Marek
2018-05-01
The recent accumulation of gene expression data in the form of RNA sequencing creates unprecedented opportunities to study gene regulation and function. Furthermore, comparative analysis of the expression data from multiple species can elucidate which functional gene modules are conserved across species, allowing the study of the evolution of these modules. However, performing such comparative analyses on raw data is not feasible for many biologists. Here, we present CoNekT (Co-expression Network Toolkit), an open source web server, that contains user-friendly tools and interactive visualizations for comparative analyses of gene expression data and co-expression networks. These tools allow analysis and cross-species comparison of (i) gene expression profiles; (ii) co-expression networks; (iii) co-expressed clusters involved in specific biological processes; (iv) tissue-specific gene expression; and (v) expression profiles of gene families. To demonstrate these features, we constructed CoNekT-Plants for green alga, seed plants and flowering plants (Picea abies, Chlamydomonas reinhardtii, Vitis vinifera, Arabidopsis thaliana, Oryza sativa, Zea mays and Solanum lycopersicum) and thus provide a web-tool with the broadest available collection of plant phyla. CoNekT-Plants is freely available from http://conekt.plant.tools, while the CoNekT source code and documentation can be found at https://github.molgen.mpg.de/proost/CoNekT/.
Hu, Valerie W.; Addington, Anjene; Hyman, Alexander
2011-01-01
The heterogeneity of symptoms associated with autism spectrum disorders (ASDs) has presented a significant challenge to genetic analyses. Even when associations with genetic variants have been identified, it has been difficult to associate them with a specific trait or characteristic of autism. Here, we report that quantitative trait analyses of ASD symptoms combined with case-control association analyses using distinct ASD subphenotypes identified on the basis of symptomatic profiles result in the identification of highly significant associations with 18 novel single nucleotide polymorphisms (SNPs). The symptom categories included deficits in language usage, non-verbal communication, social development, and play skills, as well as insistence on sameness or ritualistic behaviors. Ten of the trait-associated SNPs, or quantitative trait loci (QTL), were associated with more than one subtype, providing partial replication of the identified QTL. Notably, none of the novel SNPs is located within an exonic region, suggesting that these hereditary components of ASDs are more likely related to gene regulatory processes (or gene expression) than to structural or functional changes in gene products. Seven of the QTL reside within intergenic chromosomal regions associated with rare copy number variants that have been previously reported in autistic samples. Pathway analyses of the genes associated with the QTL identified in this study implicate neurological functions and disorders associated with autism pathophysiology. This study underscores the advantage of incorporating both quantitative traits as well as subphenotypes into large-scale genome-wide analyses of complex disorders. PMID:21556359
Influence of geogenic factors on microbial communities in metallogenic Australian soils
Reith, Frank; Brugger, Joel; Zammit, Carla M; Gregg, Adrienne L; Goldfarb, Katherine C; Andersen, Gary L; DeSantis, Todd Z; Piceno, Yvette M; Brodie, Eoin L; Lu, Zhenmei; He, Zhili; Zhou, Jizhong; Wakelin, Steven A
2012-01-01
Links between microbial community assemblages and geogenic factors were assessed in 187 soil samples collected from four metal-rich provinces across Australia. Field-fresh soils and soils incubated with soluble Au(III) complexes were analysed using three-domain multiplex-terminal restriction fragment length polymorphism, and phylogenetic (PhyloChip) and functional (GeoChip) microarrays. Geogenic factors of soils were determined using lithological-, geomorphological- and soil-mapping combined with analyses of 51 geochemical parameters. Microbial communities differed significantly between landforms, soil horizons, lithologies and also with the occurrence of underlying Au deposits. The strongest responses to these factors, and to amendment with soluble Au(III) complexes, was observed in bacterial communities. PhyloChip analyses revealed a greater abundance and diversity of Alphaproteobacteria (especially Sphingomonas spp.), and Firmicutes (Bacillus spp.) in Au-containing and Au(III)-amended soils. Analyses of potential function (GeoChip) revealed higher abundances of metal-resistance genes in metal-rich soils. For example, genes that hybridised with metal-resistance genes copA, chrA and czcA of a prevalent aurophillic bacterium, Cupriavidus metallidurans CH34, occurred only in auriferous soils. These data help establish key links between geogenic factors and the phylogeny and function within soil microbial communities. In particular, the landform, which is a crucial factor in determining soil geochemistry, strongly affected microbial community structures. PMID:22673626
Influence of geogenic factors on microbial communities in metallogenic Australian soils.
Reith, Frank; Brugger, Joel; Zammit, Carla M; Gregg, Adrienne L; Goldfarb, Katherine C; Andersen, Gary L; DeSantis, Todd Z; Piceno, Yvette M; Brodie, Eoin L; Lu, Zhenmei; He, Zhili; Zhou, Jizhong; Wakelin, Steven A
2012-11-01
Links between microbial community assemblages and geogenic factors were assessed in 187 soil samples collected from four metal-rich provinces across Australia. Field-fresh soils and soils incubated with soluble Au(III) complexes were analysed using three-domain multiplex-terminal restriction fragment length polymorphism, and phylogenetic (PhyloChip) and functional (GeoChip) microarrays. Geogenic factors of soils were determined using lithological-, geomorphological- and soil-mapping combined with analyses of 51 geochemical parameters. Microbial communities differed significantly between landforms, soil horizons, lithologies and also with the occurrence of underlying Au deposits. The strongest responses to these factors, and to amendment with soluble Au(III) complexes, was observed in bacterial communities. PhyloChip analyses revealed a greater abundance and diversity of Alphaproteobacteria (especially Sphingomonas spp.), and Firmicutes (Bacillus spp.) in Au-containing and Au(III)-amended soils. Analyses of potential function (GeoChip) revealed higher abundances of metal-resistance genes in metal-rich soils. For example, genes that hybridised with metal-resistance genes copA, chrA and czcA of a prevalent aurophillic bacterium, Cupriavidus metallidurans CH34, occurred only in auriferous soils. These data help establish key links between geogenic factors and the phylogeny and function within soil microbial communities. In particular, the landform, which is a crucial factor in determining soil geochemistry, strongly affected microbial community structures.
A systematic approach to infer biological relevance and biases of gene network structures.
Antonov, Alexey V; Tetko, Igor V; Mewes, Hans W
2006-01-10
The development of high-throughput technologies has generated the need for bioinformatics approaches to assess the biological relevance of gene networks. Although several tools have been proposed for analysing the enrichment of functional categories in a set of genes, none of them is suitable for evaluating the biological relevance of the gene network. We propose a procedure and develop a web-based resource (BIOREL) to estimate the functional bias (biological relevance) of any given genetic network by integrating different sources of biological information. The weights of the edges in the network may be either binary or continuous. These essential features make our web tool unique among many similar services. BIOREL provides standardized estimations of the network biases extracted from independent data. By the analyses of real data we demonstrate that the potential application of BIOREL ranges from various benchmarking purposes to systematic analysis of the network biology.
Gotoh, Hiroki; Zinna, Robert A; Warren, Ian; DeNieu, Michael; Niimi, Teruyuki; Dworkin, Ian; Emlen, Douglas J; Miura, Toru; Lavine, Laura C
2016-03-22
Genes in the sex determination pathway are important regulators of sexually dimorphic animal traits, including the elaborate and exaggerated male ornaments and weapons of sexual selection. In this study, we identified and functionally analyzed members of the sex determination gene family in the golden metallic stag beetle Cyclommatus metallifer, which exhibits extreme differences in mandible size between males and females. We constructed a C. metallifer transcriptomic database from larval and prepupal developmental stages and tissues of both males and females. Using Roche 454 pyrosequencing, we generated a de novo assembled database from a total of 1,223,516 raw reads, which resulted in 14,565 isotigs (putative transcript isoforms) contained in 10,794 isogroups (putative identified genes). We queried this database for C. metallifer conserved sex determination genes and identified 14 candidate sex determination pathway genes. We then characterized the roles of several of these genes in development of extreme sexual dimorphic traits in this species. We performed molecular expression analyses with RT-PCR and functional analyses using RNAi on three C. metallifer candidate genes--Sex-lethal (CmSxl), transformer-2 (Cmtra2), and intersex (Cmix). No differences in expression pattern were found between the sexes for any of these three genes. In the RNAi gene-knockdown experiments, we found that only the Cmix had any effect on sexually dimorphic morphology, and these mimicked the effects of Cmdsx knockdown in females. Knockdown of CmSxl had no measurable effects on stag beetle phenotype, while knockdown of Cmtra2 resulted in complete lethality at the prepupal period. These results indicate that the roles of CmSxl and Cmtra2 in the sex determination cascade are likely to have diverged in stag beetles when compared to Drosophila. Our results also suggest that Cmix has a conserved role in this pathway. In addition to those three genes, we also performed a more complete functional analysis of the C. metallifer dsx gene (Cmdsx) to identify the isoforms that regulate dimorphism more fully using exon-specific RNAi. We identified a total of 16 alternative splice variants of the Cmdsx gene that code for up to 14 separate exons. Despite the variation in RNA splice products of the Cmdsx gene, only four protein isoforms are predicted. The results of our exon-specific RNAi indicated that the essential CmDsx isoform for postembryonic male differentiation is CmDsxB, whereas postembryonic female specific differentiation is mainly regulated by CmDsxD. Taken together, our results highlight the importance of studying the function of highly conserved sex determination pathways in numerous insect species, especially those with dramatic and exaggerated sexual dimorphism, because conservation in protein structure does not always translate into conservation in downstream function.
Identifying gene networks underlying the neurobiology of ethanol and alcoholism.
Wolen, Aaron R; Miles, Michael F
2012-01-01
For complex disorders such as alcoholism, identifying the genes linked to these diseases and their specific roles is difficult. Traditional genetic approaches, such as genetic association studies (including genome-wide association studies) and analyses of quantitative trait loci (QTLs) in both humans and laboratory animals already have helped identify some candidate genes. However, because of technical obstacles, such as the small impact of any individual gene, these approaches only have limited effectiveness in identifying specific genes that contribute to complex diseases. The emerging field of systems biology, which allows for analyses of entire gene networks, may help researchers better elucidate the genetic basis of alcoholism, both in humans and in animal models. Such networks can be identified using approaches such as high-throughput molecular profiling (e.g., through microarray-based gene expression analyses) or strategies referred to as genetical genomics, such as the mapping of expression QTLs (eQTLs). Characterization of gene networks can shed light on the biological pathways underlying complex traits and provide the functional context for identifying those genes that contribute to disease development.
Alves, Chrystian J.; Dariolli, Rafael; Jorge, Frederico M.; Monteiro, Matheus R.; Maximino, Jessica R.; Martins, Roberto S.; Strauss, Bryan E.; Krieger, José E.; Callegaro, Dagoberto; Chadi, Gerson
2015-01-01
Amyotrophic Lateral Sclerosis (ALS) is a fatal neurodegenerative disease that leads to widespread motor neuron death, general palsy and respiratory failure. The most prevalent sporadic ALS form is not genetically inherited. Attempts to translate therapeutic strategies have failed because the described mechanisms of disease are based on animal models carrying specific gene mutations and thus do not address sporadic ALS. In order to achieve a better approach to study the human disease, human induced pluripotent stem cell (hiPSC)-differentiated motor neurons were obtained from motor nerve fibroblasts of sporadic ALS and non-ALS subjects using the STEMCCA Cre-Excisable Constitutive Polycistronic Lentivirus system and submitted to microarray analyses using a whole human genome platform. DAVID analyses of differentially expressed genes identified molecular function and biological process-related genes through Gene Ontology. REVIGO highlighted the related functions mRNA and DNA binding, GTP binding, transcription (co)-repressor activity, lipoprotein receptor binding, synapse organization, intracellular transport, mitotic cell cycle and cell death. KEGG showed pathways associated with Parkinson's disease and oxidative phosphorylation, highlighting iron homeostasis, neurotrophic functions, endosomal trafficking and ERK signaling. The analysis of most dysregulated genes and those representative of the majority of categorized genes indicates a strong association between mitochondrial function and cellular processes possibly related to motor neuron degeneration. In conclusion, iPSC-derived motor neurons from motor nerve fibroblasts of sporadic ALS patients may recapitulate key mechanisms of neurodegeneration and may offer an opportunity for translational investigation of sporadic ALS. Large gene profiling of differentiated motor neurons from sporadic ALS patients highlights mitochondrial participation in the establishment of autonomous mechanisms associated with sporadic ALS. PMID:26300727
Osato, Naoki
2018-01-19
Transcriptional target genes show functional enrichment of genes. However, how many and how significantly transcriptional target genes include functional enrichments are still unclear. To address these issues, I predicted human transcriptional target genes using open chromatin regions, ChIP-seq data and DNA binding sequences of transcription factors in databases, and examined functional enrichment and gene expression level of putative transcriptional target genes. Gene Ontology annotations showed four times larger numbers of functional enrichments in putative transcriptional target genes than gene expression information alone, independent of transcriptional target genes. To compare the number of functional enrichments of putative transcriptional target genes between cells or search conditions, I normalized the number of functional enrichment by calculating its ratios in the total number of transcriptional target genes. With this analysis, native putative transcriptional target genes showed the largest normalized number of functional enrichments, compared with target genes including 5-60% of randomly selected genes. The normalized number of functional enrichments was changed according to the criteria of enhancer-promoter interactions such as distance from transcriptional start sites and orientation of CTCF-binding sites. Forward-reverse orientation of CTCF-binding sites showed significantly higher normalized number of functional enrichments than the other orientations. Journal papers showed that the top five frequent functional enrichments were related to the cellular functions in the three cell types. The median expression level of transcriptional target genes changed according to the criteria of enhancer-promoter assignments (i.e. interactions) and was correlated with the changes of the normalized number of functional enrichments of transcriptional target genes. Human putative transcriptional target genes showed significant functional enrichments. Functional enrichments were related to the cellular functions. The normalized number of functional enrichments of human putative transcriptional target genes changed according to the criteria of enhancer-promoter assignments and correlated with the median expression level of the target genes. These analyses and characters of human putative transcriptional target genes would be useful to examine the criteria of enhancer-promoter assignments and to predict the novel mechanisms and factors such as DNA binding proteins and DNA sequences of enhancer-promoter interactions.
Chai, J H; Locke, D P; Ohta, T; Greally, J M; Nicholls, R D
2001-11-01
Prader-Willi syndrome (PWS) results from loss of function of a 1.0- to 1.5-Mb domain of imprinted, paternally expressed genes in human Chromosome (Chr) 15q11-q13. The loss of imprinted gene expression in the homologous region in mouse Chr 7C leads to a similar neonatal PWS phenotype. Several protein-coding genes in the human PWS region are intronless, possibly arising by retrotransposition. Here we present evidence for continued acquisition of genes by the mouse PWS region during evolution. Bioinformatic analyses identified a BAC containing four genes, Mkrn3, Magel2, Ndn, Frat3, and the Atp5l-ps1 pseudogene, the latter two genes derived from recent L1-mediated retrotransposition. Analyses of eight overlapping BACs indicate that these genes are clustered within 120 kb in two inbred strains, in the order tel-Atp5l-ps1-Frat3-Mkrn3-Magel2-Ndn-cen. Imprinting analyses show that Frat3 is differentially methylated and expressed solely from the paternal allele in a transgenic mouse model of Angelman syndrome, with no expression from the maternal allele in a mouse model of PWS. Loss of Frat3 expression may, therefore, contribute to the phenotype of mouse models of PWS. The identification of five intronless genes in a small genomic interval suggests that this region is prone to retroposition in germ cells or their zygotic and embryonic cell precursors, and that it allows the subsequent functional expression of these foreign sequences. The recent evolutionary acquisition of genes that adopt the same imprint as older, flanking genes indicates that the newly acquired genes become 'innocent bystanders' of a primary epigenetic signal causing imprinting in the PWS domain.
Separate enrichment analysis of pathways for up- and downregulated genes.
Hong, Guini; Zhang, Wenjing; Li, Hongdong; Shen, Xiaopei; Guo, Zheng
2014-03-06
Two strategies are often adopted for enrichment analysis of pathways: the analysis of all differentially expressed (DE) genes together or the analysis of up- and downregulated genes separately. However, few studies have examined the rationales of these enrichment analysis strategies. Using both microarray and RNA-seq data, we show that gene pairs with functional links in pathways tended to have positively correlated expression levels, which could result in an imbalance between the up- and downregulated genes in particular pathways. We then show that the imbalance could greatly reduce the statistical power for finding disease-associated pathways through the analysis of all-DE genes. Further, using gene expression profiles from five types of tumours, we illustrate that the separate analysis of up- and downregulated genes could identify more pathways that are really pertinent to phenotypic difference. In conclusion, analysing up- and downregulated genes separately is more powerful than analysing all of the DE genes together.
Nakashima, N; Tamura, T
2013-06-01
Here, we report on the construction of doxycycline (tetracycline analogue)-inducible vectors that express antisense RNAs in Escherichia coli. Using these vectors, the expression of genes of interest can be silenced conditionally. The expression of antisense RNAs from the vectors was more tightly regulated than the previously constructed isopropyl-β-D-galactopyranoside-inducible vectors. Furthermore, expression levels of antisense RNAs were enhanced by combining the doxycycline-inducible promoter with the T7 promoter-T7 RNA polymerase system; the T7 RNA polymerase gene, under control of the doxycycline-inducible promoter, was integrated into the lacZ locus of the genome without leaving any antibiotic marker. These vectors are useful for investigating gene functions or altering cell phenotypes for biotechnological and industrial applications. A gene silencing method using antisense RNAs in Escherichia coli is described, which facilitates the investigation of bacterial gene function. In particular, the method is suitable for comprehensive analyses or phenotypic analyses of genes essential for growth. Here, we describe expansion of vector variations for expressing antisense RNAs, allowing choice of a vector appropriate for the target genes or experimental purpose. © 2013 The Society for Applied Microbiology.
Chen, Lin-xing; Hu, Min; Huang, Li-nan; Hua, Zheng-shuang; Kuang, Jia-liang; Li, Sheng-jin; Shu, Wen-sheng
2015-07-01
The microbial communities in acid mine drainage have been extensively studied to reveal their roles in acid generation and adaption to this environment. Lacking, however, are integrated community- and organism-wide comparative gene transcriptional analyses that could reveal the response and adaptation mechanisms of these extraordinary microorganisms to different environmental conditions. In this study, comparative metagenomics and metatranscriptomics were performed on microbial assemblages collected from four geochemically distinct acid mine drainage (AMD) sites. Taxonomic analysis uncovered unexpectedly high microbial biodiversity of these extremely acidophilic communities, and the abundant taxa of Acidithiobacillus, Leptospirillum and Acidiphilium exhibited high transcriptional activities. Community-wide comparative analyses clearly showed that the AMD microorganisms adapted to the different environmental conditions via regulating the expression of genes involved in multiple in situ functional activities, including low-pH adaptation, carbon, nitrogen and phosphate assimilation, energy generation, environmental stress resistance, and other functions. Organism-wide comparative analyses of the active taxa revealed environment-dependent gene transcriptional profiles, especially the distinct strategies used by Acidithiobacillus ferrivorans and Leptospirillum ferrodiazotrophum in nutrients assimilation and energy generation for survival under different conditions. Overall, these findings demonstrate that the gene transcriptional profiles of AMD microorganisms are closely related to the site physiochemical characteristics, providing clues into the microbial response and adaptation mechanisms in the oligotrophic, extremely acidic environments.
Hess, Jonathan L.; Tylee, Daniel S.; Barve, Rahul; de Jong, Simone; Ophoff, Roel A.; Kumarasinghe, Nishantha; Tooney, Paul; Schall, Ulrich; Gardiner, Erin; Beveridge, Natalie Jane; Scott, Rodney J.; Yasawardene, Surangi; Perera, Antionette; Mendis, Jayan; Carr, Vaughan; Kelly, Brian; Cairns, Murray; Tsuang, Ming T.; Glatt, Stephen J.
2016-01-01
The application of microarray technology in schizophrenia research was heralded as paradigm-shifting, as it allowed for high-throughput assessment of cell and tissue function. This technology was widely adopted, initially in studies of postmortem brain tissue, and later in studies of peripheral blood. The collective body of schizophrenia microarray literature contains apparent inconsistencies between studies, with failures to replicate top hits, in part due to small sample sizes, cohort-specific effects, differences in array types, and other confounders. In an attempt to summarize existing studies of schizophrenia cases and non-related comparison subjects, we performed two mega-analyses of a combined set of microarray data from postmortem prefrontal cortices (n = 315) and from ex-vivo blood tissues (n = 578). We adjusted regression models per gene to remove non-significant covariates, providing best-estimates of transcripts dysregulated in schizophrenia. We also examined dysregulation of functionally related gene sets and gene co-expression modules, and assessed enrichment of cell types and genetic risk factors. The identities of the most significantly dysregulated genes were largely distinct for each tissue, but the findings indicated common emergent biological functions (e.g. immunity) and regulatory factors (e.g., predicted targets of transcription factors and miRNA species across tissues). Our network-based analyses converged upon similar patterns of heightened innate immune gene expression in both brain and blood in schizophrenia. We also constructed generalizable machine-learning classifiers using the blood-based microarray data. Our study provides an informative atlas for future pathophysiologic and biomarker studies of schizophrenia. PMID:27450777
Hess, Jonathan L; Tylee, Daniel S; Barve, Rahul; de Jong, Simone; Ophoff, Roel A; Kumarasinghe, Nishantha; Tooney, Paul; Schall, Ulrich; Gardiner, Erin; Beveridge, Natalie Jane; Scott, Rodney J; Yasawardene, Surangi; Perera, Antionette; Mendis, Jayan; Carr, Vaughan; Kelly, Brian; Cairns, Murray; Tsuang, Ming T; Glatt, Stephen J
2016-10-01
The application of microarray technology in schizophrenia research was heralded as paradigm-shifting, as it allowed for high-throughput assessment of cell and tissue function. This technology was widely adopted, initially in studies of postmortem brain tissue, and later in studies of peripheral blood. The collective body of schizophrenia microarray literature contains apparent inconsistencies between studies, with failures to replicate top hits, in part due to small sample sizes, cohort-specific effects, differences in array types, and other confounders. In an attempt to summarize existing studies of schizophrenia cases and non-related comparison subjects, we performed two mega-analyses of a combined set of microarray data from postmortem prefrontal cortices (n=315) and from ex-vivo blood tissues (n=578). We adjusted regression models per gene to remove non-significant covariates, providing best-estimates of transcripts dysregulated in schizophrenia. We also examined dysregulation of functionally related gene sets and gene co-expression modules, and assessed enrichment of cell types and genetic risk factors. The identities of the most significantly dysregulated genes were largely distinct for each tissue, but the findings indicated common emergent biological functions (e.g. immunity) and regulatory factors (e.g., predicted targets of transcription factors and miRNA species across tissues). Our network-based analyses converged upon similar patterns of heightened innate immune gene expression in both brain and blood in schizophrenia. We also constructed generalizable machine-learning classifiers using the blood-based microarray data. Our study provides an informative atlas for future pathophysiologic and biomarker studies of schizophrenia. Published by Elsevier B.V.
Somboonna, Naraporn; Wan, Raymond; Ojcius, David M.; Pettengill, Matthew A.; Joseph, Sandeep J.; Chang, Alexander; Hsu, Ray; Read, Timothy D.; Dean, Deborah
2011-01-01
ABSTRACT Chlamydia trachomatis is an obligate intracellular bacterium that causes a diversity of severe and debilitating diseases worldwide. Sporadic and ongoing outbreaks of lymphogranuloma venereum (LGV) strains among men who have sex with men (MSM) support the need for research on virulence factors associated with these organisms. Previous analyses have been limited to single genes or genomes of laboratory-adapted reference strain L2/434 and outbreak strain L2b/UCH-1/proctitis. We characterized an unusual LGV strain, termed L2c, isolated from an MSM with severe hemorrhagic proctitis. L2c developed nonfusing, grape-like inclusions and a cytotoxic phenotype in culture, unlike the LGV strains described to date. Deep genome sequencing revealed that L2c was a recombinant of L2 and D strains with conserved clustered regions of genetic exchange, including a 78-kb region and a partial, yet functional, toxin gene that was lost with prolonged culture. Indels (insertions/deletions) were discovered in an ftsK gene promoter and in the tarp and hctB genes, which encode key proteins involved in replication, inclusion formation, and histone H1-like protein activity, respectively. Analyses suggest that these indels affect gene and/or protein function, supporting the in vitro and disease phenotypes. While recombination has been known to occur for C. trachomatis based on gene sequence analyses, we provide the first whole-genome evidence for recombination between a virulent, invasive LGV strain and a noninvasive common urogenital strain. Given the lack of a genetic system for producing stable C. trachomatis mutants, identifying naturally occurring recombinants can clarify gene function and provide opportunities for discovering avenues for genomic manipulation. PMID:21540364
Luo, Xiong-Jian; Mattheisen, Manuel; Li, Ming; Huang, Liang; Rietschel, Marcella; Børglum, Anders D.; Als, Thomas D.; van den Oord, Edwin J.; Aberg, Karolina A.; Mors, Ole; Mortensen, Preben Bo; Luo, Zhenwu; Degenhardt, Franziska; Cichon, Sven; Schulze, Thomas G.; Nöthen, Markus M.; Su, Bing; Zhao, Zhongming; Gan, Lin; Yao, Yong-Gang
2015-01-01
Genome-wide association studies have identified multiple risk variants and loci that show robust association with schizophrenia. Nevertheless, it remains unclear how these variants confer risk to schizophrenia. In addition, the driving force that maintains the schizophrenia risk variants in human gene pool is poorly understood. To investigate whether expression-associated genetic variants contribute to schizophrenia susceptibility, we systematically integrated brain expression quantitative trait loci and genome-wide association data of schizophrenia using Sherlock, a Bayesian statistical framework. Our analyses identified ZNF323 as a schizophrenia risk gene (P = 2.22×10–6). Subsequent analyses confirmed the association of the ZNF323 and its expression-associated single nucleotide polymorphism rs1150711 in independent samples (gene-expression: P = 1.40×10–6; single-marker meta-analysis in the combined discovery and replication sample comprising 44123 individuals: P = 6.85×10−10). We found that the ZNF323 was significantly downregulated in hippocampus and frontal cortex of schizophrenia patients (P = .0038 and P = .0233, respectively). Evidence for pleiotropic effects was detected (association of rs1150711 with lung function and gene expression of ZNF323 in lung: P = 6.62×10–5 and P = 9.00×10–5, respectively) with the risk allele (T allele) for schizophrenia acting as protective allele for lung function. Subsequent population genetics analyses suggest that the risk allele (T) of rs1150711 might have undergone recent positive selection in human population. Our findings suggest that the ZNF323 is a schizophrenia susceptibility gene whose expression may influence schizophrenia risk. Our study also illustrates a possible mechanism for maintaining schizophrenia risk variants in the human gene pool. PMID:25759474
Smith-Paine, Julia; Wade, Shari L; Treble-Barna, Amery; Zhang, Nanhua; Zang, Huaiyu; Martin, Lisa J; Yeates, Keith Owen; Taylor, H Gerry; Kurowski, Brad G
2018-05-02
This study examined whether the ankyrin repeat and kinase domain containing 1 gene (ANKK1) C/T single-nucleotide polymorphism (SNP) rs1800497 moderated the association of family environment with long-term executive function (EF) following traumatic injury in early childhood. Caregivers of children with traumatic brain injury (TBI) and children with orthopedic injury (OI) completed the Behavior Rating Inventory of Executive Function (BRIEF) at post injury visits. DNA was collected to identify the rs1800497 genotype in the ANKK1 gene. General linear models examined gene-environment interactions as moderators of the effects of TBI on EF at two times post injury (12 months and 7 years). At 12 months post injury, analyses revealed a significant 3-way interaction of genotype with level of permissive parenting and injury type. Post-hoc analyses showed genetic effects were more pronounced for children with TBI from more positive family environments, such that children with TBI who were carriers of the risk allele (T-allele) had significantly poorer EF compared to non-carriers only when they were from more advantaged environments. At 7 years post injury, analyses revealed a significant 2-way interaction of genotype with level of authoritarian parenting. Post-hoc analyses found that carriers of the risk allele had significantly poorer EF compared to non-carriers only when they were from more advantaged environments. These results suggest a gene-environment interaction involving the ANKK1 gene as a predictor of EF in a pediatric injury population. The findings highlight the importance of considering environmental influences in future genetic studies on recovery following TBI and other traumatic injuries in childhood.
Luo, Dandan; Ge, Weihong; Hu, Xiao; Li, Chen; Lee, Chia-Ming; Zhou, Liqiang; Wu, Zhourui; Yu, Juehua; Lin, Sheng; Yu, Jing; Xu, Wei; Chen, Lei; Zhang, Chong; Jiang, Kun; Zhu, Xingfei; Li, Haotian; Gao, Xinpei; Geng, Yanan; Jing, Bo; Wang, Zhen; Zheng, Changhong; Zhu, Rongrong; Yan, Qiao; Lin, Quan; Ye, Keqiang; Sun, Yi E; Cheng, Liming
2018-06-28
The mammalian central nervous system (CNS) is considered an immune privileged system as it is separated from the periphery by the blood brain barrier (BBB). Yet, immune functions have been postulated to heavily influence the functional state of the CNS, especially after injury or during neurodegeneration. There is controversy regarding whether adaptive immune responses are beneficial or detrimental to CNS injury repair. In this study, we utilized immunocompromised SCID mice and subjected them to spinal cord injury (SCI). We analyzed motor function, electrophysiology, histochemistry, and performed unbiased RNA-sequencing. SCID mice displayed improved CNS functional recovery compared to WT mice after SCI. Weighted gene-coexpression network analysis (WGCNA) of spinal cord transcriptomes revealed that SCID mice had reduced expression of immune function-related genes and heightened expression of neural transmission-related genes after SCI, which was confirmed by immunohistochemical analysis and was consistent with better functional recovery. Transcriptomic analyses also indicated heightened expression of neurotransmission-related genes before injury in SCID mice, suggesting that a steady state of immune-deficiency potentially led to CNS hyper-connectivity. Consequently, SCID mice without injury demonstrated worse performance in Morris water maze test. Taken together, not only reduced inflammation after injury but also dampened steady-state immune function without injury heightened the neurotransmission program, resulting in better or worse behavioral outcomes respectively. This study revealed the intricate relationship between immune and nervous systems, raising the possibility for therapeutic manipulation of neural function via immune modulation.
A molecular characterization of the choroid plexus and stress-induced gene regulation
Sathyanesan, M; Girgenti, M J; Banasr, M; Stone, K; Bruce, C; Guilchicek, E; Wilczak-Havill, K; Nairn, A; Williams, K; Sass, S; Duman, J G; Newton, S S
2012-01-01
The role of the choroid plexus (CP) in brain homeostasis is being increasingly recognized and recent studies suggest that the CP has a more important role in physiological and pathological brain functions than currently appreciated. To obtain additional insight on the CP function, we performed a proteomics and transcriptomics characterization employing a combination of high resolution tandem mass spectrometry and gene expression analyses in normal rodent brain. Using multiple protein fractionation approaches, we identified 1400 CP proteins in adult CP. Microarray-based comparison of CP gene expression with the kidney, cortex and hippocampus showed significant overlap between the CP and the kidney. CP gene profiles were validated by in situ hybridization analysis of several target genes including klotho, CLIC 6, OATP 14 and Ezrin. Immunohistochemical analyses were performed for CP and enpendyma detection of several target proteins including cytokeratin, Rab7, klotho, tissue inhibitor of metalloprotease 1 (TIMP1), MMP9 and glial fibrillary acidic protein (GFAP). The molecular functions associated with various proteins of the CP proteome indicate that it is a blood–cerebrospinal fluid (CSF) barrier that exhibits high levels of metabolic activity. We also analyzed the gene expression changes induced by stress, an exacerbating factor for many illnesses, particularly mood disorders. Chronic stress altered the expression of several genes, downregulating 5HT2C, glucocorticoid receptor and the cilia genes IFT88 and smoothened while upregulating 5HT2A, BDNF, TNFα and IL-1b. The data presented here attach additional significance to the emerging importance of CP function in brain health and CNS disease states. PMID:22781172
Functional Annotations of Paralogs: A Blessing and a Curse
Zallot, Rémi; Harrison, Katherine J.; Kolaczkowski, Bryan; de Crécy-Lagard, Valérie
2016-01-01
Gene duplication followed by mutation is a classic mechanism of neofunctionalization, producing gene families with functional diversity. In some cases, a single point mutation is sufficient to change the substrate specificity and/or the chemistry performed by an enzyme, making it difficult to accurately separate enzymes with identical functions from homologs with different functions. Because sequence similarity is often used as a basis for assigning functional annotations to genes, non-isofunctional gene families pose a great challenge for genome annotation pipelines. Here we describe how integrating evolutionary and functional information such as genome context, phylogeny, metabolic reconstruction and signature motifs may be required to correctly annotate multifunctional families. These integrative analyses can also lead to the discovery of novel gene functions, as hints from specific subgroups can guide the functional characterization of other members of the family. We demonstrate how careful manual curation processes using comparative genomics can disambiguate subgroups within large multifunctional families and discover their functions. We present the COG0720 protein family as a case study. We also discuss strategies to automate this process to improve the accuracy of genome functional annotation pipelines. PMID:27618105
Hughes, S; Woollard, A
2017-01-01
Runx genes have been identified in all metazoans and considerable conservation of function observed across a wide range of phyla. Thus, insight gained from studying simple model organisms is invaluable in understanding RUNX biology in higher animals. Consequently, this chapter will focus on the Runx genes in the diploblasts, which includes sea anemones and sponges, as well as the lower triploblasts, including the sea urchin, nematode, planaria and insect. Due to the high degree of functional redundancy amongst vertebrate Runx genes, simpler model organisms with a solo Runx gene, like C. elegans, are invaluable systems in which to probe the molecular basis of RUNX function within a whole organism. Additionally, comparative analyses of Runx sequence and function allows for the development of novel evolutionary insights. Strikingly, recent data has emerged that reveals the presence of a Runx gene in a protist, demonstrating even more widespread occurrence of Runx genes than was previously thought. This review will summarize recent progress in using invertebrate organisms to investigate RUNX function during development and regeneration, highlighting emerging unifying themes.
Shanley, Thomas P; Cvijanovich, Natalie; Lin, Richard; Allen, Geoffrey L; Thomas, Neal J; Doctor, Allan; Kalyanaraman, Meena; Tofil, Nancy M; Penfil, Scott; Monaco, Marie; Odoms, Kelli; Barnes, Michael; Sakthivel, Bhuvaneswari; Aronow, Bruce J; Wong, Hector R
2007-01-01
We have conducted longitudinal studies focused on the expression profiles of signaling pathways and gene networks in children with septic shock. Genome-level expression profiles were generated from whole blood-derived RNA of children with septic shock (n = 30) corresponding to day one and day three of septic shock, respectively. Based on sequential statistical and expression filters, day one and day three of septic shock were characterized by differential regulation of 2,142 and 2,504 gene probes, respectively, relative to controls (n = 15). Venn analysis demonstrated 239 unique genes in the day one dataset, 598 unique genes in the day three dataset, and 1,906 genes common to both datasets. Functional analyses demonstrated time-dependent, differential regulation of genes involved in multiple signaling pathways and gene networks primarily related to immunity and inflammation. Notably, multiple and distinct gene networks involving T cell- and MHC antigen-related biology were persistently downregulated on both day one and day three. Further analyses demonstrated large scale, persistent downregulation of genes corresponding to functional annotations related to zinc homeostasis. These data represent the largest reported cohort of patients with septic shock subjected to longitudinal genome-level expression profiling. The data further advance our genome-level understanding of pediatric septic shock and support novel hypotheses. PMID:17932561
Investigating Gene Function in Cereal Rust Fungi by Plant-Mediated Virus-Induced Gene Silencing.
Panwar, Vinay; Bakkeren, Guus
2017-01-01
Cereal rust fungi are destructive pathogens, threatening grain production worldwide. Targeted breeding for resistance utilizing host resistance genes has been effective. However, breakdown of resistance occurs frequently and continued efforts are needed to understand how these fungi overcome resistance and to expand the range of available resistance genes. Whole genome sequencing, transcriptomic and proteomic studies followed by genome-wide computational and comparative analyses have identified large repertoire of genes in rust fungi among which are candidates predicted to code for pathogenicity and virulence factors. Some of these genes represent defence triggering avirulence effectors. However, functions of most genes still needs to be assessed to understand the biology of these obligate biotrophic pathogens. Since genetic manipulations such as gene deletion and genetic transformation are not yet feasible in rust fungi, performing functional gene studies is challenging. Recently, Host-induced gene silencing (HIGS) has emerged as a useful tool to characterize gene function in rust fungi while infecting and growing in host plants. We utilized Barley stripe mosaic virus-mediated virus induced gene silencing (BSMV-VIGS) to induce HIGS of candidate rust fungal genes in the wheat host to determine their role in plant-fungal interactions. Here, we describe the methods for using BSMV-VIGS in wheat for functional genomics study in cereal rust fungi.
MUFFINN: cancer gene discovery via network analysis of somatic mutation data.
Cho, Ara; Shim, Jung Eun; Kim, Eiru; Supek, Fran; Lehner, Ben; Lee, Insuk
2016-06-23
A major challenge for distinguishing cancer-causing driver mutations from inconsequential passenger mutations is the long-tail of infrequently mutated genes in cancer genomes. Here, we present and evaluate a method for prioritizing cancer genes accounting not only for mutations in individual genes but also in their neighbors in functional networks, MUFFINN (MUtations For Functional Impact on Network Neighbors). This pathway-centric method shows high sensitivity compared with gene-centric analyses of mutation data. Notably, only a marginal decrease in performance is observed when using 10 % of TCGA patient samples, suggesting the method may potentiate cancer genome projects with small patient populations.
Computational Selection of Transcriptomics Experiments Improves Guilt-by-Association Analyses
Bhat, Prajwal; Yang, Haixuan; Bögre, László; Devoto, Alessandra; Paccanaro, Alberto
2012-01-01
The Guilt-by-Association (GBA) principle, according to which genes with similar expression profiles are functionally associated, is widely applied for functional analyses using large heterogeneous collections of transcriptomics data. However, the use of such large collections could hamper GBA functional analysis for genes whose expression is condition specific. In these cases a smaller set of condition related experiments should instead be used, but identifying such functionally relevant experiments from large collections based on literature knowledge alone is an impractical task. We begin this paper by analyzing, both from a mathematical and a biological point of view, why only condition specific experiments should be used in GBA functional analysis. We are able to show that this phenomenon is independent of the functional categorization scheme and of the organisms being analyzed. We then present a semi-supervised algorithm that can select functionally relevant experiments from large collections of transcriptomics experiments. Our algorithm is able to select experiments relevant to a given GO term, MIPS FunCat term or even KEGG pathways. We extensively test our algorithm on large dataset collections for yeast and Arabidopsis. We demonstrate that: using the selected experiments there is a statistically significant improvement in correlation between genes in the functional category of interest; the selected experiments improve GBA-based gene function prediction; the effectiveness of the selected experiments increases with annotation specificity; our algorithm can be successfully applied to GBA-based pathway reconstruction. Importantly, the set of experiments selected by the algorithm reflects the existing literature knowledge about the experiments. [A MATLAB implementation of the algorithm and all the data used in this paper can be downloaded from the paper website: http://www.paccanarolab.org/papers/CorrGene/]. PMID:22879875
Differential Retention of Gene Functions in a Secondary Metabolite Cluster.
Reynolds, Hannah T; Slot, Jason C; Divon, Hege H; Lysøe, Erik; Proctor, Robert H; Brown, Daren W
2017-08-01
In fungi, distribution of secondary metabolite (SM) gene clusters is often associated with host- or environment-specific benefits provided by SMs. In the plant pathogen Alternaria brassicicola (Dothideomycetes), the DEP cluster confers an ability to synthesize the SM depudecin, a histone deacetylase inhibitor that contributes weakly to virulence. The DEP cluster includes genes encoding enzymes, a transporter, and a transcription regulator. We investigated the distribution and evolution of the DEP cluster in 585 fungal genomes and found a wide but sporadic distribution among Dothideomycetes, Sordariomycetes, and Eurotiomycetes. We confirmed DEP gene expression and depudecin production in one fungus, Fusarium langsethiae. Phylogenetic analyses suggested 6-10 horizontal gene transfers (HGTs) of the cluster, including a transfer that led to the presence of closely related cluster homologs in Alternaria and Fusarium. The analyses also indicated that HGTs were frequently followed by loss/pseudogenization of one or more DEP genes. Independent cluster inactivation was inferred in at least four fungal classes. Analyses of transitions among functional, pseudogenized, and absent states of DEP genes among Fusarium species suggest enzyme-encoding genes are lost at higher rates than the transporter (DEP3) and regulatory (DEP6) genes. The phenotype of an experimentally-induced DEP3 mutant of Fusarium did not support the hypothesis that selective retention of DEP3 and DEP6 protects fungi from exogenous depudecin. Together, the results suggest that HGT and gene loss have contributed significantly to DEP cluster distribution, and that some DEP genes provide a greater fitness benefit possibly due to a differential tendency to form network connections. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution 2017. This work is written by US Government employees and is in the public domain in the US.
Jue, Dengwei; Sang, Xuelian; Lu, Shengqiao; Dong, Chen; Zhao, Qiufang; Chen, Hongliang; Jia, Liqiang
2015-01-01
Ubiquitination is a post-translation modification where ubiquitin is attached to a substrate. Ubiquitin-conjugating enzymes (E2s) play a major role in the ubiquitin transfer pathway, as well as a variety of functions in plant biological processes. To date, no genome-wide characterization of this gene family has been conducted in maize (Zea mays). In the present study, a total of 75 putative ZmUBC genes have been identified and located in the maize genome. Phylogenetic analysis revealed that ZmUBC proteins could be divided into 15 subfamilies, which include 13 ubiquitin-conjugating enzymes (ZmE2s) and two independent ubiquitin-conjugating enzyme variant (UEV) groups. The predicted ZmUBC genes were distributed across 10 chromosomes at different densities. In addition, analysis of exon-intron junctions and sequence motifs in each candidate gene has revealed high levels of conservation within and between phylogenetic groups. Tissue expression analysis indicated that most ZmUBC genes were expressed in at least one of the tissues, indicating that these are involved in various physiological and developmental processes in maize. Moreover, expression profile analyses of ZmUBC genes under different stress treatments (4°C, 20% PEG6000, and 200 mM NaCl) and various expression patterns indicated that these may play crucial roles in the response of plants to stress. Genome-wide identification, chromosome organization, gene structure, evolutionary and expression analyses of ZmUBC genes have facilitated in the characterization of this gene family, as well as determined its potential involvement in growth, development, and stress responses. This study provides valuable information for better understanding the classification and putative functions of the UBC-encoding genes of maize.
Jue, Dengwei; Sang, Xuelian; Lu, Shengqiao; Dong, Chen; Zhao, Qiufang; Chen, Hongliang; Jia, Liqiang
2015-01-01
Background Ubiquitination is a post-translation modification where ubiquitin is attached to a substrate. Ubiquitin-conjugating enzymes (E2s) play a major role in the ubiquitin transfer pathway, as well as a variety of functions in plant biological processes. To date, no genome-wide characterization of this gene family has been conducted in maize (Zea mays). Methodology/Principal Findings In the present study, a total of 75 putative ZmUBC genes have been identified and located in the maize genome. Phylogenetic analysis revealed that ZmUBC proteins could be divided into 15 subfamilies, which include 13 ubiquitin-conjugating enzymes (ZmE2s) and two independent ubiquitin-conjugating enzyme variant (UEV) groups. The predicted ZmUBC genes were distributed across 10 chromosomes at different densities. In addition, analysis of exon-intron junctions and sequence motifs in each candidate gene has revealed high levels of conservation within and between phylogenetic groups. Tissue expression analysis indicated that most ZmUBC genes were expressed in at least one of the tissues, indicating that these are involved in various physiological and developmental processes in maize. Moreover, expression profile analyses of ZmUBC genes under different stress treatments (4°C, 20% PEG6000, and 200 mM NaCl) and various expression patterns indicated that these may play crucial roles in the response of plants to stress. Conclusions Genome-wide identification, chromosome organization, gene structure, evolutionary and expression analyses of ZmUBC genes have facilitated in the characterization of this gene family, as well as determined its potential involvement in growth, development, and stress responses. This study provides valuable information for better understanding the classification and putative functions of the UBC-encoding genes of maize. PMID:26606743
Relationships among msx gene structure and function in zebrafish and other vertebrates.
Ekker, M; Akimenko, M A; Allende, M L; Smith, R; Drouin, G; Langille, R M; Weinberg, E S; Westerfield, M
1997-10-01
The zebrafish genome contains at least five msx homeobox genes, msxA, msxB, msxC, msxD, and the newly isolated msxE. Although these genes share structural features common to all Msx genes, phylogenetic analyses of protein sequences indicate that the msx genes from zebrafish are not orthologous to the Msx1 and Msx2 genes of mammals, birds, and amphibians. The zebrafish msxB and msxC are more closely related to each other and to the mouse Msx3. Similarly, although the combinatorial expression of the zebrafish msx genes in the embryonic dorsal neuroectoderm, visceral arches, fins, and sensory organs suggests functional similarities with the Msx genes of other vertebrates, differences in the expression patterns preclude precise assignment of orthological relationships. Distinct duplication events may have given rise to the msx genes of modern fish and other vertebrate lineages whereas many aspects of msx gene functions during embryonic development have been preserved.
Wang, Ping; Lin, Mingyan; Pedrosa, Erika; Hrabovsky, Anastasia; Zhang, Zheng; Guo, Wenjun; Lachman, Herbert M; Zheng, Deyou
2015-01-01
Disruptive mutation in the CHD8 gene is one of the top genetic risk factors in autism spectrum disorders (ASDs). Previous analyses of genome-wide CHD8 occupancy and reduced expression of CHD8 by shRNA knockdown in committed neural cells showed that CHD8 regulates multiple cell processes critical for neural functions, and its targets are enriched with ASD-associated genes. To further understand the molecular links between CHD8 functions and ASD, we have applied the CRISPR/Cas9 technology to knockout one copy of CHD8 in induced pluripotent stem cells (iPSCs) to better mimic the loss-of-function status that would exist in the developing human embryo prior to neuronal differentiation. We then carried out transcriptomic and bioinformatic analyses of neural progenitors and neurons derived from the CHD8 mutant iPSCs. Transcriptome profiling revealed that CHD8 hemizygosity (CHD8 (+/-)) affected the expression of several thousands of genes in neural progenitors and early differentiating neurons. The differentially expressed genes were enriched for functions of neural development, β-catenin/Wnt signaling, extracellular matrix, and skeletal system development. They also exhibited significant overlap with genes previously associated with autism and schizophrenia, as well as the downstream transcriptional targets of multiple genes implicated in autism. Providing important insight into how CHD8 mutations might give rise to macrocephaly, we found that seven of the twelve genes associated with human brain volume or head size by genome-wide association studies (e.g., HGMA2) were dysregulated in CHD8 (+/-) neural progenitors or neurons. We have established a renewable source of CHD8 (+/-) iPSC lines that would be valuable for investigating the molecular and cellular functions of CHD8. Transcriptomic profiling showed that CHD8 regulates multiple genes implicated in ASD pathogenesis and genes associated with brain volume.
Lamontagne, Maxime; Timens, Wim; Hao, Ke; Bossé, Yohan; Laviolette, Michel; Steiling, Katrina; Campbell, Joshua D; Couture, Christian; Conti, Massimo; Sherwood, Karen; Hogg, James C; Brandsma, Corry-Anke; van den Berge, Maarten; Sandford, Andrew; Lam, Stephen; Lenburg, Marc E; Spira, Avrum; Paré, Peter D; Nickle, David; Sin, Don D; Postma, Dirkje S
2014-11-01
COPD is a complex chronic disease with poorly understood pathogenesis. Integrative genomic approaches have the potential to elucidate the biological networks underlying COPD and lung function. We recently combined genome-wide genotyping and gene expression in 1111 human lung specimens to map expression quantitative trait loci (eQTL). To determine causal associations between COPD and lung function-associated single nucleotide polymorphisms (SNPs) and lung tissue gene expression changes in our lung eQTL dataset. We evaluated causality between SNPs and gene expression for three COPD phenotypes: FEV(1)% predicted, FEV(1)/FVC and COPD as a categorical variable. Different models were assessed in the three cohorts independently and in a meta-analysis. SNPs associated with a COPD phenotype and gene expression were subjected to causal pathway modelling and manual curation. In silico analyses evaluated functional enrichment of biological pathways among newly identified causal genes. Biologically relevant causal genes were validated in two separate gene expression datasets of lung tissues and bronchial airway brushings. High reliability causal relations were found in SNP-mRNA-phenotype triplets for FEV(1)% predicted (n=169) and FEV(1)/FVC (n=80). Several genes of potential biological relevance for COPD were revealed. eQTL-SNPs upregulating cystatin C (CST3) and CD22 were associated with worse lung function. Signalling pathways enriched with causal genes included xenobiotic metabolism, apoptosis, protease-antiprotease and oxidant-antioxidant balance. By using integrative genomics and analysing the relationships of COPD phenotypes with SNPs and gene expression in lung tissue, we identified CST3 and CD22 as potential causal genes for airflow obstruction. This study also augmented the understanding of previously described COPD pathways. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Technological advances and genomics in metazoan parasites.
Knox, D P
2004-02-01
Molecular biology has provided the means to identify parasite proteins, to define their function, patterns of expression and the means to produce them in quantity for subsequent functional analyses. Whole genome and expressed sequence tag programmes, and the parallel development of powerful bioinformatics tools, allow the execution of genome-wide between stage or species comparisons and meaningful gene-expression profiling. The latter can be undertaken with several new technologies such as DNA microarray and serial analysis of gene expression. Proteome analysis has come to the fore in recent years providing a crucial link between the gene and its protein product. RNA interference and ballistic gene transfer are exciting developments which can provide the means to precisely define the function of individual genes and, of importance in devising novel parasite control strategies, the effect that gene knockdown will have on parasite survival.
Analysis of the Prefoldin Gene Family in 14 Plant Species
Cao, Jun
2016-01-01
Prefoldin is a hexameric molecular chaperone complex present in all eukaryotes and archaea. The evolution of this gene family in plants is unknown. Here, I identified 140 prefoldin genes in 14 plant species. These prefoldin proteins were divided into nine groups through phylogenetic analysis. Highly conserved gene organization and motif distribution exist in each prefoldin group, implying their functional conservation. I also observed the segmental duplication of maize prefoldin gene family. Moreover, a few functional divergence sites were identified within each group pairs. Functional network analyses identified 78 co-expressed genes, and most of them were involved in carrying, binding and kinase activity. Divergent expression profiles of the maize prefoldin genes were further investigated in different tissues and development periods and under auxin and some abiotic stresses. I also found a few cis-elements responding to abiotic stress and phytohormone in the upstream sequences of the maize prefoldin genes. The results provided a foundation for exploring the characterization of the prefoldin genes in plants and will offer insights for additional functional studies. PMID:27014333
Kirsten, Holger; Al-Hasani, Hoor; Holdt, Lesca; Gross, Arnd; Beutner, Frank; Krohn, Knut; Horn, Katrin; Ahnert, Peter; Burkhardt, Ralph; Reiche, Kristin; Hackermüller, Jörg; Löffler, Markus; Teupser, Daniel; Thiery, Joachim; Scholz, Markus
2015-01-01
Genetics of gene expression (eQTLs or expression QTLs) has proved an indispensable tool for understanding biological pathways and pathomechanisms of trait-associated SNPs. However, power of most genome-wide eQTL studies is still limited. We performed a large eQTL study in peripheral blood mononuclear cells of 2112 individuals increasing the power to detect trans-effects genome-wide. Going beyond univariate SNP-transcript associations, we analyse relations of eQTLs to biological pathways, polygenetic effects of expression regulation, trans-clusters and enrichment of co-localized functional elements. We found eQTLs for about 85% of analysed genes, and 18% of genes were trans-regulated. Local eSNPs were enriched up to a distance of 5 Mb to the transcript challenging typically implemented ranges of cis-regulations. Pathway enrichment within regulated genes of GWAS-related eSNPs supported functional relevance of identified eQTLs. We demonstrate that nearest genes of GWAS-SNPs might frequently be misleading functional candidates. We identified novel trans-clusters of potential functional relevance for GWAS-SNPs of several phenotypes including obesity-related traits, HDL-cholesterol levels and haematological phenotypes. We used chromatin immunoprecipitation data for demonstrating biological effects. Yet, we show for strongly heritable transcripts that still little trans-chromosomal heritability is explained by all identified trans-eSNPs; however, our data suggest that most cis-heritability of these transcripts seems explained. Dissection of co-localized functional elements indicated a prominent role of SNPs in loci of pseudogenes and non-coding RNAs for the regulation of coding genes. In summary, our study substantially increases the catalogue of human eQTLs and improves our understanding of the complex genetic regulation of gene expression, pathways and disease-related processes. PMID:26019233
Gruel, Jérémy; LeBorgne, Michel; LeMeur, Nolwenn; Théret, Nathalie
2011-09-12
Regulation of gene expression plays a pivotal role in cellular functions. However, understanding the dynamics of transcription remains a challenging task. A host of computational approaches have been developed to identify regulatory motifs, mainly based on the recognition of DNA sequences for transcription factor binding sites. Recent integration of additional data from genomic analyses or phylogenetic footprinting has significantly improved these methods. Here, we propose a different approach based on the compilation of Simple Shared Motifs (SSM), groups of sequences defined by their length and similarity and present in conserved sequences of gene promoters. We developed an original algorithm to search and count SSM in pairs of genes. An exceptional number of SSM is considered as a common regulatory pattern. The SSM approach is applied to a sample set of genes and validated using functional gene-set enrichment analyses. We demonstrate that the SSM approach selects genes that are over-represented in specific biological categories (Ontology and Pathways) and are enriched in co-expressed genes. Finally we show that genes co-expressed in the same tissue or involved in the same biological pathway have increased SSM values. Using unbiased clustering of genes, Simple Shared Motifs analysis constitutes an original contribution to provide a clearer definition of expression networks.
2011-01-01
Background Regulation of gene expression plays a pivotal role in cellular functions. However, understanding the dynamics of transcription remains a challenging task. A host of computational approaches have been developed to identify regulatory motifs, mainly based on the recognition of DNA sequences for transcription factor binding sites. Recent integration of additional data from genomic analyses or phylogenetic footprinting has significantly improved these methods. Results Here, we propose a different approach based on the compilation of Simple Shared Motifs (SSM), groups of sequences defined by their length and similarity and present in conserved sequences of gene promoters. We developed an original algorithm to search and count SSM in pairs of genes. An exceptional number of SSM is considered as a common regulatory pattern. The SSM approach is applied to a sample set of genes and validated using functional gene-set enrichment analyses. We demonstrate that the SSM approach selects genes that are over-represented in specific biological categories (Ontology and Pathways) and are enriched in co-expressed genes. Finally we show that genes co-expressed in the same tissue or involved in the same biological pathway have increased SSM values. Conclusions Using unbiased clustering of genes, Simple Shared Motifs analysis constitutes an original contribution to provide a clearer definition of expression networks. PMID:21910886
Rapid diversification of five Oryza AA genomes associated with rice adaptation.
Zhang, Qun-Jie; Zhu, Ting; Xia, En-Hua; Shi, Chao; Liu, Yun-Long; Zhang, Yun; Liu, Yuan; Jiang, Wen-Kai; Zhao, You-Jie; Mao, Shu-Yan; Zhang, Li-Ping; Huang, Hui; Jiao, Jun-Ying; Xu, Ping-Zhen; Yao, Qiu-Yang; Zeng, Fan-Chun; Yang, Li-Li; Gao, Ju; Tao, Da-Yun; Wang, Yue-Ju; Bennetzen, Jeffrey L; Gao, Li-Zhi
2014-11-18
Comparative genomic analyses among closely related species can greatly enhance our understanding of plant gene and genome evolution. We report de novo-assembled AA-genome sequences for Oryza nivara, Oryza glaberrima, Oryza barthii, Oryza glumaepatula, and Oryza meridionalis. Our analyses reveal massive levels of genomic structural variation, including segmental duplication and rapid gene family turnover, with particularly high instability in defense-related genes. We show, on a genomic scale, how lineage-specific expansion or contraction of gene families has led to their morphological and reproductive diversification, thus enlightening the evolutionary process of speciation and adaptation. Despite strong purifying selective pressures on most Oryza genes, we documented a large number of positively selected genes, especially those genes involved in flower development, reproduction, and resistance-related processes. These diversifying genes are expected to have played key roles in adaptations to their ecological niches in Asia, South America, Africa and Australia. Extensive variation in noncoding RNA gene numbers, function enrichment, and rates of sequence divergence might also help account for the different genetic adaptations of these rice species. Collectively, these resources provide new opportunities for evolutionary genomics, numerous insights into recent speciation, a valuable database of functional variation for crop improvement, and tools for efficient conservation of wild rice germplasm.
Rapid diversification of five Oryza AA genomes associated with rice adaptation
Zhang, Qun-Jie; Zhu, Ting; Xia, En-Hua; Shi, Chao; Liu, Yun-Long; Zhang, Yun; Liu, Yuan; Jiang, Wen-Kai; Zhao, You-Jie; Mao, Shu-Yan; Zhang, Li-Ping; Huang, Hui; Jiao, Jun-Ying; Xu, Ping-Zhen; Yao, Qiu-Yang; Zeng, Fan-Chun; Yang, Li-Li; Gao, Ju; Tao, Da-Yun; Wang, Yue-Ju; Bennetzen, Jeffrey L.; Gao, Li-Zhi
2014-01-01
Comparative genomic analyses among closely related species can greatly enhance our understanding of plant gene and genome evolution. We report de novo-assembled AA-genome sequences for Oryza nivara, Oryza glaberrima, Oryza barthii, Oryza glumaepatula, and Oryza meridionalis. Our analyses reveal massive levels of genomic structural variation, including segmental duplication and rapid gene family turnover, with particularly high instability in defense-related genes. We show, on a genomic scale, how lineage-specific expansion or contraction of gene families has led to their morphological and reproductive diversification, thus enlightening the evolutionary process of speciation and adaptation. Despite strong purifying selective pressures on most Oryza genes, we documented a large number of positively selected genes, especially those genes involved in flower development, reproduction, and resistance-related processes. These diversifying genes are expected to have played key roles in adaptations to their ecological niches in Asia, South America, Africa and Australia. Extensive variation in noncoding RNA gene numbers, function enrichment, and rates of sequence divergence might also help account for the different genetic adaptations of these rice species. Collectively, these resources provide new opportunities for evolutionary genomics, numerous insights into recent speciation, a valuable database of functional variation for crop improvement, and tools for efficient conservation of wild rice germplasm. PMID:25368197
Wen, Feng; Zhu, Hong; Li, Peng; Jiang, Min; Mao, Wenqing; Ong, Chermaine; Chu, Zhaoqing
2014-01-01
Members of plant WRKY gene family are ancient transcription factors that function in plant growth and development and respond to biotic and abiotic stresses. In our present study, we have investigated WRKY family genes in Brachypodium distachyon, a new model plant of family Poaceae. We identified a total of 86 WRKY genes from B. distachyon and explored their chromosomal distribution and evolution, domain alignment, promoter cis-elements, and expression profiles. Combining the analysis of phylogenetic tree of BdWRKY genes and the result of expression profiling, results showed that most of clustered gene pairs had higher similarities in the WRKY domain, suggesting that they might be functionally redundant. Neighbour-joining analysis of 301 WRKY domains from Oryza sativa, Arabidopsis thaliana, and B. distachyon suggested that BdWRKY domains are evolutionarily more closely related to O. sativa WRKY domains than those of A. thaliana. Moreover, tissue-specific expression profile of BdWRKY genes and their responses to phytohormones and several biotic or abiotic stresses were analysed by quantitative real-time PCR. The results showed that the expression of BdWRKY genes was rapidly regulated by stresses and phytohormones, and there was a strong correlation between promoter cis-elements and the phytohormones-induced BdWRKY gene expression. PMID:24453041
Alberti, Adriana; Lodi, Tiziana; Ferrero, Iliana; Donnini, Claudia
2003-10-15
Imp2p (Yil154c) is a transcriptional activator involved in glucose derepression of the maltose, galactose and raffinose utilization pathways and in resistance to thermal, oxidative or osmotic stress. We analysed the role of Imp2 in the regulation of GAL genes. Imp2 was shown to have a positive effect on glucose derepression of Leloir pathway genes and their activator gene GAL4. The effect of Imp2 on galactose metabolism was shown to be partially dependent on Mig1p. The Mig1-independent role depends on Nrg1p. However, disruption of both MIG1 and NRG1 only partially relieves the glucose repression of GAL genes in the Deltaimp2 mutant, indicating that Imp2 must also have other function(s). Moreover, the interaction between IMP2 and GAL6/BLH1, a recently isolated gene involved in the regulation of GAL genes that shares with Imp2 the ability to protect cells from the glycopeptide bleomycin, was also analysed. The results suggest a major role of Imp2 in a GAL6-independent pathway. Copyright 2003 John Wiley & Sons, Ltd.
Characterization of gonadal transcriptomes from the turbot (Scophthalmus maximus).
Hu, Yulong; Huang, Meng; Wang, Weiji; Guan, Jiantao; Kong, Jie
2016-01-01
The mechanisms underlying sexual reproduction and sex ratio determination remains unclear in turbot, a flatfish of great commercial value. And there is limited information in the turbot database regarding genes related to the reproductive system. Here, we conducted high-throughput transcriptome profiling of turbot gonad tissues to better understand their reproductive functions and to supply essential gene sequence information for marker-assisted selection programs in the turbot industry. In this study, two gonad libraries representing sex differences in Scophthalmus maximus yielded 453 818 high-quality reads that were assembled into 24 611 contigs and 33 713 singletons by using 454 pyrosequencing, 13 936 contigs and singletons (CS) of which were annotated using BLASTx. GO (Gene Ontology) and KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway analyses revealed that various biological functions and processes were associated with many of the annotated CS. Expression analyses showed that 510 genes were differentially expressed in males versus females; 80% of these genes were annotated. In addition, 6484 and 6036 single nucleotide polymorphisms (SNPs) were identified in male and female libraries, respectively. This transcriptome resource will serve as the foundation for cDNA or SNP microarray construction, gene expression characterization, and sex-specific linkage mapping in turbot.
Bruse, Shannon; Moreau, Michael; Bromberg, Yana; Jang, Jun-Ho; Wang, Nan; Ha, Hongseok; Picchi, Maria; Lin, Yong; Langley, Raymond J; Qualls, Clifford; Klensney-Tait, Julia; Zabner, Joseph; Leng, Shuguang; Mao, Jenny; Belinsky, Steven A; Xing, Jinchuan; Nyunoya, Toru
2016-01-07
Chronic obstructive pulmonary disease (COPD) is characterized by an irreversible airflow limitation in response to inhalation of noxious stimuli, such as cigarette smoke. However, only 15-20 % smokers manifest COPD, suggesting a role for genetic predisposition. Although genome-wide association studies have identified common genetic variants that are associated with susceptibility to COPD, effect sizes of the identified variants are modest, as is the total heritability accounted for by these variants. In this study, an extreme phenotype exome sequencing study was combined with in vitro modeling to identify COPD candidate genes. We performed whole exome sequencing of 62 highly susceptible smokers and 30 exceptionally resistant smokers to identify rare variants that may contribute to disease risk or resistance to COPD. This was a cross-sectional case-control study without therapeutic intervention or longitudinal follow-up information. We identified candidate genes based on rare variant analyses and evaluated exonic variants to pinpoint individual genes whose function was computationally established to be significantly different between susceptible and resistant smokers. Top scoring candidate genes from these analyses were further filtered by requiring that each gene be expressed in human bronchial epithelial cells (HBECs). A total of 81 candidate genes were thus selected for in vitro functional testing in cigarette smoke extract (CSE)-exposed HBECs. Using small interfering RNA (siRNA)-mediated gene silencing experiments, we showed that silencing of several candidate genes augmented CSE-induced cytotoxicity in vitro. Our integrative analysis through both genetic and functional approaches identified two candidate genes (TACC2 and MYO1E) that augment cigarette smoke (CS)-induced cytotoxicity and, potentially, COPD susceptibility.
Wei, Hengling; Li, Wei; Sun, Xiwei; Zhu, Shuijin; Zhu, Jun
2013-01-01
Plant disease resistance genes are a key component of defending plants from a range of pathogens. The majority of these resistance genes belong to the super-family that harbors a Nucleotide-binding site (NBS). A number of studies have focused on NBS-encoding genes in disease resistant breeding programs for diverse plants. However, little information has been reported with an emphasis on systematic analysis and comparison of NBS-encoding genes in cotton. To fill this gap of knowledge, in this study, we identified and investigated the NBS-encoding resistance genes in cotton using the whole genome sequence information of Gossypium raimondii. Totally, 355 NBS-encoding resistance genes were identified. Analyses of the conserved motifs and structural diversity showed that the most two distinct features for these genes are the high proportion of non-regular NBS genes and the high diversity of N-termini domains. Analyses of the physical locations and duplications of NBS-encoding genes showed that gene duplication of disease resistance genes could play an important role in cotton by leading to an increase in the functional diversity of the cotton NBS-encoding genes. Analyses of phylogenetic comparisons indicated that, in cotton, the NBS-encoding genes with TIR domain not only have their own evolution pattern different from those of genes without TIR domain, but also have their own species-specific pattern that differs from those of TIR genes in other plants. Analyses of the correlation between disease resistance QTL and NBS-encoding resistance genes showed that there could be more than half of the disease resistance QTL associated to the NBS-encoding genes in cotton, which agrees with previous studies establishing that more than half of plant resistance genes are NBS-encoding genes. PMID:23936305
Xing, Mengxin; Hou, Zhanhui; Yuan, Jianbo; Liu, Yuan; Qu, Yanmei; Liu, Bin
2013-12-01
Metagenomics combined with 16S rRNA gene sequence analyses was applied to unveil the taxonomic composition and functional diversity of the farmed adult turbot gastrointestinal (GI) microbiome. Proteobacteria and Firmicutes which existed in both GI content and mucus were dominated in the turbot GI microbiome. 16S rRNA gene sequence analyses also indicated that the turbot GI tract may harbor some bacteria which originated from associated seawater. Functional analyses indicated that the clustering-based subsystem and many metabolic subsystems were dominant in the turbot GI metagenome. Compared with other gut metagenomes, quorum sensing and biofilm formation was overabundant in the turbot GI metagenome. Genes associated with quorum sensing and biofilm formation were found in species within Vibrio, including Vibrio vulnificus, Vibrio cholerae and Vibrio parahaemolyticus. In farmed fish gut metagenomes, the stress response and protein folding subsystems were over-represented and several genes concerning antibiotic and heavy metal resistance were also detected. These data suggested that the turbot GI microbiome may be affected by human factors in aquaculture. Additionally, iron acquisition and the metabolism subsystem were more abundant in the turbot GI metagenome when compared with freshwater fish gut metagenome, suggesting that unique metabolic potential may be observed in marine animal GI microbiomes. © 2013 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
Genome-wide identification and characterization of aquaporin gene family in Beta vulgaris
Kong, Weilong; Yang, Shaozong; Wang, Yulu; Bendahmane, Mohammed
2017-01-01
Aquaporins (AQPs) are essential channel proteins that execute multi-functions throughout plant growth and development, including water transport, uncharged solutes uptake, stress response, and so on. Here, we report the first genome-wide identification and characterization AQP (BvAQP) genes in sugar beet (Beta vulgaris), an important crop widely cultivated for feed, for sugar production and for bioethanol production. Twenty-eight sugar beet AQPs (BvAQPs) were identified and assigned into five subfamilies based on phylogenetic analyses: seven of plasma membrane (PIPs), eight of tonoplast (TIPs), nine of NOD26-like (NIPs), three of small basic (SIPs), and one of x-intrinsic proteins (XIPs). BvAQP genes unevenly mapped on all chromosomes, except on chromosome 4. Gene structure and motifs analyses revealed that BvAQP have conserved exon-intron organization and that they exhibit conserved motifs within each subfamily. Prediction of BvAQPs functions, based on key protein domains conservation, showed a remarkable difference in substrate specificity among the five subfamilies. Analyses of BvAQPs expression, by mean of RNA-seq, in different plant organs and in response to various abiotic stresses revealed that they were ubiquitously expressed and that their expression was induced by heat and salt stresses. These results provide a reference base to address further the function of sugar beet aquaporins and to explore future applications for plants growth and development improvements as well as in response to environmental stresses. PMID:28948097
Diversity and function in microbial mats from the Lucky Strike hydrothermal vent field.
Crépeau, Valentin; Cambon Bonavita, Marie-Anne; Lesongeur, Françoise; Randrianalivelo, Henintsoa; Sarradin, Pierre-Marie; Sarrazin, Jozée; Godfroy, Anne
2011-06-01
Diversity and function in microbial mats from the Lucky Strike hydrothermal vent field (Mid-Atlantic Ridge) were investigated using molecular approaches. DNA and RNA were extracted from mat samples overlaying hydrothermal deposits and Bathymodiolus azoricus mussel assemblages. We constructed and analyzed libraries of 16S rRNA gene sequences and sequences of functional genes involved in autotrophic carbon fixation [forms I and II RuBisCO (cbbL/M), ATP-citrate lyase B (aclB)]; methane oxidation [particulate methane monooxygenase (pmoA)] and sulfur oxidation [adenosine-5'-phosphosulfate reductase (aprA) and soxB]. To gain new insights into the relationships between mats and mussels, we also used new domain-specific 16S rRNA gene primers targeting Bathymodiolus sp. symbionts. All identified archaeal sequences were affiliated with a single group: the marine group 1 Thaumarchaeota. In contrast, analyses of bacterial sequences revealed much higher diversity, although two phyla Proteobacteria and Bacteroidetes were largely dominant. The 16S rRNA gene sequence library revealed that species affiliated to Beggiatoa Gammaproteobacteria were the dominant active population. Analyses of DNA and RNA functional gene libraries revealed a diverse and active chemolithoautotrophic population. Most of these sequences were affiliated with Gammaproteobacteria, including hydrothermal fauna symbionts, Thiotrichales and Methylococcales. PCR and reverse transcription-PCR using 16S rRNA gene primers targeted to Bathymodiolus sp. symbionts revealed sequences affiliated with both methanotrophic and thiotrophic endosymbionts. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
The genetics of feed conversion efficiency traits in a commercial broiler line
Reyer, Henry; Hawken, Rachel; Murani, Eduard; Ponsuksili, Siriluck; Wimmers, Klaus
2015-01-01
Individual feed conversion efficiency (FCE) is a major trait that influences the usage of energy resources and the ecological footprint of livestock production. The underlying biological processes of FCE are complex and are influenced by factors as diverse as climate, feed properties, gut microbiota, and individual genetic predisposition. To gain an insight to the genetic relationships with FCE traits and to contribute to the improvement of FCE in commercial chicken lines, a genome-wide association study was conducted using a commercial broiler population (n = 859) tested for FCE and weight traits during the finisher period from 39 to 46 days of age. Both single-marker (generalized linear model) and multi-marker (Bayesian approach) analyses were applied to the dataset to detect genes associated with the variability in FCE. The separate analyses revealed 22 quantitative trait loci (QTL) regions on 13 different chromosomes; the integration of both approaches resulted in 7 overlapping QTL regions. The analyses pointed to acylglycerol kinase (AGK) and general transcription factor 2-I (GTF2I) as positional and functional candidate genes. Non-synonymous polymorphisms of both candidate genes revealed evidence for a functional importance of these genes by influencing different biological aspects of FCE. PMID:26552583
Origin and functional diversification of an amphibian defense peptide arsenal.
Roelants, Kim; Fry, Bryan G; Ye, Lumeng; Stijlemans, Benoit; Brys, Lea; Kok, Philippe; Clynen, Elke; Schoofs, Liliane; Cornelis, Pierre; Bossuyt, Franky
2013-01-01
The skin secretion of many amphibians contains an arsenal of bioactive molecules, including hormone-like peptides (HLPs) acting as defense toxins against predators, and antimicrobial peptides (AMPs) providing protection against infectious microorganisms. Several amphibian taxa seem to have independently acquired the genes to produce skin-secreted peptide arsenals, but it remains unknown how these originated from a non-defensive ancestral gene and evolved diverse defense functions against predators and pathogens. We conducted transcriptome, genome, peptidome and phylogenetic analyses to chart the full gene repertoire underlying the defense peptide arsenal of the frog Silurana tropicalis and reconstruct its evolutionary history. Our study uncovers a cluster of 13 transcriptionally active genes, together encoding up to 19 peptides, including diverse HLP homologues and AMPs. This gene cluster arose from a duplicated gastrointestinal hormone gene that attained a HLP-like defense function after major remodeling of its promoter region. Instead, new defense functions, including antimicrobial activity, arose by mutation of the precursor proteins, resulting in the proteolytic processing of secondary peptides alongside the original ones. Although gene duplication did not trigger functional innovation, it may have subsequently facilitated the convergent loss of the original function in multiple gene lineages (subfunctionalization), completing their transformation from HLP gene to AMP gene. The processing of multiple peptides from a single precursor entails a mechanism through which peptide-encoding genes may establish new functions without the need for gene duplication to avoid adaptive conflicts with older ones.
Origin and Functional Diversification of an Amphibian Defense Peptide Arsenal
Roelants, Kim; Fry, Bryan G.; Ye, Lumeng; Stijlemans, Benoit; Brys, Lea; Kok, Philippe; Clynen, Elke; Schoofs, Liliane; Cornelis, Pierre; Bossuyt, Franky
2013-01-01
The skin secretion of many amphibians contains an arsenal of bioactive molecules, including hormone-like peptides (HLPs) acting as defense toxins against predators, and antimicrobial peptides (AMPs) providing protection against infectious microorganisms. Several amphibian taxa seem to have independently acquired the genes to produce skin-secreted peptide arsenals, but it remains unknown how these originated from a non-defensive ancestral gene and evolved diverse defense functions against predators and pathogens. We conducted transcriptome, genome, peptidome and phylogenetic analyses to chart the full gene repertoire underlying the defense peptide arsenal of the frog Silurana tropicalis and reconstruct its evolutionary history. Our study uncovers a cluster of 13 transcriptionally active genes, together encoding up to 19 peptides, including diverse HLP homologues and AMPs. This gene cluster arose from a duplicated gastrointestinal hormone gene that attained a HLP-like defense function after major remodeling of its promoter region. Instead, new defense functions, including antimicrobial activity, arose by mutation of the precursor proteins, resulting in the proteolytic processing of secondary peptides alongside the original ones. Although gene duplication did not trigger functional innovation, it may have subsequently facilitated the convergent loss of the original function in multiple gene lineages (subfunctionalization), completing their transformation from HLP gene to AMP gene. The processing of multiple peptides from a single precursor entails a mechanism through which peptide-encoding genes may establish new functions without the need for gene duplication to avoid adaptive conflicts with older ones. PMID:23935531
Hamada, Aska; Miyawaki, Katsuyuki; Honda-sumi, Eri; Tomioka, Kenji; Mito, Taro; Ohuchi, Hideyo; Noji, Sumihare
2009-08-01
In order to explore a possibility that the cricket Gryllus bimaculatus would be a useful model to unveil molecular mechanisms of human diseases, we performed loss-of-function analyses of Gryllus genes homologous to human genes that are responsible for human disorders, fragile X mental retardation 1 (fmr1) and Dopamine receptor (DopR). We cloned cDNAs of their Gryllus homologues, Gb'fmr1, Gb'DopRI, and Gb'DopRII, and analyzed their functions with use of nymphal RNA interference (RNAi). For Gb'fmr1, three major phenotypes were observed: (1) abnormal wing postures, (2) abnormal calling song, and (3) loss of the circadian locomotor rhythm, while for Gb'DopRI, defects of wing posture and morphology were found. These results indicate that the cricket has the potential to become a novel model system to explore human neuronal pathogenic mechanisms and to screen therapeutic drugs by RNAi. Copyright (c) 2009 Wiley-Liss, Inc.
Zebrafish models for the functional genomics of neurogenetic disorders.
Kabashi, Edor; Brustein, Edna; Champagne, Nathalie; Drapeau, Pierre
2011-03-01
In this review, we consider recent work using zebrafish to validate and study the functional consequences of mutations of human genes implicated in a broad range of degenerative and developmental disorders of the brain and spinal cord. Also we present technical considerations for those wishing to study their own genes of interest by taking advantage of this easily manipulated and clinically relevant model organism. Zebrafish permit mutational analyses of genetic function (gain or loss of function) and the rapid validation of human variants as pathological mutations. In particular, neural degeneration can be characterized at genetic, cellular, functional, and behavioral levels. Zebrafish have been used to knock down or express mutations in zebrafish homologs of human genes and to directly express human genes bearing mutations related to neurodegenerative disorders such as spinal muscular atrophy, ataxia, hereditary spastic paraplegia, amyotrophic lateral sclerosis (ALS), epilepsy, Huntington's disease, Parkinson's disease, fronto-temporal dementia, and Alzheimer's disease. More recently, we have been using zebrafish to validate mutations of synaptic genes discovered by large-scale genomic approaches in developmental disorders such as autism, schizophrenia, and non-syndromic mental retardation. Advances in zebrafish genetics such as multigenic analyses and chemical genetics now offer a unique potential for disease research. Thus, zebrafish hold much promise for advancing the functional genomics of human diseases, the understanding of the genetics and cell biology of degenerative and developmental disorders, and the discovery of therapeutics. This article is part of a Special Issue entitled Zebrafish Models of Neurological Diseases. Copyright © 2010 Elsevier B.V. All rights reserved.
Shi, Weiwei; Bugrim, Andrej; Nikolsky, Yuri; Nikolskya, Tatiana; Brennan, Richard J
2008-01-01
ABSTRACT The ideal toxicity biomarker is composed of the properties of prediction (is detected prior to traditional pathological signs of injury), accuracy (high sensitivity and specificity), and mechanistic relationships to the endpoint measured (biological relevance). Gene expression-based toxicity biomarkers ("signatures") have shown good predictive power and accuracy, but are difficult to interpret biologically. We have compared different statistical methods of feature selection with knowledge-based approaches, using GeneGo's database of canonical pathway maps, to generate gene sets for the classification of renal tubule toxicity. The gene set selection algorithms include four univariate analyses: t-statistics, fold-change, B-statistics, and RankProd, and their combination and overlap for the identification of differentially expressed probes. Enrichment analysis following the results of the four univariate analyses, Hotelling T-square test, and, finally out-of-bag selection, a variant of cross-validation, were used to identify canonical pathway maps-sets of genes coordinately involved in key biological processes-with classification power. Differentially expressed genes identified by the different statistical univariate analyses all generated reasonably performing classifiers of tubule toxicity. Maps identified by enrichment analysis or Hotelling T-square had lower classification power, but highlighted perturbed lipid homeostasis as a common discriminator of nephrotoxic treatments. The out-of-bag method yielded the best functionally integrated classifier. The map "ephrins signaling" performed comparably to a classifier derived using sparse linear programming, a machine learning algorithm, and represents a signaling network specifically involved in renal tubule development and integrity. Such functional descriptors of toxicity promise to better integrate predictive toxicogenomics with mechanistic analysis, facilitating the interpretation and risk assessment of predictive genomic investigations.
Allen, Andrew E; Moustafa, Ahmed; Montsant, Anton; Eckert, Angelika; Kroth, Peter G; Bowler, Chris
2012-01-01
Diatoms and other chlorophyll-c containing, or chromalveolate, algae are among the most productive and diverse phytoplankton in the ocean. Evolutionarily, chlorophyll-c algae are linked through common, although not necessarily monophyletic, acquisition of plastid endosymbionts of red as well as most likely green algal origin. There is also strong evidence for a relatively high level of lineage-specific bacterial gene acquisition within chromalveolates. Therefore, analyses of gene content and derivation in chromalveolate taxa have indicated particularly diverse origins of their overall gene repertoire. As a single group of functionally related enzymes spanning two distinct gene families, fructose 1,6-bisphosphate aldolases (FBAs) illustrate the influence on core biochemical pathways of specific evolutionary associations among diatoms and other chromalveolates with various plastid-bearing and bacterial endosymbionts. Protein localization and activity, gene expression, and phylogenetic analyses indicate that the pennate diatom Phaeodactylum tricornutum contains five FBA genes with very little overall functional overlap. Three P. tricornutum FBAs, one class I and two class II, are plastid localized, and each appears to have a distinct evolutionary origin as well as function. Class I plastid FBA appears to have been acquired by chromalveolates from a red algal endosymbiont, whereas one copy of class II plastid FBA is likely to have originated from an ancient green algal endosymbiont. The other copy appears to be the result of a chromalveolate-specific gene duplication. Plastid FBA I and chromalveolate-specific class II plastid FBA are localized in the pyrenoid region of the chloroplast where they are associated with β-carbonic anhydrase, which is known to play a significant role in regulation of the diatom carbon concentrating mechanism. The two pyrenoid-associated FBAs are distinguished by contrasting gene expression profiles under nutrient limiting compared with optimal CO2 fixation conditions, suggestive of a distinct specialized function for each. Cytosolically localized FBAs in P. tricornutum likely play a role in glycolysis and cytoskeleton function and seem to have originated from the stramenopile host cell and from diatom-specific bacterial gene transfer, respectively.
Allen, Andrew E.; Moustafa, Ahmed; Montsant, Anton; Eckert, Angelika; Kroth, Peter G.; Bowler, Chris
2012-01-01
Diatoms and other chlorophyll-c containing, or chromalveolate, algae are among the most productive and diverse phytoplankton in the ocean. Evolutionarily, chlorophyll-c algae are linked through common, although not necessarily monophyletic, acquisition of plastid endosymbionts of red as well as most likely green algal origin. There is also strong evidence for a relatively high level of lineage-specific bacterial gene acquisition within chromalveolates. Therefore, analyses of gene content and derivation in chromalveolate taxa have indicated particularly diverse origins of their overall gene repertoire. As a single group of functionally related enzymes spanning two distinct gene families, fructose 1,6-bisphosphate aldolases (FBAs) illustrate the influence on core biochemical pathways of specific evolutionary associations among diatoms and other chromalveolates with various plastid-bearing and bacterial endosymbionts. Protein localization and activity, gene expression, and phylogenetic analyses indicate that the pennate diatom Phaeodactylum tricornutum contains five FBA genes with very little overall functional overlap. Three P. tricornutum FBAs, one class I and two class II, are plastid localized, and each appears to have a distinct evolutionary origin as well as function. Class I plastid FBA appears to have been acquired by chromalveolates from a red algal endosymbiont, whereas one copy of class II plastid FBA is likely to have originated from an ancient green algal endosymbiont. The other copy appears to be the result of a chromalveolate-specific gene duplication. Plastid FBA I and chromalveolate-specific class II plastid FBA are localized in the pyrenoid region of the chloroplast where they are associated with β-carbonic anhydrase, which is known to play a significant role in regulation of the diatom carbon concentrating mechanism. The two pyrenoid-associated FBAs are distinguished by contrasting gene expression profiles under nutrient limiting compared with optimal CO2 fixation conditions, suggestive of a distinct specialized function for each. Cytosolically localized FBAs in P. tricornutum likely play a role in glycolysis and cytoskeleton function and seem to have originated from the stramenopile host cell and from diatom-specific bacterial gene transfer, respectively. PMID:21903677
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chaiboonchoe, Amphun; Ghamsari, Lila; Dohai, Bushra
Metabolic networks, which are mathematical representations of organismal metabolism, are reconstructed to provide computational platforms to guide metabolic engineering experiments and explore fundamental questions on metabolism. Systems level analyses, such as interrogation of phylogenetic relationships within the network, can provide further guidance on the modification of metabolic circuitries. Chlamydomonas reinhardtii, a biofuel relevant green alga that has retained key genes with plant, animal, and protist affinities, serves as an ideal model organism to investigate the interplay between gene function and phylogenetic affinities at multiple organizational levels. Here, using detailed topological and functional analyses, coupled with transcriptomics studies on a metabolicmore » network that we have reconstructed for C. reinhardtii, we show that network connectivity has a significant concordance with the co-conservation of genes; however, a distinction between topological and functional relationships is observable within the network. Dynamic and static modes of co-conservation were defined and observed in a subset of gene-pairs across the network topologically. In contrast, genes with predicted synthetic interactions, or genes involved in coupled reactions, show significant enrichment for both shorter and longer phylogenetic distances. Based on our results, we propose that the metabolic network of C. reinhardtii is assembled with an architecture to minimize phylogenetic profile distances topologically, while it includes an expansion of such distances for functionally interacting genes. This arrangement may increase the robustness of C. reinhardtii's network in dealing with varied environmental challenges that the species may face. As a result, the defined evolutionary constraints within the network, which identify important pairings of genes in metabolism, may offer guidance on synthetic biology approaches to optimize the production of desirable metabolites.« less
Chaiboonchoe, Amphun; Ghamsari, Lila; Dohai, Bushra; Ng, Patrick; Khraiwesh, Basel; Jaiswal, Ashish; Jijakli, Kenan; Koussa, Joseph; Nelson, David R; Cai, Hong; Yang, Xinping; Chang, Roger L; Papin, Jason; Yu, Haiyuan; Balaji, Santhanam; Salehi-Ashtiani, Kourosh
2016-07-19
Metabolic networks, which are mathematical representations of organismal metabolism, are reconstructed to provide computational platforms to guide metabolic engineering experiments and explore fundamental questions on metabolism. Systems level analyses, such as interrogation of phylogenetic relationships within the network, can provide further guidance on the modification of metabolic circuitries. Chlamydomonas reinhardtii, a biofuel relevant green alga that has retained key genes with plant, animal, and protist affinities, serves as an ideal model organism to investigate the interplay between gene function and phylogenetic affinities at multiple organizational levels. Here, using detailed topological and functional analyses, coupled with transcriptomics studies on a metabolic network that we have reconstructed for C. reinhardtii, we show that network connectivity has a significant concordance with the co-conservation of genes; however, a distinction between topological and functional relationships is observable within the network. Dynamic and static modes of co-conservation were defined and observed in a subset of gene-pairs across the network topologically. In contrast, genes with predicted synthetic interactions, or genes involved in coupled reactions, show significant enrichment for both shorter and longer phylogenetic distances. Based on our results, we propose that the metabolic network of C. reinhardtii is assembled with an architecture to minimize phylogenetic profile distances topologically, while it includes an expansion of such distances for functionally interacting genes. This arrangement may increase the robustness of C. reinhardtii's network in dealing with varied environmental challenges that the species may face. The defined evolutionary constraints within the network, which identify important pairings of genes in metabolism, may offer guidance on synthetic biology approaches to optimize the production of desirable metabolites.
Chaiboonchoe, Amphun; Ghamsari, Lila; Dohai, Bushra; ...
2016-06-14
Metabolic networks, which are mathematical representations of organismal metabolism, are reconstructed to provide computational platforms to guide metabolic engineering experiments and explore fundamental questions on metabolism. Systems level analyses, such as interrogation of phylogenetic relationships within the network, can provide further guidance on the modification of metabolic circuitries. Chlamydomonas reinhardtii, a biofuel relevant green alga that has retained key genes with plant, animal, and protist affinities, serves as an ideal model organism to investigate the interplay between gene function and phylogenetic affinities at multiple organizational levels. Here, using detailed topological and functional analyses, coupled with transcriptomics studies on a metabolicmore » network that we have reconstructed for C. reinhardtii, we show that network connectivity has a significant concordance with the co-conservation of genes; however, a distinction between topological and functional relationships is observable within the network. Dynamic and static modes of co-conservation were defined and observed in a subset of gene-pairs across the network topologically. In contrast, genes with predicted synthetic interactions, or genes involved in coupled reactions, show significant enrichment for both shorter and longer phylogenetic distances. Based on our results, we propose that the metabolic network of C. reinhardtii is assembled with an architecture to minimize phylogenetic profile distances topologically, while it includes an expansion of such distances for functionally interacting genes. This arrangement may increase the robustness of C. reinhardtii's network in dealing with varied environmental challenges that the species may face. As a result, the defined evolutionary constraints within the network, which identify important pairings of genes in metabolism, may offer guidance on synthetic biology approaches to optimize the production of desirable metabolites.« less
Cross-organism learning method to discover new gene functionalities.
Domeniconi, Giacomo; Masseroli, Marco; Moro, Gianluca; Pinoli, Pietro
2016-04-01
Knowledge of gene and protein functions is paramount for the understanding of physiological and pathological biological processes, as well as in the development of new drugs and therapies. Analyses for biomedical knowledge discovery greatly benefit from the availability of gene and protein functional feature descriptions expressed through controlled terminologies and ontologies, i.e., of gene and protein biomedical controlled annotations. In the last years, several databases of such annotations have become available; yet, these valuable annotations are incomplete, include errors and only some of them represent highly reliable human curated information. Computational techniques able to reliably predict new gene or protein annotations with an associated likelihood value are thus paramount. Here, we propose a novel cross-organisms learning approach to reliably predict new functionalities for the genes of an organism based on the known controlled annotations of the genes of another, evolutionarily related and better studied, organism. We leverage a new representation of the annotation discovery problem and a random perturbation of the available controlled annotations to allow the application of supervised algorithms to predict with good accuracy unknown gene annotations. Taking advantage of the numerous gene annotations available for a well-studied organism, our cross-organisms learning method creates and trains better prediction models, which can then be applied to predict new gene annotations of a target organism. We tested and compared our method with the equivalent single organism approach on different gene annotation datasets of five evolutionarily related organisms (Homo sapiens, Mus musculus, Bos taurus, Gallus gallus and Dictyostelium discoideum). Results show both the usefulness of the perturbation method of available annotations for better prediction model training and a great improvement of the cross-organism models with respect to the single-organism ones, without influence of the evolutionary distance between the considered organisms. The generated ranked lists of reliably predicted annotations, which describe novel gene functionalities and have an associated likelihood value, are very valuable both to complement available annotations, for better coverage in biomedical knowledge discovery analyses, and to quicken the annotation curation process, by focusing it on the prioritized novel annotations predicted. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Homologues of CsLOB1 in citrus function as disease susceptibility genes in citrus canker.
Zhang, Junli; Huguet-Tapia, Jose Carlos; Hu, Yang; Jones, Jeffrey; Wang, Nian; Liu, Sanzhen; White, Frank F
2017-08-01
The lateral organ boundary domain (LBD) genes encode a group of plant-specific proteins that function as transcription factors in the regulation of plant growth and development. Citrus sinensis lateral organ boundary 1 (CsLOB1) is a member of the LBD family and functions as a disease susceptibility gene in citrus bacterial canker (CBC). Thirty-four LBD members have been identified from the Citrus sinensis genome. We assessed the potential for additional members of LBD genes in citrus to function as surrogates for CsLOB1 in CBC, and compared host gene expression on induction of different LBD genes. Using custom-designed transcription activator-like (TAL) effectors, two members of the same clade as CsLOB1, named CsLOB2 and CsLOB3, were found to be capable of functioning similarly to CsLOB1 in CBC. RNA sequencing and quantitative reverse transcription-polymerase chain reaction analyses revealed a set of cell wall metabolic genes that are associated with CsLOB1, CsLOB2 and CsLOB3 expression and may represent downstream genes involved in CBC. © 2016 BSPP AND JOHN WILEY & SONS LTD.
Sibout, Richard; Proost, Sebastian; Hansen, Bjoern Oest; Vaid, Neha; Giorgi, Federico M; Ho-Yue-Kuang, Severine; Legée, Frédéric; Cézart, Laurent; Bouchabké-Coussa, Oumaya; Soulhat, Camille; Provart, Nicholas; Pasha, Asher; Le Bris, Philippe; Roujol, David; Hofte, Herman; Jamet, Elisabeth; Lapierre, Catherine; Persson, Staffan; Mutwil, Marek
2017-08-01
While Brachypodium distachyon (Brachypodium) is an emerging model for grasses, no expression atlas or gene coexpression network is available. Such tools are of high importance to provide insights into the function of Brachypodium genes. We present a detailed Brachypodium expression atlas, capturing gene expression in its major organs at different developmental stages. The data were integrated into a large-scale coexpression database ( www.gene2function.de), enabling identification of duplicated pathways and conserved processes across 10 plant species, thus allowing genome-wide inference of gene function. We highlight the importance of the atlas and the platform through the identification of duplicated cell wall modules, and show that a lignin biosynthesis module is conserved across angiosperms. We identified and functionally characterised a putative ferulate 5-hydroxylase gene through overexpression of it in Brachypodium, which resulted in an increase in lignin syringyl units and reduced lignin content of mature stems, and led to improved saccharification of the stem biomass. Our Brachypodium expression atlas thus provides a powerful resource to reveal functionally related genes, which may advance our understanding of important biological processes in grasses. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
Zhang, Hailing; Cao, Yingping; Shang, Chen; Li, Jikai; Wang, Jianli; Wu, Zhenying; Ma, Lichao; Qi, Tianxiong; Fu, Chunxiang; Hu, Baozhong
2017-01-01
The GRAS gene family is a large plant-specific family of transcription factors that are involved in diverse processes during plant development. Medicago truncatula is an ideal model plant for genetic research in legumes, and specifically for studying nodulation, which is crucial for nitrogen fixation. In this study, 59 MtGRAS genes were identified and classified into eight distinct subgroups based on phylogenetic relationships. Motifs located in the C-termini were conserved across the subgroups, while motifs in the N-termini were subfamily specific. Gene duplication was the main evolutionary force for MtGRAS expansion, especially proliferation of the LISCL subgroup. Seventeen duplicated genes showed strong effects of purifying selection and diverse expression patterns, highlighting their functional importance and diversification after duplication. Thirty MtGRAS genes, including NSP1 and NSP2, were preferentially expressed in nodules, indicating possible roles in the process of nodulation. A transcriptome study, combined with gene expression analysis under different stress conditions, suggested potential functions of MtGRAS genes in various biological pathways and stress responses. Taken together, these comprehensive analyses provide basic information for understanding the potential functions of GRAS genes, and will facilitate further discovery of MtGRAS gene functions. PMID:28945786
Piyatrakul, Piyanuch; Yang, Meng; Putranto, Riza-Arief; Pirrello, Julien; Dessailly, Florence; Hu, Songnian; Summo, Marilyne; Theeravatanasuk, Kannikar; Leclercq, Julie; Kuswanhadi; Montoro, Pascal
2014-01-01
The AP2/ERF superfamily encodes transcription factors that play a key role in plant development and responses to abiotic and biotic stress. In Hevea brasiliensis, ERF genes have been identified by RNA sequencing. This study set out to validate the number of HbERF genes, and identify ERF genes involved in the regulation of latex cell metabolism. A comprehensive Hevea transcriptome was improved using additional RNA reads from reproductive tissues. Newly assembled contigs were annotated in the Gene Ontology database and were assigned to 3 main categories. The AP2/ERF superfamily is the third most represented compared with other transcription factor families. A comparison with genomic scaffolds led to an estimation of 114 AP2/ERF genes and 1 soloist in Hevea brasiliensis. Based on a phylogenetic analysis, functions were predicted for 26 HbERF genes. A relative transcript abundance analysis was performed by real-time RT-PCR in various tissues. Transcripts of ERFs from group I and VIII were very abundant in all tissues while those of group VII were highly accumulated in latex cells. Seven of the thirty-five ERF expression marker genes were highly expressed in latex. Subcellular localization and transactivation analyses suggested that HbERF-VII candidate genes encoded functional transcription factors. PMID:24971876
Piyatrakul, Piyanuch; Yang, Meng; Putranto, Riza-Arief; Pirrello, Julien; Dessailly, Florence; Hu, Songnian; Summo, Marilyne; Theeravatanasuk, Kannikar; Leclercq, Julie; Kuswanhadi; Montoro, Pascal
2014-01-01
The AP2/ERF superfamily encodes transcription factors that play a key role in plant development and responses to abiotic and biotic stress. In Hevea brasiliensis, ERF genes have been identified by RNA sequencing. This study set out to validate the number of HbERF genes, and identify ERF genes involved in the regulation of latex cell metabolism. A comprehensive Hevea transcriptome was improved using additional RNA reads from reproductive tissues. Newly assembled contigs were annotated in the Gene Ontology database and were assigned to 3 main categories. The AP2/ERF superfamily is the third most represented compared with other transcription factor families. A comparison with genomic scaffolds led to an estimation of 114 AP2/ERF genes and 1 soloist in Hevea brasiliensis. Based on a phylogenetic analysis, functions were predicted for 26 HbERF genes. A relative transcript abundance analysis was performed by real-time RT-PCR in various tissues. Transcripts of ERFs from group I and VIII were very abundant in all tissues while those of group VII were highly accumulated in latex cells. Seven of the thirty-five ERF expression marker genes were highly expressed in latex. Subcellular localization and transactivation analyses suggested that HbERF-VII candidate genes encoded functional transcription factors.
Kaltenegger, Elisabeth; Eich, Eckart; Ober, Dietrich
2013-01-01
Homospermidine synthase (HSS), the first pathway-specific enzyme of pyrrolizidine alkaloid biosynthesis, is known to have its origin in the duplication of a gene encoding deoxyhypusine synthase. To study the processes that followed this gene duplication event and gave rise to HSS, we identified sequences encoding HSS and deoxyhypusine synthase from various species of the Convolvulaceae. We show that HSS evolved only once in this lineage. This duplication event was followed by several losses of a functional gene copy attributable to gene loss or pseudogenization. Statistical analyses of sequence data suggest that, in those lineages in which the gene copy was successfully recruited as HSS, the gene duplication event was followed by phases of various selection pressures, including purifying selection, relaxed functional constraints, and possibly positive Darwinian selection. Site-specific mutagenesis experiments have confirmed that the substitution of sites predicted to be under positive Darwinian selection is sufficient to convert a deoxyhypusine synthase into a HSS. In addition, analyses of transcript levels have shown that HSS and deoxyhypusine synthase have also diverged with respect to their regulation. The impact of protein–protein interaction on the evolution of HSS is discussed with respect to current models of enzyme evolution. PMID:23572540
MANTIS: a phylogenetic framework for multi-species genome comparisons.
Tzika, Athanasia C; Helaers, Raphaël; Van de Peer, Yves; Milinkovitch, Michel C
2008-01-15
Practitioners of comparative genomics face huge analytical challenges as whole genome sequences and functional/expression data accumulate. Furthermore, the field would greatly benefit from a better integration of this wealth of data with evolutionary concepts. Here, we present MANTIS, a relational database for the analysis of (i) gains and losses of genes on specific branches of the metazoan phylogeny, (ii) reconstructed genome content of ancestral species and (iii) over- or under-representation of functions/processes and tissue specificity of gained, duplicated and lost genes. MANTIS estimates the most likely positions of gene losses on the true phylogeny using a maximum-likelihood function. A user-friendly interface and an extensive query system allow to investigate questions pertaining to gene identity, phylogenetic mapping and function/expression parameters. MANTIS is freely available at http://www.mantisdb.org and constitutes the missing link between multi-species genome comparisons and functional analyses.
Genetic resources for maize cell wall biology.
Penning, Bryan W; Hunter, Charles T; Tayengwa, Reuben; Eveland, Andrea L; Dugard, Christopher K; Olek, Anna T; Vermerris, Wilfred; Koch, Karen E; McCarty, Donald R; Davis, Mark F; Thomas, Steven R; McCann, Maureen C; Carpita, Nicholas C
2009-12-01
Grass species represent a major source of food, feed, and fiber crops and potential feedstocks for biofuel production. Most of the biomass is contributed by cell walls that are distinct in composition from all other flowering plants. Identifying cell wall-related genes and their functions underpins a fundamental understanding of growth and development in these species. Toward this goal, we are building a knowledge base of the maize (Zea mays) genes involved in cell wall biology, their expression profiles, and the phenotypic consequences of mutation. Over 750 maize genes were annotated and assembled into gene families predicted to function in cell wall biogenesis. Comparative genomics of maize, rice (Oryza sativa), and Arabidopsis (Arabidopsis thaliana) sequences reveal differences in gene family structure between grass species and a reference eudicot species. Analysis of transcript profile data for cell wall genes in developing maize ovaries revealed that expression within families differed by up to 100-fold. When transcriptional analyses of developing ovaries before pollination from Arabidopsis, rice, and maize were contrasted, distinct sets of cell wall genes were expressed in grasses. These differences in gene family structure and expression between Arabidopsis and the grasses underscore the requirement for a grass-specific genetic model for functional analyses. A UniformMu population proved to be an important resource in both forward- and reverse-genetics approaches to identify hundreds of mutants in cell wall genes. A forward screen of field-grown lines by near-infrared spectroscopic screen of mature leaves yielded several dozen lines with heritable spectroscopic phenotypes. Pyrolysis-molecular beam mass spectrometry confirmed that several nir mutants had altered carbohydrate-lignin compositions.
Soybean kinome: functional classification and gene expression patterns
Liu, Jinyi; Chen, Nana; Grant, Joshua N.; Cheng, Zong-Ming (Max); Stewart, C. Neal; Hewezi, Tarek
2015-01-01
The protein kinase (PK) gene family is one of the largest and most highly conserved gene families in plants and plays a role in nearly all biological functions. While a large number of genes have been predicted to encode PKs in soybean, a comprehensive functional classification and global analysis of expression patterns of this large gene family is lacking. In this study, we identified the entire soybean PK repertoire or kinome, which comprised 2166 putative PK genes, representing 4.67% of all soybean protein-coding genes. The soybean kinome was classified into 19 groups, 81 families, and 122 subfamilies. The receptor-like kinase (RLK) group was remarkably large, containing 1418 genes. Collinearity analysis indicated that whole-genome segmental duplication events may have played a key role in the expansion of the soybean kinome, whereas tandem duplications might have contributed to the expansion of specific subfamilies. Gene structure, subcellular localization prediction, and gene expression patterns indicated extensive functional divergence of PK subfamilies. Global gene expression analysis of soybean PK subfamilies revealed tissue- and stress-specific expression patterns, implying regulatory functions over a wide range of developmental and physiological processes. In addition, tissue and stress co-expression network analysis uncovered specific subfamilies with narrow or wide interconnected relationships, indicative of their association with particular or broad signalling pathways, respectively. Taken together, our analyses provide a foundation for further functional studies to reveal the biological and molecular functions of PKs in soybean. PMID:25614662
Rioualen, Claire; Da Costa, Quentin; Chetrit, Bernard; Charafe-Jauffret, Emmanuelle; Ginestier, Christophe
2017-01-01
High-throughput RNAi screenings (HTS) allow quantifying the impact of the deletion of each gene in any particular function, from virus-host interactions to cell differentiation. However, there has been less development for functional analysis tools dedicated to RNAi analyses. HTS-Net, a network-based analysis program, was developed to identify gene regulatory modules impacted in high-throughput screenings, by integrating transcription factors-target genes interaction data (regulome) and protein-protein interaction networks (interactome) on top of screening z-scores. HTS-Net produces exhaustive HTML reports for results navigation and exploration. HTS-Net is a new pipeline for RNA interference screening analyses that proves better performance than simple gene rankings by z-scores, by re-prioritizing genes and replacing them in their biological context, as shown by the three studies that we reanalyzed. Formatted input data for the three studied datasets, source code and web site for testing the system are available from the companion web site at http://htsnet.marseille.inserm.fr/. We also compared our program with existing algorithms (CARD and hotnet2). PMID:28949986
Schmitz, Judith; Lor, Stephanie; Klose, Rena; Güntürkün, Onur; Ocklenburg, Sebastian
2017-01-01
Handedness and language lateralization are partially determined by genetic influences. It has been estimated that at least 40 (and potentially more) possibly interacting genes may influence the ontogenesis of hemispheric asymmetries. Recently, it has been suggested that analyzing the genetics of hemispheric asymmetries on the level of gene ontology sets, rather than at the level of individual genes, might be more informative for understanding the underlying functional cascades. Here, we performed gene ontology, pathway and disease association analyses on genes that have previously been associated with handedness and language lateralization. Significant gene ontology sets for handedness were anatomical structure development, pattern specification (especially asymmetry formation) and biological regulation. Pathway analysis highlighted the importance of the TGF-beta signaling pathway for handedness ontogenesis. Significant gene ontology sets for language lateralization were responses to different stimuli, nervous system development, transport, signaling, and biological regulation. Despite the fact that some authors assume that handedness and language lateralization share a common ontogenetic basis, gene ontology sets barely overlap between phenotypes. Compared to genes involved in handedness, which mostly contribute to structural development, genes involved in language lateralization rather contribute to activity-dependent cognitive processes. Disease association analysis revealed associations of genes involved in handedness with diseases affecting the whole body, while genes involved in language lateralization were specifically engaged in mental and neurological diseases. These findings further support the idea that handedness and language lateralization are ontogenetically independent, complex phenotypes.
Schmitz, Judith; Lor, Stephanie; Klose, Rena; Güntürkün, Onur; Ocklenburg, Sebastian
2017-01-01
Handedness and language lateralization are partially determined by genetic influences. It has been estimated that at least 40 (and potentially more) possibly interacting genes may influence the ontogenesis of hemispheric asymmetries. Recently, it has been suggested that analyzing the genetics of hemispheric asymmetries on the level of gene ontology sets, rather than at the level of individual genes, might be more informative for understanding the underlying functional cascades. Here, we performed gene ontology, pathway and disease association analyses on genes that have previously been associated with handedness and language lateralization. Significant gene ontology sets for handedness were anatomical structure development, pattern specification (especially asymmetry formation) and biological regulation. Pathway analysis highlighted the importance of the TGF-beta signaling pathway for handedness ontogenesis. Significant gene ontology sets for language lateralization were responses to different stimuli, nervous system development, transport, signaling, and biological regulation. Despite the fact that some authors assume that handedness and language lateralization share a common ontogenetic basis, gene ontology sets barely overlap between phenotypes. Compared to genes involved in handedness, which mostly contribute to structural development, genes involved in language lateralization rather contribute to activity-dependent cognitive processes. Disease association analysis revealed associations of genes involved in handedness with diseases affecting the whole body, while genes involved in language lateralization were specifically engaged in mental and neurological diseases. These findings further support the idea that handedness and language lateralization are ontogenetically independent, complex phenotypes. PMID:28729848
da Silva, Danielle Costenaro; da Silveira Falavigna, Vítor; Fasoli, Marianna; Buffon, Vanessa; Porto, Diogo Denardi; Pappas, Georgios Joannis; Pezzotti, Mario; Pasquali, Giancarlo; Revers, Luís Fernando
2016-01-01
The Dof (DNA-binding with one finger) protein family spans a group of plant transcription factors involved in the regulation of several functions, such as plant responses to stress, hormones and light, phytochrome signaling and seed germination. Here we describe the Dof-like gene family in grapevine (Vitis vinifera L.), which consists of 25 genes coding for Dof. An extensive in silico characterization of the VviDofL gene family was performed. Additionally, the expression of the entire gene family was assessed in 54 grapevine tissues and organs using an integrated approach with microarray (cv Corvina) and real-time PCR (cv Pinot Noir) analyses. The phylogenetic analysis comparing grapevine sequences with those of Arabidopsis, tomato, poplar and already described Dof genes in other species allowed us to identify several duplicated genes. The diversification of grapevine DofL genes during evolution likely resulted in a broader range of biological roles. Furthermore, distinct expression patterns were identified between samples analyzed, corroborating such hypothesis. Our expression results indicate that several VviDofL genes perform their functional roles mainly during flower, berry and seed development, highlighting their importance for grapevine growth and production. The identification of similar expression profiles between both approaches strongly suggests that these genes have important regulatory roles that are evolutionally conserved between grapevine cvs Corvina and Pinot Noir.
da Silva, Danielle Costenaro; da Silveira Falavigna, Vítor; Fasoli, Marianna; Buffon, Vanessa; Porto, Diogo Denardi; Pappas, Georgios Joannis; Pezzotti, Mario; Pasquali, Giancarlo; Revers, Luís Fernando
2016-01-01
The Dof (DNA-binding with one finger) protein family spans a group of plant transcription factors involved in the regulation of several functions, such as plant responses to stress, hormones and light, phytochrome signaling and seed germination. Here we describe the Dof-like gene family in grapevine (Vitis vinifera L.), which consists of 25 genes coding for Dof. An extensive in silico characterization of the VviDofL gene family was performed. Additionally, the expression of the entire gene family was assessed in 54 grapevine tissues and organs using an integrated approach with microarray (cv Corvina) and real-time PCR (cv Pinot Noir) analyses. The phylogenetic analysis comparing grapevine sequences with those of Arabidopsis, tomato, poplar and already described Dof genes in other species allowed us to identify several duplicated genes. The diversification of grapevine DofL genes during evolution likely resulted in a broader range of biological roles. Furthermore, distinct expression patterns were identified between samples analyzed, corroborating such hypothesis. Our expression results indicate that several VviDofL genes perform their functional roles mainly during flower, berry and seed development, highlighting their importance for grapevine growth and production. The identification of similar expression profiles between both approaches strongly suggests that these genes have important regulatory roles that are evolutionally conserved between grapevine cvs Corvina and Pinot Noir. PMID:27610237
Gudhka, Reema K; Neilan, Brett A; Burns, Brendan P
2015-01-01
Halococcus hamelinensis was the first archaeon isolated from stromatolites. These geomicrobial ecosystems are thought to be some of the earliest known on Earth, yet, despite their evolutionary significance, the role of Archaea in these systems is still not well understood. Detailed here is the genome sequencing and analysis of an archaeon isolated from stromatolites. The genome of H. hamelinensis consisted of 3,133,046 base pairs with an average G+C content of 60.08% and contained 3,150 predicted coding sequences or ORFs, 2,196 (68.67%) of which were protein-coding genes with functional assignments and 954 (29.83%) of which were of unknown function. Codon usage of the H. hamelinensis genome was consistent with a highly acidic proteome, a major adaptive mechanism towards high salinity. Amino acid transport and metabolism, inorganic ion transport and metabolism, energy production and conversion, ribosomal structure, and unknown function COG genes were overrepresented. The genome of H. hamelinensis also revealed characteristics reflecting its survival in its extreme environment, including putative genes/pathways involved in osmoprotection, oxidative stress response, and UV damage repair. Finally, genome analyses indicated the presence of putative transposases as well as positive matches of genes of H. hamelinensis against various genomes of Bacteria, Archaea, and viruses, suggesting the potential for horizontal gene transfer.
Evolution of fruit development genes in flowering plants
Pabón-Mora, Natalia; Wong, Gane Ka-Shu; Ambrose, Barbara A.
2014-01-01
The genetic mechanisms regulating dry fruit development and opercular dehiscence have been identified in Arabidopsis thaliana. In the bicarpellate silique, valve elongation and differentiation is controlled by FRUITFULL (FUL) that antagonizes SHATTERPROOF1-2 (SHP1/SHP2) and INDEHISCENT (IND) at the dehiscence zone where they control normal lignification. SHP1/2 are also repressed by REPLUMLESS (RPL), responsible for replum formation. Similarly, FUL indirectly controls two other factors ALCATRAZ (ALC) and SPATULA (SPT) that function in the proper formation of the separation layer. FUL and SHP1/2 belong to the MADS-box family, IND and ALC belong to the bHLH family and RPL belongs to the homeodomain family, all of which are large transcription factor families. These families have undergone numerous duplications and losses in plants, likely accompanied by functional changes. Functional analyses of homologous genes suggest that this network is fairly conserved in Brassicaceae and less conserved in other core eudicots. Only the MADS box genes have been functionally characterized in basal eudicots and suggest partial conservation of the functions recorded for Brassicaceae. Here we do a comprehensive search of SHP, IND, ALC, SPT, and RPL homologs across core-eudicots, basal eudicots, monocots and basal angiosperms. Based on gene-tree analyses we hypothesize what parts of the network for fruit development in Brassicaceae, in particular regarding direct and indirect targets of FUL, might be conserved across angiosperms. PMID:25018763
Xia, Yu; Hu, Man; Wen, Xianghua; Wang, Xiaohui; Yang, Yunfeng; Zhou, Jizhong
2016-01-08
The effect of environmental conditions on the diversity and interactions of microbial communities has caused tremendous interest in microbial ecology. Here, we found that with identical influents but differing operational parameters (mainly mixed liquor suspended solid (MLSS) concentrations, solid retention time (SRT) and dissolved oxygen (DO) concentrations), two full-scale municipal wastewater treatment systems applying oxidation ditch (OD) and membrane bioreactor (MBR) processes harbored a majority of shared genes (87.2%) but had different overall functional gene structures as revealed by two datasets of 12-day time-series generated by a functional gene array-GeoChip 4.2. Association networks of core carbon, nitrogen and phosphorus cycling genes in each system based on random matrix theory (RMT) showed different topological properties and the MBR nodes showed an indication of higher connectivity. MLSS and DO were shown to be effective in shaping functional gene structures of the systems by statistical analyses. Higher MLSS concentrations resulting in decreased resource availability of the MBR system were thought to promote positive interactions of important functional genes. Together, these findings show the differences of functional potentials of some bioprocesses caused by differing environmental conditions and suggest that higher stress of resource limitation increased positive gene interactions in the MBR system.
NASA Astrophysics Data System (ADS)
Xia, Yu; Hu, Man; Wen, Xianghua; Wang, Xiaohui; Yang, Yunfeng; Zhou, Jizhong
2016-01-01
The effect of environmental conditions on the diversity and interactions of microbial communities has caused tremendous interest in microbial ecology. Here, we found that with identical influents but differing operational parameters (mainly mixed liquor suspended solid (MLSS) concentrations, solid retention time (SRT) and dissolved oxygen (DO) concentrations), two full-scale municipal wastewater treatment systems applying oxidation ditch (OD) and membrane bioreactor (MBR) processes harbored a majority of shared genes (87.2%) but had different overall functional gene structures as revealed by two datasets of 12-day time-series generated by a functional gene array-GeoChip 4.2. Association networks of core carbon, nitrogen and phosphorus cycling genes in each system based on random matrix theory (RMT) showed different topological properties and the MBR nodes showed an indication of higher connectivity. MLSS and DO were shown to be effective in shaping functional gene structures of the systems by statistical analyses. Higher MLSS concentrations resulting in decreased resource availability of the MBR system were thought to promote positive interactions of important functional genes. Together, these findings show the differences of functional potentials of some bioprocesses caused by differing environmental conditions and suggest that higher stress of resource limitation increased positive gene interactions in the MBR system.
Xia, Yu; Hu, Man; Wen, Xianghua; Wang, Xiaohui; Yang, Yunfeng; Zhou, Jizhong
2016-01-01
The effect of environmental conditions on the diversity and interactions of microbial communities has caused tremendous interest in microbial ecology. Here, we found that with identical influents but differing operational parameters (mainly mixed liquor suspended solid (MLSS) concentrations, solid retention time (SRT) and dissolved oxygen (DO) concentrations), two full-scale municipal wastewater treatment systems applying oxidation ditch (OD) and membrane bioreactor (MBR) processes harbored a majority of shared genes (87.2%) but had different overall functional gene structures as revealed by two datasets of 12-day time-series generated by a functional gene array-GeoChip 4.2. Association networks of core carbon, nitrogen and phosphorus cycling genes in each system based on random matrix theory (RMT) showed different topological properties and the MBR nodes showed an indication of higher connectivity. MLSS and DO were shown to be effective in shaping functional gene structures of the systems by statistical analyses. Higher MLSS concentrations resulting in decreased resource availability of the MBR system were thought to promote positive interactions of important functional genes. Together, these findings show the differences of functional potentials of some bioprocesses caused by differing environmental conditions and suggest that higher stress of resource limitation increased positive gene interactions in the MBR system. PMID:26743465
An Updated Collection of Sequence Barcoded Temperature-Sensitive Alleles of Yeast Essential Genes
Kofoed, Megan; Milbury, Karissa L.; Chiang, Jennifer H.; Sinha, Sunita; Ben-Aroya, Shay; Giaever, Guri; Nislow, Corey; Hieter, Philip; Stirling, Peter C.
2015-01-01
Systematic analyses of essential gene function using mutant collections in Saccharomyces cerevisiae have been conducted using collections of heterozygous diploids, promoter shut-off alleles, through alleles with destabilized mRNA, destabilized protein, or bearing mutations that lead to a temperature-sensitive (ts) phenotype. We previously described a method for construction of barcoded ts alleles in a systematic fashion. Here we report the completion of this collection of alleles covering 600 essential yeast genes. This resource covers a larger gene repertoire than previous collections and provides a complementary set of strains suitable for single gene and genomic analyses. We use deep sequencing to characterize the amino acid changes leading to the ts phenotype in half of the alleles. We also use high-throughput approaches to describe the relative ts behavior of the alleles. Finally, we demonstrate the experimental usefulness of the collection in a high-content, functional genomic screen for ts alleles that increase spontaneous P-body formation. By increasing the number of alleles and improving the annotation, this ts collection will serve as a community resource for probing new aspects of biology for essential yeast genes. PMID:26175450
An Updated Collection of Sequence Barcoded Temperature-Sensitive Alleles of Yeast Essential Genes.
Kofoed, Megan; Milbury, Karissa L; Chiang, Jennifer H; Sinha, Sunita; Ben-Aroya, Shay; Giaever, Guri; Nislow, Corey; Hieter, Philip; Stirling, Peter C
2015-07-14
Systematic analyses of essential gene function using mutant collections in Saccharomyces cerevisiae have been conducted using collections of heterozygous diploids, promoter shut-off alleles, through alleles with destabilized mRNA, destabilized protein, or bearing mutations that lead to a temperature-sensitive (ts) phenotype. We previously described a method for construction of barcoded ts alleles in a systematic fashion. Here we report the completion of this collection of alleles covering 600 essential yeast genes. This resource covers a larger gene repertoire than previous collections and provides a complementary set of strains suitable for single gene and genomic analyses. We use deep sequencing to characterize the amino acid changes leading to the ts phenotype in half of the alleles. We also use high-throughput approaches to describe the relative ts behavior of the alleles. Finally, we demonstrate the experimental usefulness of the collection in a high-content, functional genomic screen for ts alleles that increase spontaneous P-body formation. By increasing the number of alleles and improving the annotation, this ts collection will serve as a community resource for probing new aspects of biology for essential yeast genes. Copyright © 2015 Kofoed et al.
Single Cell Gene Expression Profiling of Skeletal Muscle-Derived Cells.
Gatto, Sole; Puri, Pier Lorenzo; Malecova, Barbora
2017-01-01
Single cell gene expression profiling is a fundamental tool for studying the heterogeneity of a cell population by addressing the phenotypic and functional characteristics of each cell. Technological advances that have coupled microfluidic technologies with high-throughput quantitative RT-PCR analyses have enabled detailed analyses of single cells in various biological contexts. In this chapter, we describe the procedure for isolating the skeletal muscle interstitial cells termed Fibro-Adipogenic Progenitors (FAPs ) and their gene expression profiling at the single cell level. Moreover, we accompany our bench protocol with bioinformatics analysis designed to process raw data as well as to visualize single cell gene expression data. Single cell gene expression profiling is therefore a useful tool in the investigation of FAPs heterogeneity and their contribution to muscle homeostasis.
Ecological adaptation determines functional mammalian olfactory subgenomes
Hayden, Sara; Bekaert, Michaël; Crider, Tess A.; Mariani, Stefano; Murphy, William J.; Teeling, Emma C.
2010-01-01
The ability to smell is governed by the largest gene family in mammalian genomes, the olfactory receptor (OR) genes. Although these genes are well annotated in the finished human and mouse genomes, we still do not understand which receptors bind specific odorants or how they fully function. Previous comparative studies have been taxonomically limited and mostly focused on the percentage of OR pseudogenes within species. No study has investigated the adaptive changes of functional OR gene families across phylogenetically and ecologically diverse mammals. To determine the extent to which OR gene repertoires have been influenced by habitat, sensory specialization, and other ecological traits, to better understand the functional importance of specific OR gene families and thus the odorants they bind, we compared the functional OR gene repertoires from 50 mammalian genomes. We amplified more than 2000 OR genes in aquatic, semi-aquatic, and flying mammals and coupled these data with 48,000 OR genes from mostly terrestrial mammals, extracted from genomic projects. Phylogenomic, Bayesian assignment, and principle component analyses partitioned species by ecotype (aquatic, semi-aquatic, terrestrial, flying) rather than phylogenetic relatedness, and identified OR families important for each habitat. Functional OR gene repertoires were reduced independently in the multiple origins of aquatic mammals and were significantly divergent in bats. We reject recent neutralist views of olfactory subgenome evolution and correlate specific OR gene families with physiological requirements, a preliminary step toward unraveling the relationship between specific odors and respective OR gene families. PMID:19952139
Wang, Pengfei; Su, Ling; Gao, Huanhuan; Jiang, Xilong; Wu, Xinying; Li, Yi; Zhang, Qianqian; Wang, Yongmei; Ren, Fengshan
2018-01-01
Basic helix-loop-helix (bHLH) transcription factors are involved in many abiotic stress responses as well as flavonol and anthocyanin biosynthesis. In grapes (Vitis vinifera L.), flavonols including anthocyanins and condensed tannins are most abundant in the skins of the berries. Flavonols are important phytochemicals for viticulture and enology, but grape bHLH genes have rarely been examined. We identified 94 grape bHLH genes in a genome-wide analysis and performed Nr and GO function analyses for these genes. Phylogenetic analyses placed the genes into 15 clades, with some remaining orphans. 41 duplicate gene pairs were found in the grape bHLH gene family, and all of these duplicate gene pairs underwent purifying selection. Nine triplicate gene groups were found in the grape bHLH gene family and all of these triplicate gene groups underwent purifying selection. Twenty-two grape bHLH genes could be induced by PEG treatment and 17 grape bHLH genes could be induced by cold stress treatment including a homologous form of MYC2, VvbHLH007. Based on the GO or Nr function annotations, we found three other genes that are potentially related to anthocyanin or flavonol biosynthesis: VvbHLH003, VvbHLH007, and VvbHLH010. We also performed a cis-acting regulatory element analysis on some genes involved in flavonoid or anthocyanin biosynthesis and our results showed that most of these gene promoters contained G-box or E-box elements that could be recognized by bHLH family members. PMID:29449854
Genome-wide methylation analysis identifies genes silenced in non-seminoma cell lines
Noor, Dzul Azri Mohamed; Jeyapalan, Jennie N; Alhazmi, Safiah; Carr, Matthew; Squibb, Benjamin; Wallace, Claire; Tan, Christopher; Cusack, Martin; Hughes, Jaime; Reader, Tom; Shipley, Janet; Sheer, Denise; Scotting, Paul J
2016-01-01
Silencing of genes by DNA methylation is a common phenomenon in many types of cancer. However, the genome-wide effect of DNA methylation on gene expression has been analysed in relatively few cancers. Germ cell tumours (GCTs) are a complex group of malignancies. They are unique in developing from a pluripotent progenitor cell. Previous analyses have suggested that non-seminomas exhibit much higher levels of DNA methylation than seminomas. The genomic targets that are methylated, the extent to which this results in gene silencing and the identity of the silenced genes most likely to play a role in the tumours’ biology have not yet been established. In this study, genome-wide methylation and expression analysis of GCT cell lines was combined with gene expression data from primary tumours to address this question. Genome methylation was analysed using the Illumina infinium HumanMethylome450 bead chip system and gene expression was analysed using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Regulation by methylation was confirmed by demethylation using 5-aza-2-deoxycytidine and reverse transcription–quantitative PCR. Large differences in the level of methylation of the CpG islands of individual genes between tumour cell lines correlated well with differential gene expression. Treatment of non-seminoma cells with 5-aza-2-deoxycytidine verified that methylation of all genes tested played a role in their silencing in yolk sac tumour cells and many of these genes were also differentially expressed in primary tumours. Genes silenced by methylation in the various GCT cell lines were identified. Several pluripotency-associated genes were identified as a major functional group of silenced genes. PMID:29263807
Genome-wide methylation analysis identifies genes silenced in non-seminoma cell lines.
Noor, Dzul Azri Mohamed; Jeyapalan, Jennie N; Alhazmi, Safiah; Carr, Matthew; Squibb, Benjamin; Wallace, Claire; Tan, Christopher; Cusack, Martin; Hughes, Jaime; Reader, Tom; Shipley, Janet; Sheer, Denise; Scotting, Paul J
2016-01-01
Silencing of genes by DNA methylation is a common phenomenon in many types of cancer. However, the genome-wide effect of DNA methylation on gene expression has been analysed in relatively few cancers. Germ cell tumours (GCTs) are a complex group of malignancies. They are unique in developing from a pluripotent progenitor cell. Previous analyses have suggested that non-seminomas exhibit much higher levels of DNA methylation than seminomas. The genomic targets that are methylated, the extent to which this results in gene silencing and the identity of the silenced genes most likely to play a role in the tumours' biology have not yet been established. In this study, genome-wide methylation and expression analysis of GCT cell lines was combined with gene expression data from primary tumours to address this question. Genome methylation was analysed using the Illumina infinium HumanMethylome450 bead chip system and gene expression was analysed using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Regulation by methylation was confirmed by demethylation using 5-aza-2-deoxycytidine and reverse transcription-quantitative PCR. Large differences in the level of methylation of the CpG islands of individual genes between tumour cell lines correlated well with differential gene expression. Treatment of non-seminoma cells with 5-aza-2-deoxycytidine verified that methylation of all genes tested played a role in their silencing in yolk sac tumour cells and many of these genes were also differentially expressed in primary tumours. Genes silenced by methylation in the various GCT cell lines were identified. Several pluripotency-associated genes were identified as a major functional group of silenced genes.
Wang, Quan; Jia, Peilin; Cuenco, Karen T.; Feingold, Eleanor; Marazita, Mary L.; Wang, Lily; Zhao, Zhongming
2013-01-01
A number of genetic studies have suggested numerous susceptibility genes for dental caries over the past decade with few definite conclusions. The rapid accumulation of relevant information, along with the complex architecture of the disease, provides a challenging but also unique opportunity to review and integrate the heterogeneous data for follow-up validation and exploration. In this study, we collected and curated candidate genes from four major categories: association studies, linkage scans, gene expression analyses, and literature mining. Candidate genes were prioritized according to the magnitude of evidence related to dental caries. We then searched for dense modules enriched with the prioritized candidate genes through their protein-protein interactions (PPIs). We identified 23 modules comprising of 53 genes. Functional analyses of these 53 genes revealed three major clusters: cytokine network relevant genes, matrix metalloproteinases (MMPs) family, and transforming growth factor-beta (TGF-β) family, all of which have been previously implicated to play important roles in tooth development and carious lesions. Through our extensive data collection and an integrative application of gene prioritization and PPI network analyses, we built a dental caries-specific sub-network for the first time. Our study provided insights into the molecular mechanisms underlying dental caries. The framework we proposed in this work can be applied to other complex diseases. PMID:24146904
Sysol, Justin R.; Abbasi, Taimur; Patel, Amit R.; Lang, Roberto M.; Gupta, Akash; Garcia, Joe G. N.; Gordeuk, Victor R.; Machado, Roberto F.
2016-01-01
Background Diastolic dysfunction is common in sickle cell disease (SCD), and is associated with an increased risk of mortality. However, the molecular pathogenesis underlying this development is poorly understood. The aim of this study was to identify a gene expression profile that is associated with diastolic function in SCD, potentially elucidating molecular mechanisms behind diastolic dysfunction development. Methods Diastolic function was measured via echocardiography in 65 patients with SCD from two independent study populations. Gene expression microarray data was compared with diastolic function in both study cohorts. Candidate genes that associated in both analyses were tested for validation in a murine SCD model. Lastly, genotyping array data from the replication cohort was used to derive cis-expression quantitative trait loci (cis-eQTLs) and genetic associations within the candidate gene regions. Results Transcriptome data from both patient cohorts implicated 7 genes associated with diastolic function, and mouse SCD myocardial expression validated 3 of these genes. Genetic associations and eQTLs were detected in 2 of the 3 genes, FUCA2 and IL18. Conclusions FUCA2 and IL18 are associated with diastolic function in SCD patients, and may be involved in the pathogenesis of the disease. Genetic polymorphisms within the FUCA2 and IL18 gene regions are also associated with diastolic function in SCD, likely by affecting expression levels of the genes. PMID:27636371
Lin, Zhe; Lin, Yongsheng
2017-09-05
The aim of this study was to explore potential crucial genes associated with the steroid-induced necrosis of femoral head (SINFH) and to provide valid biological information for further investigation of SINFH. Gene expression profile of GSE26316, generated from 3 SINFH rat samples and 3 normal rat samples were downloaded from Gene Expression Omnibus (GEO) database. The differentially expressed genes (DEGs) were identified using LIMMA package. After functional enrichment analyses of DEGs, protein-protein interaction (PPI) network and sub-PPI network analyses were conducted based on the STRING database and cytoscape. In total, 59 up-regulated DEGs and 156 downregulated DEGs were identified. The up-regulated DEGs were mainly involved in functions about immunity (e.g. Fcer1A and Il7R), and the downregulated DEGs were mainly enriched in muscle system process (e.g. Tnni2, Mylpf and Myl1). The PPI network of DEGs consisted of 123 nodes and 300 interactions. Tnni2, Mylpf, and Myl1 were the top 3 outstanding genes based on both subgraph centrality and degree centrality evaluation. These three genes interacted with each other in the network. Furthermore, the significant network module was composed of 22 downregulated genes (e.g. Tnni2, Mylpf and Myl1). These genes were mainly enriched in functions like muscle system process. The DEGs related to the regulation of immune system process (e.g. Fcer1A and Il7R), and DEGs correlated with muscle system process (e.g. Tnni2, Mylpf and Myl1) may be closely associated with the progress of SINFH, which is still needed to be confirmed by experiments. Copyright © 2017 Elsevier B.V. All rights reserved.
Genomic features separating ten strains of Neorhizobium galegae with different symbiotic phenotypes.
Österman, Janina; Mousavi, Seyed Abdollah; Koskinen, Patrik; Paulin, Lars; Lindström, Kristina
2015-05-02
The symbiotic phenotype of Neorhizobium galegae, with strains specifically fixing nitrogen with either Galega orientalis or G. officinalis, has made it a target in research on determinants of host specificity in nitrogen fixation. The genomic differences between representative strains of the two symbiovars are, however, relatively small. This introduced a need for a dataset representing a larger bacterial population in order to make better conclusions on characteristics typical for a subset of the species. In this study, we produced draft genomes of eight strains of N. galegae having different symbiotic phenotypes, both with regard to host specificity and nitrogen fixation efficiency. These genomes were analysed together with the previously published complete genomes of N. galegae strains HAMBI 540T and HAMBI 1141. The results showed that the presence of an additional rpoN sigma factor gene in the symbiosis gene region is a characteristic specific to symbiovar orientalis, required for nitrogen fixation. Also the nifQ gene was shown to be crucial for functional symbiosis in both symbiovars. Genome-wide analyses identified additional genes characteristic of strains of the same symbiovar and of strains having similar plant growth promoting properties on Galega orientalis. Many of these genes are involved in transcriptional regulation or in metabolic functions. The results of this study confirm that the only symbiosis-related gene that is present in one symbiovar of N. galegae but not in the other is an rpoN gene. The specific function of this gene remains to be determined, however. New genes that were identified as specific for strains of one symbiovar may be involved in determining host specificity, while others are defined as potential determinant genes for differences in efficiency of nitrogen fixation.
Opazo, Juan C.; Toloza-Villalobos, Jessica; Burmester, Thorsten; Venkatesh, Byrappa; Storz, Jay F.
2015-01-01
Comparative analyses of vertebrate genomes continue to uncover a surprising diversity of genes in the globin gene superfamily, some of which have very restricted phyletic distributions despite their antiquity. Genomic analysis of the globin gene repertoire of cartilaginous fish (Chondrichthyes) should be especially informative about the duplicative origins and ancestral functions of vertebrate globins, as divergence between Chondrichthyes and bony vertebrates represents the most basal split within the jawed vertebrates. Here, we report a comparative genomic analysis of the vertebrate globin gene family that includes the complete globin gene repertoire of the elephant shark (Callorhinchus milii). Using genomic sequence data from representatives of all major vertebrate classes, integrated analyses of conserved synteny and phylogenetic relationships revealed that the last common ancestor of vertebrates possessed a repertoire of at least seven globin genes: single copies of androglobin and neuroglobin, four paralogous copies of globin X, and the single-copy progenitor of the entire set of vertebrate-specific globins. Combined with expression data, the genomic inventory of elephant shark globins yielded four especially surprising findings: 1) there is no trace of the neuroglobin gene (a highly conserved gene that is present in all other jawed vertebrates that have been examined to date), 2) myoglobin is highly expressed in heart, but not in skeletal muscle (reflecting a possible ancestral condition in vertebrates with single-circuit circulatory systems), 3) elephant shark possesses two highly divergent globin X paralogs, one of which is preferentially expressed in gonads, and 4) elephant shark possesses two structurally distinct α-globin paralogs, one of which is preferentially expressed in the brain. Expression profiles of elephant shark globin genes reveal distinct specializations of function relative to orthologs in bony vertebrates and suggest hypotheses about ancestral functions of vertebrate globins. PMID:25743544
Luo, Yonglun; Blechingberg, Jenny; Fernandes, Ana Miguel; Li, Shengting; Fryland, Tue; Børglum, Anders D; Bolund, Lars; Nielsen, Anders Lade
2015-11-14
FUS (TLS) and EWS (EWSR1) belong to the FET-protein family of RNA and DNA binding proteins. FUS and EWS are structurally and functionally related and participate in transcriptional regulation and RNA processing. FUS and EWS are identified in translocation generated cancer fusion proteins and involved in the human neurological diseases amyotrophic lateral sclerosis and fronto-temporal lobar degeneration. To determine the gene regulatory functions of FUS and EWS at the level of chromatin, we have performed chromatin immunoprecipitation followed by next generation sequencing (ChIP-seq). Our results show that FUS and EWS bind to a subset of actively transcribed genes, that binding often is downstream the poly(A)-signal, and that binding overlaps with RNA polymerase II. Functional examinations of selected target genes identified that FUS and EWS can regulate gene expression at different levels. Gene Ontology analyses showed that FUS and EWS target genes preferentially encode proteins involved in regulatory processes at the RNA level. The presented results yield new insights into gene interactions of EWS and FUS and have identified a set of FUS and EWS target genes involved in pathways at the RNA regulatory level with potential to mediate normal and disease-associated functions of the FUS and EWS proteins.
Metagenome Analyses of Corroded Concrete Wastewater Pipe Biofilms Reveals a Complex Microbial System
Analysis of whole-metagenome pyrosequencing data and 16S rRNA gene clone libraries was used to determine microbial composition and functional genes associated with biomass harvested from crown (top) and invert (bottom) sections of a corroded wastewater pipe. Taxonomic and functio...
Integrative analysis of micro-RNA, gene expression, and survival of glioblastoma multiforme.
Huang, Yen-Tsung; Hsu, Thomas; Kelsey, Karl T; Lin, Chien-Ling
2015-02-01
Glioblastoma multiforme (GBM), the most common type of malignant brain tumor, is highly fatal. Limited understanding of its rapid progression necessitates additional approaches that integrate what is known about the genomics of this cancer. Using a discovery set (n = 348) and a validation set (n = 174) of GBM patients, we performed genome-wide analyses that integrated mRNA and micro-RNA expression data from GBM as well as associated survival information, assessing coordinated variability in each as this reflects their known mechanistic functions. Cox proportional hazards models were used for the survival analyses, and nonparametric permutation tests were performed for the micro-RNAs to investigate the association between the number of associated genes and its prognostication. We also utilized mediation analyses for micro-RNA-gene pairs to identify their mediation effects. Genome-wide analyses revealed a novel pattern: micro-RNAs related to more gene expressions are more likely to be associated with GBM survival (P = 4.8 × 10(-5)). Genome-wide mediation analyses for the 32,660 micro-RNA-gene pairs with strong association (false discovery rate [FDR] < 0.01%) identified 51 validated pairs with significant mediation effect. Of the 51 pairs, miR-223 had 16 mediation genes. These 16 mediation genes of miR-223 were also highly associated with various other micro-RNAs and mediated their prognostic effects as well. We further constructed a gene signature using the 16 genes, which was highly associated with GBM survival in both the discovery and validation sets (P = 9.8 × 10(-6)). This comprehensive study discovered mediation effects of micro-RNA to gene expression and GBM survival and provided a new analytic framework for integrative genomics. © 2014 WILEY PERIODICALS, INC.
DaGO-Fun: tool for Gene Ontology-based functional analysis using term information content measures.
Mazandu, Gaston K; Mulder, Nicola J
2013-09-25
The use of Gene Ontology (GO) data in protein analyses have largely contributed to the improved outcomes of these analyses. Several GO semantic similarity measures have been proposed in recent years and provide tools that allow the integration of biological knowledge embedded in the GO structure into different biological analyses. There is a need for a unified tool that provides the scientific community with the opportunity to explore these different GO similarity measure approaches and their biological applications. We have developed DaGO-Fun, an online tool available at http://web.cbio.uct.ac.za/ITGOM, which incorporates many different GO similarity measures for exploring, analyzing and comparing GO terms and proteins within the context of GO. It uses GO data and UniProt proteins with their GO annotations as provided by the Gene Ontology Annotation (GOA) project to precompute GO term information content (IC), enabling rapid response to user queries. The DaGO-Fun online tool presents the advantage of integrating all the relevant IC-based GO similarity measures, including topology- and annotation-based approaches to facilitate effective exploration of these measures, thus enabling users to choose the most relevant approach for their application. Furthermore, this tool includes several biological applications related to GO semantic similarity scores, including the retrieval of genes based on their GO annotations, the clustering of functionally related genes within a set, and term enrichment analysis.
SZDB: A Database for Schizophrenia Genetic Research
Wu, Yong; Yao, Yong-Gang
2017-01-01
Abstract Schizophrenia (SZ) is a debilitating brain disorder with a complex genetic architecture. Genetic studies, especially recent genome-wide association studies (GWAS), have identified multiple variants (loci) conferring risk to SZ. However, how to efficiently extract meaningful biological information from bulk genetic findings of SZ remains a major challenge. There is a pressing need to integrate multiple layers of data from various sources, eg, genetic findings from GWAS, copy number variations (CNVs), association and linkage studies, gene expression, protein–protein interaction (PPI), co-expression, expression quantitative trait loci (eQTL), and Encyclopedia of DNA Elements (ENCODE) data, to provide a comprehensive resource to facilitate the translation of genetic findings into SZ molecular diagnosis and mechanism study. Here we developed the SZDB database (http://www.szdb.org/), a comprehensive resource for SZ research. SZ genetic data, gene expression data, network-based data, brain eQTL data, and SNP function annotation information were systematically extracted, curated and deposited in SZDB. In-depth analyses and systematic integration were performed to identify top prioritized SZ genes and enriched pathways. Multiple types of data from various layers of SZ research were systematically integrated and deposited in SZDB. In-depth data analyses and integration identified top prioritized SZ genes and enriched pathways. We further showed that genes implicated in SZ are highly co-expressed in human brain and proteins encoded by the prioritized SZ risk genes are significantly interacted. The user-friendly SZDB provides high-confidence candidate variants and genes for further functional characterization. More important, SZDB provides convenient online tools for data search and browse, data integration, and customized data analyses. PMID:27451428
Keller, J.; Rousseau-Gueutin, M.; Martin, G.E.; Morice, J.; Boutte, J.; Coissac, E.; Ourari, M.; Aïnouche, M.; Salmon, A.; Cabello-Hurtado, F.
2017-01-01
Abstract The Fabaceae family is considered as a model system for understanding chloroplast genome evolution due to the presence of extensive structural rearrangements, gene losses and localized hypermutable regions. Here, we provide sequences of four chloroplast genomes from the Lupinus genus, belonging to the underinvestigated Genistoid clade. Notably, we found in Lupinus species the functional loss of the essential rps16 gene, which was most likely replaced by the nuclear rps16 gene that encodes chloroplast and mitochondrion targeted RPS16 proteins. To study the evolutionary fate of the rps16 gene, we explored all available plant chloroplast, mitochondrial and nuclear genomes. Whereas no plant mitochondrial genomes carry an rps16 gene, many plants still have a functional nuclear and chloroplast rps16 gene. Ka/Ks ratios revealed that both chloroplast and nuclear rps16 copies were under purifying selection. However, due to the dual targeting of the nuclear rps16 gene product and the absence of a mitochondrial copy, the chloroplast gene may be lost. We also performed comparative analyses of lupine plastomes (SNPs, indels and repeat elements), identified the most variable regions and examined their phylogenetic utility. The markers identified here will help to reveal the evolutionary history of lupines, Genistoids and closely related clades. PMID:28338826
Wen, Feng; Zhu, Hong; Li, Peng; Jiang, Min; Mao, Wenqing; Ong, Chermaine; Chu, Zhaoqing
2014-06-01
Members of plant WRKY gene family are ancient transcription factors that function in plant growth and development and respond to biotic and abiotic stresses. In our present study, we have investigated WRKY family genes in Brachypodium distachyon, a new model plant of family Poaceae. We identified a total of 86 WRKY genes from B. distachyon and explored their chromosomal distribution and evolution, domain alignment, promoter cis-elements, and expression profiles. Combining the analysis of phylogenetic tree of BdWRKY genes and the result of expression profiling, results showed that most of clustered gene pairs had higher similarities in the WRKY domain, suggesting that they might be functionally redundant. Neighbour-joining analysis of 301 WRKY domains from Oryza sativa, Arabidopsis thaliana, and B. distachyon suggested that BdWRKY domains are evolutionarily more closely related to O. sativa WRKY domains than those of A. thaliana. Moreover, tissue-specific expression profile of BdWRKY genes and their responses to phytohormones and several biotic or abiotic stresses were analysed by quantitative real-time PCR. The results showed that the expression of BdWRKY genes was rapidly regulated by stresses and phytohormones, and there was a strong correlation between promoter cis-elements and the phytohormones-induced BdWRKY gene expression. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Kwon, Jun Tae; Ham, Sera; Jeon, Suyeon; Kim, Youil; Oh, Seungmin; Cho, Chunghee
2017-01-01
The identification and characterization of germ cell-specific genes are essential if we hope to comprehensively understand the mechanisms of spermatogenesis and fertilization. Here, we searched the mouse UniGene databases and identified 13 novel genes as being putatively testis-specific or -predominant. Our in silico and in vitro analyses revealed that the expressions of these genes are testis- and germ cell-specific, and that they are regulated in a stage-specific manner during spermatogenesis. We generated antibodies against the proteins encoded by seven of the genes to facilitate their characterization in male germ cells. Immunoblotting and immunofluorescence analyses revealed that one of these proteins was expressed only in testicular germ cells, three were expressed in both testicular germ cells and testicular sperm, and the remaining three were expressed in sperm of the testicular stages and in mature sperm from the epididymis. Further analysis of the latter three proteins showed that they were all associated with cytoskeletal structures in the sperm flagellum. Among them, MORN5, which is predicted to contain three MORN motifs, is conserved between mouse and human sperm. In conclusion, we herein identify 13 authentic genes with male germ cell-specific expression, and provide comprehensive information about these genes and their encoded products. Our finding will facilitate future investigations into the functional roles of these novel genes in spermatogenesis and sperm functions.
D'Addabbo, Annarita; Palmieri, Orazio; Maglietta, Rosalia; Latiano, Anna; Mukherjee, Sayan; Annese, Vito; Ancona, Nicola
2011-08-01
A meta-analysis has re-analysed previous genome-wide association scanning definitively confirming eleven genes and further identifying 21 new loci. However, the identified genes/loci still explain only the minority of genetic predisposition of Crohn's disease. To identify genes weakly involved in disease predisposition by analysing chromosomal regions enriched of single nucleotide polymorphisms with modest statistical association. We utilized the WTCCC data set evaluating 1748 CD and 2938 controls. The identification of candidate genes/loci was performed by a two-step procedure: first of all chromosomal regions enriched of weak association signals were localized; subsequently, weak signals clustered in gene regions were identified. The statistical significance was assessed by non parametric permutation tests. The cytoband enrichment analysis highlighted 44 regions (P≤0.05) enriched with single nucleotide polymorphisms significantly associated with the trait including 23 out of 31 previously confirmed and replicated genes. Importantly, we highlight further 20 novel chromosomal regions carrying approximately one hundred genes/loci with modest association. Amongst these we find compelling functional candidate genes such as MAPT, GRB2 and CREM, LCT, and IL12RB2. Our study suggests a different statistical perspective to discover genes weakly associated with a given trait, although further confirmatory functional studies are needed. Copyright © 2011 Editrice Gastroenterologica Italiana S.r.l. All rights reserved.
Gil-Serna, Jessica; Vázquez, Covadonga; González-Jaén, María Teresa; Patiño, Belén
2015-12-02
Aspergillus steynii is probably the most relevant species of section Circumdati producing ochratoxin A (OTA). This mycotoxin contaminates a wide number of commodities and it is highly toxic for humans and animals. Little is known on the biosynthetic genes and their regulation in Aspergillus species. In this work, we identified and analysed three contiguous genes in A. steynii using 5'-RACE and genome walking approaches which predicted a cytochrome P450 monooxygenase (p450ste), a non-ribosomal peptide synthetase (nrpsste) and a polyketide synthase (pksste). These three genes were contiguous within a 20742 bp long genomic DNA fragment. Their corresponding cDNA were sequenced and their expression was analysed in three A. steynii strains using real time RT-PCR specific assays in permissive conditions in in vitro cultures. OTA was also analysed in these cultures. Comparative analyses of predicted genomic, cDNA and amino acid sequences were performed with sequences of similar gene functions. All the results obtained in these analyses were consistent and point out the involvement of these three genes in OTA biosynthesis by A. steynii and showed a co-ordinated expression pattern. This is the first time that a clustered organization OTA biosynthetic genes has been reported in Aspergillus genus. The results also suggested that this situation might be common in Aspergillus OTA-producing species and distinct to the one described for Penicillium species. Copyright © 2015 Elsevier B.V. All rights reserved.
snpGeneSets: An R Package for Genome-Wide Study Annotation
Mei, Hao; Li, Lianna; Jiang, Fan; Simino, Jeannette; Griswold, Michael; Mosley, Thomas; Liu, Shijian
2016-01-01
Genome-wide studies (GWS) of SNP associations and differential gene expressions have generated abundant results; next-generation sequencing technology has further boosted the number of variants and genes identified. Effective interpretation requires massive annotation and downstream analysis of these genome-wide results, a computationally challenging task. We developed the snpGeneSets package to simplify annotation and analysis of GWS results. Our package integrates local copies of knowledge bases for SNPs, genes, and gene sets, and implements wrapper functions in the R language to enable transparent access to low-level databases for efficient annotation of large genomic data. The package contains functions that execute three types of annotations: (1) genomic mapping annotation for SNPs and genes and functional annotation for gene sets; (2) bidirectional mapping between SNPs and genes, and genes and gene sets; and (3) calculation of gene effect measures from SNP associations and performance of gene set enrichment analyses to identify functional pathways. We applied snpGeneSets to type 2 diabetes (T2D) results from the NHGRI genome-wide association study (GWAS) catalog, a Finnish GWAS, and a genome-wide expression study (GWES). These studies demonstrate the usefulness of snpGeneSets for annotating and performing enrichment analysis of GWS results. The package is open-source, free, and can be downloaded at: https://www.umc.edu/biostats_software/. PMID:27807048
Alterations in the developing testis transcriptome following embryonic vinclozolin exposure.
Clement, Tracy M; Savenkova, Marina I; Settles, Matthew; Anway, Matthew D; Skinner, Michael K
2010-11-01
The current study investigates the direct effects of in utero vinclozolin exposure on the developing F1 generation rat testis transcriptome. Previous studies have demonstrated that exposure to vinclozolin during embryonic gonadal sex determination induces epigenetic modifications of the germ line and transgenerational adult onset disease states. Microarray analyses were performed to compare control and vinclozolin treated testis transcriptomes at embryonic days 13, 14 and 16. A total of 576 differentially expressed genes were identified and the major cellular functions and pathways associated with these altered transcripts were examined. The sets of regulated genes at the different development periods were found to be transiently altered and distinct. Categorization by major known functions of altered genes was performed. Specific cellular process and pathway analyses suggest the involvement of Wnt and calcium signaling, vascular development and epigenetic mechanisms as potential mediators of the direct F1 generation actions of vinclozolin. Copyright © 2010 Elsevier Inc. All rights reserved.
ALTERATIONS IN THE DEVELOPING TESTIS TRANSCRIPTOME FOLLOWING EMBRYONIC VINCLOZOLIN EXPOSURE
Clement, Tracy M.; Savenkova, Marina I.; Settles, Matthew; Anway, Matthew D.; Skinner, Michael K.
2010-01-01
The current study investigates the direct effects of in utero vinclozolin exposure on the developing F1 generation rat testis transcriptome. Previous studies have demonstrated that exposure to vinclozolin during embryonic gonadal sex determination induces epigenetic modifications of the germ line and transgenerational adult onset disease states. Microarray analyses were performed to compare control and vinclozolin treated testis transcriptomes at embryonic day 13, 14 and 16. A total of 576 differentially expressed genes were identified and the major cellular functions and pathways associated with these altered transcripts were examined. The sets of regulated genes at the different development periods were found to be transiently altered and distinct. Categorization by major known functions of altered genes was performed. Specific cellular process and pathway analyses suggest the involvement of Wnt and calcium signaling, vascular development and epigenetic mechanisms as potential mediators of the direct F1 generation actions of vinclozolin. PMID:20566332
Wei, Kai-Fa; Chen, Juan; Chen, Yan-Feng; Wu, Ling-Juan; Xie, Dao-Xin
2012-01-01
The WRKY transcription factors function in plant growth and development, and response to the biotic and abiotic stresses. Although many studies have focused on the functional identification of the WRKY transcription factors, much less is known about molecular phylogenetic and global expression analysis of the complete WRKY family in maize. In this study, we identified 136 WRKY proteins coded by 119 genes in the B73 inbred line from the complete genome and named them in an orderly manner. Then, a comprehensive phylogenetic analysis of five species was performed to explore the origin and evolutionary patterns of these WRKY genes, and the result showed that gene duplication is the major driving force for the origin of new groups and subgroups and functional divergence during evolution. Chromosomal location analysis of maize WRKY genes indicated that 20 gene clusters are distributed unevenly in the genome. Microarray-based expression analysis has revealed that 131 WRKY transcripts encoded by 116 genes may participate in the regulation of maize growth and development. Among them, 102 transcripts are stably expressed with a coefficient of variation (CV) value of <15%. The remaining 29 transcripts produced by 25 WRKY genes with the CV value of >15% are further analysed to discover new organ- or tissue-specific genes. In addition, microarray analyses of transcriptional responses to drought stress and fungal infection showed that maize WRKY proteins are involved in stress responses. All these results contribute to a deep probing into the roles of WRKY transcription factors in maize growth and development and stress tolerance. PMID:22279089
Wei, Kai-Fa; Chen, Juan; Chen, Yan-Feng; Wu, Ling-Juan; Xie, Dao-Xin
2012-04-01
The WRKY transcription factors function in plant growth and development, and response to the biotic and abiotic stresses. Although many studies have focused on the functional identification of the WRKY transcription factors, much less is known about molecular phylogenetic and global expression analysis of the complete WRKY family in maize. In this study, we identified 136 WRKY proteins coded by 119 genes in the B73 inbred line from the complete genome and named them in an orderly manner. Then, a comprehensive phylogenetic analysis of five species was performed to explore the origin and evolutionary patterns of these WRKY genes, and the result showed that gene duplication is the major driving force for the origin of new groups and subgroups and functional divergence during evolution. Chromosomal location analysis of maize WRKY genes indicated that 20 gene clusters are distributed unevenly in the genome. Microarray-based expression analysis has revealed that 131 WRKY transcripts encoded by 116 genes may participate in the regulation of maize growth and development. Among them, 102 transcripts are stably expressed with a coefficient of variation (CV) value of <15%. The remaining 29 transcripts produced by 25 WRKY genes with the CV value of >15% are further analysed to discover new organ- or tissue-specific genes. In addition, microarray analyses of transcriptional responses to drought stress and fungal infection showed that maize WRKY proteins are involved in stress responses. All these results contribute to a deep probing into the roles of WRKY transcription factors in maize growth and development and stress tolerance.
Microarray analysis of gene expression profiles in ripening pineapple fruits.
Koia, Jonni H; Moyle, Richard L; Botella, Jose R
2012-12-18
Pineapple (Ananas comosus) is a tropical fruit crop of significant commercial importance. Although the physiological changes that occur during pineapple fruit development have been well characterized, little is known about the molecular events that occur during the fruit ripening process. Understanding the molecular basis of pineapple fruit ripening will aid the development of new varieties via molecular breeding or genetic modification. In this study we developed a 9277 element pineapple microarray and used it to profile gene expression changes that occur during pineapple fruit ripening. Microarray analyses identified 271 unique cDNAs differentially expressed at least 1.5-fold between the mature green and mature yellow stages of pineapple fruit ripening. Among these 271 sequences, 184 share significant homology with genes encoding proteins of known function, 53 share homology with genes encoding proteins of unknown function and 34 share no significant homology with any database accession. Of the 237 pineapple sequences with homologs, 160 were up-regulated and 77 were down-regulated during pineapple fruit ripening. DAVID Functional Annotation Cluster (FAC) analysis of all 237 sequences with homologs revealed confident enrichment scores for redox activity, organic acid metabolism, metalloenzyme activity, glycolysis, vitamin C biosynthesis, antioxidant activity and cysteine peptidase activity, indicating the functional significance and importance of these processes and pathways during pineapple fruit development. Quantitative real-time PCR analysis validated the microarray expression results for nine out of ten genes tested. This is the first report of a microarray based gene expression study undertaken in pineapple. Our bioinformatic analyses of the transcript profiles have identified a number of genes, processes and pathways with putative involvement in the pineapple fruit ripening process. This study extends our knowledge of the molecular basis of pineapple fruit ripening and non-climacteric fruit ripening in general.
Microarray analysis of gene expression profiles in ripening pineapple fruits
2012-01-01
Background Pineapple (Ananas comosus) is a tropical fruit crop of significant commercial importance. Although the physiological changes that occur during pineapple fruit development have been well characterized, little is known about the molecular events that occur during the fruit ripening process. Understanding the molecular basis of pineapple fruit ripening will aid the development of new varieties via molecular breeding or genetic modification. In this study we developed a 9277 element pineapple microarray and used it to profile gene expression changes that occur during pineapple fruit ripening. Results Microarray analyses identified 271 unique cDNAs differentially expressed at least 1.5-fold between the mature green and mature yellow stages of pineapple fruit ripening. Among these 271 sequences, 184 share significant homology with genes encoding proteins of known function, 53 share homology with genes encoding proteins of unknown function and 34 share no significant homology with any database accession. Of the 237 pineapple sequences with homologs, 160 were up-regulated and 77 were down-regulated during pineapple fruit ripening. DAVID Functional Annotation Cluster (FAC) analysis of all 237 sequences with homologs revealed confident enrichment scores for redox activity, organic acid metabolism, metalloenzyme activity, glycolysis, vitamin C biosynthesis, antioxidant activity and cysteine peptidase activity, indicating the functional significance and importance of these processes and pathways during pineapple fruit development. Quantitative real-time PCR analysis validated the microarray expression results for nine out of ten genes tested. Conclusions This is the first report of a microarray based gene expression study undertaken in pineapple. Our bioinformatic analyses of the transcript profiles have identified a number of genes, processes and pathways with putative involvement in the pineapple fruit ripening process. This study extends our knowledge of the molecular basis of pineapple fruit ripening and non-climacteric fruit ripening in general. PMID:23245313
2012-01-01
Background In spite of its high clinical relevance, the relationship between disc degeneration and low back pain is still not well understood. Recent studies have shown that genome-wide gene expression studies utilizing ontology searches provide an efficient and valuable methodology for identification of clinically relevant genes. Here we use this approach in analysis of pain-, nerve-, and neurotrophin-related gene expression patterns in specimens of human disc tissue. Control, non-herniated clinical, and herniated clinical specimens of human annulus tissue were studied following Institutional Review Board approval. Results Analyses were performed on more generated (Thompson grade IV and V) discs vs. less degenerated discs (grades I-III), on surgically operated discs vs. control discs, and on herniated vs. control discs. Analyses of more degenerated vs. less degenerated discs identified significant upregulation of well-recognized pain-related genes (bradykinin receptor B1, calcitonin gene-related peptide and catechol-0-methyltransferase). Nerve growth factor was significantly upregulated in surgical vs. control and in herniated vs. control discs. All three analyses also found significant changes in numerous proinflammatory cytokine- and chemokine-related genes. Nerve, neurotrophin and pain-ontology searches identified many matrix, signaling and functional genes which have known importance in the disc. Immunohistochemistry was utilized to confirm the presence of calcitonin gene-related peptide, catechol-0-methyltransferase and bradykinin receptor B1 at the protein level in the human annulus. Conclusions Findings point to the utility of microarray analyses in identification of pain-, neurotrophin and nerve-related genes in the disc, and point to the importance of future work exploring functional interactions between nerve and disc cells in vitro and in vivo. Nerve, pain and neurotrophin ontology searches identified numerous changes in proinflammatory cytokines and chemokines which also have significant relevance to disc biology. Since the degenerating human disc is primarily an avascular tissue site into which disc cells have contributed high levels of proinflammatory cytokines, these substances are not cleared from the tissue and remain there over time. We hypothesize that as nerves grow into the human annulus, they encounter a proinflammatory cytokine-rich milieu which may sensitize nociceptors and exacerbate pain production. PMID:22963171
NASA Astrophysics Data System (ADS)
Liu, Xiaoming; Yang, Jiasheng; Zhang, Yi; Fang, Yun; Wang, Fayou; Wang, Jun; Zheng, Xiaoqi; Yang, Jialiang
2016-03-01
We have studied drug-response associated (DRA) gene expressions by applying a systems biology framework to the Cancer Cell Line Encyclopedia data. More than 4,000 genes are inferred to be DRA for at least one drug, while the number of DRA genes for each drug varies dramatically from almost 0 to 1,226. Functional enrichment analysis shows that the DRA genes are significantly enriched in genes associated with cell cycle and plasma membrane. Moreover, there might be two patterns of DRA genes between genders. There are significantly shared DRA genes between male and female for most drugs, while very little DRA genes tend to be shared between the two genders for a few drugs targeting sex-specific cancers (e.g., PD-0332991 for breast cancer and ovarian cancer). Our analyses also show substantial difference for DRA genes between young and old samples, suggesting the necessity of considering the age effects for personalized medicine in cancers. Lastly, differential module and key driver analyses confirm cell cycle related modules as top differential ones for drug sensitivity. The analyses also reveal the role of TSPO, TP53, and many other immune or cell cycle related genes as important key drivers for DRA network modules. These key drivers provide new drug targets to improve the sensitivity of cancer therapy.
Luo, Xiong-Jian; Mattheisen, Manuel; Li, Ming; Huang, Liang; Rietschel, Marcella; Børglum, Anders D; Als, Thomas D; van den Oord, Edwin J; Aberg, Karolina A; Mors, Ole; Mortensen, Preben Bo; Luo, Zhenwu; Degenhardt, Franziska; Cichon, Sven; Schulze, Thomas G; Nöthen, Markus M; Su, Bing; Zhao, Zhongming; Gan, Lin; Yao, Yong-Gang
2015-11-01
Genome-wide association studies have identified multiple risk variants and loci that show robust association with schizophrenia. Nevertheless, it remains unclear how these variants confer risk to schizophrenia. In addition, the driving force that maintains the schizophrenia risk variants in human gene pool is poorly understood. To investigate whether expression-associated genetic variants contribute to schizophrenia susceptibility, we systematically integrated brain expression quantitative trait loci and genome-wide association data of schizophrenia using Sherlock, a Bayesian statistical framework. Our analyses identified ZNF323 as a schizophrenia risk gene (P = 2.22×10(-6)). Subsequent analyses confirmed the association of the ZNF323 and its expression-associated single nucleotide polymorphism rs1150711 in independent samples (gene-expression: P = 1.40×10(-6); single-marker meta-analysis in the combined discovery and replication sample comprising 44123 individuals: P = 6.85×10(-10)). We found that the ZNF323 was significantly downregulated in hippocampus and frontal cortex of schizophrenia patients (P = .0038 and P = .0233, respectively). Evidence for pleiotropic effects was detected (association of rs1150711 with lung function and gene expression of ZNF323 in lung: P = 6.62×10(-5) and P = 9.00×10(-5), respectively) with the risk allele (T allele) for schizophrenia acting as protective allele for lung function. Subsequent population genetics analyses suggest that the risk allele (T) of rs1150711 might have undergone recent positive selection in human population. Our findings suggest that the ZNF323 is a schizophrenia susceptibility gene whose expression may influence schizophrenia risk. Our study also illustrates a possible mechanism for maintaining schizophrenia risk variants in the human gene pool. © The Author 2015. Published by Oxford University Press on behalf of the Maryland Psychiatric Research Center. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Circadian Enhancers Coordinate Multiple Phases of Rhythmic Gene Transcription In Vivo
Fang, Bin; Everett, Logan J.; Jager, Jennifer; Briggs, Erika; Armour, Sean M.; Feng, Dan; Roy, Ankur; Gerhart-Hines, Zachary; Sun, Zheng; Lazar, Mitchell A.
2014-01-01
SUMMARY Mammalian transcriptomes display complex circadian rhythms with multiple phases of gene expression that cannot be accounted for by current models of the molecular clock. We have determined the underlying mechanisms by measuring nascent RNA transcription around the clock in mouse liver. Unbiased examination of eRNAs that cluster in specific circadian phases identified functional enhancers driven by distinct transcription factors (TFs). We further identify on a global scale the components of the TF cistromes that function to orchestrate circadian gene expression. Integrated genomic analyses also revealed novel mechanisms by which a single circadian factor controls opposing transcriptional phases. These findings shed new light on the diversity and specificity of TF function in the generation of multiple phases of circadian gene transcription in a mammalian organ. PMID:25416951
Circadian enhancers coordinate multiple phases of rhythmic gene transcription in vivo.
Fang, Bin; Everett, Logan J; Jager, Jennifer; Briggs, Erika; Armour, Sean M; Feng, Dan; Roy, Ankur; Gerhart-Hines, Zachary; Sun, Zheng; Lazar, Mitchell A
2014-11-20
Mammalian transcriptomes display complex circadian rhythms with multiple phases of gene expression that cannot be accounted for by current models of the molecular clock. We have determined the underlying mechanisms by measuring nascent RNA transcription around the clock in mouse liver. Unbiased examination of enhancer RNAs (eRNAs) that cluster in specific circadian phases identified functional enhancers driven by distinct transcription factors (TFs). We further identify on a global scale the components of the TF cistromes that function to orchestrate circadian gene expression. Integrated genomic analyses also revealed mechanisms by which a single circadian factor controls opposing transcriptional phases. These findings shed light on the diversity and specificity of TF function in the generation of multiple phases of circadian gene transcription in a mammalian organ.
PanFP: Pangenome-based functional profiles for microbial communities
Jun, Se -Ran; Hauser, Loren John; Schadt, Christopher Warren; ...
2015-09-26
For decades there has been increasing interest in understanding the relationships between microbial communities and ecosystem functions. Current DNA sequencing technologies allows for the exploration of microbial communities in two principle ways: targeted rRNA gene surveys and shotgun metagenomics. For large study designs, it is often still prohibitively expensive to sequence metagenomes at both the breadth and depth necessary to statistically capture the true functional diversity of a community. Although rRNA gene surveys provide no direct evidence of function, they do provide a reasonable estimation of microbial diversity, while being a very cost effective way to screen samples of interestmore » for later shotgun metagenomic analyses. However, there is a great deal of 16S rRNA gene survey data currently available from diverse environments, and thus a need for tools to infer functional composition of environmental samples based on 16S rRNA gene survey data. As a result, we present a computational method called pangenome based functional profiles (PanFP), which infers functional profiles of microbial communities from 16S rRNA gene survey data for Bacteria and Archaea. PanFP is based on pangenome reconstruction of a 16S rRNA gene operational taxonomic unit (OTU) from known genes and genomes pooled from the OTU s taxonomic lineage. From this lineage, we derive an OTU functional profile by weighting a pangenome s functional profile with the OTUs abundance observed in a given sample. We validated our method by comparing PanFP to the functional profiles obtained from the direct shotgun metagenomic measurement of 65 diverse communities via Spearman correlation coefficients. These correlations improved with increasing sequencing depth, within the range of 0.8 0.9 for the most deeply sequenced Human Microbiome Project mock community samples. PanFP is very similar in performance to another recently released tool, PICRUSt, for almost all of survey data analysed here. But, our method is unique in that any OTU building method can be used, as opposed to being limited to closed reference OTU picking strategies against specific reference sequence databases. In conclusion, we developed an automated computational method, which derives an inferred functional profile based on the 16S rRNA gene surveys of microbial communities. The inferred functional profile provides a cost effective way to study complex ecosystems through predicted comparative functional metagenomes and metadata analysis. All PanFP source code and additional documentation are freely available online at GitHub.« less
PanFP: pangenome-based functional profiles for microbial communities.
Jun, Se-Ran; Robeson, Michael S; Hauser, Loren J; Schadt, Christopher W; Gorin, Andrey A
2015-09-26
For decades there has been increasing interest in understanding the relationships between microbial communities and ecosystem functions. Current DNA sequencing technologies allows for the exploration of microbial communities in two principle ways: targeted rRNA gene surveys and shotgun metagenomics. For large study designs, it is often still prohibitively expensive to sequence metagenomes at both the breadth and depth necessary to statistically capture the true functional diversity of a community. Although rRNA gene surveys provide no direct evidence of function, they do provide a reasonable estimation of microbial diversity, while being a very cost-effective way to screen samples of interest for later shotgun metagenomic analyses. However, there is a great deal of 16S rRNA gene survey data currently available from diverse environments, and thus a need for tools to infer functional composition of environmental samples based on 16S rRNA gene survey data. We present a computational method called pangenome-based functional profiles (PanFP), which infers functional profiles of microbial communities from 16S rRNA gene survey data for Bacteria and Archaea. PanFP is based on pangenome reconstruction of a 16S rRNA gene operational taxonomic unit (OTU) from known genes and genomes pooled from the OTU's taxonomic lineage. From this lineage, we derive an OTU functional profile by weighting a pangenome's functional profile with the OTUs abundance observed in a given sample. We validated our method by comparing PanFP to the functional profiles obtained from the direct shotgun metagenomic measurement of 65 diverse communities via Spearman correlation coefficients. These correlations improved with increasing sequencing depth, within the range of 0.8-0.9 for the most deeply sequenced Human Microbiome Project mock community samples. PanFP is very similar in performance to another recently released tool, PICRUSt, for almost all of survey data analysed here. But, our method is unique in that any OTU building method can be used, as opposed to being limited to closed-reference OTU picking strategies against specific reference sequence databases. We developed an automated computational method, which derives an inferred functional profile based on the 16S rRNA gene surveys of microbial communities. The inferred functional profile provides a cost effective way to study complex ecosystems through predicted comparative functional metagenomes and metadata analysis. All PanFP source code and additional documentation are freely available online at GitHub ( https://github.com/srjun/PanFP ).
Estimation of gene induction enables a relevance-based ranking of gene sets.
Bartholomé, Kilian; Kreutz, Clemens; Timmer, Jens
2009-07-01
In order to handle and interpret the vast amounts of data produced by microarray experiments, the analysis of sets of genes with a common biological functionality has been shown to be advantageous compared to single gene analyses. Some statistical methods have been proposed to analyse the differential gene expression of gene sets in microarray experiments. However, most of these methods either require threshhold values to be chosen for the analysis, or they need some reference set for the determination of significance. We present a method that estimates the number of differentially expressed genes in a gene set without requiring a threshold value for significance of genes. The method is self-contained (i.e., it does not require a reference set for comparison). In contrast to other methods which are focused on significance, our approach emphasizes the relevance of the regulation of gene sets. The presented method measures the degree of regulation of a gene set and is a useful tool to compare the induction of different gene sets and place the results of microarray experiments into the biological context. An R-package is available.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, Jing; Ma, Zihao; Carr, Steven A.
Coexpression of mRNAs under multiple conditions is commonly used to infer cofunctionality of their gene products despite well-known limitations of this “guilt-by-association” (GBA) approach. Recent advancements in mass spectrometry-based proteomic technologies have enabled global expression profiling at the protein level; however, whether proteome profiling data can outperform transcriptome profiling data for coexpression based gene function prediction has not been systematically investigated. Here, we address this question by constructing and analyzing mRNA and protein coexpression networks for three cancer types with matched mRNA and protein profiling data from The Cancer Genome Atlas (TCGA) and the Clinical Proteomic Tumor Analysis Consortium (CPTAC).more » Our analyses revealed a marked difference in wiring between the mRNA and protein coexpression networks. Whereas protein coexpression was driven primarily by functional similarity between coexpressed genes, mRNA coexpression was driven by both cofunction and chromosomal colocalization of the genes. Functionally coherent mRNA modules were more likely to have their edges preserved in corresponding protein networks than functionally incoherent mRNA modules. Proteomic data strengthened the link between gene expression and function for at least 75% of Gene Ontology (GO) biological processes and 90% of KEGG pathways. A web application Gene2Net (http://cptac.gene2net.org) developed based on the three protein coexpression networks revealed novel gene-function relationships, such as linking ERBB2 (HER2) to lipid biosynthetic process in breast cancer, identifying PLG as a new gene involved in complement activation, and identifying AEBP1 as a new epithelial-mesenchymal transition (EMT) marker. Our results demonstrate that proteome profiling outperforms transcriptome profiling for coexpression based gene function prediction. Proteomics should be integrated if not preferred in gene function and human disease studies. Molecular & Cellular Proteomics 16: 10.1074/mcp.M116.060301, 121–134, 2017.« less
More than just orphans: are taxonomically-restricted genes important in evolution?
Khalturin, Konstantin; Hemmrich, Georg; Fraune, Sebastian; Augustin, René; Bosch, Thomas C G
2009-09-01
Comparative genome analyses indicate that every taxonomic group so far studied contains 10-20% of genes that lack recognizable homologs in other species. Do such 'orphan' or 'taxonomically-restricted' genes comprise spurious, non-functional ORFs, or does their presence reflect important evolutionary processes? Recent studies in basal metazoans such as Nematostella, Acropora and Hydra have shed light on the function of these genes, and now indicate that they are involved in important species-specific adaptive processes. Here we focus on evidence from Hydra suggesting that taxonomically-restricted genes play a role in the creation of phylum-specific novelties such as cnidocytes, in the generation of morphological diversity, and in the innate defence system. We propose that taxon-specific genes drive morphological specification, enabling organisms to adapt to changing conditions.
USDA-ARS?s Scientific Manuscript database
Although well-accepted as the ultimate method for cotton functional genomics, Agrobacterium tumefaciens-mediated cotton transformation is not widely used for functional analyses of cotton genes and their promoters since regeneration of cotton in tissue culture is lengthy and labor intensive. In cer...
Chandran, Anil Kumar Nalini; Yoo, Yo-Han; Cao, Peijian; Sharma, Rita; Sharma, Manoj; Dardick, Christopher; Ronald, Pamela C; Jung, Ki-Hong
2016-12-01
Protein kinases catalyze the transfer of a phosphate moiety from a phosphate donor to the substrate molecule, thus playing critical roles in cell signaling and metabolism. Although plant genomes contain more than 1000 genes that encode kinases, knowledge is limited about the function of each of these kinases. A major obstacle that hinders progress towards kinase characterization is functional redundancy. To address this challenge, we previously developed the rice kinase database (RKD) that integrated omics-scale data within a phylogenetics context. An updated version of rice kinase database (RKD) that contains metadata derived from NCBI GEO expression datasets has been developed. RKD 2.0 facilitates in-depth transcriptomic analyses of kinase-encoding genes in diverse rice tissues and in response to biotic and abiotic stresses and hormone treatments. We identified 261 kinases specifically expressed in particular tissues, 130 that are significantly up- regulated in response to biotic stress, 296 in response to abiotic stress, and 260 in response to hormones. Based on this update and Pearson correlation coefficient (PCC) analysis, we estimated that 19 out of 26 genes characterized through loss-of-function studies confer dominant functions. These were selected because they either had paralogous members with PCC values of <0.5 or had no paralog. Compared with the previous version of RKD, RKD 2.0 enables more effective estimations of functional redundancy or dominance because it uses comprehensive expression profiles rather than individual profiles. The integrated analysis of RKD with PCC establishes a single platform for researchers to select rice kinases for functional analyses.
Evolutionary analysis of the jacalin-related lectin family genes in 11 fishes.
Cao, Jun; Lv, Yueqing
2016-09-01
Jacalin-related lectins are a type of carbohydrate-binding proteins, which are distributed across a wide variety of organisms and involved in some important biological processes. The evolution of this gene family in fishes is unknown. Here, 47 putative jacalin genes in 11 fish species were identified and divided into 4 groups through phylogenetic analysis. Conserved gene organization and motif distribution existed in each group, suggesting their functional conservation. Some fishes have eleven jacalin genes, while others have only one or zero gene in their genomes, suggesting dynamic changes in the number of jacalin genes during the evolution of fishes. Intragenic recombination played a key role in the evolution of jacalin genes. Synteny analyses of jacalin genes in some fishes implied conserved and dynamic evolution characteristics of this gene family and related genome segments. Moreover, a few functional divergence sites were identified within each group pairs. Divergent expression profiles of the zebra fish jacalin genes were further investigated in different stresses. The results provided a foundation for exploring the characterization of the jacalin genes in fishes and will offer insights for additional functional studies. Copyright © 2016 Elsevier Ltd. All rights reserved.
The behavioral genetics of nonhuman primates: Status and prospects.
Rogers, Jeffrey
2018-01-01
The complexity and diversity of primate behavior have long attracted the attention of ethologists, psychologists, behavioral ecologists, and neuroscientists. Recent studies have advanced our understanding of the nature of genetic influences on differences in behavior among individuals within species. A number of analyses have focused on the genetic analysis of behavioral reactions to specific experimental tests, providing estimates of the degree of genetic control over reactivity, and beginning to identify the genes involved. Substantial progress is also being made in identifying genetic factors that influence the structure and function of the primate brain. Most of the published studies on these topics have examined either cercopithecines or chimpanzees, though a few studies have addressed these questions in other primate species. One potentially important line of research is beginning to identify the epigenetic processes that influence primate behavior, thus revealing specific cellular and molecular mechanisms by which environmental experiences can influence gene expression or gene function relevant to behavior. This review summarizes many of these studies of non-human primate behavioral genetics. The primary focus is on analyses that address the nature of the genes and genetic processes that affect differences in behavior among individuals within non-human primate species. Analyses of between species differences and potential avenues for future research are also discussed. © 2018 American Association of Physical Anthropologists.
Li, Sheng; Wang, Chengzhong; Wang, Weikai; Liu, Weidong; Zhang, Guiqin
2018-05-01
This study aimed to explore the underlying mechanism of relapsed acute lymphoblastic leukemia (ALL).Datasets of GSE28460 and GSE18497 were downloaded from Gene Expression Omnibus (GEO). Differentially expressed genes (DEGs) between diagnostic and relapsed ALL samples were identified using Limma package in R, and a Venn diagram was drawn. Next, functional enrichment analyses of co-regulated DEGs were performed. Based on the String database, protein-protein interaction network and module analyses were also conducted. Moreover, transcription factors and miRNAs targeting co-regulated DEGs were predicted using the WebGestalt online tool.A total of 71 co-regulated DEGs were identified, including 56 co-upregulated genes and 15 co-downregulated genes. Functional enrichment analyses showed that upregulated DEGs were significantly enriched in the cell cycle, and DNA replication, and repair related pathways. POLD1, MCM2, and PLK4 were hub proteins in both protein-protein interaction network and module, and might be potential targets of E2F. Additionally, POLD1 and MCM2 were found to be regulated by miR-520H via E2F1.High expression of POLD1, MCM2, and PLK4 might play positive roles in the recurrence of ALL, and could serve as potential therapeutic targets for the treatment of relapsed ALL.
Functional Analysis of the Arabidopsis TETRASPANIN Gene Family in Plant Growth and Development.
Wang, Feng; Muto, Antonella; Van de Velde, Jan; Neyt, Pia; Himanen, Kristiina; Vandepoele, Klaas; Van Lijsebettens, Mieke
2015-11-01
TETRASPANIN (TET) genes encode conserved integral membrane proteins that are known in animals to function in cellular communication during gamete fusion, immunity reaction, and pathogen recognition. In plants, functional information is limited to one of the 17 members of the Arabidopsis (Arabidopsis thaliana) TET gene family and to expression data in reproductive stages. Here, the promoter activity of all 17 Arabidopsis TET genes was investigated by pAtTET::NUCLEAR LOCALIZATION SIGNAL-GREEN FLUORESCENT PROTEIN/β-GLUCURONIDASE reporter lines throughout the life cycle, which predicted functional divergence in the paralogous genes per clade. However, partial overlap was observed for many TET genes across the clades, correlating with few phenotypes in single mutants and, therefore, requiring double mutant combinations for functional investigation. Mutational analysis showed a role for TET13 in primary root growth and lateral root development and redundant roles for TET5 and TET6 in leaf and root growth through negative regulation of cell proliferation. Strikingly, a number of TET genes were expressed in embryonic and seedling progenitor cells and remained expressed until the differentiation state in the mature plant, suggesting a dynamic function over developmental stages. The cis-regulatory elements together with transcription factor-binding data provided molecular insight into the sites, conditions, and perturbations that affect TET gene expression and positioned the TET genes in different molecular pathways; the data represent a hypothesis-generating resource for further functional analyses. © 2015 American Society of Plant Biologists. All Rights Reserved.
Functional Analysis of the Arabidopsis TETRASPANIN Gene Family in Plant Growth and Development1[OPEN
Wang, Feng; Muto, Antonella; Van de Velde, Jan; Neyt, Pia; Himanen, Kristiina; Vandepoele, Klaas; Van Lijsebettens, Mieke
2015-01-01
TETRASPANIN (TET) genes encode conserved integral membrane proteins that are known in animals to function in cellular communication during gamete fusion, immunity reaction, and pathogen recognition. In plants, functional information is limited to one of the 17 members of the Arabidopsis (Arabidopsis thaliana) TET gene family and to expression data in reproductive stages. Here, the promoter activity of all 17 Arabidopsis TET genes was investigated by pAtTET::NUCLEAR LOCALIZATION SIGNAL-GREEN FLUORESCENT PROTEIN/β-GLUCURONIDASE reporter lines throughout the life cycle, which predicted functional divergence in the paralogous genes per clade. However, partial overlap was observed for many TET genes across the clades, correlating with few phenotypes in single mutants and, therefore, requiring double mutant combinations for functional investigation. Mutational analysis showed a role for TET13 in primary root growth and lateral root development and redundant roles for TET5 and TET6 in leaf and root growth through negative regulation of cell proliferation. Strikingly, a number of TET genes were expressed in embryonic and seedling progenitor cells and remained expressed until the differentiation state in the mature plant, suggesting a dynamic function over developmental stages. The cis-regulatory elements together with transcription factor-binding data provided molecular insight into the sites, conditions, and perturbations that affect TET gene expression and positioned the TET genes in different molecular pathways; the data represent a hypothesis-generating resource for further functional analyses. PMID:26417009
Gene expression links functional networks across cortex and striatum.
Anderson, Kevin M; Krienen, Fenna M; Choi, Eun Young; Reinen, Jenna M; Yeo, B T Thomas; Holmes, Avram J
2018-04-12
The human brain is comprised of a complex web of functional networks that link anatomically distinct regions. However, the biological mechanisms supporting network organization remain elusive, particularly across cortical and subcortical territories with vastly divergent cellular and molecular properties. Here, using human and primate brain transcriptional atlases, we demonstrate that spatial patterns of gene expression show strong correspondence with limbic and somato/motor cortico-striatal functional networks. Network-associated expression is consistent across independent human datasets and evolutionarily conserved in non-human primates. Genes preferentially expressed within the limbic network (encompassing nucleus accumbens, orbital/ventromedial prefrontal cortex, and temporal pole) relate to risk for psychiatric illness, chloride channel complexes, and markers of somatostatin neurons. Somato/motor associated genes are enriched for oligodendrocytes and markers of parvalbumin neurons. These analyses indicate that parallel cortico-striatal processing channels possess dissociable genetic signatures that recapitulate distributed functional networks, and nominate molecular mechanisms supporting cortico-striatal circuitry in health and disease.
Genetic Resources for Maize Cell Wall Biology1[C][W][OA
Penning, Bryan W.; Hunter, Charles T.; Tayengwa, Reuben; Eveland, Andrea L.; Dugard, Christopher K.; Olek, Anna T.; Vermerris, Wilfred; Koch, Karen E.; McCarty, Donald R.; Davis, Mark F.; Thomas, Steven R.; McCann, Maureen C.; Carpita, Nicholas C.
2009-01-01
Grass species represent a major source of food, feed, and fiber crops and potential feedstocks for biofuel production. Most of the biomass is contributed by cell walls that are distinct in composition from all other flowering plants. Identifying cell wall-related genes and their functions underpins a fundamental understanding of growth and development in these species. Toward this goal, we are building a knowledge base of the maize (Zea mays) genes involved in cell wall biology, their expression profiles, and the phenotypic consequences of mutation. Over 750 maize genes were annotated and assembled into gene families predicted to function in cell wall biogenesis. Comparative genomics of maize, rice (Oryza sativa), and Arabidopsis (Arabidopsis thaliana) sequences reveal differences in gene family structure between grass species and a reference eudicot species. Analysis of transcript profile data for cell wall genes in developing maize ovaries revealed that expression within families differed by up to 100-fold. When transcriptional analyses of developing ovaries before pollination from Arabidopsis, rice, and maize were contrasted, distinct sets of cell wall genes were expressed in grasses. These differences in gene family structure and expression between Arabidopsis and the grasses underscore the requirement for a grass-specific genetic model for functional analyses. A UniformMu population proved to be an important resource in both forward- and reverse-genetics approaches to identify hundreds of mutants in cell wall genes. A forward screen of field-grown lines by near-infrared spectroscopic screen of mature leaves yielded several dozen lines with heritable spectroscopic phenotypes. Pyrolysis-molecular beam mass spectrometry confirmed that several nir mutants had altered carbohydrate-lignin compositions. PMID:19926802
Plant Ion Channels: Gene Families, Physiology, and Functional Genomics Analyses
Ward, John M.; Mäser, Pascal; Schroeder, Julian I.
2016-01-01
Distinct potassium, anion, and calcium channels in the plasma membrane and vacuolar membrane of plant cells have been identified and characterized by patch clamping. Primarily owing to advances in Arabidopsis genetics and genomics, and yeast functional complementation, many of the corresponding genes have been identified. Recent advances in our understanding of ion channel genes that mediate signal transduction and ion transport are discussed here. Some plant ion channels, for example, ALMT and SLAC anion channel subunits, are unique. The majority of plant ion channel families exhibit homology to animal genes; such families include both hyperpolarization-and depolarization-activated Shaker-type potassium channels, CLC chloride transporters/channels, cyclic nucleotide–gated channels, and ionotropic glutamate receptor homologs. These plant ion channels offer unique opportunities to analyze the structural mechanisms and functions of ion channels. Here we review gene families of selected plant ion channel classes and discuss unique structure-function aspects and their physiological roles in plant cell signaling and transport. PMID:18842100
Plant ion channels: gene families, physiology, and functional genomics analyses.
Ward, John M; Mäser, Pascal; Schroeder, Julian I
2009-01-01
Distinct potassium, anion, and calcium channels in the plasma membrane and vacuolar membrane of plant cells have been identified and characterized by patch clamping. Primarily owing to advances in Arabidopsis genetics and genomics, and yeast functional complementation, many of the corresponding genes have been identified. Recent advances in our understanding of ion channel genes that mediate signal transduction and ion transport are discussed here. Some plant ion channels, for example, ALMT and SLAC anion channel subunits, are unique. The majority of plant ion channel families exhibit homology to animal genes; such families include both hyperpolarization- and depolarization-activated Shaker-type potassium channels, CLC chloride transporters/channels, cyclic nucleotide-gated channels, and ionotropic glutamate receptor homologs. These plant ion channels offer unique opportunities to analyze the structural mechanisms and functions of ion channels. Here we review gene families of selected plant ion channel classes and discuss unique structure-function aspects and their physiological roles in plant cell signaling and transport.
Disease Model Discovery from 3,328 Gene Knockouts by The International Mouse Phenotyping Consortium
Meehan, Terrence F.; Conte, Nathalie; West, David B.; Jacobsen, Julius O.; Mason, Jeremy; Warren, Jonathan; Chen, Chao-Kung; Tudose, Ilinca; Relac, Mike; Matthews, Peter; Karp, Natasha; Santos, Luis; Fiegel, Tanja; Ring, Natalie; Westerberg, Henrik; Greenaway, Simon; Sneddon, Duncan; Morgan, Hugh; Codner, Gemma F; Stewart, Michelle E; Brown, James; Horner, Neil; Haendel, Melissa; Washington, Nicole; Mungall, Christopher J.; Reynolds, Corey L; Gallegos, Juan; Gailus-Durner, Valerie; Sorg, Tania; Pavlovic, Guillaume; Bower, Lynette R; Moore, Mark; Morse, Iva; Gao, Xiang; Tocchini-Valentini, Glauco P; Obata, Yuichi; Cho, Soo Young; Seong, Je Kyung; Seavitt, John; Beaudet, Arthur L.; Dickinson, Mary E.; Herault, Yann; Wurst, Wolfgang; de Angelis, Martin Hrabe; Lloyd, K.C. Kent; Flenniken, Ann M; Nutter, Lauryl MJ; Newbigging, Susan; McKerlie, Colin; Justice, Monica J.; Murray, Stephen A.; Svenson, Karen L.; Braun, Robert E.; White, Jacqueline K.; Bradley, Allan; Flicek, Paul; Wells, Sara; Skarnes, William C.; Adams, David J.; Parkinson, Helen; Mallon, Ann-Marie; Brown, Steve D.M.; Smedley, Damian
2017-01-01
Although next generation sequencing has revolutionised the ability to associate variants with human diseases, diagnostic rates and development of new therapies are still limited by our lack of knowledge of function and pathobiological mechanism for most genes. To address this challenge, the International Mouse Phenotyping Consortium (IMPC) is creating a genome- and phenome-wide catalogue of gene function by characterizing new knockout mouse strains across diverse biological systems through a broad set of standardised phenotyping tests, with all mice made readily available to the biomedical community. Analysing the first 3328 genes reveals models for 360 diseases including the first for type C Bernard-Soulier, Bardet-Biedl-5 and Gordon Holmes syndromes. 90% of our phenotype annotations are novel, providing the first functional evidence for 1092 genes and candidates in unsolved diseases such as Arrhythmogenic Right Ventricular Dysplasia 3. Finally, we describe our role in variant functional validation with the 100,000 Genomes and other projects. PMID:28650483
C-RAF function at the genome-wide transcriptome level: A systematic view.
Huang, Ying; Zhang, Xin-Yu; An, Su; Yang, Yang; Liu, Ying; Hao, Qian; Guo, Xiao-Xi; Xu, Tian-Rui
2018-05-20
C-RAF was the first member of the RAF kinase family to be discovered. Since its discovery, C-RAF has been found to regulate many fundamental cell processes, such as cell proliferation, cell death, and metabolism. However, the majority of these functions are achieved through interactions with different proteins; the genes regulated by C-RAF in its active or inactive state remain unclear. In the work, we used RNA-seq analysis to study the global transcriptomes of C-RAF bearing or C-RAF knockout cells in quiescent or EGF activated states. We identified 3353 genes that are promoted or suppressed by C-RAF. Gene ontology and Kyoto Encyclopedia of Genes and Genomes analyses revealed that these genes are involved in drug addiction, cardiomyopathy, autoimmunity, and regulation of cell metabolism. Our results provide a panoramic view of C-RAF function, including known and novel functions, and have revealed potential targets for elucidating the role of C-RAF. Copyright © 2018 Elsevier B.V. All rights reserved.
Jung, Jaejoon; Philippot, Laurent; Park, Woojun
2016-03-14
The relationship between microbial biodiversity and soil function is an important issue in ecology, yet most studies have been performed in pristine ecosystems. Here, we assess the role of microbial diversity in ecological function and remediation strategies in diesel-contaminated soils. Soil microbial diversity was manipulated using a removal by dilution approach and microbial functions were determined using both metagenomic analyses and enzymatic assays. A shift from Proteobacteria- to Actinobacteria-dominant communities was observed when species diversity was reduced. Metagenomic analysis showed that a large proportion of functional gene categories were significantly altered by the reduction in biodiversity. The abundance of genes related to the nitrogen cycle was significantly reduced in the low-diversity community, impairing denitrification. In contrast, the efficiency of diesel biodegradation was increased in the low-diversity community and was further enhanced by addition of red clay as a stimulating agent. Our results suggest that the relationship between microbial diversity and ecological function involves trade-offs among ecological processes, and should not be generalized as a positive, neutral, or negative relationship.
Jung, Jaejoon; Philippot, Laurent; Park, Woojun
2016-01-01
The relationship between microbial biodiversity and soil function is an important issue in ecology, yet most studies have been performed in pristine ecosystems. Here, we assess the role of microbial diversity in ecological function and remediation strategies in diesel-contaminated soils. Soil microbial diversity was manipulated using a removal by dilution approach and microbial functions were determined using both metagenomic analyses and enzymatic assays. A shift from Proteobacteria- to Actinobacteria-dominant communities was observed when species diversity was reduced. Metagenomic analysis showed that a large proportion of functional gene categories were significantly altered by the reduction in biodiversity. The abundance of genes related to the nitrogen cycle was significantly reduced in the low-diversity community, impairing denitrification. In contrast, the efficiency of diesel biodegradation was increased in the low-diversity community and was further enhanced by addition of red clay as a stimulating agent. Our results suggest that the relationship between microbial diversity and ecological function involves trade-offs among ecological processes, and should not be generalized as a positive, neutral, or negative relationship. PMID:26972977
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jun, Se -Ran; Hauser, Loren John; Schadt, Christopher Warren
For decades there has been increasing interest in understanding the relationships between microbial communities and ecosystem functions. Current DNA sequencing technologies allows for the exploration of microbial communities in two principle ways: targeted rRNA gene surveys and shotgun metagenomics. For large study designs, it is often still prohibitively expensive to sequence metagenomes at both the breadth and depth necessary to statistically capture the true functional diversity of a community. Although rRNA gene surveys provide no direct evidence of function, they do provide a reasonable estimation of microbial diversity, while being a very cost effective way to screen samples of interestmore » for later shotgun metagenomic analyses. However, there is a great deal of 16S rRNA gene survey data currently available from diverse environments, and thus a need for tools to infer functional composition of environmental samples based on 16S rRNA gene survey data. As a result, we present a computational method called pangenome based functional profiles (PanFP), which infers functional profiles of microbial communities from 16S rRNA gene survey data for Bacteria and Archaea. PanFP is based on pangenome reconstruction of a 16S rRNA gene operational taxonomic unit (OTU) from known genes and genomes pooled from the OTU s taxonomic lineage. From this lineage, we derive an OTU functional profile by weighting a pangenome s functional profile with the OTUs abundance observed in a given sample. We validated our method by comparing PanFP to the functional profiles obtained from the direct shotgun metagenomic measurement of 65 diverse communities via Spearman correlation coefficients. These correlations improved with increasing sequencing depth, within the range of 0.8 0.9 for the most deeply sequenced Human Microbiome Project mock community samples. PanFP is very similar in performance to another recently released tool, PICRUSt, for almost all of survey data analysed here. But, our method is unique in that any OTU building method can be used, as opposed to being limited to closed reference OTU picking strategies against specific reference sequence databases. In conclusion, we developed an automated computational method, which derives an inferred functional profile based on the 16S rRNA gene surveys of microbial communities. The inferred functional profile provides a cost effective way to study complex ecosystems through predicted comparative functional metagenomes and metadata analysis. All PanFP source code and additional documentation are freely available online at GitHub.« less
Mining functionally relevant gene sets for analyzing physiologically novel clinical expression data.
Turcan, Sevin; Vetter, Douglas E; Maron, Jill L; Wei, Xintao; Slonim, Donna K
2011-01-01
Gene set analyses have become a standard approach for increasing the sensitivity of transcriptomic studies. However, analytical methods incorporating gene sets require the availability of pre-defined gene sets relevant to the underlying physiology being studied. For novel physiological problems, relevant gene sets may be unavailable or existing gene set databases may bias the results towards only the best-studied of the relevant biological processes. We describe a successful attempt to mine novel functional gene sets for translational projects where the underlying physiology is not necessarily well characterized in existing annotation databases. We choose targeted training data from public expression data repositories and define new criteria for selecting biclusters to serve as candidate gene sets. Many of the discovered gene sets show little or no enrichment for informative Gene Ontology terms or other functional annotation. However, we observe that such gene sets show coherent differential expression in new clinical test data sets, even if derived from different species, tissues, and disease states. We demonstrate the efficacy of this method on a human metabolic data set, where we discover novel, uncharacterized gene sets that are diagnostic of diabetes, and on additional data sets related to neuronal processes and human development. Our results suggest that our approach may be an efficient way to generate a collection of gene sets relevant to the analysis of data for novel clinical applications where existing functional annotation is relatively incomplete.
Pousada, Guillermo; Lago-Docampo, Mauro; Baloira, Adolfo; Valverde, Diana
2018-03-08
Pulmonary arterial hypertension associated with systemic lupus erythematosus (PAH-SLE) is a rare disease with a low incidence rate. In this study, PAH related genes and genetic modifiers were characterised molecularly in patients with PAH-SLE. Three patients diagnosed with PAH-SLE and 100 control individuals were analysed after signing an informed consent. Two out of the three analysed patients with PAH-SLE were carriers of pathogenic mutations in the genes BMPR2 and ENG. After an in silico analysis, pathogenic mutations were searched for in control individuals and different databases, with negative results, and they were thus functionally analysed. The third patients only showed polymorphisms in the genes BMPR2, ACVRL1 and ENG. Several genetic variants and genetic modifiers were identified in the three analysed patients. These modifiers, along with the pathogenic mutations, could lead to a more severe clinical course in patients with PAH. We present, for the first time, patients with PAH-SLE carrying pathogenic mutations in the main genes related to PAH and alterations in the genetic modifiers. Copyright © 2018 Elsevier España, S.L.U. All rights reserved.
Chuartzman, Silvia G; Schuldiner, Maya
2018-03-25
In the last decade several collections of Saccharomyces cerevisiae yeast strains have been created. In these collections every gene is modified in a similar manner such as by a deletion or the addition of a protein tag. Such libraries have enabled a diversity of systematic screens, giving rise to large amounts of information regarding gene functions. However, often papers describing such screens focus on a single gene or a small set of genes and all other loci affecting the phenotype of choice ('hits') are only mentioned in tables that are provided as supplementary material and are often hard to retrieve or search. To help unify and make such data accessible, we have created a Database of High Throughput Screening Hits (dHITS). The dHITS database enables information to be obtained about screens in which genes of interest were found as well as the other genes that came up in that screen - all in a readily accessible and downloadable format. The ability to query large lists of genes at the same time provides a platform to easily analyse hits obtained from transcriptional analyses or other screens. We hope that this platform will serve as a tool to facilitate investigation of protein functions to the yeast community. © 2018 The Authors Yeast Published by John Wiley & Sons Ltd.
Sun, Tao; Wang, Yan; Wang, Meng; Li, Tingting; Zhou, Yi; Wang, Xiatian; Wei, Shuya; He, Guangyuan; Yang, Guangxiao
2015-11-04
Calcineurin B-like (CBL) proteins belong to a unique group of calcium sensors in plant that decode the Ca(2+) signature by interacting with CBL-interacting protein kinases (CIPKs). Although CBL-CIPK complexes have been shown to play important roles in the responses to various stresses in plants, little is known about their functions in wheat. A total of seven TaCBL and 20 TaCIPK genes were amplified from bread wheat, Triticum aestivum cv. Chinese Spring. Reverse-transcriptase-polymerase chain reaction (RT-PCR) and in silico expression analyses showed that TaCBL and TaCIPK genes were expressed at different levels in different tissues, or maintained at nearly constant expression levels during the whole life cycle of the wheat plant. Some TaCBL and TaCIPK genes showed up- or down-regulated expressions during seed germination. Preferential interactions between TaCBLs and TaCIPKs were observed in yeast two-hybrid and bimolecular fluorescence complementation experiments. Analyses of a deletion series of TaCIPK proteins with amino acid variations at the C-terminus provided new insights into the specificity of the interactions between TaCIPKs and TaCBLs, and indicated that the TaCBL-TaCIPK signaling pathway is very complex in wheat because of its hexaploid genome. The expressions of many TaCBLs and TaCIPKs were responsive to abiotic stresses (salt, cold, and simulated drought) and abscisic acid treatment. Transgenic Arabidopsis plants overexpressing TaCIPK24 exhibited improved salt tolerance through increased Na(+) efflux and an enhanced reactive oxygen species scavenging capacity. These results contribute to our understanding of the functions of CBL-CIPK complexes and provide the basis for selecting appropriate genes for in-depth functional studies of CBL-CIPK in wheat.
Chen, Hongfei; Zuo, Xiya; Shao, Hongxia; Fan, Sheng; Ma, Juanjuan; Zhang, Dong; Zhao, Caiping; Yan, Xiangyan; Liu, Xiaojie; Han, Mingyu
2018-02-01
Carotenoid cleavage oxygenases (CCOs) are able to cleave carotenoids to produce apocarotenoids and their derivatives, which are important for plant growth and development. In this study, 21 apple CCO genes were identified and divided into six groups based on their phylogenetic relationships. We further characterized the apple CCO genes in terms of chromosomal distribution, structure and the presence of cis-elements in the promoter. We also predicted the cellular localization of the encoded proteins. An analysis of the synteny within the apple genome revealed that tandem, segmental, and whole-genome duplication events likely contributed to the expansion of the apple carotenoid oxygenase gene family. An additional integrated synteny analysis identified orthologous carotenoid oxygenase genes between apple and Arabidopsis thaliana, which served as references for the functional analysis of the apple CCO genes. The net photosynthetic rate, transpiration rate, and stomatal conductance of leaves decreased, while leaf stomatal density increased under drought and saline conditions. Tissue-specific gene expression analyses revealed diverse spatiotemporal expression patterns. Finally, hormone and abiotic stress treatments indicated that many apple CCO genes are responsive to various phytohormones as well as drought and salinity stresses. The genome-wide identification of apple CCO genes and the analyses of their expression patterns described herein may provide a solid foundation for future studies examining the regulation and functions of this gene family. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
Gong, Cuihua; Sun, Shangtong; Liu, Bing; Wang, Jing; Chen, Xiaodong
2017-06-01
The study aimed to identify the potential target genes and key miRNAs as well as to explore the underlying mechanisms in the pathogenesis of oral lichen planus (OLP) by bioinformatics analysis. The microarray data of GSE38617 were downloaded from Gene Expression Omnibus (GEO) database. A total of 7 OLP and 7 normal samples were used to identify the differentially expressed genes (DEGs) and miRNAs. The DEGs were then performed functional enrichment analyses. Furthermore, DEG-miRNA network and miRNA-function network were constructed by Cytoscape software. Total 1758 DEGs (598 up- and 1160 down-regulated genes) and 40 miRNAs (17 up- and 23 down-regulated miRNAs) were selected. The up-regulated genes were related to nuclear factor-Kappa B (NF-κB) signaling pathway, while down-regulated genes were mainly enriched in the function of ribosome. Tumor necrosis factor (TNF), caspase recruitment domain family, member 11 (CARD11) and mitochondrial ribosomal protein (MRP) genes were identified in these functions. In addition, miR-302 was a hub node in DEG-miRNA network and regulated cyclin D1 (CCND1). MiR-548a-2 was the key miRNA in miRNA-function network by regulating multiple functions including ribosomal function. The NF-κB signaling pathway and ribosome function may be the pathogenic mechanisms of OLP. The genes such as TNF, CARD11, MRP genes and CCND1 may be potential therapeutic target genes in OLP. MiR-548a-2 and miR-302 may play important roles in OLP development. Copyright © 2017 Elsevier Ltd. All rights reserved.
Guo, Yong; Qiu, Li-Juan
2013-01-01
The Dof domain protein family is a classic plant-specific zinc-finger transcription factor family involved in a variety of biological processes. There is great diversity in the number of Dof genes in different plants. However, there are only very limited reports on the characterization of Dof transcription factors in soybean (Glycine max). In the present study, 78 putative Dof genes were identified from the whole-genome sequence of soybean. The predicted GmDof genes were non-randomly distributed within and across 19 out of 20 chromosomes and 97.4% (38 pairs) were preferentially retained duplicate paralogous genes located in duplicated regions of the genome. Soybean-specific segmental duplications contributed significantly to the expansion of the soybean Dof gene family. These Dof proteins were phylogenetically clustered into nine distinct subgroups among which the gene structure and motif compositions were considerably conserved. Comparative phylogenetic analysis of these Dof proteins revealed four major groups, similar to those reported for Arabidopsis and rice. Most of the GmDofs showed specific expression patterns based on RNA-seq data analyses. The expression patterns of some duplicate genes were partially redundant while others showed functional diversity, suggesting the occurrence of sub-functionalization during subsequent evolution. Comprehensive expression profile analysis also provided insights into the soybean-specific functional divergence among members of the Dof gene family. Cis-regulatory element analysis of these GmDof genes suggested diverse functions associated with different processes. Taken together, our results provide useful information for the functional characterization of soybean Dof genes by combining phylogenetic analysis with global gene-expression profiling.
Transcriptomic analysis of Arabidopsis developing stems: a close-up on cell wall genes
Minic, Zoran; Jamet, Elisabeth; San-Clemente, Hélène; Pelletier, Sandra; Renou, Jean-Pierre; Rihouey, Christophe; Okinyo, Denis PO; Proux, Caroline; Lerouge, Patrice; Jouanin, Lise
2009-01-01
Background Different strategies (genetics, biochemistry, and proteomics) can be used to study proteins involved in cell biogenesis. The availability of the complete sequences of several plant genomes allowed the development of transcriptomic studies. Although the expression patterns of some Arabidopsis thaliana genes involved in cell wall biogenesis were identified at different physiological stages, detailed microarray analysis of plant cell wall genes has not been performed on any plant tissues. Using transcriptomic and bioinformatic tools, we studied the regulation of cell wall genes in Arabidopsis stems, i.e. genes encoding proteins involved in cell wall biogenesis and genes encoding secreted proteins. Results Transcriptomic analyses of stems were performed at three different developmental stages, i.e., young stems, intermediate stage, and mature stems. Many genes involved in the synthesis of cell wall components such as polysaccharides and monolignols were identified. A total of 345 genes encoding predicted secreted proteins with moderate or high level of transcripts were analyzed in details. The encoded proteins were distributed into 8 classes, based on the presence of predicted functional domains. Proteins acting on carbohydrates and proteins of unknown function constituted the two most abundant classes. Other proteins were proteases, oxido-reductases, proteins with interacting domains, proteins involved in signalling, and structural proteins. Particularly high levels of expression were established for genes encoding pectin methylesterases, germin-like proteins, arabinogalactan proteins, fasciclin-like arabinogalactan proteins, and structural proteins. Finally, the results of this transcriptomic analyses were compared with those obtained through a cell wall proteomic analysis from the same material. Only a small proportion of genes identified by previous proteomic analyses were identified by transcriptomics. Conversely, only a few proteins encoded by genes having moderate or high level of transcripts were identified by proteomics. Conclusion Analysis of the genes predicted to encode cell wall proteins revealed that about 345 genes had moderate or high levels of transcripts. Among them, we identified many new genes possibly involved in cell wall biogenesis. The discrepancies observed between results of this transcriptomic study and a previous proteomic study on the same material revealed post-transcriptional mechanisms of regulation of expression of genes encoding cell wall proteins. PMID:19149885
DOSim: an R package for similarity between diseases based on Disease Ontology.
Li, Jiang; Gong, Binsheng; Chen, Xi; Liu, Tao; Wu, Chao; Zhang, Fan; Li, Chunquan; Li, Xiang; Rao, Shaoqi; Li, Xia
2011-06-29
The construction of the Disease Ontology (DO) has helped promote the investigation of diseases and disease risk factors. DO enables researchers to analyse disease similarity by adopting semantic similarity measures, and has expanded our understanding of the relationships between different diseases and to classify them. Simultaneously, similarities between genes can also be analysed by their associations with similar diseases. As a result, disease heterogeneity is better understood and insights into the molecular pathogenesis of similar diseases have been gained. However, bioinformatics tools that provide easy and straight forward ways to use DO to study disease and gene similarity simultaneously are required. We have developed an R-based software package (DOSim) to compute the similarity between diseases and to measure the similarity between human genes in terms of diseases. DOSim incorporates a DO-based enrichment analysis function that can be used to explore the disease feature of an independent gene set. A multilayered enrichment analysis (GO and KEGG annotation) annotation function that helps users explore the biological meaning implied in a newly detected gene module is also part of the DOSim package. We used the disease similarity application to demonstrate the relationship between 128 different DO cancer terms. The hierarchical clustering of these 128 different cancers showed modular characteristics. In another case study, we used the gene similarity application on 361 obesity-related genes. The results revealed the complex pathogenesis of obesity. In addition, the gene module detection and gene module multilayered annotation functions in DOSim when applied on these 361 obesity-related genes helped extend our understanding of the complex pathogenesis of obesity risk phenotypes and the heterogeneity of obesity-related diseases. DOSim can be used to detect disease-driven gene modules, and to annotate the modules for functions and pathways. The DOSim package can also be used to visualise DO structure. DOSim can reflect the modular characteristic of disease related genes and promote our understanding of the complex pathogenesis of diseases. DOSim is available on the Comprehensive R Archive Network (CRAN) or http://bioinfo.hrbmu.edu.cn/dosim.
Genetic analysis of Ikaros target genes and tumor suppressor function in BCR-ABL1+ pre–B ALL
Aghajanirefah, Ali; McLaughlin, Jami; Cheng, Donghui; Geng, Huimin; Eggesbø, Linn M.; Smale, Stephen T.; Müschen, Markus
2017-01-01
Inactivation of the tumor suppressor gene encoding the transcriptional regulator Ikaros (IKZF1) is a hallmark of BCR-ABL1+ precursor B cell acute lymphoblastic leukemia (pre–B ALL). However, the mechanisms by which Ikaros functions as a tumor suppressor in pre–B ALL remain poorly understood. Here, we analyzed a mouse model of BCR-ABL1+ pre–B ALL together with a new model of inducible expression of wild-type Ikaros in IKZF1 mutant human BCR-ABL1+ pre–B ALL. We performed integrated genome-wide chromatin and expression analyses and identified Ikaros target genes in mouse and human BCR-ABL1+ pre–B ALL, revealing novel conserved gene pathways associated with Ikaros tumor suppressor function. Notably, genetic depletion of different Ikaros targets, including CTNND1 and the early hematopoietic cell surface marker CD34, resulted in reduced leukemic growth. Our results suggest that Ikaros mediates tumor suppressor function by enforcing proper developmental stage–specific expression of multiple genes through chromatin compaction at its target genes. PMID:28190001
Sokhi, Upneet K.; Bacolod, Manny D.; Dasgupta, Santanu; Emdad, Luni; Das, Swadesh K.; Dumur, Catherine I.; Miles, Michael F.; Sarkar, Devanand; Fisher, Paul B.
2013-01-01
Human Polynucleotide Phosphorylase (hPNPaseold-35 or PNPT1) is an evolutionarily conserved 3′→5′ exoribonuclease implicated in the regulation of numerous physiological processes including maintenance of mitochondrial homeostasis, mtRNA import and aging-associated inflammation. From an RNase perspective, little is known about the RNA or miRNA species it targets for degradation or whose expression it regulates; except for c-myc and miR-221. To further elucidate the functional implications of hPNPaseold-35 in cellular physiology, we knocked-down and overexpressed hPNPaseold-35 in human melanoma cells and performed gene expression analyses to identify differentially expressed transcripts. Ingenuity Pathway Analysis indicated that knockdown of hPNPaseold-35 resulted in significant gene expression changes associated with mitochondrial dysfunction and cholesterol biosynthesis; whereas overexpression of hPNPaseold-35 caused global changes in cell-cycle related functions. Additionally, comparative gene expression analyses between our hPNPaseold-35 knockdown and overexpression datasets allowed us to identify 77 potential “direct” and 61 potential “indirect” targets of hPNPaseold-35 which formed correlated networks enriched for cell-cycle and wound healing functional association, respectively. These results provide a comprehensive database of genes responsive to hPNPaseold-35 expression levels; along with the identification new potential candidate genes offering fresh insight into cellular pathways regulated by PNPT1 and which may be used in the future for possible therapeutic intervention in mitochondrial- or inflammation-associated disease phenotypes. PMID:24143183
High degree of genetic differentiation in marine three-spined sticklebacks (Gasterosteus aculeatus).
Defaveri, Jacquelin; Shikano, Takahito; Shimada, Yukinori; Merilä, Juha
2013-09-01
Populations of widespread marine organisms are typically characterized by a low degree of genetic differentiation in neutral genetic markers, but much less is known about differentiation in genes whose functional roles are associated with specific selection regimes. To uncover possible adaptive population divergence and heterogeneous genomic differentiation in marine three-spined sticklebacks (Gasterosteus aculeatus), we used a candidate gene-based genome-scan approach to analyse variability in 138 microsatellite loci located within/close to (<6 kb) functionally important genes in samples collected from ten geographic locations. The degree of genetic differentiation in markers classified as neutral or under balancing selection-as determined with several outlier detection methods-was low (F(ST) = 0.033 or 0.011, respectively), whereas average FST for directionally selected markers was significantly higher (F(ST) = 0.097). Clustering analyses provided support for genomic and geographic heterogeneity in selection: six genetic clusters were identified based on allele frequency differences in the directionally selected loci, whereas four were identified with the neutral loci. Allelic variation in several loci exhibited significant associations with environmental variables, supporting the conjecture that temperature and salinity, but not optic conditions, are important drivers of adaptive divergence among populations. In general, these results suggest that in spite of the high degree of physical connectivity and gene flow as inferred from neutral marker genes, marine stickleback populations are strongly genetically structured in loci associated with functionally relevant genes. © 2013 John Wiley & Sons Ltd.
Kirsten, Holger; Al-Hasani, Hoor; Holdt, Lesca; Gross, Arnd; Beutner, Frank; Krohn, Knut; Horn, Katrin; Ahnert, Peter; Burkhardt, Ralph; Reiche, Kristin; Hackermüller, Jörg; Löffler, Markus; Teupser, Daniel; Thiery, Joachim; Scholz, Markus
2015-08-15
Genetics of gene expression (eQTLs or expression QTLs) has proved an indispensable tool for understanding biological pathways and pathomechanisms of trait-associated SNPs. However, power of most genome-wide eQTL studies is still limited. We performed a large eQTL study in peripheral blood mononuclear cells of 2112 individuals increasing the power to detect trans-effects genome-wide. Going beyond univariate SNP-transcript associations, we analyse relations of eQTLs to biological pathways, polygenetic effects of expression regulation, trans-clusters and enrichment of co-localized functional elements. We found eQTLs for about 85% of analysed genes, and 18% of genes were trans-regulated. Local eSNPs were enriched up to a distance of 5 Mb to the transcript challenging typically implemented ranges of cis-regulations. Pathway enrichment within regulated genes of GWAS-related eSNPs supported functional relevance of identified eQTLs. We demonstrate that nearest genes of GWAS-SNPs might frequently be misleading functional candidates. We identified novel trans-clusters of potential functional relevance for GWAS-SNPs of several phenotypes including obesity-related traits, HDL-cholesterol levels and haematological phenotypes. We used chromatin immunoprecipitation data for demonstrating biological effects. Yet, we show for strongly heritable transcripts that still little trans-chromosomal heritability is explained by all identified trans-eSNPs; however, our data suggest that most cis-heritability of these transcripts seems explained. Dissection of co-localized functional elements indicated a prominent role of SNPs in loci of pseudogenes and non-coding RNAs for the regulation of coding genes. In summary, our study substantially increases the catalogue of human eQTLs and improves our understanding of the complex genetic regulation of gene expression, pathways and disease-related processes. © The Author 2015. Published by Oxford University Press.
Mishra, Awdhesh Kumar; Muthamilarasan, Mehanathan; Khan, Yusuf; Parida, Swarup Kumar; Prasad, Manoj
2014-01-01
WD40 proteins play a crucial role in diverse protein-protein interactions by acting as scaffolding molecules and thus assisting in the proper activity of proteins. Hence, systematic characterization and expression profiling of these WD40 genes in foxtail millet would enable us to understand the networks of WD40 proteins and their biological processes and gene functions. In the present study, a genome-wide survey was conducted and 225 potential WD40 genes were identified. Phylogenetic analysis categorized the WD40 proteins into 5 distinct sub-families (I-V). Gene Ontology annotation revealed the biological roles of the WD40 proteins along with its cellular components and molecular functions. In silico comparative mapping with sorghum, maize and rice demonstrated the orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of WD40 genes. Estimation of synonymous and non-synonymous substitution rates revealed its evolutionary significance in terms of gene-duplication and divergence. Expression profiling against abiotic stresses provided novel insights into specific and/or overlapping expression patterns of SiWD40 genes. Homology modeling enabled three-dimensional structure prediction was performed to understand the molecular functions of WD40 proteins. Although, recent findings had shown the importance of WD40 domains in acting as hubs for cellular networks during many biological processes, it has invited a lesser research attention unlike other common domains. Being a most promiscuous interactors, WD40 domains are versatile in mediating critical cellular functions and hence this genome-wide study especially in the model crop foxtail millet would serve as a blue-print for functional characterization of WD40s in millets and bioenergy grass species. In addition, the present analyses would also assist the research community in choosing the candidate WD40s for comprehensive studies towards crop improvement of millets and biofuel grasses.
Mishra, Awdhesh Kumar; Muthamilarasan, Mehanathan; Khan, Yusuf; Parida, Swarup Kumar; Prasad, Manoj
2014-01-01
WD40 proteins play a crucial role in diverse protein-protein interactions by acting as scaffolding molecules and thus assisting in the proper activity of proteins. Hence, systematic characterization and expression profiling of these WD40 genes in foxtail millet would enable us to understand the networks of WD40 proteins and their biological processes and gene functions. In the present study, a genome-wide survey was conducted and 225 potential WD40 genes were identified. Phylogenetic analysis categorized the WD40 proteins into 5 distinct sub-families (I–V). Gene Ontology annotation revealed the biological roles of the WD40 proteins along with its cellular components and molecular functions. In silico comparative mapping with sorghum, maize and rice demonstrated the orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of WD40 genes. Estimation of synonymous and non-synonymous substitution rates revealed its evolutionary significance in terms of gene-duplication and divergence. Expression profiling against abiotic stresses provided novel insights into specific and/or overlapping expression patterns of SiWD40 genes. Homology modeling enabled three-dimensional structure prediction was performed to understand the molecular functions of WD40 proteins. Although, recent findings had shown the importance of WD40 domains in acting as hubs for cellular networks during many biological processes, it has invited a lesser research attention unlike other common domains. Being a most promiscuous interactors, WD40 domains are versatile in mediating critical cellular functions and hence this genome-wide study especially in the model crop foxtail millet would serve as a blue-print for functional characterization of WD40s in millets and bioenergy grass species. In addition, the present analyses would also assist the research community in choosing the candidate WD40s for comprehensive studies towards crop improvement of millets and biofuel grasses. PMID:24466268
Hata, Junya; Satoh, Yuichi; Akaihata, Hidenori; Hiraki, Hiroyuki; Ogawa, Soichiro; Haga, Nobuhiro; Ishibashi, Kei; Aikawa, Ken; Kojima, Yoshiyuki
2016-07-01
To characterize the molecular features of benign prostatic hyperplasia by carrying out a gene expression profiling analysis in a rat model. Fetal urogenital sinus isolated from 20-day-old male rat embryo was implanted into a pubertal male rat ventral prostate. The implanted urogenital sinus grew time-dependently, and the pathological findings at 3 weeks after implantation showed epithelial hyperplasia as well as stromal hyperplasia. Whole-genome oligonucleotide microarray analysis utilizing approximately 30 000 oligonucleotide probes was carried out using prostate specimens during the prostate growth process (3 weeks after implantation). Microarray analyses showed 926 upregulated (>2-fold change, P < 0.01) and 3217 downregulated genes (<0.5-fold change, P < 0.01) in benign prostatic hyperplasia specimens compared with normal prostate. Gene ontology analyses of upregulated genes showed predominant genetic themes of involvement in development (162 genes, P = 2.01 × 10(-4) ), response to stimulus (163 genes, P = 7.37 × 10(-13) ) and growth (32 genes, P = 1.93 × 10(-5) ). When we used both normal prostate and non-transplanted urogenital sinuses as controls to identify benign prostatic hyperplasia-specific genes, 507 and 406 genes were upregulated and downregulated, respectively. Functional network and pathway analyses showed that genes associated with apoptosis modulation by heat shock protein 70, interleukin-1, interleukin-2 and interleukin-5 signaling pathways, KIT signaling pathway, and secretin-like G-protein-coupled receptors, class B, were relatively activated during the growth process in the benign prostatic hyperplasia specimens. In contrast, genes associated with cholesterol biosynthesis were relatively inactivated. Our microarray analyses of the benign prostatic hyperplasia model rat might aid in clarifying the molecular mechanism of benign prostatic hyperplasia progression, and identifying molecular targets for benign prostatic hyperplasia treatment. © 2016 The Japanese Urological Association.
Warming Alters Expressions of Microbial Functional Genes Important to Ecosystem Functioning
Xue, Kai; Xie, Jianping; Zhou, Aifen; ...
2016-05-06
Soil microbial communities play critical roles in ecosystem functioning and are likely altered by climate warming. However, so far, little is known about effects of warming on microbial functional gene expressions. Here, we applied functional gene array (GeoChip 3.0) to analyze cDNA reversely transcribed from total RNA to assess expressed functional genes in active soil microbial communities after nine years of experimental warming in a tallgrass prairie. Our results showed that warming significantly altered the community wide gene expressions. Specifically, expressed genes for degrading more recalcitrant carbon were stimulated by warming, likely linked to the plant community shift toward moremore » C 4 species under warming and to decrease the long-term soil carbon stability. In addition, warming changed expressed genes in labile C degradation and N cycling in different directions (increase and decrease), possibly reflecting the dynamics of labile C and available N pools during sampling. However, the average abundances of expressed genes in phosphorus and sulfur cycling were all increased by warming, implying a stable trend of accelerated P and S processes which might be a mechanism to sustain higher plant growth. Furthermore, the expressed gene composition was closely related to both dynamic (e.g., soil moisture) and stable environmental attributes (e.g., C 4 leaf C or N content), indicating that RNA analyses could also capture certain stable trends in the long-term treatment. Overall, this study revealed the importance of elucidating functional gene expressions of soil microbial community in enhancing our understanding of ecosystem responses to warming.« less
Warming Alters Expressions of Microbial Functional Genes Important to Ecosystem Functioning
Xue, Kai; Xie, Jianping; Zhou, Aifen; Liu, Feifei; Li, Dejun; Wu, Liyou; Deng, Ye; He, Zhili; Van Nostrand, Joy D.; Luo, Yiqi; Zhou, Jizhong
2016-01-01
Soil microbial communities play critical roles in ecosystem functioning and are likely altered by climate warming. However, so far, little is known about effects of warming on microbial functional gene expressions. Here, we applied functional gene array (GeoChip 3.0) to analyze cDNA reversely transcribed from total RNA to assess expressed functional genes in active soil microbial communities after nine years of experimental warming in a tallgrass prairie. Our results showed that warming significantly altered the community wide gene expressions. Specifically, expressed genes for degrading more recalcitrant carbon were stimulated by warming, likely linked to the plant community shift toward more C4 species under warming and to decrease the long-term soil carbon stability. In addition, warming changed expressed genes in labile C degradation and N cycling in different directions (increase and decrease), possibly reflecting the dynamics of labile C and available N pools during sampling. However, the average abundances of expressed genes in phosphorus and sulfur cycling were all increased by warming, implying a stable trend of accelerated P and S processes which might be a mechanism to sustain higher plant growth. Furthermore, the expressed gene composition was closely related to both dynamic (e.g., soil moisture) and stable environmental attributes (e.g., C4 leaf C or N content), indicating that RNA analyses could also capture certain stable trends in the long-term treatment. Overall, this study revealed the importance of elucidating functional gene expressions of soil microbial community in enhancing our understanding of ecosystem responses to warming. PMID:27199978
Warming Alters Expressions of Microbial Functional Genes Important to Ecosystem Functioning
DOE Office of Scientific and Technical Information (OSTI.GOV)
Xue, Kai; Xie, Jianping; Zhou, Aifen
Soil microbial communities play critical roles in ecosystem functioning and are likely altered by climate warming. However, so far, little is known about effects of warming on microbial functional gene expressions. Here, we applied functional gene array (GeoChip 3.0) to analyze cDNA reversely transcribed from total RNA to assess expressed functional genes in active soil microbial communities after nine years of experimental warming in a tallgrass prairie. Our results showed that warming significantly altered the community wide gene expressions. Specifically, expressed genes for degrading more recalcitrant carbon were stimulated by warming, likely linked to the plant community shift toward moremore » C 4 species under warming and to decrease the long-term soil carbon stability. In addition, warming changed expressed genes in labile C degradation and N cycling in different directions (increase and decrease), possibly reflecting the dynamics of labile C and available N pools during sampling. However, the average abundances of expressed genes in phosphorus and sulfur cycling were all increased by warming, implying a stable trend of accelerated P and S processes which might be a mechanism to sustain higher plant growth. Furthermore, the expressed gene composition was closely related to both dynamic (e.g., soil moisture) and stable environmental attributes (e.g., C 4 leaf C or N content), indicating that RNA analyses could also capture certain stable trends in the long-term treatment. Overall, this study revealed the importance of elucidating functional gene expressions of soil microbial community in enhancing our understanding of ecosystem responses to warming.« less
Lacruz, Rodrigo S; Smith, Charles E; Bringas, Pablo; Chen, Yi-Bu; Smith, Susan M; Snead, Malcolm L; Kurtz, Ira; Hacia, Joseph G; Hubbard, Michael J; Paine, Michael L
2012-05-01
The gene repertoire regulating vertebrate biomineralization is poorly understood. Dental enamel, the most highly mineralized tissue in mammals, differs from other calcifying systems in that the formative cells (ameloblasts) lack remodeling activity and largely degrade and resorb the initial extracellular matrix. Enamel mineralization requires that ameloblasts undergo a profound functional switch from matrix-secreting to maturational (calcium transport, protein resorption) roles as mineralization progresses. During the maturation stage, extracellular pH decreases markedly, placing high demands on ameloblasts to regulate acidic environments present around the growing hydroxyapatite crystals. To identify the genetic events driving enamel mineralization, we conducted genome-wide transcript profiling of the developing enamel organ from rat incisors and highlight over 300 genes differentially expressed during maturation. Using multiple bioinformatics analyses, we identified groups of maturation-associated genes whose functions are linked to key mineralization processes including pH regulation, calcium handling, and matrix turnover. Subsequent qPCR and Western blot analyses revealed that a number of solute carrier (SLC) gene family members were up-regulated during maturation, including the novel protein Slc24a4 involved in calcium handling as well as other proteins of similar function (Stim1). By providing the first global overview of the cellular machinery required for enamel maturation, this study provide a strong foundation for improving basic understanding of biomineralization and its practical applications in healthcare. Copyright © 2011 Wiley Periodicals, Inc.
Martínez-Castilla, León Patricio; Alvarez-Buylla, Elena R.
2003-01-01
Gene duplication is a substrate of evolution. However, the relative importance of positive selection versus relaxation of constraints in the functional divergence of gene copies is still under debate. Plant MADS-box genes encode transcriptional regulators key in various aspects of development and have undergone extensive duplications to form a large family. We recovered 104 MADS sequences from the Arabidopsis genome. Bayesian phylogenetic trees recover type II lineage as a monophyletic group and resolve a branching sequence of monophyletic groups within this lineage. The type I lineage is comprised of several divergent groups. However, contrasting gene structure and patterns of chromosomal distribution between type I and II sequences suggest that they had different evolutionary histories and support the placement of the root of the gene family between these two groups. Site-specific and site-branch analyses of positive Darwinian selection (PDS) suggest that different selection regimes could have affected the evolution of these lineages. We found evidence for PDS along the branch leading to flowering time genes that have a direct impact on plant fitness. Sites with high probabilities of having been under PDS were found in the MADS and K domains, suggesting that these played important roles in the acquisition of novel functions during MADS-box diversification. Detected sites are targets for further experimental analyses. We argue that adaptive changes in MADS-domain protein sequences have been important for their functional divergence, suggesting that changes within coding regions of transcriptional regulators have influenced phenotypic evolution of plants. PMID:14597714
Hitomi, Yuki; Tokunaga, Katsushi
2017-01-01
Human genome variation may cause differences in traits and disease risks. Disease-causal/susceptible genes and variants for both common and rare diseases can be detected by comprehensive whole-genome analyses, such as whole-genome sequencing (WGS), using next-generation sequencing (NGS) technology and genome-wide association studies (GWAS). Here, in addition to the application of an NGS as a whole-genome analysis method, we summarize approaches for the identification of functional disease-causal/susceptible variants from abundant genetic variants in the human genome and methods for evaluating their functional effects in human diseases, using an NGS and in silico and in vitro functional analyses. We also discuss the clinical applications of the functional disease causal/susceptible variants to personalized medicine.
2012-01-01
Background GDSL esterases/lipases are a newly discovered subclass of lipolytic enzymes that are very important and attractive research subjects because of their multifunctional properties, such as broad substrate specificity and regiospecificity. Compared with the current knowledge regarding these enzymes in bacteria, our understanding of the plant GDSL enzymes is very limited, although the GDSL gene family in plant species include numerous members in many fully sequenced plant genomes. Only two genes from a large rice GDSL esterase/lipase gene family were previously characterised, and the majority of the members remain unknown. In the present study, we describe the rice OsGELP (Oryza sativa GDSL esterase/lipase protein) gene family at the genomic and proteomic levels, and use this knowledge to provide insights into the multifunctionality of the rice OsGELP enzymes. Results In this study, an extensive bioinformatics analysis identified 114 genes in the rice OsGELP gene family. A complete overview of this family in rice is presented, including the chromosome locations, gene structures, phylogeny, and protein motifs. Among the OsGELPs and the plant GDSL esterase/lipase proteins of known functions, 41 motifs were found that represent the core secondary structure elements or appear specifically in different phylogenetic subclades. The specification and distribution of identified putative conserved clade-common and -specific peptide motifs, and their location on the predicted protein three dimensional structure may possibly signify their functional roles. Potentially important regions for substrate specificity are highlighted, in accordance with protein three-dimensional model and location of the phylogenetic specific conserved motifs. The differential expression of some representative genes were confirmed by quantitative real-time PCR. The phylogenetic analysis, together with protein motif architectures, and the expression profiling were analysed to predict the possible biological functions of the rice OsGELP genes. Conclusions Our current genomic analysis, for the first time, presents fundamental information on the organization of the rice OsGELP gene family. With combination of the genomic, phylogenetic, microarray expression, protein motif distribution, and protein structure analyses, we were able to create supported basis for the functional prediction of many members in the rice GDSL esterase/lipase family. The present study provides a platform for the selection of candidate genes for further detailed functional study. PMID:22793791
ExAtlas: An interactive online tool for meta-analysis of gene expression data.
Sharov, Alexei A; Schlessinger, David; Ko, Minoru S H
2015-12-01
We have developed ExAtlas, an on-line software tool for meta-analysis and visualization of gene expression data. In contrast to existing software tools, ExAtlas compares multi-component data sets and generates results for all combinations (e.g. all gene expression profiles versus all Gene Ontology annotations). ExAtlas handles both users' own data and data extracted semi-automatically from the public repository (GEO/NCBI database). ExAtlas provides a variety of tools for meta-analyses: (1) standard meta-analysis (fixed effects, random effects, z-score, and Fisher's methods); (2) analyses of global correlations between gene expression data sets; (3) gene set enrichment; (4) gene set overlap; (5) gene association by expression profile; (6) gene specificity; and (7) statistical analysis (ANOVA, pairwise comparison, and PCA). ExAtlas produces graphical outputs, including heatmaps, scatter-plots, bar-charts, and three-dimensional images. Some of the most widely used public data sets (e.g. GNF/BioGPS, Gene Ontology, KEGG, GAD phenotypes, BrainScan, ENCODE ChIP-seq, and protein-protein interaction) are pre-loaded and can be used for functional annotations.
Zhu, Qiyun; Kosoy, Michael; Olival, Kevin J.; Dittmar, Katharina
2014-01-01
Bartonellae are mammalian pathogens vectored by blood-feeding arthropods. Although of increasing medical importance, little is known about their ecological past, and host associations are underexplored. Previous studies suggest an influence of horizontal gene transfers in ecological niche colonization by acquisition of host pathogenicity genes. We here expand these analyses to metabolic pathways of 28 Bartonella genomes, and experimentally explore the distribution of bartonellae in 21 species of blood-feeding arthropods. Across genomes, repeated gene losses and horizontal gains in the phospholipid pathway were found. The evolutionary timing of these patterns suggests functional consequences likely leading to an early intracellular lifestyle for stem bartonellae. Comparative phylogenomic analyses discover three independent lineage-specific reacquisitions of a core metabolic gene—NAD(P)H-dependent glycerol-3-phosphate dehydrogenase (gpsA)—from Gammaproteobacteria and Epsilonproteobacteria. Transferred genes are significantly closely related to invertebrate Arsenophonus-, and Serratia-like endosymbionts, and mammalian Helicobacter-like pathogens, supporting a cellular association with arthropods and mammals at the base of extant Bartonella spp. Our studies suggest that the horizontal reacquisitions had a key impact on bartonellae lineage specific ecological and functional evolution. PMID:25106622
Ferrari, Raffaele; Forabosco, Paola; Vandrovcova, Jana; Botía, Juan A; Guelfi, Sebastian; Warren, Jason D; Momeni, Parastoo; Weale, Michael E; Ryten, Mina; Hardy, John
2016-02-24
In frontotemporal dementia (FTD) there is a critical lack in the understanding of biological and molecular mechanisms involved in disease pathogenesis. The heterogeneous genetic features associated with FTD suggest that multiple disease-mechanisms are likely to contribute to the development of this neurodegenerative condition. We here present a systems biology approach with the scope of i) shedding light on the biological processes potentially implicated in the pathogenesis of FTD and ii) identifying novel potential risk factors for FTD. We performed a gene co-expression network analysis of microarray expression data from 101 individuals without neurodegenerative diseases to explore regional-specific co-expression patterns in the frontal and temporal cortices for 12 genes (MAPT, GRN, CHMP2B, CTSC, HLA-DRA, TMEM106B, C9orf72, VCP, UBQLN2, OPTN, TARDBP and FUS) associated with FTD and we then carried out gene set enrichment and pathway analyses, and investigated known protein-protein interactors (PPIs) of FTD-genes products. Gene co-expression networks revealed that several FTD-genes (such as MAPT and GRN, CTSC and HLA-DRA, TMEM106B, and C9orf72, VCP, UBQLN2 and OPTN) were clustering in modules of relevance in the frontal and temporal cortices. Functional annotation and pathway analyses of such modules indicated enrichment for: i) DNA metabolism, i.e. transcription regulation, DNA protection and chromatin remodelling (MAPT and GRN modules); ii) immune and lysosomal processes (CTSC and HLA-DRA modules), and; iii) protein meta/catabolism (C9orf72, VCP, UBQLN2 and OPTN, and TMEM106B modules). PPI analysis supported the results of the functional annotation and pathway analyses. This work further characterizes known FTD-genes and elaborates on their biological relevance to disease: not only do we indicate likely impacted regional-specific biological processes driven by FTD-genes containing modules, but also do we suggest novel potential risk factors among the FTD-genes interactors as targets for further mechanistic characterization in hypothesis driven cell biology work.
Interactions between genetic background, insulin resistance and β-cell function.
Kahn, S E; Suvag, S; Wright, L A; Utzschneider, K M
2012-10-01
An interaction between genes and the environment is a critical component underlying the pathogenesis of the hyperglycaemia of type 2 diabetes. The development of more sophisticated techniques for studying gene variants and for analysing genetic data has led to the discovery of some 40 genes associated with type 2 diabetes. Most of these genes are related to changes in β-cell function, with a few associated with decreased insulin sensitivity and obesity. Interestingly, using quantitative traits based on continuous measures rather than dichotomous ones, it has become evident that not all genes associated with changes in fasting or post-prandial glucose are also associated with a diagnosis of type 2 diabetes. Identification of these gene variants has provided novel insights into the physiology and pathophysiology of the β-cell, including the identification of molecules involved in β-cell function that were not previously recognized as playing a role in this critical cell. Published 2012. This article is a U.S. Government work and is in the public domain in the USA.
WRKY transcription factor genes in wild rice Oryza nivara
Xu, Hengjian; Watanabe, Kenneth A.; Zhang, Liyuan; Shen, Qingxi J.
2016-01-01
The WRKY transcription factor family is one of the largest gene families involved in plant development and stress response. Although many WRKY genes have been studied in cultivated rice (Oryza sativa), the WRKY genes in the wild rice species Oryza nivara, the direct progenitor of O. sativa, have not been studied. O. nivara shows abundant genetic diversity and elite drought and disease resistance features. Herein, a total of 97 O. nivara WRKY (OnWRKY) genes were identified. RNA-sequencing demonstrates that OnWRKY genes were generally expressed at higher levels in the roots of 30-day-old plants. Bioinformatic analyses suggest that most of OnWRKY genes could be induced by salicylic acid, abscisic acid, and drought. Abundant potential MAPK phosphorylation sites in OnWRKYs suggest that activities of most OnWRKYs can be regulated by phosphorylation. Phylogenetic analyses of OnWRKYs support a novel hypothesis that ancient group IIc OnWRKYs were the original ancestors of only some group IIc and group III WRKYs. The analyses also offer strong support that group IIc OnWRKYs containing the HVE sequence in their zinc finger motifs were derived from group Ia WRKYs. This study provides a solid foundation for the study of the evolution and functions of WRKY genes in O. nivara. PMID:27345721
Microarray Analysis of Differential Gene Expression Profile Between Human Fetal and Adult Heart.
Geng, Zhimin; Wang, Jue; Pan, Lulu; Li, Ming; Zhang, Jitai; Cai, Xueli; Chu, Maoping
2017-04-01
Although many changes have been discovered during heart maturation, the genetic mechanisms involved in the changes between immature and mature myocardium have only been partially elucidated. Here, gene expression profile changed between the human fetal and adult heart was characterized. A human microarray was applied to define the gene expression signatures of the fetal (13-17 weeks of gestation, n = 4) and adult hearts (30-40 years old, n = 4). Gene ontology analyses, pathway analyses, gene set enrichment analyses, and signal transduction network were performed to predict the function of the differentially expressed genes. Ten mRNAs were confirmed by quantificational real-time polymerase chain reaction. 5547 mRNAs were found to be significantly differentially expressed. "Cell cycle" was the most enriched pathway in the down-regulated genes. EFGR, IGF1R, and ITGB1 play a central role in the regulation of heart development. EGFR, IGF1R, and FGFR2 were the core genes regulating cardiac cell proliferation. The quantificational real-time polymerase chain reaction results were concordant with the microarray data. Our data identified the transcriptional regulation of heart development in the second trimester and the potential regulators that play a prominent role in the regulation of heart development and cardiac cells proliferation.
USDA-ARS?s Scientific Manuscript database
The secreted proteins encoded by “parasitism genes” expressed within the esophageal glands cells of cyst nematodes play important roles in plant parasitism. Homologous transcripts and encoded proteins of the Heterodera glycines pioneer parasitism genes Hgsyv46, Hg4e02 and Hg5d08 were identified and ...
Awazu, Akinori; Tanabe, Takahiro; Kamitani, Mari; Tezuka, Ayumi; Nagano, Atsushi J
2018-05-29
Gene expression levels exhibit stochastic variations among genetically identical organisms under the same environmental conditions. In many recent transcriptome analyses based on RNA sequencing (RNA-seq), variations in gene expression levels among replicates were assumed to follow a negative binomial distribution, although the physiological basis of this assumption remains unclear. In this study, RNA-seq data were obtained from Arabidopsis thaliana under eight conditions (21-27 replicates), and the characteristics of gene-dependent empirical probability density function (ePDF) profiles of gene expression levels were analyzed. For A. thaliana and Saccharomyces cerevisiae, various types of ePDF of gene expression levels were obtained that were classified as Gaussian, power law-like containing a long tail, or intermediate. These ePDF profiles were well fitted with a Gauss-power mixing distribution function derived from a simple model of a stochastic transcriptional network containing a feedback loop. The fitting function suggested that gene expression levels with long-tailed ePDFs would be strongly influenced by feedback regulation. Furthermore, the features of gene expression levels are correlated with their functions, with the levels of essential genes tending to follow a Gaussian-like ePDF while those of genes encoding nucleic acid-binding proteins and transcription factors exhibit long-tailed ePDF.
Discovering Functions of Unannotated Genes from a Transcriptome Survey of Wild Fungal Isolates
Ellison, Christopher E.; Kowbel, David; Glass, N. Louise; Taylor, John W.
2014-01-01
ABSTRACT Most fungal genomes are poorly annotated, and many fungal traits of industrial and biomedical relevance are not well suited to classical genetic screens. Assigning genes to phenotypes on a genomic scale thus remains an urgent need in the field. We developed an approach to infer gene function from expression profiles of wild fungal isolates, and we applied our strategy to the filamentous fungus Neurospora crassa. Using transcriptome measurements in 70 strains from two well-defined clades of this microbe, we first identified 2,247 cases in which the expression of an unannotated gene rose and fell across N. crassa strains in parallel with the expression of well-characterized genes. We then used image analysis of hyphal morphologies, quantitative growth assays, and expression profiling to test the functions of four genes predicted from our population analyses. The results revealed two factors that influenced regulation of metabolism of nonpreferred carbon and nitrogen sources, a gene that governed hyphal architecture, and a gene that mediated amino acid starvation resistance. These findings validate the power of our population-transcriptomic approach for inference of novel gene function, and we suggest that this strategy will be of broad utility for genome-scale annotation in many fungal systems. PMID:24692637
Prasopdee, Sattrachai; Sotillo, Javier; Tesana, Smarn; Laha, Thewarach; Kulsantiwong, Jutharat; Nolan, Matthew J.
2014-01-01
Background Bithynia siamensis goniomphalos is the snail intermediate host of the liver fluke, Opisthorchis viverrini, the leading cause of cholangiocarcinoma (CCA) in the Greater Mekong sub-region of Thailand. Despite the severe public health impact of Opisthorchis-induced CCA, knowledge of the molecular interactions occurring between the parasite and its snail intermediate host is scant. The examination of differences in gene expression profiling between uninfected and O. viverrini-infected B. siamensis goniomphalos could provide clues on fundamental pathways involved in the regulation of snail-parasite interplay. Methodology/Principal Findings Using high-throughput (Illumina) sequencing and extensive bioinformatic analyses, we characterized the transcriptomes of uninfected and O. viverrini-infected B. siamensis goniomphalos. Comparative analyses of gene expression profiling allowed the identification of 7,655 differentially expressed genes (DEGs), associated to 43 distinct biological pathways, including pathways associated with immune defense mechanisms against parasites. Amongst the DEGs with immune functions, transcripts encoding distinct proteases displayed the highest down-regulation in Bithynia specimens infected by O. viverrini; conversely, transcription of genes encoding heat-shock proteins and actins was significantly up-regulated in parasite-infected snails when compared to the uninfected counterparts. Conclusions/Significance The present study lays the foundation for functional studies of genes and gene products potentially involved in immune-molecular mechanisms implicated in the ability of the parasite to successfully colonize its snail intermediate host. The annotated dataset provided herein represents a ready-to-use molecular resource for the discovery of molecular pathways underlying susceptibility and resistance mechanisms of B. siamensis goniomphalos to O. viverrini and for comparative analyses with pulmonate snail intermediate hosts of other platyhelminths including schistosomes. PMID:24676090
2012-01-01
Background Huntington’s disease (HD) is a fatal progressive neurodegenerative disorder caused by the expansion of the polyglutamine repeat region in the huntingtin gene. Although the disease is triggered by the mutation of a single gene, intensive research has linked numerous other genes to its pathogenesis. To obtain a systematic overview of these genes, which may serve as therapeutic targets, CHDI Foundation has recently established the HD Research Crossroads database. With currently over 800 cataloged genes, this web-based resource constitutes the most extensive curation of genes relevant to HD. It provides us with an unprecedented opportunity to survey molecular mechanisms involved in HD in a holistic manner. Methods To gain a synoptic view of therapeutic targets for HD, we have carried out a variety of bioinformatical and statistical analyses to scrutinize the functional association of genes curated in the HD Research Crossroads database. In particular, enrichment analyses were performed with respect to Gene Ontology categories, KEGG signaling pathways, and Pfam protein families. For selected processes, we also analyzed differential expression, using published microarray data. Additionally, we generated a candidate set of novel genetic modifiers of HD by combining information from the HD Research Crossroads database with previous genome-wide linkage studies. Results Our analyses led to a comprehensive identification of molecular mechanisms associated with HD. Remarkably, we not only recovered processes and pathways, which have frequently been linked to HD (such as cytotoxicity, apoptosis, and calcium signaling), but also found strong indications for other potentially disease-relevant mechanisms that have been less intensively studied in the context of HD (such as the cell cycle and RNA splicing, as well as Wnt and ErbB signaling). For follow-up studies, we provide a regularly updated compendium of molecular mechanism, that are associated with HD, at http://hdtt.sysbiolab.eu Additionally, we derived a candidate set of 24 novel genetic modifiers, including histone deacetylase 3 (HDAC3), metabotropic glutamate receptor 1 (GRM1), CDK5 regulatory subunit 2 (CDK5R2), and coactivator 1ß of the peroxisome proliferator-activated receptor gamma (PPARGC1B). Conclusions The results of our study give us an intriguing picture of the molecular complexity of HD. Our analyses can be seen as a first step towards a comprehensive list of biological processes, molecular functions, and pathways involved in HD, and may provide a basis for the development of more holistic disease models and new therapeutics. PMID:22741533
Jin, Yulan; Sharma, Ashok; Bai, Shan; Davis, Colleen; Liu, Haitao; Hopkins, Diane; Barriga, Kathy; Rewers, Marian; She, Jin-Xiong
2014-07-01
There is tremendous scientific and clinical value to further improving the predictive power of autoantibodies because autoantibody-positive (AbP) children have heterogeneous rates of progression to clinical diabetes. This study explored the potential of gene expression profiles as biomarkers for risk stratification among 104 AbP subjects from the Diabetes Autoimmunity Study in the Young (DAISY) using a discovery data set based on microarray and a validation data set based on real-time RT-PCR. The microarray data identified 454 candidate genes with expression levels associated with various type 1 diabetes (T1D) progression rates. RT-PCR analyses of the top-27 candidate genes confirmed 5 genes (BACH2, IGLL3, EIF3A, CDC20, and TXNDC5) associated with differential progression and implicated in lymphocyte activation and function. Multivariate analyses of these five genes in the discovery and validation data sets identified and confirmed four multigene models (BI, ICE, BICE, and BITE, with each letter representing a gene) that consistently stratify high- and low-risk subsets of AbP subjects with hazard ratios >6 (P < 0.01). The results suggest that these genes may be involved in T1D pathogenesis and potentially serve as excellent gene expression biomarkers to predict the risk of progression to clinical diabetes for AbP subjects. © 2014 by the American Diabetes Association.
Keller, J; Rousseau-Gueutin, M; Martin, G E; Morice, J; Boutte, J; Coissac, E; Ourari, M; Aïnouche, M; Salmon, A; Cabello-Hurtado, F; Aïnouche, A
2017-08-01
The Fabaceae family is considered as a model system for understanding chloroplast genome evolution due to the presence of extensive structural rearrangements, gene losses and localized hypermutable regions. Here, we provide sequences of four chloroplast genomes from the Lupinus genus, belonging to the underinvestigated Genistoid clade. Notably, we found in Lupinus species the functional loss of the essential rps16 gene, which was most likely replaced by the nuclear rps16 gene that encodes chloroplast and mitochondrion targeted RPS16 proteins. To study the evolutionary fate of the rps16 gene, we explored all available plant chloroplast, mitochondrial and nuclear genomes. Whereas no plant mitochondrial genomes carry an rps16 gene, many plants still have a functional nuclear and chloroplast rps16 gene. Ka/Ks ratios revealed that both chloroplast and nuclear rps16 copies were under purifying selection. However, due to the dual targeting of the nuclear rps16 gene product and the absence of a mitochondrial copy, the chloroplast gene may be lost. We also performed comparative analyses of lupine plastomes (SNPs, indels and repeat elements), identified the most variable regions and examined their phylogenetic utility. The markers identified here will help to reveal the evolutionary history of lupines, Genistoids and closely related clades. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Shima, Jun; Ando, Akira; Takagi, Hiroshi
2008-03-01
Yeasts used in bread making are exposed to air-drying stress during dried yeast production processes. To clarify the genes required for air-drying tolerance, we performed genome-wide screening using the complete deletion strain collection of diploid Saccharomyces cerevisiae. The screening identified 278 gene deletions responsible for air-drying sensitivity. These genes were classified based on their cellular function and on the localization of their gene products. The results showed that the genes required for air-drying tolerance were frequently involved in mitochondrial functions and in connection with vacuolar H(+)-ATPase, which plays a role in vacuolar acidification. To determine the role of vacuolar acidification in air-drying stress tolerance, we monitored intracellular pH. The results showed that intracellular acidification was induced during air-drying and that this acidification was amplified in a deletion mutant of the VMA2 gene encoding a component of vacuolar H(+)-ATPase, suggesting that vacuolar H(+)-ATPase helps maintain intracellular pH homeostasis, which is affected by air-drying stress. To determine the effects of air-drying stress on mitochondria, we analysed the mitochondrial membrane potential under air-drying stress conditions using MitoTracker. The results showed that mitochondria were extremely sensitive to air-drying stress, suggesting that a mitochondrial function is required for tolerance to air-drying stress. We also analysed the correlation between oxidative-stress sensitivity and air-drying-stress sensitivity. The results suggested that oxidative stress is a critical determinant of sensitivity to air-drying stress, although ROS-scavenging systems are not necessary for air-drying stress tolerance. (c) 2008 John Wiley & Sons, Ltd.
An efficient transgenic system by TA cloning vectors and RNAi for C. elegans
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gengyo-Ando, Keiko; CREST, JST, 4-1-8 Hon-cho, Kawaguchi, Saitama 332-0012; Yoshina, Sawako
2006-11-03
In the nematode, transgenic analyses have been performed by microinjection of DNA from various sources into the syncytium gonad. To expedite these transgenic analyses, we solved two potential problems in this work. First, we constructed an efficient TA-cloning vector system which is useful for any promoter. By amplifying the genomic DNA fragments which contain regulatory sequences with or without the coding region, we could easily construct plasmids expressing fluorescent protein fusion without considering restriction sites. We could dissect motor neurons with three colors in a single animal. Second, we used feeding RNAi to isolate transgenic strains which express lag-2::venus fusionmore » gene. We found that the fusion protein is toxic when ectopically expressed in embryos but is functional to rescue a loss of function mutant in the lag-2 gene. Thus, the transgenic system described here should be useful to examine the protein function in the nematode.« less
DaGO-Fun: tool for Gene Ontology-based functional analysis using term information content measures
2013-01-01
Background The use of Gene Ontology (GO) data in protein analyses have largely contributed to the improved outcomes of these analyses. Several GO semantic similarity measures have been proposed in recent years and provide tools that allow the integration of biological knowledge embedded in the GO structure into different biological analyses. There is a need for a unified tool that provides the scientific community with the opportunity to explore these different GO similarity measure approaches and their biological applications. Results We have developed DaGO-Fun, an online tool available at http://web.cbio.uct.ac.za/ITGOM, which incorporates many different GO similarity measures for exploring, analyzing and comparing GO terms and proteins within the context of GO. It uses GO data and UniProt proteins with their GO annotations as provided by the Gene Ontology Annotation (GOA) project to precompute GO term information content (IC), enabling rapid response to user queries. Conclusions The DaGO-Fun online tool presents the advantage of integrating all the relevant IC-based GO similarity measures, including topology- and annotation-based approaches to facilitate effective exploration of these measures, thus enabling users to choose the most relevant approach for their application. Furthermore, this tool includes several biological applications related to GO semantic similarity scores, including the retrieval of genes based on their GO annotations, the clustering of functionally related genes within a set, and term enrichment analysis. PMID:24067102
Eleftherohorinou, Hariklia; Hoggart, Clive J; Wright, Victoria J; Levin, Michael; Coin, Lachlan J M
2011-09-01
Rheumatoid arthritis (RA) is the commonest chronic, systemic, inflammatory disorder affecting ∼1% of the world population. It has a strong genetic component and a growing number of associated genes have been discovered in genome-wide association studies (GWAS), which nevertheless only account for 23% of the total genetic risk. We aimed to identify additional susceptibility loci through the analysis of GWAS in the context of biological function. We bridge the gap between pathway and gene-oriented analyses of GWAS, by introducing a pathway-driven gene stability-selection methodology that identifies potential causal genes in the top-associated disease pathways that may be driving the pathway association signals. We analysed the WTCCC and the NARAC studies of ∼5000 and ∼2000 subjects, respectively. We examined 700 pathways comprising ∼8000 genes. Ranking pathways by significance revealed that the NARAC top-ranked ∼6% laid within the top 10% of WTCCC. Gene selection on those pathways identified 58 genes in WTCCC and 61 in NARAC; 21 of those were common (P(overlap)< 10(-21)), of which 16 were novel discoveries. Among the identified genes, we validated 10 known RA associations in WTCCC and 13 in NARAC, not discovered using single-SNP approaches on the same data. Gene ontology functional enrichment analysis on the identified genes showed significant over-representation of signalling activity (P< 10(-29)) in both studies. Our findings suggest a novel model of RA genetic predisposition, which involves cell-membrane receptors and genes in second messenger signalling systems, in addition to genes that regulate immune responses, which have been the focus of interest previously.
Floral gene resources from basal angiosperms for comparative genomics research
Albert, Victor A; Soltis, Douglas E; Carlson, John E; Farmerie, William G; Wall, P Kerr; Ilut, Daniel C; Solow, Teri M; Mueller, Lukas A; Landherr, Lena L; Hu, Yi; Buzgo, Matyas; Kim, Sangtae; Yoo, Mi-Jeong; Frohlich, Michael W; Perl-Treves, Rafael; Schlarbaum, Scott E; Bliss, Barbara J; Zhang, Xiaohong; Tanksley, Steven D; Oppenheimer, David G; Soltis, Pamela S; Ma, Hong; dePamphilis, Claude W; Leebens-Mack, James H
2005-01-01
Background The Floral Genome Project was initiated to bridge the genomic gap between the most broadly studied plant model systems. Arabidopsis and rice, although now completely sequenced and under intensive comparative genomic investigation, are separated by at least 125 million years of evolutionary time, and cannot in isolation provide a comprehensive perspective on structural and functional aspects of flowering plant genome dynamics. Here we discuss new genomic resources available to the scientific community, comprising cDNA libraries and Expressed Sequence Tag (EST) sequences for a suite of phylogenetically basal angiosperms specifically selected to bridge the evolutionary gaps between model plants and provide insights into gene content and genome structure in the earliest flowering plants. Results Random sequencing of cDNAs from representatives of phylogenetically important eudicot, non-grass monocot, and gymnosperm lineages has so far (as of 12/1/04) generated 70,514 ESTs and 48,170 assembled unigenes. Efficient sorting of EST sequences into putative gene families based on whole Arabidopsis/rice proteome comparison has permitted ready identification of cDNA clones for finished sequencing. Preliminarily, (i) proportions of functional categories among sequenced floral genes seem representative of the entire Arabidopsis transcriptome, (ii) many known floral gene homologues have been captured, and (iii) phylogenetic analyses of ESTs are providing new insights into the process of gene family evolution in relation to the origin and diversification of the angiosperms. Conclusion Initial comparisons illustrate the utility of the EST data sets toward discovery of the basic floral transcriptome. These first findings also afford the opportunity to address a number of conspicuous evolutionary genomic questions, including reproductive organ transcriptome overlap between angiosperms and gymnosperms, genome-wide duplication history, lineage-specific gene duplication and functional divergence, and analyses of adaptive molecular evolution. Since not all genes in the floral transcriptome will be associated with flowering, these EST resources will also be of interest to plant scientists working on other functions, such as photosynthesis, signal transduction, and metabolic pathways. PMID:15799777
Differential expression pattern of UBX family genes in Caenorhabditis elegans
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yamauchi, Seiji; Sasagawa, Yohei; Ogura, Teru
2007-06-29
UBX (ubiquitin regulatory X)-containing proteins belong to an evolutionary conserved protein family and determine the specificity of p97/VCP/Cdc48p function by binding as its adaptors. Caenorhabditis elegans was found to possess six UBX-containing proteins, named UBXN-1 to -6. However, no general or specific function of them has been revealed. During the course of understanding not only their function but also specified function of p97, we investigated spatial and temporal expression patterns of six ubxn genes in this study. Transcript analyses showed that the expression pattern of each ubxn gene was different throughout worm's development and may show potential developmental dynamics inmore » their function, especially ubxn-5 was expressed specifically in the spermatogenic germline, suggesting a crucial role in spermatogenesis. In addition, as ubxn-4 expression was induced by ER stress, it would function as an ERAD factor in C. elegans. In vivo expression analysis by using GFP translational fusion constructs revealed that six ubxn genes show distinct expression patterns. These results altogether demonstrate that the expression of all six ubxn genes of C. elegans is differently regulated.« less
2013-01-01
Background Understanding the function of a particular gene under various stresses is important for engineering plants for broad-spectrum stress tolerance. Although virus-induced gene silencing (VIGS) has been used to characterize genes involved in abiotic stress tolerance, currently available gene silencing and stress imposition methodology at the whole plant level is not suitable for high-throughput functional analyses of genes. This demands a robust and reliable methodology for characterizing genes involved in abiotic and multi-stress tolerance. Results Our methodology employs VIGS-based gene silencing in leaf disks combined with simple stress imposition and effect quantification methodologies for easy and faster characterization of genes involved in abiotic and multi-stress tolerance. By subjecting leaf disks from gene-silenced plants to various abiotic stresses and inoculating silenced plants with various pathogens, we show the involvement of several genes for multi-stress tolerance. In addition, we demonstrate that VIGS can be used to characterize genes involved in thermotolerance. Our results also showed the functional relevance of NtEDS1 in abiotic stress, NbRBX1 and NbCTR1 in oxidative stress; NtRAR1 and NtNPR1 in salinity stress; NbSOS1 and NbHSP101 in biotic stress; and NtEDS1, NbETR1, NbWRKY2 and NbMYC2 in thermotolerance. Conclusions In addition to widening the application of VIGS, we developed a robust, easy and high-throughput methodology for functional characterization of genes involved in multi-stress tolerance. PMID:24289810
Gardiner, Donald M.; McDonald, Megan C.; Covarelli, Lorenzo; Solomon, Peter S.; Rusu, Anca G.; Marshall, Mhairi; Kazan, Kemal; Chakraborty, Sukumar; McDonald, Bruce A.; Manners, John M.
2012-01-01
Comparative analyses of pathogen genomes provide new insights into how pathogens have evolved common and divergent virulence strategies to invade related plant species. Fusarium crown and root rots are important diseases of wheat and barley world-wide. In Australia, these diseases are primarily caused by the fungal pathogen Fusarium pseudograminearum. Comparative genomic analyses showed that the F. pseudograminearum genome encodes proteins that are present in other fungal pathogens of cereals but absent in non-cereal pathogens. In some cases, these cereal pathogen specific genes were also found in bacteria associated with plants. Phylogenetic analysis of selected F. pseudograminearum genes supported the hypothesis of horizontal gene transfer into diverse cereal pathogens. Two horizontally acquired genes with no previously known role in fungal pathogenesis were studied functionally via gene knockout methods and shown to significantly affect virulence of F. pseudograminearum on the cereal hosts wheat and barley. Our results indicate using comparative genomics to identify genes specific to pathogens of related hosts reveals novel virulence genes and illustrates the importance of horizontal gene transfer in the evolution of plant infecting fungal pathogens. PMID:23028337
Conditioned taste aversion dependent regulation of amygdala gene expression.
Panguluri, Siva K; Kuwabara, Nobuyuki; Kang, Yi; Cooper, Nigel; Lundy, Robert F
2012-02-28
The present experiments investigated gene expression in the amygdala following contingent taste/LiCl treatment that supports development of conditioned taste aversion (CTA). The use of whole genome chips and stringent data set filtering led to the identification of 168 genes regulated by CTA compared to non-contingent LiCl treatment that does not support CTA learning. Seventy-six of these genes were eligible for network analysis. Such analysis identified "behavior" as the top biological function, which was represented by 15 of the 76 genes. These genes included several neuropeptides, G protein-coupled receptors, ion channels, kinases, and phosphatases. Subsequent qRT-PCR analyses confirmed changes in mRNA expression for 5 of 7 selected genes. We were able to demonstrate directionally consistent changes in protein level for 3 of these genes; insulin 1, oxytocin, and major histocompatibility complex class I-C. Behavioral analyses demonstrated that blockade of central insulin receptors produced a weaker CTA that was less resistant to extinction. Together, these results support the notion that we have identified downstream genes in the amygdala that contribute to CTA learning. Copyright © 2011 Elsevier Inc. All rights reserved.
Pappa, Irene; Szekely, Eszter; Mileva-Seitz, Viara R; Luijk, Maartje P C M; Bakermans-Kranenburg, Marian J; van IJzendoorn, Marinus H; Tiemeier, Henning
2015-01-01
Although the environmental influences on infant attachment disorganization and security are well-studied, little is known about their heritability. Candidate gene studies have shown small, often non-replicable effects. In this study, we gathered the largest sample (N = 657) of ethnically homogenous, 14-month-old children with both observed attachment and genome-wide data. First, we used a Genome-Wide Association Study (GWAS) approach to identify single nucleotide polymorphisms (SNPs) associated with attachment disorganization and security. Second, we annotated them into genes (Versatile Gene-based Association Study) and functional pathways. Our analyses provide evidence of novel genes (HDAC1, ZNF675, BSCD1) and pathways (synaptic transmission, cation transport) associated with attachment disorganization. Similar analyses identified a novel gene (BECN1) but no distinct pathways associated with attachment security. The results of this first extensive, exploratory study on the molecular-genetic basis of infant attachment await replication in large, independent samples.
Wermter, Anne-Kathrin; Kamp-Becker, Inge; Hesse, Philipp; Schulte-Körne, Gerd; Strauch, Konstantin; Remschmidt, Helmut
2010-03-05
An increasing number of animal studies advert to a substantial role of the neuropeptide oxytocin in the regulation of social attachment and affiliation. Furthermore, animal studies showed anxiety and stress-reduced effects of oxytocin. First human studies confirm these findings in animal studies and implicate a crucial role of oxytocin in human social attachment behavior and in social interactions. Thus, the oxytocin system might be involved in the impairment of social interaction and attachment in autism spectrum disorders (ASD). The human oxytocin receptor gene (OXTR) represents a plausible candidate gene for the etiology of ASD. To analyze whether genetic variants in the OXTR gene are associated with ASD we performed family-based single-marker and haplotype association analyses with 22 single nucleotide polymorphisms (SNPs) in the OXTR and its 5' region in 100 families with autistic disorders on high-functioning level (Asperger syndrome (AS), high-functioning autism (HFA), and atypical autism (AA)). Single-marker and haplotype association analyses revealed nominally significant associations of one single SNP and one haplotype with autism, respectively. Furthermore, employing a "reverse phenotyping" approach, patients carrying the haplotype associated with autism showed nominally significant impairments in comparison to noncarriers of the haplotype in items of the Autism Diagnostic Interview-Revised algorithm describing aspects of social interaction and communication. In conclusion, our results implicate that genetic variation in the OXTR gene might be relevant in the etiology of autism on high-functioning level. (c) 2009 Wiley-Liss, Inc.
Kou, Xiaobing; Qi, Kaijie; Qiao, Xin; Yin, Hao; Liu, Xing; Zhang, Shaoling; Wu, Juyou
2017-07-01
The Catharanthus roseus RLK1-like kinase (CrRLK1L) family is involved in multiple processes during plant growth. However, little is known about CrRLK1L in the wood of the pear fruit tree Pyrus bretchneideri. In this study, 26 CrRLK1L gene members were identified in pear and were grouped into six subfamilies according to phylogenetic analyses. Evolutionary analysis indicated that recent whole genome duplication (WGD) and dispersed gene duplications may contribute to the expansion of the CrRLK1L gene family in pear. Moreover, tissue-specific expression analyses suggested that CrRLK1Ls are involved in the development of various pear tissues. Subsequent qRT-PCR analyses indicated that CrRLK1Ls might play important roles in pollen tube growth. Finally, experiments with antisense oligonucleotides (ASO) demonstrated that PbrCrRLK1L26 have functions in pollen tube elongation and that PbrCrRLK1L3 regulates pollen tube rupture. These results will be useful for elaborating the biological roles of CrRLK1Ls in pear growth and development. Copyright © 2017. Published by Elsevier Inc.
Hu, Valerie W.; Sarachana, Tewarit; Kim, Kyung Soon; Nguyen, AnhThu; Kulkarni, Shreya; Steinberg, Mara E.; Luu, Truong; Lai, Yinglei; Lee, Norman H.
2009-01-01
Autism spectrum disorders (ASD) are neurodevelopmental disorders characterized by delayed/abnormal language development, deficits in social interaction, repetitive behaviors and restricted interests. The heterogeneity in clinical presentation of ASD, likely due to different etiologies, complicates genetic/biological analyses of these disorders. DNA microarray analyses were conducted on 116 lymphoblastoid cell lines (LCL) from individuals with idiopathic autism who are divided into three phenotypic subgroups according to severity scores from the commonly used Autism Diagnostic Interview-Revised questionnaire and age-matched, nonautistic controls. Statistical analyses of gene expression data from control LCL against that of LCL from ASD probands identify genes for which expression levels are either quantitatively or qualitatively associated with phenotypic severity. Comparison of the significant differentially expressed genes from each subgroup relative to the control group reveals differentially expressed genes unique to each subgroup as well as genes in common across subgroups. Among the findings unique to the most severely affected ASD group are 15 genes that regulate circadian rhythm, which has been shown to have multiple effects on neurological as well as metabolic functions commonly dysregulated in autism. Among the genes common to all three subgroups of ASD are 20 novel genes mostly in putative noncoding regions, which appear to associate with androgen sensitivity and which may underlie the strong 4:1 bias toward affected males. PMID:19418574
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tschaplinski, Timothy J; Tsai, Chung-Jui; Harding, Scott A
Salicin-based phenolic glycosides, hydroxycinnamate derivatives and flavonoid-derived condensed tannins comprise up to one-third of Populus leaf dry mass. Genes regulating the abundance and chemical diversity of these substances have not been comprehensively analysed in tree species exhibiting this metabolically demanding level of phenolic metabolism. Here, shikimate-phenylpropanoid pathway genes thought to give rise to these phenolic products were annotated from the Populus genome, their expression assessed by semiquantitative or quantitative reverse transcription polymerase chain reaction (PCR), and metabolic evidence for function presented. Unlike Arabidopsis, Populus leaves accumulate an array of hydroxycinnamoyl-quinate esters, which is consistent with broadened function of the expandedmore » hydroxycinnamoyl-CoA transferase gene family. Greater flavonoid pathway diversity is also represented, and flavonoid gene families are larger. Consistent with expanded pathway function, most of these genes were upregulated during wound-stimulated condensed tannin synthesis in leaves. The suite of Populus genes regulating phenylpropanoid product accumulation should have important application in managing phenolic carbon pools in relation to climate change and global carbon cycling.« less
Xiong, Jinbo; Wu, Liyou; Tu, Shuxin; Van Nostrand, Joy D.; He, Zhili; Zhou, Jizhong; Wang, Gejiao
2010-01-01
To understand how microbial communities and functional genes respond to arsenic contamination in the rhizosphere of Pteris vittata, five soil samples with different arsenic contamination levels were collected from the rhizosphere of P. vittata and nonrhizosphere areas and investigated by Biolog, geochemical, and functional gene microarray (GeoChip 3.0) analyses. Biolog analysis revealed that the uncontaminated soil harbored the greatest diversity of sole-carbon utilization abilities and that arsenic contamination decreased the metabolic diversity, while rhizosphere soils had higher metabolic diversities than did the nonrhizosphere soils. GeoChip 3.0 analysis showed low proportions of overlapping genes across the five soil samples (16.52% to 45.75%). The uncontaminated soil had a higher heterogeneity and more unique genes (48.09%) than did the arsenic-contaminated soils. Arsenic resistance, sulfur reduction, phosphorus utilization, and denitrification genes were remarkably distinct between P. vittata rhizosphere and nonrhizosphere soils, which provides evidence for a strong linkage among the level of arsenic contamination, the rhizosphere, and the functional gene distribution. Canonical correspondence analysis (CCA) revealed that arsenic is the main driver in reducing the soil functional gene diversity; however, organic matter and phosphorus also have significant effects on the soil microbial community structure. The results implied that rhizobacteria play an important role during soil arsenic uptake and hyperaccumulation processes of P. vittata. PMID:20833780
A survey of disease connections for CD4+ T cell master genes and their directly linked genes.
Li, Wentian; Espinal-Enríquez, Jesús; Simpfendorfer, Kim R; Hernández-Lemus, Enrique
2015-12-01
Genome-wide association studies and other genetic analyses have identified a large number of genes and variants implicating a variety of disease etiological mechanisms. It is imperative for the study of human diseases to put these genetic findings into a coherent functional context. Here we use system biology tools to examine disease connections of five master genes for CD4+ T cell subtypes (TBX21, GATA3, RORC, BCL6, and FOXP3). We compiled a list of genes functionally interacting (protein-protein interaction, or by acting in the same pathway) with the master genes, then we surveyed the disease connections, either by experimental evidence or by genetic association. Embryonic lethal genes (also known as essential genes) are over-represented in master genes and their interacting genes (55% versus 40% in other genes). Transcription factors are significantly enriched among genes interacting with the master genes (63% versus 10% in other genes). Predicted haploinsufficiency is a feature of most these genes. Disease-connected genes are enriched in this list of genes: 42% of these genes have a disease connection according to Online Mendelian Inheritance in Man (OMIM) (versus 23% in other genes), and 74% are associated with some diseases or phenotype in a Genome Wide Association Study (GWAS) (versus 43% in other genes). Seemingly, not all of the diseases connected to genes surveyed were immune related, which may indicate pleiotropic functions of the master regulator genes and associated genes. Copyright © 2015 Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ginns, E.I.; Winfield, S.; Sidransky, E.
1994-09-01
The human GC locus on chromosome 1q21 encompasses a 7 kb functional gene encoding the enzyme deficient in Gaucher disease, and a highly homologous sequence 16 Kb downstream that has the properties of a pseudogene. A novel gene, gene X, spanning the 6 kb region between the pseudogene and TSP3 has been identified and characterized in the mouse, and appears to be critical for normal embryonic development. As in the mouse, the human gene X is located 5{prime} to the TSP3 gene and two genes are transcribed divergently from a bidirectional promoter; the direction of transcription of gene X andmore » GC is convergent. However, in the human, gene X and GC are separated by gene X and GC pseudogenes that are the consequence of a gene duplication. The gene X pseudogene lacks the first exon and part of the second exon of the functional gene and may not be transcribed. Northern blot analyses indicate that gene X is transcribed in both normal individuals and in patients with Gaucher disease, but the function of this gene is still unknown. The possibility that mutations in gene X could account for some of the diversity of symptoms encountered in individuals with the more atypical presentations of Gaucher disease is under investigation.« less
From Genome to Function: Systematic Analysis of the Soil Bacterium Bacillus Subtilis
Crawshaw, Samuel G.; Wipat, Anil
2001-01-01
Bacillus subtilis is a sporulating Gram-positive bacterium that lives primarily in the soil and associated water sources. Whilst this bacterium has been studied extensively in the laboratory, relatively few studies have been undertaken to study its activity in natural environments. The publication of the B. subtilis genome sequence and subsequent systematic functional analysis programme have provided an opportunity to develop tools for analysing the role and expression of Bacillus genes in situ. In this paper we discuss analytical approaches that are being developed to relate genes to function in environments such as the rhizosphere. PMID:18628943
Gene set analysis using variance component tests.
Huang, Yen-Tsung; Lin, Xihong
2013-06-28
Gene set analyses have become increasingly important in genomic research, as many complex diseases are contributed jointly by alterations of numerous genes. Genes often coordinate together as a functional repertoire, e.g., a biological pathway/network and are highly correlated. However, most of the existing gene set analysis methods do not fully account for the correlation among the genes. Here we propose to tackle this important feature of a gene set to improve statistical power in gene set analyses. We propose to model the effects of an independent variable, e.g., exposure/biological status (yes/no), on multiple gene expression values in a gene set using a multivariate linear regression model, where the correlation among the genes is explicitly modeled using a working covariance matrix. We develop TEGS (Test for the Effect of a Gene Set), a variance component test for the gene set effects by assuming a common distribution for regression coefficients in multivariate linear regression models, and calculate the p-values using permutation and a scaled chi-square approximation. We show using simulations that type I error is protected under different choices of working covariance matrices and power is improved as the working covariance approaches the true covariance. The global test is a special case of TEGS when correlation among genes in a gene set is ignored. Using both simulation data and a published diabetes dataset, we show that our test outperforms the commonly used approaches, the global test and gene set enrichment analysis (GSEA). We develop a gene set analyses method (TEGS) under the multivariate regression framework, which directly models the interdependence of the expression values in a gene set using a working covariance. TEGS outperforms two widely used methods, GSEA and global test in both simulation and a diabetes microarray data.
Trade-off between taxon diversity and functional diversity in European lake ecosystems.
Grossmann, Lars; Beisser, Daniela; Bock, Christina; Chatzinotas, Antonis; Jensen, Manfred; Preisfeld, Angelika; Psenner, Roland; Rahmann, Sven; Wodniok, Sabina; Boenigk, Jens
2016-12-01
Inferring ecosystem functioning and ecosystem services through inspections of the species inventory is a major aspect of ecological field studies. Ecosystem functions are often stable despite considerable species turnover. Using metatranscriptome analyses, we analyse a thus-far unparalleled freshwater data set which comprises 21 mainland European freshwater lakes from the Sierra Nevada (Spain) to the Carpathian Mountains (Romania) and from northern Germany to the Apennines (Italy) and covers an altitudinal range from 38 m above sea level (a.s.l) to 3110 m a.s.l. The dominant taxa were Chlorophyta and streptophytic algae, Ciliophora, Bacillariophyta and Chrysophyta. Metatranscriptomics provided insights into differences in community composition and into functional diversity via the relative share of taxa to the overall read abundance of distinct functional genes on the ecosystem level. The dominant metabolic pathways in terms of the fraction of expressed sequences in the cDNA libraries were affiliated with primary metabolism, specifically oxidative phosphorylation, photosynthesis and the TCA cycle. Our analyses indicate that community composition is a good first proxy for the analysis of ecosystem functions. However, differential gene regulation modifies the relative importance of taxa in distinct pathways. Whereas taxon composition varies considerably between lakes, the relative importance of distinct metabolic pathways is much more stable, indicating that ecosystem functioning is buffered against shifts in community composition through a functional redundancy of taxa. © 2016 The Authors. Molecular Ecology Published by John Wiley & Sons Ltd.
Structural and functional annotation of the porcine immunome
2013-01-01
Background The domestic pig is known as an excellent model for human immunology and the two species share many pathogens. Susceptibility to infectious disease is one of the major constraints on swine performance, yet the structure and function of genes comprising the pig immunome are not well-characterized. The completion of the pig genome provides the opportunity to annotate the pig immunome, and compare and contrast pig and human immune systems. Results The Immune Response Annotation Group (IRAG) used computational curation and manual annotation of the swine genome assembly 10.2 (Sscrofa10.2) to refine the currently available automated annotation of 1,369 immunity-related genes through sequence-based comparison to genes in other species. Within these genes, we annotated 3,472 transcripts. Annotation provided evidence for gene expansions in several immune response families, and identified artiodactyl-specific expansions in the cathelicidin and type 1 Interferon families. We found gene duplications for 18 genes, including 13 immune response genes and five non-immune response genes discovered in the annotation process. Manual annotation provided evidence for many new alternative splice variants and 8 gene duplications. Over 1,100 transcripts without porcine sequence evidence were detected using cross-species annotation. We used a functional approach to discover and accurately annotate porcine immune response genes. A co-expression clustering analysis of transcriptomic data from selected experimental infections or immune stimulations of blood, macrophages or lymph nodes identified a large cluster of genes that exhibited a correlated positive response upon infection across multiple pathogens or immune stimuli. Interestingly, this gene cluster (cluster 4) is enriched for known general human immune response genes, yet contains many un-annotated porcine genes. A phylogenetic analysis of the encoded proteins of cluster 4 genes showed that 15% exhibited an accelerated evolution as compared to 4.1% across the entire genome. Conclusions This extensive annotation dramatically extends the genome-based knowledge of the molecular genetics and structure of a major portion of the porcine immunome. Our complementary functional approach using co-expression during immune response has provided new putative immune response annotation for over 500 porcine genes. Our phylogenetic analysis of this core immunome cluster confirms rapid evolutionary change in this set of genes, and that, as in other species, such genes are important components of the pig’s adaptation to pathogen challenge over evolutionary time. These comprehensive and integrated analyses increase the value of the porcine genome sequence and provide important tools for global analyses and data-mining of the porcine immune response. PMID:23676093
Convergent evolution of heat-inducibility during subfunctionalization of the Hsp70 gene family
2013-01-01
Background Heat-shock proteins of the 70 kDa family (Hsp70s) are essential chaperones required for key cellular functions. In eukaryotes, four subfamilies can be distinguished according to their function and localisation in different cellular compartments: cytosol, endoplasmic reticulum, mitochondria and chloroplasts. Generally, multiple cytosol-type Hsp70s can be found in metazoans that show either constitutive expression and/or stress-inducibility, arguing for the evolution of different tasks and functions. Information about the hsp70 copy number and diversity in microbial eukaryotes is, however, scarce, and detailed knowledge about the differential gene expression in most protists is lacking. Therefore, we have characterised the Hsp70 gene family of Paramecium caudatum to gain insight into the evolution and differential heat stress response of the distinct family members in protists and to investigate the diversification of eukaryotic hsp70s focusing on the evolution of heat-inducibility. Results Eleven putative hsp70 genes could be detected in P. caudatum comprising homologs of three major Hsp70-subfamilies. Phylogenetic analyses revealed five evolutionarily distinct Hsp70-groups, each with a closer relationship to orthologous sequences of Paramecium tetraurelia than to another P. caudatum Hsp70-group. These highly diverse, paralogous groups resulted from duplications preceding Paramecium speciation, underwent divergent evolution and were subject to purifying selection. Heat-shock treatments were performed to test for differential expression patterns among the five Hsp70-groups as well as for a functional conservation within Paramecium. These treatments induced exceptionally high mRNA up-regulations in one cytosolic group with a low basal expression, indicative for the major heat inducible hsp70s. All other groups showed comparatively high basal expression levels and moderate heat-inducibility, signifying constitutively expressed genes. Comparative EST analyses for P. tetraurelia hsp70s unveiled a corresponding expression pattern, which supports a functionally conserved evolution of the Hsp70 gene family in Paramecium. Conclusions Our analyses suggest an independent evolution of the heat-inducible cytosol-type hsp70s in Paramecium and in its close relative Tetrahymena, as well as within higher eukaryotes. This result indicates convergent evolution during hsp70 subfunctionalization and implies that heat-inducibility evolved several times during the course of eukaryotic evolution. PMID:23433225
Plant responses to environmental stress: regulation and functions of the Arabidopsis TCH genes
NASA Technical Reports Server (NTRS)
Braam, J.; Sistrunk, M. L.; Polisensky, D. H.; Xu, W.; Purugganan, M. M.; Antosiewicz, D. M.; Campbell, P.; Johnson, K. A.; McIntire, L. V. (Principal Investigator)
1997-01-01
Expression of the Arabidopsis TCH genes is markedly upregulated in response to a variety of environmental stimuli including the seemingly innocuous stimulus of touch. Understanding the mechanism(s) and factors that control TCH gene regulation will shed light on the signaling pathways that enable plants to respond to environmental conditions. The TCH proteins include calmodulin, calmodulin-related proteins and a xyloglucan endotransglycosylase. Expression analyses and localization of protein accumulation indicates that the potential sites of TCH protein function include expanding cells and tissues under mechanical strain. We hypothesize that at least a subset of the TCH proteins may collaborate in cell wall biogenesis.
Clinical and Functional Analyses of p73R1 Mutations in Prostate Cancer
2005-02-01
mutations in several genes (BRCA 1, BRCA2, and CHEK2) whose products are involved in this pathway have been associated with increased risk for this...screened this gene for mutations in prostate cancer. Two germline truncating mutations were identified. Genotyping of 403 men with sporadic prostate...based on mutation screening of candidate genes involved in the DNA damage- signaling pathway. Genomic instability is a common feature of all human
Blevins, Tana; Aliev, Fazil; Adkins, Amy; Hack, Laura; Bigdeli, Tim; D. van der Vaart, Andrew; Web, Bradley Todd; Bacanu, Silviu-Alin; Kalsi, Gursharan; Kendler, Kenneth S.; Miles, Michael F.; Dick, Danielle; Riley, Brien P.; Dumur, Catherine; Vladimirov, Vladimir I.
2015-01-01
Alcohol consumption is known to lead to gene expression changes in the brain. After performing weighted gene co-expression network analyses (WGCNA) on genome-wide mRNA and microRNA (miRNA) expression in Nucleus Accumbens (NAc) of subjects with alcohol dependence (AD; N = 18) and of matched controls (N = 18), six mRNA and three miRNA modules significantly correlated with AD were identified (Bonferoni-adj. p≤ 0.05). Cell-type-specific transcriptome analyses revealed two of the mRNA modules to be enriched for neuronal specific marker genes and downregulated in AD, whereas the remaining four mRNA modules were enriched for astrocyte and microglial specific marker genes and upregulated in AD. Gene set enrichment analysis demonstrated that neuronal specific modules were enriched for genes involved in oxidative phosphorylation, mitochondrial dysfunction and MAPK signaling. Glial-specific modules were predominantly enriched for genes involved in processes related to immune functions, i.e. cytokine signaling (all adj. p≤ 0.05). In mRNA and miRNA modules, 461 and 25 candidate hub genes were identified, respectively. In contrast to the expected biological functions of miRNAs, correlation analyses between mRNA and miRNA hub genes revealed a higher number of positive than negative correlations (χ2 test p≤ 0.0001). Integration of hub gene expression with genome-wide genotypic data resulted in 591 mRNA cis-eQTLs and 62 miRNA cis-eQTLs. mRNA cis-eQTLs were significantly enriched for AD diagnosis and AD symptom counts (adj. p = 0.014 and p = 0.024, respectively) in AD GWAS signals in a large, independent genetic sample from the Collaborative Study on Genetics of Alcohol (COGA). In conclusion, our study identified putative gene network hubs coordinating mRNA and miRNA co-expression changes in the NAc of AD subjects, and our genetic (cis-eQTL) analysis provides novel insights into the etiological mechanisms of AD. PMID:26381263
Kumar, Hirdesh; Frischknecht, Friedrich; Mair, Gunnar R; Gomes, James
2015-12-01
Genetically attenuated parasites (GAPs) that lack genes essential for the liver stage of the malaria parasite, and therefore cause developmental arrest, have been developed as live vaccines in rodent malaria models and recently been tested in humans. The genes targeted for deletion were often identified by trial and error. Here we present a systematic gene - protein and transcript - expression analyses of several Plasmodium species with the aim to identify candidate genes for the generation of novel GAPs. With a lack of liver stage expression data for human malaria parasites, we used data available for liver stage development of Plasmodium yoelii, a rodent malaria model, to identify proteins expressed in the liver stage but absent from blood stage parasites. An orthology-based search was then employed to identify orthologous proteins in the human malaria parasite Plasmodium falciparum resulting in a total of 310 genes expressed in the liver stage but lacking evidence of protein expression in blood stage parasites. Among these 310 possible GAP candidates, we further studied Plasmodium liver stage proteins by phyletic distribution and functional domain analyses and shortlisted twenty GAP-candidates; these are: fabB/F, fabI, arp, 3 genes encoding subunits of the PDH complex, dnaJ, urm1, rS5, ancp, mcp, arh, gk, lisp2, valS, palm, and four conserved Plasmodium proteins of unknown function. Parasites lacking one or several of these genes might yield new attenuated malaria parasites for experimental vaccination studies. Copyright © 2015 Elsevier B.V. All rights reserved.
Scott, Barry; Young, Carolyn A.; Saikia, Sanjay; McMillan, Lisa K.; Monahan, Brendon J.; Koulman, Albert; Astin, Jonathan; Eaton, Carla J.; Bryant, Andrea; Wrenn, Ruth E.; Finch, Sarah C.; Tapper, Brian A.; Parker, Emily J.; Jameson, Geoffrey B.
2013-01-01
The indole-diterpene paxilline is an abundant secondary metabolite synthesized by Penicillium paxilli. In total, 21 genes have been identified at the PAX locus of which six have been previously confirmed to have a functional role in paxilline biosynthesis. A combination of bioinformatics, gene expression and targeted gene replacement analyses were used to define the boundaries of the PAX gene cluster. Targeted gene replacement identified seven genes, paxG, paxA, paxM, paxB, paxC, paxP and paxQ that were all required for paxilline production, with one additional gene, paxD, required for regular prenylation of the indole ring post paxilline synthesis. The two putative transcription factors, PP104 and PP105, were not co-regulated with the pax genes and based on targeted gene replacement, including the double knockout, did not have a role in paxilline production. The relationship of indole dimethylallyl transferases involved in prenylation of indole-diterpenes such as paxilline or lolitrem B, can be found as two disparate clades, not supported by prenylation type (e.g., regular or reverse). This paper provides insight into the P. paxilli indole-diterpene locus and reviews the recent advances identified in paxilline biosynthesis. PMID:23949005
Opazo, Juan C; Lee, Alison P; Hoffmann, Federico G; Toloza-Villalobos, Jessica; Burmester, Thorsten; Venkatesh, Byrappa; Storz, Jay F
2015-07-01
Comparative analyses of vertebrate genomes continue to uncover a surprising diversity of genes in the globin gene superfamily, some of which have very restricted phyletic distributions despite their antiquity. Genomic analysis of the globin gene repertoire of cartilaginous fish (Chondrichthyes) should be especially informative about the duplicative origins and ancestral functions of vertebrate globins, as divergence between Chondrichthyes and bony vertebrates represents the most basal split within the jawed vertebrates. Here, we report a comparative genomic analysis of the vertebrate globin gene family that includes the complete globin gene repertoire of the elephant shark (Callorhinchus milii). Using genomic sequence data from representatives of all major vertebrate classes, integrated analyses of conserved synteny and phylogenetic relationships revealed that the last common ancestor of vertebrates possessed a repertoire of at least seven globin genes: single copies of androglobin and neuroglobin, four paralogous copies of globin X, and the single-copy progenitor of the entire set of vertebrate-specific globins. Combined with expression data, the genomic inventory of elephant shark globins yielded four especially surprising findings: 1) there is no trace of the neuroglobin gene (a highly conserved gene that is present in all other jawed vertebrates that have been examined to date), 2) myoglobin is highly expressed in heart, but not in skeletal muscle (reflecting a possible ancestral condition in vertebrates with single-circuit circulatory systems), 3) elephant shark possesses two highly divergent globin X paralogs, one of which is preferentially expressed in gonads, and 4) elephant shark possesses two structurally distinct α-globin paralogs, one of which is preferentially expressed in the brain. Expression profiles of elephant shark globin genes reveal distinct specializations of function relative to orthologs in bony vertebrates and suggest hypotheses about ancestral functions of vertebrate globins. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Caenorhabditis elegans ABCRNAi transporters interact genetically with rde-2 and mut-7.
Sundaram, Prema; Han, Wang; Cohen, Nancy; Echalier, Benjamin; Albin, John; Timmons, Lisa
2008-02-01
RNA interference (RNAi) mechanisms are conserved and consist of an interrelated network of activities that not only respond to exogenous dsRNA, but also perform endogenous functions required in the fine tuning of gene expression and in maintaining genome integrity. Not surprisingly, RNAi functions have widespread influences on cellular function and organismal development. Previously, we observed a reduced capacity to mount an RNAi response in nine Caenorhabditis elegans mutants that are defective in ABC transporter genes (ABC(RNAi) mutants). Here, we report an exhaustive study of mutants, collectively defective in 49 different ABC transporter genes, that allowed for the categorization of one additional transporter into the ABC(RNAi) gene class. Genetic complementation tests reveal functions for ABC(RNAi) transporters in the mut-7/rde-2 branch of the RNAi pathway. These second-site noncomplementation interactions suggest that ABC(RNAi) proteins and MUT-7/RDE-2 function together in parallel pathways and/or as multiprotein complexes. Like mut-7 and rde-2, some ABC(RNAi) mutants display transposon silencing defects. Finally, our analyses reveal a genetic interaction network of ABC(RNAi) gene function with respect to this part of the RNAi pathway. From our results, we speculate that the coordinated activities of ABC(RNAi) transporters, through their effects on endogenous RNAi-related mechanisms, ultimately affect chromosome function and integrity.
Caenorhabditis elegans ABCRNAi Transporters Interact Genetically With rde-2 and mut-7
Sundaram, Prema; Han, Wang; Cohen, Nancy; Echalier, Benjamin; Albin, John; Timmons, Lisa
2008-01-01
RNA interference (RNAi) mechanisms are conserved and consist of an interrelated network of activities that not only respond to exogenous dsRNA, but also perform endogenous functions required in the fine tuning of gene expression and in maintaining genome integrity. Not surprisingly, RNAi functions have widespread influences on cellular function and organismal development. Previously, we observed a reduced capacity to mount an RNAi response in nine Caenorhabditis elegans mutants that are defective in ABC transporter genes (ABCRNAi mutants). Here, we report an exhaustive study of mutants, collectively defective in 49 different ABC transporter genes, that allowed for the categorization of one additional transporter into the ABCRNAi gene class. Genetic complementation tests reveal functions for ABCRNAi transporters in the mut-7/rde-2 branch of the RNAi pathway. These second-site noncomplementation interactions suggest that ABCRNAi proteins and MUT-7/RDE-2 function together in parallel pathways and/or as multiprotein complexes. Like mut-7 and rde-2, some ABCRNAi mutants display transposon silencing defects. Finally, our analyses reveal a genetic interaction network of ABCRNAi gene function with respect to this part of the RNAi pathway. From our results, we speculate that the coordinated activities of ABCRNAi transporters, through their effects on endogenous RNAi-related mechanisms, ultimately affect chromosome function and integrity. PMID:18245353
GIANT 2.0: genome-scale integrated analysis of gene networks in tissues.
Wong, Aaron K; Krishnan, Arjun; Troyanskaya, Olga G
2018-05-25
GIANT2 (Genome-wide Integrated Analysis of gene Networks in Tissues) is an interactive web server that enables biomedical researchers to analyze their proteins and pathways of interest and generate hypotheses in the context of genome-scale functional maps of human tissues. The precise actions of genes are frequently dependent on their tissue context, yet direct assay of tissue-specific protein function and interactions remains infeasible in many normal human tissues and cell-types. With GIANT2, researchers can explore predicted tissue-specific functional roles of genes and reveal changes in those roles across tissues, all through interactive multi-network visualizations and analyses. Additionally, the NetWAS approach available through the server uses tissue-specific/cell-type networks predicted by GIANT2 to re-prioritize statistical associations from GWAS studies and identify disease-associated genes. GIANT2 predicts tissue-specific interactions by integrating diverse functional genomics data from now over 61 400 experiments for 283 diverse tissues and cell-types. GIANT2 does not require any registration or installation and is freely available for use at http://giant-v2.princeton.edu.
Cho, Young-Hee; Hong, Jung-Woo; Kim, Eun-Chul; Yoo, Sang-Dong
2012-04-01
Sucrose-nonfermentation1-related protein kinase1 (SnRK1) is an evolutionarily conserved energy sensor protein that regulates gene expression in response to energy depletion in plants. Efforts to elucidate the functions and mechanisms of this protein kinase are hampered, however, by inherent growth defects of snrk1-null mutant plants. To overcome these limitations and study SnRK1 functions in vivo, we applied a method combining transient expression in leaf mesophyll protoplasts and stable expression in transgenic plants. We found that both rice (Oryza sativa) and Arabidopsis (Arabidopsis thaliana) SnRK1 activities critically influence stress-inducible gene expression and the induction of stress tolerance. Genetic, molecular, and chromatin immunoprecipitation analyses further revealed that the nuclear SnRK1 modulated target gene transcription in a submergence-dependent manner. From early seedling development through late senescence, SnRK1 activities appeared to modulate developmental processes in the plants. Our findings offer insight into the regulatory functions of plant SnRK1 in stress-responsive gene regulation and in plant growth and development throughout the life cycle.
Ecological transcriptomics of lake-type and riverine sockeye salmon (Oncorhynchus nerka)
2011-01-01
Background There are a growing number of genomes sequenced with tentative functions assigned to a large proportion of the individual genes. Model organisms in laboratory settings form the basis for the assignment of gene function, and the ecological context of gene function is lacking. This work addresses this shortcoming by investigating expressed genes of sockeye salmon (Oncorhynchus nerka) muscle tissue. We compared morphology and gene expression in natural juvenile sockeye populations related to river and lake habitats. Based on previously documented divergent morphology, feeding strategy, and predation in association with these distinct environments, we expect that burst swimming is favored in riverine population and continuous swimming is favored in lake-type population. In turn we predict that morphology and expressed genes promote burst swimming in riverine sockeye and continuous swimming in lake-type sockeye. Results We found the riverine sockeye population had deep, robust bodies and lake-type had shallow, streamlined bodies. Gene expression patterns were measured using a 16K microarray, discovering 141 genes with significant differential expression. Overall, the identity and function of these genes was consistent with our hypothesis. In addition, Gene Ontology (GO) enrichment analyses with a larger set of differentially expressed genes found the "biosynthesis" category enriched for the riverine population and the "metabolism" category enriched for the lake-type population. Conclusions This study provides a framework for understanding sockeye life history from a transcriptomic perspective and a starting point for more extensive, targeted studies determining the ecological context of genes. PMID:22136247
Ecological transcriptomics of lake-type and riverine sockeye salmon (Oncorhynchus nerka).
Pavey, Scott A; Sutherland, Ben J G; Leong, Jong; Robb, Adrienne; von Schalburg, Kris; Hamon, Troy R; Koop, Ben F; Nielsen, Jennifer L
2011-12-02
There are a growing number of genomes sequenced with tentative functions assigned to a large proportion of the individual genes. Model organisms in laboratory settings form the basis for the assignment of gene function, and the ecological context of gene function is lacking. This work addresses this shortcoming by investigating expressed genes of sockeye salmon (Oncorhynchus nerka) muscle tissue. We compared morphology and gene expression in natural juvenile sockeye populations related to river and lake habitats. Based on previously documented divergent morphology, feeding strategy, and predation in association with these distinct environments, we expect that burst swimming is favored in riverine population and continuous swimming is favored in lake-type population. In turn we predict that morphology and expressed genes promote burst swimming in riverine sockeye and continuous swimming in lake-type sockeye. We found the riverine sockeye population had deep, robust bodies and lake-type had shallow, streamlined bodies. Gene expression patterns were measured using a 16 k microarray, discovering 141 genes with significant differential expression. Overall, the identity and function of these genes was consistent with our hypothesis. In addition, Gene Ontology (GO) enrichment analyses with a larger set of differentially expressed genes found the "biosynthesis" category enriched for the riverine population and the "metabolism" category enriched for the lake-type population. This study provides a framework for understanding sockeye life history from a transcriptomic perspective and a starting point for more extensive, targeted studies determining the ecological context of genes.
Rue-Albrecht, Kévin; McGettigan, Paul A; Hernández, Belinda; Nalpas, Nicolas C; Magee, David A; Parnell, Andrew C; Gordon, Stephen V; MacHugh, David E
2016-03-11
Identification of gene expression profiles that differentiate experimental groups is critical for discovery and analysis of key molecular pathways and also for selection of robust diagnostic or prognostic biomarkers. While integration of differential expression statistics has been used to refine gene set enrichment analyses, such approaches are typically limited to single gene lists resulting from simple two-group comparisons or time-series analyses. In contrast, functional class scoring and machine learning approaches provide powerful alternative methods to leverage molecular measurements for pathway analyses, and to compare continuous and multi-level categorical factors. We introduce GOexpress, a software package for scoring and summarising the capacity of gene ontology features to simultaneously classify samples from multiple experimental groups. GOexpress integrates normalised gene expression data (e.g., from microarray and RNA-seq experiments) and phenotypic information of individual samples with gene ontology annotations to derive a ranking of genes and gene ontology terms using a supervised learning approach. The default random forest algorithm allows interactions between all experimental factors, and competitive scoring of expressed genes to evaluate their relative importance in classifying predefined groups of samples. GOexpress enables rapid identification and visualisation of ontology-related gene panels that robustly classify groups of samples and supports both categorical (e.g., infection status, treatment) and continuous (e.g., time-series, drug concentrations) experimental factors. The use of standard Bioconductor extension packages and publicly available gene ontology annotations facilitates straightforward integration of GOexpress within existing computational biology pipelines.
Molecular characterization of the apical organ of the anthozoan Nematostella vectensis
Sinigaglia, Chiara; Busengdal, Henriette; Lerner, Avi; Oliveri, Paola; Rentzsch, Fabian
2015-01-01
Apical organs are sensory structures present in many marine invertebrate larvae where they are considered to be involved in their settlement, metamorphosis and locomotion. In bilaterians they are characterised by a tuft of long cilia and receptor cells and they are associated with groups of neurons, but their relatively low morphological complexity and dispersed phylogenetic distribution have left their evolutionary relationship unresolved. Moreover, since apical organs are not present in the standard model organisms, their development and function are not well understood. To provide a foundation for a better understanding of this structure we have characterised the molecular composition of the apical organ of the sea anemone Nematostella vectensis. In a microarray-based comparison of the gene expression profiles of planulae with either a wildtype or an experimentally expanded apical organ, we identified 78 evolutionarily conserved genes, which are predominantly or specifically expressed in the apical organ of Nematostella. This gene set comprises signalling molecules, transcription factors, structural and metabolic genes. The majority of these genes, including several conserved, but previously uncharacterized ones, are potentially involved in different aspects of the development or function of the long cilia of the apical organ. To demonstrate the utility of this gene set for comparative analyses, we further analysed the expression of a subset of previously uncharacterized putative orthologs in sea urchin larvae and detected expression for twelve out of eighteen of them in the apical domain. Our study provides a molecular characterization of the apical organ of Nematostella and represents an informative tool for future studies addressing the development, function and evolutionary history of apical organ cells. PMID:25478911
Franke, Lude; Bakel, Harm van; Fokkens, Like; de Jong, Edwin D.; Egmont-Petersen, Michael; Wijmenga, Cisca
2006-01-01
Most common genetic disorders have a complex inheritance and may result from variants in many genes, each contributing only weak effects to the disease. Pinpointing these disease genes within the myriad of susceptibility loci identified in linkage studies is difficult because these loci may contain hundreds of genes. However, in any disorder, most of the disease genes will be involved in only a few different molecular pathways. If we know something about the relationships between the genes, we can assess whether some genes (which may reside in different loci) functionally interact with each other, indicating a joint basis for the disease etiology. There are various repositories of information on pathway relationships. To consolidate this information, we developed a functional human gene network that integrates information on genes and the functional relationships between genes, based on data from the Kyoto Encyclopedia of Genes and Genomes, the Biomolecular Interaction Network Database, Reactome, the Human Protein Reference Database, the Gene Ontology database, predicted protein-protein interactions, human yeast two-hybrid interactions, and microarray coexpressions. We applied this network to interrelate positional candidate genes from different disease loci and then tested 96 heritable disorders for which the Online Mendelian Inheritance in Man database reported at least three disease genes. Artificial susceptibility loci, each containing 100 genes, were constructed around each disease gene, and we used the network to rank these genes on the basis of their functional interactions. By following up the top five genes per artificial locus, we were able to detect at least one known disease gene in 54% of the loci studied, representing a 2.8-fold increase over random selection. This suggests that our method can significantly reduce the cost and effort of pinpointing true disease genes in analyses of disorders for which numerous loci have been reported but for which most of the genes are unknown. PMID:16685651
Gupta, Gagan D.; Howes, Mark T.; Chandran, Ruma; Das, Anupam; Menon, Sindhu; Parton, Robert G.; Sowdhamini, R.; Thattai, Mukund; Mayor, Satyajit
2014-01-01
Single-cell-resolved measurements reveal heterogeneous distributions of clathrin-dependent (CD) and -independent (CLIC/GEEC: CG) endocytic activity in Drosophila cell populations. dsRNA-mediated knockdown of core versus peripheral endocytic machinery induces strong changes in the mean, or subtle changes in the shapes of these distributions, respectively. By quantifying these subtle shape changes for 27 single-cell features which report on endocytic activity and cell morphology, we organize 1072 Drosophila genes into a tree-like hierarchy. We find that tree nodes contain gene sets enriched in functional classes and protein complexes, providing a portrait of core and peripheral control of CD and CG endocytosis. For 470 genes we obtain additional features from separate assays and classify them into early- or late-acting genes of the endocytic pathways. Detailed analyses of specific genes at intermediate levels of the tree suggest that Vacuolar ATPase and lysosomal genes involved in vacuolar biogenesis play an evolutionarily conserved role in CG endocytosis. PMID:24971745
Genomic survey, expression profile and co-expression network analysis of OsWD40 family in rice
2012-01-01
Background WD40 proteins represent a large family in eukaryotes, which have been involved in a broad spectrum of crucial functions. Systematic characterization and co-expression analysis of OsWD40 genes enable us to understand the networks of the WD40 proteins and their biological processes and gene functions in rice. Results In this study, we identify and analyze 200 potential OsWD40 genes in rice, describing their gene structures, genome localizations, and evolutionary relationship of each member. Expression profiles covering the whole life cycle in rice has revealed that transcripts of OsWD40 were accumulated differentially during vegetative and reproductive development and preferentially up or down-regulated in different tissues. Under phytohormone treatments, 25 OsWD40 genes were differentially expressed with treatments of one or more of the phytohormone NAA, KT, or GA3 in rice seedlings. We also used a combined analysis of expression correlation and Gene Ontology annotation to infer the biological role of the OsWD40 genes in rice. The results suggested that OsWD40 genes may perform their diverse functions by complex network, thus were predictive for understanding their biological pathways. The analysis also revealed that OsWD40 genes might interact with each other to take part in metabolic pathways, suggesting a more complex feedback network. Conclusions All of these analyses suggest that the functions of OsWD40 genes are diversified, which provide useful references for selecting candidate genes for further functional studies. PMID:22429805
Goedbloed, D J; Czypionka, T; Altmüller, J; Rodriguez, A; Küpfer, E; Segev, O; Blaustein, L; Templeton, A R; Nolte, A W; Steinfartz, S
2017-12-01
The utilization of similar habitats by different species provides an ideal opportunity to identify genes underlying adaptation and acclimatization. Here, we analysed the gene expression of two closely related salamander species: Salamandra salamandra in Central Europe and Salamandra infraimmaculata in the Near East. These species inhabit similar habitat types: 'temporary ponds' and 'permanent streams' during larval development. We developed two species-specific gene expression microarrays, each targeting over 12 000 transcripts, including an overlapping subset of 8331 orthologues. Gene expression was examined for systematic differences between temporary ponds and permanent streams in larvae from both salamander species to establish gene sets and functions associated with these two habitat types. Only 20 orthologues were associated with a habitat in both species, but these orthologues did not show parallel expression patterns across species more than expected by chance. Functional annotation of a set of 106 genes with the highest effect size for a habitat suggested four putative gene function categories associated with a habitat in both species: cell proliferation, neural development, oxygen responses and muscle capacity. Among these high effect size genes was a single orthologue (14-3-3 protein zeta/YWHAZ) that was downregulated in temporary ponds in both species. The emergence of four gene function categories combined with a lack of parallel expression of orthologues (except 14-3-3 protein zeta) suggests that parallel habitat adaptation or acclimatization by larvae from S. salamandra and S. infraimmaculata to temporary ponds and permanent streams is mainly realized by different genes with a converging functionality.
Emdin, Connor A; Khera, Amit V; Chaffin, Mark; Klarin, Derek; Natarajan, Pradeep; Aragam, Krishna; Haas, Mary; Bick, Alexander; Zekavat, Seyedeh M; Nomura, Akihiro; Ardissino, Diego; Wilson, James G; Schunkert, Heribert; McPherson, Ruth; Watkins, Hugh; Elosua, Roberto; Bown, Matthew J; Samani, Nilesh J; Baber, Usman; Erdmann, Jeanette; Gupta, Namrata; Danesh, John; Chasman, Daniel; Ridker, Paul; Denny, Joshua; Bastarache, Lisa; Lichtman, Judith H; D'Onofrio, Gail; Mattera, Jennifer; Spertus, John A; Sheu, Wayne H-H; Taylor, Kent D; Psaty, Bruce M; Rich, Stephen S; Post, Wendy; Rotter, Jerome I; Chen, Yii-Der Ida; Krumholz, Harlan; Saleheen, Danish; Gabriel, Stacey; Kathiresan, Sekar
2018-04-24
Less than 3% of protein-coding genetic variants are predicted to result in loss of protein function through the introduction of a stop codon, frameshift, or the disruption of an essential splice site; however, such predicted loss-of-function (pLOF) variants provide insight into effector transcript and direction of biological effect. In >400,000 UK Biobank participants, we conduct association analyses of 3759 pLOF variants with six metabolic traits, six cardiometabolic diseases, and twelve additional diseases. We identified 18 new low-frequency or rare (allele frequency < 5%) pLOF variant-phenotype associations. pLOF variants in the gene GPR151 protect against obesity and type 2 diabetes, in the gene IL33 against asthma and allergic disease, and in the gene IFIH1 against hypothyroidism. In the gene PDE3B, pLOF variants associate with elevated height, improved body fat distribution and protection from coronary artery disease. Our findings prioritize genes for which pharmacologic mimics of pLOF variants may lower risk for disease.
Mensa-Vilaro, Anna; Teresa Bosque, María; Magri, Giuliana; Honda, Yoshitaka; Martínez-Banaclocha, Helios; Casorran-Berges, Marta; Sintes, Jordi; González-Roca, Eva; Ruiz-Ortiz, Estibaliz; Heike, Toshio; Martínez-Garcia, Juan J; Baroja-Mazo, Alberto; Cerutti, Andrea; Nishikomori, Ryuta; Yagüe, Jordi; Pelegrín, Pablo; Delgado-Beltran, Concha; Aróstegui, Juan I
2016-12-01
Gain-of-function NLRP3 mutations cause cryopyrin-associated periodic syndrome (CAPS), with gene mosaicism playing a relevant role in the pathogenesis. This study was undertaken to characterize the genetic cause underlying late-onset but otherwise typical CAPS. We studied a 64-year-old patient who presented with recurrent episodes of urticaria-like rash, fever, conjunctivitis, and oligoarthritis at age 56 years. DNA was extracted from both unfractionated blood and isolated leukocyte and CD34+ subpopulations. Genetic studies were performed using both the Sanger method of DNA sequencing and next-generation sequencing (NGS) methods. In vitro and ex vivo analyses were performed to determine the consequences that the presence of the variant have in the normal structure or function of the protein of the detected variant. NGS analyses revealed the novel p.Gln636Glu NLRP3 variant in unfractionated blood, with an allele frequency (18.4%) compatible with gene mosaicism. Sanger sequence chromatograms revealed a small peak corresponding to the variant allele. Amplicon-based deep sequencing revealed somatic NLRP3 mosaicism restricted to myeloid cells (31.8% in monocytes, 24.6% in neutrophils, and 11.2% in circulating CD34+ common myeloid progenitor cells) and its complete absence in lymphoid cells. Functional analyses confirmed the gain-of-function behavior of the gene variant and hyperactivity of the NLRP3 inflammasome in the patient. Treatment with anakinra resulted in good control of the disease. We identified the novel gain-of-function p.Gln636Glu NLRP3 mutation, which was detected as a somatic mutation restricted to myeloid cells, as the cause of late-onset but otherwise typical CAPS. Our results expand the diversity of CAPS toward milder phenotypes than previously reported, including those starting during adulthood. © 2016, American College of Rheumatology.
Genome-wide analysis of TCP family in tobacco.
Chen, L; Chen, Y Q; Ding, A M; Chen, H; Xia, F; Wang, W F; Sun, Y H
2016-05-23
The TCP family is a transcription factor family, members of which are extensively involved in plant growth and development as well as in signal transduction in the response against many physiological and biochemical stimuli. In the present study, 61 TCP genes were identified in tobacco (Nicotiana tabacum) genome. Bioinformatic methods were employed for predicting and analyzing the gene structure, gene expression, phylogenetic analysis, and conserved domains of TCP proteins in tobacco. The 61 NtTCP genes were divided into three diverse groups, based on the division of TCP genes in tomato and Arabidopsis, and the results of the conserved domain and sequence analyses further confirmed the classification of the NtTCP genes. The expression pattern of NtTCP also demonstrated that majority of these genes play important roles in all the tissues, while some special genes exercise their functions only in specific tissues. In brief, the comprehensive and thorough study of the TCP family in other plants provides sufficient resources for studying the structure and functions of TCPs in tobacco.
Insights into rubber biosynthesis from transcriptome analysis of Hevea brasiliensis latex.
Chow, Keng-See; Wan, Kiew-Lian; Isa, Mohd Noor Mat; Bahari, Azlina; Tan, Siang-Hee; Harikrishna, K; Yeang, Hoong-Yeet
2007-01-01
Hevea brasiliensis is the most widely cultivated species for commercial production of natural rubber (cis-polyisoprene). In this study, 10,040 expressed sequence tags (ESTs) were generated from the latex of the rubber tree, which represents the cytoplasmic content of a single cell type, in order to analyse the latex transcription profile with emphasis on rubber biosynthesis-related genes. A total of 3,441 unique transcripts (UTs) were obtained after quality editing and assembly of EST sequences. Functional classification of UTs according to the Gene Ontology convention showed that 73.8% were related to genes of unknown function. Among highly expressed ESTs, a significant proportion encoded proteins related to rubber biosynthesis and stress or defence responses. Sequences encoding rubber particle membrane proteins (RPMPs) belonging to three protein families accounted for 12% of the ESTs. Characterization of these ESTs revealed nine RPMP variants (7.9-27 kDa) including the 14 kDa REF (rubber elongation factor) and 22 kDa SRPP (small rubber particle protein). The expression of multiple RPMP isoforms in latex was shown using antibodies against REF and SRPP. Both EST and quantitative reverse transcription-PCR (QRT-PCR) analyses demonstrated REF and SRPP to be the most abundant transcripts in latex. Besides rubber biosynthesis, comparative sequence analysis showed that the RPMPs are highly similar to sequences in the plant kingdom having stress-related functions. Implications of the RPMP function in cis-polyisoprene biosynthesis in the context of transcript abundance and differential gene expression are discussed.
WRKY transcription factor genes in wild rice Oryza nivara.
Xu, Hengjian; Watanabe, Kenneth A; Zhang, Liyuan; Shen, Qingxi J
2016-08-01
The WRKY transcription factor family is one of the largest gene families involved in plant development and stress response. Although many WRKY genes have been studied in cultivated rice (Oryza sativa), the WRKY genes in the wild rice species Oryza nivara, the direct progenitor of O. sativa, have not been studied. O. nivara shows abundant genetic diversity and elite drought and disease resistance features. Herein, a total of 97 O. nivara WRKY (OnWRKY) genes were identified. RNA-sequencing demonstrates that OnWRKY genes were generally expressed at higher levels in the roots of 30-day-old plants. Bioinformatic analyses suggest that most of OnWRKY genes could be induced by salicylic acid, abscisic acid, and drought. Abundant potential MAPK phosphorylation sites in OnWRKYs suggest that activities of most OnWRKYs can be regulated by phosphorylation. Phylogenetic analyses of OnWRKYs support a novel hypothesis that ancient group IIc OnWRKYs were the original ancestors of only some group IIc and group III WRKYs. The analyses also offer strong support that group IIc OnWRKYs containing the HVE sequence in their zinc finger motifs were derived from group Ia WRKYs. This study provides a solid foundation for the study of the evolution and functions of WRKY genes in O. nivara. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
The PHF21B gene is associated with major depression and modulates the stress response.
Wong, M-L; Arcos-Burgos, M; Liu, S; Vélez, J I; Yu, C; Baune, B T; Jawahar, M C; Arolt, V; Dannlowski, U; Chuah, A; Huttley, G A; Fogarty, R; Lewis, M D; Bornstein, S R; Licinio, J
2017-07-01
Major depressive disorder (MDD) affects around 350 million people worldwide; however, the underlying genetic basis remains largely unknown. In this study, we took into account that MDD is a gene-environment disorder, in which stress is a critical component, and used whole-genome screening of functional variants to investigate the 'missing heritability' in MDD. Genome-wide association studies (GWAS) using single- and multi-locus linear mixed-effect models were performed in a Los Angeles Mexican-American cohort (196 controls, 203 MDD) and in a replication European-ancestry cohort (499 controls, 473 MDD). Our analyses took into consideration the stress levels in the control populations. The Mexican-American controls, comprised primarily of recent immigrants, had high levels of stress due to acculturation issues and the European-ancestry controls with high stress levels were given higher weights in our analysis. We identified 44 common and rare functional variants associated with mild to moderate MDD in the Mexican-American cohort (genome-wide false discovery rate, FDR, <0.05), and their pathway analysis revealed that the three top overrepresented Gene Ontology (GO) processes were innate immune response, glutamate receptor signaling and detection of chemical stimulus in smell sensory perception. Rare variant analysis replicated the association of the PHF21B gene in the ethnically unrelated European-ancestry cohort. The TRPM2 gene, previously implicated in mood disorders, may also be considered replicated by our analyses. Whole-genome sequencing analyses of a subset of the cohorts revealed that European-ancestry individuals have a significantly reduced (50%) number of single nucleotide variants compared with Mexican-American individuals, and for this reason the role of rare variants may vary across populations. PHF21b variants contribute significantly to differences in the levels of expression of this gene in several brain areas, including the hippocampus. Furthermore, using an animal model of stress, we found that Phf21b hippocampal gene expression is significantly decreased in animals resilient to chronic restraint stress when compared with non-chronically stressed animals. Together, our results reveal that including stress level data enables the identification of novel rare functional variants associated with MDD.
NEK1 variants confer susceptibility to amyotrophic lateral sclerosis
Kenna, Kevin P; van Doormaal, Perry T C; Dekker, Annelot M; Ticozzi, Nicola; Kenna, Brendan J; Diekstra, Frank P; van Rheenen, Wouter; van Eijk, Kristel R; Jones, Ashley R; Keagle, Pamela; Shatunov, Aleksey; Sproviero, William; Smith, Bradley N; van Es, Michael A; Topp, Simon D; Kenna, Aoife; Miller, Jack W; Fallini, Claudia; Tiloca, Cinzia; McLaughlin, Russell L; Vance, Caroline; Troakes, Claire; Colombrita, Claudia; Mora, Gabriele; Calvo, Andrea; Verde, Federico; Al-Sarraj, Safa; King, Andrew; Calini, Daniela; de Belleroche, Jacqueline; Baas, Frank; van der Kooi, Anneke J; de Visser, Marianne; Asbroek, Anneloor L M A ten; Sapp, Peter C; McKenna-Yasek, Diane; Polak, Meraida; Asress, Seneshaw; Muñoz-Blanco, José Luis; Strom, Tim M; Meitinger, Thomas; Morrison, Karen E; Lauria, Giuseppe; Williams, Kelly L; Leigh, P Nigel; Nicholson, Garth A; Blair, Ian P; Leblond, Claire S; Dion, Patrick A; Rouleau, Guy A; Pall, Hardev; Shaw, Pamela J; Turner, Martin R; Talbot, Kevin; Taroni, Franco; Boylan, Kevin B; Van Blitterswijk, Marka; Rademakers, Rosa; Esteban-Pérez, Jesús; García-Redondo, Alberto; Van Damme, Phillip; Robberecht, Wim; Chio, Adriano; Gellera, Cinzia; Drepper, Carsten; Sendtner, Michael; Ratti, Antonia; Glass, Jonathan D; Mora, Jesús S; Basak, Nazli A; Hardiman, Orla; Ludolph, Albert C; Andersen, Peter M; Weishaupt, Jochen H; Brown, Robert H; Al-Chalabi, Ammar; Silani, Vincenzo; Shaw, Christopher E; van den Berg, Leonard H; Veldink, Jan H; Landers, John E
2017-01-01
To identify genetic factors contributing to amyotrophic lateral sclerosis (ALS), we conducted whole-exome analyses of 1,022 index familial ALS (FALS) cases and 7,315 controls. In a new screening strategy, we performed gene-burden analyses trained with established ALS genes and identified a significant association between loss-of-function (LOF) NEK1 variants and FALS risk. Independently, autozygosity mapping for an isolated community in the Netherlands identified a NEK1 p.Arg261His variant as a candidate risk factor. Replication analyses of sporadic ALS (SALS) cases and independent control cohorts confirmed significant disease association for both p.Arg261His (10,589 samples analyzed) and NEK1 LOF variants (3,362 samples analyzed). In total, we observed NEK1 risk variants in nearly 3% of ALS cases. NEK1 has been linked to several cellular functions, including cilia formation, DNA-damage response, microtubule stability, neuronal morphology and axonal polarity. Our results provide new and important insights into ALS etiopathogenesis and genetic etiology. PMID:27455347
A putative regulatory genetic locus modulates virulence in the pathogen Leptospira interrogans.
Eshghi, Azad; Becam, Jérôme; Lambert, Ambroise; Sismeiro, Odile; Dillies, Marie-Agnès; Jagla, Bernd; Wunder, Elsio A; Ko, Albert I; Coppee, Jean-Yves; Goarant, Cyrille; Picardeau, Mathieu
2014-06-01
Limited research has been conducted on the role of transcriptional regulators in relation to virulence in Leptospira interrogans, the etiological agent of leptospirosis. Here, we identify an L. interrogans locus that encodes a sensor protein, an anti-sigma factor antagonist, and two genes encoding proteins of unknown function. Transposon insertion into the gene encoding the sensor protein led to dampened transcription of the other 3 genes in this locus. This lb139 insertion mutant (the lb139(-) mutant) displayed attenuated virulence in the hamster model of infection and reduced motility in vitro. Whole-transcriptome analyses using RNA sequencing revealed the downregulation of 115 genes and the upregulation of 28 genes, with an overrepresentation of gene products functioning in motility and signal transduction and numerous gene products with unknown functions, predicted to be localized to the extracellular space. Another significant finding encompassed suppressed expression of the majority of the genes previously demonstrated to be upregulated at physiological osmolarity, including the sphingomyelinase C precursor Sph2 and LigB. We provide insight into a possible requirement for transcriptional regulation as it relates to leptospiral virulence and suggest various biological processes that are affected due to the loss of native expression of this genetic locus.
Comparative whole genome transcriptome and metabolome analyses of five Klebsiella pneumonia strains.
Lee, Soojin; Kim, Borim; Yang, Jeongmo; Jeong, Daun; Park, Soohyun; Shin, Sang Heum; Kook, Jun Ho; Yang, Kap-Seok; Lee, Jinwon
2015-11-01
The integration of transcriptomics and metabolomics can provide precise information on gene-to-metabolite networks for identifying the function of novel genes. The goal of this study was to identify novel gene functions involved in 2,3-butanediol (2,3-BDO) biosynthesis by a comprehensive analysis of the transcriptome and metabolome of five mutated Klebsiella pneumonia strains (∆wabG = SGSB100, ∆wabG∆budA = SGSB106, ∆wabG∆budB = SGSB107, ∆wabG∆budC = SGSB108, ∆wabG∆budABC = SGSB109). First, the transcriptomes of all five mutants were analyzed and the genes exhibiting reproducible changes in expression were determined. The transcriptome was well conserved among the five strains, and differences in gene expression occurred mainly in genes coding for 2,3-BDO biosynthesis (budA, budB, and budC) and the genes involved in the degradation of reactive oxygen, biosynthesis and transport of arginine, cysteine biosynthesis, sulfur metabolism, oxidoreductase reaction, and formate dehydrogenase reaction. Second, differences in the metabolome (estimated by carbon distribution, CO2 emission, and redox balance) among the five mutant strains due to gene alteration of the 2,3-BDO operon were detected. The functional genomics approach integrating metabolomics and transcriptomics in K. Pneumonia presented here provides an innovative means of identifying novel gene functions involved in 2,3-BDO biosynthesis metabolism and whole cell metabolism.
2012-01-01
Background Natrialba magadii is an aerobic chemoorganotrophic member of the Euryarchaeota and is a dual extremophile requiring alkaline conditions and hypersalinity for optimal growth. The genome sequence of Nab. magadii type strain ATCC 43099 was deciphered to obtain a comprehensive insight into the genetic content of this haloarchaeon and to understand the basis of some of the cellular functions necessary for its survival. Results The genome of Nab. magadii consists of four replicons with a total sequence of 4,443,643 bp and encodes 4,212 putative proteins, some of which contain peptide repeats of various lengths. Comparative genome analyses facilitated the identification of genes encoding putative proteins involved in adaptation to hypersalinity, stress response, glycosylation, and polysaccharide biosynthesis. A proton-driven ATP synthase and a variety of putative cytochromes and other proteins supporting aerobic respiration and electron transfer were encoded by one or more of Nab. magadii replicons. The genome encodes a number of putative proteases/peptidases as well as protein secretion functions. Genes encoding putative transcriptional regulators, basal transcription factors, signal perception/transduction proteins, and chemotaxis/phototaxis proteins were abundant in the genome. Pathways for the biosynthesis of thiamine, riboflavin, heme, cobalamin, coenzyme F420 and other essential co-factors were deduced by in depth sequence analyses. However, approximately 36% of Nab. magadii protein coding genes could not be assigned a function based on Blast analysis and have been annotated as encoding hypothetical or conserved hypothetical proteins. Furthermore, despite extensive comparative genomic analyses, genes necessary for survival in alkaline conditions could not be identified in Nab. magadii. Conclusions Based on genomic analyses, Nab. magadii is predicted to be metabolically versatile and it could use different carbon and energy sources to sustain growth. Nab. magadii has the genetic potential to adapt to its milieu by intracellular accumulation of inorganic cations and/or neutral organic compounds. The identification of Nab. magadii genes involved in coenzyme biosynthesis is a necessary step toward further reconstruction of the metabolic pathways in halophilic archaea and other extremophiles. The knowledge gained from the genome sequence of this haloalkaliphilic archaeon is highly valuable in advancing the applications of extremophiles and their enzymes. PMID:22559199
Annotation of gene function in citrus using gene expression information and co-expression networks
2014-01-01
Background The genus Citrus encompasses major cultivated plants such as sweet orange, mandarin, lemon and grapefruit, among the world’s most economically important fruit crops. With increasing volumes of transcriptomics data available for these species, Gene Co-expression Network (GCN) analysis is a viable option for predicting gene function at a genome-wide scale. GCN analysis is based on a “guilt-by-association” principle whereby genes encoding proteins involved in similar and/or related biological processes may exhibit similar expression patterns across diverse sets of experimental conditions. While bioinformatics resources such as GCN analysis are widely available for efficient gene function prediction in model plant species including Arabidopsis, soybean and rice, in citrus these tools are not yet developed. Results We have constructed a comprehensive GCN for citrus inferred from 297 publicly available Affymetrix Genechip Citrus Genome microarray datasets, providing gene co-expression relationships at a genome-wide scale (33,000 transcripts). The comprehensive citrus GCN consists of a global GCN (condition-independent) and four condition-dependent GCNs that survey the sweet orange species only, all citrus fruit tissues, all citrus leaf tissues, or stress-exposed plants. All of these GCNs are clustered using genome-wide, gene-centric (guide) and graph clustering algorithms for flexibility of gene function prediction. For each putative cluster, gene ontology (GO) enrichment and gene expression specificity analyses were performed to enhance gene function, expression and regulation pattern prediction. The guide-gene approach was used to infer novel roles of genes involved in disease susceptibility and vitamin C metabolism, and graph-clustering approaches were used to investigate isoprenoid/phenylpropanoid metabolism in citrus peel, and citric acid catabolism via the GABA shunt in citrus fruit. Conclusions Integration of citrus gene co-expression networks, functional enrichment analysis and gene expression information provide opportunities to infer gene function in citrus. We present a publicly accessible tool, Network Inference for Citrus Co-Expression (NICCE, http://citrus.adelaide.edu.au/nicce/home.aspx), for the gene co-expression analysis in citrus. PMID:25023870
Xu, Aishi; Li, Guang; Yang, Dong; Wu, Songfeng; Ouyang, Hongsheng; Xu, Ping; He, Fuchu
2015-12-04
Although the "missing protein" is a temporary concept in C-HPP, the biological information for their "missing" could be an important clue in evolutionary studies. Here we classified missing-protein-encoding genes into two groups, the genes encoding PE2 proteins (with transcript evidence) and the genes encoding PE3/4 proteins (with no transcript evidence). These missing-protein-encoding genes distribute unevenly among different chromosomes, chromosomal regions, or gene clusters. In the view of evolutionary features, PE3/4 genes tend to be young, spreading at the nonhomology chromosomal regions and evolving at higher rates. Interestingly, there is a higher proportion of singletons in PE3/4 genes than the proportion of singletons in all genes (background) and OTCSGs (organ, tissue, cell type-specific genes). More importantly, most of the paralogous PE3/4 genes belong to the newly duplicated members of the paralogous gene groups, which mainly contribute to special biological functions, such as "smell perception". These functions are heavily restricted into specific type of cells, tissues, or specific developmental stages, acting as the new functional requirements that facilitated the emergence of the missing-protein-encoding genes during evolution. In addition, the criteria for the extremely special physical-chemical proteins were first set up based on the properties of PE2 proteins, and the evolutionary characteristics of those proteins were explored. Overall, the evolutionary analyses of missing-protein-encoding genes are expected to be highly instructive for proteomics and functional studies in the future.
Dynamic changes in gene expression during human trophoblast differentiation.
Handwerger, Stuart; Aronow, Bruce
2003-01-01
The genetic program that directs human placental differentiation is poorly understood. In a recent study, we used DNA microarray analyses to determine genes that are dynamically regulated during human placental development in an in vitro model system in which highly purified cytotrophoblast cells aggregate spontaneously and fuse to form a multinucleated syncytium that expresses placental lactogen, human chorionic gonadotropin, and other proteins normally expressed by fully differentiated syncytiotrophoblast cells. Of the 6918 genes present on the Incyte Human GEM V microarray that we analyzed over a 9-day period, 141 were induced and 256 were downregulated by more than 2-fold. The dynamically regulated genes fell into nine distinct kinetic patterns of induction or repression, as detected by the K-means algorithm. Classifying the genes according to functional characteristics, the regulated genes could be divided into six overall categories: cell and tissue structural dynamics, cell cycle and apoptosis, intercellular communication, metabolism, regulation of gene expression, and expressed sequence tags and function unknown. Gene expression changes within key functional categories were tightly coupled to the morphological changes that occurred during trophoblast differentiation. Within several key gene categories (e.g., cell and tissue structure), many genes were strongly activated, while others with related function were strongly repressed. These findings suggest that trophoblast differentiation is augmented by "categorical reprogramming" in which the ability of induced genes to function is enhanced by diminished synthesis of other genes within the same category. We also observed categorical reprogramming in human decidual fibroblasts decidualized in vitro in response to progesterone, estradiol, and cyclic AMP. While there was little overlap between genes that are dynamically regulated during trophoblast differentiation versus decidualization, many of the categories in which genes were strongly activated also contained genes whose expression was strongly diminished. Taken together, these findings point to a fundamental role for simultaneous induction and repression of mRNAs that encode functionally related proteins during the differentiation process.
Moon, Sunok; Oo, Moe Moe; Kim, Backki; Koh, Hee-Jong; Oh, Sung Aeong; Yi, Gihwan; An, Gynheung; Park, Soon Ki; Jung, Ki-Hong
2018-04-23
Understanding late pollen development, including the maturation and pollination process, is a key component in maintaining crop yields. Transcriptome data obtained through microarray or RNA-seq technologies can provide useful insight into those developmental processes. Six series of microarray data from a public transcriptome database, the Gene Expression Omnibus of the National Center for Biotechnology Information, are related to anther and pollen development. We performed a systematic and functional study across the rice genome of genes that are preferentially expressed in the late stages of pollen development, including maturation and germination. By comparing the transcriptomes of sporophytes and male gametes over time, we identified 627 late pollen-preferred genes that are conserved among japonica and indica rice cultivars. Functional classification analysis with a MapMan tool kit revealed a significant association between cell wall organization/metabolism and mature pollen grains. Comparative analysis of rice and Arabidopsis demonstrated that genes involved in cell wall modifications and the metabolism of major carbohydrates are unique to rice. We used the GUS reporter system to monitor the expression of eight of those genes. In addition, we evaluated the significance of our candidate genes, using T-DNA insertional mutant population and the CRISPR/Cas9 system. Mutants from T-DNA insertion and CRISPR/Cas9 systems of a rice gene encoding glycerophosphoryl diester phosphodiesterase are defective in their male gamete transfer. Through the global analyses of the late pollen-preferred genes from rice, we found several biological features of these genes. First, biological process related to cell wall organization and modification is over-represented in these genes to support rapid tube growth. Second, comparative analysis of late pollen preferred genes between rice and Arabidopsis provide a significant insight on the evolutional disparateness in cell wall biogenesis and storage reserves of pollen. In addition, these candidates might be useful targets for future examinations of late pollen development, and will be a valuable resource for accelerating the understanding of molecular mechanisms for pollen maturation and germination processes in rice.
Yan, Bo; Neilson, Karen M.; Ranganathan, Ramya; Maynard, Thomas; Streit, Andrea; Moody, Sally A.
2014-01-01
Background Six1 plays an important role in the development of several vertebrate organs, including cranial sensory placodes, somites and kidney. Although Six1 mutations cause one form of Branchio-Otic Syndrome (BOS), the responsible gene in many patients has not been identified; genes that act downstream of Six1 are potential BOS candidates. Results We sought to identify novel genes expressed during placode, somite and kidney development by comparing gene expression between control and Six1-expressing ectodermal explants. The expression patterns of 19 of the significantly up-regulated and 11 of the significantly down-regulated genes were assayed from cleavage to larval stages. 28/30 genes are expressed in the otocyst, a structure that is functionally disrupted in BOS, and 26/30 genes are expressed in the nephric mesoderm, a structure that is functionally disrupted in the related Branchio-Otic-Renal (BOR) syndrome. We also identified the chick homologues of 5 genes and show that they have conserved expression patterns. Conclusions Of the 30 genes selected for expression analyses, all are expressed at many of the developmental times and appropriate tissues to be regulated by Six1. Many have the potential to play a role in the disruption of hearing and kidney function seen in BOS/BOR patients. PMID:25403746
Redefining C and D in the petunia ABC.
Heijmans, Klaas; Ament, Kai; Rijpkema, Anneke S; Zethof, Jan; Wolters-Arts, Mieke; Gerats, Tom; Vandenbussche, Michiel
2012-06-01
According to the ABC(DE) model for flower development, C-genes are required for stamen and carpel development and floral determinacy, and D-genes were proposed to play a unique role in ovule development. Both C- and D-genes belong to the AGAMOUS (AG) subfamily of MADS box transcription factors. We show that the petunia (Petunia hybrida) C-clade genes PETUNIA MADS BOX GENE3 and FLORAL BINDING PROTEIN6 (FBP6) largely overlap in function, both in floral organ identity specification and floral determinacy, unlike the pronounced subfunctionalization observed in Arabidopsis thaliana and snapdragon (Antirrhinum majus). Some specialization has also evolved, since FBP6 plays a unique role in the development of the style and stigma. Furthermore, we show that the D-genes FBP7 and FBP11 are not essential to confer ovule identity. Instead, this function is redundantly shared among all AG members. In turn, the D-genes also participate in floral determinacy. Gain-of-function analyses suggest the presence of a posttranscriptional C-repression mechanism in petunia, most likely not existing in Arabidopsis. Finally, we show that expression maintenance of the paleoAPETALA3-type B-gene TOMATO MADS BOX GENE6 depends on the activity of C-genes. Taken together, this demonstrates considerable variation in the molecular control of floral development between eudicot species.
Redefining C and D in the Petunia ABC[W
Heijmans, Klaas; Ament, Kai; Rijpkema, Anneke S.; Zethof, Jan; Wolters-Arts, Mieke; Gerats, Tom; Vandenbussche, Michiel
2012-01-01
According to the ABC(DE) model for flower development, C-genes are required for stamen and carpel development and floral determinacy, and D-genes were proposed to play a unique role in ovule development. Both C- and D-genes belong to the AGAMOUS (AG) subfamily of MADS box transcription factors. We show that the petunia (Petunia hybrida) C-clade genes PETUNIA MADS BOX GENE3 and FLORAL BINDING PROTEIN6 (FBP6) largely overlap in function, both in floral organ identity specification and floral determinacy, unlike the pronounced subfunctionalization observed in Arabidopsis thaliana and snapdragon (Antirrhinum majus). Some specialization has also evolved, since FBP6 plays a unique role in the development of the style and stigma. Furthermore, we show that the D-genes FBP7 and FBP11 are not essential to confer ovule identity. Instead, this function is redundantly shared among all AG members. In turn, the D-genes also participate in floral determinacy. Gain-of-function analyses suggest the presence of a posttranscriptional C-repression mechanism in petunia, most likely not existing in Arabidopsis. Finally, we show that expression maintenance of the paleoAPETALA3-type B-gene TOMATO MADS BOX GENE6 depends on the activity of C-genes. Taken together, this demonstrates considerable variation in the molecular control of floral development between eudicot species. PMID:22706285
Functional genomics indicate that schizophrenia may be an adult vascular-ischemic disorder
Moises, H W; Wollschläger, D; Binder, H
2015-01-01
In search for the elusive schizophrenia pathway, candidate genes for the disorder from a discovery sample were localized within the energy-delivering and ischemia protection pathway. To test the adult vascular-ischemic (AVIH) and the competing neurodevelopmental hypothesis (NDH), functional genomic analyses of practically all available schizophrenia-associated genes from candidate gene, genome-wide association and postmortem expression studies were performed. Our results indicate a significant overrepresentation of genes involved in vascular function (P<0.001), vasoregulation (that is, perivascular (P<0.001) and shear stress (P<0.01), cerebral ischemia (P<0.001), neurodevelopment (P<0.001) and postischemic repair (P<0.001) among schizophrenia-associated genes from genetic association studies. These findings support both the NDH and the AVIH. The genes from postmortem studies showed an upregulation of vascular-ischemic genes (P=0.020) combined with downregulated synaptic (P=0.005) genes, and ND/repair (P=0.003) genes. Evidence for the AVIH and the NDH is critically discussed. We conclude that schizophrenia is probably a mild adult vascular-ischemic and postischemic repair disorder. Adult postischemic repair involves ND genes for adult neurogenesis, synaptic plasticity, glutamate and increased long-term potentiation of excitatory neurotransmission (i-LTP). Schizophrenia might be caused by the cerebral analog of microvascular angina. PMID:26261884
Functional genomics indicate that schizophrenia may be an adult vascular-ischemic disorder.
Moises, H W; Wollschläger, D; Binder, H
2015-08-11
In search for the elusive schizophrenia pathway, candidate genes for the disorder from a discovery sample were localized within the energy-delivering and ischemia protection pathway. To test the adult vascular-ischemic (AVIH) and the competing neurodevelopmental hypothesis (NDH), functional genomic analyses of practically all available schizophrenia-associated genes from candidate gene, genome-wide association and postmortem expression studies were performed. Our results indicate a significant overrepresentation of genes involved in vascular function (P < 0.001), vasoregulation (that is, perivascular (P < 0.001) and shear stress (P < 0.01), cerebral ischemia (P < 0.001), neurodevelopment (P < 0.001) and postischemic repair (P < 0.001) among schizophrenia-associated genes from genetic association studies. These findings support both the NDH and the AVIH. The genes from postmortem studies showed an upregulation of vascular-ischemic genes (P = 0.020) combined with downregulated synaptic (P = 0.005) genes, and ND/repair (P = 0.003) genes. Evidence for the AVIH and the NDH is critically discussed. We conclude that schizophrenia is probably a mild adult vascular-ischemic and postischemic repair disorder. Adult postischemic repair involves ND genes for adult neurogenesis, synaptic plasticity, glutamate and increased long-term potentiation of excitatory neurotransmission (i-LTP). Schizophrenia might be caused by the cerebral analog of microvascular angina.
Suo, Chen; Hrydziuszko, Olga; Lee, Donghwan; Pramana, Setia; Saputra, Dhany; Joshi, Himanshu; Calza, Stefano; Pawitan, Yudi
2015-08-15
Genome and transcriptome analyses can be used to explore cancers comprehensively, and it is increasingly common to have multiple omics data measured from each individual. Furthermore, there are rich functional data such as predicted impact of mutations on protein coding and gene/protein networks. However, integration of the complex information across the different omics and functional data is still challenging. Clinical validation, particularly based on patient outcomes such as survival, is important for assessing the relevance of the integrated information and for comparing different procedures. An analysis pipeline is built for integrating genomic and transcriptomic alterations from whole-exome and RNA sequence data and functional data from protein function prediction and gene interaction networks. The method accumulates evidence for the functional implications of mutated potential driver genes found within and across patients. A driver-gene score (DGscore) is developed to capture the cumulative effect of such genes. To contribute to the score, a gene has to be frequently mutated, with high or moderate mutational impact at protein level, exhibiting an extreme expression and functionally linked to many differentially expressed neighbors in the functional gene network. The pipeline is applied to 60 matched tumor and normal samples of the same patient from The Cancer Genome Atlas breast-cancer project. In clinical validation, patients with high DGscores have worse survival than those with low scores (P = 0.001). Furthermore, the DGscore outperforms the established expression-based signatures MammaPrint and PAM50 in predicting patient survival. In conclusion, integration of mutation, expression and functional data allows identification of clinically relevant potential driver genes in cancer. The documented pipeline including annotated sample scripts can be found in http://fafner.meb.ki.se/biostatwiki/driver-genes/. yudi.pawitan@ki.se Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Structural and functional partitioning of bread wheat chromosome 3B.
Choulet, Frédéric; Alberti, Adriana; Theil, Sébastien; Glover, Natasha; Barbe, Valérie; Daron, Josquin; Pingault, Lise; Sourdille, Pierre; Couloux, Arnaud; Paux, Etienne; Leroy, Philippe; Mangenot, Sophie; Guilhot, Nicolas; Le Gouis, Jacques; Balfourier, Francois; Alaux, Michael; Jamilloux, Véronique; Poulain, Julie; Durand, Céline; Bellec, Arnaud; Gaspin, Christine; Safar, Jan; Dolezel, Jaroslav; Rogers, Jane; Vandepoele, Klaas; Aury, Jean-Marc; Mayer, Klaus; Berges, Hélène; Quesneville, Hadi; Wincker, Patrick; Feuillet, Catherine
2014-07-18
We produced a reference sequence of the 1-gigabase chromosome 3B of hexaploid bread wheat. By sequencing 8452 bacterial artificial chromosomes in pools, we assembled a sequence of 774 megabases carrying 5326 protein-coding genes, 1938 pseudogenes, and 85% of transposable elements. The distribution of structural and functional features along the chromosome revealed partitioning correlated with meiotic recombination. Comparative analyses indicated high wheat-specific inter- and intrachromosomal gene duplication activities that are potential sources of variability for adaption. In addition to providing a better understanding of the organization, function, and evolution of a large and polyploid genome, the availability of a high-quality sequence anchored to genetic maps will accelerate the identification of genes underlying important agronomic traits. Copyright © 2014, American Association for the Advancement of Science.
Conceptualizing adverse outcome pathways for ...
Cyclooxygenase (COX) inhibition is of concern in fish because COX inhibitors (e.g., ibuprofen) are ubiquitous in aquatic systems/fish tissues, and can disrupt synthesis of prostaglandins that modulate a variety of essential biological functions (e.g., reproduction). This study utilized newly generated high content (transcriptomic and metabolomic) empirical data in combination with existing high throughput (ACTOR, epa.gov) toxicity data to facilitate development of adverse outcome pathways (AOPs) for molecular initiating event (MIE) of COX inhibition. We examined effects of a waterborne, 96h exposure to three COX inhibitors (indomethacin (IN; 100 µg/L), ibuprofen (IB; 200 µg/L) and celecoxib (CX; 20 µg/L) on the liver metabolome and ovarian gene expression (using oligonucleotide microarray 4 x15K platform) in sexually mature fathead minnows (n=8). Differentially expressed genes were identified (t-test, p < 0.01), and functional analyses performed to determine enriched Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways (p < 0.05). Principal component analysis indicated that liver metabolomics profiles of IN, IB and CX were not significantly different from control or one another. When compared to control, exposure to IB and CX resulted in differential expression of comparable numbers of genes (IB = 433, CX= 545). In contrast, 2558 genes were differentially expressed in IN-treated fish. KEGG pathway analyses show that IN had extensive effects on oocyte meios
Construction and Analysis of Functional Networks in the Gut Microbiome of Type 2 Diabetes Patients.
Li, Lianshuo; Wang, Zicheng; He, Peng; Ma, Shining; Du, Jie; Jiang, Rui
2016-10-01
Although networks of microbial species have been widely used in the analysis of 16S rRNA sequencing data of a microbiome, the construction and analysis of a complete microbial gene network are in general problematic because of the large number of microbial genes in metagenomics studies. To overcome this limitation, we propose to map microbial genes to functional units, including KEGG orthologous groups and the evolutionary genealogy of genes: Non-supervised Orthologous Groups (eggNOG) orthologous groups, to enable the construction and analysis of a microbial functional network. We devised two statistical methods to infer pairwise relationships between microbial functional units based on a deep sequencing dataset of gut microbiome from type 2 diabetes (T2D) patients as well as healthy controls. Networks containing such functional units and their significant interactions were constructed subsequently. We conducted a variety of analyses of global properties, local properties, and functional modules in the resulting functional networks. Our data indicate that besides the observations consistent with the current knowledge, this study provides novel biological insights into the gut microbiome associated with T2D. Copyright © 2016. Production and hosting by Elsevier Ltd.
Li, Shicheng; Sun, Xiao; Miao, Shuncheng; Liu, Jia; Jiao, Wenjie
2017-11-01
Cigarette smoking is one of the greatest preventable risk factors for developing cancer, and most cases of lung squamous cell carcinoma (lung SCC) are associated with smoking. The pathogenesis mechanism of tumor progress is unclear. This study aimed to identify biomarkers in smoking-related lung cancer, including protein-coding gene, long noncoding RNA, and transcription factors. We selected and obtained messenger RNA microarray datasets and clinical data from the Gene Expression Omnibus database to identify gene expression altered by cigarette smoking. Integrated bioinformatic analysis was used to clarify biological functions of the identified genes, including Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway, the construction of a protein-protein interaction network, transcription factor, and statistical analyses. Subsequent quantitative real-time PCR was utilized to verify these bioinformatic analyses. Five hundred and ninety-eight differentially expressed genes and 21 long noncoding RNA were identified in smoking-related lung SCC. GO and KEGG pathway analysis showed that identified genes were enriched in the cancer-related functions and pathways. The protein-protein interaction network revealed seven hub genes identified in lung SCC. Several transcription factors and their binding sites were predicted. The results of real-time quantitative PCR revealed that AURKA and BIRC5 were significantly upregulated and LINC00094 was downregulated in the tumor tissues of smoking patients. Further statistical analysis indicated that dysregulation of AURKA, BIRC5, and LINC00094 indicated poor prognosis in lung SCC. Protein-coding genes AURKA, BIRC5, and LINC00094 could be biomarkers or therapeutic targets for smoking-related lung SCC. © 2017 The Authors. Thoracic Cancer published by China Lung Oncology Group and John Wiley & Sons Australia, Ltd.
Integrated Analyses of Gene Expression Profiles Digs out Common Markers for Rheumatic Diseases
Wang, Lan; Wu, Long-Fei; Lu, Xin; Mo, Xing-Bo; Tang, Zai-Xiang; Lei, Shu-Feng; Deng, Fei-Yan
2015-01-01
Objective Rheumatic diseases have some common symptoms. Extensive gene expression studies, accumulated thus far, have successfully identified signature molecules for each rheumatic disease, individually. However, whether there exist shared factors across rheumatic diseases has yet to be tested. Methods We collected and utilized 6 public microarray datasets covering 4 types of representative rheumatic diseases including rheumatoid arthritis, systemic lupus erythematosus, ankylosing spondylitis, and osteoarthritis. Then we detected overlaps of differentially expressed genes across datasets and performed a meta-analysis aiming at identifying common differentially expressed genes that discriminate between pathological cases and normal controls. To further gain insights into the functions of the identified common differentially expressed genes, we conducted gene ontology enrichment analysis and protein-protein interaction analysis. Results We identified a total of eight differentially expressed genes (TNFSF10, CX3CR1, LY96, TLR5, TXN, TIA1, PRKCH, PRF1), each associated with at least 3 of the 4 studied rheumatic diseases. Meta-analysis warranted the significance of the eight genes and highlighted the general significance of four genes (CX3CR1, LY96, TLR5, and PRF1). Protein-protein interaction and gene ontology enrichment analyses indicated that the eight genes interact with each other to exert functions related to immune response and immune regulation. Conclusion The findings support that there exist common factors underlying rheumatic diseases. For rheumatoid arthritis, systemic lupus erythematosus, ankylosing spondylitis and osteoarthritis diseases, those common factors include TNFSF10, CX3CR1, LY96, TLR5, TXN, TIA1, PRKCH, and PRF1. In-depth studies on these common factors may provide keys to understanding the pathogenesis and developing intervention strategies for rheumatic diseases. PMID:26352601
2010-01-01
Background Terpenoids are among the most important constituents of grape flavour and wine bouquet, and serve as useful metabolite markers in viticulture and enology. Based on the initial 8-fold sequencing of a nearly homozygous Pinot noir inbred line, 89 putative terpenoid synthase genes (VvTPS) were predicted by in silico analysis of the grapevine (Vitis vinifera) genome assembly [1]. The finding of this very large VvTPS family, combined with the importance of terpenoid metabolism for the organoleptic properties of grapevine berries and finished wines, prompted a detailed examination of this gene family at the genomic level as well as an investigation into VvTPS biochemical functions. Results We present findings from the analysis of the up-dated 12-fold sequencing and assembly of the grapevine genome that place the number of predicted VvTPS genes at 69 putatively functional VvTPS, 20 partial VvTPS, and 63 VvTPS probable pseudogenes. Gene discovery and annotation included information about gene architecture and chromosomal location. A dense cluster of 45 VvTPS is localized on chromosome 18. Extensive FLcDNA cloning, gene synthesis, and protein expression enabled functional characterization of 39 VvTPS; this is the largest number of functionally characterized TPS for any species reported to date. Of these enzymes, 23 have unique functions and/or phylogenetic locations within the plant TPS gene family. Phylogenetic analyses of the TPS gene family showed that while most VvTPS form species-specific gene clusters, there are several examples of gene orthology with TPS of other plant species, representing perhaps more ancient VvTPS, which have maintained functions independent of speciation. Conclusions The highly expanded VvTPS gene family underpins the prominence of terpenoid metabolism in grapevine. We provide a detailed experimental functional annotation of 39 members of this important gene family in grapevine and comprehensive information about gene structure and phylogeny for the entire currently known VvTPS gene family. PMID:20964856
Drews, Anna; Strandh, Maria; Råberg, Lars; Westerdahl, Helena
2017-06-26
The Major Histocompatibility Complex (MHC) plays a central role in immunity and has been given considerable attention by evolutionary ecologists due to its associations with fitness-related traits. Songbirds have unusually high numbers of MHC class I (MHC-I) genes, but it is not known whether all are expressed and equally important for immune function. Classical MHC-I genes are highly expressed, polymorphic and present peptides to T-cells whereas non-classical MHC-I genes have lower expression, are more monomorphic and do not present peptides to T-cells. To get a better understanding of the highly duplicated MHC genes in songbirds, we studied gene expression in a phylogenetic framework in three species of sparrows (house sparrow, tree sparrow and Spanish sparrow), using high-throughput sequencing. We hypothesize that sparrows could have classical and non-classical genes, as previously indicated though never tested using gene expression. The phylogenetic analyses reveal two distinct types of MHC-I alleles among the three sparrow species, one with high and one with low level of polymorphism, thus resembling classical and non-classical genes, respectively. All individuals had both types of alleles, but there was copy number variation both within and among the sparrow species. However, the number of highly polymorphic alleles that were expressed did not vary between species, suggesting that the structural genomic variation is counterbalanced by conserved gene expression. Overall, 50% of the MHC-I alleles were expressed in sparrows. Expression of the highly polymorphic alleles was very variable, whereas the alleles with low polymorphism had uniformly low expression. Interestingly, within an individual only one or two alleles from the polymorphic genes were highly expressed, indicating that only a single copy of these is highly expressed. Taken together, the phylogenetic reconstruction and the analyses of expression suggest that sparrows have both classical and non-classical MHC-I genes, and that the evolutionary origin of these genes predate the split of the three investigated sparrow species 7 million years ago. Because only the classical MHC-I genes are involved in antigen presentation, the function of different MHC-I genes should be considered in future ecological and evolutionary studies of MHC-I in sparrows and other songbirds.
2011-01-01
Background The aryl hydrocarbon receptor (AhR) is a ligand-activated transcription factor (TF) that mediates responses to 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD). Integration of TCDD-induced genome-wide AhR enrichment, differential gene expression and computational dioxin response element (DRE) analyses further elucidate the hepatic AhR regulatory network. Results Global ChIP-chip and gene expression analyses were performed on hepatic tissue from immature ovariectomized mice orally gavaged with 30 μg/kg TCDD. ChIP-chip analysis identified 14,446 and 974 AhR enriched regions (1% false discovery rate) at 2 and 24 hrs, respectively. Enrichment density was greatest in the proximal promoter, and more specifically, within ± 1.5 kb of a transcriptional start site (TSS). AhR enrichment also occurred distal to a TSS (e.g. intergenic DNA and 3' UTR), extending the potential gene expression regulatory roles of the AhR. Although TF binding site analyses identified over-represented DRE sequences within enriched regions, approximately 50% of all AhR enriched regions lacked a DRE core (5'-GCGTG-3'). Microarray analysis identified 1,896 number of TCDD-responsive genes (|fold change| ≥ 1.5, P1(t) > 0.999). Integrating this gene expression data with our ChIP-chip and DRE analyses only identified 625 differentially expressed genes that involved an AhR interaction at a DRE. Functional annotation analysis of differentially regulated genes associated with AhR enrichment identified overrepresented processes related to fatty acid and lipid metabolism and transport, and xenobiotic metabolism, which are consistent with TCDD-elicited steatosis in the mouse liver. Conclusions Details of the AhR regulatory network have been expanded to include AhR-DNA interactions within intragenic and intergenic genomic regions. Moreover, the AhR can interact with DNA independent of a DRE core suggesting there are alternative mechanisms of AhR-mediated gene regulation. PMID:21762485
Kehrmann, Jan; Tatura, Roman; Zeschnigk, Michael; Probst-Kepper, Michael; Geffers, Robert; Steinmann, Joerg; Buer, Jan
2014-07-01
The epigenetic regulation of transcription factor genes is critical for T-cell lineage specification. A specific methylation pattern within a conserved region of the lineage specifying transcription factor gene FOXP3, the Treg-specific demethylated region (TSDR), is restricted to regulatory T (Treg) cells and is required for stable expression of FOXP3 and suppressive function. We analysed the impact of hypomethylating agents 5-aza-2'-deoxycytidine and epigallocatechin-3-gallate on human CD4(+) CD25(-) T cells for generating demethylation within FOXP3-TSDR and inducing functional Treg cells. Gene expression, including lineage-specifying transcription factors of the major T-cell lineages and their leading cytokines, functional properties and global transcriptome changes were analysed. The FOXP3-TSDR methylation pattern was determined by using deep amplicon bisulphite sequencing. 5-aza-2'-deoxycytidine induced FOXP3-TSDR hypomethylation and expression of the Treg-cell-specific genes FOXP3 and LRRC32. Proliferation of 5-aza-2'-deoxycytidine-treated cells was reduced, but the cells did not show suppressive function. Hypomethylation was not restricted to FOXP3-TSDR and expression of master transcription factors and leading cytokines of T helper type 1 and type 17 cells were induced. Epigallocatechin-3-gallate induced global DNA hypomethylation to a lesser extent than 5-aza-2'-deoxycitidine, but no relevant hypomethylation within FOXP3-TSDR or expression of Treg-cell-specific genes. Neither of the DNA methyltransferase inhibitors induced fully functional human Treg cells. 5-aza-2'-deoxycitidine-treated cells resembled Treg cells, but they did not suppress proliferation of responder cells, which is an essential capability to be used for Treg cell transfer therapy. Using a recently developed targeted demethylation technology might be a more promising approach for the generation of functional Treg cells. © 2014 John Wiley & Sons Ltd.
Gene expression profiling of pre-eclamptic placentae by RNA sequencing.
Kaartokallio, Tea; Cervera, Alejandra; Kyllönen, Anjuska; Laivuori, Krista; Kere, Juha; Laivuori, Hannele
2015-09-21
Pre-eclampsia is a common and complex pregnancy disorder that often involves impaired placental development. In order to identify altered gene expression in pre-eclamptic placenta, we sequenced placental transcriptomes of nine pre-eclamptic and nine healthy pregnant women in pools of three. The differential gene expression was tested both by including all the pools in the analysis and by excluding some of the pools based on phenotypic characteristics. From these analyses, we identified altogether 53 differently expressed genes, a subset of which was validated by qPCR in 20 cases and 19 controls. Furthermore, we conducted pathway and functional analyses which revealed disturbed vascular function and immunological balance in pre-eclamptic placenta. Some of the genes identified in our study have been reported by numerous microarray studies (BHLHE40, FSTL3, HK2, HTRA4, LEP, PVRL4, SASH1, SIGLEC6), but many have been implicated in only few studies or have not previously been linked to pre-eclampsia (ARMS2, BTNL9, CCSAP, DIO2, FER1L4, HPSE, LOC100129345, LYN, MYO7B, NCMAP, NDRG1, NRIP1, PLIN2, SBSPON, SERPINB9, SH3BP5, TET3, TPBG, ZNF175). Several of the molecules produced by these genes may have a role in the pathogenesis of pre-eclampsia, and some could qualify as biomarkers for prediction or detection of this pregnancy complication.
Gene expression profiling of pre-eclamptic placentae by RNA sequencing
Kaartokallio, Tea; Cervera, Alejandra; Kyllönen, Anjuska; Laivuori, Krista; Laivuori, Hannele; Heinonen, Seppo; Kajantie, Eero; Kere, Juha; Kivinen, Katja; Pouta, Anneli
2015-01-01
Pre-eclampsia is a common and complex pregnancy disorder that often involves impaired placental development. In order to identify altered gene expression in pre-eclamptic placenta, we sequenced placental transcriptomes of nine pre-eclamptic and nine healthy pregnant women in pools of three. The differential gene expression was tested both by including all the pools in the analysis and by excluding some of the pools based on phenotypic characteristics. From these analyses, we identified altogether 53 differently expressed genes, a subset of which was validated by qPCR in 20 cases and 19 controls. Furthermore, we conducted pathway and functional analyses which revealed disturbed vascular function and immunological balance in pre-eclamptic placenta. Some of the genes identified in our study have been reported by numerous microarray studies (BHLHE40, FSTL3, HK2, HTRA4, LEP, PVRL4, SASH1, SIGLEC6), but many have been implicated in only few studies or have not previously been linked to pre-eclampsia (ARMS2, BTNL9, CCSAP, DIO2, FER1L4, HPSE, LOC100129345, LYN, MYO7B, NCMAP, NDRG1, NRIP1, PLIN2, SBSPON, SERPINB9, SH3BP5, TET3, TPBG, ZNF175). Several of the molecules produced by these genes may have a role in the pathogenesis of pre-eclampsia, and some could qualify as biomarkers for prediction or detection of this pregnancy complication. PMID:26388242
Shi, Pibiao; Guy, Kateta Malangisha; Wu, Weifang; Fang, Bingsheng; Yang, Jinghua; Zhang, Mingfang; Hu, Zhongyuan
2016-04-12
The plant-specific TCP transcription factor family, which is involved in the regulation of cell growth and proliferation, performs diverse functions in multiple aspects of plant growth and development. However, no comprehensive analysis of the TCP family in watermelon (Citrullus lanatus) has been undertaken previously. A total of 27 watermelon TCP encoding genes distributed on nine chromosomes were identified. Phylogenetic analysis clustered the genes into 11 distinct subgroups. Furthermore, phylogenetic and structural analyses distinguished two homology classes within the ClTCP family, designated Class I and Class II. The Class II genes were differentiated into two subclasses, the CIN subclass and the CYC/TB1 subclass. The expression patterns of all members were determined by semi-quantitative PCR. The functions of two ClTCP genes, ClTCP14a and ClTCP15, in regulating plant height were confirmed by ectopic expression in Arabidopsis wild-type and ortholog mutants. This study represents the first genome-wide analysis of the watermelon TCP gene family, which provides valuable information for understanding the classification and functions of the TCP genes in watermelon.
Gaji, Rajshekhar Y; Howe, Daniel K
2009-07-01
The apicomplexan parasite Sarcocystis neurona undergoes a complex process of intracellular development, during which many genes are temporally regulated. The described study was undertaken to begin identifying the basic promoter elements that control gene expression in S. neurona. Sequence analysis of the 5'-flanking region of five S. neurona genes revealed a conserved heptanucleotide motif GAGACGC that is similar to the WGAGACG motif described upstream of multiple genes in Toxoplasma gondii. The promoter region for the major surface antigen gene SnSAG1, which contains three heptanucleotide motifs within 135 bases of the transcription start site, was dissected by functional analysis using a dual luciferase reporter assay. These analyses revealed that a minimal promoter fragment containing all three motifs was sufficient to drive reporter molecule expression, with the presence and orientation of the 5'-most heptanucleotide motif being absolutely critical for promoter function. Further studies should help to identify additional sequence elements important for promoter function and for controlling gene expression during intracellular development by this apicomplexan pathogen.
SorghumFDB: sorghum functional genomics database with multidimensional network analysis.
Tian, Tian; You, Qi; Zhang, Liwei; Yi, Xin; Yan, Hengyu; Xu, Wenying; Su, Zhen
2016-01-01
Sorghum (Sorghum bicolor [L.] Moench) has excellent agronomic traits and biological properties, such as heat and drought-tolerance. It is a C4 grass and potential bioenergy-producing plant, which makes it an important crop worldwide. With the sorghum genome sequence released, it is essential to establish a sorghum functional genomics data mining platform. We collected genomic data and some functional annotations to construct a sorghum functional genomics database (SorghumFDB). SorghumFDB integrated knowledge of sorghum gene family classifications (transcription regulators/factors, carbohydrate-active enzymes, protein kinases, ubiquitins, cytochrome P450, monolignol biosynthesis related enzymes, R-genes and organelle-genes), detailed gene annotations, miRNA and target gene information, orthologous pairs in the model plants Arabidopsis, rice and maize, gene loci conversions and a genome browser. We further constructed a dynamic network of multidimensional biological relationships, comprised of the co-expression data, protein-protein interactions and miRNA-target pairs. We took effective measures to combine the network, gene set enrichment and motif analyses to determine the key regulators that participate in related metabolic pathways, such as the lignin pathway, which is a major biological process in bioenergy-producing plants.Database URL: http://structuralbiology.cau.edu.cn/sorghum/index.html. © The Author(s) 2016. Published by Oxford University Press.
An RNA-Seq-based reference transcriptome for Citrus.
Terol, Javier; Tadeo, Francisco; Ventimilla, Daniel; Talon, Manuel
2016-03-01
Previous RNA-Seq studies in citrus have been focused on physiological processes relevant to fruit quality and productivity of the major species, especially sweet orange. Less attention has been paid to vegetative or reproductive tissues, while most Citrus species have never been analysed. In this work, we characterized the transcriptome of vegetative and reproductive tissues from 12 Citrus species from all main phylogenetic groups. Our aims were to acquire a complete view of the citrus transcriptome landscape, to improve previous functional annotations and to obtain genetic markers associated with genes of agronomic interest. 28 samples were used for RNA-Seq analysis, obtained from 12 Citrus species: C. medica, C. aurantifolia, C. limon, C. bergamia, C. clementina, C. deliciosa, C. reshni, C. maxima, C. paradisi, C. aurantium, C. sinensis and Poncirus trifoliata. Four different organs were analysed: root, phloem, leaf and flower. A total of 3421 million Illumina reads were produced and mapped against the reference C. clementina genome sequence. Transcript discovery pipeline revealed 3326 new genes, the number of genes with alternative splicing was increased to 19,739, and a total of 73,797 transcripts were identified. Differential expression studies between the four tissues showed that gene expression is overall related to the physiological function of the specific organs above any other variable. Variants discovery analysis revealed the presence of indels and SNPs in genes associated with fruit quality and productivity. Pivotal pathways in citrus such as those of flavonoids, flavonols, ethylene and auxin were also analysed in detail. © 2015 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.
Using the TIGR gene index databases for biological discovery.
Lee, Yuandan; Quackenbush, John
2003-11-01
The TIGR Gene Index web pages provide access to analyses of ESTs and gene sequences for nearly 60 species, as well as a number of resources derived from these. Each species-specific database is presented using a common format with a homepage. A variety of methods exist that allow users to search each species-specific database. Methods implemented currently include nucleotide or protein sequence queries using WU-BLAST, text-based searches using various sequence identifiers, searches by gene, tissue and library name, and searches using functional classes through Gene Ontology assignments. This protocol provides guidance for using the Gene Index Databases to extract information.
dbWFA: a web-based database for functional annotation of Triticum aestivum transcripts
Vincent, Jonathan; Dai, Zhanwu; Ravel, Catherine; Choulet, Frédéric; Mouzeyar, Said; Bouzidi, M. Fouad; Agier, Marie; Martre, Pierre
2013-01-01
The functional annotation of genes based on sequence homology with genes from model species genomes is time-consuming because it is necessary to mine several unrelated databases. The aim of the present work was to develop a functional annotation database for common wheat Triticum aestivum (L.). The database, named dbWFA, is based on the reference NCBI UniGene set, an expressed gene catalogue built by expressed sequence tag clustering, and on full-length coding sequences retrieved from the TriFLDB database. Information from good-quality heterogeneous sources, including annotations for model plant species Arabidopsis thaliana (L.) Heynh. and Oryza sativa L., was gathered and linked to T. aestivum sequences through BLAST-based homology searches. Even though the complexity of the transcriptome cannot yet be fully appreciated, we developed a tool to easily and promptly obtain information from multiple functional annotation systems (Gene Ontology, MapMan bin codes, MIPS Functional Categories, PlantCyc pathway reactions and TAIR gene families). The use of dbWFA is illustrated here with several query examples. We were able to assign a putative function to 45% of the UniGenes and 81% of the full-length coding sequences from TriFLDB. Moreover, comparison of the annotation of the whole T. aestivum UniGene set along with curated annotations of the two model species assessed the accuracy of the annotation provided by dbWFA. To further illustrate the use of dbWFA, genes specifically expressed during the early cell division or late storage polymer accumulation phases of T. aestivum grain development were identified using a clustering analysis and then annotated using dbWFA. The annotation of these two sets of genes was consistent with previous analyses of T. aestivum grain transcriptomes and proteomes. Database URL: urgi.versailles.inra.fr/dbWFA/ PMID:23660284
Co-option of bacteriophage lysozyme genes by bivalve genomes.
Ren, Qian; Wang, Chunyang; Jin, Min; Lan, Jiangfeng; Ye, Ting; Hui, Kaimin; Tan, Jingmin; Wang, Zheng; Wyckoff, Gerald J; Wang, Wen; Han, Guan-Zhu
2017-01-01
Eukaryotes have occasionally acquired genetic material through horizontal gene transfer (HGT). However, little is known about the evolutionary and functional significance of such acquisitions. Lysozymes are ubiquitous enzymes that degrade bacterial cell walls. Here, we provide evidence that two subclasses of bivalves (Heterodonta and Palaeoheterodonta) acquired a lysozyme gene via HGT, building on earlier findings. Phylogenetic analyses place the bivalve lysozyme genes within the clade of bacteriophage lysozyme genes, indicating that the bivalves acquired the phage-type lysozyme genes from bacteriophages, either directly or through intermediate hosts. These bivalve lysozyme genes underwent dramatic structural changes after their co-option, including intron gain and fusion with other genes. Moreover, evidence suggests that recurrent gene duplication occurred in the bivalve lysozyme genes. Finally, we show the co-opted lysozymes exhibit a capacity for antibacterial action, potentially augmenting the immune function of related bivalves. This represents an intriguing evolutionary strategy in the eukaryote-microbe arms race, in which the genetic materials of bacteriophages are co-opted by eukaryotes, and then used by eukaryotes to combat bacteria, using a shared weapon against a common enemy. © 2017 The Authors.
FARVATX: FAmily-based Rare Variant Association Test for X-linked genes
Choi, Sungkyoung; Lee, Sungyoung; Qiao, Dandi; Hardin, Megan; Cho, Michael H.; Silverman, Edwin K; Park, Taesung; Won, Sungho
2016-01-01
Although the X chromosome has many genes that are functionally related to human diseases, the complicated biological properties of the X chromosome have prevented efficient genetic association analyses, and only a few significantly associated X-linked variants have been reported for complex traits. For instance, dosage compensation of X-linked genes is often achieved via the inactivation of one allele in each X-linked variant in females; however, some X-linked variants can escape this X chromosome inactivation. Efficient genetic analyses cannot be conducted without prior knowledge about the gene expression process of X-linked variants, and misspecified information can lead to power loss. In this report, we propose new statistical methods for rare X-linked variant genetic association analysis of dichotomous phenotypes with family-based samples. The proposed methods are computationally efficient and can complete X-linked analyses within a few hours. Simulation studies demonstrate the statistical efficiency of the proposed methods, which were then applied to rare-variant association analysis of the X chromosome in chronic obstructive pulmonary disease (COPD). Some promising significant X-linked genes were identified, illustrating the practical importance of the proposed methods. PMID:27325607
FARVATX: Family-Based Rare Variant Association Test for X-Linked Genes.
Choi, Sungkyoung; Lee, Sungyoung; Qiao, Dandi; Hardin, Megan; Cho, Michael H; Silverman, Edwin K; Park, Taesung; Won, Sungho
2016-09-01
Although the X chromosome has many genes that are functionally related to human diseases, the complicated biological properties of the X chromosome have prevented efficient genetic association analyses, and only a few significantly associated X-linked variants have been reported for complex traits. For instance, dosage compensation of X-linked genes is often achieved via the inactivation of one allele in each X-linked variant in females; however, some X-linked variants can escape this X chromosome inactivation. Efficient genetic analyses cannot be conducted without prior knowledge about the gene expression process of X-linked variants, and misspecified information can lead to power loss. In this report, we propose new statistical methods for rare X-linked variant genetic association analysis of dichotomous phenotypes with family-based samples. The proposed methods are computationally efficient and can complete X-linked analyses within a few hours. Simulation studies demonstrate the statistical efficiency of the proposed methods, which were then applied to rare-variant association analysis of the X chromosome in chronic obstructive pulmonary disease. Some promising significant X-linked genes were identified, illustrating the practical importance of the proposed methods. © 2016 WILEY PERIODICALS, INC.
Child, Christopher J; Blum, Werner F; Deal, Cheri; Zimmermann, Alan G; Quigley, Charmian A; Drop, Stenvert L S; Cutler, Gordon B; Rosenfeld, Ron G
2016-05-01
To determine characteristics of children initially diagnosed with isolated growth hormone deficiency (IGHD) of organic aetiology, who later developed multiple pituitary hormone deficiencies (MPHD). Data were analysed for 716 growth hormone-treated children with organic IGHD, who were growth hormone-naïve at baseline in the multinational, observational Genetics and Neuroendocrinology of Short Stature International Study. Development of MPHD was ascertained from investigator-provided diagnoses, adverse events and concomitant medications. Analyses were performed for all patients and separately for those who developed MPHD within 4.5 years or had >3.5 years follow-up and continued to have IGHD (4-year cohort). MPHD developed in 71/716 (9.9%) children overall, and in 60/290 (20.7%) in the 4-year cohort. The most frequent additional deficiencies were thyroid-stimulating hormone (47 patients) and gonadotropins (23 patients). Compared with those who remained with IGHD, children who developed MPHD had more severe GHD at study entry, significantly lower baseline insulin-like growth factor1, peak stimulated growth hormone, and more frequent diagnosis of intracranial tumour or mutation of gene(s) controlling hypothalamic-pituitary development and/or function. Multivariate logistic regression analyses identified female gender, longer follow-up, higher baseline age and lower peak stimulated growth hormone as predictors of MPHD development. MPHD is more likely to develop in patients with severe organic IGHD, especially those with history of intracranial tumour or mutation of gene(s) controlling hypothalamic-pituitary development and/or function. Older baseline age, female gender and longer follow-up duration were also associated with higher incidence of MPHD. Long-term monitoring of pituitary function is recommended, irrespective of the aetiology of GHD. © 2016 European Society of Endocrinology.
Schweizer, Rena M; Robinson, Jacqueline; Harrigan, Ryan; Silva, Pedro; Galverni, Marco; Musiani, Marco; Green, Richard E; Novembre, John; Wayne, Robert K
2016-01-01
In an era of ever-increasing amounts of whole-genome sequence data for individuals and populations, the utility of traditional single nucleotide polymorphisms (SNPs) array-based genome scans is uncertain. We previously performed a SNP array-based genome scan to identify candidate genes under selection in six distinct grey wolf (Canis lupus) ecotypes. Using this information, we designed a targeted capture array for 1040 genes, including all exons and flanking regions, as well as 5000 1-kb nongenic neutral regions, and resequenced these regions in 107 wolves. Selection tests revealed striking patterns of variation within candidate genes relative to noncandidate regions and identified potentially functional variants related to local adaptation. We found 27% and 47% of candidate genes from the previous SNP array study had functional changes that were outliers in sweed and bayenv analyses, respectively. This result verifies the use of genomewide SNP surveys to tag genes that contain functional variants between populations. We highlight nonsynonymous variants in APOB, LIPG and USH2A that occur in functional domains of these proteins, and that demonstrate high correlation with precipitation seasonality and vegetation. We find Arctic and High Arctic wolf ecotypes have higher numbers of genes under selection, which highlight their conservation value and heightened threat due to climate change. This study demonstrates that combining genomewide genotyping arrays with large-scale resequencing and environmental data provides a powerful approach to discern candidate functional variants in natural populations. © 2015 John Wiley & Sons Ltd.
USDA-ARS?s Scientific Manuscript database
Coding/functional SNPs change the biological function of a gene and, therefore, could serve as “large-effect” genetic markers. In this study, we used two bioinformatics pipelines, GATK and SAMtools, for discovering coding/functional SNPs with allelic-imbalances associated with total body weight, mus...
Savage, Jeanne E; Jansen, Philip R; Stringer, Sven; Watanabe, Kyoko; Bryois, Julien; de Leeuw, Christiaan A; Nagel, Mats; Awasthi, Swapnil; Barr, Peter B; Coleman, Jonathan R I; Grasby, Katrina L; Hammerschlag, Anke R; Kaminski, Jakob A; Karlsson, Robert; Krapohl, Eva; Lam, Max; Nygaard, Marianne; Reynolds, Chandra A; Trampush, Joey W; Young, Hannah; Zabaneh, Delilah; Hägg, Sara; Hansell, Narelle K; Karlsson, Ida K; Linnarsson, Sten; Montgomery, Grant W; Muñoz-Manchado, Ana B; Quinlan, Erin B; Schumann, Gunter; Skene, Nathan G; Webb, Bradley T; White, Tonya; Arking, Dan E; Avramopoulos, Dimitrios; Bilder, Robert M; Bitsios, Panos; Burdick, Katherine E; Cannon, Tyrone D; Chiba-Falek, Ornit; Christoforou, Andrea; Cirulli, Elizabeth T; Congdon, Eliza; Corvin, Aiden; Davies, Gail; Deary, Ian J; DeRosse, Pamela; Dickinson, Dwight; Djurovic, Srdjan; Donohoe, Gary; Conley, Emily Drabant; Eriksson, Johan G; Espeseth, Thomas; Freimer, Nelson A; Giakoumaki, Stella; Giegling, Ina; Gill, Michael; Glahn, David C; Hariri, Ahmad R; Hatzimanolis, Alex; Keller, Matthew C; Knowles, Emma; Koltai, Deborah; Konte, Bettina; Lahti, Jari; Le Hellard, Stephanie; Lencz, Todd; Liewald, David C; London, Edythe; Lundervold, Astri J; Malhotra, Anil K; Melle, Ingrid; Morris, Derek; Need, Anna C; Ollier, William; Palotie, Aarno; Payton, Antony; Pendleton, Neil; Poldrack, Russell A; Räikkönen, Katri; Reinvang, Ivar; Roussos, Panos; Rujescu, Dan; Sabb, Fred W; Scult, Matthew A; Smeland, Olav B; Smyrnis, Nikolaos; Starr, John M; Steen, Vidar M; Stefanis, Nikos C; Straub, Richard E; Sundet, Kjetil; Tiemeier, Henning; Voineskos, Aristotle N; Weinberger, Daniel R; Widen, Elisabeth; Yu, Jin; Abecasis, Goncalo; Andreassen, Ole A; Breen, Gerome; Christiansen, Lene; Debrabant, Birgit; Dick, Danielle M; Heinz, Andreas; Hjerling-Leffler, Jens; Ikram, M Arfan; Kendler, Kenneth S; Martin, Nicholas G; Medland, Sarah E; Pedersen, Nancy L; Plomin, Robert; Polderman, Tinca J C; Ripke, Stephan; van der Sluis, Sophie; Sullivan, Patrick F; Vrieze, Scott I; Wright, Margaret J; Posthuma, Danielle
2018-06-25
Intelligence is highly heritable 1 and a major determinant of human health and well-being 2 . Recent genome-wide meta-analyses have identified 24 genomic loci linked to variation in intelligence 3-7 , but much about its genetic underpinnings remains to be discovered. Here, we present a large-scale genetic association study of intelligence (n = 269,867), identifying 205 associated genomic loci (190 new) and 1,016 genes (939 new) via positional mapping, expression quantitative trait locus (eQTL) mapping, chromatin interaction mapping, and gene-based association analysis. We find enrichment of genetic effects in conserved and coding regions and associations with 146 nonsynonymous exonic variants. Associated genes are strongly expressed in the brain, specifically in striatal medium spiny neurons and hippocampal pyramidal neurons. Gene set analyses implicate pathways related to nervous system development and synaptic structure. We confirm previous strong genetic correlations with multiple health-related outcomes, and Mendelian randomization analysis results suggest protective effects of intelligence for Alzheimer's disease and ADHD and bidirectional causation with pleiotropic effects for schizophrenia. These results are a major step forward in understanding the neurobiology of cognitive function as well as genetically related neurological and psychiatric disorders.
Vu Manh, Thien-Phong; Elhmouzi-Younes, Jamila; Urien, Céline; Ruscanu, Suzana; Jouneau, Luc; Bourge, Mickaël; Moroldo, Marco; Foucras, Gilles; Salmon, Henri; Marty, Hélène; Quéré, Pascale; Bertho, Nicolas; Boudinot, Pierre; Dalod, Marc; Schwartz-Cornil, Isabelle
2015-01-01
Mononuclear phagocytes are organized in a complex system of ontogenetically and functionally distinct subsets, that has been best described in mouse and to some extent in human. Identification of homologous mononuclear phagocyte subsets in other vertebrate species of biomedical, economic, and environmental interest is needed to improve our knowledge in physiologic and physio-pathologic processes, and to design intervention strategies against a variety of diseases, including zoonotic infections. We developed a streamlined approach combining refined cell sorting and integrated comparative transcriptomics analyses which revealed conservation of the mononuclear phagocyte organization across human, mouse, sheep, pigs and, in some respect, chicken. This strategy should help democratizing the use of omics analyses for the identification and study of cell types across tissues and species. Moreover, we identified conserved gene signatures that enable robust identification and universal definition of these cell types. We identified new evolutionarily conserved gene candidates and gene interaction networks for the molecular regulation of the development or functions of these cell types, as well as conserved surface candidates for refined subset phenotyping throughout species. A phylogenetic analysis revealed that orthologous genes of the conserved signatures exist in teleost fishes and apparently not in Lamprey. PMID:26150816
Neo-Darwinism, the Modern Synthesis and selfish genes: are they of use in physiology?
Noble, Denis
2011-01-01
This article argues that the gene-centric interpretations of evolution, and more particularly the selfish gene expression of those interpretations, form barriers to the integration of physiological science with evolutionary theory. A gene-centred approach analyses the relationships between genotypes and phenotypes in terms of differences (change the genotype and observe changes in phenotype). We now know that, most frequently, this does not correctly reveal the relationships because of extensive buffering by robust networks of interactions. By contrast, understanding biological function through physiological analysis requires an integrative approach in which the activity of the proteins and RNAs formed from each DNA template is analysed in networks of interactions. These networks also include components that are not specified by nuclear DNA. Inheritance is not through DNA sequences alone. The selfish gene idea is not useful in the physiological sciences, since selfishness cannot be defined as an intrinsic property of nucleotide sequences independently of gene frequency, i.e. the ‘success’ in the gene pool that is supposed to be attributable to the ‘selfish’ property. It is not a physiologically testable hypothesis. PMID:21135048
Neo-Darwinism, the modern synthesis and selfish genes: are they of use in physiology?
Noble, Denis
2011-03-01
This article argues that the gene-centric interpretations of evolution, and more particularly the selfish gene expression of those interpretations, form barriers to the integration of physiological science with evolutionary theory. A gene-centred approach analyses the relationships between genotypes and phenotypes in terms of differences (change the genotype and observe changes in phenotype). We now know that, most frequently, this does not correctly reveal the relationships because of extensive buffering by robust networks of interactions. By contrast, understanding biological function through physiological analysis requires an integrative approach in which the activity of the proteins and RNAs formed from each DNA template is analysed in networks of interactions. These networks also include components that are not specified by nuclear DNA. Inheritance is not through DNA sequences alone. The selfish gene idea is not useful in the physiological sciences, since selfishness cannot be defined as an intrinsic property of nucleotide sequences independently of gene frequency, i.e. the 'success' in the gene pool that is supposed to be attributable to the 'selfish' property. It is not a physiologically testable hypothesis.
Platt, James L.; Rogers, Benjamin J.; Rogers, Kelley C.; Harwood, Adrian J.; Kimmel, Alan R.
2013-01-01
Control of chromatin structure is crucial for multicellular development and regulation of cell differentiation. The CHD (chromodomain-helicase-DNA binding) protein family is one of the major ATP-dependent, chromatin remodeling factors that regulate nucleosome positioning and access of transcription factors and RNA polymerase to the eukaryotic genome. There are three mammalian CHD subfamilies and their impaired functions are associated with several human diseases. Here, we identify three CHD orthologs (ChdA, ChdB and ChdC) in Dictyostelium discoideum. These CHDs are expressed throughout development, but with unique patterns. Null mutants lacking each CHD have distinct phenotypes that reflect their expression patterns and suggest functional specificity. Accordingly, using genome-wide (RNA-seq) transcriptome profiling for each null strain, we show that the different CHDs regulate distinct gene sets during both growth and development. ChdC is an apparent ortholog of the mammalian Class III CHD group that is associated with the human CHARGE syndrome, and GO analyses of aberrant gene expression in chdC nulls suggest defects in both cell-autonomous and non-autonomous signaling, which have been confirmed through analyses of chdC nulls developed in pure populations or with low levels of wild-type cells. This study provides novel insight into the broad function of CHDs in the regulation development and disease, through chromatin-mediated changes in directed gene expression. PMID:24301467
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hayakawa, Kazuo; Department of Cell Growth and Differentiation, Center for iPS Cell Research and Application, Kyoto University, Kyoto; Department of Orthopaedic Surgery, Graduate School of Medical Sciences, Nagoya City University, Nagoya
2013-03-22
Highlights: ► We tried to identify targets of synovial sarcoma (SS)-associated SYT–SSX fusion gene. ► We established pluripotent stem cell (PSC) lines with inducible SYT–SSX gene. ► SYT–SSX responsive genes were identified by the induction of SYT–SSX in PSC. ► SS-related genes were selected from database by in silico analyses. ► 51 genes were finally identified among SS-related genes as targets of SYT–SSX in PSC. -- Abstract: Synovial sarcoma (SS) is a malignant soft tissue tumor harboring chromosomal translocation t(X; 18)(p11.2; q11.2), which produces SS-specific fusion gene, SYT–SSX. Although precise function of SYT–SSX remains to be investigated, accumulating evidences suggestmore » its role in gene regulation via epigenetic mechanisms, and the product of SYT–SSX target genes may serve as biomarkers of SS. Lack of knowledge about the cell-of-origin of SS, however, has placed obstacle in the way of target identification. Here we report a novel approach to identify SYT–SSX2 target genes using human pluripotent stem cells (hPSCs) containing a doxycycline-inducible SYT–SSX2 gene. SYT–SSX2 was efficiently induced both at mRNA and protein levels within three hours after doxycycline administration, while no morphological change of hPSCs was observed until 24 h. Serial microarray analyses identified genes of which the expression level changed more than twofold within 24 h. Surprisingly, the majority (297/312, 95.2%) were up-regulated genes and a result inconsistent with the current concept of SYT–SSX as a transcriptional repressor. Comparing these genes with SS-related genes which were selected by a series of in silico analyses, 49 and 2 genes were finally identified as candidates of up- and down-regulated target of SYT–SSX, respectively. Association of these genes with SYT–SSX in SS cells was confirmed by knockdown experiments. Expression profiles of SS-related genes in hPSCs and human mesenchymal stem cells (hMSCs) were strikingly different in response to the induction of SYT–SSX, and more than half of SYT–SSX target genes in hPSCs were not induced in hMSCs. These results suggest the importance of cellular context for correct understanding of SYT–SSX function, and demonstrated how our new system will help to overcome this issue.« less
Jin, Xiaoli; Ren, Jing; Nevo, Eviatar; Yin, Xuegui; Sun, Dongfa; Peng, Junhua
2017-01-01
NAC (NAM/ATAF/CUC) proteins constitute one of the biggest plant-specific transcription factor (TF) families and have crucial roles in diverse developmental programs during plant growth. Phylogenetic analyses have revealed both conserved and lineage-specific NAC subfamilies, among which various origins and distinct features were observed. It is reasonable to hypothesize that there should be divergent evolutionary patterns of NAC TFs both between dicots and monocots, and among NAC subfamilies. In this study, we compared the gene duplication and loss, evolutionary rate, and selective pattern among non-lineage specific NAC subfamilies, as well as those between dicots and monocots, through genome-wide analyses of sequence and functional data in six dicot and five grass lineages. The number of genes gained in the dicot lineages was much larger than that in the grass lineages, while fewer gene losses were observed in the grass than that in the dicots. We revealed (1) uneven constitution of Clusters of Orthologous Groups (COGs) and contrasting birth/death rates among subfamilies, and (2) two distinct evolutionary scenarios of NAC TFs between dicots and grasses. Our results demonstrated that relaxed selection, resulting from concerted gene duplications, may have permitted substitutions responsible for functional divergence of NAC genes into new lineages. The underlying mechanism of distinct evolutionary fates of NAC TFs shed lights on how evolutionary divergence contributes to differences in establishing NAC gene subfamilies and thus impacts the distinct features between dicots and grasses. PMID:28713414
Li, Ming-Rui; Shi, Feng-Xue; Li, Ya-Ling; Jiang, Peng; Jiao, Lili
2017-01-01
Abstract Chinese ginseng (Panax ginseng Meyer) is a medicinally important herb and plays crucial roles in traditional Chinese medicine. Pharmacological analyses identified diverse bioactive components from Chinese ginseng. However, basic biological attributes including domestication and selection of the ginseng plant remain under-investigated. Here, we presented a genome-wide view of the domestication and selection of cultivated ginseng based on the whole genome data. A total of 8,660 protein-coding genes were selected for genome-wide scanning of the 30 wild and cultivated ginseng accessions. In complement, the 45s rDNA, chloroplast and mitochondrial genomes were included to perform phylogenetic and population genetic analyses. The observed spatial genetic structure between northern cultivated ginseng (NCG) and southern cultivated ginseng (SCG) accessions suggested multiple independent origins of cultivated ginseng. Genome-wide scanning further demonstrated that NCG and SCG have undergone distinct selection pressures during the domestication process, with more genes identified in the NCG (97 genes) than in the SCG group (5 genes). Functional analyses revealed that these genes are involved in diverse pathways, including DNA methylation, lignin biosynthesis, and cell differentiation. These findings suggested that the SCG and NCG groups have distinct demographic histories. Candidate genes identified are useful for future molecular breeding of cultivated ginseng. PMID:28922794
2013-01-01
Background Nucleoside phosphorylases (NPs) have been extensively investigated in human and bacterial systems for their role in metabolic nucleotide salvaging and links to oncogenesis. In plants, NP-like proteins have not been comprehensively studied, likely because there is no evidence of a metabolic function in nucleoside salvage. However, in the forest trees genus Populus a family of NP-like proteins function as an important ecophysiological adaptation for inter- and intra-seasonal nitrogen storage and cycling. Results We conducted phylogenetic analyses to determine the distribution and evolution of NP-like proteins in plants. These analyses revealed two major clusters of NP-like proteins in plants. Group I proteins were encoded by genes across a wide range of plant taxa while proteins encoded by Group II genes were dominated by species belonging to the order Malpighiales and included the Populus Bark Storage Protein (BSP) and WIN4-like proteins. Additionally, we evaluated the NP-like genes in Populus by examining the transcript abundance of the 13 NP-like genes found in the Populus genome in various tissues of plants exposed to long-day (LD) and short-day (SD) photoperiods. We found that all 13 of the Populus NP-like genes belonging to either Group I or II are expressed in various tissues in both LD and SD conditions. Tests of natural selection and expression evolution analysis of the Populus genes suggests that divergence in gene expression may have occurred recently during the evolution of Populus, which supports the adaptive maintenance models. Lastly, in silico analysis of cis-regulatory elements in the promoters of the 13 NP-like genes in Populus revealed common regulatory elements known to be involved in light regulation, stress/pathogenesis and phytohormone responses. Conclusion In Populus, the evolution of the NP-like protein and gene family has been shaped by duplication events and natural selection. Expression data suggest that previously uncharacterized NP-like proteins may function in nutrient sensing and/or signaling. These proteins are members of Group I NP-like proteins, which are widely distributed in many plant taxa. We conclude that NP-like proteins may function in plants, although this function is undefined. PMID:23957885
Samanta, Brajogopal; Bhadury, Punyasloke
2016-01-01
Marine chromophytes are taxonomically diverse group of algae and contribute approximately half of the total oceanic primary production. To understand the global patterns of functional diversity of chromophytic phytoplankton, robust bioinformatics and statistical analyses including deep phylogeny based on 2476 form ID rbcL gene sequences representing seven ecologically significant oceanographic ecoregions were undertaken. In addition, 12 form ID rbcL clone libraries were generated and analyzed (148 sequences) from Sundarbans Biosphere Reserve representing the world’s largest mangrove ecosystem as part of this study. Global phylogenetic analyses recovered 11 major clades of chromophytic phytoplankton in varying proportions with several novel rbcL sequences in each of the seven targeted ecoregions. Majority of OTUs was found to be exclusive to each ecoregion, whereas some were shared by two or more ecoregions based on beta-diversity analysis. Present phylogenetic and bioinformatics analyses provide a strong statistical support for the hypothesis that different oceanographic regimes harbor distinct and coherent groups of chromophytic phytoplankton. It has been also shown as part of this study that varying natural selection pressure on form ID rbcL gene under different environmental conditions could lead to functional differences and overall fitness of chromophytic phytoplankton populations. PMID:26861415
Assessment of Alzheimer’s disease case–control associations using family-based methods
Schjeide, Brit-Maren M.; McQueen, Matthew B.; Mullin, Kristina; DiVito, Jason; Hogan, Meghan F.; Parkinson, Michele; Hooli, Basavaraj; Lange, Christoph; Blacker, Deborah; Tanzi, Rudolph E.
2009-01-01
The genetics of Alzheimer’s disease (AD) is heterogeneous and remains only ill-defined. We have recently created a freely available and continuously updated online database (AlzGene; http://www.alzgene.org) for which we collect all published genetic association studies in AD and perform systematic meta-analyses on all polymorphisms with sufficient genotype data. In this study, we tested 27 genes (ACE, BDNF, CH25H, CHRNB2, CST3, CTSD, DAPK1, GALP, hCG2039140, IL1B, LMNA, LOC439999, LOC651924, MAPT, MTHFR, MYH13, PCK1, PGBD1, PRNP, PSEN1, SORCS1, SORL1, TF, TFAM, TNK1, GWA_14q32.13, and GWA_7p15.2), all showing significant association with AD risk in the AlzGene meta-analyses, in a large collection of family-based samples comprised of 4,180 subjects from over 1,300 pedigrees. Overall, we observe significant association with risk for AD and polymorphisms in ACE, CHRNB2, TF, and an as yet uncharacterized locus on chromosome 7p15.2 [rs1859849]. For all four loci, the association was observed with the same alleles as in the AlzGene meta-analyses. The convergence of case–control and family-based findings suggests that these loci currently represent the most promising AD gene candidates. Further fine-mapping and functional analyses are warranted to elucidate the potential biochemical mechanisms and epidemiological relevance of these genes. PMID:18830724
Genomewide analysis of TCP transcription factor gene family in Malus domestica.
Xu, Ruirui; Sun, Peng; Jia, Fengjuan; Lu, Longtao; Li, Yuanyuan; Zhang, Shizhong; Huang, Jinguang
2014-12-01
Teosinte branched 1/cycloidea/proliferating cell factor 1 (TCP) proteins are a large family of transcriptional regulators in angiosperms. They are involved in various biological processes, including development and plant metabolism pathways. In this study, a total of 52 TCP genes were identified in apple (Malus domestica) genome. Bioinformatic methods were employed to predicate and analyse their relevant gene classification, gene structure, chromosome location, sequence alignment and conserved domains of MdTCP proteins. Expression analysis from microarray data showed that the expression levels of 28 and 51 MdTCP genes changed during the ripening and rootstock-scion interaction processes, respectively. The expression patterns of 12 selected MdTCP genes were analysed in different tissues and in response to abiotic stresses. All of the selected genes were detected in at least one of the tissues tested, and most of them were modulated by adverse treatments indicating that the MdTCPs were involved in various developmental and physiological processes. To the best of our knowledge, this is the first study of a genomewide analysis of apple TCP gene family. These results provide valuable information for studies on functions of the TCP transcription factor genes in apple.
The evolution of duplicate gene expression in mammalian organs
Guschanski, Katerina; Warnefors, Maria; Kaessmann, Henrik
2017-01-01
Gene duplications generate genomic raw material that allows the emergence of novel functions, likely facilitating adaptive evolutionary innovations. However, global assessments of the functional and evolutionary relevance of duplicate genes in mammals were until recently limited by the lack of appropriate comparative data. Here, we report a large-scale study of the expression evolution of DNA-based functional gene duplicates in three major mammalian lineages (placental mammals, marsupials, egg-laying monotremes) and birds, on the basis of RNA sequencing (RNA-seq) data from nine species and eight organs. We observe dynamic changes in tissue expression preference of paralogs with different duplication ages, suggesting differential contribution of paralogs to specific organ functions during vertebrate evolution. Specifically, we show that paralogs that emerged in the common ancestor of bony vertebrates are enriched for genes with brain-specific expression and provide evidence for differential forces underlying the preferential emergence of young testis- and liver-specific expressed genes. Further analyses uncovered that the overall spatial expression profiles of gene families tend to be conserved, with several exceptions of pronounced tissue specificity shifts among lineage-specific gene family expansions. Finally, we trace new lineage-specific genes that may have contributed to the specific biology of mammalian organs, including the little-studied placenta. Overall, our study provides novel and taxonomically broad evidence for the differential contribution of duplicate genes to tissue-specific transcriptomes and for their importance for the phenotypic evolution of vertebrates. PMID:28743766
Hu, Wei; Yan, Yan; Shi, Haitao; Liu, Juhua; Miao, Hongxia; Tie, Weiwei; Ding, Zehong; Ding, XuPo; Wu, Chunlai; Liu, Yang; Wang, Jiashui; Xu, Biyu; Jin, Zhiqiang
2017-08-29
Abscisic acid (ABA) signaling plays a crucial role in developmental and environmental adaptation processes of plants. However, the PYL-PP2C-SnRK2 families that function as the core components of ABA signaling are not well understood in banana. In the present study, 24 PYL, 87 PP2C, and 11 SnRK2 genes were identified from banana, which was further supported by evolutionary relationships, conserved motif and gene structure analyses. The comprehensive transcriptomic analyses showed that banana PYL-PP2C-SnRK2 genes are involved in tissue development, fruit development and ripening, and response to abiotic stress in two cultivated varieties. Moreover, comparative expression analyses of PYL-PP2C-SnRK2 genes between BaXi Jiao (BX) and Fen Jiao (FJ) revealed that PYL-PP2C-SnRK2-mediated ABA signaling might positively regulate banana fruit ripening and tolerance to cold, salt, and osmotic stresses. Finally, interaction networks and co-expression assays demonstrated that the core components of ABA signaling were more active in FJ than in BX in response to abiotic stress, further supporting the crucial role of the genes in tolerance to abiotic stress in banana. This study provides new insights into the complicated transcriptional control of PYL-PP2C-SnRK2 genes, improves the understanding of PYL-PP2C-SnRK2-mediated ABA signaling in the regulation of fruit development, ripening, and response to abiotic stress, and identifies some candidate genes for genetic improvement of banana.
Homeobox genes in the rodent pineal gland: roles in development and phenotype maintenance.
Rath, Martin F; Rohde, Kristian; Klein, David C; Møller, Morten
2013-06-01
The pineal gland is a neuroendocrine gland responsible for nocturnal synthesis of melatonin. During early development of the rodent pineal gland from the roof of the diencephalon, homeobox genes of the orthodenticle homeobox (Otx)- and paired box (Pax)-families are expressed and are essential for normal pineal development consistent with the well-established role that homeobox genes play in developmental processes. However, the pineal gland appears to be unusual because strong homeobox gene expression persists in the pineal gland of the adult brain. Accordingly, in addition to developmental functions, homeobox genes appear to be key regulators in postnatal phenotype maintenance in this tissue. In this paper, we review ontogenetic and phylogenetic aspects of pineal development and recent progress in understanding the involvement of homebox genes in rodent pineal development and adult function. A working model is proposed for understanding the sequential action of homeobox genes in controlling development and mature circadian function of the mammalian pinealocyte based on knowledge from detailed developmental and daily gene expression analyses in rats, the pineal phenotypes of homebox gene-deficient mice and studies on development of the retinal photoreceptor; the pinealocyte and retinal photoreceptor share features not seen in other tissues and are likely to have evolved from the same ancestral photodetector cell.
Homeobox genes in the rodent pineal gland: roles in development and phenotype maintenance
Rath, Martin F.; Rohde, Kristian; Klein, David C.; Møller, Morten
2012-01-01
The pineal gland is a neuroendocrine gland responsible for nocturnal synthesis of melatonin. During early development of the rodent pineal gland from the roof of the diencephalon, homeobox genes of the orthodenticle homeobox (Otx)- and paired box (Pax)-families are expressed and are essential for normal pineal development consistent with the well-established role that homeobox genes play in developmental processes. However, the pineal gland appears to be unusual because strong homeobox gene expression persists in the pineal gland of the adult brain. Accordingly, in addition to developmental functions, homeobox genes appear to be key regulators in postnatal phenotype maintenance in this tissue. In this paper, we review ontogenetic and phylogenetic aspects of pineal development and recent progress in understanding the involvement of homebox genes in rodent pineal development and adult function. A working model is proposed for understanding the sequential action of homeobox genes in controlling development and mature circadian function of the mammalian pinealocyte based on knowledge from detailed developmental and daily gene expression analyses in rats, the pineal phenotypes of homebox gene-deficient mice and studies on development of the retinal photoreceptor; the pinealocyte and retinal photoreceptor share features not seen in other tissues and are likely to have evolved from the same ancestral photodetector cell. PMID:23076630
The Genetic Basis for Variation in Sensitivity to Lead Toxicity in Drosophila melanogaster.
Zhou, Shanshan; Morozova, Tatiana V; Hussain, Yasmeen N; Luoma, Sarah E; McCoy, Lenovia; Yamamoto, Akihiko; Mackay, Trudy F C; Anholt, Robert R H
2016-07-01
Lead toxicity presents a worldwide health problem, especially due to its adverse effects on cognitive development in children. However, identifying genes that give rise to individual variation in susceptibility to lead toxicity is challenging in human populations. Our goal was to use Drosophila melanogaster to identify evolutionarily conserved candidate genes associated with individual variation in susceptibility to lead exposure. To identify candidate genes associated with variation in susceptibility to lead toxicity, we measured effects of lead exposure on development time, viability and adult activity in the Drosophila melanogaster Genetic Reference Panel (DGRP) and performed genome-wide association analyses to identify candidate genes. We used mutants to assess functional causality of candidate genes and constructed a genetic network associated with variation in sensitivity to lead exposure, on which we could superimpose human orthologs. We found substantial heritabilities for all three traits and identified candidate genes associated with variation in susceptibility to lead exposure for each phenotype. The genetic architectures that determine variation in sensitivity to lead exposure are highly polygenic. Gene ontology and network analyses showed enrichment of genes associated with early development and function of the nervous system. Drosophila melanogaster presents an advantageous model to study the genetic underpinnings of variation in susceptibility to lead toxicity. Evolutionary conservation of cellular pathways that respond to toxic exposure allows predictions regarding orthologous genes and pathways across phyla. Thus, studies in the D. melanogaster model system can identify candidate susceptibility genes to guide subsequent studies in human populations. Zhou S, Morozova TV, Hussain YN, Luoma SE, McCoy L, Yamamoto A, Mackay TF, Anholt RR. 2016. The genetic basis for variation in sensitivity to lead toxicity in Drosophila melanogaster. Environ Health Perspect 124:1062-1070; http://dx.doi.org/10.1289/ehp.1510513.
Identification and characterization of NF-YB family genes in tung tree.
Yang, Susu; Wang, Yangdong; Yin, Hengfu; Guo, Haobo; Gao, Ming; Zhu, Huiping; Chen, Yicun
2015-12-01
The NF-YB transcription factor gene family encodes a subunit of the CCAAT box-binding factor (CBF), a highly conserved trimeric activator that strongly binds to the CCAAT box promoter element. Studies on model plants have shown that NF-YB proteins participate in important developmental and physiological processes, but little is known about NF-YB proteins in trees. Here, we identified seven NF-YB transcription factor-encoding genes in Vernicia fordii, an important oilseed tree in China. A phylogenetic analysis separated the genes into two groups; non-LEC1 type (VfNF-YB1, 5, 7, 9, 11, 13) and LEC1-type (VfNF-YB 14). A gene structure analysis showed that VfNF-YB 5 has three introns and the other genes have no introns. The seven VfNF-YB sequences contain highly conserved domains, a disordered region at the N terminus, and two long helix structures at the C terminus. Phylogenetic analyses showed that VfNF-YB family genes are highly homologous to GmNF-YB genes, and many of them are closely related to functionally characterized NF-YBs. In expression analyses of various tissues (root, stem, leaf, and kernel) and the root during pathogen infection, VfNF-YB1, 5, and 11 were dominantly expressed in kernels, and VfNF-YB7 and 9 were expressed only in the root. Different VfNF-YB family genes showed different responses to pathogen infection, suggesting that they play different roles in the pathogen response. Together, these findings represent the first extensive evaluation of the NF-YB family in tung tree and provide a foundation for dissecting the functions of VfNF-YB genes in seed development, stress adaption, fatty acid synthesis, and pathogen response.
Adaptive evolution of the myo6 gene in old world fruit bats (family: pteropodidae).
Shen, Bin; Han, Xiuqun; Jones, Gareth; Rossiter, Stephen J; Zhang, Shuyi
2013-01-01
Myosin VI (encoded by the Myo6 gene) is highly expressed in the inner and outer hair cells of the ear, retina, and polarized epithelial cells such as kidney proximal tubule cells and intestinal enterocytes. The Myo6 gene is thought to be involved in a wide range of physiological functions such as hearing, vision, and clathrin-mediated endocytosis. Bats (Chiroptera) represent one of the most fascinating mammal groups for molecular evolutionary studies of the Myo6 gene. A diversity of specialized adaptations occur among different bat lineages, such as echolocation and associated high-frequency hearing in laryngeal echolocating bats, large eyes and a strong dependence on vision in Old World fruit bats (Pteropodidae), and specialized high-carbohydrate but low-nitrogen diets in both Old World and New World fruit bats (Phyllostomidae). To investigate what role(s) the Myo6 gene might fulfill in bats, we sequenced the coding region of the Myo6 gene in 15 bat species and used molecular evolutionary analyses to detect evidence of positive selection in different bat lineages. We also conducted real-time PCR assays to explore the expression levels of Myo6 in a range of tissues from three representative bat species. Molecular evolutionary analyses revealed that the Myo6 gene, which was widely considered as a hearing gene, has undergone adaptive evolution in the Old World fruit bats which lack laryngeal echolocation and associated high-frequency hearing. Real-time PCR showed the highest expression level of the Myo6 gene in the kidney among ten tissues examined in three bat species, indicating an important role for this gene in kidney function. We suggest that Myo6 has undergone adaptive evolution in Old World fruit bats in relation to receptor-mediated endocytosis for the preservation of protein and essential nutrients.
Genomic analysis reveals extensive gene duplication within the bovine TRB locus
Connelley, Timothy; Aerts, Jan; Law, Andy; Morrison, W Ivan
2009-01-01
Background Diverse TR and IG repertoires are generated by V(D)J somatic recombination. Genomic studies have been pivotal in cataloguing the V, D, J and C genes present in the various TR/IG loci and describing how duplication events have expanded the number of these genes. Such studies have also provided insights into the evolution of these loci and the complex mechanisms that regulate TR/IG expression. In this study we analyze the sequence of the third bovine genome assembly to characterize the germline repertoire of bovine TRB genes and compare the organization, evolution and regulatory structure of the bovine TRB locus with that of humans and mice. Results The TRB locus in the third bovine genome assembly is distributed over 5 scaffolds, extending to ~730 Kb. The available sequence contains 134 TRBV genes, assigned to 24 subgroups, and 3 clusters of DJC genes, each comprising a single TRBD gene, 5–7 TRBJ genes and a single TRBC gene. Seventy-nine of the TRBV genes are predicted to be functional. Comparison with the human and murine TRB loci shows that the gene order, as well as the sequences of non-coding elements that regulate TRB expression, are highly conserved in the bovine. Dot-plot analyses demonstrate that expansion of the genomic TRBV repertoire has occurred via a complex and extensive series of duplications, predominantly involving DNA blocks containing multiple genes. These duplication events have resulted in massive expansion of several TRBV subgroups, most notably TRBV6, 9 and 21 which contain 40, 35 and 16 members respectively. Similarly, duplication has lead to the generation of a third DJC cluster. Analyses of cDNA data confirms the diversity of the TRBV genes and, in addition, identifies a substantial number of TRBV genes, predominantly from the larger subgroups, which are still absent from the genome assembly. The observed gene duplication within the bovine TRB locus has created a repertoire of phylogenetically diverse functional TRBV genes, which is substantially larger than that described for humans and mice. Conclusion The analyses completed in this study reveal that, although the gene content and organization of the bovine TRB locus are broadly similar to that of humans and mice, multiple duplication events have led to a marked expansion in the number of TRB genes. Similar expansions in other ruminant TR loci suggest strong evolutionary pressures in this lineage have selected for the development of enlarged sets of TR genes that can contribute to diverse TR repertoires. PMID:19393068
Zhang, Bing; Schmoyer, Denise; Kirov, Stefan; Snoddy, Jay
2004-01-01
Background Microarray and other high-throughput technologies are producing large sets of interesting genes that are difficult to analyze directly. Bioinformatics tools are needed to interpret the functional information in the gene sets. Results We have created a web-based tool for data analysis and data visualization for sets of genes called GOTree Machine (GOTM). This tool was originally intended to analyze sets of co-regulated genes identified from microarray analysis but is adaptable for use with other gene sets from other high-throughput analyses. GOTree Machine generates a GOTree, a tree-like structure to navigate the Gene Ontology Directed Acyclic Graph for input gene sets. This system provides user friendly data navigation and visualization. Statistical analysis helps users to identify the most important Gene Ontology categories for the input gene sets and suggests biological areas that warrant further study. GOTree Machine is available online at . Conclusion GOTree Machine has a broad application in functional genomic, proteomic and other high-throughput methods that generate large sets of interesting genes; its primary purpose is to help users sort for interesting patterns in gene sets. PMID:14975175
Ying, Mengchao; Kidou, Shin-Ichiro
2017-07-01
To adapt to cold conditions, barley plants rely on specific mechanisms, which have not been fully understood. In this study, we characterized a novel barley cold-induced gene identified using a PCR-based high coverage gene expression profiling method. The identified gene encodes a small protein that we named CISP1 (Cold-induced Small Protein 1). Homology searches of sequence databases revealed that CISP1 homologs (CISP2 and CISP3) exist in barley genome. Further database analyses showed that the CISP1 homologs were widely distributed in cold-tolerant plants such as wheat and rye. Quantitative reverse transcription PCR analyses indicated that the expression of barley CISP genes was markedly increased in roots exposed to cold conditions. In situ hybridization analyses showed that the CISP1 transcripts were localized in the root tip and lateral root primordium. We also demonstrated that the CISP1 protein bound to RNA. Taken together, these findings indicate that CISP1 and its homologs encoding small RNA-binding proteins may serve as RNA chaperones playing a vital role in the cold adaptation of barley root. This is the first report describing the likely close relationship between root-specific genes and the cold adaptation process, as well as the potential function of the identified genes. Copyright © 2017 Elsevier B.V. All rights reserved.
Maternal Pre-Pregnancy Obesity Is Associated with Altered Placental Transcriptome.
Altmäe, Signe; Segura, Maria Teresa; Esteban, Francisco J; Bartel, Sabine; Brandi, Pilar; Irmler, Martin; Beckers, Johannes; Demmelmair, Hans; López-Sabater, Carmen; Koletzko, Berthold; Krauss-Etschmann, Susanne; Campoy, Cristina
2017-01-01
Maternal obesity has a major impact on pregnancy outcomes. There is growing evidence that maternal obesity has a negative influence on placental development and function, thereby adversely influencing offspring programming and health outcomes. However, the molecular mechanisms underlying these processes are poorly understood. We analysed ten term placenta's whole transcriptomes in obese (n = 5) and normal weight women (n = 5), using the Affymetrix microarray platform. Analyses of expression data were carried out using non-parametric methods. Hierarchical clustering and principal component analysis showed a clear distinction in placental transcriptome between obese and normal weight women. We identified 72 differentially regulated genes, with most being down-regulated in obesity (n = 61). Functional analyses of the targets using DAVID and IPA confirm the dysregulation of previously identified processes and pathways in the placenta from obese women, including inflammation and immune responses, lipid metabolism, cancer pathways, and angiogenesis. In addition, we detected new molecular aspects of obesity-derived effects on the placenta, involving the glucocorticoid receptor signalling pathway and dysregulation of several genes including CCL2, FSTL3, IGFBP1, MMP12, PRG2, PRL, QSOX1, SERPINE2 and TAC3. Our global gene expression profiling approach demonstrates that maternal obesity creates a unique in utero environment that impairs the placental transcriptome.
htsint: a Python library for sequencing pipelines that combines data through gene set generation.
Richards, Adam J; Herrel, Anthony; Bonneaud, Camille
2015-09-24
Sequencing technologies provide a wealth of details in terms of genes, expression, splice variants, polymorphisms, and other features. A standard for sequencing analysis pipelines is to put genomic or transcriptomic features into a context of known functional information, but the relationships between ontology terms are often ignored. For RNA-Seq, considering genes and their genetic variants at the group level enables a convenient way to both integrate annotation data and detect small coordinated changes between experimental conditions, a known caveat of gene level analyses. We introduce the high throughput data integration tool, htsint, as an extension to the commonly used gene set enrichment frameworks. The central aim of htsint is to compile annotation information from one or more taxa in order to calculate functional distances among all genes in a specified gene space. Spectral clustering is then used to partition the genes, thereby generating functional modules. The gene space can range from a targeted list of genes, like a specific pathway, all the way to an ensemble of genomes. Given a collection of gene sets and a count matrix of transcriptomic features (e.g. expression, polymorphisms), the gene sets produced by htsint can be tested for 'enrichment' or conditional differences using one of a number of commonly available packages. The database and bundled tools to generate functional modules were designed with sequencing pipelines in mind, but the toolkit nature of htsint allows it to also be used in other areas of genomics. The software is freely available as a Python library through GitHub at https://github.com/ajrichards/htsint.
Snejdrlova, Michaela; Kalvach, Zdenek; Topinkova, Eva; Vrablik, Michal; Prochazkova, Renata; Kvasilova, Marie; Lanska, Vera; Zlatohlavek, Lukas; Prusikova, Martina; Ceska, Richard
2011-01-01
Life expectancy is determined by a combination of genetic predisposition (~25%) and environmental influences (~75%). Nevertheless a stronger genetic influence is anticipated in long-living individuals. Apolipoprotein E (APOE) gene belongs among the most studied candidate genes of longevity. We evaluated the relation of APOE polymorphism and fitness status in the elderly. We examined a total number of 128 subjects, over 80 years of age. Using a battery of functional tests their fitness status was assessed and the subjects were stratified into 5 functional categories according to Spirduso´s classification. Biochemistry analysis was performed by enzymatic method using automated analyzers. APOE gene polymorphism was analysed performed using PCR-RFLP. APOE4 allele carriers had significantly worse fitness status compared to non-carriers (p=0.025). Multiple logistic regression analysis showed the APOE4 carriers had higher risk (p=0.05) of functional unfitness compared to APOE2/E3 individuals. APOE gene polymorphism seems be an important genetic contributor to frailty development in the elderly. While APOE2 carriers tend to remain functionally fit till higher age, the functional status of APOE4 carriers deteriorates more rapidly. © 2011 Neuroendocrinology Letters
Martínez-del Campo, Ana; Bodea, Smaranda; Hamer, Hilary A; Marks, Jonathan A; Haiser, Henry J; Turnbaugh, Peter J; Balskus, Emily P
2015-04-14
Elucidation of the molecular mechanisms underlying the human gut microbiota's effects on health and disease has been complicated by difficulties in linking metabolic functions associated with the gut community as a whole to individual microorganisms and activities. Anaerobic microbial choline metabolism, a disease-associated metabolic pathway, exemplifies this challenge, as the specific human gut microorganisms responsible for this transformation have not yet been clearly identified. In this study, we established the link between a bacterial gene cluster, the choline utilization (cut) cluster, and anaerobic choline metabolism in human gut isolates by combining transcriptional, biochemical, bioinformatic, and cultivation-based approaches. Quantitative reverse transcription-PCR analysis and in vitro biochemical characterization of two cut gene products linked the entire cluster to growth on choline and supported a model for this pathway. Analyses of sequenced bacterial genomes revealed that the cut cluster is present in many human gut bacteria, is predictive of choline utilization in sequenced isolates, and is widely but discontinuously distributed across multiple bacterial phyla. Given that bacterial phylogeny is a poor marker for choline utilization, we were prompted to develop a degenerate PCR-based method for detecting the key functional gene choline TMA-lyase (cutC) in genomic and metagenomic DNA. Using this tool, we found that new choline-metabolizing gut isolates universally possessed cutC. We also demonstrated that this gene is widespread in stool metagenomic data sets. Overall, this work represents a crucial step toward understanding anaerobic choline metabolism in the human gut microbiota and underscores the importance of examining this microbial community from a function-oriented perspective. Anaerobic choline utilization is a bacterial metabolic activity that occurs in the human gut and is linked to multiple diseases. While bacterial genes responsible for choline fermentation (the cut gene cluster) have been recently identified, there has been no characterization of these genes in human gut isolates and microbial communities. In this work, we use multiple approaches to demonstrate that the pathway encoded by the cut genes is present and functional in a diverse range of human gut bacteria and is also widespread in stool metagenomes. We also developed a PCR-based strategy to detect a key functional gene (cutC) involved in this pathway and applied it to characterize newly isolated choline-utilizing strains. Both our analyses of the cut gene cluster and this molecular tool will aid efforts to further understand the role of choline metabolism in the human gut microbiota and its link to disease. Copyright © 2015 Martínez-del Campo et al.
Lu, Jianguo; Peatman, Eric; Tang, Haibao; Lewis, Joshua; Liu, Zhanjiang
2012-06-15
Gene duplication has had a major impact on genome evolution. Localized (or tandem) duplication resulting from unequal crossing over and whole genome duplication are believed to be the two dominant mechanisms contributing to vertebrate genome evolution. While much scrutiny has been directed toward discerning patterns indicative of whole-genome duplication events in teleost species, less attention has been paid to the continuous nature of gene duplications and their impact on the size, gene content, functional diversity, and overall architecture of teleost genomes. Here, using a Markov clustering algorithm directed approach we catalogue and analyze patterns of gene duplication in the four model teleost species with chromosomal coordinates: zebrafish, medaka, stickleback, and Tetraodon. Our analyses based on set size, duplication type, synonymous substitution rate (Ks), and gene ontology emphasize shared and lineage-specific patterns of genome evolution via gene duplication. Most strikingly, our analyses highlight the extraordinary duplication and retention rate of recent duplicates in zebrafish and their likely role in the structural and functional expansion of the zebrafish genome. We find that the zebrafish genome is remarkable in its large number of duplicated genes, small duplicate set size, biased Ks distribution toward minimal mutational divergence, and proportion of tandem and intra-chromosomal duplicates when compared with the other teleost model genomes. The observed gene duplication patterns have played significant roles in shaping the architecture of teleost genomes and appear to have contributed to the recent functional diversification and divergence of important physiological processes in zebrafish. We have analyzed gene duplication patterns and duplication types among the available teleost genomes and found that a large number of genes were tandemly and intrachromosomally duplicated, suggesting their origin of independent and continuous duplication. This is particularly true for the zebrafish genome. Further analysis of the duplicated gene sets indicated that a significant portion of duplicated genes in the zebrafish genome were of recent, lineage-specific duplication events. Most strikingly, a subset of duplicated genes is enriched among the recently duplicated genes involved in immune or sensory response pathways. Such findings demonstrated the significance of continuous gene duplication as well as that of whole genome duplication in the course of genome evolution.
Discovering functions of unannotated genes from a transcriptome survey of wild fungal isolates.
Ellison, Christopher E; Kowbel, David; Glass, N Louise; Taylor, John W; Brem, Rachel B
2014-04-01
Most fungal genomes are poorly annotated, and many fungal traits of industrial and biomedical relevance are not well suited to classical genetic screens. Assigning genes to phenotypes on a genomic scale thus remains an urgent need in the field. We developed an approach to infer gene function from expression profiles of wild fungal isolates, and we applied our strategy to the filamentous fungus Neurospora crassa. Using transcriptome measurements in 70 strains from two well-defined clades of this microbe, we first identified 2,247 cases in which the expression of an unannotated gene rose and fell across N. crassa strains in parallel with the expression of well-characterized genes. We then used image analysis of hyphal morphologies, quantitative growth assays, and expression profiling to test the functions of four genes predicted from our population analyses. The results revealed two factors that influenced regulation of metabolism of nonpreferred carbon and nitrogen sources, a gene that governed hyphal architecture, and a gene that mediated amino acid starvation resistance. These findings validate the power of our population-transcriptomic approach for inference of novel gene function, and we suggest that this strategy will be of broad utility for genome-scale annotation in many fungal systems. IMPORTANCE Some fungal species cause deadly infections in humans or crop plants, and other fungi are workhorses of industrial chemistry, including the production of biofuels. Advances in medical and industrial mycology require an understanding of the genes that control fungal traits. We developed a method to infer functions of uncharacterized genes by observing correlated expression of their mRNAs with those of known genes across wild fungal isolates. We applied this strategy to a filamentous fungus and predicted functions for thousands of unknown genes. In four cases, we experimentally validated the predictions from our method, discovering novel genes involved in the metabolism of nutrient sources relevant for biofuel production, as well as colony morphology and starvation resistance. Our strategy is straightforward, inexpensive, and applicable for predicting gene function in many fungal species.
Genomic and transcriptomic approaches to study immunology in cyprinids: What is next?
Petit, Jules; David, Lior; Dirks, Ron; Wiegertjes, Geert F
2017-10-01
Accelerated by the introduction of Next-Generation Sequencing (NGS), a number of genomes of cyprinid fish species have been drafted, leading to a highly valuable collective resource of comparative genome information on cyprinids (Cyprinidae). In addition, NGS-based transcriptome analyses of different developmental stages, organs, or cell types, increasingly contribute to the understanding of complex physiological processes, including immune responses. Cyprinids are a highly interesting family because they comprise one of the most-diversified families of teleosts and because of their variation in ploidy level, with diploid, triploid, tetraploid, hexaploid and sometimes even octoploid species. The wealth of data obtained from NGS technologies provides both challenges and opportunities for immunological research, which will be discussed here. Correct interpretation of ploidy effects on immune responses requires knowledge of the degree of functional divergence between duplicated genes, which can differ even between closely-related cyprinid fish species. We summarize NGS-based progress in analysing immune responses and discuss the importance of respecting the presence of (multiple) duplicated gene sequences when performing transcriptome analyses for detailed understanding of complex physiological processes. Progressively, advances in NGS technology are providing workable methods to further elucidate the implications of gene duplication events and functional divergence of duplicates genes and proteins involved in immune responses in cyprinids. We conclude with discussing how future applications of NGS technologies and analysis methods could enhance immunological research and understanding. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
Soler, Marçal; Camargo, Eduardo Leal Oliveira; Carocha, Victor; Cassan-Wang, Hua; San Clemente, Hélène; Savelli, Bruno; Hefer, Charles A; Paiva, Jorge A Pinto; Myburg, Alexander A; Grima-Pettenati, Jacqueline
2015-06-01
The R2R3-MYB family, one of the largest transcription factor families in higher plants, controls a wide variety of plant-specific processes including, notably, phenylpropanoid metabolism and secondary cell wall formation. We performed a genome-wide analysis of this superfamily in Eucalyptus, one of the most planted hardwood trees world-wide. A total of 141 predicted R2R3-MYB sequences identified in the Eucalyptus grandis genome sequence were subjected to comparative phylogenetic analyses with Arabidopsis thaliana, Oryza sativa, Populus trichocarpa and Vitis vinifera. We analysed features such as gene structure, conserved motifs and genome location. Transcript abundance patterns were assessed by RNAseq and validated by high-throughput quantitative PCR. We found some R2R3-MYB subgroups with expanded membership in E. grandis, V. vinifera and P. trichocarpa, and others preferentially found in woody species, suggesting diversification of specific functions in woody plants. By contrast, subgroups containing key genes regulating lignin biosynthesis and secondary cell wall formation are more conserved across all of the species analysed. In Eucalyptus, R2R3-MYB tandem gene duplications seem to disproportionately affect woody-preferential and woody-expanded subgroups. Interestingly, some of the genes belonging to woody-preferential subgroups show higher expression in the cambial region, suggesting a putative role in the regulation of secondary growth. © 2014 The Authors New Phytologist © 2014 New Phytologist Trust.
The Cryptochrome/Photolyase Family in aquatic organisms.
Oliveri, Paola; Fortunato, Antonio E; Petrone, Libero; Ishikawa-Fujiwara, Tomoko; Kobayashi, Yuri; Todo, Takeshi; Antonova, Olga; Arboleda, Enrique; Zantke, Juliane; Tessmar-Raible, Kristin; Falciatore, Angela
2014-04-01
The Cryptochrome/Photolyase Family (CPF) represents an ancient group of widely distributed UV-A/blue-light sensitive proteins sharing common structures and chromophores. During the course of evolution, different CPFs acquired distinct functions in DNA repair, light perception and circadian clock regulation. Previous phylogenetic analyses of the CPF have allowed reconstruction of the evolution and distribution of the different CPF super-classes in the tree of life. However, so far only limited information is available from the CPF orthologs in aquatic organisms that evolved in environments harboring great diversity of life forms and showing peculiar light distribution and rhythms. To gain new insights into the evolutionary and functional relationships within the CPF family, we performed a detailed study of CPF members from marine (diatoms, sea urchin and annelid) and freshwater organisms (teleost) that populate diverse habitats and exhibit different life strategies. In particular, we first extended the CPF family phylogeny by including genes from aquatic organisms representative of several branches of the tree of life. Our analysis identifies four major super-classes of CPF proteins and importantly singles out the presence of a plant-like CRY in diatoms and in metazoans. Moreover, we show a dynamic evolution of Cpf genes in eukaryotes with various events of gene duplication coupled to functional diversification and gene loss, which have shaped the complex array of Cpf genes in extant aquatic organisms. Second, we uncover clear rhythmic diurnal expression patterns and light-dependent regulation for the majority of the analyzed Cpf genes in our reference species. Our analyses reconstruct the molecular evolution of the CPF family in eukaryotes and provide a solid foundation for a systematic characterization of novel light activated proteins in aquatic environments. Copyright © 2014. Published by Elsevier B.V.
Dixit, Shalabh; Kumar Biswal, Akshaya; Min, Aye; Henry, Amelia; Oane, Rowena H.; Raorane, Manish L.; Longkumer, Toshisangba; Pabuayon, Isaiah M.; Mutte, Sumanth K.; Vardarajan, Adithi R.; Miro, Berta; Govindan, Ganesan; Albano-Enriquez, Blesilda; Pueffeld, Mandy; Sreenivasulu, Nese; Slamet-Loedin, Inez; Sundarvelpandian, Kalaipandian; Tsai, Yuan-Ching; Raghuvanshi, Saurabh; Hsing, Yue-Ie C.; Kumar, Arvind; Kohli, Ajay
2015-01-01
Sub-QTLs and multiple intra-QTL genes are hypothesized to underpin large-effect QTLs. Known QTLs over gene families, biosynthetic pathways or certain traits represent functional gene-clusters of genes of the same gene ontology (GO). Gene-clusters containing genes of different GO have not been elaborated, except in silico as coexpressed genes within QTLs. Here we demonstrate the requirement of multiple intra-QTL genes for the full impact of QTL qDTY12.1 on rice yield under drought. Multiple evidences are presented for the need of the transcription factor ‘no apical meristem’ (OsNAM12.1) and its co-localized target genes of separate GO categories for qDTY12.1 function, raising a regulon-like model of genetic architecture. The molecular underpinnings of qDTY12.1 support its effectiveness in further improving a drought tolerant genotype and for its validity in multiple genotypes/ecosystems/environments. Resolving the combinatorial value of OsNAM12.1 with individual intra-QTL genes notwithstanding, identification and analyses of qDTY12.1has fast-tracked rice improvement towards food security. PMID:26507552
Ikram, Sobia; Durandet, Monique; Vesa, Simona; Pereira, Serge; Guerche, Philippe; Bonhomme, Sandrine
2014-06-01
F-box protein genes family is one of the largest gene families in plants, with almost 700 predicted genes in the model plant Arabidopsis. F-box proteins are key components of the ubiquitin proteasome system that allows targeted protein degradation. Transcriptome analyses indicate that half of these F-box protein genes are found expressed in microspore and/or pollen, i.e., during male gametogenesis. To assess the role of F-box protein genes during this crucial developmental step, we selected 34 F-box protein genes recorded as highly and specifically expressed in pollen and isolated corresponding insertion mutants. We checked the expression level of each selected gene by RT-PCR and confirmed pollen expression for 25 genes, but specific expression for only 10 of the 34 F-box protein genes. In addition, we tested the expression level of selected F-box protein genes in 24 mutant lines and showed that 11 of them were null mutants. Transmission analysis of the mutations to the progeny showed that none of the single mutations was gametophytic lethal. These unaffected transmission efficiencies suggested leaky mutations or functional redundancy among F-box protein genes. Cytological observation of the gametophytes in the mutants confirmed these results. Combinations of mutations in F-box protein genes from the same subfamily did not lead to transmission defect either, further highlighting functional redundancy and/or a high proportion of pseudogenes among these F-box protein genes.
Genomics of the Effect of Spinal Cord Stimulation on an Animal Model of Neuropathic Pain.
Vallejo, Ricardo; Tilley, Dana M; Cedeño, David L; Kelley, Courtney A; DeMaegd, Margaret; Benyamin, Ramsin
2016-08-01
Few studies have evaluated single-gene changes modulated by spinal cord stimulation (SCS), providing a narrow understanding of molecular changes. Genomics allows for a robust analysis of holistic gene changes in response to stimulation. Rats were randomized into six groups to determine the effect of continuous SCS in uninjured and spared-nerve injury (SNI) animals. After behavioral assessment, tissues from the dorsal quadrant of the spinal cord (SC) and dorsal root ganglion (DRG) underwent full-genome microarray analyses. Weighted Gene Correlation Network Analysis (WGCNA), and Gene Ontology (GO) analysis identified similar expression patterns, molecular functions and biological processes for significant genes. Microarray analyses reported 20,985 gene probes in SC and 19,104 in DRG. WGCNA sorted 7449 SC and 4275 DRG gene probes into 29 and 9 modules, respectively. WGCNA provided significant modules from paired comparisons of experimental groups. GO analyses reported significant biological processes influenced by injury, as well as the presence of an electric field. The genes Tlr2, Cxcl16, and Cd68 were used to further validate the microarray based on significant response to SCS in SNI animals. They were up-regulated in the SC while both Tlr2 and Cd68 were up-regulated in the DRG. The process described provides highly significant interconnected genes and pathways responsive to injury and/or electric field in the SC and DRG. Genes in the SC respond significantly to the SCS in both injured and uninjured animals, while those in the DRG significantly responded to injury, and SCS in injured animals. © 2016 International Neuromodulation Society.
2013-01-01
Background Aspartic proteases (APs) are a large family of proteolytic enzymes found in almost all organisms. In plants, they are involved in many biological processes, such as senescence, stress responses, programmed cell death, and reproduction. Prior to the present study, no grape AP gene(s) had been reported, and their research on woody species was very limited. Results In this study, a total of 50 AP genes (VvAP) were identified in the grape genome, among which 30 contained the complete ASP domain. Synteny analysis within grape indicated that segmental and tandem duplication events contributed to the expansion of the grape AP family. Additional analysis between grape and Arabidopsis demonstrated that several grape AP genes were found in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of grape and Arabidopsis. Phylogenetic relationships of the 30 VvAPs with the complete ASP domain and their Arabidopsis orthologs, as well as their gene and protein features were analyzed and their cellular localization was predicted. Moreover, expression profiles of VvAP genes in six different tissues were determined, and their transcript abundance under various stresses and hormone treatments were measured. Twenty-seven VvAP genes were expressed in at least one of the six tissues examined; nineteen VvAPs responded to at least one abiotic stress, 12 VvAPs responded to powdery mildew infection, and most of the VvAPs responded to SA and ABA treatments. Furthermore, integrated synteny and phylogenetic analysis identified orthologous AP genes between grape and Arabidopsis, providing a unique starting point for investigating the function of grape AP genes. Conclusions The genome-wide identification, evolutionary and expression analyses of grape AP genes provide a framework for future analysis of AP genes in defining their roles during stress response. Integrated synteny and phylogenetic analyses provide novel insight into the functions of less well-studied genes using information from their better understood orthologs. PMID:23945092
Impact of ontology evolution on functional analyses.
Groß, Anika; Hartung, Michael; Prüfer, Kay; Kelso, Janet; Rahm, Erhard
2012-10-15
Ontologies are used in the annotation and analysis of biological data. As knowledge accumulates, ontologies and annotation undergo constant modifications to reflect this new knowledge. These modifications may influence the results of statistical applications such as functional enrichment analyses that describe experimental data in terms of ontological groupings. Here, we investigate to what degree modifications of the Gene Ontology (GO) impact these statistical analyses for both experimental and simulated data. The analysis is based on new measures for the stability of result sets and considers different ontology and annotation changes. Our results show that past changes in the GO are non-uniformly distributed over different branches of the ontology. Considering the semantic relatedness of significant categories in analysis results allows a more realistic stability assessment for functional enrichment studies. We observe that the results of term-enrichment analyses tend to be surprisingly stable despite changes in ontology and annotation.
Rojas, Daniel; Rager, Julia E; Smeester, Lisa; Bailey, Kathryn A; Drobná, Zuzana; Rubio-Andrade, Marisela; Stýblo, Miroslav; García-Vargas, Gonzalo; Fry, Rebecca C
2015-01-01
Prenatal exposure to inorganic arsenic (iAs) is detrimental to the health of newborns and increases the risk of disease development later in life. Here we examined a subset of newborn cord blood leukocyte samples collected from subjects enrolled in the Biomarkers of Exposure to ARsenic (BEAR) pregnancy cohort in Gómez Palacio, Mexico, who were exposed to a range of drinking water arsenic concentrations (0.456-236 µg/l). Changes in iAs-associated DNA 5-methylcytosine methylation were assessed across 424,935 CpG sites representing 18,761 genes and compared with corresponding mRNA expression levels and birth outcomes. In the context of arsenic exposure, a total of 2919 genes were identified with iAs-associated differences in DNA methylation. Site-specific analyses identified DNA methylation changes that were most predictive of gene expression levels where CpG methylation within CpG islands positioned within the first exon, the 5' untranslated region and 200 bp upstream of the transcription start site yielded the most significant association with gene expression levels. A set of 16 genes was identified with correlated iAs-associated changes in DNA methylation and mRNA expression and all were highly enriched for binding sites of the early growth response (EGR) and CCCTC-binding factor (CTCF) transcription factors. Furthermore, DNA methylation levels of 7 of these genes were associated with differences in birth outcomes including gestational age and head circumference.These data highlight the complex interplay between DNA methylation, functional changes in gene expression and health outcomes and underscore the need for functional analyses coupled to epigenetic assessments. © The Author 2014. Published by Oxford University Press on behalf of the Society of Toxicology. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Retinoid-Related Orphan Receptor β and Transcriptional Control of Neuronal Differentiation.
Liu, Hong; Aramaki, Michihiko; Fu, Yulong; Forrest, Douglas
2017-01-01
The ability to generate neuronal diversity is central to the function of the nervous system. Here we discuss the key neurodevelopmental roles of retinoid-related orphan receptor β (RORβ) encoded by the Rorb (Nr1f2) gene. Recent studies have reported loss of function of the human RORB gene in cases of familial epilepsy and intellectual disability. Principal sites of expression of the Rorb gene in model species include sensory organs, the spinal cord, and brain regions that process sensory and circadian information. Genetic analyses in mice have indicated functions in circadian behavior, vision, and, at the cellular level, the differentiation of specific neuronal cell types. Studies in the retina and sensory areas of the cerebral cortex suggest that this orphan nuclear receptor acts at decisive steps in transcriptional hierarchies that determine neuronal diversity. 2017 Published by Elsevier Inc.
2009-01-01
Background A central task in contemporary biosciences is the identification of biological processes showing response in genome-wide differential gene expression experiments. Two types of analysis are common. Either, one generates an ordered list based on the differential expression values of the probed genes and examines the tail areas of the list for over-representation of various functional classes. Alternatively, one monitors the average differential expression level of genes belonging to a given functional class. So far these two types of method have not been combined. Results We introduce a scoring function, Gene Set Z-score (GSZ), for the analysis of functional class over-representation that combines two previous analysis methods. GSZ encompasses popular functions such as correlation, hypergeometric test, Max-Mean and Random Sets as limiting cases. GSZ is stable against changes in class size as well as across different positions of the analysed gene list in tests with randomized data. GSZ shows the best overall performance in a detailed comparison to popular functions using artificial data. Likewise, GSZ stands out in a cross-validation of methods using split real data. A comparison of empirical p-values further shows a strong difference in favour of GSZ, which clearly reports better p-values for top classes than the other methods. Furthermore, GSZ detects relevant biological themes that are missed by the other methods. These observations also hold when comparing GSZ with popular program packages. Conclusion GSZ and improved versions of earlier methods are a useful contribution to the analysis of differential gene expression. The methods and supplementary material are available from the website http://ekhidna.biocenter.helsinki.fi/users/petri/public/GSZ/GSZscore.html. PMID:19775443
Identification of potential target genes of ROR-alpha in THP1 and HUVEC cell lines.
Gulec, Cagri; Coban, Neslihan; Ozsait-Selcuk, Bilge; Sirma-Ekmekci, Sema; Yildirim, Ozlem; Erginel-Unaltuna, Nihan
2017-04-01
ROR-alpha is a nuclear receptor, activity of which can be modulated by natural or synthetic ligands. Due to its possible involvement in, and potential therapeutic target for atherosclerosis, we aimed to identify ROR-alpha target genes in monocytic and endothelial cell lines. We performed chromatin immunoprecipitation (ChIP) followed by tiling array (ChIP-on-chip) for ROR-alpha in monocytic cell line THP1 and endothelial cell line HUVEC. Following bioinformatic analysis of the array data, we tested four candidate genes in terms of dependence of their expression level on ligand-mediated ROR-alpha activity, and two of them in terms of promoter occupancy by ROR-alpha. Bioinformatic analyses of ChIP-on-chip data suggested that ROR-alpha binds to genomic regions near the transcription start site (TSS) of more than 3000 genes in THP1 and HUVEC. Potential ROR-alpha target genes in both cell types seem to be involved mainly in membrane receptor activity, signal transduction and ion transport. While SPP1 and IKBKA were shown to be direct target genes of ROR-alpha in THP1 monocytes, inflammation related gene HMOX1 and heat shock protein gene HSPA8 were shown to be potential target genes of ROR-alpha. Our results suggest that ROR-alpha may regulate signaling receptor activity, and transmembrane transport activity through its potential target genes. ROR-alpha seems also to play role in cellular sensitivity to environmental substances like arsenite and chloroprene. Although, the expression analyses have shown that synthetic ROR-alpha ligands can modulate some of potential ROR-alpha target genes, functional significance of ligand-dependent modulation of gene expression needs to be confirmed with further analyses. Copyright © 2017 Elsevier Inc. All rights reserved.
The double-stranded transcriptome of Escherichia coli.
Lybecker, Meghan; Zimmermann, Bob; Bilusic, Ivana; Tukhtubaeva, Nadezda; Schroeder, Renée
2014-02-25
Advances in high-throughput transcriptome analyses have revealed hundreds of antisense RNAs (asRNAs) for many bacteria, although few have been characterized, and the number of functional asRNAs remains unknown. We have developed a genome-wide high-throughput method to identify functional asRNAs in vivo. Most mechanisms of gene regulation via asRNAs require an RNA-RNA interaction with its target RNA, and we hypothesized that a functional asRNA would be found in a double strand (dsRNA), duplexed with its cognate RNA in a single cell. We developed a method of isolating dsRNAs from total RNA by immunoprecipitation with a ds-RNA specific antibody. Total RNA and immunoprecipitated dsRNA from Escherichia coli RNase III WT and mutant strains were deep-sequenced. A statistical model was applied to filter for biologically relevant dsRNA regions, which were subsequently categorized by location relative to annotated genes. A total of 316 potentially functional asRNAs were identified in the RNase III mutant strain and are encoded primarily opposite to the 5' ends of transcripts, but are also found opposite ncRNAs, gene junctions, and the 3' ends. A total of 21 sense/antisense RNA pairs identified in dsRNAs were confirmed by Northern blot analyses. Most of the RNA steady-state levels were higher or detectable only in the RNase III mutant strain. Taken together, our data indicate that a significant amount of dsRNA is formed in the cell, that RNase III degrades or processes these dsRNAs, and that dsRNA plays a major role in gene regulation in E. coli.
USDA-ARS?s Scientific Manuscript database
In soybean, variegated flowers can be caused by somatic excision of the CACTA-type transposable element Tgm9 from intron 2 of the DFR2 gene encoding dihydroflavonol-4-reductase in the anthocyanin pigment biosynthetic pathway. DFR2 has been mapped to the W4 locus where the allele containing the elem...
Zhang, Tingting; Hu, Shuhao; Yan, Caixia; Li, Chunjuan; Zhao, Xiaobo; Wan, Shubo; Shan, Shihua
2017-02-01
In the present investigation, a total of 60 conserved peanut (Arachis hypogaea L.) microRNA (miRNA) sequences, belonging to 16 families, were identified using bioinformatics methods. There were 392 target gene sequences, identified from 58 miRNAs with Target-align software and BLASTx analyses. Gene Ontology (GO) functional analysis suggested that these target genes were involved in mediating peanut growth and development, signal transduction and stress resistance. There were 55 miRNA sequences, verified employing a poly (A) tailing test, with a success rate of up to 91.67%. Twenty peanut target gene sequences were randomly selected, and the 5' rapid amplification of the cDNA ends (5'-RACE) method were used to validate the cleavage sites of these target genes. Of these, 14 (70%) peanut miRNA targets were verified by means of gel electrophoresis, cloning and sequencing. Furthermore, functional analysis and homologous sequence retrieval were conducted for target gene sequences, and 26 target genes were chosen as the objects for stress resistance experimental study. Real-time fluorescence quantitative PCR (qRT-PCR) technology was applied to measure the expression level of resistance-associated miRNAs and their target genes in peanut exposed to Aspergillus flavus (A. flavus) infection and drought stress, respectively. In consequence, 5 groups of miRNAs & targets were found accorded with the mode of miRNA negatively controlling the expression of target genes. This study, preliminarily determined the biological functions of some resistance-associated miRNAs and their target genes in peanut. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
Zhao, Lihua; He, Jiangman; Cai, Hanyang; Lin, Haiyan; Li, Yanqiang; Liu, Renyi; Yang, Zhenbiao; Qin, Yuan
2014-11-01
Megasporogenesis is essential for female fertility, and requires the accomplishment of meiosis and the formation of functional megaspores. The inaccessibility and low abundance of female meiocytes make it particularly difficult to elucidate the molecular basis underlying megasporogenesis. We used high-throughput tag-sequencing analysis to identify genes expressed in female meiocytes (FMs) by comparing gene expression profiles from wild-type ovules undergoing megasporogenesis with those from the spl mutant ovules, which lack megasporogenesis. A total of 862 genes were identified as FMs, with levels that are consistently reduced in spl ovules in two biological replicates. Fluorescence-assisted cell sorting followed by RNA-seq analysis of DMC1:GFP-labeled female meiocytes confirmed that 90% of the FMs are indeed detected in the female meiocyte protoplast profiling. We performed reverse genetic analysis of 120 candidate genes and identified four FM genes with a function in female meiosis progression in Arabidopsis. We further revealed that KLU, a putative cytochrome P450 monooxygenase, is involved in chromosome pairing during female meiosis, most likely by affecting the normal expression pattern of DMC1 in ovules during female meiosis. Our studies provide valuable information for functional genomic analyses of plant germline development as well as insights into meiosis. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.
Zhou, Chuanen; Han, Lu; Li, Guifen; Chai, Maofeng; Fu, Chunxiang; Cheng, Xiaofei; Wen, Jiangqi; Tang, Yuhong; Wang, Zeng-Yu
2014-01-01
Class I KNOTTED-like homeobox (KNOXI) genes are critical for the maintenance of the shoot apical meristem. The expression domain of KNOXI is regulated by ASYMMETRIC LEAVES1/ROUGHSHEATH2/PHANTASTICA (ARP) genes, which are associated with leaf morphology. In the inverted repeat-lacking clade (IRLC) of Fabaceae, the orthologs of LEAFY (LFY) function in place of KNOXI to regulate compound leaf development. Here, we characterized loss-of-function mutants of ARP (PHAN) and SHOOTMERISTEMLESS (STM)- and BREVIPEDICELLUS (BP)-like KNOXI in the model IRLC legume species Medicago truncatula. The function of ARP genes is species specific. The repression of STM/BP-like KNOXI genes in leaves is not mediated by PHAN, and no suppression of PHAN by STM/BP-like KNOXI genes was observed either, indicating that STM/BP-like KNOXI genes are uncoupled from PHAN in M. truncatula. Furthermore, comparative analyses of phenotypic output in response to ectopic expression of KNOXI and the M. truncatula LFY ortholog, SINGLE LEAFLET1 (SGL1), reveal that KNOXI and SGL1 regulate parallel pathways in leaf development. We propose that SGL1 probably functions in a stage-specific manner in the regulation of the indeterminate state of developing leaves in M. truncatula. PMID:24781113
Diepeveen, Eveline T; Kim, Fabienne D; Salzburger, Walter
2013-07-17
Gen(om)e duplication events are hypothesized as key mechanisms underlying the origin of phenotypic diversity and evolutionary innovation. The diverse and species-rich lineage of teleost fishes is a renowned example of this scenario, because of the fish-specific genome duplication. Gene families, generated by this and other gene duplication events, have been previously found to play a role in the evolution and development of innovations in cichlid fishes - a prime model system to study the genetic basis of rapid speciation, adaptation and evolutionary innovation. The distal-less homeobox genes are particularly interesting candidate genes for evolutionary novelties, such as the pharyngeal jaw apparatus and the anal fin egg-spots. Here we study the dlx repertoire in 23 East African cichlid fishes to determine the rate of evolution and the signatures of selection pressure. Four intact dlx clusters were retrieved from cichlid draft genomes. Phylogenetic analyses of these eight dlx loci in ten teleost species, followed by an in-depth analysis of 23 East African cichlid species, show that there is disparity in the rates of evolution of the dlx paralogs. Dlx3a and dlx4b are the fastest evolving dlx genes, while dlx1a and dlx6a evolved more slowly. Subsequent analyses of the nonsynonymous-synonymous substitution rate ratios indicate that dlx3b, dlx4a and dlx5a evolved under purifying selection, while signs of positive selection were found for dlx1a, dlx2a, dlx3a and dlx4b. Our results indicate that the dlx repertoire of teleost fishes and cichlid fishes in particular, is shaped by differential selection pressures and rates of evolution after gene duplication. Although the divergence of the dlx paralogs are putative signs of new or altered functions, comparisons with available expression patterns indicate that the three dlx loci under strong purifying selection, dlx3b, dlx4a and dlx5a, are transcribed at high levels in the cichlids' pharyngeal jaw and anal fin. The dlx paralogs emerge as excellent candidate genes for the development of evolutionary innovations in cichlids, although further functional analyses are necessary to elucidate their respective contribution.
Grim, Christopher J.; Kozlova, Elena V.; Sha, Jian; Fitts, Eric C.; van Lier, Christina J.; Kirtley, Michelle L.; Joseph, Sandeep J.; Read, Timothy D.; Burd, Eileen M.; Tall, Ben D.; Joseph, Sam W.; Horneman, Amy J.; Chopra, Ashok K.; Shak, Joshua R.
2013-01-01
ABSTRACT Aeromonas hydrophila has increasingly been implicated as a virulent and antibiotic-resistant etiologic agent in various human diseases. In a previously published case report, we described a subject with a polymicrobial wound infection that included a persistent and aggressive strain of A. hydrophila (E1), as well as a more antibiotic-resistant strain of A. hydrophila (E2). To better understand the differences between pathogenic and environmental strains of A. hydrophila, we conducted comparative genomic and functional analyses of virulence-associated genes of these two wound isolates (E1 and E2), the environmental type strain A. hydrophila ATCC 7966T, and four other isolates belonging to A. aquariorum, A. veronii, A. salmonicida, and A. caviae. Full-genome sequencing of strains E1 and E2 revealed extensive differences between the two and strain ATCC 7966T. The more persistent wound infection strain, E1, harbored coding sequences for a cytotoxic enterotoxin (Act), a type 3 secretion system (T3SS), flagella, hemolysins, and a homolog of exotoxin A found in Pseudomonas aeruginosa. Corresponding phenotypic analyses with A. hydrophila ATCC 7966T and SSU as reference strains demonstrated the functionality of these virulence genes, with strain E1 displaying enhanced swimming and swarming motility, lateral flagella on electron microscopy, the presence of T3SS effector AexU, and enhanced lethality in a mouse model of Aeromonas infection. By combining sequence-based analysis and functional assays, we characterized an A. hydrophila pathotype, exemplified by strain E1, that exhibited increased virulence in a mouse model of infection, likely because of encapsulation, enhanced motility, toxin secretion, and cellular toxicity. PMID:23611906
Jung, Kwang-Woo; Yang, Dong-Hoon; Kim, Min-Kyu; Seo, Ho Seong
2016-01-01
ABSTRACT The basidiomycetous fungus Cryptococcus neoformans has been known to be highly radiation resistant and has been found in fatal radioactive environments such as the damaged nuclear reactor at Chernobyl. To elucidate the mechanisms underlying the radiation resistance phenotype of C. neoformans, we identified genes affected by gamma radiation through genome-wide transcriptome analysis and characterized their functions. We found that genes involved in DNA damage repair systems were upregulated in response to gamma radiation. Particularly, deletion of recombinase RAD51 and two DNA-dependent ATPase genes, RAD54 and RDH54, increased cellular susceptibility to both gamma radiation and DNA-damaging agents. A variety of oxidative stress response genes were also upregulated. Among them, sulfiredoxin contributed to gamma radiation resistance in a peroxiredoxin/thioredoxin-independent manner. Furthermore, we found that genes involved in molecular chaperone expression, ubiquitination systems, and autophagy were induced, whereas genes involved in the biosynthesis of proteins and fatty acids/sterols were downregulated. Most importantly, we discovered a number of novel C. neoformans genes, the expression of which was modulated by gamma radiation exposure, and their deletion rendered cells susceptible to gamma radiation exposure, as well as DNA damage insults. Among these genes, we found that a unique transcription factor containing the basic leucine zipper domain, named Bdr1, served as a regulator of the gamma radiation resistance of C. neoformans by controlling expression of DNA repair genes, and its expression was regulated by the evolutionarily conserved DNA damage response protein kinase Rad53. Taken together, the current transcriptome and functional analyses contribute to the understanding of the unique molecular mechanism of the radiation-resistant fungus C. neoformans. PMID:27899501
Lin, Choun-Sea; Chen, Jeremy J W; Chiu, Chi-Chou; Hsiao, Han C W; Yang, Chen-Jui; Jin, Xiao-Hua; Leebens-Mack, James; de Pamphilis, Claude W; Huang, Yao-Ting; Yang, Ling-Hung; Chang, Wan-Jung; Kui, Ling; Wong, Gane Ka-Shu; Hu, Jer-Ming; Wang, Wen; Shih, Ming-Che
2017-06-01
The chloroplast NAD(P)H dehydrogenase-like (NDH) complex consists of about 30 subunits from both the nuclear and chloroplast genomes and is ubiquitous across most land plants. In some orchids, such as Phalaenopsis equestris, Dendrobium officinale and Dendrobium catenatum, most of the 11 chloroplast genome-encoded ndh genes (cp-ndh) have been lost. Here we investigated whether functional cp-ndh genes have been completely lost in these orchids or whether they have been transferred and retained in the nuclear genome. Further, we assessed whether both cp-ndh genes and nucleus-encoded NDH-related genes can be lost, resulting in the absence of the NDH complex. Comparative analyses of the genome of Apostasia odorata, an orchid species with a complete complement of cp-ndh genes which represents the sister lineage to all other orchids, and three published orchid genome sequences for P. equestris, D. officinale and D. catenatum, which are all missing cp-ndh genes, indicated that copies of cp-ndh genes are not present in any of these four nuclear genomes. This observation suggests that the NDH complex is not necessary for some plants. Comparative genomic/transcriptomic analyses of currently available plastid genome sequences and nuclear transcriptome data showed that 47 out of 660 photoautotrophic plants and all the heterotrophic plants are missing plastid-encoded cp-ndh genes and exhibit no evidence for maintenance of a functional NDH complex. Our data indicate that the NDH complex can be lost in photoautotrophic plant species. Further, the loss of the NDH complex may increase the probability of transition from a photoautotrophic to a heterotrophic life history. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.
Global functional analyses of cellular responses to pore-forming toxins.
Kao, Cheng-Yuan; Los, Ferdinand C O; Huffman, Danielle L; Wachi, Shinichiro; Kloft, Nicole; Husmann, Matthias; Karabrahimi, Valbona; Schwartz, Jean-Louis; Bellier, Audrey; Ha, Christine; Sagong, Youn; Fan, Hui; Ghosh, Partho; Hsieh, Mindy; Hsu, Chih-Shen; Chen, Li; Aroian, Raffi V
2011-03-01
Here we present the first global functional analysis of cellular responses to pore-forming toxins (PFTs). PFTs are uniquely important bacterial virulence factors, comprising the single largest class of bacterial protein toxins and being important for the pathogenesis in humans of many Gram positive and Gram negative bacteria. Their mode of action is deceptively simple, poking holes in the plasma membrane of cells. The scattered studies to date of PFT-host cell interactions indicate a handful of genes are involved in cellular defenses to PFTs. How many genes are involved in cellular defenses against PFTs and how cellular defenses are coordinated are unknown. To address these questions, we performed the first genome-wide RNA interference (RNAi) screen for genes that, when knocked down, result in hypersensitivity to a PFT. This screen identifies 106 genes (∼0.5% of genome) in seven functional groups that protect Caenorhabditis elegans from PFT attack. Interactome analyses of these 106 genes suggest that two previously identified mitogen-activated protein kinase (MAPK) pathways, one (p38) studied in detail and the other (JNK) not, form a core PFT defense network. Additional microarray, real-time PCR, and functional studies reveal that the JNK MAPK pathway, but not the p38 MAPK pathway, is a key central regulator of PFT-induced transcriptional and functional responses. We find C. elegans activator protein 1 (AP-1; c-jun, c-fos) is a downstream target of the JNK-mediated PFT protection pathway, protects C. elegans against both small-pore and large-pore PFTs and protects human cells against a large-pore PFT. This in vivo RNAi genomic study of PFT responses proves that cellular commitment to PFT defenses is enormous, demonstrates the JNK MAPK pathway as a key regulator of transcriptionally-induced PFT defenses, and identifies AP-1 as the first cellular component broadly important for defense against large- and small-pore PFTs.
Exercise training improves obesity‐related lymphatic dysfunction
Hespe, Geoffrey E.; Kataru, Raghu P.; Savetsky, Ira L.; García Nores, Gabriela D.; Torrisi, Jeremy S.; Nitti, Matthew D.; Gardenier, Jason C.; Zhou, Jie; Yu, Jessie Z.; Jones, Lee W.
2016-01-01
Key points Obesity results in perilymphatic inflammation and lymphatic dysfunction.Lymphatic dysfunction in obesity is characterized by decreased lymphatic vessel density, decreased collecting lymphatic vessel pumping frequency, decreased lymphatic trafficking of immune cells, increased lymphatic vessel leakiness and changes in the gene expression patterns of lymphatic endothelial cells.Aerobic exercise, independent of weight loss, decreases perilymphatic inflammatory cell accumulation, improves lymphatic function and reverses pathological changes in gene expression in lymphatic endothelial cells. Abstract Although previous studies have shown that obesity markedly decreases lymphatic function, the cellular mechanisms that regulate this response remain unknown. In addition, it is unclear whether the pathological effects of obesity on the lymphatic system are reversible with behavioural modifications. The purpose of this study, therefore, was to analyse lymphatic vascular changes in obese mice and to determine whether these pathological effects are reversible with aerobic exercise. We randomized obese mice to either aerobic exercise (treadmill running for 30 min per day, 5 days a week, for 6 weeks) or a sedentary group that was not exercised and analysed lymphatic function using a variety of outcomes. We found that sedentary obese mice had markedly decreased collecting lymphatic vessel pumping capacity, decreased lymphatic vessel density, decreased lymphatic migration of immune cells, increased lymphatic vessel leakiness and decreased expression of lymphatic specific markers compared with lean mice (all P < 0.01). Aerobic exercise did not cause weight loss but markedly improved lymphatic function compared with sedentary obese mice. Exercise had a significant anti‐inflammatory effect, resulting in decreased perilymphatic accumulation of inflammatory cells and inducible nitric oxide synthase expression. In addition, exercise normalized isolated lymphatic endothelial cell gene expression of lymphatic specific genes, including VEGFR‐3 and Prox1. Taken together, our findings suggest that obesity impairs lymphatic function via multiple mechanisms and that these pathological changes can be reversed, in part, with aerobic exercise, independent of weight loss. In addition, our study shows that obesity‐induced lymphatic endothelial cell gene expression changes are reversible with behavioural modifications. PMID:26931178
Liu, Y T; Li, S R; Wang, Z; Xiao, J Z
2016-09-13
Objective: To profile the gene expression changes associated with endoplasmic reticulum stress in INS-1-3 cells induced by thapsigargin (TG) and tunicamycin (TM). Methods: Normal cultured INS-1-3 cells were used as a control. TG and TM were used to induce endoplasmic reticulum stress in INS-1-3 cells. Digital gene expression profiling technique was used to detect differentially expressed gene. The changes of gene expression were detected by expression pattern clustering analysis, gene ontology (GO) function and pathway enrichment analysis. Real time polymerase chain reaction (RT-PCR) was used to verify the key changes of gene expression. Results: Compared with the control group, there were 57 (45 up-regulated, 12 down-regulated) and 135 (99 up-regulated, 36 down-regulated) differentially expressed genes in TG and TM group, respectively. GO function enrichment analyses indicated that the main enrichment was in the endoplasmic reticulum. In signaling pathway analysis, the identified pathways were related with endoplasmic reticulum stress, antigen processing and presentation, protein export, and most of all, the maturity onset diabetes of the young (MODY) pathway. Conclusion: Under the condition of endoplasmic reticulum stress, the related expression changes of transcriptional factors in MODY signaling pathway may be related with the impaired function in islet beta cells.
Liu, Shiming; Su, Zhaobing; Tan, Sainan; Ni, Bin; Pan, Hong; Liu, Beihong; Wang, Jing; Xiao, Jianmin; Chen, Qiuhong
2017-08-01
CITED2 gene is an important cardiac transcription factor that plays a fundamental role in the formation and development of embryonic cardiovascular. Previous studies have showed that knock-out of CITED2 in mice might result in various cardiac malformations. However, the mechanisms of CITED2 mutation on congenital heart disease (CHD) in Chinese Tibetan population are still poorly understood. In the present study, 187 unrelated Tibetan patients with CHD and 200 unrelated Tibetan healthy controls were screened for variants in the CITED2 gene; we subsequently identified one potential disease-causing mutation p.G143A in a 6-year-old girl with PDA and functional analyses of the mutation were carried out. Our study showed that the novel mutation of CITED2 significantly enhanced the expression activity of vascular endothelial growth factor (VEGF) under the role of co-receptor hypoxia inducible factor 1-aipha (HIF-1A), which is closely related with embryonic cardiac development. As a result, CITED2 gene mutation may play a significant role in the development of pediatric congenital heart disease.
MAISTAS: a tool for automatic structural evaluation of alternative splicing products.
Floris, Matteo; Raimondo, Domenico; Leoni, Guido; Orsini, Massimiliano; Marcatili, Paolo; Tramontano, Anna
2011-06-15
Analysis of the human genome revealed that the amount of transcribed sequence is an order of magnitude greater than the number of predicted and well-characterized genes. A sizeable fraction of these transcripts is related to alternatively spliced forms of known protein coding genes. Inspection of the alternatively spliced transcripts identified in the pilot phase of the ENCODE project has clearly shown that often their structure might substantially differ from that of other isoforms of the same gene, and therefore that they might perform unrelated functions, or that they might even not correspond to a functional protein. Identifying these cases is obviously relevant for the functional assignment of gene products and for the interpretation of the effect of variations in the corresponding proteins. Here we describe a publicly available tool that, given a gene or a protein, retrieves and analyses all its annotated isoforms, provides users with three-dimensional models of the isoform(s) of his/her interest whenever possible and automatically assesses whether homology derived structural models correspond to plausible structures. This information is clearly relevant. When the homology model of some isoforms of a gene does not seem structurally plausible, the implications are that either they assume a structure unrelated to that of the other isoforms of the same gene with presumably significant functional differences, or do not correspond to functional products. We provide indications that the second hypothesis is likely to be true for a substantial fraction of the cases. http://maistas.bioinformatica.crs4.it/.
Kim, Unkyu; Siegel, Rachael; Ren, Xiaodi; Gunther, Cary S; Gaasterland, Terry; Roeder, Robert G
2003-07-22
The tissue-specific transcriptional coactivator OCA-B is required for antigen-dependent B cell differentiation events, including germinal center formation. However, the identity of OCA-B target genes involved in this process is unknown. This study has used large-scale cDNA arrays to monitor changes in gene expression patterns that accompany mature B cell differentiation. B cell receptor ligation alone induces many genes involved in B cell expansion, whereas B cell receptor and helper T cell costimulation induce genes associated with B cell effector function. OCA-B expression is induced by both B cell receptor ligation alone and helper T cell costimulation, suggesting that OCA-B is involved in B cell expansion as well as B cell function. Accordingly, several genes involved in cell proliferation and signaling, such as Lck, Kcnn4, Cdc37, cyclin D3, B4galt1, and Ms4a11, have been identified as OCA-B-dependent genes. Further studies on the roles played by these genes in B cells will contribute to an understanding of B cell differentiation.
Genomics screens for metastasis genes
Yan, Jinchun; Huang, Qihong
2014-01-01
Metastasis is responsible for most cancer mortality. The process of metastasis is complex, requiring the coordinated expression and fine regulation of many genes in multiple pathways in both the tumor and host tissues. Identification and characterization of the genetic programs that regulate metastasis is critical to understanding the metastatic process and discovering molecular targets for the prevention and treatment of metastasis. Genomic approaches and functional genomic analyses can systemically discover metastasis genes. In this review, we summarize the genetic tools and methods that have been used to identify and characterize the genes that play critical roles in metastasis. PMID:22684367
Evolution of Prdm Genes in Animals: Insights from Comparative Genomics
Vervoort, Michel; Meulemeester, David; Béhague, Julien; Kerner, Pierre
2016-01-01
Prdm genes encode transcription factors with a subtype of SET domain known as the PRDF1-RIZ (PR) homology domain and a variable number of zinc finger motifs. These genes are involved in a wide variety of functions during animal development. As most Prdm genes have been studied in vertebrates, especially in mice, little is known about the evolution of this gene family. We searched for Prdm genes in the fully sequenced genomes of 93 different species representative of all the main metazoan lineages. A total of 976 Prdm genes were identified in these species. The number of Prdm genes per species ranges from 2 to 19. To better understand how the Prdm gene family has evolved in metazoans, we performed phylogenetic analyses using this large set of identified Prdm genes. These analyses allowed us to define 14 different subfamilies of Prdm genes and to establish, through ancestral state reconstruction, that 11 of them are ancestral to bilaterian animals. Three additional subfamilies were acquired during early vertebrate evolution (Prdm5, Prdm11, and Prdm17). Several gene duplication and gene loss events were identified and mapped onto the metazoan phylogenetic tree. By studying a large number of nonmetazoan genomes, we confirmed that Prdm genes likely constitute a metazoan-specific gene family. Our data also suggest that Prdm genes originated before the diversification of animals through the association of a single ancestral SET domain encoding gene with one or several zinc finger encoding genes. PMID:26560352
Lira-Albarrán, Saúl; Durand, Marta; Barrera, David; Vega, Claudia; Becerra, Rocio García; Díaz, Lorenza; García-Quiroz, Janice; Rangel, Claudia; Larrea, Fernando
2018-04-27
In order to get further information on the effects of ulipristal acetate (UPA) upon the process of decidualization of endometrium, a functional analysis of the differentially expressed genes in endometrium (DEG) from UPA treated-versus control-cycles of normal ovulatory women was performed. A list of 1183 endometrial DEG, from a previously published study by our group, was submitted to gene ontology, gene enrichment and ingenuity pathway analyses (IPA). This functional analysis showed that decidualization was a biological process overrepresented. Gene set enrichment analysis identified LIF, PRL, IL15 and STAT3 among the most down-regulated genes within the JAK STAT canonical pathway. IPA showed that decidualization of uterus was a bio-function predicted as inhibited by UPA. The results demonstrated that this selective progesterone receptor modulator, when administered during the periovulatory phase of the menstrual cycle, may affect the molecular mechanisms leading to endometrial decidualization in response to progesterone during the period of maximum embryo receptivity. Copyright © 2018 Elsevier B.V. All rights reserved.
Extracting Fitness Relationships and Oncogenic Patterns among Driver Genes in Cancer.
Zhang, Xindong; Gao, Lin; Jia, Songwei
2017-12-25
Driver mutation provides fitness advantage to cancer cells, the accumulation of which increases the fitness of cancer cells and accelerates cancer progression. This work seeks to extract patterns accumulated by driver genes ("fitness relationships") in tumorigenesis. We introduce a network-based method for extracting the fitness relationships of driver genes by modeling the network properties of the "fitness" of cancer cells. Colon adenocarcinoma (COAD) and skin cutaneous malignant melanoma (SKCM) are employed as case studies. Consistent results derived from different background networks suggest the reliability of the identified fitness relationships. Additionally co-occurrence analysis and pathway analysis reveal the functional significance of the fitness relationships with signaling transduction. In addition, a subset of driver genes called the "fitness core" is recognized for each case. Further analyses indicate the functional importance of the fitness core in carcinogenesis, and provide potential therapeutic opportunities in medicinal intervention. Fitness relationships characterize the functional continuity among driver genes in carcinogenesis, and suggest new insights in understanding the oncogenic mechanisms of cancers, as well as providing guiding information for medicinal intervention.
Guselnikov, S.V.; Grayfer, L.; De Jesús Andino, F.; Rogozin, I.B.; Robert, J.; Taranin, A.V.
2015-01-01
The ITAM-bearing transmembrane signaling subunits (TSS) are indispensable components of activating leukocyte receptor complexes. The TSS-encoding genes map to paralogous chromosomal regions, which are thought to arise from ancient genome tetraploidization(s). To assess a possible role of tetraploidization in the TSS evolution, we studied TSS and other functionally linked genes in the amphibian species Xenopus laevis whose genome was duplicated about 40 MYR ago. We found that X. laevis has retained a duplicated set of sixteen TSS genes, all except one being transcribed. Furthermore, duplicated TCRα loci and genes encoding TSS-coupling protein kinases have also been retained. No clear evidence for functional divergence of the TSS paralogs was obtained from gene expression and sequence analyses. We suggest that the main factor of maintenance of duplicated TSS genes in X. laevis was a protein dosage effect and that this effect might have facilitated the TSS set expansion in early vertebrates. PMID:26170006
Orthologs, paralogs and genome comparisons
NASA Technical Reports Server (NTRS)
Gogarten, J. P.; Olendzenski, L.
1999-01-01
During the past decade, ancient gene duplications were recognized as one of the main forces in the generation of diverse gene families and the creation of new functional capabilities. New tools developed to search data banks for homologous sequences, and an increased availability of reliable three-dimensional structural information led to the recognition that proteins with diverse functions can belong to the same superfamily. Analyses of the evolution of these superfamilies promises to provide insights into early evolution but are complicated by several important evolutionary processes. Horizontal transfer of genes can lead to a vertical spread of innovations among organisms, therefore finding a certain property in some descendants of an ancestor does not guarantee that it was present in that ancestor. Complete or partial gene conversion between duplicated genes can yield phylogenetic trees with several, apparently independent gene duplications, suggesting an often surprising parallelism in the evolution of independent lineages. Additionally, the breakup of domains within a protein and the fusion of domains into multifunctional proteins makes the delineation of superfamilies a task that remains difficult to automate.
Evolution of substrate specificity in a retained enzyme driven by gene loss
Juárez-Vázquez, Ana Lilia; Edirisinghe, Janaka N; Verduzco-Castro, Ernesto A; Michalska, Karolina; Wu, Chenggang; Noda-García, Lianet; Babnigg, Gyorgy; Endres, Michael; Medina-Ruíz, Sofía; Santoyo-Flores, Julián; Carrillo-Tripp, Mauricio; Ton-That, Hung; Joachimiak, Andrzej; Henry, Christopher S; Barona-Gómez, Francisco
2017-01-01
The connection between gene loss and the functional adaptation of retained proteins is still poorly understood. We apply phylogenomics and metabolic modeling to detect bacterial species that are evolving by gene loss, with the finding that Actinomycetaceae genomes from human cavities are undergoing sizable reductions, including loss of L-histidine and L-tryptophan biosynthesis. We observe that the dual-substrate phosphoribosyl isomerase A or priA gene, at which these pathways converge, appears to coevolve with the occurrence of trp and his genes. Characterization of a dozen PriA homologs shows that these enzymes adapt from bifunctionality in the largest genomes, to a monofunctional, yet not necessarily specialized, inefficient form in genomes undergoing reduction. These functional changes are accomplished via mutations, which result from relaxation of purifying selection, in residues structurally mapped after sequence and X-ray structural analyses. Our results show how gene loss can drive the evolution of substrate specificity from retained enzymes. DOI: http://dx.doi.org/10.7554/eLife.22679.001 PMID:28362260
Outbred genome sequencing and CRISPR/Cas9 gene editing in butterflies
Li, Xueyan; Fan, Dingding; Zhang, Wei; Liu, Guichun; Zhang, Lu; Zhao, Li; Fang, Xiaodong; Chen, Lei; Dong, Yang; Chen, Yuan; Ding, Yun; Zhao, Ruoping; Feng, Mingji; Zhu, Yabing; Feng, Yue; Jiang, Xuanting; Zhu, Deying; Xiang, Hui; Feng, Xikan; Li, Shuaicheng; Wang, Jun; Zhang, Guojie; Kronforst, Marcus R.; Wang, Wen
2015-01-01
Butterflies are exceptionally diverse but their potential as an experimental system has been limited by the difficulty of deciphering heterozygous genomes and a lack of genetic manipulation technology. Here we use a hybrid assembly approach to construct high-quality reference genomes for Papilio xuthus (contig and scaffold N50: 492 kb, 3.4 Mb) and Papilio machaon (contig and scaffold N50: 81 kb, 1.15 Mb), highly heterozygous species that differ in host plant affiliations, and adult and larval colour patterns. Integrating comparative genomics and analyses of gene expression yields multiple insights into butterfly evolution, including potential roles of specific genes in recent diversification. To functionally test gene function, we develop an efficient (up to 92.5%) CRISPR/Cas9 gene editing method that yields obvious phenotypes with three genes, Abdominal-B, ebony and frizzled. Our results provide valuable genomic and technological resources for butterflies and unlock their potential as a genetic model system. PMID:26354079
Evolution of Substrate Specificity in A Retained Enzyme Driven by Gene Loss
Juarez-Vazquez, Ana L.; Edirisinghe, Janaka N.; Verduzco-Castro, Ernesto A.; ...
2017-03-31
The connection between gene loss and the functional adaptation of retained proteins is still poorly understood. Here, we apply phylogenomics and metabolic modeling to detect bacterial species that are evolving by gene loss, with the finding that Actinomycetaceae genomes from human cavities are undergoing sizable reductions, including loss of L-histidine and L-tryptophan biosynthesis. We also observe that the dual-substrate phosphoribosyl isomerase A or priA gene, at which these pathways converge, appears to coevolve with the occurrence of trp and his genes. Characterization of a dozen PriA homologs shows that these enzymes adapt from bifunctionality in the largest genomes, to amore » monofunctional, yet not necessarily specialized, inefficient form in genomes undergoing reduction. These functional changes are accomplished via mutations, which result from relaxation of purifying selection, in residues structurally mapped after sequence and X-ray structural analyses. These results show how gene loss can drive the evolution of substrate specificity from retained enzymes.« less
Evolution of Substrate Specificity in A Retained Enzyme Driven by Gene Loss
DOE Office of Scientific and Technical Information (OSTI.GOV)
Juarez-Vazquez, Ana L.; Edirisinghe, Janaka N.; Verduzco-Castro, Ernesto A.
The connection between gene loss and the functional adaptation of retained proteins is still poorly understood. Here, we apply phylogenomics and metabolic modeling to detect bacterial species that are evolving by gene loss, with the finding that Actinomycetaceae genomes from human cavities are undergoing sizable reductions, including loss of L-histidine and L-tryptophan biosynthesis. We also observe that the dual-substrate phosphoribosyl isomerase A or priA gene, at which these pathways converge, appears to coevolve with the occurrence of trp and his genes. Characterization of a dozen PriA homologs shows that these enzymes adapt from bifunctionality in the largest genomes, to amore » monofunctional, yet not necessarily specialized, inefficient form in genomes undergoing reduction. These functional changes are accomplished via mutations, which result from relaxation of purifying selection, in residues structurally mapped after sequence and X-ray structural analyses. These results show how gene loss can drive the evolution of substrate specificity from retained enzymes.« less
Liu, Guofeng; Bao, Manzhu
2013-01-01
The identification of mutants in model plant species has led to the isolation of the floral homeotic function genes that play crucial roles in flower organ specification. However, floral homeotic C-function genes are rarely studied in basal eudicots. Here, we report the isolation and characterization of the AGAMOUS (AG) orthologous gene (PaAG) from a basal eudicot London plane tree (Platanus acerifolia Willd). Phylogenetic analysis showed that PaAG belongs to the C- clade AG group of genes. PaAG was found to be expressed predominantly in the later developmental stages of male and female inflorescences. Ectopic expression of PaAG-1 in tobacco (Nicotiana tabacum) resulted in morphological alterations of the outer two flower whorls, as well as some defects in vegetative growth. Scanning electron micrographs (SEMs) confirmed homeotic sepal-to-carpel transformation in the transgenic plants. Protein interaction assays in yeast cells indicated that PaAG could interact directly with PaAP3 (a B-class MADS-box protein in P. acerifolia), and also PaSEP1 and PaSEP3 (E-class MADS-box proteins in P. acerifolia). This study performed the functional analysis of AG orthologous genes outside core eudicots and monocots. Our findings demonstrate a conserved functional role of AG homolog in London plane tree, which also represent a contribution towards understanding the molecular mechanisms of flower development in this monoecious tree species. PMID:23691041
Kirst, Henning; Garcia-Cerdan, Jose Gines; Zurbriggen, Andreas; Ruehle, Thilo; Melis, Anastasios
2012-01-01
The truncated light-harvesting antenna size3 (tla3) DNA insertional transformant of Chlamydomonas reinhardtii is a chlorophyll-deficient mutant with a lighter green phenotype, a lower chlorophyll (Chl) per cell content, and higher Chl a/b ratio than corresponding wild-type strains. Functional analyses revealed a higher intensity for the saturation of photosynthesis and greater light-saturated photosynthetic activity in the tla3 mutant than in the wild type and a Chl antenna size of the photosystems that was only about 40% of that in the wild type. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis and western-blot analyses showed that the tla3 strain was deficient in the Chl a/b light-harvesting complex. Molecular and genetic analyses revealed a single plasmid insertion in chromosome 4 of the tla3 nuclear genome, causing deletion of predicted gene g5047 and plasmid insertion within the fourth intron of downstream-predicted gene g5046. Complementation studies defined that gene g5047 alone was necessary and sufficient to rescue the tla3 mutation. Gene g5047 encodes a C. reinhardtii homolog of the chloroplast-localized SRP43 signal recognition particle, whose occurrence and function in green microalgae has not hitherto been investigated. Biochemical analysis showed that the nucleus-encoded and chloroplast-localized CrCpSRP43 protein specifically operates in the assembly of the peripheral components of the Chl a/b light-harvesting antenna. This work demonstrates that cpsrp43 deletion in green microalgae can be employed to generate tla mutants with a substantially diminished Chl antenna size. The latter exhibit improved solar energy conversion efficiency and photosynthetic productivity under mass culture and bright sunlight conditions. PMID:23043081
Distribution of mutations in the PEX gene in families with X-linked hypophosphataemic rickets (HYP).
Rowe, P S; Oudet, C L; Francis, F; Sinding, C; Pannetier, S; Econs, M J; Strom, T M; Meitinger, T; Garabedian, M; David, A; Macher, M A; Questiaux, E; Popowska, E; Pronicka, E; Read, A P; Mokrzycki, A; Glorieux, F H; Drezner, M K; Hanauer, A; Lehrach, H; Goulding, J N; O'Riordan, J L
1997-04-01
Mutations in the PEX gene at Xp22.1 (phosphate-regulating gene with homologies to endopeptidases, on the X-chromosome), are responsible for X-linked hypophosphataemic rickets (HYP). Homology of PEX to the M13 family of Zn2+ metallopeptidases which include neprilysin (NEP) as prototype, has raised important questions regarding PEX function at the molecular level. The aim of this study was to analyse 99 HYP families for PEX gene mutations, and to correlate predicted changes in the protein structure with Zn2+ metallopeptidase gene function. Primers flanking 22 characterised exons were used to amplify DNA by PCR, and SSCP was then used to screen for mutations. Deletions, insertions, nonsense mutations, stop codons and splice mutations occurred in 83% of families screened for in all 22 exons, and 51% of a separate set of families screened in 17 PEX gene exons. Missense mutations in four regions of the gene were informative regarding function, with one mutation in the Zn2+-binding site predicted to alter substrate enzyme interaction and catalysis. Computer analysis of the remaining mutations predicted changes in secondary structure, N-glycosylation, protein phosphorylation and catalytic site molecular structure. The wide range of mutations that align with regions required for protease activity in NEP suggests that PEX also functions as a protease, and may act by processing factor(s) involved in bone mineral metabolism.
Arachchi, H S Jayasinghe; Kalra, Vijay; Lal, Banwari; Bhatia, Vikram; Baba, C S; Chakravarthy, S; Rohatgi, S; Sarma, Priyangshu M; Mishra, V; Das, Bimal; Ahuja, Vineet
2007-12-01
The duodenal ulcer (DU)-promoting gene (dupA) of Helicobacter pylori has been identified as a novel virulent marker associated with an increased risk for DU. The presence or absence of dupA gene of H. pylori present in patients with DU and functional dyspepsia in North Indian population was studied by polymerase chain reaction (PCR) and hybridization analysis. One hundred and sixty-six patients (96 DU and 70 functional dyspepsia) were included in this study. In addition, sequence diversity of dupA gene of H. pylori found in these patients was analyzed by sequencing the PCR products jhp0917 and jhp0918 on both strands with appropriate primers. PCR and hybridization analyses indicated that dupA gene was present in 37.5% (36/96) of H. pylori strains isolated from DU patients and 22.86% (16/70) of functional dyspepsia patients (p < or = .05). Of these, 35 patients with DU (97.2%) and 14 patients with functional dyspepsia (81.25%) were infected by H. pylori positive for cagA genotype. Furthermore, the presence of dupA was significantly associated with the cagA-positive genotype (p < or = .02). Results of our study have shown that significant association of dupA gene with DU in this population. The dupA gene can be considered as a novel virulent marker for DU in this population.
Chang, Yao-Ming; Liu, Wen-Yu; Shih, Arthur Chun-Chieh; Shen, Meng-Ni; Lu, Chen-Hua; Lu, Mei-Yeh Jade; Yang, Hui-Wen; Wang, Tzi-Yuan; Chen, Sean C-C; Chen, Stella Maris; Li, Wen-Hsiung; Ku, Maurice S B
2012-09-01
To study the regulatory and functional differentiation between the mesophyll (M) and bundle sheath (BS) cells of maize (Zea mays), we isolated large quantities of highly homogeneous M and BS cells from newly matured second leaves for transcriptome profiling by RNA sequencing. A total of 52,421 annotated genes with at least one read were found in the two transcriptomes. Defining a gene with more than one read per kilobase per million mapped reads as expressed, we identified 18,482 expressed genes; 14,972 were expressed in M cells, including 53 M-enriched transcription factor (TF) genes, whereas 17,269 were expressed in BS cells, including 214 BS-enriched TF genes. Interestingly, many TF gene families show a conspicuous BS preference in expression. Pathway analyses reveal differentiation between the two cell types in various functional categories, with the M cells playing more important roles in light reaction, protein synthesis and folding, tetrapyrrole synthesis, and RNA binding, while the BS cells specialize in transport, signaling, protein degradation and posttranslational modification, major carbon, hydrogen, and oxygen metabolism, cell division and organization, and development. Genes coding for several transporters involved in the shuttle of C(4) metabolites and BS cell wall development have been identified, to our knowledge, for the first time. This comprehensive data set will be useful for studying M/BS differentiation in regulation and function.
Wang, Guo-Dong; Zhang, Bao-Lin; Zhou, Wei-Wei; Li, Yong-Xin; Jin, Jie-Qiong; Shao, Yong; Yang, He-Chuan; Liu, Yan-Hu; Yan, Fang; Chen, Hong-Man; Jin, Li; Gao, Feng; Zhang, Yaoguang; Li, Haipeng; Mao, Bingyu; Murphy, Robert W; Wake, David B; Zhang, Ya-Ping; Che, Jing
2018-05-29
Tibetan frogs, Nanorana parkeri , are differentiated genetically but not morphologically along geographical and elevational gradients in a challenging environment, presenting a unique opportunity to investigate processes leading to speciation. Analyses of whole genomes of 63 frogs reveal population structuring and historical demography, characterized by highly restricted gene flow in a narrow geographic zone lying between matrilines West (W) and East (E). A population found only along a single tributary of the Yalu Zangbu River has the mitogenome only of E, whereas nuclear genes of W comprise 89-95% of the nuclear genome. Selection accounts for 579 broadly scattered, highly divergent regions (HDRs) of the genome, which involve 365 genes. These genes fall into 51 gene ontology (GO) functional classes, 14 of which are likely to be important in driving reproductive isolation. GO enrichment analyses of E reveal many overrepresented functional categories associated with adaptation to high elevations, including blood circulation, response to hypoxia, and UV radiation. Four genes, including DNAJC8 in the brain, TNNC1 and ADORA1 in the heart, and LAMB3 in the lung, differ in levels of expression between low- and high-elevation populations. High-altitude adaptation plays an important role in maintaining and driving continuing divergence and reproductive isolation. Use of total genomes enabled recognition of selection and adaptation in and between populations, as well as documentation of evolution along a stepped cline toward speciation. Copyright © 2018 the Author(s). Published by PNAS.
Chen, Jianqing; Yin, Hao; Gu, Jinping; Li, Leiting; Liu, Zhe; Jiang, Xueting; Zhou, Hongsheng; Wei, Shuwei; Zhang, Shaoling; Wu, Juyou
2015-01-01
The cyclic nucleotide-gated channel (CNGC) family is involved in the uptake of various cations, such as Ca(2+), to regulate plant growth and respond to biotic and abiotic stresses. However, there is far less information about this family in woody plants such as pear. Here, we provided a genome-wide identification and analysis of the CNGC gene family in pear. Phylogenetic analysis showed that the 21 pear CNGC genes could be divided into five groups (I, II, III, IVA and IVB). The majority of gene duplications in pear appeared to have been caused by segmental duplication and occurred 32.94-39.14 million years ago. Evolutionary analysis showed that positive selection had driven the evolution of pear CNGCs. Motif analyses showed that Group I CNGCs generally contained 26 motifs, which was the greatest number of motifs in all CNGC groups. Among these, eight motifs were shared by each group, suggesting that these domains play a conservative role in CNGC activity. Tissue-specific expression analysis indicated that functional diversification of the duplicated CNGC genes was a major feature of long-term evolution. Our results also suggested that the P-S6 and PBC & hinge domains had co-evolved during the evolution. These results provide valuable information to increase our understanding of the function, evolution and expression analyses of the CNGC gene family in higher plants. Copyright © 2014 Elsevier Inc. All rights reserved.
GoGene: gene annotation in the fast lane.
Plake, Conrad; Royer, Loic; Winnenburg, Rainer; Hakenberg, Jörg; Schroeder, Michael
2009-07-01
High-throughput screens such as microarrays and RNAi screens produce huge amounts of data. They typically result in hundreds of genes, which are often further explored and clustered via enriched GeneOntology terms. The strength of such analyses is that they build on high-quality manual annotations provided with the GeneOntology. However, the weakness is that annotations are restricted to process, function and location and that they do not cover all known genes in model organisms. GoGene addresses this weakness by complementing high-quality manual annotation with high-throughput text mining extracting co-occurrences of genes and ontology terms from literature. GoGene contains over 4,000,000 associations between genes and gene-related terms for 10 model organisms extracted from more than 18,000,000 PubMed entries. It does not cover only process, function and location of genes, but also biomedical categories such as diseases, compounds, techniques and mutations. By bringing it all together, GoGene provides the most recent and most complete facts about genes and can rank them according to novelty and importance. GoGene accepts keywords, gene lists, gene sequences and protein sequences as input and supports search for genes in PubMed, EntrezGene and via BLAST. Since all associations of genes to terms are supported by evidence in the literature, the results are transparent and can be verified by the user. GoGene is available at http://gopubmed.org/gogene.
Effects of seawater acidification on gene expression: resolving broader-scale trends in sea urchins.
Evans, Tyler G; Watson-Wynn, Priscilla
2014-06-01
Sea urchins are ecologically and economically important calcifying organisms threatened by acidification of the global ocean caused by anthropogenic CO2 emissions. Propelled by the sequencing of the purple sea urchin (Strongylocentrotus purpuratus) genome, profiling changes in gene expression during exposure to high pCO2 seawater has emerged as a powerful and increasingly common method to infer the response of urchins to ocean change. However, analyses of gene expression are sensitive to experimental methodology, and comparisons between studies of genes regulated by ocean acidification are most often made in the context of major caveats. Here we perform meta-analyses as a means of minimizing experimental discrepancies and resolving broader-scale trends regarding the effects of ocean acidification on gene expression in urchins. Analyses across eight studies and four urchin species largely support prevailing hypotheses about the impact of ocean acidification on marine calcifiers. The predominant expression pattern involved the down-regulation of genes within energy-producing pathways, a clear indication of metabolic depression. Genes with functions in ion transport were significantly over-represented and are most plausibly contributing to intracellular pH regulation. Expression profiles provided extensive evidence for an impact on biomineralization, epitomized by the down-regulation of seven spicule matrix proteins. In contrast, expression profiles provided limited evidence for CO2-mediated developmental delay or induction of a cellular stress response. Congruence between studies of gene expression and the ocean acidification literature in general validates the accuracy of gene expression in predicting the consequences of ocean change and justifies its continued use in future studies. © 2014 Marine Biological Laboratory.
Prostate cancer-associated gene expression alterations determined from needle biopsies.
Qian, David Z; Huang, Chung-Ying; O'Brien, Catherine A; Coleman, Ilsa M; Garzotto, Mark; True, Lawrence D; Higano, Celestia S; Vessella, Robert; Lange, Paul H; Nelson, Peter S; Beer, Tomasz M
2009-05-01
To accurately identify gene expression alterations that differentiate neoplastic from normal prostate epithelium using an approach that avoids contamination by unwanted cellular components and is not compromised by acute gene expression changes associated with tumor devascularization and resulting ischemia. Approximately 3,000 neoplastic and benign prostate epithelial cells were isolated using laser capture microdissection from snap-frozen prostate biopsy specimens provided by 31 patients who subsequently participated in a clinical trial of preoperative chemotherapy. cDNA synthesized from amplified total RNA was hybridized to custom-made microarrays composed of 6,200 clones derived from the Prostate Expression Database. Expression differences for selected genes were verified using quantitative reverse transcription-PCR. Comparative analyses identified 954 transcript alterations associated with cancer (q < 0.01%), including 149 differentially expressed genes with no known functional roles. Gene expression changes associated with ischemia and surgical removal of the prostate gland were absent. Genes up-regulated in prostate cancer were statistically enriched in categories related to cellular metabolism, energy use, signal transduction, and molecular transport. Genes down-regulated in prostate cancers were enriched in categories related to immune response, cellular responses to pathogens, and apoptosis. A heterogeneous pattern of androgen receptor expression changes was noted. In exploratory analyses, androgen receptor down-regulation was associated with a lower probability of cancer relapse after neoadjuvant chemotherapy followed by radical prostatectomy. Assessments of tumor phenotypes based on gene expression for treatment stratification and drug targeting of oncogenic alterations may best be ascertained using biopsy-based analyses where the effects of ischemia do not complicate interpretation.
Prostate Cancer-Associated Gene Expression Alterations Determined from Needle Biopsies
Qian, David Z.; Huang, Chung-Ying; O'Brien, Catherine A.; Coleman, Ilsa M.; Garzotto, Mark; True, Lawrence D.; Higano, Celestia S.; Vessella, Robert; Lange, Paul H.; Nelson, Peter S.; Beer, Tomasz M.
2010-01-01
Purpose To accurately identify gene expression alterations that differentiate neoplastic from normal prostate epithelium using an approach that avoids contamination by unwanted cellular components and is not compromised by acute gene expression changes associated with tumor devascularization and resulting ischemia. Experimental Design Approximately 3,000 neoplastic and benign prostate epithelial cells were isolated using laser capture microdissection from snap-frozen prostate biopsy specimens provided by 31 patients who subsequently participated in a clinical trial of preoperative chemotherapy. cDNA synthesized from amplified total RNA was hybridized to custom-made microarrays comprised of 6200 clones derived from the Prostate Expression Database. Expression differences for selected genes were verified using quantitative RT-PCR. Results Comparative analyses identified 954 transcript alterations associated with cancer (q value <0.01%) including 149 differentially expressed genes with no known functional roles. Gene expression changes associated with ischemia and surgical removal of the prostate gland were absent. Genes up-regulated in prostate cancer were statistically enriched in categories related to cellular metabolism, energy utilization, signal transduction, and molecular transport. Genes down-regulated in prostate cancers were enriched in categories related to immune response, cellular responses to pathogens, and apoptosis. A heterogeneous pattern of AR expression changes was noted. In exploratory analyses, AR down regulation was associated with a lower probability of cancer relapse after neoadjuvant chemotherapy followed by radical prostatectomy. Conclusions Assessments of tumor phenotypes based on gene expression for treatment stratification and drug targeting of oncogenic alterations may best be ascertained using biopsy-based analyses where the effects of ischemia do not complicate interpretation. PMID:19366833
Zhao, Min; Li, Zhe; Qu, Hong
2015-01-01
Metastasis suppressor genes (MS genes) are genes that play important roles in inhibiting the process of cancer metastasis without preventing growth of the primary tumor. Identification of these genes and understanding their functions are critical for investigation of cancer metastasis. Recent studies on cancer metastasis have identified many new susceptibility MS genes. However, the comprehensive illustration of diverse cellular processes regulated by metastasis suppressors during the metastasis cascade is lacking. Thus, the relationship between MS genes and cancer risk is still unclear. To unveil the cellular complexity of MS genes, we have constructed MSGene (http://MSGene.bioinfo-minzhao.org/), the first literature-based gene resource for exploring human MS genes. In total, we manually curated 194 experimentally verified MS genes and mapped to 1448 homologous genes from 17 model species. Follow-up functional analyses associated 194 human MS genes with epithelium/tissue morphogenesis and epithelia cell proliferation. In addition, pathway analysis highlights the prominent role of MS genes in activation of platelets and coagulation system in tumor metastatic cascade. Moreover, global mutation pattern of MS genes across multiple cancers may reveal common cancer metastasis mechanisms. All these results illustrate the importance of MSGene to our understanding on cell development and cancer metastasis. PMID:26486520
CNL Disease Resistance Genes in Soybean and Their Evolutionary Divergence
Nepal, Madhav P; Benson, Benjamin V
2015-01-01
Disease resistance genes (R-genes) encode proteins involved in detecting pathogen attack and activating downstream defense molecules. Recent availability of soybean genome sequences makes it possible to examine the diversity of gene families including disease-resistant genes. The objectives of this study were to identify coiled-coil NBS-LRR (= CNL) R-genes in soybean, infer their evolutionary relationships, and assess structural as well as functional divergence of the R-genes. Profile hidden Markov models were used for sequence identification and model-based maximum likelihood was used for phylogenetic analysis, and variation in chromosomal positioning, gene clustering, and functional divergence were assessed. We identified 188 soybean CNL genes nested into four clades consistent to their orthologs in Arabidopsis. Gene clustering analysis revealed the presence of 41 gene clusters located on 13 different chromosomes. Analyses of the Ks-values and chromosomal positioning suggest duplication events occurring at varying timescales, and an extrapericentromeric positioning may have facilitated their rapid evolution. Each of the four CNL clades exhibited distinct patterns of gene expression. Phylogenetic analysis further supported the extrapericentromeric positioning effect on the divergence and retention of the CNL genes. The results are important for understanding the diversity and divergence of CNL genes in soybean, which would have implication in soybean crop improvement in future. PMID:25922568
CNL Disease Resistance Genes in Soybean and Their Evolutionary Divergence.
Nepal, Madhav P; Benson, Benjamin V
2015-01-01
Disease resistance genes (R-genes) encode proteins involved in detecting pathogen attack and activating downstream defense molecules. Recent availability of soybean genome sequences makes it possible to examine the diversity of gene families including disease-resistant genes. The objectives of this study were to identify coiled-coil NBS-LRR (= CNL) R-genes in soybean, infer their evolutionary relationships, and assess structural as well as functional divergence of the R-genes. Profile hidden Markov models were used for sequence identification and model-based maximum likelihood was used for phylogenetic analysis, and variation in chromosomal positioning, gene clustering, and functional divergence were assessed. We identified 188 soybean CNL genes nested into four clades consistent to their orthologs in Arabidopsis. Gene clustering analysis revealed the presence of 41 gene clusters located on 13 different chromosomes. Analyses of the K s-values and chromosomal positioning suggest duplication events occurring at varying timescales, and an extrapericentromeric positioning may have facilitated their rapid evolution. Each of the four CNL clades exhibited distinct patterns of gene expression. Phylogenetic analysis further supported the extrapericentromeric positioning effect on the divergence and retention of the CNL genes. The results are important for understanding the diversity and divergence of CNL genes in soybean, which would have implication in soybean crop improvement in future.
The gene and the genon concept: a functional and information-theoretic analysis
Scherrer, Klaus; Jost, Jürgen
2007-01-01
‘Gene' has become a vague and ill-defined concept. To set the stage for mathematical analysis of gene storage and expression, we return to the original concept of the gene as a function encoded in the genome, basis of genetic analysis, that is a polypeptide or other functional product. The additional information needed to express a gene is contained within each mRNA as an ensemble of signals, added to or superimposed onto the coding sequence. To designate this programme, we introduce the term ‘genon'. Individual genons are contained in the pre-mRNA forming a pre-genon. A genomic domain contains a proto-genon, with the signals of transcription activation in addition to the pre-genon in the transcripts. Some contain several mRNAs and hence genons, to be singled out by RNA processing and differential splicing. The programme in the genon in cis is implemented by corresponding factors of protein or RNA nature contained in the transgenon of the cell or organism. The gene, the cis programme contained in the individual domain and transcript, and the trans programme of factors, can be analysed by information theory. PMID:17353929
Microarray gene expression profiling analysis combined with bioinformatics in multiple sclerosis.
Liu, Mingyuan; Hou, Xiaojun; Zhang, Ping; Hao, Yong; Yang, Yiting; Wu, Xiongfeng; Zhu, Desheng; Guan, Yangtai
2013-05-01
Multiple sclerosis (MS) is the most prevalent demyelinating disease and the principal cause of neurological disability in young adults. Recent microarray gene expression profiling studies have identified several genetic variants contributing to the complex pathogenesis of MS, however, expressional and functional studies are still required to further understand its molecular mechanism. The present study aimed to analyze the molecular mechanism of MS using microarray analysis combined with bioinformatics techniques. We downloaded the gene expression profile of MS from Gene Expression Omnibus (GEO) and analysed the microarray data using the differentially coexpressed genes (DCGs) and links package in R and Database for Annotation, Visualization and Integrated Discovery. The regulatory impact factor (RIF) algorithm was used to measure the impact factor of transcription factor. A total of 1,297 DCGs between MS patients and healthy controls were identified. Functional annotation indicated that these DCGs were associated with immune and neurological functions. Furthermore, the RIF result suggested that IKZF1, BACH1, CEBPB, EGR1, FOS may play central regulatory roles in controlling gene expression in the pathogenesis of MS. Our findings confirm the presence of multiple molecular alterations in MS and indicate the possibility for identifying prognostic factors associated with MS pathogenesis.
Ai, Ye; Zhang, Chunling; Sun, Yalin; Wang, Weining; He, Yanhong; Bao, Manzhu
2017-01-01
According to the floral organ development ABC model, B class genes specify petal and stamen identification. In order to study the function of B class genes in flower development of Tagetes erecta, five MADS-box B class genes were identified and their expression and putative functions were studied. Sequence comparisons and phylogenetic analyses indicated that there were one PI-like gene-TePI, two euAP3-like genes-TeAP3-1 and TeAP3-2, and two TM6-like genes-TeTM6-1 and TeTM6-2 in T. erecta. Strong expression levels of these genes were detected in stamens of the disk florets, but little or no expression was detected in bracts, receptacles or vegetative organs. Yeast hybrid experiments of the B class proteins showed that TePI protein could form a homodimer and heterodimers with all the other four B class proteins TeAP3-1, TeAP3-2, TeTM6-1 and TeTM6-2. No homodimer or interaction was observed between the euAP3 and TM6 clade members. Over-expression of five B class genes of T. erecta in Nicotiana rotundifolia showed that only the transgenic plants of 35S::TePI showed altered floral morphology compared with the non-transgenic line. This study could contribute to the understanding of the function of B class genes in flower development of T. erecta, and provide a theoretical basis for further research to change floral organ structures and create new materials for plant breeding.
Plant uncoupling mitochondrial proteins.
Vercesi, Aníbal Eugênio; Borecký, Jiri; Maia, Ivan de Godoy; Arruda, Paulo; Cuccovia, Iolanda Midea; Chaimovich, Hernan
2006-01-01
Uncoupling proteins (UCPs) are membrane proteins that mediate purine nucleotide-sensitive free fatty acid-activated H(+) flux through the inner mitochondrial membrane. After the discovery of UCP in higher plants in 1995, it was acknowledged that these proteins are widely distributed in eukaryotic organisms. The widespread presence of UCPs in eukaryotes implies that these proteins may have functions other than thermogenesis. In this review, we describe the current knowledge of plant UCPs, including their discovery, biochemical properties, distribution, gene family, gene expression profiles, regulation of gene expression, and evolutionary aspects. Expression analyses and functional studies on the plant UCPs under normal and stressful conditions suggest that UCPs regulate energy metabolism in the cellular responses to stress through regulation of the electrochemical proton potential (Deltamu(H)+) and production of reactive oxygen species.
Analysis tools for the interplay between genome layout and regulation.
Bouyioukos, Costas; Elati, Mohamed; Képès, François
2016-06-06
Genome layout and gene regulation appear to be interdependent. Understanding this interdependence is key to exploring the dynamic nature of chromosome conformation and to engineering functional genomes. Evidence for non-random genome layout, defined as the relative positioning of either co-functional or co-regulated genes, stems from two main approaches. Firstly, the analysis of contiguous genome segments across species, has highlighted the conservation of gene arrangement (synteny) along chromosomal regions. Secondly, the study of long-range interactions along a chromosome has emphasised regularities in the positioning of microbial genes that are co-regulated, co-expressed or evolutionarily correlated. While one-dimensional pattern analysis is a mature field, it is often powerless on biological datasets which tend to be incomplete, and partly incorrect. Moreover, there is a lack of comprehensive, user-friendly tools to systematically analyse, visualise, integrate and exploit regularities along genomes. Here we present the Genome REgulatory and Architecture Tools SCAN (GREAT:SCAN) software for the systematic study of the interplay between genome layout and gene expression regulation. SCAN is a collection of related and interconnected applications currently able to perform systematic analyses of genome regularities as well as to improve transcription factor binding sites (TFBS) and gene regulatory network predictions based on gene positional information. We demonstrate the capabilities of these tools by studying on one hand the regular patterns of genome layout in the major regulons of the bacterium Escherichia coli. On the other hand, we demonstrate the capabilities to improve TFBS prediction in microbes. Finally, we highlight, by visualisation of multivariate techniques, the interplay between position and sequence information for effective transcription regulation.
Whole blood genome-wide expression profiling and network analysis suggest MELAS master regulators.
Mende, Susanne; Royer, Loic; Herr, Alexander; Schmiedel, Janet; Deschauer, Marcus; Klopstock, Thomas; Kostic, Vladimir S; Schroeder, Michael; Reichmann, Heinz; Storch, Alexander
2011-07-01
The heteroplasmic mitochondrial DNA (mtDNA) mutation A3243G causes the mitochondrial encephalomyopathy, lactic acidosis, and stroke-like episodes (MELAS) syndrome as one of the most frequent mitochondrial diseases. The process of reconfiguration of nuclear gene expression profile to accommodate cellular processes to the functional status of mitochondria might be a key to MELAS disease manifestation and could contribute to its diverse phenotypic presentation. To determine master regulatory protein networks and disease-modifying genes in MELAS syndrome. Analyses of whole blood transcriptomes from 10 MELAS patients using a novel strategy by combining classic Affymetrix oligonucleotide microarray profiling with regulatory and protein interaction network analyses. Hierarchical cluster analysis elucidated that the relative abundance of mutant mtDNA molecules is decisive for the nuclear gene expression response. Further analyses confirmed not only transcription factors already known to be involved in mitochondrial diseases (such as TFAM), but also detected the hypoxia-inducible factor 1 complex, nuclear factor Y and cAMP responsive element-binding protein-related transcription factors as novel master regulators for reconfiguration of nuclear gene expression in response to the MELAS mutation. Correlation analyses of gene alterations and clinico-genetic data detected significant correlations between A3243G-induced nuclear gene expression changes and mutant mtDNA load as well as disease characteristics. These potential disease-modifying genes influencing the expression of the MELAS phenotype are mainly related to clusters primarily unrelated to cellular energy metabolism, but important for nucleic acid and protein metabolism, and signal transduction. Our data thus provide a framework to search for new pathogenetic concepts and potential therapeutic approaches to treat the MELAS syndrome.
Genome-Wide Analyses of the Soybean F-Box Gene Family in Response to Salt Stress
Jia, Qi; Xiao, Zhi-Xia; Wong, Fuk-Ling; Sun, Song; Liang, Kang-Jing; Lam, Hon-Ming
2017-01-01
The F-box family is one of the largest gene families in plants that regulate diverse life processes, including salt responses. However, the knowledge of the soybean F-box genes and their roles in salt tolerance remains limited. Here, we conducted a genome-wide survey of the soybean F-box family, and their expression analysis in response to salinity via in silico analysis of online RNA-sequencing (RNA-seq) data and quantitative reverse-transcription polymerase chain reaction (qRT-PCR) to predict their potential functions. A total of 725 potential F-box proteins encoded by 509 genes were identified and classified into 9 subfamilies. The gene structures, conserved domains and chromosomal distributions were characterized. There are 76 pairs of duplicate genes identified, including genome-wide segmental and tandem duplication events, which lead to the expansion of the number of F-box genes. The in silico expression analysis showed that these genes would be involved in diverse developmental functions and play an important role in salt response. Our qRT-PCR analysis confirmed 12 salt-responding F-box genes. Overall, our results provide useful information on soybean F-box genes, especially their potential roles in salt tolerance. PMID:28417911
Genome-Wide Analyses of the Soybean F-Box Gene Family in Response to Salt Stress.
Jia, Qi; Xiao, Zhi-Xia; Wong, Fuk-Ling; Sun, Song; Liang, Kang-Jing; Lam, Hon-Ming
2017-04-12
The F-box family is one of the largest gene families in plants that regulate diverse life processes, including salt responses. However, the knowledge of the soybean F-box genes and their roles in salt tolerance remains limited. Here, we conducted a genome-wide survey of the soybean F-box family, and their expression analysis in response to salinity via in silico analysis of online RNA-sequencing (RNA-seq) data and quantitative reverse-transcription polymerase chain reaction (qRT-PCR) to predict their potential functions. A total of 725 potential F-box proteins encoded by 509 genes were identified and classified into 9 subfamilies. The gene structures, conserved domains and chromosomal distributions were characterized. There are 76 pairs of duplicate genes identified, including genome-wide segmental and tandem duplication events, which lead to the expansion of the number of F-box genes. The in silico expression analysis showed that these genes would be involved in diverse developmental functions and play an important role in salt response. Our qRT-PCR analysis confirmed 12 salt-responding F-box genes. Overall, our results provide useful information on soybean F-box genes, especially their potential roles in salt tolerance.
Identifying a gene expression signature of cluster headache in blood
Eising, Else; Pelzer, Nadine; Vijfhuizen, Lisanne S.; Vries, Boukje de; Ferrari, Michel D.; ‘t Hoen, Peter A. C.; Terwindt, Gisela M.; van den Maagdenberg, Arn M. J. M.
2017-01-01
Cluster headache is a relatively rare headache disorder, typically characterized by multiple daily, short-lasting attacks of excruciating, unilateral (peri-)orbital or temporal pain associated with autonomic symptoms and restlessness. To better understand the pathophysiology of cluster headache, we used RNA sequencing to identify differentially expressed genes and pathways in whole blood of patients with episodic (n = 19) or chronic (n = 20) cluster headache in comparison with headache-free controls (n = 20). Gene expression data were analysed by gene and by module of co-expressed genes with particular attention to previously implicated disease pathways including hypocretin dysregulation. Only moderate gene expression differences were identified and no associations were found with previously reported pathogenic mechanisms. At the level of functional gene sets, associations were observed for genes involved in several brain-related mechanisms such as GABA receptor function and voltage-gated channels. In addition, genes and modules of co-expressed genes showed a role for intracellular signalling cascades, mitochondria and inflammation. Although larger study samples may be required to identify the full range of involved pathways, these results indicate a role for mitochondria, intracellular signalling and inflammation in cluster headache. PMID:28074859
Computation and application of tissue-specific gene set weights.
Frost, H Robert
2018-04-06
Gene set testing, or pathway analysis, has become a critical tool for the analysis of highdimensional genomic data. Although the function and activity of many genes and higher-level processes is tissue-specific, gene set testing is typically performed in a tissue agnostic fashion, which impacts statistical power and the interpretation and replication of results. To address this challenge, we have developed a bioinformatics approach to compute tissuespecific weights for individual gene sets using information on tissue-specific gene activity from the Human Protein Atlas (HPA). We used this approach to create a public repository of tissue-specific gene set weights for 37 different human tissue types from the HPA and all collections in the Molecular Signatures Database (MSigDB). To demonstrate the validity and utility of these weights, we explored three different applications: the functional characterization of human tissues, multi-tissue analysis for systemic diseases and tissue-specific gene set testing. All data used in the reported analyses is publicly available. An R implementation of the method and tissue-specific weights for MSigDB gene set collections can be downloaded at http://www.dartmouth.edu/∼hrfrost/TissueSpecificGeneSets. rob.frost@dartmouth.edu.
A premeiotic function for boule in the planarian Schmidtea mediterranea.
Iyer, Harini; Issigonis, Melanie; Sharma, Prashant P; Extavour, Cassandra G; Newmark, Phillip A
2016-06-21
Mutations in Deleted in Azoospermia (DAZ), a Y chromosome gene, are an important cause of human male infertility. DAZ is found exclusively in primates, limiting functional studies of this gene to its homologs: boule, required for meiotic progression of germ cells in invertebrate model systems, and Daz-like (Dazl), required for early germ cell maintenance in vertebrates. Dazl is believed to have acquired its premeiotic role in a vertebrate ancestor following the duplication and functional divergence of the single-copy gene boule. However, multiple homologs of boule have been identified in some invertebrates, raising the possibility that some of these genes may play other roles, including a premeiotic function. Here we identify two boule paralogs in the freshwater planarian Schmidtea mediterranea Smed-boule1 is necessary for meiotic progression of male germ cells, similar to the known function of boule in invertebrates. By contrast, Smed-boule2 is required for the maintenance of early male germ cells, similar to vertebrate Dazl To examine if Boule2 may be functionally similar to vertebrate Dazl, we identify and functionally characterize planarian homologs of human DAZL/DAZ-interacting partners and DAZ family mRNA targets. Finally, our phylogenetic analyses indicate that premeiotic functions of planarian boule2 and vertebrate Dazl evolved independently. Our study uncovers a premeiotic role for an invertebrate boule homolog and offers a tractable invertebrate model system for studying the premeiotic functions of the DAZ protein family.
NASA Astrophysics Data System (ADS)
Mittal, Shikha; Banduni, Pooja; Mallikarjuna, Mallana G.; Rao, Atmakuri R.; Jain, Prashant A.; Dash, Prasanta K.; Thirunavukkarasu, Nepolean
2018-05-01
Drought is one of the major threats to maize production. In order to improve the production and to breed tolerant hybrids, understanding the genes and regulatory mechanisms during drought stress is important. Transcription factors (TFs) play a major role in gene regulation and many TFs have been identified in response to drought stress. In our experiment, a set of 15 major TF families comprising 1436 genes was structurally and functionally characterized using in-silico tools and a gene expression assay. All 1436 genes were mapped on 10 chromosome of maize. The functional annotation indicated the involvement of these genes in ABA signaling, ROS scavenging, photosynthesis, stomatal regulation, and sucrose metabolism. Duplication was identified as the primary force in divergence and expansion of TF families. Phylogenetic relationship was developed individually for each TF family as well as combined TF families. Phylogenetic analysis grouped the TF family of genes into TF-specific and mixed groups. Phylogenetic analysis of genes belonging to various TF families suggested that the origin of TFs occurred in the lineage of maize evolution. Gene structure analysis revealed that more number of genes were intron-rich as compared to intronless genes. Drought-responsive CRE’s such as ABREA, ABREB, DRE1 and DRECRTCOREAT have been identified. Expression and interaction analyses identified leaf-specific bZIP TF, GRMZM2G140355, as a potential contributor toward drought tolerance in maize. We also analyzed protein-protein interaction network of 269 drought-responsive genes belonging to different drought-related TFs. The information generated on structural and functional characteristics, expression and interaction of the drought-related TF families will be useful to decipher the drought tolerance mechanisms and to derive drought-tolerant genotypes in maize.
Li, Ming-Rui; Shi, Feng-Xue; Li, Ya-Ling; Jiang, Peng; Jiao, Lili; Liu, Bao; Li, Lin-Feng
2017-09-01
Chinese ginseng (Panax ginseng Meyer) is a medicinally important herb and plays crucial roles in traditional Chinese medicine. Pharmacological analyses identified diverse bioactive components from Chinese ginseng. However, basic biological attributes including domestication and selection of the ginseng plant remain under-investigated. Here, we presented a genome-wide view of the domestication and selection of cultivated ginseng based on the whole genome data. A total of 8,660 protein-coding genes were selected for genome-wide scanning of the 30 wild and cultivated ginseng accessions. In complement, the 45s rDNA, chloroplast and mitochondrial genomes were included to perform phylogenetic and population genetic analyses. The observed spatial genetic structure between northern cultivated ginseng (NCG) and southern cultivated ginseng (SCG) accessions suggested multiple independent origins of cultivated ginseng. Genome-wide scanning further demonstrated that NCG and SCG have undergone distinct selection pressures during the domestication process, with more genes identified in the NCG (97 genes) than in the SCG group (5 genes). Functional analyses revealed that these genes are involved in diverse pathways, including DNA methylation, lignin biosynthesis, and cell differentiation. These findings suggested that the SCG and NCG groups have distinct demographic histories. Candidate genes identified are useful for future molecular breeding of cultivated ginseng. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Carlsbecker, Annelie; Sundström, Jens F; Englund, Marie; Uddenberg, Daniel; Izquierdo, Liz; Kvarnheden, Anders; Vergara-Silva, Francisco; Engström, Peter
2013-10-01
Reproductive organs in seed plants are morphologically divergent and their evolutionary history is often unclear. The mechanisms controlling their development have been extensively studied in angiosperms but are poorly understood in conifers and other gymnosperms. Here, we address the molecular control of seed cone development in Norway spruce, Picea abies. We present expression analyses of five novel MADS-box genes in comparison with previously identified MADS and LEAFY genes at distinct developmental stages. In addition, we have characterized the homeotic transformation from vegetative shoot to female cone and associated changes in regulatory gene expression patterns occurring in the acrocona mutant. The analyses identified genes active at the onset of ovuliferous and ovule development and identified expression patterns marking distinct domains of the ovuliferous scale. The reproductive transformation in acrocona involves the activation of all tested genes normally active in early cone development, except for an AGAMOUS-LIKE6/SEPALLATA (AGL6/SEP) homologue. This absence may be functionally associated with the nondeterminate development of the acrocona ovule-bearing scales. Our morphological and gene expression analyses give support to the hypothesis that the modern cone is a complex structure, and the ovuliferous scale the result of reductions and compactions of an ovule-bearing axillary short shoot in cones of Paleozoic conifers. © 2013 The Authors. New Phytologist © 2013 New Phytologist Trust.
Genome-Wide Identification and Analysis of the TIFY Gene Family in Grape
Zhang, Yucheng; Gao, Min; Singer, Stacy D.; Fei, Zhangjun; Wang, Hua; Wang, Xiping
2012-01-01
Background The TIFY gene family constitutes a plant-specific group of genes with a broad range of functions. This family encodes four subfamilies of proteins, including ZML, TIFY, PPD and JASMONATE ZIM-Domain (JAZ) proteins. JAZ proteins are targets of the SCFCOI1 complex, and function as negative regulators in the JA signaling pathway. Recently, it has been reported in both Arabidopsis and rice that TIFY genes, and especially JAZ genes, may be involved in plant defense against insect feeding, wounding, pathogens and abiotic stresses. Nonetheless, knowledge concerning the specific expression patterns and evolutionary history of plant TIFY family members is limited, especially in a woody species such as grape. Methodology/Principal Findings A total of two TIFY, four ZML, two PPD and 11 JAZ genes were identified in the Vitis vinifera genome. Phylogenetic analysis of TIFY protein sequences from grape, Arabidopsis and rice indicated that the grape TIFY proteins are more closely related to those of Arabidopsis than those of rice. Both segmental and tandem duplication events have been major contributors to the expansion of the grape TIFY family. In addition, synteny analysis between grape and Arabidopsis demonstrated that homologues of several grape TIFY genes were found in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of lineages that led to grape and Arabidopsis. Analyses of microarray and quantitative real-time RT-PCR expression data revealed that grape TIFY genes are not a major player in the defense against biotrophic pathogens or viruses. However, many of these genes were responsive to JA and ABA, but not SA or ET. Conclusion The genome-wide identification, evolutionary and expression analyses of grape TIFY genes should facilitate further research of this gene family and provide new insights regarding their evolutionary history and regulatory control. PMID:22984514
TARGET researchers use various sequencing and array-based methods to examine the genomes, transcriptomes, and for some diseases epigenomes of select childhood cancers. This “multi-omic” approach generates a comprehensive profile of molecular alterations for each cancer type. Alterations are changes in DNA or RNA, such as rearrangements in chromosome structure or variations in gene expression, respectively. Through computational analyses and assays to validate biological function, TARGET researchers predict which alterations disrupt the function of a gene or pathway and promote cancer growth, progression, and/or survival. Researchers identify candidate therapeutic targets and/or prognostic markers from the cancer-associated alterations.
Identification of Crowding Stress Tolerance Co-Expression Networks Involved in Sweet Corn Yield
Choe, Eunsoo; Drnevich, Jenny; Williams, Martin M.
2016-01-01
Tolerance to crowding stress has played a crucial role in improving agronomic productivity in field corn; however, commercial sweet corn hybrids vary greatly in crowding stress tolerance. The objectives were to 1) explore transcriptional changes among sweet corn hybrids with differential yield under crowding stress, 2) identify relationships between phenotypic responses and gene expression patterns, and 3) identify groups of genes associated with yield and crowding stress tolerance. Under conditions of crowding stress, three high-yielding and three low-yielding sweet corn hybrids were grouped for transcriptional and phenotypic analyses. Transcriptional analyses identified from 372 to 859 common differentially expressed genes (DEGs) for each hybrid. Large gene expression pattern variation among hybrids and only 26 common DEGs across all hybrid comparisons were identified, suggesting each hybrid has a unique response to crowding stress. Over-represented biological functions of DEGs also differed among hybrids. Strong correlation was observed between: 1) modules with up-regulation in high-yielding hybrids and yield traits, and 2) modules with up-regulation in low-yielding hybrids and plant/ear traits. Modules linked with yield traits may be important crowding stress response mechanisms influencing crop yield. Functional analysis of the modules and common DEGs identified candidate crowding stress tolerant processes in photosynthesis, glycolysis, cell wall, carbohydrate/nitrogen metabolic process, chromatin, and transcription regulation. Moreover, these biological functions were greatly inter-connected, indicating the importance of improving the mechanisms as a network. PMID:26796516
Hori, Motohide; Nakamachi, Tomoya; Shibato, Junko; Rakwal, Randeep; Tsuchida, Masachi; Shioda, Seiji; Numazawa, Satoshi
2014-01-01
Pituitary adenylate-cyclase activating polypeptide (PACAP) has neuroprotective and axonal guidance functions, but the mechanisms behind such actions remain unclear. Previously we examined effects of PACAP (PACAP38, 1 pmol) injection intracerebroventrically in a mouse model of permanent middle cerebral artery occlusion (PMCAO) along with control saline (0.9% NaCl) injection. Transcriptomic and proteomic approaches using ischemic (ipsilateral) brain hemisphere revealed differentially regulated genes and proteins by PACAP38 at 6 and 24 h post-treatment. However, as the ischemic hemisphere consisted of infarct core, penumbra, and non-ischemic regions, specificity of expression and localization of these identified molecular factors remained incomplete. This led us to devise a new experimental strategy wherein, ischemic core and penumbra were carefully sampled and compared to the corresponding contralateral (healthy) core and penumbra regions at 6 and 24 h post PACAP38 or saline injections. Both reverse transcription-polymerase chain reaction (RT-PCR) and Western blotting were used to examine targeted gene expressions and the collapsin response mediator protein 2 (CRMP2) protein profiles, respectively. Clear differences in expression of genes and CRMP2 protein abundance and degradation product/short isoform was observed between ischemic core and penumbra and also compared to the contralateral healthy tissues after PACAP38 or saline treatment. Results indicate the importance of region-specific analyses to further identify, localize and functionally analyse target molecular factors for clarifying the neuroprotective function of PACAP38. PMID:25257527
Unifying measures of gene function and evolution.
Wolf, Yuri I; Carmel, Liran; Koonin, Eugene V
2006-06-22
Recent genome analyses revealed intriguing correlations between variables characterizing the functioning of a gene, such as expression level (EL), connectivity of genetic and protein-protein interaction networks, and knockout effect, and variables describing gene evolution, such as sequence evolution rate (ER) and propensity for gene loss. Typically, variables within each of these classes are positively correlated, e.g. products of highly expressed genes also have a propensity to be involved in many protein-protein interactions, whereas variables between classes are negatively correlated, e.g. highly expressed genes, on average, evolve slower than weakly expressed genes. Here, we describe principal component (PC) analysis of seven genome-related variables and propose biological interpretations for the first three PCs. The first PC reflects a gene's 'importance', or the 'status' of a gene in the genomic community, with positive contributions from knockout lethality, EL, number of protein-protein interaction partners and the number of paralogues, and negative contributions from sequence ER and gene loss propensity. The next two PCs define a plane that seems to reflect the functional and evolutionary plasticity of a gene. Specifically, PC2 can be interpreted as a gene's 'adaptability' whereby genes with high adaptability readily duplicate, have many genetic interaction partners and tend to be non-essential. PC3 also might reflect the role of a gene in organismal adaptation albeit with a negative rather than a positive contribution of genetic interactions; we provisionally designate this PC 'reactivity'. The interpretation of PC2 and PC3 as measures of a gene's plasticity is compatible with the observation that genes with high values of these PCs tend to be expressed in a condition- or tissue-specific manner. Functional classes of genes substantially vary in status, adaptability and reactivity, with the highest status characteristic of the translation system and cytoskeletal proteins, highest adaptability seen in cellular processes and signalling genes, and top reactivity characteristic of metabolic enzymes.
Divergence and adaptive evolution of the gibberellin oxidase genes in plants.
Huang, Yuan; Wang, Xi; Ge, Song; Rao, Guang-Yuan
2015-09-29
The important phytohormone gibberellins (GAs) play key roles in various developmental processes. GA oxidases (GAoxs) are critical enzymes in GA synthesis pathway, but their classification, evolutionary history and the forces driving the evolution of plant GAox genes remain poorly understood. This study provides the first large-scale evolutionary analysis of GAox genes in plants by using an extensive whole-genome dataset of 41 species, representing green algae, bryophytes, pteridophyte, and seed plants. We defined eight subfamilies under the GAox family, namely C19-GA2ox, C20-GA2ox, GA20ox,GA3ox, GAox-A, GAox-B, GAox-C and GAox-D. Of these, subfamilies GAox-A, GAox-B, GAox-C and GAox-D are described for the first time. On the basis of phylogenetic analyses and characteristic motifs of GAox genes, we demonstrated a rapid expansion and functional divergence of the GAox genes during the diversification of land plants. We also detected the subfamily-specific motifs and potential sites of some GAox genes, which might have evolved under positive selection. GAox genes originated very early-before the divergence of bryophytes and the vascular plants and the diversification of GAox genes is associated with the functional divergence and could be driven by positive selection. Our study not only provides information on the classification of GAox genes, but also facilitates the further functional characterization and analysis of GA oxidases.
Inoue, Kimiko; Oikawa, Mami; Kamimura, Satoshi; Ogonuki, Narumi; Nakamura, Toshinobu; Nakano, Toru; Abe, Kuniya; Ogura, Atsuo
2015-01-01
Although mammalian cloning by somatic cell nuclear transfer (SCNT) has been established in various species, the low developmental efficiency has hampered its practical applications. Treatment of SCNT-derived embryos with histone deacetylase (HDAC) inhibitors can improve their development, but the underlying mechanism is still unclear. To address this question, we analysed gene expression profiles of SCNT-derived 2-cell mouse embryos treated with trichostatin A (TSA), a potent HDAC inhibitor that is best used for mouse cloning. Unexpectedly, TSA had no effect on the numbers of aberrantly expressed genes or the overall gene expression pattern in the embryos. However, in-depth investigation by gene ontology and functional analyses revealed that TSA treatment specifically improved the expression of a small subset of genes encoding transcription factors and their regulatory factors, suggesting their positive involvement in de novo RNA synthesis. Indeed, introduction of one of such transcription factors, Spi-C, into the embryos at least partially mimicked the TSA-induced improvement in embryonic development by activating gene networks associated with transcriptional regulation. Thus, the effects of TSA treatment on embryonic gene expression did not seem to be stochastic, but more specific than expected, targeting genes that direct development and trigger zygotic genome activation at the 2-cell stage. PMID:25974394
Pomerantz, Aaron F; Hoy, Marjorie A; Kawahara, Akito Y
2015-01-01
Little is known about the process of sex determination at the molecular level in species belonging to the subclass Acari, a taxon of arachnids that contains mites and ticks. The recent sequencing of the transcriptome and genome of the western orchard predatory mite Metaseiulus occidentalis allows investigation of molecular mechanisms underlying the biological processes of sex determination in this predator of phytophagous pest mites. We identified four doublesex-and-mab-3-related transcription factor (dmrt) genes, one transformer-2 gene, one intersex gene, and two fruitless-like genes in M. occidentalis. Phylogenetic analyses were conducted to infer the molecular relationships to sequences from species of arthropods, including insects, crustaceans, acarines, and a centipede, using available genomic data. Comparative analyses revealed high sequence identity within functional domains and confirmed that the architecture for certain sex-determination genes is conserved in arthropods. This study provides a framework for identifying potential target genes that could be implicated in the process of sex determination in M. occidentalis and provides insight into the conservation and change of the molecular components of sex determination in arthropods.
2015-01-01
Phytopathogenic fungi form intimate associations with host plant species and cause disease. To be successful, fungal pathogens communicate with a susceptible host through the secretion of proteinaceous effectors, hydrolytic enzymes and metabolites. Sclerotinia sclerotiorum and Botrytis cinerea are economically important necrotrophic fungal pathogens that cause disease on numerous crop species. Here, a powerful bioinformatics pipeline was used to predict the refined S. sclerotiorum and B. cinerea secretomes, identifying 432 and 499 proteins respectively. Analyses focusing on S. sclerotiorum revealed that 16% of the secretome encoding genes resided in small, sequence heterogeneous, gene clusters that were distributed over 13 of the 16 predicted chromosomes. Functional analyses highlighted the importance of plant cell hydrolysis, oxidation-reduction processes and the redox state to the S. sclerotiorum and B. cinerea secretomes and potentially host infection. Only 8% of the predicted proteins were distinct between the two secretomes. In contrast to S. sclerotiorum, the B. cinerea secretome lacked CFEM- or LysM-containing proteins. The 115 fungal and oomycete genome comparison identified 30 proteins specific to S. sclerotiorum and B. cinerea, plus 11 proteins specific to S. sclerotiorum and 32 proteins specific to B. cinerea. Expressed sequence tag (EST) and proteomic analyses showed that 246 S. sclerotiorum secretome encoding genes had EST support, including 101 which were only expressed in vitro and 49 which were only expressed in planta, whilst 42 predicted proteins were experimentally proven to be secreted. These detailed in silico analyses of two important necrotrophic pathogens will permit informed choices to be made when candidate effector proteins are selected for function analyses in planta. PMID:26107498
Reiner, Gerald; Dreher, Felix; Drungowski, Mario; Hoeltig, Doris; Bertsch, Natalie; Selke, Martin; Willems, Hermann; Gerlach, Gerald Friedrich; Probst, Inga; Tuemmler, Burkhardt; Waldmann, Karl-Heinz; Herwig, Ralf
2014-12-01
Actinobacillus (A.) pleuropneumoniae is among the most important pathogens in pig. The agent causes severe economic losses due to decreased performance, the occurrence of acute or chronic pleuropneumonia, and an increase in death incidence. Since therapeutics cannot be used in a sustainable manner, and vaccination is not always available, new prophylactic measures are urgently needed. Recent research has provided evidence for a genetic predisposition in susceptibility to A. pleuropneumoniae in a Hampshire × German Landrace F2 family with 170 animals. The aim of the present study is to characterize the expression response in this family in order to unravel resistance and susceptibility mechanisms and to prioritize candidate genes for future fine mapping approaches. F2 pigs differed distinctly in clinical, pathological, and microbiological parameters after challenge with A. pleuropneumoniae. We monitored genome-wide gene expression from the 50 most and 50 least susceptible F2 pigs and identified 171 genes differentially expressed between these extreme phenotypes. We combined expression QTL analyses with network analyses and functional characterization using gene set enrichment analysis and identified a functional hotspot on SSC13, including 55 eQTL. The integration of the different results provides a resource for candidate prioritization for fine mapping strategies, such as TF, TFRC, RUNX1, TCN1, HP, CD14, among others.
Zheng, Xing-Wu; Kudaravalli, Rama; Russell, Theresa T; DiMichele, Donna M; Gibb, Constance; Russell, J Eric; Margaritis, Paris; Pollak, Eleanor S
2011-10-01
Severe coagulant factor VII (FVII) deficiency in postpubertal dizygotic twin males results from two point mutations in the FVII gene, a promoter region T→C transition at -60 and a His-to-Arg substitution at amino acid 348; both mutations prevent persistence of plasma functional FVII. This report documents longitudinal laboratory measurements from infancy to adulthood of FVII coagulant activity (FVII:C) in the twin FVII-deficient patients; it also details specific biochemical analyses of the -60 T→C mutation. The results revealed FVII:C levels of less than 1% in infancy that remain severely decreased through puberty and into adulthood. In-vitro analyses utilizing hepatocyte nuclear factor 4α (HNF4α) co-transfection and a chromatin immunoprecipitation assay indicate that the -60 T→C mutation severely diminishes functional interaction between the FVII promoter and transcription factor HNF4α. The importance of interaction between the FVII gene and HNF4α in normal FVII expression provides an in-vivo illustration of the regulated expression of an autosomal gene encoding a coagulation protein. The constancy of FVII:C and peripubertal patient symptomatology reported here illustrates androgen-independent expression in contrast to expression with an analogous mutation in the promoter region of the gene encoding coagulation FIX.
MVisAGe Identifies Concordant and Discordant Genomic Alterations of Driver Genes in Squamous Tumors.
Walter, Vonn; Du, Ying; Danilova, Ludmila; Hayward, Michele C; Hayes, D Neil
2018-06-15
Integrated analyses of multiple genomic datatypes are now common in cancer profiling studies. Such data present opportunities for numerous computational experiments, yet analytic pipelines are limited. Tools such as the cBioPortal and Regulome Explorer, although useful, are not easy to access programmatically or to implement locally. Here, we introduce the MVisAGe R package, which allows users to quantify gene-level associations between two genomic datatypes to investigate the effect of genomic alterations (e.g., DNA copy number changes on gene expression). Visualizing Pearson/Spearman correlation coefficients according to the genomic positions of the underlying genes provides a powerful yet novel tool for conducting exploratory analyses. We demonstrate its utility by analyzing three publicly available cancer datasets. Our approach highlights canonical oncogenes in chr11q13 that displayed the strongest associations between expression and copy number, including CCND1 and CTTN , genes not identified by copy number analysis in the primary reports. We demonstrate highly concordant usage of shared oncogenes on chr3q, yet strikingly diverse oncogene usage on chr11q as a function of HPV infection status. Regions of chr19 that display remarkable associations between methylation and gene expression were identified, as were previously unreported miRNA-gene expression associations that may contribute to the epithelial-to-mesenchymal transition. Significance: This study presents an important bioinformatics tool that will enable integrated analyses of multiple genomic datatypes. Cancer Res; 78(12); 3375-85. ©2018 AACR . ©2018 American Association for Cancer Research.
Xu, Yungang; Guo, Maozu; Zou, Quan; Liu, Xiaoyan; Wang, Chunyu; Liu, Yang
2014-01-01
Cellular interactome, in which genes and/or their products interact on several levels, forming transcriptional regulatory-, protein interaction-, metabolic-, signal transduction networks, etc., has attracted decades of research focuses. However, such a specific type of network alone can hardly explain the various interactive activities among genes. These networks characterize different interaction relationships, implying their unique intrinsic properties and defects, and covering different slices of biological information. Functional gene network (FGN), a consolidated interaction network that models fuzzy and more generalized notion of gene-gene relations, have been proposed to combine heterogeneous networks with the goal of identifying functional modules supported by multiple interaction types. There are yet no successful precedents of FGNs on sparsely studied non-model organisms, such as soybean (Glycine max), due to the absence of sufficient heterogeneous interaction data. We present an alternative solution for inferring the FGNs of soybean (SoyFGNs), in a pioneering study on the soybean interactome, which is also applicable to other organisms. SoyFGNs exhibit the typical characteristics of biological networks: scale-free, small-world architecture and modularization. Verified by co-expression and KEGG pathways, SoyFGNs are more extensive and accurate than an orthology network derived from Arabidopsis. As a case study, network-guided disease-resistance gene discovery indicates that SoyFGNs can provide system-level studies on gene functions and interactions. This work suggests that inferring and modelling the interactome of a non-model plant are feasible. It will speed up the discovery and definition of the functions and interactions of other genes that control important functions, such as nitrogen fixation and protein or lipid synthesis. The efforts of the study are the basis of our further comprehensive studies on the soybean functional interactome at the genome and microRNome levels. Additionally, a web tool for information retrieval and analysis of SoyFGNs can be accessed at SoyFN: http://nclab.hit.edu.cn/SoyFN.
Xu, Yungang; Guo, Maozu; Zou, Quan; Liu, Xiaoyan; Wang, Chunyu; Liu, Yang
2014-01-01
Cellular interactome, in which genes and/or their products interact on several levels, forming transcriptional regulatory-, protein interaction-, metabolic-, signal transduction networks, etc., has attracted decades of research focuses. However, such a specific type of network alone can hardly explain the various interactive activities among genes. These networks characterize different interaction relationships, implying their unique intrinsic properties and defects, and covering different slices of biological information. Functional gene network (FGN), a consolidated interaction network that models fuzzy and more generalized notion of gene-gene relations, have been proposed to combine heterogeneous networks with the goal of identifying functional modules supported by multiple interaction types. There are yet no successful precedents of FGNs on sparsely studied non-model organisms, such as soybean (Glycine max), due to the absence of sufficient heterogeneous interaction data. We present an alternative solution for inferring the FGNs of soybean (SoyFGNs), in a pioneering study on the soybean interactome, which is also applicable to other organisms. SoyFGNs exhibit the typical characteristics of biological networks: scale-free, small-world architecture and modularization. Verified by co-expression and KEGG pathways, SoyFGNs are more extensive and accurate than an orthology network derived from Arabidopsis. As a case study, network-guided disease-resistance gene discovery indicates that SoyFGNs can provide system-level studies on gene functions and interactions. This work suggests that inferring and modelling the interactome of a non-model plant are feasible. It will speed up the discovery and definition of the functions and interactions of other genes that control important functions, such as nitrogen fixation and protein or lipid synthesis. The efforts of the study are the basis of our further comprehensive studies on the soybean functional interactome at the genome and microRNome levels. Additionally, a web tool for information retrieval and analysis of SoyFGNs can be accessed at SoyFN: http://nclab.hit.edu.cn/SoyFN. PMID:25423109
Whittington, Emma; Zhao, Qian; Borziak, Kirill; Walters, James R; Dorus, Steve
2015-07-01
The application of mass spectrometry based proteomics to sperm biology has greatly accelerated progress in understanding the molecular composition and function of spermatozoa. To date, these approaches have been largely restricted to model organisms, all of which produce a single sperm morph capable of oocyte fertilisation. Here we apply high-throughput mass spectrometry proteomic analysis to characterise sperm composition in Manduca sexta, the tobacco hornworm moth, which produce heteromorphic sperm, including one fertilisation competent (eupyrene) and one incompetent (apyrene) sperm type. This resulted in the high confidence identification of 896 proteins from a co-mixed sample of both sperm types, of which 167 are encoded by genes with strict one-to-one orthology in Drosophila melanogaster. Importantly, over half (55.1%) of these orthologous proteins have previously been identified in the D. melanogaster sperm proteome and exhibit significant conservation in quantitative protein abundance in sperm between the two species. Despite the complex nature of gene expression across spermatogenic stages, a significant correlation was also observed between sperm protein abundance and testis gene expression. Lepidopteran-specific sperm proteins (e.g., proteins with no homology to proteins in non-Lepidopteran taxa) were present in significantly greater abundance on average than those with homology outside the Lepidoptera. Given the disproportionate production of apyrene sperm (96% of all mature sperm in Manduca) relative to eupyrene sperm, these evolutionarily novel and highly abundant proteins are candidates for possessing apyrene-specific functions. Lastly, comparative genomic analyses of testis-expressed, ovary-expressed and sperm genes identified a concentration of novel sperm proteins shared amongst Lepidoptera of potential relevance to the evolutionary origin of heteromorphic spermatogenesis. As the first published Lepidopteran sperm proteome, this whole-cell proteomic characterisation will facilitate future evolutionary genetic and developmental studies of heteromorphic sperm production and parasperm function. Furthermore, the analyses presented here provide useful annotation information regarding sex-biased gene expression, novel Lepidopteran genes and gene function in the male gamete to complement the newly sequenced and annotated Manduca genome. Copyright © 2015 Elsevier Ltd. All rights reserved.
Roquigny, Roxane; Novinscak, Amy; Arseneault, Tanya; Joly, David L; Filion, Martin
2018-06-19
Phytophthora infestans is responsible for late blight, one of the most important potato diseases. Phenazine-1-carboxylic acid (PCA)-producing Pseudomonas fluorescens strain LBUM223 isolated in our laboratory shows biocontrol potential against various plant pathogens. To characterize the effect of LBUM223 on the transcriptome of P. infestans, we conducted an in vitro time-course study. Confrontational assay was performed using P. infestans inoculated alone (control) or with LBUM223, its phzC- isogenic mutant (not producing PCA), or exogenically applied PCA. Destructive sampling was performed at 6, 9 and 12 days and the transcriptome of P. infestans was analysed using RNA-Seq. The expression of a subset of differentially expressed genes was validated by RT-qPCR. Both LBUM223 and exogenically applied PCA significantly repressed P. infestans' growth at all times. Compared to the control treatment, transcriptomic analyses showed that the percentages of all P. infestans' genes significantly altered by LBUM223 and exogenically applied PCA increased as time progressed, from 50 to 61% and from to 32 to 46%, respectively. When applying an absolute cut-off value of 3 fold change or more for all three harvesting times, 207 genes were found significantly differentially expressed by PCA, either produced by LBUM223 or exogenically applied. Gene ontology analysis revealed that both treatments altered the expression of key functional genes involved in major functions like phosphorylation mechanisms, transmembrane transport and oxidoreduction activities. Interestingly, even though no host plant tissue was present in the in vitro system, PCA also led to the overexpression of several genes encoding effectors. The mutant only slightly repressed P. infestans' growth and barely altered its transcriptome. Our study suggests that PCA is involved in P. infestans' growth repression and led to important transcriptomic changes by both up- and down-regulating gene expression in P. infestans over time. Different metabolic functions were altered and many effectors were found to be upregulated, suggesting their implication in biocontrol.
Windhorst, Dafna A; Mileva-Seitz, Viara R; Rippe, Ralph C A; Tiemeier, Henning; Jaddoe, Vincent W V; Verhulst, Frank C; van IJzendoorn, Marinus H; Bakermans-Kranenburg, Marian J
2016-08-01
In a longitudinal cohort study, we investigated the interplay of harsh parenting and genetic variation across a set of functionally related dopamine genes, in association with children's externalizing behavior. This is one of the first studies to employ gene-based and gene-set approaches in tests of Gene by Environment (G × E) effects on complex behavior. This approach can offer an important alternative or complement to candidate gene and genome-wide environmental interaction (GWEI) studies in the search for genetic variation underlying individual differences in behavior. Genetic variants in 12 autosomal dopaminergic genes were available in an ethnically homogenous part of a population-based cohort. Harsh parenting was assessed with maternal (n = 1881) and paternal (n = 1710) reports at age 3. Externalizing behavior was assessed with the Child Behavior Checklist (CBCL) at age 5 (71 ± 3.7 months). We conducted gene-set analyses of the association between variation in dopaminergic genes and externalizing behavior, stratified for harsh parenting. The association was statistically significant or approached significance for children without harsh parenting experiences, but was absent in the group with harsh parenting. Similarly, significant associations between single genes and externalizing behavior were only found in the group without harsh parenting. Effect sizes in the groups with and without harsh parenting did not differ significantly. Gene-environment interaction tests were conducted for individual genetic variants, resulting in two significant interaction effects (rs1497023 and rs4922132) after correction for multiple testing. Our findings are suggestive of G × E interplay, with associations between dopamine genes and externalizing behavior present in children without harsh parenting, but not in children with harsh parenting experiences. Harsh parenting may overrule the role of genetic factors in externalizing behavior. Gene-based and gene-set analyses offer promising new alternatives to analyses focusing on single candidate polymorphisms when examining the interplay between genetic and environmental factors.
Cha, Kihoon; Hwang, Taeho; Oh, Kimin; Yi, Gwan-Su
2015-01-01
It has been reported that several brain diseases can be treated as transnosological manner implicating possible common molecular basis under those diseases. However, molecular level commonality among those brain diseases has been largely unexplored. Gene expression analyses of human brain have been used to find genes associated with brain diseases but most of those studies were restricted either to an individual disease or to a couple of diseases. In addition, identifying significant genes in such brain diseases mostly failed when it used typical methods depending on differentially expressed genes. In this study, we used a correlation-based biclustering approach to find coexpressed gene sets in five neurodegenerative diseases and three psychiatric disorders. By using biclustering analysis, we could efficiently and fairly identified various gene sets expressed specifically in both single and multiple brain diseases. We could find 4,307 gene sets correlatively expressed in multiple brain diseases and 3,409 gene sets exclusively specified in individual brain diseases. The function enrichment analysis of those gene sets showed many new possible functional bases as well as neurological processes that are common or specific for those eight diseases. This study introduces possible common molecular bases for several brain diseases, which open the opportunity to clarify the transnosological perspective assumed in brain diseases. It also showed the advantages of correlation-based biclustering analysis and accompanying function enrichment analysis for gene expression data in this type of investigation.
2015-01-01
Background It has been reported that several brain diseases can be treated as transnosological manner implicating possible common molecular basis under those diseases. However, molecular level commonality among those brain diseases has been largely unexplored. Gene expression analyses of human brain have been used to find genes associated with brain diseases but most of those studies were restricted either to an individual disease or to a couple of diseases. In addition, identifying significant genes in such brain diseases mostly failed when it used typical methods depending on differentially expressed genes. Results In this study, we used a correlation-based biclustering approach to find coexpressed gene sets in five neurodegenerative diseases and three psychiatric disorders. By using biclustering analysis, we could efficiently and fairly identified various gene sets expressed specifically in both single and multiple brain diseases. We could find 4,307 gene sets correlatively expressed in multiple brain diseases and 3,409 gene sets exclusively specified in individual brain diseases. The function enrichment analysis of those gene sets showed many new possible functional bases as well as neurological processes that are common or specific for those eight diseases. Conclusions This study introduces possible common molecular bases for several brain diseases, which open the opportunity to clarify the transnosological perspective assumed in brain diseases. It also showed the advantages of correlation-based biclustering analysis and accompanying function enrichment analysis for gene expression data in this type of investigation. PMID:26043779
Leiter, Éva; Bálint, Mihály; Miskei, Márton; Orosz, Erzsébet; Szabó, Zsuzsa; Pócsi, István
2016-07-01
A group of menadione stress-responsive function-unkown genes of Aspergillus nidulans (Locus IDs ANID_03987.1, ANID_06058.1, ANID_10219.1, and ANID_10260.1) was deleted and phenotypically characterized. Importantly, comparative and phylogenetic analyses of the tested A. nidulans genes and their orthologs shed light only on the presence of a TANGO2 domain with NRDE protein motif in the translated ANID_06058.1 gene but did not reveal any recognizable protein-encoding domains in other protein sequences. The gene deletion strains were subjected to oxidative, osmotic, and metal ion stress and, surprisingly, only the ΔANID_10219.1 mutant showed an increased sensitivity to 0.12 mmol l(-1) menadione sodium bisulfite. The gene deletions affected the stress sensitivities (tolerances) irregularly, for example, some strains grew more slowly when exposed to various oxidants and/or osmotic stress generating agents, meanwhile the ΔANID_10260.1 mutant possessed a wild-type tolerance to all stressors tested. Our results are in line with earlier studies demonstrating that the deletions of stress-responsive genes do not confer necessarily any stress-sensitivity phenotypes, which can be attributed to compensatory mechanisms based on other elements of the stress response system with overlapping functions. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Kamfwa, Kelvin; Zhao, Dongyan; Kelly, James D.
2017-01-01
Common bean (Phaseolus vulgaris L.) fixes atmospheric nitrogen (N2) through symbiotic nitrogen fixation (SNF) at levels lower than other grain legume crops. An understanding of the genes and molecular mechanisms underlying SNF will enable more effective strategies for the genetic improvement of SNF traits in common bean. In this study, transcriptome profiling was used to identify genes and molecular mechanisms underlying SNF differences between two common bean recombinant inbred lines that differed in their N-fixing abilities. Differential gene expression and functional enrichment analyses were performed on leaves, nodules and roots of the two lines when grown under N-fixing and non-fixing conditions. Receptor kinases, transmembrane transporters, and transcription factors were among the differentially expressed genes identified under N-fixing conditions, but not under non-fixing conditions. Genes up-regulated in the stronger nitrogen fixer, SA36, included those involved in molecular functions such as purine nucleoside binding, oxidoreductase and transmembrane receptor activities in nodules, and transport activity in roots. Transcription factors identified in this study are candidates for future work aimed at understanding the functional role of these genes in SNF. Information generated in this study will support the development of gene-based markers to accelerate genetic improvement of SNF in common bean. PMID:28192540
Gu, Lijiao; Li, Libei; Wei, Hengling; Wang, Hantao; Su, Junji; Guo, Yaning; Yu, Shuxun
2018-01-01
WRKY transcription factors play important roles in plant defense, stress response, leaf senescence, and plant growth and development. Previous studies have revealed the important roles of the group IIa GhWRKY genes in cotton. To comprehensively analyze the group IIa GhWRKY genes in upland cotton, we identified 15 candidate group IIa GhWRKY genes in the Gossypium hirsutum genome. The phylogenetic tree, intron-exon structure, motif prediction and Ka/Ks analyses indicated that most group IIa GhWRKY genes shared high similarity and conservation and underwent purifying selection during evolution. In addition, we detected the expression patterns of several group IIa GhWRKY genes in individual tissues as well as during leaf senescence using public RNA sequencing data and real-time quantitative PCR. To better understand the functions of group IIa GhWRKYs in cotton, GhWRKY17 (KF669857) was isolated from upland cotton, and its sequence alignment, promoter cis-acting elements and subcellular localization were characterized. Moreover, the over-expression of GhWRKY17 in Arabidopsis up-regulated the senescence-associated genes AtWRKY53, AtSAG12 and AtSAG13, enhancing the plant's susceptibility to leaf senescence. These findings lay the foundation for further analysis and study of the functions of WRKY genes in cotton.
Regulatory states in the developmental control of gene expression.
Peter, Isabelle S
2017-09-01
A growing body of evidence shows that gene expression in multicellular organisms is controlled by the combinatorial function of multiple transcription factors. This indicates that not the individual transcription factors or signaling molecules, but the combination of expressed regulatory molecules, the regulatory state, should be viewed as the functional unit in gene regulation. Here, I discuss the concept of the regulatory state and its proposed role in the genome-wide control of gene expression. Recent analyses of regulatory gene expression in sea urchin embryos have been instrumental for solving the genomic control of cell fate specification in this system. Some of the approaches that were used to determine the expression of regulatory states during sea urchin embryogenesis are reviewed. Significant developmental changes in regulatory state expression leading to the distinct specification of cell fates are regulated by gene regulatory network circuits. How these regulatory state transitions are encoded in the genome is illuminated using the sea urchin endoderm-mesoderms cell fate decision circuit as an example. These observations highlight the importance of considering developmental gene regulation, and the function of individual transcription factors, in the context of regulatory states. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Characterizing visible and invisible cell wall mutant phenotypes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Carpita, Nicholas C.; McCann, Maureen C.
2015-04-06
About 10% of a plant's genome is devoted to generating the protein machinery to synthesize, remodel, and deconstruct the cell wall. High-throughput genome sequencing technologies have enabled a reasonably complete inventory of wall-related genes that can be assembled into families of common evolutionary origin. Assigning function to each gene family member has been aided immensely by identification of mutants with visible phenotypes or by chemical and spectroscopic analysis of mutants with ‘invisible’ phenotypes of modified cell wall composition and architecture that do not otherwise affect plant growth or development. This review connects the inference of gene function on the basismore » of deviation from the wild type in genetic functional analyses to insights provided by modern analytical techniques that have brought us ever closer to elucidating the sequence structures of the major polysaccharide components of the plant cell wall.« less
RNA editing in Drosophila melanogaster: new targets and functionalconsequences
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stapleton, Mark; Carlson, Joseph W.; Celniker, Susan E.
2006-09-05
Adenosine deaminases that act on RNA (ADARs) catalyze the site-specific conversion of adenosine to inosine in primary mRNA transcripts. These re-coding events affect coding potential, splice-sites, and stability of mature mRNAs. ADAR is an essential gene and studies in mouse, C. elegans, and Drosophila suggest its primary function is to modify adult behavior by altering signaling components in the nervous system. By comparing the sequence of isogenic cDNAs to genomic DNA, we have identified and experimentally verified 27 new targets of Drosophila ADAR. Our analyses lead us to identify new classes of genes whose transcripts are targets of ADAR includingmore » components of the actin cytoskeleton, and genes involved in ion homeostasis and signal transduction. Our results indicate that editing in Drosophila increases the diversity of the proteome, and does so in a manner that has direct functional consequences on protein function.« less
Liu, Ping-Li; Du, Liang; Huang, Yuan; Gao, Shu-Min; Yu, Meng
2017-02-07
Leucine-rich repeat receptor-like protein kinases (LRR-RLKs) are the largest group of receptor-like kinases in plants and play crucial roles in development and stress responses. The evolutionary relationships among LRR-RLK genes have been investigated in flowering plants; however, no comprehensive studies have been performed for these genes in more ancestral groups. The subfamily classification of LRR-RLK genes in plants, the evolutionary history and driving force for the evolution of each LRR-RLK subfamily remain to be understood. We identified 119 LRR-RLK genes in the Physcomitrella patens moss genome, 67 LRR-RLK genes in the Selaginella moellendorffii lycophyte genome, and no LRR-RLK genes in five green algae genomes. Furthermore, these LRR-RLK sequences, along with previously reported LRR-RLK sequences from Arabidopsis thaliana and Oryza sativa, were subjected to evolutionary analyses. Phylogenetic analyses revealed that plant LRR-RLKs belong to 19 subfamilies, eighteen of which were established in early land plants, and one of which evolved in flowering plants. More importantly, we found that the basic structures of LRR-RLK genes for most subfamilies are established in early land plants and conserved within subfamilies and across different plant lineages, but divergent among subfamilies. In addition, most members of the same subfamily had common protein motif compositions, whereas members of different subfamilies showed variations in protein motif compositions. The unique gene structure and protein motif compositions of each subfamily differentiate the subfamily classifications and, more importantly, provide evidence for functional divergence among LRR-RLK subfamilies. Maximum likelihood analyses showed that some sites within four subfamilies were under positive selection. Much of the diversity of plant LRR-RLK genes was established in early land plants. Positive selection contributed to the evolution of a few LRR-RLK subfamilies.
Digital gene expression analysis of the zebra finch genome
2010-01-01
Background In order to understand patterns of adaptation and molecular evolution it is important to quantify both variation in gene expression and nucleotide sequence divergence. Gene expression profiling in non-model organisms has recently been facilitated by the advent of massively parallel sequencing technology. Here we investigate tissue specific gene expression patterns in the zebra finch (Taeniopygia guttata) with special emphasis on the genes of the major histocompatibility complex (MHC). Results Almost 2 million 454-sequencing reads from cDNA of six different tissues were assembled and analysed. A total of 11,793 zebra finch transcripts were represented in this EST data, indicating a transcriptome coverage of about 65%. There was a positive correlation between the tissue specificity of gene expression and non-synonymous to synonymous nucleotide substitution ratio of genes, suggesting that genes with a specialised function are evolving at a higher rate (or with less constraint) than genes with a more general function. In line with this, there was also a negative correlation between overall expression levels and expression specificity of contigs. We found evidence for expression of 10 different genes related to the MHC. MHC genes showed relatively tissue specific expression levels and were in general primarily expressed in spleen. Several MHC genes, including MHC class I also showed expression in brain. Furthermore, for all genes with highest levels of expression in spleen there was an overrepresentation of several gene ontology terms related to immune function. Conclusions Our study highlights the usefulness of next-generation sequence data for quantifying gene expression in the genome as a whole as well as in specific candidate genes. Overall, the data show predicted patterns of gene expression profiles and molecular evolution in the zebra finch genome. Expression of MHC genes in particular, corresponds well with expression patterns in other vertebrates. PMID:20359325
Zhang, Wei-Dong; Zhao, Yong; Zhang, Hong-Fu; Wang, Shu-Kun; Hao, Zhi-Hui; Liu, Jing; Yuan, Yu-Qing; Zhang, Peng-Fei; Yang, Hong-Di; Shen, Wei; Li, Lan
2016-08-01
Granulosa cells (GCs) are those somatic cells closest to the female germ cell. GCs play a vital role in oocyte growth and development, and the oocyte is necessary for multiplication of a species. Zinc oxide (ZnO) nanoparticles (NPs) readily cross biologic barriers to be absorbed into biologic systems that make them promising candidates as food additives. The objective of the present investigation was to explore the impact of intact NPs on gene expression and the functional classification of altered genes in hen GCs in vivo, to compare the data from in vivo and in vitro studies, and finally to point out the adverse effects of ZnO NPs on the reproductive system. After a 24-week treatment, hen GCs were isolated and gene expression was quantified. Intact NPs were found in the ovary and other organs. Zn levels were similar in ZnO-NP-100 mg/kg- and ZnSO4-100 mg/kg-treated hen ovaries. ZnO-NP-100 mg/kg and ZnSO4-100 mg/kg regulated the expression of the same sets of genes, and they also altered the expression of different sets of genes individually. The number of genes altered by the ZnO-NP-100 mg/kg and ZnSO4-100 mg/kg treatments was different. Gene Ontology (GO) functional analysis reported that different results for the two treatments and, in Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment, 12 pathways (out of the top 20 pathways) in each treatment were different. These results suggested that intact NPs and Zn(2+) had different effects on gene expression in GCs in vivo. In our recent publication, we noted that intact NPs and Zn(2+) differentially altered gene expression in GCs in vitro. However, GO functional classification and KEGG pathway enrichment analyses revealed close similarities for the changed genes in vivo and in vitro after ZnO NP treatment. Furthermore, close similarities were observed for the changed genes after ZnSO4 treatments in vivo and in vitro by GO functional classification and KEGG pathway enrichment analyses. Therefore, the effects of ZnO NPs on gene expression in vitro might represent their effects on gene expression in vivo. The results from this study and our earlier studies support previous findings indicating ZnO NPs promote adverse effects on organisms. Therefore, precautions should be taken when ZnO NPs are used as diet additives for hens because they might cause reproductive issues. Copyright © 2016 Elsevier Inc. All rights reserved.
Ali, Zulfiqar; Zhang, Da Yong; Xu, Zhao Long; Xu, Ling; Yi, Jin Xin; He, Xiao Lan; Huang, Yi Hong; Liu, Xiao Qing; Khan, Asif Ali; Trethowan, Richard M.; Ma, Hong Xiang
2012-01-01
Soil salinity has very adverse effects on growth and yield of crop plants. Several salt tolerant wild accessions and cultivars are reported in soybean. Functional genomes of salt tolerant Glycine soja and a salt sensitive genotype of Glycine max were investigated to understand the mechanism of salt tolerance in soybean. For this purpose, four libraries were constructed for Tag sequencing on Illumina platform. We identify around 490 salt responsive genes which included a number of transcription factors, signaling proteins, translation factors and structural genes like transporters, multidrug resistance proteins, antiporters, chaperons, aquaporins etc. The gene expression levels and ratio of up/down-regulated genes was greater in tolerant plants. Translation related genes remained stable or showed slightly higher expression in tolerant plants under salinity stress. Further analyses of sequenced data and the annotations for gene ontology and pathways indicated that soybean adapts to salt stress through ABA biosynthesis and regulation of translation and signal transduction of structural genes. Manipulation of these pathways may mitigate the effect of salt stress thus enhancing salt tolerance. PMID:23209559
Naville, M; Warren, I A; Haftek-Terreau, Z; Chalopin, D; Brunet, F; Levin, P; Galiana, D; Volff, J-N
2016-04-01
Viruses and transposable elements, once considered as purely junk and selfish sequences, have repeatedly been used as a source of novel protein-coding genes during the evolution of most eukaryotic lineages, a phenomenon called 'molecular domestication'. This is exemplified perfectly in mammals and other vertebrates, where many genes derived from long terminal repeat (LTR) retroelements (retroviruses and LTR retrotransposons) have been identified through comparative genomics and functional analyses. In particular, genes derived from gag structural protein and envelope (env) genes, as well as from the integrase-coding and protease-coding sequences, have been identified in humans and other vertebrates. Retroelement-derived genes are involved in many important biological processes including placenta formation, cognitive functions in the brain and immunity against retroelements, as well as in cell proliferation, apoptosis and cancer. These observations support an important role of retroelement-derived genes in the evolution and diversification of the vertebrate lineage. Copyright © 2016 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.
DNA methylation and exposure to ambient air pollution in two prospective cohorts.
Plusquin, Michelle; Guida, Florence; Polidoro, Silvia; Vermeulen, Roel; Raaschou-Nielsen, Ole; Campanella, Gianluca; Hoek, Gerard; Kyrtopoulos, Soterios A; Georgiadis, Panagiotis; Naccarati, Alessio; Sacerdote, Carlotta; Krogh, Vittorio; Bas Bueno-de-Mesquita, H; Monique Verschuren, W M; Sayols-Baixeras, Sergi; Panni, Tommaso; Peters, Annette; Hebels, Dennie G A J; Kleinjans, Jos; Vineis, Paolo; Chadeau-Hyam, Marc
2017-11-01
Long-term exposure to air pollution has been associated with several adverse health effects including cardiovascular, respiratory diseases and cancers. However, underlying molecular alterations remain to be further investigated. The aim of this study is to investigate the effects of long-term exposure to air pollutants on (a) average DNA methylation at functional regions and, (b) individual differentially methylated CpG sites. An assumption is that omic measurements, including the methylome, are more sensitive to low doses than hard health outcomes. This study included blood-derived DNA methylation (Illumina-HM450 methylation) for 454 Italian and 159 Dutch participants from the European Prospective Investigation into Cancer and Nutrition (EPIC). Long-term air pollution exposure levels, including NO 2 , NO x , PM 2.5 , PM coarse , PM 10 , PM 2.5 absorbance (soot) were estimated using models developed within the ESCAPE project, and back-extrapolated to the time of sampling when possible. We meta-analysed the associations between the air pollutants and global DNA methylation, methylation in functional regions and epigenome-wide methylation. CpG sites found differentially methylated with air pollution were further investigated for functional interpretation in an independent population (EnviroGenoMarkers project), where (N=613) participants had both methylation and gene expression data available. Exposure to NO 2 was associated with a significant global somatic hypomethylation (p-value=0.014). Hypomethylation of CpG island's shores and shelves and gene bodies was significantly associated with higher exposures to NO 2 and NO x . Meta-analysing the epigenome-wide findings of the 2 cohorts did not show genome-wide significant associations at single CpG site level. However, several significant CpG were found if the analyses were separated by countries. By regressing gene expression levels against methylation levels of the exposure-related CpG sites, we identified several significant CpG-transcript pairs and highlighted 5 enriched pathways for NO 2 and 9 for NO x mainly related to the immune system and its regulation. Our findings support results on global hypomethylation associated with air pollution, and suggest that the shores and shelves of CpG islands and gene bodies are mostly affected by higher exposure to NO 2 and NO x . Functional differences in the immune system were suggested by transcriptome analyses. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
Marra, Nicholas J; Richards, Vincent P; Early, Angela; Bogdanowicz, Steve M; Pavinski Bitar, Paulina D; Stanhope, Michael J; Shivji, Mahmood S
2017-01-30
Comparative genomic and/or transcriptomic analyses involving elasmobranchs remain limited, with genome level comparisons of the elasmobranch immune system to that of higher vertebrates, non-existent. This paper reports a comparative RNA-seq analysis of heart tissue from seven species, including four elasmobranchs and three teleosts, focusing on immunity, but concomitantly seeking to identify genetic similarities shared by the two lamnid sharks and the single billfish in our study, which could be linked to convergent evolution of regional endothermy. Across seven species, we identified an average of 10,877 Swiss-Prot annotated genes from an average of 32,474 open reading frames within each species' heart transcriptome. About half of these genes were shared between all species while the remainder included functional differences between our groups of interest (elasmobranch vs. teleost and endotherms vs. ectotherms) as revealed by Gene Ontology (GO) and selection analyses. A repeatedly represented functional category, in both the uniquely expressed elasmobranch genes (total of 259) and the elasmobranch GO enrichment results, involved antibody-mediated immunity, either in the recruitment of immune cells (Fc receptors) or in antigen presentation, including such terms as "antigen processing and presentation of exogenous peptide antigen via MHC class II", and such genes as MHC class II, HLA-DPB1. Molecular adaptation analyses identified three genes in elasmobranchs with a history of positive selection, including legumain (LGMN), a gene with roles in both innate and adaptive immunity including producing antigens for presentation by MHC class II. Comparisons between the endothermic and ectothermic species revealed an enrichment of GO terms associated with cardiac muscle contraction in endotherms, with 19 genes expressed solely in endotherms, several of which have significant roles in lipid and fat metabolism. This collective comparative evidence provides the first multi-taxa transcriptomic-based perspective on differences between elasmobranchs and teleosts, and suggests various unique features associated with the adaptive immune system of elasmobranchs, pointing in particular to the potential importance of MHC Class II. This in turn suggests that expanded comparative work involving additional tissues, as well as genome sequencing of multiple elasmobranch species would be productive in elucidating the regulatory and genome architectural hallmarks of elasmobranchs.
Rund, Samuel S C; Yoo, Boyoung; Alam, Camille; Green, Taryn; Stephens, Melissa T; Zeng, Erliang; George, Gary F; Sheppard, Aaron D; Duffield, Giles E; Milenković, Tijana; Pfrender, Michael E
2016-08-18
Marine and freshwater zooplankton exhibit daily rhythmic patterns of behavior and physiology which may be regulated directly by the light:dark (LD) cycle and/or a molecular circadian clock. One of the best-studied zooplankton taxa, the freshwater crustacean Daphnia, has a 24 h diel vertical migration (DVM) behavior whereby the organism travels up and down through the water column daily. DVM plays a critical role in resource tracking and the behavioral avoidance of predators and damaging ultraviolet radiation. However, there is little information at the transcriptional level linking the expression patterns of genes to the rhythmic physiology/behavior of Daphnia. Here we analyzed genome-wide temporal transcriptional patterns from Daphnia pulex collected over a 44 h time period under a 12:12 LD cycle (diel) conditions using a cosine-fitting algorithm. We used a comprehensive network modeling and analysis approach to identify novel co-regulated rhythmic genes that have similar network topological properties and functional annotations as rhythmic genes identified by the cosine-fitting analyses. Furthermore, we used the network approach to predict with high accuracy novel gene-function associations, thus enhancing current functional annotations available for genes in this ecologically relevant model species. Our results reveal that genes in many functional groupings exhibit 24 h rhythms in their expression patterns under diel conditions. We highlight the rhythmic expression of immunity, oxidative detoxification, and sensory process genes. We discuss differences in the chronobiology of D. pulex from other well-characterized terrestrial arthropods. This research adds to a growing body of literature suggesting the genetic mechanisms governing rhythmicity in crustaceans may be divergent from other arthropod lineages including insects. Lastly, these results highlight the power of using a network analysis approach to identify differential gene expression and provide novel functional annotation.
Ancient gene transfer from algae to animals: Mechanisms and evolutionary significance
2012-01-01
Background Horizontal gene transfer (HGT) is traditionally considered to be rare in multicellular eukaryotes such as animals. Recently, many genes of miscellaneous algal origins were discovered in choanoflagellates. Considering that choanoflagellates are the existing closest relatives of animals, we speculated that ancient HGT might have occurred in the unicellular ancestor of animals and affected the long-term evolution of animals. Results Through genome screening, phylogenetic and domain analyses, we identified 14 gene families, including 92 genes, in the tunicate Ciona intestinalis that are likely derived from miscellaneous photosynthetic eukaryotes. Almost all of these gene families are distributed in diverse animals, suggesting that they were mostly acquired by the common ancestor of animals. Their miscellaneous origins also suggest that these genes are not derived from a particular algal endosymbiont. In addition, most genes identified in our analyses are functionally related to molecule transport, cellular regulation and methylation signaling, suggesting that the acquisition of these genes might have facilitated the intercellular communication in the ancestral animal. Conclusions Our findings provide additional evidence that algal genes in aplastidic eukaryotes are not exclusively derived from historical plastids and thus important for interpreting the evolution of eukaryotic photosynthesis. Most importantly, our data represent the first evidence that more anciently acquired genes might exist in animals and that ancient HGT events have played an important role in animal evolution. PMID:22690978
Liu, Su; Rao, Xiang-Jun; Li, Mao-Ye; Feng, Ming-Feng; He, Meng-Zhu; Li, Shi-Guang
2015-03-01
We present the first antennal transcriptome sequencing information for the yellow mealworm beetle, Tenebrio molitor (Coleoptera: Tenebrionidae). Analysis of the transcriptome dataset obtained 52,216,616 clean reads, from which 35,363 unigenes were assembled. Of these, 18,820 unigenes showed significant similarity (E-value <10(-5)) to known proteins in the NCBI non-redundant protein database. Gene ontology (GO) and Cluster of Orthologous Groups (COG) analyses were used for functional classification of these unigenes. We identified 19 putative odorant-binding protein (OBP) genes, 12 chemosensory protein (CSP) genes, 20 olfactory receptor (OR) genes, 6 ionotropic receptor (IR) genes and 2 sensory neuron membrane protein (SNMP) genes. BLASTX best hit results indicated that these chemosensory genes were most identical to their respective orthologs from Tribolium castaneum. Phylogenetic analyses also revealed that the T. molitor OBPs and CSPs are closely related to those of T. castaneum. Real-time quantitative PCR assays showed that eight TmolOBP genes were antennae-specific. Of these, TmolOBP5, TmolOBP7 and TmolOBP16 were found to be predominantly expressed in male antennae, while TmolOBP17 was expressed mainly in the legs of males. Several other genes were identified that were neither tissue-specific nor sex-specific. These results establish a firm foundation for future studies of the chemosensory genes in T. molitor. Copyright © 2015 Elsevier Inc. All rights reserved.
Lineage-specific expansion of IFIT gene family: an insight into coevolution with IFN gene family.
Liu, Ying; Zhang, Yi-Bing; Liu, Ting-Kai; Gui, Jian-Fang
2013-01-01
In mammals, IFIT (Interferon [IFN]-induced proteins with Tetratricopeptide Repeat [TPR] motifs) family genes are involved in many cellular and viral processes, which are tightly related to mammalian IFN response. However, little is known about non-mammalian IFIT genes. In the present study, IFIT genes are identified in the genome databases from the jawed vertebrates including the cartilaginous elephant shark but not from non-vertebrates such as lancelet, sea squirt and acorn worm, suggesting that IFIT gene family originates from a vertebrate ancestor about 450 million years ago. IFIT family genes show conserved gene structure and gene arrangements. Phylogenetic analyses reveal that this gene family has expanded through lineage-specific and species-specific gene duplication. Interestingly, IFN gene family seem to share a common ancestor and a similar evolutionary mechanism; the function link of IFIT genes to IFN response is present early since the origin of both gene families, as evidenced by the finding that zebrafish IFIT genes are upregulated by fish IFNs, poly(I:C) and two transcription factors IRF3/IRF7, likely via the IFN-stimulated response elements (ISRE) within the promoters of vertebrate IFIT family genes. These coevolution features creates functional association of both family genes to fulfill a common biological process, which is likely selected by viral infection during evolution of vertebrates. Our results are helpful for understanding of evolution of vertebrate IFN system.
Pandey, Shashank K; Nookaraju, Akula; Fujino, Takeshi; Pattathil, Sivakumar; Joshi, Chandrashekhar P
2016-11-01
Functional characterization of two tobacco genes, one involved in xylan synthesis and the other, a positive regulator of secondary cell wall formation, is reported. Lignocellulosic secondary cell walls (SCW) provide essential plant materials for the production of second-generation bioethanol. Therefore, thorough understanding of the process of SCW formation in plants is beneficial for efficient bioethanol production. Recently, we provided the first proof-of-concept for using virus-induced gene silencing (VIGS) approach for rapid functional characterization of nine genes involved in cellulose, hemicellulose and lignin synthesis during SCW formation. Here, we report VIGS-mediated functional characterization of two tobacco genes involved in SCW formation. Stems of VIGS plants silenced for both selected genes showed increased amount of xylem formation but thinner cell walls than controls. These results were further confirmed by production of stable transgenic tobacco plants manipulated in expression of these genes. Stems of stable transgenic tobacco plants silenced for these two genes showed increased xylem proliferation with thinner walls, whereas transgenic tobacco plants overexpressing these two genes showed increased fiber cell wall thickness but no change in xylem proliferation. These two selected genes were later identified as possible members of DUF579 family involved in xylan synthesis and KNAT7 transcription factor family involved in positive regulation of SCW formation, respectively. Glycome analyses of cell walls showed increased polysaccharide extractability in 1 M KOH extracts of both VIGS-NbDUF579 and VIGS-NbKNAT7 lines suggestive of cell wall loosening. Also, VIGS-NbDUF579 and VIGS-NbKNAT7 lines showed increased saccharification rates (74.5 and 40 % higher than controls, respectively). All these properties are highly desirable for producing higher quantities of bioethanol from lignocellulosic materials of bioenergy plants.
From the ultrasonic to the infrared: molecular evolution and the sensory biology of bats
Jones, Gareth; Teeling, Emma C.; Rossiter, Stephen J.
2013-01-01
Great advances have been made recently in understanding the genetic basis of the sensory biology of bats. Research has focused on the molecular evolution of candidate sensory genes, genes with known functions [e.g., olfactory receptor (OR) genes] and genes identified from mutations associated with sensory deficits (e.g., blindness and deafness). For example, the FoxP2 gene, underpinning vocal behavior and sensorimotor coordination, has undergone diversification in bats, while several genes associated with audition show parallel amino acid substitutions in unrelated lineages of echolocating bats and, in some cases, in echolocating dolphins, representing a classic case of convergent molecular evolution. Vision genes encoding the photopigments rhodopsin and the long-wave sensitive opsin are functional in bats, while that encoding the short-wave sensitive opsin has lost functionality in rhinolophoid bats using high-duty cycle laryngeal echolocation, suggesting a sensory trade-off between investment in vision and echolocation. In terms of olfaction, bats appear to have a distinctive OR repertoire compared with other mammals, and a gene involved in signal transduction in the vomeronasal system has become non-functional in most bat species. Bitter taste receptors appear to have undergone a “birth-and death” evolution involving extensive gene duplication and loss, unlike genes coding for sweet and umami tastes that show conservation across most lineages but loss in vampire bats. Common vampire bats have also undergone adaptations for thermoperception, via alternative splicing resulting in the evolution of a novel heat-sensitive channel. The future for understanding the molecular basis of sensory biology is promising, with great potential for comparative genomic analyses, studies on gene regulation and expression, exploration of the role of alternative splicing in the generation of proteomic diversity, and linking genetic mechanisms to behavioral consequences. PMID:23755015
Genome-wide analysis of the TPX2 family proteins in Eucalyptus grandis.
Du, Pingzhou; Kumar, Manoj; Yao, Yuan; Xie, Qiaoli; Wang, Jinyan; Zhang, Baolong; Gan, Siming; Wang, Yuqi; Wu, Ai-Min
2016-11-24
The Xklp2 (TPX2) proteins belong to the microtubule-associated (MAP) family of proteins. All members of the family contain the conserved TPX2 motif, which can interact with microtubules, regulate microtubule dynamics or assist with different microtubule functions, for example, maintenance of cell morphology or regulation of cell growth and development. However, the role of members of the TPX family have not been studied in the model tree species Eucalyptus to date. Here, we report the identification of the members of the TPX2 family in Eucalyptus grandis (Eg) and analyse the expression patterns and functions of these genes. In present study, a comprehensive analysis of the plant TPX2 family proteins was performed. Phylogenetic analyses indicated that the genes can be classified into 6 distinct subfamilies. A genome-wide survey identified 12 members of the TPX2 family in the sequenced genome of Eucalyptus grandis. The basic genetic properties of the TPX2 family in Eucalyptus were analysed. Our results suggest that the TPX2 family proteins within different sub-groups are relatively conserved but there are important differences between groups. Quantitative real-time PCR (qRT-PCR) was performed to confirm the expression levels of the genes in different tissues. The results showed that in the whole plant, the levels of EgWDL5 transcript are the highest, followed by those of EgWDL4. Compared with other tissues, the level of the EgMAP20 transcript is the highest in the root. Over-expression of EgMAP20 in Arabidopsis resulted in organ twisting. The cotyledon petioles showed left-handed twisting while the hypocotyl epidermal cells produced right-handed helical twisting. Finally, EgMAP20, EgWDL3 and EgWDL3L were all able to decorate microtubules. Plant TPX2 family proteins were systematically analysed using bioinformatics methods. There are 12 TPX2 family proteins in Eucalyptus. We have performed an initial characterization of the functions of several members of the TPX2 family. We found that the gene products are localized to the microtubule cytoskeleton. Our results lay the foundation for future efforts to reveal the biological significance of TPX2 family proteins in Eucalyptus.
Liu, X Z; Sang, M; Zhang, X A; Zhang, T K; Zhang, H Y; He, X; Li, S X; Sun, X D; Zhang, Z M
2017-05-01
Saccharomyces uvarum is a good wine yeast species that may have great potential for the future. However, sulfur tolerance of most S. uvarum strains is very poor. In addition there is still little information about the SSU1 gene of S. uvarum, which encodes a putative transporter conferring sulfite tolerance. In order to analyze the function of the SSU1 gene, two expression vectors that contained different SSU1 genes were constructed and transferred into a sulfite-tolerant S. uvarum strain, A9. Then sulfite tolerance, SO2 production, and PCR, sequencing, RT-qPCR and transcriptome analyses were used to access the function of the S. uvarum SSU1 gene. Our results illustrated that enhancing expression of the SSU1 gene can promote sulfite resistance in S. uvarum, and an insertion fragment ahead of the additional SSU1 gene, as seen in some alleles, could affect the expression of other genes and the sulfite tolerance level of S. uvarum. This is the first report on enhancing the expression of the SSU1 gene of S. uvarum. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Gene duplications in prokaryotes can be associated with environmental adaptation
2010-01-01
Background Gene duplication is a normal evolutionary process. If there is no selective advantage in keeping the duplicated gene, it is usually reduced to a pseudogene and disappears from the genome. However, some paralogs are retained. These gene products are likely to be beneficial to the organism, e.g. in adaptation to new environmental conditions. The aim of our analysis is to investigate the properties of paralog-forming genes in prokaryotes, and to analyse the role of these retained paralogs by relating gene properties to life style of the corresponding prokaryotes. Results Paralogs were identified in a number of prokaryotes, and these paralogs were compared to singletons of persistent orthologs based on functional classification. This showed that the paralogs were associated with for example energy production, cell motility, ion transport, and defence mechanisms. A statistical overrepresentation analysis of gene and protein annotations was based on paralogs of the 200 prokaryotes with the highest fraction of paralog-forming genes. Biclustering of overrepresented gene ontology terms versus species was used to identify clusters of properties associated with clusters of species. The clusters were classified using similarity scores on properties and species to identify interesting clusters, and a subset of clusters were analysed by comparison to literature data. This analysis showed that paralogs often are associated with properties that are important for survival and proliferation of the specific organisms. This includes processes like ion transport, locomotion, chemotaxis and photosynthesis. However, the analysis also showed that the gene ontology terms sometimes were too general, imprecise or even misleading for automatic analysis. Conclusions Properties described by gene ontology terms identified in the overrepresentation analysis are often consistent with individual prokaryote lifestyles and are likely to give a competitive advantage to the organism. Paralogs and singletons dominate different categories of functional classification, where paralogs in particular seem to be associated with processes involving interaction with the environment. PMID:20961426
Gene duplications in prokaryotes can be associated with environmental adaptation.
Bratlie, Marit S; Johansen, Jostein; Sherman, Brad T; Huang, Da Wei; Lempicki, Richard A; Drabløs, Finn
2010-10-20
Gene duplication is a normal evolutionary process. If there is no selective advantage in keeping the duplicated gene, it is usually reduced to a pseudogene and disappears from the genome. However, some paralogs are retained. These gene products are likely to be beneficial to the organism, e.g. in adaptation to new environmental conditions. The aim of our analysis is to investigate the properties of paralog-forming genes in prokaryotes, and to analyse the role of these retained paralogs by relating gene properties to life style of the corresponding prokaryotes. Paralogs were identified in a number of prokaryotes, and these paralogs were compared to singletons of persistent orthologs based on functional classification. This showed that the paralogs were associated with for example energy production, cell motility, ion transport, and defence mechanisms. A statistical overrepresentation analysis of gene and protein annotations was based on paralogs of the 200 prokaryotes with the highest fraction of paralog-forming genes. Biclustering of overrepresented gene ontology terms versus species was used to identify clusters of properties associated with clusters of species. The clusters were classified using similarity scores on properties and species to identify interesting clusters, and a subset of clusters were analysed by comparison to literature data. This analysis showed that paralogs often are associated with properties that are important for survival and proliferation of the specific organisms. This includes processes like ion transport, locomotion, chemotaxis and photosynthesis. However, the analysis also showed that the gene ontology terms sometimes were too general, imprecise or even misleading for automatic analysis. Properties described by gene ontology terms identified in the overrepresentation analysis are often consistent with individual prokaryote lifestyles and are likely to give a competitive advantage to the organism. Paralogs and singletons dominate different categories of functional classification, where paralogs in particular seem to be associated with processes involving interaction with the environment.
Mansour, Hader A; Wood, Joel; Chowdari, Kodavali V; Tumuluru, Divya; Bamne, Mikhil; Monk, Timothy H; Hall, Martica H; Buysse, Daniel J; Nimgaonkar, Vishwajit L
2017-01-01
A variable number tandem repeat polymorphism (VNTR) in the period 3 (PER3) gene has been associated with heritable sleep and circadian variables, including self-rated chronotypes, polysomnographic (PSG) variables, insomnia and circadian sleep-wake disorders. This report describes novel molecular and clinical analyses of PER3 VNTR polymorphisms to better define their functional consequences. As the PER3 VNTR is located in the exonic (protein coding) region of PER3, we initially investigated whether both alleles (variants) are transcribed into messenger RNA in human fibroblasts. The VNTR showed bi-allelic gene expression. We next investigated genetic associations in relation to clinical variables in 274 older adult Caucasian individuals. Independent variables included genotypes for the PER3 VNTR as well as a representative set of single nucleotide polymorphisms (SNPs) that tag common variants at the PER3 locus (linkage disequilibrium (LD) between genetic variants < 0.5). In order to comprehensively evaluate variables analyzed individually in prior analyses, dependent measures included PSG total sleep time and sleep latency, self-rated chronotype, estimated with the Composite Scale (CS), and lifestyle regularity, estimated using the social rhythm metric (SRM). Initially, genetic polymorphisms were individually analyzed in relation to each outcome variable using analysis of variance (ANOVA). Nominally significant associations were further tested using regression analyses that incorporated individual ANOVA-associated DNA variants as potential predictors and each of the selected sleep/circadian variables as outcomes. The covariates included age, gender, body mass index and an index of medical co-morbidity. Significant genetic associations with the VNTR were not detected with the sleep or circadian variables. Nominally significant associations were detected between SNP rs1012477 and CS scores (p = 0.003) and between rs10462021 and SRM (p = 0.047); rs11579477 and average delta power (p = 0.043) (analyses uncorrected for multiple comparisons). In conclusion, alleles of the VNTR are expressed at the transcript level and may have a functional effect in cells expressing the PER3 gene. PER3 polymorphisms had a modest impact on selected sleep/circadian variables in our sample, suggesting that PER3 is associated with sleep and circadian function beyond VNTR polymorphisms. Further replicate analyses in larger, independent samples are recommended.
NASA Astrophysics Data System (ADS)
Xue, Zhuang; Li, Hui; Liu, Yang; Zhou, Wei; Sun, Jing; Wang, Xiuli
2017-12-01
As a `living fossil' of species origin and `rich treasure' of food and nutrition development, sea cucumber has received a lot of attentions from researchers. The cDNA library construction and EST sequencing of blood had been conducted previously in our lab. The bioinformatic analysis provided a gene fragment which is highly homologous with the genes of lectin family, named AjL ( Apostichopus japonicus lectin). To characterize and determine the phylogeny of AjL genes in early evolution, we isolated a full-length cDNA of lectin gene from the body wall of A. japonicus. The open reading frame of this gene contained 489 bp and encoded a 163 amino acids secretory protein being homologous to lectins of mammals and aquatic organisms. The deduced protein included a lectin-like domain. SDS-PAGE analysis showed that AjL migrated as a specific band (about 36.09 kDa under reducing), and agglutinated against rabbit red blood cells. AjL was similar to chain A of CEL-IV in space structure. We predicted that AjL may play the same role of CEL-IV. Our results suggested that more than one lectin gene functioned in sea cucumber and most of other species, which was fused by uncertain sequences during the evolution and encoded different proteins with diverse functions. Our findings provided the insights into the function and characteristics of lectin genes invertebrates. The results will also be helpful for the identification and structural, functional, and evolutionary analyses of lectin genes.
Computing and Applying Atomic Regulons to Understand Gene Expression and Regulation
Faria, José P.; Davis, James J.; Edirisinghe, Janaka N.; Taylor, Ronald C.; Weisenhorn, Pamela; Olson, Robert D.; Stevens, Rick L.; Rocha, Miguel; Rocha, Isabel; Best, Aaron A.; DeJongh, Matthew; Tintle, Nathan L.; Parrello, Bruce; Overbeek, Ross; Henry, Christopher S.
2016-01-01
Understanding gene function and regulation is essential for the interpretation, prediction, and ultimate design of cell responses to changes in the environment. An important step toward meeting the challenge of understanding gene function and regulation is the identification of sets of genes that are always co-expressed. These gene sets, Atomic Regulons (ARs), represent fundamental units of function within a cell and could be used to associate genes of unknown function with cellular processes and to enable rational genetic engineering of cellular systems. Here, we describe an approach for inferring ARs that leverages large-scale expression data sets, gene context, and functional relationships among genes. We computed ARs for Escherichia coli based on 907 gene expression experiments and compared our results with gene clusters produced by two prevalent data-driven methods: Hierarchical clustering and k-means clustering. We compared ARs and purely data-driven gene clusters to the curated set of regulatory interactions for E. coli found in RegulonDB, showing that ARs are more consistent with gold standard regulons than are data-driven gene clusters. We further examined the consistency of ARs and data-driven gene clusters in the context of gene interactions predicted by Context Likelihood of Relatedness (CLR) analysis, finding that the ARs show better agreement with CLR predicted interactions. We determined the impact of increasing amounts of expression data on AR construction and find that while more data improve ARs, it is not necessary to use the full set of gene expression experiments available for E. coli to produce high quality ARs. In order to explore the conservation of co-regulated gene sets across different organisms, we computed ARs for Shewanella oneidensis, Pseudomonas aeruginosa, Thermus thermophilus, and Staphylococcus aureus, each of which represents increasing degrees of phylogenetic distance from E. coli. Comparison of the organism-specific ARs showed that the consistency of AR gene membership correlates with phylogenetic distance, but there is clear variability in the regulatory networks of closely related organisms. As large scale expression data sets become increasingly common for model and non-model organisms, comparative analyses of atomic regulons will provide valuable insights into fundamental regulatory modules used across the bacterial domain. PMID:27933038
Wang, James K. T.; Langfelder, Peter; Horvath, Steve; Palazzolo, Michael J.
2017-01-01
Huntington's disease (HD) is a progressive and autosomal dominant neurodegeneration caused by CAG expansion in the huntingtin gene (HTT), but the pathophysiological mechanism of mutant HTT (mHTT) remains unclear. To study HD using systems biological methodologies on all published data, we undertook the first comprehensive curation of two key PubMed HD datasets: perturbation genes that impact mHTT-driven endpoints and therefore are putatively linked causally to pathogenic mechanisms, and the protein interactome of HTT that reflects its biology. We perused PubMed articles containing co-citation of gene IDs and MeSH terms of interest to generate mechanistic gene sets for iterative enrichment analyses and rank ordering. The HD Perturbation database of 1,218 genes highly overlaps the HTT Interactome of 1,619 genes, suggesting links between normal HTT biology and mHTT pathology. These two HD datasets are enriched for protein networks of key genes underlying two mechanisms not previously implicated in HD nor in each other: exosome synaptic functions and homeostatic synaptic plasticity. Moreover, proteins, possibly including HTT, and miRNA detected in exosomes from a wide variety of sources also highly overlap the HD datasets, suggesting both mechanistic and biomarker links. Finally, the HTT Interactome highly intersects protein networks of pathogenic genes underlying Parkinson's, Alzheimer's and eight non-HD polyglutamine diseases, ALS, and spinal muscular atrophy. These protein networks in turn highly overlap the exosome and homeostatic synaptic plasticity gene sets. Thus, we hypothesize that HTT and other neurodegeneration pathogenic genes form a large interlocking protein network involved in exosome and homeostatic synaptic functions, particularly where the two mechanisms intersect. Mutant pathogenic proteins cause dysfunctions at distinct points in this network, each altering the two mechanisms in specific fashion that contributes to distinct disease pathologies, depending on the gene mutation and the cellular and biological context. This protein network is rich with drug targets, and exosomes may provide disease biomarkers, thus enabling drug discovery. All the curated datasets are made available for other investigators. Elucidating the roles of pathogenic neurodegeneration genes in exosome and homeostatic synaptic functions may provide a unifying framework for the age-dependent, progressive and tissue selective nature of multiple neurodegenerative diseases. PMID:28611571
Wang, James K T; Langfelder, Peter; Horvath, Steve; Palazzolo, Michael J
2017-01-01
Huntington's disease (HD) is a progressive and autosomal dominant neurodegeneration caused by CAG expansion in the huntingtin gene ( HTT ), but the pathophysiological mechanism of mutant HTT (mHTT) remains unclear. To study HD using systems biological methodologies on all published data, we undertook the first comprehensive curation of two key PubMed HD datasets: perturbation genes that impact mHTT-driven endpoints and therefore are putatively linked causally to pathogenic mechanisms, and the protein interactome of HTT that reflects its biology. We perused PubMed articles containing co-citation of gene IDs and MeSH terms of interest to generate mechanistic gene sets for iterative enrichment analyses and rank ordering. The HD Perturbation database of 1,218 genes highly overlaps the HTT Interactome of 1,619 genes, suggesting links between normal HTT biology and mHTT pathology. These two HD datasets are enriched for protein networks of key genes underlying two mechanisms not previously implicated in HD nor in each other: exosome synaptic functions and homeostatic synaptic plasticity. Moreover, proteins, possibly including HTT, and miRNA detected in exosomes from a wide variety of sources also highly overlap the HD datasets, suggesting both mechanistic and biomarker links. Finally, the HTT Interactome highly intersects protein networks of pathogenic genes underlying Parkinson's, Alzheimer's and eight non-HD polyglutamine diseases, ALS, and spinal muscular atrophy. These protein networks in turn highly overlap the exosome and homeostatic synaptic plasticity gene sets. Thus, we hypothesize that HTT and other neurodegeneration pathogenic genes form a large interlocking protein network involved in exosome and homeostatic synaptic functions, particularly where the two mechanisms intersect. Mutant pathogenic proteins cause dysfunctions at distinct points in this network, each altering the two mechanisms in specific fashion that contributes to distinct disease pathologies, depending on the gene mutation and the cellular and biological context. This protein network is rich with drug targets, and exosomes may provide disease biomarkers, thus enabling drug discovery. All the curated datasets are made available for other investigators. Elucidating the roles of pathogenic neurodegeneration genes in exosome and homeostatic synaptic functions may provide a unifying framework for the age-dependent, progressive and tissue selective nature of multiple neurodegenerative diseases.
Jones, Melissa K; Lu, Bin; Saghizadeh, Mehrnoosh; Wang, Shaomei
2016-01-01
Retinal degenerative diseases (RDDs) affect millions of people and are the leading cause of vision loss. Although treatment options for RDDs are limited, stem and progenitor cell-based therapies have great potential to halt or slow the progression of vision loss. Our previous studies have shown that a single subretinal injection of human forebrain derived neural progenitor cells (hNPCs) into the Royal College of Surgeons (RCS) retinal degenerate rat offers long-term preservation of photoreceptors and visual function. Furthermore, neural progenitor cells are currently in clinical trials for treating age-related macular degeneration; however, the molecular mechanisms of stem cell-based therapies are largely unknown. This is the first study to analyze gene expression changes in the retina of RCS rats following subretinal injection of hNPCs using high-throughput sequencing. RNA-seq data of retinas from RCS rats injected with hNPCs (RCS(hNPCs)) were compared to sham surgery in RCS (RCS(sham)) and wild-type Long Evans (LE(sham)) rats. Differential gene expression patterns were determined with in silico analysis and confirmed with qRT-PCR. Function, biologic, cellular component, and pathway analyses were performed on differentially expressed genes and investigated with immunofluorescent staining experiments. Analysis of the gene expression data sets identified 1,215 genes that were differentially expressed between RCS(sham) and LE(sham) samples. Additionally, 283 genes were differentially expressed between the RCS(hNPCs) and RCS(sham) samples. Comparison of these two gene sets identified 68 genes with inverse expression (termed rescue genes), including Pdc, Rp1, and Cdc42ep5. Functional, biologic, and cellular component analyses indicate that the immune response is enhanced in RCS(sham). Pathway analysis of the differential expression gene sets identified three affected pathways in RCS(hNPCs), which all play roles in phagocytosis signaling. Immunofluorescent staining detected the increased presence of macrophages and microglia in RCS(sham) retinas, which decreased in RCS(hNPCs) retinas similar to the patterns detected in LE(sham). The results from this study provide evidence of the gene expression changes that occur following treatment with hNPCs in the degenerating retina. This information can be used in future studies to potentially enhance or predict responses to hNPC and other stem cell therapies for retinal degenerative diseases.
Nigam, Deepti; Sawant, Samir V
2013-01-01
Technological development led to an increased interest in systems biological approaches in plants to characterize developmental mechanism and candidate genes relevant to specific tissue or cell morphology. AUX-IAA proteins are important plant-specific putative transcription factors. There are several reports on physiological response of this family in Arabidopsis but in cotton fiber the transcriptional network through which AUX-IAA regulated its target genes is still unknown. in-silico modelling of cotton fiber development specific gene expression data (108 microarrays and 22,737 genes) using Algorithm for the Reconstruction of Accurate Cellular Networks (ARACNe) reveals 3690 putative AUX-IAA target genes of which 139 genes were known to be AUX-IAA co-regulated within Arabidopsis. Further AUX-IAA targeted gene regulatory network (GRN) had substantial impact on the transcriptional dynamics of cotton fiber, as showed by, altered TF networks, and Gene Ontology (GO) biological processes and metabolic pathway associated with its target genes. Analysis of the AUX-IAA-correlated gene network reveals multiple functions for AUX-IAA target genes such as unidimensional cell growth, cellular nitrogen compound metabolic process, nucleosome organization, DNA-protein complex and process related to cell wall. These candidate networks/pathways have a variety of profound impacts on such cellular functions as stress response, cell proliferation, and cell differentiation. While these functions are fairly broad, their underlying TF networks may provide a global view of AUX-IAA regulated gene expression and a GRN that guides future studies in understanding role of AUX-IAA box protein and its targets regulating fiber development. PMID:24497725
Singh, Vikash K.; Jain, Mukesh; Garg, Rohini
2014-01-01
Growth hormone auxin regulates various cellular processes by altering the expression of diverse genes in plants. Among various auxin-responsive genes, GH3 genes maintain endogenous auxin homeostasis by conjugating excess of auxin with amino acids. GH3 genes have been characterized in many plant species, but not in legumes. In the present work, we identified members of GH3 gene family and analyzed their chromosomal distribution, gene structure, gene duplication and phylogenetic analysis in different legumes, including chickpea, soybean, Medicago, and Lotus. A comprehensive expression analysis in different vegetative and reproductive tissues/stages revealed that many of GH3 genes were expressed in a tissue-specific manner. Notably, chickpea CaGH3-3, soybean GmGH3-8 and -25, and Lotus LjGH3-4, -5, -9 and -18 genes were up-regulated in root, indicating their putative role in root development. In addition, chickpea CaGH3-1 and -7, and Medicago MtGH3-7, -8, and -9 were found to be highly induced under drought and/or salt stresses, suggesting their role in abiotic stress responses. We also observed the examples of differential expression pattern of duplicated GH3 genes in soybean, indicating their functional diversification. Furthermore, analyses of three-dimensional structures, active site residues and ligand preferences provided molecular insights into function of GH3 genes in legumes. The analysis presented here would help in investigation of precise function of GH3 genes in legumes during development and stress conditions. PMID:25642236
Romero, Ibeth; Soares, Maurilio José; Romanha, Alvaro José
2017-01-01
ABSTRACT Leishmaniasis is a neglected tropical disease that affects millions of people worldwide and represents a major public health problem. Information on protein expression patterns and functional roles within the context of Leishmania-infected human monocyte-derived macrophages (MDMs) under drug treatment conditions is essential for understanding the role of these cells in leishmaniasis treatment. We analyzed functional changes in the expression of human MDM genes and proteins during in vitro infection by Leishmania braziliensis and treatment with Glucantime (SbV), using quantitative PCR (qPCR) arrays, Western blotting, confocal microscopy, and small interfering RNA (siRNA) human gene inhibition assays. Comparison of the results from gene transcription and protein expression analyses revealed that glutathione S-transferase π1 (GSTP1), glutamate-cysteine ligase modifier subunit (GCLM), glutathione reductase (GSR), glutathione synthetase (GSS), thioredoxin (TRX), and ATP-binding cassette, subfamily B, member 5 (ABCB5), were strongly upregulated at both the mRNA and protein levels in human MDMs that were infected and treated, compared to the control group. Subcellular localization studies showed a primarily phagolysosomal location for the ABCB5 transporter, indicating that this protein may be involved in the transport of SbV. By inducing a decrease in L. braziliensis intracellular survival in THP-1 macrophages, siRNA silencing of GSTP1, GSS, and ABCB5 resulted in an increased leishmanicidal effect of SbV exposure in vitro. Our results suggest that human MDMs infected with L. braziliensis and treated with SbV express increased levels of genes participating in antioxidant defense, whereas our functional analyses provide evidence for the involvement of human MDMs in drug detoxification. Therefore, we conclude that GSS, GSTP1, and ABCB5 proteins represent potential targets for enhancing the leishmanicidal activity of Glucantime. PMID:28461312
Genomic and genetic analyses of diversity and plant interactions of Pseudomonas fluorescens
Silby, Mark W; Cerdeño-Tárraga, Ana M; Vernikos, Georgios S; Giddens, Stephen R; Jackson, Robert W; Preston, Gail M; Zhang, Xue-Xian; Moon, Christina D; Gehrig, Stefanie M; Godfrey, Scott AC; Knight, Christopher G; Malone, Jacob G; Robinson, Zena; Spiers, Andrew J; Harris, Simon; Challis, Gregory L; Yaxley, Alice M; Harris, David; Seeger, Kathy; Murphy, Lee; Rutter, Simon; Squares, Rob; Quail, Michael A; Saunders, Elizabeth; Mavromatis, Konstantinos; Brettin, Thomas S; Bentley, Stephen D; Hothersall, Joanne; Stephens, Elton; Thomas, Christopher M; Parkhill, Julian; Levy, Stuart B; Rainey, Paul B; Thomson, Nicholas R
2009-01-01
Background Pseudomonas fluorescens are common soil bacteria that can improve plant health through nutrient cycling, pathogen antagonism and induction of plant defenses. The genome sequences of strains SBW25 and Pf0-1 were determined and compared to each other and with P. fluorescens Pf-5. A functional genomic in vivo expression technology (IVET) screen provided insight into genes used by P. fluorescens in its natural environment and an improved understanding of the ecological significance of diversity within this species. Results Comparisons of three P. fluorescens genomes (SBW25, Pf0-1, Pf-5) revealed considerable divergence: 61% of genes are shared, the majority located near the replication origin. Phylogenetic and average amino acid identity analyses showed a low overall relationship. A functional screen of SBW25 defined 125 plant-induced genes including a range of functions specific to the plant environment. Orthologues of 83 of these exist in Pf0-1 and Pf-5, with 73 shared by both strains. The P. fluorescens genomes carry numerous complex repetitive DNA sequences, some resembling Miniature Inverted-repeat Transposable Elements (MITEs). In SBW25, repeat density and distribution revealed 'repeat deserts' lacking repeats, covering approximately 40% of the genome. Conclusions P. fluorescens genomes are highly diverse. Strain-specific regions around the replication terminus suggest genome compartmentalization. The genomic heterogeneity among the three strains is reminiscent of a species complex rather than a single species. That 42% of plant-inducible genes were not shared by all strains reinforces this conclusion and shows that ecological success requires specialized and core functions. The diversity also indicates the significant size of genetic information within the Pseudomonas pan genome. PMID:19432983
Partnering for functional genomics research conference: Abstracts of poster presentations
DOE Office of Scientific and Technical Information (OSTI.GOV)
NONE
1998-06-01
This reports contains abstracts of poster presentations presented at the Functional Genomics Research Conference held April 16--17, 1998 in Oak Ridge, Tennessee. Attention is focused on the following areas: mouse mutagenesis and genomics; phenotype screening; gene expression analysis; DNA analysis technology development; bioinformatics; comparative analyses of mouse, human, and yeast sequences; and pilot projects to evaluate methodologies.
Balhana, Ricardo J C; Singla, Ashima; Sikder, Mahmudul Hasan; Withers, Mike; Kendall, Sharon L
2015-06-27
Mycobacteria inhabit diverse niches and display high metabolic versatility. They can colonise both humans and animals and are also able to survive in the environment. In order to succeed, response to environmental cues via transcriptional regulation is required. In this study we focused on the TetR family of transcriptional regulators (TFTRs) in mycobacteria. We used InterPro to classify the entire complement of transcriptional regulators in 10 mycobacterial species and these analyses showed that TFTRs are the most abundant family of regulators in all species. We identified those TFTRs that are conserved across all species analysed and those that are unique to the pathogens included in the analysis. We examined genomic contexts of 663 of the conserved TFTRs and observed that the majority of TFTRs are separated by 200 bp or less from divergently oriented genes. Analyses of divergent genes indicated that the TFTRs control diverse biochemical functions not limited to efflux pumps. TFTRs typically bind to palindromic motifs and we identified 11 highly significant novel motifs in the upstream regions of divergently oriented TFTRs. The C-terminal ligand binding domain from the TFTR complement in M. tuberculosis showed great diversity in amino acid sequence but with an overall architecture common to other TFTRs. This study suggests that mycobacteria depend on TFTRs for the transcriptional control of a number of metabolic functions yet the physiological role of the majority of these regulators remain unknown.
History of a prolific family: the Hes/Hey-related genes of the annelid Platynereis.
Gazave, Eve; Guillou, Aurélien; Balavoine, Guillaume
2014-01-01
The Hes superfamily or Hes/Hey-related genes encompass a variety of metazoan-specific bHLH genes, with somewhat fuzzy phylogenetic relationships. Hes superfamily members are involved in a variety of major developmental mechanisms in metazoans, notably in neurogenesis and segmentation processes, in which they often act as direct effector genes of the Notch signaling pathway. We have investigated the molecular and functional evolution of the Hes superfamily in metazoans using the lophotrochozoan Platynereis dumerilii as model. Our phylogenetic analyses of more than 200 Metazoan Hes/Hey-related genes revealed the presence of five families, three of them (Hes, Hey and Helt) being pan-metazoan. Those families were likely composed of a unique representative in the last common metazoan ancestor. The evolution of the Hes family was shaped by many independent lineage specific tandem duplication events. The expression patterns of 13 of the 15 Hes/Hey-related genes in Platynereis indicate a broad functional diversification. Nevertheless, a majority of these genes are involved in two crucial developmental processes in annelids: neurogenesis and segmentation, resembling functions highlighted in other animal models. Combining phylogenetic and expression data, our study suggests an unusual evolutionary history for the Hes superfamily. An ancestral multifunctional annelid Hes gene may have undergone multiples rounds of duplication-degeneration-complementation processes in the lineage leading to Platynereis, each gene copies ensuring their maintenance in the genome by subfunctionalisation. Similar but independent waves of duplications are at the origin of the multiplicity of Hes genes in other metazoan lineages.
Itoh, Takeshi; Tanaka, Tsuyoshi; Barrero, Roberto A.; Yamasaki, Chisato; Fujii, Yasuyuki; Hilton, Phillip B.; Antonio, Baltazar A.; Aono, Hideo; Apweiler, Rolf; Bruskiewich, Richard; Bureau, Thomas; Burr, Frances; Costa de Oliveira, Antonio; Fuks, Galina; Habara, Takuya; Haberer, Georg; Han, Bin; Harada, Erimi; Hiraki, Aiko T.; Hirochika, Hirohiko; Hoen, Douglas; Hokari, Hiroki; Hosokawa, Satomi; Hsing, Yue; Ikawa, Hiroshi; Ikeo, Kazuho; Imanishi, Tadashi; Ito, Yukiyo; Jaiswal, Pankaj; Kanno, Masako; Kawahara, Yoshihiro; Kawamura, Toshiyuki; Kawashima, Hiroaki; Khurana, Jitendra P.; Kikuchi, Shoshi; Komatsu, Setsuko; Koyanagi, Kanako O.; Kubooka, Hiromi; Lieberherr, Damien; Lin, Yao-Cheng; Lonsdale, David; Matsumoto, Takashi; Matsuya, Akihiro; McCombie, W. Richard; Messing, Joachim; Miyao, Akio; Mulder, Nicola; Nagamura, Yoshiaki; Nam, Jongmin; Namiki, Nobukazu; Numa, Hisataka; Nurimoto, Shin; O’Donovan, Claire; Ohyanagi, Hajime; Okido, Toshihisa; OOta, Satoshi; Osato, Naoki; Palmer, Lance E.; Quetier, Francis; Raghuvanshi, Saurabh; Saichi, Naomi; Sakai, Hiroaki; Sakai, Yasumichi; Sakata, Katsumi; Sakurai, Tetsuya; Sato, Fumihiko; Sato, Yoshiharu; Schoof, Heiko; Seki, Motoaki; Shibata, Michie; Shimizu, Yuji; Shinozaki, Kazuo; Shinso, Yuji; Singh, Nagendra K.; Smith-White, Brian; Takeda, Jun-ichi; Tanino, Motohiko; Tatusova, Tatiana; Thongjuea, Supat; Todokoro, Fusano; Tsugane, Mika; Tyagi, Akhilesh K.; Vanavichit, Apichart; Wang, Aihui; Wing, Rod A.; Yamaguchi, Kaori; Yamamoto, Mayu; Yamamoto, Naoyuki; Yu, Yeisoo; Zhang, Hao; Zhao, Qiang; Higo, Kenichi; Burr, Benjamin; Gojobori, Takashi; Sasaki, Takuji
2007-01-01
We present here the annotation of the complete genome of rice Oryza sativa L. ssp. japonica cultivar Nipponbare. All functional annotations for proteins and non-protein-coding RNA (npRNA) candidates were manually curated. Functions were identified or inferred in 19,969 (70%) of the proteins, and 131 possible npRNAs (including 58 antisense transcripts) were found. Almost 5000 annotated protein-coding genes were found to be disrupted in insertional mutant lines, which will accelerate future experimental validation of the annotations. The rice loci were determined by using cDNA sequences obtained from rice and other representative cereals. Our conservative estimate based on these loci and an extrapolation suggested that the gene number of rice is ∼32,000, which is smaller than previous estimates. We conducted comparative analyses between rice and Arabidopsis thaliana and found that both genomes possessed several lineage-specific genes, which might account for the observed differences between these species, while they had similar sets of predicted functional domains among the protein sequences. A system to control translational efficiency seems to be conserved across large evolutionary distances. Moreover, the evolutionary process of protein-coding genes was examined. Our results suggest that natural selection may have played a role for duplicated genes in both species, so that duplication was suppressed or favored in a manner that depended on the function of a gene. PMID:17210932
Waschburger, Edgar; Kulcheski, Franceli Rodrigues; Veto, Nicole Moreira; Margis, Rogerio; Margis-Pinheiro, Marcia; Turchetto-Zolet, Andreia Carina
2018-01-01
Abstract sn-Glycerol-3-phosphate 1-O-acyltransferase (GPAT) is an important enzyme that catalyzes the transfer of an acyl group from acyl-CoA or acyl-ACP to the sn-1 or sn-2 position of sn-glycerol-3-phosphate (G3P) to generate lysophosphatidic acids (LPAs). The functional studies of GPAT in plants demonstrated its importance in controlling storage and membrane lipid. Identifying genes encoding GPAT in a variety of plant species is crucial to understand their involvement in different metabolic pathways and physiological functions. Here, we performed genome-wide and evolutionary analyses of GPATs in plants. GPAT genes were identified in all algae and plants studied. The phylogenetic analysis showed that these genes group into three main clades. While clades I (GPAT9) and II (soluble GPAT) include GPATs from algae and plants, clade III (GPAT1-8) includes GPATs specific from plants that are involved in the biosynthesis of cutin or suberin. Gene organization and the expression pattern of GPATs in plants corroborate with clade formation in the phylogeny, suggesting that the evolutionary patterns is reflected in their functionality. Overall, our results provide important insights into the evolution of the plant GPATs and allowed us to explore the evolutionary mechanism underlying the functional diversification among these genes. PMID:29583156
Key role of dual specificity kinase TTK in proliferation and survival of pancreatic cancer cells
Kaistha, B P; Honstein, T; Müller, V; Bielak, S; Sauer, M; Kreider, R; Fassan, M; Scarpa, A; Schmees, C; Volkmer, H; Gress, T M; Buchholz, M
2014-01-01
Background: Pancreatic ductal adenocarcinoma (PDAC) is among the most aggressive human malignancies with an overall 5-year survival rate of <5%. Despite significant advances in treatment of the disease during the past decade, the median survival rate (∼6 months) has hardly improved, warranting the need to identify novel targets for therapeutic approaches. Methods: Quantitative real time PCR, western blot analyses and immunohistochemical staining of tissue microarrays were used to analyse the expression of TTK gene in primary PDAC tissues and cell lines. To inhibit TTK kinase expression in a variety of pancreatic cancer cell lines, RNA interference was used. Functional roles of this kinase in the context of PDAC were studied using cell proliferation, viability and anchorage-independent growth assays. Western blotting, fluorescence-activated cell sorting analyses and fluorescence microscopy were used to gain mechanistic insight into the functional effects. Conclusions: We show that the dual specificity kinase TTK (also known as Mps1), is strongly overexpressed in human PDAC. Functionally, cell proliferation was significantly attenuated following TTK knockdown, whereas apoptosis and necrosis rates were significantly increased. In addition, anchorage-independent growth, a hallmark of malignant transformation and metastatic potential, was strongly impaired in the absence of TTK gene function. Interestingly, immortalised normal pancreatic hTERT-HPNE cells were not affected by loss of TTK function. Mechanistically, these effects in cancer cells were associated with increased formation of micronuclei, suggesting that loss of TTK function in pancreatic cancer cells results in chromosomal instability and mitotic catastrophe. Taken together, our data show that TTK function is critical for growth and proliferation of pancreatic cancer cells, thus establishing this kinase as an interesting new target for novel therapeutic approaches in combating this malignancy. PMID:25137017
Polymorphisms in the AOX2 gene are associated with the rooting ability of olive cuttings.
Hedayati, Vahideh; Mousavi, Amir; Razavi, Khadijeh; Cultrera, Nicolò; Alagna, Fiammetta; Mariotti, Roberto; Hosseini-Mazinani, Mehdi; Baldoni, Luciana
2015-07-01
Different rooting ability candidate genes were tested on an olive cross progeny. Our results demonstrated that only the AOX2 gene was strongly induced. OeAOX2 was fully characterised and correlated to phenotypical traits. The formation of adventitious roots is a key step in the vegetative propagation of trees crop species, and this ability is under strict genetic control. While numerous studies have been carried out to identify genes controlling adventitious root formation, only a few loci have been characterised. In this work, candidate genes that were putatively involved in rooting ability were identified in olive (Olea europaea L.) by similarity with orthologs identified in other plant species. The mRNA levels of these genes were analysed by real-time PCR during root induction in high- (HR) and low-rooting (LR) individuals. Interestingly, alternative oxidase 2 (AOX2), which was previously reported to be a functional marker for rooting in olive cuttings, showed a strong induction in HR individuals. From the OeAOX2 full-length gene, alleles and effective polymorphisms were distinguished and analysed in the cross progeny, which were segregated based on rooting. The results revealed a possible correlation between two single nucleotide polymorphisms of OeAOX2 gene and rooting ability.
Biase, Fernando H; Kimble, Katelyn M
2018-05-10
The maturation and successful acquisition of developmental competence by an oocyte, the female gamete, during folliculogenesis is highly dependent on molecular interactions with somatic cells. Most of the cellular interactions identified, thus far, are modulated by growth factors, ions or metabolites. We hypothesized that this interaction is also modulated at the transcriptional level, which leads to the formation of gene regulatory networks between the oocyte and cumulus cells. We tested this hypothesis by analyzing transcriptome data from single oocytes and the surrounding cumulus cells collected from antral follicles employing an analytical framework to determine interdependencies at the transcript level. We overlapped our transcriptome data with putative protein-protein interactions and identified hundreds of ligand-receptor pairs that can transduce paracrine signaling between an oocyte and cumulus cells. We determined that 499 ligand-encoding genes expressed in oocytes and cumulus cells are functionally associated with transcription regulation (FDR < 0.05). Ligand-encoding genes with specific expression in oocytes or cumulus cells were enriched for biological functions that are likely associated with the coordinated formation of transzonal projections from cumulus cells that reach the oocyte's membrane. Thousands of gene pairs exhibit significant linear co-expression (absolute correlation > 0.85, FDR < 1.8 × 10 - 5 ) patterns between oocytes and cumulus cells. Hundreds of co-expressing genes showed clustering patterns associated with biological functions (FDR < 0.5) necessary for a coordinated function between the oocyte and cumulus cells during folliculogenesis (i.e. regulation of transcription, translation, apoptosis, cell differentiation and transport). Our analyses revealed a complex and functional gene regulatory circuit between the oocyte and surrounding cumulus cells. The regulatory profile of each cumulus-oocyte complex is likely associated with the oocytes' developmental potential to derive an embryo.
Romand, Raymond; Ripp, Raymond; Poidevin, Laetitia; Boeglin, Marcel; Geffers, Lars; Dollé, Pascal; Poch, Olivier
2015-01-01
An in situ hybridization (ISH) study was performed on 2000 murine genes representing around 10% of the protein-coding genes present in the mouse genome using data generated by the EURExpress consortium. This study was carried out in 25 tissues of late gestation embryos (E14.5), with a special emphasis on the developing ear and on five distinct developing sensory organs, including the cochlea, the vestibular receptors, the sensory retina, the olfactory organ, and the vibrissae follicles. The results obtained from an analysis of more than 11,000 micrographs have been integrated in a newly developed knowledgebase, called ImAnno. In addition to managing the multilevel micrograph annotations performed by human experts, ImAnno provides public access to various integrated databases and tools. Thus, it facilitates the analysis of complex ISH gene expression patterns, as well as functional annotation and interaction of gene sets. It also provides direct links to human pathways and diseases. Hierarchical clustering of expression patterns in the 25 tissues revealed three main branches corresponding to tissues with common functions and/or embryonic origins. To illustrate the integrative power of ImAnno, we explored the expression, function and disease traits of the sensory epithelia of the five presumptive sensory organs. The study identified 623 genes (out of 2000) concomitantly expressed in the five embryonic epithelia, among which many (∼12%) were involved in human disorders. Finally, various multilevel interaction networks were characterized, highlighting differential functional enrichments of directly or indirectly interacting genes. These analyses exemplify an under-represention of "sensory" functions in the sensory gene set suggests that E14.5 is a pivotal stage between the developmental stage and the functional phase that will be fully reached only after birth.
Plasticity of genetic interactions in metabolic networks of yeast.
Harrison, Richard; Papp, Balázs; Pál, Csaba; Oliver, Stephen G; Delneri, Daniela
2007-02-13
Why are most genes dispensable? The impact of gene deletions may depend on the environment (plasticity), the presence of compensatory mechanisms (mutational robustness), or both. Here, we analyze the interaction between these two forces by exploring the condition-dependence of synthetic genetic interactions that define redundant functions and alternative pathways. We performed systems-level flux balance analysis of the yeast (Saccharomyces cerevisiae) metabolic network to identify genetic interactions and then tested the model's predictions with in vivo gene-deletion studies. We found that the majority of synthetic genetic interactions are restricted to certain environmental conditions, partly because of the lack of compensation under some (but not all) nutrient conditions. Moreover, the phylogenetic cooccurrence of synthetically interacting pairs is not significantly different from random expectation. These findings suggest that these gene pairs have at least partially independent functions, and, hence, compensation is only a byproduct of their evolutionary history. Experimental analyses that used multiple gene deletion strains not only confirmed predictions of the model but also showed that investigation of false predictions may both improve functional annotation within the model and also lead to the discovery of higher-order genetic interactions. Our work supports the view that functional redundancy may be more apparent than real, and it offers a unified framework for the evolution of environmental adaptation and mutational robustness.
Comparative Metagenomics Revealed Commonly Enriched Gene Sets in Human Gut Microbiomes
Kurokawa, Ken; Itoh, Takehiko; Kuwahara, Tomomi; Oshima, Kenshiro; Toh, Hidehiro; Toyoda, Atsushi; Takami, Hideto; Morita, Hidetoshi; Sharma, Vineet K.; Srivastava, Tulika P.; Taylor, Todd D.; Noguchi, Hideki; Mori, Hiroshi; Ogura, Yoshitoshi; Ehrlich, Dusko S.; Itoh, Kikuji; Takagi, Toshihisa; Sakaki, Yoshiyuki; Hayashi, Tetsuya; Hattori, Masahira
2007-01-01
Numerous microbes inhabit the human intestine, many of which are uncharacterized or uncultivable. They form a complex microbial community that deeply affects human physiology. To identify the genomic features common to all human gut microbiomes as well as those variable among them, we performed a large-scale comparative metagenomic analysis of fecal samples from 13 healthy individuals of various ages, including unweaned infants. We found that, while the gut microbiota from unweaned infants were simple and showed a high inter-individual variation in taxonomic and gene composition, those from adults and weaned children were more complex but showed a high functional uniformity regardless of age or sex. In searching for the genes over-represented in gut microbiomes, we identified 237 gene families commonly enriched in adult-type and 136 families in infant-type microbiomes, with a small overlap. An analysis of their predicted functions revealed various strategies employed by each type of microbiota to adapt to its intestinal environment, suggesting that these gene sets encode the core functions of adult and infant-type gut microbiota. By analysing the orphan genes, 647 new gene families were identified to be exclusively present in human intestinal microbiomes. In addition, we discovered a conjugative transposon family explosively amplified in human gut microbiomes, which strongly suggests that the intestine is a ‘hot spot’ for horizontal gene transfer between microbes. PMID:17916580
Li, Jin; Zheng, Le; Uchiyama, Akihiko; Bin, Lianghua; Mauro, Theodora M; Elias, Peter M; Pawelczyk, Tadeusz; Sakowicz-Burkiewicz, Monika; Trzeciak, Magdalena; Leung, Donald Y M; Morasso, Maria I; Yu, Peng
2018-06-13
A large volume of biological data is being generated for studying mechanisms of various biological processes. These precious data enable large-scale computational analyses to gain biological insights. However, it remains a challenge to mine the data efficiently for knowledge discovery. The heterogeneity of these data makes it difficult to consistently integrate them, slowing down the process of biological discovery. We introduce a data processing paradigm to identify key factors in biological processes via systematic collection of gene expression datasets, primary analysis of data, and evaluation of consistent signals. To demonstrate its effectiveness, our paradigm was applied to epidermal development and identified many genes that play a potential role in this process. Besides the known epidermal development genes, a substantial proportion of the identified genes are still not supported by gain- or loss-of-function studies, yielding many novel genes for future studies. Among them, we selected a top gene for loss-of-function experimental validation and confirmed its function in epidermal differentiation, proving the ability of this paradigm to identify new factors in biological processes. In addition, this paradigm revealed many key genes in cold-induced thermogenesis using data from cold-challenged tissues, demonstrating its generalizability. This paradigm can lead to fruitful results for studying molecular mechanisms in an era of explosive accumulation of publicly available biological data.
Chemical-genetic profile analysis of five inhibitory compounds in yeast.
Alamgir, Md; Erukova, Veronika; Jessulat, Matthew; Azizi, Ali; Golshani, Ashkan
2010-08-06
Chemical-genetic profiling of inhibitory compounds can lead to identification of their modes of action. These profiles can help elucidate the complex interactions between small bioactive compounds and the cell machinery, and explain putative gene function(s). Colony size reduction was used to investigate the chemical-genetic profile of cycloheximide, 3-amino-1,2,4-triazole, paromomycin, streptomycin and neomycin in the yeast Saccharomyces cerevisiae. These compounds target the process of protein biosynthesis. More than 70,000 strains were analyzed from the array of gene deletion mutant yeast strains. As expected, the overall profiles of the tested compounds were similar, with deletions for genes involved in protein biosynthesis being the major category followed by metabolism. This implies that novel genes involved in protein biosynthesis could be identified from these profiles. Further investigations were carried out to assess the activity of three profiled genes in the process of protein biosynthesis using relative fitness of double mutants and other genetic assays. Chemical-genetic profiles provide insight into the molecular mechanism(s) of the examined compounds by elucidating their potential primary and secondary cellular target sites. Our follow-up investigations into the activity of three profiled genes in the process of protein biosynthesis provided further evidence concerning the usefulness of chemical-genetic analyses for annotating gene functions. We termed these genes TAE2, TAE3 and TAE4 for translation associated elements 2-4.