explore gene function: Topics by Science.gov

Sample records for explore gene function

Partitioning of functional gene expression data using principal points.

PubMed

Kim, Jaehee; Kim, Haseong

2017-10-12

DNA microarrays offer motivation and hope for the simultaneous study of variations in multiple genes. Gene expression is a temporal process that allows variations in expression levels with a characterized gene function over a period of time. Temporal gene expression curves can be treated as functional data since they are considered as independent realizations of a stochastic process. This process requires appropriate models to identify patterns of gene functions. The partitioning of the functional data can find homogeneous subgroups of entities for the massive genes within the inherent biological networks. Therefor it can be a useful technique for the analysis of time-course gene expression data. We propose a new self-consistent partitioning method of functional coefficients for individual expression profiles based on the orthonormal basis system. A principal points based functional partitioning method is proposed for time-course gene expression data. The method explores the relationship between genes using Legendre coefficients as principal points to extract the features of gene functions. Our proposed method provides high connectivity in connectedness after clustering for simulated data and finds a significant subsets of genes with the increased connectivity. Our approach has comparative advantages that fewer coefficients are used from the functional data and self-consistency of principal points for partitioning. As real data applications, we are able to find partitioned genes through the gene expressions found in budding yeast data and Escherichia coli data. The proposed method benefitted from the use of principal points, dimension reduction, and choice of orthogonal basis system as well as provides appropriately connected genes in the resulting subsets. We illustrate our method by applying with each set of cell-cycle-regulated time-course yeast genes and E. coli genes. The proposed method is able to identify highly connected genes and to explore the complex dynamics of biological systems in functional genomics.
Exploring the Yeast Acetylome Using Functional Genomics

PubMed Central

Duffy, Supipi Kaluarachchi; Friesen, Helena; Baryshnikova, Anastasia; Lambert, Jean-Philippe; Chong, Yolanda T.; Figeys, Daniel; Andrews, Brenda

2014-01-01

SUMMARY Lysine acetylation is a dynamic posttranslational modification with a well-defined role in regulating histones. The impact of acetylation on other cellular functions remains relatively uncharacterized. We explored the budding yeast acetylome with a functional genomics approach, assessing the effects of gene overexpression in the absence of lysine deacetylases (KDACs). We generated a network of 463 synthetic dosage lethal (SDL) interactions involving class I and II KDACs, revealing many cellular pathways regulated by different KDACs. A biochemical survey of genes interacting with the KDAC RPD3 identified 72 proteins acetylated in vivo. In-depth analysis of one of these proteins, Swi4, revealed a role for acetylation in G1-specific gene expression. Acetylation of Swi4 regulates interaction with its partner Swi6, both components of the SBF transcription factor. This study expands our view of the yeast acetylome, demonstrates the utility of functional genomic screens for exploring enzymatic pathways, and provides functional information that can be mined for future studies. PMID:22579291
Mechanistic Explanations for Restricted Evolutionary Paths That Emerge from Gene Regulatory Networks

PubMed Central

Cotterell, James; Sharpe, James

2013-01-01

The extent and the nature of the constraints to evolutionary trajectories are central issues in biology. Constraints can be the result of systems dynamics causing a non-linear mapping between genotype and phenotype. How prevalent are these developmental constraints and what is their mechanistic basis? Although this has been extensively explored at the level of epistatic interactions between nucleotides within a gene, or amino acids within a protein, selection acts at the level of the whole organism, and therefore epistasis between disparate genes in the genome is expected due to their functional interactions within gene regulatory networks (GRNs) which are responsible for many aspects of organismal phenotype. Here we explore epistasis within GRNs capable of performing a common developmental function – converting a continuous morphogen input into discrete spatial domains. By exploring the full complement of GRN wiring designs that are able to perform this function, we analyzed all possible mutational routes between functional GRNs. Through this study we demonstrate that mechanistic constraints are common for GRNs that perform even a simple function. We demonstrate a common mechanistic cause for such a constraint involving complementation between counter-balanced gene-gene interactions. Furthermore we show how such constraints can be bypassed by means of “permissive” mutations that buffer changes in a direct route between two GRN topologies that would normally be unviable. We show that such bypasses are common and thus we suggest that unlike what was observed in protein sequence-function relationships, the “tape of life” is less reproducible when one considers higher levels of biological organization. PMID:23613807
bc-GenExMiner 3.0: new mining module computes breast cancer gene expression correlation analyses.

PubMed

Jézéquel, Pascal; Frénel, Jean-Sébastien; Campion, Loïc; Guérin-Charbonnel, Catherine; Gouraud, Wilfried; Ricolleau, Gabriel; Campone, Mario

2013-01-01

We recently developed a user-friendly web-based application called bc-GenExMiner (http://bcgenex.centregauducheau.fr), which offered the possibility to evaluate prognostic informativity of genes in breast cancer by means of a 'prognostic module'. In this study, we develop a new module called 'correlation module', which includes three kinds of gene expression correlation analyses. The first one computes correlation coefficient between 2 or more (up to 10) chosen genes. The second one produces two lists of genes that are most correlated (positively and negatively) to a 'tested' gene. A gene ontology (GO) mining function is also proposed to explore GO 'biological process', 'molecular function' and 'cellular component' terms enrichment for the output lists of most correlated genes. The third one explores gene expression correlation between the 15 telomeric and 15 centromeric genes surrounding a 'tested' gene. These correlation analyses can be performed in different groups of patients: all patients (without any subtyping), in molecular subtypes (basal-like, HER2+, luminal A and luminal B) and according to oestrogen receptor status. Validation tests based on published data showed that these automatized analyses lead to results consistent with studies' conclusions. In brief, this new module has been developed to help basic researchers explore molecular mechanisms of breast cancer. DATABASE URL: http://bcgenex.centregauducheau.fr
Contrasting microbial functional genes in two distinct saline-alkali and slightly acidic oil-contaminated sites.

PubMed

Liang, Yuting; Zhao, Huihui; Zhang, Xu; Zhou, Jizhong; Li, Guanghe

2014-07-15

To compare the functional gene structure and diversity of microbial communities in saline-alkali and slightly acidic oil-contaminated sites, 40 soil samples were collected from two typical oil exploration sites in North and South China and analyzed with a comprehensive functional gene array (GeoChip 3.0). The overall microbial pattern was significantly different between the two sites, and a more divergent pattern was observed in slightly acidic soils. Response ratio was calculated to compare the microbial functional genes involved in organic contaminant degradation and carbon, nitrogen, phosphorus, and sulfur cycling. The results indicated a significantly low abundance of most genes involved in organic contaminant degradation and in the cycling of nitrogen and phosphorus in saline-alkali soils. By contrast, most carbon degradation genes and all carbon fixation genes had similar abundance at both sites. Based on the relationship between the environmental variables and microbial functional structure, pH was the major factor influencing the microbial distribution pattern in the two sites. This study demonstrated that microbial functional diversity and heterogeneity in oil-contaminated environments can vary significantly in relation to local environmental conditions. The limitation of nitrogen and phosphorus and the low degradation capacity of organic contaminant should be carefully considered, particularly in most oil-exploration sites with saline-alkali soils. Copyright © 2014 Elsevier B.V. All rights reserved.
Ensemble gene function prediction database reveals genes important for complex I formation in Arabidopsis thaliana.

PubMed

Hansen, Bjoern Oest; Meyer, Etienne H; Ferrari, Camilla; Vaid, Neha; Movahedi, Sara; Vandepoele, Klaas; Nikoloski, Zoran; Mutwil, Marek

2018-03-01

Recent advances in gene function prediction rely on ensemble approaches that integrate results from multiple inference methods to produce superior predictions. Yet, these developments remain largely unexplored in plants. We have explored and compared two methods to integrate 10 gene co-function networks for Arabidopsis thaliana and demonstrate how the integration of these networks produces more accurate gene function predictions for a larger fraction of genes with unknown function. These predictions were used to identify genes involved in mitochondrial complex I formation, and for five of them, we confirmed the predictions experimentally. The ensemble predictions are provided as a user-friendly online database, EnsembleNet. The methods presented here demonstrate that ensemble gene function prediction is a powerful method to boost prediction performance, whereas the EnsembleNet database provides a cutting-edge community tool to guide experimentalists. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
BIOLOGICAL NETWORK EXPLORATION WITH CYTOSCAPE 3

PubMed Central

Su, Gang; Morris, John H.; Demchak, Barry; Bader, Gary D.

2014-01-01

Cytoscape is one of the most popular open-source software tools for the visual exploration of biomedical networks composed of protein, gene and other types of interactions. It offers researchers a versatile and interactive visualization interface for exploring complex biological interconnections supported by diverse annotation and experimental data, thereby facilitating research tasks such as predicting gene function and pathway construction. Cytoscape provides core functionality to load, visualize, search, filter and save networks, and hundreds of Apps extend this functionality to address specific research needs. The latest generation of Cytoscape (version 3.0 and later) has substantial improvements in function, user interface and performance relative to previous versions. This protocol aims to jump-start new users with specific protocols for basic Cytoscape functions, such as installing Cytoscape and Cytoscape Apps, loading data, visualizing and navigating the network, visualizing network associated data (attributes) and identifying clusters. It also highlights new features that benefit experienced users. PMID:25199793
FamNet: A Framework to Identify Multiplied Modules Driving Pathway Expansion in Plants1

PubMed Central

Tohge, Takayuki; Klie, Sebastian; Fernie, Alisdair R.

2016-01-01

Gene duplications generate new genes that can acquire similar but often diversified functions. Recent studies of gene coexpression networks have indicated that, not only genes, but also pathways can be multiplied and diversified to perform related functions in different parts of an organism. Identification of such diversified pathways, or modules, is needed to expand our knowledge of biological processes in plants and to understand how biological functions evolve. However, systematic explorations of modules remain scarce, and no user-friendly platform to identify them exists. We have established a statistical framework to identify modules and show that approximately one-third of the genes of a plant’s genome participate in hundreds of multiplied modules. Using this framework as a basis, we implemented a platform that can explore and visualize multiplied modules in coexpression networks of eight plant species. To validate the usefulness of the platform, we identified and functionally characterized pollen- and root-specific cell wall modules that multiplied to confer tip growth in pollen tubes and root hairs, respectively. Furthermore, we identified multiplied modules involved in secondary metabolite synthesis and corroborated them by metabolite profiling of tobacco (Nicotiana tabacum) tissues. The interactive platform, referred to as FamNet, is available at http://www.gene2function.de/famnet.html. PMID:26754669
Solution Hybrid Selection Capture for the Recovery of Functional Full-Length Eukaryotic cDNAs From Complex Environmental Samples

PubMed Central

Bragalini, Claudia; Ribière, Céline; Parisot, Nicolas; Vallon, Laurent; Prudent, Elsa; Peyretaillade, Eric; Girlanda, Mariangela; Peyret, Pierre; Marmeisse, Roland; Luis, Patricia

2014-01-01

Eukaryotic microbial communities play key functional roles in soil biology and potentially represent a rich source of natural products including biocatalysts. Culture-independent molecular methods are powerful tools to isolate functional genes from uncultured microorganisms. However, none of the methods used in environmental genomics allow for a rapid isolation of numerous functional genes from eukaryotic microbial communities. We developed an original adaptation of the solution hybrid selection (SHS) for an efficient recovery of functional complementary DNAs (cDNAs) synthesized from soil-extracted polyadenylated mRNAs. This protocol was tested on the Glycoside Hydrolase 11 gene family encoding endo-xylanases for which we designed 35 explorative 31-mers capture probes. SHS was implemented on four soil eukaryotic cDNA pools. After two successive rounds of capture, >90% of the resulting cDNAs were GH11 sequences, of which 70% (38 among 53 sequenced genes) were full length. Between 1.5 and 25% of the cloned captured sequences were expressed in Saccharomyces cerevisiae. Sequencing of polymerase chain reaction-amplified GH11 gene fragments from the captured sequences highlighted hundreds of phylogenetically diverse sequences that were not yet described, in public databases. This protocol offers the possibility of performing exhaustive exploration of eukaryotic gene families within microbial communities thriving in any type of environment. PMID:25281543
Pairwise gene GO-based measures for biclustering of high-dimensional expression data.

PubMed

Nepomuceno, Juan A; Troncoso, Alicia; Nepomuceno-Chamorro, Isabel A; Aguilar-Ruiz, Jesús S

2018-01-01

Biclustering algorithms search for groups of genes that share the same behavior under a subset of samples in gene expression data. Nowadays, the biological knowledge available in public repositories can be used to drive these algorithms to find biclusters composed of groups of genes functionally coherent. On the other hand, a distance among genes can be defined according to their information stored in Gene Ontology (GO). Gene pairwise GO semantic similarity measures report a value for each pair of genes which establishes their functional similarity. A scatter search-based algorithm that optimizes a merit function that integrates GO information is studied in this paper. This merit function uses a term that addresses the information through a GO measure. The effect of two possible different gene pairwise GO measures on the performance of the algorithm is analyzed. Firstly, three well known yeast datasets with approximately one thousand of genes are studied. Secondly, a group of human datasets related to clinical data of cancer is also explored by the algorithm. Most of these data are high-dimensional datasets composed of a huge number of genes. The resultant biclusters reveal groups of genes linked by a same functionality when the search procedure is driven by one of the proposed GO measures. Furthermore, a qualitative biological study of a group of biclusters show their relevance from a cancer disease perspective. It can be concluded that the integration of biological information improves the performance of the biclustering process. The two different GO measures studied show an improvement in the results obtained for the yeast dataset. However, if datasets are composed of a huge number of genes, only one of them really improves the algorithm performance. This second case constitutes a clear option to explore interesting datasets from a clinical point of view.
New genes contribute to genetic and phenotypic novelties in human evolution

PubMed Central

Zhang, Yong E.; Long, Manyuan

2014-01-01

New genes in human genomes have been found relevant in evolution and biology of humans. It was conservatively estimated that the human genome encodes more than 300 human-specific genes and 1,000 primate-specific genes. These new arrivals appear to be implicated in brain function and male reproduction. Surprisingly, increasing evidence indicates that they may also bring negative pleiotropic effects, while assuming various possible biological functions as sources of phenotypic novelties, suggesting a non-progressive route for functional evolution. Similar to these fixed new genes, polymorphic new genes were found to contribute to functional evolution within species, e.g. with respect to digestion or disease resistance, revealing that new genes can acquire new or diverged functions in its initial stage as prototypic genes. These progresses have provided new opportunity to explore the genetic basis of human biology and human evolutionary history in a new dimension. PMID:25218862
Global analysis of human duplicated genes reveals the relative importance of whole-genome duplicates originated in the early vertebrate evolution.

PubMed

Acharya, Debarun; Ghosh, Tapash C

2016-01-22

Gene duplication is a genetic mutation that creates functionally redundant gene copies that are initially relieved from selective pressures and may adapt themselves to new functions with time. The levels of gene duplication may vary from small-scale duplication (SSD) to whole genome duplication (WGD). Studies with yeast revealed ample differences between these duplicates: Yeast WGD pairs were functionally more similar, less divergent in subcellular localization and contained a lesser proportion of essential genes. In this study, we explored the differences in evolutionary genomic properties of human SSD and WGD genes, with the identifiable human duplicates coming from the two rounds of whole genome duplication occurred early in vertebrate evolution. We observed that these two groups of duplicates were also dissimilar in terms of their evolutionary and genomic properties. But interestingly, this is not like the same observed in yeast. The human WGDs were found to be functionally less similar, diverge more in subcellular level and contain a higher proportion of essential genes than the SSDs, all of which are opposite from yeast. Additionally, we explored that human WGDs were more divergent in their gene expression profile, have higher multifunctionality and are more often associated with disease, and are evolutionarily more conserved than human SSDs. Our study suggests that human WGD duplicates are more divergent and entails the adaptation of WGDs to novel and important functions that consequently lead to their evolutionary conservation in the course of evolution.
Identifying osteosarcoma metastasis associated genes by weighted gene co-expression network analysis (WGCNA).

PubMed

Tian, Honglai; Guan, Donghui; Li, Jianmin

2018-06-01

Osteosarcoma (OS), the most common malignant bone tumor, accounts for the heavy healthy threat in the period of children and adolescents. OS occurrence usually correlates with early metastasis and high death rate. This study aimed to better understand the mechanism of OS metastasis.Based on Gene Expression Omnibus (GEO) database, we downloaded 4 expression profile data sets associated with OS metastasis, and selected differential expressed genes. Weighted gene co-expression network analysis (WGCNA) approach allowed us to investigate the most OS metastasis-correlated module. Gene Ontology functional and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses were used to give annotation of selected OS metastasis-associated genes.We select 897 differential expressed genes from OS metastasis and OS non-metastasis groups. Based on these selected genes, WGCNA further explored 142 genes included in the most OS metastasis-correlated module. Gene Ontology functional and KEGG pathway enrichment analyses showed that significantly OS metastasis-associated genes were involved in pathway correlated with insulin-like growth factor binding.Our research figured out several potential molecules participating in metastasis process and factors acting as biomarker. With this study, we could better explore the mechanism of OS metastasis and further discover more therapy targets.
Binding and condensation of plasmid DNA onto functionalized carbon nanotubes: toward the construction of nanotube-based gene delivery vectors.

PubMed

Singh, Ravi; Pantarotto, Davide; McCarthy, David; Chaloin, Olivier; Hoebeke, Johan; Partidos, Charalambos D; Briand, Jean-Paul; Prato, Maurizio; Bianco, Alberto; Kostarelos, Kostas

2005-03-30

Carbon nanotubes (CNTs) constitute a class of nanomaterials that possess characteristics suitable for a variety of possible applications. Their compatibility with aqueous environments has been made possible by the chemical functionalization of their surface, allowing for exploration of their interactions with biological components including mammalian cells. Functionalized CNTs (f-CNTs) are being intensively explored in advanced biotechnological applications ranging from molecular biosensors to cellular growth substrates. We have been exploring the potential of f-CNTs as delivery vehicles of biologically active molecules in view of possible biomedical applications, including vaccination and gene delivery. Recently we reported the capability of ammonium-functionalized single-walled CNTs to penetrate human and murine cells and facilitate the delivery of plasmid DNA leading to expression of marker genes. To optimize f-CNTs as gene delivery vehicles, it is essential to characterize their interactions with DNA. In the present report, we study the interactions of three types of f-CNTs, ammonium-functionalized single-walled and multiwalled carbon nanotubes (SWNT-NH3+; MWNT-NH3+), and lysine-functionalized single-walled carbon nanotubes (SWNT-Lys-NH3+), with plasmid DNA. Nanotube-DNA complexes were analyzed by scanning electron microscopy, surface plasmon resonance, PicoGreen dye exclusion, and agarose gel shift assay. The results indicate that all three types of cationic carbon nanotubes are able to condense DNA to varying degrees, indicating that both nanotube surface area and charge density are critical parameters that determine the interaction and electrostatic complex formation between f-CNTs with DNA. All three different f-CNT types in this study exhibited upregulation of marker gene expression over naked DNA using a mammalian (human) cell line. Differences in the levels of gene expression were correlated with the structural and biophysical data obtained for the f-CNT:DNA complexes to suggest that large surface area leading to very efficient DNA condensation is not necessary for effective gene transfer. However, it will require further investigation to determine whether the degree of binding and tight association between DNA and nanotubes is a desirable trait to increase gene expression efficiency in vitro or in vivo. This study constitutes the first thorough investigation into the physicochemical interactions between cationic functionalized carbon nanotubes and DNA toward construction of carbon nanotube-based gene transfer vector systems.
Conceptual Variation in the Depiction of Gene Function in Upper Secondary School Textbooks

ERIC Educational Resources Information Center

Gericke, Niklas Markus; Hagberg, Mariana

2010-01-01

This paper explores conceptual variation in the depiction of gene function in upper secondary school textbooks. Historically, concepts in genetics have developed in various scientific frameworks, which has led to a level of incommensurability as concepts have changed over time within their respective frameworks. Since students may have…
pico-PLAZA, a genome database of microbial photosynthetic eukaryotes.

PubMed

Vandepoele, Klaas; Van Bel, Michiel; Richard, Guilhem; Van Landeghem, Sofie; Verhelst, Bram; Moreau, Hervé; Van de Peer, Yves; Grimsley, Nigel; Piganeau, Gwenael

2013-08-01

With the advent of next generation genome sequencing, the number of sequenced algal genomes and transcriptomes is rapidly growing. Although a few genome portals exist to browse individual genome sequences, exploring complete genome information from multiple species for the analysis of user-defined sequences or gene lists remains a major challenge. pico-PLAZA is a web-based resource (http://bioinformatics.psb.ugent.be/pico-plaza/) for algal genomics that combines different data types with intuitive tools to explore genomic diversity, perform integrative evolutionary sequence analysis and study gene functions. Apart from homologous gene families, multiple sequence alignments, phylogenetic trees, Gene Ontology, InterPro and text-mining functional annotations, different interactive viewers are available to study genome organization using gene collinearity and synteny information. Different search functions, documentation pages, export functions and an extensive glossary are available to guide non-expert scientists. To illustrate the versatility of the platform, different case studies are presented demonstrating how pico-PLAZA can be used to functionally characterize large-scale EST/RNA-Seq data sets and to perform environmental genomics. Functional enrichments analysis of 16 Phaeodactylum tricornutum transcriptome libraries offers a molecular view on diatom adaptation to different environments of ecological relevance. Furthermore, we show how complementary genomic data sources can easily be combined to identify marker genes to study the diversity and distribution of algal species, for example in metagenomes, or to quantify intraspecific diversity from environmental strains. © 2013 John Wiley & Sons Ltd and Society for Applied Microbiology.
Gene context analysis in the Integrated Microbial Genomes (IMG) data management system.

PubMed

Mavromatis, Konstantinos; Chu, Ken; Ivanova, Natalia; Hooper, Sean D; Markowitz, Victor M; Kyrpides, Nikos C

2009-11-24

Computational methods for determining the function of genes in newly sequenced genomes have been traditionally based on sequence similarity to genes whose function has been identified experimentally. Function prediction methods can be extended using gene context analysis approaches such as examining the conservation of chromosomal gene clusters, gene fusion events and co-occurrence profiles across genomes. Context analysis is based on the observation that functionally related genes are often having similar gene context and relies on the identification of such events across phylogenetically diverse collection of genomes. We have used the data management system of the Integrated Microbial Genomes (IMG) as the framework to implement and explore the power of gene context analysis methods because it provides one of the largest available genome integrations. Visualization and search tools to facilitate gene context analysis have been developed and applied across all publicly available archaeal and bacterial genomes in IMG. These computations are now maintained as part of IMG's regular genome content update cycle. IMG is available at: http://img.jgi.doe.gov.
What's that gene (or protein)? Online resources for exploring functions of genes, transcripts, and proteins

PubMed Central

Hutchins, James R. A.

2014-01-01

The genomic era has enabled research projects that use approaches including genome-scale screens, microarray analysis, next-generation sequencing, and mass spectrometry–based proteomics to discover genes and proteins involved in biological processes. Such methods generate data sets of gene, transcript, or protein hits that researchers wish to explore to understand their properties and functions and thus their possible roles in biological systems of interest. Recent years have seen a profusion of Internet-based resources to aid this process. This review takes the viewpoint of the curious biologist wishing to explore the properties of protein-coding genes and their products, identified using genome-based technologies. Ten key questions are asked about each hit, addressing functions, phenotypes, expression, evolutionary conservation, disease association, protein structure, interactors, posttranslational modifications, and inhibitors. Answers are provided by presenting the latest publicly available resources, together with methods for hit-specific and data set–wide information retrieval, suited to any genome-based analytical technique and experimental species. The utility of these resources is demonstrated for 20 factors regulating cell proliferation. Results obtained using some of these are discussed in more depth using the p53 tumor suppressor as an example. This flexible and universally applicable approach for characterizing experimental hits helps researchers to maximize the potential of their projects for biological discovery. PMID:24723265
Using the Saccharomyces Genome Database (SGD) for analysis of genomic information

PubMed Central

Skrzypek, Marek S.; Hirschman, Jodi

2011-01-01

Analysis of genomic data requires access to software tools that place the sequence-derived information in the context of biology. The Saccharomyces Genome Database (SGD) integrates functional information about budding yeast genes and their products with a set of analysis tools that facilitate exploring their biological details. This unit describes how the various types of functional data available at SGD can be searched, retrieved, and analyzed. Starting with the guided tour of the SGD Home page and Locus Summary page, this unit highlights how to retrieve data using YeastMine, how to visualize genomic information with GBrowse, how to explore gene expression patterns with SPELL, and how to use Gene Ontology tools to characterize large-scale datasets. PMID:21901739
Function does not follow form in gene regulatory circuits.

PubMed

Payne, Joshua L; Wagner, Andreas

2015-08-20

Gene regulatory circuits are to the cell what arithmetic logic units are to the chip: fundamental components of information processing that map an input onto an output. Gene regulatory circuits come in many different forms, distinct structural configurations that determine who regulates whom. Studies that have focused on the gene expression patterns (functions) of circuits with a given structure (form) have examined just a few structures or gene expression patterns. Here, we use a computational model to exhaustively characterize the gene expression patterns of nearly 17 million three-gene circuits in order to systematically explore the relationship between circuit form and function. Three main conclusions emerge. First, function does not follow form. A circuit of any one structure can have between twelve and nearly thirty thousand distinct gene expression patterns. Second, and conversely, form does not follow function. Most gene expression patterns can be realized by more than one circuit structure. And third, multifunctionality severely constrains circuit form. The number of circuit structures able to drive multiple gene expression patterns decreases rapidly with the number of these patterns. These results indicate that it is generally not possible to infer circuit function from circuit form, or vice versa.

Utility and Limitations of Using Gene Expression Data to Identify Functional Associations

PubMed Central

Peng, Cheng; Shiu, Shin-Han

2016-01-01

Gene co-expression has been widely used to hypothesize gene function through guilt-by association. However, it is not clear to what degree co-expression is informative, whether it can be applied to genes involved in different biological processes, and how the type of dataset impacts inferences about gene functions. Here our goal is to assess the utility and limitations of using co-expression as a criterion to recover functional associations between genes. By determining the percentage of gene pairs in a metabolic pathway with significant expression correlation, we found that many genes in the same pathway do not have similar transcript profiles and the choice of dataset, annotation quality, gene function, expression similarity measure, and clustering approach significantly impacts the ability to recover functional associations between genes using Arabidopsis thaliana as an example. Some datasets are more informative in capturing coordinated expression profiles and larger data sets are not always better. In addition, to recover the maximum number of known pathways and identify candidate genes with similar functions, it is important to explore rather exhaustively multiple dataset combinations, similarity measures, clustering algorithms and parameters. Finally, we validated the biological relevance of co-expression cluster memberships with an independent phenomics dataset and found that genes that consistently cluster with leucine degradation genes tend to have similar leucine levels in mutants. This study provides a framework for obtaining gene functional associations by maximizing the information that can be obtained from gene expression datasets. PMID:27935950
Pattern Genes Suggest Functional Connectivity of Organs

NASA Astrophysics Data System (ADS)

Qin, Yangmei; Pan, Jianbo; Cai, Meichun; Yao, Lixia; Ji, Zhiliang

2016-05-01

Human organ, as the basic structural and functional unit in human body, is made of a large community of different cell types that organically bound together. Each organ usually exerts highly specified physiological function; while several related organs work smartly together to perform complicated body functions. In this study, we present a computational effort to understand the roles of genes in building functional connection between organs. More specifically, we mined multiple transcriptome datasets sampled from 36 human organs and tissues, and quantitatively identified 3,149 genes whose expressions showed consensus modularly patterns: specific to one organ/tissue, selectively expressed in several functionally related tissues and ubiquitously expressed. These pattern genes imply intrinsic connections between organs. According to the expression abundance of the 766 selective genes, we consistently cluster the 36 human organs/tissues into seven functional groups: adipose & gland, brain, muscle, immune, metabolism, mucoid and nerve conduction. The organs and tissues in each group either work together to form organ systems or coordinate to perform particular body functions. The particular roles of specific genes and selective genes suggest that they could not only be used to mechanistically explore organ functions, but also be designed for selective biomarkers and therapeutic targets.
Functional and Genomic Features of Human Genes Mutated in Neuropsychiatric Disorders.

PubMed

Forero, Diego A; Prada, Carlos F; Perry, George

2016-01-01

In recent years, a large number of studies around the world have led to the identification of causal genes for hereditary types of common and rare neurological and psychiatric disorders. To explore the functional and genomic features of known human genes mutated in neuropsychiatric disorders. A systematic search was used to develop a comprehensive catalog of genes mutated in neuropsychiatric disorders (NPD). Functional enrichment and protein-protein interaction analyses were carried out. A false discovery rate approach was used for correction for multiple testing. We found several functional categories that are enriched among NPD genes, such as gene ontologies, protein domains, tissue expression, signaling pathways and regulation by brain-expressed miRNAs and transcription factors. Sixty six of those NPD genes are known to be druggable. Several topographic parameters of protein-protein interaction networks and the degree of conservation between orthologous genes were identified as significant among NPD genes. These results represent one of the first analyses of enrichment of functional categories of genes known to harbor mutations for NPD. These findings could be useful for a future creation of computational tools for prioritization of novel candidate genes for NPD.
Functional and Genomic Features of Human Genes Mutated in Neuropsychiatric Disorders

PubMed Central

Forero, Diego A.; Prada, Carlos F.; Perry, George

2016-01-01

Background: In recent years, a large number of studies around the world have led to the identification of causal genes for hereditary types of common and rare neurological and psychiatric disorders. Objective: To explore the functional and genomic features of known human genes mutated in neuropsychiatric disorders. Methods: A systematic search was used to develop a comprehensive catalog of genes mutated in neuropsychiatric disorders (NPD). Functional enrichment and protein-protein interaction analyses were carried out. A false discovery rate approach was used for correction for multiple testing. Results: We found several functional categories that are enriched among NPD genes, such as gene ontologies, protein domains, tissue expression, signaling pathways and regulation by brain-expressed miRNAs and transcription factors. Sixty six of those NPD genes are known to be druggable. Several topographic parameters of protein-protein interaction networks and the degree of conservation between orthologous genes were identified as significant among NPD genes. Conclusion: These results represent one of the first analyses of enrichment of functional categories of genes known to harbor mutations for NPD. These findings could be useful for a future creation of computational tools for prioritization of novel candidate genes for NPD. PMID:27990183
Integrative View of α2,3-Sialyltransferases (ST3Gal) Molecular and Functional Evolution in Deuterostomes: Significance of Lineage-Specific Losses

PubMed Central

Petit, Daniel; Teppa, Elin; Mir, Anne-Marie; Vicogne, Dorothée; Thisse, Christine; Thisse, Bernard; Filloux, Cyril; Harduin-Lepers, Anne

2015-01-01

Sialyltransferases are responsible for the synthesis of a diverse range of sialoglycoconjugates predicted to be pivotal to deuterostomes’ evolution. In this work, we reconstructed the evolutionary history of the metazoan α2,3-sialyltransferases family (ST3Gal), a subset of sialyltransferases encompassing six subfamilies (ST3Gal I–ST3Gal VI) functionally characterized in mammals. Exploration of genomic and expressed sequence tag databases and search of conserved sialylmotifs led to the identification of a large data set of st3gal-related gene sequences. Molecular phylogeny and large scale sequence similarity network analysis identified four new vertebrate subfamilies called ST3Gal III-r, ST3Gal VII, ST3Gal VIII, and ST3Gal IX. To address the issue of the origin and evolutionary relationships of the st3gal-related genes, we performed comparative syntenic mapping of st3gal gene loci combined to ancestral genome reconstruction. The ten vertebrate ST3Gal subfamilies originated from genome duplication events at the base of vertebrates and are organized in three distinct and ancient groups of genes predating the early deuterostomes. Inferring st3gal gene family history identified also several lineage-specific gene losses, the significance of which was explored in a functional context. Toward this aim, spatiotemporal distribution of st3gal genes was analyzed in zebrafish and bovine tissues. In addition, molecular evolutionary analyses using specificity determining position and coevolved amino acid predictions led to the identification of amino acid residues with potential implication in functional divergence of vertebrate ST3Gal. We propose a detailed scenario of the evolutionary relationships of st3gal genes coupled to a conceptual framework of the evolution of ST3Gal functions. PMID:25534026
Functional Genomic Analysis of Cotton Genes with Agrobacterium-Mediated Virus-Induced Gene Silencing

PubMed Central

Gao, Xiquan; Shan, Libo

2015-01-01

Cotton (Gossypium spp.) is one of the most agronomically important crops worldwide for its unique textile fiber production and serving as food and feed stock. Molecular breeding and genetic engineering of useful genes into cotton have emerged as advanced approaches to improve cotton yield, fiber quality, and resistance to various stresses. However, the understanding of gene functions and regulations in cotton is largely hindered by the limited molecular and biochemical tools. Here, we describe the method of an Agrobacterium infiltration-based virus-induced gene silencing (VIGS) assay to transiently silence endogenous genes in cotton at 2-week-old seedling stage. The genes of interest could be readily silenced with a consistently high efficiency. To monitor gene silencing efficiency, we have cloned cotton GrCla1 from G. raimondii, a homolog gene of Arabidopsis Cloroplastos alterados 1 (AtCla1) involved in chloroplast development, and inserted into a tobacco rattle virus (TRV) binary vector pYL156. Silencing of GrCla1 results in albino phenotype on the newly emerging leaves, serving as a visual marker for silencing efficiency. To further explore the possibility of using VIGS assay to reveal the essential genes mediating disease resistance to Verticillium dahliae, a fungal pathogen causing severe Verticillium wilt in cotton, we developed a seedling infection assay to inoculate cotton seedlings when the genes of interest are silenced by VIGS. The method we describe here could be further explored for functional genomic analysis of cotton genes involved in development and various biotic and abiotic stresses. PMID:23386302
Functional genomic analysis of cotton genes with agrobacterium-mediated virus-induced gene silencing.

PubMed

Gao, Xiquan; Shan, Libo

2013-01-01

Cotton (Gossypium spp.) is one of the most agronomically important crops worldwide for its unique textile fiber production and serving as food and feed stock. Molecular breeding and genetic engineering of useful genes into cotton have emerged as advanced approaches to improve cotton yield, fiber quality, and resistance to various stresses. However, the understanding of gene functions and regulations in cotton is largely hindered by the limited molecular and biochemical tools. Here, we describe the method of an Agrobacterium infiltration-based virus-induced gene silencing (VIGS) assay to transiently silence endogenous genes in cotton at 2-week-old seedling stage. The genes of interest could be readily silenced with a consistently high efficiency. To monitor gene silencing efficiency, we have cloned cotton GrCla1 from G. raimondii, a homolog gene of Arabidopsis Cloroplastos alterados 1 (AtCla1) involved in chloroplast development, and inserted into a tobacco rattle virus (TRV) binary vector pYL156. Silencing of GrCla1 results in albino phenotype on the newly emerging leaves, serving as a visual marker for silencing efficiency. To further explore the possibility of using VIGS assay to reveal the essential genes mediating disease resistance to Verticillium dahliae, a fungal pathogen causing severe Verticillium wilt in cotton, we developed a seedling infection assay to inoculate cotton seedlings when the genes of interest are silenced by VIGS. The method we describe here could be further explored for functional genomic analysis of cotton genes involved in development and various biotic and abiotic stresses.
funRiceGenes dataset for comprehensive understanding and application of rice functional genes.

PubMed

Yao, Wen; Li, Guangwei; Yu, Yiming; Ouyang, Yidan

2018-01-01

As a main staple food, rice is also a model plant for functional genomic studies of monocots. Decoding of every DNA element of the rice genome is essential for genetic improvement to address increasing food demands. The past 15 years have witnessed extraordinary advances in rice functional genomics. Systematic characterization and proper deposition of every rice gene are vital for both functional studies and crop genetic improvement. We built a comprehensive and accurate dataset of ∼2800 functionally characterized rice genes and ∼5000 members of different gene families by integrating data from available databases and reviewing every publication on rice functional genomic studies. The dataset accounts for 19.2% of the 39 045 annotated protein-coding rice genes, which provides the most exhaustive archive for investigating the functions of rice genes. We also constructed 214 gene interaction networks based on 1841 connections between 1310 genes. The largest network with 762 genes indicated that pleiotropic genes linked different biological pathways. Increasing degree of conservation of the flowering pathway was observed among more closely related plants, implying substantial value of rice genes for future dissection of flowering regulation in other crops. All data are deposited in the funRiceGenes database (https://funricegenes.github.io/). Functionality for advanced search and continuous updating of the database are provided by a Shiny application (http://funricegenes.ncpgr.cn/). The funRiceGenes dataset would enable further exploring of the crosslink between gene functions and natural variations in rice, which can also facilitate breeding design to improve target agronomic traits of rice. © The Authors 2017. Published by Oxford University Press.
A Driving Bioinformatics Approach to Explore Co-regulation of AOX Gene Family Members During Growth and Development.

PubMed

Costa, José Hélio; Arnholdt-Schmitt, Birgit

2017-01-01

The alternative oxidase (AOX) gene family is a hot candidate for functional marker development that could help plant breeding on yield stability through more robust plants based on multi-stress tolerance. However, there is missing knowledge on the interplay between gene family members that might interfere with the efficiency of marker development. It is common view that AOX1 and AOX2 have different physiological roles. Nevertheless, both family member groups act in terms of molecular-biochemical function as "typical" alternative oxidases and co-regulation of AOX1 and AOX2 had been reported. Although conserved sequence differences had been identified, the basis for differential effects on physiology regulation is not sufficiently explored.This protocol gives instructions for a bioinformatics approach that supports discovering potential interaction of AOX family members in regulating growth and development. It further provides a strategy to elucidate the relevance of gene sequence diversity and copy number variation for final functionality in target tissues and finally the whole plant. Thus, overall this protocol provides the means for efficiently identifying plant AOX variants as functional marker candidates related to growth and development.
Genetic basis of interindividual susceptibility to cancer cachexia: selection of potential candidate gene polymorphisms for association studies.

PubMed

Johns, N; Tan, B H; MacMillan, M; Solheim, T S; Ross, J A; Baracos, V E; Damaraju, S; Fearon, K C H

2014-12-01

Cancer cachexia is a complex and multifactorial disease. Evolving definitions highlight the fact that a diverse range of biological processes contribute to cancer cachexia. Part of the variation in who will and who will not develop cancer cachexia may be genetically determined. As new definitions, classifications and biological targets continue to evolve, there is a need for reappraisal of the literature for future candidate association studies. This review summarizes genes identified or implicated as well as putative candidate genes contributing to cachexia, identified through diverse technology platforms and model systems to further guide association studies. A systematic search covering 1986-2012 was performed for potential candidate genes / genetic polymorphisms relating to cancer cachexia. All candidate genes were reviewed for functional polymorphisms or clinically significant polymorphisms associated with cachexia using the OMIM and GeneRIF databases. Pathway analysis software was used to reveal possible network associations between genes. Functionality of SNPs/genes was explored based on published literature, algorithms for detecting putative deleterious SNPs and interrogating the database for expression of quantitative trait loci (eQTLs). A total of 154 genes associated with cancer cachexia were identified and explored for functional polymorphisms. Of these 154 genes, 119 had a combined total of 281 polymorphisms with functional and/or clinical significance in terms of cachexia associated with them. Of these, 80 polymorphisms (in 51 genes) were replicated in more than one study with 24 polymorphisms found to influence two or more hallmarks of cachexia (i.e., inflammation, loss of fat mass and/or lean mass and reduced survival). Selection of candidate genes and polymorphisms is a key element of multigene study design. The present study provides a contemporary basis to select genes and/or polymorphisms for further association studies in cancer cachexia, and to develop their potential as susceptibility biomarkers of cachexia.
Exploration of the Esophageal Mucosal Barrier in Non-Erosive Reflux Disease

PubMed Central

Rinsma, Nicolaas F.; Farré, Ricard; Troost, Fred J.; Elizalde, Montserrat; Keszthelyi, Daniel; Helyes, Zsuzsanna; Masclee, Ad A.; Conchillo, José M.

2017-01-01

In the absence of visible mucosal damage, it is hypothesized that the esophageal mucosal barrier is functionally impaired in patients with non-erosive reflux disease (NERD). The aim of the present study was to perform an exploratory analysis of the mucosal barrier in NERD compared to erosive esophagitis (EE) and controls. A second aim was to explore TRPV1 gene transcription in relation to the mucosal barrier function and heartburn symptoms. In this prospective study, 10 NERD patients, 11 patients with active erosive esophagitis and 10 healthy volunteers were included. Biopsies from non-eroded mucosa were obtained for (1) ex vivo analyses (Ussing chamber) of transepithelial electrical resistance (TEER) and permeability (2) gene transcription of tight-junction proteins and transient receptor potential vanilloid subfamily member 1 (TRPV1). No differences in TEER or permeability were found between NERD and healthy volunteers, whereas TEER was lower in patients with erosive esophagitis. TRPV1 gene transcription was not significantly different between EE, NERD and controls. Conclusions: esophageal mucosal barrier function and TRPV1 transcription is not significantly altered in NERD patients. Future research is needed to explore other potential mechanisms that may account for the high symptom burden in these patients. PMID:28534850
Analysis of the Prefoldin Gene Family in 14 Plant Species

PubMed Central

Cao, Jun

2016-01-01

Prefoldin is a hexameric molecular chaperone complex present in all eukaryotes and archaea. The evolution of this gene family in plants is unknown. Here, I identified 140 prefoldin genes in 14 plant species. These prefoldin proteins were divided into nine groups through phylogenetic analysis. Highly conserved gene organization and motif distribution exist in each prefoldin group, implying their functional conservation. I also observed the segmental duplication of maize prefoldin gene family. Moreover, a few functional divergence sites were identified within each group pairs. Functional network analyses identified 78 co-expressed genes, and most of them were involved in carrying, binding and kinase activity. Divergent expression profiles of the maize prefoldin genes were further investigated in different tissues and development periods and under auxin and some abiotic stresses. I also found a few cis-elements responding to abiotic stress and phytohormone in the upstream sequences of the maize prefoldin genes. The results provided a foundation for exploring the characterization of the prefoldin genes in plants and will offer insights for additional functional studies. PMID:27014333
Using scale and feather traits for module construction provides a functional approach to chicken epidermal development.

PubMed

Bao, Weier; Greenwold, Matthew J; Sawyer, Roger H

2017-11-01

Gene co-expression network analysis has been a research method widely used in systematically exploring gene function and interaction. Using the Weighted Gene Co-expression Network Analysis (WGCNA) approach to construct a gene co-expression network using data from a customized 44K microarray transcriptome of chicken epidermal embryogenesis, we have identified two distinct modules that are highly correlated with scale or feather development traits. Signaling pathways related to feather development were enriched in the traditional KEGG pathway analysis and functional terms relating specifically to embryonic epidermal development were also enriched in the Gene Ontology analysis. Significant enrichment annotations were discovered from customized enrichment tools such as Modular Single-Set Enrichment Test (MSET) and Medical Subject Headings (MeSH). Hub genes in both trait-correlated modules showed strong specific functional enrichment toward epidermal development. Also, regulatory elements, such as transcription factors and miRNAs, were targeted in the significant enrichment result. This work highlights the advantage of this methodology for functional prediction of genes not previously associated with scale- and feather trait-related modules.
Exploration of the Anti-Inflammatory Drug Space Through Network Pharmacology: Applications for Drug Repurposing

PubMed Central

de Anda-Jáuregui, Guillermo; Guo, Kai; McGregor, Brett A.; Hur, Junguk

2018-01-01

The quintessential biological response to disease is inflammation. It is a driver and an important element in a wide range of pathological states. Pharmacological management of inflammation is therefore central in the clinical setting. Anti-inflammatory drugs modulate specific molecules involved in the inflammatory response; these drugs are traditionally classified as steroidal and non-steroidal drugs. However, the effects of these drugs are rarely limited to their canonical targets, affecting other molecules and altering biological functions with system-wide effects that can lead to the emergence of secondary therapeutic applications or adverse drug reactions (ADRs). In this study, relationships among anti-inflammatory drugs, functional pathways, and ADRs were explored through network models. We integrated structural drug information, experimental anti-inflammatory drug perturbation gene expression profiles obtained from the Connectivity Map and Library of Integrated Network-Based Cellular Signatures, functional pathways in the Kyoto Encyclopedia of Genes and Genomes (KEGG) and Reactome databases, as well as adverse reaction information from the U.S. Food and Drug Administration (FDA) Adverse Event Reporting System (FAERS). The network models comprise nodes representing anti-inflammatory drugs, functional pathways, and adverse effects. We identified structural and gene perturbation similarities linking anti-inflammatory drugs. Functional pathways were connected to drugs by implementing Gene Set Enrichment Analysis (GSEA). Drugs and adverse effects were connected based on the proportional reporting ratio (PRR) of an adverse effect in response to a given drug. Through these network models, relationships among anti-inflammatory drugs, their functional effects at the pathway level, and their adverse effects were explored. These networks comprise 70 different anti-inflammatory drugs, 462 functional pathways, and 1,175 ADRs. Network-based properties, such as degree, clustering coefficient, and node strength, were used to identify new therapeutic applications within and beyond the anti-inflammatory context, as well as ADR risk for these drugs, helping to select better repurposing candidates. Based on these parameters, we identified naproxen, meloxicam, etodolac, tenoxicam, flufenamic acid, fenoprofen, and nabumetone as candidates for drug repurposing with lower ADR risk. This network-based analysis pipeline provides a novel way to explore the effects of drugs in a therapeutic space. PMID:29545755
Exploration of the Anti-Inflammatory Drug Space Through Network Pharmacology: Applications for Drug Repurposing.

PubMed

de Anda-Jáuregui, Guillermo; Guo, Kai; McGregor, Brett A; Hur, Junguk

2018-01-01

The quintessential biological response to disease is inflammation. It is a driver and an important element in a wide range of pathological states. Pharmacological management of inflammation is therefore central in the clinical setting. Anti-inflammatory drugs modulate specific molecules involved in the inflammatory response; these drugs are traditionally classified as steroidal and non-steroidal drugs. However, the effects of these drugs are rarely limited to their canonical targets, affecting other molecules and altering biological functions with system-wide effects that can lead to the emergence of secondary therapeutic applications or adverse drug reactions (ADRs). In this study, relationships among anti-inflammatory drugs, functional pathways, and ADRs were explored through network models. We integrated structural drug information, experimental anti-inflammatory drug perturbation gene expression profiles obtained from the Connectivity Map and Library of Integrated Network-Based Cellular Signatures, functional pathways in the Kyoto Encyclopedia of Genes and Genomes (KEGG) and Reactome databases, as well as adverse reaction information from the U.S. Food and Drug Administration (FDA) Adverse Event Reporting System (FAERS). The network models comprise nodes representing anti-inflammatory drugs, functional pathways, and adverse effects. We identified structural and gene perturbation similarities linking anti-inflammatory drugs. Functional pathways were connected to drugs by implementing Gene Set Enrichment Analysis (GSEA). Drugs and adverse effects were connected based on the proportional reporting ratio (PRR) of an adverse effect in response to a given drug. Through these network models, relationships among anti-inflammatory drugs, their functional effects at the pathway level, and their adverse effects were explored. These networks comprise 70 different anti-inflammatory drugs, 462 functional pathways, and 1,175 ADRs. Network-based properties, such as degree, clustering coefficient, and node strength, were used to identify new therapeutic applications within and beyond the anti-inflammatory context, as well as ADR risk for these drugs, helping to select better repurposing candidates. Based on these parameters, we identified naproxen, meloxicam, etodolac, tenoxicam, flufenamic acid, fenoprofen, and nabumetone as candidates for drug repurposing with lower ADR risk. This network-based analysis pipeline provides a novel way to explore the effects of drugs in a therapeutic space.
Loss-of-function analyses of the fragile X-related and dopamine receptor genes by RNA interference in the cricket Gryllus bimaculatus.

PubMed

Hamada, Aska; Miyawaki, Katsuyuki; Honda-sumi, Eri; Tomioka, Kenji; Mito, Taro; Ohuchi, Hideyo; Noji, Sumihare

2009-08-01

In order to explore a possibility that the cricket Gryllus bimaculatus would be a useful model to unveil molecular mechanisms of human diseases, we performed loss-of-function analyses of Gryllus genes homologous to human genes that are responsible for human disorders, fragile X mental retardation 1 (fmr1) and Dopamine receptor (DopR). We cloned cDNAs of their Gryllus homologues, Gb'fmr1, Gb'DopRI, and Gb'DopRII, and analyzed their functions with use of nymphal RNA interference (RNAi). For Gb'fmr1, three major phenotypes were observed: (1) abnormal wing postures, (2) abnormal calling song, and (3) loss of the circadian locomotor rhythm, while for Gb'DopRI, defects of wing posture and morphology were found. These results indicate that the cricket has the potential to become a novel model system to explore human neuronal pathogenic mechanisms and to screen therapeutic drugs by RNAi. Copyright (c) 2009 Wiley-Liss, Inc.
Down-Regulation of Gene Expression by RNA-Induced Gene Silencing

NASA Astrophysics Data System (ADS)

Travella, Silvia; Keller, Beat

Down-regulation of endogenous genes via post-transcriptional gene silencing (PTGS) is a key to the characterization of gene function in plants. Many RNA-based silencing mechanisms such as post-transcriptional gene silencing, co-suppression, quelling, and RNA interference (RNAi) have been discovered among species of different kingdoms (plants, fungi, and animals). One of the most interesting discoveries was RNAi, a sequence-specific gene-silencing mechanism initiated by the introduction of double-stranded RNA (dsRNA), homologous in sequence to the silenced gene, which triggers degradation of mRNA. Infection of plants with modified viruses can also induce RNA silencing and is referred to as virus-induced gene silencing (VIGS). In contrast to insertional mutagenesis, these emerging new reverse genetic approaches represent a powerful tool for exploring gene function and for manipulating gene expression experimentally in cereal species such as barley and wheat. We examined how RNAi and VIGS have been used to assess gene function in barley and wheat, including molecular mechanisms involved in the process and available methodological elements, such as vectors, inoculation procedures, and analysis of silenced phenotypes.
[Gene deletion and functional analysis of the heptyl glycosyltransferase (waaF) gene in Vibrio parahemolyticus O-antigen cluster].

PubMed

Zhao, Feng; Meng, Songsong; Zhou, Deqing

2016-02-04

To construct heptyl glycosyltransferase gene II (waaF) gene deletion mutant of Vibrio parahaemolyticus, and explore the function of the waaF gene in Vibrio parahaemolyticus. The waaF gene deletion mutant was constructed by chitin-based transformation technology using clinical isolates, and then the growth rate, morphology and serotypes were identified. The different sources (O3, O5 and O10) waaF gene complementations were constructed through E. coli S17λpir strains conjugative transferring with Vibrio parahaemolyticus, and the function of the waaF gene was further verified by serotypes. The waaF gene deletion mutant strain was successfully constructed and it grew normally. The growth rate and morphology of mutant were similar with the wild type strains (WT), but the mutant could not occurred agglutination reaction with O antisera. The O3 and O5 sources waaF gene complementations occurred agglutination reaction with O antisera, but the O10 sources waaF gene complementations was not. The waaF gene was related with O-antigen synthesis and it was the key gene of O-antigen synthesis pathway in Vibrio parahaemolyticus. The function of different sources waaF gene were not the same.
The evolutionary landscape of intergenic trans-splicing events in insects

PubMed Central

Kong, Yimeng; Zhou, Hongxia; Yu, Yao; Chen, Longxian; Hao, Pei; Li, Xuan

2015-01-01

To explore the landscape of intergenic trans-splicing events and characterize their functions and evolutionary dynamics, we conduct a mega-data study of a phylogeny containing eight species across five orders of class Insecta, a model system spanning 400 million years of evolution. A total of 1,627 trans-splicing events involving 2,199 genes are identified, accounting for 1.58% of the total genes. Homology analysis reveals that mod(mdg4)-like trans-splicing is the only conserved event that is consistently observed in multiple species across two orders, which represents a unique case of functional diversification involving trans-splicing. Thus, evolutionarily its potential for generating proteins with novel function is not broadly utilized by insects. Furthermore, 146 non-mod trans-spliced transcripts are found to resemble canonical genes from different species. Trans-splicing preserving the function of ‘breakup' genes may serve as a general mechanism for relaxing the constraints on gene structure, with profound implications for the evolution of genes and genomes. PMID:26521696
Neuron-specific feeding RNAi in C. elegans and its use in a screen for essential genes required for GABA neuron function.

PubMed

Firnhaber, Christopher; Hammarlund, Marc

2013-11-01

Forward genetic screens are important tools for exploring the genetic requirements for neuronal function. However, conventional forward screens often have difficulty identifying genes whose relevant functions are masked by pleiotropy. In particular, if loss of gene function results in sterility, lethality, or other severe pleiotropy, neuronal-specific functions cannot be readily analyzed. Here we describe a method in C. elegans for generating cell-specific knockdown in neurons using feeding RNAi and its application in a screen for the role of essential genes in GABAergic neurons. We combine manipulations that increase the sensitivity of select neurons to RNAi with manipulations that block RNAi in other cells. We produce animal strains in which feeding RNAi results in restricted gene knockdown in either GABA-, acetylcholine-, dopamine-, or glutamate-releasing neurons. In these strains, we observe neuron cell-type specific behavioral changes when we knock down genes required for these neurons to function, including genes encoding the basal neurotransmission machinery. These reagents enable high-throughput, cell-specific knockdown in the nervous system, facilitating rapid dissection of the site of gene action and screening for neuronal functions of essential genes. Using the GABA-specific RNAi strain, we screened 1,320 RNAi clones targeting essential genes on chromosomes I, II, and III for their effect on GABA neuron function. We identified 48 genes whose GABA cell-specific knockdown resulted in reduced GABA motor output. This screen extends our understanding of the genetic requirements for continued neuronal function in a mature organism.

The origin and functional transition of P34.

PubMed

Li, Q-G; Zhang, Y-M

2013-03-01

P34, a storage protein and major soybean allergen, has undergone a functional transition from a cysteine peptidase to a syringolide receptor. An exploration of the evolutionary mechanism of this functional transition is made. To identify homologous genes of P34, syntenic network was constructed using syntenic relationships from the Plant Genome Duplication Database. The collected homologous genes, along with SPE31, a highly homologous protein to P34 from the seeds of Pachyrhizus erosus, were used to construct a phylogenetic tree. The results show that multiple gene duplications, exon shuffling and following granulin domain loss and some critical point mutations are associated with the functional transition. Although some tests suggested the existence of positive selection, the possibility that random fixation under relaxation of purifying selection results in the functional transition is also supported. In addition, the genes Glyma08g12340 and Medtr8g086470 may belong to a new group within the papain family.
The origin and functional transition of P34

PubMed Central

Li, Q-G; Zhang, Y-M

2013-01-01

P34, a storage protein and major soybean allergen, has undergone a functional transition from a cysteine peptidase to a syringolide receptor. An exploration of the evolutionary mechanism of this functional transition is made. To identify homologous genes of P34, syntenic network was constructed using syntenic relationships from the Plant Genome Duplication Database. The collected homologous genes, along with SPE31, a highly homologous protein to P34 from the seeds of Pachyrhizus erosus, were used to construct a phylogenetic tree. The results show that multiple gene duplications, exon shuffling and following granulin domain loss and some critical point mutations are associated with the functional transition. Although some tests suggested the existence of positive selection, the possibility that random fixation under relaxation of purifying selection results in the functional transition is also supported. In addition, the genes Glyma08g12340 and Medtr8g086470 may belong to a new group within the papain family. PMID:23211789
Nitrogen Cycle Evaluation (NiCE) Chip for the Simultaneous Analysis of Multiple N-Cycle Associated Genes.

PubMed

Oshiki, Mamoru; Segawa, Takahiro; Ishii, Satoshi

2018-02-02

Various microorganisms play key roles in the Nitrogen (N) cycle. Quantitative PCR (qPCR) and PCR-amplicon sequencing of the N cycle functional genes allow us to analyze the abundance and diversity of microbes responsible in the N transforming reactions in various environmental samples. However, analysis of multiple target genes can be cumbersome and expensive. PCR-independent analysis, such as metagenomics and metatranscriptomics, is useful but expensive especially when we analyze multiple samples and try to detect N cycle functional genes present at relatively low abundance. Here, we present the application of microfluidic qPCR chip technology to simultaneously quantify and prepare amplicon sequence libraries for multiple N cycle functional genes as well as taxon-specific 16S rRNA gene markers for many samples. This approach, named as N cycle evaluation (NiCE) chip, was evaluated by using DNA from pure and artificially mixed bacterial cultures and by comparing the results with those obtained by conventional qPCR and amplicon sequencing methods. Quantitative results obtained by the NiCE chip were comparable to those obtained by conventional qPCR. In addition, the NiCE chip was successfully applied to examine abundance and diversity of N cycle functional genes in wastewater samples. Although non-specific amplification was detected on the NiCE chip, this could be overcome by optimizing the primer sequences in the future. As the NiCE chip can provide high-throughput format to quantify and prepare sequence libraries for multiple N cycle functional genes, this tool should advance our ability to explore N cycling in various samples. Importance. We report a novel approach, namely Nitrogen Cycle Evaluation (NiCE) chip by using microfluidic qPCR chip technology. By sequencing the amplicons recovered from the NiCE chip, we can assess diversities of the N cycle functional genes. The NiCE chip technology is applicable to analyze the temporal dynamics of the N cycle gene transcriptions in wastewater treatment bioreactors. The NiCE chip can provide high-throughput format to quantify and prepare sequence libraries for multiple N cycle functional genes. While there is a room for future improvement, this tool should significantly advance our ability to explore the N cycle in various environmental samples. Copyright © 2018 American Society for Microbiology.
Transcriptome profile and unique genetic evolution of positively selected genes in yak lungs.

PubMed

Lan, DaoLiang; Xiong, XianRong; Ji, WenHui; Li, Jian; Mipam, Tserang-Donko; Ai, Yi; Chai, ZhiXin

2018-04-01

The yak (Bos grunniens), which is a unique bovine breed that is distributed mainly in the Qinghai-Tibetan Plateau, is considered a good model for studying plateau adaptability in mammals. The lungs are important functional organs that enable animals to adapt to their external environment. However, the genetic mechanism underlying the adaptability of yak lungs to harsh plateau environments remains unknown. To explore the unique evolutionary process and genetic mechanism of yak adaptation to plateau environments, we performed transcriptome sequencing of yak and cattle (Bos taurus) lungs using RNA-Seq technology and a subsequent comparison analysis to identify the positively selected genes in the yak. After deep sequencing, a normal transcriptome profile of yak lung that containing a total of 16,815 expressed genes was obtained, and the characteristics of yak lungs transcriptome was described by functional analysis. Furthermore, Ka/Ks comparison statistics result showed that 39 strong positively selected genes are identified from yak lungs. Further GO and KEGG analysis was conducted for the functional annotation of these genes. The results of this study provide valuable data for further explorations of the unique evolutionary process of high-altitude hypoxia adaptation in yaks in the Tibetan Plateau and the genetic mechanism at the molecular level.
[FANCA gene mutation analysis in Fanconi anemia patients].

PubMed

Chen, Fei; Peng, Guang-Jie; Zhang, Kejian; Hu, Qun; Zhang, Liu-Qing; Liu, Ai-Guo

2005-10-01

To screen the FANCA gene mutation and explore the FANCA protein function in Fanconi anemia (FA) patients. FANCA protein expression and its interaction with FANCF were analyzed using Western blot and immunoprecipitation in 3 cases of FA-A. Genomic DNA was used for MLPA analysis followed by sequencing. FANCA protein was undetectable and FANCA and FANCF protein interaction was impaired in these 3 cases of FA-A. Each case of FA-A contained biallelic pathogenic mutations in FANCA gene. No functional FANCA protein was found in these 3 cases of FA-A, and intragenic deletion, frame shift and splice site mutation were the major pathogenic mutations found in FANCA gene.
Long-Term Oil Contamination Alters the Molecular Ecological Networks of Soil Microbial Functional Genes

PubMed Central

Liang, Yuting; Zhao, Huihui; Deng, Ye; Zhou, Jizhong; Li, Guanghe; Sun, Bo

2016-01-01

With knowledge on microbial composition and diversity, investigation of within-community interactions is a further step to elucidate microbial ecological functions, such as the biodegradation of hazardous contaminants. In this work, microbial functional molecular ecological networks were studied in both contaminated and uncontaminated soils to determine the possible influences of oil contamination on microbial interactions and potential functions. Soil samples were obtained from an oil-exploring site located in South China, and the microbial functional genes were analyzed with GeoChip, a high-throughput functional microarray. By building random networks based on null model, we demonstrated that overall network structures and properties were significantly different between contaminated and uncontaminated soils (P < 0.001). Network connectivity, module numbers, and modularity were all reduced with contamination. Moreover, the topological roles of the genes (module hub and connectors) were altered with oil contamination. Subnetworks of genes involved in alkane and polycyclic aromatic hydrocarbon degradation were also constructed. Negative co-occurrence patterns prevailed among functional genes, thereby indicating probable competition relationships. The potential “keystone” genes, defined as either “hubs” or genes with highest connectivities in the network, were further identified. The network constructed in this study predicted the potential effects of anthropogenic contamination on microbial community co-occurrence interactions. PMID:26870020
DaGO-Fun: tool for Gene Ontology-based functional analysis using term information content measures.

PubMed

Mazandu, Gaston K; Mulder, Nicola J

2013-09-25

The use of Gene Ontology (GO) data in protein analyses have largely contributed to the improved outcomes of these analyses. Several GO semantic similarity measures have been proposed in recent years and provide tools that allow the integration of biological knowledge embedded in the GO structure into different biological analyses. There is a need for a unified tool that provides the scientific community with the opportunity to explore these different GO similarity measure approaches and their biological applications. We have developed DaGO-Fun, an online tool available at http://web.cbio.uct.ac.za/ITGOM, which incorporates many different GO similarity measures for exploring, analyzing and comparing GO terms and proteins within the context of GO. It uses GO data and UniProt proteins with their GO annotations as provided by the Gene Ontology Annotation (GOA) project to precompute GO term information content (IC), enabling rapid response to user queries. The DaGO-Fun online tool presents the advantage of integrating all the relevant IC-based GO similarity measures, including topology- and annotation-based approaches to facilitate effective exploration of these measures, thus enabling users to choose the most relevant approach for their application. Furthermore, this tool includes several biological applications related to GO semantic similarity scores, including the retrieval of genes based on their GO annotations, the clustering of functionally related genes within a set, and term enrichment analysis.
Identification and expression analysis of the SQUAMOSA promoter-binding protein (SBP)-box gene family in Prunus mume.

PubMed

Xu, Zongda; Sun, Lidan; Zhou, Yuzhen; Yang, Weiru; Cheng, Tangren; Wang, Jia; Zhang, Qixiang

2015-10-01

SQUAMOSA promoter-binding protein (SBP)-box family genes encode plant-specific transcription factors that play crucial roles in plant development, especially flower and fruit development. However, little information on this gene family is available for Prunus mume, an ornamental and fruit tree widely cultivated in East Asia. To explore the evolution of SBP-box genes in Prunus and explore their functions in flower and fruit development, we performed a genome-wide analysis of the SBP-box gene family in P. mume. Fifteen SBP-box genes were identified, and 11 of them contained an miR156 target site. Phylogenetic and comprehensive bioinformatics analyses revealed that different groups of SBP-box genes have undergone different evolutionary processes and varied in their length, structure, and motif composition. Purifying selection has been the main selective constraint on both paralogous and orthologous SBP-box genes. In addition, the sequences of orthologous SBP-box genes did not diverge widely after the split of P. mume and Prunus persica. Expression analysis of P. mume SBP-box genes revealed their diverse spatiotemporal expression patterns. Three duplicated SBP-box genes may have undergone subfunctionalization in Prunus. Most of the SBP-box genes showed high transcript levels in flower buds and young fruit. The four miR156-nontargeted genes were upregulated during fruit ripening. Together, these results provide information about the evolution of SBP-box genes in Prunus. The expression analysis lays the foundation for further research on the functions of SBP-box genes in P. mume and other Prunus species, especially during flower and fruit development.
Genome-Wide Screening and Characterization of the Dof Gene Family in Physic Nut (Jatropha curcas L.).

PubMed

Wang, Peipei; Li, Jing; Gao, Xiaoyang; Zhang, Di; Li, Anlin; Liu, Changning

2018-05-29

Physic nut ( Jatropha curcas L.) is a species of flowering plant with great potential for biofuel production and as an emerging model organism for functional genomic analysis, particularly in the Euphorbiaceae family. DNA binding with one finger (Dof) transcription factors play critical roles in numerous biological processes in plants. Nevertheless, the knowledge about members, and the evolutionary and functional characteristics of the Dof gene family in physic nut is insufficient. Therefore, we performed a genome-wide screening and characterization of the Dof gene family within the physic nut draft genome. In total, 24 JcDof genes (encoding 33 JcDof proteins) were identified. All the JcDof genes were divided into three major groups based on phylogenetic inference, which was further validated by the subsequent gene structure and motif analysis. Genome comparison revealed that segmental duplication may have played crucial roles in the expansion of the JcDof gene family, and gene expansion was mainly subjected to positive selection. The expression profile demonstrated the broad involvement of JcDof genes in response to various abiotic stresses, hormonal treatments and functional divergence. This study provides valuable information for better understanding the evolution of JcDof genes, and lays a foundation for future functional exploration of JcDof genes.
Evolutionary analysis of the jacalin-related lectin family genes in 11 fishes.

PubMed

Cao, Jun; Lv, Yueqing

2016-09-01

Jacalin-related lectins are a type of carbohydrate-binding proteins, which are distributed across a wide variety of organisms and involved in some important biological processes. The evolution of this gene family in fishes is unknown. Here, 47 putative jacalin genes in 11 fish species were identified and divided into 4 groups through phylogenetic analysis. Conserved gene organization and motif distribution existed in each group, suggesting their functional conservation. Some fishes have eleven jacalin genes, while others have only one or zero gene in their genomes, suggesting dynamic changes in the number of jacalin genes during the evolution of fishes. Intragenic recombination played a key role in the evolution of jacalin genes. Synteny analyses of jacalin genes in some fishes implied conserved and dynamic evolution characteristics of this gene family and related genome segments. Moreover, a few functional divergence sites were identified within each group pairs. Divergent expression profiles of the zebra fish jacalin genes were further investigated in different stresses. The results provided a foundation for exploring the characterization of the jacalin genes in fishes and will offer insights for additional functional studies. Copyright © 2016 Elsevier Ltd. All rights reserved.
MaGnET: Malaria Genome Exploration Tool.

PubMed

Sharman, Joanna L; Gerloff, Dietlind L

2013-09-15

The Malaria Genome Exploration Tool (MaGnET) is a software tool enabling intuitive 'exploration-style' visualization of functional genomics data relating to the malaria parasite, Plasmodium falciparum. MaGnET provides innovative integrated graphic displays for different datasets, including genomic location of genes, mRNA expression data, protein-protein interactions and more. Any selection of genes to explore made by the user is easily carried over between the different viewers for different datasets, and can be changed interactively at any point (without returning to a search). Free online use (Java Web Start) or download (Java application archive and MySQL database; requires local MySQL installation) at http://malariagenomeexplorer.org joanna.sharman@ed.ac.uk or dgerloff@ffame.org Supplementary data are available at Bioinformatics online.
Comparative analysis of gene expression profiles of OPN signaling pathway in four kinds of liver diseases.

PubMed

Wang, Gaiping; Chen, Shasha; Zhao, Congcong; Li, Xiaofang; Zhao, Weiming; Yang, Jing; Chang, Cuifang; Xu, Cunshuan

2016-09-01

To explore the relevance of OPN signalling pathway to the occurrence and development of nonalcoholic fatty liver disease (NAFLD), liver cirrhosis (LC), hepatic cancer (HC) and acute hepatic failure (AHF) at transcriptional level, Rat Genome 230 2.0 Array was used to detect expression profiles of OPN signalling pathway-related genes in four kinds of liver diseases. The results showed that 23, 33, 59 and 74 genes were significantly changed in the above four kinds of liver diseases, respectively. H-clustering analysis showed that the expression profiles of OPN signalling-related genes were notably different in four kinds of liver diseases. Subsequently, a total of above-mentioned 147 genes were categorized into four clusters by k-means according to the similarity of gene expression, and expression analysis systematic explorer (EASE) functional enrichment analysis revealed that OPN signalling pathway-related genes were involved in cell adhesion and migration, cell proliferation, apoptosis, stress and inflammatory reaction, etc. Finally, ingenuity pathway analysis (IPA) software was used to predict the functions of OPN signalling-related genes, and the results indicated that the activities of ROS production, cell adhesion and migration, cell proliferation were remarkably increased, while that of apoptosis, stress and inflammatory reaction were reduced in four kinds of liver diseases. In summary, the above physiological activities changed more obviously in LC, HC and AHF than in NAFLD.
Role of G-protein-coupled receptor-related genes in insecticide resistance of the mosquito, Culex quinquefasciatus.

PubMed

Li, Ting; Liu, Lena; Zhang, Lee; Liu, Nannan

2014-09-29

G-protein-coupled receptors regulate signal transduction pathways and play diverse and pivotal roles in the physiology of insects, however, the precise function of GPCRs in insecticide resistance remains unclear. Using quantitative RT-PCR and functional genomic methods, we, for the first time, explored the function of GPCRs and GPCR-related genes in insecticide resistance of mosquitoes, Culex quinquefasciatus. A comparison of the expression of 115 GPCR-related genes at a whole genome level between resistant and susceptible Culex mosquitoes identified one and three GPCR-related genes that were up-regulated in highly resistant Culex mosquito strains, HAmCq(G8) and MAmCq(G6), respectively. To characterize the function of these up-regulated GPCR-related genes in resistance, the up-regulated GPCR-related genes were knockdown in HAmCq(G8) and MAmCq(G6) using RNAi technique. Knockdown of these four GPCR-related genes not only decreased resistance of the mosquitoes to permethrin but also repressed the expression of four insecticide resistance-related P450 genes, suggesting the role of GPCR-related genes in resistance is involved in the regulation of resistance P450 gene expression. This results help in understanding of molecular regulation of resistance development in Cx. quinquefasciatus.
DOSim: an R package for similarity between diseases based on Disease Ontology.

PubMed

Li, Jiang; Gong, Binsheng; Chen, Xi; Liu, Tao; Wu, Chao; Zhang, Fan; Li, Chunquan; Li, Xiang; Rao, Shaoqi; Li, Xia

2011-06-29

The construction of the Disease Ontology (DO) has helped promote the investigation of diseases and disease risk factors. DO enables researchers to analyse disease similarity by adopting semantic similarity measures, and has expanded our understanding of the relationships between different diseases and to classify them. Simultaneously, similarities between genes can also be analysed by their associations with similar diseases. As a result, disease heterogeneity is better understood and insights into the molecular pathogenesis of similar diseases have been gained. However, bioinformatics tools that provide easy and straight forward ways to use DO to study disease and gene similarity simultaneously are required. We have developed an R-based software package (DOSim) to compute the similarity between diseases and to measure the similarity between human genes in terms of diseases. DOSim incorporates a DO-based enrichment analysis function that can be used to explore the disease feature of an independent gene set. A multilayered enrichment analysis (GO and KEGG annotation) annotation function that helps users explore the biological meaning implied in a newly detected gene module is also part of the DOSim package. We used the disease similarity application to demonstrate the relationship between 128 different DO cancer terms. The hierarchical clustering of these 128 different cancers showed modular characteristics. In another case study, we used the gene similarity application on 361 obesity-related genes. The results revealed the complex pathogenesis of obesity. In addition, the gene module detection and gene module multilayered annotation functions in DOSim when applied on these 361 obesity-related genes helped extend our understanding of the complex pathogenesis of obesity risk phenotypes and the heterogeneity of obesity-related diseases. DOSim can be used to detect disease-driven gene modules, and to annotate the modules for functions and pathways. The DOSim package can also be used to visualise DO structure. DOSim can reflect the modular characteristic of disease related genes and promote our understanding of the complex pathogenesis of diseases. DOSim is available on the Comprehensive R Archive Network (CRAN) or http://bioinfo.hrbmu.edu.cn/dosim.
Identification of potential therapeutic target genes, key miRNAs and mechanisms in oral lichen planus by bioinformatics analysis.

PubMed

Gong, Cuihua; Sun, Shangtong; Liu, Bing; Wang, Jing; Chen, Xiaodong

2017-06-01

The study aimed to identify the potential target genes and key miRNAs as well as to explore the underlying mechanisms in the pathogenesis of oral lichen planus (OLP) by bioinformatics analysis. The microarray data of GSE38617 were downloaded from Gene Expression Omnibus (GEO) database. A total of 7 OLP and 7 normal samples were used to identify the differentially expressed genes (DEGs) and miRNAs. The DEGs were then performed functional enrichment analyses. Furthermore, DEG-miRNA network and miRNA-function network were constructed by Cytoscape software. Total 1758 DEGs (598 up- and 1160 down-regulated genes) and 40 miRNAs (17 up- and 23 down-regulated miRNAs) were selected. The up-regulated genes were related to nuclear factor-Kappa B (NF-κB) signaling pathway, while down-regulated genes were mainly enriched in the function of ribosome. Tumor necrosis factor (TNF), caspase recruitment domain family, member 11 (CARD11) and mitochondrial ribosomal protein (MRP) genes were identified in these functions. In addition, miR-302 was a hub node in DEG-miRNA network and regulated cyclin D1 (CCND1). MiR-548a-2 was the key miRNA in miRNA-function network by regulating multiple functions including ribosomal function. The NF-κB signaling pathway and ribosome function may be the pathogenic mechanisms of OLP. The genes such as TNF, CARD11, MRP genes and CCND1 may be potential therapeutic target genes in OLP. MiR-548a-2 and miR-302 may play important roles in OLP development. Copyright © 2017 Elsevier Ltd. All rights reserved.
Comparative genome analysis in the integrated microbial genomes (IMG) system.

PubMed

Markowitz, Victor M; Kyrpides, Nikos C

2007-01-01

Comparative genome analysis is critical for the effective exploration of a rapidly growing number of complete and draft sequences for microbial genomes. The Integrated Microbial Genomes (IMG) system (img.jgi.doe.gov) has been developed as a community resource that provides support for comparative analysis of microbial genomes in an integrated context. IMG allows users to navigate the multidimensional microbial genome data space and focus their analysis on a subset of genes, genomes, and functions of interest. IMG provides graphical viewers, summaries, and occurrence profile tools for comparing genes, pathways, and functions (terms) across specific genomes. Genes can be further examined using gene neighborhoods and compared with sequence alignment tools.
TCGA4U: A Web-Based Genomic Analysis Platform To Explore And Mine TCGA Genomic Data For Translational Research.

PubMed

Huang, Zhenzhen; Duan, Huilong; Li, Haomin

2015-01-01

Large-scale human cancer genomics projects, such as TCGA, generated large genomics data for further study. Exploring and mining these data to obtain meaningful analysis results can help researchers find potential genomics alterations that intervene the development and metastasis of tumors. We developed a web-based gene analysis platform, named TCGA4U, which used statistics methods and models to help translational investigators explore, mine and visualize human cancer genomic characteristic information from the TCGA datasets. Furthermore, through Gene Ontology (GO) annotation and clinical data integration, the genomic data were transformed into biological process, molecular function, cellular component and survival curves to help researchers identify potential driver genes. Clinical researchers without expertise in data analysis will benefit from such a user-friendly genomic analysis platform.
Analytical workflow profiling gene expression in murine macrophages

PubMed Central

Nixon, Scott E.; González-Peña, Dianelys; Lawson, Marcus A.; McCusker, Robert H.; Hernandez, Alvaro G.; O’Connor, Jason C.; Dantzer, Robert; Kelley, Keith W.

2015-01-01

Comprehensive and simultaneous analysis of all genes in a biological sample is a capability of RNA-Seq technology. Analysis of the entire transcriptome benefits from summarization of genes at the functional level. As a cellular response of interest not previously explored with RNA-Seq, peritoneal macrophages from mice under two conditions (control and immunologically challenged) were analyzed for gene expression differences. Quantification of individual transcripts modeled RNA-Seq read distribution and uncertainty (using a Beta Negative Binomial distribution), then tested for differential transcript expression (False Discovery Rate-adjusted p-value < 0.05). Enrichment of functional categories utilized the list of differentially expressed genes. A total of 2079 differentially expressed transcripts representing 1884 genes were detected. Enrichment of 92 categories from Gene Ontology Biological Processes and Molecular Functions, and KEGG pathways were grouped into 6 clusters. Clusters included defense and inflammatory response (Enrichment Score = 11.24) and ribosomal activity (Enrichment Score = 17.89). Our work provides a context to the fine detail of individual gene expression differences in murine peritoneal macrophages during immunological challenge with high throughput RNA-Seq. PMID:25708305
Exploring the Midgut Transcriptome and Brush Border Membrane Vesicle Proteome of the Rice Stem Borer, Chilo suppressalis (Walker)

PubMed Central

Peng, Chuanhua; Wang, Xiaoping; Li, Fei; Lin, Yongjun

2012-01-01

The rice stem borer, Chilo suppressalis (Walker) (Lepidoptera: Pyralidae), is one of the most detrimental pests affecting rice crops. The use of Bacillus thuringiensis (Bt) toxins has been explored as a means to control this pest, but the potential for C. suppressalis to develop resistance to Bt toxins makes this approach problematic. Few C. suppressalis gene sequences are known, which makes in-depth study of gene function difficult. Herein, we sequenced the midgut transcriptome of the rice stem borer. In total, 37,040 contigs were obtained, with a mean size of 497 bp. As expected, the transcripts of C. suppressalis shared high similarity with arthropod genes. Gene ontology and KEGG analysis were used to classify the gene functions in C. suppressalis. Using the midgut transcriptome data, we conducted a proteome analysis to identify proteins expressed abundantly in the brush border membrane vesicles (BBMV). Of the 100 top abundant proteins that were excised and subjected to mass spectrometry analysis, 74 share high similarity with known proteins. Among these proteins, Western blot analysis showed that Aminopeptidase N and EH domain-containing protein have the binding activities with Bt-toxin Cry1Ac. These data provide invaluable information about the gene sequences of C. suppressalis and the proteins that bind with Cry1Ac. PMID:22666467
NetGen: a novel network-based probabilistic generative model for gene set functional enrichment analysis.

PubMed

Sun, Duanchen; Liu, Yinliang; Zhang, Xiang-Sun; Wu, Ling-Yun

2017-09-21

High-throughput experimental techniques have been dramatically improved and widely applied in the past decades. However, biological interpretation of the high-throughput experimental results, such as differential expression gene sets derived from microarray or RNA-seq experiments, is still a challenging task. Gene Ontology (GO) is commonly used in the functional enrichment studies. The GO terms identified via current functional enrichment analysis tools often contain direct parent or descendant terms in the GO hierarchical structure. Highly redundant terms make users difficult to analyze the underlying biological processes. In this paper, a novel network-based probabilistic generative model, NetGen, was proposed to perform the functional enrichment analysis. An additional protein-protein interaction (PPI) network was explicitly used to assist the identification of significantly enriched GO terms. NetGen achieved a superior performance than the existing methods in the simulation studies. The effectiveness of NetGen was explored further on four real datasets. Notably, several GO terms which were not directly linked with the active gene list for each disease were identified. These terms were closely related to the corresponding diseases when accessed to the curated literatures. NetGen has been implemented in the R package CopTea publicly available at GitHub ( http://github.com/wulingyun/CopTea/ ). Our procedure leads to a more reasonable and interpretable result of the functional enrichment analysis. As a novel term combination-based functional enrichment analysis method, NetGen is complementary to current individual term-based methods, and can help to explore the underlying pathogenesis of complex diseases.

DaGO-Fun: tool for Gene Ontology-based functional analysis using term information content measures

PubMed Central

2013-01-01

Background The use of Gene Ontology (GO) data in protein analyses have largely contributed to the improved outcomes of these analyses. Several GO semantic similarity measures have been proposed in recent years and provide tools that allow the integration of biological knowledge embedded in the GO structure into different biological analyses. There is a need for a unified tool that provides the scientific community with the opportunity to explore these different GO similarity measure approaches and their biological applications. Results We have developed DaGO-Fun, an online tool available at http://web.cbio.uct.ac.za/ITGOM, which incorporates many different GO similarity measures for exploring, analyzing and comparing GO terms and proteins within the context of GO. It uses GO data and UniProt proteins with their GO annotations as provided by the Gene Ontology Annotation (GOA) project to precompute GO term information content (IC), enabling rapid response to user queries. Conclusions The DaGO-Fun online tool presents the advantage of integrating all the relevant IC-based GO similarity measures, including topology- and annotation-based approaches to facilitate effective exploration of these measures, thus enabling users to choose the most relevant approach for their application. Furthermore, this tool includes several biological applications related to GO semantic similarity scores, including the retrieval of genes based on their GO annotations, the clustering of functionally related genes within a set, and term enrichment analysis. PMID:24067102
dbCPG: A web resource for cancer predisposition genes.

PubMed

Wei, Ran; Yao, Yao; Yang, Wu; Zheng, Chun-Hou; Zhao, Min; Xia, Junfeng

2016-06-21

Cancer predisposition genes (CPGs) are genes in which inherited mutations confer highly or moderately increased risks of developing cancer. Identification of these genes and understanding the biological mechanisms that underlie them is crucial for the prevention, early diagnosis, and optimized management of cancer. Over the past decades, great efforts have been made to identify CPGs through multiple strategies. However, information on these CPGs and their molecular functions is scattered. To address this issue and provide a comprehensive resource for researchers, we developed the Cancer Predisposition Gene Database (dbCPG, Database URL: http://bioinfo.ahu.edu.cn:8080/dbCPG/index.jsp), the first literature-based gene resource for exploring human CPGs. It contains 827 human (724 protein-coding, 23 non-coding, and 80 unknown type genes), 637 rats, and 658 mouse CPGs. Furthermore, data mining was performed to gain insights into the understanding of the CPGs data, including functional annotation, gene prioritization, network analysis of prioritized genes and overlap analysis across multiple cancer types. A user-friendly web interface with multiple browse, search, and upload functions was also developed to facilitate access to the latest information on CPGs. Taken together, the dbCPG database provides a comprehensive data resource for further studies of cancer predisposition genes.
In search of functional association from time-series microarray data based on the change trend and level of gene expression

PubMed Central

He, Feng; Zeng, An-Ping

2006-01-01

Background The increasing availability of time-series expression data opens up new possibilities to study functional linkages of genes. Present methods used to infer functional linkages between genes from expression data are mainly based on a point-to-point comparison. Change trends between consecutive time points in time-series data have been so far not well explored. Results In this work we present a new method based on extracting main features of the change trend and level of gene expression between consecutive time points. The method, termed as trend correlation (TC), includes two major steps: 1, calculating a maximal local alignment of change trend score by dynamic programming and a change trend correlation coefficient between the maximal matched change levels of each gene pair; 2, inferring relationships of gene pairs based on two statistical extraction procedures. The new method considers time shifts and inverted relationships in a similar way as the local clustering (LC) method but the latter is merely based on a point-to-point comparison. The TC method is demonstrated with data from yeast cell cycle and compared with the LC method and the widely used Pearson correlation coefficient (PCC) based clustering method. The biological significance of the gene pairs is examined with several large-scale yeast databases. Although the TC method predicts an overall lower number of gene pairs than the other two methods at a same p-value threshold, the additional number of gene pairs inferred by the TC method is considerable: e.g. 20.5% compared with the LC method and 49.6% with the PCC method for a p-value threshold of 2.7E-3. Moreover, the percentage of the inferred gene pairs consistent with databases by our method is generally higher than the LC method and similar to the PCC method. A significant number of the gene pairs only inferred by the TC method are process-identity or function-similarity pairs or have well-documented biological interactions, including 443 known protein interactions and some known cell cycle related regulatory interactions. It should be emphasized that the overlapping of gene pairs detected by the three methods is normally not very high, indicating a necessity of combining the different methods in search of functional association of genes from time-series data. For a p-value threshold of 1E-5 the percentage of process-identity and function-similarity gene pairs among the shared part of the three methods reaches 60.2% and 55.6% respectively, building a good basis for further experimental and functional study. Furthermore, the combined use of methods is important to infer more complete regulatory circuits and network as exemplified in this study. Conclusion The TC method can significantly augment the current major methods to infer functional linkages and biological network and is well suitable for exploring temporal relationships of gene expression in time-series data. PMID:16478547
BioGraph: unsupervised biomedical knowledge discovery via automated hypothesis generation

PubMed Central

2011-01-01

We present BioGraph, a data integration and data mining platform for the exploration and discovery of biomedical information. The platform offers prioritizations of putative disease genes, supported by functional hypotheses. We show that BioGraph can retrospectively confirm recently discovered disease genes and identify potential susceptibility genes, outperforming existing technologies, without requiring prior domain knowledge. Additionally, BioGraph allows for generic biomedical applications beyond gene discovery. BioGraph is accessible at http://www.biograph.be. PMID:21696594
“Guilt by Association” Is the Exception Rather Than the Rule in Gene Networks

PubMed Central

Gillis, Jesse; Pavlidis, Paul

2012-01-01

Gene networks are commonly interpreted as encoding functional information in their connections. An extensively validated principle called guilt by association states that genes which are associated or interacting are more likely to share function. Guilt by association provides the central top-down principle for analyzing gene networks in functional terms or assessing their quality in encoding functional information. In this work, we show that functional information within gene networks is typically concentrated in only a very few interactions whose properties cannot be reliably related to the rest of the network. In effect, the apparent encoding of function within networks has been largely driven by outliers whose behaviour cannot even be generalized to individual genes, let alone to the network at large. While experimentalist-driven analysis of interactions may use prior expert knowledge to focus on the small fraction of critically important data, large-scale computational analyses have typically assumed that high-performance cross-validation in a network is due to a generalizable encoding of function. Because we find that gene function is not systemically encoded in networks, but dependent on specific and critical interactions, we conclude it is necessary to focus on the details of how networks encode function and what information computational analyses use to extract functional meaning. We explore a number of consequences of this and find that network structure itself provides clues as to which connections are critical and that systemic properties, such as scale-free-like behaviour, do not map onto the functional connectivity within networks. PMID:22479173
The Evolutionary Fate of the Genes Encoding the Purine Catabolic Enzymes in Hominoids, Birds, and Reptiles

PubMed Central

Keebaugh, Alaine C.; Thomas, James W.

2010-01-01

Gene loss has been proposed to play a major role in adaptive evolution, and recent studies are beginning to reveal its importance in human evolution. However, the potential consequence of a single gene-loss event upon the fates of functionally interrelated genes is poorly understood. Here, we use the purine metabolic pathway as a model system in which to explore this important question. The loss of urate oxidase (UOX) activity, a necessary step in this pathway, has occurred independently in the hominoid and bird/reptile lineages. Because the loss of UOX would have removed the functional constraint upon downstream genes in this pathway, these downstream genes are generally assumed to have subsequently deteriorated. In this study, we used a comparative genomics approach to empirically determine the fate of UOX itself and the downstream genes in five hominoids, two birds, and a reptile. Although we found that the loss of UOX likely triggered the genetic deterioration of the immediate downstream genes in the hominoids, surprisingly in the birds and reptiles, the UOX locus itself and some of the downstream genes were present in the genome and predicted to encode proteins. To account for the variable pattern of gene retention and loss after the inactivation of UOX, we hypothesize that although gene loss is a common fate for genes that have been rendered obsolete due to the upstream loss of an enzyme a metabolic pathway, it is also possible that same lack of constraint will foster the evolution of new functions or allow the optimization of preexisting alternative functions in the downstream genes, thereby resulting in gene retention. Thus, adaptive single-gene losses have the potential to influence the long-term evolutionary fate of functionally interrelated genes. PMID:20106906
The evolutionary fate of the genes encoding the purine catabolic enzymes in hominoids, birds, and reptiles.

PubMed

Keebaugh, Alaine C; Thomas, James W

2010-06-01

Gene loss has been proposed to play a major role in adaptive evolution, and recent studies are beginning to reveal its importance in human evolution. However, the potential consequence of a single gene-loss event upon the fates of functionally interrelated genes is poorly understood. Here, we use the purine metabolic pathway as a model system in which to explore this important question. The loss of urate oxidase (UOX) activity, a necessary step in this pathway, has occurred independently in the hominoid and bird/reptile lineages. Because the loss of UOX would have removed the functional constraint upon downstream genes in this pathway, these downstream genes are generally assumed to have subsequently deteriorated. In this study, we used a comparative genomics approach to empirically determine the fate of UOX itself and the downstream genes in five hominoids, two birds, and a reptile. Although we found that the loss of UOX likely triggered the genetic deterioration of the immediate downstream genes in the hominoids, surprisingly in the birds and reptiles, the UOX locus itself and some of the downstream genes were present in the genome and predicted to encode proteins. To account for the variable pattern of gene retention and loss after the inactivation of UOX, we hypothesize that although gene loss is a common fate for genes that have been rendered obsolete due to the upstream loss of an enzyme a metabolic pathway, it is also possible that same lack of constraint will foster the evolution of new functions or allow the optimization of preexisting alternative functions in the downstream genes, thereby resulting in gene retention. Thus, adaptive single-gene losses have the potential to influence the long-term evolutionary fate of functionally interrelated genes.
Exploring the role of peptides in polymer-based gene delivery.

PubMed

Sun, Yanping; Yang, Zhen; Wang, Chunxi; Yang, Tianzhi; Cai, Cuifang; Zhao, Xiaoyun; Yang, Li; Ding, Pingtian

2017-09-15

Polymers are widely studied as non-viral gene vectors because of their strong DNA binding ability, capacity to carry large payload, flexibility of chemical modifications, low immunogenicity, and facile processes for manufacturing. However, high cytotoxicity and low transfection efficiency substantially restrict their application in clinical trials. Incorporating functional peptides is a promising approach to address these issues. Peptides demonstrate various functions in polymer-based gene delivery systems, such as targeting to specific cells, breaching membrane barriers, facilitating DNA condensation and release, and lowering cytotoxicity. In this review, we systematically summarize the role of peptides in polymer-based gene delivery, and elaborate how to rationally design polymer-peptide based gene delivery vectors. Polymers are widely studied as non-viral gene vectors, but suffer from high cytotoxicity and low transfection efficiency. Incorporating short, bioactive peptides into polymer-based gene delivery systems can address this issue. Peptides demonstrate various functions in polymer-based gene delivery systems, such as targeting to specific cells, breaching membrane barriers, facilitating DNA condensation and release, and lowering cytotoxicity. In this review, we highlight the peptides' roles in polymer-based gene delivery, and elaborate how to utilize various functional peptides to enhance the transfection efficiency of polymers. The optimized peptide-polymer vectors should be able to alter their structures and functions according to biological microenvironments and utilize inherent intracellular pathways of cells, and consequently overcome the barriers during gene delivery to enhance transfection efficiency. Copyright © 2017 Acta Materialia Inc. Published by Elsevier Ltd. All rights reserved.
GEsture: an online hand-drawing tool for gene expression pattern search.

PubMed

Wang, Chunyan; Xu, Yiqing; Wang, Xuelin; Zhang, Li; Wei, Suyun; Ye, Qiaolin; Zhu, Youxiang; Yin, Hengfu; Nainwal, Manoj; Tanon-Reyes, Luis; Cheng, Feng; Yin, Tongming; Ye, Ning

2018-01-01

Gene expression profiling data provide useful information for the investigation of biological function and process. However, identifying a specific expression pattern from extensive time series gene expression data is not an easy task. Clustering, a popular method, is often used to classify similar expression genes, however, genes with a 'desirable' or 'user-defined' pattern cannot be efficiently detected by clustering methods. To address these limitations, we developed an online tool called GEsture. Users can draw, or graph a curve using a mouse instead of inputting abstract parameters of clustering methods. GEsture explores genes showing similar, opposite and time-delay expression patterns with a gene expression curve as input from time series datasets. We presented three examples that illustrate the capacity of GEsture in gene hunting while following users' requirements. GEsture also provides visualization tools (such as expression pattern figure, heat map and correlation network) to display the searching results. The result outputs may provide useful information for researchers to understand the targets, function and biological processes of the involved genes.
Identification and Characterization of the MADS-Box Genes and Their Contribution to Flower Organ in Carnation (Dianthus caryophyllus L.)

PubMed Central

Zhang, Xiaoni; Wang, Qijian; Yang, Shaozong; Lin, Shengnan; Bao, Manzhu; Wu, Quanshu; Wang, Caiyun; Fu, Xiaopeng

2018-01-01

Dianthus is a large genus containing many species with high ornamental economic value. Extensive breeding strategies permitted an exploration of an improvement in the quality of cultivated carnation, particularly in flowers. However, little is known on the molecular mechanisms of flower development in carnation. Here, we report the identification and description of MADS-box genes in carnation (DcaMADS) with a focus on those involved in flower development and organ identity determination. In this study, 39 MADS-box genes were identified from the carnation genome and transcriptome by the phylogenetic analysis. These genes were categorized into four subgroups (30 MIKCc, two MIKC*, two Mα, and five Mγ). The MADS-box domain, gene structure, and conserved motif compositions of the carnation MADS genes were analysed. Meanwhile, the expression of DcaMADS genes were significantly different in stems, leaves, and flower buds. Further studies were carried out for exploring the expression of DcaMADS genes in individual flower organs, and some crucial DcaMADS genes correlated with their putative function were validated. Finally, a new expression pattern of DcaMADS genes in flower organs of carnation was provided: sepal (three class E genes and two class A genes), petal (two class B genes, two class E genes, and one SHORT VEGETATIVE PHASE (SVP)), stamen (two class B genes, two class E genes, and two class C), styles (two class E genes and two class C), and ovary (two class E genes, two class C, one AGAMOUS-LIKE 6 (AGL6), one SEEDSTICK (STK), one B sister, one SVP, and one Mα). This result proposes a model in floral organ identity of carnation and it may be helpful to further explore the molecular mechanism of flower organ identity in carnation. PMID:29617274
Identification and Characterization of the MADS-Box Genes and Their Contribution to Flower Organ in Carnation (Dianthus caryophyllus L.).

PubMed

Zhang, Xiaoni; Wang, Qijian; Yang, Shaozong; Lin, Shengnan; Bao, Manzhu; Bendahmane, Mohammed; Wu, Quanshu; Wang, Caiyun; Fu, Xiaopeng

2018-04-04

Dianthus is a large genus containing many species with high ornamental economic value. Extensive breeding strategies permitted an exploration of an improvement in the quality of cultivated carnation, particularly in flowers. However, little is known on the molecular mechanisms of flower development in carnation. Here, we report the identification and description of MADS-box genes in carnation ( DcaMADS ) with a focus on those involved in flower development and organ identity determination. In this study, 39 MADS-box genes were identified from the carnation genome and transcriptome by the phylogenetic analysis. These genes were categorized into four subgroups (30 MIKC c , two MIKC*, two Mα, and five Mγ). The MADS-box domain, gene structure, and conserved motif compositions of the carnation MADS genes were analysed. Meanwhile, the expression of DcaMADS genes were significantly different in stems, leaves, and flower buds. Further studies were carried out for exploring the expression of DcaMADS genes in individual flower organs, and some crucial DcaMADS genes correlated with their putative function were validated. Finally, a new expression pattern of DcaMADS genes in flower organs of carnation was provided: sepal (three class E genes and two class A genes), petal (two class B genes, two class E genes, and one SHORT VEGETATIVE PHASE ( SVP )), stamen (two class B genes, two class E genes, and two class C), styles (two class E genes and two class C), and ovary (two class E genes, two class C, one AGAMOUS-LIKE 6 ( AGL6 ), one SEEDSTICK ( STK ), one B sister , one SVP , and one Mα ). This result proposes a model in floral organ identity of carnation and it may be helpful to further explore the molecular mechanism of flower organ identity in carnation.
Properties of genes essential for mouse development

PubMed Central

Kabir, Mitra; Barradas, Ana; Tzotzos, George T.; Hentges, Kathryn E.

2017-01-01

Essential genes are those that are critical for life. In the specific case of the mouse, they are the set of genes whose deletion means that a mouse is unable to survive after birth. As such, they are the key minimal set of genes needed for all the steps of development to produce an organism capable of life ex utero. We explored a wide range of sequence and functional features to characterise essential (lethal) and non-essential (viable) genes in mice. Experimental data curated manually identified 1301 essential genes and 3451 viable genes. Very many sequence features show highly significant differences between essential and viable mouse genes. Essential genes generally encode complex proteins, with multiple domains and many introns. These genes tend to be: long, highly expressed, old and evolutionarily conserved. These genes tend to encode ligases, transferases, phosphorylated proteins, intracellular proteins, nuclear proteins, and hubs in protein-protein interaction networks. They are involved with regulating protein-protein interactions, gene expression and metabolic processes, cell morphogenesis, cell division, cell proliferation, DNA replication, cell differentiation, DNA repair and transcription, cell differentiation and embryonic development. Viable genes tend to encode: membrane proteins or secreted proteins, and are associated with functions such as cellular communication, apoptosis, behaviour and immune response, as well as housekeeping and tissue specific functions. Viable genes are linked to transport, ion channels, signal transduction, calcium binding and lipid binding, consistent with their location in membranes and involvement with cell-cell communication. From the analysis of the composite features of essential and viable genes, we conclude that essential genes tend to be required for intracellular functions, and viable genes tend to be involved with extracellular functions and cell-cell communication. Knowledge of the features that are over-represented in essential genes allows for a deeper understanding of the functions and processes implemented during mammalian development. PMID:28562614
Discovering Protein-Coding Genes from the Environment: Time for the Eukaryotes?

PubMed

Marmeisse, Roland; Kellner, Harald; Fraissinet-Tachet, Laurence; Luis, Patricia

2017-09-01

Eukaryotic microorganisms from diverse environments encompass a large number of taxa, many of them still unknown to science. One strategy to mine these organisms for genes of biotechnological relevance is to use a pool of eukaryotic mRNA directly extracted from environmental samples. Recent reports demonstrate that the resulting metatranscriptomic cDNA libraries can be screened by expression in yeast for a wide range of genes and functions from many of the different eukaryotic taxa. In combination with novel emerging high-throughput technologies, we anticipate that this approach should contribute to exploring the functional diversity of the eukaryotic microbiota. Copyright © 2017 Elsevier Ltd. All rights reserved.
Homogeneous versus heterogeneous probes for microbial ecological microarrays.

PubMed

Bae, Jin-Woo; Park, Yong-Ha

2006-07-01

Microbial ecological microarrays have been developed for investigating the composition and functions of microorganism communities in environmental niches. These arrays include microbial identification microarrays, which use oligonucleotides, gene fragments or microbial genomes as probes. In this article, the advantages and disadvantages of each type of probe are reviewed. Oligonucleotide probes are currently useful for probing uncultivated bacteria that are not amenable to gene fragment probing, whereas the functional gene fragments amplified randomly from microbial genomes require phylogenetic and hierarchical categorization before use as microbial identification probes, despite their high resolution for both specificity and sensitivity. Until more bacteria are sequenced and gene fragment probes are thoroughly validated, heterogeneous bacterial genome probes will provide a simple, sensitive and quantitative tool for exploring the ecosystem structure.
Functional relevance for type 1 diabetes mellitus-associated genetic variants by using integrative analyses.

PubMed

Qiu, Ying-Hua; Deng, Fei-Yan; Tang, Zai-Xiang; Jiang, Zhen-Huan; Lei, Shu-Feng

2015-10-01

Type 1 diabetes mellitus (type 1 DM) is an autoimmune disease. Although genome-wide association studies (GWAS) and meta-analyses have successfully identified numerous type 1 DM-associated susceptibility loci, the underlying mechanisms for these susceptibility loci are currently largely unclear. Based on publicly available datasets, we performed integrative analyses (i.e., integrated gene relationships among implicated loci, differential gene expression analysis, functional prediction and functional annotation clustering analysis) and combined with expression quantitative trait loci (eQTL) results to further explore function mechanisms underlying the associations between genetic variants and type 1 DM. Among a total of 183 type 1 DM-associated SNPs, eQTL analysis showed that 17 SNPs with cis-regulated eQTL effects on 9 genes. All the 9 eQTL genes enrich in immune-related pathways or Gene Ontology (GO) terms. Functional prediction analysis identified 5 SNPs located in transcription factor (TF) binding sites. Of the 9 eQTL genes, 6 (TAP2, HLA-DOB, HLA-DQB1, HLA-DQA1, HLA-DRB5 and CTSH) were differentially expressed in type 1 DM-associated related cells. Especially, rs3825932 in CTSH has integrative functional evidence supporting the association with type 1 DM. These findings indicated that integrative analyses can yield important functional information to link genetic variants and type 1 DM. Copyright © 2015 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.
GIANT 2.0: genome-scale integrated analysis of gene networks in tissues.

PubMed

Wong, Aaron K; Krishnan, Arjun; Troyanskaya, Olga G

2018-05-25

GIANT2 (Genome-wide Integrated Analysis of gene Networks in Tissues) is an interactive web server that enables biomedical researchers to analyze their proteins and pathways of interest and generate hypotheses in the context of genome-scale functional maps of human tissues. The precise actions of genes are frequently dependent on their tissue context, yet direct assay of tissue-specific protein function and interactions remains infeasible in many normal human tissues and cell-types. With GIANT2, researchers can explore predicted tissue-specific functional roles of genes and reveal changes in those roles across tissues, all through interactive multi-network visualizations and analyses. Additionally, the NetWAS approach available through the server uses tissue-specific/cell-type networks predicted by GIANT2 to re-prioritize statistical associations from GWAS studies and identify disease-associated genes. GIANT2 predicts tissue-specific interactions by integrating diverse functional genomics data from now over 61 400 experiments for 283 diverse tissues and cell-types. GIANT2 does not require any registration or installation and is freely available for use at http://giant-v2.princeton.edu.
Effects of GSTM1/GSTT1 gene polymorphism and fruit & vegetable consumption on antioxidant biomarkers and cognitive function in the elderly: a community based cross-sectional study.

PubMed

Yuan, Linhong; Ma, Weiwei; Liu, Jinmeng; Meng, Liping; Liu, Jixia; Li, Shuang; Han, Jing; Liu, Quanri; Feng, Lingli; Wang, Chao; Xiao, Rong

2014-01-01

It was reported that Glutathione S-transferase (GST) gene polymorphism and fruit and vegetable (FV) intake were associated with body antioxidant capacity. The oxidative/anti-oxidative imbalance played an important role in the pathogenesis of AD. However, the association of GST genotype, dietary FV consumption with body antioxidant biomarkers and cognitive function in the elderly is not clear. The aim of the present study was to determine the association of GST genotype, and dietary FV intake, with antioxidant biomarkers and cognitive function in the elderly. Food frequency questionnaire was used to collect data of dietary FV intakes in 504 community dwelling elderly aged from 55 to 75 years old. GSTM1 and GSTT1 genotypes were determined by using multiple-PCR method. Plasma and erythrocyte antioxidant biomarkers were measured. Cognitive function was measured by using Montreal Cognitive Assessment. Statistical analysis was applied for exploring the association of GST genotype and FV intake with antioxidant biomarkers level and cognitive function in the elderly. Individual GSTM1 or GSTT1 gene deletion affects body antioxidant biomarkers levels, including erythrocyte GST activity, plasma total antioxidant capacity, and glutathione levels. GSTM1and/or GSTT1 gene deletion have no effects on cognitive function in the surveyed participants. The effect of GST genotype on antioxidant biomarkers are FV intake dependent. There is interaction of FV intake and GST genotype on cognitive function in the elderly. GST genotype or daily FV consumption impact body antioxidant biomarkers, but not cognitive function in the elderly. There were combined effects of GST genotype and FV consumption on cognitive function in the elderly population. Large scale perspective population study is required to explore the association of GST genetic polymorphism, FV consumption and antioxidant biomarkers and cognitive function in the elderly.
Human microbiomes and their roles in dysbiosis, common diseases, and novel therapeutic approaches.

PubMed

Belizário, José E; Napolitano, Mauro

2015-01-01

The human body is the residence of a large number of commensal (non-pathogenic) and pathogenic microbial species that have co-evolved with the human genome, adaptive immune system, and diet. With recent advances in DNA-based technologies, we initiated the exploration of bacterial gene functions and their role in human health. The main goal of the human microbiome project is to characterize the abundance, diversity and functionality of the genes present in all microorganisms that permanently live in different sites of the human body. The gut microbiota expresses over 3.3 million bacterial genes, while the human genome expresses only 20 thousand genes. Microbe gene-products exert pivotal functions via the regulation of food digestion and immune system development. Studies are confirming that manipulation of non-pathogenic bacterial strains in the host can stimulate the recovery of the immune response to pathogenic bacteria causing diseases. Different approaches, including the use of nutraceutics (prebiotics and probiotics) as well as phages engineered with CRISPR/Cas systems and quorum sensing systems have been developed as new therapies for controlling dysbiosis (alterations in microbial community) and common diseases (e.g., diabetes and obesity). The designing and production of pharmaceuticals based on our own body's microbiome is an emerging field and is rapidly growing to be fully explored in the near future. This review provides an outlook on recent findings on the human microbiomes, their impact on health and diseases, and on the development of targeted therapies.
Human microbiomes and their roles in dysbiosis, common diseases, and novel therapeutic approaches

PubMed Central

Belizário, José E.; Napolitano, Mauro

2015-01-01

The human body is the residence of a large number of commensal (non-pathogenic) and pathogenic microbial species that have co-evolved with the human genome, adaptive immune system, and diet. With recent advances in DNA-based technologies, we initiated the exploration of bacterial gene functions and their role in human health. The main goal of the human microbiome project is to characterize the abundance, diversity and functionality of the genes present in all microorganisms that permanently live in different sites of the human body. The gut microbiota expresses over 3.3 million bacterial genes, while the human genome expresses only 20 thousand genes. Microbe gene-products exert pivotal functions via the regulation of food digestion and immune system development. Studies are confirming that manipulation of non-pathogenic bacterial strains in the host can stimulate the recovery of the immune response to pathogenic bacteria causing diseases. Different approaches, including the use of nutraceutics (prebiotics and probiotics) as well as phages engineered with CRISPR/Cas systems and quorum sensing systems have been developed as new therapies for controlling dysbiosis (alterations in microbial community) and common diseases (e.g., diabetes and obesity). The designing and production of pharmaceuticals based on our own body’s microbiome is an emerging field and is rapidly growing to be fully explored in the near future. This review provides an outlook on recent findings on the human microbiomes, their impact on health and diseases, and on the development of targeted therapies. PMID:26500616
Transformation of the US bread wheat Butte 86 and silencing of omega-5 gliadin genes

USDA-ARS?s Scientific Manuscript database

Complex groups of proteins determine the unique functional properties of wheat flour and are sometimes responsible for food intolerances and allergies in individuals that consume wheat products. Transgenic approaches can be used to explore the functions of different flour proteins, but are limited t...

GreenPhylDB v2.0: comparative and functional genomics in plants.

PubMed

Rouard, Mathieu; Guignon, Valentin; Aluome, Christelle; Laporte, Marie-Angélique; Droc, Gaëtan; Walde, Christian; Zmasek, Christian M; Périn, Christophe; Conte, Matthieu G

2011-01-01

GreenPhylDB is a database designed for comparative and functional genomics based on complete genomes. Version 2 now contains sixteen full genomes of members of the plantae kingdom, ranging from algae to angiosperms, automatically clustered into gene families. Gene families are manually annotated and then analyzed phylogenetically in order to elucidate orthologous and paralogous relationships. The database offers various lists of gene families including plant, phylum and species specific gene families. For each gene cluster or gene family, easy access to gene composition, protein domains, publications, external links and orthologous gene predictions is provided. Web interfaces have been further developed to improve the navigation through information related to gene families. New analysis tools are also available, such as a gene family ontology browser that facilitates exploration. GreenPhylDB is a component of the South Green Bioinformatics Platform (http://southgreen.cirad.fr/) and is accessible at http://greenphyl.cirad.fr. It enables comparative genomics in a broad taxonomy context to enhance the understanding of evolutionary processes and thus tends to speed up gene discovery.
Identifying ultrasensitive HGF dose-response functions in a 3D mammalian system for synthetic morphogenesis.

PubMed

Senthivel, Vivek Raj; Sturrock, Marc; Piedrafita, Gabriel; Isalan, Mark

2016-12-16

Nonlinear responses to signals are widespread natural phenomena that affect various cellular processes. Nonlinearity can be a desirable characteristic for engineering living organisms because it can lead to more switch-like responses, similar to those underlying the wiring in electronics. Steeper functions are described as ultrasensitive, and can be applied in synthetic biology by using various techniques including receptor decoys, multiple co-operative binding sites, and sequential positive feedbacks. Here, we explore the inherent non-linearity of a biological signaling system to identify functions that can potentially be exploited using cell genome engineering. For this, we performed genome-wide transcription profiling to identify genes with ultrasensitive response functions to Hepatocyte Growth Factor (HGF). We identified 3,527 genes that react to increasing concentrations of HGF, in Madin-Darby canine kidney (MDCK) cells, grown as cysts in 3D collagen cell culture. By fitting a generic Hill function to the dose-responses of these genes we obtained a measure of the ultrasensitivity of HGF-responsive genes, identifying a subset with higher apparent Hill coefficients (e.g. MMP1, TIMP1, SNORD75, SNORD86 and ERRFI1). The regulatory regions of these genes are potential candidates for future engineering of synthetic mammalian gene circuits requiring nonlinear responses to HGF signalling.
IMG-ABC: new features for bacterial secondary metabolism analysis and targeted biosynthetic gene cluster discovery in thousands of microbial genomes.

PubMed

Hadjithomas, Michalis; Chen, I-Min A; Chu, Ken; Huang, Jinghua; Ratner, Anna; Palaniappan, Krishna; Andersen, Evan; Markowitz, Victor; Kyrpides, Nikos C; Ivanova, Natalia N

2017-01-04

Secondary metabolites produced by microbes have diverse biological functions, which makes them a great potential source of biotechnologically relevant compounds with antimicrobial, anti-cancer and other activities. The proteins needed to synthesize these natural products are often encoded by clusters of co-located genes called biosynthetic gene clusters (BCs). In order to advance the exploration of microbial secondary metabolism, we developed the largest publically available database of experimentally verified and predicted BCs, the Integrated Microbial Genomes Atlas of Biosynthetic gene Clusters (IMG-ABC) (https://img.jgi.doe.gov/abc/). Here, we describe an update of IMG-ABC, which includes ClusterScout, a tool for targeted identification of custom biosynthetic gene clusters across 40 000 isolate microbial genomes, and a new search capability to query more than 700 000 BCs from isolate genomes for clusters with similar Pfam composition. Additional features enable fast exploration and analysis of BCs through two new interactive visualization features, a BC function heatmap and a BC similarity network graph. These new tools and features add to the value of IMG-ABC's vast body of BC data, facilitating their in-depth analysis and accelerating secondary metabolite discovery. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Still acting green: continued expression of photosynthetic genes in the heterotrophic Dinoflagellate Pfiesteria piscicida (Peridiniales, Alveolata).

PubMed

Kim, Gwang Hoon; Jeong, Hae Jin; Yoo, Yeong Du; Kim, Sunju; Han, Ji Hee; Han, Jong Won; Zuccarello, Giuseppe C

2013-01-01

The loss of photosynthetic function should lead to the cessation of expression and finally loss of photosynthetic genes in the new heterotroph. Dinoflagellates are known to have lost their photosynthetic ability several times. Dinoflagellates have also acquired photosynthesis from other organisms, either on a long-term basis or as "kleptoplastids" multiple times. The fate of photosynthetic gene expression in heterotrophs can be informative into evolution of gene expression patterns after functional loss, and the dinoflagellates ability to acquire new photosynthetic function through additional endosymbiosis. To explore this we analyzed a large-scale EST database consisting of 151,091 unique sequences (29,170 contigs, 120,921 singletons) obtained from 454 pyrosequencing of the heterotrophic dinoflagellate Pfiesteria piscicida. About 597 contigs from P. piscicida showed significant homology (E-value
Genome Data Mining and Soil Survey for the Novel Group 5 [NiFe]-Hydrogenase To Explore the Diversity and Ecological Importance of Presumptive High-Affinity H2-Oxidizing Bacteria ▿†

PubMed Central

Constant, Philippe; Chowdhury, Soumitra Paul; Hesse, Laura; Pratscher, Jennifer; Conrad, Ralf

2011-01-01

Streptomyces soil isolates exhibiting the unique ability to oxidize atmospheric H2 possess genes specifying a putative high-affinity [NiFe]-hydrogenase. This study was undertaken to explore the taxonomic diversity and the ecological importance of this novel functional group. We propose to designate the genes encoding the small and large subunits of the putative high-affinity hydrogenase hhyS and hhyL, respectively. Genome data mining revealed that the hhyL gene is unevenly distributed in the phyla Actinobacteria, Proteobacteria, Chloroflexi, and Acidobacteria. The hhyL gene sequences comprised a phylogenetically distinct group, namely, the group 5 [NiFe]-hydrogenase genes. The presumptive high-affinity H2-oxidizing bacteria constituting group 5 were shown to possess a hydrogenase gene cluster, including the genes encoding auxiliary and structural components of the enzyme and four additional open reading frames (ORFs) of unknown function. A soil survey confirmed that both high-affinity H2 oxidation activity and the hhyL gene are ubiquitous. A quantitative PCR assay revealed that soil contained 106 to 108 hhyL gene copies g (dry weight)−1. Assuming one hhyL gene copy per genome, the abundance of presumptive high-affinity H2-oxidizing bacteria was higher than the maximal population size for which maintenance energy requirements would be fully supplied through the H2 oxidation activity measured in soil. Our data indicate that the abundance of the hhyL gene should not be taken as a reliable proxy for the uptake of atmospheric H2 by soil, because high-affinity H2 oxidation is a facultatively mixotrophic metabolism, and microorganisms harboring a nonfunctional group 5 [NiFe]-hydrogenase may occur. PMID:21742924
The Mouse Genome Database (MGD): facilitating mouse as a model for human biology and disease.

PubMed

Eppig, Janan T; Blake, Judith A; Bult, Carol J; Kadin, James A; Richardson, Joel E

2015-01-01

The Mouse Genome Database (MGD, http://www.informatics.jax.org) serves the international biomedical research community as the central resource for integrated genomic, genetic and biological data on the laboratory mouse. To facilitate use of mouse as a model in translational studies, MGD maintains a core of high-quality curated data and integrates experimentally and computationally generated data sets. MGD maintains a unified catalog of genes and genome features, including functional RNAs, QTL and phenotypic loci. MGD curates and provides functional and phenotype annotations for mouse genes using the Gene Ontology and Mammalian Phenotype Ontology. MGD integrates phenotype data and associates mouse genotypes to human diseases, providing critical mouse-human relationships and access to repositories holding mouse models. MGD is the authoritative source of nomenclature for genes, genome features, alleles and strains following guidelines of the International Committee on Standardized Genetic Nomenclature for Mice. A new addition to MGD, the Human-Mouse: Disease Connection, allows users to explore gene-phenotype-disease relationships between human and mouse. MGD has also updated search paradigms for phenotypic allele attributes, incorporated incidental mutation data, added a module for display and exploration of genes and microRNA interactions and adopted the JBrowse genome browser. MGD resources are freely available to the scientific community. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Single-nucleotide polymorphisms and haplotypes of non-coding area in the CP gene are correlated with Parkinson's disease.

PubMed

Zhao, Na; Xiao, Jianqiu; Zheng, Zhiyong; Fei, Guoqiang; Zhang, Feng; Jin, Lirong; Zhong, Chunjiu

2015-04-01

Our previous studies have demonstrated that ceruloplasmin (CP) dysmetabolism is correlated with Parkinson's disease (PD). However, the causes of decreased serum CP levels in PD patients remain to be clarified. This study aimed to explore the potential association between genetic variants of the CP gene and PD. Clinical features, serum CP levels, and the CP gene (both promoter and coding regions) were analyzed in 60 PD patients and 50 controls. A luciferase reporter system was used to investigate the function of promoter single-nucleotide polymorphisms (SNPs). High-density comparative genomic hybridization microarrays were also used to detect large-scale copy-number variations in CP and an additional 47 genes involved in PD and/or copper/iron metabolism. The frequencies of eight SNPs (one intronic SNP and seven promoter SNPs of the CP gene) and their haplotypes were significantly different between PD patients, especially those with lowered serum CP levels, and controls. However, the luciferase reporter system revealed no significant effect of the risk haplotype on promoter activity of the CP gene. Neither these SNPs nor their haplotypes were correlated with the Hoehn and Yahr staging of PD. The results of this study suggest that common genetic variants of CP are associated with PD and further investigation is needed to explore their functions in PD.
Evolutionary Characteristics of Missing Proteins: Insights into the Evolution of Human Chromosomes Related to Missing-Protein-Encoding Genes.

PubMed

Xu, Aishi; Li, Guang; Yang, Dong; Wu, Songfeng; Ouyang, Hongsheng; Xu, Ping; He, Fuchu

2015-12-04

Although the "missing protein" is a temporary concept in C-HPP, the biological information for their "missing" could be an important clue in evolutionary studies. Here we classified missing-protein-encoding genes into two groups, the genes encoding PE2 proteins (with transcript evidence) and the genes encoding PE3/4 proteins (with no transcript evidence). These missing-protein-encoding genes distribute unevenly among different chromosomes, chromosomal regions, or gene clusters. In the view of evolutionary features, PE3/4 genes tend to be young, spreading at the nonhomology chromosomal regions and evolving at higher rates. Interestingly, there is a higher proportion of singletons in PE3/4 genes than the proportion of singletons in all genes (background) and OTCSGs (organ, tissue, cell type-specific genes). More importantly, most of the paralogous PE3/4 genes belong to the newly duplicated members of the paralogous gene groups, which mainly contribute to special biological functions, such as "smell perception". These functions are heavily restricted into specific type of cells, tissues, or specific developmental stages, acting as the new functional requirements that facilitated the emergence of the missing-protein-encoding genes during evolution. In addition, the criteria for the extremely special physical-chemical proteins were first set up based on the properties of PE2 proteins, and the evolutionary characteristics of those proteins were explored. Overall, the evolutionary analyses of missing-protein-encoding genes are expected to be highly instructive for proteomics and functional studies in the future.
dbCPG: A web resource for cancer predisposition genes

PubMed Central

Wei, Ran; Yao, Yao; Yang, Wu; Zheng, Chun-Hou; Zhao, Min; Xia, Junfeng

2016-01-01

Cancer predisposition genes (CPGs) are genes in which inherited mutations confer highly or moderately increased risks of developing cancer. Identification of these genes and understanding the biological mechanisms that underlie them is crucial for the prevention, early diagnosis, and optimized management of cancer. Over the past decades, great efforts have been made to identify CPGs through multiple strategies. However, information on these CPGs and their molecular functions is scattered. To address this issue and provide a comprehensive resource for researchers, we developed the Cancer Predisposition Gene Database (dbCPG, Database URL: http://bioinfo.ahu.edu.cn:8080/dbCPG/index.jsp), the first literature-based gene resource for exploring human CPGs. It contains 827 human (724 protein-coding, 23 non-coding, and 80 unknown type genes), 637 rats, and 658 mouse CPGs. Furthermore, data mining was performed to gain insights into the understanding of the CPGs data, including functional annotation, gene prioritization, network analysis of prioritized genes and overlap analysis across multiple cancer types. A user-friendly web interface with multiple browse, search, and upload functions was also developed to facilitate access to the latest information on CPGs. Taken together, the dbCPG database provides a comprehensive data resource for further studies of cancer predisposition genes. PMID:27192119
MaGnET: Malaria Genome Exploration Tool

PubMed Central

Sharman, Joanna L.; Gerloff, Dietlind L.

2013-01-01

Summary: The Malaria Genome Exploration Tool (MaGnET) is a software tool enabling intuitive ‘exploration-style’ visualization of functional genomics data relating to the malaria parasite, Plasmodium falciparum. MaGnET provides innovative integrated graphic displays for different datasets, including genomic location of genes, mRNA expression data, protein–protein interactions and more. Any selection of genes to explore made by the user is easily carried over between the different viewers for different datasets, and can be changed interactively at any point (without returning to a search). Availability and Implementation: Free online use (Java Web Start) or download (Java application archive and MySQL database; requires local MySQL installation) at http://malariagenomeexplorer.org Contact: joanna.sharman@ed.ac.uk or dgerloff@ffame.org Supplementary information: Supplementary data are available at Bioinformatics online. PMID:23894142
Integration of somatic mutation, expression and functional data reveals potential driver genes predictive of breast cancer survival.

PubMed

Suo, Chen; Hrydziuszko, Olga; Lee, Donghwan; Pramana, Setia; Saputra, Dhany; Joshi, Himanshu; Calza, Stefano; Pawitan, Yudi

2015-08-15

Genome and transcriptome analyses can be used to explore cancers comprehensively, and it is increasingly common to have multiple omics data measured from each individual. Furthermore, there are rich functional data such as predicted impact of mutations on protein coding and gene/protein networks. However, integration of the complex information across the different omics and functional data is still challenging. Clinical validation, particularly based on patient outcomes such as survival, is important for assessing the relevance of the integrated information and for comparing different procedures. An analysis pipeline is built for integrating genomic and transcriptomic alterations from whole-exome and RNA sequence data and functional data from protein function prediction and gene interaction networks. The method accumulates evidence for the functional implications of mutated potential driver genes found within and across patients. A driver-gene score (DGscore) is developed to capture the cumulative effect of such genes. To contribute to the score, a gene has to be frequently mutated, with high or moderate mutational impact at protein level, exhibiting an extreme expression and functionally linked to many differentially expressed neighbors in the functional gene network. The pipeline is applied to 60 matched tumor and normal samples of the same patient from The Cancer Genome Atlas breast-cancer project. In clinical validation, patients with high DGscores have worse survival than those with low scores (P = 0.001). Furthermore, the DGscore outperforms the established expression-based signatures MammaPrint and PAM50 in predicting patient survival. In conclusion, integration of mutation, expression and functional data allows identification of clinically relevant potential driver genes in cancer. The documented pipeline including annotated sample scripts can be found in http://fafner.meb.ki.se/biostatwiki/driver-genes/. yudi.pawitan@ki.se Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Bioinformatics for spermatogenesis: annotation of male reproduction based on proteomics

PubMed Central

Zhou, Tao; Zhou, Zuo-Min; Guo, Xue-Jiang

2013-01-01

Proteomics strategies have been widely used in the field of male reproduction, both in basic and clinical research. Bioinformatics methods are indispensable in proteomics-based studies and are used for data presentation, database construction and functional annotation. In the present review, we focus on the functional annotation of gene lists obtained through qualitative or quantitative methods, summarizing the common and male reproduction specialized proteomics databases. We introduce several integrated tools used to find the hidden biological significance from the data obtained. We further describe in detail the information on male reproduction derived from Gene Ontology analyses, pathway analyses and biomedical analyses. We provide an overview of bioinformatics annotations in spermatogenesis, from gene function to biological function and from biological function to clinical application. On the basis of recently published proteomics studies and associated data, we show that bioinformatics methods help us to discover drug targets for sperm motility and to scan for cancer-testis genes. In addition, we summarize the online resources relevant to male reproduction research for the exploration of the regulation of spermatogenesis. PMID:23852026
Evaluating Functional Annotations of Enzymes Using the Gene Ontology.

PubMed

Holliday, Gemma L; Davidson, Rebecca; Akiva, Eyal; Babbitt, Patricia C

2017-01-01

The Gene Ontology (GO) (Ashburner et al., Nat Genet 25(1):25-29, 2000) is a powerful tool in the informatics arsenal of methods for evaluating annotations in a protein dataset. From identifying the nearest well annotated homologue of a protein of interest to predicting where misannotation has occurred to knowing how confident you can be in the annotations assigned to those proteins is critical. In this chapter we explore what makes an enzyme unique and how we can use GO to infer aspects of protein function based on sequence similarity. These can range from identification of misannotation or other errors in a predicted function to accurate function prediction for an enzyme of entirely unknown function. Although GO annotation applies to any gene products, we focus here a describing our approach for hierarchical classification of enzymes in the Structure-Function Linkage Database (SFLD) (Akiva et al., Nucleic Acids Res 42(Database issue):D521-530, 2014) as a guide for informed utilisation of annotation transfer based on GO terms.
IMG-ABC: new features for bacterial secondary metabolism analysis and targeted biosynthetic gene cluster discovery in thousands of microbial genomes

DOE PAGES

Hadjithomas, Michalis; Chen, I-Min A.; Chu, Ken; ...

2016-11-29

Secondary metabolites produced by microbes have diverse biological functions, which makes them a great potential source of biotechnologically relevant compounds with antimicrobial, anti-cancer and other activities. The proteins needed to synthesize these natural products are often encoded by clusters of co-located genes called biosynthetic gene clusters (BCs). In order to advance the exploration of microbial secondary metabolism, we developed the largest publically available database of experimentally verified and predicted BCs, the Integrated Microbial Genomes Atlas of Biosynthetic gene Clusters (IMG-ABC) (https://img.jgi.doe.gov/abc/). Here, we describe an update of IMG-ABC, which includes ClusterScout, a tool for targeted identification of custom biosynthetic genemore » clusters across 40 000 isolate microbial genomes, and a new search capability to query more than 700 000 BCs from isolate genomes for clusters with similar Pfam composition. Additional features enable fast exploration and analysis of BCs through two new interactive visualization features, a BC function heatmap and a BC similarity network graph. These new tools and features add to the value of IMG-ABC's vast body of BC data, facilitating their in-depth analysis and accelerating secondary metabolite discovery.« less
IMG-ABC: new features for bacterial secondary metabolism analysis and targeted biosynthetic gene cluster discovery in thousands of microbial genomes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hadjithomas, Michalis; Chen, I-Min A.; Chu, Ken

Secondary metabolites produced by microbes have diverse biological functions, which makes them a great potential source of biotechnologically relevant compounds with antimicrobial, anti-cancer and other activities. The proteins needed to synthesize these natural products are often encoded by clusters of co-located genes called biosynthetic gene clusters (BCs). In order to advance the exploration of microbial secondary metabolism, we developed the largest publically available database of experimentally verified and predicted BCs, the Integrated Microbial Genomes Atlas of Biosynthetic gene Clusters (IMG-ABC) (https://img.jgi.doe.gov/abc/). Here, we describe an update of IMG-ABC, which includes ClusterScout, a tool for targeted identification of custom biosynthetic genemore » clusters across 40 000 isolate microbial genomes, and a new search capability to query more than 700 000 BCs from isolate genomes for clusters with similar Pfam composition. Additional features enable fast exploration and analysis of BCs through two new interactive visualization features, a BC function heatmap and a BC similarity network graph. These new tools and features add to the value of IMG-ABC's vast body of BC data, facilitating their in-depth analysis and accelerating secondary metabolite discovery.« less
Genes determining the severity of cerebral palsy: the role of single nucleotide polymorphisms on the amount and structure of apolipoprotein E

PubMed Central

Lien, Espen; Andersen, Guro; Bao, Yongde; Gordish-Dressman, Heather; Skranes, Jon S.; Blackman, James A.; Vik, Torstein

2015-01-01

Aim ApolipoproteinE (apoE) influences repair and other processes in the brain and the apoE4 variant is a risk factor for Alzheimer's disease and for prolonged recovery following traumatic brain injury. We previously reported that specific single nucleotide polymorphisms in the APOE or TOMM40 genes affecting the structure and production of apoE were associated with epilepsy, more impaired hand function and gastrostomy tube feeding in children with cerebral palsy (CP). This study explored how various combinations of the same polymorphisms may affect these clinical manifestations. Methods Successful DNA analyses of APOE and TOMM40 were carried out on 227 children. The CP Register of Norway provided details of gross and fine motor function, epilepsy and gastrostomy tube feeding. Possible associations between these clinical manifestations and various combinations of the APOEε2, ε3 or ε4 alleles and of the rs59007384 polymorphism in the TOMM40 gene were explored. Results Epilepsy, impaired fine motor function and gastrostomy tube feeding were less common in children carrying the combination of rs59007384 GG and APOEε2 or ε3 than in children with other combinations. Conclusion Our findings suggest that specific combinations of genes influence the structure and production of apoE differently and affect the clinical manifestations of CP. PMID:25703783
Genotet: An Interactive Web-based Visual Exploration Framework to Support Validation of Gene Regulatory Networks.

PubMed

Yu, Bowen; Doraiswamy, Harish; Chen, Xi; Miraldi, Emily; Arrieta-Ortiz, Mario Luis; Hafemeister, Christoph; Madar, Aviv; Bonneau, Richard; Silva, Cláudio T

2014-12-01

Elucidation of transcriptional regulatory networks (TRNs) is a fundamental goal in biology, and one of the most important components of TRNs are transcription factors (TFs), proteins that specifically bind to gene promoter and enhancer regions to alter target gene expression patterns. Advances in genomic technologies as well as advances in computational biology have led to multiple large regulatory network models (directed networks) each with a large corpus of supporting data and gene-annotation. There are multiple possible biological motivations for exploring large regulatory network models, including: validating TF-target gene relationships, figuring out co-regulation patterns, and exploring the coordination of cell processes in response to changes in cell state or environment. Here we focus on queries aimed at validating regulatory network models, and on coordinating visualization of primary data and directed weighted gene regulatory networks. The large size of both the network models and the primary data can make such coordinated queries cumbersome with existing tools and, in particular, inhibits the sharing of results between collaborators. In this work, we develop and demonstrate a web-based framework for coordinating visualization and exploration of expression data (RNA-seq, microarray), network models and gene-binding data (ChIP-seq). Using specialized data structures and multiple coordinated views, we design an efficient querying model to support interactive analysis of the data. Finally, we show the effectiveness of our framework through case studies for the mouse immune system (a dataset focused on a subset of key cellular functions) and a model bacteria (a small genome with high data-completeness).
MARQ: an online tool to mine GEO for experiments with similar or opposite gene expression signatures.

PubMed

Vazquez, Miguel; Nogales-Cadenas, Ruben; Arroyo, Javier; Botías, Pedro; García, Raul; Carazo, Jose M; Tirado, Francisco; Pascual-Montano, Alberto; Carmona-Saez, Pedro

2010-07-01

The enormous amount of data available in public gene expression repositories such as Gene Expression Omnibus (GEO) offers an inestimable resource to explore gene expression programs across several organisms and conditions. This information can be used to discover experiments that induce similar or opposite gene expression patterns to a given query, which in turn may lead to the discovery of new relationships among diseases, drugs or pathways, as well as the generation of new hypotheses. In this work, we present MARQ, a web-based application that allows researchers to compare a query set of genes, e.g. a set of over- and under-expressed genes, against a signature database built from GEO datasets for different organisms and platforms. MARQ offers an easy-to-use and integrated environment to mine GEO, in order to identify conditions that induce similar or opposite gene expression patterns to a given experimental condition. MARQ also includes additional functionalities for the exploration of the results, including a meta-analysis pipeline to find genes that are differentially expressed across different experiments. The application is freely available at http://marq.dacya.ucm.es.
GDRMS: a system for automatic extraction of the disease-centre relation

NASA Astrophysics Data System (ADS)

Yang, Ronggen; Zhang, Yue; Gong, Lejun

2012-01-01

With the rapidly increasing of biomedical literature, the deluge of new articles is leading to information overload. Extracting the available knowledge from the huge amount of biomedical literature has become a major challenge. GDRMS is developed as a tool that extracts the relationship between disease and gene, gene and gene from biomedical literatures using text mining technology. It is a ruled-based system which also provides disease-centre network visualization, constructs the disease-gene database, and represents a gene engine for understanding the function of the gene. The main focus of GDRMS is to provide a valuable opportunity to explore the relationship between disease and gene for the research community about etiology of disease.
Ossification of the posterior longitudinal ligament related genes identification using microarray gene expression profiling and bioinformatics analysis.

PubMed

He, Hailong; Mao, Lingzhou; Xu, Peng; Xi, Yanhai; Xu, Ning; Xue, Mingtao; Yu, Jiangming; Ye, Xiaojian

2014-01-10

Ossification of the posterior longitudinal ligament (OPLL) is a kind of disease with physical barriers and neurological disorders. The objective of this study was to explore the differentially expressed genes (DEGs) in OPLL patient ligament cells and identify the target sites for the prevention and treatment of OPLL in clinic. Gene expression data GSE5464 was downloaded from Gene Expression Omnibus; then DEGs were screened by limma package in R language, and changed functions and pathways of OPLL cells compared to normal cells were identified by DAVID (The Database for Annotation, Visualization and Integrated Discovery); finally, an interaction network of DEGs was constructed by string. A total of 1536 DEGs were screened, with 31 down-regulated and 1505 up-regulated genes. Response to wounding function and Toll-like receptor signaling pathway may involve in the development of OPLL. Genes, such as PDGFB, PRDX2 may involve in OPLL through response to wounding function. Toll-like receptor signaling pathway enriched genes such as TLR1, TLR5, and TLR7 may involve in spine cord injury in OPLL. PIK3R1 was the hub gene in the network of DEGs with the highest degree; INSR was one of the most closely related genes of it. OPLL related genes screened by microarray gene expression profiling and bioinformatics analysis may be helpful for elucidating the mechanism of OPLL. © 2013.

Microbial population index and community structure in saline-alkaline soil using gene targeted metagenomics.

PubMed

Keshri, Jitendra; Mishra, Avinash; Jha, Bhavanath

2013-03-30

Population indices of bacteria and archaea were investigated from saline-alkaline soil and a possible microbe-environment pattern was established using gene targeted metagenomics. Clone libraries were constructed using 16S rRNA and functional gene(s) involved in carbon fixation (cbbL), nitrogen fixation (nifH), ammonia oxidation (amoA) and sulfur metabolism (apsA). Molecular phylogeny revealed the dominance of Actinobacteria, Firmicutes and Proteobacteria along with archaeal members of Halobacteraceae. The library consisted of novel bacterial (20%) and archaeal (38%) genera showing ≤95% similarity to previously retrieved sequences. Phylogenetic analysis indicated ability of inhabitant to survive in stress condition. The 16S rRNA gene libraries contained novel gene sequences and were distantly homologous with cultured bacteria. Functional gene libraries were found unique and most of the clones were distantly related to Proteobacteria, while clones of nifH gene library also showed homology with Cyanobacteria and Firmicutes. Quantitative real-time PCR exhibited that bacterial abundance was two orders of magnitude higher than archaeal. The gene(s) quantification indicated the size of the functional guilds harboring relevant key genes. The study provides insights on microbial ecology and different metabolic interactions occurring in saline-alkaline soil, possessing phylogenetically diverse groups of bacteria and archaea, which may be explored further for gene cataloging and metabolic profiling. Copyright © 2012 Elsevier GmbH. All rights reserved.
A Systematic Analysis of Candidate Genes Associated with Nicotine Addiction

PubMed Central

Liu, Meng; Li, Xia; Fan, Rui; Liu, Xinhua; Wang, Ju

2015-01-01

Nicotine, as the major psychoactive component of tobacco, has broad physiological effects within the central nervous system, but our understanding of the molecular mechanism underlying its neuronal effects remains incomplete. In this study, we performed a systematic analysis on a set of nicotine addiction-related genes to explore their characteristics at network levels. We found that NAGenes tended to have a more moderate degree and weaker clustering coefficient and to be less central in the network compared to alcohol addiction-related genes or cancer genes. Further, clustering of these genes resulted in six clusters with themes in synaptic transmission, signal transduction, metabolic process, and apoptosis, which provided an intuitional view on the major molecular functions of the genes. Moreover, functional enrichment analysis revealed that neurodevelopment, neurotransmission activity, and metabolism related biological processes were involved in nicotine addiction. In summary, by analyzing the overall characteristics of the nicotine addiction related genes, this study provided valuable information for understanding the molecular mechanisms underlying nicotine addiction. PMID:26097843
Applying gene regulatory network logic to the evolution of social behavior.

PubMed

Baran, Nicole M; McGrath, Patrick T; Streelman, J Todd

2017-06-06

Animal behavior is ultimately the product of gene regulatory networks (GRNs) for brain development and neural networks for brain function. The GRN approach has advanced the fields of genomics and development, and we identify organizational similarities between networks of genes that build the brain and networks of neurons that encode brain function. In this perspective, we engage the analogy between developmental networks and neural networks, exploring the advantages of using GRN logic to study behavior. Applying the GRN approach to the brain and behavior provides a quantitative and manipulative framework for discovery. We illustrate features of this framework using the example of social behavior and the neural circuitry of aggression.
Analysis of the Human Prostate-Specific Proteome Defined by Transcriptomics and Antibody-Based Profiling Identifies TMEM79 and ACOXL as Two Putative, Diagnostic Markers in Prostate Cancer

PubMed Central

O'Hurley, Gillian; Busch, Christer; Fagerberg, Linn; Hallström, Björn M.; Stadler, Charlotte; Tolf, Anna; Lundberg, Emma; Schwenk, Jochen M.; Jirström, Karin; Bjartell, Anders; Gallagher, William M.; Uhlén, Mathias; Pontén, Fredrik

2015-01-01

To better understand prostate function and disease, it is important to define and explore the molecular constituents that signify the prostate gland. The aim of this study was to define the prostate specific transcriptome and proteome, in comparison to 26 other human tissues. Deep sequencing of mRNA (RNA-seq) and immunohistochemistry-based protein profiling were combined to identify prostate specific gene expression patterns and to explore tissue biomarkers for potential clinical use in prostate cancer diagnostics. We identified 203 genes with elevated expression in the prostate, 22 of which showed more than five-fold higher expression levels compared to all other tissue types. In addition to previously well-known proteins we identified two poorly characterized proteins, TMEM79 and ACOXL, with potential to differentiate between benign and cancerous prostatic glands in tissue biopsies. In conclusion, we have applied a genome-wide analysis to identify the prostate specific proteome using transcriptomics and antibody-based protein profiling to identify genes with elevated expression in the prostate. Our data provides a starting point for further functional studies to explore the molecular repertoire of normal and diseased prostate including potential prostate cancer markers such as TMEM79 and ACOXL. PMID:26237329
PanACEA: a bioinformatics tool for the exploration and visualization of bacterial pan-chromosomes.

PubMed

Clarke, Thomas H; Brinkac, Lauren M; Inman, Jason M; Sutton, Granger; Fouts, Derrick E

2018-06-27

Bacterial pan-genomes, comprised of conserved and variable genes across multiple sequenced bacterial genomes, allow for identification of genomic regions that are phylogenetically discriminating or functionally important. Pan-genomes consist of large amounts of data, which can restrict researchers ability to locate and analyze these regions. Multiple software packages are available to visualize pan-genomes, but currently their ability to address these concerns are limited by using only pre-computed data sets, prioritizing core over variable gene clusters, or by not accounting for pan-chromosome positioning in the viewer. We introduce PanACEA (Pan-genome Atlas with Chromosome Explorer and Analyzer), which utilizes locally-computed interactive web-pages to view ordered pan-genome data. It consists of multi-tiered, hierarchical display pages that extend from pan-chromosomes to both core and variable regions to single genes. Regions and genes are functionally annotated to allow for rapid searching and visual identification of regions of interest with the option that user-supplied genomic phylogenies and metadata can be incorporated. PanACEA's memory and time requirements are within the capacities of standard laptops. The capability of PanACEA as a research tool is demonstrated by highlighting a variable region important in differentiating strains of Enterobacter hormaechei. PanACEA can rapidly translate the results of pan-chromosome programs into an intuitive and interactive visual representation. It will empower researchers to visually explore and identify regions of the pan-chromosome that are most biologically interesting, and to obtain publication quality images of these regions.
Phylogeny and phylogeography of functional genes shared among seven terrestrial subsurface metagenomes reveal N-cycling and microbial evolutionary relationships

PubMed Central

Lau, Maggie C. Y.; Cameron, Connor; Magnabosco, Cara; Brown, C. Titus; Schilkey, Faye; Grim, Sharon; Hendrickson, Sarah; Pullin, Michael; Sherwood Lollar, Barbara; van Heerden, Esta; Kieft, Thomas L.; Onstott, Tullis C.

2014-01-01

Comparative studies on community phylogenetics and phylogeography of microorganisms living in extreme environments are rare. Terrestrial subsurface habitats are valuable for studying microbial biogeographical patterns due to their isolation and the restricted dispersal mechanisms. Since the taxonomic identity of a microorganism does not always correspond well with its functional role in a particular community, the use of taxonomic assignments or patterns may give limited inference on how microbial functions are affected by historical, geographical and environmental factors. With seven metagenomic libraries generated from fracture water samples collected from five South African mines, this study was carried out to (1) screen for ubiquitous functions or pathways of biogeochemical cycling of CH4, S, and N; (2) to characterize the biodiversity represented by the common functional genes; (3) to investigate the subsurface biogeography as revealed by this subset of genes; and (4) to explore the possibility of using metagenomic data for evolutionary study. The ubiquitous functional genes are NarV, NPD, PAPS reductase, NifH, NifD, NifK, NifE, and NifN genes. Although these eight common functional genes were taxonomically and phylogenetically diverse and distinct from each other, the dissimilarity between samples did not correlate strongly with geographical or environmental parameters or residence time of the water. Por genes homologous to those of Thermodesulfovibrio yellowstonii detected in all metagenomes were deep lineages of Nitrospirae, suggesting that subsurface habitats have preserved ancestral genetic signatures that inform the study of the origin and evolution of prokaryotes. PMID:25400621
Reverse genetics: Its origins and prospects

DOE Office of Scientific and Technical Information (OSTI.GOV)

Berg, P.

1991-04-01

The nucleotide sequence of a gene and its flanking segments alone will not tell us how its expression is regulated during development and differentiation, or in response to environmental changes. To comprehend the physiological significance of the molecular details requires biological analysis. Recombinant DNA techniques provide a powerful experimental approach. A strategy termed reverse genetics' utilizes the analysis of the activities of mutant and normal genes and experimentally constructed mutants to explore the relationship between gene structure and function thereby helping elucidate the relationship between genotype and phenotype.
Dynamic gene expression analysis in a H1N1 influenza virus mouse pneumonia model.

PubMed

Bao, Yanyan; Gao, Yingjie; Shi, Yujing; Cui, Xiaolan

2017-06-01

H1N1, a major pathogenic subtype of influenza A virus, causes a respiratory infection in humans and livestock that can range from a mild infection to more severe pneumonia associated with acute respiratory distress syndrome. Understanding the dynamic changes in the genome and the related functional changes induced by H1N1 influenza virus infection is essential to elucidating the pathogenesis of this virus and thereby determining strategies to prevent future outbreaks. In this study, we filtered the significantly expressed genes in mouse pneumonia using mRNA microarray analysis. Using STC analysis, seven significant gene clusters were revealed, and using STC-GO analysis, we explored the significant functions of these seven gene clusters. The results revealed GOs related to H1N1 virus-induced inflammatory and immune functions, including innate immune response, inflammatory response, specific immune response, and cellular response to interferon-beta. Furthermore, the dynamic regulation relationships of the key genes in mouse pneumonia were revealed by dynamic gene network analysis, and the most important genes were filtered, including Dhx58, Cxcl10, Cxcl11, Zbp1, Ifit1, Ifih1, Trim25, Mx2, Oas2, Cd274, Irgm1, and Irf7. These results suggested that during mouse pneumonia, changes in the expression of gene clusters and the complex interactions among genes lead to significant changes in function. Dynamic gene expression analysis revealed key genes that performed important functions. These results are a prelude to advancements in mouse H1N1 influenza virus infection biology, as well as the use of mice as a model organism for human H1N1 influenza virus infection studies.
Diseases and Molecular Diagnostics: A Step Closer to Precision Medicine.

PubMed

Dwivedi, Shailendra; Purohit, Purvi; Misra, Radhieka; Pareek, Puneet; Goel, Apul; Khattri, Sanjay; Pant, Kamlesh Kumar; Misra, Sanjeev; Sharma, Praveen

2017-10-01

The current advent of molecular technologies together with a multidisciplinary interplay of several fields led to the development of genomics, which concentrates on the detection of pathogenic events at the genome level. The structural and functional genomics approaches have now pinpointed the technical challenge in the exploration of disease-related genes and the recognition of their structural alterations or elucidation of gene function. Various promising technologies and diagnostic applications of structural genomics are currently preparing a large database of disease-genes, genetic alterations etc., by mutation scanning and DNA chip technology. Further the functional genomics also exploring the expression genetics (hybridization-, PCR- and sequence-based technologies), two-hybrid technology, next generation sequencing with Bioinformatics and computational biology. Advances in microarray "chip" technology as microarrays have allowed the parallel analysis of gene expression patterns of thousands of genes simultaneously. Sequence information collected from the genomes of many individuals is leading to the rapid discovery of single nucleotide polymorphisms or SNPs. Further advances of genetic engineering have also revolutionized immunoassay biotechnology via engineering of antibody-encoding genes and the phage display technology. The Biotechnology plays an important role in the development of diagnostic assays in response to an outbreak or critical disease response need. However, there is also need to pinpoint various obstacles and issues related to the commercialization and widespread dispersal of genetic knowledge derived from the exploitation of the biotechnology industry and the development and marketing of diagnostic services. Implementation of genetic criteria for patient selection and individual assessment of the risks and benefits of treatment emerges as a major challenge to the pharmaceutical industry. Thus this field is revolutionizing current era and further it may open new vistas in the field of disease management.
Identification of a deep intronic mutation in the COL6A2 gene by a novel custom oligonucleotide CGH array designed to explore allelic and genetic heterogeneity in collagen VI-related myopathies

PubMed Central

2010-01-01

Background Molecular characterization of collagen-VI related myopathies currently relies on standard sequencing, which yields a detection rate approximating 75-79% in Ullrich congenital muscular dystrophy (UCMD) and 60-65% in Bethlem myopathy (BM) patients as PCR-based techniques tend to miss gross genomic rearrangements as well as copy number variations (CNVs) in both the coding sequence and intronic regions. Methods We have designed a custom oligonucleotide CGH array in order to investigate the presence of CNVs in the coding and non-coding regions of COL6A1, A2, A3, A5 and A6 genes and a group of genes functionally related to collagen VI. A cohort of 12 patients with UCMD/BM negative at sequencing analysis and 2 subjects carrying a single COL6 mutation whose clinical phenotype was not explicable by inheritance were selected and the occurrence of allelic and genetic heterogeneity explored. Results A deletion within intron 1A of the COL6A2 gene, occurring in compound heterozygosity with a small deletion in exon 28, previously detected by routine sequencing, was identified in a BM patient. RNA studies showed monoallelic transcription of the COL6A2 gene, thus elucidating the functional effect of the intronic deletion. No pathogenic mutations were identified in the remaining analyzed patients, either within COL6A genes, or in genes functionally related to collagen VI. Conclusions Our custom CGH array may represent a useful complementary diagnostic tool, especially in recessive forms of the disease, when only one mutant allele is detected by standard sequencing. The intronic deletion we identified represents the first example of a pure intronic mutation in COL6A genes. PMID:20302629
Illuminating a plant’s tissue-specific metabolic diversity using computational metabolomics and information theory

PubMed Central

Li, Dapeng; Heiling, Sven; Baldwin, Ian T.

2016-01-01

Secondary metabolite diversity is considered an important fitness determinant for plants’ biotic and abiotic interactions in nature. This diversity can be examined in two dimensions. The first one considers metabolite diversity across plant species. A second way of looking at this diversity is by considering the tissue-specific localization of pathways underlying secondary metabolism within a plant. Although these cross-tissue metabolite variations are increasingly regarded as important readouts of tissue-level gene function and regulatory processes, they have rarely been comprehensively explored by nontargeted metabolomics. As such, important questions have remained superficially addressed. For instance, which tissues exhibit prevalent signatures of metabolic specialization? Reciprocally, which metabolites contribute most to this tissue specialization in contrast to those metabolites exhibiting housekeeping characteristics? Here, we explore tissue-level metabolic specialization in Nicotiana attenuata, an ecological model with rich secondary metabolism, by combining tissue-wide nontargeted mass spectral data acquisition, information theory analysis, and tandem MS (MS/MS) molecular networks. This analysis was conducted for two different methanolic extracts of 14 tissues and deconvoluted 895 nonredundant MS/MS spectra. Using information theory analysis, anthers were found to harbor the most specialized metabolome, and most unique metabolites of anthers and other tissues were annotated through MS/MS molecular networks. Tissue–metabolite association maps were used to predict tissue-specific gene functions. Predictions for the function of two UDP-glycosyltransferases in flavonoid metabolism were confirmed by virus-induced gene silencing. The present workflow allows biologists to amortize the vast amount of data produced by modern MS instrumentation in their quest to understand gene function. PMID:27821729
Natural parameter values for generalized gene adjacency.

PubMed

Yang, Zhenyu; Sankoff, David

2010-09-01

Given the gene orders in two modern genomes, it may be difficult to decide if some genes are close enough in both genomes to infer some ancestral proximity or some functional relationship. Current methods all depend on arbitrary parameters. We explore a class of gene proximity criteria and find two kinds of natural values for their parameters. One kind has to do with the parameter value where the expected information contained in two genomes about each other is maximized. The other kind of natural value has to do with parameter values beyond which all genes are clustered. We analyze these using combinatorial and probabilistic arguments as well as simulations.
Transcriptional profiling by DDRT-PCR analysis reveals gene expression during seed development in Carya cathayensis Sarg.

PubMed

Huang, You-Jun; Zhou, Qin; Huang, Jian-Qin; Zeng, Yan-Ru; Wang, Zheng-Jia; Zhang, Qi-Xiang; Zhu, Yi-Hang; Shen, Chen; Zheng, Bing-Song

2015-06-01

Hickory (Carya cathayensis Sarg.) seed has one of the highest oil content and is rich in polyunsaturated fatty acids (PUFAs), which kernel is helpful to human health, particularly to human brain function. A better elucidation of lipid accumulation mechanism would help to improve hickory production and seed quality. DDRT-PCR analysis was used to examine gene expression in hickory at thirteen time points during seed development process. A total of 67 unique genes involved in seed development were obtained, and those expression patterns were further confirmed by semi-quantitative RT-PCR and real time RT-PCR analysis. Of them, the genes with known functions were involved in signal transduction, amino acid metabolism, nuclear metabolism, fatty acid metabolism, protein metabolism, carbon metabolism, secondary metabolism, oxidation of fatty acids and stress response, suggesting that hickory underwent a complex metabolism process in seed development. Furthermore, 6 genes related to fatty acid synthesis were explored, and their functions in seed development process were further discussed. The data obtained here would provide the first clues for guiding further functional studies of fatty acid synthesis in hickory. Copyright © 2015 Elsevier Masson SAS. All rights reserved.
Genome-wide association and network analysis of lung function in the Framingham Heart Study.

PubMed

Liao, Shu-Yi; Lin, Xihong; Christiani, David C

2014-09-01

Single nucleotide polymorphisms have been found to be associated with pulmonary function using genome-wide association studies. However, lung function is a complex trait that is likely to be influenced by multiple gene-gene interactions besides individual genes. Our goal is to build a cellular network to explore the relationship between pulmonary function and genotypes by combining SNP level and network analyses using longitudinal lung function data from the Framingham Heart Study. We analyzed 2,698 genotyped participants from the Offspring cohort that had an average of 3.35 spirometry measurements per person for a mean length of 13 years. Repeated forced expiratory volume in one second (FEV1 ) and the ratio of FEV1 to forced vital capacity (FVC) were used as outcomes. Data were analyzed using linear-mixed models for the association between lung function and alleles by accounting for the correlation among repeated measures over time within the same subject and within-family correlation. Network analyses were performed using dmGWAS and validated with data from the Third Generation cohort. Analyses identified SMAD3, TGFBR2, CD44, CTGF, VCAN, CTNNB1, SCGB1A1, PDE4D, NRG1, EPHB1, and LYN as contributors to pulmonary function. Most of these genes were novel that were not found previously using solely SNP-level analysis. These novel genes are involving the transforming growth factor beta (TGFB)-SMAD pathway, Wnt/beta-catenin pathway, etc. Therefore, combining SNP-level and network analyses using longitudinal lung function data is a useful alternative strategy to identify risk genes. © 2014 WILEY PERIODICALS, INC.
Clinical and multiple gene expression variables in survival analysis of breast cancer: Analysis with the hypertabastic survival model

PubMed Central

2012-01-01

Background We explore the benefits of applying a new proportional hazard model to analyze survival of breast cancer patients. As a parametric model, the hypertabastic survival model offers a closer fit to experimental data than Cox regression, and furthermore provides explicit survival and hazard functions which can be used as additional tools in the survival analysis. In addition, one of our main concerns is utilization of multiple gene expression variables. Our analysis treats the important issue of interaction of different gene signatures in the survival analysis. Methods The hypertabastic proportional hazards model was applied in survival analysis of breast cancer patients. This model was compared, using statistical measures of goodness of fit, with models based on the semi-parametric Cox proportional hazards model and the parametric log-logistic and Weibull models. The explicit functions for hazard and survival were then used to analyze the dynamic behavior of hazard and survival functions. Results The hypertabastic model provided the best fit among all the models considered. Use of multiple gene expression variables also provided a considerable improvement in the goodness of fit of the model, as compared to use of only one. By utilizing the explicit survival and hazard functions provided by the model, we were able to determine the magnitude of the maximum rate of increase in hazard, and the maximum rate of decrease in survival, as well as the times when these occurred. We explore the influence of each gene expression variable on these extrema. Furthermore, in the cases of continuous gene expression variables, represented by a measure of correlation, we were able to investigate the dynamics with respect to changes in gene expression. Conclusions We observed that use of three different gene signatures in the model provided a greater combined effect and allowed us to assess the relative importance of each in determination of outcome in this data set. These results point to the potential to combine gene signatures to a greater effect in cases where each gene signature represents some distinct aspect of the cancer biology. Furthermore we conclude that the hypertabastic survival models can be an effective survival analysis tool for breast cancer patients. PMID:23241496
Characterization of a Crabs Claw Gene in basal eudicot species Epimedium sagittatum (Berberidaceae).

PubMed

Sun, Wei; Huang, Wenjun; Li, Zhineng; Lv, Haiyan; Huang, Hongwen; Wang, Ying

2013-01-08

The Crabs Claw (CRC) YABBY gene is required for regulating carpel development in angiosperms and has played an important role in nectary evolution during core eudicot speciation. The function or expression of CRC-like genes has been explored in two basal eudicots, Eschscholzia californica and Aquilegia formosa. To further investigate the function of CRC orthologous genes related to evolution of carpel and nectary development in basal eudicots, a CRC ortholog, EsCRC, was isolated and characterized from Epimedium sagittatum (Sieb. and Zucc.) Maxim. A phylogenetic analysis of EsCRC and previously identified CRC-like genes placed EsCRC within the basal eudicot lineage. Gene expression results suggest that EsCRC is involved in the development of sepals and carpels, but not nectaries. Phenotypic complementation of the Arabidopsis mutant crc-1 was achieved by constitutive expression of EsCRC. In addition, over-expression of EsCRC in Arabidopsis and tobacco gave rise to abaxially curled leaves. Transgenic results together with the gene expression analysis suggest that EsCRC may maintain a conserved function in carpel development and also play a novel role related to sepal formation. Absence of EsCRC and ElCRC expression in nectaries further indicates that nectary development in non-core eudicots is unrelated to expression of CRC-like genes.
Characterization of a Crabs Claw Gene in Basal Eudicot Species Epimedium sagittatum (Berberidaceae)

PubMed Central

Sun, Wei; Huang, Wenjun; Li, Zhineng; Lv, Haiyan; Huang, Hongwen; Wang, Ying

2013-01-01

The Crabs Claw (CRC) YABBY gene is required for regulating carpel development in angiosperms and has played an important role in nectary evolution during core eudicot speciation. The function or expression of CRC-like genes has been explored in two basal eudicots, Eschscholzia californica and Aquilegia formosa. To further investigate the function of CRC orthologous genes related to evolution of carpel and nectary development in basal eudicots, a CRC ortholog, EsCRC, was isolated and characterized from Epimedium sagittatum (Sieb. and Zucc.) Maxim. A phylogenetic analysis of EsCRC and previously identified CRC-like genes placed EsCRC within the basal eudicot lineage. Gene expression results suggest that EsCRC is involved in the development of sepals and carpels, but not nectaries. Phenotypic complementation of the Arabidopsis mutant crc-1 was achieved by constitutive expression of EsCRC. In addition, over-expression of EsCRC in Arabidopsis and tobacco gave rise to abaxially curled leaves. Transgenic results together with the gene expression analysis suggest that EsCRC may maintain a conserved function in carpel development and also play a novel role related to sepal formation. Absence of EsCRC and ElCRC expression in nectaries further indicates that nectary development in non-core eudicots is unrelated to expression of CRC-like genes. PMID:23299438
Genome-wide analysis of Glycine soja ubiquitin (UBQ) genes and functional analysis of GsUBQ10 in response to alkaline stress.

PubMed

Chen, Chao; Chen, Ranran; Wu, Shengyang; Zhu, Dan; Sun, Xiaoli; Liu, Beidong; Li, Qiang; Zhu, Yanming

2018-03-26

Ubiquitin is a highly conserved protein with multiple essential regulation functions through the ubiquitin-proteasome system. Even though its functions in the ubiquitin-mediated protein degradation pathway were very well characterized. The functions of ubiquitin genes in regulating alkaline stress response are not fully established. In this study, we identified 12 potential UBQ genes in Glycine soja genome, and analyzed their evolutionary relationship, conserved domains and promoter cis-elements. We also explored the expression profiles of G. soja UBQ genes under alkaline stress, based on the transcriptome sequencing. We found that the expression of GsUBQ10 was significantly induced by alkaline stress, and function of GsUBQ10 was characterized using overexpression transgenic alfalfa (Medicago sativa). Our results suggested that GsUBQ10 transgenic lines significantly improved the alkaline tolerance in alfalfa. The GsUBQ10 transgenic lines showed lower relative membrane permeability, lower malon dialdehyde content and higher catalase activity than in the wild-type plants. This indicates that GsUBQ10 is involved in regulating the reactive oxygen species accumulation under alkaline stress. Taken together, we identified an ubiquitin gene GsUBQ10 from G. soja, which plays a positive role in responses to alkaline stress in alfalfa. This article is protected by copyright. All rights reserved.
Identifying arsenic trioxide (ATO) functions in leukemia cells by using time series gene expression profiles.

PubMed

Yang, Hong; Lin, Shan; Cui, Jingru

2014-02-10

Arsenic trioxide (ATO) is presently the most active single agent in the treatment of acute promyelocytic leukemia (APL). In order to explore the molecular mechanism of ATO in leukemia cells with time series, we adopted bioinformatics strategy to analyze expression changing patterns and changes in transcription regulation modules of time series genes filtered from Gene Expression Omnibus database (GSE24946). We totally screened out 1847 time series genes for subsequent analysis. The KEGG (Kyoto encyclopedia of genes and genomes) pathways enrichment analysis of these genes showed that oxidative phosphorylation and ribosome were the top 2 significantly enriched pathways. STEM software was employed to compare changing patterns of gene expression with assigned 50 expression patterns. We screened out 7 significantly enriched patterns and 4 tendency charts of time series genes. The result of Gene Ontology showed that functions of times series genes mainly distributed in profiles 41, 40, 39 and 38. Seven genes with positive regulation of cell adhesion function were enriched in profile 40, and presented the same first increased model then decreased model as profile 40. The transcription module analysis showed that they mainly involved in oxidative phosphorylation pathway and ribosome pathway. Overall, our data summarized the gene expression changes in ATO treated K562-r cell lines with time and suggested that time series genes mainly regulated cell adhesive. Furthermore, our result may provide theoretical basis of molecular biology in treating acute promyelocytic leukemia. Copyright © 2013 Elsevier B.V. All rights reserved.
Core Promoter Functions in the Regulation of Gene Expression of Drosophila Dorsal Target Genes*

PubMed Central

Zehavi, Yonathan; Kuznetsov, Olga; Ovadia-Shochat, Avital; Juven-Gershon, Tamar

2014-01-01

Developmental processes are highly dependent on transcriptional regulation by RNA polymerase II. The RNA polymerase II core promoter is the ultimate target of a multitude of transcription factors that control transcription initiation. Core promoters consist of core promoter motifs, e.g. the initiator, TATA box, and the downstream core promoter element (DPE), which confer specific properties to the core promoter. Here, we explored the importance of core promoter functions in the dorsal-ventral developmental gene regulatory network. This network includes multiple genes that are activated by different nuclear concentrations of Dorsal, an NFκB homolog transcription factor, along the dorsal-ventral axis. We show that over two-thirds of Dorsal target genes contain DPE sequence motifs, which is significantly higher than the proportion of DPE-containing promoters in Drosophila genes. We demonstrate that multiple Dorsal target genes are evolutionarily conserved and functionally dependent on the DPE. Furthermore, we have analyzed the activation of key Dorsal target genes by Dorsal, as well as by another Rel family transcription factor, Relish, and the dependence of their activation on the DPE motif. Using hybrid enhancer-promoter constructs in Drosophila cells and embryo extracts, we have demonstrated that the core promoter composition is an important determinant of transcriptional activity of Dorsal target genes. Taken together, our results provide evidence for the importance of core promoter composition in the regulation of Dorsal target genes. PMID:24634215

BCOR regulates myeloid cell proliferation and differentiation

PubMed Central

Cao, Qi; Gearhart, Micah D.; Gery, Sigal; Shojaee, Seyedmehdi; Yang, Henry; Sun, Haibo; Lin, De-chen; Bai, Jing-wen; Mead, Monica; Zhao, Zhiqiang; Chen, Qi; Chien, Wen-wen; Alkan, Serhan; Alpermann, Tamara; Haferlach, Torsten; Müschen, Markus; Bardwell, Vivian J.; Koeffler, H. Phillip

2016-01-01

BCOR is a component of a variant Polycomb group repressive complex 1 (PRC1). Recently, we and others reported recurrent somatic BCOR loss-of-function mutations in myelodysplastic syndrome and acute myelogenous leukaemia (AML). However, the role of BCOR in normal hematopoiesis is largely unknown. Here, we explored the function of BCOR in myeloid cells using myeloid murine models with Bcor conditional loss-of-function or overexpression alleles. Bcor mutant bone marrow cells showed significantly higher proliferation and differentiation rates with upregulated expression of Hox genes. Mutation of Bcor reduced protein levels of RING1B, an H2A ubiquitin ligase subunit of PRC1 family complexes and reduced H2AK119ub upstream of upregulated HoxA genes. Global RNA expression profiling in murine cells and AML patient samples with BCOR loss-of-function mutation suggested that loss of BCOR expression is associated with enhanced cell proliferation and myeloid differentiation. Our results strongly suggest that BCOR plays an indispensable role in hematopoiesis by inhibiting myeloid cell proliferation and differentiation and offer a mechanistic explanation for how BCOR regulates gene expression such as Hox genes. PMID:26847029
Combining Zebrafish and Mouse Models to Test the Function of Deubiquitinating Enzyme (Dubs) Genes in Development: Role of USP45 in the Retina.

PubMed

Toulis, Vasileios; Garanto, Alejandro; Marfany, Gemma

2016-01-01

Ubiquitination is a dynamic and reversible posttranslational modification. Much effort has been devoted to characterize the function of ubiquitin pathway genes in the cell context, but much less is known on their functional role in the development and maintenance of organs and tissues in the organism. In fact, several ubiquitin ligases and deubiquitinating enzymes (DUBs) are implicated in human pathological disorders, from cancer to neurodegeneration. The aim of our work is to explore the relevance of DUBs in retinal function in health and disease, particularly since some genes related to the ubiquitin or SUMO pathways cause retinal dystrophies, a group of rare diseases that affect 1:3000 individuals worldwide. We propose zebrafish as an extremely useful and informative genetic model to characterize the function of any particular gene in the retina, and thus complement the expression data from mouse. A preliminary characterization of gene expression in mouse retinas (RT-PCR and in situ hybridization) was performed to select particularly interesting genes, and we later replicated the experiments in zebrafish. As a proof of concept, we selected ups45 to be knocked down by morpholino injection in zebrafish embryos. Morphant phenotypic analysis showed moderate to severe eye morphological defects, with a defective formation of the retinal structures, therefore supporting the relevance of DUBs in the formation and differentiation of the vertebrate retina, and suggesting that genes encoding ubiquitin pathway enzymes are good candidates for causing hereditary retinal dystrophies.
[Analysis of tissue-specific differentially methylated genes with differential gene expression in non-small cell lung cancer].

PubMed

Yin, L G; Zou, Z Q; Zhao, H Y; Zhang, C L; Shen, J G; Qi, L; Qi, M; Xue, Z Q

2014-01-01

Adenocarcinoma (ADC) and squamous cell carcinomas (SCC) are two subtypes of non-small cell lung carcinomas which are regarded as the leading cause of cancer-related malignancy worldwide. The aim of this study is to detect the differentially methylated loci (DMLs) and differentially methylated genes (DMGs) of these two tumor sets, and then to illustrate the different expression level of specific methylated genes. Using TCGA database and Illumina HumanMethylation 27 arrays, we first screened the DMGs and DMLs in tumor samples. Then, we explored the BiologicalProcess terms of hypermethylated and hypomethylated genes using Functional Gene Ontology (GO) catalogues. Hypermethylation intensively occurred in CpG-island, whereas hypomethylation was located in non-CpG-island. Most SCC and ADC hypermethylated genes involved GO function of DNA dependenit regulation of transcription, and hypomethylated genes mainly 'enriched in the term of immune responses. Additionally, the expression level of specific differentially methylated genesis distinctbetween ADC and SCC. It is concluded that ADC and SCC have different methylated status that might play an important role in carcinogenesis.
Gene replacement in Penicillium roqueforti.

PubMed

Goarin, Anne; Silar, Philippe; Malagnac, Fabienne

2015-05-01

Most cheese-making filamentous fungi lack suitable molecular tools to improve their biotechnology potential. Penicillium roqueforti, a species of high industrial importance, would benefit from functional data yielded by molecular genetic approaches. This work provides the first example of gene replacement by homologous recombination in P. roqueforti, demonstrating that knockout experiments can be performed in this fungus. To do so, we improved the existing transformation method to integrate transgenes into P. roqueforti genome. In the meantime, we cloned the PrNiaD gene, which encodes a NADPH-dependent nitrate reductase that reduces nitrate to nitrite. Then, we performed a deletion of the PrNiaD gene from P. roqueforti strain AGO. The ΔPrNiaD mutant strain is more resistant to chlorate-containing medium than the wild-type strain, but did not grow on nitrate-containing medium. Because genomic data are now available, we believe that generating selective deletions of candidate genes will be a key step to open the way for a comprehensive exploration of gene function in P. roqueforti.
Systematic exploration of essential yeast gene function with temperature-sensitive mutants

PubMed Central

Li, Zhijian; Vizeacoumar, Franco J; Bahr, Sondra; Li, Jingjing; Warringer, Jonas; Vizeacoumar, Frederick S; Min, Renqiang; VanderSluis, Benjamin; Bellay, Jeremy; DeVit, Michael; Fleming, James A; Stephens, Andrew; Haase, Julian; Lin, Zhen-Yuan; Baryshnikova, Anastasia; Lu, Hong; Yan, Zhun; Jin, Ke; Barker, Sarah; Datti, Alessandro; Giaever, Guri; Nislow, Corey; Bulawa, Chris; Myers, Chad L; Costanzo, Michael; Gingras, Anne-Claude; Zhang, Zhaolei; Blomberg, Anders; Bloom, Kerry; Andrews, Brenda; Boone, Charles

2012-01-01

Conditional temperature-sensitive (ts) mutations are valuable reagents for studying essential genes in the yeast Saccharomyces cerevisiae. We constructed 787 ts strains, covering 497 (~45%) of the 1,101 essential yeast genes, with ~30% of the genes represented by multiple alleles. All of the alleles are integrated into their native genomic locus in the S288C common reference strain and are linked to a kanMX selectable marker, allowing further genetic manipulation by synthetic genetic array (SGA)–based, high-throughput methods. We show two such manipulations: barcoding of 440 strains, which enables chemical-genetic suppression analysis, and the construction of arrays of strains carrying different fluorescent markers of subcellular structure, which enables quantitative analysis of phenotypes using high-content screening. Quantitative analysis of a GFP-tubulin marker identified roles for cohesin and condensin genes in spindle disassembly. This mutant collection should facilitate a wide range of systematic studies aimed at understanding the functions of essential genes. PMID:21441928
Discovery of SCORs: Anciently derived, highly conserved gene-associated repeats in stony corals.

PubMed

Qiu, Huan; Zelzion, Ehud; Putnam, Hollie M; Gates, Ruth D; Wagner, Nicole E; Adams, Diane K; Bhattacharya, Debashish

2017-10-01

Stony coral (Scleractinia) genomes are still poorly explored and many questions remain about their evolution and contribution to the success and longevity of reefs. We analyzed transcriptome and genome data from Montipora capitata, Acropora digitifera, and transcriptome data from 20 other coral species. To our surprise, we found highly conserved, anciently derived, Scleractinia COral-specific Repeat families (SCORs) that are abundant in all the studied lineages. SCORs form complex secondary structures and are located in untranslated regions and introns, but most abundant in intergenic DNA. These repeat families have undergone frequent duplication and degradation, suggesting a 'boom and bust' cycle of invasion and loss. We speculate that due to their surprisingly high sequence identities across deeply diverged corals, physical association with genes, and dynamic evolution, SCORs might have adaptive functions in corals that need to be explored using population genomic and function-based approaches. Copyright © 2017 Elsevier Inc. All rights reserved.
Gene Editing and Human Pluripotent Stem Cells: Tools for Advancing Diabetes Disease Modeling and Beta-Cell Development.

PubMed

Millette, Katelyn; Georgia, Senta

2017-10-05

This review will focus on the multiple approaches to gene editing and address the potential use of genetically modified human pluripotent stem cell-derived beta cells (SC-β) as a tool to study human beta-cell development and model their function in diabetes. We will explore how new variations of CRISPR/Cas9 gene editing may accelerate our understanding of beta-cell developmental biology, elucidate novel mechanisms that establish and regulate beta-cell function, and assist in pioneering new therapeutic modalities for treating diabetes. Improvements in CRISPR/Cas9 target specificity and homology-directed recombination continue to advance its use in engineering stem cells to model and potentially treat disease. We will review how CRISPR/Cas9 gene editing is informing our understanding of beta-cell development and expanding the therapeutic possibilities for treating diabetes and other diseases. Here we focus on the emerging use of gene editing technology, specifically CRISPR/Cas9, as a means of manipulating human gene expression to gain novel insights into the roles of key factors in beta-cell development and function. Taken together, the combined use of SC-β cells and CRISPR/Cas9 gene editing will shed new light on human beta-cell development and function and accelerate our progress towards developing new therapies for patients with diabetes.
Mapping genomic features to functional traits through microbial whole genome sequences.

PubMed

Zhang, Wei; Zeng, Erliang; Liu, Dan; Jones, Stuart E; Emrich, Scott

2014-01-01

Recently, the utility of trait-based approaches for microbial communities has been identified. Increasing availability of whole genome sequences provide the opportunity to explore the genetic foundations of a variety of functional traits. We proposed a machine learning framework to quantitatively link the genomic features with functional traits. Genes from bacteria genomes belonging to different functional traits were grouped to Cluster of Orthologs (COGs), and were used as features. Then, TF-IDF technique from the text mining domain was applied to transform the data to accommodate the abundance and importance of each COG. After TF-IDF processing, COGs were ranked using feature selection methods to identify their relevance to the functional trait of interest. Extensive experimental results demonstrated that functional trait related genes can be detected using our method. Further, the method has the potential to provide novel biological insights.
Integrating microRNA and mRNA expression profiles of acute promyelocytic leukemia cells to explore the occurrence mechanisms of differentiation syndrome

PubMed Central

Ge, Fei; Cao, Fenglin; Li, Haitao; Wang, Ping; Xu, Mengyuan; Song, Peng; Li, Xiaoxia; Wang, Shuye; Li, Jinmei; Han, Xueying; Zhao, Yanhong; Su, Yanhua; Li, Yinghua; Fan, Shengjin; Li, Limin; Zhou, Jin

2016-01-01

The pathogenesis of therapy-induced differentiation syndrome (DS) in patients with acute promyelocytic leukemia (APL) remains unclear. In this study, mRNA and microRNA (miRNA) expression profiling of peripheral blood APL cells from patients complicated with vs. without DS were integratively analyzed to explore the mechanisms underlying arsenic trioxide treatment-associated DS. By integrating the differentially expressed data with the data of differentially expressed microRNAs and their computationally predicted target genes, as well as the data of transcription factors and differentially expressed target microRNAs obtained from a literature search, a DS-related genetic regulatory network was constructed. Then using an EAGLE algorithm in clusterViz, the network was subdivided into 10 modules. Using the Kyoto Encyclopedia of Genes and Genomes (KEGG) database the modules were annotated functionally, and three functionally active modules were recognized. The further in-depth analyses on the annotated functions of the three modules and the expression and roles of the related genes revealed that proliferation, differentiation, apoptosis and infiltration capability of APL cells might play important roles in the DS pathogenesis. The results could improve our understanding of DS pathogenesis from a more overall perspective, and could provide new clues for future research. PMID:27634874
GoGene: gene annotation in the fast lane.

PubMed

Plake, Conrad; Royer, Loic; Winnenburg, Rainer; Hakenberg, Jörg; Schroeder, Michael

2009-07-01

High-throughput screens such as microarrays and RNAi screens produce huge amounts of data. They typically result in hundreds of genes, which are often further explored and clustered via enriched GeneOntology terms. The strength of such analyses is that they build on high-quality manual annotations provided with the GeneOntology. However, the weakness is that annotations are restricted to process, function and location and that they do not cover all known genes in model organisms. GoGene addresses this weakness by complementing high-quality manual annotation with high-throughput text mining extracting co-occurrences of genes and ontology terms from literature. GoGene contains over 4,000,000 associations between genes and gene-related terms for 10 model organisms extracted from more than 18,000,000 PubMed entries. It does not cover only process, function and location of genes, but also biomedical categories such as diseases, compounds, techniques and mutations. By bringing it all together, GoGene provides the most recent and most complete facts about genes and can rank them according to novelty and importance. GoGene accepts keywords, gene lists, gene sequences and protein sequences as input and supports search for genes in PubMed, EntrezGene and via BLAST. Since all associations of genes to terms are supported by evidence in the literature, the results are transparent and can be verified by the user. GoGene is available at http://gopubmed.org/gogene.
Exploring Plant Co-Expression and Gene-Gene Interactions with CORNET 3.0.

PubMed

Van Bel, Michiel; Coppens, Frederik

2017-01-01

Selecting and filtering a reference expression and interaction dataset when studying specific pathways and regulatory interactions can be a very time-consuming and error-prone task. In order to reduce the duplicated efforts required to amass such datasets, we have created the CORNET (CORrelation NETworks) platform which allows for easy access to a wide variety of data types: coexpression data, protein-protein interactions, regulatory interactions, and functional annotations. The CORNET platform outputs its results in either text format or through the Cytoscape framework, which is automatically launched by the CORNET website.CORNET 3.0 is the third iteration of the web platform designed for the user exploration of the coexpression space of plant genomes, with a focus on the model species Arabidopsis thaliana. Here we describe the platform: the tools, data, and best practices when using the platform. We indicate how the platform can be used to infer networks from a set of input genes, such as upregulated genes from an expression experiment. By exploring the network, new target and regulator genes can be discovered, allowing for follow-up experiments and more in-depth study. We also indicate how to avoid common pitfalls when evaluating the networks and how to avoid over interpretation of the results.All CORNET versions are available at http://bioinformatics.psb.ugent.be/cornet/ .
In silico analysis of a novel MKRN3 missense mutation in familial central precocious puberty.

PubMed

Neocleous, Vassos; Shammas, Christos; Phelan, Marie M; Nicolaou, Stella; Phylactou, Leonidas A; Skordis, Nicos

2016-01-01

The onset of puberty is influenced by the interplay of stimulating and restraining factors, many of which have a genetic origin. Premature activation of the GnRH secretion in central precocious puberty (CPP) may arise either from gain-of-function mutations of the KISS1 and KISS1R genes or from loss-of-function manner mutations of the MKRN3 gene leading to MKRN3 deficiency. To explore the genetic causes responsible for CPP and the potential role of the RING finger protein 3 (MKRN3) gene. We investigated potential sequence variations in the intronless MKRN3 gene by Sanger sequencing of the entire 507 amino acid coding region of exon 1 in a family with two affected girls presented with CPP at the age of 6 and 5·7 years, respectively. A novel heterozygous g.Gly312Asp missense mutation in the MKRN3 gene was identified in these siblings. The imprinted MKRN3 missense mutation was also identified as expected in the unaffected father and followed as expected an imprinted mode of inheritance. In silico analysis of the altered missense variant using the computational algorithms Polyphen2, SIFT and Mutation Taster predicted a damage and pathogenic alteration causing CPP. The pathogenicity of the alteration at the protein level via an in silico structural model is also explored. A novel mutation in the MKRN3 gene in two sisters with CPP was identified, supporting the fundamental role of this gene in the suppression of the hypothalamic GnRH neurons. © 2015 John Wiley & Sons Ltd.
Gene Function Hypotheses for the Campylobacter jejuni Glycome Generated by a Logic-Based Approach

PubMed Central

Sternberg, Michael J.E.; Tamaddoni-Nezhad, Alireza; Lesk, Victor I.; Kay, Emily; Hitchen, Paul G.; Cootes, Adrian; van Alphen, Lieke B.; Lamoureux, Marc P.; Jarrell, Harold C.; Rawlings, Christopher J.; Soo, Evelyn C.; Szymanski, Christine M.; Dell, Anne; Wren, Brendan W.; Muggleton, Stephen H.

2013-01-01

Increasingly, experimental data on biological systems are obtained from several sources and computational approaches are required to integrate this information and derive models for the function of the system. Here, we demonstrate the power of a logic-based machine learning approach to propose hypotheses for gene function integrating information from two diverse experimental approaches. Specifically, we use inductive logic programming that automatically proposes hypotheses explaining the empirical data with respect to logically encoded background knowledge. We study the capsular polysaccharide biosynthetic pathway of the major human gastrointestinal pathogen Campylobacter jejuni. We consider several key steps in the formation of capsular polysaccharide consisting of 15 genes of which 8 have assigned function, and we explore the extent to which functions can be hypothesised for the remaining 7. Two sources of experimental data provide the information for learning—the results of knockout experiments on the genes involved in capsule formation and the absence/presence of capsule genes in a multitude of strains of different serotypes. The machine learning uses the pathway structure as background knowledge. We propose assignments of specific genes to five previously unassigned reaction steps. For four of these steps, there was an unambiguous optimal assignment of gene to reaction, and to the fifth, there were three candidate genes. Several of these assignments were consistent with additional experimental results. We therefore show that the logic-based methodology provides a robust strategy to integrate results from different experimental approaches and propose hypotheses for the behaviour of a biological system. PMID:23103756
Gene function hypotheses for the Campylobacter jejuni glycome generated by a logic-based approach.

PubMed

Sternberg, Michael J E; Tamaddoni-Nezhad, Alireza; Lesk, Victor I; Kay, Emily; Hitchen, Paul G; Cootes, Adrian; van Alphen, Lieke B; Lamoureux, Marc P; Jarrell, Harold C; Rawlings, Christopher J; Soo, Evelyn C; Szymanski, Christine M; Dell, Anne; Wren, Brendan W; Muggleton, Stephen H

2013-01-09

Increasingly, experimental data on biological systems are obtained from several sources and computational approaches are required to integrate this information and derive models for the function of the system. Here, we demonstrate the power of a logic-based machine learning approach to propose hypotheses for gene function integrating information from two diverse experimental approaches. Specifically, we use inductive logic programming that automatically proposes hypotheses explaining the empirical data with respect to logically encoded background knowledge. We study the capsular polysaccharide biosynthetic pathway of the major human gastrointestinal pathogen Campylobacter jejuni. We consider several key steps in the formation of capsular polysaccharide consisting of 15 genes of which 8 have assigned function, and we explore the extent to which functions can be hypothesised for the remaining 7. Two sources of experimental data provide the information for learning-the results of knockout experiments on the genes involved in capsule formation and the absence/presence of capsule genes in a multitude of strains of different serotypes. The machine learning uses the pathway structure as background knowledge. We propose assignments of specific genes to five previously unassigned reaction steps. For four of these steps, there was an unambiguous optimal assignment of gene to reaction, and to the fifth, there were three candidate genes. Several of these assignments were consistent with additional experimental results. We therefore show that the logic-based methodology provides a robust strategy to integrate results from different experimental approaches and propose hypotheses for the behaviour of a biological system. Copyright © 2012 Elsevier Ltd. All rights reserved.
An evidence-based knowledgebase of metastasis suppressors to identify key pathways relevant to cancer metastasis

PubMed Central

Zhao, Min; Li, Zhe; Qu, Hong

2015-01-01

Metastasis suppressor genes (MS genes) are genes that play important roles in inhibiting the process of cancer metastasis without preventing growth of the primary tumor. Identification of these genes and understanding their functions are critical for investigation of cancer metastasis. Recent studies on cancer metastasis have identified many new susceptibility MS genes. However, the comprehensive illustration of diverse cellular processes regulated by metastasis suppressors during the metastasis cascade is lacking. Thus, the relationship between MS genes and cancer risk is still unclear. To unveil the cellular complexity of MS genes, we have constructed MSGene (http://MSGene.bioinfo-minzhao.org/), the first literature-based gene resource for exploring human MS genes. In total, we manually curated 194 experimentally verified MS genes and mapped to 1448 homologous genes from 17 model species. Follow-up functional analyses associated 194 human MS genes with epithelium/tissue morphogenesis and epithelia cell proliferation. In addition, pathway analysis highlights the prominent role of MS genes in activation of platelets and coagulation system in tumor metastatic cascade. Moreover, global mutation pattern of MS genes across multiple cancers may reveal common cancer metastasis mechanisms. All these results illustrate the importance of MSGene to our understanding on cell development and cancer metastasis. PMID:26486520
Computation and application of tissue-specific gene set weights.

PubMed

Frost, H Robert

2018-04-06

Gene set testing, or pathway analysis, has become a critical tool for the analysis of highdimensional genomic data. Although the function and activity of many genes and higher-level processes is tissue-specific, gene set testing is typically performed in a tissue agnostic fashion, which impacts statistical power and the interpretation and replication of results. To address this challenge, we have developed a bioinformatics approach to compute tissuespecific weights for individual gene sets using information on tissue-specific gene activity from the Human Protein Atlas (HPA). We used this approach to create a public repository of tissue-specific gene set weights for 37 different human tissue types from the HPA and all collections in the Molecular Signatures Database (MSigDB). To demonstrate the validity and utility of these weights, we explored three different applications: the functional characterization of human tissues, multi-tissue analysis for systemic diseases and tissue-specific gene set testing. All data used in the reported analyses is publicly available. An R implementation of the method and tissue-specific weights for MSigDB gene set collections can be downloaded at http://www.dartmouth.edu/∼hrfrost/TissueSpecificGeneSets. rob.frost@dartmouth.edu.
Protein-protein interaction network of gene expression in the hydrocortisone-treated keloid.

PubMed

Chen, Rui; Zhang, Zhiliang; Xue, Zhujia; Wang, Lin; Fu, Mingang; Lu, Yi; Bai, Ling; Zhang, Ping; Fan, Zhihong

2015-01-01

In order to explore the molecular mechanism of hydrocortisone in keloid tissue, the gene expression profiles of keloid samples treated with hydrocortisone were subjected to bioinformatics analysis. Firstly, the gene expression profiles (GSE7890) of five samples of keloid treated with hydrocortisone and five untreated keloid samples were downloaded from the Gene Expression Omnibus (GEO) database. Secondly, data were preprocessed using packages in R language and differentially expressed genes (DEGs) were screened using a significance analysis of microarrays (SAM) protocol. Thirdly, the DEGs were subjected to gene ontology (GO) function and KEGG pathway enrichment analysis. Finally, the interactions of DEGs in samples of keloid treated with hydrocortisone were explored in a human protein-protein interaction (PPI) network, and sub-modules of the DEGs interaction network were analyzed using Cytoscape software. Based on the analysis, 572 DEGs in the hydrocortisone-treated samples were screened; most of these were involved in the signal transduction and cell cycle. Furthermore, three critical genes in the module, including COL1A1, NID1, and PRELP, were screened in the PPI network analysis. These findings enhance understanding of the pathogenesis of the keloid and provide references for keloid therapy. © 2015 The International Society of Dermatology.
CRISPR-mediated genotypic and phenotypic correction of a chronic granulomatous disease mutation in human iPS cells

PubMed Central

Flynn, Rowan; Grundmann, Alexander; Renz, Peter; Hänseler, Walther; James, William S.; Cowley, Sally A.; Moore, Michael D.

2015-01-01

Chronic granulomatous disease (CGD) is a rare genetic disease characterized by severe and persistent childhood infections. It is caused by the lack of an antipathogen oxidative burst, normally performed by phagocytic cells to contain and clear bacterial and fungal growth. Restoration of immune function can be achieved with heterologous bone marrow transplantation; however, autologous bone marrow transplantation would be a preferable option. Thus, a method is required to recapitulate the function of the diseased gene within the patient's own cells. Gene therapy approaches for CGD have employed randomly integrating viruses with concomitant issues of insertional mutagenesis, inaccurate gene dosage, and gene silencing. Here, we explore the potential of the recently described clustered regularly interspaced short palindromic repeat (CRISPR)-Cas9 site-specific nuclease system to encourage repair of the endogenous gene by enhancing the levels of homologous recombination. Using induced pluripotent stem cells derived from a CGD patient containing a single intronic mutation in the CYBB gene, we show that footprintless gene editing is a viable option to correct disease mutations. Gene correction results in restoration of oxidative burst function in iPS-derived phagocytes by reintroduction of a previously skipped exon in the cytochrome b-245 heavy chain (CYBB) protein. This study provides proof-of-principle for a gene therapy approach to CGD treatment using CRISPR-Cas9. PMID:26101162
Thyroid hormone activation of retinoic acid synthesis in hypothalamic tanycytes.

PubMed

Stoney, Patrick N; Helfer, Gisela; Rodrigues, Diana; Morgan, Peter J; McCaffery, Peter

2016-03-01

Thyroid hormone (TH) is essential for adult brain function and its actions include several key roles in the hypothalamus. Although TH controls gene expression via specific TH receptors of the nuclear receptor class, surprisingly few genes have been demonstrated to be directly regulated by TH in the hypothalamus, or the adult brain as a whole. This study explored the rapid induction by TH of retinaldehyde dehydrogenase 1 (Raldh1), encoding a retinoic acid (RA)-synthesizing enzyme, as a gene specifically expressed in hypothalamic tanycytes, cells that mediate a number of actions of TH in the hypothalamus. The resulting increase in RA may then regulate gene expression via the RA receptors, also of the nuclear receptor class. In vivo exposure of the rat to TH led to a significant and rapid increase in hypothalamic Raldh1 within 4 hours. That this may lead to an in vivo increase in RA is suggested by the later induction by TH of the RA-responsive gene Cyp26b1. To explore the actions of RA in the hypothalamus as a potential mediator of TH control of gene regulation, an ex vivo hypothalamic rat slice culture method was developed in which the Raldh1-expressing tanycytes were maintained. These slice cultures confirmed that TH did not act on genes regulating energy balance but could induce Raldh1. RA has the potential to upregulate expression of genes involved in growth and appetite, Ghrh and Agrp. This regulation is acutely sensitive to epigenetic changes, as has been shown for TH action in vivo. These results indicate that sequential triggering of two nuclear receptor signalling systems has the capability to mediate some of the functions of TH in the hypothalamus. © 2015 Wiley Periodicals, Inc.
Effects of 4-chlorophenol wastewater treatment on sludge acute toxicity, microbial diversity and functional genes expression in an activated sludge process.

PubMed

Zhao, Jianguo; Li, Yahe; Li, Yu; Yu, Zeya; Chen, Xiurong

2018-05-31

In this study, the effects of 4-chlorophenol (4-CP) wastewater treatment on sludge acute toxicity of luminescent bacteria, microbial diversity and functional genes expression of Pseudomonas were explored. Results showed that in the entire operational process, the sludge acute toxicity acclimated by 4-CP in a sequencing batch bioreactor (SBR) was significantly higher than the control SBR without 4-CP. The dominant phyla in acclimated SBR were Proteobacteria and Firmicutes, which also existed in control SBR. Some identified genera in acclimated SBR were responsible for 4-CP degradation. At the stable operational stages, the functional genes expression of Pseudomonas in acclimated SBR was down-regulated at the end of SBR cycle, and their expression mechanisms needed further research. This study provides a theoretical support to comprehensively understand the sludge performance in industrial wastewater treatment. Copyright © 2018 Elsevier Ltd. All rights reserved.

Memory functions reveal structural properties of gene regulatory networks

PubMed Central

Perez-Carrasco, Ruben

2018-01-01

Gene regulatory networks (GRNs) control cellular function and decision making during tissue development and homeostasis. Mathematical tools based on dynamical systems theory are often used to model these networks, but the size and complexity of these models mean that their behaviour is not always intuitive and the underlying mechanisms can be difficult to decipher. For this reason, methods that simplify and aid exploration of complex networks are necessary. To this end we develop a broadly applicable form of the Zwanzig-Mori projection. By first converting a thermodynamic state ensemble model of gene regulation into mass action reactions we derive a general method that produces a set of time evolution equations for a subset of components of a network. The influence of the rest of the network, the bulk, is captured by memory functions that describe how the subnetwork reacts to its own past state via components in the bulk. These memory functions provide probes of near-steady state dynamics, revealing information not easily accessible otherwise. We illustrate the method on a simple cross-repressive transcriptional motif to show that memory functions not only simplify the analysis of the subnetwork but also have a natural interpretation. We then apply the approach to a GRN from the vertebrate neural tube, a well characterised developmental transcriptional network composed of four interacting transcription factors. The memory functions reveal the function of specific links within the neural tube network and identify features of the regulatory structure that specifically increase the robustness of the network to initial conditions. Taken together, the study provides evidence that Zwanzig-Mori projections offer powerful and effective tools for simplifying and exploring the behaviour of GRNs. PMID:29470492
Alternative splicing and the evolution of phenotypic novelty.

PubMed

Bush, Stephen J; Chen, Lu; Tovar-Corona, Jaime M; Urrutia, Araxi O

2017-02-05

Alternative splicing, a mechanism of post-transcriptional RNA processing whereby a single gene can encode multiple distinct transcripts, has been proposed to underlie morphological innovations in multicellular organisms. Genes with developmental functions are enriched for alternative splicing events, suggestive of a contribution of alternative splicing to developmental programmes. The role of alternative splicing as a source of transcript diversification has previously been compared to that of gene duplication, with the relationship between the two extensively explored. Alternative splicing is reduced following gene duplication with the retention of duplicate copies higher for genes which were alternatively spliced prior to duplication. Furthermore, and unlike the case for overall gene number, the proportion of alternatively spliced genes has also increased in line with the evolutionary diversification of cell types, suggesting alternative splicing may contribute to the complexity of developmental programmes. Together these observations suggest a prominent role for alternative splicing as a source of functional innovation. However, it is unknown whether the proliferation of alternative splicing events indeed reflects a functional expansion of the transcriptome or instead results from weaker selection acting on larger species, which tend to have a higher number of cell types and lower population sizes.This article is part of the themed issue 'Evo-devo in the genomics era, and the origins of morphological diversity'. © 2016 The Author(s).
Alternative splicing and the evolution of phenotypic novelty

PubMed Central

Bush, Stephen J.; Chen, Lu; Tovar-Corona, Jaime M.

2017-01-01

Alternative splicing, a mechanism of post-transcriptional RNA processing whereby a single gene can encode multiple distinct transcripts, has been proposed to underlie morphological innovations in multicellular organisms. Genes with developmental functions are enriched for alternative splicing events, suggestive of a contribution of alternative splicing to developmental programmes. The role of alternative splicing as a source of transcript diversification has previously been compared to that of gene duplication, with the relationship between the two extensively explored. Alternative splicing is reduced following gene duplication with the retention of duplicate copies higher for genes which were alternatively spliced prior to duplication. Furthermore, and unlike the case for overall gene number, the proportion of alternatively spliced genes has also increased in line with the evolutionary diversification of cell types, suggesting alternative splicing may contribute to the complexity of developmental programmes. Together these observations suggest a prominent role for alternative splicing as a source of functional innovation. However, it is unknown whether the proliferation of alternative splicing events indeed reflects a functional expansion of the transcriptome or instead results from weaker selection acting on larger species, which tend to have a higher number of cell types and lower population sizes. This article is part of the themed issue ‘Evo-devo in the genomics era, and the origins of morphological diversity’. PMID:27994117
New Statistics for Testing Differential Expression of Pathways from Microarray Data

NASA Astrophysics Data System (ADS)

Siu, Hoicheong; Dong, Hua; Jin, Li; Xiong, Momiao

Exploring biological meaning from microarray data is very important but remains a great challenge. Here, we developed three new statistics: linear combination test, quadratic test and de-correlation test to identify differentially expressed pathways from gene expression profile. We apply our statistics to two rheumatoid arthritis datasets. Notably, our results reveal three significant pathways and 275 genes in common in two datasets. The pathways we found are meaningful to uncover the disease mechanisms of rheumatoid arthritis, which implies that our statistics are a powerful tool in functional analysis of gene expression data.
'RetinoGenetics': a comprehensive mutation database for genes related to inherited retinal degeneration.

PubMed

Ran, Xia; Cai, Wei-Jun; Huang, Xiu-Feng; Liu, Qi; Lu, Fan; Qu, Jia; Wu, Jinyu; Jin, Zi-Bing

2014-01-01

Inherited retinal degeneration (IRD), a leading cause of human blindness worldwide, is exceptionally heterogeneous with clinical heterogeneity and genetic variety. During the past decades, tremendous efforts have been made to explore the complex heterogeneity, and massive mutations have been identified in different genes underlying IRD with the significant advancement of sequencing technology. In this study, we developed a comprehensive database, 'RetinoGenetics', which contains informative knowledge about all known IRD-related genes and mutations for IRD. 'RetinoGenetics' currently contains 4270 mutations in 186 genes, with detailed information associated with 164 phenotypes from 934 publications and various types of functional annotations. Then extensive annotations were performed to each gene using various resources, including Gene Ontology, KEGG pathways, protein-protein interaction, mutational annotations and gene-disease network. Furthermore, by using the search functions, convenient browsing ways and intuitive graphical displays, 'RetinoGenetics' could serve as a valuable resource for unveiling the genetic basis of IRD. Taken together, 'RetinoGenetics' is an integrative, informative and updatable resource for IRD-related genetic predispositions. Database URL: http://www.retinogenetics.org/. © The Author(s) 2014. Published by Oxford University Press.
Dynamic sporulation gene co-expression networks for Bacillus subtilis 168 and the food-borne isolate Bacillus amyloliquefaciens: a transcriptomic model

PubMed Central

Omony, Jimmy; de Jong, Anne; Krawczyk, Antonina O.; Eijlander, Robyn T.; Kuipers, Oscar P.

2018-01-01

Sporulation is a survival strategy, adapted by bacterial cells in response to harsh environmental adversities. The adaptation potential differs between strains and the variations may arise from differences in gene regulation. Gene networks are a valuable way of studying such regulation processes and establishing associations between genes. We reconstructed and compared sporulation gene co-expression networks (GCNs) of the model laboratory strain Bacillus subtilis 168 and the food-borne industrial isolate Bacillus amyloliquefaciens. Transcriptome data obtained from samples of six stages during the sporulation process were used for network inference. Subsequently, a gene set enrichment analysis was performed to compare the reconstructed GCNs of B. subtilis 168 and B. amyloliquefaciens with respect to biological functions, which showed the enriched modules with coherent functional groups associated with sporulation. On basis of the GCNs and time-evolution of differentially expressed genes, we could identify novel candidate genes strongly associated with sporulation in B. subtilis 168 and B. amyloliquefaciens. The GCNs offer a framework for exploring transcription factors, their targets, and co-expressed genes during sporulation. Furthermore, the methodology described here can conveniently be applied to other species or biological processes. PMID:29424683
Dynamic sporulation gene co-expression networks for Bacillus subtilis 168 and the food-borne isolate Bacillus amyloliquefaciens: a transcriptomic model.

PubMed

Omony, Jimmy; de Jong, Anne; Krawczyk, Antonina O; Eijlander, Robyn T; Kuipers, Oscar P

2018-02-09

Sporulation is a survival strategy, adapted by bacterial cells in response to harsh environmental adversities. The adaptation potential differs between strains and the variations may arise from differences in gene regulation. Gene networks are a valuable way of studying such regulation processes and establishing associations between genes. We reconstructed and compared sporulation gene co-expression networks (GCNs) of the model laboratory strain Bacillus subtilis 168 and the food-borne industrial isolate Bacillus amyloliquefaciens. Transcriptome data obtained from samples of six stages during the sporulation process were used for network inference. Subsequently, a gene set enrichment analysis was performed to compare the reconstructed GCNs of B. subtilis 168 and B. amyloliquefaciens with respect to biological functions, which showed the enriched modules with coherent functional groups associated with sporulation. On basis of the GCNs and time-evolution of differentially expressed genes, we could identify novel candidate genes strongly associated with sporulation in B. subtilis 168 and B. amyloliquefaciens. The GCNs offer a framework for exploring transcription factors, their targets, and co-expressed genes during sporulation. Furthermore, the methodology described here can conveniently be applied to other species or biological processes.
From the ultrasonic to the infrared: molecular evolution and the sensory biology of bats

PubMed Central

Jones, Gareth; Teeling, Emma C.; Rossiter, Stephen J.

2013-01-01

Great advances have been made recently in understanding the genetic basis of the sensory biology of bats. Research has focused on the molecular evolution of candidate sensory genes, genes with known functions [e.g., olfactory receptor (OR) genes] and genes identified from mutations associated with sensory deficits (e.g., blindness and deafness). For example, the FoxP2 gene, underpinning vocal behavior and sensorimotor coordination, has undergone diversification in bats, while several genes associated with audition show parallel amino acid substitutions in unrelated lineages of echolocating bats and, in some cases, in echolocating dolphins, representing a classic case of convergent molecular evolution. Vision genes encoding the photopigments rhodopsin and the long-wave sensitive opsin are functional in bats, while that encoding the short-wave sensitive opsin has lost functionality in rhinolophoid bats using high-duty cycle laryngeal echolocation, suggesting a sensory trade-off between investment in vision and echolocation. In terms of olfaction, bats appear to have a distinctive OR repertoire compared with other mammals, and a gene involved in signal transduction in the vomeronasal system has become non-functional in most bat species. Bitter taste receptors appear to have undergone a “birth-and death” evolution involving extensive gene duplication and loss, unlike genes coding for sweet and umami tastes that show conservation across most lineages but loss in vampire bats. Common vampire bats have also undergone adaptations for thermoperception, via alternative splicing resulting in the evolution of a novel heat-sensitive channel. The future for understanding the molecular basis of sensory biology is promising, with great potential for comparative genomic analyses, studies on gene regulation and expression, exploration of the role of alternative splicing in the generation of proteomic diversity, and linking genetic mechanisms to behavioral consequences. PMID:23755015
Differential Effect of Active Smoking on Gene Expression in Male and Female Smokers

PubMed Central

Paul, Sunirmal; Amundson, Sally A

2015-01-01

Smoking is the second leading cause of preventable death in the United States. Cohort epidemiological studies have demonstrated that women are more vulnerable to cigarette-smoking induced diseases than their male counterparts, however, the molecular basis of these differences has remained unknown. In this study, we explored if there were differences in the gene expression patterns between male and female smokers, and how these patterns might reflect different sex-specific responses to the stress of smoking. Using whole genome microarray gene expression profiling, we found that a substantial number of oxidant related genes were expressed in both male and female smokers, however, smoking-responsive genes did indeed differ greatly between male and female smokers. Gene set enrichment analysis (GSEA) against reference oncogenic signature gene sets identified a large number of oncogenic pathway gene-sets that were significantly altered in female smokers compared to male smokers. In addition, functional annotation with Ingenuity Pathway Analysis (IPA) identified smoking-correlated genes associated with biological functions in male and female smokers that are directly relevant to well-known smoking related pathologies. However, these relevant biological functions were strikingly overrepresented in female smokers compared to male smokers. IPA network analysis with the functional categories of immune and inflammatory response gene products suggested potential interactions between smoking response and female hormones. Our results demonstrate a striking dichotomy between male and female gene expression responses to smoking. This is the first genome-wide expression study to compare the sex-specific impacts of smoking at a molecular level and suggests a novel potential connection between sex hormone signaling and smoking-induced diseases in female smokers. PMID:25621181
[Construction of rAAV2-GPIIb/IIIa vector and test of its expression and function in vitro].

PubMed

Wang, Kai; Peng, Jian-Qiang; Chen, Fang-Ping; Wu, Xiao-Bin

2006-04-01

This study was aimed to explore the possibility of rAAV2 vector-mediating gene therapy for Glanzmann' s thrombasthenia. The rAAV2-GPIIb/IIIa vector was constructed. The GPIIb/IIIa gene expression in mammal cell were examined by different methods, such as: detection of mRNA expression in BHK-21 cells after 24 hours of infection (MOI = 1 x 10(5) v.g/cell) was performed by RT-PCR; the relation between MOI and quantity of GPII6/IIIa gene expression was detected by FACS after 48 hours of infection; GPIIb/IIIa protein expression in BHK-21 cells after 48 hours of infection (MOI = 10(5) v x g/cell) was assayed by Western blot, GPIIb/IIIa protein expression on cell surface was detected by immunofluorescence, and the biological function of expressing product was determined by PAC-1 conjunct experiments. The results showed that GPIIb/IIIa gene expression in mRNA level could be detected in BHK-21 cells after 24 hours of infection at MOI = 1 x 10(5) v x g/cell and the GPIIb/IIIa gene expression in protein level could be detected in BHK-21 cells after 48 hours of infection at MOI = 1 x 10(5) v x g/cell. In certain range, quantity of GPIIb/IIIa gene expression increased with MOI, but overdose of MOI decreased quantity of GPIIb/IIIa gene expression. Activated product of GPIIb/IIIa gene expression could combined with PAC-I, and possesed normal biological function. In conclusion, rAAV2 vactor can effectively mediate GPIIb and GPIIIa gene expressing in mammal cells, and the products of these genes exhibit biological function. This result may provide a basis for application of rAAV2 vector in Glanzmann's thrombasthenia gene therapy in furture.
Discovering functional modules by topic modeling RNA-Seq based toxicogenomic data.

PubMed

Yu, Ke; Gong, Binsheng; Lee, Mikyung; Liu, Zhichao; Xu, Joshua; Perkins, Roger; Tong, Weida

2014-09-15

Toxicogenomics (TGx) endeavors to elucidate the underlying molecular mechanisms through exploring gene expression profiles in response to toxic substances. Recently, RNA-Seq is increasingly regarded as a more powerful alternative to microarrays in TGx studies. However, realizing RNA-Seq's full potential requires novel approaches to extracting information from the complex TGx data. Considering read counts as the number of times a word occurs in a document, gene expression profiles from RNA-Seq are analogous to a word by document matrix used in text mining. Topic modeling aiming at to discover the latent structures in text corpora would be helpful to explore RNA-Seq based TGx data. In this study, topic modeling was applied on a typical RNA-Seq based TGx data set to discover hidden functional modules. The RNA-Seq based gene expression profiles were transformed into "documents", on which latent Dirichlet allocation (LDA) was used to build a topic model. We found samples treated by the compounds with the same modes of actions (MoAs) could be clustered based on topic similarities. The topic most relevant to each cluster was identified as a "marker" topic, which was interpreted by gene enrichment analysis with MoAs then confirmed by compound and pathways associations mined from literature. To further validate the "marker" topics, we tested topic transferability from RNA-Seq to microarrays. The RNA-Seq based gene expression profile of a topic specifically associated with peroxisome proliferator-activated receptors (PPAR) signaling pathway was used to query samples with similar expression profiles in two different microarray data sets, yielding accuracy of about 85%. This proof-of-concept study demonstrates the applicability of topic modeling to discover functional modules in RNA-Seq data and suggests a valuable computational tool for leveraging information within TGx data in RNA-Seq era.
Computing and Applying Atomic Regulons to Understand Gene Expression and Regulation

PubMed Central

Faria, José P.; Davis, James J.; Edirisinghe, Janaka N.; Taylor, Ronald C.; Weisenhorn, Pamela; Olson, Robert D.; Stevens, Rick L.; Rocha, Miguel; Rocha, Isabel; Best, Aaron A.; DeJongh, Matthew; Tintle, Nathan L.; Parrello, Bruce; Overbeek, Ross; Henry, Christopher S.

2016-01-01

Understanding gene function and regulation is essential for the interpretation, prediction, and ultimate design of cell responses to changes in the environment. An important step toward meeting the challenge of understanding gene function and regulation is the identification of sets of genes that are always co-expressed. These gene sets, Atomic Regulons (ARs), represent fundamental units of function within a cell and could be used to associate genes of unknown function with cellular processes and to enable rational genetic engineering of cellular systems. Here, we describe an approach for inferring ARs that leverages large-scale expression data sets, gene context, and functional relationships among genes. We computed ARs for Escherichia coli based on 907 gene expression experiments and compared our results with gene clusters produced by two prevalent data-driven methods: Hierarchical clustering and k-means clustering. We compared ARs and purely data-driven gene clusters to the curated set of regulatory interactions for E. coli found in RegulonDB, showing that ARs are more consistent with gold standard regulons than are data-driven gene clusters. We further examined the consistency of ARs and data-driven gene clusters in the context of gene interactions predicted by Context Likelihood of Relatedness (CLR) analysis, finding that the ARs show better agreement with CLR predicted interactions. We determined the impact of increasing amounts of expression data on AR construction and find that while more data improve ARs, it is not necessary to use the full set of gene expression experiments available for E. coli to produce high quality ARs. In order to explore the conservation of co-regulated gene sets across different organisms, we computed ARs for Shewanella oneidensis, Pseudomonas aeruginosa, Thermus thermophilus, and Staphylococcus aureus, each of which represents increasing degrees of phylogenetic distance from E. coli. Comparison of the organism-specific ARs showed that the consistency of AR gene membership correlates with phylogenetic distance, but there is clear variability in the regulatory networks of closely related organisms. As large scale expression data sets become increasingly common for model and non-model organisms, comparative analyses of atomic regulons will provide valuable insights into fundamental regulatory modules used across the bacterial domain. PMID:27933038
Evolutionary Trails of Plant Group II Pyridoxal Phosphate-Dependent Decarboxylase Genes.

PubMed

Kumar, Rahul

2016-01-01

Type II pyridoxal phosphate-dependent decarboxylase (PLP_deC) enzymes play important metabolic roles during nitrogen metabolism. Recent evolutionary profiling of these genes revealed a sharp expansion of histidine decarboxylase genes in the members of Solanaceae family. In spite of the high sequence homology shared by PLP_deC orthologs, these enzymes display remarkable differences in their substrate specificities. Currently, limited information is available on the gene repertoires and substrate specificities of PLP_deCs which renders their precise annotation challenging and offers technical challenges in the immediate identification and biochemical characterization of their full gene complements in plants. Herein, we explored their evolutionary trails in a comprehensive manner by taking advantage of high-throughput data accessibility and computational approaches. We discussed the premise that has enabled an improved reconstruction of their evolutionary lineage and evaluated the factors offering constraints in their rapid functional characterization, till date. We envisage that the synthesized information herein would act as a catalyst for the rapid exploration of their biochemical specificity and physiological roles in more plant species.
MutationAligner: a resource of recurrent mutation hotspots in protein domains in cancer

PubMed Central

Gauthier, Nicholas Paul; Reznik, Ed; Gao, Jianjiong; Sumer, Selcuk Onur; Schultz, Nikolaus; Sander, Chris; Miller, Martin L.

2016-01-01

The MutationAligner web resource, available at http://www.mutationaligner.org, enables discovery and exploration of somatic mutation hotspots identified in protein domains in currently (mid-2015) more than 5000 cancer patient samples across 22 different tumor types. Using multiple sequence alignments of protein domains in the human genome, we extend the principle of recurrence analysis by aggregating mutations in homologous positions across sets of paralogous genes. Protein domain analysis enhances the statistical power to detect cancer-relevant mutations and links mutations to the specific biological functions encoded in domains. We illustrate how the MutationAligner database and interactive web tool can be used to explore, visualize and analyze mutation hotspots in protein domains across genes and tumor types. We believe that MutationAligner will be an important resource for the cancer research community by providing detailed clues for the functional importance of particular mutations, as well as for the design of functional genomics experiments and for decision support in precision medicine. MutationAligner is slated to be periodically updated to incorporate additional analyses and new data from cancer genomics projects. PMID:26590264
DNA Microarray Analysis of the Expression Profile of Escherichia coli in Response to Treatment with 4,5-Dihydroxy-2-Cyclopenten-1-One

PubMed Central

Phadtare, Sangita; Kato, Ikunoshin; Inouye, Masayori

2002-01-01

We carried out DNA microarray-based global transcript profiling of Escherichia coli in response to 4,5-dihydroxy-2-cyclopenten-1-one to explore the manifestation of its antibacterial activity. We show that it has widespread effects in E. coli affecting genes encoding proteins involved in cell metabolism and membrane synthesis and functions. Genes belonging to the regulon involved in synthesis of Cys are upregulated. In addition, rpoS and RpoS-regulated genes responding to various stresses and a number of genes responding to oxidative stress are upregulated. PMID:12426362
Genome-wide analysis of the Glycerol-3-Phosphate Acyltransferase (GPAT) gene family reveals the evolution and diversification of plant GPATs

PubMed Central

Waschburger, Edgar; Kulcheski, Franceli Rodrigues; Veto, Nicole Moreira; Margis, Rogerio; Margis-Pinheiro, Marcia; Turchetto-Zolet, Andreia Carina

2018-01-01

Abstract sn-Glycerol-3-phosphate 1-O-acyltransferase (GPAT) is an important enzyme that catalyzes the transfer of an acyl group from acyl-CoA or acyl-ACP to the sn-1 or sn-2 position of sn-glycerol-3-phosphate (G3P) to generate lysophosphatidic acids (LPAs). The functional studies of GPAT in plants demonstrated its importance in controlling storage and membrane lipid. Identifying genes encoding GPAT in a variety of plant species is crucial to understand their involvement in different metabolic pathways and physiological functions. Here, we performed genome-wide and evolutionary analyses of GPATs in plants. GPAT genes were identified in all algae and plants studied. The phylogenetic analysis showed that these genes group into three main clades. While clades I (GPAT9) and II (soluble GPAT) include GPATs from algae and plants, clade III (GPAT1-8) includes GPATs specific from plants that are involved in the biosynthesis of cutin or suberin. Gene organization and the expression pattern of GPATs in plants corroborate with clade formation in the phylogeny, suggesting that the evolutionary patterns is reflected in their functionality. Overall, our results provide important insights into the evolution of the plant GPATs and allowed us to explore the evolutionary mechanism underlying the functional diversification among these genes. PMID:29583156
The miR-29 family: genomics, cell biology, and relevance to renal and cardiovascular injury.

PubMed

Kriegel, Alison J; Liu, Yong; Fang, Yi; Ding, Xiaoqiang; Liang, Mingyu

2012-02-27

The human miR-29 family of microRNAs has three mature members, miR-29a, miR-29b, and miR-29c. miR-29s are encoded by two gene clusters. Binding sites for several transcriptional factors have been identified in the promoter regions of miR-29 genes. The miR-29 family members share a common seed region sequence and are predicted to target largely overlapping sets of genes. However, the miR-29 family members exhibit differential regulation in several cases and different subcellular distribution, suggesting their functional relevance may not be identical. miR-29s directly target at least 16 extracellular matrix genes, providing a dramatic example of a single microRNA targeting a large group of functionally related genes. Strong antifibrotic effects of miR-29s have been demonstrated in heart, kidney, and other organs. miR-29s have also been shown to be proapoptotic and involved in the regulation of cell differentiation. It remains to be explored how various cellular effects of miR-29s determine functional relevance of miR-29s to specific diseases and how the miR-29 family members may function cooperatively or separately.
Plasticity of genetic interactions in metabolic networks of yeast.

PubMed

Harrison, Richard; Papp, Balázs; Pál, Csaba; Oliver, Stephen G; Delneri, Daniela

2007-02-13

Why are most genes dispensable? The impact of gene deletions may depend on the environment (plasticity), the presence of compensatory mechanisms (mutational robustness), or both. Here, we analyze the interaction between these two forces by exploring the condition-dependence of synthetic genetic interactions that define redundant functions and alternative pathways. We performed systems-level flux balance analysis of the yeast (Saccharomyces cerevisiae) metabolic network to identify genetic interactions and then tested the model's predictions with in vivo gene-deletion studies. We found that the majority of synthetic genetic interactions are restricted to certain environmental conditions, partly because of the lack of compensation under some (but not all) nutrient conditions. Moreover, the phylogenetic cooccurrence of synthetically interacting pairs is not significantly different from random expectation. These findings suggest that these gene pairs have at least partially independent functions, and, hence, compensation is only a byproduct of their evolutionary history. Experimental analyses that used multiple gene deletion strains not only confirmed predictions of the model but also showed that investigation of false predictions may both improve functional annotation within the model and also lead to the discovery of higher-order genetic interactions. Our work supports the view that functional redundancy may be more apparent than real, and it offers a unified framework for the evolution of environmental adaptation and mutational robustness.
Robustness, evolvability, and the logic of genetic regulation.

PubMed

Payne, Joshua L; Moore, Jason H; Wagner, Andreas

2014-01-01

In gene regulatory circuits, the expression of individual genes is commonly modulated by a set of regulating gene products, which bind to a gene's cis-regulatory region. This region encodes an input-output function, referred to as signal-integration logic, that maps a specific combination of regulatory signals (inputs) to a particular expression state (output) of a gene. The space of all possible signal-integration functions is vast and the mapping from input to output is many-to-one: For the same set of inputs, many functions (genotypes) yield the same expression output (phenotype). Here, we exhaustively enumerate the set of signal-integration functions that yield identical gene expression patterns within a computational model of gene regulatory circuits. Our goal is to characterize the relationship between robustness and evolvability in the signal-integration space of regulatory circuits, and to understand how these properties vary between the genotypic and phenotypic scales. Among other results, we find that the distributions of genotypic robustness are skewed, so that the majority of signal-integration functions are robust to perturbation. We show that the connected set of genotypes that make up a given phenotype are constrained to specific regions of the space of all possible signal-integration functions, but that as the distance between genotypes increases, so does their capacity for unique innovations. In addition, we find that robust phenotypes are (i) evolvable, (ii) easily identified by random mutation, and (iii) mutationally biased toward other robust phenotypes. We explore the implications of these latter observations for mutation-based evolution by conducting random walks between randomly chosen source and target phenotypes. We demonstrate that the time required to identify the target phenotype is independent of the properties of the source phenotype.
Integrated annotation and analysis of in situ hybridization images using the ImAnno system: application to the ear and sensory organs of the fetal mouse.

PubMed

Romand, Raymond; Ripp, Raymond; Poidevin, Laetitia; Boeglin, Marcel; Geffers, Lars; Dollé, Pascal; Poch, Olivier

2015-01-01

An in situ hybridization (ISH) study was performed on 2000 murine genes representing around 10% of the protein-coding genes present in the mouse genome using data generated by the EURExpress consortium. This study was carried out in 25 tissues of late gestation embryos (E14.5), with a special emphasis on the developing ear and on five distinct developing sensory organs, including the cochlea, the vestibular receptors, the sensory retina, the olfactory organ, and the vibrissae follicles. The results obtained from an analysis of more than 11,000 micrographs have been integrated in a newly developed knowledgebase, called ImAnno. In addition to managing the multilevel micrograph annotations performed by human experts, ImAnno provides public access to various integrated databases and tools. Thus, it facilitates the analysis of complex ISH gene expression patterns, as well as functional annotation and interaction of gene sets. It also provides direct links to human pathways and diseases. Hierarchical clustering of expression patterns in the 25 tissues revealed three main branches corresponding to tissues with common functions and/or embryonic origins. To illustrate the integrative power of ImAnno, we explored the expression, function and disease traits of the sensory epithelia of the five presumptive sensory organs. The study identified 623 genes (out of 2000) concomitantly expressed in the five embryonic epithelia, among which many (∼12%) were involved in human disorders. Finally, various multilevel interaction networks were characterized, highlighting differential functional enrichments of directly or indirectly interacting genes. These analyses exemplify an under-represention of "sensory" functions in the sensory gene set suggests that E14.5 is a pivotal stage between the developmental stage and the functional phase that will be fully reached only after birth.

The Reconstruction and Analysis of Gene Regulatory Networks.

PubMed

Zheng, Guangyong; Huang, Tao

2018-01-01

In post-genomic era, an important task is to explore the function of individual biological molecules (i.e., gene, noncoding RNA, protein, metabolite) and their organization in living cells. For this end, gene regulatory networks (GRNs) are constructed to show relationship between biological molecules, in which the vertices of network denote biological molecules and the edges of network present connection between nodes (Strogatz, Nature 410:268-276, 2001; Bray, Science 301:1864-1865, 2003). Biologists can understand not only the function of biological molecules but also the organization of components of living cells through interpreting the GRNs, since a gene regulatory network is a comprehensively physiological map of living cells and reflects influence of genetic and epigenetic factors (Strogatz, Nature 410:268-276, 2001; Bray, Science 301:1864-1865, 2003). In this paper, we will review the inference methods of GRN reconstruction and analysis approaches of network structure. As a powerful tool for studying complex diseases and biological processes, the applications of the network method in pathway analysis and disease gene identification will be introduced.
Text mining and network analysis to find functional associations of genes in high altitude diseases.

PubMed

Bhasuran, Balu; Subramanian, Devika; Natarajan, Jeyakumar

2018-05-02

Travel to elevations above 2500 m is associated with the risk of developing one or more forms of acute altitude illness such as acute mountain sickness (AMS), high altitude cerebral edema (HACE) or high altitude pulmonary edema (HAPE). Our work aims to identify the functional association of genes involved in high altitude diseases. In this work we identified the gene networks responsible for high altitude diseases by using the principle of gene co-occurrence statistics from literature and network analysis. First, we mined the literature data from PubMed on high-altitude diseases, and extracted the co-occurring gene pairs. Next, based on their co-occurrence frequency, gene pairs were ranked. Finally, a gene association network was created using statistical measures to explore potential relationships. Network analysis results revealed that EPO, ACE, IL6 and TNF are the top five genes that were found to co-occur with 20 or more genes, while the association between EPAS1 and EGLN1 genes is strongly substantiated. The network constructed from this study proposes a large number of genes that work in-toto in high altitude conditions. Overall, the result provides a good reference for further study of the genetic relationships in high altitude diseases. Copyright © 2018 Elsevier Ltd. All rights reserved.
Ortholog-based screening and identification of genes related to intracellular survival.

PubMed

Yang, Xiaowen; Wang, Jiawei; Bing, Guoxia; Bie, Pengfei; De, Yanyan; Lyu, Yanli; Wu, Qingmin

2018-04-20

Bioinformatics and comparative genomics analysis methods were used to predict unknown pathogen genes based on homology with identified or functionally clustered genes. In this study, the genes of common pathogens were analyzed to screen and identify genes associated with intracellular survival through sequence similarity, phylogenetic tree analysis and the λ-Red recombination system test method. The total 38,952 protein-coding genes of common pathogens were divided into 19,775 clusters. As demonstrated through a COG analysis, information storage and processing genes might play an important role intracellular survival. Only 19 clusters were present in facultative intracellular pathogens, and not all were present in extracellular pathogens. Construction of a phylogenetic tree selected 18 of these 19 clusters. Comparisons with the DEG database and previous research revealed that seven other clusters are considered essential gene clusters and that seven other clusters are associated with intracellular survival. Moreover, this study confirmed that clusters screened by orthologs with similar function could be replaced with an approved uvrY gene and its orthologs, and the results revealed that the usg gene is associated with intracellular survival. The study improves the current understanding of intracellular pathogens characteristics and allows further exploration of the intracellular survival-related gene modules in these pathogens. Copyright © 2018. Published by Elsevier B.V.
Analyzing gene expression data in mice with the Neuro Behavior Ontology.

PubMed

Hoehndorf, Robert; Hancock, John M; Hardy, Nigel W; Mallon, Ann-Marie; Schofield, Paul N; Gkoutos, Georgios V

2014-02-01

We have applied the Neuro Behavior Ontology (NBO), an ontology for the annotation of behavioral gene functions and behavioral phenotypes, to the annotation of more than 1,000 genes in the mouse that are known to play a role in behavior. These annotations can be explored by researchers interested in genes involved in particular behaviors and used computationally to provide insights into the behavioral phenotypes resulting from differences in gene expression. We developed the OntoFUNC tool and have applied it to enrichment analyses over the NBO to provide high-level behavioral interpretations of gene expression datasets. The resulting increase in the number of gene annotations facilitates the identification of behavioral or neurologic processes by assisting the formulation of hypotheses about the relationships between gene, processes, and phenotypic manifestations resulting from behavioral observations.
Genome-wide genetic variation and comparison of fruit-associated traits between kumquat (Citrus japonica) and Clementine mandarin (Citrus clementina).

PubMed

Liu, Tian-Jia; Li, Yong-Ping; Zhou, Jing-Jing; Hu, Chun-Gen; Zhang, Jin-Zhi

2018-03-01

The comprehensive genetic variation of two citrus species were analyzed at genome and transcriptome level. A total of 1090 differentially expressed genes were found during fruit development by RNA-sequencing. Fruit size (fruit equatorial diameter) and weight (fresh weight) are the two most important components determining yield and consumer acceptability for many horticultural crops. However, little is known about the genetic control of these traits. Here, we performed whole-genome resequencing to reveal the comprehensive genetic variation of the fruit development between kumquat (Citrus japonica) and Clementine mandarin (Citrus clementina). In total, 5,865,235 single-nucleotide polymorphisms (SNPs) and 414,447 insertions/deletions (InDels) were identified in the two citrus species. Based on integrative analysis of genome and transcriptome of fruit, 640,801 SNPs and 20,733 InDels were identified. The features, genomic distribution, functional effect, and other characteristics of these genetic variations were explored. RNA-sequencing identified 1090 differentially expressed genes (DEGs) during fruit development of kumquat and Clementine mandarin. Gene Ontology revealed that these genes were involved in various molecular functional and biological processes. In addition, the genetic variation of 939 DEGs and 74 multiple fruit development pathway genes from previous reports were also identified. A global survey identified 24,237 specific alternative splicing events in the two citrus species and showed that intron retention is the most prevalent pattern of alternative splicing. These genome variation data provide a foundation for further exploration of citrus diversity and gene-phenotype relationships and for future research on molecular breeding to improve kumquat, Clementine mandarin and related species.
ChIPBase v2.0: decoding transcriptional regulatory networks of non-coding RNAs and protein-coding genes from ChIP-seq data.

PubMed

Zhou, Ke-Ren; Liu, Shun; Sun, Wen-Ju; Zheng, Ling-Ling; Zhou, Hui; Yang, Jian-Hua; Qu, Liang-Hu

2017-01-04

The abnormal transcriptional regulation of non-coding RNAs (ncRNAs) and protein-coding genes (PCGs) is contributed to various biological processes and linked with human diseases, but the underlying mechanisms remain elusive. In this study, we developed ChIPBase v2.0 (http://rna.sysu.edu.cn/chipbase/) to explore the transcriptional regulatory networks of ncRNAs and PCGs. ChIPBase v2.0 has been expanded with ∼10 200 curated ChIP-seq datasets, which represent about 20 times expansion when comparing to the previous released version. We identified thousands of binding motif matrices and their binding sites from ChIP-seq data of DNA-binding proteins and predicted millions of transcriptional regulatory relationships between transcription factors (TFs) and genes. We constructed 'Regulator' module to predict hundreds of TFs and histone modifications that were involved in or affected transcription of ncRNAs and PCGs. Moreover, we built a web-based tool, Co-Expression, to explore the co-expression patterns between DNA-binding proteins and various types of genes by integrating the gene expression profiles of ∼10 000 tumor samples and ∼9100 normal tissues and cell lines. ChIPBase also provides a ChIP-Function tool and a genome browser to predict functions of diverse genes and visualize various ChIP-seq data. This study will greatly expand our understanding of the transcriptional regulations of ncRNAs and PCGs. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Computational genetic neuroanatomy of the developing mouse brain: dimensionality reduction, visualization, and clustering.

PubMed

Ji, Shuiwang

2013-07-11

The structured organization of cells in the brain plays a key role in its functional efficiency. This delicate organization is the consequence of unique molecular identity of each cell gradually established by precise spatiotemporal gene expression control during development. Currently, studies on the molecular-structural association are beginning to reveal how the spatiotemporal gene expression patterns are related to cellular differentiation and structural development. In this article, we aim at a global, data-driven study of the relationship between gene expressions and neuroanatomy in the developing mouse brain. To enable visual explorations of the high-dimensional data, we map the in situ hybridization gene expression data to a two-dimensional space by preserving both the global and the local structures. Our results show that the developing brain anatomy is largely preserved in the reduced gene expression space. To provide a quantitative analysis, we cluster the reduced data into groups and measure the consistency with neuroanatomy at multiple levels. Our results show that the clusters in the low-dimensional space are more consistent with neuroanatomy than those in the original space. Gene expression patterns and developing brain anatomy are closely related. Dimensionality reduction and visual exploration facilitate the study of this relationship.
Trehalose metabolism genes render rice white tip nematode Aphelenchoides besseyi (Nematoda: Aphelenchoididae) resistant to an anaerobic environment

PubMed Central

Chen, Qiaoli; Zhang, Ruizhi; Ling, Yaming

2018-01-01

ABSTRACT After experiencing anaerobic environments, Aphelenchoides besseyi will enter a state of suspended animation known as anoxybiosis, during which it may use trehalose as an energy supply to survive. To explore the function of trehalose metabolism, two trehalose-6-phosphate synthase (TPS) genes (Ab-tps1 and Ab-tps2) encoding enzymes catalysing trehalose synthesis, and three trehalase (TRE) genes (Ab-ntre1, Ab-ntre2 and Ab-atre) encoding enzymes catalysing the hydrolysis of trehalose, were identified and investigated. Ab-tps1 and Ab-tps2 were active during certain periods of anoxybiosis for A. besseyi, and Ab-tps2, Ab-ntre1, Ab-ntre2 and Ab-atre were active during certain periods of recovery. The results of RNA interference experiments suggested that TRE genes regulated each other and both TPS genes, while a single TPS gene only regulated the other TPS gene. However, two TPS genes together could regulate TRE genes, which indicated a feedback mechanism between these genes. All these genes also positively regulated the survival and resumption of active metabolism of the nematode. Genes functioning at re-aeration have a greater impact on nematode survival, suggesting that these genes could play roles in anoxybiosis regulation, but may function within restricted time frames. Changes in trehalose levels matched changes in TRE activity during the anoxybiosis–re-aeration process, suggesting that trehalose may act as an energy supply source. The observation of up-regulation of TPS genes during anoxybiosis suggested a possible signal role of trehalose. Trehalose metabolism genes could also work together to control trehalose levels at a certain level when the nematode is under anaerobic conditions. PMID:29158222
ISAAC - InterSpecies Analysing Application using Containers.

PubMed

Baier, Herbert; Schultz, Jörg

2014-01-15

Information about genes, transcripts and proteins is spread over a wide variety of databases. Different tools have been developed using these databases to identify biological signals in gene lists from large scale analysis. Mostly, they search for enrichments of specific features. But, these tools do not allow an explorative walk through different views and to change the gene lists according to newly upcoming stories. To fill this niche, we have developed ISAAC, the InterSpecies Analysing Application using Containers. The central idea of this web based tool is to enable the analysis of sets of genes, transcripts and proteins under different biological viewpoints and to interactively modify these sets at any point of the analysis. Detailed history and snapshot information allows tracing each action. Furthermore, one can easily switch back to previous states and perform new analyses. Currently, sets can be viewed in the context of genomes, protein functions, protein interactions, pathways, regulation, diseases and drugs. Additionally, users can switch between species with an automatic, orthology based translation of existing gene sets. As todays research usually is performed in larger teams and consortia, ISAAC provides group based functionalities. Here, sets as well as results of analyses can be exchanged between members of groups. ISAAC fills the gap between primary databases and tools for the analysis of large gene lists. With its highly modular, JavaEE based design, the implementation of new modules is straight forward. Furthermore, ISAAC comes with an extensive web-based administration interface including tools for the integration of third party data. Thus, a local installation is easily feasible. In summary, ISAAC is tailor made for highly explorative interactive analyses of gene, transcript and protein sets in a collaborative environment.
Extending bicluster analysis to annotate unclassified ORFs and predict novel functional modules using expression data

PubMed Central

Bryan, Kenneth; Cunningham, Pádraig

2008-01-01

Background Microarrays have the capacity to measure the expressions of thousands of genes in parallel over many experimental samples. The unsupervised classification technique of bicluster analysis has been employed previously to uncover gene expression correlations over subsets of samples with the aim of providing a more accurate model of the natural gene functional classes. This approach also has the potential to aid functional annotation of unclassified open reading frames (ORFs). Until now this aspect of biclustering has been under-explored. In this work we illustrate how bicluster analysis may be extended into a 'semi-supervised' ORF annotation approach referred to as BALBOA. Results The efficacy of the BALBOA ORF classification technique is first assessed via cross validation and compared to a multi-class k-Nearest Neighbour (kNN) benchmark across three independent gene expression datasets. BALBOA is then used to assign putative functional annotations to unclassified yeast ORFs. These predictions are evaluated using existing experimental and protein sequence information. Lastly, we employ a related semi-supervised method to predict the presence of novel functional modules within yeast. Conclusion In this paper we demonstrate how unsupervised classification methods, such as bicluster analysis, may be extended using of available annotations to form semi-supervised approaches within the gene expression analysis domain. We show that such methods have the potential to improve upon supervised approaches and shed new light on the functions of unclassified ORFs and their co-regulation. PMID:18831786
Comprehensive analysis of coding-lncRNA gene co-expression network uncovers conserved functional lncRNAs in zebrafish.

PubMed

Chen, Wen; Zhang, Xuan; Li, Jing; Huang, Shulan; Xiang, Shuanglin; Hu, Xiang; Liu, Changning

2018-05-09

Zebrafish is a full-developed model system for studying development processes and human disease. Recent studies of deep sequencing had discovered a large number of long non-coding RNAs (lncRNAs) in zebrafish. However, only few of them had been functionally characterized. Therefore, how to take advantage of the mature zebrafish system to deeply investigate the lncRNAs' function and conservation is really intriguing. We systematically collected and analyzed a series of zebrafish RNA-seq data, then combined them with resources from known database and literatures. As a result, we obtained by far the most complete dataset of zebrafish lncRNAs, containing 13,604 lncRNA genes (21,128 transcripts) in total. Based on that, a co-expression network upon zebrafish coding and lncRNA genes was constructed and analyzed, and used to predict the Gene Ontology (GO) and the KEGG annotation of lncRNA. Meanwhile, we made a conservation analysis on zebrafish lncRNA, identifying 1828 conserved zebrafish lncRNA genes (1890 transcripts) that have their putative mammalian orthologs. We also found that zebrafish lncRNAs play important roles in regulation of the development and function of nervous system; these conserved lncRNAs present a significant sequential and functional conservation, with their mammalian counterparts. By integrative data analysis and construction of coding-lncRNA gene co-expression network, we gained the most comprehensive dataset of zebrafish lncRNAs up to present, as well as their systematic annotations and comprehensive analyses on function and conservation. Our study provides a reliable zebrafish-based platform to deeply explore lncRNA function and mechanism, as well as the lncRNA commonality between zebrafish and human.
Effective Feature Selection for Classification of Promoter Sequences.

PubMed

K, Kouser; P G, Lavanya; Rangarajan, Lalitha; K, Acharya Kshitish

2016-01-01

Exploring novel computational methods in making sense of biological data has not only been a necessity, but also productive. A part of this trend is the search for more efficient in silico methods/tools for analysis of promoters, which are parts of DNA sequences that are involved in regulation of expression of genes into other functional molecules. Promoter regions vary greatly in their function based on the sequence of nucleotides and the arrangement of protein-binding short-regions called motifs. In fact, the regulatory nature of the promoters seems to be largely driven by the selective presence and/or the arrangement of these motifs. Here, we explore computational classification of promoter sequences based on the pattern of motif distributions, as such classification can pave a new way of functional analysis of promoters and to discover the functionally crucial motifs. We make use of Position Specific Motif Matrix (PSMM) features for exploring the possibility of accurately classifying promoter sequences using some of the popular classification techniques. The classification results on the complete feature set are low, perhaps due to the huge number of features. We propose two ways of reducing features. Our test results show improvement in the classification output after the reduction of features. The results also show that decision trees outperform SVM (Support Vector Machine), KNN (K Nearest Neighbor) and ensemble classifier LibD3C, particularly with reduced features. The proposed feature selection methods outperform some of the popular feature transformation methods such as PCA and SVD. Also, the methods proposed are as accurate as MRMR (feature selection method) but much faster than MRMR. Such methods could be useful to categorize new promoters and explore regulatory mechanisms of gene expressions in complex eukaryotic species.
Metatranscriptomics of Soil Eukaryotic Communities.

PubMed

Yadav, Rajiv K; Bragalini, Claudia; Fraissinet-Tachet, Laurence; Marmeisse, Roland; Luis, Patricia

2016-01-01

Functions expressed by eukaryotic organisms in soil can be specifically studied by analyzing the pool of eukaryotic-specific polyadenylated mRNA directly extracted from environmental samples. In this chapter, we describe two alternative protocols for the extraction of high-quality RNA from soil samples. Total soil RNA or mRNA can be converted to cDNA for direct high-throughput sequencing. Polyadenylated mRNA-derived full-length cDNAs can also be cloned in expression plasmid vectors to constitute soil cDNA libraries, which can be subsequently screened for functional gene categories. Alternatively, the diversity of specific gene families can also be explored following cDNA sequence capture using exploratory oligonucleotide probes.
Voltage-gated Na+ Channel Activity Increases Colon Cancer Transcriptional Activity and Invasion Via Persistent MAPK Signaling

NASA Astrophysics Data System (ADS)

House, Carrie D.; Wang, Bi-Dar; Ceniccola, Kristin; Williams, Russell; Simaan, May; Olender, Jacqueline; Patel, Vyomesh; Baptista-Hon, Daniel T.; Annunziata, Christina M.; Silvio Gutkind, J.; Hales, Tim G.; Lee, Norman H.

2015-06-01

Functional expression of voltage-gated Na+ channels (VGSCs) has been demonstrated in multiple cancer cell types where channel activity induces invasive activity. The signaling mechanisms by which VGSCs promote oncogenesis remain poorly understood. We explored the signal transduction process critical to VGSC-mediated invasion on the basis of reports linking channel activity to gene expression changes in excitable cells. Coincidentally, many genes transcriptionally regulated by the SCN5A isoform in colon cancer have an over-representation of cis-acting sites for transcription factors phosphorylated by ERK1/2 MAPK. We hypothesized that VGSC activity promotes MAPK activation to induce transcriptional changes in invasion-related genes. Using pharmacological inhibitors/activators and siRNA-mediated gene knockdowns, we correlated channel activity with Rap1-dependent persistent MAPK activation in the SW620 human colon cancer cell line. We further demonstrated that VGSC activity induces downstream changes in invasion-related gene expression via a PKA/ERK/c-JUN/ELK-1/ETS-1 transcriptional pathway. This is the first study illustrating a molecular mechanism linking functional activity of VGSCs to transcriptional activation of invasion-related genes.
Genetic investigation of 100 heart genes in sudden unexplained death victims in a forensic setting

PubMed Central

Christiansen, Sofie Lindgren; Hertz, Christin Løth; Ferrero-Miliani, Laura; Dahl, Morten; Weeke, Peter Ejvin; LuCamp; Ottesen, Gyda Lolk; Frank-Hansen, Rune; Bundgaard, Henning; Morling, Niels

2016-01-01

In forensic medicine, one-third of the sudden deaths remain unexplained after medico-legal autopsy. A major proportion of these sudden unexplained deaths (SUD) are considered to be caused by inherited cardiac diseases. Sudden cardiac death (SCD) may be the first manifestation of these diseases. The purpose of this study was to explore the yield of next-generation sequencing of genes associated with SCD in a cohort of SUD victims. We investigated 100 genes associated with cardiac diseases in 61 young (1–50 years) SUD cases. DNA was captured with the Haloplex target enrichment system and sequenced using an Illumina MiSeq. The identified genetic variants were evaluated and classified as likely, unknown or unlikely to have a functional effect. The criteria for this classification were based on the literature, databases, conservation and prediction of the effect of the variant. We found that 21 (34%) individuals carried variants with a likely functional effect. Ten (40%) of these variants were located in genes associated with cardiomyopathies and 15 (60%) of the variants in genes associated with cardiac channelopathies. Nineteen individuals carried variants with unknown functional effect. Our findings indicate that broad genetic investigation of SUD victims increases the diagnostic outcome, and the investigation should comprise genes involved in both cardiomyopathies and cardiac channelopathies. PMID:27650965
Pan- and core- network analysis of co-expression genes in a model plant

DOE PAGES

He, Fei; Maslov, Sergei

2016-12-16

Genome-wide gene expression experiments have been performed using the model plant Arabidopsis during the last decade. Some studies involved construction of coexpression networks, a popular technique used to identify groups of co-regulated genes, to infer unknown gene functions. One approach is to construct a single coexpression network by combining multiple expression datasets generated in different labs. We advocate a complementary approach in which we construct a large collection of 134 coexpression networks based on expression datasets reported in individual publications. To this end we reanalyzed public expression data. To describe this collection of networks we introduced concepts of ‘pan-network’ andmore » ‘core-network’ representing union and intersection between a sizeable fractions of individual networks, respectively. Here, we showed that these two types of networks are different both in terms of their topology and biological function of interacting genes. For example, the modules of the pan-network are enriched in regulatory and signaling functions, while the modules of the core-network tend to include components of large macromolecular complexes such as ribosomes and photosynthetic machinery. Our analysis is aimed to help the plant research community to better explore the information contained within the existing vast collection of gene expression data in Arabidopsis.« less
Pan- and core- network analysis of co-expression genes in a model plant

DOE Office of Scientific and Technical Information (OSTI.GOV)

He, Fei; Maslov, Sergei

Genome-wide gene expression experiments have been performed using the model plant Arabidopsis during the last decade. Some studies involved construction of coexpression networks, a popular technique used to identify groups of co-regulated genes, to infer unknown gene functions. One approach is to construct a single coexpression network by combining multiple expression datasets generated in different labs. We advocate a complementary approach in which we construct a large collection of 134 coexpression networks based on expression datasets reported in individual publications. To this end we reanalyzed public expression data. To describe this collection of networks we introduced concepts of ‘pan-network’ andmore » ‘core-network’ representing union and intersection between a sizeable fractions of individual networks, respectively. Here, we showed that these two types of networks are different both in terms of their topology and biological function of interacting genes. For example, the modules of the pan-network are enriched in regulatory and signaling functions, while the modules of the core-network tend to include components of large macromolecular complexes such as ribosomes and photosynthetic machinery. Our analysis is aimed to help the plant research community to better explore the information contained within the existing vast collection of gene expression data in Arabidopsis.« less
Genetic investigation of 100 heart genes in sudden unexplained death victims in a forensic setting.

PubMed

Christiansen, Sofie Lindgren; Hertz, Christin Løth; Ferrero-Miliani, Laura; Dahl, Morten; Weeke, Peter Ejvin; LuCamp; Ottesen, Gyda Lolk; Frank-Hansen, Rune; Bundgaard, Henning; Morling, Niels

2016-12-01

In forensic medicine, one-third of the sudden deaths remain unexplained after medico-legal autopsy. A major proportion of these sudden unexplained deaths (SUD) are considered to be caused by inherited cardiac diseases. Sudden cardiac death (SCD) may be the first manifestation of these diseases. The purpose of this study was to explore the yield of next-generation sequencing of genes associated with SCD in a cohort of SUD victims. We investigated 100 genes associated with cardiac diseases in 61 young (1-50 years) SUD cases. DNA was captured with the Haloplex target enrichment system and sequenced using an Illumina MiSeq. The identified genetic variants were evaluated and classified as likely, unknown or unlikely to have a functional effect. The criteria for this classification were based on the literature, databases, conservation and prediction of the effect of the variant. We found that 21 (34%) individuals carried variants with a likely functional effect. Ten (40%) of these variants were located in genes associated with cardiomyopathies and 15 (60%) of the variants in genes associated with cardiac channelopathies. Nineteen individuals carried variants with unknown functional effect. Our findings indicate that broad genetic investigation of SUD victims increases the diagnostic outcome, and the investigation should comprise genes involved in both cardiomyopathies and cardiac channelopathies.
CRISPR-mediated genotypic and phenotypic correction of a chronic granulomatous disease mutation in human iPS cells.

PubMed

Flynn, Rowan; Grundmann, Alexander; Renz, Peter; Hänseler, Walther; James, William S; Cowley, Sally A; Moore, Michael D

2015-10-01

Chronic granulomatous disease (CGD) is a rare genetic disease characterized by severe and persistent childhood infections. It is caused by the lack of an antipathogen oxidative burst, normally performed by phagocytic cells to contain and clear bacterial and fungal growth. Restoration of immune function can be achieved with heterologous bone marrow transplantation; however, autologous bone marrow transplantation would be a preferable option. Thus, a method is required to recapitulate the function of the diseased gene within the patient's own cells. Gene therapy approaches for CGD have employed randomly integrating viruses with concomitant issues of insertional mutagenesis, inaccurate gene dosage, and gene silencing. Here, we explore the potential of the recently described clustered regularly interspaced short palindromic repeat (CRISPR)-Cas9 site-specific nuclease system to encourage repair of the endogenous gene by enhancing the levels of homologous recombination. Using induced pluripotent stem cells derived from a CGD patient containing a single intronic mutation in the CYBB gene, we show that footprintless gene editing is a viable option to correct disease mutations. Gene correction results in restoration of oxidative burst function in iPS-derived phagocytes by reintroduction of a previously skipped exon in the cytochrome b-245 heavy chain (CYBB) protein. This study provides proof-of-principle for a gene therapy approach to CGD treatment using CRISPR-Cas9. Copyright © 2015 ISEH - International Society for Experimental Hematology. Published by Elsevier Inc. All rights reserved.
Perspectives: Gene Expression in Fisheries Management

USGS Publications Warehouse

Nielsen, Jennifer L.; Pavey, Scott A.

2010-01-01

Functional genes and gene expression have been connected to physiological traits linked to effective production and broodstock selection in aquaculture, selective implications of commercial fish harvest, and adaptive changes reflected in non-commercial fish populations subject to human disturbance and climate change. Gene mapping using single nucleotide polymorphisms (SNPs) to identify functional genes, gene expression (analogue microarrays and real-time PCR), and digital sequencing technologies looking at RNA transcripts present new concepts and opportunities in support of effective and sustainable fisheries. Genomic tools have been rapidly growing in aquaculture research addressing aspects of fish health, toxicology, and early development. Genomic technologies linking effects in functional genes involved in growth, maturation and life history development have been tied to selection resulting from harvest practices. Incorporating new and ever-increasing knowledge of fish genomes is opening a different perspective on local adaptation that will prove invaluable in wild fish conservation and management. Conservation of fish stocks is rapidly incorporating research on critical adaptive responses directed at the effects of human disturbance and climate change through gene expression studies. Genomic studies of fish populations can be generally grouped into three broad categories: 1) evolutionary genomics and biodiversity; 2) adaptive physiological responses to a changing environment; and 3) adaptive behavioral genomics and life history diversity. We review current genomic research in fisheries focusing on those that use microarrays to explore differences in gene expression among phenotypes and within or across populations, information that is critically important to the conservation of fish and their relationship to humans.

Functional organization of the transcriptome in human brain

PubMed Central

Oldham, Michael C; Konopka, Genevieve; Iwamoto, Kazuya; Langfelder, Peter; Kato, Tadafumi; Horvath, Steve; Geschwind, Daniel H

2009-01-01

The enormous complexity of the human brain ultimately derives from a finite set of molecular instructions encoded in the human genome. These instructions can be directly studied by exploring the organization of the brain’s transcriptome through systematic analysis of gene coexpression relationships. We analyzed gene coexpression relationships in microarray data generated from specific human brain regions and identified modules of coexpressed genes that correspond to neurons, oligodendrocytes, astrocytes and microglia. These modules provide an initial description of the transcriptional programs that distinguish the major cell classes of the human brain and indicate that cell type–specific information can be obtained from whole brain tissue without isolating homogeneous populations of cells. Other modules corresponded to additional cell types, organelles, synaptic function, gender differences and the subventricular neurogenic niche. We found that subventricular zone astrocytes, which are thought to function as neural stem cells in adults, have a distinct gene expression pattern relative to protoplasmic astrocytes. Our findings provide a new foundation for neurogenetic inquiries by revealing a robust and previously unrecognized organization to the human brain transcriptome. PMID:18849986
Thrips developmental stage-specific transcriptome response to tomato spotted wilt virus during the virus infection cycle in Frankliniella occidentalis, the primary vector.

PubMed

Schneweis, Derek J; Whitfield, Anna E; Rotenberg, Dorith

2017-01-01

Tomato spotted wilt virus (TSWV) is transmitted by Frankliniella occidentalis in a circulative-propagative manner. Little is known about thrips vector response to TSWV during the infection process from larval acquisition to adult inoculation of plants. Whole-body transcriptome response to virus infection was determined for first-instar larval, pre-pupal and adult thrips using RNA-Seq. TSWV responsive genes were identified using preliminary sequence of a draft genome of F. occidentalis as a reference and three developmental-stage transcriptomes were assembled. Processes and functions associated with host defense, insect cuticle structure and development, metabolism and transport were perturbed by TSWV infection as inferred by ontologies of responsive genes. The repertoire of genes responsive to TSWV varied between developmental stages, possibly reflecting the link between thrips development and the virus dissemination route in the vector. This study provides the foundation for exploration of tissue-specific expression in response to TSWV and functional analysis of thrips gene function. Copyright Â© 2016 Elsevier Inc. All rights reserved.
Exploring the genetic basis of adaptation to high elevations in reptiles: a comparative transcriptome analysis of two toad-headed agamas (genus Phrynocephalus).

PubMed

Yang, Weizhao; Qi, Yin; Fu, Jinzhong

2014-01-01

High elevation adaptation offers an excellent study system to understand the genetic basis of adaptive evolution. We acquired transcriptome sequences of two closely related lizards, Phrynocephalus przewalskii from low elevations and P. vlangalii from high elevations. Within a phylogenetic framework, we compared their genomic data along with green anole, chicken and Chinese softshell turtle, and identified candidate genes and functional categories that are potentially linked to adaptation to high elevation environments. More than 100 million sequence reads were generated for each species via Illumina sequencing. A de novo assembly produced 70,919 and 62,118 transcripts for P. przewalskii and P. vlangalii, respectively. Based on a well-established reptile phylogeny, we detected 143 positively selected genes (PSGs) along the P. vlangalii lineage from the 7,012 putative orthologs using a branch-site model. Furthermore, ten GO categories and one KEGG pathway that are over-represented by PSGs were recognized. In addition, 58 GO categories were revealed to have elevated evolutionary rates along the P. vlangalii lineage relative to P. przewalskii. These functional analyses further filter out PSGs that are most likely involved in the adaptation process to high elevations. Among them, ADAM17, MD, and HSP90B1 likely contributed to response to hypoxia, and POLK likely contributed to DNA repair. Many other candidate genes involved in gene expression and metabolism were also identified. Genome-wide scan for candidate genes may serve as the first step to explore the genetic basis of high elevation adaptation. Detailed comparative study and functional verification are needed to solidify any conclusions. High elevation adaptation requires coordinated changes in multiple genes that involve various physiological and biochemical pathways; we hope that our genetic studies will provide useful directions for future physiological or molecular studies in reptiles as well as other poikilothermic species.
Exploring the Genetic Basis of Adaptation to High Elevations in Reptiles: A Comparative Transcriptome Analysis of Two Toad-Headed Agamas (Genus Phrynocephalus)

PubMed Central

Yang, Weizhao; Qi, Yin; Fu, Jinzhong

2014-01-01

High elevation adaptation offers an excellent study system to understand the genetic basis of adaptive evolution. We acquired transcriptome sequences of two closely related lizards, Phrynocephalus przewalskii from low elevations and P. vlangalii from high elevations. Within a phylogenetic framework, we compared their genomic data along with green anole, chicken and Chinese softshell turtle, and identified candidate genes and functional categories that are potentially linked to adaptation to high elevation environments. More than 100 million sequence reads were generated for each species via Illumina sequencing. A de novo assembly produced 70,919 and 62,118 transcripts for P. przewalskii and P. vlangalii, respectively. Based on a well-established reptile phylogeny, we detected 143 positively selected genes (PSGs) along the P. vlangalii lineage from the 7,012 putative orthologs using a branch-site model. Furthermore, ten GO categories and one KEGG pathway that are over-represented by PSGs were recognized. In addition, 58 GO categories were revealed to have elevated evolutionary rates along the P. vlangalii lineage relative to P. przewalskii. These functional analyses further filter out PSGs that are most likely involved in the adaptation process to high elevations. Among them, ADAM17, MD, and HSP90B1 likely contributed to response to hypoxia, and POLK likely contributed to DNA repair. Many other candidate genes involved in gene expression and metabolism were also identified. Genome-wide scan for candidate genes may serve as the first step to explore the genetic basis of high elevation adaptation. Detailed comparative study and functional verification are needed to solidify any conclusions. High elevation adaptation requires coordinated changes in multiple genes that involve various physiological and biochemical pathways; we hope that our genetic studies will provide useful directions for future physiological or molecular studies in reptiles as well as other poikilothermic species. PMID:25386640
Functional diversity and redundancy across fish gut, sediment and water bacterial communities.

PubMed

Escalas, Arthur; Troussellier, Marc; Yuan, Tong; Bouvier, Thierry; Bouvier, Corinne; Mouchet, Maud A; Flores Hernandez, Domingo; Ramos Miranda, Julia; Zhou, Jizhong; Mouillot, David

2017-08-01

This article explores the functional diversity and redundancy in a bacterial metacommunity constituted of three habitats (sediment, water column and fish gut) in a coastal lagoon under anthropogenic pressure. Comprehensive functional gene arrays covering a wide range of ecological processes and stress resistance genes to estimate the functional potential of bacterial communities were used. Then, diversity partitioning was used to characterize functional diversity and redundancy within (α), between (β) and across (γ) habitats. It was showed that all local communities exhibit a highly diversified potential for the realization of key ecological processes and resistance to various environmental conditions, supporting the growing evidence that macro-organisms microbiomes harbour a high functional potential and are integral components of functional gene dynamics in aquatic bacterial metacommunities. Several levels of functional redundancy at different scales of the bacterial metacommunity were observed (within local communities, within habitats and at the metacommunity level). The results suggested a high potential for the realization of spatial ecological insurance within this ecosystem, that is, the functional compensation among microorganisms for the realization and maintenance of key ecological processes, within and across habitats. Finally, the role of macro-organisms as dispersal vectors of microbes and their potential influence on marine metacommunity dynamics were discussed. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.
Functional Regulation of an Autographa californica Nucleopolyhedrovirus-Encoded MicroRNA, AcMNPV-miR-1, in Baculovirus Replication

PubMed Central

Zhu, Mengxiao; Deng, Riqiang

2016-01-01

ABSTRACT An Autographa californica nucleopolyhedrovirus-encoded microRNA (miRNA), AcMNPV-miR-1, downregulates the ac94 gene, reducing the production of infectious budded virions and accelerating the formation of occlusion-derived virions. In the current study, four viruses that constitutively overexpress AcMNPV-miR-1 were constructed to further explore the function of the miRNA. In addition to the ac94 gene, two new viral gene targets (ac18 and ac95) of AcMNPV-miR-1 were identified, and the possible interacting proteins were verified and tested. In the context of AcMNPV-miR-1 overexpression, ac18 was slightly upregulated, and ac95 was downregulated. Several interacting proteins were identified, and a functional pathway for AcMNPV-miR-1 was deduced. AcMNPV-miR-1 overexpression decreased budded virus infectivity, reduced viral DNA replication, accelerated polyhedron formation, and promoted viral infection efficiency in Trichoplusia ni larvae, suggesting that AcMNPV-miR-1 restrains virus infection of cells but facilitates virus infection of larvae. IMPORTANCE Recently, microRNAs (miRNAs) have been widely reported as moderators or regulators of mammalian cellular processes, especially disease-related pathways in humans. However, the roles played by miRNAs encoded by baculoviruses, which infect numerous beneficial insects and agricultural pests, have rarely been described. To explore the actions of virus-encoded miRNAs, we investigated an miRNA encoded by Autographa californica nucleopolyhedrovirus (AcMNPV-miR-1). We previously identified this miRNA through the exogenous addition of AcMNPV-miR-1 mimics. In the current study, we constitutively overexpressed AcMNPV-miR-1 and analyzed the resultant effects to more comprehensively assess what is indeed the function of this miRNA during viral infection. In addition, we widely explored the target genes for the miRNA in the viral and host genomes and proposed a possible functional network for AcMNPV-miR-1, which provides a better general understanding of virus-encoded miRNAs. In brief, our study implied that AcMNPV-miR-1 constrains viral replication and cellular infection but enhances larval infection. PMID:27147751
A Network of Genes Antagonistic to the LIN-35 Retinoblastoma Protein of Caenorhabditis elegans

PubMed Central

Polley, Stanley R. G.; Fay, David S.

2012-01-01

The Caenorhabditis elegans pRb ortholog, LIN-35, functions in a wide range of cellular and developmental processes. This includes a role of LIN-35 in nutrient utilization by the intestine, which it carries out redundantly with SLR-2, a zinc-finger protein. This and other redundant functions of LIN-35 were identified in genetic screens for mutations that display synthetic phenotypes in conjunction with loss of lin-35. To explore the intestinal role of LIN-35, we conducted a genome-wide RNA-interference-feeding screen for suppressors of lin-35; slr-2 early larval arrest. Of the 26 suppressors identified, 17 fall into three functional classes: (1) ribosome biogenesis genes, (2) mitochondrial prohibitins, and (3) chromatin regulators. Further characterization indicates that different categories of suppressors act through distinct molecular mechanisms. We also tested lin-35; slr-2 suppressors, as well as suppressors of the synthetic multivulval phenotype, to determine the spectrum of lin-35-synthetic phenotypes that could be suppressed following inhibition of these genes. We identified 19 genes, most of which are evolutionarily conserved, that can suppress multiple unrelated lin-35-synthetic phenotypes. Our study reveals a network of genes broadly antagonistic to LIN-35 as well as genes specific to the role of LIN-35 in intestinal and vulval development. Suppressors of multiple lin-35 phenotypes may be candidate targets for anticancer therapies. Moreover, screening for suppressors of phenotypically distinct synthetic interactions, which share a common altered gene, may prove to be a novel and effective approach for identifying genes whose activities are most directly relevant to the core functions of the shared gene. PMID:22542970
Pleurochrysome: A Web Database of Pleurochrysis Transcripts and Orthologs Among Heterogeneous Algae

PubMed Central

Fujiwara, Shoko; Takatsuka, Yukiko; Hirokawa, Yasutaka; Tsuzuki, Mikio; Takano, Tomoyuki; Kobayashi, Masaaki; Suda, Kunihiro; Asamizu, Erika; Yokoyama, Koji; Shibata, Daisuke; Tabata, Satoshi; Yano, Kentaro

2016-01-01

Pleurochrysis is a coccolithophorid genus, which belongs to the Coccolithales in the Haptophyta. The genus has been used extensively for biological research, together with Emiliania in the Isochrysidales, to understand distinctive features between the two coccolithophorid-including orders. However, molecular biological research on Pleurochrysis such as elucidation of the molecular mechanism behind coccolith formation has not made great progress at least in part because of lack of comprehensive gene information. To provide such information to the research community, we built an open web database, the Pleurochrysome (http://bioinf.mind.meiji.ac.jp/phapt/), which currently stores 9,023 unique gene sequences (designated as UNIGENEs) assembled from expressed sequence tag sequences of P. haptonemofera as core information. The UNIGENEs were annotated with gene sequences sharing significant homology, conserved domains, Gene Ontology, KEGG Orthology, predicted subcellular localization, open reading frames and orthologous relationship with genes of 10 other algal species, a cyanobacterium and the yeast Saccharomyces cerevisiae. This sequence and annotation information can be easily accessed via several search functions. Besides fundamental functions such as BLAST and keyword searches, this database also offers search functions to explore orthologous genes in the 12 organisms and to seek novel genes. The Pleurochrysome will promote molecular biological and phylogenetic research on coccolithophorids and other haptophytes by helping scientists mine data from the primary transcriptome of P. haptonemofera. PMID:26746174
Exploring Fusarium head blight disease control by RNA interference

USDA-ARS?s Scientific Manuscript database

RNA interference (RNAi) technology provides a novel tool to study gene function and plant protection strategies. Fusarium graminearum is the causal agent of Fusarium head blight (FHB), which reduces crop yield and quality by producing trichothecene mycotoxins including 3-acetyl deoxynivalenol (3-ADO...
Lessons learned from whole exome sequencing in multiplex families affected by a complex genetic disorder, intracranial aneurysm.

PubMed

Farlow, Janice L; Lin, Hai; Sauerbeck, Laura; Lai, Dongbing; Koller, Daniel L; Pugh, Elizabeth; Hetrick, Kurt; Ling, Hua; Kleinloog, Rachel; van der Vlies, Pieter; Deelen, Patrick; Swertz, Morris A; Verweij, Bon H; Regli, Luca; Rinkel, Gabriel J E; Ruigrok, Ynte M; Doheny, Kimberly; Liu, Yunlong; Broderick, Joseph; Foroud, Tatiana

2015-01-01

Genetic risk factors for intracranial aneurysm (IA) are not yet fully understood. Genomewide association studies have been successful at identifying common variants; however, the role of rare variation in IA susceptibility has not been fully explored. In this study, we report the use of whole exome sequencing (WES) in seven densely-affected families (45 individuals) recruited as part of the Familial Intracranial Aneurysm study. WES variants were prioritized by functional prediction, frequency, predicted pathogenicity, and segregation within families. Using these criteria, 68 variants in 68 genes were prioritized across the seven families. Of the genes that were expressed in IA tissue, one gene (TMEM132B) was differentially expressed in aneurysmal samples (n=44) as compared to control samples (n=16) (false discovery rate adjusted p-value=0.023). We demonstrate that sequencing of densely affected families permits exploration of the role of rare variants in a relatively common disease such as IA, although there are important study design considerations for applying sequencing to complex disorders. In this study, we explore methods of WES variant prioritization, including the incorporation of unaffected individuals, multipoint linkage analysis, biological pathway information, and transcriptome profiling. Further studies are needed to validate and characterize the set of variants and genes identified in this study.
Solexa-Sequencing Based Transcriptome Study of Plaice Skin Phenotype in Rex Rabbits (Oryctolagus cuniculus)

PubMed Central

Pan, Lei; Liu, Yan; Wei, Qiang; Xiao, Chenwen; Ji, Quanan; Bao, Guolian; Wu, Xinsheng

2015-01-01

Background Fur is an important genetically-determined characteristic of domestic rabbits; rabbit furs are of great economic value. We used the Solexa sequencing technology to assess gene expression in skin tissues from full-sib Rex rabbits of different phenotypes in order to explore the molecular mechanisms associated with fur determination. Methodology/Principal Findings Transcriptome analysis included de novo assembly, gene function identification, and gene function classification and enrichment. We obtained 74,032,912 and 71,126,891 short reads of 100 nt, which were assembled into 377,618 unique sequences by Trinity strategy (N50=680 nt). Based on BLAST results with known proteins, 50,228 sequences were identified at a cut-off E-value ≥ 10-5. Using Blast to Gene Ontology (GO), Clusters of Orthologous Groups (KOG) and Kyoto Encyclopedia of Genes and Genomes (KEGG), we obtained several genes with important protein functions. A total of 308 differentially expressed genes were obtained by transcriptome analysis of plaice and un-plaice phenotype animals; 209 additional differentially expressed genes were not found in any database. These genes included 49 that were only expressed in plaice skin rabbits. The novel genes may play important roles during skin growth and development. In addition, 99 known differentially expressed genes were assigned to PI3K-Akt signaling, focal adhesion, and ECM-receptor interactin, among others. Growth factors play a role in skin growth and development by regulating these signaling pathways. We confirmed the altered expression levels of seven target genes by qRT-PCR. And chosen a key gene for SNP to found the differentially between plaice and un-plaice phenotypes rabbit. Conclusions/Significance The rabbit transcriptome profiling data provide new insights in understanding the molecular mechanisms underlying rabbit skin growth and development. PMID:25955442
PTGBase: an integrated database to study tandem duplicated genes in plants.

PubMed

Yu, Jingyin; Ke, Tao; Tehrim, Sadia; Sun, Fengming; Liao, Boshou; Hua, Wei

2015-01-01

Tandem duplication is a wide-spread phenomenon in plant genomes and plays significant roles in evolution and adaptation to changing environments. Tandem duplicated genes related to certain functions will lead to the expansion of gene families and bring increase of gene dosage in the form of gene cluster arrays. Many tandem duplication events have been studied in plant genomes; yet, there is a surprising shortage of efforts to systematically present the integration of large amounts of information about publicly deposited tandem duplicated gene data across the plant kingdom. To address this shortcoming, we developed the first plant tandem duplicated genes database, PTGBase. It delivers the most comprehensive resource available to date, spanning 39 plant genomes, including model species and newly sequenced species alike. Across these genomes, 54 130 tandem duplicated gene clusters (129 652 genes) are presented in the database. Each tandem array, as well as its member genes, is characterized in complete detail. Tandem duplicated genes in PTGBase can be explored through browsing or searching by identifiers or keywords of functional annotation and sequence similarity. Users can download tandem duplicated gene arrays easily to any scale, up to the complete annotation data set for an entire plant genome. PTGBase will be updated regularly with newly sequenced plant species as they become available. © The Author(s) 2015. Published by Oxford University Press.
Exploring Genetic, Genomic, and Phenotypic Data at the Rat Genome Database

PubMed Central

Laulederkind, Stanley J. F.; Hayman, G. Thomas; Wang, Shur-Jen; Lowry, Timothy F.; Nigam, Rajni; Petri, Victoria; Smith, Jennifer R.; Dwinell, Melinda R.; Jacob, Howard J.; Shimoyama, Mary

2013-01-01

The laboratory rat, Rattus norvegicus, is an important model of human health and disease, and experimental findings in the rat have relevance to human physiology and disease. The Rat Genome Database (RGD, http://rgd.mcw.edu) is a model organism database that provides access to a wide variety of curated rat data including disease associations, phenotypes, pathways, molecular functions, biological processes and cellular components for genes, quantitative trait loci, and strains. We present an overview of the database followed by specific examples that can be used to gain experience in employing RGD to explore the wealth of functional data available for the rat. PMID:23255149
Gene editing tools: state-of-the-art and the road ahead for the model and non-model fishes.

PubMed

Barman, Hirak Kumar; Rasal, Kiran Dashrath; Chakrapani, Vemulawada; Ninawe, A S; Vengayil, Doyil T; Asrafuzzaman, Syed; Sundaray, Jitendra K; Jayasankar, Pallipuram

2017-10-01

Advancements in the DNA sequencing technologies and computational biology have revolutionized genome/transcriptome sequencing of non-model fishes at an affordable cost. This has led to a paradigm shift with regard to our heightened understandings of structure-functional relationships of genes at a global level, from model animals/fishes to non-model large animals/fishes. Whole genome/transcriptome sequencing technologies were supplemented with the series of discoveries in gene editing tools, which are being used to modify genes at pre-determined positions using programmable nucleases to explore their respective in vivo functions. For a long time, targeted gene disruption experiments were mostly restricted to embryonic stem cells, advances in gene editing technologies such as zinc finger nuclease, transcriptional activator-like effector nucleases and CRISPR (clustered regulatory interspaced short palindromic repeats)/CRISPR-associated nucleases have facilitated targeted genetic modifications beyond stem cells to a wide range of somatic cell lines across species from laboratory animals to farmed animals/fishes. In this review, we discuss use of different gene editing tools and the strategic implications in fish species for basic and applied biology research.
Expression level, cellular compartment and metabolic network position all influence the average selective constraint on mammalian enzymes

PubMed Central

2011-01-01

Background A gene's position in regulatory, protein interaction or metabolic networks can be predictive of the strength of purifying selection acting on it, but these relationships are neither universal nor invariably strong. Following work in bacteria, fungi and invertebrate animals, we explore the relationship between selective constraint and metabolic function in mammals. Results We measure the association between selective constraint, estimated by the ratio of nonsynonymous (Ka) to synonymous (Ks) substitutions, and several, primarily metabolic, measures of gene function. We find significant differences between the selective constraints acting on enzyme-coding genes from different cellular compartments, with the nucleus showing higher constraint than genes from either the cytoplasm or the mitochondria. Among metabolic genes, the centrality of an enzyme in the metabolic network is significantly correlated with Ka/Ks. In contrast to yeasts, gene expression magnitude does not appear to be the primary predictor of selective constraint in these organisms. Conclusions Our results imply that the relationship between selective constraint and enzyme centrality is complex: the strength of selective constraint acting on mammalian genes is quite variable and does not appear to exclusively follow patterns seen in other organisms. PMID:21470417
Characterization of potential driver mutations involved in human breast cancer by computational approaches

PubMed Central

Rajendran, Barani Kumar; Deng, Chu-Xia

2017-01-01

Breast cancer is the second most frequently occurring form of cancer and is also the second most lethal cancer in women worldwide. A genetic mutation is one of the key factors that alter multiple cellular regulatory pathways and drive breast cancer initiation and progression yet nature of these cancer drivers remains elusive. In this article, we have reviewed various computational perspectives and algorithms for exploring breast cancer driver mutation genes. Using both frequency based and mutational exclusivity based approaches, we identified 195 driver genes and shortlisted 63 of them as candidate drivers for breast cancer using various computational approaches. Finally, we conducted network and pathway analysis to explore their functions in breast tumorigenesis including tumor initiation, progression, and metastasis. PMID:28477017
PdSlt2 Penicillium digitatum mitogen-activated-protein kinase controls sporulation and virulence during citrus fruit infection.

PubMed

de Ramón-Carbonell, Marta; Sánchez-Torres, Paloma

2017-12-01

The Slt2 mitogen-activated protein (MAP) kinase homologue of Penicillium digitatum, the most relevant pathogen-producing citrus green mould decay during postharvest, was identified and explored. The P. digitatum Slt2-MAPK coding gene (PdSlt2) was functionally characterized by homologous gene elimination and transcriptomic evaluation. The absence of PdSlt2 gene resulted in significantly reduced virulence during citrus infection. The ΔPdSlt2 mutants were also defective in asexual reproduction, showing impairment of sporulation during citrus infection. Gene expression analysis revealed that PdSlt2 was highly induced during citrus fruit infection at early stages (1 dpi). Moreover, PdSlt2 deletion altered gene expression profiles. The relative gene expression (RGE) of fungicide resistance- and fungal virulence-related genes showed that PdSlt2 acts as negative regulator of several transporter encoding genes (ABC and MFS transporters) and a positive regulator of two sterol demethylases. This study indicates that PdSlt2 MAPK is functionally preserved in P. digitatum and highlights the relevant role of the PdSlt2 MAP kinase-mediated signalling pathway in regulating diverse genes crucial for infection and asexual reproduction. Copyright © 2017 British Mycological Society. Published by Elsevier Ltd. All rights reserved.
Transcriptomic Profiling Analysis of Arabidopsis thaliana Treated with Exogenous Myo-Inositol

PubMed Central

Ye, Wenxing; Ren, Weibo; Kong, Lingqi; Zhang, Wanjun; Wang, Tao

2016-01-01

Myo-insositol (MI) is a crucial substance in the growth and developmental processes in plants. It is commonly added to the culture medium to promote adventitious shoot development. In our previous work, MI was found in influencing Agrobacterium-mediated transformation. In this report, a high-throughput RNA sequencing technique (RNA-Seq) was used to investigate differently expressed genes in one-month-old Arabidopsis seedling grown on MI free or MI supplemented culture medium. The results showed that 21,288 and 21,299 genes were detected with and without MI treatment, respectively. The detected genes included 184 new genes that were not annotated in the Arabidopsis thaliana reference genome. Additionally, 183 differentially expressed genes were identified (DEGs, FDR ≤0.05, log2 FC≥1), including 93 up-regulated genes and 90 down-regulated genes. The DEGs were involved in multiple pathways, such as cell wall biosynthesis, biotic and abiotic stress response, chromosome modification, and substrate transportation. Some significantly differently expressed genes provided us with valuable information for exploring the functions of exogenous MI. RNA-Seq results showed that exogenous MI could alter gene expression and signaling transduction in plant cells. These results provided a systematic understanding of the functions of exogenous MI in detail and provided a foundation for future studies. PMID:27603208
Genome-Wide Evolutionary Characterization and Expression Analyses of WRKY Family Genes in Brachypodium distachyon

PubMed Central

Wen, Feng; Zhu, Hong; Li, Peng; Jiang, Min; Mao, Wenqing; Ong, Chermaine; Chu, Zhaoqing

2014-01-01

Members of plant WRKY gene family are ancient transcription factors that function in plant growth and development and respond to biotic and abiotic stresses. In our present study, we have investigated WRKY family genes in Brachypodium distachyon, a new model plant of family Poaceae. We identified a total of 86 WRKY genes from B. distachyon and explored their chromosomal distribution and evolution, domain alignment, promoter cis-elements, and expression profiles. Combining the analysis of phylogenetic tree of BdWRKY genes and the result of expression profiling, results showed that most of clustered gene pairs had higher similarities in the WRKY domain, suggesting that they might be functionally redundant. Neighbour-joining analysis of 301 WRKY domains from Oryza sativa, Arabidopsis thaliana, and B. distachyon suggested that BdWRKY domains are evolutionarily more closely related to O. sativa WRKY domains than those of A. thaliana. Moreover, tissue-specific expression profile of BdWRKY genes and their responses to phytohormones and several biotic or abiotic stresses were analysed by quantitative real-time PCR. The results showed that the expression of BdWRKY genes was rapidly regulated by stresses and phytohormones, and there was a strong correlation between promoter cis-elements and the phytohormones-induced BdWRKY gene expression. PMID:24453041
Identification and Functional Analysis of Healing Regulators in Drosophila

PubMed Central

Álvarez-Fernández, Carmen; Tamirisa, Srividya; Prada, Federico; Chernomoretz, Ariel; Podhajcer, Osvaldo; Blanco, Enrique; Martín-Blanco, Enrique

2015-01-01

Wound healing is an essential homeostatic mechanism that maintains the epithelial barrier integrity after tissue damage. Although we know the overall steps in wound healing, many of the underlying molecular mechanisms remain unclear. Genetically amenable systems, such as wound healing in Drosophila imaginal discs, do not model all aspects of the repair process. However, they do allow the less understood aspects of the healing response to be explored, e.g., which signal(s) are responsible for initiating tissue remodeling? How is sealing of the epithelia achieved? Or, what inhibitory cues cancel the healing machinery upon completion? Answering these and other questions first requires the identification and functional analysis of wound specific genes. A variety of different microarray analyses of murine and humans have identified characteristic profiles of gene expression at the wound site, however, very few functional studies in healing regulation have been carried out. We developed an experimentally controlled method that is healing-permissive and that allows live imaging and biochemical analysis of cultured imaginal discs. We performed comparative genome-wide profiling between Drosophila imaginal cells actively involved in healing versus their non-engaged siblings. Sets of potential wound-specific genes were subsequently identified. Importantly, besides identifying and categorizing new genes, we functionally tested many of their gene products by genetic interference and overexpression in healing assays. This non-saturated analysis defines a relevant set of genes whose changes in expression level are functionally significant for proper tissue repair. Amongst these we identified the TCP1 chaperonin complex as a key regulator of the actin cytoskeleton essential for the wound healing response. There is promise that our newly identified wound-healing genes will guide future work in the more complex mammalian wound healing response. PMID:25647511

The Comparative Toxicogenomics Database (CTD): A Resource for Comparative Toxicological Studies

PubMed Central

CJ, Mattingly; MC, Rosenstein; GT, Colby; JN, Forrest; JL, Boyer

2006-01-01

The etiology of most chronic diseases involves interactions between environmental factors and genes that modulate important biological processes (Olden and Wilson, 2000). We are developing the publicly available Comparative Toxicogenomics Database (CTD) to promote understanding about the effects of environmental chemicals on human health. CTD identifies interactions between chemicals and genes and facilitates cross-species comparative studies of these genes. The use of diverse animal models and cross-species comparative sequence studies has been critical for understanding basic physiological mechanisms and gene and protein functions. Similarly, these approaches will be valuable for exploring the molecular mechanisms of action of environmental chemicals and the genetic basis of differential susceptibility. PMID:16902965
Analysis of the APETALA3- and PISTILLATA-like genes in Hedyosmum orientale (Chloranthaceae) provides insight into the evolution of the floral homeotic B-function in angiosperms

PubMed Central

Liu, Shujun; Sun, Yonghua; Du, Xiaoqiu; Xu, Qijiang; Wu, Feng; Meng, Zheng

2013-01-01

Background and Aims According to the floral ABC model, B-function genes appear to play a key role in the origin and diversification of the perianth during the evolution of angiosperms. The basal angiosperm Hedyosmum orientale (Chloranthaceae) has unisexual inflorescences associated with a seemingly primitive reproductive morphology and a reduced perianth structure in female flowers. The aim of this study was to investigate the nature of the perianth and the evolutionary state of the B-function programme in this species. Methods A series of experiments were conducted to characterize B-gene homologues isolated from H. orientale, including scanning electron microscopy to observe the development of floral organs, phylogenetic analysis to reconstruct gene evolutionary history, reverse transcription–PCR, quantitative real-time PCR and in situ hybridization to identify gene expression patterns, the yeast two-hybrid assay to explore protein dimerization affinities, and transgenic analyses in Arabidopsis thaliana to determine activities of the encoded proteins. Key Results The expression of HoAP3 genes was restricted to stamens, whereas HoPI genes were broadly expressed in all floral organs. HoAP3 was able to partially restore the stamen but not petal identity in Arabidopsis ap3-3 mutants. In contrast, HoPI could rescue aspects of both stamen and petal development in Arabidopsis pi-1 mutants. When the complete C-terminal sequence of HoPI was deleted, however, no or weak transgenic phenotypes were observed and homodimerization capability was completely abolished. Conclusions The results suggest that Hedyosmum AP3-like genes have an ancestral function in specifying male reproductive organs, and that the activity of the encoded PI-like proteins is highly conserved between Hedyosmum and Arabidopsis. Moreover, there is evidence that the C-terminal region is important for the function of HoPI. Our findings indicate that the development of the proposed perianth in Hedyosmum does not rely on the B homeotic function. PMID:23956161
Analyzing the genes related to Alzheimer's disease via a network and pathway-based approach.

PubMed

Hu, Yan-Shi; Xin, Juncai; Hu, Ying; Zhang, Lei; Wang, Ju

2017-04-27

Our understanding of the molecular mechanisms underlying Alzheimer's disease (AD) remains incomplete. Previous studies have revealed that genetic factors provide a significant contribution to the pathogenesis and development of AD. In the past years, numerous genes implicated in this disease have been identified via genetic association studies on candidate genes or at the genome-wide level. However, in many cases, the roles of these genes and their interactions in AD are still unclear. A comprehensive and systematic analysis focusing on the biological function and interactions of these genes in the context of AD will therefore provide valuable insights to understand the molecular features of the disease. In this study, we collected genes potentially associated with AD by screening publications on genetic association studies deposited in PubMed. The major biological themes linked with these genes were then revealed by function and biochemical pathway enrichment analysis, and the relation between the pathways was explored by pathway crosstalk analysis. Furthermore, the network features of these AD-related genes were analyzed in the context of human interactome and an AD-specific network was inferred using the Steiner minimal tree algorithm. We compiled 430 human genes reported to be associated with AD from 823 publications. Biological theme analysis indicated that the biological processes and biochemical pathways related to neurodevelopment, metabolism, cell growth and/or survival, and immunology were enriched in these genes. Pathway crosstalk analysis then revealed that the significantly enriched pathways could be grouped into three interlinked modules-neuronal and metabolic module, cell growth/survival and neuroendocrine pathway module, and immune response-related module-indicating an AD-specific immune-endocrine-neuronal regulatory network. Furthermore, an AD-specific protein network was inferred and novel genes potentially associated with AD were identified. By means of network and pathway-based methodology, we explored the pathogenetic mechanism underlying AD at a systems biology level. Results from our work could provide valuable clues for understanding the molecular mechanism underlying AD. In addition, the framework proposed in this study could be used to investigate the pathological molecular network and genes relevant to other complex diseases or phenotypes.
A functional genomics screen in planarians reveals regulators of whole-brain regeneration.

PubMed

Roberts-Galbraith, Rachel H; Brubacher, John L; Newmark, Phillip A

2016-09-09

Planarians regenerate all body parts after injury, including the central nervous system (CNS). We capitalized on this distinctive trait and completed a gene expression-guided functional screen to identify factors that regulate diverse aspects of neural regeneration in Schmidtea mediterranea . Our screen revealed molecules that influence neural cell fates, support the formation of a major connective hub, and promote reestablishment of chemosensory behavior. We also identified genes that encode signaling molecules with roles in head regeneration, including some that are produced in a previously uncharacterized parenchymal population of cells. Finally, we explored genes downregulated during planarian regeneration and characterized, for the first time, glial cells in the planarian CNS that respond to injury by repressing several transcripts. Collectively, our studies revealed diverse molecules and cell types that underlie an animal's ability to regenerate its brain.
Loss-of-function mutations and inducible RNAi suppression of Arabidopsis LCB2 genes reveal the critical role of sphingolipids in gametophytic and sporophytic cell viability.

PubMed

Dietrich, Charles R; Han, Gongshe; Chen, Ming; Berg, R Howard; Dunn, Teresa M; Cahoon, Edgar B

2008-04-01

Serine palmitoyltransferase (SPT) catalyzes the first step in sphingolipid biosynthesis, and downregulation of this enzyme provides a means for exploring sphingolipid function in cells. We have previously demonstrated that Arabidopsis SPT requires LCB1 and LCB2 subunits for activity, as is the case in other eukaryotes. In this study, we show that Arabidopsis has two genes (AtLCB2a and AtLCB2b) that encode functional isoforms of the LCB2 subunit. No alterations in sphingolipid content or growth were observed in T-DNA mutants for either gene, but homozygous double mutants were not recoverable, suggesting that these genes are functionally redundant. Reciprocal crosses conducted with Atlcb2a and Atlcb2b mutants indicated that lethality is associated primarily with the inability to transmit the lcb2 null genotype through the haploid pollen. Consistent with this, approximately 50% of the pollen obtained from plants homozygous for a mutation in one gene and heterozygous for a mutation in the second gene arrested during transition from uni-nucleate microspore to bicellular pollen. Ultrastructural analyses revealed that these pollen grains contained aberrant endomembranes and lacked an intine layer. To examine sphingolipid function in sporophytic cells, Arabidopsis lines were generated that allowed inducible RNAi silencing of AtLCB2b in an Atlcb2a mutant background. Studies conducted with these lines demonstrated that sphingolipids are essential throughout plant development, and that lethality resulting from LCB2 silencing in seedlings could be partially rescued by supplying exogenous long-chain bases. Overall, these studies provide insights into the genetic and biochemical properties of SPT and sphingolipid function in Arabidopsis.
Robustness, Evolvability, and the Logic of Genetic Regulation

PubMed Central

Moore, Jason H.; Wagner, Andreas

2014-01-01

In gene regulatory circuits, the expression of individual genes is commonly modulated by a set of regulating gene products, which bind to a gene’s cis-regulatory region. This region encodes an input-output function, referred to as signal-integration logic, that maps a specific combination of regulatory signals (inputs) to a particular expression state (output) of a gene. The space of all possible signal-integration functions is vast and the mapping from input to output is many-to-one: for the same set of inputs, many functions (genotypes) yield the same expression output (phenotype). Here, we exhaustively enumerate the set of signal-integration functions that yield idential gene expression patterns within a computational model of gene regulatory circuits. Our goal is to characterize the relationship between robustness and evolvability in the signal-integration space of regulatory circuits, and to understand how these properties vary between the genotypic and phenotypic scales. Among other results, we find that the distributions of genotypic robustness are skewed, such that the majority of signal-integration functions are robust to perturbation. We show that the connected set of genotypes that make up a given phenotype are constrained to specific regions of the space of all possible signal-integration functions, but that as the distance between genotypes increases, so does their capacity for unique innovations. In addition, we find that robust phenotypes are (i) evolvable, (ii) easily identified by random mutation, and (iii) mutationally biased toward other robust phenotypes. We explore the implications of these latter observations for mutation-based evolution by conducting random walks between randomly chosen source and target phenotypes. We demonstrate that the time required to identify the target phenotype is independent of the properties of the source phenotype. PMID:23373974
Identification of functional differences in metabolic networks using comparative genomics and constraint-based models.

PubMed

Hamilton, Joshua J; Reed, Jennifer L

2012-01-01

Genome-scale network reconstructions are useful tools for understanding cellular metabolism, and comparisons of such reconstructions can provide insight into metabolic differences between organisms. Recent efforts toward comparing genome-scale models have focused primarily on aligning metabolic networks at the reaction level and then looking at differences and similarities in reaction and gene content. However, these reaction comparison approaches are time-consuming and do not identify the effect network differences have on the functional states of the network. We have developed a bilevel mixed-integer programming approach, CONGA, to identify functional differences between metabolic networks by comparing network reconstructions aligned at the gene level. We first identify orthologous genes across two reconstructions and then use CONGA to identify conditions under which differences in gene content give rise to differences in metabolic capabilities. By seeking genes whose deletion in one or both models disproportionately changes flux through a selected reaction (e.g., growth or by-product secretion) in one model over another, we are able to identify structural metabolic network differences enabling unique metabolic capabilities. Using CONGA, we explore functional differences between two metabolic reconstructions of Escherichia coli and identify a set of reactions responsible for chemical production differences between the two models. We also use this approach to aid in the development of a genome-scale model of Synechococcus sp. PCC 7002. Finally, we propose potential antimicrobial targets in Mycobacterium tuberculosis and Staphylococcus aureus based on differences in their metabolic capabilities. Through these examples, we demonstrate that a gene-centric approach to comparing metabolic networks allows for a rapid comparison of metabolic models at a functional level. Using CONGA, we can identify differences in reaction and gene content which give rise to different functional predictions. Because CONGA provides a general framework, it can be applied to find functional differences across models and biological systems beyond those presented here.
Identification of Functional Differences in Metabolic Networks Using Comparative Genomics and Constraint-Based Models

PubMed Central

Hamilton, Joshua J.; Reed, Jennifer L.

2012-01-01

Genome-scale network reconstructions are useful tools for understanding cellular metabolism, and comparisons of such reconstructions can provide insight into metabolic differences between organisms. Recent efforts toward comparing genome-scale models have focused primarily on aligning metabolic networks at the reaction level and then looking at differences and similarities in reaction and gene content. However, these reaction comparison approaches are time-consuming and do not identify the effect network differences have on the functional states of the network. We have developed a bilevel mixed-integer programming approach, CONGA, to identify functional differences between metabolic networks by comparing network reconstructions aligned at the gene level. We first identify orthologous genes across two reconstructions and then use CONGA to identify conditions under which differences in gene content give rise to differences in metabolic capabilities. By seeking genes whose deletion in one or both models disproportionately changes flux through a selected reaction (e.g., growth or by-product secretion) in one model over another, we are able to identify structural metabolic network differences enabling unique metabolic capabilities. Using CONGA, we explore functional differences between two metabolic reconstructions of Escherichia coli and identify a set of reactions responsible for chemical production differences between the two models. We also use this approach to aid in the development of a genome-scale model of Synechococcus sp. PCC 7002. Finally, we propose potential antimicrobial targets in Mycobacterium tuberculosis and Staphylococcus aureus based on differences in their metabolic capabilities. Through these examples, we demonstrate that a gene-centric approach to comparing metabolic networks allows for a rapid comparison of metabolic models at a functional level. Using CONGA, we can identify differences in reaction and gene content which give rise to different functional predictions. Because CONGA provides a general framework, it can be applied to find functional differences across models and biological systems beyond those presented here. PMID:22666308
A Functional and Regulatory Network Associated with PIP Expression in Human Breast Cancer

PubMed Central

Debily, Marie-Anne; Marhomy, Sandrine El; Boulanger, Virginie; Eveno, Eric; Mariage-Samson, Régine; Camarca, Alessandra; Auffray, Charles; Piatier-Tonneau, Dominique; Imbeaud, Sandrine

2009-01-01

Background The PIP (prolactin-inducible protein) gene has been shown to be expressed in breast cancers, with contradictory results concerning its implication. As both the physiological role and the molecular pathways in which PIP is involved are poorly understood, we conducted combined gene expression profiling and network analysis studies on selected breast cancer cell lines presenting distinct PIP expression levels and hormonal receptor status, to explore the functional and regulatory network of PIP co-modulated genes. Principal Findings Microarray analysis allowed identification of genes co-modulated with PIP independently of modulations resulting from hormonal treatment or cell line heterogeneity. Relevant clusters of genes that can discriminate between [PIP+] and [PIP−] cells were identified. Functional and regulatory network analyses based on a knowledge database revealed a master network of PIP co-modulated genes, including many interconnecting oncogenes and tumor suppressor genes, half of which were detected as differentially expressed through high-precision measurements. The network identified appears associated with an inhibition of proliferation coupled with an increase of apoptosis and an enhancement of cell adhesion in breast cancer cell lines, and contains many genes with a STAT5 regulatory motif in their promoters. Conclusions Our global exploratory approach identified biological pathways modulated along with PIP expression, providing further support for its good prognostic value of disease-free survival in breast cancer. Moreover, our data pointed to the importance of a regulatory subnetwork associated with PIP expression in which STAT5 appears as a potential transcriptional regulator. PMID:19262752
Comparative symbiotic plasmid analysis indicates that symbiosis gene ancestor type affects plasmid genetic evolution.

PubMed

Wang, X; Zhao, L; Zhang, L; Wu, Y; Chou, M; Wei, G

2018-07-01

Rhizobial symbiotic plasmids play vital roles in mutualistic symbiosis with legume plants by executing the functions of nodulation and nitrogen fixation. To explore the gene composition and genetic constitution of rhizobial symbiotic plasmids, comparison analyses of 24 rhizobial symbiotic plasmids derived from four rhizobial genera was carried out. Results illustrated that rhizobial symbiotic plasmids had higher proportion of functional genes participating in amino acid transport and metabolism, replication; recombination and repair; carbohydrate transport and metabolism; energy production and conversion and transcription. Mesorhizobium amorphae CCNWGS0123 symbiotic plasmid - pM0123d had similar gene composition with pR899b and pSNGR234a. All symbiotic plasmids shared 13 orthologous genes, including five nod and eight nif/fix genes which participate in the rhizobia-legume symbiosis process. These plasmids contained nod genes from four ancestors and fix genes from six ancestors. The ancestral type of pM0123d nod genes was similar with that of Rhizobium etli plasmids, while the ancestral type of pM0123d fix genes was same as that of pM7653Rb. The phylogenetic trees constructed based on nodCIJ and fixABC displayed different topological structures mainly due to nodCIJ and fixABC ancestral type discordance. The study presents valuable insights into mosaic structures and the evolution of rhizobial symbiotic plasmids. This study compared 24 rhizobial symbiotic plasmids that included four genera and 11 species, illuminating the functional gene composition and symbiosis gene ancestor types of symbiotic plasmids from higher taxonomy. It provides valuable insights into mosaic structures and the evolution of symbiotic plasmids. © 2018 The Society for Applied Microbiology.
Expression of M6 and M7 lysin in Mytilus edulis is not restricted to sperm, but occurs also in oocytes and somatic tissue of males and females.

PubMed

Heß, Anne-Katrin; Bartel, Manuela; Roth, Karina; Messerschmidt, Katrin; Heilmann, Katja; Kenchington, Ellen; Micheel, Burkhard; Stuckas, Heiko

2012-08-01

Sperm proteins of marine sessile invertebrates have been extensively studied to understand the molecular basis of reproductive isolation. Apart from molecules such as bindin of sea urchins or lysin of abalone species, the acrosomal protein M7 lysin of Mytilus edulis has been analyzed. M7 lysin was found to be under positive selection, but mechanisms driving the evolution of this protein are not fully understood. To explore functional aspects, this study investigated the protein expression pattern of M7 and M6 lysin in gametes and somatic tissue of male and female M. edulis. The study employs a previously published monoclonal antibody (G26-AG8) to investigate M6 and M7 lysin protein expression, and explores expression of both genes. It is shown that these proteins and their encoding genes are expressed in gametes and somatic tissue of both sexes. This is in contrast to sea urchin bindin and abalone lysin, in which gene expression is strictly limited to males. Although future studies need to clarify the functional importance of both acrosomal proteins in male and female somatic tissue, new insights into the evolution of sperm proteins in marine sessile invertebrates are possible. This is because proteins with male-specific expression (bindin, lysin) might evolve differently than proteins with expression in both sexes (M6/M7 lysin), and the putative function of both proteins in females opens the possibility that the evolution of M6/M7 lysin is under sexual antagonistic selection, for example, mutations beneficial to the acrosomal function that are less beneficial the function in somatic tissue of females. Copyright © 2012 Wiley Periodicals, Inc.
Chamber Specific Gene Expression Landscape of the Zebrafish Heart

PubMed Central

Singh, Angom Ramcharan; Sivadas, Ambily; Sabharwal, Ankit; Vellarikal, Shamsudheen Karuthedath; Jayarajan, Rijith; Verma, Ankit; Kapoor, Shruti; Joshi, Adita; Scaria, Vinod; Sivasubbu, Sridhar

2016-01-01

The organization of structure and function of cardiac chambers in vertebrates is defined by chamber-specific distinct gene expression. This peculiarity and uniqueness of the genetic signatures demonstrates functional resolution attributed to the different chambers of the heart. Altered expression of the cardiac chamber genes can lead to individual chamber related dysfunctions and disease patho-physiologies. Information on transcriptional repertoire of cardiac compartments is important to understand the spectrum of chamber specific anomalies. We have carried out a genome wide transcriptome profiling study of the three cardiac chambers in the zebrafish heart using RNA sequencing. We have captured the gene expression patterns of 13,396 protein coding genes in the three cardiac chambers—atrium, ventricle and bulbus arteriosus. Of these, 7,260 known protein coding genes are highly expressed (≥10 FPKM) in the zebrafish heart. Thus, this study represents nearly an all-inclusive information on the zebrafish cardiac transcriptome. In this study, a total of 96 differentially expressed genes across the three cardiac chambers in zebrafish were identified. The atrium, ventricle and bulbus arteriosus displayed 20, 32 and 44 uniquely expressing genes respectively. We validated the expression of predicted chamber-restricted genes using independent semi-quantitative and qualitative experimental techniques. In addition, we identified 23 putative novel protein coding genes that are specifically restricted to the ventricle and not in the atrium or bulbus arteriosus. In our knowledge, these 23 novel genes have either not been investigated in detail or are sparsely studied. The transcriptome identified in this study includes 68 differentially expressing zebrafish cardiac chamber genes that have a human ortholog. We also carried out spatiotemporal gene expression profiling of the 96 differentially expressed genes throughout the three cardiac chambers in 11 developmental stages and 6 tissue types of zebrafish. We hypothesize that clustering the differentially expressed genes with both known and unknown functions will deliver detailed insights on fundamental gene networks that are important for the development and specification of the cardiac chambers. It is also postulated that this transcriptome atlas will help utilize zebrafish in a better way as a model for studying cardiac development and to explore functional role of gene networks in cardiac disease pathogenesis. PMID:26815362
Comprehensive Analysis of Interaction Networks of Telomerase Reverse Transcriptase with Multiple Bioinformatic Approaches: Deep Mining the Potential Functions of Telomere and Telomerase.

PubMed

Hou, Chunyu; Wang, Fei; Liu, Xuewen; Chang, Guangming; Wang, Feng; Geng, Xin

2017-08-01

Telomerase reverse transcriptase (TERT) is the protein component of telomerase complex. Evidence has accumulated showing that the nontelomeric functions of TERT are independent of telomere elongation. However, the mechanisms governing the interaction between TERT and its target genes are not clearly revealed. The biological functions of TERT are not fully elucidated and have thus far been underestimated. To further explore these functions, we investigated TERT interaction networks using multiple bioinformatic databases, including BioGRID, STRING, DAVID, GeneCards, GeneMANIA, PANTHER, miRWalk, mirTarBase, miRNet, miRDB, and TargetScan. In addition, network diagrams were built using Cytoscape software. As competing endogenous RNAs (ceRNAs) are endogenous transcripts that compete for the binding of microRNAs (miRNAs) by using shared miRNA recognition elements, they are involved in creating widespread regulatory networks. Therefore, the ceRNA regulatory networks of TERT were also investigated in this study. Interestingly, we found that the three genes PABPC1, SLC7A11, and TP53 were present in both TERT interaction networks and ceRNAs target genes. It was predicted that TERT might play nontelomeric roles in the generation or development of some rare diseases, such as Rift Valley fever and dyscalculia. Thus, our data will help to decipher the interaction networks of TERT and reveal the unknown functions of telomerase in cancer and aging-related diseases.
De novo assembly and next-generation sequencing to analyse full-length gene variants from codon-barcoded libraries.

PubMed

Cho, Namjin; Hwang, Byungjin; Yoon, Jung-ki; Park, Sangun; Lee, Joongoo; Seo, Han Na; Lee, Jeewon; Huh, Sunghoon; Chung, Jinsoo; Bang, Duhee

2015-09-21

Interpreting epistatic interactions is crucial for understanding evolutionary dynamics of complex genetic systems and unveiling structure and function of genetic pathways. Although high resolution mapping of en masse variant libraries renders molecular biologists to address genotype-phenotype relationships, long-read sequencing technology remains indispensable to assess functional relationship between mutations that lie far apart. Here, we introduce JigsawSeq for multiplexed sequence identification of pooled gene variant libraries by combining a codon-based molecular barcoding strategy and de novo assembly of short-read data. We first validate JigsawSeq on small sub-pools and observed high precision and recall at various experimental settings. With extensive simulations, we then apply JigsawSeq to large-scale gene variant libraries to show that our method can be reliably scaled using next-generation sequencing. JigsawSeq may serve as a rapid screening tool for functional genomics and offer the opportunity to explore evolutionary trajectories of protein variants.
Cross-Study Comparison Reveals Common Genomic, Network, and Functional Signatures of Desiccation Resistance in Drosophila melanogaster

PubMed Central

Telonis-Scott, Marina; Sgrò, Carla M.; Hoffmann, Ary A.; Griffin, Philippa C.

2016-01-01

Repeated attempts to map the genomic basis of complex traits often yield different outcomes because of the influence of genetic background, gene-by-environment interactions, and/or statistical limitations. However, where repeatability is low at the level of individual genes, overlap often occurs in gene ontology categories, genetic pathways, and interaction networks. Here we report on the genomic overlap for natural desiccation resistance from a Pool-genome-wide association study experiment and a selection experiment in flies collected from the same region in southeastern Australia in different years. We identified over 600 single nucleotide polymorphisms associated with desiccation resistance in flies derived from almost 1,000 wild-caught genotypes, a similar number of loci to that observed in our previous genomic study of selected lines, demonstrating the genetic complexity of this ecologically important trait. By harnessing the power of cross-study comparison, we narrowed the candidates from almost 400 genes in each study to a core set of 45 genes, enriched for stimulus, stress, and defense responses. In addition to gene-level overlap, there was higher order congruence at the network and functional levels, suggesting genetic redundancy in key stress sensing, stress response, immunity, signaling, and gene expression pathways. We also identified variants linked to different molecular aspects of desiccation physiology previously verified from functional experiments. Our approach provides insight into the genomic basis of a complex and ecologically important trait and predicts candidate genetic pathways to explore in multiple genetic backgrounds and related species within a functional framework. PMID:26733490
Genome-Wide Analysis of NBS-LRR Genes in Sorghum Genome Revealed Several Events Contributing to NBS-LRR Gene Evolution in Grass Species

PubMed Central

Yang, Xiping; Wang, Jianping

2016-01-01

The nucleotide-binding site (NBS)–leucine-rich repeat (LRR) gene family is crucially important for offering resistance to pathogens. To explore evolutionary conservation and variability of NBS-LRR genes across grass species, we identified 88, 107, 24, and 44 full-length NBS-LRR genes in sorghum, rice, maize, and Brachypodium, respectively. A comprehensive analysis was performed on classification, genome organization, evolution, expression, and regulation of these NBS-LRR genes using sorghum as a representative of grass species. In general, the full-length NBS-LRR genes are highly clustered and duplicated in sorghum genome mainly due to local duplications. NBS-LRR genes have basal expression levels and are highly potentially targeted by miRNA. The number of NBS-LRR genes in the four grass species is positively correlated with the gene clustering rate. The results provided a valuable genomic resource and insights for functional and evolutionary studies of NBS-LRR genes in grass species. PMID:26792976
Negative selection in tumor genome evolution acts on essential cellular functions and the immunopeptidome.

PubMed

Zapata, Luis; Pich, Oriol; Serrano, Luis; Kondrashov, Fyodor A; Ossowski, Stephan; Schaefer, Martin H

2018-05-31

Natural selection shapes cancer genomes. Previous studies used signatures of positive selection to identify genes driving malignant transformation. However, the contribution of negative selection against somatic mutations that affect essential tumor functions or specific domains remains a controversial topic. Here, we analyze 7546 individual exomes from 26 tumor types from TCGA data to explore the portion of the cancer exome under negative selection. Although we find most of the genes neutrally evolving in a pan-cancer framework, we identify essential cancer genes and immune-exposed protein regions under significant negative selection. Moreover, our simulations suggest that the amount of negative selection is underestimated. We therefore choose an empirical approach to identify genes, functions, and protein regions under negative selection. We find that expression and mutation status of negatively selected genes is indicative of patient survival. Processes that are most strongly conserved are those that play fundamental cellular roles such as protein synthesis, glucose metabolism, and molecular transport. Intriguingly, we observe strong signals of selection in the immunopeptidome and proteins controlling peptide exposition, highlighting the importance of immune surveillance evasion. Additionally, tumor type-specific immune activity correlates with the strength of negative selection on human epitopes. In summary, our results show that negative selection is a hallmark of cell essentiality and immune response in cancer. The functional domains identified could be exploited therapeutically, ultimately allowing for the development of novel cancer treatments.
Analysis of functional polymorphisms in three synaptic plasticity-related genes (BDNF, COMT AND UCHL1) in Alzheimer's disease in Colombia.

PubMed

Forero, Diego A; Benítez, Bruno; Arboleda, Gonzalo; Yunis, Juan J; Pardo, Rodrigo; Arboleda, Humberto

2006-07-01

In recent years, it has been proposed that synaptic dysfunction may be an important etiological factor for Alzheimer's disease (AD). This hypothesis has important implications for the analysis of AD genetic risk in case-control studies. In the present work, we analyzed common functional polymorphisms in three synaptic plasticity-related genes (brain-derived neurotrophic factor, BDNF Val66Met; catechol-O-methyl transferase, COMT Val158; ubiquitin carboxyl-terminal hydroxylase, UCHL1 S18Y) in a sample of 102 AD cases and 168 age and sex matched controls living in Bogotá, Colombia. There was not association between UCHL1 polymorphism and AD in our sample. We have found an initial association with BDNF polymorphism in familial cases and with COMT polymorphism in male and sporadic patients. These initial associations were lost after Bonferroni correction for multiple testing. Unadjusted results may be compatible with the expected functional effect of variations in these genes on pathological memory and cognitive dysfunction, as has been implicated in animal and cell models and also from neuropsychological analysis of normal subjects carriers of the AD associated genotypes. An exploration of functional variants in these and in other synaptic plasticity-related genes (a synaptogenomics approach) in independent larger samples will be important to discover new genes associated with AD.
An Exploration of the Serotonin System in Antisocial Boys with High Levels of Callous-Unemotional Traits

PubMed Central

Moul, Caroline; Dobson-Stone, Carol; Brennan, John; Hawes, David; Dadds, Mark

2013-01-01

Background The serotonin system is thought to play a role in the aetiology of antisocial and aggressive behaviour in both adults and children however previous findings have been inconsistent. Recently, research has suggested that the function of the serotonin system may be specifically altered in a sub-set of antisocial populations – those with psychopathic (callous-unemotional) personality traits. We explored the relationships between callous-unemotional traits and functional polymorphisms of selected serotonin-system genes, and tested the association between callous-unemotional traits and serum serotonin levels independently of antisocial and aggressive behaviour. Method Participants were boys with antisocial behaviour problems aged 3–16 years referred to University of New South Wales Child Behaviour Research Clinics. Participants volunteered either a blood or saliva sample from which levels of serum serotonin (N = 66) and/or serotonin-system single nucleotide polymorphisms (N = 157) were assayed. Results Functional single nucleotide polymorphisms from the serotonin 1b receptor gene (HTR1B) and 2a receptor gene (HTR2A) were found to be associated with callous-unemotional traits. Serum serotonin level was a significant predictor of callous-unemotional traits; levels were significantly lower in boys with high callous-unemotional traits than in boys with low callous-unemotional traits. Conclusion Results provide support to the emerging literature that argues for a genetically-driven system-wide alteration in serotonin function in the aetiology of callous-unemotional traits. The findings should be interpreted as preliminary and future research that aims to replicate and further investigate these results is required. PMID:23457595
An exploration of the serotonin system in antisocial boys with high levels of callous-unemotional traits.

PubMed

Moul, Caroline; Dobson-Stone, Carol; Brennan, John; Hawes, David; Dadds, Mark

2013-01-01

The serotonin system is thought to play a role in the aetiology of antisocial and aggressive behaviour in both adults and children however previous findings have been inconsistent. Recently, research has suggested that the function of the serotonin system may be specifically altered in a sub-set of antisocial populations - those with psychopathic (callous-unemotional) personality traits. We explored the relationships between callous-unemotional traits and functional polymorphisms of selected serotonin-system genes, and tested the association between callous-unemotional traits and serum serotonin levels independently of antisocial and aggressive behaviour. Participants were boys with antisocial behaviour problems aged 3-16 years referred to University of New South Wales Child Behaviour Research Clinics. Participants volunteered either a blood or saliva sample from which levels of serum serotonin (N = 66) and/or serotonin-system single nucleotide polymorphisms (N = 157) were assayed. Functional single nucleotide polymorphisms from the serotonin 1b receptor gene (HTR1B) and 2a receptor gene (HTR2A) were found to be associated with callous-unemotional traits. Serum serotonin level was a significant predictor of callous-unemotional traits; levels were significantly lower in boys with high callous-unemotional traits than in boys with low callous-unemotional traits. Results provide support to the emerging literature that argues for a genetically-driven system-wide alteration in serotonin function in the aetiology of callous-unemotional traits. The findings should be interpreted as preliminary and future research that aims to replicate and further investigate these results is required.

The Ketogenic Diet and Potassium Channel Function

DTIC Science & Technology

2014-10-01

1 AWARD NUMBER: W81XWH-13-1-0463 TITLE: The Ketogenic Diet and Potassium Channel Function...Unlimited 13. SUPPLEMENTARY NOTES 14. ABSTRACT The overall objective of this Discovery Award is to explore the hypothesis the ketogenic diet ...have examining the impact of the ketogenic diet on mice in which the gene that encodes Kvβ2 has been deleted (Kvβ2 KO mice) using an in vitro model of
Exploration of structural stability in deleterious nsSNPs of the XPA gene: A molecular dynamics approach.

PubMed

Nagasundaram, N; Priya Doss, C George

2011-01-01

Distinguishing the deleterious from the massive number of non-functional nsSNPs that occur within a single genome is a considerable challenge in mutation research. In this approach, we have used the existing in silico methods to explore the mutation-structure-function relationship in the XPAgene. We used the Sorting Intolerant From Tolerant (SIFT), Polymorphism Phenotyping (PolyPhen), I-Mutant 2.0, and the Protein Analysis THrough Evolutionary Relationships methods to predict the effects of deleterious nsSNPs on protein function and evaluated the impact of mutation on protein stability by Molecular Dynamics simulations. By comparing the scores of all the four in silico methods, nsSNP with an ID rs104894131 at position C108F was predicted to be highly deleterious. We extended our Molecular dynamics approach to gain insight into the impact of this non-synonymous polymorphism on structural changes that may affect the activity of the XPAgene. Based on the in silico methods score, potential energy, root-mean-square deviation, and root-mean-square fluctuation, we predict that deleterious nsSNP at position C108F would play a significant role in causing disease by the XPA gene. Our approach would present the application of in silicotools in understanding the functional variation from the perspective of structure, evolution, and phenotype.
Nuclear Imaging for Assessment of Prostate Cancer Gene Therapy

DTIC Science & Technology

2007-03-01

thymidine kinase transfected EL4 cells . Further exploration of Tc-99m conjugated potential HSV1-TK substrates is still undergoing in our laboratory...prostate cancer cells , has been demonstrated the utility for tissue-specific toxic gene therapy for prostate cancer[10, 11]. Therefore, an adenovirus...BJ5183 together with pAdeasy-1, the viral DNA plasmid. The pAdeasy-1 is E1 and E3 deleted, its E1 function can be complemented in 293A cells . The
Muscle-specific gene expression is underscored by differential stressor responses and coexpression changes.

PubMed

Moreno-Sánchez, Natalia; Rueda, Julia; Reverter, Antonio; Carabaño, María Jesús; Díaz, Clara

2012-03-01

Variations on the transcriptome from one skeletal muscle type to another still remain unknown. The reliable identification of stable gene coexpression networks is essential to unravel gene functions and define biological processes. The differential expression of two distinct muscles, M. flexor digitorum (FD) and M. psoas major (PM), was studied using microarrays in cattle to illustrate muscle-specific transcription patterns and to quantify changes in connectivity regarding the expected gene coexpression pattern. A total of 206 genes were differentially expressed (DE), 94 upregulated in PM and 112 in FD. The distribution of DE genes in pathways and biological functions was explored in the context of system biology. Global interactomes for genes of interest were predicted. Fast/slow twitch genes, genes coding for extracellular matrix, ribosomal and heat shock proteins, and fatty acid uptake centred the specific gene expression patterns per muscle. Genes involved in repairing mechanisms, such as ribosomal and heat shock proteins, suggested a differential ability of muscles to react to similar stressing factors, acting preferentially in slow twitch muscles. Muscle attributes do not seem to be completely explained by the muscle fibre composition. Changes in connectivity accounted for 24% of significant correlations between DE genes. Genes changing their connectivity mostly seem to contribute to the main differential attributes that characterize each specific muscle type. These results underscore the unique flexibility of skeletal muscle where a substantial set of genes are able to change their behavior depending on the circumstances.
Cross-biome metagenomic analyses of soil microbial communities and their functional attributes.

PubMed

Fierer, Noah; Leff, Jonathan W; Adams, Byron J; Nielsen, Uffe N; Bates, Scott Thomas; Lauber, Christian L; Owens, Sarah; Gilbert, Jack A; Wall, Diana H; Caporaso, J Gregory

2012-12-26

For centuries ecologists have studied how the diversity and functional traits of plant and animal communities vary across biomes. In contrast, we have only just begun exploring similar questions for soil microbial communities despite soil microbes being the dominant engines of biogeochemical cycles and a major pool of living biomass in terrestrial ecosystems. We used metagenomic sequencing to compare the composition and functional attributes of 16 soil microbial communities collected from cold deserts, hot deserts, forests, grasslands, and tundra. Those communities found in plant-free cold desert soils typically had the lowest levels of functional diversity (diversity of protein-coding gene categories) and the lowest levels of phylogenetic and taxonomic diversity. Across all soils, functional beta diversity was strongly correlated with taxonomic and phylogenetic beta diversity; the desert microbial communities were clearly distinct from the nondesert communities regardless of the metric used. The desert communities had higher relative abundances of genes associated with osmoregulation and dormancy, but lower relative abundances of genes associated with nutrient cycling and the catabolism of plant-derived organic compounds. Antibiotic resistance genes were consistently threefold less abundant in the desert soils than in the nondesert soils, suggesting that abiotic conditions, not competitive interactions, are more important in shaping the desert microbial communities. As the most comprehensive survey of soil taxonomic, phylogenetic, and functional diversity to date, this study demonstrates that metagenomic approaches can be used to build a predictive understanding of how microbial diversity and function vary across terrestrial biomes.
Systems level analysis of the Chlamydomonas reinhardtii metabolic network reveals variability in evolutionary co-conservation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chaiboonchoe, Amphun; Ghamsari, Lila; Dohai, Bushra

Metabolic networks, which are mathematical representations of organismal metabolism, are reconstructed to provide computational platforms to guide metabolic engineering experiments and explore fundamental questions on metabolism. Systems level analyses, such as interrogation of phylogenetic relationships within the network, can provide further guidance on the modification of metabolic circuitries. Chlamydomonas reinhardtii, a biofuel relevant green alga that has retained key genes with plant, animal, and protist affinities, serves as an ideal model organism to investigate the interplay between gene function and phylogenetic affinities at multiple organizational levels. Here, using detailed topological and functional analyses, coupled with transcriptomics studies on a metabolicmore » network that we have reconstructed for C. reinhardtii, we show that network connectivity has a significant concordance with the co-conservation of genes; however, a distinction between topological and functional relationships is observable within the network. Dynamic and static modes of co-conservation were defined and observed in a subset of gene-pairs across the network topologically. In contrast, genes with predicted synthetic interactions, or genes involved in coupled reactions, show significant enrichment for both shorter and longer phylogenetic distances. Based on our results, we propose that the metabolic network of C. reinhardtii is assembled with an architecture to minimize phylogenetic profile distances topologically, while it includes an expansion of such distances for functionally interacting genes. This arrangement may increase the robustness of C. reinhardtii's network in dealing with varied environmental challenges that the species may face. As a result, the defined evolutionary constraints within the network, which identify important pairings of genes in metabolism, may offer guidance on synthetic biology approaches to optimize the production of desirable metabolites.« less
Systems level analysis of the Chlamydomonas reinhardtii metabolic network reveals variability in evolutionary co-conservation.

PubMed

Chaiboonchoe, Amphun; Ghamsari, Lila; Dohai, Bushra; Ng, Patrick; Khraiwesh, Basel; Jaiswal, Ashish; Jijakli, Kenan; Koussa, Joseph; Nelson, David R; Cai, Hong; Yang, Xinping; Chang, Roger L; Papin, Jason; Yu, Haiyuan; Balaji, Santhanam; Salehi-Ashtiani, Kourosh

2016-07-19

Metabolic networks, which are mathematical representations of organismal metabolism, are reconstructed to provide computational platforms to guide metabolic engineering experiments and explore fundamental questions on metabolism. Systems level analyses, such as interrogation of phylogenetic relationships within the network, can provide further guidance on the modification of metabolic circuitries. Chlamydomonas reinhardtii, a biofuel relevant green alga that has retained key genes with plant, animal, and protist affinities, serves as an ideal model organism to investigate the interplay between gene function and phylogenetic affinities at multiple organizational levels. Here, using detailed topological and functional analyses, coupled with transcriptomics studies on a metabolic network that we have reconstructed for C. reinhardtii, we show that network connectivity has a significant concordance with the co-conservation of genes; however, a distinction between topological and functional relationships is observable within the network. Dynamic and static modes of co-conservation were defined and observed in a subset of gene-pairs across the network topologically. In contrast, genes with predicted synthetic interactions, or genes involved in coupled reactions, show significant enrichment for both shorter and longer phylogenetic distances. Based on our results, we propose that the metabolic network of C. reinhardtii is assembled with an architecture to minimize phylogenetic profile distances topologically, while it includes an expansion of such distances for functionally interacting genes. This arrangement may increase the robustness of C. reinhardtii's network in dealing with varied environmental challenges that the species may face. The defined evolutionary constraints within the network, which identify important pairings of genes in metabolism, may offer guidance on synthetic biology approaches to optimize the production of desirable metabolites.
Systems level analysis of the Chlamydomonas reinhardtii metabolic network reveals variability in evolutionary co-conservation

DOE PAGES

Chaiboonchoe, Amphun; Ghamsari, Lila; Dohai, Bushra; ...

2016-06-14

Metabolic networks, which are mathematical representations of organismal metabolism, are reconstructed to provide computational platforms to guide metabolic engineering experiments and explore fundamental questions on metabolism. Systems level analyses, such as interrogation of phylogenetic relationships within the network, can provide further guidance on the modification of metabolic circuitries. Chlamydomonas reinhardtii, a biofuel relevant green alga that has retained key genes with plant, animal, and protist affinities, serves as an ideal model organism to investigate the interplay between gene function and phylogenetic affinities at multiple organizational levels. Here, using detailed topological and functional analyses, coupled with transcriptomics studies on a metabolicmore » network that we have reconstructed for C. reinhardtii, we show that network connectivity has a significant concordance with the co-conservation of genes; however, a distinction between topological and functional relationships is observable within the network. Dynamic and static modes of co-conservation were defined and observed in a subset of gene-pairs across the network topologically. In contrast, genes with predicted synthetic interactions, or genes involved in coupled reactions, show significant enrichment for both shorter and longer phylogenetic distances. Based on our results, we propose that the metabolic network of C. reinhardtii is assembled with an architecture to minimize phylogenetic profile distances topologically, while it includes an expansion of such distances for functionally interacting genes. This arrangement may increase the robustness of C. reinhardtii's network in dealing with varied environmental challenges that the species may face. As a result, the defined evolutionary constraints within the network, which identify important pairings of genes in metabolism, may offer guidance on synthetic biology approaches to optimize the production of desirable metabolites.« less
Eucalyptus hairy roots, a fast, efficient and versatile tool to explore function and expression of genes involved in wood formation.

PubMed

Plasencia, Anna; Soler, Marçal; Dupas, Annabelle; Ladouce, Nathalie; Silva-Martins, Guilherme; Martinez, Yves; Lapierre, Catherine; Franche, Claudine; Truchet, Isabelle; Grima-Pettenati, Jacqueline

2016-06-01

Eucalyptus are of tremendous economic importance being the most planted hardwoods worldwide for pulp and paper, timber and bioenergy. The recent release of the Eucalyptus grandis genome sequence pointed out many new candidate genes potentially involved in secondary growth, wood formation or lineage-specific biosynthetic pathways. Their functional characterization is, however, hindered by the tedious, time-consuming and inefficient transformation systems available hitherto for eucalypts. To overcome this limitation, we developed a fast, reliable and efficient protocol to obtain and easily detect co-transformed E. grandis hairy roots using fluorescent markers, with an average efficiency of 62%. We set up conditions both to cultivate excised roots in vitro and to harden composite plants and verified that hairy root morphology and vascular system anatomy were similar to wild-type ones. We further demonstrated that co-transformed hairy roots are suitable for medium-throughput functional studies enabling, for instance, protein subcellular localization, gene expression patterns through RT-qPCR and promoter expression, as well as the modulation of endogenous gene expression. Down-regulation of the Eucalyptus cinnamoyl-CoA reductase1 (EgCCR1) gene, encoding a key enzyme in lignin biosynthesis, led to transgenic roots with reduced lignin levels and thinner cell walls. This gene was used as a proof of concept to demonstrate that the function of genes involved in secondary cell wall biosynthesis and wood formation can be elucidated in transgenic hairy roots using histochemical, transcriptomic and biochemical approaches. The method described here is timely because it will accelerate gene mining of the genome for both basic research and industry purposes. © 2015 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.
Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA Clones

PubMed Central

Imanishi, Tadashi; Itoh, Takeshi; Suzuki, Yutaka; O'Donovan, Claire; Fukuchi, Satoshi; Koyanagi, Kanako O; Barrero, Roberto A; Tamura, Takuro; Yamaguchi-Kabata, Yumi; Tanino, Motohiko; Yura, Kei; Miyazaki, Satoru; Ikeo, Kazuho; Homma, Keiichi; Kasprzyk, Arek; Nishikawa, Tetsuo; Hirakawa, Mika; Thierry-Mieg, Jean; Thierry-Mieg, Danielle; Ashurst, Jennifer; Jia, Libin; Nakao, Mitsuteru; Thomas, Michael A; Mulder, Nicola; Karavidopoulou, Youla; Jin, Lihua; Kim, Sangsoo; Yasuda, Tomohiro; Lenhard, Boris; Eveno, Eric; Suzuki, Yoshiyuki; Yamasaki, Chisato; Takeda, Jun-ichi; Gough, Craig; Hilton, Phillip; Fujii, Yasuyuki; Sakai, Hiroaki; Tanaka, Susumu; Amid, Clara; Bellgard, Matthew; Bonaldo, Maria de Fatima; Bono, Hidemasa; Bromberg, Susan K; Brookes, Anthony J; Bruford, Elspeth; Carninci, Piero; Chelala, Claude; Couillault, Christine; de Souza, Sandro J.; Debily, Marie-Anne; Devignes, Marie-Dominique; Dubchak, Inna; Endo, Toshinori; Estreicher, Anne; Eyras, Eduardo; Fukami-Kobayashi, Kaoru; R. Gopinath, Gopal; Graudens, Esther; Hahn, Yoonsoo; Han, Michael; Han, Ze-Guang; Hanada, Kousuke; Hanaoka, Hideki; Harada, Erimi; Hashimoto, Katsuyuki; Hinz, Ursula; Hirai, Momoki; Hishiki, Teruyoshi; Hopkinson, Ian; Imbeaud, Sandrine; Inoko, Hidetoshi; Kanapin, Alexander; Kaneko, Yayoi; Kasukawa, Takeya; Kelso, Janet; Kersey, Paul; Kikuno, Reiko; Kimura, Kouichi; Korn, Bernhard; Kuryshev, Vladimir; Makalowska, Izabela; Makino, Takashi; Mano, Shuhei; Mariage-Samson, Regine; Mashima, Jun; Matsuda, Hideo; Mewes, Hans-Werner; Minoshima, Shinsei; Nagai, Keiichi; Nagasaki, Hideki; Nagata, Naoki; Nigam, Rajni; Ogasawara, Osamu; Ohara, Osamu; Ohtsubo, Masafumi; Okada, Norihiro; Okido, Toshihisa; Oota, Satoshi; Ota, Motonori; Ota, Toshio; Otsuki, Tetsuji; Piatier-Tonneau, Dominique; Poustka, Annemarie; Ren, Shuang-Xi; Saitou, Naruya; Sakai, Katsunaga; Sakamoto, Shigetaka; Sakate, Ryuichi; Schupp, Ingo; Servant, Florence; Sherry, Stephen; Shiba, Rie; Shimizu, Nobuyoshi; Shimoyama, Mary; Simpson, Andrew J; Soares, Bento; Steward, Charles; Suwa, Makiko; Suzuki, Mami; Takahashi, Aiko; Tamiya, Gen; Tanaka, Hiroshi; Taylor, Todd; Terwilliger, Joseph D; Unneberg, Per; Veeramachaneni, Vamsi; Watanabe, Shinya; Wilming, Laurens; Yasuda, Norikazu; Yoo, Hyang-Sook; Stodolsky, Marvin; Makalowski, Wojciech; Go, Mitiko; Nakai, Kenta; Takagi, Toshihisa; Kanehisa, Minoru; Sakaki, Yoshiyuki; Quackenbush, John; Okazaki, Yasushi; Hayashizaki, Yoshihide; Hide, Winston; Chakraborty, Ranajit; Nishikawa, Ken; Sugawara, Hideaki; Tateno, Yoshio; Chen, Zhu; Oishi, Michio; Tonellato, Peter; Apweiler, Rolf; Okubo, Kousaku; Wagner, Lukas; Wiemann, Stefan; Strausberg, Robert L; Isogai, Takao; Auffray, Charles; Nomura, Nobuo; Sugano, Sumio

2004-01-01

The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs), identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly) may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci) did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA genes. In addition, among 72,027 uniquely mapped SNPs and insertions/deletions localized within human genes, 13,215 nonsynonymous SNPs, 315 nonsense SNPs, and 452 indels occurred in coding regions. Together with 25 polymorphic microsatellite repeats present in coding regions, they may alter protein structure, causing phenotypic effects or resulting in disease. The H-InvDB platform represents a substantial contribution to resources needed for the exploration of human biology and pathology. PMID:15103394
The Role of Leptin, Melanocortin, and Neurotrophin System Genes on Body Weight in Anorexia Nervosa and Bulimia Nervosa

PubMed Central

Yilmaz, Zeynep; Kaplan, Allan S.; Tiwari, Arun K.; Levitan, Robert D.; Piran, Sara; Bergen, Andrew W.; Kaye, Walter H.; Hakonarson, Hakon; Wang, Kai; Berrettini, Wade H.; Brandt, Harry A.; Bulik, Cynthia M.; Crawford, Steve; Crow, Scott; Fichter, Manfred M.; Halmi, Katherine A.; Johnson, Craig L.; Keel, Pamela K.; Klump, Kelly L.; Magistretti, Pierre; Mitchell, James E.; Strober, Michael; Thornton, Laura M.; Treasure, Janet; Woodside, D. Blake; Knight, Joanne; Kennedy, James L.

2014-01-01

Objective Although low weight is a key factor contributing to the high mortality in anorexia nervosa (AN), it is unclear how AN patients sustain low weight compared with bulimia nervosa (BN) patients with similar psychopathology. Studies of genes involved in appetite and weight regulation in eating disorders have yielded variable findings in part due to small sample size and clinical heterogeneity. This study: (1) assessed the role of leptin, melanocortin, and neurotrophin genetic variants in conferring risk for AN and BN and (2) explored the involvement of these genes in body mass index (BMI) variations within AN and BN. Method Our sample consisted of 745 individuals with AN without a history of BN, 245 with BN without a history of AN, and 321 controls. We genotyped 20 markers with known or putative function among genes selected from leptin, melanocortin, and neurotrophin systems. Results There were no significant differences in allele frequencies among individuals with AN, BN, and controls. AGRP rs13338499 polymorphism was associated with lowest illness-related BMI in those with AN (p=0.0013), and NTRK2 rs1042571 was associated with highest BMI in those with BN (p=0.0018). Discussion To our knowledge, this is the first study to address the issue of clinical heterogeneity in eating disorder genetics and to explore the role of known or putatively functional markers in genes regulating appetite and weight in individuals with AN and BN. If replicated, our results may serve as an important first step toward gaining a better understanding of weight regulation in eating disorders. PMID:24831852
MutationAligner: a resource of recurrent mutation hotspots in protein domains in cancer.

PubMed

Gauthier, Nicholas Paul; Reznik, Ed; Gao, Jianjiong; Sumer, Selcuk Onur; Schultz, Nikolaus; Sander, Chris; Miller, Martin L

2016-01-04

The MutationAligner web resource, available at http://www.mutationaligner.org, enables discovery and exploration of somatic mutation hotspots identified in protein domains in currently (mid-2015) more than 5000 cancer patient samples across 22 different tumor types. Using multiple sequence alignments of protein domains in the human genome, we extend the principle of recurrence analysis by aggregating mutations in homologous positions across sets of paralogous genes. Protein domain analysis enhances the statistical power to detect cancer-relevant mutations and links mutations to the specific biological functions encoded in domains. We illustrate how the MutationAligner database and interactive web tool can be used to explore, visualize and analyze mutation hotspots in protein domains across genes and tumor types. We believe that MutationAligner will be an important resource for the cancer research community by providing detailed clues for the functional importance of particular mutations, as well as for the design of functional genomics experiments and for decision support in precision medicine. MutationAligner is slated to be periodically updated to incorporate additional analyses and new data from cancer genomics projects. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Computational genetic neuroanatomy of the developing mouse brain: dimensionality reduction, visualization, and clustering

PubMed Central

2013-01-01

Background The structured organization of cells in the brain plays a key role in its functional efficiency. This delicate organization is the consequence of unique molecular identity of each cell gradually established by precise spatiotemporal gene expression control during development. Currently, studies on the molecular-structural association are beginning to reveal how the spatiotemporal gene expression patterns are related to cellular differentiation and structural development. Results In this article, we aim at a global, data-driven study of the relationship between gene expressions and neuroanatomy in the developing mouse brain. To enable visual explorations of the high-dimensional data, we map the in situ hybridization gene expression data to a two-dimensional space by preserving both the global and the local structures. Our results show that the developing brain anatomy is largely preserved in the reduced gene expression space. To provide a quantitative analysis, we cluster the reduced data into groups and measure the consistency with neuroanatomy at multiple levels. Our results show that the clusters in the low-dimensional space are more consistent with neuroanatomy than those in the original space. Conclusions Gene expression patterns and developing brain anatomy are closely related. Dimensionality reduction and visual exploration facilitate the study of this relationship. PMID:23845024
Augmenting the Genetic Toolbox for Sulfolobus islandicus with a Stringent Positive Selectable Marker for Agmatine Prototrophy

PubMed Central

Cooper, Tara E.; Krause, David J.

2013-01-01

Sulfolobus species have become the model organisms for studying the unique biology of the crenarchaeal division of the archaeal domain. In particular, Sulfolobus islandicus provides a powerful opportunity to explore natural variation via experimental functional genomics. To support these efforts, we further expanded genetic tools for S. islandicus by developing a stringent positive selection for agmatine prototrophs in strains in which the argD gene, encoding arginine decarboxylase, has been deleted. Strains with deletions in argD were shown to be auxotrophic for agmatine even in nutrient-rich medium, but growth could be restored by either supplementation of exogenous agmatine or reintroduction of a functional copy of the argD gene from S. solfataricus P2 into the ΔargD host. Using this stringent selection, a robust targeted gene knockout system was established via an improved next generation of the MID (marker insertion and unmarked target gene deletion) method. Application of this novel system was validated by targeted knockout of the upsEF genes involved in UV-inducible cell aggregation formation. PMID:23835176
Update of the Diatom EST Database: a new tool for digital transcriptomics

PubMed Central

Maheswari, Uma; Mock, Thomas; Armbrust, E. Virginia; Bowler, Chris

2009-01-01

The Diatom Expressed Sequence Tag (EST) Database was constructed to provide integral access to ESTs from these ecologically and evolutionarily interesting microalgae. It has now been updated with 130 000 Phaeodactylum tricornutum ESTs from 16 cDNA libraries and 77 000 Thalassiosira pseudonana ESTs from seven libraries, derived from cells grown in different nutrient and stress regimes. The updated relational database incorporates results from statistical analyses such as log-likelihood ratios and hierarchical clustering, which help to identify differentially expressed genes under different conditions, and allow similarities in gene expression in different libraries to be investigated in a functional context. The database also incorporates links to the recently sequenced genomes of P. tricornutum and T. pseudonana, enabling an easy cross-talk between the expression pattern of diatom orthologs and the genome browsers. These improvements will facilitate exploration of diatom responses to conditions of ecological relevance and will aid gene function identification of diatom-specific genes and in silico gene prediction in this largely unexplored class of eukaryotes. The updated Diatom EST Database is available at http://www.biologie.ens.fr/diatomics/EST3. PMID:19029140
HTS-Net: An integrated regulome-interactome approach for establishing network regulation models in high-throughput screenings

PubMed Central

Rioualen, Claire; Da Costa, Quentin; Chetrit, Bernard; Charafe-Jauffret, Emmanuelle; Ginestier, Christophe

2017-01-01

High-throughput RNAi screenings (HTS) allow quantifying the impact of the deletion of each gene in any particular function, from virus-host interactions to cell differentiation. However, there has been less development for functional analysis tools dedicated to RNAi analyses. HTS-Net, a network-based analysis program, was developed to identify gene regulatory modules impacted in high-throughput screenings, by integrating transcription factors-target genes interaction data (regulome) and protein-protein interaction networks (interactome) on top of screening z-scores. HTS-Net produces exhaustive HTML reports for results navigation and exploration. HTS-Net is a new pipeline for RNA interference screening analyses that proves better performance than simple gene rankings by z-scores, by re-prioritizing genes and replacing them in their biological context, as shown by the three studies that we reanalyzed. Formatted input data for the three studied datasets, source code and web site for testing the system are available from the companion web site at http://htsnet.marseille.inserm.fr/. We also compared our program with existing algorithms (CARD and hotnet2). PMID:28949986
Computational Methods to Work as First-Pass Filter in Deleterious SNP Analysis of Alkaptonuria

PubMed Central

Magesh, R.; George Priya Doss, C.

2012-01-01

A major challenge in the analysis of human genetic variation is to distinguish functional from nonfunctional SNPs. Discovering these functional SNPs is one of the main goals of modern genetics and genomics studies. There is a need to effectively and efficiently identify functionally important nsSNPs which may be deleterious or disease causing and to identify their molecular effects. The prediction of phenotype of nsSNPs by computational analysis may provide a good way to explore the function of nsSNPs and its relationship with susceptibility to disease. In this context, we surveyed and compared variation databases along with in silico prediction programs to assess the effects of deleterious functional variants on protein functions. In other respects, we attempted these methods to work as first-pass filter to identify the deleterious substitutions worth pursuing for further experimental research. In this analysis, we used the existing computational methods to explore the mutation-structure-function relationship in HGD gene causing alkaptonuria. PMID:22606059
University of Texas MD Anderson Cancer Center: Characterization of PIK3R1 Neomorphic Mutations | Office of Cancer Genomics

Cancer.gov

The goal of this project was to functionally characterize the most frequent mutation of the PIK3R1 gene and to explore potential therapeutic approaches to target the aberration. Read the abstract Experimental Approaches Cytotoxicity Screen
Breath Biomarkers in Environmental Health Science: Exploring Patterns in the Human Exposome

EPA Science Inventory

The human genome is the counterpart to the human exposome with respect to the gene × environment interaction that describes health state and outcome. The genome has already been sequenced and is in the process of being assessed for specific functionality; to similarly decode the ...
University of Texas MD Anderson Cancer Center (UT-MDACC): Characterization of PIK3R1 Neomorphic Mutations | Office of Cancer Genomics

Cancer.gov

The goal of this project was to functionally characterize the most frequent mutation of the PIK3R1 gene and to explore potential therapeutic approaches to target the aberration. Read the abstract Experimental Approaches Cytotoxicity Screen

Self-Immolative Polycations as Gene Delivery Vectors and Prodrugs Targeting Polyamine Metabolism in Cancer

PubMed Central

2015-01-01

Polycations are explored as carriers to deliver therapeutic nucleic acids. Polycations are conventionally pharmacological inert with the sole function of delivering therapeutic cargo. This study reports synthesis of a self-immolative polycation (DSS-BEN) based on a polyamine analogue drug N1,N11-bisethylnorspermine (BENSpm). The polycation was designed to function dually as a gene delivery carrier and a prodrug targeting dysregulated polyamine metabolism in cancer. Using a combination of NMR and HPLC, we confirm that the self-immolative polycation undergoes intracellular degradation into the parent drug BENSpm. The released BENSpm depletes cellular levels of spermidine and spermine and upregulates polyamine catabolic enzymes spermine/spermidine N1-acetyltransferase (SSAT) and spermine oxidase (SMO). The synthesized polycations form polyplexes with DNA and facilitate efficient transfection. Taking advantage of the ability of BENSpm to sensitize cancer cells to TNFα-induced apoptosis, we show that DSS-BEN enhances the cell killing activity of TNFα gene therapy. The reported findings validate DSS-BEN as a dual-function delivery system that can deliver a therapeutic gene and improve the outcome of gene therapy as a result of the intracellular degradation of DSS-BEN to BENSpm and the subsequent beneficial effect of BENSpm on dysregulated polyamine metabolism in cancer. PMID:25153488
Toll pathway is required for wound-induced expression of barrier repair genes in the Drosophila epidermis

PubMed Central

Capilla, Amalia; Karachentsev, Dmitry; Patterson, Rachel A.; Hermann, Anita; Juarez, Michelle T.; McGinnis, William

2017-01-01

The epidermis serves as a protective barrier in animals. After epidermal injury, barrier repair requires activation of many wound response genes in epidermal cells surrounding wound sites. Two such genes in Drosophila encode the enzymes dopa decarboxylase (Ddc) and tyrosine hydroxylase (ple). In this paper we explore the involvement of the Toll/NF-κB pathway in the localized activation of wound repair genes around epidermal breaks. Robust activation of wound-induced transcription from ple and Ddc requires Toll pathway components ranging from the extracellular ligand Spätzle to the Dif transcription factor. Epistasis experiments indicate a requirement for Spätzle ligand downstream of hydrogen peroxide and protease function, both of which are known activators of wound-induced transcription. The localized activation of Toll a few cell diameters from wound edges is reminiscent of local activation of Toll in early embryonic ventral hypoderm, consistent with the hypothesis that the dorsal–ventral patterning function of Toll arose from the evolutionary cooption of a morphogen-responsive function in wound repair. Furthermore, the combinatorial activity of Toll and other signaling pathways in activating epidermal barrier repair genes can help explain why developmental activation of the Toll, ERK, or JNK pathways alone fail to activate wound repair loci. PMID:28289197
Conserved noncoding sequences conserve biological networks and influence genome evolution.

PubMed

Xie, Jianbo; Qian, Kecheng; Si, Jingna; Xiao, Liang; Ci, Dong; Zhang, Deqiang

2018-05-01

Comparative genomics approaches have identified numerous conserved cis-regulatory sequences near genes in plant genomes. Despite the identification of these conserved noncoding sequences (CNSs), our knowledge of their functional importance and selection remains limited. Here, we used a combination of DNA methylome analysis, microarray expression analyses, and functional annotation to study these sequences in the model tree Populus trichocarpa. Methylation in CG contexts and non-CG contexts was lower in CNSs, particularly CNSs in the 5'-upstream regions of genes, compared with other sites in the genome. We observed that CNSs are enriched in genes with transcription and binding functions, and this also associated with syntenic genes and those from whole-genome duplications, suggesting that cis-regulatory sequences play a key role in genome evolution. We detected a significant positive correlation between CNS number and protein interactions, suggesting that CNSs may have roles in the evolution and maintenance of biological networks. The divergence of CNSs indicates that duplication-degeneration-complementation drives the subfunctionalization of a proportion of duplicated genes from whole-genome duplication. Furthermore, population genomics confirmed that most CNSs are under strong purifying selection and only a small subset of CNSs shows evidence of adaptive evolution. These findings provide a foundation for future studies exploring these key genomic features in the maintenance of biological networks, local adaptation, and transcription.
Computational Identification and Functional Predictions of Long Noncoding RNA in Zea mays

PubMed Central

Boerner, Susan; McGinnis, Karen M.

2012-01-01

Background Computational analysis of cDNA sequences from multiple organisms suggests that a large portion of transcribed DNA does not code for a functional protein. In mammals, noncoding transcription is abundant, and often results in functional RNA molecules that do not appear to encode proteins. Many long noncoding RNAs (lncRNAs) appear to have epigenetic regulatory function in humans, including HOTAIR and XIST. While epigenetic gene regulation is clearly an essential mechanism in plants, relatively little is known about the presence or function of lncRNAs in plants. Methodology/Principal Findings To explore the connection between lncRNA and epigenetic regulation of gene expression in plants, a computational pipeline using the programming language Python has been developed and applied to maize full length cDNA sequences to identify, classify, and localize potential lncRNAs. The pipeline was used in parallel with an SVM tool for identifying ncRNAs to identify the maximal number of ncRNAs in the dataset. Although the available library of sequences was small and potentially biased toward protein coding transcripts, 15% of the sequences were predicted to be noncoding. Approximately 60% of these sequences appear to act as precursors for small RNA molecules and may function to regulate gene expression via a small RNA dependent mechanism. ncRNAs were predicted to originate from both genic and intergenic loci. Of the lncRNAs that originated from genic loci, ∼20% were antisense to the host gene loci. Conclusions/Significance Consistent with similar studies in other organisms, noncoding transcription appears to be widespread in the maize genome. Computational predictions indicate that maize lncRNAs may function to regulate expression of other genes through multiple RNA mediated mechanisms. PMID:22916204
Variable sexually dimorphic gene expression in laboratory strains of Drosophila melanogaster.

PubMed

Baker, Dean A; Meadows, Lisa A; Wang, Jing; Dow, Julian At; Russell, Steven

2007-12-10

Wild-type laboratory strains of model organisms are typically kept in isolation for many years, with the action of genetic drift and selection on mutational variation causing lineages to diverge with time. Natural populations from which such strains are established, show that gender-specific interactions in particular drive many aspects of sequence level and transcriptional level variation. Here, our goal was to identify genes that display transcriptional variation between laboratory strains of Drosophila melanogaster, and to explore evidence of gender-biased interactions underlying that variability. Transcriptional variation among the laboratory genotypes studied occurs more frequently in males than in females. Qualitative differences are also apparent to suggest that genes within particular functional classes disproportionately display variation in gene expression. Our analysis indicates that genes with reproductive functions are most often divergent between genotypes in both sexes, however a large proportion of female variation can also be attributed to genes without expression in the ovaries. The present study clearly shows that transcriptional variation between common laboratory strains of Drosophila can differ dramatically due to sexual dimorphism. Much of this variation reflects sex-specific challenges associated with divergent physiological trade-offs, morphology and regulatory pathways operating within males and females.
Genomic evidence of gene duplication and adaptive evolution of Toll like receptors (TLR2 and TLR4) in reptiles.

PubMed

Shang, Shuai; Zhong, Huaming; Wu, Xiaoyang; Wei, Qinguo; Zhang, Huanxin; Chen, Jun; Chen, Yao; Tang, Xuexi; Zhang, Honghai

2018-04-01

Toll-like receptors (TLRs) encoded by the TLR multigene family play an important role in initial pathogen recognition in vertebrates. Among the TLRs, TLR2 and TLR4 may be of particular importance to reptiles. In order to study the evolutionary patterns and structural characteristics of TLRs, we explored the available genomes of several representative members of reptiles. 25 TLR2 genes and 19 TLR4 genes from reptiles were obtained in this study. Phylogenetic results showed that the TLR2 gene duplication occurred in several species. Evolutionary analysis by at least two methods identified 30 and 13 common positively selected codons in TLR2 and TLR4, respectively. Most positively selected sites of TLR2 and TLR4 were located in the Leucine-rich repeat (LRRs). Branch model analysis showed that TLR2 genes were under different evolutionary forces in reptiles, while the TLR4 genes showed no significant selection pressure. The different evolutionary adaptation of TLR2 and TLR4 among the reptiles might be due to their different function in recognizing bacteria. Overall, we explored the structure and evolution of TLR2 and TLR4 genes in reptiles for the first time. Our study revealed valuable information regarding TLR2 and TLR4 in reptiles, and provided novel insights into the conservation concern of natural populations. Copyright © 2017 Elsevier B.V. All rights reserved.
RNA-Seq and Gene Network Analysis Uncover Activation of an ABA-Dependent Signalosome During the Cork Oak Root Response to Drought

PubMed Central

Magalhães, Alexandre P.; Verde, Nuno; Reis, Francisca; Martins, Inês; Costa, Daniela; Lino-Neto, Teresa; Castro, Pedro H.; Tavares, Rui M.; Azevedo, Herlânder

2016-01-01

Quercus suber (cork oak) is a West Mediterranean species of key economic interest, being extensively explored for its ability to generate cork. Like other Mediterranean plants, Q. suber is significantly threatened by climatic changes, imposing the need to quickly understand its physiological and molecular adaptability to drought stress imposition. In the present report, we uncovered the differential transcriptome of Q. suber roots exposed to long-term drought, using an RNA-Seq approach. 454-sequencing reads were used to de novo assemble a reference transcriptome, and mapping of reads allowed the identification of 546 differentially expressed unigenes. These were enriched in both effector genes (e.g., LEA, chaperones, transporters) as well as regulatory genes, including transcription factors (TFs) belonging to various different classes, and genes associated with protein turnover. To further extend functional characterization, we identified the orthologs of differentially expressed unigenes in the model species Arabidopsis thaliana, which then allowed us to perform in silico functional inference, including gene network analysis for protein function, protein subcellular localization and gene co-expression, and in silico enrichment analysis for TFs and cis-elements. Results indicated the existence of extensive transcriptional regulatory events, including activation of ABA-responsive genes and ABF-dependent signaling. We were then able to establish that a core ABA-signaling pathway involving PP2C-SnRK2-ABF components was induced in stressed Q. suber roots, identifying a key mechanism in this species’ response to drought. PMID:26793200
Transcription regulation by the Mediator complex.

PubMed

Soutourina, Julie

2018-04-01

Alterations in the regulation of gene expression are frequently associated with developmental diseases or cancer. Transcription activation is a key phenomenon in the regulation of gene expression. In all eukaryotes, mediator of RNA polymerase II transcription (Mediator), a large complex with modular organization, is generally required for transcription by RNA polymerase II, and it regulates various steps of this process. The main function of Mediator is to transduce signals from the transcription activators bound to enhancer regions to the transcription machinery, which is assembled at promoters as the preinitiation complex (PIC) to control transcription initiation. Recent functional studies of Mediator with the use of structural biology approaches and functional genomics have revealed new insights into Mediator activity and its regulation during transcription initiation, including how Mediator is recruited to transcription regulatory regions and how it interacts and cooperates with PIC components to assist in PIC assembly. Novel roles of Mediator in the control of gene expression have also been revealed by showing its connection to the nuclear pore and linking Mediator to the regulation of gene positioning in the nuclear space. Clear links between Mediator subunits and disease have also encouraged studies to explore targeting of this complex as a potential therapeutic approach in cancer and fungal infections.
Hairy Root Transformation Using Agrobacterium rhizogenes as a Tool for Exploring Cell Type-Specific Gene Expression and Function Using Tomato as a Model1[W][OPEN

PubMed Central

Ron, Mily; Kajala, Kaisa; Pauluzzi, Germain; Wang, Dongxue; Reynoso, Mauricio A.; Zumstein, Kristina; Garcha, Jasmine; Winte, Sonja; Masson, Helen; Inagaki, Soichi; Federici, Fernán; Sinha, Neelima; Deal, Roger B.; Bailey-Serres, Julia; Brady, Siobhan M.

2014-01-01

Agrobacterium rhizogenes (or Rhizobium rhizogenes) is able to transform plant genomes and induce the production of hairy roots. We describe the use of A. rhizogenes in tomato (Solanum spp.) to rapidly assess gene expression and function. Gene expression of reporters is indistinguishable in plants transformed by Agrobacterium tumefaciens as compared with A. rhizogenes. A root cell type- and tissue-specific promoter resource has been generated for domesticated and wild tomato (Solanum lycopersicum and Solanum pennellii, respectively) using these approaches. Imaging of tomato roots using A. rhizogenes coupled with laser scanning confocal microscopy is facilitated by the use of a membrane-tagged protein fused to a red fluorescent protein marker present in binary vectors. Tomato-optimized isolation of nuclei tagged in specific cell types and translating ribosome affinity purification binary vectors were generated and used to monitor associated messenger RNA abundance or chromatin modification. Finally, transcriptional reporters, translational reporters, and clustered regularly interspaced short palindromic repeats-associated nuclease9 genome editing demonstrate that SHORT-ROOT and SCARECROW gene function is conserved between Arabidopsis (Arabidopsis thaliana) and tomato. PMID:24868032
Functional autonomy of distant-acting human enhancers

DOE Office of Scientific and Technical Information (OSTI.GOV)

Visel, Axel; Akiyama, Jennifer A.; Shoukry, Malak

2009-02-19

Many human genes are associated with dispersed arrays of transcriptional enhancers that regulate their expression in time and space. Studies in invertebrate model systems have suggested that these elements function as discrete and independent regulatory units, but the in vivo combinatorial properties of vertebrate enhancers remain poorly understood. To explore the modularity and regulatory autonomy of human developmental enhancers, we experimentally concatenated up to four enhancers from different genes and used a transgenic mouse assay to compare the in vivo activity of these compound elements with that of the single modules. In all of the six different combinations of elementsmore » tested, the reporter gene activity patterns were additive without signs of interference between the individual modules, indicating that regulatory specificity was maintained despite the presence of closely-positioned heterologous enhancers. Even in cases where two elements drove expression in close anatomical proximity, such as within neighboring subregions of the developing limb bud, the compound patterns did not show signs of cross-inhibition between individual elements or novel expression sites. These data indicate that human developmental enhancers are highly modular and functionally autonomous and suggest that genomic enhancer shuffling may have contributed to the evolution of complex gene expression patterns in vertebrates« less
Evidence for a Saponin Biosynthesis Pathway in the Body Wall of the Commercially Significant Sea Cucumber Holothuria scabra.

PubMed

Mitu, Shahida Akter; Bose, Utpal; Suwansa-Ard, Saowaros; Turner, Luke H; Zhao, Min; Elizur, Abigail; Ogbourne, Steven M; Shaw, Paul Nicholas; Cummins, Scott F

2017-11-07

The sea cucumber (phylum Echinodermata) body wall is the first line of defense and is well known for its production of secondary metabolites; including vitamins and triterpenoid glycoside saponins that have important ecological functions and potential benefits to human health. The genes involved in the various biosynthetic pathways are unknown. To gain insight into these pathways in an echinoderm, we performed a comparative transcriptome analysis and functional annotation of the body wall and the radial nerve of the sea cucumber Holothuria scabra ; to define genes associated with body wall metabolic functioning and secondary metabolite biosynthesis. We show that genes related to signal transduction mechanisms were more highly represented in the H. scabra body wall, including genes encoding enzymes involved in energy production. Eight of the core triterpenoid biosynthesis enzymes were found, however, the identity of the saponin specific biosynthetic pathway enzymes remains unknown. We confirm the body wall release of at least three different triterpenoid saponins using solid phase extraction followed by ultra-high-pressure liquid chromatography-quadrupole time of flight-mass spectrometry. The resource we have established will help to guide future research to explore secondary metabolite biosynthesis in the sea cucumber.
Genome Engineering of the 2,3-Butanediol Biosynthetic Pathway for Tight Regulation in Cyanobacteria.

PubMed

Nozzi, Nicole E; Atsumi, Shota

2015-11-20

Cyanobacteria have gained popularity among the metabolic engineering community as a tractable photosynthetic host for renewable chemical production. However, though a number of successfully engineered production systems have been reported, long-term genetic stability remains an issue for cyanobacterial systems. The genetic engineering toolbox for cyanobacteria is largely lacking inducible systems for expression control. The characterization of tight regulation systems for use in cyanobacteria may help to alleviate this problem. In this work we explore the function of the IPTG inducible promoter P(L)lacO1 in the model cyanobacterium Synechococcus elongatus PCC 7942 as well as the effect of gene order within an operon on pathway expression. According to our experiments, P(L)lacO1 functions well as an inducible promoter in S. elongatus. Additionally, we found that gene order within an operon can strongly influence control of expression of each gene.
Genetic and Functional Dissection of HTRA1 and LOC387715 in Age-Related Macular Degeneration

PubMed Central

Zeng, Jiexi; Lu, Fang; Sun, Xufang; Zhao, Chao; Wang, Kevin; Davey, Lisa; Chen, Haoyu; London, Nyall; Muramatsu, Daisuke; Salasar, Francesca; Carmona, Ruben; Kasuga, Daniel; Wang, Xiaolei; Bedell, Matthew; Dixie, Manjuxia; Zhao, Peiquan; Yang, Ruifu; Gibbs, Daniel; Liu, Xiaoqi; Li, Yan; Li, Cai; Li, Yuanfeng; Campochiaro, Betsy; Constantine, Ryan; Zack, Donald J.; Campochiaro, Peter; Fu, Yinbin; Li, Dean Y.; Katsanis, Nicholas; Zhang, Kang

2010-01-01

A common haplotype on 10q26 influences the risk of age-related macular degeneration (AMD) and encompasses two genes, LOC387715 and HTRA1. Recent data have suggested that loss of LOC387715, mediated by an insertion/deletion (in/del) that destabilizes its message, is causally related with the disorder. Here we show that loss of LOC387715 is insufficient to explain AMD susceptibility, since a nonsense mutation (R38X) in this gene that leads to loss of its message resides in a protective haplotype. At the same time, the common disease haplotype tagged by the in/del and rs11200638 has an effect on the transcriptional upregulation of the adjacent gene, HTRA1. These data implicate increased HTRA1 expression in the pathogenesis of AMD and highlight the importance of exploring multiple functional consequences of alleles in haplotypes that confer susceptibility to complex traits. PMID:20140183
A functional genomics screen in planarians reveals regulators of whole-brain regeneration

PubMed Central

Roberts-Galbraith, Rachel H; Brubacher, John L; Newmark, Phillip A

2016-01-01

Planarians regenerate all body parts after injury, including the central nervous system (CNS). We capitalized on this distinctive trait and completed a gene expression-guided functional screen to identify factors that regulate diverse aspects of neural regeneration in Schmidtea mediterranea. Our screen revealed molecules that influence neural cell fates, support the formation of a major connective hub, and promote reestablishment of chemosensory behavior. We also identified genes that encode signaling molecules with roles in head regeneration, including some that are produced in a previously uncharacterized parenchymal population of cells. Finally, we explored genes downregulated during planarian regeneration and characterized, for the first time, glial cells in the planarian CNS that respond to injury by repressing several transcripts. Collectively, our studies revealed diverse molecules and cell types that underlie an animal’s ability to regenerate its brain. DOI: http://dx.doi.org/10.7554/eLife.17002.001 PMID:27612384
Avian genomics lends insights into endocrine function in birds.

PubMed

Mello, C V; Lovell, P V

2018-01-15

The genomics era has brought along the completed sequencing of a large number of bird genomes that cover a broad range of the avian phylogenetic tree (>30 orders), leading to major novel insights into avian biology and evolution. Among recent findings, the discovery that birds lack a large number of protein coding genes that are organized in highly conserved syntenic clusters in other vertebrates is very intriguing, given the physiological importance of many of these genes. A considerable number of them play prominent endocrine roles, suggesting that birds evolved compensatory genetic or physiological mechanisms that allowed them to survive and thrive in spite of these losses. While further studies are needed to establish the exact extent of avian gene losses, these findings point to birds as potentially highly relevant model organisms for exploring the genetic basis and possible therapeutic approaches for a wide range of endocrine functions and disorders. Copyright © 2017 Elsevier Inc. All rights reserved.
The search for evolutionary developmental origins of aging in zebrafish: a novel intersection of developmental and senescence biology in the zebrafish model system.

PubMed

Kishi, Shuji

2011-09-01

Senescence may be considered the antithesis of early development, but yet there may be factors and mechanisms in common between these two phenomena during the process of aging. We investigated whether any relationship exists between the regulatory mechanisms that function in early development and in senescence using the zebrafish (Danio rerio), a small freshwater fish and a useful model animal for genetic studies. We conducted experiments to isolate zebrafish mutants expressing an apparent senescence phenotype during embryogenesis (embryonic senescence). Some of the genes we thereby identified had already been associated with cellular senescence and chronological aging in other organisms, but many had not yet been linked to these processes. Complete loss-of-function of developmentally essential genes induce embryonic (or larval) lethality, whereas it seems like their partial loss-of-function (i.e., decrease-of-function by heterozygote or hypomorphic mutations) still remains sufficient to go through the early developmental process because of its adaptive plasticity or rather heterozygote advantage. However, in some cases, such partial loss-of-function of genes compromise normal homeostasis due to haploinsufficiency later in adult life having many environmental stress challenges. By contrast, any heterozygote-advantageous genes might gain a certain benefit(s) (much more fitness) by such partial loss-of-function later in life. Physiological senescence may evolutionarily arise from both genetic and epigenetic drifts as well as from losing adaptive developmental plasticity in face of stress signals from the external environment that interacts with functions of multiple genes rather than effects of only a single gene mutation or defect. Previously uncharacterized developmental genes may thus mediate the aging process and play a pivotal role in senescence. Moreover, unexpected senescence-related genes might also be involved in the early developmental process and regulation. We wish to ascertain whether we can identify such genes promptly in a comprehensive manner. The ease of manipulation using the zebrafish system allows us to conduct an exhaustive exploration of novel genes and small molecular compounds that can be linked to the senescence phenotype and thereby facilitates searching for the evolutionary and developmental origins of aging in vertebrates. Copyright © 2011 Wiley-Liss, Inc.
The Pathway Coexpression Network: Revealing pathway relationships

PubMed Central

Tanzi, Rudolph E.

2018-01-01

A goal of genomics is to understand the relationships between biological processes. Pathways contribute to functional interplay within biological processes through complex but poorly understood interactions. However, limited functional references for global pathway relationships exist. Pathways from databases such as KEGG and Reactome provide discrete annotations of biological processes. Their relationships are currently either inferred from gene set enrichment within specific experiments, or by simple overlap, linking pathway annotations that have genes in common. Here, we provide a unifying interpretation of functional interaction between pathways by systematically quantifying coexpression between 1,330 canonical pathways from the Molecular Signatures Database (MSigDB) to establish the Pathway Coexpression Network (PCxN). We estimated the correlation between canonical pathways valid in a broad context using a curated collection of 3,207 microarrays from 72 normal human tissues. PCxN accounts for shared genes between annotations to estimate significant correlations between pathways with related functions rather than with similar annotations. We demonstrate that PCxN provides novel insight into mechanisms of complex diseases using an Alzheimer’s Disease (AD) case study. PCxN retrieved pathways significantly correlated with an expert curated AD gene list. These pathways have known associations with AD and were significantly enriched for genes independently associated with AD. As a further step, we show how PCxN complements the results of gene set enrichment methods by revealing relationships between enriched pathways, and by identifying additional highly correlated pathways. PCxN revealed that correlated pathways from an AD expression profiling study include functional clusters involved in cell adhesion and oxidative stress. PCxN provides expanded connections to pathways from the extracellular matrix. PCxN provides a powerful new framework for interrogation of global pathway relationships. Comprehensive exploration of PCxN can be performed at http://pcxn.org/. PMID:29554099
Identification of potential crucial genes associated with steroid-induced necrosis of femoral head based on gene expression profile.

PubMed

Lin, Zhe; Lin, Yongsheng

2017-09-05

The aim of this study was to explore potential crucial genes associated with the steroid-induced necrosis of femoral head (SINFH) and to provide valid biological information for further investigation of SINFH. Gene expression profile of GSE26316, generated from 3 SINFH rat samples and 3 normal rat samples were downloaded from Gene Expression Omnibus (GEO) database. The differentially expressed genes (DEGs) were identified using LIMMA package. After functional enrichment analyses of DEGs, protein-protein interaction (PPI) network and sub-PPI network analyses were conducted based on the STRING database and cytoscape. In total, 59 up-regulated DEGs and 156 downregulated DEGs were identified. The up-regulated DEGs were mainly involved in functions about immunity (e.g. Fcer1A and Il7R), and the downregulated DEGs were mainly enriched in muscle system process (e.g. Tnni2, Mylpf and Myl1). The PPI network of DEGs consisted of 123 nodes and 300 interactions. Tnni2, Mylpf, and Myl1 were the top 3 outstanding genes based on both subgraph centrality and degree centrality evaluation. These three genes interacted with each other in the network. Furthermore, the significant network module was composed of 22 downregulated genes (e.g. Tnni2, Mylpf and Myl1). These genes were mainly enriched in functions like muscle system process. The DEGs related to the regulation of immune system process (e.g. Fcer1A and Il7R), and DEGs correlated with muscle system process (e.g. Tnni2, Mylpf and Myl1) may be closely associated with the progress of SINFH, which is still needed to be confirmed by experiments. Copyright © 2017 Elsevier B.V. All rights reserved.
Use of transcriptome sequencing to understand the pistillate flowering in hickory (Carya cathayensis Sarg.).

PubMed

Huang, You-Jun; Liu, Li-Li; Huang, Jian-Qin; Wang, Zheng-Jia; Chen, Fang-Fang; Zhang, Qi-Xiang; Zheng, Bing-Song; Chen, Ming

2013-10-10

Different from herbaceous plants, the woody plants undergo a long-period vegetative stage to achieve floral transition. They then turn into seasonal plants, flowering annually. In this study, a preliminary model of gene regulations for seasonal pistillate flowering in hickory (Carya cathayensis) was proposed. The genome-wide dynamic transcriptome was characterized via the joint-approach of RNA sequencing and microarray analysis. Differential transcript abundance analysis uncovered the dynamic transcript abundance patterns of flowering correlated genes and their major functions based on Gene Ontology (GO) analysis. To explore pistillate flowering mechanism in hickory, a comprehensive flowering gene regulatory network based on Arabidopsis thaliana was constructed by additional literature mining. A total of 114 putative flowering or floral genes including 31 with differential transcript abundance were identified in hickory. The locations, functions and dynamic transcript abundances were analyzed in the gene regulatory networks. A genome-wide co-expression network for the putative flowering or floral genes shows three flowering regulatory modules corresponding to response to light abiotic stimulus, cold stress, and reproductive development process, respectively. Totally 27 potential flowering or floral genes were recruited which are meaningful to understand the hickory specific seasonal flowering mechanism better. Flowering event of pistillate flower bud in hickory is triggered by several pathways synchronously including the photoperiod, autonomous, vernalization, gibberellin, and sucrose pathway. Totally 27 potential flowering or floral genes were recruited from the genome-wide co-expression network function module analysis. Moreover, the analysis provides a potential FLC-like gene based vernalization pathway and an 'AC' model for pistillate flower development in hickory. This work provides an available framework for pistillate flower development in hickory, which is significant for insight into regulation of flowering and floral development of woody plants.
Use of transcriptome sequencing to understand the pistillate flowering in hickory (Carya cathayensis Sarg.)

PubMed Central

2013-01-01

Background Different from herbaceous plants, the woody plants undergo a long-period vegetative stage to achieve floral transition. They then turn into seasonal plants, flowering annually. In this study, a preliminary model of gene regulations for seasonal pistillate flowering in hickory (Carya cathayensis) was proposed. The genome-wide dynamic transcriptome was characterized via the joint-approach of RNA sequencing and microarray analysis. Results Differential transcript abundance analysis uncovered the dynamic transcript abundance patterns of flowering correlated genes and their major functions based on Gene Ontology (GO) analysis. To explore pistillate flowering mechanism in hickory, a comprehensive flowering gene regulatory network based on Arabidopsis thaliana was constructed by additional literature mining. A total of 114 putative flowering or floral genes including 31 with differential transcript abundance were identified in hickory. The locations, functions and dynamic transcript abundances were analyzed in the gene regulatory networks. A genome-wide co-expression network for the putative flowering or floral genes shows three flowering regulatory modules corresponding to response to light abiotic stimulus, cold stress, and reproductive development process, respectively. Totally 27 potential flowering or floral genes were recruited which are meaningful to understand the hickory specific seasonal flowering mechanism better. Conclusions Flowering event of pistillate flower bud in hickory is triggered by several pathways synchronously including the photoperiod, autonomous, vernalization, gibberellin, and sucrose pathway. Totally 27 potential flowering or floral genes were recruited from the genome-wide co-expression network function module analysis. Moreover, the analysis provides a potential FLC-like gene based vernalization pathway and an 'AC’ model for pistillate flower development in hickory. This work provides an available framework for pistillate flower development in hickory, which is significant for insight into regulation of flowering and floral development of woody plants. PMID:24106755

Identification of differentially expressed genes from Trichoderma harzianum during growth on cell wall of Fusarium solani as a tool for biotechnological application

PubMed Central

2013-01-01

Background The species of T. harzianum are well known for their biocontrol activity against many plant pathogens. However, there is a lack of studies concerning its use as a biological control agent against F. solani, a pathogen involved in several crop diseases. In this study, we have used subtractive library hybridization (SSH) and quantitative real-time PCR (RT-qPCR) techniques in order to explore changes in T. harzianum genes expression during growth on cell wall of F. solani (FSCW) or glucose. RT-qPCR was also used to examine the regulation of 18 genes, potentially involved in biocontrol, during confrontation between T. harzianum and F. solani. Results Data obtained from two subtractive libraries were compared after annotation using the Blast2GO suite. A total of 417 and 78 readable EST sequence were annotated in the FSCW and glucose libraries, respectively. Functional annotation of these genes identified diverse biological processes and molecular functions required during T. harzianum growth on FSCW or glucose. We identified various genes of biotechnological value encoding to proteins which function such as transporters, hydrolytic activity, adherence, appressorium development and pathogenesis. Fifteen genes were up-regulated and sixteen were down-regulated at least at one-time point during growth of T. harzianum in FSCW. During the confrontation assay most of the genes were up-regulated, mainly after contact, when the interaction has been established. Conclusions This study demonstrates that T. harzianum expressed different genes when grown on FSCW compared to glucose. It provides insights into the mechanisms of gene expression involved in mycoparasitism of T. harzianum against F. solani. The identification and evaluation of these genes may contribute to the development of an efficient biological control agent. PMID:23497274
Identification of differentially expressed genes from Trichoderma harzianum during growth on cell wall of Fusarium solani as a tool for biotechnological application.

PubMed

Vieira, Pabline Marinho; Coelho, Alexandre Siqueira Guedes; Steindorff, Andrei Stecca; de Siqueira, Saulo José Linhares; Silva, Roberto do Nascimento; Ulhoa, Cirano José

2013-03-15

The species of T. harzianum are well known for their biocontrol activity against many plant pathogens. However, there is a lack of studies concerning its use as a biological control agent against F. solani, a pathogen involved in several crop diseases. In this study, we have used subtractive library hybridization (SSH) and quantitative real-time PCR (RT-qPCR) techniques in order to explore changes in T. harzianum genes expression during growth on cell wall of F. solani (FSCW) or glucose. RT-qPCR was also used to examine the regulation of 18 genes, potentially involved in biocontrol, during confrontation between T. harzianum and F. solani. Data obtained from two subtractive libraries were compared after annotation using the Blast2GO suite. A total of 417 and 78 readable EST sequence were annotated in the FSCW and glucose libraries, respectively. Functional annotation of these genes identified diverse biological processes and molecular functions required during T. harzianum growth on FSCW or glucose. We identified various genes of biotechnological value encoding to proteins which function such as transporters, hydrolytic activity, adherence, appressorium development and pathogenesis. Fifteen genes were up-regulated and sixteen were down-regulated at least at one-time point during growth of T. harzianum in FSCW. During the confrontation assay most of the genes were up-regulated, mainly after contact, when the interaction has been established. This study demonstrates that T. harzianum expressed different genes when grown on FSCW compared to glucose. It provides insights into the mechanisms of gene expression involved in mycoparasitism of T. harzianum against F. solani. The identification and evaluation of these genes may contribute to the development of an efficient biological control agent.
Materials Informatics: The Materials ``Gene'' and Big Data

NASA Astrophysics Data System (ADS)

Rajan, Krishna

2015-07-01

Materials informatics provides the foundations for a new paradigm of materials discovery. It shifts our emphasis from one of solely searching among large volumes of data that may be generated by experiment or computation to one of targeted materials discovery via high-throughput identification of the key factors (i.e., “genes”) and via showing how these factors can be quantitatively integrated by statistical learning methods into design rules (i.e., “gene sequencing”) governing targeted materials functionality. However, a critical challenge in discovering these materials genes is the difficulty in unraveling the complexity of the data associated with numerous factors including noise, uncertainty, and the complex diversity of data that one needs to consider (i.e., Big Data). In this article, we explore one aspect of materials informatics, namely how one can efficiently explore for new knowledge in regimes of structure-property space, especially when no reasonable selection pathways based on theory or clear trends in observations exist among an almost infinite set of possibilities.
A comprehensive and quantitative exploration of thousands of viral genomes

PubMed Central

Mahmoudabadi, Gita

2018-01-01

The complete assembly of viral genomes from metagenomic datasets (short genomic sequences gathered from environmental samples) has proven to be challenging, so there are significant blind spots when we view viral genomes through the lens of metagenomics. One approach to overcoming this problem is to leverage the thousands of complete viral genomes that are publicly available. Here we describe our efforts to assemble a comprehensive resource that provides a quantitative snapshot of viral genomic trends – such as gene density, noncoding percentage, and abundances of functional gene categories – across thousands of viral genomes. We have also developed a coarse-grained method for visualizing viral genome organization for hundreds of genomes at once, and have explored the extent of the overlap between bacterial and bacteriophage gene pools. Existing viral classification systems were developed prior to the sequencing era, so we present our analysis in a way that allows us to assess the utility of the different classification systems for capturing genomic trends. PMID:29624169
A comprehensive and quantitative exploration of thousands of viral genomes.

PubMed

Mahmoudabadi, Gita; Phillips, Rob

2018-04-19

The complete assembly of viral genomes from metagenomic datasets (short genomic sequences gathered from environmental samples) has proven to be challenging, so there are significant blind spots when we view viral genomes through the lens of metagenomics. One approach to overcoming this problem is to leverage the thousands of complete viral genomes that are publicly available. Here we describe our efforts to assemble a comprehensive resource that provides a quantitative snapshot of viral genomic trends - such as gene density, noncoding percentage, and abundances of functional gene categories - across thousands of viral genomes. We have also developed a coarse-grained method for visualizing viral genome organization for hundreds of genomes at once, and have explored the extent of the overlap between bacterial and bacteriophage gene pools. Existing viral classification systems were developed prior to the sequencing era, so we present our analysis in a way that allows us to assess the utility of the different classification systems for capturing genomic trends. © 2018, Mahmoudabadi et al.
PanFP: Pangenome-based functional profiles for microbial communities

DOE PAGES

Jun, Se -Ran; Hauser, Loren John; Schadt, Christopher Warren; ...

2015-09-26

For decades there has been increasing interest in understanding the relationships between microbial communities and ecosystem functions. Current DNA sequencing technologies allows for the exploration of microbial communities in two principle ways: targeted rRNA gene surveys and shotgun metagenomics. For large study designs, it is often still prohibitively expensive to sequence metagenomes at both the breadth and depth necessary to statistically capture the true functional diversity of a community. Although rRNA gene surveys provide no direct evidence of function, they do provide a reasonable estimation of microbial diversity, while being a very cost effective way to screen samples of interestmore » for later shotgun metagenomic analyses. However, there is a great deal of 16S rRNA gene survey data currently available from diverse environments, and thus a need for tools to infer functional composition of environmental samples based on 16S rRNA gene survey data. As a result, we present a computational method called pangenome based functional profiles (PanFP), which infers functional profiles of microbial communities from 16S rRNA gene survey data for Bacteria and Archaea. PanFP is based on pangenome reconstruction of a 16S rRNA gene operational taxonomic unit (OTU) from known genes and genomes pooled from the OTU s taxonomic lineage. From this lineage, we derive an OTU functional profile by weighting a pangenome s functional profile with the OTUs abundance observed in a given sample. We validated our method by comparing PanFP to the functional profiles obtained from the direct shotgun metagenomic measurement of 65 diverse communities via Spearman correlation coefficients. These correlations improved with increasing sequencing depth, within the range of 0.8 0.9 for the most deeply sequenced Human Microbiome Project mock community samples. PanFP is very similar in performance to another recently released tool, PICRUSt, for almost all of survey data analysed here. But, our method is unique in that any OTU building method can be used, as opposed to being limited to closed reference OTU picking strategies against specific reference sequence databases. In conclusion, we developed an automated computational method, which derives an inferred functional profile based on the 16S rRNA gene surveys of microbial communities. The inferred functional profile provides a cost effective way to study complex ecosystems through predicted comparative functional metagenomes and metadata analysis. All PanFP source code and additional documentation are freely available online at GitHub.« less
PanFP: pangenome-based functional profiles for microbial communities.

PubMed

Jun, Se-Ran; Robeson, Michael S; Hauser, Loren J; Schadt, Christopher W; Gorin, Andrey A

2015-09-26

For decades there has been increasing interest in understanding the relationships between microbial communities and ecosystem functions. Current DNA sequencing technologies allows for the exploration of microbial communities in two principle ways: targeted rRNA gene surveys and shotgun metagenomics. For large study designs, it is often still prohibitively expensive to sequence metagenomes at both the breadth and depth necessary to statistically capture the true functional diversity of a community. Although rRNA gene surveys provide no direct evidence of function, they do provide a reasonable estimation of microbial diversity, while being a very cost-effective way to screen samples of interest for later shotgun metagenomic analyses. However, there is a great deal of 16S rRNA gene survey data currently available from diverse environments, and thus a need for tools to infer functional composition of environmental samples based on 16S rRNA gene survey data. We present a computational method called pangenome-based functional profiles (PanFP), which infers functional profiles of microbial communities from 16S rRNA gene survey data for Bacteria and Archaea. PanFP is based on pangenome reconstruction of a 16S rRNA gene operational taxonomic unit (OTU) from known genes and genomes pooled from the OTU's taxonomic lineage. From this lineage, we derive an OTU functional profile by weighting a pangenome's functional profile with the OTUs abundance observed in a given sample. We validated our method by comparing PanFP to the functional profiles obtained from the direct shotgun metagenomic measurement of 65 diverse communities via Spearman correlation coefficients. These correlations improved with increasing sequencing depth, within the range of 0.8-0.9 for the most deeply sequenced Human Microbiome Project mock community samples. PanFP is very similar in performance to another recently released tool, PICRUSt, for almost all of survey data analysed here. But, our method is unique in that any OTU building method can be used, as opposed to being limited to closed-reference OTU picking strategies against specific reference sequence databases. We developed an automated computational method, which derives an inferred functional profile based on the 16S rRNA gene surveys of microbial communities. The inferred functional profile provides a cost effective way to study complex ecosystems through predicted comparative functional metagenomes and metadata analysis. All PanFP source code and additional documentation are freely available online at GitHub ( https://github.com/srjun/PanFP ).
Functions of MicroRNAs in Cardiovascular Biology and Disease

PubMed Central

Hata, Akiko

2015-01-01

In 1993, lin-4 was discovered as a critical modulator of temporal development in Caenorhabditis elegans and, most notably, as the first in the class of small, single-stranded noncoding RNAs now defined as microRNAs (miRNAs). Another eight years elapsed before miRNA expression was detected in mammalian cells. Since then, explosive advancements in the field of miRNA biology have elucidated the basic mechanism of miRNA biogenesis, regulation, and gene-regulatory function. The discovery of this new class of small RNAs has augmented the complexity of gene-regulatory programs as well as the understanding of developmental and pathological processes in the cardiovascular system. Indeed, the contributions of miRNAs in cardiovascular development and function have been widely explored, revealing the extensive role of these small regulatory RNAs in cardiovascular physiology. PMID:23157557
The evolutionary fate of the chloroplast and nuclear rps16 genes as revealed through the sequencing and comparative analyses of four novel legume chloroplast genomes from Lupinus

PubMed Central

Keller, J.; Rousseau-Gueutin, M.; Martin, G.E.; Morice, J.; Boutte, J.; Coissac, E.; Ourari, M.; Aïnouche, M.; Salmon, A.; Cabello-Hurtado, F.

2017-01-01

Abstract The Fabaceae family is considered as a model system for understanding chloroplast genome evolution due to the presence of extensive structural rearrangements, gene losses and localized hypermutable regions. Here, we provide sequences of four chloroplast genomes from the Lupinus genus, belonging to the underinvestigated Genistoid clade. Notably, we found in Lupinus species the functional loss of the essential rps16 gene, which was most likely replaced by the nuclear rps16 gene that encodes chloroplast and mitochondrion targeted RPS16 proteins. To study the evolutionary fate of the rps16 gene, we explored all available plant chloroplast, mitochondrial and nuclear genomes. Whereas no plant mitochondrial genomes carry an rps16 gene, many plants still have a functional nuclear and chloroplast rps16 gene. Ka/Ks ratios revealed that both chloroplast and nuclear rps16 copies were under purifying selection. However, due to the dual targeting of the nuclear rps16 gene product and the absence of a mitochondrial copy, the chloroplast gene may be lost. We also performed comparative analyses of lupine plastomes (SNPs, indels and repeat elements), identified the most variable regions and examined their phylogenetic utility. The markers identified here will help to reveal the evolutionary history of lupines, Genistoids and closely related clades. PMID:28338826
Genome-wide evolutionary characterization and expression analyses of WRKY family genes in Brachypodium distachyon.

PubMed

Wen, Feng; Zhu, Hong; Li, Peng; Jiang, Min; Mao, Wenqing; Ong, Chermaine; Chu, Zhaoqing

2014-06-01

Members of plant WRKY gene family are ancient transcription factors that function in plant growth and development and respond to biotic and abiotic stresses. In our present study, we have investigated WRKY family genes in Brachypodium distachyon, a new model plant of family Poaceae. We identified a total of 86 WRKY genes from B. distachyon and explored their chromosomal distribution and evolution, domain alignment, promoter cis-elements, and expression profiles. Combining the analysis of phylogenetic tree of BdWRKY genes and the result of expression profiling, results showed that most of clustered gene pairs had higher similarities in the WRKY domain, suggesting that they might be functionally redundant. Neighbour-joining analysis of 301 WRKY domains from Oryza sativa, Arabidopsis thaliana, and B. distachyon suggested that BdWRKY domains are evolutionarily more closely related to O. sativa WRKY domains than those of A. thaliana. Moreover, tissue-specific expression profile of BdWRKY genes and their responses to phytohormones and several biotic or abiotic stresses were analysed by quantitative real-time PCR. The results showed that the expression of BdWRKY genes was rapidly regulated by stresses and phytohormones, and there was a strong correlation between promoter cis-elements and the phytohormones-induced BdWRKY gene expression. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Somatic Mutation Patterns in Hemizygous Genomic Regions Unveil Purifying Selection during Tumor Evolution

PubMed Central

Basu, Swaraj; Larsson, Erik

2016-01-01

Identification of cancer driver genes using somatic mutation patterns indicative of positive selection has become a major goal in cancer genomics. However, cancer cells additionally depend on a large number of genes involved in basic cellular processes. While such genes should in theory be subject to strong purifying (negative) selection against damaging somatic mutations, these patterns have been elusive and purifying selection remains inadequately explored in cancer. Here, we hypothesized that purifying selection should be evident in hemizygous genomic regions, where damaging mutations cannot be compensated for by healthy alleles. Using a 7,781-sample pan-cancer dataset, we first confirmed this in POLR2A, an essential gene where hemizygous deletions are known to confer elevated sensitivity to pharmacological suppression. We next used this principle to identify several genes and pathways that show patterns indicative of purifying selection to avoid deleterious mutations. These include the POLR2A interacting protein INTS10 as well as genes involved in mRNA splicing, nonsense-mediated mRNA decay and other RNA processing pathways. Many of these genes belong to large protein complexes, and strong overlaps were observed with recent functional screens for gene essentiality in human cells. Our analysis supports that purifying selection acts to preserve the remaining function of many hemizygously deleted essential genes in tumors, indicating vulnerabilities that might be exploited by future therapeutic strategies. PMID:28027311
Discovering genetic variants in Crohn's disease by exploring genomic regions enriched of weak association signals.

PubMed

D'Addabbo, Annarita; Palmieri, Orazio; Maglietta, Rosalia; Latiano, Anna; Mukherjee, Sayan; Annese, Vito; Ancona, Nicola

2011-08-01

A meta-analysis has re-analysed previous genome-wide association scanning definitively confirming eleven genes and further identifying 21 new loci. However, the identified genes/loci still explain only the minority of genetic predisposition of Crohn's disease. To identify genes weakly involved in disease predisposition by analysing chromosomal regions enriched of single nucleotide polymorphisms with modest statistical association. We utilized the WTCCC data set evaluating 1748 CD and 2938 controls. The identification of candidate genes/loci was performed by a two-step procedure: first of all chromosomal regions enriched of weak association signals were localized; subsequently, weak signals clustered in gene regions were identified. The statistical significance was assessed by non parametric permutation tests. The cytoband enrichment analysis highlighted 44 regions (P≤0.05) enriched with single nucleotide polymorphisms significantly associated with the trait including 23 out of 31 previously confirmed and replicated genes. Importantly, we highlight further 20 novel chromosomal regions carrying approximately one hundred genes/loci with modest association. Amongst these we find compelling functional candidate genes such as MAPT, GRB2 and CREM, LCT, and IL12RB2. Our study suggests a different statistical perspective to discover genes weakly associated with a given trait, although further confirmatory functional studies are needed. Copyright © 2011 Editrice Gastroenterologica Italiana S.r.l. All rights reserved.
VH Replacement Footprint Analyzer-I, a Java-Based Computer Program for Analyses of Immunoglobulin Heavy Chain Genes and Potential VH Replacement Products in Human and Mouse

PubMed Central

Huang, Lin; Lange, Miles D.; Zhang, Zhixin

2014-01-01

VH replacement occurs through RAG-mediated secondary recombination between a rearranged VH gene and an upstream unrearranged VH gene. Due to the location of the cryptic recombination signal sequence (cRSS, TACTGTG) at the 3′ end of VH gene coding region, a short stretch of nucleotides from the previous rearranged VH gene can be retained in the newly formed VH–DH junction as a “footprint” of VH replacement. Such footprints can be used as markers to identify Ig heavy chain (IgH) genes potentially generated through VH replacement. To explore the contribution of VH replacement products to the antibody repertoire, we developed a Java-based computer program, VH replacement footprint analyzer-I (VHRFA-I), to analyze published or newly obtained IgH genes from human or mouse. The VHRFA-1 program has multiple functional modules: it first uses service provided by the IMGT/V-QUEST program to assign potential VH, DH, and JH germline genes; then, it searches for VH replacement footprint motifs within the VH–DH junction (N1) regions of IgH gene sequences to identify potential VH replacement products; it can also analyze the frequencies of VH replacement products in correlation with publications, keywords, or VH, DH, and JH gene usages, and mutation status; it can further analyze the amino acid usages encoded by the identified VH replacement footprints. In summary, this program provides a useful computation tool for exploring the biological significance of VH replacement products in human and mouse. PMID:24575092
Molecular Phylogenetic and Expression Analysis of the Complete WRKY Transcription Factor Family in Maize

PubMed Central

Wei, Kai-Fa; Chen, Juan; Chen, Yan-Feng; Wu, Ling-Juan; Xie, Dao-Xin

2012-01-01

The WRKY transcription factors function in plant growth and development, and response to the biotic and abiotic stresses. Although many studies have focused on the functional identification of the WRKY transcription factors, much less is known about molecular phylogenetic and global expression analysis of the complete WRKY family in maize. In this study, we identified 136 WRKY proteins coded by 119 genes in the B73 inbred line from the complete genome and named them in an orderly manner. Then, a comprehensive phylogenetic analysis of five species was performed to explore the origin and evolutionary patterns of these WRKY genes, and the result showed that gene duplication is the major driving force for the origin of new groups and subgroups and functional divergence during evolution. Chromosomal location analysis of maize WRKY genes indicated that 20 gene clusters are distributed unevenly in the genome. Microarray-based expression analysis has revealed that 131 WRKY transcripts encoded by 116 genes may participate in the regulation of maize growth and development. Among them, 102 transcripts are stably expressed with a coefficient of variation (CV) value of <15%. The remaining 29 transcripts produced by 25 WRKY genes with the CV value of >15% are further analysed to discover new organ- or tissue-specific genes. In addition, microarray analyses of transcriptional responses to drought stress and fungal infection showed that maize WRKY proteins are involved in stress responses. All these results contribute to a deep probing into the roles of WRKY transcription factors in maize growth and development and stress tolerance. PMID:22279089
Molecular phylogenetic and expression analysis of the complete WRKY transcription factor family in maize.

PubMed

Wei, Kai-Fa; Chen, Juan; Chen, Yan-Feng; Wu, Ling-Juan; Xie, Dao-Xin

2012-04-01

The WRKY transcription factors function in plant growth and development, and response to the biotic and abiotic stresses. Although many studies have focused on the functional identification of the WRKY transcription factors, much less is known about molecular phylogenetic and global expression analysis of the complete WRKY family in maize. In this study, we identified 136 WRKY proteins coded by 119 genes in the B73 inbred line from the complete genome and named them in an orderly manner. Then, a comprehensive phylogenetic analysis of five species was performed to explore the origin and evolutionary patterns of these WRKY genes, and the result showed that gene duplication is the major driving force for the origin of new groups and subgroups and functional divergence during evolution. Chromosomal location analysis of maize WRKY genes indicated that 20 gene clusters are distributed unevenly in the genome. Microarray-based expression analysis has revealed that 131 WRKY transcripts encoded by 116 genes may participate in the regulation of maize growth and development. Among them, 102 transcripts are stably expressed with a coefficient of variation (CV) value of <15%. The remaining 29 transcripts produced by 25 WRKY genes with the CV value of >15% are further analysed to discover new organ- or tissue-specific genes. In addition, microarray analyses of transcriptional responses to drought stress and fungal infection showed that maize WRKY proteins are involved in stress responses. All these results contribute to a deep probing into the roles of WRKY transcription factors in maize growth and development and stress tolerance.
Novel candidate genes of the PARK7 interactome as mediators of apoptosis and acetylation in multiple sclerosis: An in silico analysis.

PubMed

Vavougios, George D; Zarogiannis, Sotirios G; Krogfelt, Karen Angeliki; Gourgoulianis, Konstantinos; Mitsikostas, Dimos Dimitrios; Hadjigeorgiou, Georgios

2018-01-01

currently only 4 studies have explored the potential role of PARK7's dysregulation in MS pathophysiology Currently, no study has evaluated the potential role of the PARK7 interactome in MS. The aim of our study was to assess the differential expression of PARK7 mRNA in peripheral blood mononuclears (PBMCs) donated from MS versus healthy patients using data mining techniques. The PARK7 interactome data from the GDS3920 profile were scrutinized for differentially expressed genes (DEGs); Gene Enrichment Analysis (GEA) was used to detect significantly enriched biological functions. 27 differentially expressed genes in the MS dataset were detected; 12 of these (NDUFA4, UBA2, TDP2, NPM1, NDUFS3, SUMO1, PIAS2, KIAA0101, RBBP4, NONO, RBBP7 AND HSPA4) are reported for the first time in MS. Stepwise Linear Discriminant Function Analysis constructed a predictive model (Wilk's λ = 0.176, χ 2 = 45.204, p = 1.5275e -10 ) with 2 variables (TIDP2, RBBP4) that achieved 96.6% accuracy when discriminating between patients and controls. Gene Enrichment Analysis revealed that induction and regulation of programmed / intrinsic cell death represented the most salient Gene Ontology annotations. Cross-validation on systemic lupus erythematosus and ischemic stroke datasets revealed that these functions are unique to the MS dataset. Based on our results, novel potential target genes are revealed; these differentially expressed genes regulate epigenetic and apoptotic pathways that may further elucidate underlying mechanisms of autorreactivity in MS. Copyright © 2017 Elsevier B.V. All rights reserved.
ePlant and the 3D data display initiative: integrative systems biology on the world wide web.

PubMed

Fucile, Geoffrey; Di Biase, David; Nahal, Hardeep; La, Garon; Khodabandeh, Shokoufeh; Chen, Yani; Easley, Kante; Christendat, Dinesh; Kelley, Lawrence; Provart, Nicholas J

2011-01-10

Visualization tools for biological data are often limited in their ability to interactively integrate data at multiple scales. These computational tools are also typically limited by two-dimensional displays and programmatic implementations that require separate configurations for each of the user's computing devices and recompilation for functional expansion. Towards overcoming these limitations we have developed "ePlant" (http://bar.utoronto.ca/eplant) - a suite of open-source world wide web-based tools for the visualization of large-scale data sets from the model organism Arabidopsis thaliana. These tools display data spanning multiple biological scales on interactive three-dimensional models. Currently, ePlant consists of the following modules: a sequence conservation explorer that includes homology relationships and single nucleotide polymorphism data, a protein structure model explorer, a molecular interaction network explorer, a gene product subcellular localization explorer, and a gene expression pattern explorer. The ePlant's protein structure explorer module represents experimentally determined and theoretical structures covering >70% of the Arabidopsis proteome. The ePlant framework is accessed entirely through a web browser, and is therefore platform-independent. It can be applied to any model organism. To facilitate the development of three-dimensional displays of biological data on the world wide web we have established the "3D Data Display Initiative" (http://3ddi.org).
A Network of Chromatin Factors Is Regulating the Transition to Postembryonic Development in Caenorhabditis elegans

PubMed Central

Erdelyi, Peter; Wang, Xing; Suleski, Marina; Wicky, Chantal

2016-01-01

Mi2 proteins are evolutionarily conserved, ATP-dependent chromatin remodelers of the CHD family that play key roles in stem cell differentiation and reprogramming. In Caenorhabditis elegans, the let-418 gene encodes one of the two Mi2 homologs, which is part of at least two chromatin complexes, namely the Nucleosome Remodeling and histone Deacetylase (NuRD) complex and the MEC complex, and functions in larval development, vulval morphogenesis, lifespan regulation, and cell fate determination. To explore the mechanisms involved in the action of LET-418/Mi2, we performed a genome-wide RNA interference (RNAi) screen for suppressors of early larval arrest associated with let-418 mutations. We identified 29 suppressor genes, of which 24 encode chromatin regulators, mostly orthologs of proteins present in transcriptional activator complexes. The remaining five genes vary broadly in their predicted functions. All suppressor genes could suppress multiple aspects of the let-418 phenotype, including developmental arrest and ectopic expression of germline genes in the soma. Analysis of available transcriptomic data and quantitative PCR revealed that LET-418 and the suppressors of early larval arrest are regulating common target genes. These suppressors might represent direct competitors of LET-418 complexes for chromatin regulation of crucial genes involved in the transition to postembryonic development. PMID:28007841
A Network of Chromatin Factors Is Regulating the Transition to Postembryonic Development in Caenorhabditis elegans.

PubMed

Erdelyi, Peter; Wang, Xing; Suleski, Marina; Wicky, Chantal

2017-02-09

Mi2 proteins are evolutionarily conserved, ATP-dependent chromatin remodelers of the CHD family that play key roles in stem cell differentiation and reprogramming. In Caenorhabditis elegans , the let-418 gene encodes one of the two Mi2 homologs, which is part of at least two chromatin complexes, namely the Nucleosome Remodeling and histone Deacetylase (NuRD) complex and the MEC complex, and functions in larval development, vulval morphogenesis, lifespan regulation, and cell fate determination. To explore the mechanisms involved in the action of LET-418/Mi2, we performed a genome-wide RNA interference (RNAi) screen for suppressors of early larval arrest associated with let-418 mutations. We identified 29 suppressor genes, of which 24 encode chromatin regulators, mostly orthologs of proteins present in transcriptional activator complexes. The remaining five genes vary broadly in their predicted functions. All suppressor genes could suppress multiple aspects of the let-418 phenotype, including developmental arrest and ectopic expression of germline genes in the soma. Analysis of available transcriptomic data and quantitative PCR revealed that LET-418 and the suppressors of early larval arrest are regulating common target genes. These suppressors might represent direct competitors of LET-418 complexes for chromatin regulation of crucial genes involved in the transition to postembryonic development. Copyright © 2017 Erdelyi et al.
RNA-Seq reveals seven promising candidate genes affecting the proportion of thick egg albumen in layer-type chickens.

PubMed

Wan, Yi; Jin, Sihua; Ma, Chendong; Wang, Zhicheng; Fang, Qi; Jiang, Runshen

2017-12-22

Eggs with a much higher proportion of thick albumen are preferred in the layer industry, as they are favoured by consumers. However, the genetic factors affecting the thick egg albumen trait have not been elucidated. Using RNA sequencing, we explored the magnum transcriptome in 9 Rhode Island white layers: four layers with phenotypes of extremely high ratios of thick to thin albumen (high thick albumen, HTA) and five with extremely low ratios (low thick albumen, LTA). A total of 220 genes were differentially expressed, among which 150 genes were up-regulated and 70 were down-regulated in the HTA group compared with the LTA group. Gene Ontology (GO) analysis revealed that the up-regulated genes in HTA were mainly involved in a wide range of regulatory functions. In addition, a large number of these genes were related to glycosphingolipid biosynthesis, focal adhesion, ECM-receptor interactions and cytokine-cytokine receptor interactions. Based on functional analysis, ST3GAL4, FUT4, ITGA2, SDC3, PRLR, CDH4 and GALNT9 were identified as promising candidate genes for thick albumen synthesis and metabolism during egg formation. These results provide new insights into the molecular mechanisms of egg albumen traits and may contribute to future breeding strategies that optimise the proportion of thick egg albumen.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jun, Se -Ran; Hauser, Loren John; Schadt, Christopher Warren

For decades there has been increasing interest in understanding the relationships between microbial communities and ecosystem functions. Current DNA sequencing technologies allows for the exploration of microbial communities in two principle ways: targeted rRNA gene surveys and shotgun metagenomics. For large study designs, it is often still prohibitively expensive to sequence metagenomes at both the breadth and depth necessary to statistically capture the true functional diversity of a community. Although rRNA gene surveys provide no direct evidence of function, they do provide a reasonable estimation of microbial diversity, while being a very cost effective way to screen samples of interestmore » for later shotgun metagenomic analyses. However, there is a great deal of 16S rRNA gene survey data currently available from diverse environments, and thus a need for tools to infer functional composition of environmental samples based on 16S rRNA gene survey data. As a result, we present a computational method called pangenome based functional profiles (PanFP), which infers functional profiles of microbial communities from 16S rRNA gene survey data for Bacteria and Archaea. PanFP is based on pangenome reconstruction of a 16S rRNA gene operational taxonomic unit (OTU) from known genes and genomes pooled from the OTU s taxonomic lineage. From this lineage, we derive an OTU functional profile by weighting a pangenome s functional profile with the OTUs abundance observed in a given sample. We validated our method by comparing PanFP to the functional profiles obtained from the direct shotgun metagenomic measurement of 65 diverse communities via Spearman correlation coefficients. These correlations improved with increasing sequencing depth, within the range of 0.8 0.9 for the most deeply sequenced Human Microbiome Project mock community samples. PanFP is very similar in performance to another recently released tool, PICRUSt, for almost all of survey data analysed here. But, our method is unique in that any OTU building method can be used, as opposed to being limited to closed reference OTU picking strategies against specific reference sequence databases. In conclusion, we developed an automated computational method, which derives an inferred functional profile based on the 16S rRNA gene surveys of microbial communities. The inferred functional profile provides a cost effective way to study complex ecosystems through predicted comparative functional metagenomes and metadata analysis. All PanFP source code and additional documentation are freely available online at GitHub.« less
GO-PCA: An Unsupervised Method to Explore Gene Expression Data Using Prior Knowledge

PubMed Central

Wagner, Florian

2015-01-01

Method Genome-wide expression profiling is a widely used approach for characterizing heterogeneous populations of cells, tissues, biopsies, or other biological specimen. The exploratory analysis of such data typically relies on generic unsupervised methods, e.g. principal component analysis (PCA) or hierarchical clustering. However, generic methods fail to exploit prior knowledge about the molecular functions of genes. Here, I introduce GO-PCA, an unsupervised method that combines PCA with nonparametric GO enrichment analysis, in order to systematically search for sets of genes that are both strongly correlated and closely functionally related. These gene sets are then used to automatically generate expression signatures with functional labels, which collectively aim to provide a readily interpretable representation of biologically relevant similarities and differences. The robustness of the results obtained can be assessed by bootstrapping. Results I first applied GO-PCA to datasets containing diverse hematopoietic cell types from human and mouse, respectively. In both cases, GO-PCA generated a small number of signatures that represented the majority of lineages present, and whose labels reflected their respective biological characteristics. I then applied GO-PCA to human glioblastoma (GBM) data, and recovered signatures associated with four out of five previously defined GBM subtypes. My results demonstrate that GO-PCA is a powerful and versatile exploratory method that reduces an expression matrix containing thousands of genes to a much smaller set of interpretable signatures. In this way, GO-PCA aims to facilitate hypothesis generation, design of further analyses, and functional comparisons across datasets. PMID:26575370
GO-PCA: An Unsupervised Method to Explore Gene Expression Data Using Prior Knowledge.

PubMed

Wagner, Florian

2015-01-01

Genome-wide expression profiling is a widely used approach for characterizing heterogeneous populations of cells, tissues, biopsies, or other biological specimen. The exploratory analysis of such data typically relies on generic unsupervised methods, e.g. principal component analysis (PCA) or hierarchical clustering. However, generic methods fail to exploit prior knowledge about the molecular functions of genes. Here, I introduce GO-PCA, an unsupervised method that combines PCA with nonparametric GO enrichment analysis, in order to systematically search for sets of genes that are both strongly correlated and closely functionally related. These gene sets are then used to automatically generate expression signatures with functional labels, which collectively aim to provide a readily interpretable representation of biologically relevant similarities and differences. The robustness of the results obtained can be assessed by bootstrapping. I first applied GO-PCA to datasets containing diverse hematopoietic cell types from human and mouse, respectively. In both cases, GO-PCA generated a small number of signatures that represented the majority of lineages present, and whose labels reflected their respective biological characteristics. I then applied GO-PCA to human glioblastoma (GBM) data, and recovered signatures associated with four out of five previously defined GBM subtypes. My results demonstrate that GO-PCA is a powerful and versatile exploratory method that reduces an expression matrix containing thousands of genes to a much smaller set of interpretable signatures. In this way, GO-PCA aims to facilitate hypothesis generation, design of further analyses, and functional comparisons across datasets.
Feline immunodeficiency virus OrfA alters gene expression of splicing factors and proteasome-ubiquitination proteins

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sundstrom, Magnus; Chatterji, Udayan; Schaffer, Lana

2008-02-20

Expression of the feline immunodeficiency virus (FIV) accessory protein OrfA (or Orf2) is critical for efficient viral replication in lymphocytes, both in vitro and in vivo. OrfA has been reported to exhibit functions in common with the human immunodeficiency virus (HIV) and simian immunodeficiency virus (SIV) accessory proteins Vpr and Tat, although the function of OrfA has not been fully explained. Here, we use microarray analysis to characterize how OrfA modulates the gene expression profile of T-lymphocytes. The primary IL-2-dependent T-cell line 104-C1 was transduced to express OrfA. Functional expression of OrfA was demonstrated by trans complementation of the OrfA-defectivemore » clone, FIV-34TF10. OrfA-expressing cells had a slightly reduced cell proliferation rate but did not exhibit any significant alteration in cell cycle distribution. Reverse-transcribed RNA from cells expressing green fluorescent protein (GFP) or GFP + OrfA were hybridized to Affymetrix HU133 Plus 2.0 microarray chips representing more than 47,000 genome-wide transcripts. By using two statistical approaches, 461 (Rank Products) and 277 (ANOVA) genes were identified as modulated by OrfA expression. The functional relevance of the differentially expressed genes was explored by Ingenuity Pathway Analysis. The analyses revealed alterations in genes critical for RNA post-transcriptional modifications and protein ubiquitination as the two most significant functional outcomes of OrfA expression. In these two groups, several subunits of the spliceosome, cellular splicing factors and family members of the proteasome-ubiquitination system were identified. These findings provide novel information on the versatile function of OrfA during FIV infection and indicate a fine-tuning mechanism of the cellular environment by OrfA to facilitate efficient FIV replication.« less
Increased microbial functional diversity under long-term organic and integrated fertilization in a paddy soil.

PubMed

Ding, Long-Jun; Su, Jian-Qiang; Sun, Guo-Xin; Wu, Jin-Shui; Wei, Wen-Xue

2018-02-01

Microbes play key roles in diverse biogeochemical processes including nutrient cycling. However, responses of soil microbial community and functional genes to long-term integrated fertilization (chemical combined with organic fertilization) remain unclear. Here, we used pyrosequencing and a microarray-based GeoChip to explore the shifts of microbial community and functional genes in a paddy soil which received over 21-year fertilization with various regimes, including control (no fertilizer), rice straw (R), rice straw plus chemical fertilizer nitrogen (NR), N and phosphorus (NPR), NP and potassium (NPKR), and reduced rice straw plus reduced NPK (L-NPKR). Significant shifts of the overall soil bacterial composition only occurred in the NPKR and L-NPKR treatments, with enrichment of certain groups including Bradyrhizobiaceae and Rhodospirillaceae families that benefit higher productivity. All fertilization treatments significantly altered the soil microbial functional structure with increased diversity and abundances of genes for carbon and nitrogen cycling, in which NPKR and L-NPKR exhibited the strongest effect, while R exhibited the least. Functional gene structure and abundance were significantly correlated with corresponding soil enzymatic activities and rice yield, respectively, suggesting that the structural shift of the microbial functional community under fertilization might promote soil nutrient turnover and thereby affect yield. Overall, this study indicates that the combined application of rice straw and balanced chemical fertilizers was more pronounced in shifting the bacterial composition and improving the functional diversity toward higher productivity, providing a microbial point of view on applying a cost-effective integrated fertilization regime with rice straw plus reduced chemical fertilizers for sustainable nutrient management.
CRISPR and piRNAs: Fundamental Mechanisms and Key Applications of the Next Generation of Molecular Technologies in the Field of Toxicology

EPA Science Inventory

" Exploration into the roles of genes, the proteins that they encode, and the functions that they carry out within the cell is a founding pillar in the field of toxicology. Recent breakthroughs in clustered, regularly interspaced, short palindromic repeat (CRISPR) technology...
Identification of transcript polymorphisms for seed quality improvement by exploring soybean genetic diversity

USDA-ARS?s Scientific Manuscript database

The difference in seed oil composition and content among soybean genotypes could be mostly attributed to transcript sequence and/or expression variations of oil-related genes that that lead to changes in the functions of the proteins that they encode and/or their accumulation in seeds. We sequenced ...
Strategies to explore functional genomics data sets in NCBI's GEO database.

PubMed

Wilhite, Stephen E; Barrett, Tanya

2012-01-01

The Gene Expression Omnibus (GEO) database is a major repository that stores high-throughput functional genomics data sets that are generated using both microarray-based and sequence-based technologies. Data sets are submitted to GEO primarily by researchers who are publishing their results in journals that require original data to be made freely available for review and analysis. In addition to serving as a public archive for these data, GEO has a suite of tools that allow users to identify, analyze, and visualize data relevant to their specific interests. These tools include sample comparison applications, gene expression profile charts, data set clusters, genome browser tracks, and a powerful search engine that enables users to construct complex queries.
Strategies to Explore Functional Genomics Data Sets in NCBI’s GEO Database

PubMed Central

Wilhite, Stephen E.; Barrett, Tanya

2012-01-01

The Gene Expression Omnibus (GEO) database is a major repository that stores high-throughput functional genomics data sets that are generated using both microarray-based and sequence-based technologies. Data sets are submitted to GEO primarily by researchers who are publishing their results in journals that require original data to be made freely available for review and analysis. In addition to serving as a public archive for these data, GEO has a suite of tools that allow users to identify, analyze and visualize data relevant to their specific interests. These tools include sample comparison applications, gene expression profile charts, data set clusters, genome browser tracks, and a powerful search engine that enables users to construct complex queries. PMID:22130872
Microbial ecology of soda lakes: investigating sulfur and nitrogen cycling at Mono Lake, CA, USA

NASA Astrophysics Data System (ADS)

Fairbanks, D.; Phillips, A. A.; Wells, M.; Bao, R.; Fullerton, K. M.; Stamps, B. W.; Speth, D. R.; Johnson, H.; Sessions, A. L.

2017-12-01

Soda lakes represent unique ecosystems characterized by extremes of pH, salinity and distinct geochemical cycling. Despite these extreme conditions, soda lakes are important repositories of biological adaptation and have a highly functional microbial system. We investigated the biogeochemical cycling of sulfur and nitrogen compounds in Mono Lake, California, located east of the Sierra Nevada mountains. Mono lake is characterized by hyperalkaline, hypersaline and high sulfate concentrations and can enter prolonged periods of meromixis due to freshwater inflow. Typically, the microbial sulfur cycle is highly active in soda lakes with both oxidation and reduction of sulfur compounds. However, the biological sulfur cycle is connected to many other main elemental cycles such as carbon, nitrogen and metals. Here we investigated the interaction between sulfur and nitrogen cycling in Mono lake using a combination of molecular, isotopic, and geochemical observations to explore the links between microbial phylogenetic composition and functionality. Metagenomic and 16S rRNA gene amplicon sequencing were determined at two locations and five depths in May 2017. 16S rRNA gene amplicon sequencing analysis revealed organisms capable of both sulfur and nitrogen cycling. The relative abundance and distribution of functional genes (dsrA, soxAB, nifH, etc) were also determined. These genetic markers indicate the potential in situ relevance of specific carbon, nitrogen, and sulfur pathways in the water column prior to the transition to meromictic stratification. However, genes for sulfide oxidation, denitrification, and ammonification were present. Genome binning guided by the most abundant dsrA sequences, GC content, and abundance with depth identified a Thioalkalivibrio paradoxus bin containing genes capable of sulfur oxidation, denitrification, and nitrate reduction. The presence of a large number of sulfur and nitrogen cycling genes associated with Thioalkalivibrio paradoxus suggests thiosulfate oxidation may be coupled to nitrate reduction despite the extremely low level of nitrate in Mono Lake. Our results illustrate the centrality of living organisms in both shaping and responding to geochemical cycles, as well as future directions for exploring coupled biogeochemical cycles in Mono Lake.
Exploring the effect of the apolipoprotein E (APOE) gene on executive function, working memory, and processing speed during the early recovery period following traumatic brain injury.

PubMed

Padgett, Christine R; Summers, Mathew J; Vickers, James C; McCormack, Graeme H; Skilbeck, Clive E

2016-01-01

There is evidence that the e4 allele of the apolipoprotein E (APOE) gene is detrimental to cognitive function, but results from traumatic brain injury (TBI) populations are mixed. A possible explanation is that APOEe2 carriers have routinely been incorporated into APOEe4 and non-e4 groups, despite APOEe2 being proposed to have an ameliorative effect. Our primary aim was to investigate the influence of APOEe4 on cognitive impairment during early recovery following TBI, excluding the potential confound of APOEe2 possession. A secondary objective was to explore whether APOEe4 displays more pronounced effects in moderate to severe TBI and to consider the potential postinjury protective influence of the APOEe2 allele. Participants who recently sustained a TBI (posttraumatic amnesia > 5 minutes) were assessed on measures of information processing speed, executive function, and working memory upon remission of posttraumatic amnesia. APOE genotype was determined by buccal saliva DNA extraction (APOEe4 n = 37, APOEe3 n = 92, APOEe2 n = 13). Stepwise multiple regressions were performed to compare APOEe4 carriers to APOEe3 homozygotes, with injury severity, age, and estimated premorbid IQ included in the first step. This model was found to significantly predict performance on all tasks, accounting for 17.3-24.3% of the variance. When APOEe4 status was added for the second step, there were no significant changes on any tasks (additional variance <1%). The effect of APOEe4 in moderate to severe TBI and the effect of APOEe2 were explored by analysis of covariance (ANCOVA), with no significant effects revealed. It is unlikely that APOE genotype influences cognitive function in the initial recovery period following TBI, regardless of injury severity. However, a more nuanced and long-term exploration of the effect of APOE genotype in the TBI population is warranted.
Functional genomic responses to cystic fibrosis transmembrane conductance regulator (CFTR) and CFTR(delta508) in the lung.

PubMed

Xu, Yan; Liu, Cong; Clark, Jean C; Whitsett, Jeffrey A

2006-04-21

Cystic fibrosis (CF), a common lethal pulmonary disorder in Caucasians, is caused by mutations in the cystic fibrosis transmembrane conductance regulator gene (CFTR) that disturbs fluid homeostasis and host defense in target organs. The effects of CFTR and delta508-CFTR were assessed in transgenic mice that 1) lack CFTR expression (Cftr-/-); 2) express the human delta508 CFTR (CFTR(delta508)); 3) overexpress the normal human CFTR (CFTR(tg)) in respiratory epithelial cells. Genes were selected from Affymetrix Murine Gene-Chips analysis and subjected to functional classification, k-means clustering, promoter cis-elements/modules searching, literature mining, and pathway exploring. Genomic responses to Cftr-/- were not corrected by expression of CFTR(delta508). Genes regulating host defense, inflammation, fluid and electrolyte transport were similarly altered in Cftr-/- and CFTR(delta508) mice. CFTR(delta508) induced a primary disturbance in expression of genes regulating redox and antioxidant systems. Genomic responses to CFTR(tg) were modest and were not associated with lung pathology. CFTR(tg) and CFTR(delta508) induced genes encoding heat shock proteins and other chaperones but did not activate the endoplasmic reticulum-associated degradation pathway. RNAs encoding proteins that directly interact with CFTR were identified in each of the CFTR mouse models, supporting the hypothesis that CFTR functions within a multiprotein complex whose members interact at the level of protein-protein interactions and gene expression. Promoters of genes influenced by CFTR shared common regulatory elements, suggesting that their co-expression may be mediated by shared regulatory mechanisms. Genes and pathways involved in the response to CFTR may be of interest as modifiers of CF.
Genome-Wide Identification, Expression, and Functional Analysis of the Sugar Transporter Gene Family in Cassava (Manihot esculenta).

PubMed

Liu, Qin; Dang, Huijie; Chen, Zhijian; Wu, Junzheng; Chen, Yinhua; Chen, Songbi; Luo, Lijuan

2018-03-26

The sugar transporter ( STP ) gene family encodes monosaccharide transporters that contain 12 transmembrane domains and belong to the major facilitator superfamily. STP genes play critical roles in monosaccharide distribution and participate in diverse plant metabolic processes. To investigate the potential roles of STPs in cassava ( Manihot esculenta ) tuber root growth, genome-wide identification and expression and functional analyses of the STP gene family were performed in this study. A total of 20 MeSTP genes ( MeSTP1 - 20 ) containing the Sugar_tr conserved motifs were identified from the cassava genome, which could be further classified into four distinct groups in the phylogenetic tree. The expression profiles of the MeSTP genes explored using RNA-seq data showed that most of the MeSTP genes exhibited tissue-specific expression, and 15 out of 20 MeSTP genes were mainly expressed in the early storage root of cassava. qRT-PCR analysis further confirmed that most of the MeSTPs displayed higher expression in roots after 30 and 40 days of growth, suggesting that these genes may be involved in the early growth of tuber roots. Although all the MeSTP proteins exhibited plasma membrane localization, variations in monosaccharide transport activity were found through a complementation analysis in a yeast ( Saccharomyces cerevisiae ) mutant, defective in monosaccharide uptake. Among them, MeSTP2, MeSTP15, and MeSTP19 were able to efficiently complement the uptake of five monosaccharides in the yeast mutant, while MeSTP3 and MeSTP16 only grew on medium containing galactose, suggesting that these two MeSTP proteins are transporters specific for galactose. This study provides significant insights into the potential functions of MeSTPs in early tuber root growth, which possibly involves the regulation of monosaccharide distribution.
[Progress in porky genes and transcriptome and discussion of relative issues].

PubMed

Zhu, Meng-Jin; Liu, Bang; Li, Kui

2005-01-01

To date, research on molecular base of porky molecular development was mainly involved in muscle growth and meat quality. Some functional genes including Hal gene and RN gene and some QTLs controlling or associated with porky growth and quality were detected through candidate gene approach and genome-wide scanning. Genic transcriptome pertinent to porcine muscle and adipose also came into study. At the same time, these researches have befallen some shortcomings to some extent. Research from molecular quantitative genetics showed shortcomings that single gene was devilishly emphasized and co-expression pattern of multi-genes was ignored. Research applying transcriptome analysis tool also met two of limitations, one was the singleness of type of molecular experimental techniques, and another was that genes of muscle and adipose were artificially divided into unattached two parts. Thus, porky genes were explored by parallel genetics based on systemic views and techniques to specially reveal the interactional mechanism of porky genes respectively controlling muscle and adipose, which would be important issues of genes and genome researches on porky development in the near future.
Genome-wide protein-protein interactions and protein function exploration in cyanobacteria

PubMed Central

Lv, Qi; Ma, Weimin; Liu, Hui; Li, Jiang; Wang, Huan; Lu, Fang; Zhao, Chen; Shi, Tieliu

2015-01-01

Genome-wide network analysis is well implemented to study proteins of unknown function. Here, we effectively explored protein functions and the biological mechanism based on inferred high confident protein-protein interaction (PPI) network in cyanobacteria. We integrated data from seven different sources and predicted 1,997 PPIs, which were evaluated by experiments in molecular mechanism, text mining of literatures in proved direct/indirect evidences, and “interologs” in conservation. Combined the predicted PPIs with known PPIs, we obtained 4,715 no-redundant PPIs (involving 3,231 proteins covering over 90% of genome) to generate the PPI network. Based on the PPI network, terms in Gene ontology (GO) were assigned to function-unknown proteins. Functional modules were identified by dissecting the PPI network into sub-networks and analyzing pathway enrichment, with which we investigated novel function of underlying proteins in protein complexes and pathways. Examples of photosynthesis and DNA repair indicate that the network approach is a powerful tool in protein function analysis. Overall, this systems biology approach provides a new insight into posterior functional analysis of PPIs in cyanobacteria. PMID:26490033
A limited role for gene duplications in the evolution of platypus venom.

PubMed

Wong, Emily S W; Papenfuss, Anthony T; Whittington, Camilla M; Warren, Wesley C; Belov, Katherine

2012-01-01

Gene duplication followed by adaptive selection is believed to be the primary driver of venom evolution. However, to date, no studies have evaluated the importance of gene duplications for venom evolution using a genomic approach. The availability of a sequenced genome and a venom gland transcriptome for the enigmatic platypus provides a unique opportunity to explore the role that gene duplication plays in venom evolution. Here, we identify gene duplication events and correlate them with expressed transcripts in an in-season venom gland. Gene duplicates (1,508) were identified. These duplicated pairs (421), including genes that have undergone multiple rounds of gene duplications, were expressed in the venom gland. The majority of these genes are involved in metabolism and protein synthesis not toxin functions. Twelve secretory genes including serine proteases, metalloproteinases, and protease inhibitors likely to produce symptoms of envenomation such as vasodilation and pain were detected. Only 16 of 107 platypus genes with high similarity to known toxins evolved through gene duplication. Platypus venom C-type natriuretic peptides and nerve growth factor do not possess lineage-specific gene duplicates. Extensive duplications, believed to increase the potency of toxic content and promote toxin diversification, were not found. This is the first study to take a genome-wide approach in order to examine the impact of gene duplication on venom evolution. Our findings support the idea that adaptive selection acts on gene duplicates to drive the independent evolution and functional diversification of similar venom genes in venomous species. However, gene duplications alone do not explain the "venome" of the platypus. Other mechanisms, such as alternative splicing and mutation, may be important in venom innovation.
A Limited Role for Gene Duplications in the Evolution of Platypus Venom

PubMed Central

Wong, Emily S. W.; Papenfuss, Anthony T.; Whittington, Camilla M.; Warren, Wesley C.; Belov, Katherine

2012-01-01

Gene duplication followed by adaptive selection is believed to be the primary driver of venom evolution. However, to date, no studies have evaluated the importance of gene duplications for venom evolution using a genomic approach. The availability of a sequenced genome and a venom gland transcriptome for the enigmatic platypus provides a unique opportunity to explore the role that gene duplication plays in venom evolution. Here, we identify gene duplication events and correlate them with expressed transcripts in an in-season venom gland. Gene duplicates (1,508) were identified. These duplicated pairs (421), including genes that have undergone multiple rounds of gene duplications, were expressed in the venom gland. The majority of these genes are involved in metabolism and protein synthesis not toxin functions. Twelve secretory genes including serine proteases, metalloproteinases, and protease inhibitors likely to produce symptoms of envenomation such as vasodilation and pain were detected. Only 16 of 107 platypus genes with high similarity to known toxins evolved through gene duplication. Platypus venom C-type natriuretic peptides and nerve growth factor do not possess lineage-specific gene duplicates. Extensive duplications, believed to increase the potency of toxic content and promote toxin diversification, were not found. This is the first study to take a genome-wide approach in order to examine the impact of gene duplication on venom evolution. Our findings support the idea that adaptive selection acts on gene duplicates to drive the independent evolution and functional diversification of similar venom genes in venomous species. However, gene duplications alone do not explain the “venome” of the platypus. Other mechanisms, such as alternative splicing and mutation, may be important in venom innovation. PMID:21816864
New Funding Opportunity - Illuminating the Druggable Genome | Office of Cancer Clinical Proteomics Research

Cancer.gov

The National Institutes of Health Common Fund announces two new Funding Opportunity Announcements with a focus on the Illuminating the Druggable Genome (IDG). These funding opportunities are designed to foster the development of technologies and information management to facilitate the unveiling of the functions of the poorly characterized and/or un-annotated members in four protein classes of the Druggable Genome. The IDG project is predicated on the need to fully explore the underlying biology and role in disease of genes linked to already drugged genes within the Druggable Genome.
An interactional network of genes involved in chitin synthesis in Saccharomyces cerevisiae.

PubMed

Lesage, Guillaume; Shapiro, Jesse; Specht, Charles A; Sdicu, Anne-Marie; Ménard, Patrice; Hussein, Shamiza; Tong, Amy Hin Yan; Boone, Charles; Bussey, Howard

2005-02-16

In S. cerevisiae the beta-1,4-linked N-acetylglucosamine polymer, chitin, is synthesized by a family of 3 specialized but interacting chitin synthases encoded by CHS1, CHS2 and CHS3. Chs2p makes chitin in the primary septum, while Chs3p makes chitin in the lateral cell wall and in the bud neck, and can partially compensate for the lack of Chs2p. Chs3p requires a pathway of Bni4p, Chs4p, Chs5p, Chs6p and Chs7p for its localization and activity. Chs1p is thought to have a septum repair function after cell separation. To further explore interactions in the chitin synthase family and to find processes buffering chitin synthesis, we compiled a genetic interaction network of genes showing synthetic interactions with CHS1, CHS3 and genes involved in Chs3p localization and function and made a phenotypic analysis of their mutants. Using deletion mutants in CHS1, CHS3, CHS4, CHS5, CHS6, CHS7 and BNI4 in a synthetic genetic array analysis we assembled a network of 316 interactions among 163 genes. The interaction network with CHS3, CHS4, CHS5, CHS6, CHS7 or BNI4 forms a dense neighborhood, with many genes functioning in cell wall assembly or polarized secretion. Chitin levels were altered in 54 of the mutants in individually deleted genes, indicating a functional relationship between them and chitin synthesis. 32 of these mutants triggered the chitin stress response, with elevated chitin levels and a dependence on CHS3. A large fraction of the CHS1-interaction set was distinct from that of the CHS3 network, indicating broad roles for Chs1p in buffering both Chs2p function and more global cell wall robustness. Based on their interaction patterns and chitin levels we group interacting mutants into functional categories. Genes interacting with CHS3 are involved in the amelioration of cell wall defects and in septum or bud neck chitin synthesis, and we newly assign a number of genes to these functions. Our genetic analysis of genes not interacting with CHS3 indicate expanded roles for Chs4p, Chs5p and Chs6p in secretory protein trafficking and of Bni4p in bud neck organization.
The role of leptin, melanocortin, and neurotrophin system genes on body weight in anorexia nervosa and bulimia nervosa.

PubMed

Yilmaz, Zeynep; Kaplan, Allan S; Tiwari, Arun K; Levitan, Robert D; Piran, Sara; Bergen, Andrew W; Kaye, Walter H; Hakonarson, Hakon; Wang, Kai; Berrettini, Wade H; Brandt, Harry A; Bulik, Cynthia M; Crawford, Steven; Crow, Scott; Fichter, Manfred M; Halmi, Katherine A; Johnson, Craig L; Keel, Pamela K; Klump, Kelly L; Magistretti, Pierre; Mitchell, James E; Strober, Michael; Thornton, Laura M; Treasure, Janet; Woodside, D Blake; Knight, Joanne; Kennedy, James L

2014-08-01

Although low weight is a key factor contributing to the high mortality in anorexia nervosa (AN), it is unclear how AN patients sustain low weight compared with bulimia nervosa (BN) patients with similar psychopathology. Studies of genes involved in appetite and weight regulation in eating disorders have yielded variable findings, in part due to small sample size and clinical heterogeneity. This study: (1) assessed the role of leptin, melanocortin, and neurotrophin genetic variants in conferring risk for AN and BN; and (2) explored the involvement of these genes in body mass index (BMI) variations within AN and BN. Our sample consisted of 745 individuals with AN without a history of BN, 245 individuals with BN without a history of AN, and 321 controls. We genotyped 20 markers with known or putative function among genes selected from leptin, melanocortin, and neurotrophin systems. There were no significant differences in allele frequencies among individuals with AN, BN, and controls. AGRP rs13338499 polymorphism was associated with lowest illness-related BMI in those with AN (p = 0.0013), and NTRK2 rs1042571 was associated with highest BMI in those with BN (p = 0.0018). To our knowledge, this is the first study to address the issue of clinical heterogeneity in eating disorder genetic research and to explore the role of known or putatively functional markers in genes regulating appetite and weight in individuals with AN and BN. If replicated, our results may serve as an important first step toward gaining a better understanding of weight regulation in eating disorders. Copyright © 2014 Elsevier Ltd. All rights reserved.

PLAZA 3.0: an access point for plant comparative genomics

PubMed Central

Proost, Sebastian; Van Bel, Michiel; Vaneechoutte, Dries; Van de Peer, Yves; Inzé, Dirk; Mueller-Roeber, Bernd; Vandepoele, Klaas

2015-01-01

Comparative sequence analysis has significantly altered our view on the complexity of genome organization and gene functions in different kingdoms. PLAZA 3.0 is designed to make comparative genomics data for plants available through a user-friendly web interface. Structural and functional annotation, gene families, protein domains, phylogenetic trees and detailed information about genome organization can easily be queried and visualized. Compared with the first version released in 2009, which featured nine organisms, the number of integrated genomes is more than four times higher, and now covers 37 plant species. The new species provide a wider phylogenetic range as well as a more in-depth sampling of specific clades, and genomes of additional crop species are present. The functional annotation has been expanded and now comprises data from Gene Ontology, MapMan, UniProtKB/Swiss-Prot, PlnTFDB and PlantTFDB. Furthermore, we improved the algorithms to transfer functional annotation from well-characterized plant genomes to other species. The additional data and new features make PLAZA 3.0 (http://bioinformatics.psb.ugent.be/plaza/) a versatile and comprehensible resource for users wanting to explore genome information to study different aspects of plant biology, both in model and non-model organisms. PMID:25324309
Exploration of structural stability in deleterious nsSNPs of the XPA gene: A molecular dynamics approach

PubMed Central

NagaSundaram, N; Priya Doss, C George

2011-01-01

Background: Distinguishing the deleterious from the massive number of non-functional nsSNPs that occur within a single genome is a considerable challenge in mutation research. In this approach, we have used the existing in silico methods to explore the mutation-structure-function relationship in the XPAgene. Materials and Methods: We used the Sorting Intolerant From Tolerant (SIFT), Polymorphism Phenotyping (PolyPhen), I-Mutant 2.0, and the Protein Analysis THrough Evolutionary Relationships methods to predict the effects of deleterious nsSNPs on protein function and evaluated the impact of mutation on protein stability by Molecular Dynamics simulations. Results: By comparing the scores of all the four in silico methods, nsSNP with an ID rs104894131 at position C108F was predicted to be highly deleterious. We extended our Molecular dynamics approach to gain insight into the impact of this non-synonymous polymorphism on structural changes that may affect the activity of the XPAgene. Conclusion: Based on the in silico methods score, potential energy, root-mean-square deviation, and root-mean-square fluctuation, we predict that deleterious nsSNP at position C108F would play a significant role in causing disease by the XPA gene. Our approach would present the application of in silicotools in understanding the functional variation from the perspective of structure, evolution, and phenotype. PMID:22190868
The structure and function of the dopamine transporter and its role in CNS diseases.

PubMed

McHugh, Patrick C; Buckley, David A

2015-01-01

In this chapter, we explore the basic science of the dopamine transporter (DAT), an integral component of a system that regulates dopamine homeostasis. Dopamine is a key neurotransmitter for several brain functions including locomotor control and reward systems. The transporter structure, function, mechanism of action, localization, and distribution, in addition to gene regulation, are discussed. Over many years, a wealth of information concerning the DAT has been accrued and has led to increased interest in the role of the DAT in a plethora of central nervous system diseases. These DAT characteristics are explored in relation to a range of neurological and neuropsychiatric diseases, with a particular focus on the genetics of the DAT. In addition, we discuss the pharmacology of the DAT and how this relates to disease and addiction. © 2015 Elsevier Inc. All rights reserved.
MinGenome: An In Silico Top-Down Approach for the Synthesis of Minimized Genomes.

PubMed

Wang, Lin; Maranas, Costas D

2018-02-16

Genome minimized strains offer advantages as production chassis by reducing transcriptional cost, eliminating competing functions and limiting unwanted regulatory interactions. Existing approaches for identifying stretches of DNA to remove are largely ad hoc based on information on presumably dispensable regions through experimentally determined nonessential genes and comparative genomics. Here we introduce a versatile genome reduction algorithm MinGenome that implements a mixed-integer linear programming (MILP) algorithm to identify in size descending order all dispensable contiguous sequences without affecting the organism's growth or other desirable traits. Known essential genes or genes that cause significant fitness or performance loss can be flagged and their deletion can be prohibited. MinGenome also preserves needed transcription factors and promoter regions ensuring that retained genes will be properly transcribed while also avoiding the simultaneous deletion of synthetic lethal pairs. The potential benefit of removing even larger contiguous stretches of DNA if only one or two essential genes (to be reinserted elsewhere) are within the deleted sequence is explored. We applied the algorithm to design a minimized E. coli strain and found that we were able to recapitulate the long deletions identified in previous experimental studies and discover alternative combinations of deletions that have not yet been explored in vivo.
GEM2Net: from gene expression modeling to -omics networks, a new CATdb module to investigate Arabidopsis thaliana genes involved in stress response.

PubMed

Zaag, Rim; Tamby, Jean Philippe; Guichard, Cécile; Tariq, Zakia; Rigaill, Guillem; Delannoy, Etienne; Renou, Jean-Pierre; Balzergue, Sandrine; Mary-Huard, Tristan; Aubourg, Sébastien; Martin-Magniette, Marie-Laure; Brunaud, Véronique

2015-01-01

CATdb (http://urgv.evry.inra.fr/CATdb) is a database providing a public access to a large collection of transcriptomic data, mainly for Arabidopsis but also for other plants. This resource has the rare advantage to contain several thousands of microarray experiments obtained with the same technical protocol and analyzed by the same statistical pipelines. In this paper, we present GEM2Net, a new module of CATdb that takes advantage of this homogeneous dataset to mine co-expression units and decipher Arabidopsis gene functions. GEM2Net explores 387 stress conditions organized into 18 biotic and abiotic stress categories. For each one, a model-based clustering is applied on expression differences to identify clusters of co-expressed genes. To characterize functions associated with these clusters, various resources are analyzed and integrated: Gene Ontology, subcellular localization of proteins, Hormone Families, Transcription Factor Families and a refined stress-related gene list associated to publications. Exploiting protein-protein interactions and transcription factors-targets interactions enables to display gene networks. GEM2Net presents the analysis of the 18 stress categories, in which 17,264 genes are involved and organized within 681 co-expression clusters. The meta-data analyses were stored and organized to compose a dynamic Web resource. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Systematic Analysis of the 4-Coumarate:Coenzyme A Ligase (4CL) Related Genes and Expression Profiling during Fruit Development in the Chinese Pear

PubMed Central

Cao, Yunpeng; Han, Yahui; Li, Dahui; Lin, Yi; Cai, Yongping

2016-01-01

In plants, 4-coumarate:coenzyme A ligases (4CLs), comprising some of the adenylate-forming enzymes, are key enzymes involved in regulating lignin metabolism and the biosynthesis of flavonoids and other secondary metabolites. Although several 4CL-related proteins were shown to play roles in secondary metabolism, no comprehensive study on 4CL-related genes in the pear and other Rosaceae species has been reported. In this study, we identified 4CL-related genes in the apple, peach, yangmei, and pear genomes using DNATOOLS software and inferred their evolutionary relationships using phylogenetic analysis, collinearity analysis, conserved motif analysis, and structure analysis. A total of 149 4CL-related genes in four Rosaceous species (pear, apple, peach, and yangmei) were identified, with 30 members in the pear. We explored the functions of several 4CL and acyl-coenzyme A synthetase (ACS) genes during the development of pear fruit by quantitative real-time PCR (qRT-PCR). We found that duplication events had occurred in the 30 4CL-related genes in the pear. These duplicated 4CL-related genes are distributed unevenly across all pear chromosomes except chromosomes 4, 8, 11, and 12. The results of this study provide a basis for further investigation of both the functions and evolutionary history of 4CL-related genes. PMID:27775579
Systematic Analysis of the 4-Coumarate:Coenzyme A Ligase (4CL) Related Genes and Expression Profiling during Fruit Development in the Chinese Pear.

PubMed

Cao, Yunpeng; Han, Yahui; Li, Dahui; Lin, Yi; Cai, Yongping

2016-10-19

In plants, 4-coumarate:coenzyme A ligases (4CLs), comprising some of the adenylate-forming enzymes, are key enzymes involved in regulating lignin metabolism and the biosynthesis of flavonoids and other secondary metabolites. Although several 4CL-related proteins were shown to play roles in secondary metabolism, no comprehensive study on 4CL-related genes in the pear and other Rosaceae species has been reported. In this study, we identified 4CL-related genes in the apple, peach, yangmei, and pear genomes using DNATOOLS software and inferred their evolutionary relationships using phylogenetic analysis, collinearity analysis, conserved motif analysis, and structure analysis. A total of 149 4CL-related genes in four Rosaceous species (pear, apple, peach, and yangmei) were identified, with 30 members in the pear. We explored the functions of several 4CL and acyl-coenzyme A synthetase (ACS) genes during the development of pear fruit by quantitative real-time PCR (qRT-PCR). We found that duplication events had occurred in the 30 4CL-related genes in the pear. These duplicated 4CL-related genes are distributed unevenly across all pear chromosomes except chromosomes 4, 8, 11, and 12. The results of this study provide a basis for further investigation of both the functions and evolutionary history of 4CL-related genes.
Differential mantle transcriptomics and characterization of growth-related genes in the diploid and triploid pearl oyster Pinctada fucata.

PubMed

Guan, Yunyan; He, Maoxian; Wu, Houbo

2017-06-01

To explore the molecular mechanism of triploidy effect in the pearl oyster Pinctada fucata, two RNA-seq libraries were constructed from the mantle tissue of diploids and triploids by Roche-454 massive parallel pyrosequencing. The identification of differential expressed genes (DEGs) between diploid and triploid may reveal the molecular mechanism of triploidy effect. In this study, 230 down-regulated and 259 up-regulated DEGs were obtained by comparison between diploid and triploid libraries. The gene ontology and KEGG pathway analysis revealed more functional activation in triploids and it may due to the duplicated gene expression in transcriptional level during whole genome duplication (WGD). To confirm the sequencing data, a set of 11 up-regulated genes related to growth and development control and regulation were analyzed by RT-qPCR in independent experiment. According to the validation and annotation of these genes, it is hypothesized that the set of up-regulated expressed genes had the correlated expression pattern involved in shell building or other interactive probable functions during triploidization. The up- regulation of growth-related genes may support the classic hypotheses of 'energy redistribution' from early research. The results provide valuable resources to understand the molecular mechanism of triploidy effect in both shell building and producing high-quality seawater pearls. Copyright © 2017 Elsevier B.V. All rights reserved.
Integrated Analysis of Alzheimer's Disease and Schizophrenia Dataset Revealed Different Expression Pattern in Learning and Memory.

PubMed

Li, Wen-Xing; Dai, Shao-Xing; Liu, Jia-Qian; Wang, Qian; Li, Gong-Hua; Huang, Jing-Fei

2016-01-01

Alzheimer's disease (AD) and schizophrenia (SZ) are both accompanied by impaired learning and memory functions. This study aims to explore the expression profiles of learning or memory genes between AD and SZ. We downloaded 10 AD and 10 SZ datasets from GEO-NCBI for integrated analysis. These datasets were processed using RMA algorithm and a global renormalization for all studies. Then Empirical Bayes algorithm was used to find the differentially expressed genes between patients and controls. The results showed that most of the differentially expressed genes were related to AD whereas the gene expression profile was little affected in the SZ. Furthermore, in the aspects of the number of differentially expressed genes, the fold change and the brain region, there was a great difference in the expression of learning or memory related genes between AD and SZ. In AD, the CALB1, GABRA5, and TAC1 were significantly downregulated in whole brain, frontal lobe, temporal lobe, and hippocampus. However, in SZ, only two genes CRHBP and CX3CR1 were downregulated in hippocampus, and other brain regions were not affected. The effect of these genes on learning or memory impairment has been widely studied. It was suggested that these genes may play a crucial role in AD or SZ pathogenesis. The different gene expression patterns between AD and SZ on learning and memory functions in different brain regions revealed in our study may help to understand the different mechanism between two diseases.
Interactions between Snow Chemistry, Mercury Inputs and Microbial Population Dynamics in an Arctic Snowpack

PubMed Central

Larose, Catherine; Prestat, Emmanuel; Cecillon, Sébastien; Berger, Sibel; Malandain, Cédric; Lyon, Delina; Ferrari, Christophe; Schneider, Dominique; Dommergue, Aurélien; Vogel, Timothy M.

2013-01-01

We investigated the interactions between snowpack chemistry, mercury (Hg) contamination and microbial community structure and function in Arctic snow. Snowpack chemistry (inorganic and organic ions) including mercury (Hg) speciation was studied in samples collected during a two-month field study in a high Arctic site, Svalbard, Norway (79°N). Shifts in microbial community structure were determined by using a 16S rRNA gene phylogenetic microarray. We linked snowpack and meltwater chemistry to changes in microbial community structure by using co-inertia analyses (CIA) and explored changes in community function due to Hg contamination by q-PCR quantification of Hg-resistance genes in metagenomic samples. Based on the CIA, chemical and microbial data were linked (p = 0.006) with bioavailable Hg (BioHg) and methylmercury (MeHg) contributing significantly to the ordination of samples. Mercury was shown to influence community function with increases in merA gene copy numbers at low BioHg levels. Our results show that snowpacks can be considered as dynamic habitats with microbial and chemical components responding rapidly to environmental changes. PMID:24282515
Targeted deletion of p97 (VCP/CDC48) in mouse results in early embryonic lethality.

PubMed

Müller, J M M; Deinhardt, K; Rosewell, I; Warren, G; Shima, D T

2007-03-09

The highly conserved AAA ATPase p97 (VCP/CDC48) has well-established roles in cell cycle progression, proteasome degradation and membrane dynamics. Gene disruption in Saccromyces cerevisiae, Drosophila melanogaster and Trypanosoma brucei demonstrated that p97 is essential in unicellular and multicellular organisms. To explore the requirement for p97 in mammalian cell function and embryogenesis, we disrupted the p97 locus by gene targeting. Heterozygous p97+/- mice were indistinguishable from their wild-type littermates, whereas homozygous mutants did not survive to birth and died at a peri-implantation stage. These results show that p97 is an essential gene for early mouse development.
The evolution of microRNAs in plants

PubMed Central

Cui, Jie; You, Chenjiang; Chen, Xuemei

2016-01-01

MicroRNAs (miRNAs) are a central player in post-transcriptional regulation of gene expression and are involved in numerous biological processes in eukaryotes. Knowledge of the origins and divergence of miRNAs paves the way for a better understanding of the complexity of the regulatory networks that they participate in. The biogenesis, degradation, and regulatory activities of miRNAs are relatively better understood, but the evolutionary history of miRNAs still needs more exploration. Inverted duplication of target genes, random hairpin sequences and small transposable elements constitute three main models that explain the origination of miRNA genes (MIR). Both inter- and intra-species divergence of miRNAs exhibits functional adaptation and adaptation to changing environments in evolution. Here we summarize recent progress in studies on the evolution of MIR and related genes. PMID:27886593
Inflammatory Bowel Diseases: the genetic revolution.

PubMed

Jung, C; Hugot, J-P

2009-06-01

The genetic component of Inflammatory Bowel Diseases is among the best known for complex genetic disorders. If the functional candidate gene approach was rarely fruitful in the past, genome-wide scans allowed finding several susceptibility genes for Crohn disease including NOD2, IL23R, ATG16L1, IRGM, TNFSF15, a region close to PTGER4, PTPN2, PTPN22, NKX2-3 and many others. Only one gene, ECM1, has been reported for ulcerative colitis alone. We now need to further explore these new genes before to understand their biological role. However they clearly demonstrate the importance of innate immunity and autophagy for Crohn's disease and of the TH-17 differentiation for ulcerative colitis, Crohn's disease and other inflammatory disorders. Copyright 2009 Elsevier Masson SAS. All rights reserved.
Network-assisted investigation of virulence and antibiotic-resistance systems in Pseudomonas aeruginosa

NASA Astrophysics Data System (ADS)

Hwang, Sohyun; Kim, Chan Yeong; Ji, Sun-Gou; Go, Junhyeok; Kim, Hanhae; Yang, Sunmo; Kim, Hye Jin; Cho, Ara; Yoon, Sang Sun; Lee, Insuk

2016-05-01

Pseudomonas aeruginosa is a Gram-negative bacterium of clinical significance. Although the genome of PAO1, a prototype strain of P. aeruginosa, has been extensively studied, approximately one-third of the functional genome remains unknown. With the emergence of antibiotic-resistant strains of P. aeruginosa, there is an urgent need to develop novel antibiotic and anti-virulence strategies, which may be facilitated by an approach that explores P. aeruginosa gene function in systems-level models. Here, we present a genome-wide functional network of P. aeruginosa genes, PseudomonasNet, which covers 98% of the coding genome, and a companion web server to generate functional hypotheses using various network-search algorithms. We demonstrate that PseudomonasNet-assisted predictions can effectively identify novel genes involved in virulence and antibiotic resistance. Moreover, an antibiotic-resistance network based on PseudomonasNet reveals that P. aeruginosa has common modular genetic organisations that confer increased or decreased resistance to diverse antibiotics, which accounts for the pervasiveness of cross-resistance across multiple drugs. The same network also suggests that P. aeruginosa has developed mechanism of trade-off in resistance across drugs by altering genetic interactions. Taken together, these results clearly demonstrate the usefulness of a genome-scale functional network to investigate pathogenic systems in P. aeruginosa.
Mutations and polymorphisms in FSH receptor: functional implications in human reproduction.

PubMed

Desai, Swapna S; Roy, Binita Sur; Mahale, Smita D

2013-12-01

FSH brings about its physiological actions by activating a specific receptor located on target cells. Normal functioning of the FSH receptor (FSHR) is crucial for follicular development and estradiol production in females and for the regulation of Sertoli cell function and spermatogenesis in males. In the last two decades, the number of inactivating and activating mutations, single nucleotide polymorphisms, and spliced variants of FSHR gene has been identified in selected infertile cases. Information on genotype-phenotype correlation and in vitro functional characterization of the mutants has helped in understanding the possible genetic cause for female infertility in affected individuals. The information is also being used to dissect various extracellular and intracellular events involved in hormone-receptor interaction by studying the differences in the properties of the mutant receptor when compared with WT receptor. Studies on polymorphisms in the FSHR gene have shown variability in clinical outcome among women treated with FSH. These observations are being explored to develop molecular markers to predict the optimum dose of FSH required for controlled ovarian hyperstimulation. Pharmacogenetics is an emerging field in this area that aims at designing individual treatment protocols for reproductive abnormalities based on FSHR gene polymorphisms. The present review discusses the current knowledge of various genetic alterations in FSHR and their impact on receptor function in the female reproductive system.
Oil palm phenolics confer neuroprotective effects involving cognitive and motor functions in mice

PubMed Central

Leow, Soon-Sen; Sekaran, Shamala Devi; Tan, YewAi; Sundram, Kalyana; Sambanthamurthi, Ravigadevi

2013-01-01

Objectives Phenolics are important phytochemicals which have positive effects on chronic diseases, including neurodegenerative ailments. The oil palm (Elaeis guineensis) is a rich source of water-soluble phenolics. This study was carried out to discover the effects of administering oil palm phenolics (OPP) to mice, with the aim of identifying whether these compounds possess significant neuroprotective properties. Methods OPP was given to BALB/c mice on a normal diet as fluids for 6 weeks while the controls were given distilled water. These animals were tested in a water maze and on a rotarod weekly to assess the effects of OPP on cognitive and motor functions, respectively. Using Illumina microarrays, we further explored the brain gene expression changes caused by OPP in order to determine the molecular mechanisms involved. Real-time quantitative reverse transcription-polymerase chain reaction experiments were then carried out to validate the microarray data. Results We found that mice given OPP showed better cognitive function and spatial learning when tested in a water maze, and their performance also improved when tested on a rotarod, possibly due to better motor function and balance. Microarray gene expression analysis showed that these compounds up-regulated genes involved in brain development and activity, such as those under the regulation of the brain-derived neurotrophic factor. OPP also down-regulated genes involved in inflammation. Discussion These results suggest that the improvement of mouse cognitive and motor functions by OPP is caused by the neuroprotective and anti-inflammatory effects of the extract. PMID:23433062
Assessing the genetic diversity of Cu resistance in mine tailings through high-throughput recovery of full-length copA genes

PubMed Central

Li, Xiaofang; Zhu, Yong-Guan; Shaban, Babak; Bruxner, Timothy J. C.; Bond, Philip L.; Huang, Longbin

2015-01-01

Characterizing the genetic diversity of microbial copper (Cu) resistance at the community level remains challenging, mainly due to the polymorphism of the core functional gene copA. In this study, a local BLASTN method using a copA database built in this study was developed to recover full-length putative copA sequences from an assembled tailings metagenome; these sequences were then screened for potentially functioning CopA using conserved metal-binding motifs, inferred by evolutionary trace analysis of CopA sequences from known Cu resistant microorganisms. In total, 99 putative copA sequences were recovered from the tailings metagenome, out of which 70 were found with high potential to be functioning in Cu resistance. Phylogenetic analysis of selected copA sequences detected in the tailings metagenome showed that topology of the copA phylogeny is largely congruent with that of the 16S-based phylogeny of the tailings microbial community obtained in our previous study, indicating that the development of copA diversity in the tailings might be mainly through vertical descent with few lateral gene transfer events. The method established here can be used to explore copA (and potentially other metal resistance genes) diversity in any metagenome and has the potential to exhaust the full-length gene sequences for downstream analyses. PMID:26286020
Differential maturation of rhythmic clock gene expression during early development in medaka (Oryzias latipes).

PubMed

Cuesta, Ines H; Lahiri, Kajori; Lopez-Olmeda, Jose Fernando; Loosli, Felix; Foulkes, Nicholas S; Vallone, Daniela

2014-05-01

One key challenge for the field of chronobiology is to identify how circadian clock function emerges during early embryonic development. Teleosts such as the zebrafish are ideal models for studying circadian clock ontogeny since the entire process of development occurs ex utero in an optically transparent chorion. Medaka (Oryzias latipes) represents another powerful fish model for exploring early clock function with, like the zebrafish, many tools available for detailed genetic analysis. However, to date there have been no reports documenting circadian clock gene expression during medaka development. Here we have characterized the expression of key clock genes in various developmental stages and in adult tissues of medaka. As previously reported for other fish, light dark cycles are required for the emergence of clock gene expression rhythms in this species. While rhythmic expression of per and cry genes is detected very early during development and seems to be light driven, rhythmic clock and bmal expression appears much later around hatching time. Furthermore, the maturation of clock function seems to correlate with the appearance of rhythmic expression of these positive elements of the clock feedback loop. By accelerating development through elevated temperatures or by artificially removing the chorion, we show an earlier onset of rhythmicity in clock and bmal expression. Thus, differential maturation of key elements of the medaka clock mechanism depends on the developmental stage and the presence of the chorion.
A functional polymorphism in the promoter region of MAOA gene is associated with daytime sleepiness in healthy subjects.

PubMed

Ojeda, Diego A; Niño, Carmen L; López-León, Sandra; Camargo, Andrés; Adan, Ana; Forero, Diego A

2014-02-15

Excessive daytime sleepiness (EDS) is one of the main causes of car and industrial accidents and it is associated with increased morbidity and alterations in quality of life. Prevalence of EDS in the general population around the world ranges from 6.2 to 32.4%, with a heritability of 38-40%. However, few studies have explored the role of candidate genes in EDS. Monoamine oxidase A (MAOA) gene has an important role in the regulation of neurotransmitter levels and a large number of human behaviors. We hypothesized that a functional VNTR in the promoter region of the MAOA gene might be associated with daytime sleepiness in healthy individuals. The Epworth sleepiness scale (ESS) was applied to 210 Colombian healthy subjects (university students), which were genotyped for MAOA-uVNTR. MAOA-uVNTR showed a significant association with ESS scores (p = 0.01): 3/3 genotype carriers had the lowest scores. These results were supported by differences in MAOA-uVNTR frequencies between diurnal somnolence categories (p = 0.03). Our finding provides evidence for the first time that MAOA-uVNTR has a significant association with EDS in healthy subjects. Finally, these data suggest that functional variations in MAOA gene could have a role in other phenotypes of neuropsychiatric relevance. Copyright © 2013 Elsevier B.V. All rights reserved.
Fanconi anemia: causes and consequences of genetic instability.

PubMed

Kalb, R; Neveling, K; Nanda, I; Schindler, D; Hoehn, H

2006-01-01

Fanconi anemia (FA) is a rare recessive disease that reflects the cellular and phenotypic consequences of genetic instability: growth retardation, congenital malformations, bone marrow failure, high risk of neoplasia, and premature aging. At the cellular level, manifestations of genetic instability include chromosomal breakage, cell cycle disturbance, and increased somatic mutation rates. FA cells are exquisitely sensitive towards oxygen and alkylating drugs such as mitomycin C or diepoxybutane, pointing to a function of FA genes in the defense against reactive oxygen species and other DNA damaging agents. FA is caused by biallelic mutations in at least 12 different genes which appear to function in the maintenance of genomic stability. Eight of the FA proteins form a nuclear core complex with a catalytic function involving ubiquitination of the central FANCD2 protein. The posttranslational modification of FANCD2 promotes its accumulation in nuclear foci, together with known DNA maintenance proteins such as BRCA1, BRCA2, and the RAD51 recombinase. Biallelic mutations in BRCA2 cause a severe FA-like phenotype, as do biallelic mutations in FANCD2. In fact, only leaky or hypomorphic mutations in this central group of FA genes appear to be compatible with life birth and survival. The newly discovered FANCJ (= BRIP1) and FANCM (= Hef ) genes correspond to known DNA-maintenance genes (helicase resp. helicase-associated endonuclease for fork-structured DNA). These genes provide the most convincing evidence to date of a direct involvement of FA genes in DNA repair functions associated with the resolution of DNA crosslinks and stalled replication forks. Even though genetic instability caused by mutational inactivation of the FANC genes has detrimental effects for the majority of FA patients, around 20% of patients appear to benefit from genetic instability since genetic instability also increases the chance of somatic reversion of their constitutional mutations. Intragenic crossover, gene conversion, back mutation and compensating mutations in cis have all been observed in revertant, and, consequently, mosaic FA-patients, leading to improved bone marrow function. There probably is no other experiment of nature in our species in which causes and consequences of genetic instability, including the role of reactive oxygen species, can be better documented and explored than in FA.

Executive control in schizophrenia: a preliminary study on the moderating role of COMT Val158Met for comorbid alcohol and substance use disorders.

PubMed

Carrà, Giuseppe; Nicolini, Gabriella; Crocamo, Cristina; Lax, Annamaria; Amidani, Francesca; Bartoli, Francesco; Castellano, Filippo; Chiorazzi, Alessia; Gamba, Giulia; Papagno, Costanza; Clerici, Massimo

2017-07-01

A functional polymorphism in the catechol-O-methyltransferase (COMT) gene (Val158Met) appears to influence cognition in people with alcohol/substance use disorders (AUD/SUD) and in those with psychosis. To explore the potential moderating effect of these factors, a cross-sectional study was conducted, randomly recruiting subjects with DSM-IV diagnosis of schizophrenia. AUD/SUD was rigorously assessed, as well as COMT Val158Met polymorphism. Executive control functioning was measured using the Intra-Extra Dimensional Set Shift (IED). The effect of a possible interaction between comorbid AUD/SUD and COMT Val158Met polymorphism on IED scores was explored. Subjects with schizophrenia, comorbid AUD/SUD, and MetMet carriers for SNP rs4680 of the COMT gene showed worse performance on IED completed stages scores, as compared with individuals with ValVal genotype. However, among subjects without AUD/SUD, those with the MetMet variant performed better than people carrying ValVal genotype. This study is the first to date examining the impact of COMT on cognition in a highly representative sample of people with schizophrenia and comorbid AUD/SUD. Differential moderating effects of COMT Val/Met genotype variations may similarly influence executive functions in people with schizophrenia and comorbid AUD/SUD.
Horizontal Transfers and Gene Losses in the Phospholipid Pathway of Bartonella Reveal Clues about Early Ecological Niches

PubMed Central

Zhu, Qiyun; Kosoy, Michael; Olival, Kevin J.; Dittmar, Katharina

2014-01-01

Bartonellae are mammalian pathogens vectored by blood-feeding arthropods. Although of increasing medical importance, little is known about their ecological past, and host associations are underexplored. Previous studies suggest an influence of horizontal gene transfers in ecological niche colonization by acquisition of host pathogenicity genes. We here expand these analyses to metabolic pathways of 28 Bartonella genomes, and experimentally explore the distribution of bartonellae in 21 species of blood-feeding arthropods. Across genomes, repeated gene losses and horizontal gains in the phospholipid pathway were found. The evolutionary timing of these patterns suggests functional consequences likely leading to an early intracellular lifestyle for stem bartonellae. Comparative phylogenomic analyses discover three independent lineage-specific reacquisitions of a core metabolic gene—NAD(P)H-dependent glycerol-3-phosphate dehydrogenase (gpsA)—from Gammaproteobacteria and Epsilonproteobacteria. Transferred genes are significantly closely related to invertebrate Arsenophonus-, and Serratia-like endosymbionts, and mammalian Helicobacter-like pathogens, supporting a cellular association with arthropods and mammals at the base of extant Bartonella spp. Our studies suggest that the horizontal reacquisitions had a key impact on bartonellae lineage specific ecological and functional evolution. PMID:25106622
Updated regulation curation model at the Saccharomyces Genome Database

PubMed Central

Engel, Stacia R; Skrzypek, Marek S; Hellerstedt, Sage T; Wong, Edith D; Nash, Robert S; Weng, Shuai; Binkley, Gail; Sheppard, Travis K; Karra, Kalpana; Cherry, J Michael

2018-01-01

Abstract The Saccharomyces Genome Database (SGD) provides comprehensive, integrated biological information for the budding yeast Saccharomyces cerevisiae, along with search and analysis tools to explore these data, enabling the discovery of functional relationships between sequence and gene products in fungi and higher organisms. We have recently expanded our data model for regulation curation to address regulation at the protein level in addition to transcription, and are presenting the expanded data on the ‘Regulation’ pages at SGD. These pages include a summary describing the context under which the regulator acts, manually curated and high-throughput annotations showing the regulatory relationships for that gene and a graphical visualization of its regulatory network and connected networks. For genes whose products regulate other genes or proteins, the Regulation page includes Gene Ontology enrichment analysis of the biological processes in which those targets participate. For DNA-binding transcription factors, we also provide other information relevant to their regulatory function, such as DNA binding site motifs and protein domains. As with other data types at SGD, all regulatory relationships and accompanying data are available through YeastMine, SGD’s data warehouse based on InterMine. Database URL: http://www.yeastgenome.org PMID:29688362
An effector Peptide family required for Drosophila toll-mediated immunity.

PubMed

Clemmons, Alexa W; Lindsay, Scott A; Wasserman, Steven A

2015-04-01

In Drosophila melanogaster, recognition of an invading pathogen activates the Toll or Imd signaling pathway, triggering robust upregulation of innate immune effectors. Although the mechanisms of pathogen recognition and signaling are now well understood, the functions of the immune-induced transcriptome and proteome remain much less well characterized. Through bioinformatic analysis of effector gene sequences, we have defined a family of twelve genes - the Bomanins (Boms) - that are specifically induced by Toll and that encode small, secreted peptides of unknown biochemical activity. Using targeted genome engineering, we have deleted ten of the twelve Bom genes. Remarkably, inactivating these ten genes decreases survival upon microbial infection to the same extent, and with the same specificity, as does eliminating Toll pathway function. Toll signaling, however, appears unaffected. Assaying bacterial load post-infection in wild-type and mutant flies, we provide evidence that the Boms are required for resistance to, rather than tolerance of, infection. In addition, by generating and assaying a deletion of a smaller subset of the Bom genes, we find that there is overlap in Bom activity toward particular pathogens. Together, these studies deepen our understanding of Toll-mediated immunity and provide a new in vivo model for exploration of the innate immune effector repertoire.
Violent suicidal behaviour in bipolar disorder is associated with nitric oxide synthase 3 gene polymorphism.

PubMed

Oliveira, J; Debnath, M; Etain, B; Bennabi, M; Hamdani, N; Lajnef, M; Bengoufa, D; Fortier, C; Boukouaci, W; Bellivier, F; Kahn, J-P; Henry, C; Charron, D; Krishnamoorthy, R; Leboyer, M; Tamouza, R

2015-09-01

Given the importance of nitric oxide system in oxidative stress, inflammation, neurotransmission and cerebrovascular tone regulation, we postulated its potential dysfunction in bipolar disorder (BD) and suicide. By simultaneously analysing variants of three isoforms of nitric oxide synthase (NOS) genes, we explored interindividual genetic liability to suicidal behaviour in BD. A total of 536 patients with BD (DSM-IV) and 160 healthy controls were genotyped for functionally relevant NOS1, NOS2 and NOS3 polymorphisms. History of suicidal behaviour and violent suicide attempt was documented for 511 patients with BD. Chi-squared test was used to perform genetic association analyses and logistic regression to test for gene-gene interactions. NOS3 rs1799983 T homozygous state was associated with violent suicide attempts (26.4% vs. 10.8%, in patients and controls, P = 0.002, corrected P (Pc) = 0.004, OR: 2.96, 95% CI = 1.33-6.34), and this association was restricted to the early-onset BD subgroup (37.9% vs. 10.8%, in early-onset BD and controls, P = 0.0003, Pc = 0.0006 OR: 5.05, 95% CI: 1.95-12.45), while we found no association with BD per se and no gene-gene interactions. Our results bring further evidence for the potential involvement of endothelial NOS gene variants in susceptibility to suicidal behaviour. Future exploration of this pathway on larger cohort of suicidal behaviour is warranted. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Functional preservation and variation in the cone opsin genes of nocturnal tarsiers

PubMed Central

Ong, Perry S.; Perry, George H.

2017-01-01

The short-wavelength sensitive (S-) opsin gene OPN1SW is pseudogenized in some nocturnal primates and retained in others, enabling dichromatic colour vision. Debate on the functional significance of this variation has focused on dark conditions, yet many nocturnal species initiate activity under dim (mesopic) light levels that can support colour vision. Tarsiers are nocturnal, twilight-active primates and exemplary visual predators; they also express different colour vision phenotypes, raising the possibility of discrete adaptations to mesopic conditions. To explore this premise, we conducted a field study in two stages. First, to estimate the level of functional constraint on colour vision, we sequenced OPN1SW in 12 wild-caught Philippine tarsiers (Tarsius syrichta). Second, to explore whether the dichromatic visual systems of Philippine and Bornean (Tarsius bancanus) tarsiers—which express alternate versions of the medium/long-wavelength sensitive (M/L-) opsin gene OPN1MW/OPN1LW—confer differential advantages specific to their respective habitats, we used twilight and moonlight conditions to model the visual contrasts of invertebrate prey. We detected a signature of purifying selection for OPN1SW, indicating that colour vision confers an adaptive advantage to tarsiers. However, this advantage extends to a relatively small proportion of prey–background contrasts, and mostly brown arthropod prey amid leaf litter. We also found that the colour vision of T. bancanus is advantageous for discriminating prey under twilight that is enriched in shorter (bluer) wavelengths, a plausible idiosyncrasy of understorey habitats in Borneo. This article is part of the themed issue ‘Vision in dim light’. PMID:28193820
[Change of chart genes expression in small intestines of mouse induced by electromagnetic pulse irradiation].

PubMed

Ren, Dongqing; Jin, Juan; Li, Xiaojuan; Zeng, Guiying

2008-01-01

To explore the bio-effects of electromagnetic pulse(EMP) on mouse small intestines induced by means of gene chip. Twelve BALB/c mice were randomly assigned to the normal control group and the EMP group with 6 in each group. The EMP group was irradiated with 200 kV/m, 200 pulses EMP. 18 hours after the irradiation, the mice were sacrificed and their jejunum of small intestines were eviscerated. The fluorescent cDNA probes labeled with Cy3 and Cy5 were prepared from RNA extracted from the intestines of the two groups. Probes of the two groups were then hybridized against cDNA gene chip, the fluorescent signals were scanned with a scanner and the results were analyzed by computer. Compared with the control, 56 genes in gene expression profile were altered. The expression levels of 37 genes were up-regulated distinctly while 19 genes were down-regulated significantly. Among the 56 genes, 19 were reported with known or inferred functions, 12 up-regulated genes were catenin alpha 1 (alpha-catenin), ly-6 alloantigen(Ly-6E), fructose-6-phosphate transaminase (GF6P), ribosomal protein S17 (rpS17), small proline-rich protein 2A (Sprr2a), glandular kallikrein27 (GK27), lipoxygenase-3, aldo-keto reductase (Akr1c12), GSG1, amylase 2 (Amy2),elastase 2, p6-5 gene and 7 down-regulated genes were junctional adhesion molecule (Jam), protein arginine methyltransferase (Carm1),NNP-1, 2-5 A synthetase L2,Mlark gene, ATP synthase alpha subunit, uncoupling protein-2 (Ucp2) gene; the other 37 were reported with unknown functions. EMP irradiation could induce specific expressions of some genes in mouse small intestines and most of these genes were up-regulated ones.
Information Propagation in Developmental Enhancers

NASA Astrophysics Data System (ADS)

Jena, Siddhartha; Levine, Michael

Rather than encoding information about protein sequence, certain lengths of noncoding DNA, called enhancers, interact with protein machinery such as transcription factors to precisely regulate gene expression. Enhancers have been studied extensively in the fruit fly Drosophila melanogaster, where they regulate the expression of developmental genes that establish the blueprint of the adult fly. It has been suggested that enhancer sequences possess a specific but unknown syntax with regards to the placement and strength of transcription factor binding sites. Moreover, studies in divergent fly species have shown that compensatory evolution allows for maintenance of enhancer functionality despite considerable variation in primary DNA sequence. Here, the possible role of enhancers as signal processing modules is studied as a way of explaining these two findings. We first demonstrate how this framework can be used to explain the fine-tuned spatiotemporal dynamics of gene expression. We then explore the evolutionary pressure on enhancer sequences and the resulting emergence of enhancers that are linked by compensatory mutations. This study provides a possible mechanism for the function of multiple enhancers linked to a single gene.
A Rare SNP Identified a TCP Transcription Factor Essential for Tendril Development in Cucumber.

PubMed

Wang, Shenhao; Yang, Xueyong; Xu, Mengnan; Lin, Xingzhong; Lin, Tao; Qi, Jianjian; Shao, Guangjin; Tian, Nana; Yang, Qing; Zhang, Zhonghua; Huang, Sanwen

2015-12-07

Rare genetic variants are abundant in genomes but less tractable in genome-wide association study. Here we exploit a strategy of rare variation mapping to discover a gene essential for tendril development in cucumber (Cucumis sativus L.). In a collection of >3000 lines, we discovered a unique tendril-less line that forms branches instead of tendrils and, therefore, loses its climbing ability. We hypothesized that this unusual phenotype was caused by a rare variation and subsequently identified the causative single nucleotide polymorphism. The affected gene TEN encodes a TCP transcription factor conserved within the cucurbits and is expressed specifically in tendrils, representing a new organ identity gene. The variation occurs within a protein motif unique to the cucurbits and impairs its function as a transcriptional activator. Analyses of transcriptomes from near-isogenic lines identified downstream genes required for the tendril's capability to sense and climb a support. This study provides an example to explore rare functional variants in plant genomes. Copyright © 2015 The Author. Published by Elsevier Inc. All rights reserved.
Current Understanding of Usher Syndrome Type II

PubMed Central

Yang, Jun; Wang, Le; Song, Hongman; Sokolov, Maxim

2012-01-01

Usher syndrome is the most common deafness-blindness caused by genetic mutations. To date, three genes have been identified underlying the most prevalent form of Usher syndrome, the type II form (USH2). The proteins encoded by these genes are demonstrated to form a complex in vivo. This complex is localized mainly at the periciliary membrane complex in photoreceptors and the ankle-link of the stereocilia in hair cells. Many proteins have been found to interact with USH2 proteins in vitro, suggesting that they are potential additional components of this USH2 complex and that the genes encoding these proteins may be the candidate USH2 genes. However, further investigations are critical to establish their existence in the USH2 complex in vivo. Based on the predicted functional domains in USH2 proteins, their cellular localizations in photoreceptors and hair cells, the observed phenotypes in USH2 mutant mice, and the known knowledge about diseases similar to USH2, putative biological functions of the USH2 complex have been proposed. Finally, therapeutic approaches for this group of diseases are now being actively explored. PMID:22201796
Hierarchy within the mammary STAT5-driven Wap super-enhancer.

PubMed

Shin, Ha Youn; Willi, Michaela; HyunYoo, Kyung; Zeng, Xianke; Wang, Chaochen; Metser, Gil; Hennighausen, Lothar

2016-08-01

Super-enhancers comprise dense transcription factor platforms highly enriched for active chromatin marks. A paucity of functional data led us to investigate the role of super-enhancers in the mammary gland, an organ characterized by exceptional gene regulatory dynamics during pregnancy. ChIP-seq analysis for the master regulator STAT5A, the glucocorticoid receptor, H3K27ac and MED1 identified 440 mammary-specific super-enhancers, half of which were associated with genes activated during pregnancy. We interrogated the Wap super-enhancer, generating mice carrying mutations in STAT5-binding sites within its constituent enhancers. Individually, the most distal site displayed the greatest enhancer activity. However, combinatorial mutation analysis showed that the 1,000-fold induction in gene expression during pregnancy relied on all enhancers. Disabling the binding sites of STAT5, NFIB and ELF5 in the proximal enhancer incapacitated the entire super-enhancer. Altogether, these data suggest a temporal and functional enhancer hierarchy. The identification of mammary-specific super-enhancers and the mechanistic exploration of the Wap locus provide insights into the regulation of cell-type-specific expression of hormone-sensing genes.
Enriching regulatory networks by bootstrap learning using optimised GO-based gene similarity and gene links mined from PubMed abstracts

DOE Office of Scientific and Technical Information (OSTI.GOV)

Taylor, Ronald C.; Sanfilippo, Antonio P.; McDermott, Jason E.

2011-02-18

Transcriptional regulatory networks are being determined using “reverse engineering” methods that infer connections based on correlations in gene state. Corroboration of such networks through independent means such as evidence from the biomedical literature is desirable. Here, we explore a novel approach, a bootstrapping version of our previous Cross-Ontological Analytic method (XOA) that can be used for semi-automated annotation and verification of inferred regulatory connections, as well as for discovery of additional functional relationships between the genes. First, we use our annotation and network expansion method on a biological network learned entirely from the literature. We show how new relevant linksmore » between genes can be iteratively derived using a gene similarity measure based on the Gene Ontology that is optimized on the input network at each iteration. Second, we apply our method to annotation, verification, and expansion of a set of regulatory connections found by the Context Likelihood of Relatedness algorithm.« less
A pathway-based network analysis of hypertension-related genes

NASA Astrophysics Data System (ADS)

Wang, Huan; Hu, Jing-Bo; Xu, Chuan-Yun; Zhang, De-Hai; Yan, Qian; Xu, Ming; Cao, Ke-Fei; Zhang, Xu-Sheng

2016-02-01

Complex network approach has become an effective way to describe interrelationships among large amounts of biological data, which is especially useful in finding core functions and global behavior of biological systems. Hypertension is a complex disease caused by many reasons including genetic, physiological, psychological and even social factors. In this paper, based on the information of biological pathways, we construct a network model of hypertension-related genes of the salt-sensitive rat to explore the interrelationship between genes. Statistical and topological characteristics show that the network has the small-world but not scale-free property, and exhibits a modular structure, revealing compact and complex connections among these genes. By the threshold of integrated centrality larger than 0.71, seven key hub genes are found: Jun, Rps6kb1, Cycs, Creb312, Cdk4, Actg1 and RT1-Da. These genes should play an important role in hypertension, suggesting that the treatment of hypertension should focus on the combination of drugs on multiple genes.
Functional Characteristics of the Flying Squirrel's Cecal Microbiota under a Leaf-Based Diet, Based on Multiple Meta-Omic Profiling

PubMed Central

Lu, Hsiao-Pei; Liu, Po-Yu; Wang, Yu-bin; Hsieh, Ji-Fan; Ho, Han-Chen; Huang, Shiao-Wei; Lin, Chung-Yen; Hsieh, Chih-hao; Yu, Hon-Tsen

2018-01-01

Mammalian herbivores rely on microbial activities in an expanded gut chamber to convert plant biomass into absorbable nutrients. Distinct from ruminants, small herbivores typically have a simple stomach but an enlarged cecum to harbor symbiotic microbes; however, knowledge of this specialized gut structure and characteristics of its microbial contents is limited. Here, we used leaf-eating flying squirrels as a model to explore functional characteristics of the cecal microbiota adapted to a high-fiber, toxin-rich diet. Specifically, environmental conditions across gut regions were evaluated by measuring mass, pH, feed particle size, and metabolomes. Then, parallel metagenomes and metatranscriptomes were used to detect microbial functions corresponding to the cecal environment. Based on metabolomic profiles, >600 phytochemical compounds were detected, although many were present only in the foregut and probably degraded or transformed by gut microbes in the hindgut. Based on metagenomic (DNA) and metatranscriptomic (RNA) profiles, taxonomic compositions of the cecal microbiota were dominated by bacteria of the Firmicutes taxa; they contained major gene functions related to degradation and fermentation of leaf-derived compounds. Based on functional compositions, genes related to multidrug exporters were rich in microbial genomes, whereas genes involved in nutrient importers were rich in microbial transcriptomes. In addition, genes encoding chemotaxis-associated components and glycoside hydrolases specific for plant beta-glycosidic linkages were abundant in both DNA and RNA. This exploratory study provides findings which may help to form molecular-based hypotheses regarding functional contributions of symbiotic gut microbiota in small herbivores with folivorous dietary habits. PMID:29354108
Multi-breed and multi-trait co-association analysis of meat tenderness and other meat quality traits in three French beef cattle breeds.

PubMed

Ramayo-Caldas, Yuliaxis; Renand, Gilles; Ballester, Maria; Saintilan, Romain; Rocha, Dominique

2016-04-23

Studies to identify markers associated with beef tenderness have focused on Warner-Bratzler shear force (WBSF) but the interplay between the genes associated with WBSF has not been explored. We used the association weight matrix (AWM), a systems biology approach, to identify a set of interacting genes that are co-associated with tenderness and other meat quality traits, and shared across the Charolaise, Limousine and Blonde d'Aquitaine beef cattle breeds. Genome-wide association studies were performed using ~500K single nucleotide polymorphisms (SNPs) and 17 phenotypes measured on more than 1000 animals for each breed. First, this multi-trait approach was applied separately for each breed across 17 phenotypes and second, between- and across-breed comparisons at the AWM and functional levels were performed. Genetic heterogeneity was observed, and most of the variants that were associated with WBSF segregated within rather than across breeds. We identified 206 common candidate genes associated with WBSF across the three breeds. SNPs in these common genes explained between 28 and 30 % of the phenotypic variance for WBSF. A reduced number of common SNPs mapping to the 206 common genes were identified, suggesting that different mutations may target the same genes in a breed-specific manner. Therefore, it is likely that, depending on allele frequencies and linkage disequilibrium patterns, a SNP that is identified for one breed may not be informative for another unrelated breed. Well-known candidate genes affecting beef tenderness were identified. In addition, some of the 206 common genes are located within previously reported quantitative trait loci for WBSF in several cattle breeds. Moreover, the multi-breed co-association analysis detected new candidate genes, regulators and metabolic pathways that are likely involved in the determination of meat tenderness and other meat quality traits in beef cattle. Our results suggest that systems biology approaches that explore associations of correlated traits increase statistical power to identify candidate genes beyond the one-dimensional approach. Further studies on the 206 common genes, their pathways, regulators and interactions will expand our knowledge on the molecular basis of meat tenderness and could lead to the discovery of functional mutations useful for genomic selection in a multi-breed beef cattle context.
MyGeneFriends: A Social Network Linking Genes, Genetic Diseases, and Researchers

PubMed Central

Allot, Alexis; Chennen, Kirsley; Nevers, Yannis; Poidevin, Laetitia; Kress, Arnaud; Ripp, Raymond; Thompson, Julie Dawn; Poch, Olivier

2017-01-01

Background The constant and massive increase of biological data offers unprecedented opportunities to decipher the function and evolution of genes and their roles in human diseases. However, the multiplicity of sources and flow of data mean that efficient access to useful information and knowledge production has become a major challenge. This challenge can be addressed by taking inspiration from Web 2.0 and particularly social networks, which are at the forefront of big data exploration and human-data interaction. Objective MyGeneFriends is a Web platform inspired by social networks, devoted to genetic disease analysis, and organized around three types of proactive agents: genes, humans, and genetic diseases. The aim of this study was to improve exploration and exploitation of biological, postgenomic era big data. Methods MyGeneFriends leverages conventions popularized by top social networks (Facebook, LinkedIn, etc), such as networks of friends, profile pages, friendship recommendations, affinity scores, news feeds, content recommendation, and data visualization. Results MyGeneFriends provides simple and intuitive interactions with data through evaluation and visualization of connections (friendships) between genes, humans, and diseases. The platform suggests new friends and publications and allows agents to follow the activity of their friends. It dynamically personalizes information depending on the user’s specific interests and provides an efficient way to share information with collaborators. Furthermore, the user’s behavior itself generates new information that constitutes an added value integrated in the network, which can be used to discover new connections between biological agents. Conclusions We have developed MyGeneFriends, a Web platform leveraging conventions from popular social networks to redefine the relationship between humans and biological big data and improve human processing of biomedical data. MyGeneFriends is available at lbgi.fr/mygenefriends. PMID:28623182
Evolution of SUMO Function and Chain Formation in Insects.

PubMed

Ureña, Enric; Pirone, Lucia; Chafino, Silvia; Pérez, Coralia; Sutherland, James D; Lang, Valérie; Rodriguez, Manuel S; Lopitz-Otsoa, Fernando; Blanco, Francisco J; Barrio, Rosa; Martín, David

2016-02-01

SUMOylation, the covalent binding of Small Ubiquitin-like Modifier (SUMO) to target proteins, is a posttranslational modification that regulates critical cellular processes in eukaryotes. In insects, SUMOylation has been studied in holometabolous species, particularly in the dipteran Drosophila melanogaster, which contains a single SUMO gene (smt3). This has led to the assumption that insects contain a single SUMO gene. However, the analysis of insect genomes shows that basal insects contain two SUMO genes, orthologous to vertebrate SUMO1 and SUMO2/3. Our phylogenetical analysis reveals that the SUMO gene has been duplicated giving rise to SUMO1 and SUMO2/3 families early in Metazoan evolution, and that later in insect evolution the SUMO1 gene has been lost after the Hymenoptera divergence. To explore the consequences of this loss, we have examined the characteristics and different biological functions of the two SUMO genes (SUMO1 and SUMO3) in the hemimetabolous cockroach Blattella germanica and compared them with those of Drosophila Smt3. Here, we show that the metamorphic role of the SUMO genes is evolutionary conserved in insects, although there has been a regulatory switch from SUMO1 in basal insects to SUMO3 in more derived ones. We also show that, unlike vertebrates, insect SUMO3 proteins cannot form polySUMO chains due to the loss of critical lysine residues within the N-terminal part of the protein. Furthermore, the formation of polySUMO chains by expression of ectopic human SUMO3 has a deleterious effect in Drosophila. These findings contribute to the understanding of the functional consequences of the evolution of SUMO genes. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Screening the molecular targets of ovarian cancer based on bioinformatics analysis.

PubMed

Du, Lei; Qian, Xiaolei; Dai, Chenyang; Wang, Lihua; Huang, Ding; Wang, Shuying; Shen, Xiaowei

2015-01-01

Ovarian cancer (OC) is the most lethal gynecologic malignancy. This study aims to explore the molecular mechanisms of OC and identify potential molecular targets for OC treatment. Microarray gene expression data (GSE14407) including 12 normal ovarian surface epithelia samples and 12 OC epithelia samples were downloaded from Gene Expression Omnibus database. Differentially expressed genes (DEGs) between 2 kinds of ovarian tissue were identified by using limma package in R language (|log2 fold change| gt;1 and false discovery rate [FDR] lt;0.05). Protein-protein interactions (PPIs) and known OC-related genes were screened from COXPRESdb and GenBank database, respectively. Furthermore, PPI network of top 10 upregulated DEGs and top 10 downregulated DEGs was constructed and visualized through Cytoscape software. Finally, for the genes involved in PPI network, functional enrichment analysis was performed by using DAVID (FDR lt;0.05). In total, 1136 DEGs were identified, including 544 downregulated and 592 upregulated DEGs. Then, PPI network was constructed, and DEGs CDKN2A, MUC1, OGN, ZIC1, SOX17, and TFAP2A interacted with known OC-related genes CDK4, EGFR/JUN, SRC, CLI1, CTNNB1, and TP53, respectively. Moreover, functions about oxygen transport and embryonic development were enriched by the genes involved in the network of downregulated DEGs. We propose that 4 DEGs (OGN, ZIC1, SOX17, and TFAP2A) and 2 functions (oxygen transport and embryonic development) might play a role in the development of OC. These 4 DEGs and known OC-related genes might serve as therapeutic targets for OC. Further studies are required to validate these predictions.
Revealing Alzheimer's disease genes spectrum in the whole-genome by machine learning.

PubMed

Huang, Xiaoyan; Liu, Hankui; Li, Xinming; Guan, Liping; Li, Jiankang; Tellier, Laurent Christian Asker M; Yang, Huanming; Wang, Jian; Zhang, Jianguo

2018-01-10

Alzheimer's disease (AD) is an important, progressive neurodegenerative disease, with a complex genetic architecture. A key goal of biomedical research is to seek out disease risk genes, and to elucidate the function of these risk genes in the development of disease. For this purpose, expanding the AD-associated gene set is necessary. In past research, the prediction methods for AD related genes has been limited in their exploration of the target genome regions. We here present a genome-wide method for AD candidate genes predictions. We present a machine learning approach (SVM), based upon integrating gene expression data with human brain-specific gene network data, to discover the full spectrum of AD genes across the whole genome. We classified AD candidate genes with an accuracy and the area under the receiver operating characteristic (ROC) curve of 84.56% and 94%. Our approach provides a supplement for the spectrum of AD-associated genes extracted from more than 20,000 genes in a genome wide scale. In this study, we have elucidated the whole-genome spectrum of AD, using a machine learning approach. Through this method, we expect for the candidate gene catalogue to provide a more comprehensive annotation of AD for researchers.
Regulation of P450-mediated permethrin resistance in Culex quinquefasciatus by the GPCR/Gαs/AC/cAMP/PKA signaling cascade.

PubMed

Li, Ting; Liu, Nannan

2017-12-01

This study explores the role of G-protein-coupled receptor-intracellular signaling in the development of P450-mediated insecticide resistance in mosquitoes, Culex quinquefasciatus , focusing on the essential function of the GPCRs and their downstream effectors of Gs alpha subunit protein (Gαs) and adenylyl cyclase (ACs) in P450-mediated insecticide resistance of Culex mosquitoes. Our RNAi-mediated functional study showed that knockdown of Gαs caused the decreased expression of the downstream effectors of ACs and PKAs in the GPCR signaling pathway and resistance P450 genes, whereas knockdown of ACs decreased the expression of PKAs and resistance P450 genes. Knockdown of either Gαs or ACs resulted in an increased susceptibility of mosquitoes to permethrin. These results add significantly to our understanding of the molecular basis of resistance P450 gene regulation through GPCR/Gαs/AC/cAMP-PKA signaling pathways in the insecticide resistance of mosquitoes. The temporal and spatial dynamic analyses of GPCRs, Gαs, ACs, PKAs, and P450s in two insecticide resistant mosquito strains revealed that all the GPCR signaling pathway components tested, namely GPCRs, Gαs, ACs and PKAs, were most highly expressed in the brain for both resistant strains, suggesting the role played by these genes in signaling transduction and regulation. The resistance P450 genes were mainly expressed in the brain, midgut and malpighian tubules (MTs), suggesting their critical function in the central nervous system and importance for detoxification. The temporal dynamics analysis for the gene expression showed a diverse expression profile during mosquito development, indicating their initially functional importance in response to exposure to insecticides during their life stages.

Role of extracytoplasmic function sigma factor PG1660 (RpoE) in the oxidative stress resistance regulatory network of Porphyromonas gingivalis

PubMed Central

Dou, Y.; Rutanhira, H.; Chen, X.; Mishra, A.; Wang, C.; Fletcher, H.M.

2018-01-01

Summary In Porphyromonas gingivalis, the protein PG1660, composed of 174 amino acids, is annotated as an extracytoplasmic function (ECF) sigma factor (RpoE homologue-σ24). Because PG1660 can modulate several virulence factors and responds to environmental signals in P. gingivalis, its genetic properties were evaluated. PG1660 is co-transcribed with its downstream gene PG1659, and the transcription start site was identified as adenine residue 54-nucleotides upstream of the ATG translation start codon. In addition to binding its own promoter, using the purified rPG1660 and RNAP core enzyme from Escherichia coli with the PG1660 promoter DNA as template, the function of PG1660 as a sigma factor was demonstrated in an in vitro transcription assay. Transcriptome analyses of a P. gingivalis PG1660-defective isogenic mutant revealed that under oxidative stress conditions 176 genes including genes involved in secondary metabolism were downregulated more than two-fold compared with the parental strain. The rPG1660 protein also showed the ability to bind to the promoters of the highly downregulated genes in the PG1660-deficient mutant. As the ECF sigma factor PG0162 has a 29% identity with PG1660 and can modulate its expression, the cross-talk between their regulatory networks was explored. The expression profile of the PG0162PG1660-deficient mutant (P. gingivalis FLL356) revealed that the type IX secretion system genes and several virulence genes were downregulated under hydrogen peroxide stress conditions. Taken together, we have confirmed that PG1660 can function as a sigma factor, and plays an important regulatory role in the oxidative stress and virulence regulatory network of P. gingivalis. PMID:29059500
A novel essential domain perspective for exploring gene essentiality.

PubMed

Lu, Yao; Lu, Yulan; Deng, Jingyuan; Peng, Hai; Lu, Hui; Lu, Long Jason

2015-09-15

Genes with indispensable functions are identified as essential; however, the traditional gene-level studies of essentiality have several limitations. In this study, we characterized gene essentiality from a new perspective of protein domains, the independent structural or functional units of a polypeptide chain. To identify such essential domains, we have developed an Expectation-Maximization (EM) algorithm-based Essential Domain Prediction (EDP) Model. With simulated datasets, the model provided convergent results given different initial values and offered accurate predictions even with noise. We then applied the EDP model to six microbial species and predicted 1879 domains to be essential in at least one species, ranging 10-23% in each species. The predicted essential domains were more conserved than either non-essential domains or essential genes. Comparing essential domains in prokaryotes and eukaryotes revealed an evolutionary distance consistent with that inferred from ribosomal RNA. When utilizing these essential domains to reproduce the annotation of essential genes, we received accurate results that suggest protein domains are more basic units for the essentiality of genes. Furthermore, we presented several examples to illustrate how the combination of essential and non-essential domains can lead to genes with divergent essentiality. In summary, we have described the first systematic analysis on gene essentiality on the level of domains. huilu.bioinfo@gmail.com or Long.Lu@cchmc.org Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Risk of type 1 diabetes progression in islet autoantibody-positive children can be further stratified using expression patterns of multiple genes implicated in peripheral blood lymphocyte activation and function.

PubMed

Jin, Yulan; Sharma, Ashok; Bai, Shan; Davis, Colleen; Liu, Haitao; Hopkins, Diane; Barriga, Kathy; Rewers, Marian; She, Jin-Xiong

2014-07-01

There is tremendous scientific and clinical value to further improving the predictive power of autoantibodies because autoantibody-positive (AbP) children have heterogeneous rates of progression to clinical diabetes. This study explored the potential of gene expression profiles as biomarkers for risk stratification among 104 AbP subjects from the Diabetes Autoimmunity Study in the Young (DAISY) using a discovery data set based on microarray and a validation data set based on real-time RT-PCR. The microarray data identified 454 candidate genes with expression levels associated with various type 1 diabetes (T1D) progression rates. RT-PCR analyses of the top-27 candidate genes confirmed 5 genes (BACH2, IGLL3, EIF3A, CDC20, and TXNDC5) associated with differential progression and implicated in lymphocyte activation and function. Multivariate analyses of these five genes in the discovery and validation data sets identified and confirmed four multigene models (BI, ICE, BICE, and BITE, with each letter representing a gene) that consistently stratify high- and low-risk subsets of AbP subjects with hazard ratios >6 (P < 0.01). The results suggest that these genes may be involved in T1D pathogenesis and potentially serve as excellent gene expression biomarkers to predict the risk of progression to clinical diabetes for AbP subjects. © 2014 by the American Diabetes Association.
The evolutionary fate of the chloroplast and nuclear rps16 genes as revealed through the sequencing and comparative analyses of four novel legume chloroplast genomes from Lupinus.

PubMed

Keller, J; Rousseau-Gueutin, M; Martin, G E; Morice, J; Boutte, J; Coissac, E; Ourari, M; Aïnouche, M; Salmon, A; Cabello-Hurtado, F; Aïnouche, A

2017-08-01

The Fabaceae family is considered as a model system for understanding chloroplast genome evolution due to the presence of extensive structural rearrangements, gene losses and localized hypermutable regions. Here, we provide sequences of four chloroplast genomes from the Lupinus genus, belonging to the underinvestigated Genistoid clade. Notably, we found in Lupinus species the functional loss of the essential rps16 gene, which was most likely replaced by the nuclear rps16 gene that encodes chloroplast and mitochondrion targeted RPS16 proteins. To study the evolutionary fate of the rps16 gene, we explored all available plant chloroplast, mitochondrial and nuclear genomes. Whereas no plant mitochondrial genomes carry an rps16 gene, many plants still have a functional nuclear and chloroplast rps16 gene. Ka/Ks ratios revealed that both chloroplast and nuclear rps16 copies were under purifying selection. However, due to the dual targeting of the nuclear rps16 gene product and the absence of a mitochondrial copy, the chloroplast gene may be lost. We also performed comparative analyses of lupine plastomes (SNPs, indels and repeat elements), identified the most variable regions and examined their phylogenetic utility. The markers identified here will help to reveal the evolutionary history of lupines, Genistoids and closely related clades. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Genome-wide analysis of tandem repeats in plants and green algae

Treesearch

Zhixin Zhao; Cheng Guo; Sreeskandarajan Sutharzan; Pei Li; Craig Echt; Jie Zhang; Chun Liang

2014-01-01

Tandem repeats (TRs) extensively exist in the genomes of prokaryotes and eukaryotes. Based on the sequenced genomes and gene annotations of 31 plant and algal species in Phytozome version 8.0 (http://www.phytozome.net/), we examined TRs in a genome-wide scale, characterized their distributions and motif features, and explored their putative biological functions. Among...
60 years ago, Francis Crick changed the logic of biology

PubMed Central

2017-01-01

In September 1957, Francis Crick gave a lecture in which he outlined key ideas about gene function, in particular what he called the central dogma. These ideas still frame how we understand life. This essay explores the concepts he developed in this influential lecture, including his prediction that we would study evolution by comparing sequences. PMID:28922352
Functional Interplay between Small Non-Coding RNAs and RNA Modification in the Brain.

PubMed

Leighton, Laura J; Bredy, Timothy W

2018-06-07

Small non-coding RNAs are essential for transcription, translation and gene regulation in all cell types, but are particularly important in neurons, with known roles in neurodevelopment, neuroplasticity and neurological disease. Many small non-coding RNAs are directly involved in the post-transcriptional modification of other RNA species, while others are themselves substrates for modification, or are functionally modulated by modification of their target RNAs. In this review, we explore the known and potential functions of several distinct classes of small non-coding RNAs in the mammalian brain, focusing on the newly recognised interplay between the epitranscriptome and the activity of small RNAs. We discuss the potential for this relationship to influence the spatial and temporal dynamics of gene activation in the brain, and predict that further research in the field of epitranscriptomics will identify interactions between small RNAs and RNA modifications which are essential for higher order brain functions such as learning and memory.
A transcriptome-based examination of blood group expression

PubMed Central

Noh, S.-J.; Lee, Y.T.; Byrnes, C.; Miller, J.L.

2011-01-01

Over the last two decades, red cell biologists witnessed a vast expansion of genetic-based information pertaining to blood group antigens and their carrier molecules. Genetic progress has led to a better comprehension of the associated antigens. To assist with studies concerning the integrated regulation and function of blood groups, transcript levels for each of the 36 associated genes were studied. Profiles using mRNA from directly sampled reticulocytes and cultured primary erythroblasts are summarized in this report. Transcriptome profiles suggest a highly regulated pattern of blood group gene expression during erythroid differentiation and ontogeny. Approximately one-third of the blood group carrier genes are transcribed in an erythroid-specific fashion. Low-level and indistinct expression was noted for most of the carbohydrate-associated genes. Methods are now being developed to further explore and manipulate expression of the blood group genes at all stages of human erythropoiesis. PMID:20685146
Investigation of a thiolated polymer in gene delivery

NASA Astrophysics Data System (ADS)

Bacalocostantis, Irene

Thiol-containing bioreducible polymers show significant potential as delivery vectors in gene therapy, a rapidly growing field which seeks to treat genetic-based disorders by delivering functional synthetic genes to diseased cells. Studies have shown that thiolated polymers exhibit improved biodegradability and prolonged in vivo circulation times over non-thiolated polymers. However, the extent to which thiol concentrations impact the carrier's delivery potential has not been well explored. The aim of this dissertation is to investigate how relative concentrations of free thiols and disulfide crosslinks impact a polymeric carriers delivery performance with respect to DNA packaging, complex stability, cargo protection, gene release, internalization efficiency and cytotoxicity. To accomplish this goal, several fluorescent polymers containing varying concentrations of thiol groups were synthesized by conjugating thiol-pendant chains onto the primary amines of cationic poly(allylamine). In vitro delivery assays and characterization techniques were employed to assess the effect of thiols in gene delivery.
Chromatin looping and eRNA transcription precede the transcriptional activation of gene in the β-globin locus

PubMed Central

Kim, Yea Woon; Lee, Sungkung; Yun, Jangmi; Kim, AeRi

2015-01-01

Enhancers are closely positioned with actively transcribed target genes by chromatin looping. Non-coding RNAs are often transcribed on active enhancers, referred to as eRNAs (enhancer RNAs). To explore the kinetics of enhancer–promoter looping and eRNA transcription during transcriptional activation, we induced the β-globin locus by chemical treatment and analysed cross-linking frequency between the β-globin gene and locus control region (LCR) and the amount of eRNAs transcribed on the LCR in a time course manner. The cross-linking frequency was increased after chemical induction but before the transcriptional activation of gene in the β-globin locus. Transcription of eRNAs was increased in concomitant with the increase in cross-linking frequency. These results show that chromatin looping and eRNA transcription precedes the transcriptional activation of gene. Concomitant occurrence of the two events suggests functional relationship between them. PMID:25588787
Monitoring of oil hydrocarbons pollution in the Sea of Japan, based on detection of marker genes in microbial communities

NASA Astrophysics Data System (ADS)

Kim, A. V.; Buzoleva, L. S.; Bogatyrenko, E. A.; Zemskaya, T. I.; Mamaeva, E. V.

2018-01-01

By means of molecular biology techniques, metabolic potential of microbial communities within the regions of inshore water areas in the Sea of Japan with various anthropogenic load was explored. Presence of functional genes, responsible for oil hydrocarbons destruction, for microbial communities within the regions of inshore water areas in the Sea of Japan was first researched. In total microbial DNA from water mass in the regions with chronic anthropogenic pollution, the genes, responsible for oxidation of broad range of n-alkanes and polycyclic aromatic hydrocarbons, were found. Detection of marker genes in the background water area (in the Vostok Bay) was ever indicating ecological deterioration within this territory. Thereby, it was demonstrated, that molecular genetic methods, aimed at marker gene detection in total bacterial DNA from environment objects, proved themselves to be more effective technique for identification of oil hydrocarbons water pollution, in comparison with trivial culturable methods.
Beyond the usual suspects: a multidimensional genetic exploration of infant attachment disorganization and security.

PubMed

Pappa, Irene; Szekely, Eszter; Mileva-Seitz, Viara R; Luijk, Maartje P C M; Bakermans-Kranenburg, Marian J; van IJzendoorn, Marinus H; Tiemeier, Henning

2015-01-01

Although the environmental influences on infant attachment disorganization and security are well-studied, little is known about their heritability. Candidate gene studies have shown small, often non-replicable effects. In this study, we gathered the largest sample (N = 657) of ethnically homogenous, 14-month-old children with both observed attachment and genome-wide data. First, we used a Genome-Wide Association Study (GWAS) approach to identify single nucleotide polymorphisms (SNPs) associated with attachment disorganization and security. Second, we annotated them into genes (Versatile Gene-based Association Study) and functional pathways. Our analyses provide evidence of novel genes (HDAC1, ZNF675, BSCD1) and pathways (synaptic transmission, cation transport) associated with attachment disorganization. Similar analyses identified a novel gene (BECN1) but no distinct pathways associated with attachment security. The results of this first extensive, exploratory study on the molecular-genetic basis of infant attachment await replication in large, independent samples.
Model-driven discovery of underground metabolic functions in Escherichia coli.

PubMed

Guzmán, Gabriela I; Utrilla, José; Nurk, Sergey; Brunk, Elizabeth; Monk, Jonathan M; Ebrahim, Ali; Palsson, Bernhard O; Feist, Adam M

2015-01-20

Enzyme promiscuity toward substrates has been discussed in evolutionary terms as providing the flexibility to adapt to novel environments. In the present work, we describe an approach toward exploring such enzyme promiscuity in the space of a metabolic network. This approach leverages genome-scale models, which have been widely used for predicting growth phenotypes in various environments or following a genetic perturbation; however, these predictions occasionally fail. Failed predictions of gene essentiality offer an opportunity for targeting biological discovery, suggesting the presence of unknown underground pathways stemming from enzymatic cross-reactivity. We demonstrate a workflow that couples constraint-based modeling and bioinformatic tools with KO strain analysis and adaptive laboratory evolution for the purpose of predicting promiscuity at the genome scale. Three cases of genes that are incorrectly predicted as essential in Escherichia coli--aspC, argD, and gltA--are examined, and isozyme functions are uncovered for each to a different extent. Seven isozyme functions based on genetic and transcriptional evidence are suggested between the genes aspC and tyrB, argD and astC, gabT and puuE, and gltA and prpC. This study demonstrates how a targeted model-driven approach to discovery can systematically fill knowledge gaps, characterize underground metabolism, and elucidate regulatory mechanisms of adaptation in response to gene KO perturbations.
Molecular and functional characterization of an invertase secreted by Ashbya gossypii.

PubMed

Aguiar, Tatiana Q; Dinis, Cláudia; Magalhães, Frederico; Oliveira, Carla; Wiebe, Marilyn G; Penttilä, Merja; Domingues, Lucília

2014-06-01

The repertoire of hydrolytic enzymes natively secreted by the filamentous fungus Ashbya (Eremothecium) gossypii has been poorly explored. Here, an invertase secreted by this flavinogenic fungus was for the first time molecularly and functionally characterized. Invertase activity was detected in A. gossypii culture supernatants and cell-associated fractions. Extracellular invertase migrated in a native polyacrylamide gel as diffuse protein bands, indicating the occurrence of at least two invertase isoforms. Hydrolytic activity toward sucrose was approximately 10 times higher than toward raffinose. Inulin and levan were not hydrolyzed. Production of invertase by A. gossypii was repressed by the presence of glucose in the culture medium. The A. gossypii invertase was demonstrated to be encoded by the AFR529W (AgSUC2) gene, which is highly homologous to the Saccharomyces cerevisiae SUC2 (ScSUC2) gene. Agsuc2 null mutants were unable to hydrolyze sucrose, proving that invertase is encoded by a single gene in A. gossypii. This mutation was functionally complemented by the ScSUC2 and AgSUC2 genes, when expressed from a 2-μm-plasmid. The signal sequences of both AgSuc2p and ScSuc2p were able to direct the secretion of invertase into the culture medium in A. gossypii.
Genome-Wide Identification, Characterization and Expression Analysis of the TCP Gene Family in Prunus mume

PubMed Central

Zhou, Yuzhen; Xu, Zongda; Zhao, Kai; Yang, Weiru; Cheng, Tangren; Wang, Jia; Zhang, Qixiang

2016-01-01

TCP proteins, belonging to a plant-specific transcription factors family, are known to have great functions in plant development, especially flower and leaf development. However, there is little information about this gene family in Prunus mume, which is widely cultivated in China as an ornamental and fruit tree. Here a genome-wide analysis of TCP genes was performed to explore their evolution in P. mume. Nineteen PmTCPs were identified and three of them contained putative miR319 target sites. Phylogenetic and comprehensive bioinformatics analyses of these genes revealed that different types of TCP genes had undergone different evolutionary processes and the genes in the same clade had similar chromosomal location, gene structure, and conserved domains. Expression analysis of these PmTCPs indicated that there were diverse expression patterns among different clades. Most TCP genes were predominantly expressed in flower, leaf, and stem, and showed high expression levels in the different stages of flower bud differentiation, especially in petal formation stage and gametophyte development. Genes in TCP-P subfamily had main roles in both flower development and gametophyte development. The CIN genes in double petal cultivars might have key roles in the formation of petal, while they were correlated with gametophyte development in the single petal cultivar. The CYC/TB1 type genes were highly detected in the formation of petal and pistil. The less-complex flower types of P. mume might result from the fact that there were only two CYC type genes present in P. mume and a lack of CYC2 genes to control the identity of flower types. These results lay the foundation for further study on the functions of TCP genes during flower development. PMID:27630648
Genome-Wide Identification, Characterization and Expression Analysis of the TCP Gene Family in Prunus mume.

PubMed

Zhou, Yuzhen; Xu, Zongda; Zhao, Kai; Yang, Weiru; Cheng, Tangren; Wang, Jia; Zhang, Qixiang

2016-01-01

TCP proteins, belonging to a plant-specific transcription factors family, are known to have great functions in plant development, especially flower and leaf development. However, there is little information about this gene family in Prunus mume, which is widely cultivated in China as an ornamental and fruit tree. Here a genome-wide analysis of TCP genes was performed to explore their evolution in P. mume. Nineteen PmTCPs were identified and three of them contained putative miR319 target sites. Phylogenetic and comprehensive bioinformatics analyses of these genes revealed that different types of TCP genes had undergone different evolutionary processes and the genes in the same clade had similar chromosomal location, gene structure, and conserved domains. Expression analysis of these PmTCPs indicated that there were diverse expression patterns among different clades. Most TCP genes were predominantly expressed in flower, leaf, and stem, and showed high expression levels in the different stages of flower bud differentiation, especially in petal formation stage and gametophyte development. Genes in TCP-P subfamily had main roles in both flower development and gametophyte development. The CIN genes in double petal cultivars might have key roles in the formation of petal, while they were correlated with gametophyte development in the single petal cultivar. The CYC/TB1 type genes were highly detected in the formation of petal and pistil. The less-complex flower types of P. mume might result from the fact that there were only two CYC type genes present in P. mume and a lack of CYC2 genes to control the identity of flower types. These results lay the foundation for further study on the functions of TCP genes during flower development.
GENEASE: Real time bioinformatics tool for multi-omics and disease ontology exploration, analysis and visualization.

PubMed

Ghandikota, Sudhir; Hershey, Gurjit K Khurana; Mersha, Tesfaye B

2018-03-24

Advances in high-throughput sequencing technologies have made it possible to generate multiple omics data at an unprecedented rate and scale. The accumulation of these omics data far outpaces the rate at which biologists can mine and generate new hypothesis to test experimentally. There is an urgent need to develop a myriad of powerful tools to efficiently and effectively search and filter these resources to address specific post-GWAS functional genomics questions. However, to date, these resources are scattered across several databases and often lack a unified portal for data annotation and analytics. In addition, existing tools to analyze and visualize these databases are highly fragmented, resulting researchers to access multiple applications and manual interventions for each gene or variant in an ad hoc fashion until all the questions are answered. In this study, we present GENEASE, a web-based one-stop bioinformatics tool designed to not only query and explore multi-omics and phenotype databases (e.g., GTEx, ClinVar, dbGaP, GWAS Catalog, ENCODE, Roadmap Epigenomics, KEGG, Reactome, Gene and Phenotype Ontology) in a single web interface but also to perform seamless post genome-wide association downstream functional and overlap analysis for non-coding regulatory variants. GENEASE accesses over 50 different databases in public domain including model organism-specific databases to facilitate gene/variant and disease exploration, enrichment and overlap analysis in real time. It is a user-friendly tool with point-and-click interface containing links for support information including user manual and examples. GENEASE can be accessed freely at http://research.cchmc.org/mershalab/genease_new/login.html. Tesfaye.Mersha@cchmc.org, Sudhir.Ghandikota@cchmc.org. Supplementary data are available at Bioinformatics online.
Synchronized dynamics of bacterial niche-specific functions during biofilm development in a cold seep brine pool.

PubMed

Zhang, Weipeng; Wang, Yong; Bougouffa, Salim; Tian, Renmao; Cao, Huiluo; Li, Yongxin; Cai, Lin; Wong, Yue Him; Zhang, Gen; Zhou, Guowei; Zhang, Xixiang; Bajic, Vladimir B; Al-Suwailem, Abdulaziz; Qian, Pei-Yuan

2015-10-01

The biology of biofilm in deep-sea environments is barely being explored. Here, biofilms were developed at the brine pool (characterized by limited carbon sources) and the normal bottom water adjacent to Thuwal cold seeps. Comparative metagenomics based on 50 Gb datasets identified polysaccharide degradation, nitrate reduction and proteolysis as enriched functional categories for brine biofilms. The genomes of two dominant species: a novel Deltaproteobacterium and a novel Epsilonproteobacterium in the brine biofilms were reconstructed. Despite rather small genome sizes, the Deltaproteobacterium possessed enhanced polysaccharide fermentation pathways, whereas the Epsilonproteobacterium was a versatile nitrogen reactor possessing nar, nap and nif gene clusters. These metabolic functions, together with specific regulatory and hypersaline-tolerant genes, made the two bacteria unique compared with their close relatives, including those from hydrothermal vents. Moreover, these functions were regulated by biofilm development, as both the abundance and the expression level of key functional genes were higher in later stage biofilms, and co-occurrences between the two dominant bacteria were demonstrated. Collectively, unique mechanisms were revealed: (i) polysaccharides fermentation, proteolysis interacted with nitrogen cycling to form a complex chain for energy generation, and (ii) remarkably exploiting and organizing niche-specific functions would be an important strategy for biofilm-dependent adaptation to the extreme conditions. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.
Phylogenetically informed logic relationships improve detection of biological network organization

PubMed Central

2011-01-01

Background A "phylogenetic profile" refers to the presence or absence of a gene across a set of organisms, and it has been proven valuable for understanding gene functional relationships and network organization. Despite this success, few studies have attempted to search beyond just pairwise relationships among genes. Here we search for logic relationships involving three genes, and explore its potential application in gene network analyses. Results Taking advantage of a phylogenetic matrix constructed from the large orthologs database Roundup, we invented a method to create balanced profiles for individual triplets of genes that guarantee equal weight on the different phylogenetic scenarios of coevolution between genes. When we applied this idea to LAPP, the method to search for logic triplets of genes, the balanced profiles resulted in significant performance improvement and the discovery of hundreds of thousands more putative triplets than unadjusted profiles. We found that logic triplets detected biological network organization and identified key proteins and their functions, ranging from neighbouring proteins in local pathways, to well separated proteins in the whole pathway, and to the interactions among different pathways at the system level. Finally, our case study suggested that the directionality in a logic relationship and the profile of a triplet could disclose the connectivity between the triplet and surrounding networks. Conclusion Balanced profiles are superior to the raw profiles employed by traditional methods of phylogenetic profiling in searching for high order gene sets. Gene triplets can provide valuable information in detection of biological network organization and identification of key genes at different levels of cellular interaction. PMID:22172058
NetMiner-an ensemble pipeline for building genome-wide and high-quality gene co-expression network using massive-scale RNA-seq samples.

PubMed

Yu, Hua; Jiao, Bingke; Lu, Lu; Wang, Pengfei; Chen, Shuangcheng; Liang, Chengzhi; Liu, Wei

2018-01-01

Accurately reconstructing gene co-expression network is of great importance for uncovering the genetic architecture underlying complex and various phenotypes. The recent availability of high-throughput RNA-seq sequencing has made genome-wide detecting and quantifying of the novel, rare and low-abundance transcripts practical. However, its potential merits in reconstructing gene co-expression network have still not been well explored. Using massive-scale RNA-seq samples, we have designed an ensemble pipeline, called NetMiner, for building genome-scale and high-quality Gene Co-expression Network (GCN) by integrating three frequently used inference algorithms. We constructed a RNA-seq-based GCN in one species of monocot rice. The quality of network obtained by our method was verified and evaluated by the curated gene functional association data sets, which obviously outperformed each single method. In addition, the powerful capability of network for associating genes with functions and agronomic traits was shown by enrichment analysis and case studies. In particular, we demonstrated the potential value of our proposed method to predict the biological roles of unknown protein-coding genes, long non-coding RNA (lncRNA) genes and circular RNA (circRNA) genes. Our results provided a valuable and highly reliable data source to select key candidate genes for subsequent experimental validation. To facilitate identification of novel genes regulating important biological processes and phenotypes in other plants or animals, we have published the source code of NetMiner, making it freely available at https://github.com/czllab/NetMiner.

Roles for text mining in protein function prediction.

PubMed

Verspoor, Karin M

2014-01-01

The Human Genome Project has provided science with a hugely valuable resource: the blueprints for life; the specification of all of the genes that make up a human. While the genes have all been identified and deciphered, it is proteins that are the workhorses of the human body: they are essential to virtually all cell functions and are the primary mechanism through which biological function is carried out. Hence in order to fully understand what happens at a molecular level in biological organisms, and eventually to enable development of treatments for diseases where some aspect of a biological system goes awry, we must understand the functions of proteins. However, experimental characterization of protein function cannot scale to the vast amount of DNA sequence data now available. Computational protein function prediction has therefore emerged as a problem at the forefront of modern biology (Radivojac et al., Nat Methods 10(13):221-227, 2013).Within the varied approaches to computational protein function prediction that have been explored, there are several that make use of biomedical literature mining. These methods take advantage of information in the published literature to associate specific proteins with specific protein functions. In this chapter, we introduce two main strategies for doing this: association of function terms, represented as Gene Ontology terms (Ashburner et al., Nat Genet 25(1):25-29, 2000), to proteins based on information in published articles, and a paradigm called LEAP-FS (Literature-Enhanced Automated Prediction of Functional Sites) in which literature mining is used to validate the predictions of an orthogonal computational protein function prediction method.
GEO2Enrichr: browser extension and server app to extract gene sets from GEO and analyze them for biological functions.

PubMed

Gundersen, Gregory W; Jones, Matthew R; Rouillard, Andrew D; Kou, Yan; Monteiro, Caroline D; Feldmann, Axel S; Hu, Kevin S; Ma'ayan, Avi

2015-09-15

Identification of differentially expressed genes is an important step in extracting knowledge from gene expression profiling studies. The raw expression data from microarray and other high-throughput technologies is deposited into the Gene Expression Omnibus (GEO) and served as Simple Omnibus Format in Text (SOFT) files. However, to extract and analyze differentially expressed genes from GEO requires significant computational skills. Here we introduce GEO2Enrichr, a browser extension for extracting differentially expressed gene sets from GEO and analyzing those sets with Enrichr, an independent gene set enrichment analysis tool containing over 70 000 annotated gene sets organized into 75 gene-set libraries. GEO2Enrichr adds JavaScript code to GEO web-pages; this code scrapes user selected accession numbers and metadata, and then, with one click, users can submit this information to a web-server application that downloads the SOFT files, parses, cleans and normalizes the data, identifies the differentially expressed genes, and then pipes the resulting gene lists to Enrichr for downstream functional analysis. GEO2Enrichr opens a new avenue for adding functionality to major bioinformatics resources such GEO by integrating tools and resources without the need for a plug-in architecture. Importantly, GEO2Enrichr helps researchers to quickly explore hypotheses with little technical overhead, lowering the barrier of entry for biologists by automating data processing steps needed for knowledge extraction from the major repository GEO. GEO2Enrichr is an open source tool, freely available for installation as browser extensions at the Chrome Web Store and FireFox Add-ons. Documentation and a browser independent web application can be found at http://amp.pharm.mssm.edu/g2e/. avi.maayan@mssm.edu. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
3' Untranslated regions mediate transcriptional interference between convergent genes both locally and ectopically in Saccharomyces cerevisiae.

PubMed

Wang, Luwen; Jiang, Ning; Wang, Lin; Fang, Ou; Leach, Lindsey J; Hu, Xiaohua; Luo, Zewei

2014-01-01

Paired sense and antisense (S/AS) genes located in cis represent a structural feature common to the genomes of both prokaryotes and eukaryotes, and produce partially complementary transcripts. We used published genome and transcriptome sequence data and found that over 20% of genes (645 pairs) in the budding yeast Saccharomyces cerevisiae genome are arranged in convergent pairs with overlapping 3'-UTRs. Using published microarray transcriptome data from the standard laboratory strain of S. cerevisiae, our analysis revealed that expression levels of convergent pairs are significantly negatively correlated across a broad range of environments. This implies an important role for convergent genes in the regulation of gene expression, which may compensate for the absence of RNA-dependent mechanisms such as micro RNAs in budding yeast. We selected four representative convergent gene pairs and used expression assays in wild type yeast and its genetically modified strains to explore the underlying patterns of gene expression. Results showed that convergent genes are reciprocally regulated in yeast populations and in single cells, whereby an increase in expression of one gene produces a decrease in the expression of the other, and vice-versa. Time course analysis of the cell cycle illustrated the functional significance of this relationship for the three pairs with relevant functional roles. Furthermore, a series of genetic modifications revealed that the 3'-UTR sequence plays an essential causal role in mediating transcriptional interference, which requires neither the sequence of the open reading frame nor the translation of fully functional proteins. More importantly, transcriptional interference persisted even when one of the convergent genes was expressed ectopically (in trans) and therefore does not depend on the cis arrangement of convergent genes; we conclude that the mechanism of transcriptional interference cannot be explained by the transcriptional collision model, which postulates a clash between simultaneous transcriptional processes occurring on opposite DNA strands.
3′ Untranslated Regions Mediate Transcriptional Interference between Convergent Genes Both Locally and Ectopically in Saccharomyces cerevisiae

PubMed Central

Wang, Luwen; Jiang, Ning; Wang, Lin; Fang, Ou; Leach, Lindsey J.; Hu, Xiaohua; Luo, Zewei

2014-01-01

Paired sense and antisense (S/AS) genes located in cis represent a structural feature common to the genomes of both prokaryotes and eukaryotes, and produce partially complementary transcripts. We used published genome and transcriptome sequence data and found that over 20% of genes (645 pairs) in the budding yeast Saccharomyces cerevisiae genome are arranged in convergent pairs with overlapping 3′-UTRs. Using published microarray transcriptome data from the standard laboratory strain of S. cerevisiae, our analysis revealed that expression levels of convergent pairs are significantly negatively correlated across a broad range of environments. This implies an important role for convergent genes in the regulation of gene expression, which may compensate for the absence of RNA-dependent mechanisms such as micro RNAs in budding yeast. We selected four representative convergent gene pairs and used expression assays in wild type yeast and its genetically modified strains to explore the underlying patterns of gene expression. Results showed that convergent genes are reciprocally regulated in yeast populations and in single cells, whereby an increase in expression of one gene produces a decrease in the expression of the other, and vice-versa. Time course analysis of the cell cycle illustrated the functional significance of this relationship for the three pairs with relevant functional roles. Furthermore, a series of genetic modifications revealed that the 3′-UTR sequence plays an essential causal role in mediating transcriptional interference, which requires neither the sequence of the open reading frame nor the translation of fully functional proteins. More importantly, transcriptional interference persisted even when one of the convergent genes was expressed ectopically (in trans) and therefore does not depend on the cis arrangement of convergent genes; we conclude that the mechanism of transcriptional interference cannot be explained by the transcriptional collision model, which postulates a clash between simultaneous transcriptional processes occurring on opposite DNA strands. PMID:24465217
Transcriptomic imprints of adaptation to fresh water: parallel evolution of osmoregulatory gene expression in the Alewife

USGS Publications Warehouse

Velotta, Jonathan P.; Wegrzyn, Jill L.; Ginzburg, Samuel; Kang, Lin; Czesny, Sergiusz J.; O'Neill, Rachel J.; McCormick, Stephen; Michalak, Pawel; Schultz, Eric T.

2017-01-01

Comparative approaches in physiological genomics offer an opportunity to understand the functional importance of genes involved in niche exploitation. We used populations of Alewife (Alosa pseudoharengus) to explore the transcriptional mechanisms that underlie adaptation to fresh water. Ancestrally anadromous Alewives have recently formed multiple, independently derived, landlocked populations, which exhibit reduced tolerance of saltwater and enhanced tolerance of fresh water. Using RNA-seq, we compared transcriptional responses of an anadromous Alewife population to two landlocked populations after acclimation to fresh (0 ppt) and saltwater (35 ppt). Our results suggest that the gill transcriptome has evolved in primarily discordant ways between independent landlocked populations and their anadromous ancestor. By contrast, evolved shifts in the transcription of a small suite of well-characterized osmoregulatory genes exhibited a strong degree of parallelism. In particular, transcription of genes that regulate gill ion exchange has diverged in accordance with functional predictions: freshwater ion-uptake genes (most notably, the ‘freshwater paralog’ of Na+/K+-ATPase α-subunit) were more highly expressed in landlocked forms, whereas genes that regulate saltwater ion secretion (e.g. the ‘saltwater paralog’ of NKAα) exhibited a blunted response to saltwater. Parallel divergence of ion transport gene expression is associated with shifts in salinity tolerance limits among landlocked forms, suggesting that changes to the gill's transcriptional response to salinity facilitate freshwater adaptation.
PANTHER version 11: expanded annotation data from Gene Ontology and Reactome pathways, and data analysis tool enhancements.

PubMed

Mi, Huaiyu; Huang, Xiaosong; Muruganujan, Anushya; Tang, Haiming; Mills, Caitlin; Kang, Diane; Thomas, Paul D

2017-01-04

The PANTHER database (Protein ANalysis THrough Evolutionary Relationships, http://pantherdb.org) contains comprehensive information on the evolution and function of protein-coding genes from 104 completely sequenced genomes. PANTHER software tools allow users to classify new protein sequences, and to analyze gene lists obtained from large-scale genomics experiments. In the past year, major improvements include a large expansion of classification information available in PANTHER, as well as significant enhancements to the analysis tools. Protein subfamily functional classifications have more than doubled due to progress of the Gene Ontology Phylogenetic Annotation Project. For human genes (as well as a few other organisms), PANTHER now also supports enrichment analysis using pathway classifications from the Reactome resource. The gene list enrichment tools include a new 'hierarchical view' of results, enabling users to leverage the structure of the classifications/ontologies; the tools also allow users to upload genetic variant data directly, rather than requiring prior conversion to a gene list. The updated coding single-nucleotide polymorphisms (SNP) scoring tool uses an improved algorithm. The hidden Markov model (HMM) search tools now use HMMER3, dramatically reducing search times and improving accuracy of E-value statistics. Finally, the PANTHER Tree-Attribute Viewer has been implemented in JavaScript, with new views for exploring protein sequence evolution. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Design and interpretation of microRNA-reporter gene activity.

PubMed

Carroll, Adam P; Tooney, Paul A; Cairns, Murray J

2013-06-15

MicroRNAs (miRNAs) are small noncoding RNA molecules that act as sequence specificity guides to direct post-transcriptional gene silencing. In doing so, miRNAs regulate many critical developmental processes, including cellular proliferation, differentiation, migration, and apoptosis, as well as more specialized biological functions such as dendritic spine development and synaptogenesis. Interactions between miRNAs and their miRNA recognition elements occur via partial complementarity, rendering tremendous redundancy in targeting such that miRNAs are predicted to regulate 60% of the genome, with each miRNA estimated to regulate more than 200 genes. Because these predictions are prone to false positives and false negatives, there is an ever present need to provide material support to these assertions to firmly establish the biological function of specific miRNAs in both normal and pathophysiological contexts. Using schizophrenia-associated miR-181b as an example, we present detailed guidelines and novel insights for the rapid establishment of a streamlined miRNA-reporter gene assay and explore various design concepts for miRNA-reporter gene applications, including bidirectional miRNA modulation. In exemplifying this approach, we report seven novel miR-181b target sites for five schizophrenia candidate genes (DISC1, BDNF, ENKUR, GRIA1, and GRIK1) and dissect a number of vital concepts regarding future developments for miRNA-reporter gene assays and the interpretation of their results. Copyright © 2013 Elsevier Inc. All rights reserved.
The association of telomere length and genetic variation in telomere biology genes.

PubMed

Mirabello, Lisa; Yu, Kai; Kraft, Peter; De Vivo, Immaculata; Hunter, David J; Prescott, Jennifer; Wong, Jason Y Y; Chatterjee, Nilanjan; Hayes, Richard B; Savage, Sharon A

2010-09-01

Telomeres cap chromosome ends and are critical for genomic stability. Many telomere-associated proteins are important for telomere length maintenance. Recent genome-wide association studies (GWAS) have identified single nucleotide polymorphisms (SNPs) in genes encoding telomere-associated proteins (RTEL1 and TERT-CLPTM1) as markers of cancer risk. We conducted an association study of telomere length and 743 SNPs in 43 telomere biology genes. Telomere length in peripheral blood DNA was determined by Q-PCR in 3,646 participants from the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial and Nurses' Health Study. We investigated associations by SNP, gene, and pathway (functional group). We found no associations between telomere length and SNPs in TERT-CLPTM1L or RTEL1. Telomere length was not significantly associated with specific functional groups. Thirteen SNPs from four genes (MEN1, MRE11A, RECQL5, and TNKS) were significantly associated with telomere length. The strongest findings were in MEN1 (gene-based P=0.006), menin, which associates with the telomerase promoter and may negatively regulate telomerase. This large association study did not find strong associations with telomere length. The combination of limited diversity and evolutionary conservation suggest that these genes may be under selective pressure. More work is needed to explore the role of genetic variants in telomere length regulation. Published 2010 Wiley-Liss, Inc.
IMG-ABC: A Knowledge Base To Fuel Discovery of Biosynthetic Gene Clusters and Novel Secondary Metabolites.

PubMed

Hadjithomas, Michalis; Chen, I-Min Amy; Chu, Ken; Ratner, Anna; Palaniappan, Krishna; Szeto, Ernest; Huang, Jinghua; Reddy, T B K; Cimermančič, Peter; Fischbach, Michael A; Ivanova, Natalia N; Markowitz, Victor M; Kyrpides, Nikos C; Pati, Amrita

2015-07-14

In the discovery of secondary metabolites, analysis of sequence data is a promising exploration path that remains largely underutilized due to the lack of computational platforms that enable such a systematic approach on a large scale. In this work, we present IMG-ABC (https://img.jgi.doe.gov/abc), an atlas of biosynthetic gene clusters within the Integrated Microbial Genomes (IMG) system, which is aimed at harnessing the power of "big" genomic data for discovering small molecules. IMG-ABC relies on IMG's comprehensive integrated structural and functional genomic data for the analysis of biosynthetic gene clusters (BCs) and associated secondary metabolites (SMs). SMs and BCs serve as the two main classes of objects in IMG-ABC, each with a rich collection of attributes. A unique feature of IMG-ABC is the incorporation of both experimentally validated and computationally predicted BCs in genomes as well as metagenomes, thus identifying BCs in uncultured populations and rare taxa. We demonstrate the strength of IMG-ABC's focused integrated analysis tools in enabling the exploration of microbial secondary metabolism on a global scale, through the discovery of phenazine-producing clusters for the first time in Alphaproteobacteria. IMG-ABC strives to fill the long-existent void of resources for computational exploration of the secondary metabolism universe; its underlying scalable framework enables traversal of uncovered phylogenetic and chemical structure space, serving as a doorway to a new era in the discovery of novel molecules. IMG-ABC is the largest publicly available database of predicted and experimental biosynthetic gene clusters and the secondary metabolites they produce. The system also includes powerful search and analysis tools that are integrated with IMG's extensive genomic/metagenomic data and analysis tool kits. As new research on biosynthetic gene clusters and secondary metabolites is published and more genomes are sequenced, IMG-ABC will continue to expand, with the goal of becoming an essential component of any bioinformatic exploration of the secondary metabolism world. Copyright © 2015 Hadjithomas et al.
BURRITO: An Interactive Multi-Omic Tool for Visualizing Taxa–Function Relationships in Microbiome Data

PubMed Central

McNally, Colin P.; Eng, Alexander; Noecker, Cecilia; Gagne-Maynard, William C.; Borenstein, Elhanan

2018-01-01

The abundance of both taxonomic groups and gene categories in microbiome samples can now be easily assayed via various sequencing technologies, and visualized using a variety of software tools. However, the assemblage of taxa in the microbiome and its gene content are clearly linked, and tools for visualizing the relationship between these two facets of microbiome composition and for facilitating exploratory analysis of their co-variation are lacking. Here we introduce BURRITO, a web tool for interactive visualization of microbiome multi-omic data with paired taxonomic and functional information. BURRITO simultaneously visualizes the taxonomic and functional compositions of multiple samples and dynamically highlights relationships between taxa and functions to capture the underlying structure of these data. Users can browse for taxa and functions of interest and interactively explore the share of each function attributed to each taxon across samples. BURRITO supports multiple input formats for taxonomic and metagenomic data, allows adjustment of data granularity, and can export generated visualizations as static publication-ready formatted figures. In this paper, we describe the functionality of BURRITO, and provide illustrative examples of its utility for visualizing various trends in the relationship between the composition of taxa and functions in complex microbiomes. PMID:29545787
SOX1 suppresses cell growth and invasion in cervical cancer.

PubMed

Lin, Ya-Wen; Tsao, Chun-Ming; Yu, Pei-Ning; Shih, Yu-Lueng; Lin, Chia-Hsin; Yan, Ming-De

2013-10-01

Abnormal activation of the Wnt/β-catenin signaling pathway is common in human cancers, including cervical cancer. Many papers have shown that SRY (sex-determining region Y)-box (SOX) family genes serve as either tumor suppressor genes (TSGs) or oncogenes by regulating the Wnt signaling pathway in different cancers. We have demonstrated recently that epigenetic silencing of SOX1 gene occurs frequently in cervical cancer. However, the possible role of SOX1 in cervical cancer remains unclear. This study aimed to explore whether SOX1 functions as a TSG in cervical cancer. We established a constitutive and an inducible system that overexpressed SOX1 and monitored its function by in vitro experiments. To confirm SOX1 function, we manipulated SOX1 using an inducible expression approach in cell lines. The effect of SOX1 on tumorigenesis was also analyzed in animal models. Overexpression of SOX1 inhibited cell proliferation, anchorage independency, and invasion in vitro. SOX1 suppressed tumor growth in nonobese diabetic/severe combined immunodeficiency mice. After induction of SOX1 by doxycycline (DOX), SOX1 inhibited cell growth and invasion in the inducible system. Repression of SOX1 by withdrawal of DOX partially reversed the malignant phenotype in cervical cells. SOX1 inhibited TCF-dependent transcriptional activity and the Wnt target genes. SOX1 also repressed the invasive phenotype by regulating the expression of invasion-related genes. Taken together, these data suggest that SOX1 can function as a tumor suppressor partly by interfering with Wnt/β-catenin signaling in cervical cancer. © 2013.
Topographical mapping of α- and β-keratins on developing chicken skin integuments: Functional interaction and evolutionary perspectives

PubMed Central

Wu, Ping; Ng, Chen Siang; Yan, Jie; Lai, Yung-Chih; Chen, Chih-Kuan; Lai, Yu-Ting; Wu, Siao-Man; Chen, Jiun-Jie; Luo, Weiqi; Widelitz, Randall B.; Li, Wen-Hsiung; Chuong, Cheng-Ming

2015-01-01

Avian integumentary organs include feathers, scales, claws, and beaks. They cover the body surface and play various functions to help adapt birds to diverse environments. These keratinized structures are mainly composed of corneous materials made of α-keratins, which exist in all vertebrates, and β-keratins, which only exist in birds and reptiles. Here, members of the keratin gene families were used to study how gene family evolution contributes to novelty and adaptation, focusing on tissue morphogenesis. Using chicken as a model, we applied RNA-seq and in situ hybridization to map α- and β-keratin genes in various skin appendages at embryonic developmental stages. The data demonstrate that temporal and spatial α- and β-keratin expression is involved in establishing the diversity of skin appendage phenotypes. Embryonic feathers express a higher proportion of β-keratin genes than other skin regions. In feather filament morphogenesis, β-keratins show intricate complexity in diverse substructures of feather branches. To explore functional interactions, we used a retrovirus transgenic system to ectopically express mutant α- or antisense β-keratin forms. α- and β-keratins show mutual dependence and mutations in either keratin type results in disrupted keratin networks and failure to form proper feather branches. Our data suggest that combinations of α- and β-keratin genes contribute to the morphological and structural diversity of different avian skin appendages, with feather-β-keratins conferring more possible composites in building intrafeather architecture complexity, setting up a platform of morphological evolution of functional forms in feathers. PMID:26598683
Identification of pathogenic genes related to rheumatoid arthritis through integrated analysis of DNA methylation and gene expression profiling.

PubMed

Zhang, Lei; Ma, Shiyun; Wang, Huailiang; Su, Hang; Su, Ke; Li, Longjie

2017-11-15

The purpose of our study was to identify new pathogenic genes used for exploring the pathogenesis of rheumatoid arthritis (RA). To screen pathogenic genes of RA, an integrated analysis was performed by using the microarray datasets in RA derived from the Gene Expression Omnibus (GEO) database. The functional annotation and potential pathways of differentially expressed genes (DEGs) were further discovered by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis. Afterwards, the integrated analysis of DNA methylation and gene expression profiling was used to screen crucial genes. In addition, we used RT-PCR and MSP to verify the expression levels and methylation status of these crucial genes in 20 synovial biopsy samples obtained from 10 RA model mice and 10 normal mice. BCL11B, CCDC88C, FCRLA and APOL6 were both up-regulated and hypomethylated in RA according to integrated analysis, RT-PCR and MSP verification. Four crucial genes (BCL11B, CCDC88C, FCRLA and APOL6) identified and analyzed in this study might be closely connected with the pathogenesis of RA. Copyright © 2017. Published by Elsevier B.V.
Molecular Evolution of piRNA and Transposon Control Pathways in Drosophila

PubMed Central

Malone, C.D.; Hannon, G.J.

2011-01-01

The mere prevalence and potential mobilization of transposable elements in eukaryotic genomes present challenges at both the organismal and population levels. Not only is transposition able to alter gene function and chromosomal structure, but loss of control over even a single active element in the germline can create an evolutionary dead end. Despite the dangers of coexistence, transposons and their activity have been shown to drive the evolution of gene function, chromosomal organization, and even population dynamics (Kazazian 2004). This implies that organisms have adopted elaborate means to balance both the positive and detrimental consequences of transposon activity. In this chapter, we focus on the fruit fly to explore some of the molecular clues into the long- and short-term adaptation to transposon colonization and persistence within eukaryotic genomes. PMID:20453205
Profiling Hyporheic Microbial Community Nitrogen Cycle and Carbohydrate Active Enzyme Gene Abundances across Seasons

NASA Astrophysics Data System (ADS)

Nelson, W. C.; Graham, E.; Stegen, J.

2016-12-01

The hyporheic zone (HZ) is the permanently inundated sediment layer between a surface channel and adjacent groundwater-saturated sediments. It has been hypothesized to play a major role in macronutrient (C, N, P) cycling in rivers. The correlation between community taxonomic composition dynamics and functional gene representation is poorly understood for hyporheic communities. To explore how microbial communities respond to temporal changes in environmental conditions, metagenomes were derived from communities captured in sterile sandpacks deployed within the HZ of the Columbia River. HMM databases were used to enumerate protein families present. Functional classification of reads allowed a general assessment of community function over time, while targeted assembly of specific genes enabled investigation of the diversity of organisms encoding these functions. Preliminary analysis of nitrogen cycle pathways shows most gene families examined to have quite steady representation across seasons, with most observed changes being less than an order of magnitude. Analysis of ammonia oxidation genes showed bacterial ammonia oxidizers (AOB) to be stably present across the year, while the archaeal amoA gene increased in late summer, peaking sharply in November, mirroring results from 16S rRNA amplicon analysis which showed an increase in Thaumarcheal OTUs during that same period. Most glycosyl hydrolase GH families had low representation. Highly abundant classes of GH included the GH94 (beta-glucosidase), GH95 (1-2-alpha-L-fucosidase) and GH103 (lytic transglycosylase) families, suggesting activity on plant, fungus and insect polysaccharides and peptidoglycans. Further work is investigating the taxonomy of the sequences identified, to determine how changes in the community composition contribute to the stable gene family profiles observed. These results are intended to work towards a greater understanding of the role of species diversity and functional redundancy in the dynamics of community composition in response to changes in environmental conditions and stochastic processes. In addition, it will serve as a foundation enabling modeling of generalized microbial function in the hyporheic zone, improving our ability to predict fluxes of carbon and nitrogen through riverine systems.
Network-based analysis of differentially expressed genes in cerebrospinal fluid (CSF) and blood reveals new candidate genes for multiple sclerosis

PubMed Central

Safari-Alighiarloo, Nahid; Taghizadeh, Mohammad; Tabatabaei, Seyyed Mohammad; Namaki, Saeed

2016-01-01

Background The involvement of multiple genes and missing heritability, which are dominant in complex diseases such as multiple sclerosis (MS), entail using network biology to better elucidate their molecular basis and genetic factors. We therefore aimed to integrate interactome (protein–protein interaction (PPI)) and transcriptomes data to construct and analyze PPI networks for MS disease. Methods Gene expression profiles in paired cerebrospinal fluid (CSF) and peripheral blood mononuclear cells (PBMCs) samples from MS patients, sampled in relapse or remission and controls, were analyzed. Differentially expressed genes which determined only in CSF (MS vs. control) and PBMCs (relapse vs. remission) separately integrated with PPI data to construct the Query-Query PPI (QQPPI) networks. The networks were further analyzed to investigate more central genes, functional modules and complexes involved in MS progression. Results The networks were analyzed and high centrality genes were identified. Exploration of functional modules and complexes showed that the majority of high centrality genes incorporated in biological pathways driving MS pathogenesis. Proteasome and spliceosome were also noticeable in enriched pathways in PBMCs (relapse vs. remission) which were identified by both modularity and clique analyses. Finally, STK4, RB1, CDKN1A, CDK1, RAC1, EZH2, SDCBP genes in CSF (MS vs. control) and CDC37, MAP3K3, MYC genes in PBMCs (relapse vs. remission) were identified as potential candidate genes for MS, which were the more central genes involved in biological pathways. Discussion This study showed that network-based analysis could explicate the complex interplay between biological processes underlying MS. Furthermore, an experimental validation of candidate genes can lead to identification of potential therapeutic targets. PMID:28028462
Computational Identification of the Paralogs and Orthologs of Human Cytochrome P450 Superfamily and the Implication in Drug Discovery

PubMed Central

Pan, Shu-Ting; Xue, Danfeng; Li, Zhi-Ling; Zhou, Zhi-Wei; He, Zhi-Xu; Yang, Yinxue; Yang, Tianxin; Qiu, Jia-Xuan; Zhou, Shu-Feng

2016-01-01

The human cytochrome P450 (CYP) superfamily consisting of 57 functional genes is the most important group of Phase I drug metabolizing enzymes that oxidize a large number of xenobiotics and endogenous compounds, including therapeutic drugs and environmental toxicants. The CYP superfamily has been shown to expand itself through gene duplication, and some of them become pseudogenes due to gene mutations. Orthologs and paralogs are homologous genes resulting from speciation or duplication, respectively. To explore the evolutionary and functional relationships of human CYPs, we conducted this bioinformatic study to identify their corresponding paralogs, homologs, and orthologs. The functional implications and implications in drug discovery and evolutionary biology were then discussed. GeneCards and Ensembl were used to identify the paralogs of human CYPs. We have used a panel of online databases to identify the orthologs of human CYP genes: NCBI, Ensembl Compara, GeneCards, OMA (“Orthologous MAtrix”) Browser, PATHER, TreeFam, EggNOG, and Roundup. The results show that each human CYP has various numbers of paralogs and orthologs using GeneCards and Ensembl. For example, the paralogs of CYP2A6 include CYP2A7, 2A13, 2B6, 2C8, 2C9, 2C18, 2C19, 2D6, 2E1, 2F1, 2J2, 2R1, 2S1, 2U1, and 2W1; CYP11A1 has 6 paralogs including CYP11B1, 11B2, 24A1, 27A1, 27B1, and 27C1; CYP51A1 has only three paralogs: CYP26A1, 26B1, and 26C1; while CYP20A1 has no paralog. The majority of human CYPs are well conserved from plants, amphibians, fishes, or mammals to humans due to their important functions in physiology and xenobiotic disposition. The data from different approaches are also cross-validated and validated when experimental data are available. These findings facilitate our understanding of the evolutionary relationships and functional implications of the human CYP superfamily in drug discovery. PMID:27367670
NCBI GEO: archive for high-throughput functional genomic data.

PubMed

Barrett, Tanya; Troup, Dennis B; Wilhite, Stephen E; Ledoux, Pierre; Rudnev, Dmitry; Evangelista, Carlos; Kim, Irene F; Soboleva, Alexandra; Tomashevsky, Maxim; Marshall, Kimberly A; Phillippy, Katherine H; Sherman, Patti M; Muertter, Rolf N; Edgar, Ron

2009-01-01

The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) is the largest public repository for high-throughput gene expression data. Additionally, GEO hosts other categories of high-throughput functional genomic data, including those that examine genome copy number variations, chromatin structure, methylation status and transcription factor binding. These data are generated by the research community using high-throughput technologies like microarrays and, more recently, next-generation sequencing. The database has a flexible infrastructure that can capture fully annotated raw and processed data, enabling compliance with major community-derived scientific reporting standards such as 'Minimum Information About a Microarray Experiment' (MIAME). In addition to serving as a centralized data storage hub, GEO offers many tools and features that allow users to effectively explore, analyze and download expression data from both gene-centric and experiment-centric perspectives. This article summarizes the GEO repository structure, content and operating procedures, as well as recently introduced data mining features. GEO is freely accessible at http://www.ncbi.nlm.nih.gov/geo/.
Genome sequence analysis of a flocculant-producing bacterium, Paenibacillus shenyangensis.

PubMed

Fu, Lili; Jiang, Binhui; Liu, Jinliang; Zhao, Xin; Liu, Qian; Hu, Xiaomin

2016-03-01

To explore the metabolic process of Paenibacillus shenyangensis that is an efficient bioflocculant-producing bacterium. The biosynthesis mechanism of bioflocculation was used to enrich the genome of Paenibacillus shenyangensis and provide a basis for molecular genetics and functional genomics analyses. According to the analysis of de novo assembly, a total of 5,501,467 bp clean reads were generated, and were assembled into 92 contigs. 4800 unigenes were predicted of which 4393 were annotated showing a specific gene function in the NCBI-Nr database. 3423 genes were found in the database of cluster of orthologous groups. Among the 168 Kyoto Encyclopedia of Genes and Genomes database, cell growth and metabolism were the main biological processes, and a potential metabolic pathway was predicted from glucose to exopolysaccharide within the starch and sucrose metabolism pathway. By using the high-throughput sequencing technology, we provide a genome analysis of Paenibacillus shenyangensis that predicts the main metabolic processes and a potential pathway of exopolysaccharide biosynthesis.
Insights into structural variations and genome rearrangements in prokaryotic genomes.

PubMed

Periwal, Vinita; Scaria, Vinod

2015-01-01

Structural variations (SVs) are genomic rearrangements that affect fairly large fragments of DNA. Most of the SVs such as inversions, deletions and translocations have been largely studied in context of genetic diseases in eukaryotes. However, recent studies demonstrate that genome rearrangements can also have profound impact on prokaryotic genomes, leading to altered cell phenotype. In contrast to single-nucleotide variations, SVs provide a much deeper insight into organization of bacterial genomes at a much better resolution. SVs can confer change in gene copy number, creation of new genes, altered gene expression and many other functional consequences. High-throughput technologies have now made it possible to explore SVs at a much refined resolution in bacterial genomes. Through this review, we aim to highlight the importance of the less explored field of SVs in prokaryotic genomes and their impact. We also discuss its potential applicability in the emerging fields of synthetic biology and genome engineering where targeted SVs could serve to create sophisticated and accurate genome editing. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Patterns and architecture of genomic islands in marine bacteria

PubMed Central

2012-01-01

Background Genomic Islands (GIs) have key roles since they modulate the structure and size of bacterial genomes displaying a diverse set of laterally transferred genes. Despite their importance, GIs in marine bacterial genomes have not been explored systematically to uncover possible trends and to analyze their putative ecological significance. Results We carried out a comprehensive analysis of GIs in 70 selected marine bacterial genomes detected with IslandViewer to explore the distribution, patterns and functional gene content in these genomic regions. We detected 438 GIs containing a total of 8152 genes. GI number per genome was strongly and positively correlated with the total GI size. In 50% of the genomes analyzed the GIs accounted for approximately 3% of the genome length, with a maximum of 12%. Interestingly, we found transposases particularly enriched within Alphaproteobacteria GIs, and site-specific recombinases in Gammaproteobacteria GIs. We described specific Homologous Recombination GIs (HR-GIs) in several genera of marine Bacteroidetes and in Shewanella strains among others. In these HR-GIs, we recurrently found conserved genes such as the β-subunit of DNA-directed RNA polymerase, regulatory sigma factors, the elongation factor Tu and ribosomal protein genes typically associated with the core genome. Conclusions Our results indicate that horizontal gene transfer mediated by phages, plasmids and other mobile genetic elements, and HR by site-specific recombinases play important roles in the mobility of clusters of genes between taxa and within closely related genomes, modulating the flexible pool of the genome. Our findings suggest that GIs may increase bacterial fitness under environmental changing conditions by acquiring novel foreign genes and/or modifying gene transcription and/or transduction. PMID:22839777
Exploring of the molecular mechanism of rhinitis via bioinformatics methods

PubMed Central

Song, Yufen; Yan, Zhaohui

2018-01-01

The aim of this study was to analyze gene expression profiles for exploring the function and regulatory network of differentially expressed genes (DEGs) in pathogenesis of rhinitis by a bioinformatics method. The gene expression profile of GSE43523 was downloaded from the Gene Expression Omnibus database. The dataset contained 7 seasonal allergic rhinitis samples and 5 non-allergic normal samples. DEGs between rhinitis samples and normal samples were identified via the limma package of R. The webGestal database was used to identify enriched Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways of the DEGs. The differentially co-expressed pairs of the DEGs were identified via the DCGL package in R, and the differential co-expression network was constructed based on these pairs. A protein-protein interaction (PPI) network of the DEGs was constructed based on the Search Tool for the Retrieval of Interacting Genes database. A total of 263 DEGs were identified in rhinitis samples compared with normal samples, including 125 downregulated ones and 138 upregulated ones. The DEGs were enriched in 7 KEGG pathways. 308 differential co-expression gene pairs were obtained. A differential co-expression network was constructed, containing 212 nodes. In total, 148 PPI pairs of the DEGs were identified, and a PPI network was constructed based on these pairs. Bioinformatics methods could help us identify significant genes and pathways related to the pathogenesis of rhinitis. Steroid biosynthesis pathway and metabolic pathways might play important roles in the development of allergic rhinitis (AR). Genes such as CDC42 effector protein 5, solute carrier family 39 member A11 and PR/SET domain 10 might be also associated with the pathogenesis of AR, which provided references for the molecular mechanisms of AR. PMID:29257233
Adeno-Associated Virus–Mediated Gene Therapy for Metabolic Myopathy

PubMed Central

Mah, Cathryn S.; Soustek, Meghan S.; Todd, A. Gary; McCall, Angela; Smith, Barbara K.; Corti, Manuela; Falk, Darin J.

2013-01-01

Abstract Metabolic myopathies are a diverse group of rare diseases in which impaired breakdown of stored energy leads to profound muscle dysfunction ranging from exercise intolerance to severe muscle wasting. Metabolic myopathies are largely caused by functional deficiency of a single gene and are generally subcategorized into three major types of metabolic disease: mitochondrial, lipid, or glycogen. Treatment varies greatly depending on the biochemical nature of the disease, and unfortunately no definitive treatments exist for metabolic myopathy. Since this group of diseases is inherited, gene therapy is being explored as an approach to personalized medical treatment. Adeno-associated virus–based vectors in particular have shown to be promising in the treatment of several forms of metabolic myopathy. This review will discuss the most recent advances in gene therapy efforts for the treatment of metabolic myopathies. PMID:24164240
Reduce, reuse, and recycle: developmental evolution of trait diversification.

PubMed

Preston, Jill C; Hileman, Lena C; Cubas, Pilar

2011-03-01

A major focus of evolutionary developmental (evo-devo) studies is to determine the genetic basis of variation in organismal form and function, both of which are fundamental to biological diversification. Pioneering work on metazoan and flowering plant systems has revealed conserved sets of genes that underlie the bauplan of organisms derived from a common ancestor. However, the extent to which variation in the developmental genetic toolkit mirrors variation at the phenotypic level is an active area of research. Here we explore evidence from the angiosperm evo-devo literature supporting the frugal use of genes and genetic pathways in the evolution of developmental patterning. In particular, these examples highlight the importance of genetic pleiotropy in different developmental modules, thus reducing the number of genes required in growth and development, and the reuse of particular genes in the parallel evolution of ecologically important traits.
Identification of gene expression profiles and key genes in subchondral bone of osteoarthritis using weighted gene coexpression network analysis.

PubMed

Guo, Sheng-Min; Wang, Jian-Xiong; Li, Jin; Xu, Fang-Yuan; Wei, Quan; Wang, Hai-Ming; Huang, Hou-Qiang; Zheng, Si-Lin; Xie, Yu-Jie; Zhang, Chi

2018-06-15

Osteoarthritis (OA) significantly influences the quality life of people around the world. It is urgent to find an effective way to understand the genetic etiology of OA. We used weighted gene coexpression network analysis (WGCNA) to explore the key genes involved in the subchondral bone pathological process of OA. Fifty gene expression profiles of GSE51588 were downloaded from the Gene Expression Omnibus database. The OA-associated genes and gene ontologies were acquired from JuniorDoc. Weighted gene coexpression network analysis was used to find disease-related networks based on 21756 gene expression correlation coefficients, hub-genes with the highest connectivity in each module were selected, and the correlation between module eigengene and clinical traits was calculated. The genes in the traits-related gene coexpression modules were subject to functional annotation and pathway enrichment analysis using ClusterProfiler. A total of 73 gene modules were identified, of which, 12 modules were found with high connectivity with clinical traits. Five modules were found with enriched OA-associated genes. Moreover, 310 OA-associated genes were found, and 34 of them were among hub-genes in each module. Consequently, enrichment results indicated some key metabolic pathways, such as extracellular matrix (ECM)-receptor interaction (hsa04512), focal adhesion (hsa04510), the phosphatidylinositol 3'-kinase (PI3K)-Akt signaling pathway (PI3K-AKT) (hsa04151), transforming growth factor beta pathway, and Wnt pathway. We intended to identify some core genes, collagen (COL)6A3, COL6A1, ITGA11, BAMBI, and HCK, which could influence downstream signaling pathways once they were activated. In this study, we identified important genes within key coexpression modules, which associate with a pathological process of subchondral bone in OA. Functional analysis results could provide important information to understand the mechanism of OA. © 2018 Wiley Periodicals, Inc.
CuGene as a tool to view and explore genomic data

NASA Astrophysics Data System (ADS)

Haponiuk, Michał; Pawełkowicz, Magdalena; Przybecki, Zbigniew; Nowak, Robert M.

2017-08-01

Integrated CuGene is an easy-to-use, open-source, on-line tool that can be used to browse, analyze, and query genomic data and annotations. It places annotation tracks beneath genome coordinate positions, allowing rapid visual correlation of different types of information. It also allows users to upload and display their own experimental results or annotation sets. An important functionality of the application is a possibility to find similarity between sequences by applying four different algorithms of different accuracy. The presented tool was tested on real genomic data and is extensively used by Polish Consortium of Cucumber Genome Sequencing.
FunCoup 3.0: database of genome-wide functional coupling networks

PubMed Central

Schmitt, Thomas; Ogris, Christoph; Sonnhammer, Erik L. L.

2014-01-01

We present an update of the FunCoup database (http://FunCoup.sbc.su.se) of functional couplings, or functional associations, between genes and gene products. Identifying these functional couplings is an important step in the understanding of higher level mechanisms performed by complex cellular processes. FunCoup distinguishes between four classes of couplings: participation in the same signaling cascade, participation in the same metabolic process, co-membership in a protein complex and physical interaction. For each of these four classes, several types of experimental and statistical evidence are combined by Bayesian integration to predict genome-wide functional coupling networks. The FunCoup framework has been completely re-implemented to allow for more frequent future updates. It contains many improvements, such as a regularization procedure to automatically downweight redundant evidences and a novel method to incorporate phylogenetic profile similarity. Several datasets have been updated and new data have been added in FunCoup 3.0. Furthermore, we have developed a new Web site, which provides powerful tools to explore the predicted networks and to retrieve detailed information about the data underlying each prediction. PMID:24185702
FunCoup 3.0: database of genome-wide functional coupling networks.

PubMed

Schmitt, Thomas; Ogris, Christoph; Sonnhammer, Erik L L

2014-01-01

We present an update of the FunCoup database (http://FunCoup.sbc.su.se) of functional couplings, or functional associations, between genes and gene products. Identifying these functional couplings is an important step in the understanding of higher level mechanisms performed by complex cellular processes. FunCoup distinguishes between four classes of couplings: participation in the same signaling cascade, participation in the same metabolic process, co-membership in a protein complex and physical interaction. For each of these four classes, several types of experimental and statistical evidence are combined by Bayesian integration to predict genome-wide functional coupling networks. The FunCoup framework has been completely re-implemented to allow for more frequent future updates. It contains many improvements, such as a regularization procedure to automatically downweight redundant evidences and a novel method to incorporate phylogenetic profile similarity. Several datasets have been updated and new data have been added in FunCoup 3.0. Furthermore, we have developed a new Web site, which provides powerful tools to explore the predicted networks and to retrieve detailed information about the data underlying each prediction.
[Effect of methyl tertiary butyl ether on the expression of proto-oncogenes and function genes].

PubMed

Zhou, W; Huang, G; Zhang, H

1999-05-30

Methyl tertiary butyl ether (MTBE) is a new gasoline additive, which is used to increase the combustion of gasoline and to reduce the emission of harmful exhaust from automobile. The mechanism for the carcinogenesis of MTBE in animals is not clear. Immunohistochemistry method was used to detect the effect of MTBE on the expression of c-myc and p21 proteins in NIH3T3 cells. Dot hybridization method was used to explore the expression of c-myc gene and GST-P(glutathione S-transferase-P) gene in the of MTBE treated rats. The results showed that MTBE could enhance the expression of c-myc protein, but had no effect on p21 protein. MTBE could induce high expression of c-myc gene, and had no effect on the expression of GST-P gene. These results suggest that the high expression of c-myc gene induced by MTBE might be one of the mechanisms of its carcinogenicity in animal.
In vivo functional analysis of L-rhamnose metabolic pathway in Aspergillus niger: a tool to identify the potential inducer of RhaR.

PubMed

Khosravi, Claire; Kun, Roland Sándor; Visser, Jaap; Aguilar-Pontes, María Victoria; de Vries, Ronald P; Battaglia, Evy

2017-11-06

The genes of the non-phosphorylative L-rhamnose catabolic pathway have been identified for several yeast species. In Schefferomyces stipitis, all L-rhamnose pathway genes are organized in a cluster, which is conserved in Aspergillus niger, except for the lra-4 ortholog (lraD). The A. niger cluster also contains the gene encoding the L-rhamnose responsive transcription factor (RhaR) that has been shown to control the expression of genes involved in L-rhamnose release and catabolism. In this paper, we confirmed the function of the first three putative L-rhamnose utilisation genes from A. niger through gene deletion. We explored the identity of the inducer of the pathway regulator (RhaR) through expression analysis of the deletion mutants grown in transfer experiments to L-rhamnose and L-rhamnonate. Reduced expression of L-rhamnose-induced genes on L-rhamnose in lraA and lraB deletion strains, but not on L-rhamnonate (the product of LraB), demonstrate that the inducer of the pathway is of L-rhamnonate or a compound downstream of it. Reduced expression of these genes in the lraC deletion strain on L-rhamnonate show that it is in fact a downstream product of L-rhamnonate. This work showed that the inducer of RhaR is beyond L-rhamnonate dehydratase (LraC) and is likely to be the 2-keto-3-L-deoxyrhamnonate.
A unified design space of synthetic stripe-forming networks

PubMed Central

Schaerli, Yolanda; Munteanu, Andreea; Gili, Magüi; Cotterell, James; Sharpe, James; Isalan, Mark

2014-01-01

Synthetic biology is a promising tool to study the function and properties of gene regulatory networks. Gene circuits with predefined behaviours have been successfully built and modelled, but largely on a case-by-case basis. Here we go beyond individual networks and explore both computationally and synthetically the design space of possible dynamical mechanisms for 3-node stripe-forming networks. First, we computationally test every possible 3-node network for stripe formation in a morphogen gradient. We discover four different dynamical mechanisms to form a stripe and identify the minimal network of each group. Next, with the help of newly established engineering criteria we build these four networks synthetically and show that they indeed operate with four fundamentally distinct mechanisms. Finally, this close match between theory and experiment allows us to infer and subsequently build a 2-node network that represents the archetype of the explored design space. PMID:25247316
RNA Interference (RNAi) Induced Gene Silencing: A Promising Approach of Hi-Tech Plant Breeding.

PubMed

Younis, Adnan; Siddique, Muhammad Irfan; Kim, Chang-Kil; Lim, Ki-Byung

2014-01-01

RNA interference (RNAi) is a promising gene regulatory approach in functional genomics that has significant impact on crop improvement which permits down-regulation in gene expression with greater precise manner without affecting the expression of other genes. RNAi mechanism is expedited by small molecules of interfering RNA to suppress a gene of interest effectively. RNAi has also been exploited in plants for resistance against pathogens, insect/pest, nematodes, and virus that cause significant economic losses. Keeping beside the significance in the genome integrity maintenance as well as growth and development, RNAi induced gene syntheses are vital in plant stress management. Modifying the genes by the interference of small RNAs is one of the ways through which plants react to the environmental stresses. Hence, investigating the role of small RNAs in regulating gene expression assists the researchers to explore the potentiality of small RNAs in abiotic and biotic stress management. This novel approach opens new avenues for crop improvement by developing disease resistant, abiotic or biotic stress tolerant, and high yielding elite varieties.
RNA Interference (RNAi) Induced Gene Silencing: A Promising Approach of Hi-Tech Plant Breeding

PubMed Central

Younis, Adnan; Siddique, Muhammad Irfan; Kim, Chang-Kil; Lim, Ki-Byung

2014-01-01

RNA interference (RNAi) is a promising gene regulatory approach in functional genomics that has significant impact on crop improvement which permits down-regulation in gene expression with greater precise manner without affecting the expression of other genes. RNAi mechanism is expedited by small molecules of interfering RNA to suppress a gene of interest effectively. RNAi has also been exploited in plants for resistance against pathogens, insect/pest, nematodes, and virus that cause significant economic losses. Keeping beside the significance in the genome integrity maintenance as well as growth and development, RNAi induced gene syntheses are vital in plant stress management. Modifying the genes by the interference of small RNAs is one of the ways through which plants react to the environmental stresses. Hence, investigating the role of small RNAs in regulating gene expression assists the researchers to explore the potentiality of small RNAs in abiotic and biotic stress management. This novel approach opens new avenues for crop improvement by developing disease resistant, abiotic or biotic stress tolerant, and high yielding elite varieties. PMID:25332689
PRISM offers a comprehensive genomic approach to transcription factor function prediction

PubMed Central

Wenger, Aaron M.; Clarke, Shoa L.; Guturu, Harendra; Chen, Jenny; Schaar, Bruce T.; McLean, Cory Y.; Bejerano, Gill

2013-01-01

The human genome encodes 1500–2000 different transcription factors (TFs). ChIP-seq is revealing the global binding profiles of a fraction of TFs in a fraction of their biological contexts. These data show that the majority of TFs bind directly next to a large number of context-relevant target genes, that most binding is distal, and that binding is context specific. Because of the effort and cost involved, ChIP-seq is seldom used in search of novel TF function. Such exploration is instead done using expression perturbation and genetic screens. Here we propose a comprehensive computational framework for transcription factor function prediction. We curate 332 high-quality nonredundant TF binding motifs that represent all major DNA binding domains, and improve cross-species conserved binding site prediction to obtain 3.3 million conserved, mostly distal, binding site predictions. We combine these with 2.4 million facts about all human and mouse gene functions, in a novel statistical framework, in search of enrichments of particular motifs next to groups of target genes of particular functions. Rigorous parameter tuning and a harsh null are used to minimize false positives. Our novel PRISM (predicting regulatory information from single motifs) approach obtains 2543 TF function predictions in a large variety of contexts, at a false discovery rate of 16%. The predictions are highly enriched for validated TF roles, and 45 of 67 (67%) tested binding site regions in five different contexts act as enhancers in functionally matched cells. PMID:23382538
Think like a sponge: The genetic signal of sensory cells in sponges.

PubMed

Mah, Jasmine L; Leys, Sally P

2017-11-01

A complex genetic repertoire underlies the apparently simple body plan of sponges. Among the genes present in poriferans are those fundamental to the sensory and nervous systems of other animals. Sponges are dynamic and sensitive animals and it is intuitive to link these genes to behaviour. The proposal that ctenophores are the earliest diverging metazoan has led to the question of whether sponges possess a 'pre-nervous' system or have undergone nervous system loss. Both lines of thought generally assume that the last common ancestor of sponges and eumetazoans possessed the genetic modules that underlie sensory abilities. By corollary extant sponges may possess a sensory cell homologous to one present in the last common ancestor, a hypothesis that has been studied by gene expression. We have performed a meta-analysis of all gene expression studies published to date to explore whether gene expression is indicative of a feature's sensory function. In sponges we find that eumetazoan sensory-neural markers are not particularly expressed in structures with known sensory functions. Instead it is common for these genes to be expressed in cells with no known or uncharacterized sensory function. Indeed, many sensory-neural markers so far studied are expressed during development, perhaps because many are transcription factors. This suggests that the genetic signal of a sponge sensory cell is dissimilar enough to be unrecognizable when compared to a bilaterian sensory or neural cell. It is possible that sensory-neural markers have as yet unknown functions in sponge cells, such as assembling an immunological synapse in the larval globular cell. Furthermore, the expression of sensory-neural markers in non-sensory cells, such as adult and larval epithelial cells, suggest that these cells may have uncharacterized sensory functions. While this does not rule out the co-option of ancestral sensory modules in later evolving groups, a distinct genetic foundation may underlie the sponge sensory system. Copyright © 2017 Elsevier Inc. All rights reserved.
Single-Cell RNA-Seq Reveals the Transcriptional Landscape and Heterogeneity of Aortic Macrophages in Murine Atherosclerosis.

PubMed

Cochain, Clément; Vafadarnejad, Ehsan; Arampatzi, Panagiota; Jaroslav, Pelisek; Winkels, Holger; Ley, Klaus; Wolf, Dennis; Saliba, Antoine-Emmanuel; Zernecke, Alma

2018-03-15

Rationale: It is assumed that atherosclerotic arteries contain several macrophage subsets endowed with specific functions. The precise identity of these subsets is poorly characterized as they ha ve been defined by the expression of a restricted number of markers. Objective: We have applied single-cell RNA-seq as an unbiased profiling strategy to interrogate and classify aortic macrophage heterogeneity at the single-cell level in atherosclerosis. Methods and Results: We performed single-cell RNA sequencing of total aortic CD45 + cells extracted from the non-diseased (chow fed) and atherosclerotic (11 weeks of high fat diet) aorta of Ldlr -/- mice. Unsupervised clustering singled out 13 distinct aortic cell clusters. Among the myeloid cell populations, Resident-like macrophages with a gene expression profile similar to aortic resident macrophages were found in healthy and diseased aortae, whereas monocytes, monocyte-derived dendritic cells (MoDC), and two populations of macrophages were almost exclusively detectable in atherosclerotic aortae, comprising Inflammatory macrophages showing enrichment in I l1b , and previously undescribed TREM2 hi macrophages. Differential gene expression and gene ontology enrichment analyses revealed specific gene expression patterns distinguishing these three macrophage subsets and MoDC, and uncovered putative functions of each cell type. Notably, TREM2 hi macrophages appeared to be endowed with specialized functions in lipid metabolism and catabolism, and presented a gene expression signature reminiscent of osteoclasts, suggesting a role in lesion calcification. TREM2 expression was moreover detected in human lesional macrophages. Importantly, these macrophage populations were present also in advanced atherosclerosis and in Apoe -/- aortae, indicating relevance of our findings in different stages of atherosclerosis and mouse models. Conclusions: These data unprecedentedly uncovered the transcriptional landscape and phenotypic heterogeneity of aortic macrophages and MoDCs in atherosclerotic and identified previously unrecognized macrophage populations and their gene expression signature, suggesting specialized functions. Our findings will open up novel opportunities to explore distinct myeloid cell populations and their functions in atherosclerosis.
Growth Trade-Offs Accompany the Emergence of Glycolytic Metabolism in Shewanella oneidensis MR-1

DOE PAGES

Chubiz, Lon M.; Marx, Christopher J.

2017-03-13

Bacteria increase their metabolic capacity via the acquisition of genetic material or by the mutation of genes already present in the genome. Here, we explore the mechanisms and trade-offs involved whenShewanella oneidensis, a bacterium that typically consumes small organic and amino acids, rapidly evolves to expand its metabolic capacity to catabolize glucose after a short period of adaptation to a glucose-rich environment. Using whole-genome sequencing and genetic approaches, we discovered that deletions in a region including the transcriptional repressor (nagR) that regulates the expression of genes associated with catabolism ofN-acetylglucosamine are the common basis for evolved glucose metabolism across populations.more » The loss ofnagRresults in the constitutive expression of genes for anN-acetylglucosamine permease (nagP) and kinase (nagK). We demonstrate that promiscuous activities of both NagP and NagK toward glucose allow for the transport and phosphorylation of glucose to glucose-6-phosphate, the initial events of glycolysis otherwise thought to be absent inS. oneidensis. 13C-based metabolic flux analysis uncovered that subsequent utilization was mediated by the Entner-Doudoroff pathway. This is an example whereby gene loss and preexisting enzymatic promiscuity, and not gain-of-function mutations, were the drivers of increased metabolic capacity. However, we observed a significant decrease in the growth rate on lactate after adaptation to glucose catabolism, suggesting that trade-offs may explain why glycolytic function may not be readily observed inS. oneidensisin natural environments despite it being readily accessible through just a single mutational event.Gains in metabolic capacity are frequently associated with the acquisition of novel genetic material via natural or engineered horizontal gene transfer events. Here, we explored how a bacterium that typically consumes small organic acids and amino acids expands its metabolic capacity to include glucose via a loss of genetic material, a process frequently associated with a deterioration of metabolic function. Our findings highlight how the natural promiscuity of transporters and enzymes can be a key driver in expanding metabolic diversity and that many bacteria may possess a latent metabolic capacity accessible through one or a few mutations that remove regulatory functions. Our discovery of trade-offs between growth on lactate and on glucose suggests why this easily gained trait is not observed in nature.« less
Growth Trade-Offs Accompany the Emergence of Glycolytic Metabolism in Shewanella oneidensis MR-1

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chubiz, Lon M.; Marx, Christopher J.

Bacteria increase their metabolic capacity via the acquisition of genetic material or by the mutation of genes already present in the genome. Here, we explore the mechanisms and trade-offs involved whenShewanella oneidensis, a bacterium that typically consumes small organic and amino acids, rapidly evolves to expand its metabolic capacity to catabolize glucose after a short period of adaptation to a glucose-rich environment. Using whole-genome sequencing and genetic approaches, we discovered that deletions in a region including the transcriptional repressor (nagR) that regulates the expression of genes associated with catabolism ofN-acetylglucosamine are the common basis for evolved glucose metabolism across populations.more » The loss ofnagRresults in the constitutive expression of genes for anN-acetylglucosamine permease (nagP) and kinase (nagK). We demonstrate that promiscuous activities of both NagP and NagK toward glucose allow for the transport and phosphorylation of glucose to glucose-6-phosphate, the initial events of glycolysis otherwise thought to be absent inS. oneidensis. 13C-based metabolic flux analysis uncovered that subsequent utilization was mediated by the Entner-Doudoroff pathway. This is an example whereby gene loss and preexisting enzymatic promiscuity, and not gain-of-function mutations, were the drivers of increased metabolic capacity. However, we observed a significant decrease in the growth rate on lactate after adaptation to glucose catabolism, suggesting that trade-offs may explain why glycolytic function may not be readily observed inS. oneidensisin natural environments despite it being readily accessible through just a single mutational event.Gains in metabolic capacity are frequently associated with the acquisition of novel genetic material via natural or engineered horizontal gene transfer events. Here, we explored how a bacterium that typically consumes small organic acids and amino acids expands its metabolic capacity to include glucose via a loss of genetic material, a process frequently associated with a deterioration of metabolic function. Our findings highlight how the natural promiscuity of transporters and enzymes can be a key driver in expanding metabolic diversity and that many bacteria may possess a latent metabolic capacity accessible through one or a few mutations that remove regulatory functions. Our discovery of trade-offs between growth on lactate and on glucose suggests why this easily gained trait is not observed in nature.« less
PROS-1/Prospero Is a Major Regulator of the Glia-Specific Secretome Controlling Sensory-Neuron Shape and Function in C. elegans.

PubMed

Wallace, Sean W; Singhvi, Aakanksha; Liang, Yupu; Lu, Yun; Shaham, Shai

2016-04-19

Sensory neurons are an animal's gateway to the world, and their receptive endings, the sites of sensory signal transduction, are often associated with glia. Although glia are known to promote sensory-neuron functions, the molecular bases of these interactions are poorly explored. Here, we describe a post-developmental glial role for the PROS-1/Prospero/PROX1 homeodomain protein in sensory-neuron function in C. elegans. Using glia expression profiling, we demonstrate that, unlike previously characterized cell fate roles, PROS-1 functions post-embryonically to control sense-organ glia-specific secretome expression. PROS-1 functions cell autonomously to regulate glial secretion and membrane structure, and non-cell autonomously to control the shape and function of the receptive endings of sensory neurons. Known glial genes controlling sensory-neuron function are PROS-1 targets, and we identify additional PROS-1-dependent genes required for neuron attributes. Drosophila Prospero and vertebrate PROX1 are expressed in post-mitotic sense-organ glia and astrocytes, suggesting conserved roles for this class of transcription factors. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Genome-guided exploration of metabolic features of Streptomyces peucetius ATCC 27952: past, current, and prospect.

PubMed

Thuan, Nguyen Huy; Dhakal, Dipesh; Pokhrel, Anaya Raj; Chu, Luan Luong; Van Pham, Thi Thuy; Shrestha, Anil; Sohng, Jae Kyung

2018-05-01

Streptomyces peucetius ATCC 27952 produces two major anthracyclines, doxorubicin (DXR) and daunorubicin (DNR), which are potent chemotherapeutic agents for the treatment of several cancers. In order to gain detailed insight on genetics and biochemistry of the strain, the complete genome was determined and analyzed. The result showed that its complete sequence contains 7187 protein coding genes in a total of 8,023,114 bp, whereas 87% of the genome contributed to the protein coding region. The genomic sequence included 18 rRNA, 66 tRNAs, and 3 non-coding RNAs. In silico studies predicted ~ 68 biosynthetic gene clusters (BCGs) encoding diverse classes of secondary metabolites, including non-ribosomal polyketide synthase (NRPS), polyketide synthase (PKS I, II, and III), terpenes, and others. Detailed analysis of the genome sequence revealed versatile biocatalytic enzymes such as cytochrome P450 (CYP), electron transfer systems (ETS) genes, methyltransferase (MT), glycosyltransferase (GT). In addition, numerous functional genes (transporter gene, SOD, etc.) and regulatory genes (afsR-sp, metK-sp, etc.) involved in the regulation of secondary metabolites were found. This minireview summarizes the genome-based genome mining (GM) of diverse BCGs and genome exploration (GE) of versatile biocatalytic enzymes, and other enzymes involved in maintenance and regulation of metabolism of S. peucetius. The detailed analysis of genome sequence provides critically important knowledge useful in the bioengineering of the strain or harboring catalytically efficient enzymes for biotechnological applications.

Exploring the complexity of intellectual disability in fetal alcohol spectrum disorders.

PubMed

Chokroborty-Hoque, Aniruddho; Alberry, Bonnie; Singh, Shiva M

2014-01-01

Brain development in mammals is long lasting. It begins early during embryonic growth and is finalized in early adulthood. This progression represents a delicate choreography of molecular, cellular, and physiological processes initiated and directed by the fetal genotype in close interaction with environment. Not surprisingly, most aberrations in brain functioning including intellectual disability (ID) are attributed to either gene(s), or environment or the interaction of the two. The ensuing complexity has made the assessment of this choreography, ever challenging. A model to assess this complexity has used a mouse model (C57BL/6J or B6) that is subjected to prenatal alcohol exposure. The resulting pups show learning and memory deficits similar to patients with fetal alcohol spectrum disorder (FASD), which is associated with life-long changes in gene expression. Interestingly, this change in gene expression underlies epigenetic processes including DNA methylation and miRNAs. This paradigm is applicable to ethanol exposure at different developmental times (binge at trimesters 1, 2, and 3 as well as continuous preference drinking (70%) of 10% alcohol by B6 females during pregnancy). The exposure leads to life-long changes in neural epigenetic marks, gene expression, and a variety of defects in neurodevelopment and CNS function. We argue that this cascade may be reversed postnatally via drugs, chemicals, and environment including maternal care. Such conclusions are supported by two sets of results. First, antipsychotic drugs that are used to treat ID including psychosis function via changes in DNA methylation, a major epigenetic mark. Second, post-natal environment may improve (with enriched environments) or worsen (with negative and maternal separation stress) the cognitive ability of pups that were prenatally exposed to ethanol as well as their matched controls. In this review, we will discuss operational epigenetic mechanisms involved in the development of intellectual ability/disability in response to alcohol during prenatal or post-natal development. In doing so, we will explore the potential of epigenetic manipulation in the treatment of FASD and related disorders implicated in ID.
Exploring the Complexity of Intellectual Disability in Fetal Alcohol Spectrum Disorders

PubMed Central

Chokroborty-Hoque, Aniruddho; Alberry, Bonnie; Singh, Shiva M.

2014-01-01

Brain development in mammals is long lasting. It begins early during embryonic growth and is finalized in early adulthood. This progression represents a delicate choreography of molecular, cellular, and physiological processes initiated and directed by the fetal genotype in close interaction with environment. Not surprisingly, most aberrations in brain functioning including intellectual disability (ID) are attributed to either gene(s), or environment or the interaction of the two. The ensuing complexity has made the assessment of this choreography, ever challenging. A model to assess this complexity has used a mouse model (C57BL/6J or B6) that is subjected to prenatal alcohol exposure. The resulting pups show learning and memory deficits similar to patients with fetal alcohol spectrum disorder (FASD), which is associated with life-long changes in gene expression. Interestingly, this change in gene expression underlies epigenetic processes including DNA methylation and miRNAs. This paradigm is applicable to ethanol exposure at different developmental times (binge at trimesters 1, 2, and 3 as well as continuous preference drinking (70%) of 10% alcohol by B6 females during pregnancy). The exposure leads to life-long changes in neural epigenetic marks, gene expression, and a variety of defects in neurodevelopment and CNS function. We argue that this cascade may be reversed postnatally via drugs, chemicals, and environment including maternal care. Such conclusions are supported by two sets of results. First, antipsychotic drugs that are used to treat ID including psychosis function via changes in DNA methylation, a major epigenetic mark. Second, post-natal environment may improve (with enriched environments) or worsen (with negative and maternal separation stress) the cognitive ability of pups that were prenatally exposed to ethanol as well as their matched controls. In this review, we will discuss operational epigenetic mechanisms involved in the development of intellectual ability/disability in response to alcohol during prenatal or post-natal development. In doing so, we will explore the potential of epigenetic manipulation in the treatment of FASD and related disorders implicated in ID. PMID:25207264
Whole genome expression and biochemical correlates of extreme constitutional types defined in Ayurveda.

PubMed

Prasher, Bhavana; Negi, Sapna; Aggarwal, Shilpi; Mandal, Amit K; Sethi, Tav P; Deshmukh, Shailaja R; Purohit, Sudha G; Sengupta, Shantanu; Khanna, Sangeeta; Mohammad, Farhan; Garg, Gaurav; Brahmachari, Samir K; Mukerji, Mitali

2008-09-09

Ayurveda is an ancient system of personalized medicine documented and practiced in India since 1500 B.C. According to this system an individual's basic constitution to a large extent determines predisposition and prognosis to diseases as well as therapy and life-style regime. Ayurveda describes seven broad constitution types (Prakritis) each with a varying degree of predisposition to different diseases. Amongst these, three most contrasting types, Vata, Pitta, Kapha, are the most vulnerable to diseases. In the realm of modern predictive medicine, efforts are being directed towards capturing disease phenotypes with greater precision for successful identification of markers for prospective disease conditions. In this study, we explore whether the different constitution types as described in Ayurveda has molecular correlates. Normal individuals of the three most contrasting constitutional types were identified following phenotyping criteria described in Ayurveda in Indian population of Indo-European origin. The peripheral blood samples of these individuals were analysed for genome wide expression levels, biochemical and hematological parameters. Gene Ontology (GO) and pathway based analysis was carried out on differentially expressed genes to explore if there were significant enrichments of functional categories among Prakriti types. Individuals from the three most contrasting constitutional types exhibit striking differences with respect to biochemical and hematological parameters and at genome wide expression levels. Biochemical profiles like liver function tests, lipid profiles, and hematological parameters like haemoglobin exhibited differences between Prakriti types. Functional categories of genes showing differential expression among Prakriti types were significantly enriched in core biological processes like transport, regulation of cyclin dependent protein kinase activity, immune response and regulation of blood coagulation. A significant enrichment of housekeeping, disease related and hub genes were observed in these extreme constitution types. Ayurveda based method of phenotypic classification of extreme constitutional types allows us to uncover genes that may contribute to system level differences in normal individuals which could lead to differential disease predisposition. This is a first attempt towards unraveling the clinical phenotyping principle of a traditional system of medicine in terms of modern biology. An integration of Ayurveda with genomics holds potential and promise for future predictive medicine.
Whole genome expression and biochemical correlates of extreme constitutional types defined in Ayurveda

PubMed Central

Prasher, Bhavana; Negi, Sapna; Aggarwal, Shilpi; Mandal, Amit K; Sethi, Tav P; Deshmukh, Shailaja R; Purohit, Sudha G; Sengupta, Shantanu; Khanna, Sangeeta; Mohammad, Farhan; Garg, Gaurav; Brahmachari, Samir K; Mukerji, Mitali

2008-01-01

Background Ayurveda is an ancient system of personalized medicine documented and practiced in India since 1500 B.C. According to this system an individual's basic constitution to a large extent determines predisposition and prognosis to diseases as well as therapy and life-style regime. Ayurveda describes seven broad constitution types (Prakritis) each with a varying degree of predisposition to different diseases. Amongst these, three most contrasting types, Vata, Pitta, Kapha, are the most vulnerable to diseases. In the realm of modern predictive medicine, efforts are being directed towards capturing disease phenotypes with greater precision for successful identification of markers for prospective disease conditions. In this study, we explore whether the different constitution types as described in Ayurveda has molecular correlates. Methods Normal individuals of the three most contrasting constitutional types were identified following phenotyping criteria described in Ayurveda in Indian population of Indo-European origin. The peripheral blood samples of these individuals were analysed for genome wide expression levels, biochemical and hematological parameters. Gene Ontology (GO) and pathway based analysis was carried out on differentially expressed genes to explore if there were significant enrichments of functional categories among Prakriti types. Results Individuals from the three most contrasting constitutional types exhibit striking differences with respect to biochemical and hematological parameters and at genome wide expression levels. Biochemical profiles like liver function tests, lipid profiles, and hematological parameters like haemoglobin exhibited differences between Prakriti types. Functional categories of genes showing differential expression among Prakriti types were significantly enriched in core biological processes like transport, regulation of cyclin dependent protein kinase activity, immune response and regulation of blood coagulation. A significant enrichment of housekeeping, disease related and hub genes were observed in these extreme constitution types. Conclusion Ayurveda based method of phenotypic classification of extreme constitutional types allows us to uncover genes that may contribute to system level differences in normal individuals which could lead to differential disease predisposition. This is a first attempt towards unraveling the clinical phenotyping principle of a traditional system of medicine in terms of modern biology. An integration of Ayurveda with genomics holds potential and promise for future predictive medicine. PMID:18782426
Gastrointestinal Fibroblasts Have Specialized, Diverse Transcriptional Phenotypes: A Comprehensive Gene Expression Analysis of Human Fibroblasts

PubMed Central

Ishii, Genichiro; Aoyagi, Kazuhiko; Sasaki, Hiroki; Ochiai, Atsushi

2015-01-01

Background Fibroblasts are the principal stromal cells that exist in whole organs and play vital roles in many biological processes. Although the functional diversity of fibroblasts has been estimated, a comprehensive analysis of fibroblasts from the whole body has not been performed and their transcriptional diversity has not been sufficiently explored. The aim of this study was to elucidate the transcriptional diversity of human fibroblasts within the whole body. Methods Global gene expression analysis was performed on 63 human primary fibroblasts from 13 organs. Of these, 32 fibroblasts from gastrointestinal organs (gastrointestinal fibroblasts: GIFs) were obtained from a pair of 2 anatomical sites: the submucosal layer (submucosal fibroblasts: SMFs) and the subperitoneal layer (subperitoneal fibroblasts: SPFs). Using hierarchical clustering analysis, we elucidated identifiable subgroups of fibroblasts and analyzed the transcriptional character of each subgroup. Results In unsupervised clustering, 2 major clusters that separate GIFs and non-GIFs were observed. Organ- and anatomical site-dependent clusters within GIFs were also observed. The signature genes that discriminated GIFs from non-GIFs, SMFs from SPFs, and the fibroblasts of one organ from another organ consisted of genes associated with transcriptional regulation, signaling ligands, and extracellular matrix remodeling. Conclusions GIFs are characteristic fibroblasts with specific gene expressions from transcriptional regulation, signaling ligands, and extracellular matrix remodeling related genes. In addition, the anatomical site- and organ-dependent diversity of GIFs was also discovered. These features of GIFs contribute to their specific physiological function and homeostatic maintenance, and create a functional diversity of the gastrointestinal tract. PMID:26046848
Characterisation of Four LIM Protein-Encoding Genes Involved in Infection-Related Development and Pathogenicity by the Rice Blast Fungus Magnaporthe oryzae

PubMed Central

Li, Ya; Yue, Xiaofeng; Que, Yawei; Yan, Xia; Ma, Zhonghua; Talbot, Nicholas J.; Wang, Zhengyi

2014-01-01

LIM domain proteins contain contiguous double-zinc finger domains and play important roles in cytoskeletal re-organisation and organ development in multi-cellular eukaryotes. Here, we report the characterization of four genes encoding LIM proteins in the rice blast fungus Magnaporthe oryzae. Targeted gene replacement of either the paxillin-encoding gene, PAX1, or LRG1 resulted in a significant reduction in hyphal growth and loss of pathogenicity, while deletion of RGA1 caused defects in conidiogenesis and appressorium development. A fourth LIM domain gene, LDP1, was not required for infection-associated development by M. oryzae. Live cell imaging revealed that Lrg1-GFP and Rga1-GFP both localize to septal pores, while Pax1-GFP is present in the cytoplasm. To explore the function of individual LIM domains, we carried out systematic deletion of each LIM domain, which revealed the importance of the Lrg1-LIM2 and Lrg1-RhoGAP domains for Lrg1 function and overlapping functions of the three LIM domains of Pax1. Interestingly, deletion of either PAX1 or LRG1 led to decreased sensitivity to cell wall-perturbing agents, such as Congo Red and SDS (sodium dodecyl sulfate). qRT-PCR analysis demonstrated the importance of both Lrg1 and Pax1 to regulation of genes associated with cell wall biogenesis. When considered together, our results indicate that LIM domain proteins are key regulators of infection-associated morphogenesis by the rice blast fungus. PMID:24505448
Functional Conservation of Coenzyme Q Biosynthetic Genes among Yeasts, Plants, and Humans

PubMed Central

Hayashi, Kazuhiro; Ogiyama, Yuki; Yokomi, Kazumasa; Nakagawa, Tsuyoshi; Kaino, Tomohiro; Kawamukai, Makoto

2014-01-01

Coenzyme Q (CoQ) is an essential factor for aerobic growth and oxidative phosphorylation in the electron transport system. The biosynthetic pathway for CoQ has been proposed mainly from biochemical and genetic analyses of Escherichia coli and Saccharomyces cerevisiae; however, the biosynthetic pathway in higher eukaryotes has been explored in only a limited number of studies. We previously reported the roles of several genes involved in CoQ synthesis in the fission yeast Schizosaccharomyces pombe. Here, we expand these findings by identifying ten genes (dps1, dlp1, ppt1, and coq3–9) that are required for CoQ synthesis. CoQ10-deficient S. pombe coq deletion strains were generated and characterized. All mutant fission yeast strains were sensitive to oxidative stress, produced a large amount of sulfide, required an antioxidant to grow on minimal medium, and did not survive at the stationary phase. To compare the biosynthetic pathway of CoQ in fission yeast with that in higher eukaryotes, the ability of CoQ biosynthetic genes from humans and plants (Arabidopsis thaliana) to functionally complement the S. pombe coq deletion strains was determined. With the exception of COQ9, expression of all other human and plant COQ genes recovered CoQ10 production by the fission yeast coq deletion strains, although the addition of a mitochondrial targeting sequence was required for human COQ3 and COQ7, as well as A. thaliana COQ6. In summary, this study describes the functional conservation of CoQ biosynthetic genes between yeasts, humans, and plants. PMID:24911838
Adaptive genomic divergence under high gene flow between freshwater and brackish-water ecotypes of prickly sculpin (Cottus asper) revealed by Pool-Seq.

PubMed

Dennenmoser, Stefan; Vamosi, Steven M; Nolte, Arne W; Rogers, Sean M

2017-01-01

Understanding the genomic basis of adaptive divergence in the presence of gene flow remains a major challenge in evolutionary biology. In prickly sculpin (Cottus asper), an abundant euryhaline fish in northwestern North America, high genetic connectivity among brackish-water (estuarine) and freshwater (tributary) habitats of coastal rivers does not preclude the build-up of neutral genetic differentiation and emergence of different life history strategies. Because these two habitats present different osmotic niches, we predicted high genetic differentiation at known teleost candidate genes underlying salinity tolerance and osmoregulation. We applied whole-genome sequencing of pooled DNA samples (Pool-Seq) to explore adaptive divergence between two estuarine and two tributary habitats. Paired-end sequence reads were mapped against genomic contigs of European Cottus, and the gene content of candidate regions was explored based on comparisons with the threespine stickleback genome. Genes showing signals of repeated differentiation among brackish-water and freshwater habitats included functions such as ion transport and structural permeability in freshwater gills, which suggests that local adaptation to different osmotic niches might contribute to genomic divergence among habitats. Overall, the presence of both repeated and unique signatures of differentiation across many loci scattered throughout the genome is consistent with polygenic adaptation from standing genetic variation and locally variable selection pressures in the early stages of life history divergence. © 2016 John Wiley & Sons Ltd.
Characterization of hairless (Hr) and FGF5 genes provides insights into the molecular basis of hair loss in cetaceans

PubMed Central

2013-01-01

Background Hair is one of the main distinguishing characteristics of mammals and it has many important biological functions. Cetaceans originated from terrestrial mammals and they have evolved a series of adaptations to aquatic environments, which are of evolutionary significance. However, the molecular mechanisms underlying their aquatic adaptations have not been well explored. This study provided insights into the evolution of hair loss during the transition from land to water by investigating and comparing two essential regulators of hair follicle development and hair follicle cycling, i.e., the Hairless (Hr) and FGF5 genes, in representative cetaceans and their terrestrial relatives. Results The full open reading frame sequences of the Hr and FGF5 genes were characterized in seven cetaceans. The sequence characteristics and evolutionary analyses suggested the functional loss of the Hr gene in cetaceans, which supports the loss of hair during their full adaptation to aquatic habitats. By contrast, positive selection for the FGF5 gene was found in cetaceans where a series of positively selected amino acid residues were identified. Conclusions This is the first study to investigate the molecular basis of the hair loss in cetaceans. Our investigation of Hr and FGF5, two indispensable regulators of the hair cycle, provide some new insights into the molecular basis of hair loss in cetaceans. The results suggest that positive selection for the FGF5 gene might have promoted the termination of hair growth and early entry into the catagen stage of hair follicle cycling. Consequently, the hair follicle cycle was disrupted and the hair was lost completely due to the loss of the Hr gene function in cetaceans. This suggests that cetaceans have evolved an effective and complex mechanism for hair loss. PMID:23394579
An interactional network of genes involved in chitin synthesis in Saccharomyces cerevisiae

PubMed Central

Lesage, Guillaume; Shapiro, Jesse; Specht, Charles A; Sdicu, Anne-Marie; Ménard, Patrice; Hussein, Shamiza; Tong, Amy Hin Yan; Boone, Charles; Bussey, Howard

2005-01-01

Background In S. cerevisiae the β-1,4-linked N-acetylglucosamine polymer, chitin, is synthesized by a family of 3 specialized but interacting chitin synthases encoded by CHS1, CHS2 and CHS3. Chs2p makes chitin in the primary septum, while Chs3p makes chitin in the lateral cell wall and in the bud neck, and can partially compensate for the lack of Chs2p. Chs3p requires a pathway of Bni4p, Chs4p, Chs5p, Chs6p and Chs7p for its localization and activity. Chs1p is thought to have a septum repair function after cell separation. To further explore interactions in the chitin synthase family and to find processes buffering chitin synthesis, we compiled a genetic interaction network of genes showing synthetic interactions with CHS1, CHS3 and genes involved in Chs3p localization and function and made a phenotypic analysis of their mutants. Results Using deletion mutants in CHS1, CHS3, CHS4, CHS5, CHS6, CHS7 and BNI4 in a synthetic genetic array analysis we assembled a network of 316 interactions among 163 genes. The interaction network with CHS3, CHS4, CHS5, CHS6, CHS7 or BNI4 forms a dense neighborhood, with many genes functioning in cell wall assembly or polarized secretion. Chitin levels were altered in 54 of the mutants in individually deleted genes, indicating a functional relationship between them and chitin synthesis. 32 of these mutants triggered the chitin stress response, with elevated chitin levels and a dependence on CHS3. A large fraction of the CHS1-interaction set was distinct from that of the CHS3 network, indicating broad roles for Chs1p in buffering both Chs2p function and more global cell wall robustness. Conclusion Based on their interaction patterns and chitin levels we group interacting mutants into functional categories. Genes interacting with CHS3 are involved in the amelioration of cell wall defects and in septum or bud neck chitin synthesis, and we newly assign a number of genes to these functions. Our genetic analysis of genes not interacting with CHS3 indicate expanded roles for Chs4p, Chs5p and Chs6p in secretory protein trafficking and of Bni4p in bud neck organization. PMID:15715908
PLAZA 3.0: an access point for plant comparative genomics.

PubMed

Proost, Sebastian; Van Bel, Michiel; Vaneechoutte, Dries; Van de Peer, Yves; Inzé, Dirk; Mueller-Roeber, Bernd; Vandepoele, Klaas

2015-01-01

Comparative sequence analysis has significantly altered our view on the complexity of genome organization and gene functions in different kingdoms. PLAZA 3.0 is designed to make comparative genomics data for plants available through a user-friendly web interface. Structural and functional annotation, gene families, protein domains, phylogenetic trees and detailed information about genome organization can easily be queried and visualized. Compared with the first version released in 2009, which featured nine organisms, the number of integrated genomes is more than four times higher, and now covers 37 plant species. The new species provide a wider phylogenetic range as well as a more in-depth sampling of specific clades, and genomes of additional crop species are present. The functional annotation has been expanded and now comprises data from Gene Ontology, MapMan, UniProtKB/Swiss-Prot, PlnTFDB and PlantTFDB. Furthermore, we improved the algorithms to transfer functional annotation from well-characterized plant genomes to other species. The additional data and new features make PLAZA 3.0 (http://bioinformatics.psb.ugent.be/plaza/) a versatile and comprehensible resource for users wanting to explore genome information to study different aspects of plant biology, both in model and non-model organisms. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Cellular function reinstitution of offspring red blood cells cloned from the sickle cell disease patient blood post CRISPR genome editing.

PubMed

Wen, Jianguo; Tao, Wenjing; Hao, Suyang; Zu, Youli

2017-06-13

Sickle cell disease (SCD) is a disorder of red blood cells (RBCs) expressing abnormal hemoglobin-S (HbS) due to genetic inheritance of homologous HbS gene. However, people with the sickle cell trait (SCT) carry a single allele of HbS and do not usually suffer from SCD symptoms, thus providing a rationale to treat SCD. To validate gene therapy potential, hematopoietic stem cells were isolated from the SCD patient blood and treated with CRISPR/Cas9 approach. To precisely dissect genome-editing effects, erythroid progenitor cells were cloned from single colonies of CRISPR-treated cells and then expanded for simultaneous gene, protein, and cellular function studies. Genotyping and sequencing analysis revealed that the genome-edited erythroid progenitor colonies were converted to SCT genotype from SCD genotype. HPLC protein assays confirmed reinstallation of normal hemoglobin at a similar level with HbS in the cloned genome-edited erythroid progenitor cells. For cell function evaluation, in vitro RBC differentiation of the cloned erythroid progenitor cells was induced. As expected, cell sickling assays indicated function reinstitution of the genome-edited offspring SCD RBCs, which became more resistant to sickling under hypoxia condition. This study is an exploration of genome editing of SCD HSPCs.
Exploring the Adult Life of Men and Women with Fragile X Syndrome: Results from a National Survey

ERIC Educational Resources Information Center

Hartleyand, Sigan L.; Seltzer, Marsha Mailick; Raspa, Melissa; Olmstead, Murrey; Bishop, Ellen; Bailey, Donald B., Jr.

2011-01-01

Using data from a national family survey, the authors describe the adult lives (i.e., residence, employment, level of assistance needed with everyday life, friendships, and leisure activities) of 328 adults with the full mutation of the FMR1 gene and identify characteristics related to independence in these domains. Level of functional skills was…
Chronic Enhancement of CREB Activity in the Hippocampus Interferes with the Retrieval of Spatial Information

ERIC Educational Resources Information Center

Viosca, Jose; Malleret, Gael; Bourtchouladze, Rusiko; Benito, Eva; Vronskava, Svetlana; Kandel, Eric R.; Barco, Angel

2009-01-01

The activation of cAMP-responsive element-binding protein (CREB)-dependent gene expression is thought to be critical for the formation of different types of long-term memory. To explore the consequences of chronic enhancement of CREB function on spatial memory in mammals, we examined spatial navigation in bitransgenic mice that express in a…
Genome of the Cyanobacterium Microcoleus vaginatusFGP-2, a Photosynthetic Ecosystem Engineer of Arid Land Soil Biocrusts Worldwide▿

PubMed Central

Starkenburg, Shawn R.; Reitenga, Krista G.; Freitas, Tracey; Johnson, Shannon; Chain, Patrick S. G.; Garcia-Pichel, Ferran; Kuske, Cheryl R.

2011-01-01

The filamentous cyanobacterium Microcoleus vaginatusis found in arid land soils worldwide. The genome of M. vaginatusstrain FGP-2 allows exploration of genes involved in photosynthesis, desiccation tolerance, alkane production, and other features contributing to this organism's ability to function as a major component of biological soil crusts in arid lands. PMID:21705610
Deletion of the nicotinic acetylcholine receptor subunit gene Dα1 confers insecticide resistance, but at what cost?

PubMed

Somers, Jason; Luong, Hang Ngoc Bao; Batterham, Philip; Perry, Trent

2018-01-02

Nicotinic acetylcholine receptors (nAChRs) have vital functions in processes of neurotransmission that underpin key behaviors. These pentameric ligand-gated ion channels have been used as targets for insecticides that constitutively activate them, causing the death of insect pests. In examining a knockout of the Dα1 nAChR subunit gene, our study linked this one subunit with multiple traits. We were able to confirm previous work that had identified Dα1 as a target of the neonicotinoid class of insecticides. Further, we uncovered roles for the gene in influencing mating behavior and patterns of sleep. The knockout mutant was also observed to have a significant reduction in longevity. This study highlighted the severe fitness costs that appear to be associated with the loss of function of this gene in natural populations in the absence of insecticides targeting the Dα1 subunit. Such a fitness cost could explain why target site resistances to neonicotinoids in pest insect populations have been associated specific amino acid replacement mutations in nAChR subunits, rather than loss of function. That mutant phenotypes were observed for the two behaviors examined indicates that the functions of Dα1, and other nAChR subunits, need to be explored more broadly. It also remains to be established whether these phenotypes were due to loss of the Dα1 receptor and/or to compensatory changes in the expression levels of other nAChR subunits.
Circadian clock regulation of the cell cycle in the zebrafish intestine.

PubMed

Peyric, Elodie; Moore, Helen A; Whitmore, David

2013-01-01

The circadian clock controls cell proliferation in a number of healthy tissues where cell renewal and regeneration are critical for normal physiological function. The intestine is an organ that typically undergoes regular cycles of cell division, differentiation and apoptosis as part of its role in digestion and nutrient absorption. The aim of this study was to explore circadian clock regulation of cell proliferation and cell cycle gene expression in the zebrafish intestine. Here we show that the zebrafish gut contains a directly light-entrainable circadian pacemaker, which regulates the daily timing of mitosis. Furthermore, this intestinal clock controls the expression of key cell cycle regulators, such as cdc2, wee1, p21, PCNA and cdk2, but only weakly influences cyclin B1, cyclin B2 and cyclin E1 expression. Interestingly, food deprivation has little impact on circadian clock function in the gut, but dramatically reduces cell proliferation, as well as cell cycle gene expression in this tissue. Timed feeding under constant dark conditions is able to drive rhythmic expression not only of circadian clock genes, but also of several cell cycle genes, suggesting that food can entrain the clock, as well as the cell cycle in the intestine. Rather surprisingly, we found that timed feeding is critical for high amplitude rhythms in cell cycle gene expression, even when zebrafish are maintained on a light-dark cycle. Together these results suggest that the intestinal clock integrates multiple rhythmic cues, including light and food, to function optimally.
Circadian Clock Regulation of the Cell Cycle in the Zebrafish Intestine

PubMed Central

Peyric, Elodie; Moore, Helen A.; Whitmore, David

2013-01-01

The circadian clock controls cell proliferation in a number of healthy tissues where cell renewal and regeneration are critical for normal physiological function. The intestine is an organ that typically undergoes regular cycles of cell division, differentiation and apoptosis as part of its role in digestion and nutrient absorption. The aim of this study was to explore circadian clock regulation of cell proliferation and cell cycle gene expression in the zebrafish intestine. Here we show that the zebrafish gut contains a directly light-entrainable circadian pacemaker, which regulates the daily timing of mitosis. Furthermore, this intestinal clock controls the expression of key cell cycle regulators, such as cdc2, wee1, p21, PCNA and cdk2, but only weakly influences cyclin B1, cyclin B2 and cyclin E1 expression. Interestingly, food deprivation has little impact on circadian clock function in the gut, but dramatically reduces cell proliferation, as well as cell cycle gene expression in this tissue. Timed feeding under constant dark conditions is able to drive rhythmic expression not only of circadian clock genes, but also of several cell cycle genes, suggesting that food can entrain the clock, as well as the cell cycle in the intestine. Rather surprisingly, we found that timed feeding is critical for high amplitude rhythms in cell cycle gene expression, even when zebrafish are maintained on a light-dark cycle. Together these results suggest that the intestinal clock integrates multiple rhythmic cues, including light and food, to function optimally. PMID:24013905
RNA-Sequencing Analysis Reveals a Regulatory Role for Transcription Factor Fezf2 in the Mature Motor Cortex

PubMed Central

Clare, Alison J.; Wicky, Hollie E.; Empson, Ruth M.; Hughes, Stephanie M.

2017-01-01

Forebrain embryonic zinc finger (Fezf2) encodes a transcription factor essential for the specification of layer 5 projection neurons (PNs) in the developing cerebral cortex. As with many developmental transcription factors, Fezf2 continues to be expressed into adulthood, suggesting it remains crucial to the maintenance of neuronal phenotypes. Despite the continued expression, a function has yet to be explored for Fezf2 in the PNs of the developed cortex. Here, we investigated the role of Fezf2 in mature neurons, using lentiviral-mediated delivery of a shRNA to conditionally knockdown the expression of Fezf2 in the mouse primary motor cortex (M1). RNA-sequencing analysis of Fezf2-reduced M1 revealed significant changes to the transcriptome, identifying a regulatory role for Fezf2 in the mature M1. Kyoto Encyclopedia Genes and Genomes (KEGG) pathway analyses of Fezf2-regulated genes indicated a role in neuronal signaling and plasticity, with significant enrichment of neuroactive ligand-receptor interaction, cell adhesion molecules and calcium signaling pathways. Gene Ontology analysis supported a functional role for Fezf2-regulated genes in neuronal transmission and additionally indicated an importance in the regulation of behavior. Using the mammalian phenotype ontology database, we identified a significant overrepresentation of Fezf2-regulated genes associated with specific behavior phenotypes, including associative learning, social interaction, locomotor activation and hyperactivity. These roles were distinct from that of Fezf2-regulated genes identified in development, indicating a dynamic transition in Fezf2 function. Together our findings demonstrate a regulatory role for Fezf2 in the mature brain, with Fezf2-regulated genes having functional roles in sustaining normal neuronal and behavioral phenotypes. These results support the hypothesis that developmental transcription factors are important for maintaining neuron transcriptomes and that disruption of their expression could contribute to the progression of disease phenotypes. PMID:28936162
PIGD: a database for intronless genes in the Poaceae.

PubMed

Yan, Hanwei; Jiang, Cuiping; Li, Xiaoyu; Sheng, Lei; Dong, Qing; Peng, Xiaojian; Li, Qian; Zhao, Yang; Jiang, Haiyang; Cheng, Beijiu

2014-10-01

Intronless genes are a feature of prokaryotes; however, they are widespread and unequally distributed among eukaryotes and represent an important resource to study the evolution of gene architecture. Although many databases on exons and introns exist, there is currently no cohesive database that collects intronless genes in plants into a single database. In this study, we present the Poaceae Intronless Genes Database (PIGD), a user-friendly web interface to explore information on intronless genes from different plants. Five Poaceae species, Sorghum bicolor, Zea mays, Setaria italica, Panicum virgatum and Brachypodium distachyon, are included in the current release of PIGD. Gene annotations and sequence data were collected and integrated from different databases. The primary focus of this study was to provide gene descriptions and gene product records. In addition, functional annotations, subcellular localization prediction and taxonomic distribution are reported. PIGD allows users to readily browse, search and download data. BLAST and comparative analyses are also provided through this online database, which is available at http://pigd.ahau.edu.cn/. PIGD provides a solid platform for the collection, integration and analysis of intronless genes in the Poaceae. As such, this database will be useful for subsequent bio-computational analysis in comparative genomics and evolutionary studies.

Analysis and functional characterization of sequence variations in ligand binding domain of thyroid hormone receptors in autism spectrum disorder (ASD) patients.

PubMed

Kalikiri, Mahesh Kumar; Mamidala, Madhu Poornima; Rao, Ananth N; Rajesh, Vidya

2017-12-01

Autism spectrum disorder (ASD) is a neuro developmental disorder, reported to be on a rise in the past two decades. Thyroid hormone-T3 plays an important role in early embryonic and central nervous system development. T3 mediates its function by binding to thyroid hormone receptors, TRα and TRβ. Alterations in T3 levels and thyroid receptor mutations have been earlier implicated in neuropsychiatric disorders and have been linked to environmental toxins. Limited reports from earlier studies have shown the effectiveness of T3 treatment with promising results in children with ASD and that the thyroid hormone levels in these children was also normal. This necessitates the need to explore the genetic variations in the components of the thyroid hormone pathway in ASD children. To achieve this objective, we performed genetic analysis of ligand binding domain of THRA and THRB receptor genes in 30 ASD subjects and in age matched controls from India. Our study for the first time reports novel single nucleotide polymorphisms in the THRA and THRB receptor genes of ASD individuals. Autism Res 2017, 10: 1919-1928. ©2017 International Society for Autism Research, Wiley Periodicals, Inc. Thyroid hormone (T3) and thyroid receptors (TRα and TRβ) are the major components of the thyroid hormone pathway. The link between thyroid pathway and neuronal development is proven in clinical medicine. Since the thyroid hormone levels in Autistic children are normal, variations in their receptors needs to be explored. To achieve this objective, changes in THRA and THRB receptor genes was studied in 30 ASD and normal children from India. The impact of some of these mutations on receptor function was also studied. © 2017 International Society for Autism Research, Wiley Periodicals, Inc.
Twin Attributes of Tyrosyl-tRNA Synthetase of Leishmania donovani: A HOUSEKEEPING PROTEIN TRANSLATION ENZYME AND A MIMIC OF HOST CHEMOKINE.

PubMed

Anand, Sneha; Madhubala, Rentala

2016-08-19

Aminoacyl-tRNA synthetases (aaRSs) are housekeeping enzymes essential for protein synthesis. Apart from their parent aminoacylation activity, several aaRSs perform non-canonical functions in diverse biological processes. The present study explores the twin attributes of Leishmania tyrosyl-tRNA synthetase (LdTyrRS) namely, aminoacylation, and as a mimic of host CXC chemokine. Leishmania donovani is a protozoan parasite. Its genome encodes a single copy of tyrosyl-tRNA synthetase. We first tested the canonical aminoacylation role of LdTyrRS. The recombinant protein was expressed, and its kinetic parameters were determined by aminoacylation assay. To study the physiological role of LdTyrRS in Leishmania, gene deletion mutations were attempted via targeted gene replacement. The heterozygous mutants showed slower growth kinetics and exhibited attenuated virulence. LdTyrRS appears to be an essential gene as the chromosomal null mutants did not survive. Our data also highlights the non-canonical function of L. donovani tyrosyl-tRNA synthetase. We show that LdTyrRS protein is present in the cytoplasm and exits from the parasite cytoplasm into the extracellular medium. The released LdTyrRS functions as a neutrophil chemoattractant. We further show that LdTyrRS specifically binds to host macrophages with its ELR (Glu-Leu-Arg) peptide motif. The ELR-CXCR2 receptor interaction mediates this binding. This interaction triggers enhanced secretion of the proinflammatory cytokines TNF-α and IL-6 by host macrophages. Our data indicates a possible immunomodulating role of LdTyrRS in Leishmania infection. This study provides a platform to explore LdTyrRS as a potential target for drug development. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
The small RNA complement of adult Schistosoma haematobium.

PubMed

Stroehlein, Andreas J; Young, Neil D; Korhonen, Pasi K; Hall, Ross S; Jex, Aaron R; Webster, Bonnie L; Rollinson, David; Brindley, Paul J; Gasser, Robin B

2018-05-01

Blood flukes of the genus Schistosoma cause schistosomiasis-a neglected tropical disease (NTD) that affects more than 200 million people worldwide. Studies of schistosome genomes have improved our understanding of the molecular biology of flatworms, but most of them have focused largely on protein-coding genes. Small non-coding RNAs (sncRNAs) have been explored in selected schistosome species and are suggested to play essential roles in the post-transcriptional regulation of genes, and in modulating flatworm-host interactions. However, genome-wide small RNA data are currently lacking for key schistosomes including Schistosoma haematobium-the causative agent of urogenital schistosomiasis of humans. MicroRNAs (miRNAs) and other sncRNAs of male and female adults of S. haematobium and small RNA transcription levels were explored by deep sequencing, genome mapping and detailed bioinformatic analyses. In total, 89 transcribed miRNAs were identified in S. haematobium-a similar complement to those reported for the congeners S. mansoni and S. japonicum. Of these miRNAs, 34 were novel, with no homologs in other schistosomes. Most miRNAs (n = 64) exhibited sex-biased transcription, suggestive of roles in sexual differentiation, pairing of adult worms and reproductive processes. Of the sncRNAs that were not miRNAs, some related to the spliceosome (n = 21), biogenesis of other RNAs (n = 3) or ribozyme functions (n = 16), whereas most others (n = 3798) were novel ('orphans') with unknown functions. This study provides the first genome-wide sncRNA resource for S. haematobium, extending earlier studies of schistosomes. The present work should facilitate the future curation and experimental validation of sncRNA functions in schistosomes to enhance our understanding of post-transcriptional gene regulation and of the roles that sncRNAs play in schistosome reproduction, development and parasite-host cross-talk.
Metatranscriptomics of the marine sponge Geodia barretti: tackling phylogeny and function of its microbial community.

PubMed

Radax, Regina; Rattei, Thomas; Lanzen, Anders; Bayer, Christoph; Rapp, Hans Tore; Urich, Tim; Schleper, Christa

2012-05-01

Geodia barretti is a marine cold-water sponge harbouring high numbers of microorganisms. Significant rates of nitrification have been observed in this sponge, indicating a substantial contribution to nitrogen turnover in marine environments with high sponge cover. In order to get closer insights into the phylogeny and function of the active microbial community and the interaction with its host G. barretti, a metatranscriptomic approach was employed, using the simultaneous analysis of rRNA and mRNA. Of the 262 298 RNA-tags obtained by pyrosequencing, 92% were assigned to ribosomal RNA (ribo-tags). A total of 109 325 SSU rRNA ribo-tags revealed a detailed picture of the community, dominated by group SAR202 of Chloroflexi, candidate phylum Poribacteria and Acidobacteria, which was different in its composition from that obtained in clone libraries prepared form the same samples. Optimized assembly strategies allowed the reconstruction of full-length rRNA sequences from the short ribo-tags for more detailed phylogenetic studies of the dominant taxa. Cells of several phyla were visualized by FISH analyses for confirmation. Of the remaining 21 325 RNA-tags, 10 023 were assigned to mRNA-tags, based on similarities to genes in the databases. A wide range of putative functional gene transcripts from over 10 different phyla were identified among the bacterial mRNA-tags. The most abundant mRNAs were those encoding key metabolic enzymes of nitrification from ammonia-oxidizing archaea as well as candidate genes involved in related processes. Our analysis demonstrates the potential and limits of using a combined rRNA and mRNA approach to explore the microbial community profile, phylogenetic assignments and metabolic activities of a complex, but little explored microbial community. © 2012 Society for Applied Microbiology and Blackwell Publishing Ltd.
Systematic analysis of gene expression pattern in has-miR-197 over-expressed human uterine leiomyoma cells.

PubMed

Ling, Jing; Wu, Xiaoli; Fu, Ziyi; Tan, Jie; Xu, Qing

2015-10-01

Our previous study showed that the expression of miR-197 in leiomyoma was down-regulated compared with myometrium. Further, miR-197 has been identified to affect uterine leiomyoma cell proliferation, apoptosis, and metastasis ability, though the responsible molecular mechanism has not been well elucidated. In this study, we sought to determine the expression patterns of miR-197 targeted genes and to explore their potential functions, participating Pathways and the networks that are involved in the biological behavior of human uterine leiomyoma. After transfection of human uterine leiomyoma cells with miR-197, we confirmed the expression level of miR-197 using quantitative real-time PCR (qRT-PCR), and we detected the gene expression profiles after miR-197 over-expression through DNA microarray analysis. Further, we performed GO and Pathway analysis. The dominantly dys-regulated genes, which were up- or down-regulated by more than 10-fold, compared with parental cells, were confirmed using qRT-PCR technology. Compared with the control group, miR-197 was up-regulated by 30-fold after miR-197 lentiviral transfection. The microarray data showed that 872 genes were dys-regulated by more than 2-fold in human uterine leiomyoma cells after miR-197 overexpression, including 537 up-regulated and 335 down-regulated genes. The GO analysis indicated that the dys-regulated genes were primarily involved in response to stimuli, multicellular organ processes, and the signaling of biological progression. Further, Pathway analysis data showed that these genes participated in regulating several signaling Pathways, including the JAK/STAT signaling Pathway, the Toll-like receptor signaling Pathway, and cytokine-cytokine receptor interaction. The qRT-PCR results confirmed that 17 of the 66 selected genes, which were up- or down-regulated more than 10-fold by miR-197, were consistent with the microarray results, including tumorigenesis-related genes, such as DRT7, SLC549, SFMBT2, FLJ37956, FBLN2, C10orf35, HOXD12, CACNG7, and LOC100134279. Our study explored gene expression patterns after miR-197 overexpression and confirmed 17 dominantly dys-regulated genes, which could expand the insights into the function of miR-197 and the molecular mechanisms during the development and progression of uterine leiomyomas. This study might afford new clues for understanding the pathogenesis of uterine leiomyomas, and it could likely provide a unique method for diagnosing or predicting prognosis in the clinical treatment of leiomyoma. Copyright © 2015 Elsevier Masson SAS. All rights reserved.
A multitasking Argonaute: exploring the many facets of C. elegans CSR-1.

PubMed

Wedeles, Christopher J; Wu, Monica Z; Claycomb, Julie M

2013-12-01

While initial studies of small RNA-mediated gene regulatory pathways focused on the cytoplasmic functions of such pathways, identifying roles for Argonaute/small RNA pathways in modulating chromatin and organizing the genome has become a topic of intense research in recent years. Nuclear regulatory mechanisms for Argonaute/small RNA pathways appear to be widespread, in organisms ranging from plants to fission yeast, Caenorhabditis elegans to humans. As the effectors of small RNA-mediated gene regulatory pathways, Argonaute proteins guide the chromatin-directed activities of these pathways. Of particular interest is the C. elegans Argonaute, chromosome segregation and RNAi deficient (CSR-1), which has been implicated in such diverse functions as organizing the holocentromeres of worm chromosomes, modulating germline chromatin, protecting the genome from foreign nucleic acid, regulating histone levels, executing RNAi, and inhibiting translation in conjunction with Pumilio proteins. CSR-1 interacts with small RNAs known as 22G-RNAs, which have complementarity to 25 % of the protein coding genes. This peculiar Argonaute is the only essential C. elegans Argonaute out of 24 family members in total. Here, we summarize the current understanding of CSR-1 functions in the worm, with emphasis on the chromatin-directed activities of this ever-intriguing Argonaute.
Bioinformatic detection of E47, E2F1 and SREBP1 transcription factors as potential regulators of genes associated to acquisition of endometrial receptivity

PubMed Central

2011-01-01

Background The endometrium is a dynamic tissue whose changes are driven by the ovarian steroidal hormones. Its main function is to provide an adequate substrate for embryo implantation. Using microarray technology, several reports have provided the gene expression patterns of human endometrial tissue during the window of implantation. However it is required that biological connections be made across these genomic datasets to take full advantage of them. The objective of this work was to perform a research synthesis of available gene expression profiles related to acquisition of endometrial receptivity for embryo implantation, in order to gain insights into its molecular basis and regulation. Methods Gene expression datasets were intersected to determine a consensus endometrial receptivity transcript list (CERTL). For this cluster of genes we determined their functional annotations using available web-based databases. In addition, promoter sequences were analyzed to identify putative transcription factor binding sites using bioinformatics tools and determined over-represented features. Results We found 40 up- and 21 down-regulated transcripts in the CERTL. Those more consistently increased were C4BPA, SPP1, APOD, CD55, CFD, CLDN4, DKK1, ID4, IL15 and MAP3K5 whereas the more consistently decreased were OLFM1, CCNB1, CRABP2, EDN3, FGFR1, MSX1 and MSX2. Functional annotation of CERTL showed it was enriched with transcripts related to the immune response, complement activation and cell cycle regulation. Promoter sequence analysis of genes revealed that DNA binding sites for E47, E2F1 and SREBP1 transcription factors were the most consistently over-represented and in both up- and down-regulated genes during the window of implantation. Conclusions Our research synthesis allowed organizing and mining high throughput data to explore endometrial receptivity and focus future research efforts on specific genes and pathways. The discovery of possible new transcription factors orchestrating the CERTL opens new alternatives for understanding gene expression regulation in uterine function. PMID:21272326
Genome-wide characterization of differential transcript usage in Arabidopsis thaliana.

PubMed

Vaneechoutte, Dries; Estrada, April R; Lin, Ying-Chen; Loraine, Ann E; Vandepoele, Klaas

2017-12-01

Alternative splicing and the usage of alternate transcription start- or stop sites allows a single gene to produce multiple transcript isoforms. Most plant genes express certain isoforms at a significantly higher level than others, but under specific conditions this expression dominance can change, resulting in a different set of dominant isoforms. These events of differential transcript usage (DTU) have been observed for thousands of Arabidopsis thaliana, Zea mays and Vitis vinifera genes, and have been linked to development and stress response. However, neither the characteristics of these genes, nor the implications of DTU on their protein coding sequences or functions, are currently well understood. Here we present a dataset of isoform dominance and DTU for all genes in the AtRTD2 reference transcriptome based on a protocol that was benchmarked on simulated data and validated through comparison with a published reverse transciptase-polymerase chain reaction panel. We report DTU events for 8148 genes across 206 public RNA-Seq samples, and find that protein sequences are affected in 22% of the cases. The observed DTU events show high consistency across replicates, and reveal reproducible patterns in response to treatment and development. We also demonstrate that genes with different evolutionary ages, expression breadths and functions show large differences in the frequency at which they undergo DTU, and in the effect that these events have on their protein sequences. Finally, we showcase how the generated dataset can be used to explore DTU events for genes of interest or to find genes with specific DTU in samples of interest. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.
Generation and Analysis of Expressed Sequence Tags (ESTs) from Halophyte Atriplex canescens to Explore Salt-Responsive Related Genes

PubMed Central

Li, Jingtao; Sun, Xinhua; Yu, Gang; Jia, Chengguo; Liu, Jinliang; Pan, Hongyu

2014-01-01

Little information is available on gene expression profiling of halophyte A. canescens. To elucidate the molecular mechanism for stress tolerance in A. canescens, a full-length complementary DNA library was generated from A. canescens exposed to 400 mM NaCl, and provided 343 high-quality ESTs. In an evaluation of 343 valid EST sequences in the cDNA library, 197 unigenes were assembled, among which 190 unigenes (83.1% ESTs) were identified according to their significant similarities with proteins of known functions. All the 343 EST sequences have been deposited in the dbEST GenBank under accession numbers JZ535802 to JZ536144. According to Arabidopsis MIPS functional category and GO classifications, we identified 193 unigenes of the 311 annotations EST, representing 72 non-redundant unigenes sharing similarities with genes related to the defense response. The sets of ESTs obtained provide a rich genetic resource and 17 up-regulated genes related to salt stress resistance were identified by qRT-PCR. Six of these genes may contribute crucially to earlier and later stage salt stress resistance. Additionally, among the 343 unigenes sequences, 22 simple sequence repeats (SSRs) were also identified contributing to the study of A. canescens resources. PMID:24960361
The number of genes encoding repeat domain-containing proteins positively correlates with genome size in amoebal giant viruses

PubMed Central

Shukla, Avi; Chatterjee, Anirvan

2018-01-01

Abstract Curiously, in viruses, the virion volume appears to be predominantly driven by genome length rather than the number of proteins it encodes or geometric constraints. With their large genome and giant particle size, amoebal viruses (AVs) are ideally suited to study the relationship between genome and virion size and explore the role of genome plasticity in their evolutionary success. Different genomic regions of AVs exhibit distinct genealogies. Although the vertically transferred core genes and their functions are universally conserved across the nucleocytoplasmic large DNA virus (NCLDV) families and are essential for their replication, the horizontally acquired genes are variable across families and are lineage-specific. When compared with other giant virus families, we observed a near–linear increase in the number of genes encoding repeat domain-containing proteins (RDCPs) with the increase in the genome size of AVs. From what is known about the functions of RDCPs in bacteria and eukaryotes and their prevalence in the AV genomes, we envisage important roles for RDCPs in the life cycle of AVs, their genome expansion, and plasticity. This observation also supports the evolution of AVs from a smaller viral ancestor by the acquisition of diverse gene families from the environment including RDCPs that might have helped in host adaption. PMID:29308275
Spatial and temporal expression of the Grainyhead-like transcription factor family during murine development.

PubMed

Auden, Alana; Caddy, Jacinta; Wilanowski, Tomasz; Ting, Stephen B; Cunningham, John M; Jane, Stephen M

2006-10-01

The Drosophila transcription factor Grainyhead (grh) is expressed in ectoderm-derived tissues where it regulates several key developmental events including cuticle formation, tracheal elongation and dorsal closure. Our laboratory has recently identified three novel mammalian homologues of the grh gene, Grainyhead-like 1, -2 and -3 (Grhl1-3) that rewrite the phylogeny of this family. Using gene targeting in mice, we have shown that Grhl3 is essential for neural tube closure, skin barrier formation and wound healing. Despite their extensive sequence homology, Grhl1 and Grhl2 are unable to compensate for loss of Grhl3 in these developmental processes. To explore this lack of redundancy, and to gain further insights into the functions of this gene family in mammalian development we have performed an extensive in situ hybridisation analysis. We demonstrate that, although all three Grhl genes are highly expressed in the developing epidermis, they display subtle differences in the timing and level of expression. Surprisingly, we also demonstrate differential expression patterns in non-ectoderm-derived tissues, including the heart, the lung, and the metanephric kidney. These findings expand our understanding of the unique role of Grhl3 in neurulation and epidermal morphogenesis, and provide a focus for further functional analysis of the Grhl genes during mouse embryogenesis.
Four novel mutations of the BCKDHA, BCKDHB and DBT genes in Iranian patients with maple syrup urine disease.

PubMed

Zeynalzadeh, Monica; Tafazoli, Alireza; Aarabi, Azadeh; Moghaddassian, Morteza; Ashrafzadeh, Farah; Houshmand, Massoud; Taghehchian, Negin; Abbaszadegan, Mohammad Reza

2018-01-26

Maple syrup urine disease (MSUD) is a rare metabolic autosomal recessive disorder caused by dysfunction of the branched-chain α-ketoacid dehydrogenase (BCKDH) complex. Mutations in the BCKDHA, BCKDHB and DBT genes are responsible for MSUD. The current study analyzed seven Iranian MSUD patients genetically and explored probable correlations between their genotype and phenotype. The panel of genes, including BCKDHA, BCKDHB and DBT, was evaluated, using routine the polymerase chain reaction (PCR)-sequencing method. In addition, protein modeling (homology and threading modeling) of the deduced novel mutations was performed. The resulting structures were then analyzed, using state-of-the-art bioinformatics tools to better understand the structural and functional effects caused by mutations. Seven mutations were detected in seven patients, including four novel pathogenic mutations in BCKDHA (c.1198delA, c.629C>T), BCKDHB (c.652C>T) and DBT (c.1150A>G) genes. Molecular modeling of the novel mutations revealed clear changes in the molecular energy levels and stereochemical traits of the modeled proteins, which may be indicative of strong correlations with the functional modifications of the genes. Structural deficiencies were compatible with the observed phenotypes. Any type of MSUD can show heterogeneous clinical manifestations in different ethnic groups. Comprehensive molecular investigations would be necessary for differential diagnosis.
Insect Phylogenomics: Exploring the Source of Incongruence Using New Transcriptomic Data

PubMed Central

Simon, Sabrina; Narechania, Apurva; DeSalle, Rob; Hadrys, Heike

2012-01-01

The evolution of the diverse insect lineages is one of the most fascinating issues in evolutionary biology. Despite extensive research in this area, the resolution of insect phylogeny especially of interordinal relationships has turned out to be still a great challenge. One of the challenges for insect systematics is the radiation of the polyneopteran lineages with several contradictory and/or unresolved relationships. Here, we provide the first transcriptomic data for three enigmatic polyneopteran orders (Dermaptera, Plecoptera, and Zoraptera) to clarify one of the most debated issues among higher insect systematics. We applied different approaches to generate 3 data sets comprising 78 species and 1,579 clusters of orthologous genes. Using these three matrices, we explored several key mechanistic problems of phylogenetic reconstruction including missing data, matrix selection, gene and taxa number/choice, and the biological function of the genes. Based on the first phylogenomic approach including these three ambiguous polyneopteran orders, we provide here conclusive support for monophyletic Polyneoptera, contesting the hypothesis of Zoraptera + Paraneoptera and Plecoptera + remaining Neoptera. In addition, we employ various approaches to evaluate data quality and highlight problematic nodes within the Insect Tree that still exist despite our phylogenomic approach. We further show how the support for these nodes or alternative hypotheses might depend on the taxon- and/or gene-sampling. PMID:23175716
Transcriptome profiling and digital gene expression analysis of sweet potato for the identification of putative genes involved in the defense response against Fusarium oxysporum f. sp. batatas.

PubMed

Lin, Yuli; Zou, Weikun; Lin, Shiqiang; Onofua, Dennis; Yang, Zhijian; Chen, Haizhou; Wang, Songliang; Chen, Xuanyang

2017-01-01

Sweet potato production is constrained by Fusarium wilt, which is caused by Fusarium oxysporum f. sp. batatas (Fob). The identification of genes related to disease resistance and the underlying mechanisms will contribute to improving disease resistance via sweet potato breeding programs. In the present study, we performed de novo transcriptome assembly and digital gene expression (DGE) profiling of sweet potato challenged with Fob using Illumina HiSeq technology. In total, 89,944,188 clean reads were generated from 12 samples and assembled into 101,988 unigenes with an average length of 666 bp; of these unigenes, 62,605 (61.38%) were functionally annotated in the NCBI non-redundant protein database by BLASTX with a cutoff E-value of 10-5. Clusters of Orthologous Groups (COG), Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) annotations were examined to explore the unigenes' functions. We constructed four DGE libraries for the sweet potato cultivars JinShan57 (JS57, highly resistant) and XinZhongHua (XZH, highly susceptible), which were challenged with pathogenic Fob. Genes that were differentially expressed in the four libraries were identified by comparing the transcriptomes. Various genes that were differentially expressed during defense, including chitin elicitor receptor kinase 1 (CERK), mitogen-activated protein kinase (MAPK), WRKY, NAC, MYB, and ethylene-responsive transcription factor (ERF), as well as resistance genes, pathogenesis-related genes, and genes involved in salicylic acid (SA) and jasmonic acid (JA) signaling pathways, were identified. These data represent a sequence resource for genetic and genomic studies of sweet potato that will enhance the understanding of the mechanism of disease resistance.
Mycobacterium tuberculosis strains exhibit differential and strain-specific molecular signatures in pulmonary epithelial cells.

PubMed

Mvubu, Nontobeko Eunice; Pillay, Balakrishna; Gamieldien, Junaid; Bishai, William; Pillay, Manormoney

2016-12-01

Although pulmonary epithelial cells are integral to innate and adaptive immune responses during Mycobacterium tuberculosis infection, global transcriptomic changes in these cells remain largely unknown. Changes in gene expression induced in pulmonary epithelial cells infected with M. tuberculosis F15/LAM4/KZN, F11, F28, Beijing and Unique genotypes were investigated by RNA sequencing (RNA-Seq). The Illumina HiSeq 2000 platform generated 50 bp reads that were mapped to the human genome (Hg19) using Tophat (2.0.10). Differential gene expression induced by the different strains in infected relative to the uninfected cells was quantified and compared using Cufflinks (2.1.0) and MeV (4.0.9), respectively. Gene expression varied among the strains with the total number of genes as follows: F15/LAM4/KZN (1187), Beijing (1252), F11 (1639), F28 (870), Unique (886) and H37Rv (1179). A subset of 292 genes was commonly induced by all strains, where 52 genes were down-regulated while 240 genes were up-regulated. Differentially expressed genes were compared among the strains and the number of induced strain-specific gene signatures were as follows: F15/LAM4/KZN (138), Beijing (52), F11 (255), F28 (55), Unique (186) and H37Rv (125). Strain-specific molecular gene signatures associated with functional pathways were observed only for the Unique and H37Rv strains while certain biological functions may be associated with other strain signatures. This study demonstrated that strains of M. tuberculosis induce differential gene expression and strain-specific molecular signatures in pulmonary epithelial cells. Specific signatures induced by clinical strains of M. tuberculosis can be further explored for novel host-associated biomarkers and adjunctive immunotherapies. Copyright © 2016 Elsevier Ltd. All rights reserved.
Identification of Differentially Expressed Genes in Blood Cells of Narcolepsy Patients

PubMed Central

Tanaka, Susumu; Honda, Yutaka; Honda, Makoto

2007-01-01

Study Objective: A close association between the human leukocyte antigen (HLA)-DRB1*1501/DQB1*0602 and abnormalities in some inflammatory cytokines have been demonstrated in narcolepsy. Specific alterations in the immune system have been suggested to occur in this disorder. We attempted to identify alterations in gene expression underlying the abnormalities in the blood cells of narcoleptic patients. Designs: Total RNA from 12 narcolepsy-cataplexy patients and from 12 age- and sex-matched healthy controls were pooled. The pooled samples were initially screened for candidate genes for narcolepsy by differential display analysis using annealing control primers (ACP). The second screening of the samples was carried out by semiquantitative PCR using gene-specific primers. Finally, the expression levels of the candidate genes were further confirmed by quantitative real-time PCR using a new set of samples (20 narcolepsy-cataplexy patients and 20 healthy controls). Results: The second screening revealed differential expression of 4 candidate genes. Among them, MX2 was confirmed as a significantly down-regulated gene in the white blood cells of narcoleptic patients by quantitative real-time PCR. Conclusion: We found the MX2 gene to be significantly less expressed in comparison with normal subjects in the white blood cells of narcoleptic patients. This gene is relevant to the immune system. Although differential display analysis using ACP technology has a limitation in that it does not help in determining the functional mechanism underlying sleep/wakefulness dysregulation, it is useful for identifying novel genetic factors related to narcolepsy, such as HLA molecules. Further studies are required to explore the functional relationship between the MX2 gene and narcolepsy pathophysiology. Citation: Tanaka S; Honda Y; Honda M. Identification of differentially expressed genes in blood cells of narcolepsy patients. SLEEP 2007;30(8):974-979. PMID:17702266
Silencing the Honey Bee (Apis mellifera) Naked Cuticle Gene (nkd) Improves Host Immune Function and Reduces Nosema ceranae Infections

PubMed Central

Li, Wenfeng; Evans, Jay D.; Huang, Qiang; Rodríguez-García, Cristina; Liu, Jie; Hamilton, Michele; Grozinger, Christina M.; Webster, Thomas C.; Su, Songkun

2016-01-01

ABSTRACT Nosema ceranae is a new and emerging microsporidian parasite of European honey bees, Apis mellifera, that has been implicated in colony losses worldwide. RNA interference (RNAi), a posttranscriptional gene silencing mechanism, has emerged as a potent and specific strategy for controlling infections of parasites and pathogens in honey bees. While previous studies have focused on the silencing of parasite/pathogen virulence factors, we explore here the possibility of silencing a host factor as a mechanism for reducing parasite load. Specifically, we used an RNAi strategy to reduce the expression of a honey bee gene, naked cuticle (nkd), which is a negative regulator of host immune function. Our studies found that nkd mRNA levels in adult bees were upregulated by N. ceranae infection (and thus, the parasite may use this mechanism to suppress host immune function) and that ingestion of double-stranded RNA (dsRNA) specific to nkd efficiently silenced its expression. Furthermore, we found that RNAi-mediated knockdown of nkd transcripts in Nosema-infected bees resulted in upregulation of the expression of several immune genes (Abaecin, Apidaecin, Defensin-1, and PGRP-S2), reduction of Nosema spore loads, and extension of honey bee life span. The results of our studies clearly indicate that silencing the host nkd gene can activate honey bee immune responses, suppress the reproduction of N. ceranae, and improve the overall health of honey bees. This study represents a novel host-derived therapeutic for honey bee disease treatment that merits further exploration. IMPORTANCE Given the critical role of honey bees in the pollination of agricultural crops, it is urgent to develop strategies to prevent the colony decline induced by the infection of parasites/pathogens. Targeting parasites and pathogens directly by RNAi has been proven to be useful for controlling infections in honey bees, but little is known about the disease impacts of RNAi silencing of host factors. Here, we demonstrate that knocking down the honey bee immune repressor-encoding nkd gene can suppress the reproduction of N. ceranae and improve the overall health of honey bees, which highlights the potential role of host-derived and RNAi-based therapeutics in controlling the infections in honey bees. The information obtained from this study will have positive implications for honey bee disease management practices. PMID:27613683
Silencing the Honey Bee (Apis mellifera) Naked Cuticle Gene (nkd) Improves Host Immune Function and Reduces Nosema ceranae Infections.

PubMed

Li, Wenfeng; Evans, Jay D; Huang, Qiang; Rodríguez-García, Cristina; Liu, Jie; Hamilton, Michele; Grozinger, Christina M; Webster, Thomas C; Su, Songkun; Chen, Yan Ping

2016-11-15

Nosema ceranae is a new and emerging microsporidian parasite of European honey bees, Apis mellifera, that has been implicated in colony losses worldwide. RNA interference (RNAi), a posttranscriptional gene silencing mechanism, has emerged as a potent and specific strategy for controlling infections of parasites and pathogens in honey bees. While previous studies have focused on the silencing of parasite/pathogen virulence factors, we explore here the possibility of silencing a host factor as a mechanism for reducing parasite load. Specifically, we used an RNAi strategy to reduce the expression of a honey bee gene, naked cuticle (nkd), which is a negative regulator of host immune function. Our studies found that nkd mRNA levels in adult bees were upregulated by N. ceranae infection (and thus, the parasite may use this mechanism to suppress host immune function) and that ingestion of double-stranded RNA (dsRNA) specific to nkd efficiently silenced its expression. Furthermore, we found that RNAi-mediated knockdown of nkd transcripts in Nosema-infected bees resulted in upregulation of the expression of several immune genes (Abaecin, Apidaecin, Defensin-1, and PGRP-S2), reduction of Nosema spore loads, and extension of honey bee life span. The results of our studies clearly indicate that silencing the host nkd gene can activate honey bee immune responses, suppress the reproduction of N. ceranae, and improve the overall health of honey bees. This study represents a novel host-derived therapeutic for honey bee disease treatment that merits further exploration. Given the critical role of honey bees in the pollination of agricultural crops, it is urgent to develop strategies to prevent the colony decline induced by the infection of parasites/pathogens. Targeting parasites and pathogens directly by RNAi has been proven to be useful for controlling infections in honey bees, but little is known about the disease impacts of RNAi silencing of host factors. Here, we demonstrate that knocking down the honey bee immune repressor-encoding nkd gene can suppress the reproduction of N. ceranae and improve the overall health of honey bees, which highlights the potential role of host-derived and RNAi-based therapeutics in controlling the infections in honey bees. The information obtained from this study will have positive implications for honey bee disease management practices. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Genomes by design

PubMed Central

Haimovich, Adrian D.; Muir, Paul; Isaacs, Farren J.

2016-01-01

Next-generation DNA sequencing has revealed the complete genome sequences of numerous organisms, establishing a fundamental and growing understanding of genetic variation and phenotypic diversity. Engineering at the gene, network and whole-genome scale aims to introduce targeted genetic changes both to explore emergent phenotypes and to introduce new functionalities. Expansion of these approaches into massively parallel platforms establishes the ability to generate targeted genome modifications, elucidating causal links between genotype and phenotype, as well as the ability to design and reprogramme organisms. In this Review, we explore techniques and applications in genome engineering, outlining key advances and defining challenges. PMID:26260262
Down-regulated energy metabolism genes associated with mitochondria oxidative phosphorylation and fatty acid metabolism in viral cardiomyopathy mouse heart.

PubMed

Xu, Jing; Nie, Hong-gang; Zhang, Xiao-dong; Tian, Ye; Yu, Bo

2011-08-01

The majority of experimental and clinical studies indicates that the hypertrophied and failing myocardium are characterized by changes in energy and substrate metabolism that attributed to failing heart changes at the genomic level, in fact, heart failure is caused by various diseases, their energy metabolism and substrate are in different genetic variations, then the potential significance of the molecular mechanisms for the aetiology of heart failure is necessary to be evaluated. Persistent viral infection (especially coxsackievirus group B3) of the myocardium in viral myocarditis and viral dilated cardiomyopathy has never been neglected by experts. This study aimed to explore the role and regulatory mechanism of the altered gene expression for energy metabolism involved in mitochondrial oxidative phosphorylation, fatty acid metabolism in viral dilated cardiomyopathy. cDNA Microarray technology was used to evaluate the expression of >35,852 genes in a mice model of viral dilated cardiomyopathy. In total 1385 highly different genes expression, we analyzed 33 altered genes expression for energy metabolism involved in mitochondrial oxidative phosphorylation, fatty acid metabolism and further selected real-time-PCR for quantity one of regulatory mechanisms for energy including fatty acid metabolism-the UCP2 and assayed cytochrome C oxidase activity by Spectrophotometer to explore mitochondrial oxidative phosphorylation function. We found obviously different expression of 33 energy metabolism genes associated with mitochondria oxidative phosphorylation, fatty acid metabolism in cardiomyopathy mouse heart, the regulatory gene for energy metabolism: UCP2 was down-regulated and cytochrome C oxidase activity was decreased. Genes involved in both fatty acid metabolism and mitochondrial oxidative phosphorylation were down-regulated, mitochondrial uncoupling proteins (UCP2) expression did not increase but decrease which might be a kind of adaptive protection response to regulate energy metabolism for ATP produce.

Integration of lncRNA and mRNA Transcriptome Analyses Reveals Genes and Pathways Potentially Involved in Calf Intestinal Growth and Development during the Early Weeks of Life

PubMed Central

Do, Duy N.; Dudemaine, Pier-Luc; Fomenky, Bridget E.

2018-01-01

A better understanding of the factors that regulate growth and immune response of the gastrointestinal tract (GIT) of calves will promote informed management practices in calf rearing. This study aimed to explore genomics (messenger RNA (mRNA)) and epigenomics (long non-coding RNA (lncRNA)) mechanisms regulating the development of the rumen and ileum in calves. Thirty-two calves (≈5-days-old) were reared for 96 days following standard procedures. Sixteen calves were humanely euthanized on experiment day 33 (D33) (pre-weaning) and another 16 on D96 (post-weaning) for collection of ileum and rumen tissues. RNA from tissues was subjected to next generation sequencing and 3310 and 4217 mRNAs were differentially expressed (DE) between D33 and D96 in ileum and rumen tissues, respectively. Gene ontology and pathways enrichment of DE genes confirmed their roles in developmental processes, immunity and lipid metabolism. A total of 1568 (63 known and 1505 novel) and 4243 (88 known and 4155 novel) lncRNAs were detected in ileum and rumen tissues, respectively. Cis target gene analysis identified BMPR1A, an important gene for a GIT disease (juvenile polyposis syndrome) in humans, as a candidate cis target gene for lncRNAs in both tissues. LncRNA cis target gene enrichment suggested that lncRNAs might regulate growth and development in both tissues as well as posttranscriptional gene silencing by RNA or microRNA processing in rumen, or disease resistance mechanisms in ileum. This study provides a catalog of bovine lncRNAs and set a baseline for exploring their functions in calf GIT development. PMID:29510583
Integration of lncRNA and mRNA Transcriptome Analyses Reveals Genes and Pathways Potentially Involved in Calf Intestinal Growth and Development during the Early Weeks of Life.

PubMed

Ibeagha-Awemu, Eveline M; Do, Duy N; Dudemaine, Pier-Luc; Fomenky, Bridget E; Bissonnette, Nathalie

2018-03-05

A better understanding of the factors that regulate growth and immune response of the gastrointestinal tract (GIT) of calves will promote informed management practices in calf rearing. This study aimed to explore genomics (messenger RNA (mRNA)) and epigenomics (long non-coding RNA (lncRNA)) mechanisms regulating the development of the rumen and ileum in calves. Thirty-two calves (≈5-days-old) were reared for 96 days following standard procedures. Sixteen calves were humanely euthanized on experiment day 33 (D33) (pre-weaning) and another 16 on D96 (post-weaning) for collection of ileum and rumen tissues. RNA from tissues was subjected to next generation sequencing and 3310 and 4217 mRNAs were differentially expressed (DE) between D33 and D96 in ileum and rumen tissues, respectively. Gene ontology and pathways enrichment of DE genes confirmed their roles in developmental processes, immunity and lipid metabolism. A total of 1568 (63 known and 1505 novel) and 4243 (88 known and 4155 novel) lncRNAs were detected in ileum and rumen tissues, respectively. Cis target gene analysis identified BMPR1A , an important gene for a GIT disease (juvenile polyposis syndrome) in humans, as a candidate cis target gene for lncRNAs in both tissues. LncRNA cis target gene enrichment suggested that lncRNAs might regulate growth and development in both tissues as well as posttranscriptional gene silencing by RNA or microRNA processing in rumen, or disease resistance mechanisms in ileum. This study provides a catalog of bovine lncRNAs and set a baseline for exploring their functions in calf GIT development.
Functional genomics of the evolution of increased resistance to parasitism in Drosophila.

PubMed

Wertheim, Bregje; Kraaijeveld, Alex R; Hopkins, Meirion G; Walther Boer, Mark; Godfray, H Charles J

2011-03-01

Individual hosts normally respond to parasite attack by launching an acute immune response (a phenotypic plastic response), while host populations can respond in the longer term by evolving higher level of defence against parasites. Little is known about the genetics of the evolved response: the identity and number of genes involved and whether it involves a pre-activation of the regulatory systems governing the plastic response. We explored these questions by surveying transcriptional changes in a Drosophila melanogaster strain artificially selected for resistance against the hymenopteran endoparasitoid Asobara tabida. Using micro-arrays, we profiled gene expression at seven time points during development (from the egg to the second instar larva) and found a large number of genes (almost 900) with altered expression levels. Bioinformatic analysis showed that some were involved in immunity or defence-associated functions but many were not. Previously, we had defined a set of genes whose level of expression changed after parasitoid attack and a comparison with the present set showed a significant though comparatively small overlap. This suggests that the evolutionary response to parasitism is not a simple pre-activation of the plastic, acute response. We also found overlap in the genes involved in the evolutionary response to parasitism and to other biotic and abiotic stressors, perhaps suggesting a 'module' of genes involved in a generalized stress response as has been found in other organisms. © 2010 Blackwell Publishing Ltd.
The Role of Zic Genes in Inner Ear Development in the Mouse: Exploring Mutant Mouse Phenotypes

PubMed Central

Chervenak, Andrew P.; Bank, Lisa M.; Thomsen, Nicole; Glanville-Jones, Hannah C; Skibo, Jonathan; Millen, Kathleen J.; Arkell, Ruth M.; Barald, Kate F.

2014-01-01

Background Murine Zic genes (Zic1-5) are expressed in the dorsal hindbrain and in periotic mesenchyme (POM) adjacent to the developing inner ear. Zic genes are involved in developmental signaling pathways in many organ systems, including the ear, although their exact roles haven't been fully elucidated. This report examines the role of Zic1, Zic2, and Zic4 during inner ear development in mouse mutants in which these Zic genes are affected Results Zic1/Zic4 double mutants don't exhibit any apparent defects in inner ear morphology. By contrast, inner ears from Zic2kd/kd and Zic2Ku/Ku mutants have severe but variable morphological defects in endolymphatic duct/sac and semicircular canal formation and in cochlear extension in the inner ear. Analysis of otocyst patterning in the Zic2Ku/Ku mutants by in situ hybridization showed changes in the expression patterns of Gbx2 and Pax2. Conclusions The experiments provide the first genetic evidence that the Zic genes are required for morphogenesis of the inner ear. Zic2 loss-of-function doesn't prevent initial otocyst patterning but leads to molecular abnormalities concomitant with morphogenesis of the endolymphatic duct. Functional hearing deficits often accompany inner ear dysmorphologies, making Zic2 a novel candidate gene for ongoing efforts to identify the genetic basis of human hearing loss. PMID:25178196
Abundance and Genetic Diversity of nifH Gene Sequences in Anthropogenically Affected Brazilian Mangrove Sediments

PubMed Central

Dias, Armando Cavalcante Franco; Pereira e Silva, Michele de Cassia; Cotta, Simone Raposo; Dini-Andreote, Francisco; Soares, Fábio Lino; Salles, Joana Falcão; Azevedo, João Lúcio; van Elsas, Jan Dirk

2012-01-01

Although mangroves represent ecosystems of global importance, the genetic diversity and abundance of functional genes that are key to their functioning scarcely have been explored. Here, we present a survey based on the nifH gene across transects of sediments of two mangrove systems located along the coast line of São Paulo state (Brazil) which differed by degree of disturbance, i.e., an oil-spill-affected and an unaffected mangrove. The diazotrophic communities were assessed by denaturing gradient gel electrophoresis (DGGE), quantitative PCR (qPCR), and clone libraries. The nifH gene abundance was similar across the two mangrove sediment systems, as evidenced by qPCR. However, the nifH-based PCR-DGGE profiles revealed clear differences between the mangroves. Moreover, shifts in the nifH gene diversities were noted along the land-sea transect within the previously oiled mangrove. The nifH gene diversity depicted the presence of nitrogen-fixing bacteria affiliated with a wide range of taxa, encompassing members of the Alphaproteobacteria, Betaproteobacteria, Gammaproteobacteria, Firmicutes, and also a group of anaerobic sulfate-reducing bacteria. We also detected a unique mangrove-specific cluster of sequences denoted Mgv-nifH. Our results indicate that nitrogen-fixing bacterial guilds can be partially endemic to mangroves, and these communities are modulated by oil contamination, which has important implications for conservation strategies. PMID:22941088
Abundance and genetic diversity of nifH gene sequences in anthropogenically affected Brazilian mangrove sediments.

PubMed

Dias, Armando Cavalcante Franco; Pereira e Silva, Michele de Cassia; Cotta, Simone Raposo; Dini-Andreote, Francisco; Soares, Fábio Lino; Salles, Joana Falcão; Azevedo, João Lúcio; van Elsas, Jan Dirk; Andreote, Fernando Dini

2012-11-01

Although mangroves represent ecosystems of global importance, the genetic diversity and abundance of functional genes that are key to their functioning scarcely have been explored. Here, we present a survey based on the nifH gene across transects of sediments of two mangrove systems located along the coast line of São Paulo state (Brazil) which differed by degree of disturbance, i.e., an oil-spill-affected and an unaffected mangrove. The diazotrophic communities were assessed by denaturing gradient gel electrophoresis (DGGE), quantitative PCR (qPCR), and clone libraries. The nifH gene abundance was similar across the two mangrove sediment systems, as evidenced by qPCR. However, the nifH-based PCR-DGGE profiles revealed clear differences between the mangroves. Moreover, shifts in the nifH gene diversities were noted along the land-sea transect within the previously oiled mangrove. The nifH gene diversity depicted the presence of nitrogen-fixing bacteria affiliated with a wide range of taxa, encompassing members of the Alphaproteobacteria, Betaproteobacteria, Gammaproteobacteria, Firmicutes, and also a group of anaerobic sulfate-reducing bacteria. We also detected a unique mangrove-specific cluster of sequences denoted Mgv-nifH. Our results indicate that nitrogen-fixing bacterial guilds can be partially endemic to mangroves, and these communities are modulated by oil contamination, which has important implications for conservation strategies.
Turning publicly available gene expression data into discoveries using gene set context analysis.

PubMed

Ji, Zhicheng; Vokes, Steven A; Dang, Chi V; Ji, Hongkai

2016-01-08

Gene Set Context Analysis (GSCA) is an open source software package to help researchers use massive amounts of publicly available gene expression data (PED) to make discoveries. Users can interactively visualize and explore gene and gene set activities in 25,000+ consistently normalized human and mouse gene expression samples representing diverse biological contexts (e.g. different cells, tissues and disease types, etc.). By providing one or multiple genes or gene sets as input and specifying a gene set activity pattern of interest, users can query the expression compendium to systematically identify biological contexts associated with the specified gene set activity pattern. In this way, researchers with new gene sets from their own experiments may discover previously unknown contexts of gene set functions and hence increase the value of their experiments. GSCA has a graphical user interface (GUI). The GUI makes the analysis convenient and customizable. Analysis results can be conveniently exported as publication quality figures and tables. GSCA is available at https://github.com/zji90/GSCA. This software significantly lowers the bar for biomedical investigators to use PED in their daily research for generating and screening hypotheses, which was previously difficult because of the complexity, heterogeneity and size of the data. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Whole-exome sequencing in obsessive-compulsive disorder identifies rare mutations in immunological and neurodevelopmental pathways

PubMed Central

Cappi, C; Brentani, H; Lima, L; Sanders, S J; Zai, G; Diniz, B J; Reis, V N S; Hounie, A G; Conceição do Rosário, M; Mariani, D; Requena, G L; Puga, R; Souza-Duran, F L; Shavitt, R G; Pauls, D L; Miguel, E C; Fernandez, T V

2016-01-01

Studies of rare genetic variation have identified molecular pathways conferring risk for developmental neuropsychiatric disorders. To date, no published whole-exome sequencing studies have been reported in obsessive-compulsive disorder (OCD). We sequenced all the genome coding regions in 20 sporadic OCD cases and their unaffected parents to identify rare de novo (DN) single-nucleotide variants (SNVs). The primary aim of this pilot study was to determine whether DN variation contributes to OCD risk. To this aim, we evaluated whether there is an elevated rate of DN mutations in OCD, which would justify this approach toward gene discovery in larger studies of the disorder. Furthermore, to explore functional molecular correlations among genes with nonsynonymous DN SNVs in OCD probands, a protein–protein interaction (PPI) network was generated based on databases of direct molecular interactions. We applied Degree-Aware Disease Gene Prioritization (DADA) to rank the PPI network genes based on their relatedness to a set of OCD candidate genes from two OCD genome-wide association studies (Stewart et al., 2013; Mattheisen et al., 2014). In addition, we performed a pathway analysis with genes from the PPI network. The rate of DN SNVs in OCD was 2.51 × 10−8 per base per generation, significantly higher than a previous estimated rate in unaffected subjects using the same sequencing platform and analytic pipeline. Several genes harboring DN SNVs in OCD were highly interconnected in the PPI network and ranked high in the DADA analysis. Nearly all the DN SNVs in this study are in genes expressed in the human brain, and a pathway analysis revealed enrichment in immunological and central nervous system functioning and development. The results of this pilot study indicate that further investigation of DN variation in larger OCD cohorts is warranted to identify specific risk genes and to confirm our preliminary finding with regard to PPI network enrichment for particular biological pathways and functions. PMID:27023170
A remarkable synergistic effect at the transcriptomic level in peach fruits doubly infected by prunus necrotic ringspot virus and peach latent mosaic viroid.

PubMed

Herranz, Mari Carmen; Niehl, Annette; Rosales, Marlene; Fiore, Nicola; Zamorano, Alan; Granell, Antonio; Pallas, Vicente

2013-05-28

Microarray profiling is a powerful technique to investigate expression changes of large amounts of genes in response to specific environmental conditions. The majority of the studies investigating gene expression changes in virus-infected plants are limited to interactions between a virus and a model host plant, which usually is Arabidopsis thaliana or Nicotiana benthamiana. In the present work, we performed microarray profiling to explore changes in the expression profile of field-grown Prunus persica (peach) originating from Chile upon single and double infection with Prunus necrotic ringspot virus (PNRSV) and Peach latent mosaic viroid (PLMVd), worldwide natural pathogens of peach trees. Upon single PLMVd or PNRSV infection, the number of statistically significant gene expression changes was relatively low. By contrast, doubly-infected fruits presented a high number of differentially regulated genes. Among these, down-regulated genes were prevalent. Functional categorization of the gene expression changes upon double PLMVd and PNRSV infection revealed protein modification and degradation as the functional category with the highest percentage of repressed genes whereas induced genes encoded mainly proteins related to phosphate, C-compound and carbohydrate metabolism and also protein modification. Overrepresentation analysis upon double infection with PLMVd and PNRSV revealed specific functional categories over- and underrepresented among the repressed genes indicating active counter-defense mechanisms of the pathogens during infection. Our results identify a novel synergistic effect of PLMVd and PNRSV on the transcriptome of peach fruits. We demonstrate that mixed infections, which occur frequently in field conditions, result in a more complex transcriptional response than that observed in single infections. Thus, our data demonstrate for the first time that the simultaneous infection of a viroid and a plant virus synergistically affect the host transcriptome in infected peach fruits. These field studies can help to fully understand plant-pathogen interactions and to develop appropriate crop protection strategies.
A remarkable synergistic effect at the transcriptomic level in peach fruits doubly infected by prunus necrotic ringspot virus and peach latent mosaic viroid

PubMed Central

2013-01-01

Background Microarray profiling is a powerful technique to investigate expression changes of large amounts of genes in response to specific environmental conditions. The majority of the studies investigating gene expression changes in virus-infected plants are limited to interactions between a virus and a model host plant, which usually is Arabidopsis thaliana or Nicotiana benthamiana. In the present work, we performed microarray profiling to explore changes in the expression profile of field-grown Prunus persica (peach) originating from Chile upon single and double infection with Prunus necrotic ringspot virus (PNRSV) and Peach latent mosaic viroid (PLMVd), worldwide natural pathogens of peach trees. Results Upon single PLMVd or PNRSV infection, the number of statistically significant gene expression changes was relatively low. By contrast, doubly-infected fruits presented a high number of differentially regulated genes. Among these, down-regulated genes were prevalent. Functional categorization of the gene expression changes upon double PLMVd and PNRSV infection revealed protein modification and degradation as the functional category with the highest percentage of repressed genes whereas induced genes encoded mainly proteins related to phosphate, C-compound and carbohydrate metabolism and also protein modification. Overrepresentation analysis upon double infection with PLMVd and PNRSV revealed specific functional categories over- and underrepresented among the repressed genes indicating active counter-defense mechanisms of the pathogens during infection. Conclusions Our results identify a novel synergistic effect of PLMVd and PNRSV on the transcriptome of peach fruits. We demonstrate that mixed infections, which occur frequently in field conditions, result in a more complex transcriptional response than that observed in single infections. Thus, our data demonstrate for the first time that the simultaneous infection of a viroid and a plant virus synergistically affect the host transcriptome in infected peach fruits. These field studies can help to fully understand plant-pathogen interactions and to develop appropriate crop protection strategies. PMID:23710752
PNPLA3, the triacylglycerol synthesis/hydrolysis/storage dilemma, and nonalcoholic fatty liver disease

PubMed Central

Sookoian, Silvia; Pirola, Carlos J

2012-01-01

Genome-wide and candidate gene association studies have identified several variants that predispose individuals to developing nonalcoholic fatty liver disease (NAFLD). However, the gene that has been consistently involved in the genetic susceptibility of NAFLD in humans is patatin-like phospholipase domain containing 3 (PNPLA3, also known as adiponutrin). A nonsynonymous single nucleotide polymorphism in PNPLA3 (rs738409 C/G, a coding variant that encodes an amino acid substitution I148M) is significantly associated with fatty liver and histological disease severity, not only in adults but also in children. Nevertheless, how PNPLA3 influences the biology of fatty liver disease is still an open question. A recent article describes new aspects about PNPLA3 gene/protein function and suggests that the I148M variant promotes hepatic lipid synthesis due to a gain of function. We revise here the published data about the role of the I148M variant in lipogenesis/lipolysis, and suggest putative areas of future research. For instance we explored in silico whether the rs738409 C or G alleles have the ability to modify miRNA binding sites and miRNA gene regulation, and we found that prediction of PNPLA3 target miRNAs shows two miRNAs potentially interacting in the 3’UTR region (hsa-miR-769-3p and hsa-miR-516a-3p). In addition, interesting unanswered questions remain to be explored. For example, PNPLA3 lies between two CCCTC-binding factor-bound sites that could be tested for insulator activity, and an intronic histone 3 lysine 4 trimethylation peak predicts an enhancer element, corroborated by the DNase I hypersensitivity site peak. Finally, an interaction between PNPLA3 and glycerol-3-phosphate acyltransferase 2 is suggested by data miming. PMID:23155331
[Biological characteristics of an enteroinvasive Escherichia coli strain with tatABC deletion].

PubMed

Gong, Zhaolong; Ye, Changyun; Liu, Xiaobing; Zhang, Min; Zhuo, Qin

2013-05-04

To study the relationship between twin-arginine translocation system (Tat) system with the biological characteristics of enteroinvasive Escherichia coli (EIEC). Through homologous recombination, we constructed EIEC's tatABC gene deletion strain and complementary strain, and explored their impact on bacterial form, substrate transport function as well as on HeLa cells and guinea pig's corneal invasion force. The tatABC gene deletion strain had apparent changes in bacterial form, loss of substrate transporter function, and significant weakened bacterial invasion force (the number of the deletion strain invading into HeLa cells was decreased significantly, and the ability of its corneal lesion capacity of the guinea pig was significantly weakened), while the complementary strain was similar to the wild strain in the above respects. EIEC's Tat protein transport system is closely related with the biological characteristics of EIEC.
Gene expression variations during Drosophila metamorphosis in real and simulated gravity

NASA Astrophysics Data System (ADS)

Marco, R.; Leandro-García, L. J.; Benguría, A.; Herranz, R.; Zeballos, A.; Gassert, G.; van Loon, J. J.; Medina, F. J.

Establishing the extent and significance of the effects of the exposure to microgravity of complex living organisms is a critical piece of information if the long-term exploration of near-by planets involving human beings is going to take place in the Future As a first step in this direction we have started to look into the patterns of gene expression during Drosophila development in real and simulated microgravity using microarray analysis of mRNA isolated from samples exposed to different environmental conditions In these experiments we used Affymetrix chips version 1 0 containing probes for more than 14 000 genes almost the complete Drosophila genome 55 of which are tagged with some molecular or functional designation while 45 are still waiting to be identified in functional terms The real microgravity exposure was imposed on the samples during the crew exchanging Soyuz 8 Mission to the ISS in October 2003 when after 11 days in Microgravity the Spanish-born astronaut Pedro Duque returned in the Soyuz 7 capsule carrying the experiments prepared by our Team Due to the constraints in the current ISS experiments in these Missions we limited the stages explored in our experiment to the developmental processes occurring during Drosophila metamorphosis As the experimental conditions at the launch site Baikonour were fairly limited we prepared the experiment in Madrid Toulouse and transp o rted the samples at 15 C in a temperature controlled container to slow down the developmental process a
Xylella fastidiosa gene expression analysis by DNA microarrays.

PubMed

Travensolo, Regiane F; Carareto-Alves, Lucia M; Costa, Maria V C G; Lopes, Tiago J S; Carrilho, Emanuel; Lemos, Eliana G M

2009-04-01

Xylella fastidiosa genome sequencing has generated valuable data by identifying genes acting either on metabolic pathways or in associated pathogenicity and virulence. Based on available information on these genes, new strategies for studying their expression patterns, such as microarray technology, were employed. A total of 2,600 primer pairs were synthesized and then used to generate fragments using the PCR technique. The arrays were hybridized against cDNAs labeled during reverse transcription reactions and which were obtained from bacteria grown under two different conditions (liquid XDM(2) and liquid BCYE). All data were statistically analyzed to verify which genes were differentially expressed. In addition to exploring conditions for X. fastidiosa genome-wide transcriptome analysis, the present work observed the differential expression of several classes of genes (energy, protein, amino acid and nucleotide metabolism, transport, degradation of substances, toxins and hypothetical proteins, among others). The understanding of expressed genes in these two different media will be useful in comprehending the metabolic characteristics of X. fastidiosa, and in evaluating how important certain genes are for the functioning and survival of these bacteria in plants.
[New advances in animal transgenic technology].

PubMed

Sun, Zhen-Hong; Miao, Xiang-Yang; Zhu, Rui-Liang

2010-06-01

Animal transgenic technology is one of the fastest growing biotechnology in the 21st century. It is used to integrate foreign genes into the animal genome by genetic engineering technology so that foreign genes can be expressed and inherited to the offspring. The transgenic efficiency and precise control of gene expression are the key limiting factors on preparation of transgenic animals. A variety of transgenic techniques are available, each of which has its own advantages and disadvantages and still needs further study because of unresolved technical and safety issues. With the in-depth research, the transgenic technology will have broad application prospects in the fields of exploration of gene function, animal genetic improvement, bioreactor, animal disease models, organ transplantation and so on. This article reviews the recently developed animal gene transfer techniques, including germline stem cell mediated method to improve the efficiency, gene targeting to improve the accuracy, RNA interference (RNAi)-mediated gene silencing technology, and the induced pluripotent stem cells (iPS) transgenic technology. The new transgenic techniques can provide a better platform for the study of trans-genic animals and promote the development of medical sciences, livestock production, and other fields.
Recent advances in the development of new transgenic animal technology.

PubMed

Miao, Xiangyang

2013-03-01

Transgenic animal technology is one of the fastest growing biotechnology areas. It is used to integrate exogenous genes into the animal genome by genetic engineering technology so that these genes can be inherited and expressed by offspring. The transgenic efficiency and precise control of gene expression are the key limiting factors in the production of transgenic animals. A variety of transgenic technologies are available. Each has its own advantages and disadvantages and needs further study because of unresolved technical and safety issues. Further studies will allow transgenic technology to explore gene function, animal genetic improvement, bioreactors, animal disease models, and organ transplantation. This article reviews the recently developed animal transgenic technologies, including the germ line stem cell-mediated method to improve efficiency, gene targeting to improve accuracy, RNA interference-mediated gene silencing technology, zinc-finger nuclease gene targeting technology and induced pluripotent stem cell technology. These new transgenic techniques can provide a better platform to develop transgenic animals for breeding new animal varieties and promote the development of medical sciences, livestock production, and other fields.
A system-level model for the microbial regulatory genome.

PubMed

Brooks, Aaron N; Reiss, David J; Allard, Antoine; Wu, Wei-Ju; Salvanha, Diego M; Plaisier, Christopher L; Chandrasekaran, Sriram; Pan, Min; Kaur, Amardeep; Baliga, Nitin S

2014-07-15

Microbes can tailor transcriptional responses to diverse environmental challenges despite having streamlined genomes and a limited number of regulators. Here, we present data-driven models that capture the dynamic interplay of the environment and genome-encoded regulatory programs of two types of prokaryotes: Escherichia coli (a bacterium) and Halobacterium salinarum (an archaeon). The models reveal how the genome-wide distributions of cis-acting gene regulatory elements and the conditional influences of transcription factors at each of those elements encode programs for eliciting a wide array of environment-specific responses. We demonstrate how these programs partition transcriptional regulation of genes within regulons and operons to re-organize gene-gene functional associations in each environment. The models capture fitness-relevant co-regulation by different transcriptional control mechanisms acting across the entire genome, to define a generalized, system-level organizing principle for prokaryotic gene regulatory networks that goes well beyond existing paradigms of gene regulation. An online resource (http://egrin2.systemsbiology.net) has been developed to facilitate multiscale exploration of conditional gene regulation in the two prokaryotes. © 2014 The Authors. Published under the terms of the CC BY 4.0 license.
Myocardin-related transcription factors are required for cardiac development and function

PubMed Central

Mokalled, Mayssa H.; Carroll, Kelli J.; Cenik, Bercin K.; Chen, Beibei; Liu, Ning; Olson, Eric N.; Bassel-Duby, Rhonda

2016-01-01

Myocardin-Related Transcription Factors A and B (MRTF-A and MRTF-B) are highly homologous proteins that function as powerful coactivators of serum response factor (SRF), a ubiquitously expressed transcription factor essential for cardiac development. The SRF/MRTF complex binds to CArG boxes found in the control regions of genes that regulate cytoskeletal dynamics and muscle contraction, among other processes. While SRF is required for heart development and function, the role of MRTFs in the developing or adult heart has not been explored. Through cardiac-specific deletion of MRTF alleles in mice, we show that either MRTF-A or MRTF-B is dispensable for cardiac development and function, whereas deletion of both MRTF-A and MRTF-B causes a spectrum of structural and functional cardiac abnormalities. Defects observed in MRTF-A/B null mice ranged from reduced cardiac contractility and adult onset heart failure to neonatal lethality accompanied by sarcomere disarray. RNA-seq analysis on neonatal hearts identified the most altered pathways in MRTF double knockout hearts as being involved in cytoskeletal organization. Together, these findings demonstrate redundant but essential roles of the MRTFs in maintenance of cardiac structure and function and as indispensible links in cardiac cytoskeletal gene regulatory networks. PMID:26386146
SWI/SNF Subunits SMARCA4, SMARCD2 and DPF2 Collaborate in MLL-Rearranged Leukaemia Maintenance.

PubMed

Cruickshank, V Adam; Sroczynska, Patrycja; Sankar, Aditya; Miyagi, Satoru; Rundsten, Carsten Friis; Johansen, Jens Vilstrup; Helin, Kristian

2015-01-01

Alterations in chromatin structure caused by deregulated epigenetic mechanisms collaborate with underlying genetic lesions to promote cancer. SMARCA4/BRG1, a core component of the SWI/SNF ATP-dependent chromatin-remodelling complex, has been implicated by its mutational spectrum as exerting a tumour-suppressor function in many solid tumours; recently however, it has been reported to sustain leukaemogenic transformation in MLL-rearranged leukaemia in mice. Here we further explore the role of SMARCA4 and the two SWI/SNF subunits SMARCD2/BAF60B and DPF2/BAF45D in leukaemia. We observed the selective requirement for these proteins for leukaemic cell expansion and self-renewal in-vitro as well as in leukaemia. Gene expression profiling in human cells of each of these three factors suggests that they have overlapping functions in leukaemia. The gene expression changes induced by loss of the three proteins demonstrate that they are required for the expression of haematopoietic stem cell associated genes but in contrast to previous results obtained in mouse cells, the three proteins are not required for the expression of c-MYC regulated genes.
Does stress remove the HDAC brakes for the formation and persistence of long-term memory?

PubMed

White, André O; Wood, Marcelo A

2014-07-01

It has been known for numerous decades that gene expression is required for long-lasting forms of memory. In the past decade, the study of epigenetic mechanisms in memory processes has revealed yet another layer of complexity in the regulation of gene expression. Epigenetic mechanisms do not only provide complexity in the protein regulatory complexes that control coordinate transcription for specific cell function, but the epigenome encodes critical information that integrates experience and cellular history for specific cell functions as well. Thus, epigenetic mechanisms provide a unique mechanism of gene expression regulation for memory processes. This may be why critical negative regulators of gene expression, such as histone deacetylases (HDACs), have powerful effects on the formation and persistence of memory. For example, HDAC inhibition has been shown to transform a subthreshold learning event into robust long-term memory and also generate a form of long-term memory that persists beyond the point at which normal long-term memory fails. A key question that is explored in this review, from a learning and memory perspective, is whether stress-dependent signaling drives the formation and persistence of long-term memory via HDAC-dependent mechanisms. Copyright © 2013 Elsevier Inc. All rights reserved.

[Diversity and antimicrobial activities of cultivable bacteria isolated from Jiaozhou Bay].

PubMed

Wang, Yiting; Zhang, Chuanbo; Qi, Lin; Jia, Xiaoqiang; Lu, Wenyu

2016-12-04

Marine microorganisms have a great potential in producing biologically active secondary metabolites. In order to study the diversity and antimicrobial activity, we explored 9 sediment samples in different observation sites of Jiaozhou bay. We used YPD and Z2216E culture medium to isolate bacteria from the sediments; 16S rRNA was sequenced for classification and identification of the isolates. Then, we used Oxford cup method to detect antimicrobial activities of the isolated bacteria against 7 test strains. Lastly, we selected 16 representatives to detect secondary-metabolite biosynthesis genes:PKSI, NRPS, CYP, PhzE, dTGD by PCR specific amplification. A total of 76 bacterial strains were isolated from Jiaozhou bay; according to the 16S rRNA gene sequence analysis. These strains could be sorted into 11 genera belonging to 8 different families:Aneurinibacillus, Brevibacillus, Microbacterium, Oceanisphae, Bacillus, Marinomonas, Staphylococcus, Kocuria, Arthrobacters, Micrococcus and Pseudoalteromonas. Of them 34 strains showed antimicrobial activity against at least one of the tested strains. All 16 strains had at least one function genes, 5 strains possessed more than three function genes. Jiaozhou bay area is rich in microbial resources with potential in providing useful secondary metabolites.
Does stress remove the HDAC brakes for the formation and persistence of long-term memory?

PubMed Central

White, André O.; Wood, Marcelo A.

2013-01-01

It has been known for numerous decades that gene expression is required for long-lasting forms of memory. In the past decade, the study of epigenetic mechanisms in memory processes has revealed yet another layer of complexity in the regulation of gene expression. Epigenetic mechanisms do not only provide complexity in the protein regulatory complexes that control coordinate transcription for specific cell function, but the epigenome encodes critical information that integrates experience and cellular history for specific cell functions as well. Thus, epigenetic mechanisms provide a unique mechanism of gene expression regulation for memory processes. This may be why critical negative regulators of gene expression, such as histone deacetylases (HDACs), have powerful effects on the formation and persistence of memory. For example, HDAC inhibition has been shown to transform a subthreshold learning event into robust long-term memory and also generate a form of long-term memory that persists beyond the point at which normal long-term memory fails. A key question that is explored in this review, from a learning and memory perspective, is whether stress-dependent signaling drives the formation and persistence of long-term memory via HDAC-dependent mechanisms. PMID:24149059
Monoamine oxidase A gene promoter methylation and transcriptional downregulation in an offender population with antisocial personality disorder.

PubMed

Checknita, D; Maussion, G; Labonté, B; Comai, S; Tremblay, R E; Vitaro, F; Turecki, N; Bertazzo, A; Gobbi, G; Côté, G; Turecki, G

2015-03-01

Antisocial personality disorder (ASPD) is characterised by elevated impulsive aggression and increased risk for criminal behaviour and incarceration. Deficient activity of the monoamine oxidase A (MAOA) gene is suggested to contribute to serotonergic system dysregulation strongly associated with impulsive aggression and antisocial criminality. To elucidate the role of epigenetic processes in altered MAOA expression and serotonin regulation in a population of incarcerated offenders with ASPD compared with a healthy non-incarcerated control population. Participants were 86 incarcerated participants with ASPD and 73 healthy controls. MAOA promoter methylation was compared between case and control groups. We explored the functional impact of MAOA promoter methylation on gene expression in vitro and blood 5-HT levels in a subset of the case group. Results suggest that MAOA promoter hypermethylation is associated with ASPD and may contribute to downregulation of MAOA gene expression, as indicated by functional assays in vitro, and regression analysis with whole-blood serotonin levels in offenders with ASPD. These results are consistent with prior literature suggesting MAOA and serotonergic dysregulation in antisocial populations. Our results offer the first evidence suggesting epigenetic mechanisms may contribute to MAOA dysregulation in antisocial offenders. Royal College of Psychiatrists.
Sex determining gene on the X chromosome short arm: dosage sensitive sex reversal.

PubMed

Ogata, T; Matsuo, N

1996-08-01

The present review article summarizes current knowledge concerning the sex determining gene on Xp21, termed DSS (dosage sensitive sex reversal). The presence of DSS has been based on the finding that, in the presence of SRY, partial active Xp duplications encompassing the middle part of Xp result in sex reversal, whereas those of the distal or proximal part of Xp permit male sex development. Because Klinefelter patients develop as males, it is believed that DSS is normally subject to X-inactivation, and that two active copies of DSS override the function of SRY, resulting in gonadal dysgenesis because of meiotic pairing failure. It may be possible that DSS encodes a target sequence for repressing function of SRY or that DSS is involved in an X chromosome-counting mechanism. Molecular approaches have localized DSS to a 160 kb region and isolated candidate genes such as DAX-1 and MAGE-Xp, but there has been no formal evidence equating the candidate gene with DSS. In addition to its clinical importance, the exploration of DSS must provide a useful clue to phylogenetic studies of sex chromosomes and dosage compensation.
A method to identify differential expression profiles of time-course gene data with Fourier transformation.

PubMed

Kim, Jaehee; Ogden, Robert Todd; Kim, Haseong

2013-10-18

Time course gene expression experiments are an increasingly popular method for exploring biological processes. Temporal gene expression profiles provide an important characterization of gene function, as biological systems are both developmental and dynamic. With such data it is possible to study gene expression changes over time and thereby to detect differential genes. Much of the early work on analyzing time series expression data relied on methods developed originally for static data and thus there is a need for improved methodology. Since time series expression is a temporal process, its unique features such as autocorrelation between successive points should be incorporated into the analysis. This work aims to identify genes that show different gene expression profiles across time. We propose a statistical procedure to discover gene groups with similar profiles using a nonparametric representation that accounts for the autocorrelation in the data. In particular, we first represent each profile in terms of a Fourier basis, and then we screen out genes that are not differentially expressed based on the Fourier coefficients. Finally, we cluster the remaining gene profiles using a model-based approach in the Fourier domain. We evaluate the screening results in terms of sensitivity, specificity, FDR and FNR, compare with the Gaussian process regression screening in a simulation study and illustrate the results by application to yeast cell-cycle microarray expression data with alpha-factor synchronization.The key elements of the proposed methodology: (i) representation of gene profiles in the Fourier domain; (ii) automatic screening of genes based on the Fourier coefficients and taking into account autocorrelation in the data, while controlling the false discovery rate (FDR); (iii) model-based clustering of the remaining gene profiles. Using this method, we identified a set of cell-cycle-regulated time-course yeast genes. The proposed method is general and can be potentially used to identify genes which have the same patterns or biological processes, and help facing the present and forthcoming challenges of data analysis in functional genomics.
Identification, distribution and molecular evolution of the pacifastin gene family in Metazoa

PubMed Central

Breugelmans, Bert; Simonet, Gert; van Hoef, Vincent; Van Soest, Sofie; Broeck, Jozef Vanden

2009-01-01

Background Members of the pacifastin family are serine peptidase inhibitors, most of which are produced as multi domain precursor proteins. Structural and biochemical characteristics of insect pacifastin-like peptides have been studied intensively, but only one inhibitor has been functionally characterised. Recent sequencing projects of metazoan genomes have created an unprecedented opportunity to explore the distribution, evolution and functional diversification of pacifastin genes in the animal kingdom. Results A large scale in silico data mining search led to the identification of 83 pacifastin members with 284 inhibitor domains, distributed over 55 species from three metazoan phyla. In contrast to previous assumptions, members of this family were also found in other phyla than Arthropoda, including the sister phylum Onychophora and the 'primitive', non-bilaterian Placozoa. In Arthropoda, pacifastin members were found to be distributed among insect families of nearly all insect orders and for the first time also among crustacean species other than crayfish and the Chinese mitten crab. Contrary to precursors from Crustacea, the majority of insect pacifastin members contain dibasic cleavage sites, indicative for posttranslational processing into numerous inhibitor peptides. Whereas some insect species have lost the pacifastin gene, others were found to have several (often clustered) paralogous genes. Amino acids corresponding to the reactive site or involved in the folding of the inhibitor domain were analysed as a basis for the biochemical properties. Conclusion The absence of the pacifastin gene in some insect genomes and the extensive gene expansion in other insects are indicative for the rapid (adaptive) evolution of this gene family. In addition, differential processing mechanisms and a high variability in the reactive site residues and the inner core interactions contribute to a broad functional diversification of inhibitor peptides, indicating wide ranging roles in different physiological processes. Based on the observation of a pacifastin gene in Placozoa, it can be hypothesized that the ancestral pacifastin gene has occurred before the divergence of bilaterian animals. However, considering differences in gene structure between the placozoan and other pacifastin genes and the existence of a 'pacifastin gene gap' between Placozoa and Onychophora/Arthropoda, it cannot be excluded that the pacifastin signature originated twice by convergent evolution. PMID:19435517
Pathway Distiller - multisource biological pathway consolidation

PubMed Central

2012-01-01

Background One method to understand and evaluate an experiment that produces a large set of genes, such as a gene expression microarray analysis, is to identify overrepresentation or enrichment for biological pathways. Because pathways are able to functionally describe the set of genes, much effort has been made to collect curated biological pathways into publicly accessible databases. When combining disparate databases, highly related or redundant pathways exist, making their consolidation into pathway concepts essential. This will facilitate unbiased, comprehensive yet streamlined analysis of experiments that result in large gene sets. Methods After gene set enrichment finds representative pathways for large gene sets, pathways are consolidated into representative pathway concepts. Three complementary, but different methods of pathway consolidation are explored. Enrichment Consolidation combines the set of the pathways enriched for the signature gene list through iterative combining of enriched pathways with other pathways with similar signature gene sets; Weighted Consolidation utilizes a Protein-Protein Interaction network based gene-weighting approach that finds clusters of both enriched and non-enriched pathways limited to the experiments' resultant gene list; and finally the de novo Consolidation method uses several measurements of pathway similarity, that finds static pathway clusters independent of any given experiment. Results We demonstrate that the three consolidation methods provide unified yet different functional insights of a resultant gene set derived from a genome-wide profiling experiment. Results from the methods are presented, demonstrating their applications in biological studies and comparing with a pathway web-based framework that also combines several pathway databases. Additionally a web-based consolidation framework that encompasses all three methods discussed in this paper, Pathway Distiller (http://cbbiweb.uthscsa.edu/PathwayDistiller), is established to allow researchers access to the methods and example microarray data described in this manuscript, and the ability to analyze their own gene list by using our unique consolidation methods. Conclusions By combining several pathway systems, implementing different, but complementary pathway consolidation methods, and providing a user-friendly web-accessible tool, we have enabled users the ability to extract functional explanations of their genome wide experiments. PMID:23134636
Pathway Distiller - multisource biological pathway consolidation.

PubMed

Doderer, Mark S; Anguiano, Zachry; Suresh, Uthra; Dashnamoorthy, Ravi; Bishop, Alexander J R; Chen, Yidong

2012-01-01

One method to understand and evaluate an experiment that produces a large set of genes, such as a gene expression microarray analysis, is to identify overrepresentation or enrichment for biological pathways. Because pathways are able to functionally describe the set of genes, much effort has been made to collect curated biological pathways into publicly accessible databases. When combining disparate databases, highly related or redundant pathways exist, making their consolidation into pathway concepts essential. This will facilitate unbiased, comprehensive yet streamlined analysis of experiments that result in large gene sets. After gene set enrichment finds representative pathways for large gene sets, pathways are consolidated into representative pathway concepts. Three complementary, but different methods of pathway consolidation are explored. Enrichment Consolidation combines the set of the pathways enriched for the signature gene list through iterative combining of enriched pathways with other pathways with similar signature gene sets; Weighted Consolidation utilizes a Protein-Protein Interaction network based gene-weighting approach that finds clusters of both enriched and non-enriched pathways limited to the experiments' resultant gene list; and finally the de novo Consolidation method uses several measurements of pathway similarity, that finds static pathway clusters independent of any given experiment. We demonstrate that the three consolidation methods provide unified yet different functional insights of a resultant gene set derived from a genome-wide profiling experiment. Results from the methods are presented, demonstrating their applications in biological studies and comparing with a pathway web-based framework that also combines several pathway databases. Additionally a web-based consolidation framework that encompasses all three methods discussed in this paper, Pathway Distiller (http://cbbiweb.uthscsa.edu/PathwayDistiller), is established to allow researchers access to the methods and example microarray data described in this manuscript, and the ability to analyze their own gene list by using our unique consolidation methods. By combining several pathway systems, implementing different, but complementary pathway consolidation methods, and providing a user-friendly web-accessible tool, we have enabled users the ability to extract functional explanations of their genome wide experiments.
NRASG12V oncogene facilitates self-renewal in a murine model of acute myelogenous leukemia

PubMed Central

LaRue, Rebecca S.; Nguyen, Hanh T.; Sachs, Karen; Noble, Klara E.; Mohd Hassan, Nurul Azyan; Diaz-Flores, Ernesto; Rathe, Susan K.; Sarver, Aaron L.; Bendall, Sean C.; Ha, Ngoc A.; Diers, Miechaleen D.; Nolan, Garry P.; Shannon, Kevin M.; Largaespada, David A.

2014-01-01

Mutant RAS oncoproteins activate signaling molecules that drive oncogenesis in multiple human tumors including acute myelogenous leukemia (AML). However, the specific functions of these pathways in AML are unclear, thwarting the rational application of targeted therapeutics. To elucidate the downstream functions of activated NRAS in AML, we used a murine model that harbors Mll-AF9 and a tetracycline-repressible, activated NRAS (NRASG12V). Using computational approaches to explore our gene-expression data sets, we found that NRASG12V enforced the leukemia self-renewal gene-expression signature and was required to maintain an MLL-AF9– and Myb-dependent leukemia self-renewal gene-expression program. NRASG12V was required for leukemia self-renewal independent of its effects on growth and survival. Analysis of the gene-expression patterns of leukemic subpopulations revealed that the NRASG12V-mediated leukemia self-renewal signature is preferentially expressed in the leukemia stem cell–enriched subpopulation. In a multiplexed analysis of RAS-dependent signaling, Mac-1Low cells, which harbor leukemia stem cells, were preferentially sensitive to NRASG12V withdrawal. NRASG12V maintained leukemia self-renewal through mTOR and MEK pathway activation, implicating these pathways as potential targets for cancer stem cell–specific therapies. Together, these experimental results define a RAS oncogene–driven function that is critical for leukemia maintenance and represents a novel mechanism of oncogene addiction. PMID:25316678
Exploration of G-quadruplex function in c-Myb gene and its transcriptional regulation by topotecan.

PubMed

Li, Fangyuan; Zhou, Jiang; Xu, Ming; Yuan, Gu

2018-02-01

Our bioinformatics research shows that there are four G-rich sequences (S1-S4) in the upstream region of the transcription start site of c-Myb gene, and we have proved that these sequences have the ability to form G-quadruplex structures. This work mainly focuses on G-quadruplex function, recognition and transcription regulation in c-Myb gene, revealing a novel regulatory element in c-Myb proximal promoter region, and its transcription regulation by G-quadruplex binder. The research has identified that the enhancer effect in c-Myb transcription was primarily affected by the G-quadruplex formed by S1 sequence, and the up-regulation effect may due to the removal of repressive progress of MZF-1 by stabilizing G-quadruplex. Attentions were being paid to the development of G-quadruplex binders for selective recognition, and topotecan was found to have high binding affinity in vitro and could effectively affect the c-Myb transcription activities in cells. The regulation of G-quadruplex with binders in transcriptional, translational levels by Q-RT-PCR and western blot was in expectation of providing a strategy for gene expression modulation. In conclusion, our study revealed a G-quadruplex structure in c-Myb proximal promoter region, which was of great importance in the regulation of c-Myb function. Copyright © 2017 Elsevier B.V. All rights reserved.
TRANSPARENT TESTA GLABRA 1 ubiquitously regulates plant growth and development from Arabidopsis to foxtail millet (Setaria italica).

PubMed

Liu, Kaige; Qi, Shuanghui; Li, Dong; Jin, Changyu; Gao, Chenhao; Duan, Shaowei; Feng, Baili; Chen, Mingxun

2017-01-01

TRANSPARENT TESTA GLABRA 1 of Arabidopsis thaliana (AtTTG1) is a WD40 repeat transcription factor that plays multiple roles in plant growth and development, particularly in seed metabolite production. In the present study, to determine whether SiTTG1 of the phylogenetically distant monocot foxtail millet (Setaria italica) has similar functions, we used transgenic Arabidopsis and Nicotiana systems to explore its activities. We found that SiTTG1 functions as a transcription factor. Overexpression of the SiTTG1 gene rescued many of the mutant phenotypes in Arabidopsis ttg1-13 plants. Additionally, SiTTG1 overexpression fully corrected the reduced expression of mucilage biosynthetic genes, and the induced expression of genes involved in accumulation of seed fatty acids and storage proteins in developing seeds of ttg1-13 plants. Ectopic expression of SiTTG1 restored the sensitivity of the ttg1-13 mutant to salinity and high glucose stresses during germination and seedling establishment, and restored altered expression levels of some stress-responsive genes in ttg1-13 seedlings to the wild type level under salinity and glucose stresses. Our results provide information that will be valuable for understanding the function of TTG1 from monocot to dicot species and identifying a promising target for genetic manipulation of foxtail millet to improve the amount of seed metabolites. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Voltage-gated Na+ channel SCN5A is a key regulator of a gene transcriptional network that controls colon cancer invasion

PubMed Central

House, Carrie D.; Vaske, Charles J.; Schwartz, Arnold M.; Obias, Vincent; Frank, Bryan; Luu, Truong; Sarvazyan, Narine; Irby, Rosalyn; Strausberg, Robert L.; Hales, Tim G.; Stuart, Joshua M.; Lee, Norman H.

2010-01-01

Voltage-gated Na+ channels (VGSCs) have been implicated in the metastatic potential of human breast, prostate and lung cancer cells. Specifically, the SCN5A gene encoding the VGSC isotype Nav1.5 has been defined as a key driver of human cancer cell invasion. In this study, we examined the expression and function of VGSCs in a panel of colon cancer cell lines by electrophysiological recordings. Na+ channel activity and invasive potential were inhibited pharmacologically by tetrodotoxin or genetically by siRNAs specifically targeting SCN5A. Clinical relevance was established by immunohistochemistry of patient biopsies, where there was strong Nav1.5 protein staining in colon cancer specimens but little to no staining in matched-paired normal colon tissues. We explored the mechanism of VGSC-mediated invasive potential on the basis of reported links between VGSC activity and gene expression in excitable cells. Probabilistic modeling of loss-of-function screens and microarray data established an unequivocal role of VGSC SCN5A as a high level regulator of a colon cancer invasion network, involving genes that encompass Wnt signaling, cell migration, ectoderm development, response to biotic stimulus, steroid metabolic process and cell cycle control. siRNA-mediated knockdown of predicted downstream network components caused a loss of invasive behavior, demonstrating network connectivity and its function in driving colon cancer invasion. PMID:20651255
A functional phylogenomic view of the seed plants.

PubMed

Lee, Ernest K; Cibrian-Jaramillo, Angelica; Kolokotronis, Sergios-Orestis; Katari, Manpreet S; Stamatakis, Alexandros; Ott, Michael; Chiu, Joanna C; Little, Damon P; Stevenson, Dennis Wm; McCombie, W Richard; Martienssen, Robert A; Coruzzi, Gloria; Desalle, Rob

2011-12-01

A novel result of the current research is the development and implementation of a unique functional phylogenomic approach that explores the genomic origins of seed plant diversification. We first use 22,833 sets of orthologs from the nuclear genomes of 101 genera across land plants to reconstruct their phylogenetic relationships. One of the more salient results is the resolution of some enigmatic relationships in seed plant phylogeny, such as the placement of Gnetales as sister to the rest of the gymnosperms. In using this novel phylogenomic approach, we were also able to identify overrepresented functional gene ontology categories in genes that provide positive branch support for major nodes prompting new hypotheses for genes associated with the diversification of angiosperms. For example, RNA interference (RNAi) has played a significant role in the divergence of monocots from other angiosperms, which has experimental support in Arabidopsis and rice. This analysis also implied that the second largest subunit of RNA polymerase IV and V (NRPD2) played a prominent role in the divergence of gymnosperms. This hypothesis is supported by the lack of 24nt siRNA in conifers, the maternal control of small RNA in the seeds of flowering plants, and the emergence of double fertilization in angiosperms. Our approach takes advantage of genomic data to define orthologs, reconstruct relationships, and narrow down candidate genes involved in plant evolution within a phylogenomic view of species' diversification.
A Functional Phylogenomic View of the Seed Plants

PubMed Central

Katari, Manpreet S.; Stamatakis, Alexandros; Ott, Michael; Chiu, Joanna C.; Little, Damon P.; Stevenson, Dennis Wm.; McCombie, W. Richard; Martienssen, Robert A.; Coruzzi, Gloria; DeSalle, Rob

2011-01-01

A novel result of the current research is the development and implementation of a unique functional phylogenomic approach that explores the genomic origins of seed plant diversification. We first use 22,833 sets of orthologs from the nuclear genomes of 101 genera across land plants to reconstruct their phylogenetic relationships. One of the more salient results is the resolution of some enigmatic relationships in seed plant phylogeny, such as the placement of Gnetales as sister to the rest of the gymnosperms. In using this novel phylogenomic approach, we were also able to identify overrepresented functional gene ontology categories in genes that provide positive branch support for major nodes prompting new hypotheses for genes associated with the diversification of angiosperms. For example, RNA interference (RNAi) has played a significant role in the divergence of monocots from other angiosperms, which has experimental support in Arabidopsis and rice. This analysis also implied that the second largest subunit of RNA polymerase IV and V (NRPD2) played a prominent role in the divergence of gymnosperms. This hypothesis is supported by the lack of 24nt siRNA in conifers, the maternal control of small RNA in the seeds of flowering plants, and the emergence of double fertilization in angiosperms. Our approach takes advantage of genomic data to define orthologs, reconstruct relationships, and narrow down candidate genes involved in plant evolution within a phylogenomic view of species' diversification. PMID:22194700
Genome-wide identification and characterization of aquaporin gene family in Beta vulgaris

PubMed Central

Kong, Weilong; Yang, Shaozong; Wang, Yulu; Bendahmane, Mohammed

2017-01-01

Aquaporins (AQPs) are essential channel proteins that execute multi-functions throughout plant growth and development, including water transport, uncharged solutes uptake, stress response, and so on. Here, we report the first genome-wide identification and characterization AQP (BvAQP) genes in sugar beet (Beta vulgaris), an important crop widely cultivated for feed, for sugar production and for bioethanol production. Twenty-eight sugar beet AQPs (BvAQPs) were identified and assigned into five subfamilies based on phylogenetic analyses: seven of plasma membrane (PIPs), eight of tonoplast (TIPs), nine of NOD26-like (NIPs), three of small basic (SIPs), and one of x-intrinsic proteins (XIPs). BvAQP genes unevenly mapped on all chromosomes, except on chromosome 4. Gene structure and motifs analyses revealed that BvAQP have conserved exon-intron organization and that they exhibit conserved motifs within each subfamily. Prediction of BvAQPs functions, based on key protein domains conservation, showed a remarkable difference in substrate specificity among the five subfamilies. Analyses of BvAQPs expression, by mean of RNA-seq, in different plant organs and in response to various abiotic stresses revealed that they were ubiquitously expressed and that their expression was induced by heat and salt stresses. These results provide a reference base to address further the function of sugar beet aquaporins and to explore future applications for plants growth and development improvements as well as in response to environmental stresses. PMID:28948097
Val158Met polymorphism in the COMT gene is associated with hypersomnia and mental health-related quality of life in a Colombian sample.

PubMed

Jiménez, Karen M; Pereira-Morales, Angela J; Forero, Diego A

2017-03-22

The identification of genes that are risk factors for major depressive disorder remains a main task for global psychiatric research. The Catechol-O-methyltransferase (COMT) gene has been an important candidate risk factor for several psychiatric disorders. Previous studies have shown that a functional polymorphism (Val158Met) in this gene has an effect on several brain circuits and endophenotypes of psychiatric relevance. The aim of this study was to explore the association of a functional polymorphism in the COMT gene with psychological distress, sleep problems and health-related quality of life. Two hundred seventy young Colombian subjects (mean age: 21.3 years; range: 18-57 years) completed the Patient Health Questionnaire-9, the Perceived Stress Scale, the Oviedo Sleep Questionnaire and the 12-Item Short-Form Health Survey and were genotyped for the Val158Met polymorphism (rs4680) in the COMT gene. A linear regression analysis, adjusting for potential confounding factors, was carried out. Subjects that were Met carriers (Val/Met and Met/Met genotypes) showed higher scores for hypersomnia (p=0.001) and lower scores for mental health-related quality of life (p=0.007), these associations remained significant after correcting for multiple testing. These findings support the hypothesis of a broad effect of the Val158Met polymorphism in the COMT gene on several dimensions of behavior and neuropsychiatric symptoms. Copyright © 2017 Elsevier B.V. All rights reserved.
Gene doping: possibilities and practicalities.

PubMed

Wells, Dominic J

2009-01-01

Our ever-increasing understanding of the genetic control of cardiovascular and musculoskeletal function together with recent technical improvements in genetic manipulation generates mounting concern over the possibility of such technology being abused by athletes in their quest for improved performance. Genetic manipulation in the context of athletic performance is commonly referred to as gene doping. A review of the literature was performed to identify the genes and methodologies most likely to be used for gene doping and the technologies that might be used to identify such doping. A large number of candidate performance-enhancing genes have been identified from animal studies, many of them using transgenic mice. Only a limited number have been shown to be effective following gene transfer into adults. Those that seem most likely to be abused are genes that exert their effects locally and leave little, if any, trace in blood or urine. There is currently no evidence that gene doping has yet been undertaken in competitive athletes but the anti-doping authorities will need to remain vigilant in reviewing this rapidly emerging technology. The detection of gene doping involves some different challenges from other agents and a number of promising approaches are currently being explored. 2009 S. Karger AG, Basel
Differential Connectivity in Colorectal Cancer Gene Expression Network

PubMed

Izadi, Fereshteh

2018-05-30

Colorectal cancer (CRC) is one of the challenging types of cancers; thus, exploring effective biomarkers related to colorectal could lead to significant progresses toward the treatment of this disease. In the present study, CRC gene expression datasets have been reanalyzed. Mutual differentially expressed genes across 294 normal mucosa and adjacent tumoral samples were then utilized in order to build two independent transcriptional regulatory networks. By analyzing the networks topologically, genes with differential global connectivity related to cancer state were determined for which the potential transcriptional regulators including transcription factors were identified. The majority of differentially connected genes (DCGs) were up-regulated in colorectal transcriptome experiments. Moreover, a number of these genes have been experimentally validated as cancer or CRC-associated genes. The DCGs, including GART, TGFB1, ITGA2, SLC16A5, SOX9, and MMP7, were investigated across 12 cancer types. Functional enrichment analysis followed by detailed data mining exhibited that these candidate genes could be related to CRC by mediating in metastatic cascade in addition to shared pathways with 12 cancer types by triggering the inflammatory events Our study uncovered correlated alterations in gene expression related to CRC susceptibility and progression that the potent candidate biomarkers could provide a link to disease.
The clock gene Period1 regulates innate routine behaviour in mice

PubMed Central

Bechstein, Philipp; Rehbach, Nils-Jörn; Yuhasingham, Gowzekan; Schürmann, Christoph; Göpfert, Melanie; Kössl, Manfred; Maronde, Erik

2014-01-01

Laboratory mice are well capable of performing innate routine behaviour programmes necessary for courtship, nest-building and exploratory activities although housed for decades in animal facilities. We found that in mice inactivation of the clock gene Period1 profoundly changes innate routine behaviour programmes like those necessary for courtship, nest building, exploration and learning. These results in wild-type and Period1 mutant mice, together with earlier findings on courtship behaviour in wild-type and period-mutant Drosophila melanogaster, suggest a conserved role of Period-genes on innate routine behaviour. Additionally, both per-mutant flies and Period1-mutant mice display spatial learning and memory deficits. The profound influence of Period1 on routine behaviour programmes in mice, including female partner choice, may be independent of its function as a circadian clock gene, since Period1-deficient mice display normal circadian behaviour. PMID:24598427
The clock gene Period1 regulates innate routine behaviour in mice.

PubMed

Bechstein, Philipp; Rehbach, Nils-Jörn; Yuhasingham, Gowzekan; Schürmann, Christoph; Göpfert, Melanie; Kössl, Manfred; Maronde, Erik

2014-04-22

Laboratory mice are well capable of performing innate routine behaviour programmes necessary for courtship, nest-building and exploratory activities although housed for decades in animal facilities. We found that in mice inactivation of the clock gene Period1 profoundly changes innate routine behaviour programmes like those necessary for courtship, nest building, exploration and learning. These results in wild-type and Period1 mutant mice, together with earlier findings on courtship behaviour in wild-type and period-mutant Drosophila melanogaster, suggest a conserved role of Period-genes on innate routine behaviour. Additionally, both per-mutant flies and Period1-mutant mice display spatial learning and memory deficits. The profound influence of Period1 on routine behaviour programmes in mice, including female partner choice, may be independent of its function as a circadian clock gene, since Period1-deficient mice display normal circadian behaviour.

Sex determination in insects: a binary decision based on alternative splicing.

PubMed

Salz, Helen K

2011-08-01

The gene regulatory networks that control sex determination vary between species. Despite these differences, comparative studies in insects have found that alternative splicing is reiteratively used in evolution to control expression of the key sex-determining genes. Sex determination is best understood in Drosophila where activation of the RNA binding protein-encoding gene Sex-lethal is the central female-determining event. Sex-lethal serves as a genetic switch because once activated it controls its own expression by a positive feedback splicing mechanism. Sex fate choice in is also maintained by self-sustaining positive feedback splicing mechanisms in other dipteran and hymenopteran insects, although different RNA binding protein-encoding genes function as the binary switch. Studies exploring the mechanisms of sex-specific splicing have revealed the extent to which sex determination is integrated with other developmental regulatory networks. Copyright © 2011 Elsevier Ltd. All rights reserved.
Apatinib inhibits cellular invasion and migration by fusion kinase KIF5B-RET via suppressing RET/Src signaling pathway

PubMed Central

Xie, Weiwei; Zheng, Rongliang; Gan, Yu; Chang, Jianhua

2016-01-01

The Rearranged during transfection (RET) fusion gene is a newly identified oncogenic mutation in non-small cell lung cancer (NSCLC). The aim of this study is to explore the biological functions of the gene in tumorigenesis and metastasis in RET gene fusion-driven preclinical models. We also investigate the anti-tumor activity of Apatinib, a potent inhibitor of VEGFR-2, PDGFR-β, c-Src and RET, in RET-rearranged lung adenocarcinoma, together with the mechanisms underlying. Our results suggested that KIF5B-RET fusion gene promoted cell invasion and migration, which were probably mediated through Src signaling pathway. Apatinib exerted its anti-cancer effect not only via cytotoxicity, but also via inhibition of migration and invasion by suppressing RET/Src signaling pathway, supporting a potential role for Apatinib in the treatment of KIF5B-RET driven tumors. PMID:27494860
Apatinib inhibits cellular invasion and migration by fusion kinase KIF5B-RET via suppressing RET/Src signaling pathway.

PubMed

Lin, Chen; Wang, Shanshan; Xie, Weiwei; Zheng, Rongliang; Gan, Yu; Chang, Jianhua

2016-09-13

The Rearranged during transfection (RET) fusion gene is a newly identified oncogenic mutation in non-small cell lung cancer (NSCLC). The aim of this study is to explore the biological functions of the gene in tumorigenesis and metastasis in RET gene fusion-driven preclinical models. We also investigate the anti-tumor activity of Apatinib, a potent inhibitor of VEGFR-2, PDGFR-β, c-Src and RET, in RET-rearranged lung adenocarcinoma, together with the mechanisms underlying. Our results suggested that KIF5B-RET fusion gene promoted cell invasion and migration, which were probably mediated through Src signaling pathway. Apatinib exerted its anti-cancer effect not only via cytotoxicity, but also via inhibition of migration and invasion by suppressing RET/Src signaling pathway, supporting a potential role for Apatinib in the treatment of KIF5B-RET driven tumors.
Predicting taxonomic and functional structure of microbial communities in acid mine drainage

PubMed Central

Kuang, Jialiang; Huang, Linan; He, Zhili; Chen, Linxing; Hua, Zhengshuang; Jia, Pu; Li, Shengjin; Liu, Jun; Li, Jintian; Zhou, Jizhong; Shu, Wensheng

2016-01-01

Predicting the dynamics of community composition and functional attributes responding to environmental changes is an essential goal in community ecology but remains a major challenge, particularly in microbial ecology. Here, by targeting a model system with low species richness, we explore the spatial distribution of taxonomic and functional structure of 40 acid mine drainage (AMD) microbial communities across Southeast China profiled by 16S ribosomal RNA pyrosequencing and a comprehensive microarray (GeoChip). Similar environmentally dependent patterns of dominant microbial lineages and key functional genes were observed regardless of the large-scale geographical isolation. Functional and phylogenetic β-diversities were significantly correlated, whereas functional metabolic potentials were strongly influenced by environmental conditions and community taxonomic structure. Using advanced modeling approaches based on artificial neural networks, we successfully predicted the taxonomic and functional dynamics with significantly higher prediction accuracies of metabolic potentials (average Bray–Curtis similarity 87.8) as compared with relative microbial abundances (similarity 66.8), implying that natural AMD microbial assemblages may be better predicted at the functional genes level rather than at taxonomic level. Furthermore, relative metabolic potentials of genes involved in many key ecological functions (for example, nitrogen and phosphate utilization, metals resistance and stress response) were extrapolated to increase under more acidic and metal-rich conditions, indicating a critical strategy of stress adaptation in these extraordinary communities. Collectively, our findings indicate that natural selection rather than geographic distance has a more crucial role in shaping the taxonomic and functional patterns of AMD microbial community that readily predicted by modeling methods and suggest that the model-based approach is essential to better understand natural acidophilic microbial communities. PMID:26943622
Predicting taxonomic and functional structure of microbial communities in acid mine drainage.

PubMed

Kuang, Jialiang; Huang, Linan; He, Zhili; Chen, Linxing; Hua, Zhengshuang; Jia, Pu; Li, Shengjin; Liu, Jun; Li, Jintian; Zhou, Jizhong; Shu, Wensheng

2016-06-01

Predicting the dynamics of community composition and functional attributes responding to environmental changes is an essential goal in community ecology but remains a major challenge, particularly in microbial ecology. Here, by targeting a model system with low species richness, we explore the spatial distribution of taxonomic and functional structure of 40 acid mine drainage (AMD) microbial communities across Southeast China profiled by 16S ribosomal RNA pyrosequencing and a comprehensive microarray (GeoChip). Similar environmentally dependent patterns of dominant microbial lineages and key functional genes were observed regardless of the large-scale geographical isolation. Functional and phylogenetic β-diversities were significantly correlated, whereas functional metabolic potentials were strongly influenced by environmental conditions and community taxonomic structure. Using advanced modeling approaches based on artificial neural networks, we successfully predicted the taxonomic and functional dynamics with significantly higher prediction accuracies of metabolic potentials (average Bray-Curtis similarity 87.8) as compared with relative microbial abundances (similarity 66.8), implying that natural AMD microbial assemblages may be better predicted at the functional genes level rather than at taxonomic level. Furthermore, relative metabolic potentials of genes involved in many key ecological functions (for example, nitrogen and phosphate utilization, metals resistance and stress response) were extrapolated to increase under more acidic and metal-rich conditions, indicating a critical strategy of stress adaptation in these extraordinary communities. Collectively, our findings indicate that natural selection rather than geographic distance has a more crucial role in shaping the taxonomic and functional patterns of AMD microbial community that readily predicted by modeling methods and suggest that the model-based approach is essential to better understand natural acidophilic microbial communities.
Expression, function and regulation of mouse cytochrome P450 enzymes: comparison with human P450 enzymes.

PubMed

Hrycay, E G; Bandiera, S M

2009-12-01

The present review focuses on the expression, function and regulation of mouse cytochrome P450 (Cyp) enzymes. Information compiled for mouse Cyp enzymes is compared with data collected for human CYP enzymes. To date, approximately 40 pairs of orthologous mouse-human CYP genes have been identified that encode enzymes performing similar metabolic functions. Recent knowledge concerning the tissue expression of mouse Cyp enzymes from families 1 to 51 is summarized. The catalytic activities of microsomal, mitochondrial and recombinant mouse Cyp enzymes are discussed and their involvement in the metabolism of exogenous and endogenous compounds is highlighted. The role of nuclear receptors, such as the aryl hydrocarbon receptor, constitutive androstane receptor and pregnane X receptor, in regulating the expression of mouse Cyp enzymes is examined. Targeted disruption of selected Cyp genes has generated numerous Cyp null mouse lines used to decipher the role of Cyp enzymes in metabolic, toxicological and biological processes. In conclusion, the laboratory mouse is an indispensable model for exploring human CYP-mediated activities.
Exploring the Transcriptome of Ciliated Cells Using In Silico Dissection of Human Tissues

PubMed Central

Ivliev, Alexander E.; 't Hoen, Peter A. C.; van Roon-Mom, Willeke M. C.; Peters, Dorien J. M.; Sergeeva, Marina G.

2012-01-01

Cilia are cell organelles that play important roles in cell motility, sensory and developmental functions and are involved in a range of human diseases, known as ciliopathies. Here, we search for novel human genes related to cilia using a strategy that exploits the previously reported tendency of cell type-specific genes to be coexpressed in the transcriptome of complex tissues. Gene coexpression networks were constructed using the noise-resistant WGCNA algorithm in 12 publicly available microarray datasets from human tissues rich in motile cilia: airways, fallopian tubes and brain. A cilia-related coexpression module was detected in 10 out of the 12 datasets. A consensus analysis of this module's gene composition recapitulated 297 known and predicted 74 novel cilia-related genes. 82% of the novel candidates were supported by tissue-specificity expression data from GEO and/or proteomic data from the Human Protein Atlas. The novel findings included a set of genes (DCDC2, DYX1C1, KIAA0319) related to a neurological disease dyslexia suggesting their potential involvement in ciliary functions. Furthermore, we searched for differences in gene composition of the ciliary module between the tissues. A multidrug-and-toxin extrusion transporter MATE2 (SLC47A2) was found as a brain-specific central gene in the ciliary module. We confirm the localization of MATE2 in cilia by immunofluorescence staining using MDCK cells as a model. While MATE2 has previously gained attention as a pharmacologically relevant transporter, its potential relation to cilia is suggested for the first time. Taken together, our large-scale analysis of gene coexpression networks identifies novel genes related to human cell cilia. PMID:22558177
Genome-Wide Identification and Comprehensive Expression Profiling of Ribosomal Protein Small Subunit (RPS) Genes and their Comparative Analysis with the Large Subunit (RPL) Genes in Rice

PubMed Central

Saha, Anusree; Das, Shubhajit; Moin, Mazahar; Dutta, Mouboni; Bakshi, Achala; Madhav, M. S.; Kirti, P. B.

2017-01-01

Ribosomal proteins (RPs) are indispensable in ribosome biogenesis and protein synthesis, and play a crucial role in diverse developmental processes. Our previous studies on Ribosomal Protein Large subunit (RPL) genes provided insights into their stress responsive roles in rice. In the present study, we have explored the developmental and stress regulated expression patterns of Ribosomal Protein Small (RPS) subunit genes for their differential expression in a spatiotemporal and stress dependent manner. We have also performed an in silico analysis of gene structure, cis-elements in upstream regulatory regions, protein properties and phylogeny. Expression studies of the 34 RPS genes in 13 different tissues of rice covering major growth and developmental stages revealed that their expression was substantially elevated, mostly in shoots and leaves indicating their possible involvement in the development of vegetative organs. The majority of the RPS genes have manifested significant expression under all abiotic stress treatments with ABA, PEG, NaCl, and H2O2. Infection with important rice pathogens, Xanthomonas oryzae pv. oryzae (Xoo) and Rhizoctonia solani also induced the up-regulation of several of the RPS genes. RPS4, 13a, 18a, and 4a have shown higher transcript levels under all the abiotic stresses, whereas, RPS4 is up-regulated in both the biotic stress treatments. The information obtained from the present investigation would be useful in appreciating the possible stress-regulatory attributes of the genes coding for rice ribosomal small subunit proteins apart from their functions as house-keeping proteins. A detailed functional analysis of independent genes is required to study their roles in stress tolerance and generating stress- tolerant crops. PMID:28966624
Adaptive evolution of the myo6 gene in old world fruit bats (family: pteropodidae).

PubMed

Shen, Bin; Han, Xiuqun; Jones, Gareth; Rossiter, Stephen J; Zhang, Shuyi

2013-01-01

Myosin VI (encoded by the Myo6 gene) is highly expressed in the inner and outer hair cells of the ear, retina, and polarized epithelial cells such as kidney proximal tubule cells and intestinal enterocytes. The Myo6 gene is thought to be involved in a wide range of physiological functions such as hearing, vision, and clathrin-mediated endocytosis. Bats (Chiroptera) represent one of the most fascinating mammal groups for molecular evolutionary studies of the Myo6 gene. A diversity of specialized adaptations occur among different bat lineages, such as echolocation and associated high-frequency hearing in laryngeal echolocating bats, large eyes and a strong dependence on vision in Old World fruit bats (Pteropodidae), and specialized high-carbohydrate but low-nitrogen diets in both Old World and New World fruit bats (Phyllostomidae). To investigate what role(s) the Myo6 gene might fulfill in bats, we sequenced the coding region of the Myo6 gene in 15 bat species and used molecular evolutionary analyses to detect evidence of positive selection in different bat lineages. We also conducted real-time PCR assays to explore the expression levels of Myo6 in a range of tissues from three representative bat species. Molecular evolutionary analyses revealed that the Myo6 gene, which was widely considered as a hearing gene, has undergone adaptive evolution in the Old World fruit bats which lack laryngeal echolocation and associated high-frequency hearing. Real-time PCR showed the highest expression level of the Myo6 gene in the kidney among ten tissues examined in three bat species, indicating an important role for this gene in kidney function. We suggest that Myo6 has undergone adaptive evolution in Old World fruit bats in relation to receptor-mediated endocytosis for the preservation of protein and essential nutrients.
An RNAi Screen for Genes Involved in Nanoscale Protrusion Formation on Corneal Lens in Drosophila melanogaster.

PubMed

Minami, Ryunosuke; Sato, Chiaki; Yamahama, Yumi; Kubo, Hideo; Hariyama, Takahiko; Kimura, Ken-Ichi

2016-12-01

The "moth-eye" structure, which is observed on the surface of corneal lens in several insects, supports anti-reflective and self-cleaning functions due to nanoscale protrusions known as corneal nipples. Although the morphology and function of the "moth-eye" structure, are relatively well studied, the mechanism of protrusion formation from cell-secreted substances is unknown. In Drosophila melanogaster, a compound eye consists of approximately 800 facets, the surface of which is formed by the corneal lens with nanoscale protrusions. In the present study, we sought to identify genes involved in "moth-eye" structure, formation in order to elucidate the developmental mechanism of the protrusions in Drosophila. We re-examined the aberrant patterns in classical glossy-eye mutants by scanning electron microscope and classified the aberrant patterns into groups. Next, we screened genes encoding putative structural cuticular proteins and genes involved in cuticular formation using eye specific RNAi silencing methods combined with the Gal4/UAS expression system. We identified 12 of 100 candidate genes, such as cuticular proteins family genes (Cuticular protein 23B and Cuticular protein 49Ah), cuticle secretion-related genes (Syntaxin 1A and Sec61 ββ subunit), ecdysone signaling and biosynthesis-related genes (Ecdysone receptor, Blimp-1, and shroud), and genes involved in cell polarity/cell architecture (Actin 5C, shotgun, armadillo, discs large1, and coracle). Although some of the genes we identified may affect corneal protrusion formation indirectly through general patterning defects in eye formation, these initial findings have encouraged us to more systematically explore the precise mechanisms underlying the formation of nanoscale protrusions in Drosophila.
Understanding how long‐acting β2‐adrenoceptor agonists enhance the clinical efficacy of inhaled corticosteroids in asthma – an update

PubMed Central

Giembycz, Mark A

2016-01-01

In moderate‐to‐severe asthma, adding an inhaled long‐acting β2‐adenoceptor agonist (LABA) to an inhaled corticosteroid (ICS) provides better disease control than simply increasing the dose of ICS. Acting on the glucocorticoid receptor (GR, gene NR3C1), ICSs promote anti‐inflammatory/anti‐asthma gene expression. In vitro, LABAs synergistically enhance the maximal expression of many glucocorticoid‐induced genes. Other genes, including dual‐specificity phosphatase 1(DUSP1) in human airways smooth muscle (ASM) and epithelial cells, are up‐regulated additively by both drug classes. Synergy may also occur for LABA‐induced genes, as illustrated by the bronchoprotective gene, regulator of G‐protein signalling 2 (RGS2) in ASM. Such effects cannot be produced by either drug alone and may explain the therapeutic efficacy of ICS/LABA combination therapies. While the molecular basis of synergy remains unclear, mechanistic interpretations must accommodate gene‐specific regulation. We explore the concept that each glucocorticoid‐induced gene is an independent signal transducer optimally activated by a specific, ligand‐directed, GR conformation. In addition to explaining partial agonism, this realization provides opportunities to identify novel GR ligands that exhibit gene expression bias. Translating this into improved therapeutic ratios requires consideration of GR density in target tissues and further understanding of gene function. Similarly, the ability of a LABA to interact with a glucocorticoid may be suboptimal due to low β2‐adrenoceptor density or biased β2‐adrenoceptor signalling. Strategies to overcome these limitations include adding‐on a phosphodiesterase inhibitor and using agonists of other Gs‐coupled receptors. In all cases, the rational design of ICS/LABA, and derivative, combination therapies requires functional knowledge of induced (and repressed) genes for therapeutic benefit to be maximized. PMID:27646470
Computational analysis of TRAPPC9: candidate gene for autosomal recessive non-syndromic mental retardation.

PubMed

Khattak, Naureen Aslam; Mir, Asif

2014-01-01

Mental retardation (MR)/ intellectual disability (ID) is a neuro-developmental disorder characterized by a low intellectual quotient (IQ) and deficits in adaptive behavior related to everyday life tasks such as delayed language acquisition, social skills or self-help skills with onset before age 18. To date, a few genes (PRSS12, CRBN, CC2D1A, GRIK2, TUSC3, TRAPPC9, TECR, ST3GAL3, MED23, MAN1B1, NSUN1) for autosomal-recessive forms of non syndromic MR (NS-ARMR) have been identified and established in various families with ID. The recently reported candidate gene TRAPPC9 was selected for computational analysis to explore its potentially important role in pathology as it is the only gene for ID reported in more than five different familial cases worldwide. YASARA (12.4.1) was utilized to generate three dimensional structures of the candidate gene TRAPPC9. Hybrid structure prediction was employed. Crystal Structure of a Conserved Metalloprotein From Bacillus Cereus (3D19-C) was selected as best suitable template using position-specific iteration-BLAST. Template (3D19-C) parameters were based on E-value, Z-score and resolution and quality score of 0.32, -1.152, 2.30°A and 0.684 respectively. Model reliability showed 93.1% residues placed in the most favored region with 96.684 quality factor, and overall 0.20 G-factor (dihedrals 0.06 and covalent 0.39 respectively). Protein-Protein docking analysis demonstrated that TRAPPC9 showed strong interactions of the amino acid residues S(253), S(251), Y(256), G(243), D(131) with R(105), Q(425), W(226), N(255), S(233), its functional partner 1KBKB. Protein-protein interacting residues could facilitate the exploration of structural and functional outcomes of wild type and mutated TRAPCC9 protein. Actively involved residues can be used to elucidate the binding properties of the protein, and to develop drug therapy for NS-ARMR patients.
Exploring long-wave infrared transmitting materials with AxBy form: First-principles gene-like studies.

PubMed

Du, Jia-Ren; Chen, Nian-Ke; Li, Xian-Bin; Xie, Sheng-Yi; Tian, Wei Quan; Wang, Xian-Yin; Tu, Hai-Ling; Sun, Hong-Bo

2016-02-23

Long-wave infrared (8-12 μm) transmitting materials play critical roles in space science and electronic science. However, the paradox between their mechanical strength and infrared transmitting performance seriously prohibits their applications in harsh external environment. From the experimental view, searching a good window material compatible with both properties is a vast trail-and-error engineering project, which is not readily achieved efficiently. In this work, we propose a very simple and efficient method to explore potential infrared window materials with suitable mechanical property by first-principles gene-like searching. Two hundred and fifty-three potential materials are evaluated to find their bulk modulus (for mechanical performance) and phonon vibrational frequency (for optical performance). Seven new potential candidates are selected, namely TiSe, TiS, MgS, CdF2, HgF2, CdO, and SrO. Especially, the performances of TiS and CdF2 can be comparable to that of the most popular commercial ZnS at high temperature. Finally, we propose possible ranges of infrared transmission for halogen, chalcogen and nitrogen compounds respectively to guide further exploration. The present strategy to explore IR window materials can significantly speed up the new development progress. The same idea can be used for other material rapid searching towards special functions and applications.
Microbial oceanography in a sea of opportunity.

PubMed

Bowler, Chris; Karl, David M; Colwell, Rita R

2009-05-14

Plankton use solar energy to drive the nutrient cycles that make the planet habitable for larger organisms. We can now explore the diversity and functions of plankton using genomics, revealing the gene repertoires associated with survival in the oceans. Such studies will help us to appreciate the sensitivity of ocean systems and of the ocean's response to climate change, improving the predictive power of climate models.
deFUME: Dynamic exploration of functional metagenomic sequencing data.

PubMed

van der Helm, Eric; Geertz-Hansen, Henrik Marcus; Genee, Hans Jasper; Malla, Sailesh; Sommer, Morten Otto Alexander

2015-07-31

Functional metagenomic selections represent a powerful technique that is widely applied for identification of novel genes from complex metagenomic sources. However, whereas hundreds to thousands of clones can be easily generated and sequenced over a few days of experiments, analyzing the data is time consuming and constitutes a major bottleneck for experimental researchers in the field. Here we present the deFUME web server, an easy-to-use web-based interface for processing, annotation and visualization of functional metagenomics sequencing data, tailored to meet the requirements of non-bioinformaticians. The web-server integrates multiple analysis steps into one single workflow: read assembly, open reading frame prediction, and annotation with BLAST, InterPro and GO classifiers. Analysis results are visualized in an online dynamic web-interface. The deFUME webserver provides a fast track from raw sequence to a comprehensive visual data overview that facilitates effortless inspection of gene function, clustering and distribution. The webserver is available at cbs.dtu.dk/services/deFUME/and the source code is distributed at github.com/EvdH0/deFUME.
GADD45β, an anti-tumor gene, inhibits avian leukosis virus subgroup J replication in chickens.

PubMed

Zhang, Xinheng; Yan, Zhuanqiang; Li, Xinjian; Lin, Wencheng; Dai, Zhenkai; Yan, Yiming; Lu, Piaopiao; Chen, Weiguo; Zhang, Huanmin; Chen, Feng; Ma, Jingyun; Xie, Qingmei

2016-10-18

Avian leukosis virus subgroup J (ALV-J) is a retroviruses that induces neoplasia, hepatomegaly, immunosuppression and poor performance in chickens. The tumorigenic and pathogenic mechanisms of ALV-J remain a hot topic. To explore anti-tumor genes that promote resistance to ALV-J infection in chickens, we bred ALV-J resistant and susceptible chickens (F3 generation). RNA-sequencing (RNA-Seq) of liver tissue from the ALV-J resistant and susceptible chickens identified 216 differentially expressed genes; 88 of those genes were up-regulated in the ALV-J resistant chickens (compared to the susceptible ones). We screened for significantly up-regulated genes (P < 0.01) of interest in the ALV-J resistant chickens, based on their involvement in biological signaling pathways. Functional analyses showed that overexpression of GADD45β inhibited ALV-J replication. GADD45β could enhance defense against ALV-J infection and may be used as a molecular marker to identify ALV-J infections.
GADD45β, an anti-tumor gene, inhibits avian leukosis virus subgroup J replication in chickens

PubMed Central

Zhang, Xinheng; Yan, Zhuanqiang; Li, Xinjian; Lin, Wencheng; Dai, Zhenkai; Yan, Yiming; Lu, Piaopiao; Chen, Weiguo; Zhang, Huanmin; Chen, Feng; Ma, Jingyun; Xie, Qingmei

2016-01-01

Avian leukosis virus subgroup J (ALV-J) is a retroviruses that induces neoplasia, hepatomegaly, immunosuppression and poor performance in chickens. The tumorigenic and pathogenic mechanisms of ALV-J remain a hot topic. To explore anti-tumor genes that promote resistance to ALV-J infection in chickens, we bred ALV-J resistant and susceptible chickens (F3 generation). RNA-sequencing (RNA-Seq) of liver tissue from the ALV-J resistant and susceptible chickens identified 216 differentially expressed genes; 88 of those genes were up-regulated in the ALV-J resistant chickens (compared to the susceptible ones). We screened for significantly up-regulated genes (P < 0.01) of interest in the ALV-J resistant chickens, based on their involvement in biological signaling pathways. Functional analyses showed that overexpression of GADD45β inhibited ALV-J replication. GADD45β could enhance defense against ALV-J infection and may be used as a molecular marker to identify ALV-J infections. PMID:27655697
DOE Office of Scientific and Technical Information (OSTI.GOV)

Williams, Kelly Porter

Key goals towards national biosecurity include methods for analyzing pathogens, predicting their emergence, and developing countermeasures. These goals are served by studying bacterial genes that promote pathogenicity and the pathogenicity islands that mobilize them. Cyberinfrastructure promoting an island database advances this field and enables deeper bioinformatic analysis that may identify novel pathogenicity genes. New automated methods and rich visualizations were developed for identifying pathogenicity islands, based on the principle that islands occur sporadically among closely related strains. The chromosomally-ordered pan-genome organizes all genes from a clade of strains; gaps in this visualization indicate islands, and decorations of the gene matrixmore » facilitate exploration of island gene functions. A %E2%80%9Clearned phyloblocks%E2%80%9D method was developed for automated island identification, that trains on the phylogenetic patterns of islands identified by other methods. Learned phyloblocks better defined termini of previously identified islands in multidrug-resistant Klebsiella pneumoniae ATCC BAA-2146, and found its only antibiotic resistance island.« less
DOSE: an R/Bioconductor package for disease ontology semantic and enrichment analysis.

PubMed

Yu, Guangchuang; Wang, Li-Gen; Yan, Guang-Rong; He, Qing-Yu

2015-02-15

Disease ontology (DO) annotates human genes in the context of disease. DO is important annotation in translating molecular findings from high-throughput data to clinical relevance. DOSE is an R package providing semantic similarity computations among DO terms and genes which allows biologists to explore the similarities of diseases and of gene functions in disease perspective. Enrichment analyses including hypergeometric model and gene set enrichment analysis are also implemented to support discovering disease associations of high-throughput biological data. This allows biologists to verify disease relevance in a biological experiment and identify unexpected disease associations. Comparison among gene clusters is also supported. DOSE is released under Artistic-2.0 License. The source code and documents are freely available through Bioconductor (http://www.bioconductor.org/packages/release/bioc/html/DOSE.html). Supplementary data are available at Bioinformatics online. gcyu@connect.hku.hk or tqyhe@jnu.edu.cn. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Gene network analysis shows immune-signaling and ERK1/2 as novel genetic markers for multiple addiction phenotypes: alcohol, smoking and opioid addiction.

PubMed

Reyes-Gibby, Cielito C; Yuan, Christine; Wang, Jian; Yeung, Sai-Ching J; Shete, Sanjay

2015-06-05

Addictions to alcohol and tobacco, known risk factors for cancer, are complex heritable disorders. Addictive behaviors have a bidirectional relationship with pain. We hypothesize that the associations between alcohol, smoking, and opioid addiction observed in cancer patients have a genetic basis. Therefore, using bioinformatics tools, we explored the underlying genetic basis and identified new candidate genes and common biological pathways for smoking, alcohol, and opioid addiction. Literature search showed 56 genes associated with alcohol, smoking and opioid addiction. Using Core Analysis function in Ingenuity Pathway Analysis software, we found that ERK1/2 was strongly interconnected across all three addiction networks. Genes involved in immune signaling pathways were shown across all three networks. Connect function from IPA My Pathway toolbox showed that DRD2 is the gene common to both the list of genetic variations associated with all three addiction phenotypes and the components of the brain neuronal signaling network involved in substance addiction. The top canonical pathways associated with the 56 genes were: 1) calcium signaling, 2) GPCR signaling, 3) cAMP-mediated signaling, 4) GABA receptor signaling, and 5) G-alpha i signaling. Cancer patients are often prescribed opioids for cancer pain thus increasing their risk for opioid abuse and addiction. Our findings provide candidate genes and biological pathways underlying addiction phenotypes, which may be future targets for treatment of addiction. Further study of the variations of the candidate genes could allow physicians to make more informed decisions when treating cancer pain with opioid analgesics.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.