identify genes essential: Topics by Science.gov

Sample records for identify genes essential

Combining Genome-Scale Experimental and Computational Methods To Identify Essential Genes in Rhodobacter sphaeroides

DOE PAGES

Burger, Brian T.; Imam, Saheed; Scarborough, Matthew J.; ...

2017-06-06

Rhodobacter sphaeroides is one of the best-studied alphaproteobacteria from biochemical, genetic, and genomic perspectives. To gain a better systems-level understanding of this organism, we generated a large transposon mutant library and used transposon sequencing (Tn-seq) to identify genes that are essential under several growth conditions. Using newly developed Tn-seq analysis software (TSAS), we identified 493 genes as essential for aerobic growth on a rich medium. We then used the mutant library to identify conditionally essential genes under two laboratory growth conditions, identifying 85 additional genes required for aerobic growth in a minimal medium and 31 additional genes required for photosyntheticmore » growth. In all instances, our analyses confirmed essentiality for many known genes and identified genes not previously considered to be essential. We used the resulting Tn-seq data to refine and improve a genome-scale metabolic network model (GEM) for R. sphaeroides. Together, we demonstrate how genetic, genomic, and computational approaches can be combined to obtain a systems-level understanding of the genetic framework underlying metabolic diversity in bacterial species.« less
Methods for identifying an essential gene in a prokaryotic microorganism

DOEpatents

Shizuya, Hiroaki

2006-01-31

Methods are provided for the rapid identification of essential or conditionally essential DNA segments in any species of haploid cell (one copy chromosome per cell) that is capable of being transformed by artificial means and is capable of undergoing DNA recombination. This system offers an enhanced means of identifying essential function genes in diploid pathogens, such as gram-negative and gram-positive bacteria.
Predicting essential genes for identifying potential drug targets in Aspergillus fumigatus.

PubMed

Lu, Yao; Deng, Jingyuan; Rhodes, Judith C; Lu, Hui; Lu, Long Jason

2014-06-01

Aspergillus fumigatus (Af) is a ubiquitous and opportunistic pathogen capable of causing acute, invasive pulmonary disease in susceptible hosts. Despite current therapeutic options, mortality associated with invasive Af infections remains unacceptably high, increasing 357% since 1980. Therefore, there is an urgent need for the development of novel therapeutic strategies, including more efficacious drugs acting on new targets. Thus, as noted in a recent review, "the identification of essential genes in fungi represents a crucial step in the development of new antifungal drugs". Expanding the target space by rapidly identifying new essential genes has thus been described as "the most important task of genomics-based target validation". In previous research, we were the first to show that essential gene annotation can be reliably transferred between distantly related four Prokaryotic species. In this study, we extend our machine learning approach to the much more complex Eukaryotic fungal species. A compendium of essential genes is predicted in Af by transferring known essential gene annotations from another filamentous fungus Neurospora crassa. This approach predicts essential genes by integrating diverse types of intrinsic and context-dependent genomic features encoded in microbial genomes. The predicted essential datasets contained 1674 genes. We validated our results by comparing our predictions with known essential genes in Af, comparing our predictions with those predicted by homology mapping, and conducting conditional expressed alleles. We applied several layers of filters and selected a set of potential drug targets from the predicted essential genes. Finally, we have conducted wet lab knockout experiments to verify our predictions, which further validates the accuracy and wide applicability of the machine learning approach. The approach presented here significantly extended our ability to predict essential genes beyond orthologs and made it possible to
A Genome-wide CRISPR Screen in Toxoplasma Identifies Essential Apicomplexan Genes.

PubMed

Sidik, Saima M; Huet, Diego; Ganesan, Suresh M; Huynh, My-Hang; Wang, Tim; Nasamu, Armiyaw S; Thiru, Prathapan; Saeij, Jeroen P J; Carruthers, Vern B; Niles, Jacquin C; Lourido, Sebastian

2016-09-08

Apicomplexan parasites are leading causes of human and livestock diseases such as malaria and toxoplasmosis, yet most of their genes remain uncharacterized. Here, we present the first genome-wide genetic screen of an apicomplexan. We adapted CRISPR/Cas9 to assess the contribution of each gene from the parasite Toxoplasma gondii during infection of human fibroblasts. Our analysis defines ∼200 previously uncharacterized, fitness-conferring genes unique to the phylum, from which 16 were investigated, revealing essential functions during infection of human cells. Secondary screens identify as an invasion factor the claudin-like apicomplexan microneme protein (CLAMP), which resembles mammalian tight-junction proteins and localizes to secretory organelles, making it critical to the initiation of infection. CLAMP is present throughout sequenced apicomplexan genomes and is essential during the asexual stages of the malaria parasite Plasmodium falciparum. These results provide broad-based functional information on T. gondii genes and will facilitate future approaches to expand the horizon of antiparasitic interventions. Copyright © 2016 Elsevier Inc. All rights reserved.
A genome-wide inducible phenotypic screen identifies antisense RNA constructs silencing Escherichia coli essential genes

PubMed Central

Meng, Jia; Kanzaki, Gregory; Meas, Diane; Lam, Christopher K.; Crummer, Heather; Tain, Justina; Xu, H. Howard

2013-01-01

Regulated antisense RNA (asRNA) expression has been employed successfully in Gram-positive bacteria for genome-wide essential gene identification and drug target determination. However, there have been no published reports describing the application of asRNA gene silencing for comprehensive analyses of essential genes in Gram-negative bacteria. In this study, we report the first genome-wide identification of asRNA constructs for essential genes in Escherichia coli. We screened 250,000 library transformants for conditional growth-inhibitory recombinant clones from two shot-gun genomic libraries of E. coli using a paired-termini expression vector (pHN678). After sequencing plasmid inserts of 675 confirmed inducer-sensitive cell clones, we identified 152 separate asRNA constructs of which 134 inserts came from essential genes while 18 originated from non-essential genes (but share operons with essential genes). Among the 79 individual essential genes silenced by these asRNA constructs, 61 genes (77%) engage in processes related to protein synthesis. The cell-based assays of an asRNA clone targeting fusA (encoding elongation factor G) showed that the induced cells were sensitized 12 fold to fusidic acid, a known specific inhibitor. Our results demonstrate the utility of the paired-termini expression vector and feasibility of large-scale gene silencing in E. coli using regulated asRNA expression. PMID:22268863
A novel essential domain perspective for exploring gene essentiality.

PubMed

Lu, Yao; Lu, Yulan; Deng, Jingyuan; Peng, Hai; Lu, Hui; Lu, Long Jason

2015-09-15

Genes with indispensable functions are identified as essential; however, the traditional gene-level studies of essentiality have several limitations. In this study, we characterized gene essentiality from a new perspective of protein domains, the independent structural or functional units of a polypeptide chain. To identify such essential domains, we have developed an Expectation-Maximization (EM) algorithm-based Essential Domain Prediction (EDP) Model. With simulated datasets, the model provided convergent results given different initial values and offered accurate predictions even with noise. We then applied the EDP model to six microbial species and predicted 1879 domains to be essential in at least one species, ranging 10-23% in each species. The predicted essential domains were more conserved than either non-essential domains or essential genes. Comparing essential domains in prokaryotes and eukaryotes revealed an evolutionary distance consistent with that inferred from ribosomal RNA. When utilizing these essential domains to reproduce the annotation of essential genes, we received accurate results that suggest protein domains are more basic units for the essentiality of genes. Furthermore, we presented several examples to illustrate how the combination of essential and non-essential domains can lead to genes with divergent essentiality. In summary, we have described the first systematic analysis on gene essentiality on the level of domains. huilu.bioinfo@gmail.com or Long.Lu@cchmc.org Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
A genome-wide inducible phenotypic screen identifies antisense RNA constructs silencing Escherichia coli essential genes.

PubMed

Meng, Jia; Kanzaki, Gregory; Meas, Diane; Lam, Christopher K; Crummer, Heather; Tain, Justina; Xu, H Howard

2012-04-01

Regulated antisense RNA (asRNA) expression has been employed successfully in Gram-positive bacteria for genome-wide essential gene identification and drug target determination. However, there have been no published reports describing the application of asRNA gene silencing for comprehensive analyses of essential genes in Gram-negative bacteria. In this study, we report the first genome-wide identification of asRNA constructs for essential genes in Escherichia coli. We screened 250 000 library transformants for conditional growth inhibitory recombinant clones from two shotgun genomic libraries of E. coli using a paired-termini expression vector (pHN678). After sequencing plasmid inserts of 675 confirmed inducer sensitive cell clones, we identified 152 separate asRNA constructs of which 134 inserts came from essential genes, while 18 originated from nonessential genes (but share operons with essential genes). Among the 79 individual essential genes silenced by these asRNA constructs, 61 genes (77%) engage in processes related to protein synthesis. The cell-based assays of an asRNA clone targeting fusA (encoding elongation factor G) showed that the induced cells were sensitized 12-fold to fusidic acid, a known specific inhibitor. Our results demonstrate the utility of the paired-termini expression vector and feasibility of large-scale gene silencing in E. coli using regulated asRNA expression. © 2012 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
Pseudomonas aeruginosa essentials: an update on investigation of essential genes.

PubMed

Juhas, Mario

2015-11-01

Pseudomonas aeruginosa is the leading cause of nosocomial infections, particularly in immunocompromised, cancer, burn and cystic fibrosis patients. Development of novel antimicrobials against P. aeruginosa is therefore of the highest importance. Although the first reports on P. aeruginosa essential genes date back to the early 2000s, a number of more sensitive genomic approaches have been used recently to better define essential genes in this organism. These analyses highlight the evolution of the definition of an 'essential' gene from the traditional to the context-dependent. Essential genes, particularly those indispensable under the clinically relevant conditions, are considered to be promising targets of novel antibiotics against P. aeruginosa. This review provides an update on the investigation of P. aeruginosa essential genes. Special focus is on recently identified P. aeruginosa essential genes and their exploitation for the development of antimicrobials.
Transposon Mutagenesis Identified Chromosomal and Plasmid Genes Essential for Adaptation of the Marine Bacterium Dinoroseobacter shibae to Anaerobic Conditions

PubMed Central

Ebert, Matthias; Laaß, Sebastian; Burghartz, Melanie; Petersen, Jörn; Koßmehl, Sebastian; Wöhlbrand, Lars; Rabus, Ralf; Wittmann, Christoph; Jahn, Dieter

2013-01-01

Anaerobic growth and survival are integral parts of the life cycle of many marine bacteria. To identify genes essential for the anoxic life of Dinoroseobacter shibae, a transposon library was screened for strains impaired in anaerobic denitrifying growth. Transposon insertions in 35 chromosomal and 18 plasmid genes were detected. The essential contribution of plasmid genes to anaerobic growth was confirmed with plasmid-cured D. shibae strains. A combined transcriptome and proteome approach identified oxygen tension-regulated genes. Transposon insertion sites of a total of 1,527 mutants without an anaerobic growth phenotype were determined to identify anaerobically induced but not essential genes. A surprisingly small overlap of only three genes (napA, phaA, and the Na+/Pi antiporter gene Dshi_0543) between anaerobically essential and induced genes was found. Interestingly, transposon mutations in genes involved in dissimilatory and assimilatory nitrate reduction (napA, nasA) and corresponding cofactor biosynthesis (genomic moaB, moeB, and dsbC and plasmid-carried dsbD and ccmH) were found to cause anaerobic growth defects. In contrast, mutation of anaerobically induced genes encoding proteins required for the later denitrification steps (nirS, nirJ, nosD), dimethyl sulfoxide reduction (dmsA1), and fermentation (pdhB1, arcA, aceE, pta, acs) did not result in decreased anaerobic growth under the conditions tested. Additional essential components (ferredoxin, cccA) of the anaerobic electron transfer chain and central metabolism (pdhB) were identified. Another surprise was the importance of sodium gradient-dependent membrane processes and genomic rearrangements via viruses, transposons, and insertion sequence elements for anaerobic growth. These processes and the observed contributions of cell envelope restructuring (lysM, mipA, fadK), C4-dicarboxylate transport (dctM1, dctM3), and protease functions to anaerobic growth require further investigation to unravel the
Guided genetic screen to identify genes essential in the regeneration of hair cells and other tissues.

PubMed

Pei, Wuhong; Xu, Lisha; Huang, Sunny C; Pettie, Kade; Idol, Jennifer; Rissone, Alberto; Jimenez, Erin; Sinclair, Jason W; Slevin, Claire; Varshney, Gaurav K; Jones, MaryPat; Carrington, Blake; Bishop, Kevin; Huang, Haigen; Sood, Raman; Lin, Shuo; Burgess, Shawn M

2018-01-01

Regenerative medicine holds great promise for both degenerative diseases and traumatic tissue injury which represent significant challenges to the health care system. Hearing loss, which affects hundreds of millions of people worldwide, is caused primarily by a permanent loss of the mechanosensory receptors of the inner ear known as hair cells. This failure to regenerate hair cells after loss is limited to mammals, while all other non-mammalian vertebrates tested were able to completely regenerate these mechanosensory receptors after injury. To understand the mechanism of hair cell regeneration and its association with regeneration of other tissues, we performed a guided mutagenesis screen using zebrafish lateral line hair cells as a screening platform to identify genes that are essential for hair cell regeneration, and further investigated how genes essential for hair cell regeneration were involved in the regeneration of other tissues. We created genetic mutations either by retroviral insertion or CRISPR/Cas9 approaches, and developed a high-throughput screening pipeline for analyzing hair cell development and regeneration. We screened 254 gene mutations and identified 7 genes specifically affecting hair cell regeneration. These hair cell regeneration genes fell into distinct and somewhat surprising functional categories. By examining the regeneration of caudal fin and liver, we found these hair cell regeneration genes often also affected other types of tissue regeneration. Therefore, our results demonstrate guided screening is an effective approach to discover regeneration candidates, and hair cell regeneration is associated with other tissue regeneration.
Genes essential for phototrophic growth by a purple alphaproteobacterium: Genes for phototrophic growth

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yang, Jianming; Yin, Liang; Lessner, Faith H.

Anoxygenic purple phototrophic bacteria have served as important models for studies of photophosphorylation. The pigment-protein complexes responsible for converting light energy to ATP are relatively simple and these bacteria can grow heterotrophically under aerobic conditions, thus allowing for the study of mutants defective in photophosphorylation. In the past, genes responsible for anoxygenic phototrophic growth have been identified in a number of different bacterial species. Here we systematically studied the genetic basis for this metabolism by using Tn-seq to identify genes essential for the anaerobic growth of the purple bacterium Rhodopseudomonas palustris on acetate in light. We identified 171 genes requiredmore » for growth in this condition, 35 of which are annotated as photosynthesis genes. Among these are a few new genes not previously shown to be essential for phototrophic growth. We verified the essentiality of many of the genes we identified by analyzing the phenotypes of mutants we generated by Tn mutagenesis that had altered pigmentation. We used directed mutagenesis to verify that the R. palustris NADH:quinone oxidoreductase complex IE is essential for phototrophic growth. As a complement to the genetic data, we carried out proteomics experiments in which we found that 429 proteins were present in significantly higher amounts in cells grown anaerobically in light compared to aerobically. Among these were proteins encoded by subset of the phototrophic growth-essential genes.« less
Properties of genes essential for mouse development

PubMed Central

Kabir, Mitra; Barradas, Ana; Tzotzos, George T.; Hentges, Kathryn E.

2017-01-01

Essential genes are those that are critical for life. In the specific case of the mouse, they are the set of genes whose deletion means that a mouse is unable to survive after birth. As such, they are the key minimal set of genes needed for all the steps of development to produce an organism capable of life ex utero. We explored a wide range of sequence and functional features to characterise essential (lethal) and non-essential (viable) genes in mice. Experimental data curated manually identified 1301 essential genes and 3451 viable genes. Very many sequence features show highly significant differences between essential and viable mouse genes. Essential genes generally encode complex proteins, with multiple domains and many introns. These genes tend to be: long, highly expressed, old and evolutionarily conserved. These genes tend to encode ligases, transferases, phosphorylated proteins, intracellular proteins, nuclear proteins, and hubs in protein-protein interaction networks. They are involved with regulating protein-protein interactions, gene expression and metabolic processes, cell morphogenesis, cell division, cell proliferation, DNA replication, cell differentiation, DNA repair and transcription, cell differentiation and embryonic development. Viable genes tend to encode: membrane proteins or secreted proteins, and are associated with functions such as cellular communication, apoptosis, behaviour and immune response, as well as housekeeping and tissue specific functions. Viable genes are linked to transport, ion channels, signal transduction, calcium binding and lipid binding, consistent with their location in membranes and involvement with cell-cell communication. From the analysis of the composite features of essential and viable genes, we conclude that essential genes tend to be required for intracellular functions, and viable genes tend to be involved with extracellular functions and cell-cell communication. Knowledge of the features that are over
The essential gene set of a photosynthetic organism

DOE PAGES

Rubin, Benjamin E.; Wetmore, Kelly M.; Price, Morgan N.; ...

2015-10-27

Synechococcus elongatus PCC 7942 is a model organism used for studying photosynthesis and the circadian clock, and it is being developed for the production of fuel, industrial chemicals, and pharmaceuticals. To identify a comprehensive set of genes and intergenic regions that impacts fitness in S. elongatus, we created a pooled library of ~250,000 transposon mutants and used sequencing to identify the insertion locations. By analyzing the distribution and survival of these mutants, we identified 718 of the organism's 2,723 genes as essential for survival under laboratory conditions. The validity of the essential gene set is supported by its tight overlapmore » with wellconserved genes and its enrichment for core biological processes. The differences noted between our dataset and these predictors of essentiality, however, have led to surprising biological insights. One such finding is that genes in a large portion of the TCA cycle are dispensable, suggesting that S. elongatus does not require a cyclic TCA process. Furthermore, the density of the transposon mutant library enabled individual and global statements about the essentiality of noncoding RNAs, regulatory elements, and other intergenic regions. In this way, a group I intron located in tRNA Leu , which has been used extensively for phylogenetic studies, was shown here to be essential for the survival of S. elongatus. Our survey of essentiality for every locus in the S. elongatus genome serves as a powerful resource for understanding the organism's physiology and defines the essential gene set required for the growth of a photosynthetic organism.« less
A CRISPR-Based Screen Identifies Genes Essential for West-Nile-Virus-Induced Cell Death.

PubMed

Ma, Hongming; Dang, Ying; Wu, Yonggan; Jia, Gengxiang; Anaya, Edgar; Zhang, Junli; Abraham, Sojan; Choi, Jang-Gi; Shi, Guojun; Qi, Ling; Manjunath, N; Wu, Haoquan

2015-07-28

West Nile virus (WNV) causes an acute neurological infection attended by massive neuronal cell death. However, the mechanism(s) behind the virus-induced cell death is poorly understood. Using a library containing 77,406 sgRNAs targeting 20,121 genes, we performed a genome-wide screen followed by a second screen with a sub-library. Among the genes identified, seven genes, EMC2, EMC3, SEL1L, DERL2, UBE2G2, UBE2J1, and HRD1, stood out as having the strongest phenotype, whose knockout conferred strong protection against WNV-induced cell death with two different WNV strains and in three cell lines. Interestingly, knockout of these genes did not block WNV replication. Thus, these appear to be essential genes that link WNV replication to downstream cell death pathway(s). In addition, the fact that all of these genes belong to the ER-associated protein degradation (ERAD) pathway suggests that this might be the primary driver of WNV-induced cell death. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Genome-wide essential gene identification in Streptococcus sanguinis

PubMed Central

Xu, Ping; Ge, Xiuchun; Chen, Lei; Wang, Xiaojing; Dou, Yuetan; Xu, Jerry Z.; Patel, Jenishkumar R.; Stone, Victoria; Trinh, My; Evans, Karra; Kitten, Todd; Bonchev, Danail; Buck, Gregory A.

2011-01-01

A clear perception of gene essentiality in bacterial pathogens is pivotal for identifying drug targets to combat emergence of new pathogens and antibiotic-resistant bacteria, for synthetic biology, and for understanding the origins of life. We have constructed a comprehensive set of deletion mutants and systematically identified a clearly defined set of essential genes for Streptococcus sanguinis. Our results were confirmed by growing S. sanguinis in minimal medium and by double-knockout of paralogous or isozyme genes. Careful examination revealed that these essential genes were associated with only three basic categories of biological functions: maintenance of the cell envelope, energy production, and processing of genetic information. Our finding was subsequently validated in two other pathogenic streptococcal species, Streptococcus pneumoniae and Streptococcus mutans and in two other gram-positive pathogens, Bacillus subtilis and Staphylococcus aureus. Our analysis has thus led to a simplified model that permits reliable prediction of gene essentiality. PMID:22355642
Analysis of pan-genome to identify the core genes and essential genes of Brucella spp.

PubMed

Yang, Xiaowen; Li, Yajie; Zang, Juan; Li, Yexia; Bie, Pengfei; Lu, Yanli; Wu, Qingmin

2016-04-01

Brucella spp. are facultative intracellular pathogens, that cause a contagious zoonotic disease, that can result in such outcomes as abortion or sterility in susceptible animal hosts and grave, debilitating illness in humans. For deciphering the survival mechanism of Brucella spp. in vivo, 42 Brucella complete genomes from NCBI were analyzed for the pan-genome and core genome by identification of their composition and function of Brucella genomes. The results showed that the total 132,143 protein-coding genes in these genomes were divided into 5369 clusters. Among these, 1710 clusters were associated with the core genome, 1182 clusters with strain-specific genes and 2477 clusters with dispensable genomes. COG analysis indicated that 44 % of the core genes were devoted to metabolism, which were mainly responsible for energy production and conversion (COG category C), and amino acid transport and metabolism (COG category E). Meanwhile, approximately 35 % of the core genes were in positive selection. In addition, 1252 potential essential genes were predicted in the core genome by comparison with a prokaryote database of essential genes. The results suggested that the core genes in Brucella genomes are relatively conservation, and the energy and amino acid metabolism play a more important role in the process of growth and reproduction in Brucella spp. This study might help us to better understand the mechanisms of Brucella persistent infection and provide some clues for further exploring the gene modules of the intracellular survival in Brucella spp.
OGEE v2: an update of the online gene essentiality database with special focus on differentially essential genes in human cancer cell lines.

PubMed

Chen, Wei-Hua; Lu, Guanting; Chen, Xiao; Zhao, Xing-Ming; Bork, Peer

2017-01-04

OGEE is an Online GEne Essentiality database. To enhance our understanding of the essentiality of genes, in OGEE we collected experimentally tested essential and non-essential genes, as well as associated gene properties known to contribute to gene essentiality. We focus on large-scale experiments, and complement our data with text-mining results. We organized tested genes into data sets according to their sources, and tagged those with variable essentiality statuses across data sets as conditionally essential genes, intending to highlight the complex interplay between gene functions and environments/experimental perturbations. Developments since the last public release include increased numbers of species and gene essentiality data sets, inclusion of non-coding essential sequences and genes with intermediate essentiality statuses. In addition, we included 16 essentiality data sets from cancer cell lines, corresponding to 9 human cancers; with OGEE, users can easily explore the shared and differentially essential genes within and between cancer types. These genes, especially those derived from cell lines that are similar to tumor samples, could reveal the oncogenic drivers, paralogous gene expression pattern and chromosomal structure of the corresponding cancer types, and can be further screened to identify targets for cancer therapy and/or new drug development. OGEE is freely available at http://ogee.medgenius.info. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
ZCURVE 3.0: identify prokaryotic genes with higher accuracy as well as automatically and accurately select essential genes

PubMed Central

Hua, Zhi-Gang; Lin, Yan; Yuan, Ya-Zhou; Yang, De-Chang; Wei, Wen; Guo, Feng-Biao

2015-01-01

In 2003, we developed an ab initio program, ZCURVE 1.0, to find genes in bacterial and archaeal genomes. In this work, we present the updated version (i.e. ZCURVE 3.0). Using 422 prokaryotic genomes, the average accuracy was 93.7% with the updated version, compared with 88.7% with the original version. Such results also demonstrate that ZCURVE 3.0 is comparable with Glimmer 3.02 and may provide complementary predictions to it. In fact, the joint application of the two programs generated better results by correctly finding more annotated genes while also containing fewer false-positive predictions. As the exclusive function, ZCURVE 3.0 contains one post-processing program that can identify essential genes with high accuracy (generally >90%). We hope ZCURVE 3.0 will receive wide use with the web-based running mode. The updated ZCURVE can be freely accessed from http://cefg.uestc.edu.cn/zcurve/ or http://tubic.tju.edu.cn/zcurveb/ without any restrictions. PMID:25977299
Identification of essential genes in Streptococcus pneumoniae by allelic replacement mutagenesis.

PubMed

Song, Jae-Hoon; Ko, Kwan Soo; Lee, Ji-Young; Baek, Jin Yang; Oh, Won Sup; Yoon, Ha Sik; Jeong, Jin-Yong; Chun, Jongsik

2005-06-30

To find potential targets of novel antimicrobial agents, we identified essential genes of Streptococcus pneumoniae using comparative genomics and allelic replacement mutagenesis. We compared the genome of S. pneumoniae R6 with those of Bacillus subtilis, Enterococcus faecalis, Escherichia coli, and Staphylococcus aureus, and selected 693 candidate target genes with > 40% amino acid sequence identity to the corresponding genes in at least two of the other species. The 693 genes were disrupted and 133 were found to be essential for growth. Of these, 32 encoded proteins of unknown function, and we were able to identify orthologues of 22 of these genes by genomic comparisons. The experimental method used in this study is easy to perform, rapid and efficient for identifying essential genes of bacterial pathogens.
Gene essentiality, conservation index and co-evolution of genes in cyanobacteria.

PubMed

Tiruveedula, Gopi Siva Sai; Wangikar, Pramod P

2017-01-01

Cyanobacteria, a group of photosynthetic prokaryotes, dominate the earth with ~ 1015 g wet biomass. Despite diversity in habitats and an ancient origin, cyanobacterial phylum has retained a significant core genome. Cyanobacteria are being explored for direct conversion of solar energy and carbon dioxide into biofuels. For this, efficient cyanobacterial strains will need to be designed via metabolic engineering. This will require identification of target knockouts to channelize the flow of carbon toward the product of interest while minimizing deletions of essential genes. We propose "Gene Conservation Index" (GCI) as a quick measure to predict gene essentiality in cyanobacteria. GCI is based on phylogenetic profile of a gene constructed with a reduced dataset of cyanobacterial genomes. GCI is the percentage of organism clusters in which the query gene is present in the reduced dataset. Of the 750 genes deemed to be essential in the experimental study on S. elongatus PCC 7942, we found 494 to be conserved across the phylum which largely comprise of the essential metabolic pathways. On the contrary, the conserved but non-essential genes broadly comprise of genes required under stress conditions. Exceptions to this rule include genes such as the glycogen synthesis and degradation enzymes, deoxyribose-phosphate aldolase (DERA), glucose-6-phosphate 1-dehydrogenase (zwf) and fructose-1,6-bisphosphatase class1, which are conserved but non-essential. While the essential genes are to be avoided during gene knockout studies as potentially lethal deletions, the non-essential but conserved set of genes could be interesting targets for metabolic engineering. Further, we identify clusters of co-evolving genes (CCG), which provide insights that may be useful in annotation. Principal component analysis (PCA) plots of the CCGs are demonstrated as data visualization tools that are complementary to the conventional heatmaps. Our dataset consists of phylogenetic profiles for 23

An ensemble framework for identifying essential proteins.

PubMed

Zhang, Xue; Xiao, Wangxin; Acencio, Marcio Luis; Lemke, Ney; Wang, Xujing

2016-08-25

Many centrality measures have been proposed to mine and characterize the correlations between network topological properties and protein essentiality. However, most of them show limited prediction accuracy, and the number of common predicted essential proteins by different methods is very small. In this paper, an ensemble framework is proposed which integrates gene expression data and protein-protein interaction networks (PINs). It aims to improve the prediction accuracy of basic centrality measures. The idea behind this ensemble framework is that different protein-protein interactions (PPIs) may show different contributions to protein essentiality. Five standard centrality measures (degree centrality, betweenness centrality, closeness centrality, eigenvector centrality, and subgraph centrality) are integrated into the ensemble framework respectively. We evaluated the performance of the proposed ensemble framework using yeast PINs and gene expression data. The results show that it can considerably improve the prediction accuracy of the five centrality measures individually. It can also remarkably increase the number of common predicted essential proteins among those predicted by each centrality measure individually and enable each centrality measure to find more low-degree essential proteins. This paper demonstrates that it is valuable to differentiate the contributions of different PPIs for identifying essential proteins based on network topological characteristics. The proposed ensemble framework is a successful paradigm to this end.
Defining the ABC of gene essentiality in streptococci.

PubMed

Charbonneau, Amelia R L; Forman, Oliver P; Cain, Amy K; Newland, Graham; Robinson, Carl; Boursnell, Mike; Parkhill, Julian; Leigh, James A; Maskell, Duncan J; Waller, Andrew S

2017-05-31

Utilising next generation sequencing to interrogate saturated bacterial mutant libraries provides unprecedented information for the assignment of genome-wide gene essentiality. Exposure of saturated mutant libraries to specific conditions and subsequent sequencing can be exploited to uncover gene essentiality relevant to the condition. Here we present a barcoded transposon directed insertion-site sequencing (TraDIS) system to define an essential gene list for Streptococcus equi subsp. equi, the causative agent of strangles in horses, for the first time. The gene essentiality data for this group C Streptococcus was compared to that of group A and B streptococci. Six barcoded variants of pGh9:ISS1 were designed and used to generate mutant libraries containing between 33,000-66,000 unique mutants. TraDIS was performed on DNA extracted from each library and data were analysed separately and as a combined master pool. Gene essentiality determined that 19.5% of the S. equi genome was essential. Gene essentialities were compared to those of group A and group B streptococci, identifying concordances of 90.2% and 89.4%, respectively and an overall concordance of 83.7% between the three species. The use of barcoded pGh9:ISS1 to generate mutant libraries provides a highly useful tool for the assignment of gene function in S. equi and other streptococci. The shared essential gene set of group A, B and C streptococci provides further evidence of the close genetic relationships between these important pathogenic bacteria. Therefore, the ABC of gene essentiality reported here provides a solid foundation towards reporting the functional genome of streptococci.
Combining Shigella Tn-seq data with gold-standard E. coli gene deletion data suggests rare transitions between essential and non-essential gene functionality.

PubMed

Freed, Nikki E; Bumann, Dirk; Silander, Olin K

2016-09-06

Gene essentiality - whether or not a gene is necessary for cell growth - is a fundamental component of gene function. It is not well established how quickly gene essentiality can change, as few studies have compared empirical measures of essentiality between closely related organisms. Here we present the results of a Tn-seq experiment designed to detect essential protein coding genes in the bacterial pathogen Shigella flexneri 2a 2457T on a genome-wide scale. Superficial analysis of this data suggested that 481 protein-coding genes in this Shigella strain are critical for robust cellular growth on rich media. Comparison of this set of genes with a gold-standard data set of essential genes in the closely related Escherichia coli K12 BW25113 revealed that an excessive number of genes appeared essential in Shigella but non-essential in E. coli. Importantly, and in converse to this comparison, we found no genes that were essential in E. coli and non-essential in Shigella, implying that many genes were artefactually inferred as essential in Shigella. Controlling for such artefacts resulted in a much smaller set of discrepant genes. Among these, we identified three sets of functionally related genes, two of which have previously been implicated as critical for Shigella growth, but which are dispensable for E. coli growth. The data presented here highlight the small number of protein coding genes for which we have strong evidence that their essentiality status differs between the closely related bacterial taxa E. coli and Shigella. A set of genes involved in acetate utilization provides a canonical example. These results leave open the possibility of developing strain-specific antibiotic treatments targeting such differentially essential genes, but suggest that such opportunities may be rare in closely related bacteria.
Defining the Role of Essential Genes in Human Disease

PubMed Central

Robertson, David L.; Hentges, Kathryn E.

2011-01-01

A greater understanding of the causes of human disease can come from identifying characteristics that are specific to disease genes. However, a full understanding of the contribution of essential genes to human disease is lacking, due to the premise that these genes tend to cause developmental abnormalities rather than adult disease. We tested the hypothesis that human orthologs of mouse essential genes are associated with a variety of human diseases, rather than only those related to miscarriage and birth defects. We segregated human disease genes according to whether the knockout phenotype of their mouse ortholog was lethal or viable, defining those with orthologs producing lethal knockouts as essential disease genes. We show that the human orthologs of mouse essential genes are associated with a wide spectrum of diseases affecting diverse physiological systems. Notably, human disease genes with essential mouse orthologs are over-represented among disease genes associated with cancer, suggesting links between adult cellular abnormalities and developmental functions. The proteins encoded by essential genes are highly connected in protein-protein interaction networks, which we find correlates with an over-representation of nuclear proteins amongst essential disease genes. Disease genes associated with essential orthologs also are more likely than those with non-essential orthologs to contribute to disease through an autosomal dominant inheritance pattern, suggesting that these diseases may actually result from semi-dominant mutant alleles. Overall, we have described attributes found in disease genes according to the essentiality status of their mouse orthologs. These findings demonstrate that disease genes do occupy highly connected positions in protein-protein interaction networks, and that due to the complexity of disease-associated alleles, essential genes cannot be ignored as candidates for causing diverse human diseases. PMID:22096564
ZCURVE 3.0: identify prokaryotic genes with higher accuracy as well as automatically and accurately select essential genes.

PubMed

Hua, Zhi-Gang; Lin, Yan; Yuan, Ya-Zhou; Yang, De-Chang; Wei, Wen; Guo, Feng-Biao

2015-07-01

In 2003, we developed an ab initio program, ZCURVE 1.0, to find genes in bacterial and archaeal genomes. In this work, we present the updated version (i.e. ZCURVE 3.0). Using 422 prokaryotic genomes, the average accuracy was 93.7% with the updated version, compared with 88.7% with the original version. Such results also demonstrate that ZCURVE 3.0 is comparable with Glimmer 3.02 and may provide complementary predictions to it. In fact, the joint application of the two programs generated better results by correctly finding more annotated genes while also containing fewer false-positive predictions. As the exclusive function, ZCURVE 3.0 contains one post-processing program that can identify essential genes with high accuracy (generally >90%). We hope ZCURVE 3.0 will receive wide use with the web-based running mode. The updated ZCURVE can be freely accessed from http://cefg.uestc.edu.cn/zcurve/ or http://tubic.tju.edu.cn/zcurveb/ without any restrictions. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Prediction and analysis of essential genes using the enrichments of gene ontology and KEGG pathways.

PubMed

Chen, Lei; Zhang, Yu-Hang; Wang, ShaoPeng; Zhang, YunHua; Huang, Tao; Cai, Yu-Dong

2017-01-01

Identifying essential genes in a given organism is important for research on their fundamental roles in organism survival. Furthermore, if possible, uncovering the links between core functions or pathways with these essential genes will further help us obtain deep insight into the key roles of these genes. In this study, we investigated the essential and non-essential genes reported in a previous study and extracted gene ontology (GO) terms and biological pathways that are important for the determination of essential genes. Through the enrichment theory of GO and KEGG pathways, we encoded each essential/non-essential gene into a vector in which each component represented the relationship between the gene and one GO term or KEGG pathway. To analyze these relationships, the maximum relevance minimum redundancy (mRMR) was adopted. Then, the incremental feature selection (IFS) and support vector machine (SVM) were employed to extract important GO terms and KEGG pathways. A prediction model was built simultaneously using the extracted GO terms and KEGG pathways, which yielded nearly perfect performance, with a Matthews correlation coefficient of 0.951, for distinguishing essential and non-essential genes. To fully investigate the key factors influencing the fundamental roles of essential genes, the 21 most important GO terms and three KEGG pathways were analyzed in detail. In addition, several genes was provided in this study, which were predicted to be essential genes by our prediction model. We suggest that this study provides more functional and pathway information on the essential genes and provides a new way to investigate related problems.
A Noise Trimming and Positional Significance of Transposon Insertion System to Identify Essential Genes in Yersinia pestis

NASA Astrophysics Data System (ADS)

Yang, Zheng Rong; Bullifent, Helen L.; Moore, Karen; Paszkiewicz, Konrad; Saint, Richard J.; Southern, Stephanie J.; Champion, Olivia L.; Senior, Nicola J.; Sarkar-Tyson, Mitali; Oyston, Petra C. F.; Atkins, Timothy P.; Titball, Richard W.

2017-02-01

Massively parallel sequencing technology coupled with saturation mutagenesis has provided new and global insights into gene functions and roles. At a simplistic level, the frequency of mutations within genes can indicate the degree of essentiality. However, this approach neglects to take account of the positional significance of mutations - the function of a gene is less likely to be disrupted by a mutation close to the distal ends. Therefore, a systematic bioinformatics approach to improve the reliability of essential gene identification is desirable. We report here a parametric model which introduces a novel mutation feature together with a noise trimming approach to predict the biological significance of Tn5 mutations. We show improved performance of essential gene prediction in the bacterium Yersinia pestis, the causative agent of plague. This method would have broad applicability to other organisms and to the identification of genes which are essential for competitiveness or survival under a broad range of stresses.
A Noise Trimming and Positional Significance of Transposon Insertion System to Identify Essential Genes in Yersinia pestis

PubMed Central

Yang, Zheng Rong; Bullifent, Helen L.; Moore, Karen; Paszkiewicz, Konrad; Saint, Richard J.; Southern, Stephanie J.; Champion, Olivia L.; Senior, Nicola J.; Sarkar-Tyson, Mitali; Oyston, Petra C. F.; Atkins, Timothy P.; Titball, Richard W.

2017-01-01

Massively parallel sequencing technology coupled with saturation mutagenesis has provided new and global insights into gene functions and roles. At a simplistic level, the frequency of mutations within genes can indicate the degree of essentiality. However, this approach neglects to take account of the positional significance of mutations - the function of a gene is less likely to be disrupted by a mutation close to the distal ends. Therefore, a systematic bioinformatics approach to improve the reliability of essential gene identification is desirable. We report here a parametric model which introduces a novel mutation feature together with a noise trimming approach to predict the biological significance of Tn5 mutations. We show improved performance of essential gene prediction in the bacterium Yersinia pestis, the causative agent of plague. This method would have broad applicability to other organisms and to the identification of genes which are essential for competitiveness or survival under a broad range of stresses. PMID:28165493
The Essential Gene EMB1611 Maintains Shoot Apical Meristem Function During Arabidopsis Development

USDA-ARS?s Scientific Manuscript database

The Arabidopsis thaliana genome contains hundreds of genes essential for seed development. Because null mutations in these genes cause embryo lethality, their specific molecular and developmental functions are largely unknown. Here, we identify a role for EMB1611/MEE22, an essential gene in Arabidop...
Predicting Essential Genes and Proteins Based on Machine Learning and Network Topological Features: A Comprehensive Review

PubMed Central

Zhang, Xue; Acencio, Marcio Luis; Lemke, Ney

2016-01-01

Essential proteins/genes are indispensable to the survival or reproduction of an organism, and the deletion of such essential proteins will result in lethality or infertility. The identification of essential genes is very important not only for understanding the minimal requirements for survival of an organism, but also for finding human disease genes and new drug targets. Experimental methods for identifying essential genes are costly, time-consuming, and laborious. With the accumulation of sequenced genomes data and high-throughput experimental data, many computational methods for identifying essential proteins are proposed, which are useful complements to experimental methods. In this review, we show the state-of-the-art methods for identifying essential genes and proteins based on machine learning and network topological features, point out the progress and limitations of current methods, and discuss the challenges and directions for further research. PMID:27014079
Statistical Analysis of Hurst Exponents of Essential/Nonessential Genes in 33 Bacterial Genomes

PubMed Central

Liu, Xiao; Wang, Baojin; Xu, Luo

2015-01-01

Methods for identifying essential genes currently depend predominantly on biochemical experiments. However, there is demand for improved computational methods for determining gene essentiality. In this study, we used the Hurst exponent, a characteristic parameter to describe long-range correlation in DNA, and analyzed its distribution in 33 bacterial genomes. In most genomes (31 out of 33) the significance levels of the Hurst exponents of the essential genes were significantly higher than for the corresponding full-gene-set, whereas the significance levels of the Hurst exponents of the nonessential genes remained unchanged or increased only slightly. All of the Hurst exponents of essential genes followed a normal distribution, with one exception. We therefore propose that the distribution feature of Hurst exponents of essential genes can be used as a classification index for essential gene prediction in bacteria. For computer-aided design in the field of synthetic biology, this feature can build a restraint for pre- or post-design checking of bacterial essential genes. Moreover, considering the relationship between gene essentiality and evolution, the Hurst exponents could be used as a descriptive parameter related to evolutionary level, or be added to the annotation of each gene. PMID:26067107
Identification of essential genes and synthetic lethal gene combinations in Escherichia coli K-12.

PubMed

Mori, Hirotada; Baba, Tomoya; Yokoyama, Katsushi; Takeuchi, Rikiya; Nomura, Wataru; Makishi, Kazuichi; Otsuka, Yuta; Dose, Hitomi; Wanner, Barry L

2015-01-01

Here we describe the systematic identification of single genes and gene pairs, whose knockout causes lethality in Escherichia coli K-12. During construction of precise single-gene knockout library of E. coli K-12, we identified 328 essential gene candidates for growth in complex (LB) medium. Upon establishment of the Keio single-gene deletion library, we undertook the development of the ASKA single-gene deletion library carrying a different antibiotic resistance. In addition, we developed tools for identification of synthetic lethal gene combinations by systematic construction of double-gene knockout mutants. We introduce these methods herein.
Systematic exploration of essential yeast gene function with temperature-sensitive mutants

PubMed Central

Li, Zhijian; Vizeacoumar, Franco J; Bahr, Sondra; Li, Jingjing; Warringer, Jonas; Vizeacoumar, Frederick S; Min, Renqiang; VanderSluis, Benjamin; Bellay, Jeremy; DeVit, Michael; Fleming, James A; Stephens, Andrew; Haase, Julian; Lin, Zhen-Yuan; Baryshnikova, Anastasia; Lu, Hong; Yan, Zhun; Jin, Ke; Barker, Sarah; Datti, Alessandro; Giaever, Guri; Nislow, Corey; Bulawa, Chris; Myers, Chad L; Costanzo, Michael; Gingras, Anne-Claude; Zhang, Zhaolei; Blomberg, Anders; Bloom, Kerry; Andrews, Brenda; Boone, Charles

2012-01-01

Conditional temperature-sensitive (ts) mutations are valuable reagents for studying essential genes in the yeast Saccharomyces cerevisiae. We constructed 787 ts strains, covering 497 (~45%) of the 1,101 essential yeast genes, with ~30% of the genes represented by multiple alleles. All of the alleles are integrated into their native genomic locus in the S288C common reference strain and are linked to a kanMX selectable marker, allowing further genetic manipulation by synthetic genetic array (SGA)–based, high-throughput methods. We show two such manipulations: barcoding of 440 strains, which enables chemical-genetic suppression analysis, and the construction of arrays of strains carrying different fluorescent markers of subcellular structure, which enables quantitative analysis of phenotypes using high-content screening. Quantitative analysis of a GFP-tubulin marker identified roles for cohesin and condensin genes in spindle disassembly. This mutant collection should facilitate a wide range of systematic studies aimed at understanding the functions of essential genes. PMID:21441928
A new computational strategy for predicting essential genes.

PubMed

Cheng, Jian; Wu, Wenwu; Zhang, Yinwen; Li, Xiangchen; Jiang, Xiaoqian; Wei, Gehong; Tao, Shiheng

2013-12-21

Determination of the minimum gene set for cellular life is one of the central goals in biology. Genome-wide essential gene identification has progressed rapidly in certain bacterial species; however, it remains difficult to achieve in most eukaryotic species. Several computational models have recently been developed to integrate gene features and used as alternatives to transfer gene essentiality annotations between organisms. We first collected features that were widely used by previous predictive models and assessed the relationships between gene features and gene essentiality using a stepwise regression model. We found two issues that could significantly reduce model accuracy: (i) the effect of multicollinearity among gene features and (ii) the diverse and even contrasting correlations between gene features and gene essentiality existing within and among different species. To address these issues, we developed a novel model called feature-based weighted Naïve Bayes model (FWM), which is based on Naïve Bayes classifiers, logistic regression, and genetic algorithm. The proposed model assesses features and filters out the effects of multicollinearity and diversity. The performance of FWM was compared with other popular models, such as support vector machine, Naïve Bayes model, and logistic regression model, by applying FWM to reciprocally predict essential genes among and within 21 species. Our results showed that FWM significantly improves the accuracy and robustness of essential gene prediction. FWM can remarkably improve the accuracy of essential gene prediction and may be used as an alternative method for other classification work. This method can contribute substantially to the knowledge of the minimum gene sets required for living organisms and the discovery of new drug targets.
Transcription factor genes essential for cell proliferation and replicative lifespan in budding yeast

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kamei, Yuka; Tai, Akiko; Dakeyama, Shota

Many of the lifespan-related genes have been identified in eukaryotes ranging from the yeast to human. However, there is limited information available on the longevity genes that are essential for cell proliferation. Here, we investigated whether the essential genes encoding DNA-binding transcription factors modulated the replicative lifespan of Saccharomyces cerevisiae. Heterozygous diploid knockout strains for FHL1, RAP1, REB1, and MCM1 genes showed significantly short lifespan. {sup 1}H-nuclear magnetic resonance analysis indicated a characteristic metabolic profile in the Δfhl1/FHL1 mutant. These results strongly suggest that FHL1 regulates the transcription of lifespan related metabolic genes. Thus, heterozygous knockout strains could be themore » potential materials for discovering further novel lifespan genes. - Highlights: • Involvement of yeast TF genes essential for cell growth in lifespan was evaluated. • The essential TF genes, FHL1, RAP1, REB1, and MCM1, regulate replicative lifespan. • Heterozygous deletion of FHL1 changes cellular metabolism related to lifespan.« less
A genomic approach to identify hybrid incompatibility genes.

PubMed

Cooper, Jacob C; Phadnis, Nitin

2016-07-02

Uncovering the genetic and molecular basis of barriers to gene flow between populations is key to understanding how new species are born. Intrinsic postzygotic reproductive barriers such as hybrid sterility and hybrid inviability are caused by deleterious genetic interactions known as hybrid incompatibilities. The difficulty in identifying these hybrid incompatibility genes remains a rate-limiting step in our understanding of the molecular basis of speciation. We recently described how whole genome sequencing can be applied to identify hybrid incompatibility genes, even from genetically terminal hybrids. Using this approach, we discovered a new hybrid incompatibility gene, gfzf, between Drosophila melanogaster and Drosophila simulans, and found that it plays an essential role in cell cycle regulation. Here, we discuss the history of the hunt for incompatibility genes between these species, discuss the molecular roles of gfzf in cell cycle regulation, and explore how intragenomic conflict drives the evolution of fundamental cellular mechanisms that lead to the developmental arrest of hybrids.
A genomic approach to identify hybrid incompatibility genes

PubMed Central

Cooper, Jacob C.; Phadnis, Nitin

2016-01-01

ABSTRACT Uncovering the genetic and molecular basis of barriers to gene flow between populations is key to understanding how new species are born. Intrinsic postzygotic reproductive barriers such as hybrid sterility and hybrid inviability are caused by deleterious genetic interactions known as hybrid incompatibilities. The difficulty in identifying these hybrid incompatibility genes remains a rate-limiting step in our understanding of the molecular basis of speciation. We recently described how whole genome sequencing can be applied to identify hybrid incompatibility genes, even from genetically terminal hybrids. Using this approach, we discovered a new hybrid incompatibility gene, gfzf, between Drosophila melanogaster and Drosophila simulans, and found that it plays an essential role in cell cycle regulation. Here, we discuss the history of the hunt for incompatibility genes between these species, discuss the molecular roles of gfzf in cell cycle regulation, and explore how intragenomic conflict drives the evolution of fundamental cellular mechanisms that lead to the developmental arrest of hybrids. PMID:27230814
Functional requirements for bacteriophage growth: gene essentiality and expression in mycobacteriophage Giles.

PubMed

Dedrick, Rebekah M; Marinelli, Laura J; Newton, Gerald L; Pogliano, Kit; Pogliano, Joseph; Hatfull, Graham F

2013-05-01

Bacteriophages represent a majority of all life forms, and the vast, dynamic population with early origins is reflected in their enormous genetic diversity. A large number of bacteriophage genomes have been sequenced. They are replete with novel genes without known relatives. We know little about their functions, which genes are required for lytic growth, and how they are expressed. Furthermore, the diversity is such that even genes with required functions - such as virion proteins and repressors - cannot always be recognized. Here we describe a functional genomic dissection of mycobacteriophage Giles, in which the virion proteins are identified, genes required for lytic growth are determined, the repressor is identified, and the transcription patterns determined. We find that although all of the predicted phage genes are expressed either in lysogeny or in lytic growth, 45% of the predicted genes are non-essential for lytic growth. We also describe genes required for DNA replication, show that recombination is required for lytic growth, and that Giles encodes a novel repressor. RNAseq analysis reveals abundant expression of a small non-coding RNA in a lysogen and in late lytic growth, although it is non-essential for lytic growth and does not alter lysogeny. © 2013 Blackwell Publishing Ltd.
Genomewide Identification of Essential Genes and Fitness Determinants of Streptococcus mutans UA159

PubMed Central

Zeng, Lin; Culp, David J.

2018-01-01

ABSTRACT Transposon mutagenesis coupled with next-generation DNA sequencing (Tn-seq) is a powerful tool for discovering regions of the genome that are required for the survival of bacteria in different environments. We adapted this technique to the dental caries pathogen Streptococcus mutans UA159 and identified 11% of the genome as essential, with many genes encoding products required for replication, translation, lipid metabolism, and cell wall biogenesis. Comparison of the essential genome of S. mutans UA159 with those of selected other streptococci for which such information is available revealed several metabolic pathways and genes that are required in S. mutans, but not in some Streptococcus spp. We further identified genes that are essential for sustained growth in rich or defined medium, as well as for persistence in vivo in a rodent model of oral infection. Collectively, our results provide a novel and comprehensive view of the genes required for essential processes of S. mutans, many of which could represent potential targets for therapeutics. IMPORTANCE Tooth decay (dental caries) is a common cause of pain, impaired quality of life, and tooth loss in children and adults. It begins because of a compositional change in the microorganisms that colonize the tooth surface driven by repeated and sustained carbohydrate intake. Although several bacterial species are associated with tooth decay, Streptococcus mutans is the most common cause. Therefore, it is important to identify biological processes that contribute to the survival of S. mutans in the human mouth, with the aim of disrupting the processes with antimicrobial agents. We successfully applied Tn-seq to S. mutans, discovering genes that are required for survival, growth, and persistence, both in laboratory environments and in a mouse model of tooth decay. This work highlights new avenues for the control of an important human pathogen. PMID:29435491
Analysis of essential gene dynamics under antibiotic stress in Streptococcus sanguinis

PubMed Central

El-Rami, Fadi; Kong, Xiangzhen; Parikh, Hardik; Zhu, Bin; Stone, Victoria; Kitten, Todd; Xu, Ping

2018-01-01

The paradoxical response of Streptococcus sanguinis to drugs prescribed for dental and clinical practices has complicated treatment guidelines and raised the need for further investigation. We conducted a high throughput study on concomitant transcriptome and proteome dynamics in a time course to assess S. sanguinis behaviour under a sub-inhibitory concentration of ampicillin. Temporal changes at the transcriptome and proteome level were monitored to cover essential genes and proteins over a physiological map of intricate pathways. Our findings revealed that translation was the functional category in S. sanguinis that was most enriched in essential proteins. Moreover, essential proteins in this category demonstrated the greatest conservation across 2774 bacterial proteomes, in comparison to other essential functional categories like cell wall biosynthesis and energy production. In comparison to non-essential proteins, essential proteins were less likely to contain ‘degradation-prone’ amino acids at their N-terminal position, suggesting a longer half-life. Despite the ampicillin-induced stress, the transcriptional up-regulation of amino acid-tRNA synthetases and proteomic elevation of amino acid biosynthesis enzymes favoured the enriched components of essential proteins revealing ‘proteomic signatures’ that can be used to bridge the genotype–phenotype gap of S. sanguinis under ampicillin stress. Furthermore, we identified a significant correlation between the levels of mRNA and protein for essential genes and detected essential protein-enriched pathways differentially regulated through a persistent stress response pattern at late time points. We propose that the current findings will help characterize a bacterial model to study the dynamics of essential genes and proteins under clinically relevant stress conditions. PMID:29393020

Training set selection for the prediction of essential genes.

PubMed

Cheng, Jian; Xu, Zhao; Wu, Wenwu; Zhao, Li; Li, Xiangchen; Liu, Yanlin; Tao, Shiheng

2014-01-01

Various computational models have been developed to transfer annotations of gene essentiality between organisms. However, despite the increasing number of microorganisms with well-characterized sets of essential genes, selection of appropriate training sets for predicting the essential genes of poorly-studied or newly sequenced organisms remains challenging. In this study, a machine learning approach was applied reciprocally to predict the essential genes in 21 microorganisms. Results showed that training set selection greatly influenced predictive accuracy. We determined four criteria for training set selection: (1) essential genes in the selected training set should be reliable; (2) the growth conditions in which essential genes are defined should be consistent in training and prediction sets; (3) species used as training set should be closely related to the target organism; and (4) organisms used as training and prediction sets should exhibit similar phenotypes or lifestyles. We then analyzed the performance of an incomplete training set and an integrated training set with multiple organisms. We found that the size of the training set should be at least 10% of the total genes to yield accurate predictions. Additionally, the integrated training sets exhibited remarkable increase in stability and accuracy compared with single sets. Finally, we compared the performance of the integrated training sets with the four criteria and with random selection. The results revealed that a rational selection of training sets based on our criteria yields better performance than random selection. Thus, our results provide empirical guidance on training set selection for the identification of essential genes on a genome-wide scale.
Dissecting the Gene Network of Dietary Restriction to Identify Evolutionarily Conserved Pathways and New Functional Genes

PubMed Central

Wuttke, Daniel; Connor, Richard; Vora, Chintan; Craig, Thomas; Li, Yang; Wood, Shona; Vasieva, Olga; Shmookler Reis, Robert; Tang, Fusheng; de Magalhães, João Pedro

2012-01-01

Dietary restriction (DR), limiting nutrient intake from diet without causing malnutrition, delays the aging process and extends lifespan in multiple organisms. The conserved life-extending effect of DR suggests the involvement of fundamental mechanisms, although these remain a subject of debate. To help decipher the life-extending mechanisms of DR, we first compiled a list of genes that if genetically altered disrupt or prevent the life-extending effects of DR. We called these DR–essential genes and identified more than 100 in model organisms such as yeast, worms, flies, and mice. In order for other researchers to benefit from this first curated list of genes essential for DR, we established an online database called GenDR (http://genomics.senescence.info/diet/). To dissect the interactions of DR–essential genes and discover the underlying lifespan-extending mechanisms, we then used a variety of network and systems biology approaches to analyze the gene network of DR. We show that DR–essential genes are more conserved at the molecular level and have more molecular interactions than expected by chance. Furthermore, we employed a guilt-by-association method to predict novel DR–essential genes. In budding yeast, we predicted nine genes related to vacuolar functions; we show experimentally that mutations deleting eight of those genes prevent the life-extending effects of DR. Three of these mutants (OPT2, FRE6, and RCR2) had extended lifespan under ad libitum, indicating that the lack of further longevity under DR is not caused by a general compromise of fitness. These results demonstrate how network analyses of DR using GenDR can be used to make phenotypically relevant predictions. Moreover, gene-regulatory circuits reveal that the DR–induced transcriptional signature in yeast involves nutrient-sensing, stress responses and meiotic transcription factors. Finally, comparing the influence of gene expression changes during DR on the interactomes of multiple
Increased burden of deleterious variants in essential genes in autism spectrum disorder.

PubMed

Ji, Xiao; Kember, Rachel L; Brown, Christopher D; Bućan, Maja

2016-12-27

Autism spectrum disorder (ASD) is a heterogeneous, highly heritable neurodevelopmental syndrome characterized by impaired social interaction, communication, and repetitive behavior. It is estimated that hundreds of genes contribute to ASD. We asked if genes with a strong effect on survival and fitness contribute to ASD risk. Human orthologs of genes with an essential role in pre- and postnatal development in the mouse [essential genes (EGs)] are enriched for disease genes and under strong purifying selection relative to human orthologs of mouse genes with a known nonlethal phenotype [nonessential genes (NEGs)]. This intolerance to deleterious mutations, commonly observed haploinsufficiency, and the importance of EGs in development suggest a possible cumulative effect of deleterious variants in EGs on complex neurodevelopmental disorders. With a comprehensive catalog of 3,915 mammalian EGs, we provide compelling evidence for a stronger contribution of EGs to ASD risk compared with NEGs. By examining the exonic de novo and inherited variants from 1,781 ASD quartet families, we show a significantly higher burden of damaging mutations in EGs in ASD probands compared with their non-ASD siblings. The analysis of EGs in the developing brain identified clusters of coexpressed EGs implicated in ASD. Finally, we suggest a high-priority list of 29 EGs with potential ASD risk as targets for future functional and behavioral studies. Overall, we show that large-scale studies of gene function in model organisms provide a powerful approach for prioritization of genes and pathogenic variants identified by sequencing studies of human disease.
Increased burden of deleterious variants in essential genes in autism spectrum disorder

PubMed Central

Kember, Rachel L.; Brown, Christopher D.; Bućan, Maja

2016-01-01

Autism spectrum disorder (ASD) is a heterogeneous, highly heritable neurodevelopmental syndrome characterized by impaired social interaction, communication, and repetitive behavior. It is estimated that hundreds of genes contribute to ASD. We asked if genes with a strong effect on survival and fitness contribute to ASD risk. Human orthologs of genes with an essential role in pre- and postnatal development in the mouse [essential genes (EGs)] are enriched for disease genes and under strong purifying selection relative to human orthologs of mouse genes with a known nonlethal phenotype [nonessential genes (NEGs)]. This intolerance to deleterious mutations, commonly observed haploinsufficiency, and the importance of EGs in development suggest a possible cumulative effect of deleterious variants in EGs on complex neurodevelopmental disorders. With a comprehensive catalog of 3,915 mammalian EGs, we provide compelling evidence for a stronger contribution of EGs to ASD risk compared with NEGs. By examining the exonic de novo and inherited variants from 1,781 ASD quartet families, we show a significantly higher burden of damaging mutations in EGs in ASD probands compared with their non-ASD siblings. The analysis of EGs in the developing brain identified clusters of coexpressed EGs implicated in ASD. Finally, we suggest a high-priority list of 29 EGs with potential ASD risk as targets for future functional and behavioral studies. Overall, we show that large-scale studies of gene function in model organisms provide a powerful approach for prioritization of genes and pathogenic variants identified by sequencing studies of human disease. PMID:27956632
Identification of candidate genes for familial early-onset essential tremor.

PubMed

Liu, Xinmin; Hernandez, Nora; Kisselev, Sergey; Floratos, Aris; Sawle, Ashley; Ionita-Laza, Iuliana; Ottman, Ruth; Louis, Elan D; Clark, Lorraine N

2016-07-01

Essential tremor (ET) is one of the most common causes of tremor in humans. Despite its high heritability and prevalence, few susceptibility genes for ET have been identified. To identify ET genes, whole-exome sequencing was performed in 37 early-onset ET families with an autosomal-dominant inheritance pattern. We identified candidate genes for follow-up functional studies in five ET families. In two independent families, we identified variants predicted to affect function in the nitric oxide (NO) synthase 3 gene (NOS3) that cosegregated with disease. NOS3 is highly expressed in the central nervous system (including cerebellum), neurons and endothelial cells, and is one of three enzymes that converts l-arginine to the neurotransmitter NO. In one family, a heterozygous variant, c.46G>A (p.(Gly16Ser)), in NOS3, was identified in three affected ET cases and was absent in an unaffected family member; and in a second family, a heterozygous variant, c.164C>T (p.(Pro55Leu)), was identified in three affected ET cases (dizygotic twins and their mother). Both variants result in amino-acid substitutions of highly conserved amino-acid residues that are predicted to be deleterious and damaging by in silico analysis. In three independent families, variants predicted to affect function were also identified in other genes, including KCNS2 (KV9.2), HAPLN4 (BRAL2) and USP46. These genes are highly expressed in the cerebellum and Purkinje cells, and influence function of the gamma-amino butyric acid (GABA)-ergic system. This is in concordance with recent evidence that the pathophysiological process in ET involves cerebellar dysfunction and possibly cerebellar degeneration with a reduction in Purkinje cells, and a decrease in GABA-ergic tone.
Promoter mapping of the mouse Tcp-10bt gene in transgenic mice identifies essential male germ cell regulatory sequences.

PubMed

Ewulonu, U K; Snyder, L; Silver, L M; Schimenti, J C

1996-03-01

Transgenic mice were generated to localize essential promoter elements in the mouse testis-expressed Tcp-10 genes. These genes are expressed exclusively in male germ cells, and exhibit a diffuse range of transcriptional start sites, possibly due to the absence of a TATA box. A series of transgene constructs containing different amounts of 5' flanking DNA revealed that all sequences necessary for appropriate temporal and tissue-specific transcription of Tcp-10 reside between positions -1 to -973. All transgenic animals containing these sequences expressed a chimeric transgene at high levels, in a pattern that paralleled the endogenous genes. These experiments further defined a 227 bp fragment from -746 to -973 that was absolutely essential for expression. In a gel-shift assay, this 227-bp fragment bound nuclear protein from testis, but not other tissues, to yield two retarded bands. Sequence analysis of this fragment revealed a half-site for the AP-2 transcription factor recognition sequence. Gel shift assays using native or mutant oligonucleotides demonstrated that the putative AP-2 recognition sequence was essential for generating the retarded bands. Since the binding activity is testis-specific, but AP-2 expression is not exclusive to male germ cells, it is possible that transcription of Tcp-10 requires interaction between AP-2 and a germ cell-specific transcription factor.
Prediction of essential proteins based on gene expression programming.

PubMed

Zhong, Jiancheng; Wang, Jianxin; Peng, Wei; Zhang, Zhen; Pan, Yi

2013-01-01

Essential proteins are indispensable for cell survive. Identifying essential proteins is very important for improving our understanding the way of a cell working. There are various types of features related to the essentiality of proteins. Many methods have been proposed to combine some of them to predict essential proteins. However, it is still a big challenge for designing an effective method to predict them by integrating different features, and explaining how these selected features decide the essentiality of protein. Gene expression programming (GEP) is a learning algorithm and what it learns specifically is about relationships between variables in sets of data and then builds models to explain these relationships. In this work, we propose a GEP-based method to predict essential protein by combing some biological features and topological features. We carry out experiments on S. cerevisiae data. The experimental results show that the our method achieves better prediction performance than those methods using individual features. Moreover, our method outperforms some machine learning methods and performs as well as a method which is obtained by combining the outputs of eight machine learning methods. The accuracy of predicting essential proteins can been improved by using GEP method to combine some topological features and biological features.
A Rare SNP Identified a TCP Transcription Factor Essential for Tendril Development in Cucumber.

PubMed

Wang, Shenhao; Yang, Xueyong; Xu, Mengnan; Lin, Xingzhong; Lin, Tao; Qi, Jianjian; Shao, Guangjin; Tian, Nana; Yang, Qing; Zhang, Zhonghua; Huang, Sanwen

2015-12-07

Rare genetic variants are abundant in genomes but less tractable in genome-wide association study. Here we exploit a strategy of rare variation mapping to discover a gene essential for tendril development in cucumber (Cucumis sativus L.). In a collection of >3000 lines, we discovered a unique tendril-less line that forms branches instead of tendrils and, therefore, loses its climbing ability. We hypothesized that this unusual phenotype was caused by a rare variation and subsequently identified the causative single nucleotide polymorphism. The affected gene TEN encodes a TCP transcription factor conserved within the cucurbits and is expressed specifically in tendrils, representing a new organ identity gene. The variation occurs within a protein motif unique to the cucurbits and impairs its function as a transcriptional activator. Analyses of transcriptomes from near-isogenic lines identified downstream genes required for the tendril's capability to sense and climb a support. This study provides an example to explore rare functional variants in plant genomes. Copyright © 2015 The Author. Published by Elsevier Inc. All rights reserved.
Systems Biology-Based Investigation of Cellular Antiviral Drug Targets Identified by Gene-Trap Insertional Mutagenesis.

PubMed

Cheng, Feixiong; Murray, James L; Zhao, Junfei; Sheng, Jinsong; Zhao, Zhongming; Rubin, Donald H

2016-09-01

Viruses require host cellular factors for successful replication. A comprehensive systems-level investigation of the virus-host interactome is critical for understanding the roles of host factors with the end goal of discovering new druggable antiviral targets. Gene-trap insertional mutagenesis is a high-throughput forward genetics approach to randomly disrupt (trap) host genes and discover host genes that are essential for viral replication, but not for host cell survival. In this study, we used libraries of randomly mutagenized cells to discover cellular genes that are essential for the replication of 10 distinct cytotoxic mammalian viruses, 1 gram-negative bacterium, and 5 toxins. We herein reported 712 candidate cellular genes, characterizing distinct topological network and evolutionary signatures, and occupying central hubs in the human interactome. Cell cycle phase-specific network analysis showed that host cell cycle programs played critical roles during viral replication (e.g. MYC and TAF4 regulating G0/1 phase). Moreover, the viral perturbation of host cellular networks reflected disease etiology in that host genes (e.g. CTCF, RHOA, and CDKN1B) identified were frequently essential and significantly associated with Mendelian and orphan diseases, or somatic mutations in cancer. Computational drug repositioning framework via incorporating drug-gene signatures from the Connectivity Map into the virus-host interactome identified 110 putative druggable antiviral targets and prioritized several existing drugs (e.g. ajmaline) that may be potential for antiviral indication (e.g. anti-Ebola). In summary, this work provides a powerful methodology with a tight integration of gene-trap insertional mutagenesis testing and systems biology to identify new antiviral targets and drugs for the development of broadly acting and targeted clinical antiviral therapeutics.
Gene essentiality and the topology of protein interaction networks

PubMed Central

Coulomb, Stéphane; Bauer, Michel; Bernard, Denis; Marsolier-Kergoat, Marie-Claude

2005-01-01

The mechanistic bases for gene essentiality and for cell mutational resistance have long been disputed. The recent availability of large protein interaction databases has fuelled the analysis of protein interaction networks and several authors have proposed that gene dispensability could be strongly related to some topological parameters of these networks. However, many results were based on protein interaction data whose biases were not taken into account. In this article, we show that the essentiality of a gene in yeast is poorly related to the number of interactants (or degree) of the corresponding protein and that the physiological consequences of gene deletions are unrelated to several other properties of proteins in the interaction networks, such as the average degrees of their nearest neighbours, their clustering coefficients or their relative distances. We also found that yeast protein interaction networks lack degree correlation, i.e. a propensity for their vertices to associate according to their degrees. Gene essentiality and more generally cell resistance against mutations thus seem largely unrelated to many parameters of protein network topology. PMID:16087428
The Essential Genome of Escherichia coli K-12

PubMed Central

2018-01-01

ABSTRACT Transposon-directed insertion site sequencing (TraDIS) is a high-throughput method coupling transposon mutagenesis with short-fragment DNA sequencing. It is commonly used to identify essential genes. Single gene deletion libraries are considered the gold standard for identifying essential genes. Currently, the TraDIS method has not been benchmarked against such libraries, and therefore, it remains unclear whether the two methodologies are comparable. To address this, a high-density transposon library was constructed in Escherichia coli K-12. Essential genes predicted from sequencing of this library were compared to existing essential gene databases. To decrease false-positive identification of essential genes, statistical data analysis included corrections for both gene length and genome length. Through this analysis, new essential genes and genes previously incorrectly designated essential were identified. We show that manual analysis of TraDIS data reveals novel features that would not have been detected by statistical analysis alone. Examples include short essential regions within genes, orientation-dependent effects, and fine-resolution identification of genome and protein features. Recognition of these insertion profiles in transposon mutagenesis data sets will assist genome annotation of less well characterized genomes and provides new insights into bacterial physiology and biochemistry. PMID:29463657
The Essential Genome of Escherichia coli K-12.

PubMed

Goodall, Emily C A; Robinson, Ashley; Johnston, Iain G; Jabbari, Sara; Turner, Keith A; Cunningham, Adam F; Lund, Peter A; Cole, Jeffrey A; Henderson, Ian R

2018-02-20

Transposon-directed insertion site sequencing (TraDIS) is a high-throughput method coupling transposon mutagenesis with short-fragment DNA sequencing. It is commonly used to identify essential genes. Single gene deletion libraries are considered the gold standard for identifying essential genes. Currently, the TraDIS method has not been benchmarked against such libraries, and therefore, it remains unclear whether the two methodologies are comparable. To address this, a high-density transposon library was constructed in Escherichia coli K-12. Essential genes predicted from sequencing of this library were compared to existing essential gene databases. To decrease false-positive identification of essential genes, statistical data analysis included corrections for both gene length and genome length. Through this analysis, new essential genes and genes previously incorrectly designated essential were identified. We show that manual analysis of TraDIS data reveals novel features that would not have been detected by statistical analysis alone. Examples include short essential regions within genes, orientation-dependent effects, and fine-resolution identification of genome and protein features. Recognition of these insertion profiles in transposon mutagenesis data sets will assist genome annotation of less well characterized genomes and provides new insights into bacterial physiology and biochemistry. IMPORTANCE Incentives to define lists of genes that are essential for bacterial survival include the identification of potential targets for antibacterial drug development, genes required for rapid growth for exploitation in biotechnology, and discovery of new biochemical pathways. To identify essential genes in Escherichia coli , we constructed a transposon mutant library of unprecedented density. Initial automated analysis of the resulting data revealed many discrepancies compared to the literature. We now report more extensive statistical analysis supported by both
Clustering approaches to identifying gene expression patterns from DNA microarray data.

PubMed

Do, Jin Hwan; Choi, Dong-Kug

2008-04-30

The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.
Functional Study of Genes Essential for Autogamy and Nuclear Reorganization in Paramecium▿§

PubMed Central

Nowak, Jacek K.; Gromadka, Robert; Juszczuk, Marek; Jerka-Dziadosz, Maria; Maliszewska, Kamila; Mucchielli, Marie-Hélène; Gout, Jean-François; Arnaiz, Olivier; Agier, Nicolas; Tang, Thomas; Aggerbeck, Lawrence P.; Cohen, Jean; Delacroix, Hervé; Sperling, Linda; Herbert, Christopher J.; Zagulski, Marek; Bétermier, Mireille

2011-01-01

Like all ciliates, Paramecium tetraurelia is a unicellular eukaryote that harbors two kinds of nuclei within its cytoplasm. At each sexual cycle, a new somatic macronucleus (MAC) develops from the germ line micronucleus (MIC) through a sequence of complex events, which includes meiosis, karyogamy, and assembly of the MAC genome from MIC sequences. The latter process involves developmentally programmed genome rearrangements controlled by noncoding RNAs and a specialized RNA interference machinery. We describe our first attempts to identify genes and biological processes that contribute to the progression of the sexual cycle. Given the high percentage of unknown genes annotated in the P. tetraurelia genome, we applied a global strategy to monitor gene expression profiles during autogamy, a self-fertilization process. We focused this pilot study on the genes carried by the largest somatic chromosome and designed dedicated DNA arrays covering 484 genes from this chromosome (1.2% of all genes annotated in the genome). Transcriptome analysis revealed four major patterns of gene expression, including two successive waves of gene induction. Functional analysis of 15 upregulated genes revealed four that are essential for vegetative growth, one of which is involved in the maintenance of MAC integrity and another in cell division or membrane trafficking. Two additional genes, encoding a MIC-specific protein and a putative RNA helicase localizing to the old and then to the new MAC, are specifically required during sexual processes. Our work provides a proof of principle that genes essential for meiosis and nuclear reorganization can be uncovered following genome-wide transcriptome analysis. PMID:21257794
ICan: an integrated co-alteration network to identify ovarian cancer-related genes.

PubMed

Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan

2015-01-01

Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data.
Silencing of Essential Genes within a Highly Coordinated Operon in Escherichia coli

PubMed Central

Hohmeier, Angela; Stone, Timothy C.; Offord, Victoria; Sarabia, Francisco; Garcia-Ruiz, Cristina; Good, Liam

2015-01-01

Essential bacterial genes located within operons are particularly challenging to study independently because of coordinated gene expression and the nonviability of knockout mutants. Essentiality scores for many operon genes remain uncertain. Antisense RNA (asRNA) silencing or in-frame gene disruption of genes may help establish essentiality but can lead to polar effects on genes downstream or upstream of the target gene. Here, the Escherichia coli ribF-ileS-lspA-fkpB-ispH operon was used to evaluate the possibility of independently studying an essential gene using expressed asRNA and target gene overexpression to deregulate coupled expression. The gene requirement for growth in conditional silencing strains was determined by the relationship of target mRNA reduction with growth inhibition as the minimum transcript level required for 50% growth (MTL50). Mupirocin and globomycin, the protein inhibitors of IleS and LspA, respectively, were used in sensitization assays of strains containing both asRNA-expressing and open reading frame-expressing plasmids to examine deregulation of the overlapping ileS-lspA genes. We found upstream and downstream polar silencing effects when either ileS or lspA was silenced, indicating coupled expression. Weighted MTL50 values (means and standard deviations) of ribF, ileS, and lspA were 0.65 ± 0.18, 0.64 ± 0.06, and 0.76 ± 0.10, respectively. However, they were not significantly different (P = 0.71 by weighted one-way analysis of variance). The gene requirement for ispH could not be determined due to insufficient growth reduction. Mupirocin and globomycin sensitization experiments indicated that ileS-lspA expression could not be decoupled. The results highlight the inherent challenges associated with genetic analyses of operons; however, coupling of essential genes may provide opportunities to improve RNA-silencing antimicrobials. PMID:26070674
The Goddard and Saturn Genes Are Essential for Drosophila Male Fertility and May Have Arisen De Novo

PubMed Central

Gubala, Anna M.; Schmitz, Jonathan F.; Kearns, Michael J.; Vinh, Tery T.; Bornberg-Bauer, Erich; Wolfner, Mariana F.

2017-01-01

New genes arise through a variety of mechanisms, including the duplication of existing genes and the de novo birth of genes from noncoding DNA sequences. While there are numerous examples of duplicated genes with important functional roles, the functions of de novo genes remain largely unexplored. Many newly evolved genes are expressed in the male reproductive tract, suggesting that these evolutionary innovations may provide advantages to males experiencing sexual selection. Using testis-specific RNA interference, we screened 11 putative de novo genes in Drosophila melanogaster for effects on male fertility and identified two, goddard and saturn, that are essential for spermatogenesis and sperm function. Goddard knockdown (KD) males fail to produce mature sperm, while saturn KD males produce few sperm, and these function inefficiently once transferred to females. Consistent with a de novo origin, both genes are identifiable only in Drosophila and are predicted to encode proteins with no sequence similarity to any annotated protein. However, since high levels of divergence prevented the unambiguous identification of the noncoding sequences from which each gene arose, we consider goddard and saturn to be putative de novo genes. Within Drosophila, both genes have been lost in certain lineages, but show conserved, male-specific patterns of expression in the species in which they are found. Goddard is consistently found in single-copy and evolves under purifying selection. In contrast, saturn has diversified through gene duplication and positive selection. These data suggest that de novo genes can acquire essential roles in male reproduction. PMID:28104747
An integrative machine learning strategy for improved prediction of essential genes in Escherichia coli metabolism using flux-coupled features.

PubMed

Nandi, Sutanu; Subramanian, Abhishek; Sarkar, Ram Rup

2017-07-25

Prediction of essential genes helps to identify a minimal set of genes that are absolutely required for the appropriate functioning and survival of a cell. The available machine learning techniques for essential gene prediction have inherent problems, like imbalanced provision of training datasets, biased choice of the best model for a given balanced dataset, choice of a complex machine learning algorithm, and data-based automated selection of biologically relevant features for classification. Here, we propose a simple support vector machine-based learning strategy for the prediction of essential genes in Escherichia coli K-12 MG1655 metabolism that integrates a non-conventional combination of an appropriate sample balanced training set, a unique organism-specific genotype, phenotype attributes that characterize essential genes, and optimal parameters of the learning algorithm to generate the best machine learning model (the model with the highest accuracy among all the models trained for different sample training sets). For the first time, we also introduce flux-coupled metabolic subnetwork-based features for enhancing the classification performance. Our strategy proves to be superior as compared to previous SVM-based strategies in obtaining a biologically relevant classification of genes with high sensitivity and specificity. This methodology was also trained with datasets of other recent supervised classification techniques for essential gene classification and tested using reported test datasets. The testing accuracy was always high as compared to the known techniques, proving that our method outperforms known methods. Observations from our study indicate that essential genes are conserved among homologous bacterial species, demonstrate high codon usage bias, GC content and gene expression, and predominantly possess a tendency to form physiological flux modules in metabolism.
ICan: An Integrated Co-Alteration Network to Identify Ovarian Cancer-Related Genes

PubMed Central

Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan

2015-01-01

Background Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. Results We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). Conclusion In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data. PMID:25803614
Silencing of Essential Genes within a Highly Coordinated Operon in Escherichia coli.

PubMed

Goh, Shan; Hohmeier, Angela; Stone, Timothy C; Offord, Victoria; Sarabia, Francisco; Garcia-Ruiz, Cristina; Good, Liam

2015-08-15

Essential bacterial genes located within operons are particularly challenging to study independently because of coordinated gene expression and the nonviability of knockout mutants. Essentiality scores for many operon genes remain uncertain. Antisense RNA (asRNA) silencing or in-frame gene disruption of genes may help establish essentiality but can lead to polar effects on genes downstream or upstream of the target gene. Here, the Escherichia coli ribF-ileS-lspA-fkpB-ispH operon was used to evaluate the possibility of independently studying an essential gene using expressed asRNA and target gene overexpression to deregulate coupled expression. The gene requirement for growth in conditional silencing strains was determined by the relationship of target mRNA reduction with growth inhibition as the minimum transcript level required for 50% growth (MTL50). Mupirocin and globomycin, the protein inhibitors of IleS and LspA, respectively, were used in sensitization assays of strains containing both asRNA-expressing and open reading frame-expressing plasmids to examine deregulation of the overlapping ileS-lspA genes. We found upstream and downstream polar silencing effects when either ileS or lspA was silenced, indicating coupled expression. Weighted MTL50 values (means and standard deviations) of ribF, ileS, and lspA were 0.65 ± 0.18, 0.64 ± 0.06, and 0.76 ± 0.10, respectively. However, they were not significantly different (P = 0.71 by weighted one-way analysis of variance). The gene requirement for ispH could not be determined due to insufficient growth reduction. Mupirocin and globomycin sensitization experiments indicated that ileS-lspA expression could not be decoupled. The results highlight the inherent challenges associated with genetic analyses of operons; however, coupling of essential genes may provide opportunities to improve RNA-silencing antimicrobials. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

Discovery of rice essential genes by characterizing a CRISPR-edited mutation of closely related rice MAP kinase genes.

PubMed

Minkenberg, Bastian; Xie, Kabin; Yang, Yinong

2017-02-01

The clustered regularly interspaced short palindromic repeat (CRISPR)/CRISPR-associated protein 9 nuclease (Cas9) system depends on a guide RNA (gRNA) to specify its target. By efficiently co-expressing multiple gRNAs that target different genomic sites, the polycistronic tRNA-gRNA gene (PTG) strategy enables multiplex gene editing in the family of closely related mitogen-activated protein kinase (MPK) genes in Oryza sativa (rice). In this study, we identified MPK1 and MPK6 (Arabidopsis AtMPK6 and AtMPK4 orthologs, respectively) as essential genes for rice development by finding the preservation of MPK functional alleles and normal phenotypes in CRISPR-edited mutants. The true knock-out mutants of MPK1 were severely dwarfed and sterile, and homozygous mpk1 seeds from heterozygous parents were defective in embryo development. By contrast, heterozygous mpk6 mutant plants completely failed to produce homozygous mpk6 seeds. In addition, the functional importance of specific MPK features could be evaluated by characterizing CRISPR-induced allelic variation in the conserved kinase domain of MPK6. By simultaneously targeting between two and eight genomic sites in the closely related MPK genes, we demonstrated 45-86% frequency of biallelic mutations and the successful creation of single, double and quadruple gene mutants. Indels and fragment deletion were both stably inherited to the next generations, and transgene-free mutants of rice MPK genes were readily obtained via genetic segregation, thereby eliminating any positional effects of transgene insertions. Taken together, our study reveals the essentiality of MPK1 and MPK6 in rice development, and enables the functional discovery of previously inaccessible genes or domains with phenotypes masked by lethality or redundancy. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.
The Goddard and Saturn Genes Are Essential for Drosophila Male Fertility and May Have Arisen De Novo.

PubMed

Gubala, Anna M; Schmitz, Jonathan F; Kearns, Michael J; Vinh, Tery T; Bornberg-Bauer, Erich; Wolfner, Mariana F; Findlay, Geoffrey D

2017-05-01

New genes arise through a variety of mechanisms, including the duplication of existing genes and the de novo birth of genes from noncoding DNA sequences. While there are numerous examples of duplicated genes with important functional roles, the functions of de novo genes remain largely unexplored. Many newly evolved genes are expressed in the male reproductive tract, suggesting that these evolutionary innovations may provide advantages to males experiencing sexual selection. Using testis-specific RNA interference, we screened 11 putative de novo genes in Drosophila melanogaster for effects on male fertility and identified two, goddard and saturn, that are essential for spermatogenesis and sperm function. Goddard knockdown (KD) males fail to produce mature sperm, while saturn KD males produce few sperm, and these function inefficiently once transferred to females. Consistent with a de novo origin, both genes are identifiable only in Drosophila and are predicted to encode proteins with no sequence similarity to any annotated protein. However, since high levels of divergence prevented the unambiguous identification of the noncoding sequences from which each gene arose, we consider goddard and saturn to be putative de novo genes. Within Drosophila, both genes have been lost in certain lineages, but show conserved, male-specific patterns of expression in the species in which they are found. Goddard is consistently found in single-copy and evolves under purifying selection. In contrast, saturn has diversified through gene duplication and positive selection. These data suggest that de novo genes can acquire essential roles in male reproduction. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
High-throughput analysis of Yersinia pseudotuberculosis gene essentiality in optimised in vitro conditions, and implications for the speciation of Yersinia pestis.

PubMed

Willcocks, Samuel J; Stabler, Richard A; Atkins, Helen S; Oyston, Petra F; Wren, Brendan W

2018-05-31

Yersinia pseudotuberculosis is a zoonotic pathogen, causing mild gastrointestinal infection in humans. From this comparatively benign pathogenic species emerged the highly virulent plague bacillus, Yersinia pestis, which has experienced significant genetic divergence in a relatively short time span. Much of our knowledge of Yersinia spp. evolution stems from genomic comparison and gene expression studies. Here we apply transposon-directed insertion site sequencing (TraDIS) to describe the essential gene set of Y. pseudotuberculosis IP32953 in optimised in vitro growth conditions, and contrast these with the published essential genes of Y. pestis. The essential genes of an organism are the core genetic elements required for basic survival processes in a given growth condition, and are therefore attractive targets for antimicrobials. One such gene we identified is yptb3665, which encodes a peptide deformylase, and here we report for the first time, the sensitivity of Y. pseudotuberculosis to actinonin, a deformylase inhibitor. Comparison of the essential genes of Y. pseudotuberculosis with those of Y. pestis revealed the genes whose importance are shared by both species, as well as genes that were differentially required for growth. In particular, we find that the two species uniquely rely upon different iron acquisition and respiratory metabolic pathways under similar in vitro conditions. The discovery of uniquely essential genes between the closely related Yersinia spp. represent some of the fundamental, species-defining points of divergence that arose during the evolution of Y. pestis from its ancestor. Furthermore, the shared essential genes represent ideal candidates for the development of novel antimicrobials against both species.
Neuron-specific feeding RNAi in C. elegans and its use in a screen for essential genes required for GABA neuron function.

PubMed

Firnhaber, Christopher; Hammarlund, Marc

2013-11-01

Forward genetic screens are important tools for exploring the genetic requirements for neuronal function. However, conventional forward screens often have difficulty identifying genes whose relevant functions are masked by pleiotropy. In particular, if loss of gene function results in sterility, lethality, or other severe pleiotropy, neuronal-specific functions cannot be readily analyzed. Here we describe a method in C. elegans for generating cell-specific knockdown in neurons using feeding RNAi and its application in a screen for the role of essential genes in GABAergic neurons. We combine manipulations that increase the sensitivity of select neurons to RNAi with manipulations that block RNAi in other cells. We produce animal strains in which feeding RNAi results in restricted gene knockdown in either GABA-, acetylcholine-, dopamine-, or glutamate-releasing neurons. In these strains, we observe neuron cell-type specific behavioral changes when we knock down genes required for these neurons to function, including genes encoding the basal neurotransmission machinery. These reagents enable high-throughput, cell-specific knockdown in the nervous system, facilitating rapid dissection of the site of gene action and screening for neuronal functions of essential genes. Using the GABA-specific RNAi strain, we screened 1,320 RNAi clones targeting essential genes on chromosomes I, II, and III for their effect on GABA neuron function. We identified 48 genes whose GABA cell-specific knockdown resulted in reduced GABA motor output. This screen extends our understanding of the genetic requirements for continued neuronal function in a mature organism.
Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes.

PubMed

Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich

2012-02-01

The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information.
Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes

PubMed Central

Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich

2012-01-01

The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information. PMID:22384404
Mutagenesis Screen Identifies agtpbp1 and eps15L1 as Essential for T lymphocyte Development in Zebrafish.

PubMed

Seiler, Christoph; Gebhart, Nichole; Zhang, Yong; Shinton, Susan A; Li, Yue-sheng; Ross, Nicola L; Liu, Xingjun; Li, Qin; Bilbee, Alison N; Varshney, Gaurav K; LaFave, Matthew C; Burgess, Shawn M; Balciuniene, Jorune; Balciunas, Darius; Hardy, Richard R; Kappes, Dietmar J; Wiest, David L; Rhodes, Jennifer

2015-01-01

Genetic screens are a powerful tool to discover genes that are important in immune cell development and function. The evolutionarily conserved development of lymphoid cells paired with the genetic tractability of zebrafish make this a powerful model system for this purpose. We used a Tol2-based gene-breaking transposon to induce mutations in the zebrafish (Danio rerio, AB strain) genome, which served the dual purpose of fluorescently tagging cells and tissues that express the disrupted gene and provided a means of identifying the disrupted gene. We identified 12 lines in which hematopoietic tissues expressed green fluorescent protein (GFP) during embryonic development, as detected by microscopy. Subsequent analysis of young adult fish, using a novel approach in which single cell suspensions of whole fish were analyzed by flow cytometry, revealed that 8 of these lines also exhibited GFP expression in young adult cells. An additional 15 lines that did not have embryonic GFP+ hematopoietic tissue by microscopy, nevertheless exhibited GFP+ cells in young adults. RT-PCR analysis of purified GFP+ populations for expression of T and B cell-specific markers identified 18 lines in which T and/or B cells were fluorescently tagged at 6 weeks of age. As transposon insertion is expected to cause gene disruption, these lines can be used to assess the requirement for the disrupted genes in immune cell development. Focusing on the lines with embryonic GFP+ hematopoietic tissue, we identified three lines in which homozygous mutants exhibited impaired T cell development at 6 days of age. In two of the lines we identified the disrupted genes, agtpbp1 and eps15L1. Morpholino-mediated knockdown of these genes mimicked the T cell defects in the corresponding mutant embryos, demonstrating the previously unrecognized, essential roles of agtpbp1 and eps15L1 in T cell development.
NIH Researchers Identify OCD Risk Gene

MedlinePlus

... News From NIH NIH Researchers Identify OCD Risk Gene Past Issues / Summer 2006 Table of Contents For ... and Alcoholism (NIAAA) have identified a previously unknown gene variant that doubles an individual's risk for obsessive- ...
The ciliopathy gene Rpgrip1l is essential for hair follicle development.

PubMed

Chen, Jiang; Laclef, Christine; Moncayo, Alejandra; Snedecor, Elizabeth R; Yang, Ning; Li, Li; Takemaru, Ken-Ichi; Paus, Ralf; Schneider-Maunoury, Sylvie; Clark, Richard A

2015-03-01

The primary cilium is essential for skin morphogenesis through regulating the Notch, Wnt, and hedgehog signaling pathways. Prior studies on the functions of primary cilia in the skin were based on the investigations of genes that are essential for cilium formation. However, none of these ciliogenic genes has been linked to ciliopathy, a group of disorders caused by abnormal formation or function of cilia. To determine whether there is a genetic and molecular link between ciliopathies and skin morphogenesis, we investigated the role of RPGRIP1L, a gene mutated in Joubert (JBTS) and Meckel (MKS) syndromes, two severe forms of ciliopathy, in the context of skin development. We found that RPGRIP1L is essential for hair follicle morphogenesis. Specifically, disrupting the Rpgrip1l gene in mice resulted in reduced proliferation and differentiation of follicular keratinocytes, leading to hair follicle developmental defects. These defects were associated with significantly decreased primary cilium formation and attenuated hedgehog signaling. In contrast, we found that hair follicle induction and polarization and the development of interfollicular epidermis were unaffected. This study indicates that RPGRIP1L, a ciliopathy gene, is essential for hair follicle morphogenesis likely through regulating primary cilia formation and the hedgehog signaling pathway.
High-Throughput Screening to Identify Regulators of Meiosis-Specific Gene Expression in Saccharomyces cerevisiae.

PubMed

Kassir, Yona

2017-01-01

Meiosis and gamete formation are processes that are essential for sexual reproduction in all eukaryotic organisms. Multiple intracellular and extracellular signals feed into pathways that converge on transcription factors that induce the expression of meiosis-specific genes. Once triggered the meiosis-specific gene expression program proceeds in a cascade that drives progress through the events of meiosis and gamete formation. Meiosis-specific gene expression is tightly controlled by a balance of positive and negative regulatory factors that respond to a plethora of signaling pathways. The budding yeast Saccharomyces cerevisiae has proven to be an outstanding model for the dissection of gametogenesis owing to the sophisticated genetic manipulations that can be performed with the cells. It is possible to use a variety selection and screening methods to identify genes and their functions. High-throughput screening technology has been developed to allow an array of all viable yeast gene deletion mutants to be screened for phenotypes and for regulators of gene expression. This chapter describes a protocol that has been used to screen a library of homozygous diploid yeast deletion strains to identify regulators of the meiosis-specific IME1 gene.
Microarray and differential display identify genes involved in jasmonate-dependent anther development.

PubMed

Mandaokar, Ajin; Kumar, V Dinesh; Amway, Matt; Browse, John

2003-07-01

Jasmonate (JA) is a signaling compound essential for anther development and pollen fertility in Arabidopsis. Mutations that block the pathway of JA synthesis result into male sterility. To understand the processes of anther and pollen maturation, we used microarray and differential display approaches to compare gene expression pattern in anthers of wild-type Arabidopsis and the male-sterile mutant, opr3. Microarray experiment revealed 25 genes that were up-regulated more than 1.8-fold in wild-type anthers as compared to mutant anthers. Experiments based on differential display identified 13 additional genes up-regulated in wild-type anthers compared to opr3 for a total of 38 differentially expressed genes. Searches of the Arabidopsis and non-redundant databases disclosed known or likely functions for 28 of the 38 genes identified, while 10 genes encode proteins of unknown function. Northern blot analysis of eight representative clones as probes confirmed low expression in opr3 anthers compared with wild-type anthers. JA responsiveness of these same genes was also investigated by northern blot analysis of anther RNA isolated from wild-type and opr3 plants, In these experiments, four genes were induced in opr3 anthers within 0.5-1 h of JA treatment while the remaining genes were up-regulated only 1-8 h after JA application. None of these genes was induced by JA in anthers of the coil mutant that is deficient in JA responsiveness. The four early-induced genes in opr3 encode lipoxygenase, a putative bHLH transcription factor, epithiospecifier protein and an unknown protein. We propose that these and other early components may be involved in JA signaling and in the initiation of developmental processes. The four late genes encode an extensin-like protein, a peptide transporter and two unknown proteins, which may represent components required later in anther and pollen maturation. Transcript profiling has provided a successful approach to identify genes involved in
An Evolutionary Genomic Approach to Identify Genes Involved in Human Birth Timing

PubMed Central

Orabona, Guilherme; Morgan, Thomas; Haataja, Ritva; Hallman, Mikko; Puttonen, Hilkka; Menon, Ramkumar; Kuczynski, Edward; Norwitz, Errol; Snegovskikh, Victoria; Palotie, Aarno; Fellman, Vineta; DeFranco, Emily A.; Chaudhari, Bimal P.; McGregor, Tracy L.; McElroy, Jude J.; Oetjens, Matthew T.; Teramo, Kari; Borecki, Ingrid; Fay, Justin; Muglia, Louis

2011-01-01

Coordination of fetal maturation with birth timing is essential for mammalian reproduction. In humans, preterm birth is a disorder of profound global health significance. The signals initiating parturition in humans have remained elusive, due to divergence in physiological mechanisms between humans and model organisms typically studied. Because of relatively large human head size and narrow birth canal cross-sectional area compared to other primates, we hypothesized that genes involved in parturition would display accelerated evolution along the human and/or higher primate phylogenetic lineages to decrease the length of gestation and promote delivery of a smaller fetus that transits the birth canal more readily. Further, we tested whether current variation in such accelerated genes contributes to preterm birth risk. Evidence from allometric scaling of gestational age suggests human gestation has been shortened relative to other primates. Consistent with our hypothesis, many genes involved in reproduction show human acceleration in their coding or adjacent noncoding regions. We screened >8,400 SNPs in 150 human accelerated genes in 165 Finnish preterm and 163 control mothers for association with preterm birth. In this cohort, the most significant association was in FSHR, and 8 of the 10 most significant SNPs were in this gene. Further evidence for association of a linkage disequilibrium block of SNPs in FSHR, rs11686474, rs11680730, rs12473870, and rs1247381 was found in African Americans. By considering human acceleration, we identified a novel gene that may be associated with preterm birth, FSHR. We anticipate other human accelerated genes will similarly be associated with preterm birth risk and elucidate essential pathways for human parturition. PMID:21533219
Towards the prediction of essential genes by integration of network topology, cellular localization and biological process information

PubMed Central

2009-01-01

Background The identification of essential genes is important for the understanding of the minimal requirements for cellular life and for practical purposes, such as drug design. However, the experimental techniques for essential genes discovery are labor-intensive and time-consuming. Considering these experimental constraints, a computational approach capable of accurately predicting essential genes would be of great value. We therefore present here a machine learning-based computational approach relying on network topological features, cellular localization and biological process information for prediction of essential genes. Results We constructed a decision tree-based meta-classifier and trained it on datasets with individual and grouped attributes-network topological features, cellular compartments and biological processes-to generate various predictors of essential genes. We showed that the predictors with better performances are those generated by datasets with integrated attributes. Using the predictor with all attributes, i.e., network topological features, cellular compartments and biological processes, we obtained the best predictor of essential genes that was then used to classify yeast genes with unknown essentiality status. Finally, we generated decision trees by training the J48 algorithm on datasets with all network topological features, cellular localization and biological process information to discover cellular rules for essentiality. We found that the number of protein physical interactions, the nuclear localization of proteins and the number of regulating transcription factors are the most important factors determining gene essentiality. Conclusion We were able to demonstrate that network topological features, cellular localization and biological process information are reliable predictors of essential genes. Moreover, by constructing decision trees based on these data, we could discover cellular rules governing essentiality. PMID:19758426
A Caenorhabditis elegans RNA polymerase II gene, ama-1 IV, and nearby essential genes.

PubMed

Rogalski, T M; Riddle, D L

1988-01-01

The amanitin-binding subunit of RNA polymerase II in Caenorhabditis elegans is encoded by the ama-1 gene, located approximately 0.05 map unit to the right of dpy-13 IV. Using the amanitin-resistant ama-1(m118) strain as a parent, we have isolated amanitin-sensitive mutants that carry recessive-lethal ama-1 alleles. Of the six ethyl methanesulfonate-induced mutants examined, two are arrested late in embryogenesis. One of these is a large deficiency, mDf9, but the second may be a novel point mutation. The four other mutants are hypomorphs, and presumably produce altered RNA polymerase II enzymes with some residual function. Two of these mutants develop into sterile adults at 20 degrees but are arrested as larvae at 25 degrees, and two others are fertile at 20 degrees and sterile at 25 degrees. Temperature-shift experiments performed with the adult sterile mutant, ama-1(m118m238ts), have revealed a temperature-sensitive period that begins late in gonadogenesis and is centered around the initiation of egg-laying. Postembryonic development at 25 degrees is slowed by 30%. By contrast, the amanitin-resistant allele of ama-1 has very little effect on developmental rate or fertility. We have identified 15 essential genes in an interval of 4.5 map units surrounding ama-1, as well as four gamma-ray-induced deficiencies and two duplications that include the ama-1 gene. The larger duplication, mDp1, may include the entire left arm of chromosome IV, and it recombines with the normal homologue at a low frequency. The smallest deficiency, mDf10, complements all but three identified genes: let-278, dpy-13 and ama-1, which define an interval of only 0.1 map unit. The terminal phenotype of mDf10 homozygotes is developmental arrest during the first larval stage, suggesting that there is sufficient maternal RNA polymerase II to complete embryonic development.
ChIP-Seq Analysis for Identifying Genome-Wide Histone Modifications Associated with Stress-Responsive Genes in Plants.

PubMed

Li, Guosheng; Jagadeeswaran, Guru; Mort, Andrew; Sunkar, Ramanjulu

2017-01-01

Histone modifications represent the crux of epigenetic gene regulation essential for most biological processes including abiotic stress responses in plants. Thus, identification of histone modifications at the genome-scale can provide clues for how some genes are 'turned-on' while some others are "turned-off" in response to stress. This chapter details a step-by-step protocol for identifying genome-wide histone modifications associated with stress-responsive gene regulation using chromatin immunoprecipitation (ChIP) followed by sequencing of the DNA (ChIP-seq).
In vivo and in silico determination of essential genes of Campylobacter jejuni.

PubMed

Metris, Aline; Reuter, Mark; Gaskin, Duncan J H; Baranyi, Jozsef; van Vliet, Arnoud H M

2011-11-01

In the United Kingdom, the thermophilic Campylobacter species C. jejuni and C. coli are the most frequent causes of food-borne gastroenteritis in humans. While campylobacteriosis is usually a relatively mild infection, it has a significant public health and economic impact, and possible complications include reactive arthritis and the autoimmune diseases Guillain-Barré syndrome. The rapid developments in "omics" technologies have resulted in the availability of diverse datasets allowing predictions of metabolism and physiology of pathogenic micro-organisms. When combined, these datasets may allow for the identification of potential weaknesses that can be used for development of new antimicrobials to reduce or eliminate C. jejuni and C. coli from the food chain. A metabolic model of C. jejuni was constructed using the annotation of the NCTC 11168 genome sequence, a published model of the related bacterium Helicobacter pylori, and extensive literature mining. Using this model, we have used in silico Flux Balance Analysis (FBA) to determine key metabolic routes that are essential for generating energy and biomass, thus creating a list of genes potentially essential for growth under laboratory conditions. To complement this in silico approach, candidate essential genes have been determined using a whole genome transposon mutagenesis method. FBA and transposon mutagenesis (both this study and a published study) predict a similar number of essential genes (around 200). The analysis of the intersection between the three approaches highlights the shikimate pathway where genes are predicted to be essential by one or more method, and tend to be network hubs, based on a previously published Campylobacter protein-protein interaction network, and could therefore be targets for novel antimicrobial therapy. We have constructed the first curated metabolic model for the food-borne pathogen Campylobacter jejuni and have presented the resulting metabolic insights. We have shown that
A screen to identify Drosophila genes required for integrin-mediated adhesion.

PubMed Central

Walsh, E P; Brown, N H

1998-01-01

Drosophila integrins have essential adhesive roles during development, including adhesion between the two wing surfaces. Most position-specific integrin mutations cause lethality, and clones of homozygous mutant cells in the wing do not adhere to the apposing surface, causing blisters. We have used FLP-FRT induced mitotic recombination to generate clones of randomly induced mutations in the F1 generation and screened for mutations that cause wing blisters. This phenotype is highly selective, since only 14 lethal complementation groups were identified in screens of the five major chromosome arms. Of the loci identified, 3 are PS integrin genes, 2 are blistered and bloated, and the remaining 9 appear to be newly characterized loci. All 11 nonintegrin loci are required on both sides of the wing, in contrast to integrin alpha subunit genes. Mutations in 8 loci only disrupt adhesion in the wing, similar to integrin mutations, while mutations in the 3 other loci cause additional wing defects. Mutations in 4 loci, like the strongest integrin mutations, cause a "tail-up" embryonic lethal phenotype, and mutant alleles of 1 of these loci strongly enhance an integrin mutation. Thus several of these loci are good candidates for genes encoding cytoplasmic proteins required for integrin function. PMID:9755209
Utilizing Gene Tree Variation to Identify Candidate Effector Genes in Zymoseptoria tritici

PubMed Central

McDonald, Megan C.; McGinness, Lachlan; Hane, James K.; Williams, Angela H.; Milgate, Andrew; Solomon, Peter S.

2016-01-01

Zymoseptoria tritici is a host-specific, necrotrophic pathogen of wheat. Infection by Z. tritici is characterized by its extended latent period, which typically lasts 2 wks, and is followed by extensive host cell death, and rapid proliferation of fungal biomass. This work characterizes the level of genomic variation in 13 isolates, for which we have measured virulence on 11 wheat cultivars with differential resistance genes. Between the reference isolate, IPO323, and the 13 Australian isolates we identified over 800,000 single nucleotide polymorphisms, of which ∼10% had an effect on the coding regions of the genome. Furthermore, we identified over 1700 probable presence/absence polymorphisms in genes across the Australian isolates using de novo assembly. Finally, we developed a gene tree sorting method that quickly identifies groups of isolates within a single gene alignment whose sequence haplotypes correspond with virulence scores on a single wheat cultivar. Using this method, we have identified < 100 candidate effector genes whose gene sequence correlates with virulence toward a wheat cultivar carrying a major resistance gene. PMID:26837952
Implementation of a model for identifying Essentially Derived Varieties in vegetatively propagated Calluna vulgaris varieties.

PubMed

Borchert, Thomas; Krueger, Joerg; Hohe, Annette

2008-08-20

Variety protection is of high relevance for the horticultural community and juridical cases have become more frequent in a globalized economy due to essential derivation of varieties. This applies equally to Calluna vulgaris, a vegetatively propagated species from the Ericaceae family that belongs to the top-selling pot plants in Europe. We therefore analyzed the genetic diversity of 74 selected varieties and genotypes of C. vulgaris and 3 of Erica spp. by means of RAPD and iSSR fingerprinting using 168 mono- and polymorphisms. The same data set was utilized to generate a system to reliably identify Essentially Derived Varieties (EDVs) in C. vulgaris, which was adapted from a method suggested for lettuce and barley. This system was developed, validated and used for selected tests of interest in C. vulgaris. As expected following personal communications with breeders, a very small genetic diversity became evident within C. vulgaris when investigated using our molecular methods. Thus, a dendrogram-based assay to detect Essentially Derived Varieties in this species is not suitable, although varieties are propagated vegetatively. In contrast, the system applied in lettuce, which itself applies pairwise comparisons using appropriate reference sets, proved functional with this species. The narrow gene pool detected in C. vulgaris may be the genetic basis for juridical conflicts between breeders. We successfully tested a methodology for identification of Essentially Derived Varieties in highly identical C. vulgaris genotypes and recommend this for future proof of essential derivation in C. vulgaris and other vegetatively propagated crops.
An Updated Collection of Sequence Barcoded Temperature-Sensitive Alleles of Yeast Essential Genes

PubMed Central

Kofoed, Megan; Milbury, Karissa L.; Chiang, Jennifer H.; Sinha, Sunita; Ben-Aroya, Shay; Giaever, Guri; Nislow, Corey; Hieter, Philip; Stirling, Peter C.

2015-01-01

Systematic analyses of essential gene function using mutant collections in Saccharomyces cerevisiae have been conducted using collections of heterozygous diploids, promoter shut-off alleles, through alleles with destabilized mRNA, destabilized protein, or bearing mutations that lead to a temperature-sensitive (ts) phenotype. We previously described a method for construction of barcoded ts alleles in a systematic fashion. Here we report the completion of this collection of alleles covering 600 essential yeast genes. This resource covers a larger gene repertoire than previous collections and provides a complementary set of strains suitable for single gene and genomic analyses. We use deep sequencing to characterize the amino acid changes leading to the ts phenotype in half of the alleles. We also use high-throughput approaches to describe the relative ts behavior of the alleles. Finally, we demonstrate the experimental usefulness of the collection in a high-content, functional genomic screen for ts alleles that increase spontaneous P-body formation. By increasing the number of alleles and improving the annotation, this ts collection will serve as a community resource for probing new aspects of biology for essential yeast genes. PMID:26175450

An Updated Collection of Sequence Barcoded Temperature-Sensitive Alleles of Yeast Essential Genes.

PubMed

Kofoed, Megan; Milbury, Karissa L; Chiang, Jennifer H; Sinha, Sunita; Ben-Aroya, Shay; Giaever, Guri; Nislow, Corey; Hieter, Philip; Stirling, Peter C

2015-07-14

Systematic analyses of essential gene function using mutant collections in Saccharomyces cerevisiae have been conducted using collections of heterozygous diploids, promoter shut-off alleles, through alleles with destabilized mRNA, destabilized protein, or bearing mutations that lead to a temperature-sensitive (ts) phenotype. We previously described a method for construction of barcoded ts alleles in a systematic fashion. Here we report the completion of this collection of alleles covering 600 essential yeast genes. This resource covers a larger gene repertoire than previous collections and provides a complementary set of strains suitable for single gene and genomic analyses. We use deep sequencing to characterize the amino acid changes leading to the ts phenotype in half of the alleles. We also use high-throughput approaches to describe the relative ts behavior of the alleles. Finally, we demonstrate the experimental usefulness of the collection in a high-content, functional genomic screen for ts alleles that increase spontaneous P-body formation. By increasing the number of alleles and improving the annotation, this ts collection will serve as a community resource for probing new aspects of biology for essential yeast genes. Copyright © 2015 Kofoed et al.
Genome-wide RNAi screening identifies protein damage as a regulator of osmoprotective gene expression.

PubMed

Lamitina, Todd; Huang, Chunyi George; Strange, Kevin

2006-08-08

The detection, stabilization, and repair of stress-induced damage are essential requirements for cellular life. All cells respond to osmotic stress-induced water loss with increased expression of genes that mediate accumulation of organic osmolytes, solutes that function as chemical chaperones and restore osmotic homeostasis. The signals and signaling mechanisms that regulate osmoprotective gene expression in animal cells are poorly understood. Here, we show that gpdh-1 and gpdh-2, genes that mediate the accumulation of the organic osmolyte glycerol, are essential for survival of the nematode Caenorhabditis elegans during osmotic stress. Expression of GFP driven by the gpdh-1 promoter (P(gpdh-1)::GFP) is detected only during hypertonic stress but is not induced by other stressors. Using P(gpdh-1)::GFP expression as a phenotype, we screened approximately 16,000 genes by RNAi feeding and identified 122 that cause constitutive activation of gpdh-1 expression and glycerol accumulation. Many of these genes function to regulate protein translation and cotranslational protein folding and to target and degrade denatured proteins, suggesting that the accumulation of misfolded proteins functions as a signal to activate osmoprotective gene expression and organic osmolyte accumulation in animal cells. Consistent with this hypothesis, 73% of these protein-homeostasis genes have been shown to slow age-dependent protein aggregation in C. elegans. Because diverse environmental stressors and numerous disease states result in protein misfolding, mechanisms must exist that discriminate between osmotically induced and other forms of stress-induced protein damage. Our findings provide a foundation for understanding how these damage-selectivity mechanisms function.
Genome-wide RNAi screening identifies protein damage as a regulator of osmoprotective gene expression

PubMed Central

Lamitina, Todd; Huang, Chunyi George; Strange, Kevin

2006-01-01

The detection, stabilization, and repair of stress-induced damage are essential requirements for cellular life. All cells respond to osmotic stress-induced water loss with increased expression of genes that mediate accumulation of organic osmolytes, solutes that function as chemical chaperones and restore osmotic homeostasis. The signals and signaling mechanisms that regulate osmoprotective gene expression in animal cells are poorly understood. Here, we show that gpdh-1 and gpdh-2, genes that mediate the accumulation of the organic osmolyte glycerol, are essential for survival of the nematode Caenorhabditis elegans during osmotic stress. Expression of GFP driven by the gpdh-1 promoter (Pgpdh-1::GFP) is detected only during hypertonic stress but is not induced by other stressors. Using Pgpdh-1::GFP expression as a phenotype, we screened ≈16,000 genes by RNAi feeding and identified 122 that cause constitutive activation of gpdh-1 expression and glycerol accumulation. Many of these genes function to regulate protein translation and cotranslational protein folding and to target and degrade denatured proteins, suggesting that the accumulation of misfolded proteins functions as a signal to activate osmoprotective gene expression and organic osmolyte accumulation in animal cells. Consistent with this hypothesis, 73% of these protein-homeostasis genes have been shown to slow age-dependent protein aggregation in C. elegans. Because diverse environmental stressors and numerous disease states result in protein misfolding, mechanisms must exist that discriminate between osmotically induced and other forms of stress-induced protein damage. Our findings provide a foundation for understanding how these damage-selectivity mechanisms function. PMID:16880390
An Approach for Predicting Essential Genes Using Multiple Homology Mapping and Machine Learning Algorithms.

PubMed

Hua, Hong-Li; Zhang, Fa-Zhan; Labena, Abraham Alemayehu; Dong, Chuan; Jin, Yan-Ting; Guo, Feng-Biao

Investigation of essential genes is significant to comprehend the minimal gene sets of cell and discover potential drug targets. In this study, a novel approach based on multiple homology mapping and machine learning method was introduced to predict essential genes. We focused on 25 bacteria which have characterized essential genes. The predictions yielded the highest area under receiver operating characteristic (ROC) curve (AUC) of 0.9716 through tenfold cross-validation test. Proper features were utilized to construct models to make predictions in distantly related bacteria. The accuracy of predictions was evaluated via the consistency of predictions and known essential genes of target species. The highest AUC of 0.9552 and average AUC of 0.8314 were achieved when making predictions across organisms. An independent dataset from Synechococcus elongatus , which was released recently, was obtained for further assessment of the performance of our model. The AUC score of predictions is 0.7855, which is higher than other methods. This research presents that features obtained by homology mapping uniquely can achieve quite great or even better results than those integrated features. Meanwhile, the work indicates that machine learning-based method can assign more efficient weight coefficients than using empirical formula based on biological knowledge.
Islands of non-essential genes, including a DNA translocation operon, in the genome of bacteriophage 0305ϕ8-36

PubMed Central

Pathria, Saurav; Rolando, Mandy; Lieman, Karen; Hayes, Shirley; Hardies, Stephen; Serwer, Philip

2012-01-01

We investigate genes of lytic, Bacillus thuringiensis bacteriophage 0305ϕ8-36 that are non-essential for laboratory propagation, but might have a function in the wild. We isolate deletion mutants to identify these genes. The non-permutation of the genome (218.948 Kb, with a 6.479 Kb terminal repeat and 247 identified orfs) simplifies isolation of deletion mutants. We find two islands of non-essential genes. The first island (3.01% of the genomic DNA) has an informatically identified DNA translocation operon. Deletion causes no detectable growth defect during propagation in a dilute agarose overlay. Identification of the DNA translocation operon begins with a DNA relaxase and continues with a translocase and membrane-binding anchor proteins. The relaxase is in a family, first identified here, with homologs in other bacteriophages. The second deleted island (3.71% of the genome) has genes for two metallo-protein chaperonins and two tRNAs. Deletion causes a significant growth defect. In addition, (1) we find by “in situ” (in-plaque) single-particle fluorescence microscopy that adsorption to the host occurs at the tip of the 486 nm long tail, (2) we develop a procedure of 0305ϕ8-36 purification that does not cause tail contraction, and (3) we then find by electron microscopy that 0305ϕ8-36 undergoes tail tip-tail tip dimerization that potentially blocks adsorption to host cells, presumably with effectiveness that increases as the bacteriophage particle concentration increases. These observations provide an explanation of the previous observation that 0305ϕ8-36 does not lyse liquid cultures, even though 0305ϕ8-36 is genomically lytic. PMID:22666654
Identifying Cancer Driver Genes Using Replication-Incompetent Retroviral Vectors

PubMed Central

Bii, Victor M.; Trobridge, Grant D.

2016-01-01

Identifying novel genes that drive tumor metastasis and drug resistance has significant potential to improve patient outcomes. High-throughput sequencing approaches have identified cancer genes, but distinguishing driver genes from passengers remains challenging. Insertional mutagenesis screens using replication-incompetent retroviral vectors have emerged as a powerful tool to identify cancer genes. Unlike replicating retroviruses and transposons, replication-incompetent retroviral vectors lack additional mutagenesis events that can complicate the identification of driver mutations from passenger mutations. They can also be used for almost any human cancer due to the broad tropism of the vectors. Replication-incompetent retroviral vectors have the ability to dysregulate nearby cancer genes via several mechanisms including enhancer-mediated activation of gene promoters. The integrated provirus acts as a unique molecular tag for nearby candidate driver genes which can be rapidly identified using well established methods that utilize next generation sequencing and bioinformatics programs. Recently, retroviral vector screens have been used to efficiently identify candidate driver genes in prostate, breast, liver and pancreatic cancers. Validated driver genes can be potential therapeutic targets and biomarkers. In this review, we describe the emergence of retroviral insertional mutagenesis screens using replication-incompetent retroviral vectors as a novel tool to identify cancer driver genes in different cancer types. PMID:27792127
A forward genetic screen reveals essential and non-essential RNAi factors in Paramecium tetraurelia

PubMed Central

Marker, Simone; Carradec, Quentin; Tanty, Véronique; Arnaiz, Olivier; Meyer, Eric

2014-01-01

In most eukaryotes, small RNA-mediated gene silencing pathways form complex interacting networks. In the ciliate Paramecium tetraurelia, at least two RNA interference (RNAi) mechanisms coexist, involving distinct but overlapping sets of protein factors and producing different types of short interfering RNAs (siRNAs). One is specifically triggered by high-copy transgenes, and the other by feeding cells with double-stranded RNA (dsRNA)-producing bacteria. In this study, we designed a forward genetic screen for mutants deficient in dsRNA-induced silencing, and a powerful method to identify the relevant mutations by whole-genome sequencing. We present a set of 47 mutant alleles for five genes, revealing two previously unknown RNAi factors: a novel Paramecium-specific protein (Pds1) and a Cid1-like nucleotidyl transferase. Analyses of allelic diversity distinguish non-essential and essential genes and suggest that the screen is saturated for non-essential, single-copy genes. We show that non-essential genes are specifically involved in dsRNA-induced RNAi while essential ones are also involved in transgene-induced RNAi. One of the latter, the RNA-dependent RNA polymerase RDR2, is further shown to be required for all known types of siRNAs, as well as for sexual reproduction. These results open the way for the dissection of the genetic complexity, interconnection, mechanisms and natural functions of RNAi pathways in P. tetraurelia. PMID:24860163
Identifying the Essential Elements of Effective Science Communication: What Do the Experts Say?

ERIC Educational Resources Information Center

Bray, Belinda; France, Bev; Gilbert, John K.

2012-01-01

Experts in science communication were asked to identify the essential elements of a science communication course for post-graduate students. A Delphi methodology provided a framework for a research design that accessed their opinions and allowed them to contribute to, reflect on and identify 10 essential elements. There was a high level of…
Candidate genes for panhypopituitarism identified by gene expression profiling

PubMed Central

Mortensen, Amanda H.; MacDonald, James W.; Ghosh, Debashis

2011-01-01

Mutations in the transcription factors PROP1 and PIT1 (POU1F1) lead to pituitary hormone deficiency and hypopituitarism in mice and humans. The dysmorphology of developing Prop1 mutant pituitaries readily distinguishes them from those of Pit1 mutants and normal mice. This and other features suggest that Prop1 controls the expression of genes besides Pit1 that are important for pituitary cell migration, survival, and differentiation. To identify genes involved in these processes we used microarray analysis of gene expression to compare pituitary RNA from newborn Prop1 and Pit1 mutants and wild-type littermates. Significant differences in gene expression were noted between each mutant and their normal littermates, as well as between Prop1 and Pit1 mutants. Otx2, a gene critical for normal eye and pituitary development in humans and mice, exhibited elevated expression specifically in Prop1 mutant pituitaries. We report the spatial and temporal regulation of Otx2 in normal mice and Prop1 mutants, and the results suggest Otx2 could influence pituitary development by affecting signaling from the ventral diencephalon and regulation of gene expression in Rathke's pouch. The discovery that Otx2 expression is affected by Prop1 deficiency provides support for our hypothesis that identifying molecular differences in mutants will contribute to understanding the molecular mechanisms that control pituitary organogenesis and lead to human pituitary disease. PMID:21828248
Use of essential gene, encoding prophobilinogen deaminase from extreme psychrophilic Colwellia sp. C1, to generate temperature-sensitive strain of Francisella novicida.

PubMed

Pankowski, J A

2016-08-01

Previously, several essential genes from psychrophilic bacteria have been substituted for their homologues in mesophilic bacterial pathogens to make the latter temperature sensitive. It has been noted that an essential ligA gene from an extreme psychrophile, Colwellia sp. C1, yielded a gene product that is inactivated at 27°C, the lowest that has been observed for any psychrophilic enzyme, and hypothesized that other essential proteins of that strain would also have low inactivation temperatures. This work describes the partial sequencing of the genome of Colwellia sp. C1 strain and the identification of 24 open reading frames encoding homologues of highly conserved bacterial essential genes. The gene encoding porphobilinogen deaminase (hemC), which is involved in the pathway of haem synthesis, has been tested for its ability to convert Francisella novicida into a temperature-sensitive strain. The hybrid strain carrying the C1-derived hemC gene exhibited a temperature-sensitive phenotype with a restrictive temperature of 36°C. These results support the conclusion that Colwellia sp. C1 is a rich source of heat-labile enzymes. The issue of biosafety is often raised when it comes to work with pathogenic organisms. The main concern is caused by the risk of researchers being exposed to infectious doses of dangerous microbes. This paper analyses essential genes identified in partial genomic sequence of the psychrophilic bacterium Collwelia sp. C1. These sequences can be used as a mean of generating temperature-sensitive strains of pathogenic bacteria. Such strains are incapable of surviving at the temperature of human body. This means they could be applied as vaccines or for safer work with dangerous organisms. © 2016 The Society for Applied Microbiology.
Large-scale identification of differentially expressed genes during pupa development reveals solute carrier gene is essential for pupal pigmentation in Chilo suppressalis.

PubMed

Sun, Yang; Huang, Shuijin; Wang, Shuping; Guo, Dianhao; Ge, Chang; Xiao, Huamei; Jie, Wencai; Yang, Qiupu; Teng, Xiaolu; Li, Fei

2017-04-01

Insects undergo metamorphosis, involving an abrupt change in body structure through cell growth and differentiation. Rice stem stripped borer (SSB), Chilo suppressalis, is one of the most destructive rice pests. However, little is known about the regulation mechanism of metamorphosis development in this notorious insect pest. Here, we studied the expression of 22,197 SSB genes at seven time points during pupa development with a customized microarray, identifying 622 differentially expressed genes (DEG) during pupa development. Gene ontology (GO) analysis of these DEGs indicated that the genes related to substance metabolism were highly expressed in the early pupa, which participate in the physiological processes of larval tissue disintegration at these stages. In comparison, highly expressed genes in the late pupal stages were mainly associated with substance biosynthesis, consistent with adult organ formation at these stages. There were 27 solute carrier (SLC) genes that were highly expressed during pupa development. We knocked down SLC22A3 at the prepupal stage, demonstrating that silencing SLC22A3 induced a deficiency in pupa stiffness and pigmentation. The RNAi-treated individuals had white and soft pupa, suggesting that this gene has an essential role in pupal development. Copyright © 2016 Elsevier Ltd. All rights reserved.
Identifying potential maternal genes of Bombyx mori using digital gene expression profiling

PubMed Central

Xu, Pingzhen

2018-01-01

Maternal genes present in mature oocytes play a crucial role in the early development of silkworm. Although maternal genes have been widely studied in many other species, there has been limited research in Bombyx mori. High-throughput next generation sequencing provides a practical method for gene discovery on a genome-wide level. Herein, a transcriptome study was used to identify maternal-related genes from silkworm eggs. Unfertilized eggs from five different stages of early development were used to detect the changing situation of gene expression. The expressed genes showed different patterns over time. Seventy-six maternal genes were annotated according to homology analysis with Drosophila melanogaster. More than half of the differentially expressed maternal genes fell into four expression patterns, while the expression patterns showed a downward trend over time. The functional annotation of these material genes was mainly related to transcription factor activity, growth factor activity, nucleic acid binding, RNA binding, ATP binding, and ion binding. Additionally, twenty-two gene clusters including maternal genes were identified from 18 scaffolds. Altogether, we plotted a profile for the maternal genes of Bombyx mori using a digital gene expression profiling method. This will provide the basis for maternal-specific signature research and improve the understanding of the early development of silkworm. PMID:29462160
A functional screen for copper homeostasis genes identifies a pharmacologically tractable cellular system

PubMed Central

2014-01-01

Background Copper is essential for the survival of aerobic organisms. If copper is not properly regulated in the body however, it can be extremely cytotoxic and genetic mutations that compromise copper homeostasis result in severe clinical phenotypes. Understanding how cells maintain optimal copper levels is therefore highly relevant to human health. Results We found that addition of copper (Cu) to culture medium leads to increased respiratory growth of yeast, a phenotype which we then systematically and quantitatively measured in 5050 homozygous diploid deletion strains. Cu’s positive effect on respiratory growth was quantitatively reduced in deletion strains representing 73 different genes, the function of which identify increased iron uptake as a cause of the increase in growth rate. Conversely, these effects were enhanced in strains representing 93 genes. Many of these strains exhibited respiratory defects that were specifically rescued by supplementing the growth medium with Cu. Among the genes identified are known and direct regulators of copper homeostasis, genes required to maintain low vacuolar pH, and genes where evidence supporting a functional link with Cu has been heretofore lacking. Roughly half of the genes are conserved in man, and several of these are associated with Mendelian disorders, including the Cu-imbalance syndromes Menkes and Wilson’s disease. We additionally demonstrate that pharmacological agents, including the approved drug disulfiram, can rescue Cu-deficiencies of both environmental and genetic origin. Conclusions A functional screen in yeast has expanded the list of genes required for Cu-dependent fitness, revealing a complex cellular system with implications for human health. Respiratory fitness defects arising from perturbations in this system can be corrected with pharmacological agents that increase intracellular copper concentrations. PMID:24708151
A gene family for acidic ribosomal proteins in Schizosaccharomyces pombe: two essential and two nonessential genes.

PubMed Central

Beltrame, M; Bianchi, M E

1990-01-01

We have cloned the genes for small acidic ribosomal proteins (A-proteins) of the fission yeast Schizosaccharomyces pombe. S. pombe contains four transcribed genes for small A-proteins per haploid genome, as is the case for Saccharomyces cerevisiae. In contrast, multicellular eucaryotes contain two transcribed genes per haploid genome. The four proteins of S. pombe, besides sharing a high overall similarity, form two couples of nearly identical sequences. Their corresponding genes have a very conserved structure and are transcribed to a similar level. Surprisingly, of each couple of genes coding for nearly identical proteins, one is essential for cell growth, whereas the other is not. We suggest that the unequal importance of the four small A-proteins for cell survival is related to their physical organization in 60S ribosomal subunits. Images PMID:2325655
Systematic bacterialization of yeast genes identifies a near-universally swappable pathway

PubMed Central

Kachroo, Aashiq H; Laurent, Jon M; Akhmetov, Azat; Szilagyi-Jones, Madelyn; McWhite, Claire D; Zhao, Alice; Marcotte, Edward M

2017-01-01

Eukaryotes and prokaryotes last shared a common ancestor ~2 billion years ago, and while many present-day genes in these lineages predate this divergence, the extent to which these genes still perform their ancestral functions is largely unknown. To test principles governing retention of ancient function, we asked if prokaryotic genes could replace their essential eukaryotic orthologs. We systematically replaced essential genes in yeast by their 1:1 orthologs from Escherichia coli. After accounting for mitochondrial localization and alternative start codons, 31 out of 51 bacterial genes tested (61%) could complement a lethal growth defect and replace their yeast orthologs with minimal effects on growth rate. Replaceability was determined on a pathway-by-pathway basis; codon usage, abundance, and sequence similarity contributed predictive power. The heme biosynthesis pathway was particularly amenable to inter-kingdom exchange, with each yeast enzyme replaceable by its bacterial, human, or plant ortholog, suggesting it as a near-universally swappable pathway. DOI: http://dx.doi.org/10.7554/eLife.25093.001 PMID:28661399
Identifying key genes in rheumatoid arthritis by weighted gene co-expression network analysis.

PubMed

Ma, Chunhui; Lv, Qi; Teng, Songsong; Yu, Yinxian; Niu, Kerun; Yi, Chengqin

2017-08-01

This study aimed to identify rheumatoid arthritis (RA) related genes based on microarray data using the WGCNA (weighted gene co-expression network analysis) method. Two gene expression profile datasets GSE55235 (10 RA samples and 10 healthy controls) and GSE77298 (16 RA samples and seven healthy controls) were downloaded from Gene Expression Omnibus database. Characteristic genes were identified using metaDE package. WGCNA was used to find disease-related networks based on gene expression correlation coefficients, and module significance was defined as the average gene significance of all genes used to assess the correlation between the module and RA status. Genes in the disease-related gene co-expression network were subject to functional annotation and pathway enrichment analysis using Database for Annotation Visualization and Integrated Discovery. Characteristic genes were also mapped to the Connectivity Map to screen small molecules. A total of 599 characteristic genes were identified. For each dataset, characteristic genes in the green, red and turquoise modules were most closely associated with RA, with gene numbers of 54, 43 and 79, respectively. These genes were enriched in totally enriched in 17 Gene Ontology terms, mainly related to immune response (CD97, FYB, CXCL1, IKBKE, CCR1, etc.), inflammatory response (CD97, CXCL1, C3AR1, CCR1, LYZ, etc.) and homeostasis (C3AR1, CCR1, PLN, CCL19, PPT1, etc.). Two small-molecule drugs sanguinarine and papaverine were predicted to have a therapeutic effect against RA. Genes related to immune response, inflammatory response and homeostasis presumably have critical roles in RA pathogenesis. Sanguinarine and papaverine have a potential therapeutic effect against RA. © 2017 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.
Yeast Two-Hybrid and One-Hybrid Screenings Identify Regulators of hsp70 Gene Expression.

PubMed

Saito, Youhei; Nakagawa, Takanobu; Kakihana, Ayana; Nakamura, Yoshia; Nabika, Tomomi; Kasai, Michihiro; Takamori, Mai; Yamagishi, Nobuyuki; Kuga, Takahisa; Hatayama, Takumi; Nakayama, Yuji

2016-09-01

The mammalian stress protein Hsp105β, which is specifically expressed during mild heat shock and localizes to the nucleus, induces the major stress protein Hsp70. In the present study, we performed yeast two-hybrid and one-hybrid screenings to identify the regulators of Hsp105β-mediated hsp70 gene expression. Six and two proteins were detected as Hsp105β- and hsp70 promoter-binding proteins, respectively. A luciferase reporter gene assay revealed that hsp70 promoter activation is enhanced by the transcriptional co-activator AF9 and splicing mediator SNRPE, but suppressed by the coiled-coil domain-containing protein CCDC127. Of these proteins, the knockdown of SNRPE suppressed the expression of Hsp70 irrespective of the presence of Hsp105β, indicating that SNRPE essentially functions as a transcriptional activator of hsp70 gene expression. The overexpression of HSP70 in tumor cells has been associated with cell survival and drug resistance. We here identified novel regulators of Hsp70 expression in stress signaling and also provided important insights into Hsp70-targeted anti-cancer therapy. J. Cell. Biochem. 117: 2109-2117, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
The Transposon impala Is Activated by Low Temperatures: Use of a Controlled Transposition System To Identify Genes Critical for Viability of Aspergillus fumigatus ▿ †

PubMed Central

Carr, Paul D.; Tuckwell, Danny; Hey, Peter M.; Simon, Laurence; d'Enfert, Christophe; Birch, Mike; Oliver, Jason D.; Bromley, Michael J.

2010-01-01

Genes that are essential for viability represent potential targets for the development of anti-infective agents. However, relatively few have been determined in the filamentous fungal pathogen Aspergillus fumigatus. A novel solution employing parasexual genetics coupled with transposon mutagenesis using the Fusarium oxysporum transposon impala had previously enabled the identification of 20 essential genes from A. fumigatus; however, further use of this system required a better understanding of the mode of action of the transposon itself. Examination of a range of conditions indicated that impala is activated by prolonged exposure to low temperatures. This newly identified property was then harnessed to identify 96 loci that are critical for viability in A. fumigatus, including genes required for RNA metabolism, organelle organization, protein transport, ribosome biogenesis, and transcription, as well as a number of noncoding RNAs. A number of these genes represent potential targets for much-needed novel antifungal drugs. PMID:20097738
Identifying gene networks underlying the neurobiology of ethanol and alcoholism.

PubMed

Wolen, Aaron R; Miles, Michael F

2012-01-01

For complex disorders such as alcoholism, identifying the genes linked to these diseases and their specific roles is difficult. Traditional genetic approaches, such as genetic association studies (including genome-wide association studies) and analyses of quantitative trait loci (QTLs) in both humans and laboratory animals already have helped identify some candidate genes. However, because of technical obstacles, such as the small impact of any individual gene, these approaches only have limited effectiveness in identifying specific genes that contribute to complex diseases. The emerging field of systems biology, which allows for analyses of entire gene networks, may help researchers better elucidate the genetic basis of alcoholism, both in humans and in animal models. Such networks can be identified using approaches such as high-throughput molecular profiling (e.g., through microarray-based gene expression analyses) or strategies referred to as genetical genomics, such as the mapping of expression QTLs (eQTLs). Characterization of gene networks can shed light on the biological pathways underlying complex traits and provide the functional context for identifying those genes that contribute to disease development.
Gene Unprediction with Spurio: A tool to identify spurious protein sequences.

PubMed

Höps, Wolfram; Jeffryes, Matt; Bateman, Alex

2018-01-01

We now have access to the sequences of tens of millions of proteins. These protein sequences are essential for modern molecular biology and computational biology. The vast majority of protein sequences are derived from gene prediction tools and have no experimental supporting evidence for their translation. Despite the increasing accuracy of gene prediction tools there likely exists a large number of spurious protein predictions in the sequence databases. We have developed the Spurio tool to help identify spurious protein predictions in prokaryotes. Spurio searches the query protein sequence against a prokaryotic nucleotide database using tblastn and identifies homologous sequences. The tblastn matches are used to score the query sequence's likelihood of being a spurious protein prediction using a Gaussian process model. The most informative feature is the appearance of stop codons within the presumed translation of homologous DNA sequences. Benchmarking shows that the Spurio tool is able to distinguish spurious from true proteins. However, transposon proteins are prone to be predicted as spurious because of the frequency of degraded homologs found in the DNA sequence databases. Our initial experiments suggest that less than 1% of the proteins in the UniProtKB sequence database are likely to be spurious and that Spurio is able to identify over 60 times more spurious proteins than the AntiFam resource. The Spurio software and source code is available under an MIT license at the following URL: https://bitbucket.org/bateman-group/spurio.

Identifying key genes in glaucoma based on a benchmarked dataset and the gene regulatory network.

PubMed

Chen, Xi; Wang, Qiao-Ling; Zhang, Meng-Hui

2017-10-01

The current study aimed to identify key genes in glaucoma based on a benchmarked dataset and gene regulatory network (GRN). Local and global noise was added to the gene expression dataset to produce a benchmarked dataset. Differentially-expressed genes (DEGs) between patients with glaucoma and normal controls were identified utilizing the Linear Models for Microarray Data (Limma) package based on benchmarked dataset. A total of 5 GRN inference methods, including Zscore, GeneNet, context likelihood of relatedness (CLR) algorithm, Partial Correlation coefficient with Information Theory (PCIT) and GEne Network Inference with Ensemble of Trees (Genie3) were evaluated using receiver operating characteristic (ROC) and precision and recall (PR) curves. The interference method with the best performance was selected to construct the GRN. Subsequently, topological centrality (degree, closeness and betweenness) was conducted to identify key genes in the GRN of glaucoma. Finally, the key genes were validated by performing reverse transcription-quantitative polymerase chain reaction (RT-qPCR). A total of 176 DEGs were detected from the benchmarked dataset. The ROC and PR curves of the 5 methods were analyzed and it was determined that Genie3 had a clear advantage over the other methods; thus, Genie3 was used to construct the GRN. Following topological centrality analysis, 14 key genes for glaucoma were identified, including IL6 , EPHA2 and GSTT1 and 5 of these 14 key genes were validated by RT-qPCR. Therefore, the current study identified 14 key genes in glaucoma, which may be potential biomarkers to use in the diagnosis of glaucoma and aid in identifying the molecular mechanism of this disease.
RAV transcription factors are essential for disease resistance against cassava bacterial blight via activation of melatonin biosynthesis genes.

PubMed

Wei, Yunxie; Chang, Yanli; Zeng, Hongqiu; Liu, Guoyin; He, Chaozu; Shi, Haitao

2018-01-01

With 1 AP2 domain and 1 B3 domain, 7 MeRAVs in apetala2/ethylene response factor (AP2/ERF) gene family have been identified in cassava. However, the in vivo roles of these remain unknown. Gene expression assays showed that the transcripts of MeRAVs were commonly regulated after Xanthomonas axonopodis pv manihotis (Xam) and MeRAVs were specifically located in plant cell nuclei. Through virus-induced gene silencing (VIGS) in cassava, we found that MeRAV1 and MeRAV2 are essential for plant disease resistance against cassava bacterial blight, as shown by the bacterial propagation of Xam in plant leaves. Through VIGS in cassava leaves and overexpression in cassava leave protoplasts, we found that MeRAV1 and MeRAV2 positively regulated melatonin biosynthesis genes and the endogenous melatonin level. Further investigation showed that MeRAV1 and MeRAV2 are direct transcriptional activators of 3 melatonin biosynthesis genes in cassava, as evidenced by chromatin immunoprecipitation-PCR in cassava leaf protoplasts and electrophoretic mobility shift assay. Moreover, cassava melatonin biosynthesis genes also positively regulated plant disease resistance. Taken together, this study identified MeRAV1 and MeRAV2 as common and upstream transcription factors of melatonin synthesis genes in cassava and revealed a model of MeRAV1 and MeRAV2-melatonin biosynthesis genes-melatonin level in plant disease resistance against cassava bacterial blight. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Diametrical clustering for identifying anti-correlated gene clusters.

PubMed

Dhillon, Inderjit S; Marcotte, Edward M; Roshan, Usman

2003-09-01

Clustering genes based upon their expression patterns allows us to predict gene function. Most existing clustering algorithms cluster genes together when their expression patterns show high positive correlation. However, it has been observed that genes whose expression patterns are strongly anti-correlated can also be functionally similar. Biologically, this is not unintuitive-genes responding to the same stimuli, regardless of the nature of the response, are more likely to operate in the same pathways. We present a new diametrical clustering algorithm that explicitly identifies anti-correlated clusters of genes. Our algorithm proceeds by iteratively (i). re-partitioning the genes and (ii). computing the dominant singular vector of each gene cluster; each singular vector serving as the prototype of a 'diametric' cluster. We empirically show the effectiveness of the algorithm in identifying diametrical or anti-correlated clusters. Testing the algorithm on yeast cell cycle data, fibroblast gene expression data, and DNA microarray data from yeast mutants reveals that opposed cellular pathways can be discovered with this method. We present systems whose mRNA expression patterns, and likely their functions, oppose the yeast ribosome and proteosome, along with evidence for the inverse transcriptional regulation of a number of cellular systems.
Cross-species microarray hybridization to identify developmentally regulated genes in the filamentous fungus Sordaria macrospora.

PubMed

Nowrousian, Minou; Ringelberg, Carol; Dunlap, Jay C; Loros, Jennifer J; Kück, Ulrich

2005-04-01

The filamentous fungus Sordaria macrospora forms complex three-dimensional fruiting bodies that protect the developing ascospores and ensure their proper discharge. Several regulatory genes essential for fruiting body development were previously isolated by complementation of the sterile mutants pro1, pro11 and pro22. To establish the genetic relationships between these genes and to identify downstream targets, we have conducted cross-species microarray hybridizations using cDNA arrays derived from the closely related fungus Neurospora crassa and RNA probes prepared from wild-type S. macrospora and the three developmental mutants. Of the 1,420 genes which gave a signal with the probes from all the strains used, 172 (12%) were regulated differently in at least one of the three mutants compared to the wild type, and 17 (1.2%) were regulated differently in all three mutant strains. Microarray data were verified by Northern analysis or quantitative real time PCR. Among the genes that are up- or down-regulated in the mutant strains are genes encoding the pheromone precursors, enzymes involved in melanin biosynthesis and a lectin-like protein. Analysis of gene expression in double mutants revealed a complex network of interaction between the pro gene products.
Genexpi: a toolset for identifying regulons and validating gene regulatory networks using time-course expression data.

PubMed

Modrák, Martin; Vohradský, Jiří

2018-04-13

Identifying regulons of sigma factors is a vital subtask of gene network inference. Integrating multiple sources of data is essential for correct identification of regulons and complete gene regulatory networks. Time series of expression data measured with microarrays or RNA-seq combined with static binding experiments (e.g., ChIP-seq) or literature mining may be used for inference of sigma factor regulatory networks. We introduce Genexpi: a tool to identify sigma factors by combining candidates obtained from ChIP experiments or literature mining with time-course gene expression data. While Genexpi can be used to infer other types of regulatory interactions, it was designed and validated on real biological data from bacterial regulons. In this paper, we put primary focus on CyGenexpi: a plugin integrating Genexpi with the Cytoscape software for ease of use. As a part of this effort, a plugin for handling time series data in Cytoscape called CyDataseries has been developed and made available. Genexpi is also available as a standalone command line tool and an R package. Genexpi is a useful part of gene network inference toolbox. It provides meaningful information about the composition of regulons and delivers biologically interpretable results.
Large-Scale Gene-Centric Analysis Identifies Novel Variants for Coronary Artery Disease

PubMed Central

2011-01-01

Coronary artery disease (CAD) has a significant genetic contribution that is incompletely characterized. To complement genome-wide association (GWA) studies, we conducted a large and systematic candidate gene study of CAD susceptibility, including analysis of many uncommon and functional variants. We examined 49,094 genetic variants in ∼2,100 genes of cardiovascular relevance, using a customised gene array in 15,596 CAD cases and 34,992 controls (11,202 cases and 30,733 controls of European descent; 4,394 cases and 4,259 controls of South Asian origin). We attempted to replicate putative novel associations in an additional 17,121 CAD cases and 40,473 controls. Potential mechanisms through which the novel variants could affect CAD risk were explored through association tests with vascular risk factors and gene expression. We confirmed associations of several previously known CAD susceptibility loci (eg, 9p21.3:p<10−33; LPA:p<10−19; 1p13.3:p<10−17) as well as three recently discovered loci (COL4A1/COL4A2, ZC3HC1, CYP17A1:p<5×10−7). However, we found essentially null results for most previously suggested CAD candidate genes. In our replication study of 24 promising common variants, we identified novel associations of variants in or near LIPA, IL5, TRIB1, and ABCG5/ABCG8, with per-allele odds ratios for CAD risk with each of the novel variants ranging from 1.06–1.09. Associations with variants at LIPA, TRIB1, and ABCG5/ABCG8 were supported by gene expression data or effects on lipid levels. Apart from the previously reported variants in LPA, none of the other ∼4,500 low frequency and functional variants showed a strong effect. Associations in South Asians did not differ appreciably from those in Europeans, except for 9p21.3 (per-allele odds ratio: 1.14 versus 1.27 respectively; P for heterogeneity = 0.003). This large-scale gene-centric analysis has identified several novel genes for CAD that relate to diverse biochemical and cellular functions and
Large-scale gene-centric analysis identifies novel variants for coronary artery disease.

PubMed

2011-09-01

Coronary artery disease (CAD) has a significant genetic contribution that is incompletely characterized. To complement genome-wide association (GWA) studies, we conducted a large and systematic candidate gene study of CAD susceptibility, including analysis of many uncommon and functional variants. We examined 49,094 genetic variants in ∼2,100 genes of cardiovascular relevance, using a customised gene array in 15,596 CAD cases and 34,992 controls (11,202 cases and 30,733 controls of European descent; 4,394 cases and 4,259 controls of South Asian origin). We attempted to replicate putative novel associations in an additional 17,121 CAD cases and 40,473 controls. Potential mechanisms through which the novel variants could affect CAD risk were explored through association tests with vascular risk factors and gene expression. We confirmed associations of several previously known CAD susceptibility loci (eg, 9p21.3:p<10(-33); LPA:p<10(-19); 1p13.3:p<10(-17)) as well as three recently discovered loci (COL4A1/COL4A2, ZC3HC1, CYP17A1:p<5×10(-7)). However, we found essentially null results for most previously suggested CAD candidate genes. In our replication study of 24 promising common variants, we identified novel associations of variants in or near LIPA, IL5, TRIB1, and ABCG5/ABCG8, with per-allele odds ratios for CAD risk with each of the novel variants ranging from 1.06-1.09. Associations with variants at LIPA, TRIB1, and ABCG5/ABCG8 were supported by gene expression data or effects on lipid levels. Apart from the previously reported variants in LPA, none of the other ∼4,500 low frequency and functional variants showed a strong effect. Associations in South Asians did not differ appreciably from those in Europeans, except for 9p21.3 (per-allele odds ratio: 1.14 versus 1.27 respectively; P for heterogeneity = 0.003). This large-scale gene-centric analysis has identified several novel genes for CAD that relate to diverse biochemical and cellular functions and
ENU Mutagenesis in Mice Identifies Candidate Genes For Hypogonadism

PubMed Central

Weiss, Jeffrey; Hurley, Lisa A.; Harris, Rebecca M.; Finlayson, Courtney; Tong, Minghan; Fisher, Lisa A.; Moran, Jennifer L.; Beier, David R.; Mason, Christopher; Jameson, J. Larry

2012-01-01

Genome-wide mutagenesis was performed in mice to identify candidate genes for male infertility, for which the predominant causes remain idiopathic. Mice were mutagenized using N-ethyl-N-nitrosourea (ENU), bred, and screened for phenotypes associated with the male urogenital system. Fifteen heritable lines were isolated and chromosomal loci were assigned using low density genome-wide SNP arrays. Ten of the fifteen lines were pursued further using higher resolution SNP analysis to narrow the candidate gene regions. Exon sequencing of candidate genes identified mutations in mice with cystic kidneys (Bicc1), cryptorchidism (Rxfp2), restricted germ cell deficiency (Plk4), and severe germ cell deficiency (Prdm9). In two other lines with severe hypogonadism candidate sequencing failed to identify mutations, suggesting defects in genes with previously undocumented roles in gonadal function. These genomic intervals were sequenced in their entirety and a candidate mutation was identified in SnrpE in one of the two lines. The line harboring the SnrpE variant retains substantial spermatogenesis despite small testis size, an unusual phenotype. In addition to the reproductive defects, heritable phenotypes were observed in mice with ataxia (Myo5a), tremors (Pmp22), growth retardation (unknown gene), and hydrocephalus (unknown gene). These results demonstrate that the ENU screen is an effective tool for identifying potential causes of male infertility. PMID:22258617
SSER: Species specific essential reactions database.

PubMed

Labena, Abraham A; Ye, Yuan-Nong; Dong, Chuan; Zhang, Fa-Z; Guo, Feng-Biao

2017-04-19

Essential reactions are vital components of cellular networks. They are the foundations of synthetic biology and are potential candidate targets for antimetabolic drug design. Especially if a single reaction is catalyzed by multiple enzymes, then inhibiting the reaction would be a better option than targeting the enzymes or the corresponding enzyme-encoding gene. The existing databases such as BRENDA, BiGG, KEGG, Bio-models, Biosilico, and many others offer useful and comprehensive information on biochemical reactions. But none of these databases especially focus on essential reactions. Therefore, building a centralized repository for this class of reactions would be of great value. Here, we present a species-specific essential reactions database (SSER). The current version comprises essential biochemical and transport reactions of twenty-six organisms which are identified via flux balance analysis (FBA) combined with manual curation on experimentally validated metabolic network models. Quantitative data on the number of essential reactions, number of the essential reactions associated with their respective enzyme-encoding genes and shared essential reactions across organisms are the main contents of the database. SSER would be a prime source to obtain essential reactions data and related gene and metabolite information and it can significantly facilitate the metabolic network models reconstruction and analysis, and drug target discovery studies. Users can browse, search, compare and download the essential reactions of organisms of their interest through the website http://cefg.uestc.edu.cn/sser .
LNDriver: identifying driver genes by integrating mutation and expression data based on gene-gene interaction network.

PubMed

Wei, Pi-Jing; Zhang, Di; Xia, Junfeng; Zheng, Chun-Hou

2016-12-23

Cancer is a complex disease which is characterized by the accumulation of genetic alterations during the patient's lifetime. With the development of the next-generation sequencing technology, multiple omics data, such as cancer genomic, epigenomic and transcriptomic data etc., can be measured from each individual. Correspondingly, one of the key challenges is to pinpoint functional driver mutations or pathways, which contributes to tumorigenesis, from millions of functional neutral passenger mutations. In this paper, in order to identify driver genes effectively, we applied a generalized additive model to mutation profiles to filter genes with long length and constructed a new gene-gene interaction network. Then we integrated the mutation data and expression data into the gene-gene interaction network. Lastly, greedy algorithm was used to prioritize candidate driver genes from the integrated data. We named the proposed method Length-Net-Driver (LNDriver). Experiments on three TCGA datasets, i.e., head and neck squamous cell carcinoma, kidney renal clear cell carcinoma and thyroid carcinoma, demonstrated that the proposed method was effective. Also, it can identify not only frequently mutated drivers, but also rare candidate driver genes.
New genes often acquire male-specific functions but rarely become essential in Drosophila.

PubMed

Kondo, Shu; Vedanayagam, Jeffrey; Mohammed, Jaaved; Eizadshenass, Sogol; Kan, Lijuan; Pang, Nan; Aradhya, Rajaguru; Siepel, Adam; Steinhauer, Josefa; Lai, Eric C

2017-09-15

Relatively little is known about the in vivo functions of newly emerging genes, especially in metazoans. Although prior RNAi studies reported prevalent lethality among young gene knockdowns, our phylogenomic analyses reveal that young Drosophila genes are frequently restricted to the nonessential male reproductive system. We performed large-scale CRISPR/Cas9 mutagenesis of "conserved, essential" and "young, RNAi-lethal" genes and broadly confirmed the lethality of the former but the viability of the latter. Nevertheless, certain young gene mutants exhibit defective spermatogenesis and/or male sterility. Moreover, we detected widespread signatures of positive selection on young male-biased genes. Thus, young genes have a preferential impact on male reproductive system function. © 2017 Kondo et al.; Published by Cold Spring Harbor Laboratory Press.
Identifying key genes associated with acute myocardial infarction.

PubMed

Cheng, Ming; An, Shoukuan; Li, Junquan

2017-10-01

This study aimed to identify key genes associated with acute myocardial infarction (AMI) by reanalyzing microarray data. Three gene expression profile datasets GSE66360, GSE34198, and GSE48060 were downloaded from GEO database. After data preprocessing, genes without heterogeneity across different platforms were subjected to differential expression analysis between the AMI group and the control group using metaDE package. P < .05 was used as the cutoff for a differentially expressed gene (DEG). The expression data matrices of DEGs were imported in ReactomeFIViz to construct a gene functional interaction (FI) network. Then, DEGs in each module were subjected to pathway enrichment analysis using DAVID. MiRNAs and transcription factors predicted to regulate target DEGs were identified. Quantitative real-time polymerase chain reaction (RT-PCR) was applied to verify the expression of genes. A total of 913 upregulated genes and 1060 downregulated genes were identified in the AMI group. A FI network consists of 21 modules and DEGs in 12 modules were significantly enriched in pathways. The transcription factor-miRNA-gene network contains 2 transcription factors FOXO3 and MYBL2, and 2 miRNAs hsa-miR-21-5p and hsa-miR-30c-5p. RT-PCR validations showed that expression levels of FOXO3 and MYBL2 were significantly increased in AMI, and expression levels of hsa-miR-21-5p and hsa-miR-30c-5p were obviously decreased in AMI. A total of 41 DEGs, such as SOCS3, VAPA, and COL5A2, are speculated to have roles in the pathogenesis of AMI; 2 transcription factors FOXO3 and MYBL2, and 2 miRNAs hsa-miR-21-5p and hsa-miR-30c-5p may be involved in the regulation of the expression of these DEGs.
Dissecting the regulon of the two-component system CvsSR: Identifying new virulence genes in Pseudomonas syringae pv. tomato DC3000

USDA-ARS?s Scientific Manuscript database

Recognition of environmental changes and regulation of genes that allow for adaption to those changes is essential for survival of bacteria. Two-component systems (TCSs) allow bacteria to sense and adapt to their environment. We previously identified the TCS CvsSR in the bacterial plant pathogen Pse...
The etiology of essential tremor: Genes versus environment.

PubMed

Hopfner, Franziska; Helmich, Rick C

2018-01-01

Essential tremor (ET) is characterized by bilateral upper limb action tremor. Here we review the pathophysiology (cerebral mechanisms) and etiology (genetic and environmental risk factors) of ET. We reviewed the literature (until June 2017) by searching PubMed for relevant papers. The pathophysiology of ET involves oscillatory activity in the cortico-olivo-cerebello-thalamic circuit, evidenced by electrophysiological and metabolic imaging. Possible underlying mechanisms include GABA-ergic dysfunction, cerebellar neurodegeneration, olivary dysfunction, or a combination. Genetic studies have examined affected ET families (linkage studies and whole-exome sequencing studies). These studies revealed several chromosomal regions and genes associated with ET, but the findings have not been replicated across different ET families. Genetic studies also assessed the sporadic occurrence of ET using genome wide genotyping of single nucleotide polymorphisms (SNP's) and candidate gene studies. Several SNP's are associated with ET, and this has been replicated across different cohorts. Interestingly, some of the involved genes are linked to the cerebellum and inferior olive. Environmental studies point to an association between ET and beta-carboline alkaloids (such as harmane), which have been found in the cerebellum. Genetic and environmental risk factors may influence cerebellar and/or olivary function, resulting in abnormal cortico-olivo-cerebello-thalamic activity, and ultimately ET. Copyright © 2017 Elsevier Ltd. All rights reserved.
Network-Based Method for Identifying Co-Regeneration Genes in Bone, Dentin, Nerve and Vessel Tissues

PubMed Central

Pan, Hongying; Zhang, Yu-Hang; Feng, Kaiyan; Kong, XiangYin; Cai, Yu-Dong

2017-01-01

Bone and dental diseases are serious public health problems. Most current clinical treatments for these diseases can produce side effects. Regeneration is a promising therapy for bone and dental diseases, yielding natural tissue recovery with few side effects. Because soft tissues inside the bone and dentin are densely populated with nerves and vessels, the study of bone and dentin regeneration should also consider the co-regeneration of nerves and vessels. In this study, a network-based method to identify co-regeneration genes for bone, dentin, nerve and vessel was constructed based on an extensive network of protein–protein interactions. Three procedures were applied in the network-based method. The first procedure, searching, sought the shortest paths connecting regeneration genes of one tissue type with regeneration genes of other tissues, thereby extracting possible co-regeneration genes. The second procedure, testing, employed a permutation test to evaluate whether possible genes were false discoveries; these genes were excluded by the testing procedure. The last procedure, screening, employed two rules, the betweenness ratio rule and interaction score rule, to select the most essential genes. A total of seventeen genes were inferred by the method, which were deemed to contribute to co-regeneration of at least two tissues. All these seventeen genes were extensively discussed to validate the utility of the method. PMID:28974058
Network-Based Method for Identifying Co- Regeneration Genes in Bone, Dentin, Nerve and Vessel Tissues.

PubMed

Chen, Lei; Pan, Hongying; Zhang, Yu-Hang; Feng, Kaiyan; Kong, XiangYin; Huang, Tao; Cai, Yu-Dong

2017-10-02

Bone and dental diseases are serious public health problems. Most current clinical treatments for these diseases can produce side effects. Regeneration is a promising therapy for bone and dental diseases, yielding natural tissue recovery with few side effects. Because soft tissues inside the bone and dentin are densely populated with nerves and vessels, the study of bone and dentin regeneration should also consider the co-regeneration of nerves and vessels. In this study, a network-based method to identify co-regeneration genes for bone, dentin, nerve and vessel was constructed based on an extensive network of protein-protein interactions. Three procedures were applied in the network-based method. The first procedure, searching, sought the shortest paths connecting regeneration genes of one tissue type with regeneration genes of other tissues, thereby extracting possible co-regeneration genes. The second procedure, testing, employed a permutation test to evaluate whether possible genes were false discoveries; these genes were excluded by the testing procedure. The last procedure, screening, employed two rules, the betweenness ratio rule and interaction score rule, to select the most essential genes. A total of seventeen genes were inferred by the method, which were deemed to contribute to co-regeneration of at least two tissues. All these seventeen genes were extensively discussed to validate the utility of the method.
Transcriptome profiling of two maize inbreds with distinct responses to Gibberella ear rot disease to identify candidate resistance genes.

PubMed

Kebede, Aida Z; Johnston, Anne; Schneiderman, Danielle; Bosnich, Whynn; Harris, Linda J

2018-02-09

Gibberella ear rot (GER) is one of the most economically important fungal diseases of maize in the temperate zone due to moldy grain contaminated with health threatening mycotoxins. To develop resistant genotypes and control the disease, understanding the host-pathogen interaction is essential. RNA-Seq-derived transcriptome profiles of fungal- and mock-inoculated developing kernel tissues of two maize inbred lines were used to identify differentially expressed transcripts and propose candidate genes mapping within GER resistance quantitative trait loci (QTL). A total of 1255 transcripts were significantly (P ≤ 0.05) up regulated due to fungal infection in both susceptible and resistant inbreds. A greater number of transcripts were up regulated in the former (1174) than the latter (497) and increased as the infection progressed from 1 to 2 days after inoculation. Focusing on differentially expressed genes located within QTL regions for GER resistance, we identified 81 genes involved in membrane transport, hormone regulation, cell wall modification, cell detoxification, and biosynthesis of pathogenesis related proteins and phytoalexins as candidate genes contributing to resistance. Applying droplet digital PCR, we validated the expression profiles of a subset of these candidate genes from QTL regions contributed by the resistant inbred on chromosomes 1, 2 and 9. By screening global gene expression profiles for differentially expressed genes mapping within resistance QTL regions, we have identified candidate genes for gibberella ear rot resistance on several maize chromosomes which could potentially lead to a better understanding of Fusarium resistance mechanisms.
Functional characterization of MAT1-1-specific mating-type genes in the homothallic ascomycete Sordaria macrospora provides new insights into essential and nonessential sexual regulators.

PubMed

Klix, V; Nowrousian, M; Ringelberg, C; Loros, J J; Dunlap, J C; Pöggeler, S

2010-06-01

Mating-type genes in fungi encode regulators of mating and sexual development. Heterothallic ascomycete species require different sets of mating-type genes to control nonself-recognition and mating of compatible partners of different mating types. Homothallic (self-fertile) species also carry mating-type genes in their genome that are essential for sexual development. To analyze the molecular basis of homothallism and the role of mating-type genes during fruiting-body development, we deleted each of the three genes, SmtA-1 (MAT1-1-1), SmtA-2 (MAT1-1-2), and SmtA-3 (MAT1-1-3), contained in the MAT1-1 part of the mating-type locus of the homothallic ascomycete species Sordaria macrospora. Phenotypic analysis of deletion mutants revealed that the PPF domain protein-encoding gene SmtA-2 is essential for sexual reproduction, whereas the alpha domain protein-encoding genes SmtA-1 and SmtA-3 play no role in fruiting-body development. By means of cross-species microarray analysis using Neurospora crassa oligonucleotide microarrays hybridized with S. macrospora targets and quantitative real-time PCR, we identified genes expressed under the control of SmtA-1 and SmtA-2. Both genes are involved in the regulation of gene expression, including that of pheromone genes.
Transcriptional profiling identifies differentially expressed genes in developing turkey skeletal muscle

PubMed Central

2011-01-01

Background Skeletal muscle growth and development from embryo to adult consists of a series of carefully regulated changes in gene expression. Understanding these developmental changes in agriculturally important species is essential to the production of high quality meat products. For example, consumer demand for lean, inexpensive meat products has driven the turkey industry to unprecedented production through intensive genetic selection. However, achievements of increased body weight and muscle mass have been countered by an increased incidence of myopathies and meat quality defects. In a previous study, we developed and validated a turkey skeletal muscle-specific microarray as a tool for functional genomics studies. The goals of the current study were to utilize this microarray to elucidate functional pathways of genes responsible for key events in turkey skeletal muscle development and to compare differences in gene expression between two genetic lines of turkeys. To achieve these goals, skeletal muscle samples were collected at three critical stages in muscle development: 18d embryo (hyperplasia), 1d post-hatch (shift from myoblast-mediated growth to satellite cell-modulated growth by hypertrophy), and 16wk (market age) from two genetic lines: a randombred control line (RBC2) maintained without selection pressure, and a line (F) selected from the RBC2 line for increased 16wk body weight. Array hybridizations were performed in two experiments: Experiment 1 directly compared the developmental stages within genetic line, while Experiment 2 directly compared the two lines within each developmental stage. Results A total of 3474 genes were differentially expressed (false discovery rate; FDR < 0.001) by overall effect of development, while 16 genes were differentially expressed (FDR < 0.10) by overall effect of genetic line. Ingenuity Pathways Analysis was used to group annotated genes into networks, functions, and canonical pathways. The expression of 28 genes
RAD25 (SSL2), the yeast homolog of the human xeroderma pigmentosum group B DNA repair gene, is essential for viability

DOE Office of Scientific and Technical Information (OSTI.GOV)

Park, E.; Prakash, L.; Guzder, S.N.

1992-12-01

Xeroderma pigmentosum (XP) patients are extremely sensitive to ultraviolet (UV) light and suffer from a high incidence of skin cancers, due to a defect in nucleotide excision repair. The disease is genetically heterogeneous, and seven complementation groups, A-G, have been identified. Homologs of human excision repair genes ERCC1, XPDC/ERCC2, and XPAC have been identified in the yeast Saccharomyces cerevisiae. Since no homolog of human XPBC/ERCC3 existed among the known yeast genes, we cloned the yeast homolog by using XPBC cDNA as a hybridization probe. The yeast homolog, RAD25 (SSL2), encodes a protein of 843 amino acids (M[sub r] 95,356). Themore » RAD25 (SSL2)- and XPCX-encoded proteins share 55% identical and 72% conserved amino acid residues, and the two proteins resemble one another in containing the conserved DNA helicase sequence motifs. A nonsense mutation at codon 799 that deletes the 45 C-terminal amino acid residues in RAD25 (SSL2) confers UV sensitivity. This mutation shows epistasis with genes in the excision repair group, whereas a synergistic increase in UN sensitivity occurs when it is combined with mutations in genes in other DNA repair pathways, indicating that RAD25 (SSL2) functions in excision repair but not in other repair pathways. We also show that RAD25 (SSL2) is an essential gene. A mutation of the Lys[sup 392] residue to arginine in the conserved Walker type A nucleotide-binding motif is lethal, suggesting an essential role of the putative RAD 25 (SSL2) ATPase/DNA helicase activity in viability. 40 refs., 3 figs., 1 tab.« less

Phylogenomic analysis of UDP glycosyltransferase 1 multigene family in Linum usitatissimum identified genes with varied expression patterns.

PubMed

Barvkar, Vitthal T; Pardeshi, Varsha C; Kale, Sandip M; Kadoo, Narendra Y; Gupta, Vidya S

2012-05-08

The glycosylation process, catalyzed by ubiquitous glycosyltransferase (GT) family enzymes, is a prevalent modification of plant secondary metabolites that regulates various functions such as hormone homeostasis, detoxification of xenobiotics and biosynthesis and storage of secondary metabolites. Flax (Linum usitatissimum L.) is a commercially grown oilseed crop, important because of its essential fatty acids and health promoting lignans. Identification and characterization of UDP glycosyltransferase (UGT) genes from flax could provide valuable basic information about this important gene family and help to explain the seed specific glycosylated metabolite accumulation and other processes in plants. Plant genome sequencing projects are useful to discover complexity within this gene family and also pave way for the development of functional genomics approaches. Taking advantage of the newly assembled draft genome sequence of flax, we identified 137 UDP glycosyltransferase (UGT) genes from flax using a conserved signature motif. Phylogenetic analysis of these protein sequences clustered them into 14 major groups (A-N). Expression patterns of these genes were investigated using publicly available expressed sequence tag (EST), microarray data and reverse transcription quantitative real time PCR (RT-qPCR). Seventy-three per cent of these genes (100 out of 137) showed expression evidence in 15 tissues examined and indicated varied expression profiles. The RT-qPCR results of 10 selected genes were also coherent with the digital expression analysis. Interestingly, five duplicated UGT genes were identified, which showed differential expression in various tissues. Of the seven intron loss/gain positions detected, two intron positions were conserved among most of the UGTs, although a clear relationship about the evolution of these genes could not be established. Comparison of the flax UGTs with orthologs from four other sequenced dicot genomes indicated that seven UGTs were
Essential pitfalls in "essential” tremor

PubMed Central

Espay, AJ; Lang, AE; Erro, R; Merola, A; Fasano, A; Berardelli, A; Bhatia, KP

2016-01-01

While essential tremor has been considered the most common movement disorder, it has largely remained a diagnosis of exclusion: many tremor and non-tremor features must be absent for the clinical diagnosis to stand. The clinical features of “essential tremor” overlap with or may be part of other tremor disorders and, not surprisingly, this prevalent familial disorder has remained without a gene identified, without a consistent natural history, and without an acceptable pathology or pathophysiologic underpinning. The collective evidence suggests that under the rubric of essential tremor there exists multiple unique diseases, some of which represent cerebellar dysfunction, but for which there is no intrinsic “essence” other than a common oscillatory behavior on posture and action. One approach may be to use the term “essential tremor” only as a transitional node in the deep phenotyping of tremor disorders based on historical, phenomenological, and neurophysiological features, to facilitate its etiologic diagnosis or serve for future gene- and biomarker-discovery efforts. This approach deemphasizes essential tremor as a diagnostic entity and facilitates the understanding of the underlying disorders in order to develop biologically tailored diagnostic and therapeutic strategies. PMID:28116753
A new computational strategy for identifying essential proteins based on network topological properties and biological information.

PubMed

Qin, Chao; Sun, Yongqi; Dong, Yadong

2017-01-01

Essential proteins are the proteins that are indispensable to the survival and development of an organism. Deleting a single essential protein will cause lethality or infertility. Identifying and analysing essential proteins are key to understanding the molecular mechanisms of living cells. There are two types of methods for predicting essential proteins: experimental methods, which require considerable time and resources, and computational methods, which overcome the shortcomings of experimental methods. However, the prediction accuracy of computational methods for essential proteins requires further improvement. In this paper, we propose a new computational strategy named CoTB for identifying essential proteins based on a combination of topological properties, subcellular localization information and orthologous protein information. First, we introduce several topological properties of the protein-protein interaction (PPI) network. Second, we propose new methods for measuring orthologous information and subcellular localization and a new computational strategy that uses a random forest prediction model to obtain a probability score for the proteins being essential. Finally, we conduct experiments on four different Saccharomyces cerevisiae datasets. The experimental results demonstrate that our strategy for identifying essential proteins outperforms traditional computational methods and the most recently developed method, SON. In particular, our strategy improves the prediction accuracy to 89, 78, 79, and 85 percent on the YDIP, YMIPS, YMBD and YHQ datasets at the top 100 level, respectively.
Identifying a gene expression signature of cluster headache in blood

PubMed Central

Eising, Else; Pelzer, Nadine; Vijfhuizen, Lisanne S.; Vries, Boukje de; Ferrari, Michel D.; ‘t Hoen, Peter A. C.; Terwindt, Gisela M.; van den Maagdenberg, Arn M. J. M.

2017-01-01

Cluster headache is a relatively rare headache disorder, typically characterized by multiple daily, short-lasting attacks of excruciating, unilateral (peri-)orbital or temporal pain associated with autonomic symptoms and restlessness. To better understand the pathophysiology of cluster headache, we used RNA sequencing to identify differentially expressed genes and pathways in whole blood of patients with episodic (n = 19) or chronic (n = 20) cluster headache in comparison with headache-free controls (n = 20). Gene expression data were analysed by gene and by module of co-expressed genes with particular attention to previously implicated disease pathways including hypocretin dysregulation. Only moderate gene expression differences were identified and no associations were found with previously reported pathogenic mechanisms. At the level of functional gene sets, associations were observed for genes involved in several brain-related mechanisms such as GABA receptor function and voltage-gated channels. In addition, genes and modules of co-expressed genes showed a role for intracellular signalling cascades, mitochondria and inflammation. Although larger study samples may be required to identify the full range of involved pathways, these results indicate a role for mitochondria, intracellular signalling and inflammation in cluster headache. PMID:28074859
Integron associated mobile genes: Just a collection of plug in apps or essential components of cell network hardware?

PubMed

Labbate, Maurizio; Boucher, Yan; Luu, Ivan; Chowdhury, Piklu Roy; Stokes, H W

2012-01-01

Lateral gene transfer (LGT) impacts on the evolution of prokaryotes in both the short and long-term. The short-term impacts of mobilized genes are a concern to humans since LGT explains the global rise of multi drug resistant pathogens seen in the past 70 years. However, LGT has been a feature of prokaryotes from the earliest days of their existence and the concept of a bifurcating tree of life is not entirely applicable to prokaryotes since most genes in extant prokaryotic genomes have probably been acquired from other lineages. Successful transfer and maintenance of a gene in a new host is understandable if it acts independently of cell networks and confers an advantage. Antibiotic resistance provides an example of this whereby a gene can be advantageous in virtually any cell across broad species backgrounds. In a longer evolutionary context however laterally transferred genes can be assimilated into even essential cell networks. How this happens is not well understood and we discuss recent work that identifies a mobile gene, unique to a cell lineage, which is detrimental to the cell when lost. We also present some additional data and believe our emerging model will be helpful in understanding how mobile genes integrate into cell networks.
Identifying key genes associated with acute myocardial infarction

PubMed Central

Cheng, Ming; An, Shoukuan; Li, Junquan

2017-01-01

Abstract Background: This study aimed to identify key genes associated with acute myocardial infarction (AMI) by reanalyzing microarray data. Methods: Three gene expression profile datasets GSE66360, GSE34198, and GSE48060 were downloaded from GEO database. After data preprocessing, genes without heterogeneity across different platforms were subjected to differential expression analysis between the AMI group and the control group using metaDE package. P < .05 was used as the cutoff for a differentially expressed gene (DEG). The expression data matrices of DEGs were imported in ReactomeFIViz to construct a gene functional interaction (FI) network. Then, DEGs in each module were subjected to pathway enrichment analysis using DAVID. MiRNAs and transcription factors predicted to regulate target DEGs were identified. Quantitative real-time polymerase chain reaction (RT-PCR) was applied to verify the expression of genes. Result: A total of 913 upregulated genes and 1060 downregulated genes were identified in the AMI group. A FI network consists of 21 modules and DEGs in 12 modules were significantly enriched in pathways. The transcription factor-miRNA-gene network contains 2 transcription factors FOXO3 and MYBL2, and 2 miRNAs hsa-miR-21-5p and hsa-miR-30c-5p. RT-PCR validations showed that expression levels of FOXO3 and MYBL2 were significantly increased in AMI, and expression levels of hsa-miR-21–5p and hsa-miR-30c-5p were obviously decreased in AMI. Conclusion: A total of 41 DEGs, such as SOCS3, VAPA, and COL5A2, are speculated to have roles in the pathogenesis of AMI; 2 transcription factors FOXO3 and MYBL2, and 2 miRNAs hsa-miR-21-5p and hsa-miR-30c-5p may be involved in the regulation of the expression of these DEGs. PMID:29049183
Gene-based rare allele analysis identified a risk gene of Alzheimer's disease.

PubMed

Kim, Jong Hun; Song, Pamela; Lim, Hyunsun; Lee, Jae-Hyung; Lee, Jun Hong; Park, Sun Ah

2014-01-01

Alzheimer's disease (AD) has a strong propensity to run in families. However, the known risk genes excluding APOE are not clinically useful. In various complex diseases, gene studies have targeted rare alleles for unsolved heritability. Our study aims to elucidate previously unknown risk genes for AD by targeting rare alleles. We used data from five publicly available genetic studies from the Alzheimer's Disease Neuroimaging Initiative (ADNI) and the database of Genotypes and Phenotypes (dbGaP). A total of 4,171 cases and 9,358 controls were included. The genotype information of rare alleles was imputed using 1,000 genomes. We performed gene-based analysis of rare alleles (minor allele frequency≤3%). The genome-wide significance level was defined as meta P<1.8×10(-6) (0.05/number of genes in human genome = 0.05/28,517). ZNF628, which is located at chromosome 19q13.42, showed a genome-wide significant association with AD. The association of ZNF628 with AD was not dependent on APOE ε4. APOE and TREM2 were also significantly associated with AD, although not at genome-wide significance levels. Other genes identified by targeting common alleles could not be replicated in our gene-based rare allele analysis. We identified that rare variants in ZNF628 are associated with AD. The protein encoded by ZNF628 is known as a transcription factor. Furthermore, the associations of APOE and TREM2 with AD were highly significant, even in gene-based rare allele analysis, which implies that further deep sequencing of these genes is required in AD heritability studies.
Essentiality of threonylcarbamoyladenosine (t6A), a universal tRNA modification, in bacteria

PubMed Central

Thiaville, Patrick C.; Yacoubi, Basma El; Köhrer, Caroline; Thiaville, Jennifer J.; Deutsch, Chris; Iwata-Reuyl, Dirk; Bacusmo, Jo Marie; Armengaud, Jean; Bessho, Yoshitaka; Wetzel, Collin; Cao, Xiaoyu; Limbach, Patrick A.; RajBhandary, Uttam L.; de Crécy-Lagard, Valérie

2016-01-01

Threonylcarbamoyladenosine (t6A) is a modified nucleoside universally conserved in tRNAs in all three kingdoms of life. The recently discovered genes for t6A synthesis, including tsaC and tsaD, are essential in model prokaryotes but not essential in yeast. These genes had been identified as antibacterial targets even before their functions were known. However, the molecular basis for this prokaryotic-specific essentiality has remained a mystery. Here, we show that t6A is a strong positive determinant for aminoacylation of tRNA by bacterial-type but not by eukaryotic-type isoleucyl-tRNA synthetases and might also be a determinant for the essential enzyme tRNAIle-lysidine synthetase. We confirm that t6A is essential in Escherichia coli and a survey of genome-wide essentiality studies shows that genes for t6A synthesis are essential in most prokaryotes. This essentiality phenotype is not universal in Bacteria as t6A is dispensable in Deinococcus radiodurans, Thermus thermophilus, Synechocystis PCC6803 and Streptococcus mutans. Proteomic analysis of t6A- D. radiodurans strains revealed an induction of the proteotoxic stress response and identified genes whose translation is most affected by the absence of t6A in tRNAs. Thus, although t6A is universally conserved in tRNAs, its role in translation might vary greatly between organisms. PMID:26337258
Gene expression meta-analysis identifies chromosomal regions and candidate genes involved in breast cancer metastasis.

PubMed

Thomassen, Mads; Tan, Qihua; Kruse, Torben A

2009-01-01

Breast cancer cells exhibit complex karyotypic alterations causing deregulation of numerous genes. Some of these genes are probably causal for cancer formation and local growth whereas others are causal for the various steps of metastasis. In a fraction of tumors deregulation of the same genes might be caused by epigenetic modulations, point mutations or the influence of other genes. We have investigated the relation of gene expression and chromosomal position, using eight datasets including more than 1200 breast tumors, to identify chromosomal regions and candidate genes possibly causal for breast cancer metastasis. By use of "Gene Set Enrichment Analysis" we have ranked chromosomal regions according to their relation to metastasis. Overrepresentation analysis identified regions with increased expression for chromosome 1q41-42, 8q24, 12q14, 16q22, 16q24, 17q12-21.2, 17q21-23, 17q25, 20q11, and 20q13 among metastasizing tumors and reduced gene expression at 1p31-21, 8p22-21, and 14q24. By analysis of genes with extremely imbalanced expression in these regions we identified DIRAS3 at 1p31, PSD3, LPL, EPHX2 at 8p21-22, and FOS at 14q24 as candidate metastasis suppressor genes. Potential metastasis promoting genes includes RECQL4 at 8q24, PRMT7 at 16q22, GINS2 at 16q24, and AURKA at 20q13.
Functional Characterization of MAT1-1-Specific Mating-Type Genes in the Homothallic Ascomycete Sordaria macrospora Provides New Insights into Essential and Nonessential Sexual Regulators▿†

PubMed Central

Klix, V.; Nowrousian, M.; Ringelberg, C.; Loros, J. J.; Dunlap, J. C.; Pöggeler, S.

2010-01-01

Mating-type genes in fungi encode regulators of mating and sexual development. Heterothallic ascomycete species require different sets of mating-type genes to control nonself-recognition and mating of compatible partners of different mating types. Homothallic (self-fertile) species also carry mating-type genes in their genome that are essential for sexual development. To analyze the molecular basis of homothallism and the role of mating-type genes during fruiting-body development, we deleted each of the three genes, SmtA-1 (MAT1-1-1), SmtA-2 (MAT1-1-2), and SmtA-3 (MAT1-1-3), contained in the MAT1-1 part of the mating-type locus of the homothallic ascomycete species Sordaria macrospora. Phenotypic analysis of deletion mutants revealed that the PPF domain protein-encoding gene SmtA-2 is essential for sexual reproduction, whereas the α domain protein-encoding genes SmtA-1 and SmtA-3 play no role in fruiting-body development. By means of cross-species microarray analysis using Neurospora crassa oligonucleotide microarrays hybridized with S. macrospora targets and quantitative real-time PCR, we identified genes expressed under the control of SmtA-1 and SmtA-2. Both genes are involved in the regulation of gene expression, including that of pheromone genes. PMID:20435701
Identifying RNA splicing factors using IFT genes in Chlamydomonas reinhardtii.

PubMed

Lin, Huawen; Zhang, Zhengyan; Iomini, Carlo; Dutcher, Susan K

2018-03-01

Intraflagellar transport moves proteins in and out of flagella/cilia and it is essential for the assembly of these organelles. Using whole-genome sequencing, we identified splice site mutations in two IFT genes, IFT81 ( fla9 ) and IFT121 ( ift121-2 ), which lead to flagellar assembly defects in the unicellular green alga Chlamydomonas reinhardtii The splicing defects in these ift mutants are partially corrected by mutations in two conserved spliceosome proteins, DGR14 and FRA10. We identified a dgr14 deletion mutant, which suppresses the 3' splice site mutation in IFT81 , and a frameshift mutant of FRA10 , which suppresses the 5' splice site mutation in IFT121 Surprisingly, we found dgr14-1 and fra10 mutations suppress both splice site mutations. We suggest these two proteins are involved in facilitating splice site recognition/interaction; in their absence some splice site mutations are tolerated. Nonsense mutations in SMG1 , which is involved in nonsense-mediated decay, lead to accumulation of aberrant transcripts and partial restoration of flagellar assembly in the ift mutants. The high density of introns and the conservation of noncore splicing factors, together with the ease of scoring the ift mutant phenotype, make Chlamydomonas an attractive organism to identify new proteins involved in splicing through suppressor screening. © 2018 The Authors.
Construction of a Bacterial Cell that Contains Only the Set of Essential Genes Necessary to Impart Life

DTIC Science & Technology

2014-05-16

native uncharacterized genes for characterized genes from Bacillus subtilis , that is presented in a constitutive expression module. If the B... subtilis gene containing M. mycoides mutant is viable than the function of the conserved hypothetical gene is the same as the input B. subtilis gene...Characterized genes from B. subtilis were swapped with similar, but not so similar as to be clearly the same, essential genes from M. mycoides. The B. subtilis
Cas9 Nickase-Assisted RNA Repression Enables Stable and Efficient Manipulation of Essential Metabolic Genes in Clostridium cellulolyticum.

PubMed

Xu, Tao; Li, Yongchao; He, Zhili; Van Nostrand, Joy D; Zhou, Jizhong

2017-01-01

Essential gene functions remain largely underexplored in bacteria. Clostridium cellulolyticum is a promising candidate for consolidated bioprocessing; however, its genetic manipulation to reduce the formation of less-valuable acetate is technically challenging due to the essentiality of acetate-producing genes. Here we developed a Cas9 nickase-assisted chromosome-based RNA repression to stably manipulate essential genes in C. cellulolyticum . Our plasmid-based expression of antisense RNA (asRNA) molecules targeting the phosphotransacetylase ( pta ) gene successfully reduced the enzymatic activity by 35% in cellobiose-grown cells, metabolically decreased the acetate titer by 15 and 52% in wildtype transformants on cellulose and xylan, respectively. To control both acetate and lactate simultaneously, we transformed the repression plasmid into lactate production-deficient mutant and found the plasmid delivery reduced acetate titer by more than 33%, concomitant with negligible lactate formation. The strains with pta gene repression generally diverted more carbon into ethanol. However, further testing on chromosomal integrants that were created by double-crossover recombination exhibited only very weak repression because DNA integration dramatically lessened gene dosage. With the design of a tandem repetitive promoter-driven asRNA module and the use of a new Cas9 nickase genome editing tool, a chromosomal integrant (LM3P) was generated in a single step and successfully enhanced RNA repression, with a 27% decrease in acetate titer on cellulose in antibiotic-free medium. These results indicate the effectiveness of tandem promoter-driven RNA repression modules in promoting gene repression in chromosomal integrants. Our combinatorial method using a Cas9 nickase genome editing tool to integrate the gene repression module demonstrates easy-to-use and high-efficiency advantages, paving the way for stably manipulating genes, even essential ones, for functional characterization
Cas9 Nickase-Assisted RNA Repression Enables Stable and Efficient Manipulation of Essential Metabolic Genes in Clostridium cellulolyticum

PubMed Central

Xu, Tao; Li, Yongchao; He, Zhili; Van Nostrand, Joy D.; Zhou, Jizhong

2017-01-01

Essential gene functions remain largely underexplored in bacteria. Clostridium cellulolyticum is a promising candidate for consolidated bioprocessing; however, its genetic manipulation to reduce the formation of less-valuable acetate is technically challenging due to the essentiality of acetate-producing genes. Here we developed a Cas9 nickase-assisted chromosome-based RNA repression to stably manipulate essential genes in C. cellulolyticum. Our plasmid-based expression of antisense RNA (asRNA) molecules targeting the phosphotransacetylase (pta) gene successfully reduced the enzymatic activity by 35% in cellobiose-grown cells, metabolically decreased the acetate titer by 15 and 52% in wildtype transformants on cellulose and xylan, respectively. To control both acetate and lactate simultaneously, we transformed the repression plasmid into lactate production-deficient mutant and found the plasmid delivery reduced acetate titer by more than 33%, concomitant with negligible lactate formation. The strains with pta gene repression generally diverted more carbon into ethanol. However, further testing on chromosomal integrants that were created by double-crossover recombination exhibited only very weak repression because DNA integration dramatically lessened gene dosage. With the design of a tandem repetitive promoter-driven asRNA module and the use of a new Cas9 nickase genome editing tool, a chromosomal integrant (LM3P) was generated in a single step and successfully enhanced RNA repression, with a 27% decrease in acetate titer on cellulose in antibiotic-free medium. These results indicate the effectiveness of tandem promoter-driven RNA repression modules in promoting gene repression in chromosomal integrants. Our combinatorial method using a Cas9 nickase genome editing tool to integrate the gene repression module demonstrates easy-to-use and high-efficiency advantages, paving the way for stably manipulating genes, even essential ones, for functional characterization and
A gene-trap strategy identifies quiescence-induced genes in synchronized myoblasts.

PubMed

Sambasivan, Ramkumar; Pavlath, Grace K; Dhawan, Jyotsna

2008-03-01

Cellular quiescence is characterized not only by reduced mitotic and metabolic activity but also by altered gene expression. Growing evidence suggests that quiescence is not merely a basal state but is regulated by active mechanisms. To understand the molecular programme that governs reversible cell cycle exit, we focused on quiescence-related gene expression in a culture model of myogenic cell arrest and activation. Here we report the identification of quiescence-induced genes using a gene-trap strategy. Using a retroviral vector, we generated a library of gene traps in C2C12 myoblasts that were screened for arrest-induced insertions by live cell sorting (FACS-gal). Several independent gene- trap lines revealed arrest-dependent induction of betagal activity, confirming the efficacy of the FACS screen. The locus of integration was identified in 15 lines. In three lines,insertion occurred in genes previously implicated in the control of quiescence, i.e. EMSY - a BRCA2--interacting protein, p8/com1 - a p300HAT -- binding protein and MLL5 - a SET domain protein. Our results demonstrate that expression of chromatin modulatory genes is induced in G0, providing support to the notion that this reversibly arrested state is actively regulated.
Construction of a Bacterial Cell that Contains Only the Set of Essential Genes Necessary to Impart Life

DTIC Science & Technology

2014-08-15

characterized genes from Bacillus subtilis , that is presented in a constitutive expression module. If the B. subtilis gene containing M. mycoides mutant is...essential gene MMYC_0361 with the rlmH gene from Bacillus subtilis . Mycoplasma mycoides containing the B. subtilis rlmH was viable. This tells us the...viable than the function of the conserved hypothetical gene is the same as the input B. subtilis gene. Table of Contents: Section
A P-Norm Robust Feature Extraction Method for Identifying Differentially Expressed Genes

PubMed Central

Liu, Jian; Liu, Jin-Xing; Gao, Ying-Lian; Kong, Xiang-Zhen; Wang, Xue-Song; Wang, Dong

2015-01-01

In current molecular biology, it becomes more and more important to identify differentially expressed genes closely correlated with a key biological process from gene expression data. In this paper, based on the Schatten p-norm and Lp-norm, a novel p-norm robust feature extraction method is proposed to identify the differentially expressed genes. In our method, the Schatten p-norm is used as the regularization function to obtain a low-rank matrix and the Lp-norm is taken as the error function to improve the robustness to outliers in the gene expression data. The results on simulation data show that our method can obtain higher identification accuracies than the competitive methods. Numerous experiments on real gene expression data sets demonstrate that our method can identify more differentially expressed genes than the others. Moreover, we confirmed that the identified genes are closely correlated with the corresponding gene expression data. PMID:26201006
A P-Norm Robust Feature Extraction Method for Identifying Differentially Expressed Genes.

PubMed

Liu, Jian; Liu, Jin-Xing; Gao, Ying-Lian; Kong, Xiang-Zhen; Wang, Xue-Song; Wang, Dong

2015-01-01

In current molecular biology, it becomes more and more important to identify differentially expressed genes closely correlated with a key biological process from gene expression data. In this paper, based on the Schatten p-norm and Lp-norm, a novel p-norm robust feature extraction method is proposed to identify the differentially expressed genes. In our method, the Schatten p-norm is used as the regularization function to obtain a low-rank matrix and the Lp-norm is taken as the error function to improve the robustness to outliers in the gene expression data. The results on simulation data show that our method can obtain higher identification accuracies than the competitive methods. Numerous experiments on real gene expression data sets demonstrate that our method can identify more differentially expressed genes than the others. Moreover, we confirmed that the identified genes are closely correlated with the corresponding gene expression data.
An essential cell cycle regulation gene causes hybrid inviability in Drosophila

PubMed Central

Phadnis, Nitin; Baker, EmilyClare P.; Cooper, Jacob C.; Frizzell, Kimberly A.; Hsieh, Emily; de la Cruz, Aida Flor A.; Shendure, Jay; Kitzman, Jacob O.; Malik, Harmit S.

2015-01-01

Speciation, the process by which new biological species arise, involves the evolution of reproductive barriers such as hybrid sterility or inviability between populations. However, identifying hybrid incompatibility genes remains a key obstacle in understanding the molecular basis of reproductive isolation. We devised a genomic screen, which identified a cell cycle regulation gene as the cause of male inviability in hybrids between Drosophila melanogaster and D. simulans. Ablation of the D. simulans allele of this gene is sufficient to rescue the adult viability of hybrid males. This dominantly acting cell cycle regulator causes mitotic arrest and, thereby, inviability of male hybrid larvae. Our genomic method provides a facile means to accelerate the identification of hybrid incompatibility genes in other model and non-model systems. PMID:26680200
Phylogenomic analysis of UDP glycosyltransferase 1 multigene family in Linum usitatissimum identified genes with varied expression patterns

PubMed Central

2012-01-01

Background The glycosylation process, catalyzed by ubiquitous glycosyltransferase (GT) family enzymes, is a prevalent modification of plant secondary metabolites that regulates various functions such as hormone homeostasis, detoxification of xenobiotics and biosynthesis and storage of secondary metabolites. Flax (Linum usitatissimum L.) is a commercially grown oilseed crop, important because of its essential fatty acids and health promoting lignans. Identification and characterization of UDP glycosyltransferase (UGT) genes from flax could provide valuable basic information about this important gene family and help to explain the seed specific glycosylated metabolite accumulation and other processes in plants. Plant genome sequencing projects are useful to discover complexity within this gene family and also pave way for the development of functional genomics approaches. Results Taking advantage of the newly assembled draft genome sequence of flax, we identified 137 UDP glycosyltransferase (UGT) genes from flax using a conserved signature motif. Phylogenetic analysis of these protein sequences clustered them into 14 major groups (A-N). Expression patterns of these genes were investigated using publicly available expressed sequence tag (EST), microarray data and reverse transcription quantitative real time PCR (RT-qPCR). Seventy-three per cent of these genes (100 out of 137) showed expression evidence in 15 tissues examined and indicated varied expression profiles. The RT-qPCR results of 10 selected genes were also coherent with the digital expression analysis. Interestingly, five duplicated UGT genes were identified, which showed differential expression in various tissues. Of the seven intron loss/gain positions detected, two intron positions were conserved among most of the UGTs, although a clear relationship about the evolution of these genes could not be established. Comparison of the flax UGTs with orthologs from four other sequenced dicot genomes indicated that

A Strategy for Identifying Quantitative Trait Genes Using Gene Expression Analysis and Causal Analysis.

PubMed

Ishikawa, Akira

2017-11-27

Large numbers of quantitative trait loci (QTL) affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs) for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.
Selection of recombinant MVA by rescue of the essential D4R gene.

PubMed

Ricci, Patricia S; Schäfer, Birgit; Kreil, Thomas R; Falkner, Falko G; Holzer, Georg W

2011-12-12

Modified vaccinia virus Ankara (MVA) has become a promising vaccine vector due to its immunogenicity and its proven safety in humans. As a general approach for stringent and rapid selection of recombinant MVA, we assessed marker rescue of the essential viral D4R gene in an engineered deletion mutant that is fully replication defective in wild-type cells. Recombinant, replicating virus was obtained by re-introduction of the deleted viral gene as a dominant selection marker into the deletion mutant.
Gene-Trap Mutagenesis Identifies Mammalian Genes Contributing to Intoxication by Clostridium perfringens ε-Toxin

PubMed Central

Ivie, Susan E.; Fennessey, Christine M.; Sheng, Jinsong; Rubin, Donald H.; McClain, Mark S.

2011-01-01

The Clostridium perfringens ε-toxin is an extremely potent toxin associated with lethal toxemias in domesticated ruminants and may be toxic to humans. Intoxication results in fluid accumulation in various tissues, most notably in the brain and kidneys. Previous studies suggest that the toxin is a pore-forming toxin, leading to dysregulated ion homeostasis and ultimately cell death. However, mammalian host factors that likely contribute to ε-toxin-induced cytotoxicity are poorly understood. A library of insertional mutant Madin Darby canine kidney (MDCK) cells, which are highly susceptible to the lethal affects of ε-toxin, was used to select clones of cells resistant to ε-toxin-induced cytotoxicity. The genes mutated in 9 surviving resistant cell clones were identified. We focused additional experiments on one of the identified genes as a means of validating the experimental approach. Gene expression microarray analysis revealed that one of the identified genes, hepatitis A virus cellular receptor 1 (HAVCR1, KIM-1, TIM1), is more abundantly expressed in human kidney cell lines than it is expressed in human cells known to be resistant to ε-toxin. One human kidney cell line, ACHN, was found to be sensitive to the toxin and expresses a larger isoform of the HAVCR1 protein than the HAVCR1 protein expressed by other, toxin-resistant human kidney cell lines. RNA interference studies in MDCK and in ACHN cells confirmed that HAVCR1 contributes to ε-toxin-induced cytotoxicity. Additionally, ε-toxin was shown to bind to HAVCR1 in vitro. The results of this study indicate that HAVCR1 and the other genes identified through the use of gene-trap mutagenesis and RNA interference strategies represent important targets for investigation of the process by which ε-toxin induces cell death and new targets for potential therapeutic intervention. PMID:21412435
Gene-trap mutagenesis identifies mammalian genes contributing to intoxication by Clostridium perfringens ε-toxin.

PubMed

Ivie, Susan E; Fennessey, Christine M; Sheng, Jinsong; Rubin, Donald H; McClain, Mark S

2011-03-11

The Clostridium perfringens ε-toxin is an extremely potent toxin associated with lethal toxemias in domesticated ruminants and may be toxic to humans. Intoxication results in fluid accumulation in various tissues, most notably in the brain and kidneys. Previous studies suggest that the toxin is a pore-forming toxin, leading to dysregulated ion homeostasis and ultimately cell death. However, mammalian host factors that likely contribute to ε-toxin-induced cytotoxicity are poorly understood. A library of insertional mutant Madin Darby canine kidney (MDCK) cells, which are highly susceptible to the lethal affects of ε-toxin, was used to select clones of cells resistant to ε-toxin-induced cytotoxicity. The genes mutated in 9 surviving resistant cell clones were identified. We focused additional experiments on one of the identified genes as a means of validating the experimental approach. Gene expression microarray analysis revealed that one of the identified genes, hepatitis A virus cellular receptor 1 (HAVCR1, KIM-1, TIM1), is more abundantly expressed in human kidney cell lines than it is expressed in human cells known to be resistant to ε-toxin. One human kidney cell line, ACHN, was found to be sensitive to the toxin and expresses a larger isoform of the HAVCR1 protein than the HAVCR1 protein expressed by other, toxin-resistant human kidney cell lines. RNA interference studies in MDCK and in ACHN cells confirmed that HAVCR1 contributes to ε-toxin-induced cytotoxicity. Additionally, ε-toxin was shown to bind to HAVCR1 in vitro. The results of this study indicate that HAVCR1 and the other genes identified through the use of gene-trap mutagenesis and RNA interference strategies represent important targets for investigation of the process by which ε-toxin induces cell death and new targets for potential therapeutic intervention.
A Penalized Robust Method for Identifying Gene-Environment Interactions

PubMed Central

Shi, Xingjie; Liu, Jin; Huang, Jian; Zhou, Yong; Xie, Yang; Ma, Shuangge

2015-01-01

In high-throughput studies, an important objective is to identify gene-environment interactions associated with disease outcomes and phenotypes. Many commonly adopted methods assume specific parametric or semiparametric models, which may be subject to model mis-specification. In addition, they usually use significance level as the criterion for selecting important interactions. In this study, we adopt the rank-based estimation, which is much less sensitive to model specification than some of the existing methods and includes several commonly encountered data and models as special cases. Penalization is adopted for the identification of gene-environment interactions. It achieves simultaneous estimation and identification and does not rely on significance level. For computation feasibility, a smoothed rank estimation is further proposed. Simulation shows that under certain scenarios, for example with contaminated or heavy-tailed data, the proposed method can significantly outperform the existing alternatives with more accurate identification. We analyze a lung cancer prognosis study with gene expression measurements under the AFT (accelerated failure time) model. The proposed method identifies interactions different from those using the alternatives. Some of the identified genes have important implications. PMID:24616063
Predicting hepatocellular carcinoma through cross-talk genes identified by risk pathways

PubMed Central

Shao, Zhuo; Huo, Diwei; Zhang, Denan; Xie, Hongbo; Yang, Jingbo; Liu, Qiuqi; Chen, Xiujie

2018-01-01

Hepatocellular carcinoma (HCC) is the most frequent type of liver cancer with poor survival rate and high mortality. Despite efforts on the mechanism of HCC, new molecular markers are needed for exact diagnosis, evaluation and treatment. Here, we combined transcriptome of HCC with networks and pathways to identify reliable molecular markers. Through integrating 249 differentially expressed genes with syncretic protein interaction networks, we constructed a HCC-specific network, from which we further extracted 480 pivotal genes. Based on the cross-talk between the enriched pathways of the pivotal genes, we finally identified a HCC signature of 45 genes, which could accurately distinguish HCC patients with normal individuals and reveal the prognosis of HCC patients. Among these 45 genes, 15 showed dysregulated expression patterns and a part have been reported to be associated with HCC and/or other cancers. These findings suggested that our identified 45 gene signature could be potential and valuable molecular markers for diagnosis and evaluation of HCC. PMID:29765536
Inferring Gene Family Histories in Yeast Identifies Lineage Specific Expansions

PubMed Central

Ames, Ryan M.; Money, Daniel; Lovell, Simon C.

2014-01-01

The complement of genes found in the genome is a balance between gene gain and gene loss. Knowledge of the specific genes that are gained and lost over evolutionary time allows an understanding of the evolution of biological functions. Here we use new evolutionary models to infer gene family histories across complete yeast genomes; these models allow us to estimate the relative genome-wide rates of gene birth, death, innovation and extinction (loss of an entire family) for the first time. We show that the rates of gene family evolution vary both between gene families and between species. We are also able to identify those families that have experienced rapid lineage specific expansion/contraction and show that these families are enriched for specific functions. Moreover, we find that families with specific functions are repeatedly expanded in multiple species, suggesting the presence of common adaptations and that these family expansions/contractions are not random. Additionally, we identify potential specialisations, unique to specific species, in the functions of lineage specific expanded families. These results suggest that an important mechanism in the evolution of genome content is the presence of lineage-specific gene family changes. PMID:24921666
An essential cell cycle regulation gene causes hybrid inviability in Drosophila.

PubMed

Phadnis, Nitin; Baker, EmilyClare P; Cooper, Jacob C; Frizzell, Kimberly A; Hsieh, Emily; de la Cruz, Aida Flor A; Shendure, Jay; Kitzman, Jacob O; Malik, Harmit S

2015-12-18

Speciation, the process by which new biological species arise, involves the evolution of reproductive barriers, such as hybrid sterility or inviability between populations. However, identifying hybrid incompatibility genes remains a key obstacle in understanding the molecular basis of reproductive isolation. We devised a genomic screen, which identified a cell cycle-regulation gene as the cause of male inviability in hybrids resulting from a cross between Drosophila melanogaster and D. simulans. Ablation of the D. simulans allele of this gene is sufficient to rescue the adult viability of hybrid males. This dominantly acting cell cycle regulator causes mitotic arrest and, thereby, inviability of male hybrid larvae. Our genomic method provides a facile means to accelerate the identification of hybrid incompatibility genes in other model and nonmodel systems. Copyright © 2015, American Association for the Advancement of Science.
30 CFR 285.803 - How must I conduct my approved activities to protect essential fish habitats identified and...

Code of Federal Regulations, 2010 CFR

2010-07-01

... protect essential fish habitats identified and described under the Magnuson-Stevens Fishery Conservation... fish habitats identified and described under the Magnuson-Stevens Fishery Conservation and Management Act? (a) If, during the conduct of your approved activities, MMS finds that essential fish habitat or...
Functional analysis of UMOD gene and its effect on inflammatory cytokines in serum of essential hypertension patients.

PubMed

Jian, Liguo; Fa, Xian'en; Zhou, Zheng; Liu, Shichao

2015-01-01

The study aimed to investigate the function of uromodulin (UMOD) gene and its effect on inflammatory cytokines in serum of essential hypertension patients. The online database and software of computer were used for bioinformatics analysis on UMOD gene as well as the structure and function of its encoding proteins. Moreover, radioimmunoassay and enzyme linked immunosorbent assay was adopted to validate the content of urine UMOD protein of essential hypertension patients and their serum inflammatory cytokines. As an alkaline and hydrophilic protein, UMOD has no transmembrane region, but it does have a signal peptide sequence. It is mainly located extracellularly, belonging to a secreted protein, whose secondary structure was based mainly on Random coil which account for 58.44%. According to function prediction, it is found that the UMOD protein has stress response which may be participate in the inflammatory reaction. It has been observed from the experiment which was designed on the basis of the correlation between inflammation reaction and essential hypertension that the content of urine UMOD protein of essential hypertension patients who is in stage I was (28.71 ± 10.53) mg/24 h and when compared with the control group's content (30.15 ± 14.10 mg/24 h), the difference was not obviously; The content of urine UMOD protein of essential hypertension patients who's in stage II and III was (18.24 ± 6.12) mg/24 h and (9.43 ± 3.16) mg/24 h, respectively, which were obviously lower than that of the control group (P<0.01). Additionally, the serum inflammatory cytokines, such as TNF-α, IL-6 and IL1-α content of essential hypertension patients were all markedly higher than that of control group (P<0.05). For essential hypertension patients, there's a close relationship between the expression level of UMOD gene and inflammatory cytokines, which were manifested as the negative correlation between the level of the gene's expression and inflammatory cytokines. That has
Analysis of genomic aberrations and gene expression profiling identifies novel lesions and pathways in myeloproliferative neoplasms

PubMed Central

Rice, K L; Lin, X; Wolniak, K; Ebert, B L; Berkofsky-Fessler, W; Buzzai, M; Sun, Y; Xi, C; Elkin, P; Levine, R; Golub, T; Gilliland, D G; Crispino, J D; Licht, J D; Zhang, W

2011-01-01

Polycythemia vera (PV), essential thrombocythemia and primary myelofibrosis, are myeloproliferative neoplasms (MPNs) with distinct clinical features and are associated with the JAK2V617F mutation. To identify genomic anomalies involved in the pathogenesis of these disorders, we profiled 87 MPN patients using Affymetrix 250K single-nucleotide polymorphism (SNP) arrays. Aberrations affecting chr9 were the most frequently observed and included 9pLOH (n=16), trisomy 9 (n=6) and amplifications of 9p13.3–23.3 (n=1), 9q33.1–34.13 (n=1) and 9q34.13 (n=6). Patients with trisomy 9 were associated with elevated JAK2V617F mutant allele burden, suggesting that gain of chr9 represents an alternative mechanism for increasing JAK2V617F dosage. Gene expression profiling of patients with and without chr9 abnormalities (+9, 9pLOH), identified genes potentially involved in disease pathogenesis including JAK2, STAT5B and MAPK14. We also observed recurrent gains of 1p36.31–36.33 (n=6), 17q21.2–q21.31 (n=5) and 17q25.1–25.3 (n=5) and deletions affecting 18p11.31–11.32 (n=8). Combined SNP and gene expression analysis identified aberrations affecting components of a non-canonical PRC2 complex (EZH1, SUZ12 and JARID2) and genes comprising a ‘HSC signature' (MLLT3, SMARCA2 and PBX1). We show that NFIB, which is amplified in 7/87 MPN patients and upregulated in PV CD34+ cells, protects cells from apoptosis induced by cytokine withdrawal. PMID:22829077
Acquired RhD mosaicism identifies fibrotic transformation of thrombopoietin receptor-mutated essential thrombocythemia.

PubMed

Montemayor-Garcia, Celina; Coward, Rebecca; Albitar, Maher; Udani, Rupa; Jain, Prachi; Koklanaris, Eleftheria; Battiwalla, Minoo; Keel, Siobán; Klein, Harvey G; Barrett, A John; Ito, Sawa

2017-09-01

Acquired copy-neutral loss of heterozygosity has been described in myeloid malignant progression with an otherwise normal karyotype. A 65-year-old woman with MPL-mutated essential thrombocythemia and progression to myelofibrosis was noted upon routine pretransplant testing to have mixed field reactivity with anti-D and an historic discrepancy in RhD type. The patient had never received transfusions or transplantation. Gel immunoagglutination revealed group A red blood cells and a mixed-field reaction for the D phenotype, with a predominant D-negative population and a small subset of circulating red blood cells carrying the D antigen. Subsequent genomic microarray single nucleotide polymorphism profiling revealed copy-neutral loss of heterozygosity of chromosome 1 p36.33-p34.2, a known molecular mechanism underlying fibrotic progression of MPL-mutated essential thrombocythemia. The chromosomal region affected by this copy-neutral loss of heterozygosity encompassed the RHD, RHCE, and MPL genes. We propose a model of chronological molecular events that is supported by RHD zygosity assays in peripheral lymphoid and myeloid-derived cells. Copy-neutral loss of heterozygosity events that lead to clonal selection and myeloid malignant progression may also affect the expression of adjacent unrelated genes, including those encoding for blood group antigens. Detection of mixed-field reactions and investigation of discrepant blood typing results are important for proper transfusion support of these patients and can provide useful surrogate markers of myeloproliferative disease progression. © 2017 AABB.
The genetics of alcoholism: identifying specific genes through family studies.

PubMed

Edenberg, Howard J; Foroud, Tatiana

2006-09-01

Alcoholism is a complex disorder with both genetic and environmental risk factors. Studies in humans have begun to elucidate the genetic underpinnings of the risk for alcoholism. Here we briefly review strategies for identifying individual genes in which variations affect the risk for alcoholism and related phenotypes, in the context of one large study that has successfully identified such genes. The Collaborative Study on the Genetics of Alcoholism (COGA) is a family-based study that has collected detailed phenotypic data on individuals in families with multiple alcoholic members. A genome-wide linkage approach led to the identification of chromosomal regions containing genes that influenced alcoholism risk and related phenotypes. Subsequently, single nucleotide polymorphisms (SNPs) were genotyped in positional candidate genes located within the linked chromosomal regions, and analyzed for association with these phenotypes. Using this sequential approach, COGA has detected association with GABRA2, CHRM2 and ADH4; these associations have all been replicated by other researchers. COGA has detected association to additional genes including GABRG3, TAS2R16, SNCA, OPRK1 and PDYN, results that are awaiting confirmation. These successes demonstrate that genes contributing to the risk for alcoholism can be reliably identified using human subjects.
Gene expression patterns combined with bioinformatics analysis identify genes associated with cholangiocarcinoma.

PubMed

Li, Chen; Shen, Weixing; Shen, Sheng; Ai, Zhilong

2013-12-01

To explore the molecular mechanisms of cholangiocarcinoma (CC), microarray technology was used to find biomarkers for early detection and diagnosis. The gene expression profiles from 6 patients with CC and 5 normal controls were downloaded from Gene Expression Omnibus and compared. As a result, 204 differentially co-expressed genes (DCGs) in CC patients compared to normal controls were identified using a computational bioinformatics analysis. These genes were mainly involved in coenzyme metabolic process, peptidase activity and oxidation reduction. A regulatory network was constructed by mapping the DCGs to known regulation data. Four transcription factors, FOXC1, ZIC2, NKX2-2 and GCGR, were hub nodes in the network. In conclusion, this study provides a set of targets useful for future investigations into molecular biomarker studies. Copyright © 2013 Elsevier Ltd. All rights reserved.
Comparative phylogenetic analysis and transcriptional profiling of MADS-box gene family identified DAM and FLC-like genes in apple (Malusx domestica)

PubMed Central

Kumar, Gulshan; Arya, Preeti; Gupta, Khushboo; Randhawa, Vinay; Acharya, Vishal; Singh, Anil Kumar

2016-01-01

The MADS-box transcription factors play essential roles in various processes of plant growth and development. In the present study, phylogenetic analysis of 142 apple MADS-box proteins with that of other dicotyledonous species identified six putative Dormancy-Associated MADS-box (DAM) and four putative Flowering Locus C-like (FLC-like) proteins. In order to study the expression of apple MADS-box genes, RNA-seq analysis of 3 apical and 5 spur bud stages during dormancy, 6 flower stages and 7 fruit development stages was performed. The dramatic reduction in expression of two MdDAMs, MdMADS063 and MdMADS125 and two MdFLC-like genes, MdMADS135 and MdMADS136 during dormancy release suggests their role as flowering-repressors in apple. Apple orthologs of Arabidopsis genes, FLOWERING LOCUS T, FRIGIDA, SUPPRESSOR OF OVEREXPRESSION OF CONSTANS 1 and LEAFY exhibit similar expression patterns as reported in Arabidopsis, suggesting functional conservation in floral signal integration and meristem determination pathways. Gene ontology enrichment analysis of predicted targets of DAM revealed their involvement in regulation of reproductive processes and meristematic activities, indicating functional conservation of SVP orthologs (DAM) in apple. This study provides valuable insights into the functions of MADS-box proteins during apple phenology, which may help in devising strategies to improve important traits in apple. PMID:26856238
Comparative phylogenetic analysis and transcriptional profiling of MADS-box gene family identified DAM and FLC-like genes in apple (Malusx domestica).

PubMed

Kumar, Gulshan; Arya, Preeti; Gupta, Khushboo; Randhawa, Vinay; Acharya, Vishal; Singh, Anil Kumar

2016-02-09

The MADS-box transcription factors play essential roles in various processes of plant growth and development. In the present study, phylogenetic analysis of 142 apple MADS-box proteins with that of other dicotyledonous species identified six putative Dormancy-Associated MADS-box (DAM) and four putative Flowering Locus C-like (FLC-like) proteins. In order to study the expression of apple MADS-box genes, RNA-seq analysis of 3 apical and 5 spur bud stages during dormancy, 6 flower stages and 7 fruit development stages was performed. The dramatic reduction in expression of two MdDAMs, MdMADS063 and MdMADS125 and two MdFLC-like genes, MdMADS135 and MdMADS136 during dormancy release suggests their role as flowering-repressors in apple. Apple orthologs of Arabidopsis genes, FLOWERING LOCUS T, FRIGIDA, SUPPRESSOR OF OVEREXPRESSION OF CONSTANS 1 and LEAFY exhibit similar expression patterns as reported in Arabidopsis, suggesting functional conservation in floral signal integration and meristem determination pathways. Gene ontology enrichment analysis of predicted targets of DAM revealed their involvement in regulation of reproductive processes and meristematic activities, indicating functional conservation of SVP orthologs (DAM) in apple. This study provides valuable insights into the functions of MADS-box proteins during apple phenology, which may help in devising strategies to improve important traits in apple.
Negative selection in tumor genome evolution acts on essential cellular functions and the immunopeptidome.

PubMed

Zapata, Luis; Pich, Oriol; Serrano, Luis; Kondrashov, Fyodor A; Ossowski, Stephan; Schaefer, Martin H

2018-05-31

Natural selection shapes cancer genomes. Previous studies used signatures of positive selection to identify genes driving malignant transformation. However, the contribution of negative selection against somatic mutations that affect essential tumor functions or specific domains remains a controversial topic. Here, we analyze 7546 individual exomes from 26 tumor types from TCGA data to explore the portion of the cancer exome under negative selection. Although we find most of the genes neutrally evolving in a pan-cancer framework, we identify essential cancer genes and immune-exposed protein regions under significant negative selection. Moreover, our simulations suggest that the amount of negative selection is underestimated. We therefore choose an empirical approach to identify genes, functions, and protein regions under negative selection. We find that expression and mutation status of negatively selected genes is indicative of patient survival. Processes that are most strongly conserved are those that play fundamental cellular roles such as protein synthesis, glucose metabolism, and molecular transport. Intriguingly, we observe strong signals of selection in the immunopeptidome and proteins controlling peptide exposition, highlighting the importance of immune surveillance evasion. Additionally, tumor type-specific immune activity correlates with the strength of negative selection on human epitopes. In summary, our results show that negative selection is a hallmark of cell essentiality and immune response in cancer. The functional domains identified could be exploited therapeutically, ultimately allowing for the development of novel cancer treatments.
A Systems Biology Approach To Identify the Combination Effects of Human Herpesvirus 8 Genes on NF-κB Activation▿

PubMed Central

Konrad, Andreas; Wies, Effi; Thurau, Mathias; Marquardt, Gaby; Naschberger, Elisabeth; Hentschel, Sonja; Jochmann, Ramona; Schulz, Thomas F.; Erfle, Holger; Brors, Benedikt; Lausen, Berthold; Neipel, Frank; Stürzl, Michael

2009-01-01

Human herpesvirus 8 (HHV-8) is the etiologic agent of Kaposi's sarcoma and primary effusion lymphoma. Activation of the cellular transcription factor nuclear factor-kappa B (NF-κB) is essential for latent persistence of HHV-8, survival of HHV-8-infected cells, and disease progression. We used reverse-transfected cell microarrays (RTCM) as an unbiased systems biology approach to systematically analyze the effects of HHV-8 genes on the NF-κB signaling pathway. All HHV-8 genes individually (n = 86) and, additionally, all K and latent genes in pairwise combinations (n = 231) were investigated. Statistical analyses of more than 14,000 transfections identified ORF75 as a novel and confirmed K13 as a known HHV-8 activator of NF-κB. K13 and ORF75 showed cooperative NF-κB activation. Small interfering RNA-mediated knockdown of ORF75 expression demonstrated that this gene contributes significantly to NF-κB activation in HHV-8-infected cells. Furthermore, our approach confirmed K10.5 as an NF-κB inhibitor and newly identified K1 as an inhibitor of both K13- and ORF75-mediated NF-κB activation. All results obtained with RTCM were confirmed with classical transfection experiments. Our work describes the first successful application of RTCM for the systematic analysis of pathofunctions of genes of an infectious agent. With this approach, ORF75 and K1 were identified as novel HHV-8 regulatory molecules on the NF-κB signal transduction pathway. The genes identified may be involved in fine-tuning of the balance between latency and lytic replication, since this depends critically on the state of NF-κB activity. PMID:19129458
Integrated in silico analyses of regulatory and metabolic networks of Synechococcus sp. PCC 7002 reveal relationships between gene centrality and essentiality

DOE PAGES

Song, Hyun-Seob; McClure, Ryan S.; Bernstein, Hans C.; ...

2015-03-27

Cyanobacteria dynamically relay environmental inputs to intracellular adaptations through a coordinated adjustment of photosynthetic efficiency and carbon processing rates. The output of such adaptations is reflected through changes in transcriptional patterns and metabolic flux distributions that ultimately define growth strategy. To address interrelationships between metabolism and regulation, we performed integrative analyses of metabolic and gene co-expression networks in a model cyanobacterium, Synechococcus sp. PCC 7002. Centrality analyses using the gene co-expression network identified a set of key genes, which were defined here as ‘topologically important.’ Parallel in silico gene knock-out simulations, using the genome-scale metabolic network, classified what we termedmore » as ‘functionally important’ genes, deletion of which affected growth or metabolism. A strong positive correlation was observed between topologically and functionally important genes. Functionally important genes exhibited variable levels of topological centrality; however, the majority of topologically central genes were found to be functionally essential for growth. Subsequent functional enrichment analysis revealed that both functionally and topologically important genes in Synechococcus sp. PCC 7002 are predominantly associated with translation and energy metabolism, two cellular processes critical for growth. This research demonstrates how synergistic network-level analyses can be used for reconciliation of metabolic and gene expression data to uncover fundamental biological principles.« less
Integrated in silico analyses of regulatory and metabolic networks of Synechococcus sp. PCC 7002 reveal relationships between gene centrality and essentiality

DOE Office of Scientific and Technical Information (OSTI.GOV)

Song, Hyun-Seob; McClure, Ryan S.; Bernstein, Hans C.

Cyanobacteria dynamically relay environmental inputs to intracellular adaptations through a coordinated adjustment of photosynthetic efficiency and carbon processing rates. The output of such adaptations is reflected through changes in transcriptional patterns and metabolic flux distributions that ultimately define growth strategy. To address interrelationships between metabolism and regulation, we performed integrative analyses of metabolic and gene co-expression networks in a model cyanobacterium, Synechococcus sp. PCC 7002. Centrality analyses using the gene co-expression network identified a set of key genes, which were defined here as ‘topologically important.’ Parallel in silico gene knock-out simulations, using the genome-scale metabolic network, classified what we termedmore » as ‘functionally important’ genes, deletion of which affected growth or metabolism. A strong positive correlation was observed between topologically and functionally important genes. Functionally important genes exhibited variable levels of topological centrality; however, the majority of topologically central genes were found to be functionally essential for growth. Subsequent functional enrichment analysis revealed that both functionally and topologically important genes in Synechococcus sp. PCC 7002 are predominantly associated with translation and energy metabolism, two cellular processes critical for growth. This research demonstrates how synergistic network-level analyses can be used for reconciliation of metabolic and gene expression data to uncover fundamental biological principles.« less

Association of ACE, FABP2 and GST genes polymorphism with essential hypertension risk among a North Indian population.

PubMed

Abbas, Shania; Raza, Syed Tasleem; Chandra, Anu; Rizvi, Saliha; Ahmed, Faisal; Eba, Ale; Mahdi, Farzana

2015-01-01

Hypertension has a multi-factorial background based on genetic and environmental interactive factors. ACE, FABP2 and GST genes have been suggested to be involved in the development of hypertension. However, the results have been inconsistent. The present study was carried out to investigate the association of ACE (rs4646994), FABP2 (rs1799883) and GST (GSTM1 null or positive genotype and GSTT1 null or positive genotype) genes polymorphism with essential HTN cases and controls. This study includes 138 essential hypertension (HTN) patients and 116 age-, sex- and ethnicity-matched control subjects. GST (GSTM1 null or positive genotype and GSTT1 null or positive genotype) genes polymorphisms were evaluated by multiplex PCR, ACE (rs4646994) gene polymorphisms by PCR and FABP2 (rs1799883) gene polymorphisms by PCR-RFLP method. Significant differences were obtained in the frequencies of ACE DD, II genotype (p = 0.006, 0.003), GSTT1 null, GSTM1 positive genotype (p = 0.048, 0.010) and FABP2 Ala54/Ala54 genotype (p = 0.049) between essential HTN cases and controls. It is concluded that ACE (rs 4646994), FABP2 (rs1799883) and GST (GSTM1 null or positive genotype and GSTT1 null or positive genotype) genes polymorphism are associated with HTN. Further investigation with a larger sample size may be required to validate this study.
An Integrative Genetics Approach to Identify Candidate Genes Regulating BMD: Combining Linkage, Gene Expression, and Association

PubMed Central

Farber, Charles R; van Nas, Atila; Ghazalpour, Anatole; Aten, Jason E; Doss, Sudheer; Sos, Brandon; Schadt, Eric E; Ingram-Drake, Leslie; Davis, Richard C; Horvath, Steve; Smith, Desmond J; Drake, Thomas A; Lusis, Aldons J

2009-01-01

Numerous quantitative trait loci (QTLs) affecting bone traits have been identified in the mouse; however, few of the underlying genes have been discovered. To improve the process of transitioning from QTL to gene, we describe an integrative genetics approach, which combines linkage analysis, expression QTL (eQTL) mapping, causality modeling, and genetic association in outbred mice. In C57BL/6J × C3H/HeJ (BXH) F2 mice, nine QTLs regulating femoral BMD were identified. To select candidate genes from within each QTL region, microarray gene expression profiles from individual F2 mice were used to identify 148 genes whose expression was correlated with BMD and regulated by local eQTLs. Many of the genes that were the most highly correlated with BMD have been previously shown to modulate bone mass or skeletal development. Candidates were further prioritized by determining whether their expression was predicted to underlie variation in BMD. Using network edge orienting (NEO), a causality modeling algorithm, 18 of the 148 candidates were predicted to be causally related to differences in BMD. To fine-map QTLs, markers in outbred MF1 mice were tested for association with BMD. Three chromosome 11 SNPs were identified that were associated with BMD within the Bmd11 QTL. Finally, our approach provides strong support for Wnt9a, Rasd1, or both underlying Bmd11. Integration of multiple genetic and genomic data sets can substantially improve the efficiency of QTL fine-mapping and candidate gene identification. PMID:18767929
The clp1 gene of the mushroom Coprinus cinereus is essential for A-regulated sexual development.

PubMed Central

Inada, K; Morimoto, Y; Arima, T; Murata, Y; Kamada, T

2001-01-01

Sexual development in the mushroom Coprinus cinereus is under the control of the A and B mating-type loci, both of which must be different for a compatible, dikaryotic mycelium to form between two parents. The A genes, encoding proteins with homeodomain motifs, regulate conjugate division of the two nuclei from each mating partner and promote the formation of clamp connections. The latter are hyphal configurations required for the maintenance of the nuclear status in the dikaryotic phase of basidiomycetes. The B genes encode pheromones and pheromone receptors. They regulate the cellular fusions that complete clamp connections during growth, as well as the nuclear migration required for dikaryosis. The AmutBmut strain (326) of C. cinereus, in which both A- and B-regulated pathways are constitutively activated by mutations, produces, without mating, dikaryon-like, fertile hyphae with clamp connections. In this study we isolated and characterized clampless1-1 (clp1-1), a mutation that blocks clamp formation, an essential step in A-regulated sexual development, in the AmutBmut background. A genomic DNA fragment that rescues the clp1-1 mutation was identified by transformations. Sequencing of the genomic DNA, together with RACE experiments, identified an ORF interrupted by one intron, encoding a novel protein of 365 amino acids. The clp1-1 mutant allele carries a deletion of four nucleotides, which is predicted to cause elimination of codon 128 and frameshifts thereafter. The clp1 transcript was normally detected only in the presence of the A protein heterodimer formed when homokaryons with compatible A genes were mated. Forced expression of clp1 by promoter replacements induced clamp development without the need for a compatible A gene combination. These results indicate that expression of clp1 is necessary and sufficient for induction of the A-regulated pathway that leads to clamp development. PMID:11139497
Exome sequencing of Pakistani consanguineous families identifies 30 novel candidate genes for recessive intellectual disability

PubMed Central

Riazuddin, S; Hussain, M; Razzaq, A; Iqbal, Z; Shahzad, M; Polla, D L; Song, Y; van Beusekom, E; Khan, A A; Tomas-Roca, L; Rashid, M; Zahoor, M Y; Wissink-Lindhout, W M; Basra, M A R; Ansar, M; Agha, Z; van Heeswijk, K; Rasheed, F; Van de Vorst, M; Veltman, J A; Gilissen, C; Akram, J; Kleefstra, T; Assir, M Z; Grozeva, D; Carss, K; Raymond, F L; O'Connor, T D; Riazuddin, S A; Khan, S N; Ahmed, Z M; de Brouwer, A P M; van Bokhoven, H; Riazuddin, S

2017-01-01

Intellectual disability (ID) is a clinically and genetically heterogeneous disorder, affecting 1–3% of the general population. Although research into the genetic causes of ID has recently gained momentum, identification of pathogenic mutations that cause autosomal recessive ID (ARID) has lagged behind, predominantly due to non-availability of sizeable families. Here we present the results of exome sequencing in 121 large consanguineous Pakistani ID families. In 60 families, we identified homozygous or compound heterozygous DNA variants in a single gene, 30 affecting reported ID genes and 30 affecting novel candidate ID genes. Potential pathogenicity of these alleles was supported by co-segregation with the phenotype, low frequency in control populations and the application of stringent bioinformatics analyses. In another eight families segregation of multiple pathogenic variants was observed, affecting 19 genes that were either known or are novel candidates for ID. Transcriptome profiles of normal human brain tissues showed that the novel candidate ID genes formed a network significantly enriched for transcriptional co-expression (P<0.0001) in the frontal cortex during fetal development and in the temporal–parietal and sub-cortex during infancy through adulthood. In addition, proteins encoded by 12 novel ID genes directly interact with previously reported ID proteins in six known pathways essential for cognitive function (P<0.0001). These results suggest that disruptions of temporal parietal and sub-cortical neurogenesis during infancy are critical to the pathophysiology of ID. These findings further expand the existing repertoire of genes involved in ARID, and provide new insights into the molecular mechanisms and the transcriptome map of ID. PMID:27457812
Exome sequencing of Pakistani consanguineous families identifies 30 novel candidate genes for recessive intellectual disability.

PubMed

Riazuddin, S; Hussain, M; Razzaq, A; Iqbal, Z; Shahzad, M; Polla, D L; Song, Y; van Beusekom, E; Khan, A A; Tomas-Roca, L; Rashid, M; Zahoor, M Y; Wissink-Lindhout, W M; Basra, M A R; Ansar, M; Agha, Z; van Heeswijk, K; Rasheed, F; Van de Vorst, M; Veltman, J A; Gilissen, C; Akram, J; Kleefstra, T; Assir, M Z; Grozeva, D; Carss, K; Raymond, F L; O'Connor, T D; Riazuddin, S A; Khan, S N; Ahmed, Z M; de Brouwer, A P M; van Bokhoven, H; Riazuddin, S

2017-11-01

Intellectual disability (ID) is a clinically and genetically heterogeneous disorder, affecting 1-3% of the general population. Although research into the genetic causes of ID has recently gained momentum, identification of pathogenic mutations that cause autosomal recessive ID (ARID) has lagged behind, predominantly due to non-availability of sizeable families. Here we present the results of exome sequencing in 121 large consanguineous Pakistani ID families. In 60 families, we identified homozygous or compound heterozygous DNA variants in a single gene, 30 affecting reported ID genes and 30 affecting novel candidate ID genes. Potential pathogenicity of these alleles was supported by co-segregation with the phenotype, low frequency in control populations and the application of stringent bioinformatics analyses. In another eight families segregation of multiple pathogenic variants was observed, affecting 19 genes that were either known or are novel candidates for ID. Transcriptome profiles of normal human brain tissues showed that the novel candidate ID genes formed a network significantly enriched for transcriptional co-expression (P<0.0001) in the frontal cortex during fetal development and in the temporal-parietal and sub-cortex during infancy through adulthood. In addition, proteins encoded by 12 novel ID genes directly interact with previously reported ID proteins in six known pathways essential for cognitive function (P<0.0001). These results suggest that disruptions of temporal parietal and sub-cortical neurogenesis during infancy are critical to the pathophysiology of ID. These findings further expand the existing repertoire of genes involved in ARID, and provide new insights into the molecular mechanisms and the transcriptome map of ID.
Identifying conserved gene clusters in the presence of homology families.

PubMed

He, Xin; Goldwasser, Michael H

2005-01-01

The study of conserved gene clusters is important for understanding the forces behind genome organization and evolution, as well as the function of individual genes or gene groups. In this paper, we present a new model and algorithm for identifying conserved gene clusters from pairwise genome comparison. This generalizes a recent model called "gene teams." A gene team is a set of genes that appear homologously in two or more species, possibly in a different order yet with the distance of adjacent genes in the team for each chromosome always no more than a certain threshold. We remove the constraint in the original model that each gene must have a unique occurrence in each chromosome and thus allow the analysis on complex prokaryotic or eukaryotic genomes with extensive paralogs. Our algorithm analyzes a pair of chromosomes in O(mn) time and uses O(m+n) space, where m and n are the number of genes in the respective chromosomes. We demonstrate the utility of our methods by studying two bacterial genomes, E. coli K-12 and B. subtilis. Many of the teams identified by our algorithm correlate with documented E. coli operons, while several others match predicted operons, previously suggested by computational techniques. Our implementation and data are publicly available at euler.slu.edu/ approximately goldwasser/homologyteams/.
Genes of the N-Methylglutamate Pathway Are Essential for Growth of Methylobacterium extorquens DM4 with Monomethylamine

PubMed Central

Gruffaz, Christelle; Muller, Emilie E. L.; Louhichi-Jelail, Yousra; Nelli, Yella R.; Guichard, Gilles

2014-01-01

Monomethylamine (MMA, CH3NH2) can be used as a carbon and nitrogen source by many methylotrophic bacteria. Methylobacterium extorquens DM4 lacks the MMA dehydrogenase encoded by mau genes, which in M. extorquens AM1 is essential for growth on MMA. Identification and characterization of minitransposon mutants with an MMA-dependent phenotype showed that strain DM4 grows with MMA as the sole source of carbon, energy, and nitrogen by the N-methylglutamate (NMG) pathway. Independent mutations were found in a chromosomal region containing the genes gmaS, mgsABC, and mgdABCD for the three enzymes of the pathway, γ-glutamylmethylamide (GMA) synthetase, NMG synthase, and NMG dehydrogenase, respectively. Reverse transcription-PCR confirmed the operonic structure of the two divergent gene clusters mgsABC-gmaS and mgdABCD and their induction during growth with MMA. The genes mgdABCD and mgsABC were found to be essential for utilization of MMA as a carbon and nitrogen source. The gene gmaS was essential for MMA utilization as a carbon source, but residual growth of mutant DM4gmaS growing with succinate and MMA as a nitrogen source was observed. Plasmid copies of gmaS and the gmaS homolog METDI4690, which encodes a protein 39% identical to GMA synthetase, fully restored the ability of mutants DM4gmaS and DM4gmaSΔmetdi4690 to use MMA as a carbon and nitrogen source. Similarly, chemically synthesized GMA, the product of GMA synthetase, could be used as a nitrogen source for growth in the wild-type strain, as well as in DM4gmaS and DM4gmaSΔmetdi4690 mutants. The NADH:ubiquinone oxidoreductase respiratory complex component NuoG was also found to be essential for growth with MMA as a carbon source. PMID:24682302
Gene Signature in Sessile Serrated Polyps Identifies Colon Cancer Subtype

PubMed Central

Kanth, Priyanka; Bronner, Mary P.; Boucher, Kenneth M.; Burt, Randall W.; Neklason, Deborah W.; Hagedorn, Curt H.; Delker, Don A.

2016-01-01

Sessile serrated colon adenoma/polyps (SSA/Ps) are found during routine screening colonoscopy and may account for 20–30% of colon cancers. However, differentiating SSA/Ps from hyperplastic polyps (HP) with little risk of cancer is challenging and complementary molecular markers are needed. Additionally, the molecular mechanisms of colon cancer development from SSA/Ps are poorly understood. RNA sequencing was performed on 21 SSA/Ps, 10 HPs, 10 adenomas, 21 uninvolved colon and 20 control colon specimens. Differential expression and leave-one-out cross validation methods were used to define a unique gene signature of SSA/Ps. Our SSA/P gene signature was evaluated in colon cancer RNA-Seq data from The Cancer Genome Atlas (TCGA) to identify a subtype of colon cancers that may develop from SSA/Ps. A total of 1422 differentially expressed genes were found in SSA/Ps relative to controls. Serrated polyposis syndrome (n=12) and sporadic SSA/Ps (n=9) exhibited almost complete (96%) gene overlap. A 51-gene panel in SSA/P showed similar expression in a subset of TCGA colon cancers with high microsatellite instability (MSI-H). A smaller seven-gene panel showed high sensitivity and specificity in identifying BRAF mutant, CpG island methylator phenotype high (CIMP-H) and MLH1 silenced colon cancers. We describe a unique gene signature in SSA/Ps that identifies a subset of colon cancers likely to develop through the serrated pathway. These gene panels may be utilized for improved differentiation of SSA/Ps from HPs and provide insights into novel molecular pathways altered in colon cancer arising from the serrated pathway. PMID:27026680
Genome-Scale Approaches to Identify Genes Essential for Haemophilus influenzae Pathogenesis

PubMed Central

Wong, Sandy M. S.; Akerley, Brian J.

2012-01-01

Haemophilus influenzae is a Gram-negative bacterium that has no identified natural niche outside of the human host. It primarily colonizes the nasopharyngeal mucosa in an asymptomatic mode, but has the ability to disseminate to other anatomical sites to cause otitis media, upper, and lower respiratory tract infections, septicemia, and meningitis. To persist in diverse environments the bacterium must exploit and utilize the nutrients and other resources available in these sites for optimal growth/survival. Recent evidence suggests that regulatory factors that direct such adaptations also control virulence determinants required to resist and evade immune clearance mechanisms. In this review, we describe the recent application of whole-genome approaches that together provide insight into distinct survival mechanisms of H. influenzae in the context of different sites of pathogenesis. PMID:22919615
Genome-scale approaches to identify genes essential for Haemophilus influenzae pathogenesis.

PubMed

Wong, Sandy M S; Akerley, Brian J

2012-01-01

Haemophilus influenzae is a Gram-negative bacterium that has no identified natural niche outside of the human host. It primarily colonizes the nasopharyngeal mucosa in an asymptomatic mode, but has the ability to disseminate to other anatomical sites to cause otitis media, upper, and lower respiratory tract infections, septicemia, and meningitis. To persist in diverse environments the bacterium must exploit and utilize the nutrients and other resources available in these sites for optimal growth/survival. Recent evidence suggests that regulatory factors that direct such adaptations also control virulence determinants required to resist and evade immune clearance mechanisms. In this review, we describe the recent application of whole-genome approaches that together provide insight into distinct survival mechanisms of H. influenzae in the context of different sites of pathogenesis.
RNA-seq methods for identifying differentially expressed gene in human pancreatic islet cells treated with pro-inflammatory cytokines.

PubMed

Li, Bo; Bi, Chang Long; Lang, Ning; Li, Yu Ze; Xu, Chao; Zhang, Ying Qi; Zhai, Ai Xia; Cheng, Zhi Feng

2014-01-01

Type 1 diabetes is a chronic autoimmune disease in which pancreatic beta cells are killed by the infiltrating immune cells as well as the cytokines released by these cells. Many studies indicate that inflammatory mediators have an essential role in this disease. In the present study, we profiled the transcriptome in human islets of langerhans under control conditions or following exposure to the pro-inflammatory cytokines based on the RNA sequencing dataset downloaded from SRA database. After filtered the low-quality ones, the RNA readers was aligned to human genome hg19 by TopHat and then assembled by Cufflinks. The expression value of each transcript was calculated and consequently differentially expressed genes were screened out. Finally, a total of 63 differentially expressed genes were identified including 60 up-regulated and three down-regulated genes. GBP5 and CXCL9 stood out as the top two most up-regulated genes in cytokines treated samples with the log2 fold change of 12.208 and 10.901, respectively. Meanwhile, PTF1A and REG3G were identified as the top two most down-regulated genes with the log2 fold change of -3.759 and -3.606, respectively. Of note, we also found 262 lncRNAs (long non-coding RNA), 177 of which were inferred as novel lncRNAs. Further in-depth follow-up analysis of the transcriptional regulation reported in this study may shed light on the specific function of these lncRNA.
Drosophila nemo is an essential gene involved in the regulation of programmed cell death.

PubMed

Mirkovic, Ivana; Charish, Kristi; Gorski, Sharon M; McKnight, Kristen; Verheyen, Esther M

2002-11-01

Nemo-like kinases define a novel family of serine/threonine kinases that are involved in integrating multiple signaling pathways. They are conserved regulators of Wnt/Wingless pathways, which may coordinate Wnt with TGFbeta-mediated signaling. Drosophila nemo was identified through its involvement in epithelial planar polarity, a process regulated by a non-canonical Wnt pathway. We have previously found that ectopic expression of Nemo using the Gal4-UAS system resulted in embryonic lethality associated with defects in patterning and head development. In this study we present our analyses of the phenotypes of germline clone-derived embryos. We observe lethality associated with head defects and reduction of programmed cell death and conclude that nmo is an essential gene. We also present data showing that nmo is involved in regulating apoptosis during eye development, based on both loss of function phenotypes and on genetic interactions with the pro-apoptotic gene reaper. Finally, we present genetic data from the adult wing that suggest the activity of ectopically expressed Nemo can be modulated by Jun N-terminal kinase (JNK) signaling. Such an observation supports the model that there is cross-talk between Wnt, TGFbeta and JNK signaling at multiple stages of development. Copyright 2002 Elsevier Science Ireland Ltd.
RNA-Seq analysis identifies key genes associated with haustorial development in the root hemiparasite Santalum album

PubMed Central

Zhang, Xinhua; Berkowitz, Oliver; Teixeira da Silva, Jaime A.; Zhang, Muhan; Ma, Guohua; Whelan, James; Duan, Jun

2015-01-01

Santalum album (sandalwood) is one of the economically important plant species in the Santalaceae for its production of highly valued perfume oils. Sandalwood is also a hemiparasitic tree that obtains some of its water and simple nutrients by tapping into other plants through haustoria which are highly specialized organs in parasitic angiosperms. However, an understanding of the molecular mechanisms involved in haustorium development is limited. In this study, RNA sequencing (RNA-seq) analyses were performed to identify changes in gene expression and metabolic pathways associated with the development of the S. album haustorium. A total of 56,011 non-redundant contigs with a mean contig size of 618 bp were obtained by de novo assembly of the transcriptome of haustoria and non-haustorial seedling roots. A substantial number of the identified differentially expressed genes were involved in cell wall metabolism and protein metabolism, as well as mitochondrial electron transport functions. Phytohormone-mediated regulation might play an important role during haustorial development. Especially, auxin signaling is likely to be essential for haustorial initiation, and genes related to cytokinin and gibberellin biosynthesis and metabolism are involved in haustorial development. Our results suggest that genes encoding nodulin-like proteins may be important for haustorial morphogenesis in S. album. The obtained sequence data will become a rich resource for future research in this interesting species. This information improves our understanding of haustorium development in root hemiparasitic species and will allow further exploration of the detailed molecular mechanisms underlying plant parasitism. PMID:26388878
Phenoscape: Identifying Candidate Genes for Evolutionary Phenotypes

PubMed Central

Edmunds, Richard C.; Su, Baofeng; Balhoff, James P.; Eames, B. Frank; Dahdul, Wasila M.; Lapp, Hilmar; Lundberg, John G.; Vision, Todd J.; Dunham, Rex A.; Mabee, Paula M.; Westerfield, Monte

2016-01-01

Phenotypes resulting from mutations in genetic model organisms can help reveal candidate genes for evolutionarily important phenotypic changes in related taxa. Although testing candidate gene hypotheses experimentally in nonmodel organisms is typically difficult, ontology-driven information systems can help generate testable hypotheses about developmental processes in experimentally tractable organisms. Here, we tested candidate gene hypotheses suggested by expert use of the Phenoscape Knowledgebase, specifically looking for genes that are candidates responsible for evolutionarily interesting phenotypes in the ostariophysan fishes that bear resemblance to mutant phenotypes in zebrafish. For this, we searched ZFIN for genetic perturbations that result in either loss of basihyal element or loss of scales phenotypes, because these are the ancestral phenotypes observed in catfishes (Siluriformes). We tested the identified candidate genes by examining their endogenous expression patterns in the channel catfish, Ictalurus punctatus. The experimental results were consistent with the hypotheses that these features evolved through disruption in developmental pathways at, or upstream of, brpf1 and eda/edar for the ancestral losses of basihyal element and scales, respectively. These results demonstrate that ontological annotations of the phenotypic effects of genetic alterations in model organisms, when aggregated within a knowledgebase, can be used effectively to generate testable, and useful, hypotheses about evolutionary changes in morphology. PMID:26500251
A Medium-Throughput Single Cell CRISPR-Cas9 Assay to Assess Gene Essentiality.

PubMed

Grassian, A R; Scales, T M E; Knutson, S K; Kuntz, K W; McCarthy, N J; Lowe, C E; Moore, J D; Copeland, R A; Keilhack, H; Smith, J J; Wickenden, J A; Ribich, S

2015-01-01

Target selection for oncology is a crucial step in the successful development of therapeutics. Clustered regularly interspaced short palindromic repeats (CRISPR)-Cas9 editing of specific loci offers an alternative method to RNA interference and small molecule inhibitors for determining whether a cell line is dependent on a specific gene product for proliferation or survival. In our initial studies using CRISPR-Cas9 to verify the dependence on EZH2 activity for proliferation of a SMARCB1/SNF5/INI1 mutant malignant rhabdoid tumor (MRT) cell line, we noted that the initial reduction in proliferation was lost over time. We hypothesized that in the few cells that retain proliferative capacity, at least one allele of EZH2 remains functional. To verify this, we developed an assay to analyze 10s-100s of clonal cell populations for target gene disruption using restriction digest and fluorescent fragment length analyses. Our results clearly show that in cell lines in which EZH2 is essential for proliferation, at least one potentially functional allele of EZH2 is retained in the clones that survive. This assay clearly indicates whether or not a specific gene is essential for survival and/or proliferation in a given cell line. Such data can aid the development of more robust therapeutics by increasing confidence in target selection.
Common Marker Genes Identified from Various Sample Types for Systemic Lupus Erythematosus.

PubMed

Bing, Peng-Fei; Xia, Wei; Wang, Lan; Zhang, Yong-Hong; Lei, Shu-Feng; Deng, Fei-Yan

2016-01-01

Systemic lupus erythematosus (SLE) is a complex auto-immune disease. Gene expression studies have been conducted to identify SLE-related genes in various types of samples. It is unknown whether there are common marker genes significant for SLE but independent of sample types, which may have potentials for follow-up translational research. The aim of this study is to identify common marker genes across various sample types for SLE. Based on four public microarray gene expression datasets for SLE covering three representative types of blood-born samples (monocyte; peripheral blood mononuclear cell, PBMC; whole blood), we utilized three statistics (fold-change, FC; t-test p value; false discovery rate adjusted p value) to scrutinize genes simultaneously regulated with SLE across various sample types. For common marker genes, we conducted the Gene Ontology enrichment analysis and Protein-Protein Interaction analysis to gain insights into their functions. We identified 10 common marker genes associated with SLE (IFI6, IFI27, IFI44L, OAS1, OAS2, EIF2AK2, PLSCR1, STAT1, RNASE2, and GSTO1). Significant up-regulation of IFI6, IFI27, and IFI44L with SLE was observed in all the studied sample types, though the FC was most striking in monocyte, compared with PBMC and whole blood (8.82-251.66 vs. 3.73-74.05 vs. 1.19-1.87). Eight of the above 10 genes, except RNASE2 and GSTO1, interact with each other and with known SLE susceptibility genes, participate in immune response, RNA and protein catabolism, and cell death. Our data suggest that there exist common marker genes across various sample types for SLE. The 10 common marker genes, identified herein, deserve follow-up studies to dissert their potentials as diagnostic or therapeutic markers to predict SLE or treatment response.
Integrative Analysis of GWASs, Human Protein Interaction, and Gene Expression Identified Gene Modules Associated With BMDs

PubMed Central

He, Hao; Zhang, Lei; Li, Jian; Wang, Yu-Ping; Zhang, Ji-Gang; Shen, Jie; Guo, Yan-Fang

2014-01-01

Context: To date, few systems genetics studies in the bone field have been performed. We designed our study from a systems-level perspective by integrating genome-wide association studies (GWASs), human protein-protein interaction (PPI) network, and gene expression to identify gene modules contributing to osteoporosis risk. Methods: First we searched for modules significantly enriched with bone mineral density (BMD)-associated genes in human PPI network by using 2 large meta-analysis GWAS datasets through a dense module search algorithm. One included 7 individual GWAS samples (Meta7). The other was from the Genetic Factors for Osteoporosis Consortium (GEFOS2). One was assigned as a discovery dataset and the other as an evaluation dataset, and vice versa. Results: In total, 42 modules and 129 modules were identified significantly in both Meta7 and GEFOS2 datasets for femoral neck and spine BMD, respectively. There were 3340 modules identified for hip BMD only in Meta7. As candidate modules, they were assessed for the biological relevance to BMD by gene set enrichment analysis in 2 expression profiles generated from circulating monocytes in subjects with low versus high BMD values. Interestingly, there were 2 modules significantly enriched in monocytes from the low BMD group in both gene expression datasets (nominal P value <.05). Two modules had 16 nonredundant genes. Functional enrichment analysis revealed that both modules were enriched for genes involved in Wnt receptor signaling and osteoblast differentiation. Conclusion: We highlighted 2 modules and novel genes playing important roles in the regulation of bone mass, providing important clues for therapeutic approaches for osteoporosis. PMID:25119315
A shell regeneration assay to identify biomineralization candidate genes in mytilid mussels.

PubMed

Hüning, Anne K; Lange, Skadi M; Ramesh, Kirti; Jacob, Dorrit E; Jackson, Daniel J; Panknin, Ulrike; Gutowska, Magdalena A; Philipp, Eva E R; Rosenstiel, Philip; Lucassen, Magnus; Melzner, Frank

2016-06-01

Biomineralization processes in bivalve molluscs are still poorly understood. Here we provide an analysis of specifically expressed sequences from a mantle transcriptome of the blue mussel, Mytilus edulis. We then developed a novel, integrative shell injury assay to test, whether biomineralization candidate genes highly expressed in marginal and pallial mantle could be induced in central mantle tissue underlying the damaged shell areas. This experimental approach makes it possible to identify gene products that control the chemical micro-environment during calcification as well as organic matrix components. This is unlike existing methodological approaches that work retroactively to characterize calcification relevant molecules and are just able to examine organic matrix components that are present in completed shells. In our assay an orthogonal array of nine 1mm holes was drilled into the left valve, and mussels were suspended in net cages for 20, 29 and 36days to regenerate. Structural observations using stereo-microscopy, SEM and Raman spectroscopy revealed organic sheet synthesis (day 20) as the first step of shell-repair followed by the deposition of calcite crystals (days 20 and 29) and aragonite tablets (day 36). The regeneration period was characterized by time-dependent shifts in gene expression in left central mantle tissue underlying the injured shell, (i) increased expression of two tyrosinase isoforms (TYR3: 29-fold and TYR6: 5-fold) at day 20 with a decline thereafter, (ii) an increase in expression of a gene encoding a nacrein-like protein (max. 100-fold) on day 29. The expression of an acidic Asp-Ser-rich protein was enhanced during the entire regeneration process. This proof-of-principle study demonstrates that genes that are specifically expressed in pallial and marginal mantle tissue can be induced (4 out of 10 genes) in central mantle following experimental injury of the overlying shell. Our findings suggest that regeneration assays can be used
Gene expression patterns combined with network analysis identify hub genes associated with bladder cancer.

PubMed

Bi, Dongbin; Ning, Hao; Liu, Shuai; Que, Xinxiang; Ding, Kejia

2015-06-01

To explore molecular mechanisms of bladder cancer (BC), network strategy was used to find biomarkers for early detection and diagnosis. The differentially expressed genes (DEGs) between bladder carcinoma patients and normal subjects were screened using empirical Bayes method of the linear models for microarray data package. Co-expression networks were constructed by differentially co-expressed genes and links. Regulatory impact factors (RIF) metric was used to identify critical transcription factors (TFs). The protein-protein interaction (PPI) networks were constructed by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and clusters were obtained through molecular complex detection (MCODE) algorithm. Centralities analyses for complex networks were performed based on degree, stress and betweenness. Enrichment analyses were performed based on Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Co-expression networks and TFs (based on expression data of global DEGs and DEGs in different stages and grades) were identified. Hub genes of complex networks, such as UBE2C, ACTA2, FABP4, CKS2, FN1 and TOP2A, were also obtained according to analysis of degree. In gene enrichment analyses of global DEGs, cell adhesion, proteinaceous extracellular matrix and extracellular matrix structural constituent were top three GO terms. ECM-receptor interaction, focal adhesion, and cell cycle were significant pathways. Our results provide some potential underlying biomarkers of BC. However, further validation is required and deep studies are needed to elucidate the pathogenesis of BC. Copyright © 2015 Elsevier Ltd. All rights reserved.
A cross-species bi-clustering approach to identifying conserved co-regulated genes.

PubMed

Sun, Jiangwen; Jiang, Zongliang; Tian, Xiuchun; Bi, Jinbo

2016-06-15

A growing number of studies have explored the process of pre-implantation embryonic development of multiple mammalian species. However, the conservation and variation among different species in their developmental programming are poorly defined due to the lack of effective computational methods for detecting co-regularized genes that are conserved across species. The most sophisticated method to date for identifying conserved co-regulated genes is a two-step approach. This approach first identifies gene clusters for each species by a cluster analysis of gene expression data, and subsequently computes the overlaps of clusters identified from different species to reveal common subgroups. This approach is ineffective to deal with the noise in the expression data introduced by the complicated procedures in quantifying gene expression. Furthermore, due to the sequential nature of the approach, the gene clusters identified in the first step may have little overlap among different species in the second step, thus difficult to detect conserved co-regulated genes. We propose a cross-species bi-clustering approach which first denoises the gene expression data of each species into a data matrix. The rows of the data matrices of different species represent the same set of genes that are characterized by their expression patterns over the developmental stages of each species as columns. A novel bi-clustering method is then developed to cluster genes into subgroups by a joint sparse rank-one factorization of all the data matrices. This method decomposes a data matrix into a product of a column vector and a row vector where the column vector is a consistent indicator across the matrices (species) to identify the same gene cluster and the row vector specifies for each species the developmental stages that the clustered genes co-regulate. Efficient optimization algorithm has been developed with convergence analysis. This approach was first validated on synthetic data and compared

Genome-Wide Identification by Transposon Insertion Sequencing of Escherichia coli K1 Genes Essential for In Vitro Growth, Gastrointestinal Colonizing Capacity, and Survival in Serum.

PubMed

McCarthy, Alex J; Stabler, Richard A; Taylor, Peter W

2018-04-01

Escherichia coli K1 strains are major causative agents of invasive disease of newborn infants. The age dependency of infection can be reproduced in neonatal rats. Colonization of the small intestine following oral administration of K1 bacteria leads rapidly to invasion of the blood circulation; bacteria that avoid capture by the mesenteric lymphatic system and evade antibacterial mechanisms in the blood may disseminate to cause organ-specific infections such as meningitis. Some E. coli K1 surface constituents, in particular the polysialic acid capsule, are known to contribute to invasive potential, but a comprehensive picture of the factors that determine the fully virulent phenotype has not emerged so far. We constructed a library and constituent sublibraries of ∼775,000 Tn 5 transposon mutants of E. coli K1 strain A192PP and employed transposon-directed insertion site sequencing (TraDIS) to identify genes required for fitness for infection of 2-day-old rats. Transposon insertions were lacking in 357 genes following recovery on selective agar; these genes were considered essential for growth in nutrient-replete medium. Colonization of the midsection of the small intestine was facilitated by 167 E. coli K1 gene products. Restricted bacterial translocation across epithelial barriers precluded TraDIS analysis of gut-to-blood and blood-to-brain transits; 97 genes were required for survival in human serum. This study revealed that a large number of bacterial genes, many of which were not previously associated with systemic E. coli K1 infection, are required to realize full invasive potential. IMPORTANCE Escherichia coli K1 strains cause life-threatening infections in newborn infants. They are acquired from the mother at birth and colonize the small intestine, from where they invade the blood and central nervous system. It is difficult to obtain information from acutely ill patients that sheds light on physiological and bacterial factors determining invasive disease
Genome-Wide Identification by Transposon Insertion Sequencing of Escherichia coli K1 Genes Essential for In Vitro Growth, Gastrointestinal Colonizing Capacity, and Survival in Serum

PubMed Central

McCarthy, Alex J.

2018-01-01

ABSTRACT Escherichia coli K1 strains are major causative agents of invasive disease of newborn infants. The age dependency of infection can be reproduced in neonatal rats. Colonization of the small intestine following oral administration of K1 bacteria leads rapidly to invasion of the blood circulation; bacteria that avoid capture by the mesenteric lymphatic system and evade antibacterial mechanisms in the blood may disseminate to cause organ-specific infections such as meningitis. Some E. coli K1 surface constituents, in particular the polysialic acid capsule, are known to contribute to invasive potential, but a comprehensive picture of the factors that determine the fully virulent phenotype has not emerged so far. We constructed a library and constituent sublibraries of ∼775,000 Tn5 transposon mutants of E. coli K1 strain A192PP and employed transposon-directed insertion site sequencing (TraDIS) to identify genes required for fitness for infection of 2-day-old rats. Transposon insertions were lacking in 357 genes following recovery on selective agar; these genes were considered essential for growth in nutrient-replete medium. Colonization of the midsection of the small intestine was facilitated by 167 E. coli K1 gene products. Restricted bacterial translocation across epithelial barriers precluded TraDIS analysis of gut-to-blood and blood-to-brain transits; 97 genes were required for survival in human serum. This study revealed that a large number of bacterial genes, many of which were not previously associated with systemic E. coli K1 infection, are required to realize full invasive potential. IMPORTANCE Escherichia coli K1 strains cause life-threatening infections in newborn infants. They are acquired from the mother at birth and colonize the small intestine, from where they invade the blood and central nervous system. It is difficult to obtain information from acutely ill patients that sheds light on physiological and bacterial factors determining invasive
Antibiotic discovery throughout the Small World Initiative: A molecular strategy to identify biosynthetic gene clusters involved in antagonistic activity.

PubMed

Davis, Elizabeth; Sloan, Tyler; Aurelius, Krista; Barbour, Angela; Bodey, Elijah; Clark, Brigette; Dennis, Celeste; Drown, Rachel; Fleming, Megan; Humbert, Allison; Glasgo, Elizabeth; Kerns, Trent; Lingro, Kelly; McMillin, MacKenzie; Meyer, Aaron; Pope, Breanna; Stalevicz, April; Steffen, Brittney; Steindl, Austin; Williams, Carolyn; Wimberley, Carmen; Zenas, Robert; Butela, Kristen; Wildschutte, Hans

2017-06-01

The emergence of bacterial pathogens resistant to all known antibiotics is a global health crisis. Adding to this problem is that major pharmaceutical companies have shifted away from antibiotic discovery due to low profitability. As a result, the pipeline of new antibiotics is essentially dry and many bacteria now resist the effects of most commonly used drugs. To address this global health concern, citizen science through the Small World Initiative (SWI) was formed in 2012. As part of SWI, students isolate bacteria from their local environments, characterize the strains, and assay for antibiotic production. During the 2015 fall semester at Bowling Green State University, students isolated 77 soil-derived bacteria and genetically characterized strains using the 16S rRNA gene, identified strains exhibiting antagonistic activity, and performed an expanded SWI workflow using transposon mutagenesis to identify a biosynthetic gene cluster involved in toxigenic compound production. We identified one mutant with loss of antagonistic activity and through subsequent whole-genome sequencing and linker-mediated PCR identified a 24.9 kb biosynthetic gene locus likely involved in inhibitory activity in that mutant. Further assessment against human pathogens demonstrated the inhibition of Bacillus cereus, Listeria monocytogenes, and methicillin-resistant Staphylococcus aureus in the presence of this compound, thus supporting our molecular strategy as an effective research pipeline for SWI antibiotic discovery and genetic characterization. © 2017 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.
Identifying differentially expressed genes in cancer patients using a non-parameter Ising model.

PubMed

Li, Xumeng; Feltus, Frank A; Sun, Xiaoqian; Wang, James Z; Luo, Feng

2011-10-01

Identification of genes and pathways involved in diseases and physiological conditions is a major task in systems biology. In this study, we developed a novel non-parameter Ising model to integrate protein-protein interaction network and microarray data for identifying differentially expressed (DE) genes. We also proposed a simulated annealing algorithm to find the optimal configuration of the Ising model. The Ising model was applied to two breast cancer microarray data sets. The results showed that more cancer-related DE sub-networks and genes were identified by the Ising model than those by the Markov random field model. Furthermore, cross-validation experiments showed that DE genes identified by Ising model can improve classification performance compared with DE genes identified by Markov random field model. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
A recellularized human colon model identifies cancer driver genes

PubMed Central

Chen, Huanhuan Joyce; Wei, Zhubo; Sun, Jian; Bhattacharya, Asmita; Savage, David J; Serda, Rita; Mackeyev, Yuri; Curley, Steven A.; Bu, Pengcheng; Wang, Lihua; Chen, Shuibing; Cohen-Gould, Leona; Huang, Emina; Shen, Xiling; Lipkin, Steven M.; Copeland, Neal G.; Jenkins, Nancy A.; Shuler, Michael L.

2016-01-01

Refined cancer models are needed to bridge the gap between cell-line, animal and clinical research. Here we describe the engineering of an organotypic colon cancer model by recellularization of a native human matrix that contains cell-populated mucosa and an intact muscularis mucosa layer. This ex vivo system recapitulates the pathophysiological progression from APC-mutant neoplasia to submucosal invasive tumor. We used it to perform a Sleeping Beauty transposon mutagenesis screen to identify genes that cooperate with mutant APC in driving invasive neoplasia. 38 candidate invasion driver genes were identified, 17 of which have been previously implicated in colorectal cancer progression, including TCF7L2, TWIST2, MSH2, DCC and EPHB1/2. Six invasion driver genes that to our knowledge have not been previously described were validated in vitro using cell proliferation, migration and invasion assays, and ex vivo using recellularized human colon. These results demonstrate the utility of our organoid model for studying cancer biology. PMID:27398792
De Novo Sequencing and Analysis of Lemongrass Transcriptome Provide First Insights into the Essential Oil Biosynthesis of Aromatic Grasses.

PubMed

Meena, Seema; Kumar, Sarma R; Venkata Rao, D K; Dwivedi, Varun; Shilpashree, H B; Rastogi, Shubhra; Shasany, Ajit K; Nagegowda, Dinesh A

2016-01-01

Aromatic grasses of the genus Cymbopogon (Poaceae family) represent unique group of plants that produce diverse composition of monoterpene rich essential oils, which have great value in flavor, fragrance, cosmetic, and aromatherapy industries. Despite the commercial importance of these natural aromatic oils, their biosynthesis at the molecular level remains unexplored. As the first step toward understanding the essential oil biosynthesis, we performed de novo transcriptome assembly and analysis of C. flexuosus (lemongrass) by employing Illumina sequencing. Mining of transcriptome data and subsequent phylogenetic analysis led to identification of terpene synthases, pyrophosphatases, alcohol dehydrogenases, aldo-keto reductases, carotenoid cleavage dioxygenases, alcohol acetyltransferases, and aldehyde dehydrogenases, which are potentially involved in essential oil biosynthesis. Comparative essential oil profiling and mRNA expression analysis in three Cymbopogon species (C. flexuosus, aldehyde type; C. martinii, alcohol type; and C. winterianus, intermediate type) with varying essential oil composition indicated the involvement of identified candidate genes in the formation of alcohols, aldehydes, and acetates. Molecular modeling and docking further supported the role of identified protein sequences in aroma formation in Cymbopogon. Also, simple sequence repeats were found in the transcriptome with many linked to terpene pathway genes including the genes potentially involved in aroma biosynthesis. This work provides the first insights into the essential oil biosynthesis of aromatic grasses, and the identified candidate genes and markers can be a great resource for biotechnological and molecular breeding approaches to modulate the essential oil composition.
De Novo Sequencing and Analysis of Lemongrass Transcriptome Provide First Insights into the Essential Oil Biosynthesis of Aromatic Grasses

PubMed Central

Meena, Seema; Kumar, Sarma R.; Venkata Rao, D. K.; Dwivedi, Varun; Shilpashree, H. B.; Rastogi, Shubhra; Shasany, Ajit K.; Nagegowda, Dinesh A.

2016-01-01

Aromatic grasses of the genus Cymbopogon (Poaceae family) represent unique group of plants that produce diverse composition of monoterpene rich essential oils, which have great value in flavor, fragrance, cosmetic, and aromatherapy industries. Despite the commercial importance of these natural aromatic oils, their biosynthesis at the molecular level remains unexplored. As the first step toward understanding the essential oil biosynthesis, we performed de novo transcriptome assembly and analysis of C. flexuosus (lemongrass) by employing Illumina sequencing. Mining of transcriptome data and subsequent phylogenetic analysis led to identification of terpene synthases, pyrophosphatases, alcohol dehydrogenases, aldo-keto reductases, carotenoid cleavage dioxygenases, alcohol acetyltransferases, and aldehyde dehydrogenases, which are potentially involved in essential oil biosynthesis. Comparative essential oil profiling and mRNA expression analysis in three Cymbopogon species (C. flexuosus, aldehyde type; C. martinii, alcohol type; and C. winterianus, intermediate type) with varying essential oil composition indicated the involvement of identified candidate genes in the formation of alcohols, aldehydes, and acetates. Molecular modeling and docking further supported the role of identified protein sequences in aroma formation in Cymbopogon. Also, simple sequence repeats were found in the transcriptome with many linked to terpene pathway genes including the genes potentially involved in aroma biosynthesis. This work provides the first insights into the essential oil biosynthesis of aromatic grasses, and the identified candidate genes and markers can be a great resource for biotechnological and molecular breeding approaches to modulate the essential oil composition. PMID:27516768
Gene expression profiling combined with bioinformatics analysis identify biomarkers for Parkinson disease.

PubMed

Diao, Hongyu; Li, Xinxing; Hu, Sheng; Liu, Yunhui

2012-01-01

Parkinson disease (PD) progresses relentlessly and affects approximately 4% of the population aged over 80 years old. It is difficult to diagnose in its early stages. The purpose of our study is to identify molecular biomarkers for PD initiation using a computational bioinformatics analysis of gene expression. We downloaded the gene expression profile of PD from Gene Expression Omnibus and identified differentially coexpressed genes (DCGs) and dysfunctional pathways in PD patients compared to controls. Besides, we built a regulatory network by mapping the DCGs to known regulatory data between transcription factors (TFs) and target genes and calculated the regulatory impact factor of each transcription factor. As the results, a total of 1004 genes associated with PD initiation were identified. Pathway enrichment of these genes suggests that biological processes of protein turnover were impaired in PD. In the regulatory network, HLF, E2F1 and STAT4 were found have altered expression levels in PD patients. The expression levels of other transcription factors, NKX3-1, TAL1, RFX1 and EGR3, were not found altered. However, they regulated differentially expressed genes. In conclusion, we suggest that HLF, E2F1 and STAT4 may be used as molecular biomarkers for PD; however, more work is needed to validate our result.
Gene Expression Profiling Combined with Bioinformatics Analysis Identify Biomarkers for Parkinson Disease

PubMed Central

Diao, Hongyu; Li, Xinxing; Hu, Sheng; Liu, Yunhui

2012-01-01

Parkinson disease (PD) progresses relentlessly and affects approximately 4% of the population aged over 80 years old. It is difficult to diagnose in its early stages. The purpose of our study is to identify molecular biomarkers for PD initiation using a computational bioinformatics analysis of gene expression. We downloaded the gene expression profile of PD from Gene Expression Omnibus and identified differentially coexpressed genes (DCGs) and dysfunctional pathways in PD patients compared to controls. Besides, we built a regulatory network by mapping the DCGs to known regulatory data between transcription factors (TFs) and target genes and calculated the regulatory impact factor of each transcription factor. As the results, a total of 1004 genes associated with PD initiation were identified. Pathway enrichment of these genes suggests that biological processes of protein turnover were impaired in PD. In the regulatory network, HLF, E2F1 and STAT4 were found have altered expression levels in PD patients. The expression levels of other transcription factors, NKX3-1, TAL1, RFX1 and EGR3, were not found altered. However, they regulated differentially expressed genes. In conclusion, we suggest that HLF, E2F1 and STAT4 may be used as molecular biomarkers for PD; however, more work is needed to validate our result. PMID:23284986
Axon Regeneration Genes Identified by RNAi Screening in C. elegans

PubMed Central

Nix, Paola; Hammarlund, Marc; Hauth, Linda; Lachnit, Martina; Jorgensen, Erik M.

2014-01-01

Axons of the mammalian CNS lose the ability to regenerate soon after development due to both an inhibitory CNS environment and the loss of cell-intrinsic factors necessary for regeneration. The complex molecular events required for robust regeneration of mature neurons are not fully understood, particularly in vivo. To identify genes affecting axon regeneration in Caenorhabditis elegans, we performed both an RNAi-based screen for defective motor axon regeneration in unc-70/β-spectrin mutants and a candidate gene screen. From these screens, we identified at least 50 conserved genes with growth-promoting or growth-inhibiting functions. Through our analysis of mutants, we shed new light on certain aspects of regeneration, including the role of β-spectrin and membrane dynamics, the antagonistic activity of MAP kinase signaling pathways, and the role of stress in promoting axon regeneration. Many gene candidates had not previously been associated with axon regeneration and implicate new pathways of interest for therapeutic intervention. PMID:24403161
Gene expression profiles analysis identifies key genes for acute lung injury in patients with sepsis.

PubMed

Guo, Zhiqiang; Zhao, Chuncheng; Wang, Zheng

2014-09-26

To identify critical genes and biological pathways in acute lung injury (ALI), a comparative analysis of gene expression profiles of patients with ALI + sepsis compared with patients with sepsis alone were performed with bioinformatic tools. GSE10474 was downloaded from Gene Expression Omnibus, including a collective of 13 whole blood samples with ALI + sepsis and 21 whole blood samples with sepsis alone. After pre-treatment with robust multichip averaging (RMA) method, differential analysis was conducted using simpleaffy package based upon t-test and fold change. Hierarchical clustering was also performed using function hclust from package stats. Beisides, functional enrichment analysis was conducted using iGepros. Moreover, the gene regulatory network was constructed with information from Kyoto Encyclopedia of Genes and Genomes (KEGG) and then visualized by Cytoscape. A total of 128 differentially expressed genes (DEGs) were identified, including 47 up- and 81 down-regulated genes. The significantly enriched functions included negative regulation of cell proliferation, regulation of response to stimulus and cellular component morphogenesis. A total of 27 DEGs were significantly enriched in 16 KEGG pathways, such as protein digestion and absorption, fatty acid metabolism, amoebiasis, etc. Furthermore, the regulatory network of these 27 DEGs was constructed, which involved several key genes, including protein tyrosine kinase 2 (PTK2), v-src avian sarcoma (SRC) and Caveolin 2 (CAV2). PTK2, SRC and CAV2 may be potential markers for diagnosis and treatment of ALI. The virtual slide(s) for this article can be found here: http://www.diagnosticpathology.diagnomx.eu/vs/5865162912987143.
An enhanced genome-scale metabolic reconstruction of Streptomyces clavuligerus identifies novel strain improvement strategies.

PubMed

Toro, León; Pinilla, Laura; Avignone-Rossa, Claudio; Ríos-Estepa, Rigoberto

2018-05-01

In this work, we expanded and updated a genome-scale metabolic model of Streptomyces clavuligerus. The model includes 1021 genes and 1494 biochemical reactions; genome-reaction information was curated and new features related to clavam metabolism and to the biomass synthesis equation were incorporated. The model was validated using experimental data from the literature and simulations were performed to predict cellular growth and clavulanic acid biosynthesis. Flux balance analysis (FBA) showed that limiting concentrations of phosphate and an excess of ammonia accumulation are unfavorable for growth and clavulanic acid biosynthesis. The evaluation of different objective functions for FBA showed that maximization of ATP yields the best predictions for cellular behavior in continuous cultures, while the maximization of growth rate provides better predictions for batch cultures. Through gene essentiality analysis, 130 essential genes were found using a limited in silico media, while 100 essential genes were identified in amino acid-supplemented media. Finally, a strain design was carried out to identify candidate genes to be overexpressed or knocked out so as to maximize antibiotic biosynthesis. Interestingly, potential metabolic engineering targets, identified in this study, have not been tested experimentally.
A Functional Genomics Approach to Identify Novel Breast Cancer Gene Targets in Yeast

DTIC Science & Technology

2004-05-01

AD Award Number: DAMD17-03-1-0232 TITLE: A Functional Genomics Approach to Identify Novel Breast Cancer Gene Targets in Yeast PRINCIPAL INVESTIGATOR...Approach to Identify Novel Breast DAMD17-03-1-0232 Cancer Gene Targets in Yeast 6. A UTHOR(S) Craig Bennett, Ph.D. 7. PERFORMING ORGANIZA TION NAME(S...Unlimited 13. ABSTRACT (Maximum 200 Words) We are using the yeast Saccharomyces cerevisiae to identify new cancer gene targets that interact with the
DiffSLC: A graph centrality method to detect essential proteins of a protein-protein interaction network.

PubMed

Mistry, Divya; Wise, Roger P; Dickerson, Julie A

2017-01-01

Identification of central genes and proteins in biomolecular networks provides credible candidates for pathway analysis, functional analysis, and essentiality prediction. The DiffSLC centrality measure predicts central and essential genes and proteins using a protein-protein interaction network. Network centrality measures prioritize nodes and edges based on their importance to the network topology. These measures helped identify critical genes and proteins in biomolecular networks. The proposed centrality measure, DiffSLC, combines the number of interactions of a protein and the gene coexpression values of genes from which those proteins were translated, as a weighting factor to bias the identification of essential proteins in a protein interaction network. Potentially essential proteins with low node degree are promoted through eigenvector centrality. Thus, the gene coexpression values are used in conjunction with the eigenvector of the network's adjacency matrix and edge clustering coefficient to improve essentiality prediction. The outcome of this prediction is shown using three variations: (1) inclusion or exclusion of gene co-expression data, (2) impact of different coexpression measures, and (3) impact of different gene expression data sets. For a total of seven networks, DiffSLC is compared to other centrality measures using Saccharomyces cerevisiae protein interaction networks and gene expression data. Comparisons are also performed for the top ranked proteins against the known essential genes from the Saccharomyces Gene Deletion Project, which show that DiffSLC detects more essential proteins and has a higher area under the ROC curve than other compared methods. This makes DiffSLC a stronger alternative to other centrality methods for detecting essential genes using a protein-protein interaction network that obeys centrality-lethality principle. DiffSLC is implemented using the igraph package in R, and networkx package in Python. The python package can be
MMTV insertional mutagenesis identifies genes, gene families and pathways involved in mammary cancer.

PubMed

Theodorou, Vassiliki; Kimm, Melanie A; Boer, Mandy; Wessels, Lodewyk; Theelen, Wendy; Jonkers, Jos; Hilkens, John

2007-06-01

We performed a high-throughput retroviral insertional mutagenesis screen in mouse mammary tumor virus (MMTV)-induced mammary tumors and identified 33 common insertion sites, of which 17 genes were previously not known to be associated with mammary cancer and 13 had not previously been linked to cancer in general. Although members of the Wnt and fibroblast growth factors (Fgf) families were frequently tagged, our exhaustive screening for MMTV insertion sites uncovered a new repertoire of candidate breast cancer oncogenes. We validated one of these genes, Rspo3, as an oncogene by overexpression in a p53-deficient mammary epithelial cell line. The human orthologs of the candidate oncogenes were frequently deregulated in human breast cancers and associated with several tumor parameters. Computational analysis of all MMTV-tagged genes uncovered specific gene families not previously associated with cancer and showed a significant overrepresentation of protein domains and signaling pathways mainly associated with development and growth factor signaling. Comparison of all tagged genes in MMTV and Moloney murine leukemia virus-induced malignancies showed that both viruses target mostly different genes that act predominantly in distinct pathways.
A genetic replacement system for selection-based engineering of essential proteins

PubMed Central

2012-01-01

Background Essential genes represent the core of biological functions required for viability. Molecular understanding of essentiality as well as design of synthetic cellular systems includes the engineering of essential proteins. An impediment to this effort is the lack of growth-based selection systems suitable for directed evolution approaches. Results We established a simple strategy for genetic replacement of an essential gene by a (library of) variant(s) during a transformation. The system was validated using three different essential genes and plasmid combinations and it reproducibly shows transformation efficiencies on the order of 107 transformants per microgram of DNA without any identifiable false positives. This allowed for reliable recovery of functional variants out of at least a 105-fold excess of non-functional variants. This outperformed selection in conventional bleach-out strains by at least two orders of magnitude, where recombination between functional and non-functional variants interfered with reliable recovery even in recA negative strains. Conclusions We propose that this selection system is extremely suitable for evaluating large libraries of engineered essential proteins resulting in the reliable isolation of functional variants in a clean strain background which can readily be used for in vivo applications as well as expression and purification for use in in vitro studies. PMID:22898007
Essential protein discovery based on a combination of modularity and conservatism.

PubMed

Zhao, Bihai; Wang, Jianxin; Li, Xueyong; Wu, Fang-Xiang

2016-11-01

Essential proteins are indispensable for the survival of a living organism and play important roles in the emerging field of synthetic biology. Many computational methods have been proposed to identify essential proteins by using the topological features of interactome networks. However, most of these methods ignored intrinsic biological meaning of proteins. Researches show that essentiality is tied not only to the protein or gene itself, but also to the molecular modules to which that protein belongs. The results of this study reveal the modularity of essential proteins. On the other hand, essential proteins are more evolutionarily conserved than nonessential proteins and frequently bind each other. That is to say, conservatism is another important feature of essential proteins. Multiple networks are constructed by integrating protein-protein interaction (PPI) networks, time course gene expression data and protein domain information. Based on these networks, a new essential protein identification method is proposed based on a combination of modularity and conservatism of proteins. Experimental results show that the proposed method outperforms other essential protein identification methods in terms of a number essential protein out of top ranked candidates. Copyright © 2016. Published by Elsevier Inc.
Identifying Mendelian disease genes with the Variant Effect Scoring Tool

PubMed Central

2013-01-01

Background Whole exome sequencing studies identify hundreds to thousands of rare protein coding variants of ambiguous significance for human health. Computational tools are needed to accelerate the identification of specific variants and genes that contribute to human disease. Results We have developed the Variant Effect Scoring Tool (VEST), a supervised machine learning-based classifier, to prioritize rare missense variants with likely involvement in human disease. The VEST classifier training set comprised ~ 45,000 disease mutations from the latest Human Gene Mutation Database release and another ~45,000 high frequency (allele frequency >1%) putatively neutral missense variants from the Exome Sequencing Project. VEST outperforms some of the most popular methods for prioritizing missense variants in carefully designed holdout benchmarking experiments (VEST ROC AUC = 0.91, PolyPhen2 ROC AUC = 0.86, SIFT4.0 ROC AUC = 0.84). VEST estimates variant score p-values against a null distribution of VEST scores for neutral variants not included in the VEST training set. These p-values can be aggregated at the gene level across multiple disease exomes to rank genes for probable disease involvement. We tested the ability of an aggregate VEST gene score to identify candidate Mendelian disease genes, based on whole-exome sequencing of a small number of disease cases. We used whole-exome data for two Mendelian disorders for which the causal gene is known. Considering only genes that contained variants in all cases, the VEST gene score ranked dihydroorotate dehydrogenase (DHODH) number 2 of 2253 genes in four cases of Miller syndrome, and myosin-3 (MYH3) number 2 of 2313 genes in three cases of Freeman Sheldon syndrome. Conclusions Our results demonstrate the potential power gain of aggregating bioinformatics variant scores into gene-level scores and the general utility of bioinformatics in assisting the search for disease genes in large-scale exome sequencing studies. VEST is
Gene-Based Genome-Wide Association Analysis in European and Asian Populations Identified Novel Genes for Rheumatoid Arthritis.

PubMed

Zhu, Hong; Xia, Wei; Mo, Xing-Bo; Lin, Xiang; Qiu, Ying-Hua; Yi, Neng-Jun; Zhang, Yong-Hong; Deng, Fei-Yan; Lei, Shu-Feng

2016-01-01

Rheumatoid arthritis (RA) is a complex autoimmune disease. Using a gene-based association research strategy, the present study aims to detect unknown susceptibility to RA and to address the ethnic differences in genetic susceptibility to RA between European and Asian populations. Gene-based association analyses were performed with KGG 2.5 by using publicly available large RA datasets (14,361 RA cases and 43,923 controls of European subjects, 4,873 RA cases and 17,642 controls of Asian Subjects). For the newly identified RA-associated genes, gene set enrichment analyses and protein-protein interactions analyses were carried out with DAVID and STRING version 10.0, respectively. Differential expression verification was conducted using 4 GEO datasets. The expression levels of three selected 'highly verified' genes were measured by ELISA among our in-house RA cases and controls. A total of 221 RA-associated genes were newly identified by gene-based association study, including 71'overlapped', 76 'European-specific' and 74 'Asian-specific' genes. Among them, 105 genes had significant differential expressions between RA patients and health controls at least in one dataset, especially for 20 genes including 11 'overlapped' (ABCF1, FLOT1, HLA-F, IER3, TUBB, ZKSCAN4, BTN3A3, HSP90AB1, CUTA, BRD2, HLA-DMA), 5 'European-specific' (PHTF1, RPS18, BAK1, TNFRSF14, SUOX) and 4 'Asian-specific' (RNASET2, HFE, BTN2A2, MAPK13) genes whose differential expressions were significant at least in three datasets. The protein expressions of two selected genes FLOT1 (P value = 1.70E-02) and HLA-DMA (P value = 4.70E-02) in plasma were significantly different in our in-house samples. Our study identified 221 novel RA-associated genes and especially highlighted the importance of 20 candidate genes on RA. The results addressed ethnic genetic background differences for RA susceptibility between European and Asian populations and detected a long list of overlapped or ethnic specific RA genes. The
Systematic analysis of microarray datasets to identify Parkinson's disease‑associated pathways and genes.

PubMed

Feng, Yinling; Wang, Xuefeng

2017-03-01

In order to investigate commonly disturbed genes and pathways in various brain regions of patients with Parkinson's disease (PD), microarray datasets from previous studies were collected and systematically analyzed. Different normalization methods were applied to microarray datasets from different platforms. A strategy combining gene co‑expression networks and clinical information was adopted, using weighted gene co‑expression network analysis (WGCNA) to screen for commonly disturbed genes in different brain regions of patients with PD. Functional enrichment analysis of commonly disturbed genes was performed using the Database for Annotation, Visualization, and Integrated Discovery (DAVID). Co‑pathway relationships were identified with Pearson's correlation coefficient tests and a hypergeometric distribution‑based test. Common genes in pathway pairs were selected out and regarded as risk genes. A total of 17 microarray datasets from 7 platforms were retained for further analysis. Five gene coexpression modules were identified, containing 9,745, 736, 233, 101 and 93 genes, respectively. One module was significantly correlated with PD samples and thus the 736 genes it contained were considered to be candidate PD‑associated genes. Functional enrichment analysis demonstrated that these genes were implicated in oxidative phosphorylation and PD. A total of 44 pathway pairs and 52 risk genes were revealed, and a risk gene pathway relationship network was constructed. Eight modules were identified and were revealed to be associated with PD, cancers and metabolism. A number of disturbed pathways and risk genes were unveiled in PD, and these findings may help advance understanding of PD pathogenesis.

A risk of essential thrombocythemia in carriers of constitutional CHEK2 gene mutations.

PubMed

Janiszewska, Hanna; Bak, Aneta; Pilarska, Maria; Heise, Marta; Junkiert-Czarnecka, Anna; Kuliszkiewicz-Janus, Małgorzata; Całbecka, Małgorzata; Jaźwiec, Bozena; Wołowiec, Dariusz; Kuliczkowski, Kazimierz; Haus, Olga

2012-03-01

Germline mutations of the CHEK2 gene have been reported in some myeloid and lymphoid malignancies, but their impact on development of essential thrombocythemia has not been studied. In 16 out of 106 (15.1%) consecutive patients, newly diagnosed with essential thrombocythemia, we found one of four analyzed CHEK2 mutations: I157T, 1100delC, IVS2+1G>A or del5395. They were associated with the increased risk of disease (OR=3.8; P=0.002). The median age at ET diagnosis among CHEK2+/JAK2V617F+ patients was seven years lower than that among CHEK2-/JAK2V617F+ (52 vs. 59 years; P=0.04), whereas there was no difference in the medians of hematologic parameters between these groups. The results obtained suggest that CHEK2 mutations could potentially contribute to the susceptibility to essential thrombocythemia. The germline inactivation of CHEK2, as it seems, has no direct impact on the development of disease, but it could cause disruption of cell cycle checkpoints and initiate or support the cancerogenic process of essential thrombocythemia at a younger age.
A risk of essential thrombocythemia in carriers of constitutional CHEK2 gene mutations

PubMed Central

Janiszewska, Hanna; Bąk, Aneta; Pilarska, Maria; Heise, Marta; Junkiert-Czarnecka, Anna; Kuliszkiewicz-Janus, Małgorzata; Całbecka, Małgorzata; JaŸwiec, Bożena; Wołowiec, Dariusz; Kuliczkowski, Kazimierz; Haus, Olga

2012-01-01

Germline mutations of the CHEK2 gene have been reported in some myeloid and lymphoid malignancies, but their impact on development of essential thrombocythemia has not been studied. In 16 out of 106 (15.1%) consecutive patients, newly diagnosed with essential thrombocythemia, we found one of four analyzed CHEK2 mutations: I157T, 1100delC, IVS2+1G>A or del5395. They were associated with the increased risk of disease (OR=3.8; P=0.002). The median age at ET diagnosis among CHEK2+/JAK2V617F+ patients was seven years lower than that among CHEK2−/JAK2V617F+ (52 vs. 59 years; P=0.04), whereas there was no difference in the medians of hematologic parameters between these groups. The results obtained suggest that CHEK2 mutations could potentially contribute to the susceptibility to essential thrombocythemia. The germline inactivation of CHEK2, as it seems, has no direct impact on the development of disease, but it could cause disruption of cell cycle checkpoints and initiate or support the cancerogenic process of essential thrombocythemia at a younger age. PMID:22058216
Identifying Candidate Reprogramming Genes in Mouse Induced Pluripotent Stem Cells.

PubMed

Gao, Fang; Li, Jingyu; Zhang, Heng; Yang, Xu; An, Tiezhu

2017-08-01

Factor-based induced reprogramming approaches have tremendous potential for human regenerative medicine, but the efficiencies of these approaches are still low. In this study, we analyzed the global transcriptional profiles of mouse induced pluripotent stem cells (miPSCs) and mouse embryonic stem cells (mESCs) from seven different labs and present here the first successful clustering according to cell type, not by lab of origin. We identified 2131 different expression genes (DEs) as candidate pluripotency-associated genes by comparing mESCs/miPSCs with somatic cells and 720 DEs between miPSCs and mESCs. Interestingly, there was a significant overlap between the two DE sets. Therefore, we defined the overlap DEs as "consensus DEs" including 313 miPSC-specific genes expressed at a higher level in miPSCs versus mESCs and 184 mESC-specific genes in total and reasoned that these may contribute to the differences in pluripotency between mESCs and miPSCs. A classification of "consensus DEs" according to their different expression levels between somatic cells and mESCs/miPSCs shows that 86% of the miPSC-specific genes are more highly expressed in somatic cells, while 73% of mESC-specific genes are highly expressed in mESCs/miPSCs, indicating that the miPSCs have not efficiently silenced the expression pattern of the somatic cells from which they are derived and failed to completely induce the genes with high expression levels in mESCs. We further revealed a strong correlation between oocyte-enriched factors and insufficiently induced mESC-specific genes and identified 11 hub genes via network analysis. In light of these findings, we postulated that these key hub genes might not only drive somatic cell nuclear transfer (SCNT) reprogramming but also augment the efficiency and quality of miPSC reprogramming.
Exome Sequencing Identifies Three Novel Candidate Genes Implicated in Intellectual Disability

PubMed Central

Azam, Maleeha; Ayub, Humaira; Vissers, Lisenka E. L. M.; Gilissen, Christian; Ali, Syeda Hafiza Benish; Riaz, Moeen; Veltman, Joris A.; Pfundt, Rolph; van Bokhoven, Hans; Qamar, Raheel

2014-01-01

Intellectual disability (ID) is a major health problem mostly with an unknown etiology. Recently exome sequencing of individuals with ID identified novel genes implicated in the disease. Therefore the purpose of the present study was to identify the genetic cause of ID in one syndromic and two non-syndromic Pakistani families. Whole exome of three ID probands was sequenced. Missense variations in two plausible novel genes implicated in autosomal recessive ID were identified: lysine (K)-specific methyltransferase 2B (KMT2B), zinc finger protein 589 (ZNF589), as well as hedgehog acyltransferase (HHAT) with a de novo mutation with autosomal dominant mode of inheritance. The KMT2B recessive variant is the first report of recessive Kleefstra syndrome-like phenotype. Identification of plausible causative mutations for two recessive and a dominant type of ID, in genes not previously implicated in disease, underscores the large genetic heterogeneity of ID. These results also support the viewpoint that large number of ID genes converge on limited number of common networks i.e. ZNF589 belongs to KRAB-domain zinc-finger proteins previously implicated in ID, HHAT is predicted to affect sonic hedgehog, which is involved in several disorders with ID, KMT2B associated with syndromic ID fits the epigenetic module underlying the Kleefstra syndromic spectrum. The association of these novel genes in three different Pakistani ID families highlights the importance of screening these genes in more families with similar phenotypes from different populations to confirm the involvement of these genes in pathogenesis of ID. PMID:25405613
Integrating Gene Expression with Summary Association Statistics to Identify Genes Associated with 30 Complex Traits.

PubMed

Mancuso, Nicholas; Shi, Huwenbo; Goddard, Pagé; Kichaev, Gleb; Gusev, Alexander; Pasaniuc, Bogdan

2017-03-02

Although genome-wide association studies (GWASs) have identified thousands of risk loci for many complex traits and diseases, the causal variants and genes at these loci remain largely unknown. Here, we introduce a method for estimating the local genetic correlation between gene expression and a complex trait and utilize it to estimate the genetic correlation due to predicted expression between pairs of traits. We integrated gene expression measurements from 45 expression panels with summary GWAS data to perform 30 multi-tissue transcriptome-wide association studies (TWASs). We identified 1,196 genes whose expression is associated with these traits; of these, 168 reside more than 0.5 Mb away from any previously reported GWAS significant variant. We then used our approach to find 43 pairs of traits with significant genetic correlation at the level of predicted expression; of these, eight were not found through genetic correlation at the SNP level. Finally, we used bi-directional regression to find evidence that BMI causally influences triglyceride levels and that triglyceride levels causally influence low-density lipoprotein. Together, our results provide insight into the role of gene expression in the susceptibility of complex traits and diseases. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Using SCOPE to identify potential regulatory motifs in coregulated genes.

PubMed

Martyanov, Viktor; Gross, Robert H

2011-05-31

SCOPE is an ensemble motif finder that uses three component algorithms in parallel to identify potential regulatory motifs by over-representation and motif position preference. Each component algorithm is optimized to find a different kind of motif. By taking the best of these three approaches, SCOPE performs better than any single algorithm, even in the presence of noisy data. In this article, we utilize a web version of SCOPE to examine genes that are involved in telomere maintenance. SCOPE has been incorporated into at least two other motif finding programs and has been used in other studies. The three algorithms that comprise SCOPE are BEAM, which finds non-degenerate motifs (ACCGGT), PRISM, which finds degenerate motifs (ASCGWT), and SPACER, which finds longer bipartite motifs (ACCnnnnnnnnGGT). These three algorithms have been optimized to find their corresponding type of motif. Together, they allow SCOPE to perform extremely well. Once a gene set has been analyzed and candidate motifs identified, SCOPE can look for other genes that contain the motif which, when added to the original set, will improve the motif score. This can occur through over-representation or motif position preference. Working with partial gene sets that have biologically verified transcription factor binding sites, SCOPE was able to identify most of the rest of the genes also regulated by the given transcription factor. Output from SCOPE shows candidate motifs, their significance, and other information both as a table and as a graphical motif map. FAQs and video tutorials are available at the SCOPE web site which also includes a "Sample Search" button that allows the user to perform a trial run. Scope has a very friendly user interface that enables novice users to access the algorithm's full power without having to become an expert in the bioinformatics of motif finding. As input, SCOPE can take a list of genes, or FASTA sequences. These can be entered in browser text fields, or read from
[Key effect genes responding to nerve injury identified by gene ontology and computer pattern recognition].

PubMed

Pan, Qian; Peng, Jin; Zhou, Xue; Yang, Hao; Zhang, Wei

2012-07-01

In order to screen out important genes from large gene data of gene microarray after nerve injury, we combine gene ontology (GO) method and computer pattern recognition technology to find key genes responding to nerve injury, and then verify one of these screened-out genes. Data mining and gene ontology analysis of gene chip data GSE26350 was carried out through MATLAB software. Cd44 was selected from screened-out key gene molecular spectrum by comparing genes' different GO terms and positions on score map of principal component. Function interferences were employed to influence the normal binding of Cd44 and one of its ligands, chondroitin sulfate C (CSC), to observe neurite extension. Gene ontology analysis showed that the first genes on score map (marked by red *) mainly distributed in molecular transducer activity, receptor activity, protein binding et al molecular function GO terms. Cd44 is one of six effector protein genes, and attracted us with its function diversity. After adding different reagents into the medium to interfere the normal binding of CSC and Cd44, varying-degree remissions of CSC's inhibition on neurite extension were observed. CSC can inhibit neurite extension through binding Cd44 on the neuron membrane. This verifies that important genes in given physiological processes can be identified by gene ontology analysis of gene chip data.
The effect of drought stress on the expression of key genes involved in the biosynthesis of phenylpropanoids and essential oil components in basil (Ocimum basilicum L.).

PubMed

Abdollahi Mandoulakani, Babak; Eyvazpour, Elham; Ghadimzadeh, Morteza

2017-07-01

Basil (Ocimum basilicum L.), a medicinal plant of the Lamiaceae family, is used in traditional medicine; its essential oil is a rich source of phenylpropanoids. Methylchavicol and methyleugenol are the most important constituents of basil essential oil. Drought stress is proposed to enhance the essential oil composition and expression levels of the genes involved in its biosynthesis. In the current investigation, an experiment based on a completely randomized design (CRD) with three replications was conducted in the greenhouse to study the effect of drought stress on the expression level of four genes involved in the phenylpropanoid biosynthesis pathway in O. basilicum c.v. Keshkeni luvelou. The genes studied were chavicol O-methyl transferase (CVOMT), eugenol O-methyl transferase (EOMT), cinnamate 4-hydroxylase (C4H), 4-coumarate coA ligase (4CL), and cinnamyl alcohol dehydrogenase (CAD). The effect of drought stress on the essential oil compounds and their relationship with the expression levels of the studied genes were also investigated. Plants were subjected to levels of 100%, 75%, and 50% of field capacity (FC) at the 6-8 leaf stage. Essential oil compounds were identified by gas chromatography/mass spectrometry (GC-MS) at flowering stage and the levels of gene expression were determind by real time PCR in plant leaves at the same stage. Results showed that drought stress increased the amount of methylchavicol, methyleugenol, β-Myrcene and α-bergamotene. The maximum amount of these compounds was observed at 50% FC. Real-time PCR analysis revealed that severe drought stress (50% FC) increased the expression level of CVOMT and EOMT by about 6.46 and 46.33 times, respectively, whereas those of CAD relatively remained unchanged. The expression level of 4CL and C4H reduced under drought stress conditions. Our results also demonstrated that changes in the expression levels of CVOMT and EOMT are significantly correlated with methylchavicol (r = 0.94, P ≤ 0
Sleeping Beauty transposon mutagenesis identifies genes that cooperate with mutant Smad4 in gastric cancer development

PubMed Central

Takeda, Haruna; Rust, Alistair G.; Ward, Jerrold M.; Yew, Christopher Chin Kuan; Jenkins, Nancy A.; Copeland, Neal G.

2016-01-01

Mutations in SMAD4 predispose to the development of gastrointestinal cancer, which is the third leading cause of cancer-related deaths. To identify genes driving gastric cancer (GC) development, we performed a Sleeping Beauty (SB) transposon mutagenesis screen in the stomach of Smad4+/− mutant mice. This screen identified 59 candidate GC trunk drivers and a much larger number of candidate GC progression genes. Strikingly, 22 SB-identified trunk drivers are known or candidate cancer genes, whereas four SB-identified trunk drivers, including PTEN, SMAD4, RNF43, and NF1, are known human GC trunk drivers. Similar to human GC, pathway analyses identified WNT, TGF-β, and PI3K-PTEN signaling, ubiquitin-mediated proteolysis, adherens junctions, and RNA degradation in addition to genes involved in chromatin modification and organization as highly deregulated pathways in GC. Comparative oncogenomic filtering of the complete list of SB-identified genes showed that they are highly enriched for genes mutated in human GC and identified many candidate human GC genes. Finally, by comparing our complete list of SB-identified genes against the list of mutated genes identified in five large-scale human GC sequencing studies, we identified LDL receptor-related protein 1B (LRP1B) as a previously unidentified human candidate GC tumor suppressor gene. In LRP1B, 129 mutations were found in 462 human GC samples sequenced, and LRP1B is one of the top 10 most deleted genes identified in a panel of 3,312 human cancers. SB mutagenesis has, thus, helped to catalog the cooperative molecular mechanisms driving SMAD4-induced GC growth and discover genes with potential clinical importance in human GC. PMID:27006499
Sleeping Beauty transposon mutagenesis identifies genes that cooperate with mutant Smad4 in gastric cancer development.

PubMed

Takeda, Haruna; Rust, Alistair G; Ward, Jerrold M; Yew, Christopher Chin Kuan; Jenkins, Nancy A; Copeland, Neal G

2016-04-05

Mutations in SMAD4 predispose to the development of gastrointestinal cancer, which is the third leading cause of cancer-related deaths. To identify genes driving gastric cancer (GC) development, we performed a Sleeping Beauty (SB) transposon mutagenesis screen in the stomach of Smad4(+/-) mutant mice. This screen identified 59 candidate GC trunk drivers and a much larger number of candidate GC progression genes. Strikingly, 22 SB-identified trunk drivers are known or candidate cancer genes, whereas four SB-identified trunk drivers, including PTEN, SMAD4, RNF43, and NF1, are known human GC trunk drivers. Similar to human GC, pathway analyses identified WNT, TGF-β, and PI3K-PTEN signaling, ubiquitin-mediated proteolysis, adherens junctions, and RNA degradation in addition to genes involved in chromatin modification and organization as highly deregulated pathways in GC. Comparative oncogenomic filtering of the complete list of SB-identified genes showed that they are highly enriched for genes mutated in human GC and identified many candidate human GC genes. Finally, by comparing our complete list of SB-identified genes against the list of mutated genes identified in five large-scale human GC sequencing studies, we identified LDL receptor-related protein 1B (LRP1B) as a previously unidentified human candidate GC tumor suppressor gene. In LRP1B, 129 mutations were found in 462 human GC samples sequenced, and LRP1B is one of the top 10 most deleted genes identified in a panel of 3,312 human cancers. SB mutagenesis has, thus, helped to catalog the cooperative molecular mechanisms driving SMAD4-induced GC growth and discover genes with potential clinical importance in human GC.
Novel Myopia Genes and Pathways Identified From Syndromic Forms of Myopia

PubMed Central

Loughman, James; Wildsoet, Christine F.; Williams, Cathy; Guggenheim, Jeremy A.

2018-01-01

Purpose To test the hypothesis that genes known to cause clinical syndromes featuring myopia also harbor polymorphisms contributing to nonsyndromic refractive errors. Methods Clinical phenotypes and syndromes that have refractive errors as a recognized feature were identified using the Online Mendelian Inheritance in Man (OMIM) database. One hundred fifty-four unique causative genes were identified, of which 119 were specifically linked with myopia and 114 represented syndromic myopia (i.e., myopia and at least one other clinical feature). Myopia was the only refractive error listed for 98 genes and hyperopia and the only refractive error noted for 28 genes, with the remaining 28 genes linked to phenotypes with multiple forms of refractive error. Pathway analysis was carried out to find biological processes overrepresented within these sets of genes. Genetic variants located within 50 kb of the 119 myopia-related genes were evaluated for involvement in refractive error by analysis of summary statistics from genome-wide association studies (GWAS) conducted by the CREAM Consortium and 23andMe, using both single-marker and gene-based tests. Results Pathway analysis identified several biological processes already implicated in refractive error development through prior GWAS analyses and animal studies, including extracellular matrix remodeling, focal adhesion, and axon guidance, supporting the research hypothesis. Novel pathways also implicated in myopia development included mannosylation, glycosylation, lens development, gliogenesis, and Schwann cell differentiation. Hyperopia was found to be linked to a different pattern of biological processes, mostly related to organogenesis. Comparison with GWAS findings further confirmed that syndromic myopia genes were enriched for genetic variants that influence refractive errors in the general population. Gene-based analyses implicated 21 novel candidate myopia genes (ADAMTS18, ADAMTS2, ADAMTSL4, AGK, ALDH18A1, ASXL1, COL4A1
Expression profiling identifies novel Hh/Gli regulated genes in developing zebrafish embryos.

PubMed Central

Bergeron, Sadie A.; Milla, Luis A.; Villegas, Rosario; Shen, Meng-Chieh; Burgess, Shawn M.; Allende, Miguel L.; Karlstrom, Rolf O.; Palma, Verónica

2008-01-01

The Hedgehog (Hh) signaling pathway plays critical instructional roles during embryonic development. Mis-regulation of Hh/Gli signaling is a major causative factor in human congenital disorders and in a variety of cancers. The zebrafish is a powerful genetic model for the study of Hh signaling during embryogenesis, as a large number of mutants have been identified affecting different components of the Hh/Gli signaling system. By performing global profiling of gene expression in different Hh/Gli gain- and loss-of-function scenarios we identified several known (e.g. ptc1 and nkx2.2a) as well as a large number of novel Hh regulated genes that are differentially expressed in embryos with altered Hh/Gli signaling function. By uncovering changes in tissue specific gene expression, we revealed new embryological processes that are influenced by Hh signaling. We thus provide a comprehensive survey of Hh/Gli regulated genes during embryogenesis and we identify new Hh-regulated genes that may be targets of mis-regulation during tumorogenesis. PMID:18055165
Identifying Stress Transcription Factors Using Gene Expression and TF-Gene Association Data

PubMed Central

Wu, Wei-Sheng; Chen, Bor-Sen

2007-01-01

Unicellular organisms such as yeasts have evolved to survive environmental stresses by rapidly reorganizing the genomic expression program to meet the challenges of harsh environments. The complex adaptation mechanisms to stress remain to be elucidated. In this study, we developed Stress Transcription Factor Identification Algorithm (STFIA), which integrates gene expression and TF-gene association data to identify the stress transcription factors (TFs) of six kinds of stresses. We identified some general stress TFs that are in response to various stresses, and some specific stress TFs that are in response to one specific stress. The biological significance of our findings is validated by the literature. We found that a small number of TFs may be sufficient to control a wide variety of expression patterns in yeast under different stresses. Two implications can be inferred from this observation. First, the adaptation mechanisms to different stresses may have a bow-tie structure. Second, there may exist extensive regulatory cross-talk among different stress responses. In conclusion, this study proposes a network of the regulators of stress responses and their mechanism of action. PMID:20066130
Cloning and sequencing of Staphylococcus aureus murC, a gene essential for cell wall biosynthesis.

PubMed

Lowe, A M; Deresiewicz, R L

1999-01-01

Staphylococcus aureus is a major human pathogen that is increasingly resistant to clinically useful antimicrobial agents. While screening for S. aureus genes expressed during mammalian infection, we isolated murC. This gene encodes UDP-N-acetylmuramoyl-L-alanine synthetase, an enzyme essential for cell wall biosynthesis in a number of bacteria. S. aureus MurC has a predicted mass 49,182 Da and complements the temperature-sensitive murC mutation of E. coli ST222. Sequence data on the DNA flanking staphylococcal murC suggests that the local gene organization there parallels that found in B. subtilis, but differs from that found in gram-negative bacterial pathogens. MurC proteins represent promising targets for broad spectrum antimicrobial drug development.
GeneCOST: a novel scoring-based prioritization framework for identifying disease causing genes.

PubMed

Ozer, Bugra; Sağıroğlu, Mahmut; Demirci, Hüseyin

2015-11-15

Due to the big data produced by next-generation sequencing studies, there is an evident need for methods to extract the valuable information gathered from these experiments. In this work, we propose GeneCOST, a novel scoring-based method to evaluate every gene for their disease association. Without any prior filtering and any prior knowledge, we assign a disease likelihood score to each gene in correspondence with their variations. Then, we rank all genes based on frequency, conservation, pedigree and detailed variation information to find out the causative reason of the disease state. We demonstrate the usage of GeneCOST with public and real life Mendelian disease cases including recessive, dominant, compound heterozygous and sporadic models. As a result, we were able to identify causative reason behind the disease state in top rankings of our list, proving that this novel prioritization framework provides a powerful environment for the analysis in genetic disease studies alternative to filtering-based approaches. GeneCOST software is freely available at www.igbam.bilgem.tubitak.gov.tr/en/softwares/genecost-en/index.html. buozer@gmail.com Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Pathway-driven gene stability selection of two rheumatoid arthritis GWAS identifies and validates new susceptibility genes in receptor mediated signalling pathways.

PubMed

Eleftherohorinou, Hariklia; Hoggart, Clive J; Wright, Victoria J; Levin, Michael; Coin, Lachlan J M

2011-09-01

Rheumatoid arthritis (RA) is the commonest chronic, systemic, inflammatory disorder affecting ∼1% of the world population. It has a strong genetic component and a growing number of associated genes have been discovered in genome-wide association studies (GWAS), which nevertheless only account for 23% of the total genetic risk. We aimed to identify additional susceptibility loci through the analysis of GWAS in the context of biological function. We bridge the gap between pathway and gene-oriented analyses of GWAS, by introducing a pathway-driven gene stability-selection methodology that identifies potential causal genes in the top-associated disease pathways that may be driving the pathway association signals. We analysed the WTCCC and the NARAC studies of ∼5000 and ∼2000 subjects, respectively. We examined 700 pathways comprising ∼8000 genes. Ranking pathways by significance revealed that the NARAC top-ranked ∼6% laid within the top 10% of WTCCC. Gene selection on those pathways identified 58 genes in WTCCC and 61 in NARAC; 21 of those were common (P(overlap)< 10(-21)), of which 16 were novel discoveries. Among the identified genes, we validated 10 known RA associations in WTCCC and 13 in NARAC, not discovered using single-SNP approaches on the same data. Gene ontology functional enrichment analysis on the identified genes showed significant over-representation of signalling activity (P< 10(-29)) in both studies. Our findings suggest a novel model of RA genetic predisposition, which involves cell-membrane receptors and genes in second messenger signalling systems, in addition to genes that regulate immune responses, which have been the focus of interest previously.
Use of RNA-seq to identify cardiac genes and gene pathways differentially expressed between dogs with and without dilated cardiomyopathy

PubMed Central

Friedenberg, Steven G.; Chdid, Lhoucine; Keene, Bruce; Sherry, Barbara; Motsinger-Reif, Alison; Meurs, Kathryn M.

2017-01-01

OBJECTIVE To identify cardiac tissue genes and gene pathways differentially expressed between dogs with and without dilated cardiomyopathy (DCM). ANIMALS 8 dogs with and 5 dogs without DCM. PROCEDURES Following euthanasia, samples of left ventricular myocardium were collected from each dog. Total RNA was extracted from tissue samples, and RNA sequencing was performed on each sample. Samples from dogs with and without DCM were grouped to identify genes that were differentially regulated between the 2 populations. Overrepresentation analysis was performed on upregulated and downregulated gene sets to identify altered molecular pathways in dogs with DCM. RESULTS Genes involved in cellular energy metabolism, especially metabolism of carbohydrates and fats, were significantly downregulated in dogs with DCM. Expression of cardiac structural proteins was also altered in affected dogs. CONCLUSIONS AND CLINICAL RELEVANCE Results suggested that RNA sequencing may provide important insights into the pathogenesis of DCM in dogs and highlight pathways that should be explored to identify causative mutations and develop novel therapeutic interventions. PMID:27347821
Use of RNA-seq to identify cardiac genes and gene pathways differentially expressed between dogs with and without dilated cardiomyopathy.

PubMed

Friedenberg, Steven G; Chdid, Lhoucine; Keene, Bruce; Sherry, Barbara; Motsinger-Reif, Alison; Meurs, Kathryn M

2016-07-01

OBJECTIVE To identify cardiac tissue genes and gene pathways differentially expressed between dogs with and without dilated cardiomyopathy (DCM). ANIMALS 8 dogs with and 5 dogs without DCM. PROCEDURES Following euthanasia, samples of left ventricular myocardium were collected from each dog. Total RNA was extracted from tissue samples, and RNA sequencing was performed on each sample. Samples from dogs with and without DCM were grouped to identify genes that were differentially regulated between the 2 populations. Overrepresentation analysis was performed on upregulated and downregulated gene sets to identify altered molecular pathways in dogs with DCM. RESULTS Genes involved in cellular energy metabolism, especially metabolism of carbohydrates and fats, were significantly downregulated in dogs with DCM. Expression of cardiac structural proteins was also altered in affected dogs. CONCLUSIONS AND CLINICAL RELEVANCE Results suggested that RNA sequencing may provide important insights into the pathogenesis of DCM in dogs and highlight pathways that should be explored to identify causative mutations and develop novel therapeutic interventions.
Genetic predictors of antipsychotic response to lurasidone identified in a genome wide association study and by schizophrenia risk genes.

PubMed

Li, Jiang; Yoshikawa, Akane; Brennan, Mark D; Ramsey, Timothy L; Meltzer, Herbert Y

2018-02-01

Biomarkers which predict response to atypical antipsychotic drugs (AAPDs) increases their benefit/risk ratio. We sought to identify common variants in genes which predict response to lurasidone, an AAPD, by associating genome-wide association study (GWAS) data and changes (Δ) in Positive And Negative Syndrome Scale (PANSS) scores from two 6-week randomized, placebo-controlled trials of lurasidone in schizophrenia (SCZ) patients. We also included SCZ risk SNPs identified by the Psychiatric Genomics Consortium using a polygenic risk analysis. The top genomic loci, with uncorrected p<10 -4 , include: 1) synaptic adhesion (PTPRD, LRRC4C, NRXN1, ILIRAPL1, SLITRK1) and scaffolding (MAGI1, MAGI2, NBEA) genes, both essential for synaptic function; 2) other synaptic plasticity-related genes (NRG1/3 and KALRN); 3) the neuron-specific RNA splicing regulator, RBFOX1; and 4) ion channel genes, e.g. KCNA10, KCNAB1, KCNK9 and CACNA2D3). Some genes predicted response for patients with both European and African Ancestries. We replicated some SNPs reported to predict response to other atypical APDs in other GWAS. Although none of the biomarkers reached genome-wide significance, many of the genes and associated pathways have previously been linked to SCZ. Two polygenic modeling approaches, GCTA-GREML and PLINK-Polygenic Risk Score, demonstrated that some risk genes related to neurodevelopment, synaptic biology, immune response, and histones, also contributed to prediction of response. The top hits predicting response to lurasidone did not predict improvement with placebo. This is the first evidence from clinical trials that SCZ risk SNPs are related to clinical response to an AAPD. These results need to be replicated in an independent sample. Copyright © 2017. Published by Elsevier B.V.
A general method for identifying major hybrid male sterility genes in Drosophila.

PubMed

Zeng, L W; Singh, R S

1995-10-01

The genes responsible for hybrid male sterility in species crosses are usually identified by introgressing chromosome segments, monitored by visible markers, between closely related species by continuous backcrosses. This commonly used method, however, suffers from two problems. First, it relies on the availability of markers to monitor the introgressed regions and so the portion of the genome examined is limited to the marked regions. Secondly, the introgressed regions are usually large and it is impossible to tell if the effects of the introgressed regions are the result of single (or few) major genes or many minor genes (polygenes). Here we introduce a simple and general method for identifying putative major hybrid male sterility genes which is free of these problems. In this method, the actual hybrid male sterility genes (rather than markers), or tightly linked gene complexes with large effects, are selectively introgressed from one species into the background of another species by repeated backcrosses. This is performed by selectively backcrossing heterozygous (for hybrid male sterility gene or genes) females producing fertile and sterile sons in roughly equal proportions to males of either parental species. As no marker gene is required for this procedure, this method can be used with any species pairs that produce unisexual sterility. With the application of this method, a small X chromosome region of Drosophila mauritiana which produces complete hybrid male sterility (aspermic testes) in the background of D. simulans was identified. Recombination analysis reveals that this region contains a second major hybrid male sterility gene linked to the forked locus located at either 62.7 +/- 0.66 map units or at the centromere region of the X chromosome of D. mauritiana.

Epidermal growth factor gene is a newly identified candidate gene for gout.

PubMed

Han, Lin; Cao, Chunwei; Jia, Zhaotong; Liu, Shiguo; Liu, Zhen; Xin, Ruosai; Wang, Can; Li, Xinde; Ren, Wei; Wang, Xuefeng; Li, Changgui

2016-08-10

Chromosome 4q25 has been identified as a genomic region associated with gout. However, the associations of gout with the genes in this region have not yet been confirmed. Here, we performed two-stage analysis to determine whether variations in candidate genes in the 4q25 region are associated with gout in a male Chinese Han population. We first evaluated 96 tag single nucleotide polymorphisms (SNPs) in eight inflammatory/immune pathway- or glucose/lipid metabolism-related genes in the 4q25 region in 480 male gout patients and 480 controls. The SNP rs12504538, located in the elongation of very-long-chain-fatty-acid-like family member 6 gene (Elovl6), was found to be associated with gout susceptibility (Padjusted = 0.00595). In the second stage of analysis, we performed fine mapping analysis of 93 tag SNPs in Elovl6 and in the epidermal growth factor gene (EGF) and its flanking regions in 1017 male patients gout and 1897 healthy male controls. We observed a significant association between the T allele of EGF rs2298999 and gout (odds ratio = 0.77, 95% confidence interval = 0.67-0.88, Padjusted = 6.42 × 10(-3)). These results provide the first evidence for an association between the EGF rs2298999 C/T polymorphism and gout. Our findings should be validated in additional populations.
Identifying the essential components of cultural competence in a Chinese nursing context: A qualitative study.

PubMed

Cai, Duanying; Kunaviktikul, Wipada; Klunklin, Areewan; Sripusanapan, Acharaporn; Avant, Patricia Kay

2017-06-01

This qualitative study using semi-structured interviews was conducted to identify the essential components of cultural competence from the perspective of Chinese nurses. A purposive sample of 20 nurse experts, including senior clinical nurses, nurse administrators, and educators in transcultural nursing, was recruited. Using thematic analysis, four themes: awareness, attitudes, knowledge, and skills, with two subthemes for each, were identified. Notably, culture in China was understood in a broad way. The participants' responses focused upon demographic attributes, individuality, and efforts to facilitate quality care rather than on the cultural differences of ethnicity and race and developing the capacity to change discrimination or health disparities. A greater understanding of cultural competence in the Chinese nursing context, in which a dominant cultural group exists, is essential to facilitate the provision of culturally competent care to diverse populations. © 2016 John Wiley & Sons Australia, Ltd.
Rapid evolution of RNA editing sites in a small non-essential plastid gene

PubMed Central

Fiebig, Andreas; Stegemann, Sandra; Bock, Ralph

2004-01-01

Chloroplast RNA editing proceeds by C-to-U transitions at highly specific sites. Here, we provide a phylogenetic analysis of RNA editing in a small plastid gene, petL, encoding subunit VI of the cytochrome b6f complex. Analyzing representatives from most major groups of seed plants, we find an unexpectedly high frequency and dynamics of RNA editing. High-frequency editing has previously been observed in plastid ndh genes, which are remarkable in that their mutational inactivation does not produce an obvious mutant phenotype. In order to test the idea that reduced functional constraints allow for more flexible evolution of RNA editing sites, we have created petL knockout plants by tobacco chloroplast transformation. We find that, in the higher plant tobacco, targeted inactivation of petL does not impair plant growth under a variety of conditions markedly contrasting the important role of petL in photosynthesis in the green alga Chlamydomonas reinhardtii. Together with a low number of editing sites in plastid genes that are essential to gene expression and photosynthetic activity, these data suggest that RNA editing sites may evolve more readily in those genes whose transitory loss of function can be tolerated. Accumulated evidence for this ‘relative neutrality hypothesis for the evolution of plastid editing sites’ is discussed. PMID:15240834
Identification of Arabidopsis GPAT9 (At5g60620) as an Essential Gene Involved in Triacylglycerol Biosynthesis.

PubMed

Shockey, Jay; Regmi, Anushobha; Cotton, Kimberly; Adhikari, Neil; Browse, John; Bates, Philip D

2016-01-01

The first step in the biosynthesis of nearly all plant membrane phospholipids and storage triacylglycerols is catalyzed by a glycerol-3-phosphate acyltransferase (GPAT). The requirement for an endoplasmic reticulum (ER)-localized GPAT for both of these critical metabolic pathways was recognized more than 60 years ago. However, identification of the gene(s) encoding this GPAT activity has remained elusive. Here, we present the results of a series of in vivo, in vitro, and in silico experiments in Arabidopsis (Arabidopsis thaliana) designed to assign this essential function to AtGPAT9. This gene has been highly conserved throughout evolution and is largely present as a single copy in most plants, features consistent with essential housekeeping functions. A knockout mutant of AtGPAT9 demonstrates both male and female gametophytic lethality phenotypes, consistent with the role in essential membrane lipid synthesis. Significant expression of developing seed AtGPAT9 is required for wild-type levels of triacylglycerol accumulation, and the transcript level is directly correlated to the level of microsomal GPAT enzymatic activity in seeds. Finally, the AtGPAT9 protein interacts with other enzymes involved in ER glycerolipid biosynthesis, suggesting the possibility of ER-localized lipid biosynthetic complexes. Together, these results suggest that GPAT9 is the ER-localized GPAT enzyme responsible for plant membrane lipid and oil biosynthesis. © 2016 American Society of Plant Biologists. All Rights Reserved.
Whole exome sequencing identifies novel candidate genes that modify chronic obstructive pulmonary disease susceptibility.

PubMed

Bruse, Shannon; Moreau, Michael; Bromberg, Yana; Jang, Jun-Ho; Wang, Nan; Ha, Hongseok; Picchi, Maria; Lin, Yong; Langley, Raymond J; Qualls, Clifford; Klensney-Tait, Julia; Zabner, Joseph; Leng, Shuguang; Mao, Jenny; Belinsky, Steven A; Xing, Jinchuan; Nyunoya, Toru

2016-01-07

Chronic obstructive pulmonary disease (COPD) is characterized by an irreversible airflow limitation in response to inhalation of noxious stimuli, such as cigarette smoke. However, only 15-20 % smokers manifest COPD, suggesting a role for genetic predisposition. Although genome-wide association studies have identified common genetic variants that are associated with susceptibility to COPD, effect sizes of the identified variants are modest, as is the total heritability accounted for by these variants. In this study, an extreme phenotype exome sequencing study was combined with in vitro modeling to identify COPD candidate genes. We performed whole exome sequencing of 62 highly susceptible smokers and 30 exceptionally resistant smokers to identify rare variants that may contribute to disease risk or resistance to COPD. This was a cross-sectional case-control study without therapeutic intervention or longitudinal follow-up information. We identified candidate genes based on rare variant analyses and evaluated exonic variants to pinpoint individual genes whose function was computationally established to be significantly different between susceptible and resistant smokers. Top scoring candidate genes from these analyses were further filtered by requiring that each gene be expressed in human bronchial epithelial cells (HBECs). A total of 81 candidate genes were thus selected for in vitro functional testing in cigarette smoke extract (CSE)-exposed HBECs. Using small interfering RNA (siRNA)-mediated gene silencing experiments, we showed that silencing of several candidate genes augmented CSE-induced cytotoxicity in vitro. Our integrative analysis through both genetic and functional approaches identified two candidate genes (TACC2 and MYO1E) that augment cigarette smoke (CS)-induced cytotoxicity and, potentially, COPD susceptibility.
Genes Important for Schizosaccharomyces pombe Meiosis Identified Through a Functional Genomics Screen

PubMed Central

Blyth, Julie; Makrantoni, Vasso; Barton, Rachael E.; Spanos, Christos; Rappsilber, Juri; Marston, Adele L.

2018-01-01

Meiosis is a specialized cell division that generates gametes, such as eggs and sperm. Errors in meiosis result in miscarriages and are the leading cause of birth defects; however, the molecular origins of these defects remain unknown. Studies in model organisms are beginning to identify the genes and pathways important for meiosis, but the parts list is still poorly defined. Here we present a comprehensive catalog of genes important for meiosis in the fission yeast, Schizosaccharomyces pombe. Our genome-wide functional screen surveyed all nonessential genes for roles in chromosome segregation and spore formation. Novel genes important at distinct stages of the meiotic chromosome segregation and differentiation program were identified. Preliminary characterization implicated three of these genes in centrosome/spindle pole body, centromere, and cohesion function. Our findings represent a near-complete parts list of genes important for meiosis in fission yeast, providing a valuable resource to advance our molecular understanding of meiosis. PMID:29259000
The Ep152R ORF of African swine fever virus strain Georgia encodes for an essential gene that interacts with host protein BAG6.

PubMed

Borca, Manuel V; O'Donnell, Vivian; Holinka, Lauren G; Rai, Devendra K; Sanford, Brenton; Alfano, Marialexia; Carlson, Jolene; Azzinaro, Paul A; Alonso, Covadonga; Gladue, Douglas P

2016-09-02

African swine fever virus (ASFV) is the etiological agent of a contagious and often lethal disease of domestic pigs that has significant economic consequences for the swine industry. The viral genome encodes for more than 150 genes, and only a select few of these genes have been studied in some detail. Here we report the characterization of open reading frame Ep152R that has a predicted complement control module/SCR domain. This domain is found in Vaccinia virus proteins that are involved in blocking the immune response during viral infection. A recombinant ASFV harboring a HA tagged version of the Ep152R protein was developed (ASFV-G-Ep152R-HA) and used to demonstrate that Ep152R is an early virus protein. Attempts to construct recombinant viruses having a deleted Ep152R gene were consistently unsuccessful indicating that Ep152R is an essential gene. Interestingly, analysis of host-protein interactions for Ep152R using a yeast two-hybrid screen, identified BAG6, a protein previously identified as being required for ASFV replication. Furthermore, fluorescent microscopy analysis confirms that Ep152R-BAG6 interaction actually occurs in cells infected with ASFV. Published by Elsevier B.V.
A system view and analysis of essential hypertension.

PubMed

Botzer, Alon; Grossman, Ehud; Moult, John; Unger, Ron

2018-05-01

The goal of this study was to investigate genes associated with essential hypertension from a system perspective, making use of bioinformatic tools to gain insights that are not evident when focusing at a detail-based resolution. Using various databases (pathways, Genome Wide Association Studies, knockouts etc.), we compiled a set of about 200 genes that play a major role in hypertension and identified the interactions between them. This enabled us to create a protein-protein interaction network graph, from which we identified key elements, based on graph centrality analysis. Enriched gene regulatory elements (transcription factors and microRNAs) were extracted by motif finding techniques and knowledge-based tools. We found that the network is composed of modules associated with functions such as water retention, endothelial vasoconstriction, sympathetic activity and others. We identified the transcription factor SP1 and the two microRNAs miR27 (a and b) and miR548c-3p that seem to play a major role in regulating the network as they exert their control over several modules and are not restricted to specific functions. We also noticed that genes involved in metabolic diseases (e.g. insulin) are central to the network. We view the blood-pressure regulation mechanism as a system-of-systems, composed of several contributing subsystems and pathways rather than a single module. The system is regulated by distributed elements. Understanding this mode of action can lead to a more precise treatment and drug target discovery. Our analysis suggests that insulin plays a primary role in hypertension, highlighting the tight link between essential hypertension and diseases associated with the metabolic syndrome.
Lentiviral vector-based insertional mutagenesis identifies genes associated with liver cancer

PubMed Central

Ranzani, Marco; Cesana, Daniela; Bartholomae, Cynthia C.; Sanvito, Francesca; Pala, Mauro; Benedicenti, Fabrizio; Gallina, Pierangela; Sergi, Lucia Sergi; Merella, Stefania; Bulfone, Alessandro; Doglioni, Claudio; von Kalle, Christof; Kim, Yoon Jun; Schmidt, Manfred; Tonon, Giovanni; Naldini, Luigi; Montini, Eugenio

2013-01-01

Transposons and γ-retroviruses have been efficiently used as insertional mutagens in different tissues to identify molecular culprits of cancer. However, these systems are characterized by recurring integrations that accumulate in tumor cells, hampering the identification of early cancer-driving events amongst bystander and progression-related events. We developed an insertional mutagenesis platform based on lentiviral vectors (LVV) by which we could efficiently induce hepatocellular carcinoma (HCC) in 3 different mouse models. By virtue of LVV’s replication-deficient nature and broad genome-wide integration pattern, LVV-based insertional mutagenesis allowed identification of 4 new liver cancer genes from a limited number of integrations. We validated the oncogenic potential of all the identified genes in vivo, with different levels of penetrance. Our newly identified cancer genes are likely to play a role in human disease, since they are upregulated and/or amplified/deleted in human HCCs and can predict clinical outcome of patients. PMID:23314173
Update on genetics of essential tremor.

PubMed

Jiménez-Jiménez, F J; Alonso-Navarro, H; García-Martín, E; Lorenzo-Betancor, O; Pastor, P; Agúndez, J A G

2013-12-01

Despite the research, few advances in the etiopathogenesis on essential tremor (ET) have been made to date. The high frequency of positive family history of ET and the observed high concordance rates in monozygotic compared with dizygotic twins support a major role of genetic factors in the development of ET. In addition, a possible role of environmental factors has been suggested in the etiology of ET (at least in non-familial forms). Although several gene variants in the LINGO1 gene may increase the risk of ET, to date no causative mutated genes have been identified. In this review, we summarize the studies performed on families with tremor, twin studies, linkage studies, case-control association studies, and exome sequencing in familial ET. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
GESearch: An Interactive GUI Tool for Identifying Gene Expression Signature.

PubMed

Ye, Ning; Yin, Hengfu; Liu, Jingjing; Dai, Xiaogang; Yin, Tongming

2015-01-01

The huge amount of gene expression data generated by microarray and next-generation sequencing technologies present challenges to exploit their biological meanings. When searching for the coexpression genes, the data mining process is largely affected by selection of algorithms. Thus, it is highly desirable to provide multiple options of algorithms in the user-friendly analytical toolkit to explore the gene expression signatures. For this purpose, we developed GESearch, an interactive graphical user interface (GUI) toolkit, which is written in MATLAB and supports a variety of gene expression data files. This analytical toolkit provides four models, including the mean, the regression, the delegate, and the ensemble models, to identify the coexpression genes, and enables the users to filter data and to select gene expression patterns by browsing the display window or by importing knowledge-based genes. Subsequently, the utility of this analytical toolkit is demonstrated by analyzing two sets of real-life microarray datasets from cell-cycle experiments. Overall, we have developed an interactive GUI toolkit that allows for choosing multiple algorithms for analyzing the gene expression signatures.
Identifying candidate genes for Type 2 Diabetes Mellitus and obesity through gene expression profiling in multiple tissues or cells.

PubMed

Chen, Junhui; Meng, Yuhuan; Zhou, Jinghui; Zhuo, Min; Ling, Fei; Zhang, Yu; Du, Hongli; Wang, Xiaoning

2013-01-01

Type 2 Diabetes Mellitus (T2DM) and obesity have become increasingly prevalent in recent years. Recent studies have focused on identifying causal variations or candidate genes for obesity and T2DM via analysis of expression quantitative trait loci (eQTL) within a single tissue. T2DM and obesity are affected by comprehensive sets of genes in multiple tissues. In the current study, gene expression levels in multiple human tissues from GEO datasets were analyzed, and 21 candidate genes displaying high percentages of differential expression were filtered out. Specifically, DENND1B, LYN, MRPL30, POC1B, PRKCB, RP4-655J12.3, HIBADH, and TMBIM4 were identified from the T2DM-control study, and BCAT1, BMP2K, CSRNP2, MYNN, NCKAP5L, SAP30BP, SLC35B4, SP1, BAP1, GRB14, HSP90AB1, ITGA5, and TOMM5 were identified from the obesity-control study. The majority of these genes are known to be involved in T2DM and obesity. Therefore, analysis of gene expression in various tissues using GEO datasets may be an effective and feasible method to determine novel or causal genes associated with T2DM and obesity.
Applying Multivariate Adaptive Splines to Identify Genes With Expressions Varying After Diagnosis in Microarray Experiments.

PubMed

Duan, Fenghai; Xu, Ye

2017-01-01

To analyze a microarray experiment to identify the genes with expressions varying after the diagnosis of breast cancer. A total of 44 928 probe sets in an Affymetrix microarray data publicly available on Gene Expression Omnibus from 249 patients with breast cancer were analyzed by the nonparametric multivariate adaptive splines. Then, the identified genes with turning points were grouped by K-means clustering, and their network relationship was subsequently analyzed by the Ingenuity Pathway Analysis. In total, 1640 probe sets (genes) were reliably identified to have turning points along with the age at diagnosis in their expression profiling, of which 927 expressed lower after turning points and 713 expressed higher after the turning points. K-means clustered them into 3 groups with turning points centering at 54, 62.5, and 72, respectively. The pathway analysis showed that the identified genes were actively involved in various cancer-related functions or networks. In this article, we applied the nonparametric multivariate adaptive splines method to a publicly available gene expression data and successfully identified genes with expressions varying before and after breast cancer diagnosis.
PAF53 is essential in mammalian cells: CRISPR/Cas9 fails to eliminate PAF53 expression.

PubMed

Rothblum, Lawrence I; Rothblum, Katrina; Chang, Eugenie

2017-05-15

When mammalian cells are nutrient and/or growth factor deprived, exposed to inhibitors of protein synthesis, stressed by heat shock or grown to confluence, rDNA transcription is essentially shut off. Various mechanisms are available to accomplish this downshift in ribosome biogenesis. Muramatsu's laboratory (Hanada et al., 1996) first demonstrated that mammalian PAF53 was essential for specific rDNA transcription and that PAF53 levels were regulated in response to growth factors. While S. cerevisae A49, the homologue of vertebrate PAF53, is not essential for viability (Liljelund et al., 1992), deletion of yA49 results in colonies that grow at 6% of the wild type rate at 25°C. Experiments described by Wang et al. (2015) identified PAF53 as a gene "essential for optimal proliferation". However, they did not discriminate genes essential for viability. Hence, in order to resolve this question, we designed a series of experiments to determine if PAF53 was essential for cell survival. We set out to delete the gene product from mammalian cells using CRISPR/CAS9 technology. Human 293 cells were transfected with lentiCRISPR v2 carrying genes for various sgRNA that targeted PAF53. In some experiments, the cells were cotransfected in parallel with plasmids encoding FLAG-tagged mouse PAF53. After treating the transfected cells with puromycin (to select for the lentiCRISPR backbone), cells were cloned and analyzed by western blots for PAF53 expression. Genomic DNA was amplified across the "CRISPRd" exon, cloned and sequenced to identify mutated PAF53 genes. We obtained cell lines in which the endogenous PAF53 gene was "knocked out" only when we rescued with FLAG-PAF53. DNA sequencing demonstrated that in the absence of ectopic PAF53 expression, cells demonstrated unique means of surviving; including recombination or the utilization of alternative reading frames. We never observed a clone in which one PAF53 gene is expressed, unless there was also ectopic expression In the
TGMI: an efficient algorithm for identifying pathway regulators through evaluation of triple-gene mutual interaction

PubMed Central

Gunasekara, Chathura; Zhang, Kui; Deng, Wenping; Brown, Laura

2018-01-01

Abstract Despite their important roles, the regulators for most metabolic pathways and biological processes remain elusive. Presently, the methods for identifying metabolic pathway and biological process regulators are intensively sought after. We developed a novel algorithm called triple-gene mutual interaction (TGMI) for identifying these regulators using high-throughput gene expression data. It first calculated the regulatory interactions among triple gene blocks (two pathway genes and one transcription factor (TF)), using conditional mutual information, and then identifies significantly interacted triple genes using a newly identified novel mutual interaction measure (MIM), which was substantiated to reflect strengths of regulatory interactions within each triple gene block. The TGMI calculated the MIM for each triple gene block and then examined its statistical significance using bootstrap. Finally, the frequencies of all TFs present in all significantly interacted triple gene blocks were calculated and ranked. We showed that the TFs with higher frequencies were usually genuine pathway regulators upon evaluating multiple pathways in plants, animals and yeast. Comparison of TGMI with several other algorithms demonstrated its higher accuracy. Therefore, TGMI will be a valuable tool that can help biologists to identify regulators of metabolic pathways and biological processes from the exploded high-throughput gene expression data in public repositories. PMID:29579312
Identifying essential proteins based on sub-network partition and prioritization by integrating subcellular localization information.

PubMed

Li, Min; Li, Wenkai; Wu, Fang-Xiang; Pan, Yi; Wang, Jianxin

2018-06-14

Essential proteins are important participants in various life activities and play a vital role in the survival and reproduction of living organisms. Identification of essential proteins from protein-protein interaction (PPI) networks has great significance to facilitate the study of human complex diseases, the design of drugs and the development of bioinformatics and computational science. Studies have shown that highly connected proteins in a PPI network tend to be essential. A series of computational methods have been proposed to identify essential proteins by analyzing topological structures of PPI networks. However, the high noise in the PPI data can degrade the accuracy of essential protein prediction. Moreover, proteins must be located in the appropriate subcellular localization to perform their functions, and only when the proteins are located in the same subcellular localization, it is possible that they can interact with each other. In this paper, we propose a new network-based essential protein discovery method based on sub-network partition and prioritization by integrating subcellular localization information, named SPP. The proposed method SPP was tested on two different yeast PPI networks obtained from DIP database and BioGRID database. The experimental results show that SPP can effectively reduce the effect of false positives in PPI networks and predict essential proteins more accurately compared with other existing computational methods DC, BC, CC, SC, EC, IC, NC. Copyright © 2018 Elsevier Ltd. All rights reserved.
Epidermal growth factor gene is a newly identified candidate gene for gout

PubMed Central

Han, Lin; Cao, Chunwei; Jia, Zhaotong; Liu, Shiguo; Liu, Zhen; Xin, Ruosai; Wang, Can; Li, Xinde; Ren, Wei; Wang, Xuefeng; Li, Changgui

2016-01-01

Chromosome 4q25 has been identified as a genomic region associated with gout. However, the associations of gout with the genes in this region have not yet been confirmed. Here, we performed two-stage analysis to determine whether variations in candidate genes in the 4q25 region are associated with gout in a male Chinese Han population. We first evaluated 96 tag single nucleotide polymorphisms (SNPs) in eight inflammatory/immune pathway- or glucose/lipid metabolism-related genes in the 4q25 region in 480 male gout patients and 480 controls. The SNP rs12504538, located in the elongation of very-long-chain-fatty-acid-like family member 6 gene (Elovl6), was found to be associated with gout susceptibility (Padjusted = 0.00595). In the second stage of analysis, we performed fine mapping analysis of 93 tag SNPs in Elovl6 and in the epidermal growth factor gene (EGF) and its flanking regions in 1017 male patients gout and 1897 healthy male controls. We observed a significant association between the T allele of EGF rs2298999 and gout (odds ratio = 0.77, 95% confidence interval = 0.67–0.88, Padjusted = 6.42 × 10−3). These results provide the first evidence for an association between the EGF rs2298999 C/T polymorphism and gout. Our findings should be validated in additional populations. PMID:27506295
Identification of an activator protein required for the induction of fruA, a gene essential for fruiting body development in Myxococcus xanthus

PubMed Central

Ueki, Toshiyuki; Inouye, Sumiko

2003-01-01

Myxococcus xanthus exhibits social behavior and multicellular development. FruA is an essential transcription factor for fruiting body development in M. xanthus. In the present study, the upstream promoter region was found to be necessary for the induction of fruA expression during development. A cis-acting element required for the induction was identified and was located between nucleotides –154 and –107 with respect to the transcription initiation site. In addition, it was found that two binding sites exist within this element of the fruA promoter. By using DNA affinity column chromatography containing the cis-acting element, a fruA promoter-binding protein was purified. The purified protein was shown by N-terminal sequence analysis to be identical to MrpC, a protein identified previously by transposon insertion mutagenesis as an essential locus for fruiting body development [Sun, H. & Shi, W. (2001) J. Bacteriol. 183, 4786–4795]. Furthermore, fruA mRNA was not detectable in the mrpC::km strain, demonstrating that MrpC is essential for fruA expression. Moreover, mutational analysis of the binding sites for MrpC in the fruA promoter indicates that binding of MrpC activates transcription of fruA in vivo. This report provides evidence for a direct molecular interaction involved in temporally regulated gene expression in M. xanthus. PMID:12851461
Thioredoxin-2 (TRX-2) is an essential gene regulating mitochondria-dependent apoptosis.

PubMed

Tanaka, Toru; Hosoi, Fumihito; Yamaguchi-Iwai, Yuko; Nakamura, Hajime; Masutani, Hiroshi; Ueda, Shugo; Nishiyama, Akira; Takeda, Shunichi; Wada, Hiromi; Spyrou, Giannis; Yodoi, Junji

2002-04-02

Thioredoxin-2 (Trx-2) is a mitochondria-specific member of the thioredoxin superfamily. Mitochondria have a crucial role in the signal transduction for apoptosis. To investigate the biological significance of Trx-2, we cloned chicken TRX-2 cDNA and generated clones of the conditional Trx-2-deficient cells using chicken B-cell line, DT40. Here we show that TRX-2 is an essential gene and that Trx-2-deficient cells undergo apoptosis upon repression of the TRX-2 transgene, showing an accumulation of intracellular reactive oxygen species (ROS). Cytochrome c is released from mitochondria, while caspase-9 and caspase-3, but not caspase-8, are activated upon inhibition of the TRX-2 transgene. In addition, Trx-2 and cytochrome c are co-immunoprecipitated in an in vitro assay. These results suggest that mitochondrial Trx-2 is essential for cell viability, playing a crucial role in the scavenging ROS in mitochondria and regulating the mitochondrial apoptosis signaling pathway.
Identifying the genes of unconventional high temperature superconductors.

PubMed

Hu, Jiangping

We elucidate a recently emergent framework in unifying the two families of high temperature (high [Formula: see text]) superconductors, cuprates and iron-based superconductors. The unification suggests that the latter is simply the counterpart of the former to realize robust extended s-wave pairing symmetries in a square lattice. The unification identifies that the key ingredients (gene) of high [Formula: see text] superconductors is a quasi two dimensional electronic environment in which the d -orbitals of cations that participate in strong in-plane couplings to the p -orbitals of anions are isolated near Fermi energy. With this gene, the superexchange magnetic interactions mediated by anions could maximize their contributions to superconductivity. Creating the gene requires special arrangements between local electronic structures and crystal lattice structures. The speciality explains why high [Formula: see text] superconductors are so rare. An explicit prediction is made to realize high [Formula: see text] superconductivity in Co/Ni-based materials with a quasi two dimensional hexagonal lattice structure formed by trigonal bipyramidal complexes.

Combining gene expression and genetic analyses to identify candidate genes involved in cold responses in pea.

PubMed

Legrand, Sylvain; Marque, Gilles; Blassiau, Christelle; Bluteau, Aurélie; Canoy, Anne-Sophie; Fontaine, Véronique; Jaminon, Odile; Bahrman, Nasser; Mautord, Julie; Morin, Julie; Petit, Aurélie; Baranger, Alain; Rivière, Nathalie; Wilmer, Jeroen; Delbreil, Bruno; Lejeune-Hénaut, Isabelle

2013-09-01

Cold stress affects plant growth and development. In order to better understand the responses to cold (chilling or freezing tolerance), we used two contrasted pea lines. Following a chilling period, the Champagne line becomes tolerant to frost whereas the Terese line remains sensitive. Four suppression subtractive hybridisation libraries were obtained using mRNAs isolated from pea genotypes Champagne and Terese. Using quantitative polymerase chain reaction (qPCR) performed on 159 genes, 43 and 54 genes were identified as differentially expressed at the initial time point and during the time course study, respectively. Molecular markers were developed from the differentially expressed genes and were genotyped on a population of 164 RILs derived from a cross between Champagne and Terese. We identified 5 candidate genes colocalizing with 3 different frost damage quantitative trait loci (QTL) intervals and a protein quantity locus (PQL) rich region previously reported. This investigation revealed the role of constitutive differences between both genotypes in the cold responses, in particular with genes related to glycine degradation pathway that could confer to Champagne a better frost tolerance. We showed that freezing tolerance involves a decrease of expression of genes related to photosynthesis and the expression of a gene involved in the production of cysteine and methionine that could act as cryoprotectant molecules. Although it remains to be confirmed, this study could also reveal the involvement of the jasmonate pathway in the cold responses, since we observed that two genes related to this pathway were mapped in a frost damage QTL interval and in a PQL rich region interval, respectively. Copyright © 2013 Elsevier GmbH. All rights reserved.
De novo transcriptome sequencing in Bixa orellana to identify genes involved in methylerythritol phosphate, carotenoid and bixin biosynthesis

DOE PAGES

Cárdenas-Conejo, Yair; Carballo-Uicab, Víctor; Lieberman, Meric; ...

2015-10-28

Bixin or annatto is a commercially important natural orange-red pigment derived from lycopene that is produced and stored in seeds of Bixa orellana L. An enzymatic pathway for bixin biosynthesis was inferred from homology of putative proteins encoded by differentially expressed seed cDNAs. Some activities were later validated in a heterologous system. Nevertheless, much of the pathway remains to be clarified. For example, it is essential to identify the methylerythritol phosphate (MEP) and carotenoid pathways genes. In order to investigate the MEP, carotenoid, and bixin pathways genes, total RNA from young leaves and two different developmental stages of seeds frommore » B. orellana were used for the construction of indexed mRNA libraries, sequenced on the Illumina HiSeq 2500 platform and assembled de novo using Velvet, CLC Genomics Workbench and CAP3 software. A total of 52,549 contigs were obtained with average length of 1,924 bp. Two phylogenetic analyses of inferred proteins, in one case encoded by thirteen general, single-copy cDNAs, in the other from carotenoid and MEP cDNAs, indicated that B. orellana is closely related to sister Malvales species cacao and cotton. Using homology, we identified 7 and 14 core gene products from the MEP and carotenoid pathways, respectively. Surprisingly, previously defined bixin pathway cDNAs were not present in our transcriptome. Here we propose a new set of gene products involved in bixin pathway. In conclusion, the identification and qRT-PCR quantification of cDNAs involved in annatto production suggest a hypothetical model for bixin biosynthesis that involve coordinated activation of some MEP, carotenoid and bixin pathway genes. These findings provide a better understanding of the mechanisms regulating these pathways and will facilitate the genetic improvement of B. orellana.« less
De novo transcriptome sequencing in Bixa orellana to identify genes involved in methylerythritol phosphate, carotenoid and bixin biosynthesis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cárdenas-Conejo, Yair; Carballo-Uicab, Víctor; Lieberman, Meric

Bixin or annatto is a commercially important natural orange-red pigment derived from lycopene that is produced and stored in seeds of Bixa orellana L. An enzymatic pathway for bixin biosynthesis was inferred from homology of putative proteins encoded by differentially expressed seed cDNAs. Some activities were later validated in a heterologous system. Nevertheless, much of the pathway remains to be clarified. For example, it is essential to identify the methylerythritol phosphate (MEP) and carotenoid pathways genes. In order to investigate the MEP, carotenoid, and bixin pathways genes, total RNA from young leaves and two different developmental stages of seeds frommore » B. orellana were used for the construction of indexed mRNA libraries, sequenced on the Illumina HiSeq 2500 platform and assembled de novo using Velvet, CLC Genomics Workbench and CAP3 software. A total of 52,549 contigs were obtained with average length of 1,924 bp. Two phylogenetic analyses of inferred proteins, in one case encoded by thirteen general, single-copy cDNAs, in the other from carotenoid and MEP cDNAs, indicated that B. orellana is closely related to sister Malvales species cacao and cotton. Using homology, we identified 7 and 14 core gene products from the MEP and carotenoid pathways, respectively. Surprisingly, previously defined bixin pathway cDNAs were not present in our transcriptome. Here we propose a new set of gene products involved in bixin pathway. In conclusion, the identification and qRT-PCR quantification of cDNAs involved in annatto production suggest a hypothetical model for bixin biosynthesis that involve coordinated activation of some MEP, carotenoid and bixin pathway genes. These findings provide a better understanding of the mechanisms regulating these pathways and will facilitate the genetic improvement of B. orellana.« less
A gene (ETM) for essential tremor maps to chromosome 2p22-p25.

PubMed

Higgins, J J; Pho, L T; Nee, L E

1997-11-01

We report the results of linkage analysis in a large American family of Czech descent with dominantly inherited "pure" essential tremor (ET) and genetic anticipation. Genetic loci on chromosome 2p22-p25 establish linkage to this region with a maximum LOD score (Zmax) = 5.92 for the locus, D2S272. Obligate recombinant events place the ETM gene in a 15-cM candidate interval between the genetic loci D2S168 and D2S224. Repeat expansion detection analysis suggests that expanded CAG trinucleotide sequences are associated with ET. These findings will facilitate the search for an ETM gene and may further our understanding of the human motor system.
An Arrayed Genome-Scale Lentiviral-Enabled Short Hairpin RNA Screen Identifies Lethal and Rescuer Gene Candidates

PubMed Central

Bhinder, Bhavneet; Antczak, Christophe; Ramirez, Christina N.; Shum, David; Liu-Sullivan, Nancy; Radu, Constantin; Frattini, Mark G.

2013-01-01

Abstract RNA interference technology is becoming an integral tool for target discovery and validation.; With perhaps the exception of only few studies published using arrayed short hairpin RNA (shRNA) libraries, most of the reports have been either against pooled siRNA or shRNA, or arrayed siRNA libraries. For this purpose, we have developed a workflow and performed an arrayed genome-scale shRNA lethality screen against the TRC1 library in HeLa cells. The resulting targets would be a valuable resource of candidates toward a better understanding of cellular homeostasis. Using a high-stringency hit nomination method encompassing criteria of at least three active hairpins per gene and filtered for potential off-target effects (OTEs), referred to as the Bhinder–Djaballah analysis method, we identified 1,252 lethal and 6 rescuer gene candidates, knockdown of which resulted in severe cell death or enhanced growth, respectively. Cross referencing individual hairpins with the TRC1 validated clone database, 239 of the 1,252 candidates were deemed independently validated with at least three validated clones. Through our systematic OTE analysis, we have identified 31 microRNAs (miRNAs) in lethal and 2 in rescuer genes; all having a seed heptamer mimic in the corresponding shRNA hairpins and likely cause of the OTE observed in our screen, perhaps unraveling a previously unknown plausible essentiality of these miRNAs in cellular viability. Taken together, we report on a methodology for performing large-scale arrayed shRNA screens, a comprehensive analysis method to nominate high-confidence hits, and a performance assessment of the TRC1 library highlighting the intracellular inefficiencies of shRNA processing in general. PMID:23198867
Microarray expression profiling identifies genes with altered expression in HDL-deficient mice

DOE Office of Scientific and Technical Information (OSTI.GOV)

Callow, Matthew J.; Dudoit, Sandrine; Gong, Elaine L.

2000-05-05

Based on the assumption that severe alterations in the expression of genes known to be involved in HDL metabolism may affect the expression of other genes we screened an array of over 5000 mouse expressed sequence tags (ESTs) for altered gene expression in the livers of two lines of mice with dramatic decreases in HDL plasma concentrations. Labeled cDNA from livers of apolipoprotein AI (apo AI) knockout mice, Scavenger Receptor BI (SR-BI) transgenic mice and control mice were co-hybridized to microarrays. Two-sample t-statistics were used to identify genes with altered expression levels in the knockout or transgenic mice compared withmore » the control mice. In the SR-BI group we found 9 array elements representing at least 5 genes to be significantly altered on the basis of an adjusted p value of less than 0.05. In the apo AI knockout group 8 array elements representing 4 genes were altered compared with the control group (p < 0.05). Several of the genes identified in the SR-BI transgenic suggest altered sterol metabolism and oxidative processes. These studies illustrate the use of multiple-testing methods for the identification of genes with altered expression in replicated microarray experiments of apo AI knockout and SR-BI transgenic mice.« less
Identifying the rooted species tree from the distribution of unrooted gene trees under the coalescent.

PubMed

Allman, Elizabeth S; Degnan, James H; Rhodes, John A

2011-06-01

Gene trees are evolutionary trees representing the ancestry of genes sampled from multiple populations. Species trees represent populations of individuals-each with many genes-splitting into new populations or species. The coalescent process, which models ancestry of gene copies within populations, is often used to model the probability distribution of gene trees given a fixed species tree. This multispecies coalescent model provides a framework for phylogeneticists to infer species trees from gene trees using maximum likelihood or Bayesian approaches. Because the coalescent models a branching process over time, all trees are typically assumed to be rooted in this setting. Often, however, gene trees inferred by traditional phylogenetic methods are unrooted. We investigate probabilities of unrooted gene trees under the multispecies coalescent model. We show that when there are four species with one gene sampled per species, the distribution of unrooted gene tree topologies identifies the unrooted species tree topology and some, but not all, information in the species tree edges (branch lengths). The location of the root on the species tree is not identifiable in this situation. However, for 5 or more species with one gene sampled per species, we show that the distribution of unrooted gene tree topologies identifies the rooted species tree topology and all its internal branch lengths. The length of any pendant branch leading to a leaf of the species tree is also identifiable for any species from which more than one gene is sampled.
Genome-wide methylation analysis identifies genes silenced in non-seminoma cell lines

PubMed Central

Noor, Dzul Azri Mohamed; Jeyapalan, Jennie N; Alhazmi, Safiah; Carr, Matthew; Squibb, Benjamin; Wallace, Claire; Tan, Christopher; Cusack, Martin; Hughes, Jaime; Reader, Tom; Shipley, Janet; Sheer, Denise; Scotting, Paul J

2016-01-01

Silencing of genes by DNA methylation is a common phenomenon in many types of cancer. However, the genome-wide effect of DNA methylation on gene expression has been analysed in relatively few cancers. Germ cell tumours (GCTs) are a complex group of malignancies. They are unique in developing from a pluripotent progenitor cell. Previous analyses have suggested that non-seminomas exhibit much higher levels of DNA methylation than seminomas. The genomic targets that are methylated, the extent to which this results in gene silencing and the identity of the silenced genes most likely to play a role in the tumours’ biology have not yet been established. In this study, genome-wide methylation and expression analysis of GCT cell lines was combined with gene expression data from primary tumours to address this question. Genome methylation was analysed using the Illumina infinium HumanMethylome450 bead chip system and gene expression was analysed using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Regulation by methylation was confirmed by demethylation using 5-aza-2-deoxycytidine and reverse transcription–quantitative PCR. Large differences in the level of methylation of the CpG islands of individual genes between tumour cell lines correlated well with differential gene expression. Treatment of non-seminoma cells with 5-aza-2-deoxycytidine verified that methylation of all genes tested played a role in their silencing in yolk sac tumour cells and many of these genes were also differentially expressed in primary tumours. Genes silenced by methylation in the various GCT cell lines were identified. Several pluripotency-associated genes were identified as a major functional group of silenced genes. PMID:29263807
Genome-wide methylation analysis identifies genes silenced in non-seminoma cell lines.

PubMed

Noor, Dzul Azri Mohamed; Jeyapalan, Jennie N; Alhazmi, Safiah; Carr, Matthew; Squibb, Benjamin; Wallace, Claire; Tan, Christopher; Cusack, Martin; Hughes, Jaime; Reader, Tom; Shipley, Janet; Sheer, Denise; Scotting, Paul J

2016-01-01

Silencing of genes by DNA methylation is a common phenomenon in many types of cancer. However, the genome-wide effect of DNA methylation on gene expression has been analysed in relatively few cancers. Germ cell tumours (GCTs) are a complex group of malignancies. They are unique in developing from a pluripotent progenitor cell. Previous analyses have suggested that non-seminomas exhibit much higher levels of DNA methylation than seminomas. The genomic targets that are methylated, the extent to which this results in gene silencing and the identity of the silenced genes most likely to play a role in the tumours' biology have not yet been established. In this study, genome-wide methylation and expression analysis of GCT cell lines was combined with gene expression data from primary tumours to address this question. Genome methylation was analysed using the Illumina infinium HumanMethylome450 bead chip system and gene expression was analysed using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Regulation by methylation was confirmed by demethylation using 5-aza-2-deoxycytidine and reverse transcription-quantitative PCR. Large differences in the level of methylation of the CpG islands of individual genes between tumour cell lines correlated well with differential gene expression. Treatment of non-seminoma cells with 5-aza-2-deoxycytidine verified that methylation of all genes tested played a role in their silencing in yolk sac tumour cells and many of these genes were also differentially expressed in primary tumours. Genes silenced by methylation in the various GCT cell lines were identified. Several pluripotency-associated genes were identified as a major functional group of silenced genes.
Haplotype Analysis in Multiple Crosses to Identify a QTL Gene

PubMed Central

Wang, Xiaosong; Korstanje, Ron; Higgins, David; Paigen, Beverly

2004-01-01

Identifying quantitative trait locus (QTL) genes is a challenging task. Herein, we report using a two-step process to identify Apoa2 as the gene underlying Hdlq5, a QTL for plasma high-density lipoprotein cholesterol (HDL) levels on mouse chromosome 1. First, we performed a sequence analysis of the Apoa2 coding region in 46 genetically diverse mouse strains and found five different APOA2 protein variants, which we named APOA2a to APOA2e. Second, we conducted a haplotype analysis of the strains in 21 crosses that have so far detected HDL QTLs; we found that Hdlq5 was detected only in the nine crosses where one parent had the APOA2b protein variant characterized by an Ala61-to-Val61 substitution. We then found that strains with the APOA2b variant had significantly higher (P ≤ 0.002) plasma HDL levels than those with either the APOA2a or the APOA2c variant. These findings support Apoa2 as the underlying Hdlq5 gene and suggest the Apoa2 polymorphisms responsible for the Hdlq5 phenotype. Therefore, haplotype analysis in multiple crosses can be used to support a candidate QTL gene. PMID:15310659
Haplotype analysis in multiple crosses to identify a QTL gene.

PubMed

Wang, Xiaosong; Korstanje, Ron; Higgins, David; Paigen, Beverly

2004-09-01

Identifying quantitative trait locus (QTL) genes is a challenging task. Herein, we report using a two-step process to identify Apoa2 as the gene underlying Hdlq5, a QTL for plasma high-density lipoprotein cholesterol (HDL) levels on mouse chromosome 1. First, we performed a sequence analysis of the Apoa2 coding region in 46 genetically diverse mouse strains and found five different APOA2 protein variants, which we named APOA2a to APOA2e. Second, we conducted a haplotype analysis of the strains in 21 crosses that have so far detected HDL QTLs; we found that Hdlq5 was detected only in the nine crosses where one parent had the APOA2b protein variant characterized by an Ala61-to-Val61 substitution. We then found that strains with the APOA2b variant had significantly higher (P < or = 0.002) plasma HDL levels than those with either the APOA2a or the APOA2c variant. These findings support Apoa2 as the underlying Hdlq5 gene and suggest the Apoa2 polymorphisms responsible for the Hdlq5 phenotype. Therefore, haplotype analysis in multiple crosses can be used to support a candidate QTL gene.
Genes associated with thermosensitive genic male sterility in rice identified by comparative expression profiling.

PubMed

Pan, Yufang; Li, Qiaofeng; Wang, Zhizheng; Wang, Yang; Ma, Rui; Zhu, Lili; He, Guangcun; Chen, Rongzhi

2014-12-16

Thermosensitive genic male sterile (TGMS) lines and photoperiod-sensitive genic male sterile (PGMS) lines have been successfully used in hybridization to improve rice yields. However, the molecular mechanisms underlying male sterility transitions in most PGMS/TGMS rice lines are unclear. In the recently developed TGMS-Co27 line, the male sterility is based on co-suppression of a UDP-glucose pyrophosphorylase gene (Ugp1), but further study is needed to fully elucidate the molecular mechanisms involved. Microarray-based transcriptome profiling of TGMS-Co27 and wild-type Hejiang 19 (H1493) plants grown at high and low temperatures revealed that 15462 probe sets representing 8303 genes were differentially expressed in the two lines, under the two conditions, or both. Environmental factors strongly affected global gene expression. Some genes important for pollen development were strongly repressed in TGMS-Co27 at high temperature. More significantly, series-cluster analysis of differentially expressed genes (DEGs) between TGMS-Co27 plants grown under the two conditions showed that low temperature induced the expression of a gene cluster. This cluster was found to be essential for sterility transition. It includes many meiosis stage-related genes that are probably important for thermosensitive male sterility in TGMS-Co27, inter alia: Arg/Ser-rich domain (RS)-containing zinc finger proteins, polypyrimidine tract-binding proteins (PTBs), DEAD/DEAH box RNA helicases, ZOS (C2H2 zinc finger proteins of Oryza sativa), at least one polyadenylate-binding protein and some other RNA recognition motif (RRM) domain-containing proteins involved in post-transcriptional processes, eukaryotic initiation factor 5B (eIF5B), ribosomal proteins (L37, L1p/L10e, L27 and L24), aminoacyl-tRNA synthetases (ARSs), eukaryotic elongation factor Tu (eEF-Tu) and a peptide chain release factor protein involved in translation. The differential expression of 12 DEGs that are important for pollen
Essentiality, conservation, evolutionary pressure and codon bias in bacterial genomes.

PubMed

Dilucca, Maddalena; Cimini, Giulio; Giansanti, Andrea

2018-07-15

Essential genes constitute the core of genes which cannot be mutated too much nor lost along the evolutionary history of a species. Natural selection is expected to be stricter on essential genes and on conserved (highly shared) genes, than on genes that are either nonessential or peculiar to a single or a few species. In order to further assess this expectation, we study here how essentiality of a gene is connected with its degree of conservation among several unrelated bacterial species, each one characterised by its own codon usage bias. Confirming previous results on E. coli, we show the existence of a universal exponential relation between gene essentiality and conservation in bacteria. Moreover, we show that, within each bacterial genome, there are at least two groups of functionally distinct genes, characterised by different levels of conservation and codon bias: i) a core of essential genes, mainly related to cellular information processing; ii) a set of less conserved nonessential genes with prevalent functions related to metabolism. In particular, the genes in the first group are more retained among species, are subject to a stronger purifying conservative selection and display a more limited repertoire of synonymous codons. The core of essential genes is close to the minimal bacterial genome, which is in the focus of recent studies in synthetic biology, though we confirm that orthologs of genes that are essential in one species are not necessarily essential in other species. We also list a set of highly shared genes which, reasonably, could constitute a reservoir of targets for new anti-microbial drugs. Copyright © 2018 Elsevier B.V. All rights reserved.
A transposon-based genetic screen in mice identifies genes altered in colorectal cancer.

PubMed

Starr, Timothy K; Allaei, Raha; Silverstein, Kevin A T; Staggs, Rodney A; Sarver, Aaron L; Bergemann, Tracy L; Gupta, Mihir; O'Sullivan, M Gerard; Matise, Ilze; Dupuy, Adam J; Collier, Lara S; Powers, Scott; Oberg, Ann L; Asmann, Yan W; Thibodeau, Stephen N; Tessarollo, Lino; Copeland, Neal G; Jenkins, Nancy A; Cormier, Robert T; Largaespada, David A

2009-03-27

Human colorectal cancers (CRCs) display a large number of genetic and epigenetic alterations, some of which are causally involved in tumorigenesis (drivers) and others that have little functional impact (passengers). To help distinguish between these two classes of alterations, we used a transposon-based genetic screen in mice to identify candidate genes for CRC. Mice harboring mutagenic Sleeping Beauty (SB) transposons were crossed with mice expressing SB transposase in gastrointestinal tract epithelium. Most of the offspring developed intestinal lesions, including intraepithelial neoplasia, adenomas, and adenocarcinomas. Analysis of over 16,000 transposon insertions identified 77 candidate CRC genes, 60 of which are mutated and/or dysregulated in human CRC and thus are most likely to drive tumorigenesis. These genes include APC, PTEN, and SMAD4. The screen also identified 17 candidate genes that had not previously been implicated in CRC, including POLI, PTPRK, and RSPO2.
Comparison of gene expression in segregating families identifies genes and genomic regions involved in a novel adaptation, zinc hyperaccumulation.

PubMed

Filatov, Victor; Dowdle, John; Smirnoff, Nicholas; Ford-Lloyd, Brian; Newbury, H John; Macnair, Mark R

2006-09-01

One of the challenges of comparative genomics is to identify specific genetic changes associated with the evolution of a novel adaptation or trait. We need to be able to disassociate the genes involved with a particular character from all the other genetic changes that take place as lineages diverge. Here we show that by comparing the transcriptional profile of segregating families with that of parent species differing in a novel trait, it is possible to narrow down substantially the list of potential target genes. In addition, by assuming synteny with a related model organism for which the complete genome sequence is available, it is possible to use the cosegregation of markers differing in transcription level to identify regions of the genome which probably contain quantitative trait loci (QTLs) for the character. This novel combination of genomics and classical genetics provides a very powerful tool to identify candidate genes. We use this methodology to investigate zinc hyperaccumulation in Arabidopsis halleri, the sister species to the model plant, Arabidopsis thaliana. We compare the transcriptional profile of A. halleri with that of its sister nonaccumulator species, Arabidopsis petraea, and between accumulator and nonaccumulator F(3)s derived from the cross between the two species. We identify eight genes which consistently show greater expression in accumulator phenotypes in both roots and shoots, including two metal transporter genes (NRAMP3 and ZIP6), and cytoplasmic aconitase, a gene involved in iron homeostasis in mammals. We also show that there appear to be two QTLs for zinc accumulation, on chromosomes 3 and 7.
The Autographa californica Multiple Nucleopolyhedrovirus ac83 Gene Contains a cis-Acting Element That Is Essential for Nucleocapsid Assembly.

PubMed

Huang, Zhihong; Pan, Mengjia; Zhu, Silei; Zhang, Hao; Wu, Wenbi; Yuan, Meijin; Yang, Kai

2017-03-01

Baculoviridae is a family of insect-specific viruses that have a circular double-stranded DNA genome packaged within a rod-shaped capsid. The mechanism of baculovirus nucleocapsid assembly remains unclear. Previous studies have shown that deletion of the ac83 gene of Autographa californica multiple nucleopolyhedrovirus (AcMNPV) blocks viral nucleocapsid assembly. Interestingly, the ac83 -encoded protein Ac83 is not a component of the nucleocapsid, implying a particular role for ac83 in nucleocapsid assembly that may be independent of its protein product. To examine this possibility, Ac83 synthesis was disrupted by insertion of a chloramphenicol resistance gene into its coding sequence or by deleting its promoter and translation start codon. Both mutants produced progeny viruses normally, indicating that the Ac83 protein is not required for nucleocapsid assembly. Subsequently, complementation assays showed that the production of progeny viruses required the presence of ac83 in the AcMNPV genome instead of its presence in trans Therefore, we reasoned that ac83 is involved in nucleocapsid assembly via an internal cis -acting element, which we named the nucleocapsid assembly-essential element (NAE). The NAE was identified to lie within nucleotides 1651 to 1850 of ac83 and had 8 conserved A/T-rich regions. Sequences homologous to the NAE were found only in alphabaculoviruses and have a conserved positional relationship with another essential cis -acting element that was recently identified. The identification of the NAE may help to connect the data of viral cis -acting elements and related proteins in the baculovirus nucleocapsid assembly, which is important for elucidating DNA-protein interaction events during this process. IMPORTANCE Virus nucleocapsid assembly usually requires specific cis -acting elements in the viral genome for various processes, such as the selection of the viral genome from the cellular nucleic acids, the cleavage of concatemeric viral genome
Integrating genome-wide association study and expression quantitative trait loci data identifies multiple genes and gene set associated with neuroticism.

PubMed

Fan, Qianrui; Wang, Wenyu; Hao, Jingcan; He, Awen; Wen, Yan; Guo, Xiong; Wu, Cuiyan; Ning, Yujie; Wang, Xi; Wang, Sen; Zhang, Feng

2017-08-01

Neuroticism is a fundamental personality trait with significant genetic determinant. To identify novel susceptibility genes for neuroticism, we conducted an integrative analysis of genomic and transcriptomic data of genome wide association study (GWAS) and expression quantitative trait locus (eQTL) study. GWAS summary data was driven from published studies of neuroticism, totally involving 170,906 subjects. eQTL dataset containing 927,753 eQTLs were obtained from an eQTL meta-analysis of 5311 samples. Integrative analysis of GWAS and eQTL data was conducted by summary data-based Mendelian randomization (SMR) analysis software. To identify neuroticism associated gene sets, the SMR analysis results were further subjected to gene set enrichment analysis (GSEA). The gene set annotation dataset (containing 13,311 annotated gene sets) of GSEA Molecular Signatures Database was used. SMR single gene analysis identified 6 significant genes for neuroticism, including MSRA (p value=2.27×10 -10 ), MGC57346 (p value=6.92×10 -7 ), BLK (p value=1.01×10 -6 ), XKR6 (p value=1.11×10 -6 ), C17ORF69 (p value=1.12×10 -6 ) and KIAA1267 (p value=4.00×10 -6 ). Gene set enrichment analysis observed significant association for Chr8p23 gene set (false discovery rate=0.033). Our results provide novel clues for the genetic mechanism studies of neuroticism. Copyright © 2017. Published by Elsevier Inc.
A Multiomics Approach to Identify Genes Associated with Childhood Asthma Risk and Morbidity.

PubMed

Forno, Erick; Wang, Ting; Yan, Qi; Brehm, John; Acosta-Perez, Edna; Colon-Semidey, Angel; Alvarez, Maria; Boutaoui, Nadia; Cloutier, Michelle M; Alcorn, John F; Canino, Glorisa; Chen, Wei; Celedón, Juan C

2017-10-01

Childhood asthma is a complex disease. In this study, we aim to identify genes associated with childhood asthma through a multiomics "vertical" approach that integrates multiple analytical steps using linear and logistic regression models. In a case-control study of childhood asthma in Puerto Ricans (n = 1,127), we used adjusted linear or logistic regression models to evaluate associations between several analytical steps of omics data, including genome-wide (GW) genotype data, GW methylation, GW expression profiling, cytokine levels, asthma-intermediate phenotypes, and asthma status. At each point, only the top genes/single-nucleotide polymorphisms/probes/cytokines were carried forward for subsequent analysis. In step 1, asthma modified the gene expression-protein level association for 1,645 genes; pathway analysis showed an enrichment of these genes in the cytokine signaling system (n = 269 genes). In steps 2-3, expression levels of 40 genes were associated with intermediate phenotypes (asthma onset age, forced expiratory volume in 1 second, exacerbations, eosinophil counts, and skin test reactivity); of those, methylation of seven genes was also associated with asthma. Of these seven candidate genes, IL5RA was also significant in analytical steps 4-8. We then measured plasma IL-5 receptor α levels, which were associated with asthma age of onset and moderate-severe exacerbations. In addition, in silico database analysis showed that several of our identified IL5RA single-nucleotide polymorphisms are associated with transcription factors related to asthma and atopy. This approach integrates several analytical steps and is able to identify biologically relevant asthma-related genes, such as IL5RA. It differs from other methods that rely on complex statistical models with various assumptions.
The pnk/pnl gene (ORF 86) of Autographa californica nucleopolyhedrovirus is a non-essential, immediate early gene.

PubMed

Durantel, D; Croizier, L; Ayres, M D; Croizier, G; Possee, R D; López-Ferber, M

1998-03-01

Autographa californica nucleopolyhedrovirus (AcMNPV) ORF 86, located within the HindIII C fragment, potentially encodes a protein which shares sequence similarity with two T4 bacteriophage gene products, RNA ligase and polynucleotide kinase. This AcMNPV gene has been designated pnk/pnl but has yet to be assigned a function in virus replication. It has been classified as an immediate early virus gene, since the promoter was active in uninfected insect cells and mRNA transcripts were detectable from 4 to 48 h post-infection and in the presence of cycloheximide or aphidicolin in virus-infected cells. The extremities of the transcript have been mapped by primer extension and 3' RACE-PCR to positions -18 from the translational start codon and +15 downstream of the stop codon. The function of pnk/pnl was investigated by producing a recombinant virus (Acdel86lacZ) with the coding region replaced with that of lacZ. This virus replicated normally in Spodoptera frugiperda (Sf 21) cells, indicating that pnk/pnl is not essential for propagation in these cells. Virus protein production in Acdel86lacZ-infected Sf 21 cells also appeared to be unaffected, with normal synthesis of the IE-1, GP64, VP39 and polyhedrin proteins. Shut-down of host protein synthesis was not abolished in recombinant infection. When other baculovirus genomes were examined for the presence of pnk/pnl by restriction enzyme digestion and PCR, a deletion was found in AcMNPV 1.2, Galleria mellonella NPV (GmMNPV) and Bombyx mori NPV (BmNPV), suggesting that in many isolates this gene has either never been acquired or has been lost during genome evolution. This is one of the first baculovirus immediate early genes that appears to be nonessential for virus survival.
A large-scale RNA interference screen identifies genes that regulate autophagy at different stages.

PubMed

Guo, Sujuan; Pridham, Kevin J; Virbasius, Ching-Man; He, Bin; Zhang, Liqing; Varmark, Hanne; Green, Michael R; Sheng, Zhi

2018-02-12

Dysregulated autophagy is central to the pathogenesis and therapeutic development of cancer. However, how autophagy is regulated in cancer is not well understood and genes that modulate cancer autophagy are not fully defined. To gain more insights into autophagy regulation in cancer, we performed a large-scale RNA interference screen in K562 human chronic myeloid leukemia cells using monodansylcadaverine staining, an autophagy-detecting approach equivalent to immunoblotting of the autophagy marker LC3B or fluorescence microscopy of GFP-LC3B. By coupling monodansylcadaverine staining with fluorescence-activated cell sorting, we successfully isolated autophagic K562 cells where we identified 336 short hairpin RNAs. After candidate validation using Cyto-ID fluorescence spectrophotometry, LC3B immunoblotting, and quantitative RT-PCR, 82 genes were identified as autophagy-regulating genes. 20 genes have been reported previously and the remaining 62 candidates are novel autophagy mediators. Bioinformatic analyses revealed that most candidate genes were involved in molecular pathways regulating autophagy, rather than directly participating in the autophagy process. Further autophagy flux assays revealed that 57 autophagy-regulating genes suppressed autophagy initiation, whereas 21 candidates promoted autophagy maturation. Our RNA interference screen identifies identified genes that regulate autophagy at different stages, which helps decode autophagy regulation in cancer and offers novel avenues to develop autophagy-related therapies for cancer.

Overexpression screens identify conserved dosage chromosome instability genes in yeast and human cancer

PubMed Central

Duffy, Supipi; Fam, Hok Khim; Wang, Yi Kan; Styles, Erin B.; Kim, Jung-Hyun; Ang, J. Sidney; Singh, Tejomayee; Larionov, Vladimir; Shah, Sohrab P.; Andrews, Brenda; Boerkoel, Cornelius F.; Hieter, Philip

2016-01-01

Somatic copy number amplification and gene overexpression are common features of many cancers. To determine the role of gene overexpression on chromosome instability (CIN), we performed genome-wide screens in the budding yeast for yeast genes that cause CIN when overexpressed, a phenotype we refer to as dosage CIN (dCIN), and identified 245 dCIN genes. This catalog of genes reveals human orthologs known to be recurrently overexpressed and/or amplified in tumors. We show that two genes, TDP1, a tyrosyl-DNA-phosphdiesterase, and TAF12, an RNA polymerase II TATA-box binding factor, cause CIN when overexpressed in human cells. Rhabdomyosarcoma lines with elevated human Tdp1 levels also exhibit CIN that can be partially rescued by siRNA-mediated knockdown of TDP1. Overexpression of dCIN genes represents a genetic vulnerability that could be leveraged for selective killing of cancer cells through targeting of an unlinked synthetic dosage lethal (SDL) partner. Using SDL screens in yeast, we identified a set of genes that when deleted specifically kill cells with high levels of Tdp1. One gene was the histone deacetylase RPD3, for which there are known inhibitors. Both HT1080 cells overexpressing hTDP1 and rhabdomyosarcoma cells with elevated levels of hTdp1 were more sensitive to histone deacetylase inhibitors valproic acid (VPA) and trichostatin A (TSA), recapitulating the SDL interaction in human cells and suggesting VPA and TSA as potential therapeutic agents for tumors with elevated levels of hTdp1. The catalog of dCIN genes presented here provides a candidate list to identify genes that cause CIN when overexpressed in cancer, which can then be leveraged through SDL to selectively target tumors. PMID:27551064
Integrating mean and variance heterogeneities to identify differentially expressed genes.

PubMed

Ouyang, Weiwei; An, Qiang; Zhao, Jinying; Qin, Huaizhen

2016-12-06

In functional genomics studies, tests on mean heterogeneity have been widely employed to identify differentially expressed genes with distinct mean expression levels under different experimental conditions. Variance heterogeneity (aka, the difference between condition-specific variances) of gene expression levels is simply neglected or calibrated for as an impediment. The mean heterogeneity in the expression level of a gene reflects one aspect of its distribution alteration; and variance heterogeneity induced by condition change may reflect another aspect. Change in condition may alter both mean and some higher-order characteristics of the distributions of expression levels of susceptible genes. In this report, we put forth a conception of mean-variance differentially expressed (MVDE) genes, whose expression means and variances are sensitive to the change in experimental condition. We mathematically proved the null independence of existent mean heterogeneity tests and variance heterogeneity tests. Based on the independence, we proposed an integrative mean-variance test (IMVT) to combine gene-wise mean heterogeneity and variance heterogeneity induced by condition change. The IMVT outperformed its competitors under comprehensive simulations of normality and Laplace settings. For moderate samples, the IMVT well controlled type I error rates, and so did existent mean heterogeneity test (i.e., the Welch t test (WT), the moderated Welch t test (MWT)) and the procedure of separate tests on mean and variance heterogeneities (SMVT), but the likelihood ratio test (LRT) severely inflated type I error rates. In presence of variance heterogeneity, the IMVT appeared noticeably more powerful than all the valid mean heterogeneity tests. Application to the gene profiles of peripheral circulating B raised solid evidence of informative variance heterogeneity. After adjusting for background data structure, the IMVT replicated previous discoveries and identified novel experiment
G20210A prothrombin gene mutation identified in patients with venous leg ulcers.

PubMed

Jebeleanu, G; Procopciuc, L

2001-01-01

The G20210A mutation variant of prothrombin gene is the second most frequent mutation identified in patients with deep venous thrombosis, after factor V Leiden. The risk for developing deep venous thrombosis is high in patients identified as heterozygous for G20210A mutation. In order to identify this polymorphism in the gene coding prothrombin, the 345bp fragment in the 3'- untranslated region of the prothrombin gene was amplified using amplification by polymerase chain reaction and enzymatic digestion by HindIII (restriction endonuclease enzyme). The products of amplification and enzymatic's digestion were analized using agarose gel electrophoresis. We investigated 20 patients with venous leg ulcers and we found 2 heterozygous (10%) for G20210A mutation. None of the patients in the control group had G20210A mutation. Our study confirms the presence of G20210A mutation in the Romanian population. Our study also shows the link between venous leg ulcers and this polymorphism in the prothrombin gene.
A Novel Yeast Genomics Method for Identifying New Breast Cancer Susceptibility Genes

DTIC Science & Technology

2007-05-01

find new candidate genes for breast cancer susceptibility in women and identifying these human genes can further improve monitoring and treatment...breast cancer susceptibility genes in humans that are currently unknown and not deducible from current methodologies. It is a fundamental...template to faithfully repair the broken strand. In human cancer it is loss of HR, rather than NHEJ, that is more important in increasing cancer
Nutrigenomics of essential oils and their potential domestic use for improving health.

PubMed

Cayuela Sánchez, José Antonio; Elamrani, Abdelaziz

2014-11-01

The use of essential oils as industrial food additives is notorious, like their medicinal properties. However, their use in household food spicing is for now limited. In this work, we have made a review to reveal the nutrigenomic actions exerted by their bioactive components, to promote awareness of their modulating gene expression ability and the potential that this implies. Also considered is how essential oils can be used as flavoring and seasoning after cooking and before consumption, such as diet components which can improve human health. Genetic mechanisms involved in the medicinal properties of essential oils for food use are identified from literature. These genetic mechanisms reveal nutrigenomic actions. Reviews on the medicinal properties of essential oils have been particularly considered. A wide diversity of nutrigenomic effects from essential oils useful potentially for food spicing is reviewed. General ideas are discussed about essential oils and their properties, such as anti-inflammatory, analgesic, immunomodulatory, anticancer, hepatoprotective, hypolipidemic, anti-diabetic, antioxidant, bone-reparation, anti-depressant and mitigatory for Alzheimer's disease. The essential oils for food use are potentially promoting health agents, and, therefore, worth using as flavoring and condiments. Becoming aware of the modulating gene expression actions from essential oils is important for understanding their potential for use in household dishes as spices to improve health.
Next-generation sequencing to identify candidate genes and develop diagnostic markers for a novel Phytophthora resistance gene, RpsHC18, in soybean.

PubMed

Zhong, Chao; Sun, Suli; Li, Yinping; Duan, Canxing; Zhu, Zhendong

2018-03-01

A novel Phytophthora sojae resistance gene RpsHC18 was identified and finely mapped on soybean chromosome 3. Two NBS-LRR candidate genes were identified and two diagnostic markers of RpsHC18 were developed. Phytophthora root rot caused by Phytophthora sojae is a destructive disease of soybean. The most effective disease-control strategy is to deploy resistant cultivars carrying Phytophthora-resistant Rps genes. The soybean cultivar Huachun 18 has a broad and distinct resistance spectrum to 12 P. sojae isolates. Quantitative trait loci sequencing (QTL-seq), based on the whole-genome resequencing (WGRS) of two extreme resistant and susceptible phenotype bulks from an F 2:3 population, was performed, and one 767-kb genomic region with ΔSNP-index ≥ 0.9 on chromosome 3 was identified as the RpsHC18 candidate region in Huachun 18. The candidate region was reduced to a 146-kb region by fine mapping. Nonsynonymous SNP and haplotype analyses were carried out in the 146-kb region among ten soybean genotypes using WGRS. Four specific nonsynonymous SNPs were identified in two nucleotide-binding sites-leucine-rich repeat (NBS-LRR) genes, RpsHC18-NBL1 and RpsHC18-NBL2, which were considered to be the candidate genes. Finally, one specific SNP marker in each candidate gene was successfully developed using a tetra-primer ARMS-PCR assay, and the two markers were verified to be specific for RpsHC18 and to effectively distinguish other known Rps genes. In this study, we applied an integrated genomic-based strategy combining WGRS with traditional genetic mapping to identify RpsHC18 candidate genes and develop diagnostic markers. These results suggest that next-generation sequencing is a precise, rapid and cost-effective way to identify candidate genes and develop diagnostic markers, and it can accelerate Rps gene cloning and marker-assisted selection for breeding of P. sojae-resistant soybean cultivars.
A 6-gene signature identifies four molecular subgroups of neuroblastoma

PubMed Central

2011-01-01

Background There are currently three postulated genomic subtypes of the childhood tumour neuroblastoma (NB); Type 1, Type 2A, and Type 2B. The most aggressive forms of NB are characterized by amplification of the oncogene MYCN (MNA) and low expression of the favourable marker NTRK1. Recently, mutations or high expression of the familial predisposition gene Anaplastic Lymphoma Kinase (ALK) was associated to unfavourable biology of sporadic NB. Also, various other genes have been linked to NB pathogenesis. Results The present study explores subgroup discrimination by gene expression profiling using three published microarray studies on NB (47 samples). Four distinct clusters were identified by Principal Components Analysis (PCA) in two separate data sets, which could be verified by an unsupervised hierarchical clustering in a third independent data set (101 NB samples) using a set of 74 discriminative genes. The expression signature of six NB-associated genes ALK, BIRC5, CCND1, MYCN, NTRK1, and PHOX2B, significantly discriminated the four clusters (p < 0.05, one-way ANOVA test). PCA clusters p1, p2, and p3 were found to correspond well to the postulated subtypes 1, 2A, and 2B, respectively. Remarkably, a fourth novel cluster was detected in all three independent data sets. This cluster comprised mainly 11q-deleted MNA-negative tumours with low expression of ALK, BIRC5, and PHOX2B, and was significantly associated with higher tumour stage, poor outcome and poor survival compared to the Type 1-corresponding favourable group (INSS stage 4 and/or dead of disease, p < 0.05, Fisher's exact test). Conclusions Based on expression profiling we have identified four molecular subgroups of neuroblastoma, which can be distinguished by a 6-gene signature. The fourth subgroup has not been described elsewhere, and efforts are currently made to further investigate this group's specific characteristics. PMID:21492432
Delimitation of essential genes of cassava latent virus DNA 2.

PubMed Central

Etessami, P; Callis, R; Ellwood, S; Stanley, J

1988-01-01

Insertion and deletion mutagenesis of both extended open reading frames (ORFs) of cassava latent virus DNA 2 destroys infectivity. Infectivity is restored by coinoculating constructs that contain single mutations within different ORFs. Although frequent intermolecular recombination produces dominant parental-type virus, mutants can be retained within the virus population indicating that they are competent for replication and suggesting that rescue can occur by complementation of trans acting gene products. By cloning specific fragments into DNA 1 coat protein deletion vectors we have delimited the DNA 2 coding regions and provide substantive evidence that both are essential for virus infection. Although a DNA 2 component is unique to whitefly-transmitted geminiviruses, the results demonstrate that neither coding region is involved solely in insect transmission. The requirement for a bipartite genome for whitefly-transmitted geminiviruses is discussed. Images PMID:3387209
Gene disruptions indicate an essential function for the LmmCRK1 cdc2-related kinase of Leishmania mexicana.

PubMed

Mottram, J C; McCready, B P; Brown, K G; Grant, K M

1996-11-01

The generation of homozygous null mutants for the crk1 Cdc2-Related Kinase of Leishmania mexicana was attempted using targeted gene disruption. Promastigote mutants heterozygous for crk1 were readily isolated with a hyg-targeting fragment, but attempts to create null mutants by second-round transfections with a bie-targeting fragment yielded two classes of mutant, neither of which was null. First, the transfected fragment formed an episome; second, the cloned transfectants were found to contain wild-type crk1 alleles as well as hyg and ble integrations. DNA-content analysis revealed that these mutants were triploid or tetraploid. Plasticity in chromosome number following targeting has been proposed as a means by which Leishmania avoids deletion of essential genes. These data support this theory and implicate crk1 as an essential gene, validating CRK1 as a potential drug target. L mexicana transfected with a Trypanosoma brucel homologue, tbcrk1, was shown to be viable in an immcrk1 null background, thus showing complementation of function between these trypanosomatid genes. The expression of crk1 was further manipulated by engineering a six-histidine tag at the C-terminus of the kinase, allowing purification of the active complex by affinity selection on Nl(2+)-nitriloacetic acid (NTA) agarose.
Comparative study of Saccharomyces cerevisiae wine strains to identify potential marker genes correlated to desiccation stress tolerance.

PubMed

Capece, Angela; Votta, Sonia; Guaragnella, Nicoletta; Zambuto, Marianna; Romaniello, Rossana; Romano, Patrizia

2016-05-01

The most diffused formulation of starter for winemaking is active dry yeast (ADY). ADYs production process is essentially characterized by air-drying stress, a combination of several stresses, including thermal, hyperosmotic and oxidative and cell capacity to counteract such multiple stresses will determine its survival. The molecular mechanisms underlying cell stress response to desiccation have been mostly studied in laboratory and commercial yeast strains, but a growing interest is currently developing for indigenous yeast strains which represent a valuable and alternative source of genetic and molecular biodiversity to be exploited. In this work, a comparative study of different Saccharomyces cerevisiae indigenous wine strains, previously selected for their technological traits, has been carried out to identify potentially relevant genes involved in desiccation stress tolerance. Cell viability was evaluated along desiccation treatment and gene expression was analyzed by real-time PCR before and during the stress. Our data show that the observed differences in individual strain sensitivity to desiccation stress could be associated to specific gene expression over time. In particular, either the basal or the stress-induced mRNA levels of certain genes, such as HSP12, SSA3, TPS1, TPS2, CTT1 and SOD1, result tightly correlated to the strain survival advantage. This study provides a reliable and sensitive method to predict desiccation stress tolerance of indigenous wine yeast strains which could be preliminary to biotechnological applications. © FEMS 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Exome sequencing of a large family identifies potential candidate genes contributing risk to bipolar disorder.

PubMed

Zhang, Tianxiao; Hou, Liping; Chen, David T; McMahon, Francis J; Wang, Jen-Chyong; Rice, John P

2018-03-01

Bipolar disorder is a mental illness with lifetime prevalence of about 1%. Previous genetic studies have identified multiple chromosomal linkage regions and candidate genes that might be associated with bipolar disorder. The present study aimed to identify potential susceptibility variants for bipolar disorder using 6 related case samples from a four-generation family. A combination of exome sequencing and linkage analysis was performed to identify potential susceptibility variants for bipolar disorder. Our study identified a list of five potential candidate genes for bipolar disorder. Among these five genes, GRID1(Glutamate Receptor Delta-1 Subunit), which was previously reported to be associated with several psychiatric disorders and brain related traits, is particularly interesting. Variants with functional significance in this gene were identified from two cousins in our bipolar disorder pedigree. Our findings suggest a potential role for these genes and the related rare variants in the onset and development of bipolar disorder in this one family. Additional research is needed to replicate these findings and evaluate their patho-biological significance. Copyright © 2017 Elsevier B.V. All rights reserved.
Systems approach identifies an organic nitrogen-responsive gene network that is regulated by the master clock control gene CCA1.

PubMed

Gutiérrez, Rodrigo A; Stokes, Trevor L; Thum, Karen; Xu, Xiaodong; Obertello, Mariana; Katari, Manpreet S; Tanurdzic, Milos; Dean, Alexis; Nero, Damion C; McClung, C Robertson; Coruzzi, Gloria M

2008-03-25

Understanding how nutrients affect gene expression will help us to understand the mechanisms controlling plant growth and development as a function of nutrient availability. Nitrate has been shown to serve as a signal for the control of gene expression in Arabidopsis. There is also evidence, on a gene-by-gene basis, that downstream products of nitrogen (N) assimilation such as glutamate (Glu) or glutamine (Gln) might serve as signals of organic N status that in turn regulate gene expression. To identify genome-wide responses to such organic N signals, Arabidopsis seedlings were transiently treated with ammonium nitrate in the presence or absence of MSX, an inhibitor of glutamine synthetase, resulting in a block of Glu/Gln synthesis. Genes that responded to organic N were identified as those whose response to ammonium nitrate treatment was blocked in the presence of MSX. We showed that some genes previously identified to be regulated by nitrate are under the control of an organic N-metabolite. Using an integrated network model of molecular interactions, we uncovered a subnetwork regulated by organic N that included CCA1 and target genes involved in N-assimilation. We validated some of the predicted interactions and showed that regulation of the master clock control gene CCA1 by Glu or a Glu-derived metabolite in turn regulates the expression of key N-assimilatory genes. Phase response curve analysis shows that distinct N-metabolites can advance or delay the CCA1 phase. Regulation of CCA1 by organic N signals may represent a novel input mechanism for N-nutrients to affect plant circadian clock function.
Weighted gene co-expression network analysis of expression data of monozygotic twins identifies specific modules and hub genes related to BMI.

PubMed

Wang, Weijing; Jiang, Wenjie; Hou, Lin; Duan, Haiping; Wu, Yili; Xu, Chunsheng; Tan, Qihua; Li, Shuxia; Zhang, Dongfeng

2017-11-13

The therapeutic management of obesity is challenging, hence further elucidating the underlying mechanisms of obesity development and identifying new diagnostic biomarkers and therapeutic targets are urgent and necessary. Here, we performed differential gene expression analysis and weighted gene co-expression network analysis (WGCNA) to identify significant genes and specific modules related to BMI based on gene expression profile data of 7 discordant monozygotic twins. In the differential gene expression analysis, it appeared that 32 differentially expressed genes (DEGs) were with a trend of up-regulation in twins with higher BMI when compared to their siblings. Categories of positive regulation of nitric-oxide synthase biosynthetic process, positive regulation of NF-kappa B import into nucleus, and peroxidase activity were significantly enriched within GO database and NF-kappa B signaling pathway within KEGG database. DEGs of NAMPT, TLR9, PTGS2, HBD, and PCSK1N might be associated with obesity. In the WGCNA, among the total 20 distinct co-expression modules identified, coral1 module (68 genes) had the strongest positive correlation with BMI (r = 0.56, P = 0.04) and disease status (r = 0.56, P = 0.04). Categories of positive regulation of phospholipase activity, high-density lipoprotein particle clearance, chylomicron remnant clearance, reverse cholesterol transport, intermediate-density lipoprotein particle, chylomicron, low-density lipoprotein particle, very-low-density lipoprotein particle, voltage-gated potassium channel complex, cholesterol transporter activity, and neuropeptide hormone activity were significantly enriched within GO database for this module. And alcoholism and cell adhesion molecules pathways were significantly enriched within KEGG database. Several hub genes, such as GAL, ASB9, NPPB, TBX2, IL17C, APOE, ABCG4, and APOC2 were also identified. The module eigengene of saddlebrown module (212 genes) was also significantly
Analysis of global gene expression profiles to identify differentially expressed genes critical for embryo development in Brassica rapa.

PubMed

Zhang, Yu; Peng, Lifang; Wu, Ya; Shen, Yanyue; Wu, Xiaoming; Wang, Jianbo

2014-11-01

Embryo development represents a crucial developmental period in the life cycle of flowering plants. To gain insights into the genetic programs that control embryo development in Brassica rapa L., RNA sequencing technology was used to perform transcriptome profiling analysis of B. rapa developing embryos. The results generated 42,906,229 sequence reads aligned with 32,941 genes. In total, 27,760, 28,871, 28,384, and 25,653 genes were identified from embryos at globular, heart, early cotyledon, and mature developmental stages, respectively, and analysis between stages revealed a subset of stage-specific genes. We next investigated 9,884 differentially expressed genes with more than fivefold changes in expression and false discovery rate ≤ 0.001 from three adjacent-stage comparisons; 1,514, 3,831, and 6,633 genes were detected between globular and heart stage embryo libraries, heart stage and early cotyledon stage, and early cotyledon and mature stage, respectively. Large numbers of genes related to cellular process, metabolism process, response to stimulus, and biological process were expressed during the early and middle stages of embryo development. Fatty acid biosynthesis, biosynthesis of secondary metabolites, and photosynthesis-related genes were expressed predominantly in embryos at the middle stage. Genes for lipid metabolism and storage proteins were highly expressed in the middle and late stages of embryo development. We also identified 911 transcription factor genes that show differential expression across embryo developmental stages. These results increase our understanding of the complex molecular and cellular events during embryo development in B. rapa and provide a foundation for future studies on other oilseed crops.
Genome-wide screen in Saccharomyces cerevisiae identifies vacuolar protein sorting, autophagy, biosynthetic, and tRNA methylation genes involved in life span regulation.

PubMed

Fabrizio, Paola; Hoon, Shawn; Shamalnasab, Mehrnaz; Galbani, Abdulaye; Wei, Min; Giaever, Guri; Nislow, Corey; Longo, Valter D

2010-07-15

The study of the chronological life span of Saccharomyces cerevisiae, which measures the survival of populations of non-dividing yeast, has resulted in the identification of homologous genes and pathways that promote aging in organisms ranging from yeast to mammals. Using a competitive genome-wide approach, we performed a screen of a complete set of approximately 4,800 viable deletion mutants to identify genes that either increase or decrease chronological life span. Half of the putative short-/long-lived mutants retested from the primary screen were confirmed, demonstrating the utility of our approach. Deletion of genes involved in vacuolar protein sorting, autophagy, and mitochondrial function shortened life span, confirming that respiration and degradation processes are essential for long-term survival. Among the genes whose deletion significantly extended life span are ACB1, CKA2, and TRM9, implicated in fatty acid transport and biosynthesis, cell signaling, and tRNA methylation, respectively. Deletion of these genes conferred heat-shock resistance, supporting the link between life span extension and cellular protection observed in several model organisms. The high degree of conservation of these novel yeast longevity determinants in other species raises the possibility that their role in senescence might be conserved.
Evolutionary Inference across Eukaryotes Identifies Specific Pressures Favoring Mitochondrial Gene Retention.

PubMed

Johnston, Iain G; Williams, Ben P

2016-02-24

Since their endosymbiotic origin, mitochondria have lost most of their genes. Although many selective mechanisms underlying the evolution of mitochondrial genomes have been proposed, a data-driven exploration of these hypotheses is lacking, and a quantitatively supported consensus remains absent. We developed HyperTraPS, a methodology coupling stochastic modeling with Bayesian inference, to identify the ordering of evolutionary events and suggest their causes. Using 2015 complete mitochondrial genomes, we inferred evolutionary trajectories of mtDNA gene loss across the eukaryotic tree of life. We find that proteins comprising the structural cores of the electron transport chain are preferentially encoded within mitochondrial genomes across eukaryotes. A combination of high GC content and high protein hydrophobicity is required to explain patterns of mtDNA gene retention; a model that accounts for these selective pressures can also predict the success of artificial gene transfer experiments in vivo. This work provides a general method for data-driven inference of the ordering of evolutionary and progressive events, here identifying the distinct features shaping mitochondrial genomes of present-day species. Copyright © 2016 Elsevier Inc. All rights reserved.
Biosynthesis of Essential Polyunsaturated Fatty Acids in Wheat Triggered by Expression of Artificial Gene

PubMed Central

Mihálik, Daniel; Klčová, Lenka; Ondreičková, Katarína; Hudcovicová, Martina; Gubišová, Marcela; Klempová, Tatiana; Čertík, Milan; Pauk, János; Kraic, Ján

2015-01-01

The artificial gene D6D encoding the enzyme ∆6desaturase was designed and synthesized using the sequence of the same gene from the fungus Thamnidium elegans. The original start codon was replaced by the signal sequence derived from the wheat gene for high-molecular-weight glutenin subunit and the codon usage was completely changed for optimal expression in wheat. Synthesized artificial D6D gene was delivered into plants of the spring wheat line CY-45 and the gene itself, as well as transcribed D6D mRNA were confirmed in plants of T0 and T1 generations. The desired product of the wheat genetic modification by artificial D6D gene was the γ-linolenic acid. Its presence was confirmed in mature grains of transgenic wheat plants in the amount 0.04%–0.32% (v/v) of the total amount of fatty acids. Both newly synthesized γ-linolenic acid and stearidonic acid have been detected also in leaves, stems, roots, awns, paleas, rachillas, and immature grains of the T1 generation as well as in immature and mature grains of the T2 generation. Contents of γ-linolenic acid and stearidonic acid varied in range 0%–1.40% (v/v) and 0%–1.53% (v/v) from the total amount of fatty acids, respectively. This approach has opened the pathway of desaturation of fatty acids and production of essential polyunsaturated fatty acids in wheat. PMID:26694368
Coalitional game theory as a promising approach to identify candidate autism genes.

PubMed

Gupta, Anika; Sun, Min Woo; Paskov, Kelley Marie; Stockham, Nate Tyler; Jung, Jae-Yoon; Wall, Dennis Paul

2018-01-01

Despite mounting evidence for the strong role of genetics in the phenotypic manifestation of Autism Spectrum Disorder (ASD), the specific genes responsible for the variable forms of ASD remain undefined. ASD may be best explained by a combinatorial genetic model with varying epistatic interactions across many small effect mutations. Coalitional or cooperative game theory is a technique that studies the combined effects of groups of players, known as coalitions, seeking to identify players who tend to improve the performance--the relationship to a specific disease phenotype--of any coalition they join. This method has been previously shown to boost biologically informative signal in gene expression data but to-date has not been applied to the search for cooperative mutations among putative ASD genes. We describe our approach to highlight genes relevant to ASD using coalitional game theory on alteration data of 1,965 fully sequenced genomes from 756 multiplex families. Alterations were encoded into binary matrices for ASD (case) and unaffected (control) samples, indicating likely gene-disrupting, inherited mutations in altered genes. To determine individual gene contributions given an ASD phenotype, a "player" metric, referred to as the Shapley value, was calculated for each gene in the case and control cohorts. Sixty seven genes were found to have significantly elevated player scores and likely represent significant contributors to the genetic coordination underlying ASD. Using network and cross-study analysis, we found that these genes are involved in biological pathways known to be affected in the autism cases and that a subset directly interact with several genes known to have strong associations to autism. These findings suggest that coalitional game theory can be applied to large-scale genomic data to identify hidden yet influential players in complex polygenic disorders such as autism.
LmaPA2G4, a Homolog of Human Ebp1, Is an Essential Gene and Inhibits Cell Proliferation in L. major

PubMed Central

Joyce, Michelle V.; Morales, Miguel A.

2014-01-01

We have identified LmaPA2G4, a homolog of the human proliferation-associated 2G4 protein (also termed Ebp1), in a phosphoproteomic screening. Multiple sequence alignment and cluster analysis revealed that LmaPA2G4 is a non-peptidase member of the M24 family of metallopeptidases. This pseudoenzyme is structurally related to methionine aminopeptidases. A null mutant system based on negative selection allowed us to demonstrate that LmaPA2G4 is an essential gene in Leishmania major. Over-expression of LmaPA2G4 did not alter cell morphology or the ability to differentiate into metacyclic and amastigote stages. Interestingly, the over-expression affected cell proliferation and virulence in mouse footpad analysis. LmaPA2G4 binds a synthetic double-stranded RNA polyriboinosinic polyribocytidylic acid [poly(I∶C)] as shown in an electrophoretic mobility shift assay (EMSA). Quantitative proteomics revealed that the over-expression of LmaPA2G4 led to accumulation of factors involved in translation initiation and elongation. Significantly, we found a strong reduction of de novo protein biosynthesis in transgenic parasites using a non-radioactive metabolic labeling assay. In conclusion, LmaPA2G4 is an essential gene and is potentially implicated in fundamental biological mechanisms, such as translation, making it an attractive target for therapeutic intervention. PMID:24421916
The FUN of identifying gene function in bacterial pathogens; insights from Salmonella functional genomics.

PubMed

Hammarlöf, Disa L; Canals, Rocío; Hinton, Jay C D

2013-10-01

The availability of thousands of genome sequences of bacterial pathogens poses a particular challenge because each genome contains hundreds of genes of unknown function (FUN). How can we easily discover which FUN genes encode important virulence factors? One solution is to combine two different functional genomic approaches. First, transcriptomics identifies bacterial FUN genes that show differential expression during the process of mammalian infection. Second, global mutagenesis identifies individual FUN genes that the pathogen requires to cause disease. The intersection of these datasets can reveal a small set of candidate genes most likely to encode novel virulence attributes. We demonstrate this approach with the Salmonella infection model, and propose that a similar strategy could be used for other bacterial pathogens. Copyright © 2013 Elsevier Ltd. All rights reserved.

Transposon mutagenesis identifies genes that cooperate with mutant Pten in breast cancer progression

PubMed Central

Rangel, Roberto; Lee, Song-Choon; Hon-Kim Ban, Kenneth; Guzman-Rojas, Liliana; Mann, Michael B.; Newberg, Justin Y.; McNoe, Leslie A.; Selvanesan, Luxmanan; Ward, Jerrold M.; Rust, Alistair G.; Chin, Kuan-Yew; Black, Michael A.; Jenkins, Nancy A.; Copeland, Neal G.

2016-01-01

Triple-negative breast cancer (TNBC) has the worst prognosis of any breast cancer subtype. To better understand the genetic forces driving TNBC, we performed a transposon mutagenesis screen in a phosphatase and tensin homolog (Pten) mutant mice and identified 12 candidate trunk drivers and a much larger number of progression genes. Validation studies identified eight TNBC tumor suppressor genes, including the GATA-like transcriptional repressor TRPS1. Down-regulation of TRPS1 in TNBC cells promoted epithelial-to-mesenchymal transition (EMT) by deregulating multiple EMT pathway genes, in addition to increasing the expression of SERPINE1 and SERPINB2 and the subsequent migration, invasion, and metastasis of tumor cells. Transposon mutagenesis has thus provided a better understanding of the genetic forces driving TNBC and discovered genes with potential clinical importance in TNBC. PMID:27849608
Identifying Novel Transcriptional and Epigenetic Features of Nuclear Lamina-associated Genes.

PubMed

Wu, Feinan; Yao, Jie

2017-03-07

Because a large portion of the mammalian genome is associated with the nuclear lamina (NL), it is interesting to study how native genes resided there are transcribed and regulated. In this study, we report unique transcriptional and epigenetic features of nearly 3,500 NL-associated genes (NL genes). Promoter regions of active NL genes are often excluded from NL-association, suggesting that NL-promoter interactions may repress transcription. Active NL genes with higher RNA polymerase II (Pol II) recruitment levels tend to display Pol II promoter-proximal pausing, while Pol II recruitment and Pol II pausing are not correlated among non-NL genes. At the genome-wide scale, NL-association and H3K27me3 distinguishes two large gene classes with low transcriptional activities. Notably, NL-association is anti-correlated with both transcription and active histone mark levels among genes not significantly enriched with H3K9me3 or H3K27me3, suggesting that NL-association may represent a novel gene repression pathway. Interestingly, an NL gene subgroup is not significantly enriched with H3K9me3 or H3K27me3 and is transcribed at higher levels than the rest of NL genes. Furthermore, we identified distal enhancers associated with active NL genes and reported their epigenetic features.
Identification of Arabidopsis GPAT9 (At5g60620) as an Essential Gene Involved in Triacylglycerol Biosynthesis1[OPEN

PubMed Central

Browse, John

2016-01-01

The first step in the biosynthesis of nearly all plant membrane phospholipids and storage triacylglycerols is catalyzed by a glycerol-3-phosphate acyltransferase (GPAT). The requirement for an endoplasmic reticulum (ER)-localized GPAT for both of these critical metabolic pathways was recognized more than 60 years ago. However, identification of the gene(s) encoding this GPAT activity has remained elusive. Here, we present the results of a series of in vivo, in vitro, and in silico experiments in Arabidopsis (Arabidopsis thaliana) designed to assign this essential function to AtGPAT9. This gene has been highly conserved throughout evolution and is largely present as a single copy in most plants, features consistent with essential housekeeping functions. A knockout mutant of AtGPAT9 demonstrates both male and female gametophytic lethality phenotypes, consistent with the role in essential membrane lipid synthesis. Significant expression of developing seed AtGPAT9 is required for wild-type levels of triacylglycerol accumulation, and the transcript level is directly correlated to the level of microsomal GPAT enzymatic activity in seeds. Finally, the AtGPAT9 protein interacts with other enzymes involved in ER glycerolipid biosynthesis, suggesting the possibility of ER-localized lipid biosynthetic complexes. Together, these results suggest that GPAT9 is the ER-localized GPAT enzyme responsible for plant membrane lipid and oil biosynthesis. PMID:26586834
Using reporter gene assays to identify cis regulatory differences between humans and chimpanzees.

PubMed

Chabot, Adrien; Shrit, Ralla A; Blekhman, Ran; Gilad, Yoav

2007-08-01

Most phenotypic differences between human and chimpanzee are likely to result from differences in gene regulation, rather than changes to protein-coding regions. To date, however, only a handful of human-chimpanzee nucleotide differences leading to changes in gene regulation have been identified. To hone in on differences in regulatory elements between human and chimpanzee, we focused on 10 genes that were previously found to be differentially expressed between the two species. We then designed reporter gene assays for the putative human and chimpanzee promoters of the 10 genes. Of seven promoters that we found to be active in human liver cell lines, human and chimpanzee promoters had significantly different activity in four cases, three of which recapitulated the gene expression difference seen in the microarray experiment. For these three genes, we were therefore able to demonstrate that a change in cis influences expression differences between humans and chimpanzees. Moreover, using site-directed mutagenesis on one construct, the promoter for the DDA3 gene, we were able to identify three nucleotides that together lead to a cis regulatory difference between the species. High-throughput application of this approach can provide a map of regulatory element differences between humans and our close evolutionary relatives.
Genome-Wide Association Study Identifies Candidate Genes for Starch Content Regulation in Maize Kernels

PubMed Central

Liu, Na; Xue, Yadong; Guo, Zhanyong; Li, Weihua; Tang, Jihua

2016-01-01

Kernel starch content is an important trait in maize (Zea mays L.) as it accounts for 65–75% of the dry kernel weight and positively correlates with seed yield. A number of starch synthesis-related genes have been identified in maize in recent years. However, many loci underlying variation in starch content among maize inbred lines still remain to be identified. The current study is a genome-wide association study that used a set of 263 maize inbred lines. In this panel, the average kernel starch content was 66.99%, ranging from 60.60 to 71.58% over the three study years. These inbred lines were genotyped with the SNP50 BeadChip maize array, which is comprised of 56,110 evenly spaced, random SNPs. Population structure was controlled by a mixed linear model (MLM) as implemented in the software package TASSEL. After the statistical analyses, four SNPs were identified as significantly associated with starch content (P ≤ 0.0001), among which one each are located on chromosomes 1 and 5 and two are on chromosome 2. Furthermore, 77 candidate genes associated with starch synthesis were found within the 100-kb intervals containing these four QTLs, and four highly associated genes were within 20-kb intervals of the associated SNPs. Among the four genes, Glucose-1-phosphate adenylyltransferase (APS1; Gene ID GRMZM2G163437) is known as an important regulator of kernel starch content. The identified SNPs, QTLs, and candidate genes may not only be readily used for germplasm improvement by marker-assisted selection in breeding, but can also elucidate the genetic basis of starch content. Further studies on these identified candidate genes may help determine the molecular mechanisms regulating kernel starch content in maize and other important cereal crops. PMID:27512395
MicroRNA Regulation of Human Protease Genes Essential for Influenza Virus Replication

PubMed Central

Meliopoulos, Victoria A.; Andersen, Lauren E.; Brooks, Paula; Yan, Xiuzhen; Bakre, Abhijeet; Coleman, J. Keegan; Tompkins, S. Mark; Tripp, Ralph A.

2012-01-01

Influenza A virus causes seasonal epidemics and periodic pandemics threatening the health of millions of people each year. Vaccination is an effective strategy for reducing morbidity and mortality, and in the absence of drug resistance, the efficacy of chemoprophylaxis is comparable to that of vaccines. However, the rapid emergence of drug resistance has emphasized the need for new drug targets. Knowledge of the host cell components required for influenza replication has been an area targeted for disease intervention. In this study, the human protease genes required for influenza virus replication were determined and validated using RNA interference approaches. The genes validated as critical for influenza virus replication were ADAMTS7, CPE, DPP3, MST1, and PRSS12, and pathway analysis showed these genes were in global host cell pathways governing inflammation (NF-κB), cAMP/calcium signaling (CRE/CREB), and apoptosis. Analyses of host microRNAs predicted to govern expression of these genes showed that eight miRNAs regulated gene expression during virus replication. These findings identify unique host genes and microRNAs important for influenza replication providing potential new targets for disease intervention strategies. PMID:22606348
MicroRNA regulation of human protease genes essential for influenza virus replication.

PubMed

Meliopoulos, Victoria A; Andersen, Lauren E; Brooks, Paula; Yan, Xiuzhen; Bakre, Abhijeet; Coleman, J Keegan; Tompkins, S Mark; Tripp, Ralph A

2012-01-01

Influenza A virus causes seasonal epidemics and periodic pandemics threatening the health of millions of people each year. Vaccination is an effective strategy for reducing morbidity and mortality, and in the absence of drug resistance, the efficacy of chemoprophylaxis is comparable to that of vaccines. However, the rapid emergence of drug resistance has emphasized the need for new drug targets. Knowledge of the host cell components required for influenza replication has been an area targeted for disease intervention. In this study, the human protease genes required for influenza virus replication were determined and validated using RNA interference approaches. The genes validated as critical for influenza virus replication were ADAMTS7, CPE, DPP3, MST1, and PRSS12, and pathway analysis showed these genes were in global host cell pathways governing inflammation (NF-κB), cAMP/calcium signaling (CRE/CREB), and apoptosis. Analyses of host microRNAs predicted to govern expression of these genes showed that eight miRNAs regulated gene expression during virus replication. These findings identify unique host genes and microRNAs important for influenza replication providing potential new targets for disease intervention strategies.
GENE EXPRESSION PROFILING TO IDENTIFY MECHANISMS OF MALE REPRODUCTIVE TOXICITY

EPA Science Inventory

Gene Expression Profiling to Identify Mechanisms of Male Reproductive Toxicity
David J. Dix
National Health and Environmental Effects Research Laboratory, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC, 27711, USA.
Ab...
FLASH is essential during early embryogenesis and cooperates with p73 to regulate histone gene transcription.

PubMed

De Cola, A; Bongiorno-Borbone, L; Bianchi, E; Barcaroli, D; Carletti, E; Knight, R A; Di Ilio, C; Melino, G; Sette, C; De Laurenzi, V

2012-02-02

Replication-dependent histone gene expression is a fundamental process occurring in S-phase under the control of the cyclin-E/CDK2 complex. This process is regulated by a number of proteins, including Flice-Associated Huge Protein (FLASH) (CASP8AP2), concentrated in specific nuclear organelles known as HLBs. FLASH regulates both histone gene transcription and mRNA maturation, and its downregulation in vitro results in the depletion of the histone pull and cell-cycle arrest in S-phase. Here we show that the transcription factor p73 binds to FLASH and is part of the complex that regulates histone gene transcription. Moreover, we created a novel gene trap to disrupt FLASH in mice, and we show that homozygous deletion of FLASH results in early embryonic lethality, owing to arrest of FLASH(-/-) embryos at the morula stage. These results indicate that FLASH is an essential, non-redundant regulator of histone transcription and cell cycle during embryogenesis.
Intrinsic biocontainment: Multiplex genome safeguards combine transcriptional and recombinational control of essential yeast genes

PubMed Central

Cai, Yizhi; Agmon, Neta; Choi, Woo Jin; Ubide, Alba; Stracquadanio, Giovanni; Caravelli, Katrina; Hao, Haiping; Bader, Joel S.; Boeke, Jef D.

2015-01-01

Biocontainment may be required in a wide variety of situations such as work with pathogens, field release applications of engineered organisms, and protection of intellectual properties. Here, we describe the control of growth of the brewer’s yeast, Saccharomyces cerevisiae, using both transcriptional and recombinational “safeguard” control of essential gene function. Practical biocontainment strategies dependent on the presence of small molecules require them to be active at very low concentrations, rendering them inexpensive and difficult to detect. Histone genes were controlled by an inducible promoter and controlled by 30 nM estradiol. The stability of the engineered genes was separately regulated by the expression of a site-specific recombinase. The combined frequency of generating viable derivatives when both systems were active was below detection (<10−10), consistent with their orthogonal nature and the individual escape frequencies of <10−6. Evaluation of escaper mutants suggests strategies for reducing their emergence. Transcript profiling and growth test suggest high fitness of safeguarded strains, an important characteristic for wide acceptance. PMID:25624482
NPY genes play an essential role in root gravitropic responses in Arabidopsis.

PubMed

Li, Yuanting; Dai, Xinhua; Cheng, Youfa; Zhao, Yunde

2011-01-01

Plants can sense the direction of gravity and orient their growth to ensure that roots are anchored in soil and that shoots grow upward. Gravitropism has been studied extensively using Arabidopsis genetics, but the exact mechanisms for gravitropism are not fully understood. Here, we demonstrate that five NPY genes play a key role in Arabidopsis root gravitropism. NPY genes were previously identified as regulators of auxin-mediated organogenesis in a genetic pathway with the AGC kinases PID, PID2, WAG1, and WAG2. We show that all five NPY genes are highly expressed in primary root tips. The single npy mutants do not display obvious gravitropism defects, but the npy1 npy2 npy3 npy4 npy5 quintuple mutants show dramatic gravitropic phenotypes. Systematic analysis of all the npy double, triple, and quadruple combinations demonstrates that the five NPY genes all contribute to gravitropism. Our work indicates that gravitropism, phototropism, and organogenesis use analogous mechanisms in which at least one AGC kinase, one NPH3/NPY gene, and one ARF are required.
Expression screening of cancer/testis genes in prostate cancer identifies NR6A1 as a novel marker of disease progression and aggressiveness.

PubMed

Mathieu, Romain; Evrard, Bertrand; Fromont, Gaëlle; Rioux-Leclercq, Nathalie; Godet, Julie; Cathelineau, Xavier; Guillé, François; Primig, Michael; Chalmel, Frédéric

2013-07-01

Cancer/Testis (CT) genes are expressed in male gonads, repressed in most healthy somatic tissues and de-repressed in various somatic malignancies including prostate cancers (PCa). Because of their specific expression signature and their associations with tumor aggressiveness and poor outcomes, CT genes are considered to be useful biomarkers and they are also targets for the development of new anti-cancer immunotherapies. The aim of this study was to identify novel CT genes associated with hormone-sensitive prostate cancer (HSPC), and castration-resistant prostate cancer (CRPC). To identify novel CT genes we screened genes for which transcripts were detected by RNA profiling specifically in normal testis and in either HSPC or CRPC as compared to normal prostate and 44 other healthy tissues using GeneChips. The expression and clinicopathological significance of a promising candidate--NR6A1--was examined in HSPC, CRPC, and metastatic site samples using tissue microarrays. We report the identification of 98 genes detected in CRPC, HSPC and testicular samples but not in the normal controls. Among them, cellular levels of NR6A1 were found to be higher in HSPC compared to normal prostate and further increased in metastatic lesions and CRPC. Furthermore, increased NR6A1 immunoreactivity was significantly associated with a high Gleason score, advanced pT stage and cancer cell proliferation. Our results show that cellular levels of NR6A1 are correlated with disease progression in PCa. We suggest that this essential orphan nuclear receptor is a potential therapeutic target as well as a biomarker of PCa aggressiveness. Copyright © 2013 Wiley Periodicals, Inc.
A Large-Scale RNAi Screen Identifies SGK1 as a Key Survival Kinase for GBM Stem Cells.

PubMed

Kulkarni, Shreya; Goel-Bhattacharya, Surbhi; Sengupta, Sejuti; Cochran, Brent H

2018-01-01

Glioblastoma multiforme (GBM) is the most common type of primary malignant brain cancer and has a very poor prognosis. A subpopulation of cells known as GBM stem-like cells (GBM-SC) have the capacity to initiate and sustain tumor growth and possess molecular characteristics similar to the parental tumor. GBM-SCs are known to be enriched in hypoxic niches and may contribute to therapeutic resistance. Therefore, to identify genetic determinants important for the proliferation and survival of GBM stem cells, an unbiased pooled shRNA screen of 10,000 genes was conducted under normoxic as well as hypoxic conditions. A number of essential genes were identified that are required for GBM-SC growth, under either or both oxygen conditions, in two different GBM-SC lines. Interestingly, only about a third of the essential genes were common to both cell lines. The oxygen environment significantly impacts the cellular genetic dependencies as 30% of the genes required under hypoxia were not required under normoxic conditions. In addition to identifying essential genes already implicated in GBM such as CDK4, KIF11 , and RAN , the screen also identified new genes that have not been previously implicated in GBM stem cell biology. The importance of the serum and glucocorticoid-regulated kinase 1 (SGK1) for cellular survival was validated in multiple patient-derived GBM stem cell lines using shRNA, CRISPR, and pharmacologic inhibitors. However, SGK1 depletion and inhibition has little effect on traditional serum grown glioma lines and on differentiated GBM-SCs indicating its specific importance in GBM stem cell survival. Implications: This study identifies genes required for the growth and survival of GBM stem cells under both normoxic and hypoxic conditions and finds SGK1 as a novel potential drug target for GBM. Mol Cancer Res; 16(1); 103-14. ©2017 AACR . ©2017 American Association for Cancer Research.
A data mining paradigm for identifying key factors in biological processes using gene expression data.

PubMed

Li, Jin; Zheng, Le; Uchiyama, Akihiko; Bin, Lianghua; Mauro, Theodora M; Elias, Peter M; Pawelczyk, Tadeusz; Sakowicz-Burkiewicz, Monika; Trzeciak, Magdalena; Leung, Donald Y M; Morasso, Maria I; Yu, Peng

2018-06-13

A large volume of biological data is being generated for studying mechanisms of various biological processes. These precious data enable large-scale computational analyses to gain biological insights. However, it remains a challenge to mine the data efficiently for knowledge discovery. The heterogeneity of these data makes it difficult to consistently integrate them, slowing down the process of biological discovery. We introduce a data processing paradigm to identify key factors in biological processes via systematic collection of gene expression datasets, primary analysis of data, and evaluation of consistent signals. To demonstrate its effectiveness, our paradigm was applied to epidermal development and identified many genes that play a potential role in this process. Besides the known epidermal development genes, a substantial proportion of the identified genes are still not supported by gain- or loss-of-function studies, yielding many novel genes for future studies. Among them, we selected a top gene for loss-of-function experimental validation and confirmed its function in epidermal differentiation, proving the ability of this paradigm to identify new factors in biological processes. In addition, this paradigm revealed many key genes in cold-induced thermogenesis using data from cold-challenged tissues, demonstrating its generalizability. This paradigm can lead to fruitful results for studying molecular mechanisms in an era of explosive accumulation of publicly available biological data.
The Chromatin Remodeler BPTF Activates a Stemness Gene-Expression Program Essential for the Maintenance of Adult Hematopoietic Stem Cells.

PubMed

Xu, Bowen; Cai, Ling; Butler, Jason M; Chen, Dongliang; Lu, Xiongdong; Allison, David F; Lu, Rui; Rafii, Shahin; Parker, Joel S; Zheng, Deyou; Wang, Gang Greg

2018-03-13

Self-renewal and differentiation of adult stem cells are tightly regulated partly through configuration of chromatin structure by chromatin remodelers. Using knockout mice, we here demonstrate that bromodomain PHD finger transcription factor (BPTF), a component of the nucleosome remodeling factor (NURF) chromatin-remodeling complex, is essential for maintaining the population size of hematopoietic stem/progenitor cells (HSPCs), including long-term hematopoietic stem cells (HSCs). Bptf-deficient HSCs are defective in reconstituted hematopoiesis, and hematopoietic-specific knockout of Bptf caused profound defects including bone marrow failure and anemia. Genome-wide transcriptome profiling revealed that BPTF loss caused downregulation of HSC-specific gene-expression programs, which contain several master transcription factors (Meis1, Pbx1, Mn1, and Lmo2) required for HSC maintenance and self-renewal. Furthermore, we show that BPTF potentiates the chromatin accessibility of key HSC "stemness" genes. These results demonstrate an essential requirement of the chromatin remodeler BPTF and NURF for activation of "stemness" gene-expression programs and proper function of adult HSCs. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.
Multi-species comparative analysis of the equine ACE gene identifies a highly conserved potential transcription factor binding site in intron 16.

PubMed

Hamilton, Natasha A; Tammen, Imke; Raadsma, Herman W

2013-01-01

Angiotensin converting enzyme (ACE) is essential for control of blood pressure. The human ACE gene contains an intronic Alu indel (I/D) polymorphism that has been associated with variation in serum enzyme levels, although the functional mechanism has not been identified. The polymorphism has also been associated with cardiovascular disease, type II diabetes, renal disease and elite athleticism. We have characterized the ACE gene in horses of breeds selected for differing physical abilities. The equine gene has a similar structure to that of all known mammalian ACE genes. Nine common single nucleotide polymorphisms (SNPs) discovered in pooled DNA were found to be inherited in nine haplotypes. Three of these SNPs were located in intron 16, homologous to that containing the Alu polymorphism in the human. A highly conserved 18 bp sequence, also within that intron, was identified as being a potential binding site for the transcription factors Oct-1, HFH-1 and HNF-3β, and lies within a larger area of higher than normal homology. This putative regulatory element may contribute to regulation of the documented inter-individual variation in human circulating enzyme levels, for which a functional mechanism is yet to be defined. Two equine SNPs occurred within the conserved area in intron 16, although neither of them disrupted the putative binding site. We propose a possible regulatory mechanism of the ACE gene in mammalian species which was previously unknown. This advance will allow further analysis leading to a better understanding of the mechanisms underpinning the associations seen between the human Alu polymorphism and enzyme levels, cardiovascular disease states and elite athleticism.
Multi-Species Comparative Analysis of the Equine ACE Gene Identifies a Highly Conserved Potential Transcription Factor Binding Site in Intron 16

PubMed Central

Hamilton, Natasha A.; Tammen, Imke; Raadsma, Herman W.

2013-01-01

Angiotensin converting enzyme (ACE) is essential for control of blood pressure. The human ACE gene contains an intronic Alu indel (I/D) polymorphism that has been associated with variation in serum enzyme levels, although the functional mechanism has not been identified. The polymorphism has also been associated with cardiovascular disease, type II diabetes, renal disease and elite athleticism. We have characterized the ACE gene in horses of breeds selected for differing physical abilities. The equine gene has a similar structure to that of all known mammalian ACE genes. Nine common single nucleotide polymorphisms (SNPs) discovered in pooled DNA were found to be inherited in nine haplotypes. Three of these SNPs were located in intron 16, homologous to that containing the Alu polymorphism in the human. A highly conserved 18 bp sequence, also within that intron, was identified as being a potential binding site for the transcription factors Oct-1, HFH-1 and HNF-3β, and lies within a larger area of higher than normal homology. This putative regulatory element may contribute to regulation of the documented inter-individual variation in human circulating enzyme levels, for which a functional mechanism is yet to be defined. Two equine SNPs occurred within the conserved area in intron 16, although neither of them disrupted the putative binding site. We propose a possible regulatory mechanism of the ACE gene in mammalian species which was previously unknown. This advance will allow further analysis leading to a better understanding of the mechanisms underpinning the associations seen between the human Alu polymorphism and enzyme levels, cardiovascular disease states and elite athleticism. PMID:23408978
X-exome sequencing of 405 unresolved families identifies seven novel intellectual disability genes.

PubMed

Hu, H; Haas, S A; Chelly, J; Van Esch, H; Raynaud, M; de Brouwer, A P M; Weinert, S; Froyen, G; Frints, S G M; Laumonnier, F; Zemojtel, T; Love, M I; Richard, H; Emde, A-K; Bienek, M; Jensen, C; Hambrock, M; Fischer, U; Langnick, C; Feldkamp, M; Wissink-Lindhout, W; Lebrun, N; Castelnau, L; Rucci, J; Montjean, R; Dorseuil, O; Billuart, P; Stuhlmann, T; Shaw, M; Corbett, M A; Gardner, A; Willis-Owen, S; Tan, C; Friend, K L; Belet, S; van Roozendaal, K E P; Jimenez-Pocquet, M; Moizard, M-P; Ronce, N; Sun, R; O'Keeffe, S; Chenna, R; van Bömmel, A; Göke, J; Hackett, A; Field, M; Christie, L; Boyle, J; Haan, E; Nelson, J; Turner, G; Baynam, G; Gillessen-Kaesbach, G; Müller, U; Steinberger, D; Budny, B; Badura-Stronka, M; Latos-Bieleńska, A; Ousager, L B; Wieacker, P; Rodríguez Criado, G; Bondeson, M-L; Annerén, G; Dufke, A; Cohen, M; Van Maldergem, L; Vincent-Delorme, C; Echenne, B; Simon-Bouy, B; Kleefstra, T; Willemsen, M; Fryns, J-P; Devriendt, K; Ullmann, R; Vingron, M; Wrogemann, K; Wienker, T F; Tzschach, A; van Bokhoven, H; Gecz, J; Jentsch, T J; Chen, W; Ropers, H-H; Kalscheuer, V M

2016-01-01

X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes or loci are yet to be identified. Here, we have investigated 405 unresolved families with XLID. We employed massively parallel sequencing of all X-chromosome exons in the index males. The majority of these males were previously tested negative for copy number variations and for mutations in a subset of known XLID genes by Sanger sequencing. In total, 745 X-chromosomal genes were screened. After stringent filtering, a total of 1297 non-recurrent exonic variants remained for prioritization. Co-segregation analysis of potential clinically relevant changes revealed that 80 families (20%) carried pathogenic variants in established XLID genes. In 19 families, we detected likely causative protein truncating and missense variants in 7 novel and validated XLID genes (CLCN4, CNKSR2, FRMPD4, KLHL15, LAS1L, RLIM and USP27X) and potentially deleterious variants in 2 novel candidate XLID genes (CDK16 and TAF1). We show that the CLCN4 and CNKSR2 variants impair protein functions as indicated by electrophysiological studies and altered differentiation of cultured primary neurons from Clcn4(-/-) mice or after mRNA knock-down. The newly identified and candidate XLID proteins belong to pathways and networks with established roles in cognitive function and intellectual disability in particular. We suggest that systematic sequencing of all X-chromosomal genes in a cohort of patients with genetic evidence for X-chromosome locus involvement may resolve up to 58% of Fragile X-negative cases.
X-exome sequencing of 405 unresolved families identifies seven novel intellectual disability genes

PubMed Central

Hu, H; Haas, S A; Chelly, J; Van Esch, H; Raynaud, M; de Brouwer, A P M; Weinert, S; Froyen, G; Frints, S G M; Laumonnier, F; Zemojtel, T; Love, M I; Richard, H; Emde, A-K; Bienek, M; Jensen, C; Hambrock, M; Fischer, U; Langnick, C; Feldkamp, M; Wissink-Lindhout, W; Lebrun, N; Castelnau, L; Rucci, J; Montjean, R; Dorseuil, O; Billuart, P; Stuhlmann, T; Shaw, M; Corbett, M A; Gardner, A; Willis-Owen, S; Tan, C; Friend, K L; Belet, S; van Roozendaal, K E P; Jimenez-Pocquet, M; Moizard, M-P; Ronce, N; Sun, R; O'Keeffe, S; Chenna, R; van Bömmel, A; Göke, J; Hackett, A; Field, M; Christie, L; Boyle, J; Haan, E; Nelson, J; Turner, G; Baynam, G; Gillessen-Kaesbach, G; Müller, U; Steinberger, D; Budny, B; Badura-Stronka, M; Latos-Bieleńska, A; Ousager, L B; Wieacker, P; Rodríguez Criado, G; Bondeson, M-L; Annerén, G; Dufke, A; Cohen, M; Van Maldergem, L; Vincent-Delorme, C; Echenne, B; Simon-Bouy, B; Kleefstra, T; Willemsen, M; Fryns, J-P; Devriendt, K; Ullmann, R; Vingron, M; Wrogemann, K; Wienker, T F; Tzschach, A; van Bokhoven, H; Gecz, J; Jentsch, T J; Chen, W; Ropers, H-H; Kalscheuer, V M

2016-01-01

X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes or loci are yet to be identified. Here, we have investigated 405 unresolved families with XLID. We employed massively parallel sequencing of all X-chromosome exons in the index males. The majority of these males were previously tested negative for copy number variations and for mutations in a subset of known XLID genes by Sanger sequencing. In total, 745 X-chromosomal genes were screened. After stringent filtering, a total of 1297 non-recurrent exonic variants remained for prioritization. Co-segregation analysis of potential clinically relevant changes revealed that 80 families (20%) carried pathogenic variants in established XLID genes. In 19 families, we detected likely causative protein truncating and missense variants in 7 novel and validated XLID genes (CLCN4, CNKSR2, FRMPD4, KLHL15, LAS1L, RLIM and USP27X) and potentially deleterious variants in 2 novel candidate XLID genes (CDK16 and TAF1). We show that the CLCN4 and CNKSR2 variants impair protein functions as indicated by electrophysiological studies and altered differentiation of cultured primary neurons from Clcn4−/− mice or after mRNA knock-down. The newly identified and candidate XLID proteins belong to pathways and networks with established roles in cognitive function and intellectual disability in particular. We suggest that systematic sequencing of all X-chromosomal genes in a cohort of patients with genetic evidence for X-chromosome locus involvement may resolve up to 58% of Fragile X-negative cases. PMID:25644381
A multicopper oxidase is essential for manganese oxidation and laccase-like activity in Pedomicrobium sp. ACM 3067.

PubMed

Ridge, Justin P; Lin, Marianne; Larsen, Eloise I; Fegan, Mark; McEwan, Alastair G; Sly, Lindsay I

2007-04-01

Pedomicrobium sp. ACM 3067 is a budding-hyphal bacterium belonging to the alpha-Proteobacteria which is able to oxidize soluble Mn2+ to insoluble manganese oxide. A cosmid, from a whole-genome library, containing the putative genes responsible for manganese oxidation was identified and a primer-walking approach yielded 4350 bp of novel sequence. Analysis of this sequence showed the presence of a predicted three-gene operon, moxCBA. The moxA gene product showed homology to multicopper oxidases (MCOs) and contained the characteristic four copper-binding motifs (A, B, C and D) common to MCOs. An insertion mutation of moxA showed that this gene was essential for both manganese oxidation and laccase-like activity. The moxB gene product showed homology to a family of outer membrane proteins which are essential for Type I secretion in Gram-negative bacteria. moxBA has not been observed in other manganese-oxidizing bacteria but homologues were identified in the genomes of several bacteria including Sinorhizobium meliloti 1021 and Agrobacterium tumefaciens C58. These results suggest that moxBA and its homologues constitute a family of genes encoding an MCO and a predicted component of the Type I secretion system.

A Genome-Wide Knockout Screen to Identify Genes Involved in Acquired Carboplatin Resistance

DTIC Science & Technology

2016-07-01

library screen to identify genes that when knocked out render human ovarian cells > 2.5-fold resistant to CBDCA; 2) Validate the ability of...a GeCKOv2 library screen to identify genes that when knocked out render human ovarian cells > 2.5-fold resistant to CBDCA; 2) validate the ability of...resistance in either cell lines or clinical samples. The CRIPSR-cas9 technology now provides us with a major new tool to introduce knock out mutations
A computational approach to identify cellular heterogeneity and tissue-specific gene regulatory networks.

PubMed

Jambusaria, Ankit; Klomp, Jeff; Hong, Zhigang; Rafii, Shahin; Dai, Yang; Malik, Asrar B; Rehman, Jalees

2018-06-07

The heterogeneity of cells across tissue types represents a major challenge for studying biological mechanisms as well as for therapeutic targeting of distinct tissues. Computational prediction of tissue-specific gene regulatory networks may provide important insights into the mechanisms underlying the cellular heterogeneity of cells in distinct organs and tissues. Using three pathway analysis techniques, gene set enrichment analysis (GSEA), parametric analysis of gene set enrichment (PGSEA), alongside our novel model (HeteroPath), which assesses heterogeneously upregulated and downregulated genes within the context of pathways, we generated distinct tissue-specific gene regulatory networks. We analyzed gene expression data derived from freshly isolated heart, brain, and lung endothelial cells and populations of neurons in the hippocampus, cingulate cortex, and amygdala. In both datasets, we found that HeteroPath segregated the distinct cellular populations by identifying regulatory pathways that were not identified by GSEA or PGSEA. Using simulated datasets, HeteroPath demonstrated robustness that was comparable to what was seen using existing gene set enrichment methods. Furthermore, we generated tissue-specific gene regulatory networks involved in vascular heterogeneity and neuronal heterogeneity by performing motif enrichment of the heterogeneous genes identified by HeteroPath and linking the enriched motifs to regulatory transcription factors in the ENCODE database. HeteroPath assesses contextual bidirectional gene expression within pathways and thus allows for transcriptomic assessment of cellular heterogeneity. Unraveling tissue-specific heterogeneity of gene expression can lead to a better understanding of the molecular underpinnings of tissue-specific phenotypes.
Essential Role of Chromatin Remodeling Protein Bptf in Early Mouse Embryos and Embryonic Stem Cells

PubMed Central

Landry, Joseph; Sharov, Alexei A.; Piao, Yulan; Sharova, Lioudmila V.; Xiao, Hua; Southon, Eileen; Matta, Jennifer; Tessarollo, Lino; Zhang, Ying E.; Ko, Minoru S. H.; Kuehn, Michael R.; Yamaguchi, Terry P.; Wu, Carl

2008-01-01

We have characterized the biological functions of the chromatin remodeling protein Bptf (Bromodomain PHD-finger Transcription Factor), the largest subunit of NURF (Nucleosome Remodeling Factor) in a mammal. Bptf mutants manifest growth defects at the post-implantation stage and are reabsorbed by E8.5. Histological analyses of lineage markers show that Bptf−/− embryos implant but fail to establish a functional distal visceral endoderm. Microarray analysis at early stages of differentiation has identified Bptf-dependent gene targets including homeobox transcriptions factors and genes essential for the development of ectoderm, mesoderm, and both definitive and visceral endoderm. Differentiation of Bptf−/− embryonic stem cell lines into embryoid bodies revealed its requirement for development of mesoderm, endoderm, and ectoderm tissue lineages, and uncovered many genes whose activation or repression are Bptf-dependent. We also provide functional and physical links between the Bptf-containing NURF complex and the Smad transcription factors. These results suggest that Bptf may co-regulate some gene targets of this pathway, which is essential for establishment of the visceral endoderm. We conclude that Bptf likely regulates genes and signaling pathways essential for the development of key tissues of the early mouse embryo. PMID:18974875
Identifying candidate driver genes by integrative ovarian cancer genomics data

NASA Astrophysics Data System (ADS)

Lu, Xinguo; Lu, Jibo

2017-08-01

Integrative analysis of molecular mechanics underlying cancer can distinguish interactions that cannot be revealed based on one kind of data for the appropriate diagnosis and treatment of cancer patients. Tumor samples exhibit heterogeneity in omics data, such as somatic mutations, Copy Number Variations CNVs), gene expression profiles and so on. In this paper we combined gene co-expression modules and mutation modulators separately in tumor patients to obtain the candidate driver genes for resistant and sensitive tumor from the heterogeneous data. The final list of modulators identified are well known in biological processes associated with ovarian cancer, such as CCL17, CACTIN, CCL16, CCL22, APOB, KDF1, CCL11, HNF1B, LRG1, MED1 and so on, which can help to facilitate the discovery of biomarkers, molecular diagnostics, and drug discovery.
Identifying novel genes and chemicals related to nasopharyngeal cancer in a heterogeneous network.

PubMed

Li, Zhandong; An, Lifeng; Li, Hao; Wang, ShaoPeng; Zhou, You; Yuan, Fei; Li, Lin

2016-05-05

Nasopharyngeal cancer or nasopharyngeal carcinoma (NPC) is the most common cancer originating in the nasopharynx. The factors that induce nasopharyngeal cancer are still not clear. Additional information about the chemicals or genes related to nasopharyngeal cancer will promote a better understanding of the pathogenesis of this cancer and the factors that induce it. Thus, a computational method NPC-RGCP was proposed in this study to identify the possible relevant chemicals and genes based on the presently known chemicals and genes related to nasopharyngeal cancer. To extensively utilize the functional associations between proteins and chemicals, a heterogeneous network was constructed based on interactions of proteins and chemicals. The NPC-RGCP included two stages: the searching stage and the screening stage. The former stage is for finding new possible genes and chemicals in the heterogeneous network, while the latter stage is for screening and removing false discoveries and selecting the core genes and chemicals. As a result, five putative genes, CXCR3, IRF1, CDK1, GSTP1, and CDH2, and seven putative chemicals, iron, propionic acid, dimethyl sulfoxide, isopropanol, erythrose 4-phosphate, β-D-Fructose 6-phosphate, and flavin adenine dinucleotide, were identified by NPC-RGCP. Extensive analyses provided confirmation that the putative genes and chemicals have significant associations with nasopharyngeal cancer.
Identifying novel genes and chemicals related to nasopharyngeal cancer in a heterogeneous network

PubMed Central

Li, Zhandong; An, Lifeng; Li, Hao; Wang, ShaoPeng; Zhou, You; Yuan, Fei; Li, Lin

2016-01-01

Nasopharyngeal cancer or nasopharyngeal carcinoma (NPC) is the most common cancer originating in the nasopharynx. The factors that induce nasopharyngeal cancer are still not clear. Additional information about the chemicals or genes related to nasopharyngeal cancer will promote a better understanding of the pathogenesis of this cancer and the factors that induce it. Thus, a computational method NPC-RGCP was proposed in this study to identify the possible relevant chemicals and genes based on the presently known chemicals and genes related to nasopharyngeal cancer. To extensively utilize the functional associations between proteins and chemicals, a heterogeneous network was constructed based on interactions of proteins and chemicals. The NPC-RGCP included two stages: the searching stage and the screening stage. The former stage is for finding new possible genes and chemicals in the heterogeneous network, while the latter stage is for screening and removing false discoveries and selecting the core genes and chemicals. As a result, five putative genes, CXCR3, IRF1, CDK1, GSTP1, and CDH2, and seven putative chemicals, iron, propionic acid, dimethyl sulfoxide, isopropanol, erythrose 4-phosphate, β-D-Fructose 6-phosphate, and flavin adenine dinucleotide, were identified by NPC-RGCP. Extensive analyses provided confirmation that the putative genes and chemicals have significant associations with nasopharyngeal cancer. PMID:27149165
A whole-blood transcriptome meta-analysis identifies gene expression signatures of cigarette smoking

PubMed Central

Huan, Tianxiao; Joehanes, Roby; Schurmann, Claudia; Schramm, Katharina; Pilling, Luke C.; Peters, Marjolein J.; Mägi, Reedik; DeMeo, Dawn; O'Connor, George T.; Ferrucci, Luigi; Teumer, Alexander; Homuth, Georg; Biffar, Reiner; Völker, Uwe; Herder, Christian; Waldenberger, Melanie; Peters, Annette; Zeilinger, Sonja; Metspalu, Andres; Hofman, Albert; Uitterlinden, André G.; Hernandez, Dena G.; Singleton, Andrew B.; Bandinelli, Stefania; Munson, Peter J.; Lin, Honghuang; Benjamin, Emelia J.; Esko, Tõnu; Grabe, Hans J.; Prokisch, Holger; van Meurs, Joyce B.J.; Melzer, David; Levy, Daniel

2016-01-01

Abstract Cigarette smoking is a leading modifiable cause of death worldwide. We hypothesized that cigarette smoking induces extensive transcriptomic changes that lead to target-organ damage and smoking-related diseases. We performed a meta-analysis of transcriptome-wide gene expression using whole blood-derived RNA from 10,233 participants of European ancestry in six cohorts (including 1421 current and 3955 former smokers) to identify associations between smoking and altered gene expression levels. At a false discovery rate (FDR) <0.1, we identified 1270 differentially expressed genes in current vs. never smokers, and 39 genes in former vs. never smokers. Expression levels of 12 genes remained elevated up to 30 years after smoking cessation, suggesting that the molecular consequence of smoking may persist for decades. Gene ontology analysis revealed enrichment of smoking-related genes for activation of platelets and lymphocytes, immune response, and apoptosis. Many of the top smoking-related differentially expressed genes, including LRRN3 and GPR15, have DNA methylation loci in promoter regions that were recently reported to be hypomethylated among smokers. By linking differential gene expression with smoking-related disease phenotypes, we demonstrated that stroke and pulmonary function show enrichment for smoking-related gene expression signatures. Mediation analysis revealed the expression of several genes (e.g. ALAS2) to be putative mediators of the associations between smoking and inflammatory biomarkers (IL6 and C-reactive protein levels). Our transcriptomic study provides potential insights into the effects of cigarette smoking on gene expression in whole blood and their relations to smoking-related diseases. The results of such analyses may highlight attractive targets for treating or preventing smoking-related health effects. PMID:28158590
Resistance gene candidates identified by PCR with degenerate oligonucleotide primers map to clusters of resistance genes in lettuce.

PubMed

Shen, K A; Meyers, B C; Islam-Faridi, M N; Chin, D B; Stelly, D M; Michelmore, R W

1998-08-01

The recent cloning of genes for resistance against diverse pathogens from a variety of plants has revealed that many share conserved sequence motifs. This provides the possibility of isolating numerous additional resistance genes by polymerase chain reaction (PCR) with degenerate oligonucleotide primers. We amplified resistance gene candidates (RGCs) from lettuce with multiple combinations of primers with low degeneracy designed from motifs in the nucleotide binding sites (NBSs) of RPS2 of Arabidopsis thaliana and N of tobacco. Genomic DNA, cDNA, and bacterial artificial chromosome (BAC) clones were successfully used as templates. Four families of sequences were identified that had the same similarity to each other as to resistance genes from other species. The relationship of the amplified products to resistance genes was evaluated by several sequence and genetic criteria. The amplified products contained open reading frames with additional sequences characteristic of NBSs. Hybridization of RGCs to genomic DNA and to BAC clones revealed large numbers of related sequences. Genetic analysis demonstrated the existence of clustered multigene families for each of the four RGC sequences. This parallels classical genetic data on clustering of disease resistance genes. Two of the four families mapped to known clusters of resistance genes; these two families were therefore studied in greater detail. Additional evidence that these RGCs could be resistance genes was gained by the identification of leucine-rich repeat (LRR) regions in sequences adjoining the NBS similar to those in RPM1 and RPS2 of A. thaliana. Fluorescent in situ hybridization confirmed the clustered genomic distribution of these sequences. The use of PCR with degenerate oligonucleotide primers is therefore an efficient method to identify numerous RGCs in plants.
Gene-environment interaction involving recently identified colorectal cancer susceptibility loci

PubMed Central

Kantor, Elizabeth D.; Hutter, Carolyn M.; Minnier, Jessica; Berndt, Sonja I.; Brenner, Hermann; Caan, Bette J.; Campbell, Peter T.; Carlson, Christopher S.; Casey, Graham; Chan, Andrew T.; Chang-Claude, Jenny; Chanock, Stephen J.; Cotterchio, Michelle; Du, Mengmeng; Duggan, David; Fuchs, Charles S.; Giovannucci, Edward L.; Gong, Jian; Harrison, Tabitha A.; Hayes, Richard B.; Henderson, Brian E.; Hoffmeister, Michael; Hopper, John L.; Jenkins, Mark A.; Jiao, Shuo; Kolonel, Laurence N.; Le Marchand, Loic; Lemire, Mathieu; Ma, Jing; Newcomb, Polly A.; Ochs-Balcom, Heather M.; Pflugeisen, Bethann M.; Potter, John D.; Rudolph, Anja; Schoen, Robert E.; Seminara, Daniela; Slattery, Martha L.; Stelling, Deanna L.; Thomas, Fridtjof; Thornquist, Mark; Ulrich, Cornelia M.; Warnick, Greg S.; Zanke, Brent W.; Peters, Ulrike; Hsu, Li; White, Emily

2014-01-01

BACKGROUND Genome-wide association studies have identified several single nucleotide polymorphisms (SNPs) that are associated with risk of colorectal cancer (CRC). Prior research has evaluated the presence of gene-environment interaction involving the first 10 identified susceptibility loci, but little work has been conducted on interaction involving SNPs at recently identified susceptibility loci, including: rs10911251, rs6691170, rs6687758, rs11903757, rs10936599, rs647161, rs1321311, rs719725, rs1665650, rs3824999, rs7136702, rs11169552, rs59336, rs3217810, rs4925386, and rs2423279. METHODS Data on 9160 cases and 9280 controls from the Genetics and Epidemiology of Colorectal Cancer Consortium (GECCO) and Colon Cancer Family Registry (CCFR) were used to evaluate the presence of interaction involving the above-listed SNPs and sex, body mass index (BMI), alcohol consumption, smoking, aspirin use, post-menopausal hormone (PMH) use, as well as intake of dietary calcium, dietary fiber, dietary folate, red meat, processed meat, fruit, and vegetables. Interaction was evaluated using a fixed-effects meta-analysis of an efficient Empirical Bayes estimator, and permutation was used to account for multiple comparisons. RESULTS None of the permutation-adjusted p-values reached statistical significance. CONCLUSIONS The associations between recently identified genetic susceptibility loci and CRC are not strongly modified by sex, BMI, alcohol, smoking, aspirin, PMH use, and various dietary factors. IMPACT Results suggest no evidence of strong gene-environment interactions involving the recently identified 16 susceptibility loci for CRC taken one at a time. PMID:24994789
LGscore: A method to identify disease-related genes using biological literature and Google data.

PubMed

Kim, Jeongwoo; Kim, Hyunjin; Yoon, Youngmi; Park, Sanghyun

2015-04-01

Since the genome project in 1990s, a number of studies associated with genes have been conducted and researchers have confirmed that genes are involved in disease. For this reason, the identification of the relationships between diseases and genes is important in biology. We propose a method called LGscore, which identifies disease-related genes using Google data and literature data. To implement this method, first, we construct a disease-related gene network using text-mining results. We then extract gene-gene interactions based on co-occurrences in abstract data obtained from PubMed, and calculate the weights of edges in the gene network by means of Z-scoring. The weights contain two values: the frequency and the Google search results. The frequency value is extracted from literature data, and the Google search result is obtained using Google. We assign a score to each gene through a network analysis. We assume that genes with a large number of links and numerous Google search results and frequency values are more likely to be involved in disease. For validation, we investigated the top 20 inferred genes for five different diseases using answer sets. The answer sets comprised six databases that contain information on disease-gene relationships. We identified a significant number of disease-related genes as well as candidate genes for Alzheimer's disease, diabetes, colon cancer, lung cancer, and prostate cancer. Our method was up to 40% more accurate than existing methods. Copyright © 2015 Elsevier Inc. All rights reserved.
Identifying candidate genes affecting developmental time in Drosophila melanogaster: pervasive pleiotropy and gene-by-environment interaction

PubMed Central

Mensch, Julián; Lavagnino, Nicolás; Carreira, Valeria Paula; Massaldi, Ana; Hasson, Esteban; Fanara, Juan José

2008-01-01

Background Understanding the genetic architecture of ecologically relevant adaptive traits requires the contribution of developmental and evolutionary biology. The time to reach the age of reproduction is a complex life history trait commonly known as developmental time. In particular, in holometabolous insects that occupy ephemeral habitats, like fruit flies, the impact of developmental time on fitness is further exaggerated. The present work is one of the first systematic studies of the genetic basis of developmental time, in which we also evaluate the impact of environmental variation on the expression of the trait. Results We analyzed 179 co-isogenic single P[GT1]-element insertion lines of Drosophila melanogaster to identify novel genes affecting developmental time in flies reared at 25°C. Sixty percent of the lines showed a heterochronic phenotype, suggesting that a large number of genes affect this trait. Mutant lines for the genes Merlin and Karl showed the most extreme phenotypes exhibiting a developmental time reduction and increase, respectively, of over 2 days and 4 days relative to the control (a co-isogenic P-element insertion free line). In addition, a subset of 42 lines selected at random from the initial set of 179 lines was screened at 17°C. Interestingly, the gene-by-environment interaction accounted for 52% of total phenotypic variance. Plastic reaction norms were found for a large number of developmental time candidate genes. Conclusion We identified components of several integrated time-dependent pathways affecting egg-to-adult developmental time in Drosophila. At the same time, we also show that many heterochronic phenotypes may arise from changes in genes involved in several developmental mechanisms that do not explicitly control the timing of specific events. We also demonstrate that many developmental time genes have pleiotropic effects on several adult traits and that the action of most of them is sensitive to temperature during
The application of artificial intelligence to microarray data: identification of a novel gene signature to identify bladder cancer progression.

PubMed

Catto, James W F; Abbod, Maysam F; Wild, Peter J; Linkens, Derek A; Pilarsky, Christian; Rehman, Ishtiaq; Rosario, Derek J; Denzinger, Stefan; Burger, Maximilian; Stoehr, Robert; Knuechel, Ruth; Hartmann, Arndt; Hamdy, Freddie C

2010-03-01

New methods for identifying bladder cancer (BCa) progression are required. Gene expression microarrays can reveal insights into disease biology and identify novel biomarkers. However, these experiments produce large datasets that are difficult to interpret. To develop a novel method of microarray analysis combining two forms of artificial intelligence (AI): neurofuzzy modelling (NFM) and artificial neural networks (ANN) and validate it in a BCa cohort. We used AI and statistical analyses to identify progression-related genes in a microarray dataset (n=66 tumours, n=2800 genes). The AI-selected genes were then investigated in a second cohort (n=262 tumours) using immunohistochemistry. We compared the accuracy of AI and statistical approaches to identify tumour progression. AI identified 11 progression-associated genes (odds ratio [OR]: 0.70; 95% confidence interval [CI], 0.56-0.87; p=0.0004), and these were more discriminate than genes chosen using statistical analyses (OR: 1.24; 95% CI, 0.96-1.60; p=0.09). The expression of six AI-selected genes (LIG3, FAS, KRT18, ICAM1, DSG2, and BRCA2) was determined using commercial antibodies and successfully identified tumour progression (concordance index: 0.66; log-rank test: p=0.01). AI-selected genes were more discriminate than pathologic criteria at determining progression (Cox multivariate analysis: p=0.01). Limitations include the use of statistical correlation to identify 200 genes for AI analysis and that we did not compare regression identified genes with immunohistochemistry. AI and statistical analyses use different techniques of inference to determine gene-phenotype associations and identify distinct prognostic gene signatures that are equally valid. We have identified a prognostic gene signature whose members reflect a variety of carcinogenic pathways that could identify progression in non-muscle-invasive BCa. 2009 European Association of Urology. Published by Elsevier B.V. All rights reserved.
Genetic organization of the unc-22 IV gene and the adjacent region in Caenorhabditis elegans.

PubMed

Rogalski, T M; Baillie, D L

1985-01-01

The genetic organization of the region immediately adjacent to the unc-22 IV gene in Caenorhabditis elegans has been studied. We have identified twenty essential genes in this interval of approximately 1.5-map units on Linkage Group IV. The mutations that define these genes were positioned by recombination mapping and complementation with several deficiencies. With few exceptions, the positions obtained by these two methods agreed. Eight of the twenty essential genes identified are represented by more than one allele. Three possible internal deletions of the unc-22 gene have been located by intra-genic mapping. In addition, the right end point of a deficiency or an inversion affecting the adjacent genes let-56 and unc-22 has been positioned inside the unc-22 gene.
Identification of essential genes of Pseudomonas aeruginosa for its growth in airway mucus.

PubMed

Alrahman, Mohammed Abd; Yoon, Sang Sun

2017-01-01

Pseudomonas aeruginosa has been identified as an important causative agent of airway infection, mainly in cystic fibrosis. This disease is characterized by defective mucociliary clearance induced in part by mucus hyper-production. Mucin is a major component of airway mucus and is heavily O-glycosylated, with a protein backbone. Airway infection is known to be established with bacterial adhesion to mucin. However, the genes involved in mucin degradation or utilization remain elusive. In this study, we sought to provide a genetic basis of P. aeruginosa airway growth by identifying those genes. First, using RNASeq analyses, we compared genome-wide expression profiles of PAO1, a prototype P. aeruginosa laboratory strain, grown in M9-mucin (M9M) and M9-glucose (M9G) media. Additionally, a PAO1 transposon (Tn) insertion mutants library was screened for mutants defective in growth in M9M medium. One mutant with a Tn insertion in the xcpU gene (PA3100) was determined to exhibit faulty growth in M9M medium. This gene contributes to the type II secretion system, suggesting that P. aeruginosa uses this secretion system to produce a number of proteins to break down and assimilate the mucin molecule. Furthermore, we screened the PAO1 genome for genes with protease activity. Of 13 mutants, one with mutation in PA3247 gene exhibited defective growth in M9M, suggesting that the PA3247-encoded protease plays a role in mucin utilization. Further mechanistic dissection of this particular process will reveal new drug targets, the inhibition of which could control recalcitrant P. aeruginosa infections.
Comprehensive Ex Vivo Transposon Mutagenesis Identifies Genes That Promote Growth Factor Independence and Leukemogenesis.

PubMed

Guo, Yabin; Updegraff, Barrett L; Park, Sunho; Durakoglugil, Deniz; Cruz, Victoria H; Maddux, Sarah; Hwang, Tae Hyun; O'Donnell, Kathryn A

2016-02-15

Aberrant signaling through cytokine receptors and their downstream signaling pathways is a major oncogenic mechanism underlying hematopoietic malignancies. To better understand how these pathways become pathologically activated and to potentially identify new drivers of hematopoietic cancers, we developed a high-throughput functional screening approach using ex vivo mutagenesis with the Sleeping Beauty transposon. We analyzed over 1,100 transposon-mutagenized pools of Ba/F3 cells, an IL3-dependent pro-B-cell line, which acquired cytokine independence and tumor-forming ability. Recurrent transposon insertions could be mapped to genes in the JAK/STAT and MAPK pathways, confirming the ability of this strategy to identify known oncogenic components of cytokine signaling pathways. In addition, recurrent insertions were identified in a large set of genes that have been found to be mutated in leukemia or associated with survival, but were not previously linked to the JAK/STAT or MAPK pathways nor shown to functionally contribute to leukemogenesis. Forced expression of these novel genes resulted in IL3-independent growth in vitro and tumorigenesis in vivo, validating this mutagenesis-based approach for identifying new genes that promote cytokine signaling and leukemogenesis. Therefore, our findings provide a broadly applicable approach for classifying functionally relevant genes in diverse malignancies and offer new insights into the impact of cytokine signaling on leukemia development. ©2015 American Association for Cancer Research.
The Association of Mitofusion-2 Gene Polymorphisms with Susceptibility of Essential Hypertension in Northern Han Chinese Population.

PubMed

Li, Mei; Zhang, Bei; Li, Chuang; Liu, Jielin; Liu, Ya; Sun, Dongdong; Ma, Hanying; Wen, Shaojun

2016-01-01

Mitofusion-2 (Mfn2) played an important role in regulating vascular smooth muscle cells proliferation, insulin resistance and endoplasmic reticulum stress, which were found to be involved in the development of hypertension. So we inferred that the Mfn2 gene may participate in the pathogenesis of hypertension. The aim of this study was to determine whether common single nucleotide polymorphisms (SNPs) in Mfn2 gene were associated with essential hypertension (EH) in northern Han Chinese. We genotyped 6 tagging SNPs of Mfn2 gene (rs2336384, rs2295281, rs17037564, rs2236057, rs2236058 and rs3766741) with the TaqMan assay in 626 hypertensive patients and 618 controls. Logistic regression analysis indicated that CC+CA genotype of rs2336384 and AA+AG genotype of rs2236057 were significantly associated with increased risk of EH (OR=1.617, P=0.005; OR=1.418, P=0.031, respectively). GG genotype of rs2236058 and GG+CG genotype of rs3766741 were found to be significantly associated with decreased risk of EH (OR=0.662, P=0.023; OR=0.639, P=0.024).When stratified by gender, for rs2336384, rs2236057 and rs2236058, significant association was observed in males, but not in females. Haplotype analysis indicated that the CCAACC haplotype was positively correlated with EH and there was a negative correlation between ACAGGG haplotype and EH. This study demonstrated that Mfn2 gene polymorphisms were associated with essential hypertension in northern Han Chinese population, especially in male subjects.
The Association of Mitofusion-2 Gene Polymorphisms with Susceptibility of Essential Hypertension in Northern Han Chinese Population

PubMed Central

Li, Mei; Zhang, Bei; Li, Chuang; Liu, Jielin; Liu, Ya; Sun, Dongdong; Ma, Hanying; Wen, Shaojun

2016-01-01

Background: Mitofusion-2 (Mfn2) played an important role in regulating vascular smooth muscle cells proliferation, insulin resistance and endoplasmic reticulum stress, which were found to be involved in the development of hypertension. So we inferred that the Mfn2 gene may participate in the pathogenesis of hypertension. The aim of this study was to determine whether common single nucleotide polymorphisms (SNPs) in Mfn2 gene were associated with essential hypertension (EH) in northern Han Chinese. Methods: We genotyped 6 tagging SNPs of Mfn2 gene (rs2336384, rs2295281, rs17037564, rs2236057, rs2236058 and rs3766741) with the TaqMan assay in 626 hypertensive patients and 618 controls. Results: Logistic regression analysis indicated that CC+CA genotype of rs2336384 and AA+AG genotype of rs2236057 were significantly associated with increased risk of EH (OR=1.617, P=0.005; OR=1.418, P=0.031, respectively). GG genotype of rs2236058 and GG+CG genotype of rs3766741 were found to be significantly associated with decreased risk of EH (OR=0.662, P=0.023; OR=0.639, P=0.024).When stratified by gender, for rs2336384, rs2236057 and rs2236058, significant association was observed in males, but not in females. Haplotype analysis indicated that the CCAACC haplotype was positively correlated with EH and there was a negative correlation between ACAGGG haplotype and EH. Conclusions: This study demonstrated that Mfn2 gene polymorphisms were associated with essential hypertension in northern Han Chinese population, especially in male subjects. PMID:26816493
The Flavin-Containing Monooxygenase 3 Gene and Essential Hypertension: The Joint Effect of Polymorphism E158K and Cigarette Smoking on Disease Susceptibility

PubMed Central

Bushueva, Olga; Solodilova, Maria; Churnosov, Mikhail; Ivanov, Vladimir; Polonikov, Alexey

2014-01-01

Gene encoding flavin-containing monooxygenase 3 (FMO3), a microsomal antioxidant defense enzyme, has been suggested to contribute to essential hypertension (EH). The present study was designed to investigate whether common functional polymorphism E158K (rs2266782) of the FMO3 gene is associated with EH susceptibility in a Russian population. A total of 2 995 unrelated subjects from Kursk (1 362 EH patients and 843 healthy controls) and Belgorod (357 EH patients and 422 population controls) regions of Central Russia were recruited for this study. DNA samples from all study participants were genotyped for the FMO3 gene polymorphism through PCR followed by RFLP analysis. We found that the polymorphism E158K is associated with increased risk of essential hypertension in both discovery population from Kursk region (OR 1.36 95% CI 1.09–1.69, P = 0.01) and replication population from Belgorod region (OR 1.54 95% CI 1.07–1.89, P = 0.02) after adjustment for gender and age using logistic regression analysis. Further analysis showed that the increased hypertension risk in carriers of genotype 158KK gene occurred in cigarette smokers, whereas nonsmoker carriers of this genotype did not show the disease risk. This is the first study reporting the association of the FMO3 gene polymorphism and the risk of essential hypertension. PMID:25243081
Computational correction of copy number effect improves specificity of CRISPR-Cas9 essentiality screens in cancer cells.

PubMed

Meyers, Robin M; Bryan, Jordan G; McFarland, James M; Weir, Barbara A; Sizemore, Ann E; Xu, Han; Dharia, Neekesh V; Montgomery, Phillip G; Cowley, Glenn S; Pantel, Sasha; Goodale, Amy; Lee, Yenarae; Ali, Levi D; Jiang, Guozhi; Lubonja, Rakela; Harrington, William F; Strickland, Matthew; Wu, Ting; Hawes, Derek C; Zhivich, Victor A; Wyatt, Meghan R; Kalani, Zohra; Chang, Jaime J; Okamoto, Michael; Stegmaier, Kimberly; Golub, Todd R; Boehm, Jesse S; Vazquez, Francisca; Root, David E; Hahn, William C; Tsherniak, Aviad

2017-12-01

The CRISPR-Cas9 system has revolutionized gene editing both at single genes and in multiplexed loss-of-function screens, thus enabling precise genome-scale identification of genes essential for proliferation and survival of cancer cells. However, previous studies have reported that a gene-independent antiproliferative effect of Cas9-mediated DNA cleavage confounds such measurement of genetic dependency, thereby leading to false-positive results in copy number-amplified regions. We developed CERES, a computational method to estimate gene-dependency levels from CRISPR-Cas9 essentiality screens while accounting for the copy number-specific effect. In our efforts to define a cancer dependency map, we performed genome-scale CRISPR-Cas9 essentiality screens across 342 cancer cell lines and applied CERES to this data set. We found that CERES decreased false-positive results and estimated sgRNA activity for both this data set and previously published screens performed with different sgRNA libraries. We further demonstrate the utility of this collection of screens, after CERES correction, for identifying cancer-type-specific vulnerabilities.
Selection on plant male function genes identifies candidates for reproductive isolation of yellow monkeyflowers.

PubMed

Aagaard, Jan E; George, Renee D; Fishman, Lila; Maccoss, Michael J; Swanson, Willie J

2013-01-01

Understanding the genetic basis of reproductive isolation promises insight into speciation and the origins of biological diversity. While progress has been made in identifying genes underlying barriers to reproduction that function after fertilization (post-zygotic isolation), we know much less about earlier acting pre-zygotic barriers. Of particular interest are barriers involved in mating and fertilization that can evolve extremely rapidly under sexual selection, suggesting they may play a prominent role in the initial stages of reproductive isolation. A significant challenge to the field of speciation genetics is developing new approaches for identification of candidate genes underlying these barriers, particularly among non-traditional model systems. We employ powerful proteomic and genomic strategies to study the genetic basis of conspecific pollen precedence, an important component of pre-zygotic reproductive isolation among yellow monkeyflowers (Mimulus spp.) resulting from male pollen competition. We use isotopic labeling in combination with shotgun proteomics to identify more than 2,000 male function (pollen tube) proteins within maternal reproductive structures (styles) of M. guttatus flowers where pollen competition occurs. We then sequence array-captured pollen tube exomes from a large outcrossing population of M. guttatus, and identify those genes with evidence of selective sweeps or balancing selection consistent with their role in pollen competition. We also test for evidence of positive selection on these genes more broadly across yellow monkeyflowers, because a signal of adaptive divergence is a common feature of genes causing reproductive isolation. Together the molecular evolution studies identify 159 pollen tube proteins that are candidate genes for conspecific pollen precedence. Our work demonstrates how powerful proteomic and genomic tools can be readily adapted to non-traditional model systems, allowing for genome-wide screens towards the

Comparative Transcriptional Profiling of the Axolotl Limb Identifies a Tripartite Regeneration-Specific Gene Program

PubMed Central

Knapp, Dunja; Schulz, Herbert; Rascon, Cynthia Alexander; Volkmer, Michael; Scholz, Juliane; Nacu, Eugen; Le, Mu; Novozhilov, Sergey; Tazaki, Akira; Protze, Stephanie; Jacob, Tina; Hubner, Norbert; Habermann, Bianca; Tanaka, Elly M.

2013-01-01

Understanding how the limb blastema is established after the initial wound healing response is an important aspect of regeneration research. Here we performed parallel expression profile time courses of healing lateral wounds versus amputated limbs in axolotl. This comparison between wound healing and regeneration allowed us to identify amputation-specific genes. By clustering the expression profiles of these samples, we could detect three distinguishable phases of gene expression – early wound healing followed by a transition-phase leading to establishment of the limb development program, which correspond to the three phases of limb regeneration that had been defined by morphological criteria. By focusing on the transition-phase, we identified 93 strictly amputation-associated genes many of which are implicated in oxidative-stress response, chromatin modification, epithelial development or limb development. We further classified the genes based on whether they were or were not significantly expressed in the developing limb bud. The specific localization of 53 selected candidates within the blastema was investigated by in situ hybridization. In summary, we identified a set of genes that are expressed specifically during regeneration and are therefore, likely candidates for the regulation of blastema formation. PMID:23658691
Genome-wide gene by lead exposure interaction analysis identifies UNC5D as a candidate gene for neurodevelopment.

PubMed

Wang, Zhaoxi; Claus Henn, Birgit; Wang, Chaolong; Wei, Yongyue; Su, Li; Sun, Ryan; Chen, Han; Wagner, Peter J; Lu, Quan; Lin, Xihong; Wright, Robert; Bellinger, David; Kile, Molly; Mazumdar, Maitreyi; Tellez-Rojo, Martha Maria; Schnaas, Lourdes; Christiani, David C

2017-07-28

Neurodevelopment is a complex process involving both genetic and environmental factors. Prenatal exposure to lead (Pb) has been associated with lower performance on neurodevelopmental tests. Adverse neurodevelopmental outcomes are more frequent and/or more severe when toxic exposures interact with genetic susceptibility. To explore possible loci associated with increased susceptibility to prenatal Pb exposure, we performed a genome-wide gene-environment interaction study (GWIS) in young children from Mexico (n = 390) and Bangladesh (n = 497). Prenatal Pb exposure was estimated by cord blood Pb concentration. Neurodevelopment was assessed using the Bayley Scales of Infant Development. We identified a locus on chromosome 8, containing UNC5D, and demonstrated evidence of its genome-wide significance with mental composite scores (rs9642758, p meta = 4.35 × 10 -6 ). Within this locus, the joint effects of two independent single nucleotide polymorphisms (SNPs, rs9642758 and rs10503970) had a p-value of 4.38 × 10 -9 for mental composite scores. Correlating GWIS results with in vitro transcriptomic profiles identified one common gene, SLC1A5, which is involved in synaptic function, neuronal development, and excitotoxicity. Further analysis revealed interconnected interactions that formed a large network of 52 genes enriched with oxidative stress genes and neurodevelopmental genes. Our findings suggest that certain genetic polymorphisms within/near genes relevant to neurodevelopment might modify the toxic effects of Pb exposure via oxidative stress.
Preferential Allele Expression Analysis Identifies Shared Germline and Somatic Driver Genes in Advanced Ovarian Cancer

PubMed Central

Halabi, Najeeb M.; Martinez, Alejandra; Al-Farsi, Halema; Mery, Eliane; Puydenus, Laurence; Pujol, Pascal; Khalak, Hanif G.; McLurcan, Cameron; Ferron, Gwenael; Querleu, Denis; Al-Azwani, Iman; Al-Dous, Eman; Mohamoud, Yasmin A.; Malek, Joel A.; Rafii, Arash

2016-01-01

Identifying genes where a variant allele is preferentially expressed in tumors could lead to a better understanding of cancer biology and optimization of targeted therapy. However, tumor sample heterogeneity complicates standard approaches for detecting preferential allele expression. We therefore developed a novel approach combining genome and transcriptome sequencing data from the same sample that corrects for sample heterogeneity and identifies significant preferentially expressed alleles. We applied this analysis to epithelial ovarian cancer samples consisting of matched primary ovary and peritoneum and lymph node metastasis. We find that preferentially expressed variant alleles include germline and somatic variants, are shared at a relatively high frequency between patients, and are in gene networks known to be involved in cancer processes. Analysis at a patient level identifies patient-specific preferentially expressed alleles in genes that are targets for known drugs. Analysis at a site level identifies patterns of site specific preferential allele expression with similar pathways being impacted in the primary and metastasis sites. We conclude that genes with preferentially expressed variant alleles can act as cancer drivers and that targeting those genes could lead to new therapeutic strategies. PMID:26735499
An Optimal Mean Based Block Robust Feature Extraction Method to Identify Colorectal Cancer Genes with Integrated Data.

PubMed

Liu, Jian; Cheng, Yuhu; Wang, Xuesong; Zhang, Lin; Liu, Hui

2017-08-17

It is urgent to diagnose colorectal cancer in the early stage. Some feature genes which are important to colorectal cancer development have been identified. However, for the early stage of colorectal cancer, less is known about the identity of specific cancer genes that are associated with advanced clinical stage. In this paper, we conducted a feature extraction method named Optimal Mean based Block Robust Feature Extraction method (OMBRFE) to identify feature genes associated with advanced colorectal cancer in clinical stage by using the integrated colorectal cancer data. Firstly, based on the optimal mean and L 2,1 -norm, a novel feature extraction method called Optimal Mean based Robust Feature Extraction method (OMRFE) is proposed to identify feature genes. Then the OMBRFE method which introduces the block ideology into OMRFE method is put forward to process the colorectal cancer integrated data which includes multiple genomic data: copy number alterations, somatic mutations, methylation expression alteration, as well as gene expression changes. Experimental results demonstrate that the OMBRFE is more effective than previous methods in identifying the feature genes. Moreover, genes identified by OMBRFE are verified to be closely associated with advanced colorectal cancer in clinical stage.
The polyketide synthase gene pks4 is essential for sexual development and regulates fruiting body morphology in Sordaria macrospora.

PubMed

Schindler, Daniel; Nowrousian, Minou

2014-07-01

Filamentous ascomycetes have long been known as producers of a variety of secondary metabolites, many of which have toxic effects on other organisms. However, the role of these metabolites in the biology of the fungi that produce them remains in most cases enigmatic. A major group of fungal secondary metabolites are polyketides. They are chemically diverse, but have in common that their chemical scaffolds are synthesized by polyketide synthases (PKSs). In a previous study, we analyzed development-dependent expression of pks genes in the filamentous ascomycete Sordaria macrospora. Here, we show that a deletion mutant of the pks4 gene is sterile, producing only protoperithecia but no mature perithecia, whereas overexpression of pks4 leads to enlarged, malformed fruiting bodies. Thus, correct expression levels of pks4 are essential for wild type-like perithecia formation. The predicted PKS4 protein has a domain structure that is similar to homologs in other fungi, but conserved residues of a methyl transferase domain present in other fungi are mutated in PKS4. Expression of several developmental genes is misregulated in the pks4 mutant. Surprisingly, the development-associated app gene is not downregulated in the mutant, in contrast to all other previously studied mutants with a block at the protoperithecial stage. Our data show that the polyketide synthase gene pks4 is essential for sexual development and plays a role in regulating fruiting body morphology. Copyright © 2014 Elsevier Inc. All rights reserved.
The transcription factor MTF-1 is essential for basal and heavy metal-induced metallothionein gene expression.

PubMed

Heuchel, R; Radtke, F; Georgiev, O; Stark, G; Aguet, M; Schaffner, W

1994-06-15

We have described and cloned previously a factor (MTF-1) that binds specifically to heavy metal-responsive DNA sequence elements in the enhancer/promoter region of metallothionein genes. MTF-1 is a protein of 72.5 kDa that contains six zinc fingers and multiple domains for transcriptional activation. Here we report the disruption of both alleles of the MTF-1 gene in mouse embryonic stem cells by homologous recombination. The resulting null mutant cell line fails to produce detectable amounts of MTF-1. Moreover, due to the loss of MTF-1, the endogenous metallothionein I and II genes are silent, indicating that MTF-1 is required for both their basal and zinc-induced transcription. In addition to zinc, other heavy metals, including cadmium, copper, nickel and lead, also fail to activate metal-responsive promoters in null mutant cells. However, cotransfection of an MTF-1 expression vector and metal-responsive reporter genes yields strong basal transcription that can be further boosted by zinc treatment of cells. These results demonstrate that MTF-1 is essential for metallothionein gene regulation. Finally, we present evidence that MTF-1 itself is a zinc sensor, which exhibits increased DNA binding activity upon zinc treatment.
Exome sequencing in amyotrophic lateral sclerosis identifies risk genes and pathways.

PubMed

Cirulli, Elizabeth T; Lasseigne, Brittany N; Petrovski, Slavé; Sapp, Peter C; Dion, Patrick A; Leblond, Claire S; Couthouis, Julien; Lu, Yi-Fan; Wang, Quanli; Krueger, Brian J; Ren, Zhong; Keebler, Jonathan; Han, Yujun; Levy, Shawn E; Boone, Braden E; Wimbish, Jack R; Waite, Lindsay L; Jones, Angela L; Carulli, John P; Day-Williams, Aaron G; Staropoli, John F; Xin, Winnie W; Chesi, Alessandra; Raphael, Alya R; McKenna-Yasek, Diane; Cady, Janet; Vianney de Jong, J M B; Kenna, Kevin P; Smith, Bradley N; Topp, Simon; Miller, Jack; Gkazi, Athina; Al-Chalabi, Ammar; van den Berg, Leonard H; Veldink, Jan; Silani, Vincenzo; Ticozzi, Nicola; Shaw, Christopher E; Baloh, Robert H; Appel, Stanley; Simpson, Ericka; Lagier-Tourenne, Clotilde; Pulst, Stefan M; Gibson, Summer; Trojanowski, John Q; Elman, Lauren; McCluskey, Leo; Grossman, Murray; Shneider, Neil A; Chung, Wendy K; Ravits, John M; Glass, Jonathan D; Sims, Katherine B; Van Deerlin, Vivianna M; Maniatis, Tom; Hayes, Sebastian D; Ordureau, Alban; Swarup, Sharan; Landers, John; Baas, Frank; Allen, Andrew S; Bedlack, Richard S; Harper, J Wade; Gitler, Aaron D; Rouleau, Guy A; Brown, Robert; Harms, Matthew B; Cooper, Gregory M; Harris, Tim; Myers, Richard M; Goldstein, David B

2015-03-27

Amyotrophic lateral sclerosis (ALS) is a devastating neurological disease with no effective treatment. We report the results of a moderate-scale sequencing study aimed at increasing the number of genes known to contribute to predisposition for ALS. We performed whole-exome sequencing of 2869 ALS patients and 6405 controls. Several known ALS genes were found to be associated, and TBK1 (the gene encoding TANK-binding kinase 1) was identified as an ALS gene. TBK1 is known to bind to and phosphorylate a number of proteins involved in innate immunity and autophagy, including optineurin (OPTN) and p62 (SQSTM1/sequestosome), both of which have also been implicated in ALS. These observations reveal a key role of the autophagic pathway in ALS and suggest specific targets for therapeutic intervention. Copyright © 2015, American Association for the Advancement of Science.
Essential Genes for In Vitro Growth of the Endophyte Herbaspirillum seropedicae SmR1 as Revealed by Transposon Insertion Site Sequencing.

PubMed

Rosconi, Federico; de Vries, Stefan P W; Baig, Abiyad; Fabiano, Elena; Grant, Andrew J

2016-11-15

The interior of plants contains microorganisms (referred to as endophytes) that are distinct from those present at the root surface or in the surrounding soil. Herbaspirillum seropedicae strain SmR1, belonging to the betaproteobacteria, is an endophyte that colonizes crops, including rice, maize, sugarcane, and sorghum. Different approaches have revealed genes and pathways regulated during the interactions of H. seropedicae with its plant hosts. However, functional genomic analysis of transposon (Tn) mutants has been hampered by the lack of genetic tools. Here we successfully employed a combination of in vivo high-density mariner Tn mutagenesis and targeted Tn insertion site sequencing (Tn-seq) in H. seropedicae SmR1. The analysis of multiple gene-saturating Tn libraries revealed that 395 genes are essential for the growth of H. seropedicae SmR1 in tryptone-yeast extract medium. A comparative analysis with the Database of Essential Genes (DEG) showed that 25 genes are uniquely essential in H. seropedicae SmR1. The Tn mutagenesis protocol developed and the gene-saturating Tn libraries generated will facilitate elucidation of the genetic mechanisms of the H. seropedicae endophytic lifestyle. A focal point in the study of endophytes is the development of effective biofertilizers that could help to reduce the input of agrochemicals in croplands. Besides the ability to promote plant growth, a good biofertilizer should be successful in colonizing its host and competing against the native microbiota. By using a systematic Tn-based gene-inactivation strategy and massively parallel sequencing of Tn insertion sites (Tn-seq), it is possible to study the fitness of thousands of Tn mutants in a single experiment. We have applied the combination of these techniques to the plant-growth-promoting endophyte Herbaspirillum seropedicae SmR1. The Tn mutant libraries generated will enable studies into the genetic mechanisms of H. seropedicae-plant interactions. The approach that we
Essential Genes for In Vitro Growth of the Endophyte Herbaspirillum seropedicae SmR1 as Revealed by Transposon Insertion Site Sequencing

PubMed Central

Rosconi, Federico; de Vries, Stefan P. W.; Baig, Abiyad; Fabiano, Elena

2016-01-01

ABSTRACT The interior of plants contains microorganisms (referred to as endophytes) that are distinct from those present at the root surface or in the surrounding soil. Herbaspirillum seropedicae strain SmR1, belonging to the betaproteobacteria, is an endophyte that colonizes crops, including rice, maize, sugarcane, and sorghum. Different approaches have revealed genes and pathways regulated during the interactions of H. seropedicae with its plant hosts. However, functional genomic analysis of transposon (Tn) mutants has been hampered by the lack of genetic tools. Here we successfully employed a combination of in vivo high-density mariner Tn mutagenesis and targeted Tn insertion site sequencing (Tn-seq) in H. seropedicae SmR1. The analysis of multiple gene-saturating Tn libraries revealed that 395 genes are essential for the growth of H. seropedicae SmR1 in tryptone-yeast extract medium. A comparative analysis with the Database of Essential Genes (DEG) showed that 25 genes are uniquely essential in H. seropedicae SmR1. The Tn mutagenesis protocol developed and the gene-saturating Tn libraries generated will facilitate elucidation of the genetic mechanisms of the H. seropedicae endophytic lifestyle. IMPORTANCE A focal point in the study of endophytes is the development of effective biofertilizers that could help to reduce the input of agrochemicals in croplands. Besides the ability to promote plant growth, a good biofertilizer should be successful in colonizing its host and competing against the native microbiota. By using a systematic Tn-based gene-inactivation strategy and massively parallel sequencing of Tn insertion sites (Tn-seq), it is possible to study the fitness of thousands of Tn mutants in a single experiment. We have applied the combination of these techniques to the plant-growth-promoting endophyte Herbaspirillum seropedicae SmR1. The Tn mutant libraries generated will enable studies into the genetic mechanisms of H. seropedicae-plant interactions. The
Marek's disease virus protein kinase gene identified within the short unique region of the viral genome is not essential for viral replication in cell culture and vaccine-induced immunity in chickens.

PubMed

Sakaguchi, M; Urakawa, T; Hirayama, Y; Miki, N; Yamamoto, M; Zhu, G S; Hirai, K

1993-07-01

The open reading frame (ORF) of 1206 bp within the short unique region (Us) of Marek's disease virus type 1 (MDV1) shows significant homology with the herpes simplex virus type 1 US3 gene encoding protein kinase (PK). The lacZ gene of Escherichia coli was inserted within the ORF, designated MDV1-US3, of MDV1 K544 strain DNA by homologous recombination. The plaque-purified recombinant MDV1 stably expressed the beta-galactosidase encoded by the inserted lacZ gene in infected cells and replicated well as the parental K544 strain. Antibodies against both MDV1 antigen and beta-galactosidase were detected in the sera of chickens immunized with recombinant MDV1. Chickens vaccinated with the recombinant MDV1 were protected from challenge with virulent MDV1. The MDV1 US3 gene expressed by a baculovirus vector encoded a 44-kDa protein. Mouse antisera against the 44-kDa protein reacted with two proteins of 44 and 45 kDa in extracts of cells infected with MDV1 but not with MDV types 2 or 3. The PK activity was detected in immune complexes of the anti-44-kDa sera with extracts of cells infected with MDV1 but not with the recombinant MDV1. Thus, PK encoded from the MDV1-US3 is not essential for virus replication in cell culture and vaccine-induced immunity.
PIMMS (Pragmatic Insertional Mutation Mapping System) Laboratory Methodology a Readily Accessible Tool for Identification of Essential Genes in Streptococcus

PubMed Central

Blanchard, Adam M.; Egan, Sharon A.; Emes, Richard D.; Warry, Andrew; Leigh, James A.

2016-01-01

The Pragmatic Insertional Mutation Mapping (PIMMS) laboratory protocol was developed alongside various bioinformatics packages (Blanchard et al., 2015) to enable detection of essential and conditionally essential genes in Streptococcus and related bacteria. This extended the methodology commonly used to locate insertional mutations in individual mutants to the analysis of mutations in populations of bacteria. In Streptococcus uberis, a pyogenic Streptococcus associated with intramammary infection and mastitis in ruminants, the mutagen pGhost9:ISS1 was shown to integrate across the entire genome. Analysis of >80,000 mutations revealed 196 coding sequences, which were not be mutated and a further 67 where mutation only occurred beyond the 90th percentile of the coding sequence. These sequences showed good concordance with sequences within the database of essential genes and typically matched sequences known to be associated with basic cellular functions. Due to the broad utility of this mutagen and the simplicity of the methodology it is anticipated that PIMMS will be of value to a wide range of laboratories in functional genomic analysis of a wide range of Gram positive bacteria (Streptococcus, Enterococcus, and Lactococcus) of medical, veterinary, and industrial significance. PMID:27826289
MADGiC: a model-based approach for identifying driver genes in cancer

PubMed Central

Korthauer, Keegan D.; Kendziorski, Christina

2015-01-01

Motivation: Identifying and prioritizing somatic mutations is an important and challenging area of cancer research that can provide new insights into gene function as well as new targets for drug development. Most methods for prioritizing mutations rely primarily on frequency-based criteria, where a gene is identified as having a driver mutation if it is altered in significantly more samples than expected according to a background model. Although useful, frequency-based methods are limited in that all mutations are treated equally. It is well known, however, that some mutations have no functional consequence, while others may have a major deleterious impact. The spatial pattern of mutations within a gene provides further insight into their functional consequence. Properly accounting for these factors improves both the power and accuracy of inference. Also important is an accurate background model. Results: Here, we develop a Model-based Approach for identifying Driver Genes in Cancer (termed MADGiC) that incorporates both frequency and functional impact criteria and accommodates a number of factors to improve the background model. Simulation studies demonstrate advantages of the approach, including a substantial increase in power over competing methods. Further advantages are illustrated in an analysis of ovarian and lung cancer data from The Cancer Genome Atlas (TCGA) project. Availability and implementation: R code to implement this method is available at http://www.biostat.wisc.edu/ kendzior/MADGiC/. Contact: kendzior@biostat.wisc.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25573922
A Kinome RNAi Screen in Drosophila Identifies Novel Genes Interacting with Lgl, aPKC, and Crb Cell Polarity Genes in Epithelial Tissues.

PubMed

Parsons, Linda M; Grzeschik, Nicola A; Amaratunga, Kasun; Burke, Peter; Quinn, Leonie M; Richardson, Helena E

2017-08-07

In both Drosophila melanogaster and mammalian systems, epithelial structure and underlying cell polarity are essential for proper tissue morphogenesis and organ growth. Cell polarity interfaces with multiple cellular processes that are regulated by the phosphorylation status of large protein networks. To gain insight into the molecular mechanisms that coordinate cell polarity with tissue growth, we screened a boutique collection of RNAi stocks targeting the kinome for their capacity to modify Drosophila "cell polarity" eye and wing phenotypes. Initially, we identified kinase or phosphatase genes whose depletion modified adult eye phenotypes associated with the manipulation of cell polarity complexes (via overexpression of Crb or aPKC). We next conducted a secondary screen to test whether these cell polarity modifiers altered tissue overgrowth associated with depletion of Lgl in the wing. These screens identified Hippo, Jun kinase (JNK), and Notch signaling pathways, previously linked to cell polarity regulation of tissue growth. Furthermore, novel pathways not previously connected to cell polarity regulation of tissue growth were identified, including Wingless (Wg/Wnt), Ras, and lipid/Phospho-inositol-3-kinase (PI3K) signaling pathways. Additionally, we demonstrated that the "nutrient sensing" kinases Salt Inducible Kinase 2 and 3 ( SIK2 and 3 ) are potent modifiers of cell polarity phenotypes and regulators of tissue growth. Overall, our screen has revealed novel cell polarity-interacting kinases and phosphatases that affect tissue growth, providing a platform for investigating molecular mechanisms coordinating cell polarity and tissue growth during development. Copyright © 2017 Parsons et al.
Novel Association of WNK4 Gene, Ala589Ser Polymorphism in Essential Hypertension, and Type 2 Diabetes Mellitus in Malaysia.

PubMed

Ghodsian, Nooshin; Ismail, Patimah; Ahmadloo, Salma; Heidari, Farzad; Haghvirdizadeh, Polin; Ataollahi Eshkoor, Sima; Etemad, Ali

2016-01-01

With-no-lysine (K) Kinase-4 (WNK4) consisted of unique serine and threonine protein kinases, genetically associated with an autosomal dominant form of hypertension. Argumentative consequences have lately arisen on the association of specific single nucleotide polymorphisms of WNK4 gene and essential hypertension (EHT). The aim of this study was to determine the association of Ala589Ser polymorphism of WNK4 gene with essential hypertensive patients in Malaysia. WNK4 gene polymorphism was specified utilizing mutagenically separated polymerase chain reaction (PCR) and restriction fragment length polymorphism (RFLP) method in 320 subjects including 163 cases and 157 controls. Close relation between Ala589Ser polymorphism and elevated systolic and diastolic blood pressure (SBP and DBP) was recognized. Sociodemographic factors including body mass index (BMI), age, the level of fasting blood sugar (FBS), low density lipoprotein (LDL), and triglyceride (TG) in the cases and healthy subjects exhibited strong differences (p < 0.05). The distribution of allele frequency and genotype of WNK4 gene Ala589Ser polymorphism showed significant differences (p < 0.05) between EHT subjects with or without type 2 diabetes mellitus (T2DM) and normotensive subjects, statistically. The WNK4 gene variation influences significantly blood pressure increase. Ala589Ser probably has effects on the enzymic activity leading to enhanced predisposition to the disorder.
Delta-amino-levulinic acid dehydratase gene and essential tremor.

PubMed

Agúndez, José A G; García-Martín, Elena; Alonso-Navarro, Hortensia; Ayuso, Pedro; Esguevillas, Gara; Benito-León, Julián; Ortega-Cubero, Sara; Pastor, Pau; López-Alburquerque, Tomás; Jiménez-Jiménez, Félix Javier

2017-05-01

Several reports found a relationship between increased serum lead levels and the risk for essential tremor (ET), especially in carriers of the minor allele of the single nucleotide polymorphism (SNP) rs1800435 in the aminolevulinate dehydratase (ALAD) gene, which is involved in the synthesis of haem groups. Our group reported decreased risk for ET in carriers of the minor alleles of the rs2071746 and rs1051308 SNPs in the haem-oxygenases 1 and 2 (HMOX1 and HMOX2), respectively, involved in haem metabolism. We analysed whether ALAD rs1800435 alone and their interactions with the four common SNPs in the HMOX1 and HMOX2 genes are associated with the risk for ET. We analysed the genotype and allele variants frequencies of ALAD rs1800435 in 202 patients with familial ET and 218 healthy controls using a TaqMan method. We also analysed the role of the interaction between ALAD rs1800435 and the HMOX1 rs2071746, HMOX1 rs2071747, HMOX2 rs2270363 and HMOX2 rs1051308 with the risk of developing ET. The frequencies of genotype and allelic variants of ALAD rs1800435 did not differ significantly between patients with ET and controls, and were not influenced by gender. Subjects carrying the ALAD rs1800435CC genotype (wild-type) and the HMOX2 rs1051308GG genotype or the HMOX2 rs1051308G allele had significantly decreased risk for ET. These results suggest that the ALAD rs1800435 SNP is not related with the risk for ET, but its interaction with the HMOX2 rs1051308 SNP could be weakly associated with the risk for this disease. © 2017 Stichting European Society for Clinical Investigation Journal Foundation.
Genetic effects on gene expression across human tissues

PubMed Central

2017-01-01

Characterization of the molecular function of the human genome and its variation across individuals is essential for identifying the cellular mechanisms that underlie human genetic traits and diseases. The Genotype-Tissue Expression (GTEx) project aims to characterize variation in gene expression levels across individuals and diverse tissues of the human body, many of which are not easily accessible. Here we describe genetic effects on gene expression levels across 44 human tissues. We find that local genetic variation affects gene expression levels for the majority of genes, and we further identify inter-chromosomal genetic effects for 93 genes and 112 loci. On the basis of the identified genetic effects, we characterize patterns of tissue specificity, compare local and distal effects, and evaluate the functional properties of the genetic effects. We also demonstrate that multi-tissue, multi-individual data can be used to identify genes and pathways affected by human disease-associated variation, enabling a mechanistic interpretation of gene regulation and the genetic basis of disease. PMID:29022597
Genetic effects on gene expression across human tissues.

PubMed

Battle, Alexis; Brown, Christopher D; Engelhardt, Barbara E; Montgomery, Stephen B

2017-10-11

Characterization of the molecular function of the human genome and its variation across individuals is essential for identifying the cellular mechanisms that underlie human genetic traits and diseases. The Genotype-Tissue Expression (GTEx) project aims to characterize variation in gene expression levels across individuals and diverse tissues of the human body, many of which are not easily accessible. Here we describe genetic effects on gene expression levels across 44 human tissues. We find that local genetic variation affects gene expression levels for the majority of genes, and we further identify inter-chromosomal genetic effects for 93 genes and 112 loci. On the basis of the identified genetic effects, we characterize patterns of tissue specificity, compare local and distal effects, and evaluate the functional properties of the genetic effects. We also demonstrate that multi-tissue, multi-individual data can be used to identify genes and pathways affected by human disease-associated variation, enabling a mechanistic interpretation of gene regulation and the genetic basis of disease.
The DEAD-box RNA helicase Ddx39ab is essential for myocyte and lens development in zebrafish.

PubMed

Zhang, Linlin; Yang, Yuxi; Li, Beibei; Scott, Ian C; Lou, Xin

2018-04-23

RNA helicases from the DEAD-box family are found in almost all organisms and have important roles in RNA metabolism, including RNA synthesis, processing and degradation. The function and mechanism of action of most of these helicases in animal development and human disease remain largely unexplored. In a zebrafish mutagenesis screen to identify genes essential for heart development we identified a mutant that disrupts the gene encoding the RNA helicase DEAD-box 39ab ( ddx39ab ). Homozygous ddx39ab mutant embryos exhibit profound cardiac and trunk muscle dystrophy, along with lens abnormalities, caused by abrupt terminal differentiation of cardiomyocyte, myoblast and lens fiber cells. Loss of ddx39ab hindered splicing of mRNAs encoding epigenetic regulatory factors, including members of the KMT2 gene family, leading to misregulation of structural gene expression in cardiomyocyte, myoblast and lens fiber cells. Taken together, these results show that Ddx39ab plays an essential role in establishment of the proper epigenetic status during differentiation of multiple cell lineages. © 2018. Published by The Company of Biologists Ltd.
A Sleeping Beauty forward genetic screen identifies new genes and pathways driving osteosarcoma development and metastasis

PubMed Central

Moriarity, Branden S; Otto, George M; Rahrmann, Eric P; Rathe, Susan K; Wolf, Natalie K; Weg, Madison T; Manlove, Luke A; LaRue, Rebecca S; Temiz, Nuri A; Molyneux, Sam D; Choi, Kwangmin; Holly, Kevin J; Sarver, Aaron L; Scott, Milcah C; Forster, Colleen L; Modiano, Jaime F; Khanna, Chand; Hewitt, Stephen M; Khokha, Rama; Yang, Yi; Gorlick, Richard; Dyer, Michael A; Largaespada, David A

2016-01-01

Osteosarcomas are sarcomas of the bone, derived from osteoblasts or their precursors, with a high propensity to metastasize. Osteosarcoma is associated with massive genomic instability, making it problematic to identify driver genes using human tumors or prototypical mouse models, many of which involve loss of Trp53 function. To identify the genes driving osteosarcoma development and metastasis, we performed a Sleeping Beauty (SB) transposon-based forward genetic screen in mice with and without somatic loss of Trp53. Common insertion site (CIS) analysis of 119 primary tumors and 134 metastatic nodules identified 232 sites associated with osteosarcoma development and 43 sites associated with metastasis, respectively. Analysis of CIS-associated genes identified numerous known and new osteosarcoma-associated genes enriched in the ErbB, PI3K-AKT-mTOR and MAPK signaling pathways. Lastly, we identified several oncogenes involved in axon guidance, including Sema4d and Sema6d, which we functionally validated as oncogenes in human osteosarcoma. PMID:25961939
Male specific genes from dioecious white campion identified by fluorescent differential display.

PubMed

Scutt, Charles P; Jenkins, Tom; Furuya, Masaki; Gilmartin, Philip M

2002-05-01

Fluorescent differential display (FDD) has been used to screen for cDNAs that are differentially up-regulated in male flowers of the dioecious plant Silene latifolia in which an X/Y chromosome system of sex determination operates. To adapt FDD to the cloning of large numbers of differential cDNAs, a novel method of confirming the differential expression of these has been devised. FDD gels were Southern electro-blotted and probed with mixtures of individual cDNA clones derived from different FDD product ligation reactions. These Southern blots were then stripped and re-probed with further mixtures of individual cloned FDD products to identify the maximum number of recombinant clones carrying the true differential amplification products. Of 135 differential bands identified by FDD, 56 differential amplification products were confirmed; these represent 23 unique differentially expressed genes as determined by virtual Northern analysis and two genes expressed at or below the level of detection by virtual Northern analysis. These two low expressed genes show bands of hybridization on genomic Southern blots that are specific to male plants, indicating that they are derived from, or closely related to, Y chromosome genes.

Gene-set analysis based on the pharmacological profiles of drugs to identify repurposing opportunities in schizophrenia.

PubMed

de Jong, Simone; Vidler, Lewis R; Mokrab, Younes; Collier, David A; Breen, Gerome

2016-08-01

Genome-wide association studies (GWAS) have identified thousands of novel genetic associations for complex genetic disorders, leading to the identification of potential pharmacological targets for novel drug development. In schizophrenia, 108 conservatively defined loci that meet genome-wide significance have been identified and hundreds of additional sub-threshold associations harbour information on the genetic aetiology of the disorder. In the present study, we used gene-set analysis based on the known binding targets of chemical compounds to identify the 'drug pathways' most strongly associated with schizophrenia-associated genes, with the aim of identifying potential drug repositioning opportunities and clues for novel treatment paradigms, especially in multi-target drug development. We compiled 9389 gene sets (2496 with unique gene content) and interrogated gene-based p-values from the PGC2-SCZ analysis. Although no single drug exceeded experiment wide significance (corrected p<0.05), highly ranked gene-sets reaching suggestive significance including the dopamine receptor antagonists metoclopramide and trifluoperazine and the tyrosine kinase inhibitor neratinib. This is a proof of principle analysis showing the potential utility of GWAS data of schizophrenia for the direct identification of candidate drugs and molecules that show polypharmacy. © The Author(s) 2016.
Genome-wide transposon mutagenesis of Proteus mirabilis: Essential genes, fitness factors for catheter-associated urinary tract infection, and the impact of polymicrobial infection on fitness requirements.

PubMed

Armbruster, Chelsie E; Forsyth-DeOrnellas, Valerie; Johnson, Alexandra O; Smith, Sara N; Zhao, Lili; Wu, Weisheng; Mobley, Harry L T

2017-06-01

The Gram-negative bacterium Proteus mirabilis is a leading cause of catheter-associated urinary tract infections (CAUTIs), which are often polymicrobial. Numerous prior studies have uncovered virulence factors for P. mirabilis pathogenicity in a murine model of ascending UTI, but little is known concerning pathogenesis during CAUTI or polymicrobial infection. In this study, we utilized five pools of 10,000 transposon mutants each and transposon insertion-site sequencing (Tn-Seq) to identify the full arsenal of P. mirabilis HI4320 fitness factors for single-species versus polymicrobial CAUTI with Providencia stuartii BE2467. 436 genes in the input pools lacked transposon insertions and were therefore concluded to be essential for P. mirabilis growth in rich medium. 629 genes were identified as P. mirabilis fitness factors during single-species CAUTI. Tn-Seq from coinfection with P. stuartii revealed 217/629 (35%) of the same genes as identified by single-species Tn-Seq, and 1353 additional factors that specifically contribute to colonization during coinfection. Mutants were constructed in eight genes of interest to validate the initial screen: 7/8 (88%) mutants exhibited the expected phenotypes for single-species CAUTI, and 3/3 (100%) validated the expected phenotypes for polymicrobial CAUTI. This approach provided validation of numerous previously described P. mirabilis fitness determinants from an ascending model of UTI, the discovery of novel fitness determinants specifically for CAUTI, and a stringent assessment of how polymicrobial infection influences fitness requirements. For instance, we describe a requirement for branched-chain amino acid biosynthesis by P. mirabilis during coinfection due to high-affinity import of leucine by P. stuartii. Further investigation of genes and pathways that provide a competitive advantage during both single-species and polymicrobial CAUTI will likely provide robust targets for therapeutic intervention to reduce P. mirabilis
Genome-wide transposon mutagenesis of Proteus mirabilis: Essential genes, fitness factors for catheter-associated urinary tract infection, and the impact of polymicrobial infection on fitness requirements

PubMed Central

Smith, Sara N.; Zhao, Lili; Wu, Weisheng

2017-01-01

The Gram-negative bacterium Proteus mirabilis is a leading cause of catheter-associated urinary tract infections (CAUTIs), which are often polymicrobial. Numerous prior studies have uncovered virulence factors for P. mirabilis pathogenicity in a murine model of ascending UTI, but little is known concerning pathogenesis during CAUTI or polymicrobial infection. In this study, we utilized five pools of 10,000 transposon mutants each and transposon insertion-site sequencing (Tn-Seq) to identify the full arsenal of P. mirabilis HI4320 fitness factors for single-species versus polymicrobial CAUTI with Providencia stuartii BE2467. 436 genes in the input pools lacked transposon insertions and were therefore concluded to be essential for P. mirabilis growth in rich medium. 629 genes were identified as P. mirabilis fitness factors during single-species CAUTI. Tn-Seq from coinfection with P. stuartii revealed 217/629 (35%) of the same genes as identified by single-species Tn-Seq, and 1353 additional factors that specifically contribute to colonization during coinfection. Mutants were constructed in eight genes of interest to validate the initial screen: 7/8 (88%) mutants exhibited the expected phenotypes for single-species CAUTI, and 3/3 (100%) validated the expected phenotypes for polymicrobial CAUTI. This approach provided validation of numerous previously described P. mirabilis fitness determinants from an ascending model of UTI, the discovery of novel fitness determinants specifically for CAUTI, and a stringent assessment of how polymicrobial infection influences fitness requirements. For instance, we describe a requirement for branched-chain amino acid biosynthesis by P. mirabilis during coinfection due to high-affinity import of leucine by P. stuartii. Further investigation of genes and pathways that provide a competitive advantage during both single-species and polymicrobial CAUTI will likely provide robust targets for therapeutic intervention to reduce P. mirabilis
Exome Sequencing and Linkage Analysis Identified Novel Candidate Genes in Recessive Intellectual Disability Associated with Ataxia.

PubMed

Jazayeri, Roshanak; Hu, Hao; Fattahi, Zohreh; Musante, Luciana; Abedini, Seyedeh Sedigheh; Hosseini, Masoumeh; Wienker, Thomas F; Ropers, Hans Hilger; Najmabadi, Hossein; Kahrizi, Kimia

2015-10-01

Intellectual disability (ID) is a neuro-developmental disorder which causes considerable socio-economic problems. Some ID individuals are also affected by ataxia, and the condition includes different mutations affecting several genes. We used whole exome sequencing (WES) in combination with homozygosity mapping (HM) to identify the genetic defects in five consanguineous families among our cohort study, with two affected children with ID and ataxia as major clinical symptoms. We identified three novel candidate genes, RIPPLY1, MRPL10, SNX14, and a new mutation in known gene SURF1. All are autosomal genes, except RIPPLY1, which is located on the X chromosome. Two are housekeeping genes, implicated in transcription and translation regulation and intracellular trafficking, and two encode mitochondrial proteins. The pathogenesis of these variants was evaluated by mutation classification, bioinformatic methods, review of medical and biological relevance, co-segregation studies in the particular family, and a normal population study. Linkage analysis and exome sequencing of a small number of affected family members is a powerful new technique which can be used to decrease the number of candidate genes in heterogenic disorders such as ID, and may even identify the responsible gene(s).
Transposon mutagenesis identifies genes and cellular processes driving epithelial-mesenchymal transition in hepatocellular carcinoma

PubMed Central

Kodama, Takahiro; Newberg, Justin Y.; Kodama, Michiko; Rangel, Roberto; Yoshihara, Kosuke; Tien, Jean C.; Parsons, Pamela H.; Wu, Hao; Finegold, Milton J.; Copeland, Neal G.; Jenkins, Nancy A.

2016-01-01

Epithelial-mesenchymal transition (EMT) is thought to contribute to metastasis and chemoresistance in patients with hepatocellular carcinoma (HCC), leading to their poor prognosis. The genes driving EMT in HCC are not yet fully understood, however. Here, we show that mobilization of Sleeping Beauty (SB) transposons in immortalized mouse hepatoblasts induces mesenchymal liver tumors on transplantation to nude mice. These tumors show significant down-regulation of epithelial markers, along with up-regulation of mesenchymal markers and EMT-related transcription factors (EMT-TFs). Sequencing of transposon insertion sites from tumors identified 233 candidate cancer genes (CCGs) that were enriched for genes and cellular processes driving EMT. Subsequent trunk driver analysis identified 23 CCGs that are predicted to function early in tumorigenesis and whose mutation or alteration in patients with HCC is correlated with poor patient survival. Validation of the top trunk drivers identified in the screen, including MET (MET proto-oncogene, receptor tyrosine kinase), GRB2-associated binding protein 1 (GAB1), HECT, UBA, and WWE domain containing 1 (HUWE1), lysine-specific demethylase 6A (KDM6A), and protein-tyrosine phosphatase, nonreceptor-type 12 (PTPN12), showed that deregulation of these genes activates an EMT program in human HCC cells that enhances tumor cell migration. Finally, deregulation of these genes in human HCC was found to confer sorafenib resistance through apoptotic tolerance and reduced proliferation, consistent with recent studies showing that EMT contributes to the chemoresistance of tumor cells. Our unique cell-based transposon mutagenesis screen appears to be an excellent resource for discovering genes involved in EMT in human HCC and potentially for identifying new drug targets. PMID:27247392
Two Closely Related Genes of Arabidopsis Encode Plastidial Cytidinediphosphate Diacylglycerol Synthases Essential for Photoautotrophic Growth1[C

PubMed Central

Haselier, André; Akbari, Hana; Weth, Agnes; Baumgartner, Werner; Frentzen, Margrit

2010-01-01

Cytidinediphosphate diacylglycerol synthase (CDS) catalyzes the formation of cytidinediphosphate diacylglycerol, an essential precursor of anionic phosphoglycerolipids like phosphatidylglycerol or -inositol. In plant cells, CDS isozymes are located in plastids, mitochondria, and microsomes. Here, we show that these isozymes are encoded by five genes in Arabidopsis (Arabidopsis thaliana). Alternative translation initiation or alternative splicing of CDS2 and CDS4 transcripts can result in up to 10 isoforms. Most of the cDNAs encoding the various plant isoforms were functionally expressed in yeast and rescued the nonviable phenotype of the mutant strain lacking CDS activity. The closely related genes CDS4 and CDS5 were found to encode plastidial isozymes with similar catalytic properties. Inactivation of both genes was required to obtain Arabidopsis mutant lines with a visible phenotype, suggesting that the genes have redundant functions. Analysis of these Arabidopsis mutants provided further independent evidence for the importance of plastidial phosphatidylglycerol for structure and function of thylakoid membranes and, hence, for photoautotrophic growth. PMID:20442275
Transcriptomic Analysis Using Olive Varieties and Breeding Progenies Identifies Candidate Genes Involved in Plant Architecture.

PubMed

González-Plaza, Juan J; Ortiz-Martín, Inmaculada; Muñoz-Mérida, Antonio; García-López, Carmen; Sánchez-Sevilla, José F; Luque, Francisco; Trelles, Oswaldo; Bejarano, Eduardo R; De La Rosa, Raúl; Valpuesta, Victoriano; Beuzón, Carmen R

2016-01-01

Plant architecture is a critical trait in fruit crops that can significantly influence yield, pruning, planting density and harvesting. Little is known about how plant architecture is genetically determined in olive, were most of the existing varieties are traditional with an architecture poorly suited for modern growing and harvesting systems. In the present study, we have carried out microarray analysis of meristematic tissue to compare expression profiles of olive varieties displaying differences in architecture, as well as seedlings from their cross pooled on the basis of their sharing architecture-related phenotypes. The microarray used, previously developed by our group has already been applied to identify candidates genes involved in regulating juvenile to adult transition in the shoot apex of seedlings. Varieties with distinct architecture phenotypes and individuals from segregating progenies displaying opposite architecture features were used to link phenotype to expression. Here, we identify 2252 differentially expressed genes (DEGs) associated to differences in plant architecture. Microarray results were validated by quantitative RT-PCR carried out on genes with functional annotation likely related to plant architecture. Twelve of these genes were further analyzed in individual seedlings of the corresponding pool. We also examined Arabidopsis mutants in putative orthologs of these targeted candidate genes, finding altered architecture for most of them. This supports a functional conservation between species and potential biological relevance of the candidate genes identified. This study is the first to identify genes associated to plant architecture in olive, and the results obtained could be of great help in future programs aimed at selecting phenotypes adapted to modern cultivation practices in this species.
Systematic analysis of mutation distribution in three dimensional protein structures identifies cancer driver genes.

PubMed

Fujimoto, Akihiro; Okada, Yukinori; Boroevich, Keith A; Tsunoda, Tatsuhiko; Taniguchi, Hiroaki; Nakagawa, Hidewaki

2016-05-26

Protein tertiary structure determines molecular function, interaction, and stability of the protein, therefore distribution of mutation in the tertiary structure can facilitate the identification of new driver genes in cancer. To analyze mutation distribution in protein tertiary structures, we applied a novel three dimensional permutation test to the mutation positions. We analyzed somatic mutation datasets of 21 types of cancers obtained from exome sequencing conducted by the TCGA project. Of the 3,622 genes that had ≥3 mutations in the regions with tertiary structure data, 106 genes showed significant skew in mutation distribution. Known tumor suppressors and oncogenes were significantly enriched in these identified cancer gene sets. Physical distances between mutations in known oncogenes were significantly smaller than those of tumor suppressors. Twenty-three genes were detected in multiple cancers. Candidate genes with significant skew of the 3D mutation distribution included kinases (MAPK1, EPHA5, ERBB3, and ERBB4), an apoptosis related gene (APP), an RNA splicing factor (SF1), a miRNA processing factor (DICER1), an E3 ubiquitin ligase (CUL1) and transcription factors (KLF5 and EEF1B2). Our study suggests that systematic analysis of mutation distribution in the tertiary protein structure can help identify cancer driver genes.
Systematic analysis of mutation distribution in three dimensional protein structures identifies cancer driver genes

PubMed Central

Fujimoto, Akihiro; Okada, Yukinori; Boroevich, Keith A.; Tsunoda, Tatsuhiko; Taniguchi, Hiroaki; Nakagawa, Hidewaki

2016-01-01

Protein tertiary structure determines molecular function, interaction, and stability of the protein, therefore distribution of mutation in the tertiary structure can facilitate the identification of new driver genes in cancer. To analyze mutation distribution in protein tertiary structures, we applied a novel three dimensional permutation test to the mutation positions. We analyzed somatic mutation datasets of 21 types of cancers obtained from exome sequencing conducted by the TCGA project. Of the 3,622 genes that had ≥3 mutations in the regions with tertiary structure data, 106 genes showed significant skew in mutation distribution. Known tumor suppressors and oncogenes were significantly enriched in these identified cancer gene sets. Physical distances between mutations in known oncogenes were significantly smaller than those of tumor suppressors. Twenty-three genes were detected in multiple cancers. Candidate genes with significant skew of the 3D mutation distribution included kinases (MAPK1, EPHA5, ERBB3, and ERBB4), an apoptosis related gene (APP), an RNA splicing factor (SF1), a miRNA processing factor (DICER1), an E3 ubiquitin ligase (CUL1) and transcription factors (KLF5 and EEF1B2). Our study suggests that systematic analysis of mutation distribution in the tertiary protein structure can help identify cancer driver genes. PMID:27225414
Specific PCR primers directed to identify cryI and cryIII genes within a Bacillus thuringiensis strain collection.

PubMed Central

Cerón, J; Ortíz, A; Quintero, R; Güereca, L; Bravo, A

1995-01-01

In this paper we describe a PCR strategy that can be used to rapidly identify Bacillus thuringiensis strains that harbor any of the known cryI or cryIII genes. Four general PCR primers which amplify DNA fragments from the known cryI or cryIII genes were selected from conserved regions. Once a strain was identified as an organism that contains a particular type of cry gene, it could be easily characterized by performing additional PCR with specific cryI and cryIII primers selected from variable regions. The method described in this paper can be used to identify the 10 different cryI genes and the five different cryIII genes. One feature of this screening method is that each cry gene is expected to produce a PCR product having a precise molecular weight. The genes which produce PCR products having different sizes probably represent strains that harbor a potentially novel cry gene. Finally, we present evidence that novel crystal genes can be identified by the method described in this paper. PMID:8526493
A method to identify differential expression profiles of time-course gene data with Fourier transformation.

PubMed

Kim, Jaehee; Ogden, Robert Todd; Kim, Haseong

2013-10-18

Time course gene expression experiments are an increasingly popular method for exploring biological processes. Temporal gene expression profiles provide an important characterization of gene function, as biological systems are both developmental and dynamic. With such data it is possible to study gene expression changes over time and thereby to detect differential genes. Much of the early work on analyzing time series expression data relied on methods developed originally for static data and thus there is a need for improved methodology. Since time series expression is a temporal process, its unique features such as autocorrelation between successive points should be incorporated into the analysis. This work aims to identify genes that show different gene expression profiles across time. We propose a statistical procedure to discover gene groups with similar profiles using a nonparametric representation that accounts for the autocorrelation in the data. In particular, we first represent each profile in terms of a Fourier basis, and then we screen out genes that are not differentially expressed based on the Fourier coefficients. Finally, we cluster the remaining gene profiles using a model-based approach in the Fourier domain. We evaluate the screening results in terms of sensitivity, specificity, FDR and FNR, compare with the Gaussian process regression screening in a simulation study and illustrate the results by application to yeast cell-cycle microarray expression data with alpha-factor synchronization.The key elements of the proposed methodology: (i) representation of gene profiles in the Fourier domain; (ii) automatic screening of genes based on the Fourier coefficients and taking into account autocorrelation in the data, while controlling the false discovery rate (FDR); (iii) model-based clustering of the remaining gene profiles. Using this method, we identified a set of cell-cycle-regulated time-course yeast genes. The proposed method is general and can be
A method to identify differential expression profiles of time-course gene data with Fourier transformation

PubMed Central

2013-01-01

Background Time course gene expression experiments are an increasingly popular method for exploring biological processes. Temporal gene expression profiles provide an important characterization of gene function, as biological systems are both developmental and dynamic. With such data it is possible to study gene expression changes over time and thereby to detect differential genes. Much of the early work on analyzing time series expression data relied on methods developed originally for static data and thus there is a need for improved methodology. Since time series expression is a temporal process, its unique features such as autocorrelation between successive points should be incorporated into the analysis. Results This work aims to identify genes that show different gene expression profiles across time. We propose a statistical procedure to discover gene groups with similar profiles using a nonparametric representation that accounts for the autocorrelation in the data. In particular, we first represent each profile in terms of a Fourier basis, and then we screen out genes that are not differentially expressed based on the Fourier coefficients. Finally, we cluster the remaining gene profiles using a model-based approach in the Fourier domain. We evaluate the screening results in terms of sensitivity, specificity, FDR and FNR, compare with the Gaussian process regression screening in a simulation study and illustrate the results by application to yeast cell-cycle microarray expression data with alpha-factor synchronization. The key elements of the proposed methodology: (i) representation of gene profiles in the Fourier domain; (ii) automatic screening of genes based on the Fourier coefficients and taking into account autocorrelation in the data, while controlling the false discovery rate (FDR); (iii) model-based clustering of the remaining gene profiles. Conclusions Using this method, we identified a set of cell-cycle-regulated time-course yeast genes. The
Transcriptome Sequencing Identified Genes and Gene Ontologies Associated with Early Freezing Tolerance in Maize

PubMed Central

Li, Zhao; Hu, Guanghui; Liu, Xiangfeng; Zhou, Yao; Li, Yu; Zhang, Xu; Yuan, Xiaohui; Zhang, Qian; Yang, Deguang; Wang, Tianyu; Zhang, Zhiwu

2016-01-01

Originating in a tropical climate, maize has faced great challenges as cultivation has expanded to the majority of the world's temperate zones. In these zones, frost and cold temperatures are major factors that prevent maize from reaching its full yield potential. Among 30 elite maize inbred lines adapted to northern China, we identified two lines of extreme, but opposite, freezing tolerance levels—highly tolerant and highly sensitive. During the seedling stage of these two lines, we used RNA-seq to measure changes in maize whole genome transcriptome before and after freezing treatment. In total, 19,794 genes were expressed, of which 4550 exhibited differential expression due to either treatment (before or after freezing) or line type (tolerant or sensitive). Of the 4550 differently expressed genes, 948 exhibited differential expression due to treatment within line or lines under freezing condition. Analysis of gene ontology found that these 948 genes were significantly enriched for binding functions (DNA binding, ATP binding, and metal ion binding), protein kinase activity, and peptidase activity. Based on their enrichment, literature support, and significant levels of differential expression, 30 of these 948 genes were selected for quantitative real-time PCR (qRT-PCR) validation. The validation confirmed our RNA-Seq-based findings, with squared correlation coefficients of 80% and 50% in the tolerance and sensitive lines, respectively. This study provided valuable resources for further studies to enhance understanding of the molecular mechanisms underlying maize early freezing response and enable targeted breeding strategies for developing varieties with superior frost resistance to achieve yield potential. PMID:27774095
The Ornithine Decarboxylase Gene Is Essential for Cell Survival during Early Murine Development

PubMed Central

Pendeville, Hélène; Carpino, Nick; Marine, Jean-Christophe; Takahashi, Yutaka; Muller, Marc; Martial, Joseph A.; Cleveland, John L.

2001-01-01

Overexpression and inhibitor studies have suggested that the c-Myc target gene for ornithine decarboxylase (ODC), the enzyme which converts ornithine to putrescine, plays an important role in diverse biological processes, including cell growth, differentiation, transformation, and apoptosis. To explore the physiological function of ODC in mammalian development, we generated mice harboring a disrupted ODC gene. ODC-heterozygous mice were viable, normal, and fertile. Although zygotic ODC is expressed throughout the embryo prior to implantation, loss of ODC did not block normal development to the blastocyst stage. Embryonic day E3.5 ODC-deficient embryos were capable of uterine implantation and induced maternal decidualization yet failed to develop substantially thereafter. Surprisingly, analysis of ODC-deficient blastocysts suggests that loss of ODC does not affect cell growth per se but rather is required for survival of the pluripotent cells of the inner cell mass. Therefore, ODC plays an essential role in murine development, and proper homeostasis of polyamine pools appears to be required for cell survival prior to gastrulation. PMID:11533243
A large shRNA library approach identifies lncRNA Ntep as an essential regulator of cell proliferation

PubMed Central

Beermann, Julia; Kirste, Dominique; Iwanov, Katharina; Lu, Dongchao; Kleemiß, Felix; Kumarswamy, Regalla; Schimmel, Katharina; Bär, Christian; Thum, Thomas

2018-01-01

The mammalian cell cycle is a complex and tightly controlled event. Myriads of different control mechanisms are involved in its regulation. Long non-coding RNAs (lncRNA) have emerged as important regulators of many cellular processes including cellular proliferation. However, a more global and unbiased approach to identify lncRNAs with importance for cell proliferation is missing. Here, we present a lentiviral shRNA library-based approach for functional lncRNA profiling. We validated our library approach in NIH3T3 (3T3) fibroblasts by identifying lncRNAs critically involved in cell proliferation. Using stringent selection criteria we identified lncRNA NR_015491.1 out of 3842 different RNA targets represented in our library. We termed this transcript Ntep (non-coding transcript essential for proliferation), as a bona fide lncRNA essential for cell cycle progression. Inhibition of Ntep in 3T3 and primary fibroblasts prevented normal cell growth and expression of key fibroblast markers. Mechanistically, we discovered that Ntep is important to activate P53 concomitant with increased apoptosis and cell cycle blockade in late G2/M. Our findings suggest Ntep to serve as an important regulator of fibroblast proliferation and function. In summary, our study demonstrates the applicability of an innovative shRNA library approach to identify long non-coding RNA functions in a massive parallel approach. PMID:29099486
Utility and Limitations of Using Gene Expression Data to Identify Functional Associations

PubMed Central

Peng, Cheng; Shiu, Shin-Han

2016-01-01

Gene co-expression has been widely used to hypothesize gene function through guilt-by association. However, it is not clear to what degree co-expression is informative, whether it can be applied to genes involved in different biological processes, and how the type of dataset impacts inferences about gene functions. Here our goal is to assess the utility and limitations of using co-expression as a criterion to recover functional associations between genes. By determining the percentage of gene pairs in a metabolic pathway with significant expression correlation, we found that many genes in the same pathway do not have similar transcript profiles and the choice of dataset, annotation quality, gene function, expression similarity measure, and clustering approach significantly impacts the ability to recover functional associations between genes using Arabidopsis thaliana as an example. Some datasets are more informative in capturing coordinated expression profiles and larger data sets are not always better. In addition, to recover the maximum number of known pathways and identify candidate genes with similar functions, it is important to explore rather exhaustively multiple dataset combinations, similarity measures, clustering algorithms and parameters. Finally, we validated the biological relevance of co-expression cluster memberships with an independent phenomics dataset and found that genes that consistently cluster with leucine degradation genes tend to have similar leucine levels in mutants. This study provides a framework for obtaining gene functional associations by maximizing the information that can be obtained from gene expression datasets. PMID:27935950
Gene Network for Identifying the Entropy Changes of Different Modules in Pediatric Sepsis.

PubMed

Yang, Jing; Zhang, Pingli; Wang, Lumin

2016-01-01

Pediatric sepsis is a disease that threatens life of children. The incidence of pediatric sepsis is higher in developing countries due to various reasons, such as insufficient immunization and nutrition, water and air pollution, etc. Exploring the potential genes via different methods is of significance for the prevention and treatment of pediatric sepsis. This study aimed to identify potential genes associated with pediatric sepsis utilizing analysis of gene network and entropy. The mRNA expression in the blood samples collected from 20 septic children and 30 healthy controls was quantified by using Affymetrix HG-U133A microarray. Two condition-specific protein-protein interaction networks (PINs), one for the healthy control and the other one for the children with sepsis, were deduced by combining the fundamental human PINs with gene expression profiles in the two phenotypes. Subsequently, distinct modules from the two conditional networks were extracted by adopting a maximal clique-merging approach. Delta entropy (ΔS) was calculated between sepsis and control modules. Then, key genes displaying changes in gene composition were identified by matching the control and sepsis modules. Two objective modules were obtained, in which ribosomal protein RPL4 and RPL9 as well as TOP2A were probably considered as the key genes differentiating sepsis from healthy controls. According to previous reports and this work, TOP2A is the potential gene therapy target for pediatric sepsis. The relationship between pediatric sepsis and RPL4 and RPL9 needs further investigation. © 2016 The Author(s) Published by S. Karger AG, Basel.
Gene expression dynamics during embryonic development in rainbow trout

USDA-ARS?s Scientific Manuscript database

The supply of maternal RNAs in fertilized egg and activation of embryonic genome during maternal-zygotic transition (MZT) are important for normal embryonic development. In order to identify genes and gene products that are essential in the regulation of embryonic development in rainbow trout, RNA-S...
Combining Functional and Structural Genomics to Sample the Essential Burkholderia Structome

PubMed Central

Baugh, Loren; Gallagher, Larry A.; Patrapuvich, Rapatbhorn; Clifton, Matthew C.; Gardberg, Anna S.; Edwards, Thomas E.; Armour, Brianna; Begley, Darren W.; Dieterich, Shellie H.; Dranow, David M.; Abendroth, Jan; Fairman, James W.; Fox, David; Staker, Bart L.; Phan, Isabelle; Gillespie, Angela; Choi, Ryan; Nakazawa-Hewitt, Steve; Nguyen, Mary Trang; Napuli, Alberto; Barrett, Lynn; Buchko, Garry W.; Stacy, Robin; Myler, Peter J.; Stewart, Lance J.; Manoil, Colin; Van Voorhis, Wesley C.

2013-01-01

Background The genus Burkholderia includes pathogenic gram-negative bacteria that cause melioidosis, glanders, and pulmonary infections of patients with cancer and cystic fibrosis. Drug resistance has made development of new antimicrobials critical. Many approaches to discovering new antimicrobials, such as structure-based drug design and whole cell phenotypic screens followed by lead refinement, require high-resolution structures of proteins essential to the parasite. Methodology/Principal Findings We experimentally identified 406 putative essential genes in B. thailandensis, a low-virulence species phylogenetically similar to B. pseudomallei, the causative agent of melioidosis, using saturation-level transposon mutagenesis and next-generation sequencing (Tn-seq). We selected 315 protein products of these genes based on structure-determination criteria, such as excluding very large and/or integral membrane proteins, and entered them into the Seattle Structural Genomics Center for Infection Disease (SSGCID) structure determination pipeline. To maximize structural coverage of these targets, we applied an “ortholog rescue” strategy for those producing insoluble or difficult to crystallize proteins, resulting in the addition of 387 orthologs (or paralogs) from seven other Burkholderia species into the SSGCID pipeline. This structural genomics approach yielded structures from 31 putative essential targets from B. thailandensis, and 25 orthologs from other Burkholderia species, yielding an overall structural coverage for 49 of the 406 essential gene families, with a total of 88 depositions into the Protein Data Bank. Of these, 25 proteins have properties of a potential antimicrobial drug target i.e., no close human homolog, part of an essential metabolic pathway, and a deep binding pocket. We describe the structures of several potential drug targets in detail. Conclusions/Significance This collection of structures, solubility and experimental essentiality data
Combining functional and structural genomics to sample the essential Burkholderia structome.

PubMed

Baugh, Loren; Gallagher, Larry A; Patrapuvich, Rapatbhorn; Clifton, Matthew C; Gardberg, Anna S; Edwards, Thomas E; Armour, Brianna; Begley, Darren W; Dieterich, Shellie H; Dranow, David M; Abendroth, Jan; Fairman, James W; Fox, David; Staker, Bart L; Phan, Isabelle; Gillespie, Angela; Choi, Ryan; Nakazawa-Hewitt, Steve; Nguyen, Mary Trang; Napuli, Alberto; Barrett, Lynn; Buchko, Garry W; Stacy, Robin; Myler, Peter J; Stewart, Lance J; Manoil, Colin; Van Voorhis, Wesley C

2013-01-01

The genus Burkholderia includes pathogenic gram-negative bacteria that cause melioidosis, glanders, and pulmonary infections of patients with cancer and cystic fibrosis. Drug resistance has made development of new antimicrobials critical. Many approaches to discovering new antimicrobials, such as structure-based drug design and whole cell phenotypic screens followed by lead refinement, require high-resolution structures of proteins essential to the parasite. We experimentally identified 406 putative essential genes in B. thailandensis, a low-virulence species phylogenetically similar to B. pseudomallei, the causative agent of melioidosis, using saturation-level transposon mutagenesis and next-generation sequencing (Tn-seq). We selected 315 protein products of these genes based on structure-determination criteria, such as excluding very large and/or integral membrane proteins, and entered them into the Seattle Structural Genomics Center for Infection Disease (SSGCID) structure determination pipeline. To maximize structural coverage of these targets, we applied an "ortholog rescue" strategy for those producing insoluble or difficult to crystallize proteins, resulting in the addition of 387 orthologs (or paralogs) from seven other Burkholderia species into the SSGCID pipeline. This structural genomics approach yielded structures from 31 putative essential targets from B. thailandensis, and 25 orthologs from other Burkholderia species, yielding an overall structural coverage for 49 of the 406 essential gene families, with a total of 88 depositions into the Protein Data Bank. Of these, 25 proteins have properties of a potential antimicrobial drug target i.e., no close human homolog, part of an essential metabolic pathway, and a deep binding pocket. We describe the structures of several potential drug targets in detail. This collection of structures, solubility and experimental essentiality data provides a resource for development of drugs against infections and diseases

Evolutionary analysis of vision genes identifies potential drivers of visual differences between giraffe and okapi

PubMed Central

Agaba, Morris; Cavener, Douglas R.

2017-01-01

Background The capacity of visually oriented species to perceive and respond to visual signal is integral to their evolutionary success. Giraffes are closely related to okapi, but the two species have broad range of phenotypic differences including their visual capacities. Vision studies rank giraffe’s visual acuity higher than all other artiodactyls despite sharing similar vision ecological determinants with many of them. The extent to which the giraffe’s unique visual capacity and its difference with okapi is reflected by changes in their vision genes is not understood. Methods The recent availability of giraffe and okapi genomes provided opportunity to identify giraffe and okapi vision genes. Multiple strategies were employed to identify thirty-six candidate mammalian vision genes in giraffe and okapi genomes. Quantification of selection pressure was performed by a combination of branch-site tests of positive selection and clade models of selection divergence through comparing giraffe and okapi vision genes and orthologous sequences from other mammals. Results Signatures of selection were identified in key genes that could potentially underlie giraffe and okapi visual adaptations. Importantly, some genes that contribute to optical transparency of the eye and those that are critical in light signaling pathway were found to show signatures of adaptive evolution or selection divergence. Comparison between giraffe and other ruminants identifies significant selection divergence in CRYAA and OPN1LW. Significant selection divergence was identified in SAG while positive selection was detected in LUM when okapi is compared with ruminants and other mammals. Sequence analysis of OPN1LW showed that at least one of the sites known to affect spectral sensitivity of the red pigment is uniquely divergent between giraffe and other ruminants. Discussion By taking a systemic approach to gene function in vision, the results provide the first molecular clues associated with
Evolutionary analysis of vision genes identifies potential drivers of visual differences between giraffe and okapi.

PubMed

Ishengoma, Edson; Agaba, Morris; Cavener, Douglas R

2017-01-01

The capacity of visually oriented species to perceive and respond to visual signal is integral to their evolutionary success. Giraffes are closely related to okapi, but the two species have broad range of phenotypic differences including their visual capacities. Vision studies rank giraffe's visual acuity higher than all other artiodactyls despite sharing similar vision ecological determinants with many of them. The extent to which the giraffe's unique visual capacity and its difference with okapi is reflected by changes in their vision genes is not understood. The recent availability of giraffe and okapi genomes provided opportunity to identify giraffe and okapi vision genes. Multiple strategies were employed to identify thirty-six candidate mammalian vision genes in giraffe and okapi genomes. Quantification of selection pressure was performed by a combination of branch-site tests of positive selection and clade models of selection divergence through comparing giraffe and okapi vision genes and orthologous sequences from other mammals. Signatures of selection were identified in key genes that could potentially underlie giraffe and okapi visual adaptations. Importantly, some genes that contribute to optical transparency of the eye and those that are critical in light signaling pathway were found to show signatures of adaptive evolution or selection divergence. Comparison between giraffe and other ruminants identifies significant selection divergence in CRYAA and OPN1LW . Significant selection divergence was identified in SAG while positive selection was detected in LUM when okapi is compared with ruminants and other mammals. Sequence analysis of OPN1LW showed that at least one of the sites known to affect spectral sensitivity of the red pigment is uniquely divergent between giraffe and other ruminants. By taking a systemic approach to gene function in vision, the results provide the first molecular clues associated with giraffe and okapi vision adaptations. At
A conserved BDNF, glutamate- and GABA-enriched gene module related to human depression identified by coexpression meta-analysis and DNA variant genome-wide association studies.

PubMed

Chang, Lun-Ching; Jamain, Stephane; Lin, Chien-Wei; Rujescu, Dan; Tseng, George C; Sibille, Etienne

2014-01-01

Large scale gene expression (transcriptome) analysis and genome-wide association studies (GWAS) for single nucleotide polymorphisms have generated a considerable amount of gene- and disease-related information, but heterogeneity and various sources of noise have limited the discovery of disease mechanisms. As systematic dataset integration is becoming essential, we developed methods and performed meta-clustering of gene coexpression links in 11 transcriptome studies from postmortem brains of human subjects with major depressive disorder (MDD) and non-psychiatric control subjects. We next sought enrichment in the top 50 meta-analyzed coexpression modules for genes otherwise identified by GWAS for various sets of disorders. One coexpression module of 88 genes was consistently and significantly associated with GWAS for MDD, other neuropsychiatric disorders and brain functions, and for medical illnesses with elevated clinical risk of depression, but not for other diseases. In support of the superior discriminative power of this novel approach, we observed no significant enrichment for GWAS-related genes in coexpression modules extracted from single studies or in meta-modules using gene expression data from non-psychiatric control subjects. Genes in the identified module encode proteins implicated in neuronal signaling and structure, including glutamate metabotropic receptors (GRM1, GRM7), GABA receptors (GABRA2, GABRA4), and neurotrophic and development-related proteins [BDNF, reelin (RELN), Ephrin receptors (EPHA3, EPHA5)]. These results are consistent with the current understanding of molecular mechanisms of MDD and provide a set of putative interacting molecular partners, potentially reflecting components of a functional module across cells and biological pathways that are synchronously recruited in MDD, other brain disorders and MDD-related illnesses. Collectively, this study demonstrates the importance of integrating transcriptome data, gene coexpression modules
Unique attributes of cyanobacterial metabolism revealed by improved genome-scale metabolic modeling and essential gene analysis

DOE PAGES

Broddrick, Jared T.; Rubin, Benjamin E.; Welkie, David G.; ...

2016-12-20

The model cyanobacterium, Synechococcus elongatus PCC 7942, is a genetically tractable obligate phototroph that is being developed for the bioproduction of high-value chemicals. Genome-scale models (GEMs) have been successfully used to assess and engineer cellular metabolism; however, GEMs of phototrophic metabolism have been limited by the lack of experimental datasets for model validation and the challenges of incorporating photon uptake. In this paper, we develop a GEM of metabolism in S. elongatus using random barcode transposon site sequencing (RB-TnSeq) essential gene and physiological data specific to photoautotrophic metabolism. The model explicitly describes photon absorption and accounts for shading, resulting inmore » the characteristic linear growth curve of photoautotrophs. GEM predictions of gene essentiality were compared with data obtained from recent dense-transposon mutagenesis experiments. This dataset allowed major improvements to the accuracy of the model. Furthermore, discrepancies between GEM predictions and the in vivo dataset revealed biological characteristics, such as the importance of a truncated, linear TCA pathway, low flux toward amino acid synthesis from photorespiration, and knowledge gaps within nucleotide metabolism. Finally, coupling of strong experimental support and photoautotrophic modeling methods thus resulted in a highly accurate model of S. elongatus metabolism that highlights previously unknown areas of S. elongatus biology.« less
Unique attributes of cyanobacterial metabolism revealed by improved genome-scale metabolic modeling and essential gene analysis

PubMed Central

Broddrick, Jared T.; Rubin, Benjamin E.; Welkie, David G.; Du, Niu; Mih, Nathan; Diamond, Spencer; Lee, Jenny J.; Golden, Susan S.; Palsson, Bernhard O.

2016-01-01

The model cyanobacterium, Synechococcus elongatus PCC 7942, is a genetically tractable obligate phototroph that is being developed for the bioproduction of high-value chemicals. Genome-scale models (GEMs) have been successfully used to assess and engineer cellular metabolism; however, GEMs of phototrophic metabolism have been limited by the lack of experimental datasets for model validation and the challenges of incorporating photon uptake. Here, we develop a GEM of metabolism in S. elongatus using random barcode transposon site sequencing (RB-TnSeq) essential gene and physiological data specific to photoautotrophic metabolism. The model explicitly describes photon absorption and accounts for shading, resulting in the characteristic linear growth curve of photoautotrophs. GEM predictions of gene essentiality were compared with data obtained from recent dense-transposon mutagenesis experiments. This dataset allowed major improvements to the accuracy of the model. Furthermore, discrepancies between GEM predictions and the in vivo dataset revealed biological characteristics, such as the importance of a truncated, linear TCA pathway, low flux toward amino acid synthesis from photorespiration, and knowledge gaps within nucleotide metabolism. Coupling of strong experimental support and photoautotrophic modeling methods thus resulted in a highly accurate model of S. elongatus metabolism that highlights previously unknown areas of S. elongatus biology. PMID:27911809
Unique attributes of cyanobacterial metabolism revealed by improved genome-scale metabolic modeling and essential gene analysis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Broddrick, Jared T.; Rubin, Benjamin E.; Welkie, David G.

The model cyanobacterium, Synechococcus elongatus PCC 7942, is a genetically tractable obligate phototroph that is being developed for the bioproduction of high-value chemicals. Genome-scale models (GEMs) have been successfully used to assess and engineer cellular metabolism; however, GEMs of phototrophic metabolism have been limited by the lack of experimental datasets for model validation and the challenges of incorporating photon uptake. In this paper, we develop a GEM of metabolism in S. elongatus using random barcode transposon site sequencing (RB-TnSeq) essential gene and physiological data specific to photoautotrophic metabolism. The model explicitly describes photon absorption and accounts for shading, resulting inmore » the characteristic linear growth curve of photoautotrophs. GEM predictions of gene essentiality were compared with data obtained from recent dense-transposon mutagenesis experiments. This dataset allowed major improvements to the accuracy of the model. Furthermore, discrepancies between GEM predictions and the in vivo dataset revealed biological characteristics, such as the importance of a truncated, linear TCA pathway, low flux toward amino acid synthesis from photorespiration, and knowledge gaps within nucleotide metabolism. Finally, coupling of strong experimental support and photoautotrophic modeling methods thus resulted in a highly accurate model of S. elongatus metabolism that highlights previously unknown areas of S. elongatus biology.« less
Genome-Wide and Gene-Based Meta-Analyses Identify Novel Loci Influencing Blood Pressure Response to Hydrochlorothiazide.

PubMed

Salvi, Erika; Wang, Zhiying; Rizzi, Federica; Gong, Yan; McDonough, Caitrin W; Padmanabhan, Sandosh; Hiltunen, Timo P; Lanzani, Chiara; Zaninello, Roberta; Chittani, Martina; Bailey, Kent R; Sarin, Antti-Pekka; Barcella, Matteo; Melander, Olle; Chapman, Arlene B; Manunta, Paolo; Kontula, Kimmo K; Glorioso, Nicola; Cusi, Daniele; Dominiczak, Anna F; Johnson, Julie A; Barlassina, Cristina; Boerwinkle, Eric; Cooper-DeHoff, Rhonda M; Turner, Stephen T

2017-01-01

This study aimed to identify novel loci influencing the antihypertensive response to hydrochlorothiazide monotherapy. A genome-wide meta-analysis of blood pressure (BP) response to hydrochlorothiazide was performed in 1739 white hypertensives from 6 clinical trials within the International Consortium for Antihypertensive Pharmacogenomics Studies, making it the largest study to date of its kind. No signals reached genome-wide significance (P<5×10 - 8 ), and the suggestive regions (P<10 -5 ) were cross-validated in 2 black cohorts treated with hydrochlorothiazide. In addition, a gene-based analysis was performed on candidate genes with previous evidence of involvement in diuretic response, in BP regulation, or in hypertension susceptibility. Using the genome-wide meta-analysis approach, with validation in blacks, we identified 2 suggestive regulatory regions linked to gap junction protein α1 gene (GJA1) and forkhead box A1 gene (FOXA1), relevant for cardiovascular and kidney function. With the gene-based approach, we identified hydroxy-delta-5-steroid dehydrogenase, 3 β- and steroid δ-isomerase 1 gene (HSD3B1) as significantly associated with BP response (P<2.28×10 - 4 ). HSD3B1 encodes the 3β-hydroxysteroid dehydrogenase enzyme and plays a crucial role in the biosynthesis of aldosterone and endogenous ouabain. By amassing all of the available pharmacogenomic studies of BP response to hydrochlorothiazide, and using 2 different analytic approaches, we identified 3 novel loci influencing BP response to hydrochlorothiazide. The gene-based analysis, never before applied to pharmacogenomics of antihypertensive drugs to our knowledge, provided a powerful strategy to identify a locus of interest, which was not identified in the genome-wide meta-analysis because of high allelic heterogeneity. These data pave the way for future investigations on new pathways and drug targets to enhance the current understanding of personalized antihypertensive treatment. © 2016
A transcriptome-wide association study of 229,000 women identifies new candidate susceptibility genes for breast cancer.

PubMed

Wu, Lang; Shi, Wei; Long, Jirong; Guo, Xingyi; Michailidou, Kyriaki; Beesley, Jonathan; Bolla, Manjeet K; Shu, Xiao-Ou; Lu, Yingchang; Cai, Qiuyin; Al-Ejeh, Fares; Rozali, Esdy; Wang, Qin; Dennis, Joe; Li, Bingshan; Zeng, Chenjie; Feng, Helian; Gusev, Alexander; Barfield, Richard T; Andrulis, Irene L; Anton-Culver, Hoda; Arndt, Volker; Aronson, Kristan J; Auer, Paul L; Barrdahl, Myrto; Baynes, Caroline; Beckmann, Matthias W; Benitez, Javier; Bermisheva, Marina; Blomqvist, Carl; Bogdanova, Natalia V; Bojesen, Stig E; Brauch, Hiltrud; Brenner, Hermann; Brinton, Louise; Broberg, Per; Brucker, Sara Y; Burwinkel, Barbara; Caldés, Trinidad; Canzian, Federico; Carter, Brian D; Castelao, J Esteban; Chang-Claude, Jenny; Chen, Xiaoqing; Cheng, Ting-Yuan David; Christiansen, Hans; Clarke, Christine L; Collée, Margriet; Cornelissen, Sten; Couch, Fergus J; Cox, David; Cox, Angela; Cross, Simon S; Cunningham, Julie M; Czene, Kamila; Daly, Mary B; Devilee, Peter; Doheny, Kimberly F; Dörk, Thilo; Dos-Santos-Silva, Isabel; Dumont, Martine; Dwek, Miriam; Eccles, Diana M; Eilber, Ursula; Eliassen, A Heather; Engel, Christoph; Eriksson, Mikael; Fachal, Laura; Fasching, Peter A; Figueroa, Jonine; Flesch-Janys, Dieter; Fletcher, Olivia; Flyger, Henrik; Fritschi, Lin; Gabrielson, Marike; Gago-Dominguez, Manuela; Gapstur, Susan M; García-Closas, Montserrat; Gaudet, Mia M; Ghoussaini, Maya; Giles, Graham G; Goldberg, Mark S; Goldgar, David E; González-Neira, Anna; Guénel, Pascal; Hahnen, Eric; Haiman, Christopher A; Håkansson, Niclas; Hall, Per; Hallberg, Emily; Hamann, Ute; Harrington, Patricia; Hein, Alexander; Hicks, Belynda; Hillemanns, Peter; Hollestelle, Antoinette; Hoover, Robert N; Hopper, John L; Huang, Guanmengqian; Humphreys, Keith; Hunter, David J; Jakubowska, Anna; Janni, Wolfgang; John, Esther M; Johnson, Nichola; Jones, Kristine; Jones, Michael E; Jung, Audrey; Kaaks, Rudolf; Kerin, Michael J; Khusnutdinova, Elza; Kosma, Veli-Matti; Kristensen, Vessela N; Lambrechts, Diether; Le Marchand, Loic; Li, Jingmei; Lindström, Sara; Lissowska, Jolanta; Lo, Wing-Yee; Loibl, Sibylle; Lubinski, Jan; Luccarini, Craig; Lux, Michael P; MacInnis, Robert J; Maishman, Tom; Kostovska, Ivana Maleva; Mannermaa, Arto; Manson, JoAnn E; Margolin, Sara; Mavroudis, Dimitrios; Meijers-Heijboer, Hanne; Meindl, Alfons; Menon, Usha; Meyer, Jeffery; Mulligan, Anna Marie; Neuhausen, Susan L; Nevanlinna, Heli; Neven, Patrick; Nielsen, Sune F; Nordestgaard, Børge G; Olopade, Olufunmilayo I; Olson, Janet E; Olsson, Håkan; Peterlongo, Paolo; Peto, Julian; Plaseska-Karanfilska, Dijana; Prentice, Ross; Presneau, Nadege; Pylkäs, Katri; Rack, Brigitte; Radice, Paolo; Rahman, Nazneen; Rennert, Gad; Rennert, Hedy S; Rhenius, Valerie; Romero, Atocha; Romm, Jane; Rudolph, Anja; Saloustros, Emmanouil; Sandler, Dale P; Sawyer, Elinor J; Schmidt, Marjanka K; Schmutzler, Rita K; Schneeweiss, Andreas; Scott, Rodney J; Scott, Christopher G; Seal, Sheila; Shah, Mitul; Shrubsole, Martha J; Smeets, Ann; Southey, Melissa C; Spinelli, John J; Stone, Jennifer; Surowy, Harald; Swerdlow, Anthony J; Tamimi, Rulla M; Tapper, William; Taylor, Jack A; Terry, Mary Beth; Tessier, Daniel C; Thomas, Abigail; Thöne, Kathrin; Tollenaar, Rob A E M; Torres, Diana; Truong, Thérèse; Untch, Michael; Vachon, Celine; Van Den Berg, David; Vincent, Daniel; Waisfisz, Quinten; Weinberg, Clarice R; Wendt, Camilla; Whittemore, Alice S; Wildiers, Hans; Willett, Walter C; Winqvist, Robert; Wolk, Alicja; Xia, Lucy; Yang, Xiaohong R; Ziogas, Argyrios; Ziv, Elad; Dunning, Alison M; Pharoah, Paul D P; Simard, Jacques; Milne, Roger L; Edwards, Stacey L; Kraft, Peter; Easton, Douglas F; Chenevix-Trench, Georgia; Zheng, Wei

2018-06-18

The breast cancer risk variants identified in genome-wide association studies explain only a small fraction of the familial relative risk, and the genes responsible for these associations remain largely unknown. To identify novel risk loci and likely causal genes, we performed a transcriptome-wide association study evaluating associations of genetically predicted gene expression with breast cancer risk in 122,977 cases and 105,974 controls of European ancestry. We used data from the Genotype-Tissue Expression Project to establish genetic models to predict gene expression in breast tissue and evaluated model performance using data from The Cancer Genome Atlas. Of the 8,597 genes evaluated, significant associations were identified for 48 at a Bonferroni-corrected threshold of P < 5.82 × 10 -6 , including 14 genes at loci not yet reported for breast cancer. We silenced 13 genes and showed an effect for 11 on cell proliferation and/or colony-forming efficiency. Our study provides new insights into breast cancer genetics and biology.
Identifying signatures of positive selection in pigmentation genes in two South Asian populations.

PubMed

Jonnalagadda, Manjari; Bharti, Neeraj; Patil, Yatish; Ozarkar, Shantanu; K, Sunitha Manjari; Joshi, Rajendra; Norton, Heather

2017-09-10

Skin pigmentation is a polygenic trait showing wide phenotypic variations among global populations. While numerous pigmentation genes have been identified to be under positive selection among European and East populations, genes contributing to phenotypic variation in skin pigmentation within and among South Asian populations are still poorly understood. The present study uses data from the Phase 3 of the 1000 genomes project focusing on two South Asian populations-GIH (Gujarati Indian from Houston, Texas) and ITU (Indian Telugu from UK), so as to decode the genetic architecture involved in adaptation to ultraviolet radiation in South Asian populations. Statistical tests included were (1) tests to identify deviations of the Site Frequency Spectrum (SFS) from neutral expectations (Tajima's D, Fay and Wu's H and Fu and Li's D* and F*), (2) tests focused on the identification of high-frequency haplotypes with extended linkage disequilibrium (iHS and Rsb), and (3) tests based on genetic differentiation between populations (LSBL). Twenty-two pigmentation genes fall in the top 1% for at least one statistic in the GIH population, 5 of which (LYST, OCA2, SLC24A5, SLC45A2, and TYR) have been previously associated with normal variation in skin, hair, or eye color. In comparison, 17 genes fall in the top 1% for at least one statistic in the ITU population. Twelve loci which are identified as outliers in the ITU scan were also identified in the GIH population. These results suggest that selection may have affected these loci broadly across the region. © 2017 Wiley Periodicals, Inc.
Transcriptomic Analysis Using Olive Varieties and Breeding Progenies Identifies Candidate Genes Involved in Plant Architecture

PubMed Central

González-Plaza, Juan J.; Ortiz-Martín, Inmaculada; Muñoz-Mérida, Antonio; García-López, Carmen; Sánchez-Sevilla, José F.; Luque, Francisco; Trelles, Oswaldo; Bejarano, Eduardo R.; De La Rosa, Raúl; Valpuesta, Victoriano; Beuzón, Carmen R.

2016-01-01

Plant architecture is a critical trait in fruit crops that can significantly influence yield, pruning, planting density and harvesting. Little is known about how plant architecture is genetically determined in olive, were most of the existing varieties are traditional with an architecture poorly suited for modern growing and harvesting systems. In the present study, we have carried out microarray analysis of meristematic tissue to compare expression profiles of olive varieties displaying differences in architecture, as well as seedlings from their cross pooled on the basis of their sharing architecture-related phenotypes. The microarray used, previously developed by our group has already been applied to identify candidates genes involved in regulating juvenile to adult transition in the shoot apex of seedlings. Varieties with distinct architecture phenotypes and individuals from segregating progenies displaying opposite architecture features were used to link phenotype to expression. Here, we identify 2252 differentially expressed genes (DEGs) associated to differences in plant architecture. Microarray results were validated by quantitative RT-PCR carried out on genes with functional annotation likely related to plant architecture. Twelve of these genes were further analyzed in individual seedlings of the corresponding pool. We also examined Arabidopsis mutants in putative orthologs of these targeted candidate genes, finding altered architecture for most of them. This supports a functional conservation between species and potential biological relevance of the candidate genes identified. This study is the first to identify genes associated to plant architecture in olive, and the results obtained could be of great help in future programs aimed at selecting phenotypes adapted to modern cultivation practices in this species. PMID:26973682
SPK1 is an essential S-phase-specific gene of Saccharomyces cerevisiae that encodes a nuclear serine/threonine/tyrosine kinase.

PubMed

Zheng, P; Fay, D S; Burton, J; Xiao, H; Pinkham, J L; Stern, D F

1993-09-01

SPK1 was originally discovered in an immunoscreen for tyrosine-protein kinases in Saccharomyces cerevisiae. We have used biochemical and genetic techniques to investigate the function of this gene and its encoded protein. Hybridization of an SPK1 probe to an ordered genomic library showed that SPK1 is adjacent to PEP4 (chromosome XVI L). Sporulation of spk1/+ heterozygotes gave rise to spk1 spores that grew into microcolonies but could not be further propagated. These colonies were greatly enriched for budded cells, especially those with large buds. Similarly, eviction of CEN plasmids bearing SPK1 from cells with a chromosomal SPK1 disruption yielded viable cells with only low frequency. Spk1 protein was identified by immunoprecipitation and immunoblotting. It was associated with protein-Ser, Thr, and Tyr kinase activity in immune complex kinase assays. Spk1 was localized to the nucleus by immunofluorescence. The nucleotide sequence of the SPK1 5' noncoding region revealed that SPK1 contains two MluI cell cycle box elements. These elements confer S-phase-specific transcription to many genes involved in DNA synthesis. Northern (RNA) blotting of synchronized cells verified that the SPK1 transcript is coregulated with other MluI box-regulated genes. The SPK1 upstream region also includes a domain highly homologous to sequences involved in induction of RAD2 and other excision repair genes by agents that induce DNA damage. spk1 strains were hypersensitive to UV irradiation. Taken together, these findings indicate that SPK1 is a dual-specificity (Ser/Thr and Tyr) protein kinase that is essential for viability. The cell cycle-dependent transcription, presence of DNA damage-related sequences, requirement for UV resistance, and nuclear localization of Spk1 all link this gene to a crucial S-phase-specific role, probably as a positive regulator of DNA synthesis.
Robust Principal Component Analysis Regularized by Truncated Nuclear Norm for Identifying Differentially Expressed Genes.

PubMed

Wang, Ya-Xuan; Gao, Ying-Lian; Liu, Jin-Xing; Kong, Xiang-Zhen; Li, Hai-Jun

2017-09-01

Identifying differentially expressed genes from the thousands of genes is a challenging task. Robust principal component analysis (RPCA) is an efficient method in the identification of differentially expressed genes. RPCA method uses nuclear norm to approximate the rank function. However, theoretical studies showed that the nuclear norm minimizes all singular values, so it may not be the best solution to approximate the rank function. The truncated nuclear norm is defined as the sum of some smaller singular values, which may achieve a better approximation of the rank function than nuclear norm. In this paper, a novel method is proposed by replacing nuclear norm of RPCA with the truncated nuclear norm, which is named robust principal component analysis regularized by truncated nuclear norm (TRPCA). The method decomposes the observation matrix of genomic data into a low-rank matrix and a sparse matrix. Because the significant genes can be considered as sparse signals, the differentially expressed genes are viewed as the sparse perturbation signals. Thus, the differentially expressed genes can be identified according to the sparse matrix. The experimental results on The Cancer Genome Atlas data illustrate that the TRPCA method outperforms other state-of-the-art methods in the identification of differentially expressed genes.
Identifying prognostic signature in ovarian cancer using DirGenerank

PubMed Central

Wang, Jian-Yong; Chen, Ling-Ling; Zhou, Xiong-Hui

2017-01-01

Identifying the prognostic genes in cancer is essential not only for the treatment of cancer patients, but also for drug discovery. However, it's still a big challenge to select the prognostic genes that can distinguish the risk of cancer patients across various data sets because of tumor heterogeneity. In this situation, the selected genes whose expression levels are statistically related to prognostic risks may be passengers. In this paper, based on gene expression data and prognostic data of ovarian cancer patients, we used conditional mutual information to construct gene dependency network in which the nodes (genes) with more out-degrees have more chances to be the modulators of cancer prognosis. After that, we proposed DirGenerank (Generank in direct netowrk) algorithm, which concerns both the gene dependency network and genes’ correlations to prognostic risks, to identify the gene signature that can predict the prognostic risks of ovarian cancer patients. Using ovarian cancer data set from TCGA (The Cancer Genome Atlas) as training data set, 40 genes with the highest importance were selected as prognostic signature. Survival analysis of these patients divided by the prognostic signature in testing data set and four independent data sets showed the signature can distinguish the prognostic risks of cancer patients significantly. Enrichment analysis of the signature with curated cancer genes and the drugs selected by CMAP showed the genes in the signature may be drug targets for therapy. In summary, we have proposed a useful pipeline to identify prognostic genes of cancer patients. PMID:28615526
Genome-Wide Temporal Expression Profiling in Caenorhabditis elegans Identifies a Core Gene Set Related to Long-Term Memory.

PubMed

Freytag, Virginie; Probst, Sabine; Hadziselimovic, Nils; Boglari, Csaba; Hauser, Yannick; Peter, Fabian; Gabor Fenyves, Bank; Milnik, Annette; Demougin, Philippe; Vukojevic, Vanja; de Quervain, Dominique J-F; Papassotiropoulos, Andreas; Stetak, Attila

2017-07-12

The identification of genes related to encoding, storage, and retrieval of memories is a major interest in neuroscience. In the current study, we analyzed the temporal gene expression changes in a neuronal mRNA pool during an olfactory long-term associative memory (LTAM) in Caenorhabditis elegans hermaphrodites. Here, we identified a core set of 712 (538 upregulated and 174 downregulated) genes that follows three distinct temporal peaks demonstrating multiple gene regulation waves in LTAM. Compared with the previously published positive LTAM gene set (Lakhina et al., 2015), 50% of the identified upregulated genes here overlap with the previous dataset, possibly representing stimulus-independent memory-related genes. On the other hand, the remaining genes were not previously identified in positive associative memory and may specifically regulate aversive LTAM. Our results suggest a multistep gene activation process during the formation and retrieval of long-term memory and define general memory-implicated genes as well as conditioning-type-dependent gene sets. SIGNIFICANCE STATEMENT The identification of genes regulating different steps of memory is of major interest in neuroscience. Identification of common memory genes across different learning paradigms and the temporal activation of the genes are poorly studied. Here, we investigated the temporal aspects of Caenorhabditis elegans gene expression changes using aversive olfactory associative long-term memory (LTAM) and identified three major gene activation waves. Like in previous studies, aversive LTAM is also CREB dependent, and CREB activity is necessary immediately after training. Finally, we define a list of memory paradigm-independent core gene sets as well as conditioning-dependent genes. Copyright © 2017 the authors 0270-6474/17/376661-12$15.00/0.
DNA methylome profiling identifies novel methylated genes in African American patients with colorectal neoplasia.

PubMed

Ashktorab, Hassan; Daremipouran, M; Goel, Ajay; Varma, Sudhir; Leavitt, R; Sun, Xueguang; Brim, Hassan

2014-04-01

The identification of genes that are differentially methylated in colorectal cancer (CRC) has potential value for both diagnostic and therapeutic interventions specifically in high-risk populations such as African Americans (AAs). However, DNA methylation patterns in CRC, especially in AAs, have not been systematically explored and remain poorly understood. Here, we performed DNA methylome profiling to identify the methylation status of CpG islands within candidate genes involved in critical pathways important in the initiation and development of CRC. We used reduced representation bisulfite sequencing (RRBS) in colorectal cancer and adenoma tissues that were compared with DNA methylome from a healthy AA subject's colon tissue and peripheral blood DNA. The identified methylation markers were validated in fresh frozen CRC tissues and corresponding normal tissues from AA patients diagnosed with CRC at Howard University Hospital. We identified and validated the methylation status of 355 CpG sites located within 16 gene promoter regions associated with CpG islands. Fifty CpG sites located within CpG islands-in genes ATXN7L1 (2), BMP3 (7), EID3 (15), GAS7 (1), GPR75 (24), and TNFAIP2 (1)-were significantly hypermethylated in tumor vs. normal tissues (P<0.05). The methylation status of BMP3, EID3, GAS7, and GPR75 was confirmed in an independent, validation cohort. Ingenuity pathway analysis mapped three of these markers (GAS7, BMP3 and GPR) in the insulin and TGF-β1 network-the two key pathways in CRC. In addition to hypermethylated genes, our analysis also revealed that LINE-1 repeat elements were progressively hypomethylated in the normal-adenoma-cancer sequence. We conclude that DNA methylome profiling based on RRBS is an effective method for screening aberrantly methylated genes in CRC. While previous studies focused on the limited identification of hypermethylated genes, ours is the first study to systematically and comprehensively identify novel hypermethylated
High-resolution genome-wide scan of genes, gene-networks and cellular systems impacting the yeast ionome

USDA-ARS?s Scientific Manuscript database

To balance the demand for uptake of essential elements with their potential toxicity living cells have complex regulatory mechanisms. Here, we describe a genome-wide screen to identify genes that impact the elemental composition (‘ionome’) of yeast Saccharomyces cerevisiae. Using inductively coupled...
Identifying Stable Reference Genes for qRT-PCR Normalisation in Gene Expression Studies of Narrow-Leafed Lupin (Lupinus angustifolius L.).

PubMed

Taylor, Candy M; Jost, Ricarda; Erskine, William; Nelson, Matthew N

2016-01-01

Quantitative Reverse Transcription PCR (qRT-PCR) is currently one of the most popular, high-throughput and sensitive technologies available for quantifying gene expression. Its accurate application depends heavily upon normalisation of gene-of-interest data with reference genes that are uniformly expressed under experimental conditions. The aim of this study was to provide the first validation of reference genes for Lupinus angustifolius (narrow-leafed lupin, a significant grain legume crop) using a selection of seven genes previously trialed as reference genes for the model legume, Medicago truncatula. In a preliminary evaluation, the seven candidate reference genes were assessed on the basis of primer specificity for their respective targeted region, PCR amplification efficiency, and ability to discriminate between cDNA and gDNA. Following this assessment, expression of the three most promising candidates [Ubiquitin C (UBC), Helicase (HEL), and Polypyrimidine tract-binding protein (PTB)] was evaluated using the NormFinder and RefFinder statistical algorithms in two narrow-leafed lupin lines, both with and without vernalisation treatment, and across seven organ types (cotyledons, stem, leaves, shoot apical meristem, flowers, pods and roots) encompassing three developmental stages. UBC was consistently identified as the most stable candidate and has sufficiently uniform expression that it may be used as a sole reference gene under the experimental conditions tested here. However, as organ type and developmental stage were associated with greater variability in relative expression, it is recommended using UBC and HEL as a pair to achieve optimal normalisation. These results highlight the importance of rigorously assessing candidate reference genes for each species across a diverse range of organs and developmental stages. With emerging technologies, such as RNAseq, and the completion of valuable transcriptome data sets, it is possible that other potentially more
Identifying Stable Reference Genes for qRT-PCR Normalisation in Gene Expression Studies of Narrow-Leafed Lupin (Lupinus angustifolius L.)

PubMed Central

Erskine, William; Nelson, Matthew N.

2016-01-01

Quantitative Reverse Transcription PCR (qRT-PCR) is currently one of the most popular, high-throughput and sensitive technologies available for quantifying gene expression. Its accurate application depends heavily upon normalisation of gene-of-interest data with reference genes that are uniformly expressed under experimental conditions. The aim of this study was to provide the first validation of reference genes for Lupinus angustifolius (narrow-leafed lupin, a significant grain legume crop) using a selection of seven genes previously trialed as reference genes for the model legume, Medicago truncatula. In a preliminary evaluation, the seven candidate reference genes were assessed on the basis of primer specificity for their respective targeted region, PCR amplification efficiency, and ability to discriminate between cDNA and gDNA. Following this assessment, expression of the three most promising candidates [Ubiquitin C (UBC), Helicase (HEL), and Polypyrimidine tract-binding protein (PTB)] was evaluated using the NormFinder and RefFinder statistical algorithms in two narrow-leafed lupin lines, both with and without vernalisation treatment, and across seven organ types (cotyledons, stem, leaves, shoot apical meristem, flowers, pods and roots) encompassing three developmental stages. UBC was consistently identified as the most stable candidate and has sufficiently uniform expression that it may be used as a sole reference gene under the experimental conditions tested here. However, as organ type and developmental stage were associated with greater variability in relative expression, it is recommended using UBC and HEL as a pair to achieve optimal normalisation. These results highlight the importance of rigorously assessing candidate reference genes for each species across a diverse range of organs and developmental stages. With emerging technologies, such as RNAseq, and the completion of valuable transcriptome data sets, it is possible that other potentially more
Cytotoxicity and gene induction by some essential oils in the yeast Saccharomyces cerevisiae.

PubMed

Bakkali, F; Averbeck, S; Averbeck, D; Zhiri, A; Idaomar, M

2005-08-01

In order to get an insight into the possible genotoxicity of essential oils (EOs) used in traditional pharmacological applications we tested five different oils extracted from the medicinal plants Origanum compactum, Coriandrum sativum, Artemisia herba alba, Cinnamomum camphora (Ravintsara aromatica) and Helichrysum italicum (Calendula officinalis) for genotoxic effects using the yeast Saccharomyces cerevisiae. Clear cytotoxic effects were observed in the diploid yeast strain D7, with the cells being more sensitive to EOs in exponential than in stationary growth phase. The cytotoxicity decreased in the following order: Origanum compactum>Coriandrum sativum>Artemisia herba alba>Cinnamomum camphora>Helichrysum italicum. In the same order, all EOs, except that derived from Helichrysum italicum, clearly induced cytoplasmic petite mutations indicating damage to mitochondrial DNA. However, no nuclear genetic events such as point mutations or mitotic intragenic or intergenic recombination were induced. The capacity of EOs to induce nuclear DNA damage-responsive genes was tested using suitable Lac-Z fusion strains for RNR3 and RAD51, which are genes involved in DNA metabolism and DNA repair, respectively. At equitoxic doses, all EOs demonstrated significant gene induction, approximately the same as that caused by hydrogen peroxide, but much lower than that caused by methyl methanesulfonate (MMS). EOs affect mitochondrial structure and function and can stimulate the transcriptional expression of DNA damage-responsive genes. The induction of mitochondrial damage by EOs appears to be closely linked to overall cellular cytotoxicity and appears to mask the occurrence of nuclear genetic events. EO-induced cytotoxicity involves oxidative stress, as is evident from the protection observed in the presence of ROS inhibitors such as glutathione, catalase or the iron-chelating agent deferoxamine.
A stratified transcriptomics analysis of polygenic fat and lean mouse adipose tissues identifies novel candidate obesity genes.

PubMed

Morton, Nicholas M; Nelson, Yvonne B; Michailidou, Zoi; Di Rollo, Emma M; Ramage, Lynne; Hadoke, Patrick W F; Seckl, Jonathan R; Bunger, Lutz; Horvat, Simon; Kenyon, Christopher J; Dunbar, Donald R

2011-01-01

Obesity and metabolic syndrome results from a complex interaction between genetic and environmental factors. In addition to brain-regulated processes, recent genome wide association studies have indicated that genes highly expressed in adipose tissue affect the distribution and function of fat and thus contribute to obesity. Using a stratified transcriptome gene enrichment approach we attempted to identify adipose tissue-specific obesity genes in the unique polygenic Fat (F) mouse strain generated by selective breeding over 60 generations for divergent adiposity from a comparator Lean (L) strain. To enrich for adipose tissue obesity genes a 'snap-shot' pooled-sample transcriptome comparison of key fat depots and non adipose tissues (muscle, liver, kidney) was performed. Known obesity quantitative trait loci (QTL) information for the model allowed us to further filter genes for increased likelihood of being causal or secondary for obesity. This successfully identified several genes previously linked to obesity (C1qr1, and Np3r) as positional QTL candidate genes elevated specifically in F line adipose tissue. A number of novel obesity candidate genes were also identified (Thbs1, Ppp1r3d, Tmepai, Trp53inp2, Ttc7b, Tuba1a, Fgf13, Fmr) that have inferred roles in fat cell function. Quantitative microarray analysis was then applied to the most phenotypically divergent adipose depot after exaggerating F and L strain differences with chronic high fat feeding which revealed a distinct gene expression profile of line, fat depot and diet-responsive inflammatory, angiogenic and metabolic pathways. Selected candidate genes Npr3 and Thbs1, as well as Gys2, a non-QTL gene that otherwise passed our enrichment criteria were characterised, revealing novel functional effects consistent with a contribution to obesity. A focussed candidate gene enrichment strategy in the unique F and L model has identified novel adipose tissue-enriched genes contributing to obesity.

Genome‐Wide MicroRNA and Gene Analysis of Mesenchymal Stem Cell Chondrogenesis Identifies an Essential Role and Multiple Targets for miR‐140‐5p

PubMed Central

Tselepi, Maria; Gómez, Rodolfo; Woods, Steven; Hui, Wang; Smith, Graham R.; Shanley, Daryl P.; Clark, Ian M.; Young, David A.

2015-01-01

Abstract microRNAs (miRNAs) are abundantly expressed in development where they are critical determinants of cell differentiation and phenotype. Accordingly miRNAs are essential for normal skeletal development and chondrogenesis in particular. However, the question of which miRNAs are specific to the chondrocyte phenotype has not been fully addressed. Using microarray analysis of miRNA expression during mesenchymal stem cell chondrogenic differentiation and detailed examination of the role of essential differentiation factors, such as SOX9, TGF‐β, and the cell condensation phase, we characterize the repertoire of specific miRNAs involved in chondrocyte development, highlighting in particular miR‐140 and miR‐455. Further with the use of mRNA microarray data we integrate miRNA expression and mRNA expression during chondrogenesis to underline the particular importance of miR‐140, especially the ‐5p strand. We provide a detailed identification and validation of direct targets of miR‐140‐5p in both chondrogenesis and adult chondrocytes with the use of microarray and 3′UTR analysis. This emphasizes the diverse array of targets and pathways regulated by miR‐140‐5p. We are also able to confirm previous experimentally identified targets but, additionally, identify a novel positive regulation of the Wnt signaling pathway by miR‐140‐5p. Wnt signaling has a complex role in chondrogenesis and skeletal development and these findings illustrate a previously unidentified role for miR‐140‐5p in regulation of Wnt signaling in these processes. Together these developments further highlight the role of miRNAs during chondrogenesis to improve our understanding of chondrocyte development and guide cartilage tissue engineering. Stem Cells 2015;33:3266–3280 PMID:26175215
Comparative transcript profiling by SuperSAGE identifies novel candidate genes for controlling potato quantitative resistance to late blight not compromised by late maturity.

PubMed

Draffehn, Astrid M; Li, Li; Krezdorn, Nicolas; Ding, Jia; Lübeck, Jens; Strahwald, Josef; Muktar, Meki S; Walkemeier, Birgit; Rotter, Björn; Gebhardt, Christiane

2013-01-01

Resistance to pathogens is essential for survival of wild and cultivated plants. Pathogen susceptibility causes major losses of crop yield and quality. Durable field resistance combined with high yield and other superior agronomic characters are therefore, important objectives in every crop breeding program. Precision and efficacy of resistance breeding can be enhanced by molecular diagnostic tools, which result from knowledge of the molecular basis of resistance and susceptibility. Breeding uses resistance conferred by single R genes and polygenic quantitative resistance. The latter is partial but considered more durable. Molecular mechanisms of plant pathogen interactions are elucidated mainly in experimental systems involving single R genes, whereas most genes important for quantitative resistance in crops like potato are unknown. Quantitative resistance of potato to Phytophthora infestans causing late blight is often compromised by late plant maturity, a negative agronomic character. Our objective was to identify candidate genes for quantitative resistance to late blight not compromised by late plant maturity. We used diagnostic DNA-markers to select plants with different field levels of maturity corrected resistance (MCR) to late blight and compared their leaf transcriptomes before and after infection with P. infestans using SuperSAGE (serial analysis of gene expression) technology and next generation sequencing. We identified 2034 transcripts up or down regulated upon infection, including a homolog of the kiwi fruit allergen kiwellin. 806 transcripts showed differential expression between groups of genotypes with contrasting MCR levels. The observed expression patterns suggest that MCR is in part controlled by differential transcript levels in uninfected plants. Functional annotation suggests that, besides biotic and abiotic stress responses, general cellular processes such as photosynthesis, protein biosynthesis, and degradation play a role in MCR.
Ddx18 is essential for cell-cycle progression in zebrafish hematopoietic cells and is mutated in human AML

PubMed Central

Bolli, Niccolò; Rhodes, Jennifer; Abdel-Wahab, Omar I.; Levine, Ross; Hedvat, Cyrus V.; Stone, Richard; Khanna-Gupta, Arati; Sun, Hong; Kanki, John P.; Gazda, Hanna T.; Beggs, Alan H.; Cotter, Finbarr E.

2011-01-01

In a zebrafish mutagenesis screen to identify genes essential for myelopoiesis, we identified an insertional allele hi1727, which disrupts the gene encoding RNA helicase dead-box 18 (Ddx18). Homozygous Ddx18 mutant embryos exhibit a profound loss of myeloid and erythroid cells along with cardiovascular abnormalities and reduced size. These mutants also display prominent apoptosis and a G1 cell-cycle arrest. Loss of p53, but not Bcl-xl overexpression, rescues myeloid cells to normal levels, suggesting that the hematopoietic defect is because of p53-dependent G1 cell-cycle arrest. We then sequenced primary samples from 262 patients with myeloid malignancies because genes essential for myelopoiesis are often mutated in human leukemias. We identified 4 nonsynonymous sequence variants (NSVs) of DDX18 in acute myeloid leukemia (AML) patient samples. RNA encoding wild-type DDX18 and 3 NSVs rescued the hematopoietic defect, indicating normal DDX18 activity. RNA encoding one mutation, DDX18-E76del, was unable to rescue hematopoiesis, and resulted in reduced myeloid cell numbers in ddx18hi1727/+ embryos, indicating this NSV likely functions as a dominant-negative allele. These studies demonstrate the use of the zebrafish as a robust in vivo system for assessing the function of genes mutated in AML, which will become increasingly important as more sequence variants are identified by next-generation resequencing technologies. PMID:21653321
Flanking genes of an essential gene give information about the evolution of metazoa.

PubMed

Zimek, Alexander; Weber, Klaus

2011-04-01

We collected as much information as possible on new lamin genes and their flanking genes. The number of lamin genes varies from 1 to 4 depending more or less on the phylogenetic position of the species. Strong genome drift is recognised by fewer and unusually placed introns and a change in flanking genes. This applies to the nematode Caenorhabditis elegans, the insect Drosophila melanogaster, the urochordate Ciona intestinalis, the annelid Capitella teleta and the planaria Schmidtea mediterranea. In contrast stable genomes show astonishing conservation of the flanking genes. These are identical in the sea anemone Nematostella vectensis and the cephalochordate Branchiostoma floridae lamin B1 gene. Even in the lamin B1 genes from Xenopus tropicalis and man one of the flanking genes is conserved. Finally our analysis forms the basis for a molecular analysis of metazoan phylogeny. Copyright © 2010 Elsevier GmbH. All rights reserved.
Genomic Analyses Yield Markers for Identifying Agronomically Important Genes in Potato

USDA-ARS?s Scientific Manuscript database

This study explores the genetic architecture underling the potato evolution through a comprehensive assessment of wild and cultivated potato species based on the re-sequencing of 201 accessions of Solanum section Petota with >12 × genome coverage. We identified 450 domesticated genes, which showed e...
Engineering and Functional Characterization of Fusion Genes Identifies Novel Oncogenic Drivers of Cancer.

PubMed

Lu, Hengyu; Villafane, Nicole; Dogruluk, Turgut; Grzeskowiak, Caitlin L; Kong, Kathleen; Tsang, Yiu Huen; Zagorodna, Oksana; Pantazi, Angeliki; Yang, Lixing; Neill, Nicholas J; Kim, Young Won; Creighton, Chad J; Verhaak, Roel G; Mills, Gordon B; Park, Peter J; Kucherlapati, Raju; Scott, Kenneth L

2017-07-01

Oncogenic gene fusions drive many human cancers, but tools to more quickly unravel their functional contributions are needed. Here we describe methodology permitting fusion gene construction for functional evaluation. Using this strategy, we engineered the known fusion oncogenes, BCR-ABL1, EML4-ALK , and ETV6-NTRK3, as well as 20 previously uncharacterized fusion genes identified in The Cancer Genome Atlas datasets. In addition to confirming oncogenic activity of the known fusion oncogenes engineered by our construction strategy, we validated five novel fusion genes involving MET, NTRK2 , and BRAF kinases that exhibited potent transforming activity and conferred sensitivity to FDA-approved kinase inhibitors. Our fusion construction strategy also enabled domain-function studies of BRAF fusion genes. Our results confirmed other reports that the transforming activity of BRAF fusions results from truncation-mediated loss of inhibitory domains within the N-terminus of the BRAF protein. BRAF mutations residing within this inhibitory region may provide a means for BRAF activation in cancer, therefore we leveraged the modular design of our fusion gene construction methodology to screen N-terminal domain mutations discovered in tumors that are wild-type at the BRAF mutation hotspot, V600. We identified an oncogenic mutation, F247L, whose expression robustly activated the MAPK pathway and sensitized cells to BRAF and MEK inhibitors. When applied broadly, these tools will facilitate rapid fusion gene construction for subsequent functional characterization and translation into personalized treatment strategies. Cancer Res; 77(13); 3502-12. ©2017 AACR . ©2017 American Association for Cancer Research.
Novel linkage disequilibrium clustering algorithm identifies new lupus genes on meta-analysis of GWAS datasets.

PubMed

Saeed, Mohammad

2017-05-01

Systemic lupus erythematosus (SLE) is a complex disorder. Genetic association studies of complex disorders suffer from the following three major issues: phenotypic heterogeneity, false positive (type I error), and false negative (type II error) results. Hence, genes with low to moderate effects are missed in standard analyses, especially after statistical corrections. OASIS is a novel linkage disequilibrium clustering algorithm that can potentially address false positives and negatives in genome-wide association studies (GWAS) of complex disorders such as SLE. OASIS was applied to two SLE dbGAP GWAS datasets (6077 subjects; ∼0.75 million single-nucleotide polymorphisms). OASIS identified three known SLE genes viz. IFIH1, TNIP1, and CD44, not previously reported using these GWAS datasets. In addition, 22 novel loci for SLE were identified and the 5 SLE genes previously reported using these datasets were verified. OASIS methodology was validated using single-variant replication and gene-based analysis with GATES. This led to the verification of 60% of OASIS loci. New SLE genes that OASIS identified and were further verified include TNFAIP6, DNAJB3, TTF1, GRIN2B, MON2, LATS2, SNX6, RBFOX1, NCOA3, and CHAF1B. This study presents the OASIS algorithm, software, and the meta-analyses of two publicly available SLE GWAS datasets along with the novel SLE genes. Hence, OASIS is a novel linkage disequilibrium clustering method that can be universally applied to existing GWAS datasets for the identification of new genes.
An unbiased approach to identify genes involved in development in a turtle with temperature-dependent sex determination.

PubMed

Chojnowski, Jena L; Braun, Edward L

2012-07-15

Many reptiles exhibit temperature-dependent sex determination (TSD). The initial cue in TSD is incubation temperature, unlike genotypic sex determination (GSD) where it is determined by the presence of specific alleles (or genetic loci). We used patterns of gene expression to identify candidates for genes with a role in TSD and other developmental processes without making a priori assumptions about the identity of these genes (ortholog-based approach). We identified genes with sexually dimorphic mRNA accumulation during the temperature sensitive period of development in the Red-eared slider turtle (Trachemys scripta), a turtle with TSD. Genes with differential mRNA accumulation in response to estrogen (estradiol-17β; E(2)) exposure and developmental stages were also identified. Sequencing 767 clones from three suppression-subtractive hybridization libraries yielded a total of 581 unique sequences. Screening a macroarray with a subset of those sequences revealed a total of 26 genes that exhibited differential mRNA accumulation: 16 female biased and 10 male biased. Additional analyses revealed that C16ORF62 (an unknown gene) and MALAT1 (a long noncoding RNA) exhibited increased mRNA accumulation at the male producing temperature relative to the female producing temperature during embryonic sexual development. Finally, we identified four genes (C16ORF62, CCT3, MMP2, and NFIB) that exhibited a stage effect and five genes (C16ORF62, CCT3, MMP2, NFIB and NOTCH2) showed a response to E(2) exposure. Here we report a survey of genes identified using patterns of mRNA accumulation during embryonic development in a turtle with TSD. Many previous studies have focused on examining the turtle orthologs of genes involved in mammalian development. Although valuable, the limitations of this approach are exemplified by our identification of two genes (MALAT1 and C16ORF62) that are sexually dimorphic during embryonic development. MALAT1 is a noncoding RNA that has not been implicated
An ant colony optimization based algorithm for identifying gene regulatory elements.

PubMed

Liu, Wei; Chen, Hanwu; Chen, Ling

2013-08-01

It is one of the most important tasks in bioinformatics to identify the regulatory elements in gene sequences. Most of the existing algorithms for identifying regulatory elements are inclined to converge into a local optimum, and have high time complexity. Ant Colony Optimization (ACO) is a meta-heuristic method based on swarm intelligence and is derived from a model inspired by the collective foraging behavior of real ants. Taking advantage of the ACO in traits such as self-organization and robustness, this paper designs and implements an ACO based algorithm named ACRI (ant-colony-regulatory-identification) for identifying all possible binding sites of transcription factor from the upstream of co-expressed genes. To accelerate the ants' searching process, a strategy of local optimization is presented to adjust the ants' start positions on the searched sequences. By exploiting the powerful optimization ability of ACO, the algorithm ACRI can not only improve precision of the results, but also achieve a very high speed. Experimental results on real world datasets show that ACRI can outperform other traditional algorithms in the respects of speed and quality of solutions. Copyright © 2013 Elsevier Ltd. All rights reserved.
Similarity of markers identified from cancer gene expression studies: observations from GEO.

PubMed

Shi, Xingjie; Shen, Shihao; Liu, Jin; Huang, Jian; Zhou, Yong; Ma, Shuangge

2014-09-01

Gene expression profiling has been extensively conducted in cancer research. The analysis of multiple independent cancer gene expression datasets may provide additional information and complement single-dataset analysis. In this study, we conduct multi-dataset analysis and are interested in evaluating the similarity of cancer-associated genes identified from different datasets. The first objective of this study is to briefly review some statistical methods that can be used for such evaluation. Both marginal analysis and joint analysis methods are reviewed. The second objective is to apply those methods to 26 Gene Expression Omnibus (GEO) datasets on five types of cancers. Our analysis suggests that for the same cancer, the marker identification results may vary significantly across datasets, and different datasets share few common genes. In addition, datasets on different cancers share few common genes. The shared genetic basis of datasets on the same or different cancers, which has been suggested in the literature, is not observed in the analysis of GEO data. © The Author 2013. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.
Gene Network Construction from Microarray Data Identifies a Key Network Module and Several Candidate Hub Genes in Age-Associated Spatial Learning Impairment

PubMed Central

Uddin, Raihan; Singh, Shiva M.

2017-01-01

As humans age many suffer from a decrease in normal brain functions including spatial learning impairments. This study aimed to better understand the molecular mechanisms in age-associated spatial learning impairment (ASLI). We used a mathematical modeling approach implemented in Weighted Gene Co-expression Network Analysis (WGCNA) to create and compare gene network models of young (learning unimpaired) and aged (predominantly learning impaired) brains from a set of exploratory datasets in rats in the context of ASLI. The major goal was to overcome some of the limitations previously observed in the traditional meta- and pathway analysis using these data, and identify novel ASLI related genes and their networks based on co-expression relationship of genes. This analysis identified a set of network modules in the young, each of which is highly enriched with genes functioning in broad but distinct GO functional categories or biological pathways. Interestingly, the analysis pointed to a single module that was highly enriched with genes functioning in “learning and memory” related functions and pathways. Subsequent differential network analysis of this “learning and memory” module in the aged (predominantly learning impaired) rats compared to the young learning unimpaired rats allowed us to identify a set of novel ASLI candidate hub genes. Some of these genes show significant repeatability in networks generated from independent young and aged validation datasets. These hub genes are highly co-expressed with other genes in the network, which not only show differential expression but also differential co-expression and differential connectivity across age and learning impairment. The known function of these hub genes indicate that they play key roles in critical pathways, including kinase and phosphatase signaling, in functions related to various ion channels, and in maintaining neuronal integrity relating to synaptic plasticity and memory formation. Taken
Gene Network Construction from Microarray Data Identifies a Key Network Module and Several Candidate Hub Genes in Age-Associated Spatial Learning Impairment.

PubMed

Uddin, Raihan; Singh, Shiva M

2017-01-01

As humans age many suffer from a decrease in normal brain functions including spatial learning impairments. This study aimed to better understand the molecular mechanisms in age-associated spatial learning impairment (ASLI). We used a mathematical modeling approach implemented in Weighted Gene Co-expression Network Analysis (WGCNA) to create and compare gene network models of young (learning unimpaired) and aged (predominantly learning impaired) brains from a set of exploratory datasets in rats in the context of ASLI. The major goal was to overcome some of the limitations previously observed in the traditional meta- and pathway analysis using these data, and identify novel ASLI related genes and their networks based on co-expression relationship of genes. This analysis identified a set of network modules in the young, each of which is highly enriched with genes functioning in broad but distinct GO functional categories or biological pathways. Interestingly, the analysis pointed to a single module that was highly enriched with genes functioning in "learning and memory" related functions and pathways. Subsequent differential network analysis of this "learning and memory" module in the aged (predominantly learning impaired) rats compared to the young learning unimpaired rats allowed us to identify a set of novel ASLI candidate hub genes. Some of these genes show significant repeatability in networks generated from independent young and aged validation datasets. These hub genes are highly co-expressed with other genes in the network, which not only show differential expression but also differential co-expression and differential connectivity across age and learning impairment. The known function of these hub genes indicate that they play key roles in critical pathways, including kinase and phosphatase signaling, in functions related to various ion channels, and in maintaining neuronal integrity relating to synaptic plasticity and memory formation. Taken together, they
Use of an activated beta-catenin to identify Wnt pathway target genes in caenorhabditis elegans, including a subset of collagen genes expressed in late larval development.

PubMed

Jackson, Belinda M; Abete-Luzi, Patricia; Krause, Michael W; Eisenmann, David M

2014-04-16

The Wnt signaling pathway plays a fundamental role during metazoan development, where it regulates diverse processes, including cell fate specification, cell migration, and stem cell renewal. Activation of the beta-catenin-dependent/canonical Wnt pathway up-regulates expression of Wnt target genes to mediate a cellular response. In the nematode Caenorhabditis elegans, a canonical Wnt signaling pathway regulates several processes during larval development; however, few target genes of this pathway have been identified. To address this deficit, we used a novel approach of conditionally activated Wnt signaling during a defined stage of larval life by overexpressing an activated beta-catenin protein, then used microarray analysis to identify genes showing altered expression compared with control animals. We identified 166 differentially expressed genes, of which 104 were up-regulated. A subset of the up-regulated genes was shown to have altered expression in mutants with decreased or increased Wnt signaling; we consider these genes to be bona fide C. elegans Wnt pathway targets. Among these was a group of six genes, including the cuticular collagen genes, bli-1 col-38, col-49, and col-71. These genes show a peak of expression in the mid L4 stage during normal development, suggesting a role in adult cuticle formation. Consistent with this finding, reduction of function for several of the genes causes phenotypes suggestive of defects in cuticle function or integrity. Therefore, this work has identified a large number of putative Wnt pathway target genes during larval life, including a small subset of Wnt-regulated collagen genes that may function in synthesis of the adult cuticle.
Genomic characterization of biliary tract cancers identifies driver genes and predisposing mutations.

PubMed

Wardell, Christopher P; Fujita, Masashi; Yamada, Toru; Simbolo, Michele; Fassan, Matteo; Karlic, Rosa; Polak, Paz; Kim, Jaegil; Hatanaka, Yutaka; Maejima, Kazuhiro; Lawlor, Rita T; Nakanishi, Yoshitsugu; Mitsuhashi, Tomoko; Fujimoto, Akihiro; Furuta, Mayuko; Ruzzenente, Andrea; Conci, Simone; Oosawa, Ayako; Sasaki-Oku, Aya; Nakano, Kaoru; Tanaka, Hiroko; Yamamoto, Yujiro; Michiaki, Kubo; Kawakami, Yoshiiku; Aikata, Hiroshi; Ueno, Masaki; Hayami, Shinya; Gotoh, Kunihito; Ariizumi, Shun-Ichi; Yamamoto, Masakazu; Yamaue, Hiroki; Chayama, Kazuaki; Miyano, Satoru; Getz, Gad; Scarpa, Aldo; Hirano, Satoshi; Nakamura, Toru; Nakagawa, Hidewaki

2018-05-01

Biliary tract cancers (BTCs) are clinically and pathologically heterogeneous and respond poorly to treatment. Genomic profiling can offer a clearer understanding of their carcinogenesis, classification and treatment strategy. We performed large-scale genome sequencing analyses on BTCs to investigate their somatic and germline driver events and characterize their genomic landscape. We analyzed 412 BTC samples from Japanese and Italian populations, 107 by whole-exome sequencing (WES), 39 by whole-genome sequencing (WGS), and a further 266 samples by targeted sequencing. The subtypes were 136 intrahepatic cholangiocarcinomas (ICCs), 101 distal cholangiocarcinomas (DCCs), 109 peri-hilar type cholangiocarcinomas (PHCs), and 66 gallbladder or cystic duct cancers (GBCs/CDCs). We identified somatic alterations and searched for driver genes in BTCs, finding pathogenic germline variants of cancer-predisposing genes. We predicted cell-of-origin for BTCs by combining somatic mutation patterns and epigenetic features. We identified 32 significantly and commonly mutated genes including TP53, KRAS, SMAD4, NF1, ARID1A, PBRM1, and ATR, some of which negatively affected patient prognosis. A novel deletion of MUC17 at 7q22.1 affected patient prognosis. Cell-of-origin predictions using WGS and epigenetic features suggest hepatocyte-origin of hepatitis-related ICCs. Deleterious germline mutations of cancer-predisposing genes such as BRCA1, BRCA2, RAD51D, MLH1, or MSH2 were detected in 11% (16/146) of BTC patients. BTCs have distinct genetic features including somatic events and germline predisposition. These findings could be useful to establish treatment and diagnostic strategies for BTCs based on genetic information. We here analyzed genomic features of 412 BTC samples from Japanese and Italian populations. A total of 32 significantly and commonly mutated genes were identified, some of which negatively affected patient prognosis, including a novel deletion of MUC17 at 7q22.1. Cell
Identifying marker genes in transcription profiling data using a mixture of feature relevance experts.

PubMed

Chow, M L; Moler, E J; Mian, I S

2001-03-08

Transcription profiling experiments permit the expression levels of many genes to be measured simultaneously. Given profiling data from two types of samples, genes that most distinguish the samples (marker genes) are good candidates for subsequent in-depth experimental studies and developing decision support systems for diagnosis, prognosis, and monitoring. This work proposes a mixture of feature relevance experts as a method for identifying marker genes and illustrates the idea using published data from samples labeled as acute lymphoblastic and myeloid leukemia (ALL, AML). A feature relevance expert implements an algorithm that calculates how well a gene distinguishes samples, reorders genes according to this relevance measure, and uses a supervised learning method [here, support vector machines (SVMs)] to determine the generalization performances of different nested gene subsets. The mixture of three feature relevance experts examined implement two existing and one novel feature relevance measures. For each expert, a gene subset consisting of the top 50 genes distinguished ALL from AML samples as completely as all 7,070 genes. The 125 genes at the union of the top 50s are plausible markers for a prototype decision support system. Chromosomal aberration and other data support the prediction that the three genes at the intersection of the top 50s, cystatin C, azurocidin, and adipsin, are good targets for investigating the basic biology of ALL/AML. The same data were employed to identify markers that distinguish samples based on their labels of T cell/B cell, peripheral blood/bone marrow, and male/female. Selenoprotein W may discriminate T cells from B cells. Results from analysis of transcription profiling data from tumor/nontumor colon adenocarcinoma samples support the general utility of the aforementioned approach. Theoretical issues such as choosing SVM kernels and their parameters, training and evaluating feature relevance experts, and the impact of
Integrative strategies to identify candidate genes in rodent models of human alcoholism.

PubMed

Treadwell, Julie A

2006-01-01

The search for genes underlying alcohol-related behaviours in rodent models of human alcoholism has been ongoing for many years with only limited success. Recently, new strategies that integrate several of the traditional approaches have provided new insights into the molecular mechanisms underlying ethanol's actions in the brain. We have used alcohol-preferring C57BL/6J (B6) and alcohol-avoiding DBA/2J (D2) genetic strains of mice in an integrative strategy combining high-throughput gene expression screening, genetic segregation analysis, and mapping to previously published quantitative trait loci to uncover candidate genes for the ethanol-preference phenotype. In our study, 2 genes, retinaldehyde binding protein 1 (Rlbp1) and syntaxin 12 (Stx12), were found to be strong candidates for ethanol preference. Such experimental approaches have the power and the potential to greatly speed up the laborious process of identifying candidate genes for the animal models of human alcoholism.
Gene co-expression analysis identifies gene clusters associated with isotropic and polarized growth in Aspergillus fumigatus conidia.

PubMed

Baltussen, Tim J H; Coolen, Jordy P M; Zoll, Jan; Verweij, Paul E; Melchers, Willem J G

2018-04-26

Aspergillus fumigatus is a saprophytic fungus that extensively produces conidia. These microscopic asexually reproductive structures are small enough to reach the lungs. Germination of conidia followed by hyphal growth inside human lungs is a key step in the establishment of infection in immunocompromised patients. RNA-Seq was used to analyze the transcriptome of dormant and germinating A. fumigatus conidia. Construction of a gene co-expression network revealed four gene clusters (modules) correlated with a growth phase (dormant, isotropic growth, polarized growth). Transcripts levels of genes encoding for secondary metabolites were high in dormant conidia. During isotropic growth, transcript levels of genes involved in cell wall modifications increased. Two modules encoding for growth and cell cycle/DNA processing were associated with polarized growth. In addition, the co-expression network was used to identify highly connected intermodular hub genes. These genes may have a pivotal role in the respective module and could therefore be compelling therapeutic targets. Generally, cell wall remodeling is an important process during isotropic and polarized growth, characterized by an increase of transcripts coding for hyphal growth and cell cycle/DNA processing when polarized growth is initiated. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Targeted sequencing identifies 91 neurodevelopmental disorder risk genes with autism and developmental disability biases

PubMed Central

Stessman, Holly A. F.; Xiong, Bo; Coe, Bradley P.; Wang, Tianyun; Hoekzema, Kendra; Fenckova, Michaela; Kvarnung, Malin; Gerdts, Jennifer; Trinh, Sandy; Cosemans, Nele; Vives, Laura; Lin, Janice; Turner, Tychele N.; Santen, Gijs; Ruivenkamp, Claudia; Kriek, Marjolein; van Haeringen, Arie; Aten, Emmelien; Friend, Kathryn; Liebelt, Jan; Barnett, Christopher; Haan, Eric; Shaw, Marie; Gecz, Jozef; Anderlid, Britt-Marie; Nordgren, Ann; Lindstrand, Anna; Schwartz, Charles; Kooy, R. Frank; Vandeweyer, Geert; Helsmoortel, Celine; Romano, Corrado; Alberti, Antonino; Vinci, Mirella; Avola, Emanuela; Giusto, Stefania; Courchesne, Eric; Pramparo, Tiziano; Pierce, Karen; Nalabolu, Srinivasa; Amaral, David; Scheffer, Ingrid E.; Delatycki, Martin B.; Lockhart, Paul J.; Hormozdiari, Fereydoun; Harich, Benjamin; Castells-Nobau, Anna; Xia, Kun; Peeters, Hilde; Nordenskjöld, Magnus; Schenck, Annette; Bernier, Raphael A.; Eichler, Evan E.

2017-01-01

Gene-disruptive mutations contribute to the biology of neurodevelopmental disorders (NDDs), but most pathogenic genes are not known. We sequenced 208 candidate genes from >11,730 patients and >2,867 controls. We report 91 genes with an excess of de novo mutations or private disruptive mutations in 5.7% of patients, including 38 novel NDD genes. Drosophila functional assays of a subset bolster their involvement in NDDs. We identify 25 genes that show a bias for autism versus intellectual disability and highlight a network associated with high-functioning autism (FSIQ>100). Clinical follow-up for NAA15, KMT5B, and ASH1L reveals novel syndromic and non-syndromic forms of disease. PMID:28191889
Tbx2/3 is an essential mediator within the Brachyury gene network during Ciona notochord development

PubMed Central

José-Edwards, Diana S.; Oda-Ishii, Izumi; Nibu, Yutaka; Di Gregorio, Anna

2013-01-01

T-box genes are potent regulators of mesoderm development in many metazoans. In chordate embryos, the T-box transcription factor Brachyury (Bra) is required for specification and differentiation of the notochord. In some chordates, including the ascidian Ciona, members of the Tbx2 subfamily of T-box genes are also expressed in this tissue; however, their regulatory relationships with Bra and their contributions to the development of the notochord remain uncharacterized. We determined that the notochord expression of Ciona Tbx2/3 (Ci-Tbx2/3) requires Ci-Bra, and identified a Ci-Tbx2/3 notochord CRM that necessitates multiple Ci-Bra binding sites for its activity. Expression of mutant forms of Ci-Tbx2/3 in the developing notochord revealed a role for this transcription factor primarily in convergent extension. Through microarray screens, we uncovered numerous Ci-Tbx2/3 targets, some of which overlap with known Ci-Bra-downstream notochord genes. Among the Ci-Tbx2/3 notochord targets are evolutionarily conserved genes, including caspases, lineage-specific genes, such as Noto4, and newly identified genes, such as MLKL. This work sheds light on a large section of the notochord regulatory circuitry controlled by T-box factors, and reveals new components of the complement of genes required for the proper formation of this structure. PMID:23674602
Tbx2/3 is an essential mediator within the Brachyury gene network during Ciona notochord development.

PubMed

José-Edwards, Diana S; Oda-Ishii, Izumi; Nibu, Yutaka; Di Gregorio, Anna

2013-06-01

T-box genes are potent regulators of mesoderm development in many metazoans. In chordate embryos, the T-box transcription factor Brachyury (Bra) is required for specification and differentiation of the notochord. In some chordates, including the ascidian Ciona, members of the Tbx2 subfamily of T-box genes are also expressed in this tissue; however, their regulatory relationships with Bra and their contributions to the development of the notochord remain uncharacterized. We determined that the notochord expression of Ciona Tbx2/3 (Ci-Tbx2/3) requires Ci-Bra, and identified a Ci-Tbx2/3 notochord CRM that necessitates multiple Ci-Bra binding sites for its activity. Expression of mutant forms of Ci-Tbx2/3 in the developing notochord revealed a role for this transcription factor primarily in convergent extension. Through microarray screens, we uncovered numerous Ci-Tbx2/3 targets, some of which overlap with known Ci-Bra-downstream notochord genes. Among the Ci-Tbx2/3 notochord targets are evolutionarily conserved genes, including caspases, lineage-specific genes, such as Noto4, and newly identified genes, such as MLKL. This work sheds light on a large section of the notochord regulatory circuitry controlled by T-box factors, and reveals new components of the complement of genes required for the proper formation of this structure.

Integrative Analysis of DNA Methylation and Gene Expression Data Identifies EPAS1 as a Key Regulator of COPD

PubMed Central

Yoo, Seungyeul; Takikawa, Sachiko; Geraghty, Patrick; Argmann, Carmen; Campbell, Joshua; Lin, Luan; Huang, Tao; Tu, Zhidong; Feronjy, Robert; Spira, Avrum; Schadt, Eric E.; Powell, Charles A.; Zhu, Jun

2015-01-01

Chronic Obstructive Pulmonary Disease (COPD) is a complex disease. Genetic, epigenetic, and environmental factors are known to contribute to COPD risk and disease progression. Therefore we developed a systematic approach to identify key regulators of COPD that integrates genome-wide DNA methylation, gene expression, and phenotype data in lung tissue from COPD and control samples. Our integrative analysis identified 126 key regulators of COPD. We identified EPAS1 as the only key regulator whose downstream genes significantly overlapped with multiple genes sets associated with COPD disease severity. EPAS1 is distinct in comparison with other key regulators in terms of methylation profile and downstream target genes. Genes predicted to be regulated by EPAS1 were enriched for biological processes including signaling, cell communications, and system development. We confirmed that EPAS1 protein levels are lower in human COPD lung tissue compared to non-disease controls and that Epas1 gene expression is reduced in mice chronically exposed to cigarette smoke. As EPAS1 downstream genes were significantly enriched for hypoxia responsive genes in endothelial cells, we tested EPAS1 function in human endothelial cells. EPAS1 knockdown by siRNA in endothelial cells impacted genes that significantly overlapped with EPAS1 downstream genes in lung tissue including hypoxia responsive genes, and genes associated with emphysema severity. Our first integrative analysis of genome-wide DNA methylation and gene expression profiles illustrates that not only does DNA methylation play a ‘causal’ role in the molecular pathophysiology of COPD, but it can be leveraged to directly identify novel key mediators of this pathophysiology. PMID:25569234
Co-expression network analysis identified six hub genes in association with metastasis risk and prognosis in hepatocellular carcinoma

PubMed Central

Feng, Juerong; Zhou, Rui; Chang, Ying; Liu, Jing; Zhao, Qiu

2017-01-01

Hepatocellular carcinoma (HCC) has a high incidence and mortality worldwide, and its carcinogenesis and progression are influenced by a complex network of gene interactions. A weighted gene co-expression network was constructed to identify gene modules associated with the clinical traits in HCC (n = 214). Among the 13 modules, high correlation was only found between the red module and metastasis risk (classified by the HCC metastasis gene signature) (R2 = −0.74). Moreover, in the red module, 34 network hub genes for metastasis risk were identified, six of which (ABAT, AGXT, ALDH6A1, CYP4A11, DAO and EHHADH) were also hub nodes in the protein-protein interaction network of the module genes. Thus, a total of six hub genes were identified. In validation, all hub genes showed a negative correlation with the four-stage HCC progression (P for trend < 0.05) in the test set. Furthermore, in the training set, HCC samples with any hub gene lowly expressed demonstrated a higher recurrence rate and poorer survival rate (hazard ratios with 95% confidence intervals > 1). RNA-sequencing data of 142 HCC samples showed consistent results in the prognosis. Gene set enrichment analysis (GSEA) demonstrated that in the samples with any hub gene highly expressed, a total of 24 functional gene sets were enriched, most of which focused on amino acid metabolism and oxidation. In conclusion, co-expression network analysis identified six hub genes in association with HCC metastasis risk and prognosis, which might improve the prognosis by influencing amino acid metabolism and oxidation. PMID:28430663
Integrative analysis of DNA methylation and gene expression data identifies EPAS1 as a key regulator of COPD.

PubMed

Yoo, Seungyeul; Takikawa, Sachiko; Geraghty, Patrick; Argmann, Carmen; Campbell, Joshua; Lin, Luan; Huang, Tao; Tu, Zhidong; Foronjy, Robert F; Feronjy, Robert; Spira, Avrum; Schadt, Eric E; Powell, Charles A; Zhu, Jun

2015-01-01

Chronic Obstructive Pulmonary Disease (COPD) is a complex disease. Genetic, epigenetic, and environmental factors are known to contribute to COPD risk and disease progression. Therefore we developed a systematic approach to identify key regulators of COPD that integrates genome-wide DNA methylation, gene expression, and phenotype data in lung tissue from COPD and control samples. Our integrative analysis identified 126 key regulators of COPD. We identified EPAS1 as the only key regulator whose downstream genes significantly overlapped with multiple genes sets associated with COPD disease severity. EPAS1 is distinct in comparison with other key regulators in terms of methylation profile and downstream target genes. Genes predicted to be regulated by EPAS1 were enriched for biological processes including signaling, cell communications, and system development. We confirmed that EPAS1 protein levels are lower in human COPD lung tissue compared to non-disease controls and that Epas1 gene expression is reduced in mice chronically exposed to cigarette smoke. As EPAS1 downstream genes were significantly enriched for hypoxia responsive genes in endothelial cells, we tested EPAS1 function in human endothelial cells. EPAS1 knockdown by siRNA in endothelial cells impacted genes that significantly overlapped with EPAS1 downstream genes in lung tissue including hypoxia responsive genes, and genes associated with emphysema severity. Our first integrative analysis of genome-wide DNA methylation and gene expression profiles illustrates that not only does DNA methylation play a 'causal' role in the molecular pathophysiology of COPD, but it can be leveraged to directly identify novel key mediators of this pathophysiology.
Microarray analysis identified Puccinia striiformis f. sp. tritici genes involved in infection and sporulation.

USDA-ARS?s Scientific Manuscript database

Puccinia striiformis f. sp. tritici (Pst) causes stripe rust, one of the most important diseases of wheat worldwide. To identify Pst genes involved in infection and sporulation, a custom oligonucleotide Genechip was made using sequences of 442 genes selected from Pst cDNA libraries. Microarray analy...
Digital gene expression profiling of flax (Linum usitatissimum L.) stem peel identifies genes enriched in fiber-bearing phloem tissue.

PubMed

Guo, Yuan; Qiu, Caisheng; Long, Songhua; Chen, Ping; Hao, Dongmei; Preisner, Marta; Wang, Hui; Wang, Yufu

2017-08-30

To better understand the molecular mechanisms and gene expression characteristics associated with development of bast fiber cell within flax stem phloem, the gene expression profiling of flax stem peels and leaves were screened, using Illumina's Digital Gene Expression (DGE) analysis. Four DGE libraries (2 for stem peel and 2 for leaf), ranging from 6.7 to 9.2 million clean reads were obtained, which produced 7.0 million and 6.8 million mapped reads for flax stem peel and leave, respectively. By differential gene expression analysis, a total of 975 genes, of which 708 (73%) genes have protein-coding annotation, were identified as phloem enriched genes putatively involved in the processes of polysaccharide and cell wall metabolism. Differential expression genes (DEGs) was validated using quantitative RT-PCR, the expression pattern of all nine genes determined by qRT-PCR fitted in well with that obtained by sequencing analysis. Cluster and Gene Ontology (GO) analysis revealed that a large number of genes related to metabolic process, catalytic activity and binding category were expressed predominantly in the stem peels. The Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis of the phloem enriched genes suggested approximately 111 biological pathways. The large number of genes and pathways produced from DGE sequencing will expand our understanding of the complex molecular and cellular events in flax bast fiber development and provide a foundation for future studies on fiber development in other bast fiber crops. Copyright © 2017 Elsevier B.V. All rights reserved.
Immunity-Associated Programmed Cell Death as a Tool for the Identification of Genes Essential for Plant Innate Immunity.

PubMed

Zhou, Bangjun; Zeng, Lirong

2018-01-01

Plants have evolved a sophisticated innate immune system to contend with potential infection by various pathogens. Understanding and manipulation of key molecular mechanisms that plants use to defend against various pathogens are critical for developing novel strategies in plant disease control. In plants, resistance to attempted pathogen infection is often associated with hypersensitive response (HR), a form of rapid programmed cell death (PCD) at the site of attempted pathogen invasion. In this chapter, we describe a method for rapid identification of genes that are essential for plant innate immunity. It combines virus-induced gene silencing (VIGS), a tool that is suitable for studying gene function in high-throughput, with the utilization of immunity-associated PCD, particularly HR-linked PCD as the readout of changes in plant innate immunity. The chapter covers from the design of gene fragment for VIGS, the agroinfiltration of the Nicotiana benthamian plants, to the use of immunity-associated PCD induced by twelve elicitors as the indicator of activation of plant immunity.
Gene expression profiles of breast biopsies from healthy women identify a group with claudin-low features

PubMed Central

2011-01-01

Background Increased understanding of the variability in normal breast biology will enable us to identify mechanisms of breast cancer initiation and the origin of different subtypes, and to better predict breast cancer risk. Methods Gene expression patterns in breast biopsies from 79 healthy women referred to breast diagnostic centers in Norway were explored by unsupervised hierarchical clustering and supervised analyses, such as gene set enrichment analysis and gene ontology analysis and comparison with previously published genelists and independent datasets. Results Unsupervised hierarchical clustering identified two separate clusters of normal breast tissue based on gene-expression profiling, regardless of clustering algorithm and gene filtering used. Comparison of the expression profile of the two clusters with several published gene lists describing breast cells revealed that the samples in cluster 1 share characteristics with stromal cells and stem cells, and to a certain degree with mesenchymal cells and myoepithelial cells. The samples in cluster 1 also share many features with the newly identified claudin-low breast cancer intrinsic subtype, which also shows characteristics of stromal and stem cells. More women belonging to cluster 1 have a family history of breast cancer and there is a slight overrepresentation of nulliparous women in cluster 1. Similar findings were seen in a separate dataset consisting of histologically normal tissue from both breasts harboring breast cancer and from mammoplasty reductions. Conclusion This is the first study to explore the variability of gene expression patterns in whole biopsies from normal breasts and identified distinct subtypes of normal breast tissue. Further studies are needed to determine the specific cell contribution to the variation in the biology of normal breasts, how the clusters identified relate to breast cancer risk and their possible link to the origin of the different molecular subtypes of breast
Gene expression profiles of breast biopsies from healthy women identify a group with claudin-low features.

PubMed

Haakensen, Vilde D; Lingjaerde, Ole Christian; Lüders, Torben; Riis, Margit; Prat, Aleix; Troester, Melissa A; Holmen, Marit M; Frantzen, Jan Ole; Romundstad, Linda; Navjord, Dina; Bukholm, Ida K; Johannesen, Tom B; Perou, Charles M; Ursin, Giske; Kristensen, Vessela N; Børresen-Dale, Anne-Lise; Helland, Aslaug

2011-11-01

Increased understanding of the variability in normal breast biology will enable us to identify mechanisms of breast cancer initiation and the origin of different subtypes, and to better predict breast cancer risk. Gene expression patterns in breast biopsies from 79 healthy women referred to breast diagnostic centers in Norway were explored by unsupervised hierarchical clustering and supervised analyses, such as gene set enrichment analysis and gene ontology analysis and comparison with previously published genelists and independent datasets. Unsupervised hierarchical clustering identified two separate clusters of normal breast tissue based on gene-expression profiling, regardless of clustering algorithm and gene filtering used. Comparison of the expression profile of the two clusters with several published gene lists describing breast cells revealed that the samples in cluster 1 share characteristics with stromal cells and stem cells, and to a certain degree with mesenchymal cells and myoepithelial cells. The samples in cluster 1 also share many features with the newly identified claudin-low breast cancer intrinsic subtype, which also shows characteristics of stromal and stem cells. More women belonging to cluster 1 have a family history of breast cancer and there is a slight overrepresentation of nulliparous women in cluster 1. Similar findings were seen in a separate dataset consisting of histologically normal tissue from both breasts harboring breast cancer and from mammoplasty reductions. This is the first study to explore the variability of gene expression patterns in whole biopsies from normal breasts and identified distinct subtypes of normal breast tissue. Further studies are needed to determine the specific cell contribution to the variation in the biology of normal breasts, how the clusters identified relate to breast cancer risk and their possible link to the origin of the different molecular subtypes of breast cancer.
CRISPR/Cas9-mediated gene knockout screens and target identification via whole-genome sequencing uncover host genes required for picornavirus infection.

PubMed

Kim, Heon Seok; Lee, Kyungjin; Bae, Sangsu; Park, Jeongbin; Lee, Chong-Kyo; Kim, Meehyein; Kim, Eunji; Kim, Minju; Kim, Seokjoong; Kim, Chonsaeng; Kim, Jin-Soo

2017-06-23

Several groups have used genome-wide libraries of lentiviruses encoding small guide RNAs (sgRNAs) for genetic screens. In most cases, sgRNA expression cassettes are integrated into cells by using lentiviruses, and target genes are statistically estimated by the readout of sgRNA sequences after targeted sequencing. We present a new virus-free method for human gene knockout screens using a genome-wide library of CRISPR/Cas9 sgRNAs based on plasmids and target gene identification via whole-genome sequencing (WGS) confirmation of authentic mutations rather than statistical estimation through targeted amplicon sequencing. We used 30,840 pairs of individually synthesized oligonucleotides to construct the genome-scale sgRNA library, collectively targeting 10,280 human genes ( i.e. three sgRNAs per gene). These plasmid libraries were co-transfected with a Cas9-expression plasmid into human cells, which were then treated with cytotoxic drugs or viruses. Only cells lacking key factors essential for cytotoxic drug metabolism or viral infection were able to survive. Genomic DNA isolated from cells that survived these challenges was subjected to WGS to directly identify CRISPR/Cas9-mediated causal mutations essential for cell survival. With this approach, we were able to identify known and novel genes essential for viral infection in human cells. We propose that genome-wide sgRNA screens based on plasmids coupled with WGS are powerful tools for forward genetics studies and drug target discovery. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
A reference gene set for sex pheromone biosynthesis and degradation genes from the diamondback moth, Plutella xylostella, based on genome and transcriptome digital gene expression analyses.

PubMed

He, Peng; Zhang, Yun-Fei; Hong, Duan-Yang; Wang, Jun; Wang, Xing-Liang; Zuo, Ling-Hua; Tang, Xian-Fu; Xu, Wei-Ming; He, Ming

2017-03-01

Female moths synthesize species-specific sex pheromone components and release them to attract male moths, which depend on precise sex pheromone chemosensory system to locate females. Two types of genes involved in the sex pheromone biosynthesis and degradation pathways play essential roles in this important moth behavior. To understand the function of genes in the sex pheromone pathway, this study investigated the genome-wide and digital gene expression of sex pheromone biosynthesis and degradation genes in various adult tissues in the diamondback moth (DBM), Plutella xylostella, which is a notorious vegetable pest worldwide. A massive transcriptome data (at least 39.04 Gb) was generated by sequencing 6 adult tissues including male antennae, female antennae, heads, legs, abdomen and female pheromone glands from DBM by using Illumina 4000 next-generation sequencing and mapping to a published DBM genome. Bioinformatics analysis yielded a total of 89,332 unigenes among which 87 transcripts were putatively related to seven gene families in the sex pheromone biosynthesis pathway. Among these, seven [two desaturases (DES), three fatty acyl-CoA reductases (FAR) one acetyltransferase (ACT) and one alcohol dehydrogenase (AD)] were mainly expressed in the pheromone glands with likely function in the three essential sex pheromone biosynthesis steps: desaturation, reduction, and esterification. We also identified 210 odorant-degradation related genes (including sex pheromone-degradation related genes) from seven major enzyme groups. Among these genes, 100 genes are new identified and two aldehyde oxidases (AOXs), one aldehyde dehydrogenase (ALDH), five carboxyl/cholinesterases (CCEs), five UDP-glycosyltransferases (UGTs), eight cytochrome P450 (CYP) and three glutathione S-transferases (GSTs) displayed more robust expression in the antennae, and thus are proposed to participate in the degradation of sex pheromone components and plant volatiles. To date, this is the most
Suppressors of systemin signaling identify genes in the tomato wound response pathway.

PubMed Central

Howe, G A; Ryan, C A

1999-01-01

In tomato plants, systemic induction of defense genes in response to herbivory or mechanical wounding is regulated by an 18-amino-acid peptide signal called systemin. Transgenic plants that overexpress prosystemin, the systemin precursor, from a 35S::prosystemin (35S::prosys) transgene exhibit constitutive expression of wound-inducible defense proteins including proteinase inhibitors and polyphenol oxidase. To study further the role of (pro)systemin in the wound response pathway, we isolated and characterized mutations that suppress 35S::prosys-mediated phenotypes. Ten recessive, extragenic suppressors were identified. Two of these define new alleles of def-1, a previously identified mutation that blocks both wound- and systemin-induced gene expression and renders plants susceptible to herbivory. The remaining mutants defined four loci designated Spr-1, Spr-2, Spr-3, and Spr-4 (for Suppressed in 35S::prosystemin-mediated responses). spr-3 and spr-4 mutants were not significantly affected in their response to either systemin or mechanical wounding. In contrast, spr-1 and spr-2 plants lacked systemic wound responses and were insensitive to systemin. These results confirm the function of (pro)systemin in the transduction of systemic wound signals and further establish that wounding, systemin, and 35S::prosys induce defensive gene expression through a common signaling pathway defined by at least three genes (Def-1, Spr-1, and Spr-2). PMID:10545469
Integrated microarray and ChIP analysis identifies multiple Foxa2 dependent target genes in the notochord.

PubMed

Tamplin, Owen J; Cox, Brian J; Rossant, Janet

2011-12-15

The node and notochord are key tissues required for patterning of the vertebrate body plan. Understanding the gene regulatory network that drives their formation and function is therefore important. Foxa2 is a key transcription factor at the top of this genetic hierarchy and finding its targets will help us to better understand node and notochord development. We performed an extensive microarray-based gene expression screen using sorted embryonic notochord cells to identify early notochord-enriched genes. We validated their specificity to the node and notochord by whole mount in situ hybridization. This provides the largest available resource of notochord-expressed genes, and therefore candidate Foxa2 target genes in the notochord. Using existing Foxa2 ChIP-seq data from adult liver, we were able to identify a set of genes expressed in the notochord that had associated regions of Foxa2-bound chromatin. Given that Foxa2 is a pioneer transcription factor, we reasoned that these sites might represent notochord-specific enhancers. Candidate Foxa2-bound regions were tested for notochord specific enhancer function in a zebrafish reporter assay and 7 novel notochord enhancers were identified. Importantly, sequence conservation or predictive models could not have readily identified these regions. Mutation of putative Foxa2 binding elements in two of these novel enhancers abrogated reporter expression and confirmed their Foxa2 dependence. The combination of highly specific gene expression profiling and genome-wide ChIP analysis is a powerful means of understanding developmental pathways, even for small cell populations such as the notochord. Copyright © 2011 Elsevier Inc. All rights reserved.
Relationship between ADD1 Gly460Trp gene polymorphism and essential hypertension in Madeira Island.

PubMed

Sousa, Ana Célia; Palma Dos Reis, Roberto; Pereira, Andreia; Borges, Sofia; Freitas, Ana Isabel; Guerra, Graça; Góis, Teresa; Rodrigues, Mariana; Henriques, Eva; Freitas, Sónia; Ornelas, Ilídio; Pereira, Décio; Brehm, António; Mendonça, Maria Isabel

2017-10-01

Essential hypertension (EH) is a complex disease in which physiological, environmental, and genetic factors are involved in its genesis. The genetic variant of the alpha-adducin gene (ADD1) has been described as a risk factor for EH, but with controversial results.The objective of this study was to evaluate the association of ADD1 (Gly460Trp) gene polymorphism with the EH risk in a population from Madeira Island.A case-control study with 1614 individuals of Caucasian origin was performed, including 817 individuals with EH and 797 controls. Cases and controls were matched for sex and age, by frequency-matching method. All participants collected blood for biochemical and genotypic analysis for the Gly460Trp polymorphism. We further investigated which variables were independently associated to EH, and, consequently, analyzed their interactions.In our study, we found a significant association between the ADD1 gene polymorphism and EH (odds ratio 2.484, P = .01). This association remained statistically significant after the multivariate analysis (odds ratio 2.548, P = .02).The ADD1 Gly460Trp gene polymorphism is significantly and independently associated with EH risk in our population. The knowledge of genetic polymorphisms associated with EH is of paramount importance because it leads to a better understanding of the etiology and pathophysiology of this pathology.
Relationship between ADD1 Gly460Trp gene polymorphism and essential hypertension in Madeira Island

PubMed Central

Sousa, Ana Célia; Palma dos Reis, Roberto; Pereira, Andreia; Borges, Sofia; Freitas, Ana Isabel; Guerra, Graça; Góis, Teresa; Rodrigues, Mariana; Henriques, Eva; Freitas, Sónia; Ornelas, Ilídio; Pereira, Décio; Brehm, António; Mendonça, Maria Isabel

2017-01-01

Abstract Essential hypertension (EH) is a complex disease in which physiological, environmental, and genetic factors are involved in its genesis. The genetic variant of the alpha-adducin gene (ADD1) has been described as a risk factor for EH, but with controversial results. The objective of this study was to evaluate the association of ADD1 (Gly460Trp) gene polymorphism with the EH risk in a population from Madeira Island. A case-control study with 1614 individuals of Caucasian origin was performed, including 817 individuals with EH and 797 controls. Cases and controls were matched for sex and age, by frequency-matching method. All participants collected blood for biochemical and genotypic analysis for the Gly460Trp polymorphism. We further investigated which variables were independently associated to EH, and, consequently, analyzed their interactions. In our study, we found a significant association between the ADD1 gene polymorphism and EH (odds ratio 2.484, P = .01). This association remained statistically significant after the multivariate analysis (odds ratio 2.548, P = .02). The ADD1 Gly460Trp gene polymorphism is significantly and independently associated with EH risk in our population. The knowledge of genetic polymorphisms associated with EH is of paramount importance because it leads to a better understanding of the etiology and pathophysiology of this pathology. PMID:29049185
Anther-preferential expressing gene PMR is essential for the mitosis of pollen development in rice.

PubMed

Liu, Yaqin; Xu, Ya; Ling, Sheng; Liu, Shasha; Yao, Jialing

2017-06-01

Phenotype identification, expression examination, and function prediction declared that the anther-preferential expressing gene PMR may participate in regulation of male gametophyte development in rice. Male germline development in flowering plants produces the pair of sperm cells for double fertilization and the pollen mitosis is a key process of it. Although the structural features of male gametophyte have been defined, the molecular mechanisms regulating the mitotic cell cycle are not well elucidated in rice. Here, we reported an anther-preferential expressing gene in rice, PMR (Pollen Mitosis Relative), playing an essential role in male gametogenesis. When PMR gene was suppressed via RNAi, the mitosis of microspore was severely damaged, and the plants formed unmatured pollens containing only one or two nucleuses at the anthesis, ultimately leading to serious reduction of pollen fertility and seed-setting. The CRISPR mutants, pmr-1 and pmr-2, both showed the similar defects as the PMR-RNAi lines. Further analysis revealed that PMR together with its co-expressing genes were liable to participate in the regulation of DNA metabolism in the nucleus, and affected the activities of some enzymes related to the cell cycle. We finally discussed that unknown protein PMR contained the PHD, SWIB and Plus-3 domains and they might have coordinating functions in regulation pathway of the pollen mitosis in rice.
Essential Oils Modulate Gene Expression and Ochratoxin A Production in Aspergillus carbonarius.

PubMed

El Khoury, Rachelle; Atoui, Ali; Verheecke, Carol; Maroun, Richard; El Khoury, Andre; Mathieu, Florence

2016-08-19

Ochratoxin A (OTA) is a mycotoxin, mainly produced on grapes by Aspergillus carbonarius, that causes massive health problems for humans. This study aims to reduce the occurrence of OTA by using the ten following essential oils (E.Os): fennel, cardamom, anise, chamomile, celery, cinnamon, thyme, taramira, oregano and rosemary at 1 µL/mL and 5 µL/mL for each E.O.As a matter of fact, their effects on the OTA production and the growth of A. carbonarius S402 cultures were evaluated, after four days at 28 °C on a Synthetic Grape Medium (SGM). Results showed that A. carbonarius growth was reduced up to 100%, when cultured with the E.Os of cinnamon, taramira, and oregano at both concentrations and the thyme at 5 µL/mL. As for the other six E.Os, their effect on A. carbonarius growth was insignificant, but highly important on the OTA production. Interestingly, the fennel E.O at 5 µL/mL reduced the OTA production up to 88.9% compared to the control, with only 13.8% of fungal growth reduction. We further investigated the effect of these E.Os on the expression levels of the genes responsible for the OTA biosynthesis (acOTApks and acOTAnrps along with the acpks gene) as well as the two regulatory genes laeA and vea, using the quantitative Reverse Transcription-Polymerase Chain Reaction (qRT-PCR) method. The results revealed that these six E.Os reduced the expression of the five studied genes, where the ackps was downregulated by 99.2% (the highest downregulation in this study) with 5 µL/mL of fennel E.O.As for the acOTApks, acOTAnrps, veA and laeA, their reduction levels ranged between 10% and 96% depending on the nature of the E.O and its concentration in the medium.
Pinus Roxburghii essential oil anticancer activity and chemical composition evaluation

PubMed Central

Sajid, Arfaa; Manzoor, Qaisar; Iqbal, Munawar; Tyagi, Amit Kumar; Sarfraz, Raja Adil; Sajid, Anam

2018-01-01

The present study was conducted to appraise the anticancer activity of Pinus roxburghii essential oil along with chemical composition evaluation. MTT assay revealed cytotoxicity induction in colon, leukemia, multiple myeloma, pancreatic, head and neck and lung cancer cells exposed to essential oil. Cancer cell death was also observed through live/dead cell viability assay and FACS analysis. Apoptosis induced by essential oil was confirmed by cleavage of PARP and caspase-3 that suppressed the colony-forming ability of tumor cells and 50 % inhibition occurred at a dose of 25 μg/mL. Moreover, essential oil inhibited the activation of inflammatory transcription factor NF-κB and inhibited expression of NF-κB regulated gene products linked to cell survival (survivin, c-FLIP, Bcl-2, Bcl-xL, c-Myc, c-IAP2), proliferation (Cyclin D1) and metastasis (MMP-9). P. roxburghii essential oil has considerable anticancer activity and could be used as anticancer agent, which needs further investigation to identify and purify the bioactive compounds followed by in vivo studies. PMID:29743861
Pinus Roxburghii essential oil anticancer activity and chemical composition evaluation.

PubMed

Sajid, Arfaa; Manzoor, Qaisar; Iqbal, Munawar; Tyagi, Amit Kumar; Sarfraz, Raja Adil; Sajid, Anam

2018-01-01

The present study was conducted to appraise the anticancer activity of Pinus roxburghii essential oil along with chemical composition evaluation. MTT assay revealed cytotoxicity induction in colon, leukemia, multiple myeloma, pancreatic, head and neck and lung cancer cells exposed to essential oil. Cancer cell death was also observed through live/dead cell viability assay and FACS analysis. Apoptosis induced by essential oil was confirmed by cleavage of PARP and caspase-3 that suppressed the colony-forming ability of tumor cells and 50 % inhibition occurred at a dose of 25 μg/mL. Moreover, essential oil inhibited the activation of inflammatory transcription factor NF-κB and inhibited expression of NF-κB regulated gene products linked to cell survival (survivin, c-FLIP, Bcl-2, Bcl-xL, c-Myc, c-IAP2), proliferation (Cyclin D1) and metastasis (MMP-9). P. roxburghii essential oil has considerable anticancer activity and could be used as anticancer agent, which needs further investigation to identify and purify the bioactive compounds followed by in vivo studies.
Genetic regulation of gene expression in the lung identifies CST3 and CD22 as potential causal genes for airflow obstruction.

PubMed

Lamontagne, Maxime; Timens, Wim; Hao, Ke; Bossé, Yohan; Laviolette, Michel; Steiling, Katrina; Campbell, Joshua D; Couture, Christian; Conti, Massimo; Sherwood, Karen; Hogg, James C; Brandsma, Corry-Anke; van den Berge, Maarten; Sandford, Andrew; Lam, Stephen; Lenburg, Marc E; Spira, Avrum; Paré, Peter D; Nickle, David; Sin, Don D; Postma, Dirkje S

2014-11-01

COPD is a complex chronic disease with poorly understood pathogenesis. Integrative genomic approaches have the potential to elucidate the biological networks underlying COPD and lung function. We recently combined genome-wide genotyping and gene expression in 1111 human lung specimens to map expression quantitative trait loci (eQTL). To determine causal associations between COPD and lung function-associated single nucleotide polymorphisms (SNPs) and lung tissue gene expression changes in our lung eQTL dataset. We evaluated causality between SNPs and gene expression for three COPD phenotypes: FEV(1)% predicted, FEV(1)/FVC and COPD as a categorical variable. Different models were assessed in the three cohorts independently and in a meta-analysis. SNPs associated with a COPD phenotype and gene expression were subjected to causal pathway modelling and manual curation. In silico analyses evaluated functional enrichment of biological pathways among newly identified causal genes. Biologically relevant causal genes were validated in two separate gene expression datasets of lung tissues and bronchial airway brushings. High reliability causal relations were found in SNP-mRNA-phenotype triplets for FEV(1)% predicted (n=169) and FEV(1)/FVC (n=80). Several genes of potential biological relevance for COPD were revealed. eQTL-SNPs upregulating cystatin C (CST3) and CD22 were associated with worse lung function. Signalling pathways enriched with causal genes included xenobiotic metabolism, apoptosis, protease-antiprotease and oxidant-antioxidant balance. By using integrative genomics and analysing the relationships of COPD phenotypes with SNPs and gene expression in lung tissue, we identified CST3 and CD22 as potential causal genes for airflow obstruction. This study also augmented the understanding of previously described COPD pathways. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
A novel approach to identify genes that determine grain protein deviation in cereals.

PubMed

Mosleth, Ellen F; Wan, Yongfang; Lysenko, Artem; Chope, Gemma A; Penson, Simon P; Shewry, Peter R; Hawkesford, Malcolm J

2015-06-01

Grain yield and protein content were determined for six wheat cultivars grown over 3 years at multiple sites and at multiple nitrogen (N) fertilizer inputs. Although grain protein content was negatively correlated with yield, some grain samples had higher protein contents than expected based on their yields, a trait referred to as grain protein deviation (GPD). We used novel statistical approaches to identify gene transcripts significantly related to GPD across environments. The yield and protein content were initially adjusted for nitrogen fertilizer inputs and then adjusted for yield (to remove the negative correlation with protein content), resulting in a parameter termed corrected GPD. Significant genetic variation in corrected GPD was observed for six cultivars grown over a range of environmental conditions (a total of 584 samples). Gene transcript profiles were determined in a subset of 161 samples of developing grain to identify transcripts contributing to GPD. Principal component analysis (PCA), analysis of variance (ANOVA) and means of scores regression (MSR) were used to identify individual principal components (PCs) correlating with GPD alone. Scores of the selected PCs, which were significantly related to GPD and protein content but not to the yield and significantly affected by cultivar, were identified as reflecting a multivariate pattern of gene expression related to genetic variation in GPD. Transcripts with consistent variation along the selected PCs were identified by an approach hereby called one-block means of scores regression (one-block MSR). © 2014 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.

Immunogenetic mechanisms leading to thyroid autoimmunity: recent advances in identifying susceptibility genes and regions.

PubMed

Brand, Oliver J; Gough, Stephen C L

2011-12-01

The autoimmune thyroid diseases (AITD) include Graves' disease (GD) and Hashimoto's thyroiditis (HT), which are characterised by a breakdown in immune tolerance to thyroid antigens. Unravelling the genetic architecture of AITD is vital to better understanding of AITD pathogenesis, required to advance therapeutic options in both disease management and prevention. The early whole-genome linkage and candidate gene association studies provided the first evidence that the HLA region and CTLA-4 represented AITD risk loci. Recent improvements in; high throughput genotyping technologies, collection of larger disease cohorts and cataloguing of genome-scale variation have facilitated genome-wide association studies and more thorough screening of candidate gene regions. This has allowed identification of many novel AITD risk genes and more detailed association mapping. The growing number of confirmed AITD susceptibility loci, implicates a number of putative disease mechanisms most of which are tightly linked with aspects of immune system function. The unprecedented advances in genetic study will allow future studies to identify further novel disease risk genes and to identify aetiological variants within specific gene regions, which will undoubtedly lead to a better understanding of AITD patho-physiology.
Immunogenetic Mechanisms Leading to Thyroid Autoimmunity: Recent Advances in Identifying Susceptibility Genes and Regions

PubMed Central

Brand, Oliver J; Gough, Stephen C.L

2011-01-01

The autoimmune thyroid diseases (AITD) include Graves’ disease (GD) and Hashimoto’s thyroiditis (HT), which are characterised by a breakdown in immune tolerance to thyroid antigens. Unravelling the genetic architecture of AITD is vital to better understanding of AITD pathogenesis, required to advance therapeutic options in both disease management and prevention. The early whole-genome linkage and candidate gene association studies provided the first evidence that the HLA region and CTLA-4 represented AITD risk loci. Recent improvements in; high throughput genotyping technologies, collection of larger disease cohorts and cataloguing of genome-scale variation have facilitated genome-wide association studies and more thorough screening of candidate gene regions. This has allowed identification of many novel AITD risk genes and more detailed association mapping. The growing number of confirmed AITD susceptibility loci, implicates a number of putative disease mechanisms most of which are tightly linked with aspects of immune system function. The unprecedented advances in genetic study will allow future studies to identify further novel disease risk genes and to identify aetiological variants within specific gene regions, which will undoubtedly lead to a better understanding of AITD patho-physiology. PMID:22654554
Estrogen-related receptor {alpha} is essential for the expression of antioxidant protection genes and mitochondrial function

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rangwala, Shamina M.; Li, Xiaoyan; Lindsley, Loren

2007-05-25

Estrogen-related receptor {alpha} (ERR{alpha}) is an important mediator of mitochondrial biogenesis and function. To investigate the transcriptional network controlling these phenomena, we investigated mitochondrial gene expression in embryonic fibroblasts isolated from ERR{alpha} null mice. Peroxisome proliferator-activated receptor {gamma} coactivator-1{alpha} (PGC-1{alpha}) stimulated mitochondrial gene expression program in control cells, but not in the ERR{alpha} null cells. Interestingly, the induction of levels of mitochondrial oxidative stress protection genes in response to increased PGC-1{alpha} levels was dependent on ERR{alpha}. Furthermore, we found that the PGC-1{alpha}-mediated induction of estrogen-related receptor {gamma} and nuclear respiratory factor 2 (NRF-2), was dependent on the presence of ERR{alpha}.more » Basal levels of NRF-2 were decreased in the absence of ERR{alpha}. The absence of ERR{alpha} resulted in a decrease in citrate synthase enzyme activity in response to PGC-1{alpha} overexpression. Our results indicate an essential role for ERR{alpha} as a key regulator of oxidative metabolism.« less
A Stratified Transcriptomics Analysis of Polygenic Fat and Lean Mouse Adipose Tissues Identifies Novel Candidate Obesity Genes

PubMed Central

Morton, Nicholas M.; Nelson, Yvonne B.; Michailidou, Zoi; Di Rollo, Emma M.; Ramage, Lynne; Hadoke, Patrick W. F.; Seckl, Jonathan R.; Bunger, Lutz; Horvat, Simon; Kenyon, Christopher J.; Dunbar, Donald R.

2011-01-01

Background Obesity and metabolic syndrome results from a complex interaction between genetic and environmental factors. In addition to brain-regulated processes, recent genome wide association studies have indicated that genes highly expressed in adipose tissue affect the distribution and function of fat and thus contribute to obesity. Using a stratified transcriptome gene enrichment approach we attempted to identify adipose tissue-specific obesity genes in the unique polygenic Fat (F) mouse strain generated by selective breeding over 60 generations for divergent adiposity from a comparator Lean (L) strain. Results To enrich for adipose tissue obesity genes a ‘snap-shot’ pooled-sample transcriptome comparison of key fat depots and non adipose tissues (muscle, liver, kidney) was performed. Known obesity quantitative trait loci (QTL) information for the model allowed us to further filter genes for increased likelihood of being causal or secondary for obesity. This successfully identified several genes previously linked to obesity (C1qr1, and Np3r) as positional QTL candidate genes elevated specifically in F line adipose tissue. A number of novel obesity candidate genes were also identified (Thbs1, Ppp1r3d, Tmepai, Trp53inp2, Ttc7b, Tuba1a, Fgf13, Fmr) that have inferred roles in fat cell function. Quantitative microarray analysis was then applied to the most phenotypically divergent adipose depot after exaggerating F and L strain differences with chronic high fat feeding which revealed a distinct gene expression profile of line, fat depot and diet-responsive inflammatory, angiogenic and metabolic pathways. Selected candidate genes Npr3 and Thbs1, as well as Gys2, a non-QTL gene that otherwise passed our enrichment criteria were characterised, revealing novel functional effects consistent with a contribution to obesity. Conclusions A focussed candidate gene enrichment strategy in the unique F and L model has identified novel adipose tissue-enriched genes
Gene panel sequencing in familial breast/ovarian cancer patients identifies multiple novel mutations also in genes others than BRCA1/2.

PubMed

Kraus, Cornelia; Hoyer, Juliane; Vasileiou, Georgia; Wunderle, Marius; Lux, Michael P; Fasching, Peter A; Krumbiegel, Mandy; Uebe, Steffen; Reuter, Miriam; Beckmann, Matthias W; Reis, André

2017-01-01

Breast and ovarian cancer (BC/OC) predisposition has been attributed to a number of high- and moderate to low-penetrance susceptibility genes. With the advent of next generation sequencing (NGS) simultaneous testing of these genes has become feasible. In this monocentric study, we report results of panel-based screening of 14 BC/OC susceptibility genes (BRCA1, BRCA2, RAD51C, RAD51D, CHEK2, PALB2, ATM, NBN, CDH1, TP53, MLH1, MSH2, MSH6 and PMS2) in a group of 581 consecutive individuals from a German population with BC and/or OC fulfilling diagnostic criteria for BRCA1 and BRCA2 testing including 179 with a triple-negative tumor. Altogether we identified 106 deleterious mutations in 105 (18%) patients in 10 different genes, including seven different exon deletions. Of these 106 mutations, 16 (15%) were novel and only six were found in BRCA1/2. To further characterize mutations located in or nearby splicing consensus sites we performed RT-PCR analysis which allowed confirmation of pathogenicity in 7 of 9 mutations analyzed. In PALB2, we identified a deleterious variant in six cases. All but one were associated with early onset BC and a positive family history indicating that penetrance for PALB2 mutations is comparable to BRCA2. Overall, extended testing beyond BRCA1/2 identified a deleterious mutation in further 6% of patients. As a downside, 89 variants of uncertain significance were identified highlighting the need for comprehensive variant databases. In conclusion, panel testing yields more accurate information on genetic cancer risk than assessing BRCA1/2 alone and wide-spread testing will help improve penetrance assessment of variants in these risk genes. © 2016 UICC.
Genome-Wide association study identifies candidate genes for Parkinson's disease in an Ashkenazi Jewish population

PubMed Central

2011-01-01

Background To date, nine Parkinson disease (PD) genome-wide association studies in North American, European and Asian populations have been published. The majority of studies have confirmed the association of the previously identified genetic risk factors, SNCA and MAPT, and two studies have identified three new PD susceptibility loci/genes (PARK16, BST1 and HLA-DRB5). In a recent meta-analysis of datasets from five of the published PD GWAS an additional 6 novel candidate genes (SYT11, ACMSD, STK39, MCCC1/LAMP3, GAK and CCDC62/HIP1R) were identified. Collectively the associations identified in these GWAS account for only a small proportion of the estimated total heritability of PD suggesting that an 'unknown' component of the genetic architecture of PD remains to be identified. Methods We applied a GWAS approach to a relatively homogeneous Ashkenazi Jewish (AJ) population from New York to search for both 'rare' and 'common' genetic variants that confer risk of PD by examining any SNPs with allele frequencies exceeding 2%. We have focused on a genetic isolate, the AJ population, as a discovery dataset since this cohort has a higher sharing of genetic background and historically experienced a significant bottleneck. We also conducted a replication study using two publicly available datasets from dbGaP. The joint analysis dataset had a combined sample size of 2,050 cases and 1,836 controls. Results We identified the top 57 SNPs showing the strongest evidence of association in the AJ dataset (p < 9.9 × 10-5). Six SNPs located within gene regions had positive signals in at least one other independent dbGaP dataset: LOC100505836 (Chr3p24), LOC153328/SLC25A48 (Chr5q31.1), UNC13B (9p13.3), SLCO3A1(15q26.1), WNT3(17q21.3) and NSF (17q21.3). We also replicated published associations for the gene regions SNCA (Chr4q21; rs3775442, p = 0.037), PARK16 (Chr1q32.1; rs823114 (NUCKS1), p = 6.12 × 10-4), BST1 (Chr4p15; rs12502586, p = 0.027), STK39 (Chr2q24.3; rs3754775, p = 0
De novo Transcriptome Analysis of Miscanthus lutarioriparius Identifies Candidate Genes in Rhizome Development

PubMed Central

Hu, Ruibo; Yu, Changjiang; Wang, Xiaoyu; Jia, Chunlin; Pei, Shengqiang; He, Kang; He, Guo; Kong, Yingzhen; Zhou, Gongke

2017-01-01

HIGHLIGHT De novo transcriptome profiling of five tissues reveals candidate genes putatively involved in rhizome development in M. lutarioriparius. Miscanthus lutarioriparius is a promising lignocellulosic feedstock for second-generation bioethanol production. However, the genomic resource for this species is relatively limited thus hampers our understanding of the molecular mechanisms underlying many important biological processes. In this study, we performed the first de novo transcriptome analysis of five tissues (leaf, stem, root, lateral bud and rhizome bud) of M. lutarioriparius with an emphasis to identify putative genes involved in rhizome development. Approximately 66 gigabase (GB) paired-end clean reads were obtained and assembled into 169,064 unigenes with an average length of 759 bp. Among these unigenes, 103,899 (61.5%) were annotated in seven public protein databases. Differential gene expression profiling analysis revealed that 4,609, 3,188, 1,679, 1,218, and 1,077 genes were predominantly expressed in root, leaf, stem, lateral bud, and rhizome bud, respectively. Their expression patterns were further classified into 12 distinct clusters. Pathway enrichment analysis revealed that genes predominantly expressed in rhizome bud were mainly involved in primary metabolism and hormone signaling and transduction pathways. Noteworthy, 19 transcription factors (TFs) and 16 hormone signaling pathway-related genes were identified to be predominantly expressed in rhizome bud compared with the other tissues, suggesting putative roles in rhizome formation and development. In addition, a predictive regulatory network was constructed between four TFs and six auxin and abscisic acid (ABA) -related genes. Furthermore, the expression of 24 rhizome-specific genes was further validated by quantitative real-time RT-PCR (qRT-PCR) analysis. Taken together, this study provide a global portrait of gene expression across five different tissues and reveal preliminary insights
Functional screen of MSI2 interactors identifies an essential role for SYNCRIP in myeloid leukemia stem cells

PubMed Central

Vu, Ly P.; Prieto, Camila; Amin, Elianna M.; Chhangawala, Sagar; Krivtsov, Andrei; Calvo-Vidal, M. Nieves; Chou, Timothy; Chow, Arthur; Minuesa, Gerard; Park, Sun Mi; Barlowe, Trevor S.; Taggart, James; Tivnan, Patrick; Deering, Raquel P.; Chu, Lisa P; Kwon, Jeong-Ah; Meydan, Cem; Perales-Paton, Javier; Arshi, Arora; Gönen, Mithat; Famulare, Christopher; Patel, Minal; Paietta, Elisabeth; Tallman, Martin S.; Lu, Yuheng; Glass, Jacob; Garret-Bakelman, Francine; Melnick, Ari; Levine, Ross; Al-Shahrour, Fatima; Järås, Marcus; Hacohen, Nir; Hwang, Alexia; Garippa, Ralph; Lengner, Christopher J.; Armstrong, Scott A; Cerchietti, Leandro; Cowley, Glenn S; Root, David; Doench, John; Leslie, Christina; Ebert, Benjamin L; Kharas, Michael G.

2017-01-01

The identity of the RNA binding proteins (RBPs) that govern cancer stem cell remains poorly characterized. The MSI2 RBP is a central regulator of translation of cancer stem cell programs. Through proteomics analysis of the MSI2 interacting RBP network and functional shRNA screening, we identified 24 genes required for in vivo leukemia and SYNCRIP was the most differentially required gene between normal and myeloid leukemia cells. SYNCRIP depletion increased apoptosis and differentiation while delaying leukemogenesis. Gene expression profiling of SYNCRIP depleted cells demonstrated a loss of the MLL and HOXA9 leukemia stem cell gene associated program. SYNCRIP and MSI2 interact indirectly though shared mRNA targets. SYNCRIP maintains HOXA9 translation and MSI2 or HOXA9 overexpression rescued the effects of SYNCRIP depletion. We validated SYNCRIP as a novel RBP that controls the myeloid leukemia stem cell program and propose that targeting these functional complexes might provide a novel therapeutic strategy in leukemia. PMID:28436985
Blood pressure loci identified with a gene-centric array.

PubMed

Johnson, Toby; Gaunt, Tom R; Newhouse, Stephen J; Padmanabhan, Sandosh; Tomaszewski, Maciej; Kumari, Meena; Morris, Richard W; Tzoulaki, Ioanna; O'Brien, Eoin T; Poulter, Neil R; Sever, Peter; Shields, Denis C; Thom, Simon; Wannamethee, Sasiwarang G; Whincup, Peter H; Brown, Morris J; Connell, John M; Dobson, Richard J; Howard, Philip J; Mein, Charles A; Onipinla, Abiodun; Shaw-Hawkins, Sue; Zhang, Yun; Davey Smith, George; Day, Ian N M; Lawlor, Debbie A; Goodall, Alison H; Fowkes, F Gerald; Abecasis, Gonçalo R; Elliott, Paul; Gateva, Vesela; Braund, Peter S; Burton, Paul R; Nelson, Christopher P; Tobin, Martin D; van der Harst, Pim; Glorioso, Nicola; Neuvrith, Hani; Salvi, Erika; Staessen, Jan A; Stucchi, Andrea; Devos, Nabila; Jeunemaitre, Xavier; Plouin, Pierre-François; Tichet, Jean; Juhanson, Peeter; Org, Elin; Putku, Margus; Sõber, Siim; Veldre, Gudrun; Viigimaa, Margus; Levinsson, Anna; Rosengren, Annika; Thelle, Dag S; Hastie, Claire E; Hedner, Thomas; Lee, Wai K; Melander, Olle; Wahlstrand, Björn; Hardy, Rebecca; Wong, Andrew; Cooper, Jackie A; Palmen, Jutta; Chen, Li; Stewart, Alexandre F R; Wells, George A; Westra, Harm-Jan; Wolfs, Marcel G M; Clarke, Robert; Franzosi, Maria Grazia; Goel, Anuj; Hamsten, Anders; Lathrop, Mark; Peden, John F; Seedorf, Udo; Watkins, Hugh; Ouwehand, Willem H; Sambrook, Jennifer; Stephens, Jonathan; Casas, Juan-Pablo; Drenos, Fotios; Holmes, Michael V; Kivimaki, Mika; Shah, Sonia; Shah, Tina; Talmud, Philippa J; Whittaker, John; Wallace, Chris; Delles, Christian; Laan, Maris; Kuh, Diana; Humphries, Steve E; Nyberg, Fredrik; Cusi, Daniele; Roberts, Robert; Newton-Cheh, Christopher; Franke, Lude; Stanton, Alice V; Dominiczak, Anna F; Farrall, Martin; Hingorani, Aroon D; Samani, Nilesh J; Caulfield, Mark J; Munroe, Patricia B

2011-12-09

Raised blood pressure (BP) is a major risk factor for cardiovascular disease. Previous studies have identified 47 distinct genetic variants robustly associated with BP, but collectively these explain only a few percent of the heritability for BP phenotypes. To find additional BP loci, we used a bespoke gene-centric array to genotype an independent discovery sample of 25,118 individuals that combined hypertensive case-control and general population samples. We followed up four SNPs associated with BP at our p < 8.56 × 10(-7) study-specific significance threshold and six suggestively associated SNPs in a further 59,349 individuals. We identified and replicated a SNP at LSP1/TNNT3, a SNP at MTHFR-NPPB independent (r(2) = 0.33) of previous reports, and replicated SNPs at AGT and ATP2B1 reported previously. An analysis of combined discovery and follow-up data identified SNPs significantly associated with BP at p < 8.56 × 10(-7) at four further loci (NPR3, HFE, NOS3, and SOX6). The high number of discoveries made with modest genotyping effort can be attributed to using a large-scale yet targeted genotyping array and to the development of a weighting scheme that maximized power when meta-analyzing results from samples ascertained with extreme phenotypes, in combination with results from nonascertained or population samples. Chromatin immunoprecipitation and transcript expression data highlight potential gene regulatory mechanisms at the MTHFR and NOS3 loci. These results provide candidates for further study to help dissect mechanisms affecting BP and highlight the utility of studying SNPs and samples that are independent of those studied previously even when the sample size is smaller than that in previous studies. Copyright © 2011 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Methods to identify and analyze gene products involved in neuronal intracellular transport using Drosophila

PubMed Central

Neisch, Amanda L.; Avery, Adam W.; Machame, James B.; Li, Min-gang; Hays, Thomas S.

2017-01-01

Proper neuronal function critically depends on efficient intracellular transport and disruption of transport leads to neurodegeneration. Molecular pathways that support or regulate neuronal transport are not fully understood. A greater understanding of these pathways will help reveal the pathological mechanisms underlying disease. Drosophila melanogaster is the premier model system for performing large-scale genetic functional screens. Here we describe methods to carry out primary and secondary genetic screens in Drosophila aimed at identifying novel gene products and pathways that impact neuronal intracellular transport. These screens are performed using whole animal or live cell imaging of intact neural tissue to ensure integrity of neurons and their cellular environment. The primary screen is used to identify gross defects in neuronal function indicative of a disruption in microtubule-based transport. The secondary screens, conducted in both motoneurons and dendritic arborization neurons, will confirm the function of candidate gene products in intracellular transport. Together, the methodologies described here will support labs interested in identifying and characterizing gene products that alter intracellular transport in Drosophila. PMID:26794520
Discrimination of germline V genes at different sequencing lengths and mutational burdens: A new tool for identifying and evaluating the reliability of V gene assignment.

PubMed

Zhang, Bochao; Meng, Wenzhao; Prak, Eline T Luning; Hershberg, Uri

2015-12-01

Immune repertoires are collections of lymphocytes that express diverse antigen receptor gene rearrangements consisting of Variable (V), (Diversity (D) in the case of heavy chains) and Joining (J) gene segments. Clonally related cells typically share the same germline gene segments and have highly similar junctional sequences within their third complementarity determining regions. Identifying clonal relatedness of sequences is a key step in the analysis of immune repertoires. The V gene is the most important for clone identification because it has the longest sequence and the greatest number of sequence variants. However, accurate identification of a clone's germline V gene source is challenging because there is a high degree of similarity between different germline V genes. This difficulty is compounded in antibodies, which can undergo somatic hypermutation. Furthermore, high-throughput sequencing experiments often generate partial sequences and have significant error rates. To address these issues, we describe a novel method to estimate which germline V genes (or alleles) cannot be discriminated under different conditions (read lengths, sequencing errors or somatic hypermutation frequencies). Starting with any set of germline V genes, this method measures their similarity using different sequencing lengths and calculates their likelihood of unambiguous assignment under different levels of mutation. Hence, one can identify, under different experimental and biological conditions, the germline V genes (or alleles) that cannot be uniquely identified and bundle them together into groups of specific V genes with highly similar sequences. Copyright © 2015 Elsevier B.V. All rights reserved.
Inference of cancer-specific gene regulatory networks using soft computing rules.

PubMed

Wang, Xiaosheng; Gotoh, Osamu

2010-03-24

Perturbations of gene regulatory networks are essentially responsible for oncogenesis. Therefore, inferring the gene regulatory networks is a key step to overcoming cancer. In this work, we propose a method for inferring directed gene regulatory networks based on soft computing rules, which can identify important cause-effect regulatory relations of gene expression. First, we identify important genes associated with a specific cancer (colon cancer) using a supervised learning approach. Next, we reconstruct the gene regulatory networks by inferring the regulatory relations among the identified genes, and their regulated relations by other genes within the genome. We obtain two meaningful findings. One is that upregulated genes are regulated by more genes than downregulated ones, while downregulated genes regulate more genes than upregulated ones. The other one is that tumor suppressors suppress tumor activators and activate other tumor suppressors strongly, while tumor activators activate other tumor activators and suppress tumor suppressors weakly, indicating the robustness of biological systems. These findings provide valuable insights into the pathogenesis of cancer.
Functional analysis of ars gene cluster of Pannonibacter indicus strain HT23(T) (DSM 23407(T)) and identification of a proline residue essential for arsenate reductase activity.

PubMed

Bandyopadhyay, Saumya; Das, Subrata K

2016-04-01

Arsenic is a naturally occurring ubiquitous highly toxic metalloid. In this study, we have identified ars gene cluster in Pannonibacter indicus strain HT23(T) (DSM 23407(T)), responsible for reduction of toxic pentavalent arsenate. The ars gene cluster is comprised of four non-overlapping open reading frames (ORFs) encoding a transcriptional regulator (ArsR), a low molecular weight protein tyrosine phosphatases (LMW-PTPase) with hypothetical function, an arsenite efflux pump (Acr3), and an arsenate reductase (ArsC). Heterologous expression of arsenic inducible ars gene cluster conferred arsenic resistance to Escherichia coli ∆ars mutant strain AW3110. The recombinant ArsC was purified and assayed. Site-directed mutagenesis was employed to ascertain the role of specific amino acids in ArsC catalysis. Pro94X (X = Ala, Arg, Cys, and His) amino acid substitutions led to enzyme inactivation. Circular dichroism spectra analysis suggested Pro94 as an essential amino acid for enzyme catalytic activity as it is indispensable for optimum protein folding in P. indicus Grx-coupled ArsC.
Genome-wide transcriptional analysis of flagellar regeneration in Chlamydomonas reinhardtii identifies orthologs of ciliary disease genes

NASA Technical Reports Server (NTRS)

Stolc, Viktor; Samanta, Manoj Pratim; Tongprasit, Waraporn; Marshall, Wallace F.

2005-01-01

The important role that cilia and flagella play in human disease creates an urgent need to identify genes involved in ciliary assembly and function. The strong and specific induction of flagellar-coding genes during flagellar regeneration in Chlamydomonas reinhardtii suggests that transcriptional profiling of such cells would reveal new flagella-related genes. We have conducted a genome-wide analysis of RNA transcript levels during flagellar regeneration in Chlamydomonas by using maskless photolithography method-produced DNA oligonucleotide microarrays with unique probe sequences for all exons of the 19,803 predicted genes. This analysis represents previously uncharacterized whole-genome transcriptional activity profiling study in this important model organism. Analysis of strongly induced genes reveals a large set of known flagellar components and also identifies a number of important disease-related proteins as being involved with cilia and flagella, including the zebrafish polycystic kidney genes Qilin, Reptin, and Pontin, as well as the testis-expressed tubby-like protein TULP2.
A gene expression biomarker identifies in vitro and in vivo ERα modulators in a human gene expression compendium

EPA Science Inventory

We propose the use of gene expression profiling to complement the chemical characterization currently based on HTS assay data and present a case study relevant to the Endocrine Disruptor Screening Program. We have developed computational methods to identify estrogen receptor &alp...
Genome-wide association study to identify candidate loci and genes for Mn toxicity tolerance in rice

PubMed Central

Shrestha, Asis; Dziwornu, Ambrose Kwaku; Ueda, Yoshiaki; Wu, Lin-Bo; Mathew, Boby

2018-01-01

Manganese (Mn) is an essential micro-nutrient for plants, but flooded rice fields can accumulate high levels of Mn2+ leading to Mn toxicity. Here, we present a genome-wide association study (GWAS) to identify candidate loci conferring Mn toxicity tolerance in rice (Oryza sativa L.). A diversity panel of 288 genotypes was grown in hydroponic solutions in a greenhouse under optimal and toxic Mn concentrations. We applied a Mn toxicity treatment (5 ppm Mn2+, 3 weeks) at twelve days after transplanting. Mn toxicity caused moderate damage in rice in terms of biomass loss and symptom formation despite extremely high shoot Mn concentrations ranging from 2.4 to 17.4 mg g-1. The tropical japonica subpopulation was more sensitive to Mn toxicity than other subpopulations. Leaf damage symptoms were significantly correlated with Mn uptake into shoots. Association mapping was conducted for seven traits using 416741 single nucleotide polymorphism (SNP) markers using a mixed linear model, and detected six significant associations for the traits shoot manganese concentration and relative shoot length. Candidate regions contained genes coding for a heavy metal transporter, peroxidase precursor and Mn2+ ion binding proteins. The significant marker SNP-2.22465867 caused an amino acid change in a gene (LOC_Os02g37170) with unknown function. This study demonstrated significant natural variation in rice for Mn toxicity tolerance and the possibility of using GWAS to unravel genetic factors responsible for such complex traits. PMID:29425206
Genome-wide association study to identify candidate loci and genes for Mn toxicity tolerance in rice.

PubMed

Shrestha, Asis; Dziwornu, Ambrose Kwaku; Ueda, Yoshiaki; Wu, Lin-Bo; Mathew, Boby; Frei, Michael

2018-01-01

Manganese (Mn) is an essential micro-nutrient for plants, but flooded rice fields can accumulate high levels of Mn2+ leading to Mn toxicity. Here, we present a genome-wide association study (GWAS) to identify candidate loci conferring Mn toxicity tolerance in rice (Oryza sativa L.). A diversity panel of 288 genotypes was grown in hydroponic solutions in a greenhouse under optimal and toxic Mn concentrations. We applied a Mn toxicity treatment (5 ppm Mn2+, 3 weeks) at twelve days after transplanting. Mn toxicity caused moderate damage in rice in terms of biomass loss and symptom formation despite extremely high shoot Mn concentrations ranging from 2.4 to 17.4 mg g-1. The tropical japonica subpopulation was more sensitive to Mn toxicity than other subpopulations. Leaf damage symptoms were significantly correlated with Mn uptake into shoots. Association mapping was conducted for seven traits using 416741 single nucleotide polymorphism (SNP) markers using a mixed linear model, and detected six significant associations for the traits shoot manganese concentration and relative shoot length. Candidate regions contained genes coding for a heavy metal transporter, peroxidase precursor and Mn2+ ion binding proteins. The significant marker SNP-2.22465867 caused an amino acid change in a gene (LOC_Os02g37170) with unknown function. This study demonstrated significant natural variation in rice for Mn toxicity tolerance and the possibility of using GWAS to unravel genetic factors responsible for such complex traits.
Identifying core gene modules in glioblastoma based on multilayer factor-mediated dysfunctional regulatory networks through integrating multi-dimensional genomic data

PubMed Central

Ping, Yanyan; Deng, Yulan; Wang, Li; Zhang, Hongyi; Zhang, Yong; Xu, Chaohan; Zhao, Hongying; Fan, Huihui; Yu, Fulong; Xiao, Yun; Li, Xia

2015-01-01

The driver genetic aberrations collectively regulate core cellular processes underlying cancer development. However, identifying the modules of driver genetic alterations and characterizing their functional mechanisms are still major challenges for cancer studies. Here, we developed an integrative multi-omics method CMDD to identify the driver modules and their affecting dysregulated genes through characterizing genetic alteration-induced dysregulated networks. Applied to glioblastoma (GBM), the CMDD identified a core gene module of 17 genes, including seven known GBM drivers, and their dysregulated genes. The module showed significant association with shorter survival of GBM. When classifying driver genes in the module into two gene sets according to their genetic alteration patterns, we found that one gene set directly participated in the glioma pathway, while the other indirectly regulated the glioma pathway, mostly, via their dysregulated genes. Both of the two gene sets were significant contributors to survival and helpful for classifying GBM subtypes, suggesting their critical roles in GBM pathogenesis. Also, by applying the CMDD to other six cancers, we identified some novel core modules associated with overall survival of patients. Together, these results demonstrate integrative multi-omics data can identify driver modules and uncover their dysregulated genes, which is useful for interpreting cancer genome. PMID:25653168
Integration of QTL and bioinformatic tools to identify candidate genes for triglycerides in mice[S

PubMed Central

Leduc, Magalie S.; Hageman, Rachael S.; Verdugo, Ricardo A.; Tsaih, Shirng-Wern; Walsh, Kenneth; Churchill, Gary A.; Paigen, Beverly

2011-01-01

To identify genetic loci influencing lipid levels, we performed quantitative trait loci (QTL) analysis between inbred mouse strains MRL/MpJ and SM/J, measuring triglyceride levels at 8 weeks of age in F2 mice fed a chow diet. We identified one significant QTL on chromosome (Chr) 15 and three suggestive QTL on Chrs 2, 7, and 17. We also carried out microarray analysis on the livers of parental strains of 282 F2 mice and used these data to find cis-regulated expression QTL. We then narrowed the list of candidate genes under significant QTL using a “toolbox” of bioinformatic resources, including haplotype analysis; parental strain comparison for gene expression differences and nonsynonymous coding single nucleotide polymorphisms (SNP); cis-regulated eQTL in livers of F2 mice; correlation between gene expression and phenotype; and conditioning of expression on the phenotype. We suggest Slc25a7 as a candidate gene for the Chr 7 QTL and, based on expression differences, five genes (Polr3 h, Cyp2d22, Cyp2d26, Tspo, and Ttll12) as candidate genes for Chr 15 QTL. This study shows how bioinformatics can be used effectively to reduce candidate gene lists for QTL related to complex traits. PMID:21622629
Transcriptome and metabolite analysis identifies nitrogen utilization genes in tea plant (Camellia sinensis).

PubMed

Li, Wei; Xiang, Fen; Zhong, Micai; Zhou, Lingyun; Liu, Hongyan; Li, Saijun; Wang, Xuewen

2017-05-10

Applied nitrogen (N) fertilizer significantly increases the leaf yield. However, most N is not utilized by the plant, negatively impacting the environment. To date, little is known regarding N utilization genes and mechanisms in the leaf production. To understand this, we investigated transcriptomes using RNA-seq and amino acid levels with N treatment in tea (Camellia sinensis), the most popular beverage crop. We identified 196 and 29 common differentially expressed genes in roots and leaves, respectively, in response to ammonium in two tea varieties. Among those genes, AMT, NRT and AQP for N uptake and GOGAT and GS for N assimilation were the key genes, validated by RT-qPCR, which expressed in a network manner with tissue specificity. Importantly, only AQP and three novel DEGs associated with stress, manganese binding, and gibberellin-regulated transcription factor were common in N responses across all tissues and varieties. A hypothesized gene regulatory network for N was proposed. A strong statistical correlation between key genes' expression and amino acid content was revealed. The key genes and regulatory network improve our understanding of the molecular mechanism of N usage and offer gene targets for plant improvement.

Mistaken Identifiers: Gene name errors can be introduced inadvertently when using Excel in bioinformatics

PubMed Central

Zeeberg, Barry R; Riss, Joseph; Kane, David W; Bussey, Kimberly J; Uchio, Edward; Linehan, W Marston; Barrett, J Carl; Weinstein, John N

2004-01-01

Background When processing microarray data sets, we recently noticed that some gene names were being changed inadvertently to non-gene names. Results A little detective work traced the problem to default date format conversions and floating-point format conversions in the very useful Excel program package. The date conversions affect at least 30 gene names; the floating-point conversions affect at least 2,000 if Riken identifiers are included. These conversions are irreversible; the original gene names cannot be recovered. Conclusions Users of Excel for analyses involving gene names should be aware of this problem, which can cause genes, including medically important ones, to be lost from view and which has contaminated even carefully curated public databases. We provide work-arounds and scripts for circumventing the problem. PMID:15214961
Regulatory elements of the floral homeotic gene AGAMOUS identified by phylogenetic footprinting and shadowing.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hong, R. L., Hamaguchi, L., Busch, M. A., and Weigel, D.

2003-06-01

OAK-B135 In Arabidopsis thaliana, cis-regulatory sequences of the floral homeotic gene AGAMOUS (AG) are located in the second intron. This 3 kb intron contains binding sites for two direct activators of AG, LEAFY (LFY) and WUSCHEL (WUS), along with other putative regulatory elements. We have used phylogenetic footprinting and the related technique of phylogenetic shadowing to identify putative cis-regulatory elements in this intron. Among 29 Brassicaceae, several other motifs, but not the LFY and WUS binding sites previously identified, are largely invariant. Using reporter gene analyses, we tested six of these motifs and found that they are all functionally importantmore » for activity of AG regulatory sequences in A. thaliana. Although there is little obvious sequence similarity outside the Brassicaceae, the intron from cucumber AG has at least partial activity in A. thaliana. Our studies underscore the value of the comparative approach as a tool that complements gene-by-gene promoter dissection, but also highlight that sequence-based studies alone are insufficient for a complete identification of cis-regulatory sites.« less
Genetic Susceptibility to Vitiligo: GWAS Approaches for Identifying Vitiligo Susceptibility Genes and Loci

PubMed Central

Shen, Changbing; Gao, Jing; Sheng, Yujun; Dou, Jinfa; Zhou, Fusheng; Zheng, Xiaodong; Ko, Randy; Tang, Xianfa; Zhu, Caihong; Yin, Xianyong; Sun, Liangdan; Cui, Yong; Zhang, Xuejun

2016-01-01

Vitiligo is an autoimmune disease with a strong genetic component, characterized by areas of depigmented skin resulting from loss of epidermal melanocytes. Genetic factors are known to play key roles in vitiligo through discoveries in association studies and family studies. Previously, vitiligo susceptibility genes were mainly revealed through linkage analysis and candidate gene studies. Recently, our understanding of the genetic basis of vitiligo has been rapidly advancing through genome-wide association study (GWAS). More than 40 robust susceptible loci have been identified and confirmed to be associated with vitiligo by using GWAS. Most of these associated genes participate in important pathways involved in the pathogenesis of vitiligo. Many susceptible loci with unknown functions in the pathogenesis of vitiligo have also been identified, indicating that additional molecular mechanisms may contribute to the risk of developing vitiligo. In this review, we summarize the key loci that are of genome-wide significance, which have been shown to influence vitiligo risk. These genetic loci may help build the foundation for genetic diagnosis and personalize treatment for patients with vitiligo in the future. However, substantial additional studies, including gene-targeted and functional studies, are required to confirm the causality of the genetic variants and their biological relevance in the development of vitiligo. PMID:26870082
Rare copy number variations in congenital heart disease patients identify unique genes in left-right patterning

PubMed Central

Fakhro, Khalid A.; Choi, Murim; Ware, Stephanie M.; Belmont, John W.; Towbin, Jeffrey A.; Lifton, Richard P.; Khokha, Mustafa K.; Brueckner, Martina

2011-01-01

Dominant human genetic diseases that impair reproductive fitness and have high locus heterogeneity constitute a problem for gene discovery because the usual criterion of finding more mutations in specific genes than expected by chance may require extremely large populations. Heterotaxy (Htx), a congenital heart disease resulting from abnormalities in left-right (LR) body patterning, has features suggesting that many cases fall into this category. In this setting, appropriate model systems may provide a means to support implication of specific genes. By high-resolution genotyping of 262 Htx subjects and 991 controls, we identify a twofold excess of subjects with rare genic copy number variations in Htx (14.5% vs. 7.4%, P = 1.5 × 10−4). Although 7 of 45 Htx copy number variations were large chromosomal abnormalities, 38 smaller copy number variations altered a total of 61 genes, 22 of which had Xenopus orthologs. In situ hybridization identified 7 of these 22 genes with expression in the ciliated LR organizer (gastrocoel roof plate), a marked enrichment compared with 40 of 845 previously studied genes (sevenfold enrichment, P < 10−6). Morpholino knockdown in Xenopus of Htx candidates demonstrated that five (NEK2, ROCK2, TGFBR2, GALNT11, and NUP188) strongly disrupted both morphological LR development and expression of pitx2, a molecular marker of LR patterning. These effects were specific, because 0 of 13 control genes from rare Htx or control copy number variations produced significant LR abnormalities (P = 0.001). These findings identify genes not previously implicated in LR patterning. PMID:21282601
Rare copy number variations in congenital heart disease patients identify unique genes in left-right patterning.

PubMed

Fakhro, Khalid A; Choi, Murim; Ware, Stephanie M; Belmont, John W; Towbin, Jeffrey A; Lifton, Richard P; Khokha, Mustafa K; Brueckner, Martina

2011-02-15

Dominant human genetic diseases that impair reproductive fitness and have high locus heterogeneity constitute a problem for gene discovery because the usual criterion of finding more mutations in specific genes than expected by chance may require extremely large populations. Heterotaxy (Htx), a congenital heart disease resulting from abnormalities in left-right (LR) body patterning, has features suggesting that many cases fall into this category. In this setting, appropriate model systems may provide a means to support implication of specific genes. By high-resolution genotyping of 262 Htx subjects and 991 controls, we identify a twofold excess of subjects with rare genic copy number variations in Htx (14.5% vs. 7.4%, P = 1.5 × 10(-4)). Although 7 of 45 Htx copy number variations were large chromosomal abnormalities, 38 smaller copy number variations altered a total of 61 genes, 22 of which had Xenopus orthologs. In situ hybridization identified 7 of these 22 genes with expression in the ciliated LR organizer (gastrocoel roof plate), a marked enrichment compared with 40 of 845 previously studied genes (sevenfold enrichment, P < 10(-6)). Morpholino knockdown in Xenopus of Htx candidates demonstrated that five (NEK2, ROCK2, TGFBR2, GALNT11, and NUP188) strongly disrupted both morphological LR development and expression of pitx2, a molecular marker of LR patterning. These effects were specific, because 0 of 13 control genes from rare Htx or control copy number variations produced significant LR abnormalities (P = 0.001). These findings identify genes not previously implicated in LR patterning.
A framework to identify gene expression profiles in a model of inflammation induced by lipopolysaccharide after treatment with thalidomide

PubMed Central

2012-01-01

Background Thalidomide is an anti-inflammatory and anti-angiogenic drug currently used for the treatment of several diseases, including erythema nodosum leprosum, which occurs in patients with lepromatous leprosy. In this research, we use DNA microarray analysis to identify the impact of thalidomide on gene expression responses in human cells after lipopolysaccharide (LPS) stimulation. We employed a two-stage framework. Initially, we identified 1584 altered genes in response to LPS. Modulation of this set of genes was then analyzed in the LPS stimulated cells treated with thalidomide. Results We identified 64 genes with altered expression induced by thalidomide using the rank product method. In addition, the lists of up-regulated and down-regulated genes were investigated by means of bioinformatics functional analysis, which allowed for the identification of biological processes affected by thalidomide. Confirmatory analysis was done in five of the identified genes using real time PCR. Conclusions The results showed some genes that can further our understanding of the biological mechanisms in the action of thalidomide. Of the five genes evaluated with real time PCR, three were down regulated and two were up regulated confirming the initial results of the microarray analysis. PMID:22695124
no blokes Is Essential for Male Viability and X Chromosome Gene Expression in the Australian Sheep Blowfly.

PubMed

Davis, Rebecca J; Belikoff, Esther J; Scholl, Elizabeth H; Li, Fang; Scott, Maxwell J

2018-06-18

It has been hypothesized that the Drosophila 4 th chromosome is derived from an ancient X chromosome [1]. In the Australian sheep blowfly, Lucilia cuprina, the heterochromatic X chromosome contains few active genes and orthologs of Drosophila X-linked genes are autosomal. Of 8 X-linked genes identified previously in L. cuprina, 6 were orthologs of Drosophila 4 th -chromosome genes [2]. The X-linked genes were expressed equally in males and females. Here we identify an additional 51 X-linked genes and show that most are dosage compensated. Orthologs of 49 of the 59 X-linked genes are on the 4 th chromosome in D. melanogaster. Because painting of fourth (Pof) is important for expression of Drosophila 4 th -chromosome genes [3], we used Cas9 to make a loss-of-function knockin mutation in an L. cuprina Pof ortholog we call no blokes (nbl). Homozygous nbl males derived from homozygous nbl mothers die at the late pupal stage. Homozygous nbl females are viable, fertile, and live longer than heterozygous nbl females. RNA expression of most X-linked genes was reduced in homozygous nbl male pupae and to a lesser extent in nbl females compared to heterozygous siblings. The results suggest that NBL could be important for X chromosome dosage compensation in L. cuprina. NBL may also facilitate gene expression in the heterochromatic environment of the X chromosome in both sexes. This study supports the hypothesis on the origin of the Drosophila 4 th chromosome and that a POF-like protein was required for normal gene expression on the ancient X chromosome. Copyright © 2018 Elsevier Ltd. All rights reserved.
Bacterial reference genes for gene expression studies by RT-qPCR: survey and analysis.

PubMed

Rocha, Danilo J P; Santos, Carolina S; Pacheco, Luis G C

2015-09-01

The appropriate choice of reference genes is essential for accurate normalization of gene expression data obtained by the method of reverse transcription quantitative real-time PCR (RT-qPCR). In 2009, a guideline called the Minimum Information for Publication of Quantitative Real-Time PCR Experiments (MIQE) highlighted the importance of the selection and validation of more than one suitable reference gene for obtaining reliable RT-qPCR results. Herein, we searched the recent literature in order to identify the bacterial reference genes that have been most commonly validated in gene expression studies by RT-qPCR (in the first 5 years following publication of the MIQE guidelines). Through a combination of different search parameters with the text mining tool MedlineRanker, we identified 145 unique bacterial genes that were recently tested as candidate reference genes. Of these, 45 genes were experimentally validated and, in most of the cases, their expression stabilities were verified using the software tools geNorm and NormFinder. It is noteworthy that only 10 of these reference genes had been validated in two or more of the studies evaluated. An enrichment analysis using Gene Ontology classifications demonstrated that genes belonging to the functional categories of DNA Replication (GO: 0006260) and Transcription (GO: 0006351) rendered a proportionally higher number of validated reference genes. Three genes in the former functional class were also among the top five most stable genes identified through an analysis of gene expression data obtained from the Pathosystems Resource Integration Center. These results may provide a guideline for the initial selection of candidate reference genes for RT-qPCR studies in several different bacterial species.
Mrp--a new auxiliary gene essential for optimal expression of methicillin resistance in Staphylococcus aureus.

PubMed

Wu, S W; De Lencastre, H

1999-01-01

Screening of a library of Tn551 insertional mutants selected for reduction in the methicillin resistance level of the parental Staphylococcus aureus strain COL resulted in the isolation of mutant RUSA266 in which the minimal inhibitory concentration (MIC) of the parent was reduced from 1,600 to 1.5 micrograms/mL. Cloning and sequencing of the vicinity of the insertion site omega 726 identified an open reading frame (orf1365) encoding a very large polypeptide of more than 1,365 amino acids. A unique feature of the deduced amino acid sequence was the presence of multiple tandem repeats of 75 amino acids in the polypeptide, reminiscent of the structure of high-molecular-weight cell-surface proteins EF* and Emb identified in some streptococcal strains. Mutant RUSA266 with the inactivated gene, which we shall provisionally refer to as mrp (for multiple repeat polypeptide), produced a peptidoglycan with altered muropeptide composition, and both the reduced antibiotic resistance and the altered cell wall composition were co-transduced in back-crosses into the parental strain COL. Additional sequencing upstream of mrp has revealed that this gene was part of a five-gene cluster occupying a 9.2-kb region of the staphylococcal chromosome and was composed of glmM (directly upstream of mrp), two open reading frames orf310 and orf269 coding for two hypothetical proteins, and the gene encoding the staphylococcal arginase (arg). Transcriptional analysis demonstrated that the five genes in the cluster were transcribed together.
Multiple genome alignment for identifying the core structure among moderately related microbial genomes.

PubMed

Uchiyama, Ikuo

2008-10-31

Identifying the set of intrinsically conserved genes, or the genomic core, among related genomes is crucial for understanding prokaryotic genomes where horizontal gene transfers are common. Although core genome identification appears to be obvious among very closely related genomes, it becomes more difficult when more distantly related genomes are compared. Here, we consider the core structure as a set of sufficiently long segments in which gene orders are conserved so that they are likely to have been inherited mainly through vertical transfer, and developed a method for identifying the core structure by finding the order of pre-identified orthologous groups (OGs) that maximally retains the conserved gene orders. The method was applied to genome comparisons of two well-characterized families, Bacillaceae and Enterobacteriaceae, and identified their core structures comprising 1438 and 2125 OGs, respectively. The core sets contained most of the essential genes and their related genes, which were primarily included in the intersection of the two core sets comprising around 700 OGs. The definition of the genomic core based on gene order conservation was demonstrated to be more robust than the simpler approach based only on gene conservation. We also investigated the core structures in terms of G+C content homogeneity and phylogenetic congruence, and found that the core genes primarily exhibited the expected characteristic, i.e., being indigenous and sharing the same history, more than the non-core genes. The results demonstrate that our strategy of genome alignment based on gene order conservation can provide an effective approach to identify the genomic core among moderately related microbial genomes.
The Essential Role of the Deinococcus radiodurans ssb Gene in Cell Survival and Radiation Tolerance

PubMed Central

Lockhart, J. Scott; DeVeaux, Linda C.

2013-01-01

Recent evidence has implicated single-stranded DNA-binding protein (SSB) expression level as an important factor in microbial radiation resistance. The genome of the extremely radiation resistant bacterium Deinococcus radiodurans contains genes for two SSB homologs: the homodimeric, canonical Ssb, encoded by the gene ssb, and a novel pentameric protein encoded by the gene ddrB. ddrB is highly induced upon exposure to radiation, and deletions result in decreased radiation-resistance, suggesting an integral role of the protein in the extreme resistance exhibited by this organism. Although expression of ssb is also induced after irradiation, Ssb is thought to be involved primarily in replication. In this study, we demonstrate that Ssb in D. radiodurans is essential for cell survival. The lethality of an ssb deletion cannot be complemented by providing ddrB in trans. In addition, the radiation-sensitive phenotype conferred by a ddrB deletion is not alleviated by providing ssb in trans. By altering expression of the ssb gene, we also show that lower levels of transcription are required for optimal growth than are necessary for high radiation resistance. When expression is reduced to that of E. coli, ionizing radiation resistance is similarly reduced. UV resistance is also decreased under low ssb transcript levels where growth is unimpaired. These results indicate that the expression of ssb is a key component of both normal cellular metabolism as well as pathways responsible for the high radiation tolerance of D. radiodurans. PMID:23951213
A novel gammaretroviral shuttle vector insertional mutagenesis screen identifies SHARPIN as a breast cancer metastasis gene and prognostic biomarker.

PubMed

Bii, Victor M; Rae, Dustin T; Trobridge, Grant D

2015-11-24

Breast cancer (BC) is the second leading cause of malignancy among U.S. women. Metastasis results in a poor prognosis and increased mortality, but the molecular mechanisms by which metastatic tumors occur are not well understood. Identifying the genes that drive the metastatic process could provide targets for improved therapy and biomarkers to improve BC patient outcomes. Using a forward mutagenesis screen, BC cells mutagenized with a replication-incompetent gammaretroviral vector (γRV) were xenotransplanted into the mammary fat pad of immunodeficient mice. In this approach the vector provirus dysregulates nearby genes, providing a selective advantage to transduced cells to form metastases. Metastatic tumors were analyzed for proviral integration sites to identify nearby candidate metastasis genes. The γRV has a transgene cassette that allows for rescue in bacteria and rapid identification of vector integration sites. Using this approach, we identified the previously described metastasis gene WWTR1 (TAZ), and three other novel candidate metastasis genes including SHARPIN. SHARPIN was independently validated in vivo as a BC metastasis gene. Analysis of patient data showed that SHARPIN expression predicts metastasis-free survival after adjuvant therapy. Our approach has broad potential to identify genes involved in oncogenic processes for BC and other cancers. We show here it can identify both known (WWTR1) and novel (SHARPIN) BC metastasis genes.
A comparative gene analysis with rice identified orthologous group II HKT genes and their association with Na(+) concentration in bread wheat.

PubMed

Ariyarathna, H A Chandima K; Oldach, Klaus H; Francki, Michael G

2016-01-19

Although the HKT transporter genes ascertain some of the key determinants of crop salt tolerance mechanisms, the diversity and functional role of group II HKT genes are not clearly understood in bread wheat. The advanced knowledge on rice HKT and whole genome sequence was, therefore, used in comparative gene analysis to identify orthologous wheat group II HKT genes and their role in trait variation under different saline environments. The four group II HKTs in rice identified two orthologous gene families from bread wheat, including the known TaHKT2;1 gene family and a new distinctly different gene family designated as TaHKT2;2. A single copy of TaHKT2;2 was found on each homeologous chromosome arm 7AL, 7BL and 7DL and each gene was expressed in leaf blade, sheath and root tissues under non-stressed and at 200 mM salt stressed conditions. The proteins encoded by genes of the TaHKT2;2 family revealed more than 93% amino acid sequence identity but ≤52% amino acid identity compared to the proteins encoded by TaHKT2;1 family. Specifically, variations in known critical domains predicted functional differences between the two protein families. Similar to orthologous rice genes on chromosome 6L, TaHKT2;1 and TaHKT2;2 genes were located approximately 3 kb apart on wheat chromosomes 7AL, 7BL and 7DL, forming a static syntenic block in the two species. The chromosomal region on 7AL containing TaHKT2;1 7AL-1 co-located with QTL for shoot Na(+) concentration and yield in some saline environments. The differences in copy number, genes sequences and encoded proteins between TaHKT2;2 homeologous genes and other group II HKT gene families within and across species likely reflect functional diversity for ion selectivity and transport in plants. Evidence indicated that neither TaHKT2;2 nor TaHKT2;1 were associated with primary root Na(+) uptake but TaHKT2;1 may be associated with trait variation for Na(+) exclusion and yield in some but not all saline environments.
Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression

PubMed Central

Poole, William; Leinonen, Kalle; Shmulevich, Ilya

2017-01-01

Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C. PMID:28170390
Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression.

PubMed

Poole, William; Leinonen, Kalle; Shmulevich, Ilya; Knijnenburg, Theo A; Bernard, Brady

2017-02-01

Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C.
A Morpholino-based screen to identify novel genes involved in craniofacial morphogenesis

PubMed Central

Melvin, Vida Senkus; Feng, Weiguo; Hernandez-Lagunas, Laura; Artinger, Kristin Bruk; Williams, Trevor

2014-01-01

BACKGROUND The regulatory mechanisms underpinning facial development are conserved between diverse species. Therefore, results from model systems provide insight into the genetic causes of human craniofacial defects. Previously, we generated a comprehensive dataset examining gene expression during development and fusion of the mouse facial prominences. Here, we used this resource to identify genes that have dynamic expression patterns in the facial prominences, but for which only limited information exists concerning developmental function. RESULTS This set of ~80 genes was used for a high throughput functional analysis in the zebrafish system using Morpholino gene knockdown technology. This screen revealed three classes of cranial cartilage phenotypes depending upon whether knockdown of the gene affected the neurocranium, viscerocranium, or both. The targeted genes that produced consistent phenotypes encoded proteins linked to transcription (meis1, meis2a, tshz2, vgll4l), signaling (pkdcc, vlk, macc1, wu:fb16h09), and extracellular matrix function (smoc2). The majority of these phenotypes were not altered by reduction of p53 levels, demonstrating that both p53 dependent and independent mechanisms were involved in the craniofacial abnormalities. CONCLUSIONS This Morpholino-based screen highlights new genes involved in development of the zebrafish craniofacial skeleton with wider relevance to formation of the face in other species, particularly mouse and human. PMID:23559552
Use of deep whole-genome sequencing data to identify structure risk variants in breast cancer susceptibility genes.

PubMed

Guo, Xingyi; Shi, Jiajun; Cai, Qiuyin; Shu, Xiao-Ou; He, Jing; Wen, Wanqing; Allen, Jamie; Pharoah, Paul; Dunning, Alison; Hunter, David J; Kraft, Peter; Easton, Douglas F; Zheng, Wei; Long, Jirong

2018-03-01

Functional disruptions of susceptibility genes by large genomic structure variant (SV) deletions in germlines are known to be associated with cancer risk. However, few studies have been conducted to systematically search for SV deletions in breast cancer susceptibility genes. We analysed deep (> 30x) whole-genome sequencing (WGS) data generated in blood samples from 128 breast cancer patients of Asian and European descent with either a strong family history of breast cancer or early cancer onset disease. To identify SV deletions in known or suspected breast cancer susceptibility genes, we used multiple SV calling tools including Genome STRiP, Delly, Manta, BreakDancer and Pindel. SV deletions were detected by at least three of these bioinformatics tools in five genes. Specifically, we identified heterozygous deletions covering a fraction of the coding regions of BRCA1 (with approximately 80kb in two patients), and TP53 genes (with ∼1.6 kb in two patients), and of intronic regions (∼1 kb) of the PALB2 (one patient), PTEN (three patients) and RAD51C genes (one patient). We confirmed the presence of these deletions using real-time quantitative PCR (qPCR). Our study identified novel SV deletions in breast cancer susceptibility genes and the identification of such SV deletions may improve clinical testing.
Novel mutations in the homogentisate 1,2 dioxygenase gene identified in Jordanian patients with alkaptonuria.

PubMed

Al-sbou, Mohammed

2012-06-01

This study was conducted to identify mutations in the homogentisate 1,2 dioxygenase gene (HGD) in alkaptonuria patients among Jordanian population. Blood samples were collected from four alkaptonuria patients, four carriers, and two healthy volunteers. DNA was isolated from peripheral blood. All 14 exons of the HGD gene were amplified using the polymerase chain reaction (PCR) technique. The PCR products were then purified and analyzed by sequencing. Five mutations were identified in our samples. Four of them were novel C1273A, T1046G, 551-552insG, T533G and had not been previously reported, and one mutation T847C has been described before. The types of mutations identified were two missense mutations, one splice site mutation, one frameshift mutation, and one polymorphism. We present the first molecular study of the HGD gene in Jordanian alkaptonuria patients. This study provides valuable information about the molecular basis of alkaptonuria in Jordanian population.
Gene interactions in the DNA damage-response pathway identified by genome-wide RNA-interference analysis of synthetic lethality

PubMed Central

van Haaften, Gijs; Vastenhouw, Nadine L.; Nollen, Ellen A. A.; Plasterk, Ronald H. A.; Tijsterman, Marcel

2004-01-01

Here, we describe a systematic search for synthetic gene interactions in a multicellular organism, the nematode Caenorhabditis elegans. We established a high-throughput method to determine synthetic gene interactions by genome-wide RNA interference and identified genes that are required to protect the germ line against DNA double-strand breaks. Besides known DNA-repair proteins such as the C. elegans orthologs of TopBP1, RPA2, and RAD51, eight genes previously unassociated with a double-strand-break response were identified. Knockdown of these genes increased sensitivity to ionizing radiation and camptothecin and resulted in increased chromosomal nondisjunction. All genes have human orthologs that may play a role in human carcinogenesis. PMID:15326288
Integrated database for identifying candidate genes for Aspergillus flavus resistance in maize

PubMed Central

2010-01-01

Background Aspergillus flavus Link:Fr, an opportunistic fungus that produces aflatoxin, is pathogenic to maize and other oilseed crops. Aflatoxin is a potent carcinogen, and its presence markedly reduces the value of grain. Understanding and enhancing host resistance to A. flavus infection and/or subsequent aflatoxin accumulation is generally considered an efficient means of reducing grain losses to aflatoxin. Different proteomic, genomic and genetic studies of maize (Zea mays L.) have generated large data sets with the goal of identifying genes responsible for conferring resistance to A. flavus, or aflatoxin. Results In order to maximize the usage of different data sets in new studies, including association mapping, we have constructed a relational database with web interface integrating the results of gene expression, proteomic (both gel-based and shotgun), Quantitative Trait Loci (QTL) genetic mapping studies, and sequence data from the literature to facilitate selection of candidate genes for continued investigation. The Corn Fungal Resistance Associated Sequences Database (CFRAS-DB) (http://agbase.msstate.edu/) was created with the main goal of identifying genes important to aflatoxin resistance. CFRAS-DB is implemented using MySQL as the relational database management system running on a Linux server, using an Apache web server, and Perl CGI scripts as the web interface. The database and the associated web-based interface allow researchers to examine many lines of evidence (e.g. microarray, proteomics, QTL studies, SNP data) to assess the potential role of a gene or group of genes in the response of different maize lines to A. flavus infection and subsequent production of aflatoxin by the fungus. Conclusions CFRAS-DB provides the first opportunity to integrate data pertaining to the problem of A. flavus and aflatoxin resistance in maize in one resource and to support queries across different datasets. The web-based interface gives researchers different query

Integrated database for identifying candidate genes for Aspergillus flavus resistance in maize.

PubMed

Kelley, Rowena Y; Gresham, Cathy; Harper, Jonathan; Bridges, Susan M; Warburton, Marilyn L; Hawkins, Leigh K; Pechanova, Olga; Peethambaran, Bela; Pechan, Tibor; Luthe, Dawn S; Mylroie, J E; Ankala, Arunkanth; Ozkan, Seval; Henry, W B; Williams, W P

2010-10-07

Aspergillus flavus Link:Fr, an opportunistic fungus that produces aflatoxin, is pathogenic to maize and other oilseed crops. Aflatoxin is a potent carcinogen, and its presence markedly reduces the value of grain. Understanding and enhancing host resistance to A. flavus infection and/or subsequent aflatoxin accumulation is generally considered an efficient means of reducing grain losses to aflatoxin. Different proteomic, genomic and genetic studies of maize (Zea mays L.) have generated large data sets with the goal of identifying genes responsible for conferring resistance to A. flavus, or aflatoxin. In order to maximize the usage of different data sets in new studies, including association mapping, we have constructed a relational database with web interface integrating the results of gene expression, proteomic (both gel-based and shotgun), Quantitative Trait Loci (QTL) genetic mapping studies, and sequence data from the literature to facilitate selection of candidate genes for continued investigation. The Corn Fungal Resistance Associated Sequences Database (CFRAS-DB) (http://agbase.msstate.edu/) was created with the main goal of identifying genes important to aflatoxin resistance. CFRAS-DB is implemented using MySQL as the relational database management system running on a Linux server, using an Apache web server, and Perl CGI scripts as the web interface. The database and the associated web-based interface allow researchers to examine many lines of evidence (e.g. microarray, proteomics, QTL studies, SNP data) to assess the potential role of a gene or group of genes in the response of different maize lines to A. flavus infection and subsequent production of aflatoxin by the fungus. CFRAS-DB provides the first opportunity to integrate data pertaining to the problem of A. flavus and aflatoxin resistance in maize in one resource and to support queries across different datasets. The web-based interface gives researchers different query options for mining the database
Genome-wide association meta-analysis of 78,308 individuals identifies new loci and genes influencing human intelligence.

PubMed

Sniekers, Suzanne; Stringer, Sven; Watanabe, Kyoko; Jansen, Philip R; Coleman, Jonathan R I; Krapohl, Eva; Taskesen, Erdogan; Hammerschlag, Anke R; Okbay, Aysu; Zabaneh, Delilah; Amin, Najaf; Breen, Gerome; Cesarini, David; Chabris, Christopher F; Iacono, William G; Ikram, M Arfan; Johannesson, Magnus; Koellinger, Philipp; Lee, James J; Magnusson, Patrik K E; McGue, Matt; Miller, Mike B; Ollier, William E R; Payton, Antony; Pendleton, Neil; Plomin, Robert; Rietveld, Cornelius A; Tiemeier, Henning; van Duijn, Cornelia M; Posthuma, Danielle

2017-07-01

Intelligence is associated with important economic and health-related life outcomes. Despite intelligence having substantial heritability (0.54) and a confirmed polygenic nature, initial genetic studies were mostly underpowered. Here we report a meta-analysis for intelligence of 78,308 individuals. We identify 336 associated SNPs (METAL P < 5 × 10 -8 ) in 18 genomic loci, of which 15 are new. Around half of the SNPs are located inside a gene, implicating 22 genes, of which 11 are new findings. Gene-based analyses identified an additional 30 genes (MAGMA P < 2.73 × 10 -6 ), of which all but one had not been implicated previously. We show that the identified genes are predominantly expressed in brain tissue, and pathway analysis indicates the involvement of genes regulating cell development (MAGMA competitive P = 3.5 × 10 -6 ). Despite the well-known difference in twin-based heritability for intelligence in childhood (0.45) and adulthood (0.80), we show substantial genetic correlation (r g = 0.89, LD score regression P = 5.4 × 10 -29 ). These findings provide new insight into the genetic architecture of intelligence.
Mutant characterization and in vivo conditional repression identify aromatic amino acid biosynthesis to be essential for Aspergillus fumigatus virulence

PubMed Central

Sasse, Anna; Hamer, Stefanie N; Amich, Jorge; Binder, Jasmin; Krappmann, Sven

2016-01-01

Pathogenicity of the saprobe Aspergillus fumigatus strictly depends on nutrient acquisition during infection, as fungal growth determines colonisation and invasion of a susceptible host. Primary metabolism has to be considered as a valid target for antimycotic therapy, based on the fact that several fungal anabolic pathways are not conserved in higher eukaryotes. To test whether fungal proliferation during invasive aspergillosis relies on endogenous biosynthesis of aromatic amino acids, defined auxotrophic mutants of A. fumigatus were generated and assessed for their infectious capacities in neutropenic mice and found to be strongly attenuated in virulence. Moreover, essentiality of the complete biosynthetic pathway could be demonstrated, corroborated by conditional gene expression in infected animals and inhibitor studies. This brief report not only validates the aromatic amino acid biosynthesis pathway of A. fumigatus to be a promising antifungal target but furthermore demonstrates feasibility of conditional gene expression in a murine infection model of aspergillosis. PMID:26605426
Computational modeling identifies key gene regulatory interactions underlying phenobarbital-mediated tumor promotion

PubMed Central

Luisier, Raphaëlle; Unterberger, Elif B.; Goodman, Jay I.; Schwarz, Michael; Moggs, Jonathan; Terranova, Rémi; van Nimwegen, Erik

2014-01-01

Gene regulatory interactions underlying the early stages of non-genotoxic carcinogenesis are poorly understood. Here, we have identified key candidate regulators of phenobarbital (PB)-mediated mouse liver tumorigenesis, a well-characterized model of non-genotoxic carcinogenesis, by applying a new computational modeling approach to a comprehensive collection of in vivo gene expression studies. We have combined our previously developed motif activity response analysis (MARA), which models gene expression patterns in terms of computationally predicted transcription factor binding sites with singular value decomposition (SVD) of the inferred motif activities, to disentangle the roles that different transcriptional regulators play in specific biological pathways of tumor promotion. Furthermore, transgenic mouse models enabled us to identify which of these regulatory activities was downstream of constitutive androstane receptor and β-catenin signaling, both crucial components of PB-mediated liver tumorigenesis. We propose novel roles for E2F and ZFP161 in PB-mediated hepatocyte proliferation and suggest that PB-mediated suppression of ESR1 activity contributes to the development of a tumor-prone environment. Our study shows that combining MARA with SVD allows for automated identification of independent transcription regulatory programs within a complex in vivo tissue environment and provides novel mechanistic insights into PB-mediated hepatocarcinogenesis. PMID:24464994
A fast and high performance multiple data integration algorithm for identifying human disease genes

PubMed Central

2015-01-01

Background Integrating multiple data sources is indispensable in improving disease gene identification. It is not only due to the fact that disease genes associated with similar genetic diseases tend to lie close with each other in various biological networks, but also due to the fact that gene-disease associations are complex. Although various algorithms have been proposed to identify disease genes, their prediction performances and the computational time still should be further improved. Results In this study, we propose a fast and high performance multiple data integration algorithm for identifying human disease genes. A posterior probability of each candidate gene associated with individual diseases is calculated by using a Bayesian analysis method and a binary logistic regression model. Two prior probability estimation strategies and two feature vector construction methods are developed to test the performance of the proposed algorithm. Conclusions The proposed algorithm is not only generated predictions with high AUC scores, but also runs very fast. When only a single PPI network is employed, the AUC score is 0.769 by using F2 as feature vectors. The average running time for each leave-one-out experiment is only around 1.5 seconds. When three biological networks are integrated, the AUC score using F3 as feature vectors increases to 0.830, and the average running time for each leave-one-out experiment takes only about 12.54 seconds. It is better than many existing algorithms. PMID:26399620
Epigenomic Elements Analyses for Promoters Identify ESRRG as a New Susceptibility Gene for Obesity-related Traits

PubMed Central

Dong, Shan-Shan; Guo, Yan; Zhu, Dong-Li; Chen, Xiao-Feng; Wu, Xiao-Ming; Shen, Hui; Chen, Xiang-Ding; Tan, Li-Jun; Tian, Qing; Deng, Hong-Wen; Yang, Tie-Lin

2016-01-01

OBJECTIVES With ENCODE epigenomic data and results from published genome-wide association studies (GWASs), we aimed to find regulatory signatures of obesity genes and discover novel susceptibility genes. METHODS Obesity genes were obtained from public GWASs databases and their promoters were annotated based on the regulatory elements information. Significantly enriched or depleted epigenomic elements in the promoters of obesity genes were evaluated and all human genes were then prioritized according to the existence of the selected elements to predict new candidate genes. Top ranked genes were subsequently applied to validate their associations with obesity-related traits in three independent in-house GWASs samples. RESULTS We identified RAD21 and EZH2 as over-represented, STAT2 and IRF3 as depleted transcription factors. Histone modification of H3K9me3 and chromatin state segmentation of “poised promoter” and “repressed” were overrepresented. All genes were prioritized and we selected the top five genes for validation at population level. Combined results from the three GWASs samples, rs7522101 in ESRRG remained significantly associated with BMI after multiple testing corrections (P = 7.25 × 10−5). It was also associated with β-cell function (P = 1.99 × 10−3) and fasting glucose level (P < 0.05) in the meta-analyses of glucose and insulin-related traits consortium (MAGIC) dataset. CONCLUSIONS In summary, we identified epigenomic characteristics for obesity genes and suggested ESRRG as a novel obesity susceptibility gene. PMID:27113491
In-Silico Integration Approach to Identify a Key miRNA Regulating a Gene Network in Aggressive Prostate Cancer

PubMed Central

Colaprico, Antonio; Bontempi, Gianluca; Castiglioni, Isabella

2018-01-01

Like other cancer diseases, prostate cancer (PC) is caused by the accumulation of genetic alterations in the cells that drives malignant growth. These alterations are revealed by gene profiling and copy number alteration (CNA) analysis. Moreover, recent evidence suggests that also microRNAs have an important role in PC development. Despite efforts to profile PC, the alterations (gene, CNA, and miRNA) and biological processes that correlate with disease development and progression remain partially elusive. Many gene signatures proposed as diagnostic or prognostic tools in cancer poorly overlap. The identification of co-expressed genes, that are functionally related, can identify a core network of genes associated with PC with a better reproducibility. By combining different approaches, including the integration of mRNA expression profiles, CNAs, and miRNA expression levels, we identified a gene signature of four genes overlapping with other published gene signatures and able to distinguish, in silico, high Gleason-scored PC from normal human tissue, which was further enriched to 19 genes by gene co-expression analysis. From the analysis of miRNAs possibly regulating this network, we found that hsa-miR-153 was highly connected to the genes in the network. Our results identify a four-gene signature with diagnostic and prognostic value in PC and suggest an interesting gene network that could play a key regulatory role in PC development and progression. Furthermore, hsa-miR-153, controlling this network, could be a potential biomarker for theranostics in high Gleason-scored PC. PMID:29562723
Genome-wide significant localization for working and spatial memory: Identifying genes for psychosis using models of cognition.

PubMed

Knowles, Emma E M; Carless, Melanie A; de Almeida, Marcio A A; Curran, Joanne E; McKay, D Reese; Sprooten, Emma; Dyer, Thomas D; Göring, Harald H; Olvera, Rene; Fox, Peter; Almasy, Laura; Duggirala, Ravi; Kent, Jack W; Blangero, John; Glahn, David C

2014-01-01

It is well established that risk for developing psychosis is largely mediated by the influence of genes, but identifying precisely which genes underlie that risk has been problematic. Focusing on endophenotypes, rather than illness risk, is one solution to this problem. Impaired cognition is a well-established endophenotype of psychosis. Here we aimed to characterize the genetic architecture of cognition using phenotypically detailed models as opposed to relying on general IQ or individual neuropsychological measures. In so doing we hoped to identify genes that mediate cognitive ability, which might also contribute to psychosis risk. Hierarchical factor models of genetically clustered cognitive traits were subjected to linkage analysis followed by QTL region-specific association analyses in a sample of 1,269 Mexican American individuals from extended pedigrees. We identified four genome wide significant QTLs, two for working and two for spatial memory, and a number of plausible and interesting candidate genes. The creation of detailed models of cognition seemingly enhanced the power to detect genetic effects on cognition and provided a number of possible candidate genes for psychosis. © 2013 Wiley Periodicals, Inc.
Microarray analysis and scale-free gene networks identify candidate regulators in drought-stressed roots of loblolly pine (P. taeda L.)

PubMed Central

2011-01-01

Background Global transcriptional analysis of loblolly pine (Pinus taeda L.) is challenging due to limited molecular tools. PtGen2, a 26,496 feature cDNA microarray, was fabricated and used to assess drought-induced gene expression in loblolly pine propagule roots. Statistical analysis of differential expression and weighted gene correlation network analysis were used to identify drought-responsive genes and further characterize the molecular basis of drought tolerance in loblolly pine. Results Microarrays were used to interrogate root cDNA populations obtained from 12 genotype × treatment combinations (four genotypes, three watering regimes). Comparison of drought-stressed roots with roots from the control treatment identified 2445 genes displaying at least a 1.5-fold expression difference (false discovery rate = 0.01). Genes commonly associated with drought response in pine and other plant species, as well as a number of abiotic and biotic stress-related genes, were up-regulated in drought-stressed roots. Only 76 genes were identified as differentially expressed in drought-recovered roots, indicating that the transcript population can return to the pre-drought state within 48 hours. Gene correlation analysis predicts a scale-free network topology and identifies eleven co-expression modules that ranged in size from 34 to 938 members. Network topological parameters identified a number of central nodes (hubs) including those with significant homology (E-values ≤ 2 × 10-30) to 9-cis-epoxycarotenoid dioxygenase, zeatin O-glucosyltransferase, and ABA-responsive protein. Identified hubs also include genes that have been associated previously with osmotic stress, phytohormones, enzymes that detoxify reactive oxygen species, and several genes of unknown function. Conclusion PtGen2 was used to evaluate transcriptome responses in loblolly pine and was leveraged to identify 2445 differentially expressed genes responding to severe drought stress in roots. Many of the
Transcriptome analysis identifies genes involved in ethanol response of Saccharomyces cerevisiae in Agave tequilana juice.

PubMed

Ramirez-Córdova, Jesús; Drnevich, Jenny; Madrigal-Pulido, Jaime Alberto; Arrizon, Javier; Allen, Kirk; Martínez-Velázquez, Moisés; Alvarez-Maya, Ikuri

2012-08-01

During ethanol fermentation, yeast cells are exposed to stress due to the accumulation of ethanol, cell growth is altered and the output of the target product is reduced. For Agave beverages, like tequila, no reports have been published on the global gene expression under ethanol stress. In this work, we used microarray analysis to identify Saccharomyces cerevisiae genes involved in the ethanol response. Gene expression of a tequila yeast strain of S. cerevisiae (AR5) was explored by comparing global gene expression with that of laboratory strain S288C, both after ethanol exposure. Additionally, we used two different culture conditions, cells grown in Agave tequilana juice as a natural fermentation media or grown in yeast-extract peptone dextrose as artificial media. Of the 6368 S. cerevisiae genes in the microarray, 657 genes were identified that had different expression responses to ethanol stress due to strain and/or media. A cluster of 28 genes was found over-expressed specifically in the AR5 tequila strain that could be involved in the adaptation to tequila yeast fermentation, 14 of which are unknown such as yor343c, ylr162w, ygr182c, ymr265c, yer053c-a or ydr415c. These could be the most suitable genes for transforming tequila yeast to increase ethanol tolerance in the tequila fermentation process. Other genes involved in response to stress (RFC4, TSA1, MLH1, PAU3, RAD53) or transport (CYB2, TIP20, QCR9) were expressed in the same cluster. Unknown genes could be good candidates for the development of recombinant yeasts with ethanol tolerance for use in industrial tequila fermentation.
Parallel analysis of tagged deletion mutants efficiently identifies genes involved in endoplasmic reticulum biogenesis.

PubMed

Wright, Robin; Parrish, Mark L; Cadera, Emily; Larson, Lynnelle; Matson, Clinton K; Garrett-Engele, Philip; Armour, Chris; Lum, Pek Yee; Shoemaker, Daniel D

2003-07-30

Increased levels of HMG-CoA reductase induce cell type- and isozyme-specific proliferation of the endoplasmic reticulum. In yeast, the ER proliferations induced by Hmg1p consist of nuclear-associated stacks of smooth ER membranes known as karmellae. To identify genes required for karmellae assembly, we compared the composition of populations of homozygous diploid S. cerevisiae deletion mutants following 20 generations of growth with and without karmellae. Using an initial population of 1,557 deletion mutants, 120 potential mutants were identified as a result of three independent experiments. Each experiment produced a largely non-overlapping set of potential mutants, suggesting that differences in specific growth conditions could be used to maximize the comprehensiveness of similar parallel analysis screens. Only two genes, UBC7 and YAL011W, were identified in all three experiments. Subsequent analysis of individual mutant strains confirmed that each experiment was identifying valid mutations, based on the mutant's sensitivity to elevated HMG-CoA reductase and inability to assemble normal karmellae. The largest class of HMG-CoA reductase-sensitive mutations was a subset of genes that are involved in chromatin structure and transcriptional regulation, suggesting that karmellae assembly requires changes in transcription or that the presence of karmellae may interfere with normal transcriptional regulation. Copyright 2003 John Wiley & Sons, Ltd.
Genome-wide association study identifies the SERPINB gene cluster as a susceptibility locus for food allergy.

PubMed

Marenholz, Ingo; Grosche, Sarah; Kalb, Birgit; Rüschendorf, Franz; Blümchen, Katharina; Schlags, Rupert; Harandi, Neda; Price, Mareike; Hansen, Gesine; Seidenberg, Jürgen; Röblitz, Holger; Yürek, Songül; Tschirner, Sebastian; Hong, Xiumei; Wang, Xiaobin; Homuth, Georg; Schmidt, Carsten O; Nöthen, Markus M; Hübner, Norbert; Niggemann, Bodo; Beyer, Kirsten; Lee, Young-Ae

2017-10-20

Genetic factors and mechanisms underlying food allergy are largely unknown. Due to heterogeneity of symptoms a reliable diagnosis is often difficult to make. Here, we report a genome-wide association study on food allergy diagnosed by oral food challenge in 497 cases and 2387 controls. We identify five loci at genome-wide significance, the clade B serpin (SERPINB) gene cluster at 18q21.3, the cytokine gene cluster at 5q31.1, the filaggrin gene, the C11orf30/LRRC32 locus, and the human leukocyte antigen (HLA) region. Stratifying the results for the causative food demonstrates that association of the HLA locus is peanut allergy-specific whereas the other four loci increase the risk for any food allergy. Variants in the SERPINB gene cluster are associated with SERPINB10 expression in leukocytes. Moreover, SERPINB genes are highly expressed in the esophagus. All identified loci are involved in immunological regulation or epithelial barrier function, emphasizing the role of both mechanisms in food allergy.
Effectively identifying regulatory hotspots while capturing expression heterogeneity in gene expression studies

PubMed Central

2014-01-01

Expression quantitative trait loci (eQTL) mapping is a tool that can systematically identify genetic variation affecting gene expression. eQTL mapping studies have shown that certain genomic locations, referred to as regulatory hotspots, may affect the expression levels of many genes. Recently, studies have shown that various confounding factors may induce spurious regulatory hotspots. Here, we introduce a novel statistical method that effectively eliminates spurious hotspots while retaining genuine hotspots. Applied to simulated and real datasets, we validate that our method achieves greater sensitivity while retaining low false discovery rates compared to previous methods. PMID:24708878
Systems Biology in Animal Breeding: Identifying relationships among markers, genes, and phenotypes

USDA-ARS?s Scientific Manuscript database

The Breeding and Genetics Symposium titled “Systems Biology in Animal Breeding: Identifying relationships among markers, genes, and phenotypes” was held at the Joint Annual Meeting of the American Dairy Science Association and the American Society of Animal Science in Phoenix, AZ, July 15 to 19, 201...
MethylMix 2.0: an R package for identifying DNA methylation genes. | Office of Cancer Genomics

Cancer.gov

DNA methylation is an important mechanism regulating gene transcription, and its role in carcinogenesis has been extensively studied. Hyper and hypomethylation of genes is a major mechanism of gene expression deregulation in a wide range of diseases. At the same time, high-throughput DNA methylation assays have been developed generating vast amounts of genome wide DNA methylation measurements. We developed MethylMix, an algorithm implemented in R to identify disease specific hyper and hypomethylated genes.
High density array screening to identify the genetic requirements for transition metal tolerance in Saccharomyces cerevisiae.

PubMed

Bleackley, Mark R; Young, Barry P; Loewen, Christopher J R; MacGillivray, Ross T A

2011-02-01

Biological systems have developed with a strong dependence on transition metals for accomplishing a number of biochemical reactions. Iron, copper, manganese and zinc are essential for virtually all forms of life with their unique chemistries contributing to a variety of physiological processes including oxygen transport, generation of cellular energy and protein structure and function. Properties of these metals (and to a lesser extent nickel and cobalt) that make them so essential to life also make them extremely cytotoxic in many cases through the formation of damaging oxygen radicals via Fenton chemistry. While life has evolved to exploit the chemistries of transition metals to drive physiological reactions, systems have concomitantly evolved to protect against the damaging effects of these same metals. Saccharomyces cerevisiae is a valuable tool for studying metal homeostasis with many of the genes identified thus far having homologs in higher eukaryotes including humans. Using high density arrays, we have screened a haploid S. cerevisiae deletion set containing 4786 non-essential gene deletions for strains sensitive to each of Fe, Cu, Mn, Ni, Zn and Co and then integrated the six screens using cluster analysis to identify pathways that are unique to individual metals and others with function shared between metals. Genes with no previous implication in metal homeostasis were found to contribute to sensitivity to each metal. Significant overlap was observed between the strains that were sensitive to Mn, Ni, Zn and Co with many of these strains lacking genes for the high affinity Fe transport pathway and genes involved in vacuolar transport and acidification. The results from six genome-wide metal tolerance screens show that there is some commonality between the cellular defenses against the toxicity of Mn, Ni, Zn and Co with Fe and Cu requiring different systems. Additionally, potential new factors been identified that function in tolerance to each of the six
An essential role of a FoxD gene in notochord induction in Ciona embryos.

PubMed

Imai, Kaoru S; Satoh, Nori; Satou, Yutaka

2002-07-01

A key issue for understanding the early development of the chordate body plan is how the endoderm induces notochord formation. In the ascidian Ciona, nuclear accumulation of beta-catenin is the first step in the process of endoderm specification. We show that nuclear accumulation of beta-catenin directly activates the gene (Cs-FoxD) for a winged helix/forkhead transcription factor and that this gene is expressed transiently at the 16- and 32-cell stages in endodermal cells. The function of Cs-FoxD, however, is not associated with differentiation of the endoderm itself but is essential for notochord differentiation or induction. In addition, it is likely that the inductive signal that appears to act downstream of Cs-FoxD does not act over a long range. It has been suggested that FGF or Notch signal transduction pathway mediates ascidian notochord induction. Our previous study suggests that Cs-FGF4/6/9 is partially involved in the notochord induction. The present experimental results suggest that the expression and function of Cs-FGF4/6/9 and Cs-FoxD are not interdependent, and that the Notch pathway is involved in B-line notochord induction downstream of Cs-FoxD.
Regulation of Msx genes by a Bmp gradient is essential for neural crest specification.

PubMed

Tribulo, Celeste; Aybar, Manuel J; Nguyen, Vu H; Mullins, Mary C; Mayor, Roberto

2003-12-01

genetic cascade. In order to study the hierarchical relationship between msx1 and snail/slug we performed several rescue experiments using dominant negatives for these genes. The rescuing activity by snail and slug on neural crest development of the msx1 dominant negative, together with the inability of msx1 to rescue the dominant negatives of slug and snail strongly argue that msx1 is upstream of snail and slug in the genetic cascade that specifies the neural crest in the ectoderm. We propose a model where a gradient of Bmp activity specifies the expression of Msx genes in the neural folds, and that this expression is essential for the early specification of the neural crest.
Co-fuse: a new class discovery analysis tool to identify and prioritize recurrent fusion genes from RNA-sequencing data.

PubMed

Paisitkriangkrai, Sakrapee; Quek, Kelly; Nievergall, Eva; Jabbour, Anissa; Zannettino, Andrew; Kok, Chung Hoow

2018-06-07

Recurrent oncogenic fusion genes play a critical role in the development of various cancers and diseases and provide, in some cases, excellent therapeutic targets. To date, analysis tools that can identify and compare recurrent fusion genes across multiple samples have not been available to researchers. To address this deficiency, we developed Co-occurrence Fusion (Co-fuse), a new and easy to use software tool that enables biologists to merge RNA-seq information, allowing them to identify recurrent fusion genes, without the need for exhaustive data processing. Notably, Co-fuse is based on pattern mining and statistical analysis which enables the identification of hidden patterns of recurrent fusion genes. In this report, we show that Co-fuse can be used to identify 2 distinct groups within a set of 49 leukemic cell lines based on their recurrent fusion genes: a multiple myeloma (MM) samples-enriched cluster and an acute myeloid leukemia (AML) samples-enriched cluster. Our experimental results further demonstrate that Co-fuse can identify known driver fusion genes (e.g., IGH-MYC, IGH-WHSC1) in MM, when compared to AML samples, indicating the potential of Co-fuse to aid the discovery of yet unknown driver fusion genes through cohort comparisons. Additionally, using a 272 primary glioma sample RNA-seq dataset, Co-fuse was able to validate recurrent fusion genes, further demonstrating the power of this analysis tool to identify recurrent fusion genes. Taken together, Co-fuse is a powerful new analysis tool that can be readily applied to large RNA-seq datasets, and may lead to the discovery of new disease subgroups and potentially new driver genes, for which, targeted therapies could be developed. The Co-fuse R source code is publicly available at https://github.com/sakrapee/co-fuse .
Gene expression in bovine rumen epithelium during weaning identifies molecular regulators of rumen development and growth.

PubMed

Connor, Erin E; Baldwin, Ransom L; Li, Cong-jun; Li, Robert W; Chung, Hoyoung

2013-03-01

During weaning, epithelial cell function in the rumen transitions in response to conversion from a pre-ruminant to a true ruminant environment to ensure efficient nutrient absorption and metabolism. To identify gene networks affected by weaning in bovine rumen, Holstein bull calves were fed commercial milk replacer only (MRO) until 42 days of age, then were provided diets of either milk + orchardgrass hay (MH) or milk + grain-based calf starter (MG). Rumen epithelial RNA was extracted from calves sacrificed at four time points: day 14 (n = 3) and day 42 (n = 3) of age while fed the MRO diet and day 56 (n = 3/diet) and day 70 (n = 3/diet) while fed the MH and MG diets for transcript profiling by microarray hybridization. Five two-group comparisons were made using Permutation Analysis of Differential Expression® to identify differentially expressed genes over time and developmental stage between days 14 and 42 within the MRO diet, between day 42 on the MRO diet and day 56 on the MG or MH diets, and between the MG and MH diets at days 56 and 70. Ingenuity Pathway Analysis (IPA) of differentially expressed genes during weaning indicated the top 5 gene networks involving molecules participating in lipid metabolism, cell morphology and death, cellular growth and proliferation, molecular transport, and the cell cycle. Putative genes functioning in the establishment of the rumen microbial population and associated rumen epithelial inflammation during weaning were identified. Activation of transcription factor PPAR-α was identified by IPA software as an important regulator of molecular changes in rumen epithelium that function in papillary development and fatty acid oxidation during the transition from pre-rumination to rumination. Thus, molecular markers of rumen development and gene networks regulating differentiation and growth of rumen epithelium were identified for selecting targets and methods for improving and assessing rumen development and

Genome-wide association study identifies candidate genes for male fertility traits in humans.

PubMed

Kosova, Gülüm; Scott, Nicole M; Niederberger, Craig; Prins, Gail S; Ober, Carole

2012-06-08

Despite the fact that hundreds of genes are known to affect fertility in animal models, relatively little is known about genes that influence natural fertility in humans. To broadly survey genes contributing to variation in male fertility, we conducted a genome-wide association study (GWAS) of two fertility traits (family size and birth rate) in 269 married men who are members of a founder population of European descent that proscribes contraception and has large family sizes. Associations between ∼250,000 autosomal SNPs and the fertility traits were examined. A total of 41 SNPs with p ≤ 1 × 10(-4) for either trait were taken forward to a validation study of 123 ethnically diverse men from Chicago who had previously undergone semen analyses. Nine (22%) of the SNPs associated with reduced fertility in the GWAS were also associated with one or more of the ten measures of reduced sperm quantity and/or function, yielding 27 associations with p values < 0.05 and seven with p values < 0.01 in the validation study. On the basis of 5,000 permutations of our data, the probabilities of observing this many or more small p values were 0.0014 and 5.6 × 10(-4), respectively. Among the nine associated loci, outstanding candidates for male fertility genes include USP8, an essential deubiquitinating enzyme that has a role in acrosome assembly; UBD and EPSTI1, which have potential roles in innate immunity; and LRRC32, which encodes a latent transforming growth factor β (TGF-β) receptor on regulatory T cells. We suggest that mutations in these genes that are more severe may account for some of the unexplained infertility (or subfertility) in the general population. Copyright © 2012 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Genome-wide Association Study Identifies Candidate Genes for Male Fertility Traits in Humans

PubMed Central

Kosova, Gülüm; Scott, Nicole M.; Niederberger, Craig; Prins, Gail S.; Ober, Carole

2012-01-01

Despite the fact that hundreds of genes are known to affect fertility in animal models, relatively little is known about genes that influence natural fertility in humans. To broadly survey genes contributing to variation in male fertility, we conducted a genome-wide association study (GWAS) of two fertility traits (family size and birth rate) in 269 married men who are members of a founder population of European descent that proscribes contraception and has large family sizes. Associations between ∼250,000 autosomal SNPs and the fertility traits were examined. A total of 41 SNPs with p ≤ 1 × 10−4 for either trait were taken forward to a validation study of 123 ethnically diverse men from Chicago who had previously undergone semen analyses. Nine (22%) of the SNPs associated with reduced fertility in the GWAS were also associated with one or more of the ten measures of reduced sperm quantity and/or function, yielding 27 associations with p values < 0.05 and seven with p values < 0.01 in the validation study. On the basis of 5,000 permutations of our data, the probabilities of observing this many or more small p values were 0.0014 and 5.6 × 10−4, respectively. Among the nine associated loci, outstanding candidates for male fertility genes include USP8, an essential deubiquitinating enzyme that has a role in acrosome assembly; UBD and EPSTI1, which have potential roles in innate immunity; and LRRC32, which encodes a latent transforming growth factor β (TGF-β) receptor on regulatory T cells. We suggest that mutations in these genes that are more severe may account for some of the unexplained infertility (or subfertility) in the general population. PMID:22633400
Comparative Transcriptomics to Identify Novel Genes and Pathways in Dinoflagellates

NASA Astrophysics Data System (ADS)

Ryan, D.

2016-02-01

The unarmored dinoflagellate Karenia brevis is among the most prominent harmful, bloom-forming phytoplankton species in the Gulf of Mexico. During blooms, the polyketides PbTx-1 and PbTx-2 (brevetoxins) are produced by K. brevis. Brevetoxins negatively impact human health and the Gulf shellfish harvest. However, the genes underlying brevetoxin synthesis are currently unknown. Because the K. brevis genome is extremely large ( 1 × 1011 base pairs long), and with a high proportion of repetitive, non-coding DNA, it has not been sequenced. In fact, large, repetitive genomes are common among the dinoflagellate group. High-throughput RNA sequencing technology enabled us to assemble Karenia transcriptomes de novo and investigate potential genes in the brevetoxin pathway through comparative transcriptomics. The brevetoxin profile varies among K. brevis clonal cultures. For example, well-documented Wilson-CCFWC268 typically produces 8-10 pg PbTx per cell, whereas SP1 produces < 2 pg PbTx/cell, and the mutant low-toxin Wilson clone produces undetectable to low (<0.05 pg/cell) amounts. Further, PbTx-2 has been measured in Karenia papilionacea but not Karenia mikimotoi. We compared the transcriptomes of four K. brevis clones (Wilson-CCFWC268, SP3, SP1, and mutant low-toxin Wilson) with K. papilionacea and K. mikimotoi to investigate nucleotide-level genetic variations and differences in gene expression. Of the 85,000 transcripts in the K. brevis transcriptome, 4,600 transcripts, including novel unannotated orthologs and putative polyketide synthases (PKSs), were only expressed by brevetoxin-producing K. brevis and K. papilionacea, not K. mikimotoi. Examination of gene expression between the typical- and low-toxin Wilson clones identified about 3,500 genes with significantly different expression levels, including 2 putative PKSs. One of the 2 PKSs was only found in the brevetoxin-producing Karenia species. These transcriptomes could not have been characterized without high
Epigenomic elements analyses for promoters identify ESRRG as a new susceptibility gene for obesity-related traits.

PubMed

Dong, S-S; Guo, Y; Zhu, D-L; Chen, X-F; Wu, X-M; Shen, H; Chen, X-D; Tan, L-J; Tian, Q; Deng, H-W; Yang, T-L

2016-07-01

With ENCODE epigenomic data and results from published genome-wide association studies (GWASs), we aimed to find regulatory signatures of obesity genes and discover novel susceptibility genes. Obesity genes were obtained from public GWAS databases and their promoters were annotated based on the regulatory element information. Significantly enriched or depleted epigenomic elements in the promoters of obesity genes were evaluated and all human genes were then prioritized according to the existence of the selected elements to predict new candidate genes. Top-ranked genes were subsequently applied to validate their associations with obesity-related traits in three independent in-house GWAS samples. We identified RAD21 and EZH2 as over-represented, and STAT2 (signal transducer and activator of transcription 2) and IRF3 (interferon regulatory transcription factor 3) as depleted transcription factors. Histone modification of H3K9me3 and chromatin state segmentation of 'poised promoter' and 'repressed' were over-represented. All genes were prioritized and we selected the top five genes for validation at the population level. Combining results from the three GWAS samples, rs7522101 in ESRRG (estrogen-related receptor-γ) remained significantly associated with body mass index after multiple testing corrections (P=7.25 × 10(-5)). It was also associated with β-cell function (P=1.99 × 10(-3)) and fasting glucose level (P<0.05) in the meta-analyses of glucose and insulin-related traits consortium (MAGIC) data set.Cnoclusions:In summary, we identified epigenomic characteristics for obesity genes and suggested ESRRG as a novel obesity-susceptibility gene.
Weighted gene co‑expression network analysis in identification of key genes and networks for ischemic‑reperfusion remodeling myocardium.

PubMed

Guo, Nan; Zhang, Nan; Yan, Liqiu; Lian, Zheng; Wang, Jiawang; Lv, Fengfeng; Wang, Yunfei; Cao, Xufen

2018-06-14

Acute myocardial infarction induces ventricular remodeling, which is implicated in dilated heart and heart failure. The pathogenical mechanism of myocardium remodeling remains to be elucidated. The aim of the present study was to identify key genes and networks for myocardium remodeling following ischemia‑reperfusion (IR). First, the mRNA expression data from the National Center for Biotechnology Information database were downloaded to identify differences in mRNA expression of the IR heart at days 2 and 7. Then, weighted gene co‑expression network analysis, hierarchical clustering, protein‑protein interaction (PPI) network, Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway were used to identify key genes and networks for the heart remodeling process following IR. A total of 3,321 differentially expressed genes were identified during the heart remodeling process. A total of 6 modules were identified through gene co‑expression network analysis. GO and KEGG analysis results suggested that each module represented a different biological function and was associated with different pathways. Finally, hub genes of each module were identified by PPI network construction. The present study revealed that heart remodeling following IR is a complicated process, involving extracellular matrix organization, neural development, apoptosis and energy metabolism. The dysregulated genes, including SRC proto‑oncogene, non‑receptor tyrosine kinase, discs large MAGUK scaffold protein 1, ATP citrate lyase, RAN, member RAS oncogene family, tumor protein p53, and polo like kinase 2, may be essential for heart remodeling following IR and may be used as potential targets for the inhibition of heart remodeling following acute myocardial infarction.
Isocitrate Lyase Is Essential for Pathogenicity of the Fungus Leptosphaeria maculans to Canola (Brassica napus)

PubMed Central

Idnurm, Alexander; Howlett, Barbara J.

2002-01-01

A pathogenicity gene has been identified in Leptosphaeria maculans, the ascomycetous fungus that causes blackleg disease of canola (Brassica napus). This gene encodes isocitrate lyase, a component of the glyoxylate cycle, and is essential for the successful colonization of B. napus. It was identified by a reverse genetics approach whereby a plasmid conferring hygromycin resistance was inserted randomly into the L. maculans genome. Twelve of 516 transformants tested had reduced pathogenicity on cotyledons of B. juncea and B. napus, and 1 of these 12 had a deletion of the isocitrate lyase gene, as well as an insertion of the hygromycin resistance gene. This mutant was unable to grow on fatty acids, including monolaurate, and the isocitrate lyase transcript was not detected. When the wild-type gene was reintroduced into the mutant, growth on monolaurate was restored and pathogenicity was partially restored. L. maculans isocitrate lyase is produced during infection of B. napus cotyledons, while the plant homologue is not. When 2.5% glucose was added to the inoculum of the isocitrate lyase mutant, lesions of sizes similar to those caused by wild-type isolate M1 developed on B. napus cotyledons. These findings suggest that the glyoxylate pathway is essential for disease development by this plant-pathogenic fungus, as has been shown recently for a fungal and bacterial pathogen of animals and a bacterial pathogen of plants. Involvement of the glyoxylate pathway in pathogenesis in animals and plants presents potential drug targets for control of diseases. PMID:12455691
GeneMachine: gene prediction and sequence annotation.

PubMed

Makalowska, I; Ryan, J F; Baxevanis, A D

2001-09-01

A number of free-standing programs have been developed in order to help researchers find potential coding regions and deduce gene structure for long stretches of what is essentially 'anonymous DNA'. As these programs apply inherently different criteria to the question of what is and is not a coding region, multiple algorithms should be used in the course of positional cloning and positional candidate projects to assure that all potential coding regions within a previously-identified critical region are identified. We have developed a gene identification tool called GeneMachine which allows users to query multiple exon and gene prediction programs in an automated fashion. BLAST searches are also performed in order to see whether a previously-characterized coding region corresponds to a region in the query sequence. A suite of Perl programs and modules are used to run MZEF, GENSCAN, GRAIL 2, FGENES, RepeatMasker, Sputnik, and BLAST. The results of these runs are then parsed and written into ASN.1 format. Output files can be opened using NCBI Sequin, in essence using Sequin as both a workbench and as a graphical viewer. The main feature of GeneMachine is that the process is fully automated; the user is only required to launch GeneMachine and then open the resulting file with Sequin. Annotations can then be made to these results prior to submission to GenBank, thereby increasing the intrinsic value of these data. GeneMachine is freely-available for download at http://genome.nhgri.nih.gov/genemachine. A public Web interface to the GeneMachine server for academic and not-for-profit users is available at http://genemachine.nhgri.nih.gov. The Web supplement to this paper may be found at http://genome.nhgri.nih.gov/genemachine/supplement/.
Pharmacological Validation of Candidate Causal Sleep Genes Identified in an N2 Cross

PubMed Central

Brunner, Joseph I.; Gotter, Anthony L.; Millstein, Joshua; Garson, Susan; Binns, Jacquelyn; Fox, Steven V.; Savitz, Alan T.; Yang, He S.; Fitzpatrick, Karrie; Zhou, Lili; Owens, Joseph R.; Webber, Andrea L.; Vitaterna, Martha H.; Kasarskis, Andrew; Uebele, Victor N.; Turek, Fred; Renger, John J.; Winrow, Christopher J.

2013-01-01

Despite the substantial impact of sleep disturbances on human health and the many years of study dedicated to understanding sleep pathologies, the underlying genetic mechanisms that govern sleep and wake largely remain unknown. Recently, we completed large scale genetic and gene expression analyses in a segregating inbred mouse cross and identified candidate causal genes that regulate the mammalian sleep-wake cycle, across multiple traits including total sleep time, amounts of REM, non-REM, sleep bout duration and sleep fragmentation. Here we describe a novel approach toward validating candidate causal genes, while also identifying potential targets for sleep-related indications. Select small molecule antagonists and agonists were used to interrogate candidate causal gene function in rodent sleep polysomnography assays to determine impact on overall sleep architecture and to evaluate alignment with associated sleep-wake traits. Significant effects on sleep architecture were observed in validation studies using compounds targeting the muscarinic acetylcholine receptor M3 subunit (Chrm3)(wake promotion), nicotinic acetylcholine receptor alpha4 subunit (Chrna4)(wake promotion), dopamine receptor D5 subunit (Drd5)(sleep induction), serotonin 1D receptor (Htr1d)(altered REM fragmentation), glucagon-like peptide-1 receptor (Glp1r)(light sleep promotion and reduction of deep sleep), and Calcium channel, voltage-dependent, T type, alpha 1I subunit (Cacna1i)(increased bout duration slow wave sleep). Taken together, these results show the complexity of genetic components that regulate sleep-wake traits and highlight the importance of evaluating this complex behavior at a systems level. Pharmacological validation of genetically identified putative targets provides a rapid alternative to generating knock out or transgenic animal models, and may ultimately lead towards new therapeutic opportunities. PMID:22091728
QTL Mapping and CRISPR/Cas9 Editing to Identify a Drug Resistance Gene in Toxoplasma gondii.

PubMed

Shen, Bang; Powell, Robin H; Behnke, Michael S

2017-06-22

Scientific knowledge is intrinsically linked to available technologies and methods. This article will present two methods that allowed for the identification and verification of a drug resistance gene in the Apicomplexan parasite Toxoplasma gondii, the method of Quantitative Trait Locus (QTL) mapping using a Whole Genome Sequence (WGS) -based genetic map and the method of Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)/Cas9 -based gene editing. The approach of QTL mapping allows one to test if there is a correlation between a genomic region(s) and a phenotype. Two datasets are required to run a QTL scan, a genetic map based on the progeny of a recombinant cross and a quantifiable phenotype assessed in each of the progeny of that cross. These datasets are then formatted to be compatible with R/qtl software that generates a QTL scan to identify significant loci correlated with the phenotype. Although this can greatly narrow the search window of possible candidates, QTLs span regions containing a number of genes from which the causal gene needs to be identified. Having WGS of the progeny was critical to identify the causal drug resistance mutation at the gene level. Once identified, the candidate mutation can be verified by genetic manipulation of drug sensitive parasites. The most facile and efficient method to genetically modify T. gondii is the CRISPR/Cas9 system. This system comprised of just 2 components both encoded on a single plasmid, a single guide RNA (gRNA) containing a 20 bp sequence complementary to the genomic target and the Cas9 endonuclease that generates a double-strand DNA break (DSB) at the target, repair of which allows for insertion or deletion of sequences around the break site. This article provides detailed protocols to use CRISPR/Cas9 based genome editing tools to verify the gene responsible for sinefungin resistance and to construct transgenic parasites.
MVisAGe Identifies Concordant and Discordant Genomic Alterations of Driver Genes in Squamous Tumors.

PubMed

Walter, Vonn; Du, Ying; Danilova, Ludmila; Hayward, Michele C; Hayes, D Neil

2018-06-15

Integrated analyses of multiple genomic datatypes are now common in cancer profiling studies. Such data present opportunities for numerous computational experiments, yet analytic pipelines are limited. Tools such as the cBioPortal and Regulome Explorer, although useful, are not easy to access programmatically or to implement locally. Here, we introduce the MVisAGe R package, which allows users to quantify gene-level associations between two genomic datatypes to investigate the effect of genomic alterations (e.g., DNA copy number changes on gene expression). Visualizing Pearson/Spearman correlation coefficients according to the genomic positions of the underlying genes provides a powerful yet novel tool for conducting exploratory analyses. We demonstrate its utility by analyzing three publicly available cancer datasets. Our approach highlights canonical oncogenes in chr11q13 that displayed the strongest associations between expression and copy number, including CCND1 and CTTN , genes not identified by copy number analysis in the primary reports. We demonstrate highly concordant usage of shared oncogenes on chr3q, yet strikingly diverse oncogene usage on chr11q as a function of HPV infection status. Regions of chr19 that display remarkable associations between methylation and gene expression were identified, as were previously unreported miRNA-gene expression associations that may contribute to the epithelial-to-mesenchymal transition. Significance: This study presents an important bioinformatics tool that will enable integrated analyses of multiple genomic datatypes. Cancer Res; 78(12); 3375-85. ©2018 AACR . ©2018 American Association for Cancer Research.
Inhibition of Escherichia coli viability by external guide sequences complementary to two essential genes

PubMed Central

McKinney, Jeffrey; Guerrier-Takada, Cecilia; Wesolowski, Donna; Altman, Sidney

2001-01-01

Narrow spectrum antimicrobial activity has been designed to reduce the expression of two essential genes, one coding for the protein subunit of RNase P (C5 protein) and one for gyrase (gyrase A). In both cases, external guide sequences (EGS) have been designed to complex with either mRNA. Using the EGS technology, the level of microbial viability is reduced to less than 10% of the wild-type strain. The EGSs are additive when used together and depend on the number of nucleotides paired when attacking gyrase A mRNA. In the case of gyrase A, three nucleotides unpaired out of a 15-mer EGS still favor complete inhibition by the EGS but five unpaired nucleotides do not. PMID:11381134
Genome-wide localization and expression profiling establish Sp2 as a sequence-specific transcription factor regulating vitally important genes

PubMed Central

Terrados, Gloria; Finkernagel, Florian; Stielow, Bastian; Sadic, Dennis; Neubert, Juliane; Herdt, Olga; Krause, Michael; Scharfe, Maren; Jarek, Michael; Suske, Guntram

2012-01-01

The transcription factor Sp2 is essential for early mouse development and for proliferation of mouse embryonic fibroblasts in culture. Yet its mechanisms of action and its target genes are largely unknown. In this study, we have combined RNA interference, in vitro DNA binding, chromatin immunoprecipitation sequencing and global gene-expression profiling to investigate the role of Sp2 for cellular functions, to define target sites and to identify genes regulated by Sp2. We show that Sp2 is important for cellular proliferation that it binds to GC-boxes and occupies proximal promoters of genes essential for vital cellular processes including gene expression, replication, metabolism and signalling. Moreover, we identified important key target genes and cellular pathways that are directly regulated by Sp2. Most significantly, Sp2 binds and activates numerous sequence-specific transcription factor and co-activator genes, and represses the whole battery of cholesterol synthesis genes. Our results establish Sp2 as a sequence-specific regulator of vitally important genes. PMID:22684502
Genomic convergence to identify candidate genes for Alzheimer disease on chromosome 10

PubMed Central

Liang, Xueying; Slifer, Michael; Martin, Eden R.; Schnetz-Boutaud, Nathalie; Bartlett, Jackie; Anderson, Brent; Züchner, Stephan; Gwirtsman, Harry; Gilbert, John R.; Pericak-Vance, Margaret A.; Haines, Jonathan L.

2009-01-01

A broad region of chromosome 10 (chr10) has engendered continued interest in the etiology of late-onset Alzheimer Disease (LOAD) from both linkage and candidate gene studies. However, there is a very extensive heterogeneity on chr10. We converged linkage analysis and gene expression data using the concept of genomic convergence that suggests that genes showing positive results across multiple different data types are more likely to be involved in AD. We identified and examined 28 genes on chr10 for association with AD in a Caucasian case-control dataset of 506 cases and 558 controls with substantial clinical information. The cases were all LOAD (minimum age at onset ≥ 60 years). Both single marker and haplotypic associations were tested in the overall dataset and 8 subsets defined by age, gender, ApoE and clinical status. PTPLA showed allelic, genotypic and haplotypic association in the overall dataset. SORCS1 was significant in the overall data sets (p=0.0025) and most significant in the female subset (allelic association p=0.00002, a 3-locus haplotype had p=0.0005). Odds Ratio of SORCS1 in the female subset was 1.7 (p<0.0001). SORCS1 is an interesting candidate gene involved in the Aβ pathway. Therefore, genetic variations in PTPLA and SORCS1 may be associated and have modest effect to the risk of AD by affecting Aβ pathway. The replication of the effect of these genes in different study populations and search for susceptible variants and functional studies of these genes are necessary to get a better understanding of the roles of the genes in Alzheimer disease. PMID:19241460
Enhancer connectome in primary human cells identifies target genes of disease-associated DNA elements

PubMed Central

Mumbach, Maxwell R; Satpathy, Ansuman T; Boyle, Evan A; Dai, Chao; Gowen, Benjamin G; Cho, Seung Woo; Nguyen, Michelle L; Rubin, Adam J; Granja, Jeffrey M; Kazane, Katelynn R; Wei, Yuning; Nguyen, Trieu; Greenside, Peyton G; Corces, M Ryan; Tycko, Josh; Simeonov, Dimitre R; Suliman, Nabeela; Li, Rui; Xu, Jin; Flynn, Ryan A; Kundaje, Anshul; Khavari, Paul A; Marson, Alexander; Corn, Jacob E; Quertermous, Thomas; Greenleaf, William J; Chang, Howard Y

2018-01-01

The challenge of linking intergenic mutations to target genes has limited molecular understanding of human diseases. Here we show that H3K27ac HiChIP generates high-resolution contact maps of active enhancers and target genes in rare primary human T cell subtypes and coronary artery smooth muscle cells. Differentiation of naive T cells into T helper 17 cells or regulatory T cells creates subtype-specific enhancer–promoter interactions, specifically at regions of shared DNA accessibility. These data provide a principled means of assigning molecular functions to autoimmune and cardiovascular disease risk variants, linking hundreds of noncoding variants to putative gene targets. Target genes identified with HiChIP are further supported by CRISPR interference and activation at linked enhancers, by the presence of expression quantitative trait loci, and by allele-specific enhancer loops in patient-derived primary cells. The majority of disease-associated enhancers contact genes beyond the nearest gene in the linear genome, leading to a fourfold increase in the number of potential target genes for autoimmune and cardiovascular diseases. PMID:28945252
An elm EST database for identifying leaf beetle egg-induced defense genes

PubMed Central

2012-01-01

Background Plants can defend themselves against herbivorous insects prior to the onset of larval feeding by responding to the eggs laid on their leaves. In the European field elm (Ulmus minor), egg laying by the elm leaf beetle ( Xanthogaleruca luteola) activates the emission of volatiles that attract specialised egg parasitoids, which in turn kill the eggs. Little is known about the transcriptional changes that insect eggs trigger in plants and how such indirect defense mechanisms are orchestrated in the context of other biological processes. Results Here we present the first large scale study of egg-induced changes in the transcriptional profile of a tree. Five cDNA libraries were generated from leaves of (i) untreated control elms, and elms treated with (ii) egg laying and feeding by elm leaf beetles, (iii) feeding, (iv) artificial transfer of egg clutches, and (v) methyl jasmonate. A total of 361,196 ESTs expressed sequence tags (ESTs) were identified which clustered into 52,823 unique transcripts (Unitrans) and were stored in a database with a public web interface. Among the analyzed Unitrans, 73% could be annotated by homology to known genes in the UniProt (Plant) database, particularly to those from Vitis, Ricinus, Populus and Arabidopsis. Comparative in silico analysis among the different treatments revealed differences in Gene Ontology term abundances. Defense- and stress-related gene transcripts were present in high abundance in leaves after herbivore egg laying, but transcripts involved in photosynthesis showed decreased abundance. Many pathogen-related genes and genes involved in phytohormone signaling were expressed, indicative of jasmonic acid biosynthesis and activation of jasmonic acid responsive genes. Cross-comparisons between different libraries based on expression profiles allowed the identification of genes with a potential relevance in egg-induced defenses, as well as other biological processes, including signal transduction, transport and
An elm EST database for identifying leaf beetle egg-induced defense genes.

PubMed

Büchel, Kerstin; McDowell, Eric; Nelson, Will; Descour, Anne; Gershenzon, Jonathan; Hilker, Monika; Soderlund, Carol; Gang, David R; Fenning, Trevor; Meiners, Torsten

2012-06-15

Plants can defend themselves against herbivorous insects prior to the onset of larval feeding by responding to the eggs laid on their leaves. In the European field elm (Ulmus minor), egg laying by the elm leaf beetle ( Xanthogaleruca luteola) activates the emission of volatiles that attract specialised egg parasitoids, which in turn kill the eggs. Little is known about the transcriptional changes that insect eggs trigger in plants and how such indirect defense mechanisms are orchestrated in the context of other biological processes. Here we present the first large scale study of egg-induced changes in the transcriptional profile of a tree. Five cDNA libraries were generated from leaves of (i) untreated control elms, and elms treated with (ii) egg laying and feeding by elm leaf beetles, (iii) feeding, (iv) artificial transfer of egg clutches, and (v) methyl jasmonate. A total of 361,196 ESTs expressed sequence tags (ESTs) were identified which clustered into 52,823 unique transcripts (Unitrans) and were stored in a database with a public web interface. Among the analyzed Unitrans, 73% could be annotated by homology to known genes in the UniProt (Plant) database, particularly to those from Vitis, Ricinus, Populus and Arabidopsis. Comparative in silico analysis among the different treatments revealed differences in Gene Ontology term abundances. Defense- and stress-related gene transcripts were present in high abundance in leaves after herbivore egg laying, but transcripts involved in photosynthesis showed decreased abundance. Many pathogen-related genes and genes involved in phytohormone signaling were expressed, indicative of jasmonic acid biosynthesis and activation of jasmonic acid responsive genes. Cross-comparisons between different libraries based on expression profiles allowed the identification of genes with a potential relevance in egg-induced defenses, as well as other biological processes, including signal transduction, transport and primary metabolism
Suppression subtractive hybridization identified differentially expressed genes in lung adenocarcinoma: ERGIC3 as a novel lung cancer-related gene

PubMed Central

2013-01-01

Background To understand the carcinogenesis caused by accumulated genetic and epigenetic alterations and seek novel biomarkers for various cancers, studying differentially expressed genes between cancerous and normal tissues is crucial. In the study, two cDNA libraries of lung cancer were constructed and screened for identification of differentially expressed genes. Methods Two cDNA libraries of differentially expressed genes were constructed using lung adenocarcinoma tissue and adjacent nonmalignant lung tissue by suppression subtractive hybridization. The data of the cDNA libraries were then analyzed and compared using bioinformatics analysis. Levels of mRNA and protein were measured by quantitative real-time polymerase chain reaction (q-RT-PCR) and western blot respectively, as well as expression and localization of proteins were determined by immunostaining. Gene functions were investigated using proliferation and migration assays after gene silencing and gene over-expression. Results Two libraries of differentially expressed genes were obtained. The forward-subtracted library (FSL) and the reverse-subtracted library (RSL) contained 177 and 59 genes, respectively. Bioinformatic analysis demonstrated that these genes were involved in a wide range of cellular functions. The vast majority of these genes were newly identified to be abnormally expressed in lung cancer. In the first stage of the screening for 16 genes, we compared lung cancer tissues with their adjacent non-malignant tissues at the mRNA level, and found six genes (ERGIC3, DDR1, HSP90B1, SDC1, RPSA, and LPCAT1) from the FSL were significantly up-regulated while two genes (GPX3 and TIMP3) from the RSL were significantly down-regulated (P < 0.05). The ERGIC3 protein was also over-expressed in lung cancer tissues and cultured cells, and expression of ERGIC3 was correlated with the differentiated degree and histological type of lung cancer. The up-regulation of ERGIC3 could promote cellular migration
Mapping eQTLs in the Norfolk Island Genetic Isolate Identifies Candidate Genes for CVD Risk Traits

PubMed Central

Benton, Miles C.; Lea, Rod A.; Macartney-Coxson, Donia; Carless, Melanie A.; Göring, Harald H.; Bellis, Claire; Hanna, Michelle; Eccles, David; Chambers, Geoffrey K.; Curran, Joanne E.; Harper, Jacquie L.; Blangero, John; Griffiths, Lyn R.

2013-01-01

Cardiovascular disease (CVD) affects millions of people worldwide and is influenced by numerous factors, including lifestyle and genetics. Expression quantitative trait loci (eQTLs) influence gene expression and are good candidates for CVD risk. Founder-effect pedigrees can provide additional power to map genes associated with disease risk. Therefore, we identified eQTLs in the genetic isolate of Norfolk Island (NI) and tested for associations between these and CVD risk factors. We measured genome-wide transcript levels of blood lymphocytes in 330 individuals and used pedigree-based heritability analysis to identify heritable transcripts. eQTLs were identified by genome-wide association testing of these transcripts. Testing for association between CVD risk factors (i.e., blood lipids, blood pressure, and body fat indices) and eQTLs revealed 1,712 heritable transcripts (p < 0.05) with heritability values ranging from 0.18 to 0.84. From these, we identified 200 cis-acting and 70 trans-acting eQTLs (p < 1.84 × 10−7) An eQTL-centric analysis of CVD risk traits revealed multiple associations, including 12 previously associated with CVD-related traits. Trait versus eQTL regression modeling identified four CVD risk candidates (NAAA, PAPSS1, NME1, and PRDX1), all of which have known biological roles in disease. In addition, we implicated several genes previously associated with CVD risk traits, including MTHFR and FN3KRP. We have successfully identified a panel of eQTLs in the NI pedigree and used this to implicate several genes in CVD risk. Future studies are required for further assessing the functional importance of these eQTLs and whether the findings here also relate to outbred populations. PMID:24314549
Association Analysis Suggests SOD2 as a Newly Identified Candidate Gene Associated With Leprosy Susceptibility.

PubMed

Ramos, Geovana Brotto; Salomão, Heloisa; Francio, Angela Schneider; Fava, Vinícius Medeiros; Werneck, Renata Iani; Mira, Marcelo Távora

2016-08-01

Genetic studies have identified several genes and genomic regions contributing to the control of host susceptibility to leprosy. Here, we test variants of the positional and functional candidate gene SOD2 for association with leprosy in 2 independent population samples. Family-based analysis revealed an association between leprosy and allele G of marker rs295340 (P = .042) and borderline evidence of an association between leprosy and alleles C and A of markers rs4880 (P = .077) and rs5746136 (P = .071), respectively. Findings were validated in an independent case-control sample for markers rs295340 (P = .049) and rs4880 (P = .038). These results suggest SOD2 as a newly identified gene conferring susceptibility to leprosy. © The Author 2016. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail journals.permissions@oup.com.
Expression of Essential B Cell Development Genes in Horses with Common Variable Immunodeficiency

PubMed Central

Tallmadge, R.L.; Such, K.A.; Miller, K.C.; Matychak, M.B.; Felippe, M.J.B.

2012-01-01

Common variable immunodeficiency (CVID) is a heterogeneous disorder of B cell differentiation or function with inadequate antibody production. Our laboratory studies a natural form of CVID in horses characterized by late-onset B cell lymphopenia due to impaired B cell production in the bone marrow. This study was undertaken to assess the status of B cell differentiation in the bone marrow of CVID-affected horses by measuring the expression of genes essential for early B cell commitment and development. Standard RT-PCR revealed that most of the transcription factors and key signaling molecules that directly regulate B cell differentiation in the bone marrow and precede PAX5 are expressed in the affected horses. Yet, the expression of PAX5 and relevant target genes was variable. Quantitative RT-PCR analysis confirmed that the mRNA expression of E2A, PAX5, CD19, and IGHD was significantly reduced in equine CVID patients when compared to healthy horses (p < 0.05). In addition, the PAX5/EBF1 and PAX5/B220 ratios were significantly reduced in CVID patients (p < 0.01). Immunohistochemical analysis confirmed the absence of PAX5-BSAP expression in the bone marrow of affected horses. Our data suggest that B cell development seems to be impaired at the transition between pre-pro-B cells and pro-B cells in equine CVID patients. PMID:22464097

Some links on this page may take you to non-federal websites. Their policies may differ from this site.