gene generate multiple: Topics by Science.gov

Sample records for gene generate multiple

One-step generation of complete gene knockout mice and monkeys by CRISPR/Cas9-mediated gene editing with multiple sgRNAs.

PubMed

Zuo, Erwei; Cai, Yi-Jun; Li, Kui; Wei, Yu; Wang, Bang-An; Sun, Yidi; Liu, Zhen; Liu, Jiwei; Hu, Xinde; Wei, Wei; Huo, Xiaona; Shi, Linyu; Tang, Cheng; Liang, Dan; Wang, Yan; Nie, Yan-Hong; Zhang, Chen-Chen; Yao, Xuan; Wang, Xing; Zhou, Changyang; Ying, Wenqin; Wang, Qifang; Chen, Ren-Chao; Shen, Qi; Xu, Guo-Liang; Li, Jinsong; Sun, Qiang; Xiong, Zhi-Qi; Yang, Hui

2017-07-01

The CRISPR/Cas9 system is an efficient gene-editing method, but the majority of gene-edited animals showed mosaicism, with editing occurring only in a portion of cells. Here we show that single gene or multiple genes can be completely knocked out in mouse and monkey embryos by zygotic injection of Cas9 mRNA and multiple adjacent single-guide RNAs (spaced 10-200 bp apart) that target only a single key exon of each gene. Phenotypic analysis of F0 mice following targeted deletion of eight genes on the Y chromosome individually demonstrated the robustness of this approach in generating knockout mice. Importantly, this approach delivers complete gene knockout at high efficiencies (100% on Arntl and 91% on Prrt2) in monkey embryos. Finally, we could generate a complete Prrt2 knockout monkey in a single step, demonstrating the usefulness of this approach in rapidly establishing gene-edited monkey models.
Functional comparison of microarray data across multiple platforms using the method of percentage of overlapping functions.

PubMed

Li, Zhiguang; Kwekel, Joshua C; Chen, Tao

2012-01-01

Functional comparison across microarray platforms is used to assess the comparability or similarity of the biological relevance associated with the gene expression data generated by multiple microarray platforms. Comparisons at the functional level are very important considering that the ultimate purpose of microarray technology is to determine the biological meaning behind the gene expression changes under a specific condition, not just to generate a list of genes. Herein, we present a method named percentage of overlapping functions (POF) and illustrate how it is used to perform the functional comparison of microarray data generated across multiple platforms. This method facilitates the determination of functional differences or similarities in microarray data generated from multiple array platforms across all the functions that are presented on these platforms. This method can also be used to compare the functional differences or similarities between experiments, projects, or laboratories.
A versatile system for rapid multiplex genome-edited CAR T cell generation

PubMed Central

Ren, Jiangtao; Zhang, Xuhua; Liu, Xiaojun; Fang, Chongyun; Jiang, Shuguang; June, Carl H.; Zhao, Yangbing

2017-01-01

The therapeutic potential of CRISPR system has already been demonstrated in many instances and begun to overlap with the rapidly expanding field of cancer immunotherapy, especially on the production of genetically modified T cell receptor or chimeric antigen receptor (CAR) T cells. Efficient genomic disruption of multiple gene loci to generate universal donor cells, as well as potent effector T cells resistant to multiple inhibitory pathways such as PD-1 and CTLA4 is an attractive strategy for cell therapy. In this study, we accomplished rapid and efficient multiplex genomic editing, and re-directing T cells with antigen specific CAR via a one-shot CRISPR protocol by incorporation of multiple gRNAs in a CAR lentiviral vector. High efficient double knockout of endogenous TCR and HLA class I could be easily achieved to generate allogeneic universal CAR T cells. We also generated Fas-resistant universal CAR T cells by triple gene disruption. Simultaneous gene editing of four gene loci using the one-shot CRISPR protocol to generate allogeneic universal T cells deficient of both PD1 and CTLA-4 was also attempted. PMID:28199983
A BAC-bacterial recombination method to generate physically linked multiple gene reporter DNA constructs.

PubMed

Maye, Peter; Stover, Mary Louise; Liu, Yaling; Rowe, David W; Gong, Shiaochin; Lichtler, Alexander C

2009-03-13

Reporter gene mice are valuable animal models for biological research providing a gene expression readout that can contribute to cellular characterization within the context of a developmental process. With the advancement of bacterial recombination techniques to engineer reporter gene constructs from BAC genomic clones and the generation of optically distinguishable fluorescent protein reporter genes, there is an unprecedented capability to engineer more informative transgenic reporter mouse models relative to what has been traditionally available. We demonstrate here our first effort on the development of a three stage bacterial recombination strategy to physically link multiple genes together with their respective fluorescent protein (FP) reporters in one DNA fragment. This strategy uses bacterial recombination techniques to: (1) subclone genes of interest into BAC linking vectors, (2) insert desired reporter genes into respective genes and (3) link different gene-reporters together. As proof of concept, we have generated a single DNA fragment containing the genes Trap, Dmp1, and Ibsp driving the expression of ECFP, mCherry, and Topaz FP reporter genes, respectively. Using this DNA construct, we have successfully generated transgenic reporter mice that retain two to three gene readouts. The three stage methodology to link multiple genes with their respective fluorescent protein reporter works with reasonable efficiency. Moreover, gene linkage allows for their common chromosomal integration into a single locus. However, the testing of this multi-reporter DNA construct by transgenesis does suggest that the linkage of two different genes together, despite their large size, can still create a positional effect. We believe that gene choice, genomic DNA fragment size and the presence of endogenous insulator elements are critical variables.
Seven gene deletions in seven days: Fast generation of Escherichia coli strains tolerant to acetate and osmotic stress

PubMed Central

Jensen, Sheila I.; Lennen, Rebecca M.; Herrgård, Markus J.; Nielsen, Alex T.

2015-01-01

Generation of multiple genomic alterations is currently a time consuming process. Here, a method was established that enables highly efficient and simultaneous deletion of multiple genes in Escherichia coli. A temperature sensitive plasmid containing arabinose inducible lambda Red recombineering genes and a rhamnose inducible flippase recombinase was constructed to facilitate fast marker-free deletions. To further speed up the procedure, we integrated the arabinose inducible lambda Red recombineering genes and the rhamnose inducible FLP into the genome of E. coli K-12 MG1655. This system enables growth at 37 °C, thereby facilitating removal of integrated antibiotic cassettes and deletion of additional genes in the same day. Phosphorothioated primers were demonstrated to enable simultaneous deletions during one round of electroporation. Utilizing these methods, we constructed strains in which four to seven genes were deleted in E. coli W and E. coli K-12. The growth rate of an E. coli K-12 quintuple deletion strain was significantly improved in the presence of high concentrations of acetate and NaCl. In conclusion, we have generated a method that enables efficient and simultaneous deletion of multiple genes in several E. coli variants. The method enables deletion of up to seven genes in as little as seven days. PMID:26643270
Golden Gate Assembly of CRISPR gRNA expression array for simultaneously targeting multiple genes.

PubMed

Vad-Nielsen, Johan; Lin, Lin; Bolund, Lars; Nielsen, Anders Lade; Luo, Yonglun

2016-11-01

The engineered CRISPR/Cas9 technology has developed as the most efficient and broadly used genome editing tool. However, simultaneously targeting multiple genes (or genomic loci) in the same individual cells using CRISPR/Cas9 remain one technical challenge. In this article, we have developed a Golden Gate Assembly method for the generation of CRISPR gRNA expression arrays, thus enabling simultaneous gene targeting. Using this method, the generation of CRISPR gRNA expression array can be accomplished in 2 weeks, and contains up to 30 gRNA expression cassettes. We demonstrated in the study that simultaneously targeting 10 genomic loci or simultaneously inhibition of multiple endogenous genes could be achieved using the multiplexed gRNA expression array vector in human cells. The complete set of plasmids is available through the non-profit plasmid repository Addgene.
Molecular basis of the polydispersity of mucins: implications for the generation of saccharide diversity.

PubMed

Bhavanandan, V P; Gupta, D; Woitach, J; Guo, X; Jiang, W

1999-06-01

Secreted epithelial mucins are large macromolecules which exhibit extreme polydispersity, the molecular basis of which is not fully understood. We have obtained partial sequences of two genes (BSM1 and BSM2) coding for two distinct molecules. This is the first time that such closely-related genes have been identified for any mucin from an animal. We propose that a combination of multiple homologous genes, alternative splicing, differential glycosylation, and additional post-translational processing all contribute to the extreme polydispersity of mucins. The multiple domain structure and non-identical tandem repeats are also very important for the generation of the saccharide diversities of mucins.
Small RNAs Reflect Grandparental Environments in Apomictic Dandelion

PubMed Central

Morgado, Lionel; Preite, Veronica; Oplaat, Carla; Anava, Sarit; Ferreira de Carvalho, Julie; Rechavi, Oded; Johannes, Frank; Verhoeven, Koen J.F.

2017-01-01

Abstract Plants can show long-term effects of environmental stresses and in some cases a stress “memory” has been reported to persist across generations, potentially mediated by epigenetic mechanisms. However, few documented cases exist of transgenerational effects that persist for multiple generations and it remains unclear if or how epigenetic mechanisms are involved. Here, we show that the composition of small regulatory RNAs in apomictic dandelion lineages reveals a footprint of drought stress and salicylic acid treatment experienced two generations ago. Overall proportions of 21 and 24 nt RNA pools were shifted due to grandparental treatments. While individual genes did not show strong up- or downregulation of associated sRNAs, the subset of genes that showed the strongest shifts in sRNA abundance was significantly enriched for several GO terms including stress-specific functions. This suggests that a stress-induced signal was transmitted across multiple unexposed generations leading to persistent changes in epigenetic gene regulation. PMID:28472380
Generation of HIV-1 based bi-cistronic lentiviral vectors for stable gene expression and live cell imaging.

PubMed

Sehgal, Lalit; Budnar, Srikanth; Bhatt, Khyati; Sansare, Sneha; Mukhopadhaya, Amitabha; Kalraiya, Rajiv D; Dalal, Sorab N

2012-10-01

The study of protein-protein interactions, protein localization, protein organization into higher order structures and organelle dynamics in live cells, has greatly enhanced the understanding of various cellular processes. Live cell imaging experiments employ plasmid or viral vectors to express the protein/proteins of interest fused to a fluorescent protein. Unlike plasmid vectors, lentiviral vectors can be introduced into both dividing and non dividing cells, can be pseudotyped to infect a broad or narrow range of cells, and can be used to generate transgenic animals. However, the currently available lentiviral vectors are limited by the choice of fluorescent protein tag, choice of restriction enzyme sites in the Multiple Cloning Sites (MCS) and promoter choice for gene expression. In this report, HIV-1 based bi-cistronic lentiviral vectors have been generated that drive the expression of multiple fluorescent tags (EGFP, mCherry, ECFP, EYFP and dsRed), using two different promoters. The presence of a unique MCS with multiple restriction sites allows the generation of fusion proteins with the fluorescent tag of choice, allowing analysis of multiple fusion proteins in live cell imaging experiments. These novel lentiviral vectors are improved delivery vehicles for gene transfer applications and are important tools for live cell imaging in vivo.
In vivo simultaneous transcriptional activation of multiple genes in the brain using CRISPR-dCas9-activator transgenic mice.

PubMed

Zhou, Haibo; Liu, Junlai; Zhou, Changyang; Gao, Ni; Rao, Zhiping; Li, He; Hu, Xinde; Li, Changlin; Yao, Xuan; Shen, Xiaowen; Sun, Yidi; Wei, Yu; Liu, Fei; Ying, Wenqin; Zhang, Junming; Tang, Cheng; Zhang, Xu; Xu, Huatai; Shi, Linyu; Cheng, Leping; Huang, Pengyu; Yang, Hui

2018-03-01

Despite rapid progresses in the genome-editing field, in vivo simultaneous overexpression of multiple genes remains challenging. We generated a transgenic mouse using an improved dCas9 system that enables simultaneous and precise in vivo transcriptional activation of multiple genes and long noncoding RNAs in the nervous system. As proof of concept, we were able to use targeted activation of endogenous neurogenic genes in these transgenic mice to directly and efficiently convert astrocytes into functional neurons in vivo. This system provides a flexible and rapid screening platform for studying complex gene networks and gain-of-function phenotypes in the mammalian brain.
A Robust CRISPR/Cas9 System for Convenient, High-Efficiency Multiplex Genome Editing in Monocot and Dicot Plants.

PubMed

Ma, Xingliang; Zhang, Qunyu; Zhu, Qinlong; Liu, Wei; Chen, Yan; Qiu, Rong; Wang, Bin; Yang, Zhongfang; Li, Heying; Lin, Yuru; Xie, Yongyao; Shen, Rongxin; Chen, Shuifu; Wang, Zhi; Chen, Yuanling; Guo, Jingxin; Chen, Letian; Zhao, Xiucai; Dong, Zhicheng; Liu, Yao-Guang

2015-08-01

CRISPR/Cas9 genome targeting systems have been applied to a variety of species. However, most CRISPR/Cas9 systems reported for plants can only modify one or a few target sites. Here, we report a robust CRISPR/Cas9 vector system, utilizing a plant codon optimized Cas9 gene, for convenient and high-efficiency multiplex genome editing in monocot and dicot plants. We designed PCR-based procedures to rapidly generate multiple sgRNA expression cassettes, which can be assembled into the binary CRISPR/Cas9 vectors in one round of cloning by Golden Gate ligation or Gibson Assembly. With this system, we edited 46 target sites in rice with an average 85.4% rate of mutation, mostly in biallelic and homozygous status. We reasoned that about 16% of the homozygous mutations in rice were generated through the non-homologous end-joining mechanism followed by homologous recombination-based repair. We also obtained uniform biallelic, heterozygous, homozygous, and chimeric mutations in Arabidopsis T1 plants. The targeted mutations in both rice and Arabidopsis were heritable. We provide examples of loss-of-function gene mutations in T0 rice and T1 Arabidopsis plants by simultaneous targeting of multiple (up to eight) members of a gene family, multiple genes in a biosynthetic pathway, or multiple sites in a single gene. This system has provided a versatile toolbox for studying functions of multiple genes and gene families in plants for basic research and genetic improvement. Copyright © 2015 The Author. Published by Elsevier Inc. All rights reserved.
GESearch: An Interactive GUI Tool for Identifying Gene Expression Signature.

PubMed

Ye, Ning; Yin, Hengfu; Liu, Jingjing; Dai, Xiaogang; Yin, Tongming

2015-01-01

The huge amount of gene expression data generated by microarray and next-generation sequencing technologies present challenges to exploit their biological meanings. When searching for the coexpression genes, the data mining process is largely affected by selection of algorithms. Thus, it is highly desirable to provide multiple options of algorithms in the user-friendly analytical toolkit to explore the gene expression signatures. For this purpose, we developed GESearch, an interactive graphical user interface (GUI) toolkit, which is written in MATLAB and supports a variety of gene expression data files. This analytical toolkit provides four models, including the mean, the regression, the delegate, and the ensemble models, to identify the coexpression genes, and enables the users to filter data and to select gene expression patterns by browsing the display window or by importing knowledge-based genes. Subsequently, the utility of this analytical toolkit is demonstrated by analyzing two sets of real-life microarray datasets from cell-cycle experiments. Overall, we have developed an interactive GUI toolkit that allows for choosing multiple algorithms for analyzing the gene expression signatures.
Three gene expression vector sets for concurrently expressing multiple genes in Saccharomyces cerevisiae.

PubMed

Ishii, Jun; Kondo, Takashi; Makino, Harumi; Ogura, Akira; Matsuda, Fumio; Kondo, Akihiko

2014-05-01

Yeast has the potential to be used in bulk-scale fermentative production of fuels and chemicals due to its tolerance for low pH and robustness for autolysis. However, expression of multiple external genes in one host yeast strain is considerably labor-intensive due to the lack of polycistronic transcription. To promote the metabolic engineering of yeast, we generated systematic and convenient genetic engineering tools to express multiple genes in Saccharomyces cerevisiae. We constructed a series of multi-copy and integration vector sets for concurrently expressing two or three genes in S. cerevisiae by embedding three classical promoters. The comparative expression capabilities of the constructed vectors were monitored with green fluorescent protein, and the concurrent expression of genes was monitored with three different fluorescent proteins. Our multiple gene expression tool will be helpful to the advanced construction of genetically engineered yeast strains in a variety of research fields other than metabolic engineering. © 2014 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
Precise Sequential DNA Ligation on A Solid Substrate: Solid-Based Rapid Sequential Ligation of Multiple DNA Molecules

PubMed Central

Takita, Eiji; Kohda, Katsunori; Tomatsu, Hajime; Hanano, Shigeru; Moriya, Kanami; Hosouchi, Tsutomu; Sakurai, Nozomu; Suzuki, Hideyuki; Shinmyo, Atsuhiko; Shibata, Daisuke

2013-01-01

Ligation, the joining of DNA fragments, is a fundamental procedure in molecular cloning and is indispensable to the production of genetically modified organisms that can be used for basic research, the applied biosciences, or both. Given that many genes cooperate in various pathways, incorporating multiple gene cassettes in tandem in a transgenic DNA construct for the purpose of genetic modification is often necessary when generating organisms that produce multiple foreign gene products. Here, we describe a novel method, designated PRESSO (precise sequential DNA ligation on a solid substrate), for the tandem ligation of multiple DNA fragments. We amplified donor DNA fragments with non-palindromic ends, and ligated the fragment to acceptor DNA fragments on solid beads. After the final donor DNA fragments, which included vector sequences, were joined to the construct that contained the array of fragments, the ligation product (the construct) was thereby released from the beads via digestion with a rare-cut meganuclease; the freed linear construct was circularized via an intra-molecular ligation. PRESSO allowed us to rapidly and efficiently join multiple genes in an optimized order and orientation. This method can overcome many technical challenges in functional genomics during the post-sequencing generation. PMID:23897972
The effects of in utero bisphenol A exposure on the ovaries in multiple generations of mice

PubMed Central

Berger, Amelia; Ziv-Gal, Ayelet; Cudiamat, Jonathan; Wang, Wei; Zhou, Changqing; Flaws, Jodi A.

2016-01-01

Bisphenol A is used in polycarbonate plastics and epoxy resins. Previous studies show that in utero BPA exposure inhibits germ cell nest breakdown in the F1 generation of mice, but its effects on germ cell nest breakdown and on the ovary in the F2–F3 generations were unknown. Thus, we tested the hypothesis that BPA has transgenerational effects on the ovary. Mice were exposed to BPA in utero (BPA 0.5, 20, or 50 µg/kg/day), and ovaries were collected at postnatal days (PND) 4 and 21 from the F1–F3 generations and subjected to histological evaluation and gene expression analyses. In utero BPA exposure did not have transgenerational effects on germ cell nest breakdown and gene expression on PND 4, but it caused transgenerational changes in expression in multiple genes on PND 21. Collectively, these data indicate that in utero BPA exposure has some transgenerational effects in mice. PMID:26746108
High Resolution Melt (HRM) analysis is an efficient tool to genotype EMS mutants in complex crop genomes.

PubMed

Lochlainn, Seosamh Ó; Amoah, Stephen; Graham, Neil S; Alamer, Khalid; Rios, Juan J; Kurup, Smita; Stoute, Andrew; Hammond, John P; Østergaard, Lars; King, Graham J; White, Phillip J; Broadley, Martin R

2011-12-08

Targeted Induced Loci Lesions IN Genomes (TILLING) is increasingly being used to generate and identify mutations in target genes of crop genomes. TILLING populations of several thousand lines have been generated in a number of crop species including Brassica rapa. Genetic analysis of mutants identified by TILLING requires an efficient, high-throughput and cost effective genotyping method to track the mutations through numerous generations. High resolution melt (HRM) analysis has been used in a number of systems to identify single nucleotide polymorphisms (SNPs) and insertion/deletions (IN/DELs) enabling the genotyping of different types of samples. HRM is ideally suited to high-throughput genotyping of multiple TILLING mutants in complex crop genomes. To date it has been used to identify mutants and genotype single mutations. The aim of this study was to determine if HRM can facilitate downstream analysis of multiple mutant lines identified by TILLING in order to characterise allelic series of EMS induced mutations in target genes across a number of generations in complex crop genomes. We demonstrate that HRM can be used to genotype allelic series of mutations in two genes, BraA.CAX1a and BraA.MET1.a in Brassica rapa. We analysed 12 mutations in BraA.CAX1.a and five in BraA.MET1.a over two generations including a back-cross to the wild-type. Using a commercially available HRM kit and the Lightscanner™ system we were able to detect mutations in heterozygous and homozygous states for both genes. Using HRM genotyping on TILLING derived mutants, it is possible to generate an allelic series of mutations within multiple target genes rapidly. Lines suitable for phenotypic analysis can be isolated approximately 8-9 months (3 generations) from receiving M3 seed of Brassica rapa from the RevGenUK TILLING service.
High Resolution Melt (HRM) analysis is an efficient tool to genotype EMS mutants in complex crop genomes

PubMed Central

2011-01-01

Background Targeted Induced Loci Lesions IN Genomes (TILLING) is increasingly being used to generate and identify mutations in target genes of crop genomes. TILLING populations of several thousand lines have been generated in a number of crop species including Brassica rapa. Genetic analysis of mutants identified by TILLING requires an efficient, high-throughput and cost effective genotyping method to track the mutations through numerous generations. High resolution melt (HRM) analysis has been used in a number of systems to identify single nucleotide polymorphisms (SNPs) and insertion/deletions (IN/DELs) enabling the genotyping of different types of samples. HRM is ideally suited to high-throughput genotyping of multiple TILLING mutants in complex crop genomes. To date it has been used to identify mutants and genotype single mutations. The aim of this study was to determine if HRM can facilitate downstream analysis of multiple mutant lines identified by TILLING in order to characterise allelic series of EMS induced mutations in target genes across a number of generations in complex crop genomes. Results We demonstrate that HRM can be used to genotype allelic series of mutations in two genes, BraA.CAX1a and BraA.MET1.a in Brassica rapa. We analysed 12 mutations in BraA.CAX1.a and five in BraA.MET1.a over two generations including a back-cross to the wild-type. Using a commercially available HRM kit and the Lightscanner™ system we were able to detect mutations in heterozygous and homozygous states for both genes. Conclusions Using HRM genotyping on TILLING derived mutants, it is possible to generate an allelic series of mutations within multiple target genes rapidly. Lines suitable for phenotypic analysis can be isolated approximately 8-9 months (3 generations) from receiving M3 seed of Brassica rapa from the RevGenUK TILLING service. PMID:22152063
Gene selection with multiple ordering criteria.

PubMed

Chen, James J; Tsai, Chen-An; Tzeng, Shengli; Chen, Chun-Houh

2007-03-05

A microarray study may select different differentially expressed gene sets because of different selection criteria. For example, the fold-change and p-value are two commonly known criteria to select differentially expressed genes under two experimental conditions. These two selection criteria often result in incompatible selected gene sets. Also, in a two-factor, say, treatment by time experiment, the investigator may be interested in one gene list that responds to both treatment and time effects. We propose three layer ranking algorithms, point-admissible, line-admissible (convex), and Pareto, to provide a preference gene list from multiple gene lists generated by different ranking criteria. Using the public colon data as an example, the layer ranking algorithms are applied to the three univariate ranking criteria, fold-change, p-value, and frequency of selections by the SVM-RFE classifier. A simulation experiment shows that for experiments with small or moderate sample sizes (less than 20 per group) and detecting a 4-fold change or less, the two-dimensional (p-value and fold-change) convex layer ranking selects differentially expressed genes with generally lower FDR and higher power than the standard p-value ranking. Three applications are presented. The first application illustrates a use of the layer rankings to potentially improve predictive accuracy. The second application illustrates an application to a two-factor experiment involving two dose levels and two time points. The layer rankings are applied to selecting differentially expressed genes relating to the dose and time effects. In the third application, the layer rankings are applied to a benchmark data set consisting of three dilution concentrations to provide a ranking system from a long list of differentially expressed genes generated from the three dilution concentrations. The layer ranking algorithms are useful to help investigators in selecting the most promising genes from multiple gene lists generated by different filter, normalization, or analysis methods for various objectives.
Generation of gene-targeted mice using embryonic stem cells derived from a transgenic mouse model of Alzheimer's disease.

PubMed

Yamamoto, Satoshi; Ooshima, Yuki; Nakata, Mitsugu; Yano, Takashi; Matsuoka, Kunio; Watanabe, Sayuri; Maeda, Ryouta; Takahashi, Hideki; Takeyama, Michiyasu; Matsumoto, Yoshio; Hashimoto, Tadatoshi

2013-06-01

Gene-targeting technology using mouse embryonic stem (ES) cells has become the "gold standard" for analyzing gene functions and producing disease models. Recently, genetically modified mice with multiple mutations have increasingly been produced to study the interaction between proteins and polygenic diseases. However, introduction of an additional mutation into mice already harboring several mutations by conventional natural crossbreeding is an extremely time- and labor-intensive process. Moreover, to do so in mice with a complex genetic background, several years may be required if the genetic background is to be retained. Establishing ES cells from multiple-mutant mice, or disease-model mice with a complex genetic background, would offer a possible solution. Here, we report the establishment and characterization of novel ES cell lines from a mouse model of Alzheimer's disease (3xTg-AD mouse, Oddo et al. in Neuron 39:409-421, 2003) harboring 3 mutated genes (APPswe, TauP301L, and PS1M146V) and a complex genetic background. Thirty blastocysts were cultured and 15 stable ES cell lines (male: 11; female: 4) obtained. By injecting these ES cells into diploid or tetraploid blastocysts, we generated germline-competent chimeras. Subsequently, we confirmed that F1 mice derived from these animals showed similar biochemical and behavioral characteristics to the original 3xTg-AD mice. Furthermore, we introduced a gene-targeting vector into the ES cells and successfully obtained gene-targeted ES cells, which were then used to generate knockout mice for the targeted gene. These results suggest that the present methodology is effective for introducing an additional mutation into mice already harboring multiple mutated genes and/or a complex genetic background.
Male germline transmits fetal alcohol epigenetic marks for multiple generations: a review.

PubMed

Sarkar, Dipak K

2016-01-01

Alcohol exposure during fetal and early postnatal development can lead to an increased incidence of later life adult-onset diseases. Examples include central nervous system dysfunction, depression, anxiety, hyperactivity, and an inability to deal with stressful situations, increased infection and cancer. Direct effects of alcohol leading to developmental abnormalities often involve epigenetic modifications of genes that regulate cellular functions. Epigenetic marks carried over from the parents are known to undergo molecular programming events that happen early in embryonic development by a wave of DNA demethylation, which leaves the embryo with a fresh genomic composition. The proopiomelanocortin (Pomc) gene controls neuroendocrine-immune functions and is imprinted by fetal alcohol exposure. Recently, this gene has been shown to be hypermethylated through three generations. Additionally, the alcohol epigenetic marks on the Pomc gene are maintained in the male but not in the female germline during this transgenerational transmission. These data suggest that the male-specific chromosome might be involved in transmitting alcohol epigenetic marks through multiple generations. © 2015 Society for the Study of Addiction.

Fast-tracking determination of homozygous transgenic lines and transgene stacking using a reliable quantitative real-time PCR assay.

PubMed

Wang, Xianghong; Jiang, Daiming; Yang, Daichang

2015-01-01

The selection of homozygous lines is a crucial step in the characterization of newly generated transgenic plants. This is particularly time- and labor-consuming when transgenic stacking is required. Here, we report a fast and accurate method based on quantitative real-time PCR with a rice gene RBE4 as a reference gene for selection of homozygous lines when using multiple transgenic stacking in rice. Use of this method allowed can be used to determine the stacking of up to three transgenes within four generations. Selection accuracy reached 100 % for a single locus and 92.3 % for two loci. This method confers distinct advantages over current transgenic research methodologies, as it is more accurate, rapid, and reliable. Therefore, this protocol could be used to efficiently select homozygous plants and to expedite time- and labor-consuming processes normally required for multiple transgene stacking. This protocol was standardized for determination of multiple gene stacking in molecular breeding via marker-assisted selection.
Restricted VH gene usage and generation of antibody diversity in rabbit.

PubMed

Knight, K L

1992-01-01

The presence of VHa allotypic specificities on nearly all rabbit Ig molecules has perplexed immunologists for many years. How could these allotypic specificities be inherited as if controlled by alleles if the germline has hundreds of VHa allotype-encoding genes and if most of these genes are used in VDJ gene rearrangements. I review recent data indicating that the allelic inheritance of the VHa allotypes can be explained by preferential utilization of the D-proximal VH gene VH1 in VDJ gene rearrangements. The preferential usage of one VH gene, however, limits the contribution of combinatorial joining of multiple VH, D and JH gene segments to the generation of antibody diversity. The roles of somatic gene conversion and somatic mutation in generating antibody diversity are discussed. Further, the limited usage of germline VH genes in normal, allotype-suppressed and the mutant Alicia rabbit as well as the molecular basis of latent allotypes and VH/CH recombinants is reviewed.
Generation of mammalian cells stably expressing multiple genes at predetermined levels.

PubMed

Liu, X; Constantinescu, S N; Sun, Y; Bogan, J S; Hirsch, D; Weinberg, R A; Lodish, H F

2000-04-10

Expression of cloned genes at desired levels in cultured mammalian cells is essential for studying protein function. Controlled levels of expression have been difficult to achieve, especially for cell lines with low transfection efficiency or when expression of multiple genes is required. An internal ribosomal entry site (IRES) has been incorporated into many types of expression vectors to allow simultaneous expression of two genes. However, there has been no systematic quantitative analysis of expression levels in individual cells of genes linked by an IRES, and thus the broad use of these vectors in functional analysis has been limited. We constructed a set of retroviral expression vectors containing an IRES followed by a quantitative selectable marker such as green fluorescent protein (GFP) or truncated cell surface proteins CD2 or CD4. The gene of interest is placed in a multiple cloning site 5' of the IRES sequence under the control of the retroviral long terminal repeat (LTR) promoter. These vectors exploit the approximately 100-fold differences in levels of expression of a retrovirus vector depending on its site of insertion in the host chromosome. We show that the level of expression of the gene downstream of the IRES and the expression level and functional activity of the gene cloned upstream of the IRES are highly correlated in stably infected target cells. This feature makes our vectors extremely useful for the rapid generation of stably transfected cell populations or clonal cell lines expressing specific amounts of a desired protein simply by fluorescent activated cell sorting (FACS) based on the level of expression of the gene downstream of the IRES. We show how these vectors can be used to generate cells expressing high levels of the erythropoietin receptor (EpoR) or a dominant negative Smad3 protein and to generate cells expressing two different cloned proteins, Ski and Smad4. Correlation of a biologic effect with the level of expression of the protein downstream of the IRES provides strong evidence for the function of the protein placed upstream of the IRES.
Biotechnologically generating 'super chickpea' for food and nutritional security.

PubMed

Acharjee, Sumita; Sarmah, Bidyut Kumar

2013-06-01

Chickpea productivity is affected by various constraints that are biotic (Helicoverpa, Aphids, Callosobruchus, Bromus and Orobanche) and abiotic (drought and salinity). In addition, the grains of this legume are deficient in sulfur amino acids, methionine and cysteine. The possibilities for genetic improvement by marker-assisted breeding and selection approaches are limited in chickpeas due to their sexually incompatible gene pool. Transgenic chickpeas expressing either the cry1Ac/b or the cry2Aa gene and the bean α-amylase inhibitor gene are resistant to Helicoverpa and bruchids, respectively, but these chickpeas have yet to be commercialized. Unfortunately, attempts to generate transgenic chickpeas with increased tolerance to drought and salinity or with increased methionine content have been less successful. The commercialization of transgenic chickpeas containing a single transgene may not give adequate yield advantage, as chickpeas are affected by many production constraints in the field and in storage. Gene pyramiding by incorporating two or more genes may be useful because improving one trait at a time will be time-consuming, labor-intensive and costly. Use of modern multi-gene vectors that contain recognition sites for zinc finger nucleases (ZFNs) and homing endonucleases may simplify the incorporation of multiple genes into chickpeas. This approach necessitates a collaborative effort between individuals, public and private organizations to generate 'super chickpeas' that harbor multiple transgenic traits. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Circadian Enhancers Coordinate Multiple Phases of Rhythmic Gene Transcription In Vivo

PubMed Central

Fang, Bin; Everett, Logan J.; Jager, Jennifer; Briggs, Erika; Armour, Sean M.; Feng, Dan; Roy, Ankur; Gerhart-Hines, Zachary; Sun, Zheng; Lazar, Mitchell A.

2014-01-01

SUMMARY Mammalian transcriptomes display complex circadian rhythms with multiple phases of gene expression that cannot be accounted for by current models of the molecular clock. We have determined the underlying mechanisms by measuring nascent RNA transcription around the clock in mouse liver. Unbiased examination of eRNAs that cluster in specific circadian phases identified functional enhancers driven by distinct transcription factors (TFs). We further identify on a global scale the components of the TF cistromes that function to orchestrate circadian gene expression. Integrated genomic analyses also revealed novel mechanisms by which a single circadian factor controls opposing transcriptional phases. These findings shed new light on the diversity and specificity of TF function in the generation of multiple phases of circadian gene transcription in a mammalian organ. PMID:25416951
Circadian enhancers coordinate multiple phases of rhythmic gene transcription in vivo.

PubMed

Fang, Bin; Everett, Logan J; Jager, Jennifer; Briggs, Erika; Armour, Sean M; Feng, Dan; Roy, Ankur; Gerhart-Hines, Zachary; Sun, Zheng; Lazar, Mitchell A

2014-11-20

Mammalian transcriptomes display complex circadian rhythms with multiple phases of gene expression that cannot be accounted for by current models of the molecular clock. We have determined the underlying mechanisms by measuring nascent RNA transcription around the clock in mouse liver. Unbiased examination of enhancer RNAs (eRNAs) that cluster in specific circadian phases identified functional enhancers driven by distinct transcription factors (TFs). We further identify on a global scale the components of the TF cistromes that function to orchestrate circadian gene expression. Integrated genomic analyses also revealed mechanisms by which a single circadian factor controls opposing transcriptional phases. These findings shed light on the diversity and specificity of TF function in the generation of multiple phases of circadian gene transcription in a mammalian organ.
Molecular Typing of Lung Adenocarcinoma on Cytological Samples Using a Multigene Next Generation Sequencing Panel

PubMed Central

Fassan, Matteo; Rachiglio, Anna Maria; Cappellesso, Rocco; Antonello, Davide; Amato, Eliana; Mafficini, Andrea; Lambiase, Matilde; Esposito, Claudia; Bria, Emilio; Simonato, Francesca; Scardoni, Maria; Turri, Giona; Chilosi, Marco; Tortora, Giampaolo; Fassina, Ambrogio; Normanno, Nicola

2013-01-01

Identification of driver mutations in lung adenocarcinoma has led to development of targeted agents that are already approved for clinical use or are in clinical trials. Therefore, the number of biomarkers that will be needed to assess is expected to rapidly increase. This calls for the implementation of methods probing the mutational status of multiple genes for inoperable cases, for which limited cytological or bioptic material is available. Cytology specimens from 38 lung adenocarcinomas were subjected to the simultaneous assessment of 504 mutational hotspots of 22 lung cancer-associated genes using 10 nanograms of DNA and Ion Torrent PGM next-generation sequencing. Thirty-six cases were successfully sequenced (95%). In 24/36 cases (67%) at least one mutated gene was observed, including EGFR, KRAS, PIK3CA, BRAF, TP53, PTEN, MET, SMAD4, FGFR3, STK11, MAP2K1. EGFR and KRAS mutations, respectively found in 6/36 (16%) and 10/36 (28%) cases, were mutually exclusive. Nine samples (25%) showed concurrent alterations in different genes. The next-generation sequencing test used is superior to current standard methodologies, as it interrogates multiple genes and requires limited amounts of DNA. Its applicability to routine cytology samples might allow a significant increase in the fraction of lung cancer patients eligible for personalized therapy. PMID:24236184
Characterization of fusion genes and the significantly expressed fusion isoforms in breast cancer by hybrid sequencing

PubMed Central

Weirather, Jason L.; Afshar, Pegah Tootoonchi; Clark, Tyson A.; Tseng, Elizabeth; Powers, Linda S.; Underwood, Jason G.; Zabner, Joseph; Korlach, Jonas; Wong, Wing Hung; Au, Kin Fai

2015-01-01

We developed an innovative hybrid sequencing approach, IDP-fusion, to detect fusion genes, determine fusion sites and identify and quantify fusion isoforms. IDP-fusion is the first method to study gene fusion events by integrating Third Generation Sequencing long reads and Second Generation Sequencing short reads. We applied IDP-fusion to PacBio data and Illumina data from the MCF-7 breast cancer cells. Compared with the existing tools, IDP-fusion detects fusion genes at higher precision and a very low false positive rate. The results show that IDP-fusion will be useful for unraveling the complexity of multiple fusion splices and fusion isoforms within tumorigenesis-relevant fusion genes. PMID:26040699
CD8 single-cell gene coexpression reveals three different effector types present at distinct phases of the immune response

PubMed Central

Peixoto, António; Evaristo, César; Munitic, Ivana; Monteiro, Marta; Charbit, Alain; Rocha, Benedita; Veiga-Fernandes, Henrique

2007-01-01

To study in vivo CD8 T cell differentiation, we quantified the coexpression of multiple genes in single cells throughout immune responses. After in vitro activation, CD8 T cells rapidly express effector molecules and cease their expression when the antigen is removed. Gene behavior after in vivo activation, in contrast, was quite heterogeneous. Different mRNAs were induced at very different time points of the response, were transcribed during different time periods, and could decline or persist independently of the antigen load. Consequently, distinct gene coexpression patterns/different cell types were generated at the various phases of the immune responses. During primary stimulation, inflammatory molecules were induced and down-regulated shortly after activation, generating early cells that only mediated inflammation. Cytotoxic T cells were generated at the peak of the primary response, when individual cells simultaneously expressed multiple killer molecules, whereas memory cells lost killer capacity because they no longer coexpressed killer genes. Surprisingly, during secondary responses gene transcription became permanent. Secondary cells recovered after antigen elimination were more efficient killers than cytotoxic T cells present at the peak of the primary response. Thus, primary responses produced two transient effector types. However, after boosting, CD8 T cells differentiate into long-lived killer cells that persist in vivo in the absence of antigen. PMID:17485515
BubbleGUM: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple Gene Set Enrichment Analyses.

PubMed

Spinelli, Lionel; Carpentier, Sabrina; Montañana Sanchis, Frédéric; Dalod, Marc; Vu Manh, Thien-Phong

2015-10-19

Recent advances in the analysis of high-throughput expression data have led to the development of tools that scaled-up their focus from single-gene to gene set level. For example, the popular Gene Set Enrichment Analysis (GSEA) algorithm can detect moderate but coordinated expression changes of groups of presumably related genes between pairs of experimental conditions. This considerably improves extraction of information from high-throughput gene expression data. However, although many gene sets covering a large panel of biological fields are available in public databases, the ability to generate home-made gene sets relevant to one's biological question is crucial but remains a substantial challenge to most biologists lacking statistic or bioinformatic expertise. This is all the more the case when attempting to define a gene set specific of one condition compared to many other ones. Thus, there is a crucial need for an easy-to-use software for generation of relevant home-made gene sets from complex datasets, their use in GSEA, and the correction of the results when applied to multiple comparisons of many experimental conditions. We developed BubbleGUM (GSEA Unlimited Map), a tool that allows to automatically extract molecular signatures from transcriptomic data and perform exhaustive GSEA with multiple testing correction. One original feature of BubbleGUM notably resides in its capacity to integrate and compare numerous GSEA results into an easy-to-grasp graphical representation. We applied our method to generate transcriptomic fingerprints for murine cell types and to assess their enrichments in human cell types. This analysis allowed us to confirm homologies between mouse and human immunocytes. BubbleGUM is an open-source software that allows to automatically generate molecular signatures out of complex expression datasets and to assess directly their enrichment by GSEA on independent datasets. Enrichments are displayed in a graphical output that helps interpreting the results. This innovative methodology has recently been used to answer important questions in functional genomics, such as the degree of similarities between microarray datasets from different laboratories or with different experimental models or clinical cohorts. BubbleGUM is executable through an intuitive interface so that both bioinformaticians and biologists can use it. It is available at http://www.ciml.univ-mrs.fr/applications/BubbleGUM/index.html .
Evaluating Gene Set Enrichment Analysis Via a Hybrid Data Model

PubMed Central

Hua, Jianping; Bittner, Michael L.; Dougherty, Edward R.

2014-01-01

Gene set enrichment analysis (GSA) methods have been widely adopted by biological labs to analyze data and generate hypotheses for validation. Most of the existing comparison studies focus on whether the existing GSA methods can produce accurate P-values; however, practitioners are often more concerned with the correct gene-set ranking generated by the methods. The ranking performance is closely related to two critical goals associated with GSA methods: the ability to reveal biological themes and ensuring reproducibility, especially for small-sample studies. We have conducted a comprehensive simulation study focusing on the ranking performance of seven representative GSA methods. We overcome the limitation on the availability of real data sets by creating hybrid data models from existing large data sets. To build the data model, we pick a master gene from the data set to form the ground truth and artificially generate the phenotype labels. Multiple hybrid data models can be constructed from one data set and multiple data sets of smaller sizes can be generated by resampling the original data set. This approach enables us to generate a large batch of data sets to check the ranking performance of GSA methods. Our simulation study reveals that for the proposed data model, the Q2 type GSA methods have in general better performance than other GSA methods and the global test has the most robust results. The properties of a data set play a critical role in the performance. For the data sets with highly connected genes, all GSA methods suffer significantly in performance. PMID:24558298
Stochastic and epigenetic changes of gene expression in Arabidopsis polyploids.

PubMed

Wang, Jianlin; Tian, Lu; Madlung, Andreas; Lee, Hyeon-Se; Chen, Meng; Lee, Jinsuk J; Watson, Brian; Kagochi, Trevor; Comai, Luca; Chen, Z Jeffrey

2004-08-01

Polyploidization is an abrupt speciation mechanism for eukaryotes and is especially common in plants. However, little is known about patterns and mechanisms of gene regulation during early stages of polyploid formation. Here we analyzed differential expression patterns of the progenitors' genes among successive selfing generations and independent lineages. The synthetic Arabidopsis allotetraploid lines were produced by a genetic cross between A. thaliana and A. arenosa autotetraploids. We found that some progenitors' genes are differentially expressed in early generations, whereas other genes are silenced in late generations or among different siblings within a selfing generation, suggesting that the silencing of progenitors' genes is rapidly and/or stochastically established. Moreover, a subset of genes is affected in autotetraploid and multiple independent allotetraploid lines and in A. suecica, a natural allotetraploid derived from A. thaliana and A. arenosa, indicating locus-specific susceptibility to ploidy-dependent gene regulation. The role of DNA methylation in silencing progenitors' genes is tested in DNA-hypomethylation transgenic lines of A. suecica using RNA interference (RNAi). Two silenced genes are reactivated in both ddm1- and met1-RNAi lines, consistent with the demethylation of centromeric repeats and gene-specific regions in the genome. A rapid and stochastic process of differential gene expression is reinforced by epigenetic regulation during polyploid formation and evolution. Copyright 2004 Genetics Society of America
[Analysis of genetic models and gene effects on main agronomy characters in rapeseed].

PubMed

Li, J; Qiu, J; Tang, Z; Shen, L

1992-01-01

According to four different genetic models, the genetic patterns of 8 agronomy traits were analysed by using the data of 24 generations which included positive and negative cross of 81008 x Tower, both of the varieties are of good quality. The results showed that none of 8 characters could fit in with additive-dominance models. Epistasis was found in all of these characters, and it has significant effect on generation means. Seed weight/plant and some other main yield characters are controlled by duplicate interaction genes. The interaction between triple genes or multiple genes needs to be utilized in yield heterosis.
Employment of Near Full-Length Ribosome Gene TA-Cloning and Primer-Blast to Detect Multiple Species in a Natural Complex Microbial Community Using Species-Specific Primers Designed with Their Genome Sequences.

PubMed

Zhang, Huimin; He, Hongkui; Yu, Xiujuan; Xu, Zhaohui; Zhang, Zhizhou

2016-11-01

It remains an unsolved problem to quantify a natural microbial community by rapidly and conveniently measuring multiple species with functional significance. Most widely used high throughput next-generation sequencing methods can only generate information mainly for genus-level taxonomic identification and quantification, and detection of multiple species in a complex microbial community is still heavily dependent on approaches based on near full-length ribosome RNA gene or genome sequence information. In this study, we used near full-length rRNA gene library sequencing plus Primer-Blast to design species-specific primers based on whole microbial genome sequences. The primers were intended to be specific at the species level within relevant microbial communities, i.e., a defined genomics background. The primers were tested with samples collected from the Daqu (also called fermentation starters) and pit mud of a traditional Chinese liquor production plant. Sixteen pairs of primers were found to be suitable for identification of individual species. Among them, seven pairs were chosen to measure the abundance of microbial species through quantitative PCR. The combination of near full-length ribosome RNA gene library sequencing and Primer-Blast may represent a broadly useful protocol to quantify multiple species in complex microbial population samples with species-specific primers.
Molecular profiling of multiple myeloma: from gene expression analysis to next-generation sequencing.

PubMed

Agnelli, Luca; Tassone, Pierfrancesco; Neri, Antonino

2013-06-01

Multiple myeloma is a fatal malignant proliferation of clonal bone marrow Ig-secreting plasma cells, characterized by wide clinical, biological, and molecular heterogeneity. Herein, global gene and microRNA expression, genome-wide DNA profilings, and next-generation sequencing technology used to investigate the genomic alterations underlying the bio-clinical heterogeneity in multiple myeloma are discussed. High-throughput technologies have undoubtedly allowed a better comprehension of the molecular basis of the disease, a fine stratification, and early identification of high-risk patients, and have provided insights toward targeted therapy studies. However, such technologies are at risk of being affected by laboratory- or cohort-specific biases, and are moreover influenced by high number of expected false positives. This aspect has a major weight in myeloma, which is characterized by large molecular heterogeneity. Therefore, meta-analysis as well as multiple approaches are desirable if not mandatory to validate the results obtained, in line with commonly accepted recommendation for tumor diagnostic/prognostic biomarker studies.
Hybrid-Lambda: simulation of multiple merger and Kingman gene genealogies in species networks and species trees.

PubMed

Zhu, Sha; Degnan, James H; Goldstien, Sharyn J; Eldon, Bjarki

2015-09-15

There has been increasing interest in coalescent models which admit multiple mergers of ancestral lineages; and to model hybridization and coalescence simultaneously. Hybrid-Lambda is a software package that simulates gene genealogies under multiple merger and Kingman's coalescent processes within species networks or species trees. Hybrid-Lambda allows different coalescent processes to be specified for different populations, and allows for time to be converted between generations and coalescent units, by specifying a population size for each population. In addition, Hybrid-Lambda can generate simulated datasets, assuming the infinitely many sites mutation model, and compute the F ST statistic. As an illustration, we apply Hybrid-Lambda to infer the time of subdivision of certain marine invertebrates under different coalescent processes. Hybrid-Lambda makes it possible to investigate biogeographic concordance among high fecundity species exhibiting skewed offspring distribution.
Identification of aberrant gene expression associated with aberrant promoter methylation in primordial germ cells between E13 and E16 rat F3 generation vinclozolin lineage.

PubMed

Taguchi, Y-h

2015-01-01

Transgenerational epigenetics (TGE) are currently considered important in disease, but the mechanisms involved are not yet fully understood. TGE abnormalities expected to cause disease are likely to be initiated during development and to be mediated by aberrant gene expression associated with aberrant promoter methylation that is heritable between generations. However, because methylation is removed and then re-established during development, it is not easy to identify promoter methylation abnormalities by comparing normal lineages with those expected to exhibit TGE abnormalities. This study applied the recently proposed principal component analysis (PCA)-based unsupervised feature extraction to previously reported and publically available gene expression/promoter methylation profiles of rat primordial germ cells, between E13 and E16 of the F3 generation vinclozolin lineage that are expected to exhibit TGE abnormalities, to identify multiple genes that exhibited aberrant gene expression/promoter methylation during development. The biological feasibility of the identified genes were tested via enrichment analyses of various biological concepts including pathway analysis, gene ontology terms and protein-protein interactions. All validations suggested superiority of the proposed method over three conventional and popular supervised methods that employed t test, limma and significance analysis of microarrays, respectively. The identified genes were globally related to tumors, the prostate, kidney, testis and the immune system and were previously reported to be related to various diseases caused by TGE. Among the genes reported by PCA-based unsupervised feature extraction, we propose that chemokine signaling pathways and leucine rich repeat proteins are key factors that initiate transgenerational epigenetic-mediated diseases, because multiple genes included in these two categories were identified in this study.
Identification of aberrant gene expression associated with aberrant promoter methylation in primordial germ cells between E13 and E16 rat F3 generation vinclozolin lineage

PubMed Central

2015-01-01

Background Transgenerational epigenetics (TGE) are currently considered important in disease, but the mechanisms involved are not yet fully understood. TGE abnormalities expected to cause disease are likely to be initiated during development and to be mediated by aberrant gene expression associated with aberrant promoter methylation that is heritable between generations. However, because methylation is removed and then re-established during development, it is not easy to identify promoter methylation abnormalities by comparing normal lineages with those expected to exhibit TGE abnormalities. Methods This study applied the recently proposed principal component analysis (PCA)-based unsupervised feature extraction to previously reported and publically available gene expression/promoter methylation profiles of rat primordial germ cells, between E13 and E16 of the F3 generation vinclozolin lineage that are expected to exhibit TGE abnormalities, to identify multiple genes that exhibited aberrant gene expression/promoter methylation during development. Results The biological feasibility of the identified genes were tested via enrichment analyses of various biological concepts including pathway analysis, gene ontology terms and protein-protein interactions. All validations suggested superiority of the proposed method over three conventional and popular supervised methods that employed t test, limma and significance analysis of microarrays, respectively. The identified genes were globally related to tumors, the prostate, kidney, testis and the immune system and were previously reported to be related to various diseases caused by TGE. Conclusions Among the genes reported by PCA-based unsupervised feature extraction, we propose that chemokine signaling pathways and leucine rich repeat proteins are key factors that initiate transgenerational epigenetic-mediated diseases, because multiple genes included in these two categories were identified in this study. PMID:26677731
Generation of gene edited birds in one generation using sperm transfection assisted gene editing (STAGE).

PubMed

Cooper, Caitlin A; Challagulla, Arjun; Jenkins, Kristie A; Wise, Terry G; O'Neil, Terri E; Morris, Kirsten R; Tizard, Mark L; Doran, Timothy J

2017-06-01

Generating transgenic and gene edited mammals involves in vitro manipulation of oocytes or single cell embryos. Due to the comparative inaccessibility of avian oocytes and single cell embryos, novel protocols have been developed to produce transgenic and gene edited birds. While these protocols are relatively efficient, they involve two generation intervals before reaching complete somatic and germline expressing transgenic or gene edited birds. Most of this work has been done with chickens, and many protocols require in vitro culturing of primordial germ cells (PGCs). However, for many other bird species no methodology for long term culture of PGCs exists. Developing methodologies to produce germline transgenic or gene edited birds in the first generation would save significant amounts of time and resource. Furthermore, developing protocols that can be readily adapted to a wide variety of avian species would open up new research opportunities. Here we report a method using sperm as a delivery mechanism for gene editing vectors which we call sperm transfection assisted gene editing (STAGE). We have successfully used this method to generate GFP knockout embryos and chickens, as well as generate embryos with mutations in the doublesex and mab-3 related transcription factor 1 (DMRT1) gene using the CRISPR/Cas9 system. The efficiency of the method varies from as low as 0% to as high as 26% with multiple factors such as CRISPR guide efficiency and mRNA stability likely impacting the outcome. This straightforward methodology could simplify gene editing in many bird species including those for which no methodology currently exists.
Comparative analysis of expressed sequence tags of conifers and angiosperms reveals sequences specifically conserved in conifers.

PubMed

Ujino-Ihara, Tokuko; Kanamori, Hiroyuki; Yamane, Hiroko; Taguchi, Yuriko; Namiki, Nobukazu; Mukai, Yuzuru; Yoshimura, Kensuke; Tsumura, Yoshihiko

2005-12-01

To identify and characterize lineage-specific genes of conifers, two sets of ESTs (with 12791 and 5902 ESTs, representing 5373 and 3018 gene transcripts, respectively) were generated from the Cupressaceae species Cryptomeria japonica and Chamaecyparis obtusa. These transcripts were compared with non-redundant sets of genes generated from Pinaceae species, other gymnosperms and angiosperms. About 6% of tentative unique genes (Unigenes) of C. japonica and C. obtusa had homologs in other conifers but not angiosperms, and about 70% had apparent homologs in angiosperms. The calculated GC contents of orthologous genes showed that GC contents of coniferous genes are likely to be lower than those of angiosperms. Comparisons of the numbers of homologous genes in each species suggest that copy numbers of genes may be correlated between diverse seed plants. This correlation suggests that the multiplicity of such genes may have arisen before the divergence of gymnosperms and angiosperms.

Generation of six multiple sclerosis patient-derived induced pluripotent stem cell lines.

PubMed

Miquel-Serra, L; Duarri, A; Muñoz, Y; Kuebler, B; Aran, B; Costa, C; Martí, M; Comabella, M; Malhotra, S; Montalban, X; Veiga, A; Raya, A

2017-10-01

Multiple sclerosis (MS) is considered a chronic autoimmune disease of the central nervous system that leads to gliosis, demyelination, axonal damage and neuronal death. The MS disease aetiology is unknown, though a polymorphism of the TNFRSF1A gene, rs1800693, is known to confer an increased risk for MS. Using retroviral delivery of reprogramming transgenes, we generated six MS patient-specific iPSC lines with two distinct genotypes, CC or TT, of the polymorphism rs1800693. iPSC lines had normal karyotype, expressed pluripotency genes and differentiated into the three germ layers. These lines offer a good tool to study MS pathomechanisms and for drug testing. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.
The Interaction Network Ontology-supported modeling and mining of complex interactions represented with multiple keywords in biomedical literature.

PubMed

Özgür, Arzucan; Hur, Junguk; He, Yongqun

2016-01-01

The Interaction Network Ontology (INO) logically represents biological interactions, pathways, and networks. INO has been demonstrated to be valuable in providing a set of structured ontological terms and associated keywords to support literature mining of gene-gene interactions from biomedical literature. However, previous work using INO focused on single keyword matching, while many interactions are represented with two or more interaction keywords used in combination. This paper reports our extension of INO to include combinatory patterns of two or more literature mining keywords co-existing in one sentence to represent specific INO interaction classes. Such keyword combinations and related INO interaction type information could be automatically obtained via SPARQL queries, formatted in Excel format, and used in an INO-supported SciMiner, an in-house literature mining program. We studied the gene interaction sentences from the commonly used benchmark Learning Logic in Language (LLL) dataset and one internally generated vaccine-related dataset to identify and analyze interaction types containing multiple keywords. Patterns obtained from the dependency parse trees of the sentences were used to identify the interaction keywords that are related to each other and collectively represent an interaction type. The INO ontology currently has 575 terms including 202 terms under the interaction branch. The relations between the INO interaction types and associated keywords are represented using the INO annotation relations: 'has literature mining keywords' and 'has keyword dependency pattern'. The keyword dependency patterns were generated via running the Stanford Parser to obtain dependency relation types. Out of the 107 interactions in the LLL dataset represented with two-keyword interaction types, 86 were identified by using the direct dependency relations. The LLL dataset contained 34 gene regulation interaction types, each of which associated with multiple keywords. A hierarchical display of these 34 interaction types and their ancestor terms in INO resulted in the identification of specific gene-gene interaction patterns from the LLL dataset. The phenomenon of having multi-keyword interaction types was also frequently observed in the vaccine dataset. By modeling and representing multiple textual keywords for interaction types, the extended INO enabled the identification of complex biological gene-gene interactions represented with multiple keywords.
Homology-integrated CRISPR-Cas (HI-CRISPR) system for one-step multigene disruption in Saccharomyces cerevisiae.

PubMed

Bao, Zehua; Xiao, Han; Liang, Jing; Zhang, Lu; Xiong, Xiong; Sun, Ning; Si, Tong; Zhao, Huimin

2015-05-15

One-step multiple gene disruption in the model organism Saccharomyces cerevisiae is a highly useful tool for both basic and applied research, but it remains a challenge. Here, we report a rapid, efficient, and potentially scalable strategy based on the type II Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-CRISPR associated proteins (Cas) system to generate multiple gene disruptions simultaneously in S. cerevisiae. A 100 bp dsDNA mutagenizing homologous recombination donor is inserted between two direct repeats for each target gene in a CRISPR array consisting of multiple donor and guide sequence pairs. An ultrahigh copy number plasmid carrying iCas9, a variant of wild-type Cas9, trans-encoded RNA (tracrRNA), and a homology-integrated crRNA cassette is designed to greatly increase the gene disruption efficiency. As proof of concept, three genes, CAN1, ADE2, and LYP1, were simultaneously disrupted in 4 days with an efficiency ranging from 27 to 87%. Another three genes involved in an artificial hydrocortisone biosynthetic pathway, ATF2, GCY1, and YPR1, were simultaneously disrupted in 6 days with 100% efficiency. This homology-integrated CRISPR (HI-CRISPR) strategy represents a powerful tool for creating yeast strains with multiple gene knockouts.
EnRICH: Extraction and Ranking using Integration and Criteria Heuristics.

PubMed

Zhang, Xia; Greenlee, M Heather West; Serb, Jeanne M

2013-01-15

High throughput screening technologies enable biologists to generate candidate genes at a rate that, due to time and cost constraints, cannot be studied by experimental approaches in the laboratory. Thus, it has become increasingly important to prioritize candidate genes for experiments. To accomplish this, researchers need to apply selection requirements based on their knowledge, which necessitates qualitative integration of heterogeneous data sources and filtration using multiple criteria. A similar approach can also be applied to putative candidate gene relationships. While automation can assist in this routine and imperative procedure, flexibility of data sources and criteria must not be sacrificed. A tool that can optimize the trade-off between automation and flexibility to simultaneously filter and qualitatively integrate data is needed to prioritize candidate genes and generate composite networks from heterogeneous data sources. We developed the java application, EnRICH (Extraction and Ranking using Integration and Criteria Heuristics), in order to alleviate this need. Here we present a case study in which we used EnRICH to integrate and filter multiple candidate gene lists in order to identify potential retinal disease genes. As a result of this procedure, a candidate pool of several hundred genes was narrowed down to five candidate genes, of which four are confirmed retinal disease genes and one is associated with a retinal disease state. We developed a platform-independent tool that is able to qualitatively integrate multiple heterogeneous datasets and use different selection criteria to filter each of them, provided the datasets are tables that have distinct identifiers (required) and attributes (optional). With the flexibility to specify data sources and filtering criteria, EnRICH automatically prioritizes candidate genes or gene relationships for biologists based on their specific requirements. Here, we also demonstrate that this tool can be effectively and easily used to apply highly specific user-defined criteria and can efficiently identify high quality candidate genes from relatively sparse datasets.
A group LASSO-based method for robustly inferring gene regulatory networks from multiple time-course datasets.

PubMed

Liu, Li-Zhi; Wu, Fang-Xiang; Zhang, Wen-Jun

2014-01-01

As an abstract mapping of the gene regulations in the cell, gene regulatory network is important to both biological research study and practical applications. The reverse engineering of gene regulatory networks from microarray gene expression data is a challenging research problem in systems biology. With the development of biological technologies, multiple time-course gene expression datasets might be collected for a specific gene network under different circumstances. The inference of a gene regulatory network can be improved by integrating these multiple datasets. It is also known that gene expression data may be contaminated with large errors or outliers, which may affect the inference results. A novel method, Huber group LASSO, is proposed to infer the same underlying network topology from multiple time-course gene expression datasets as well as to take the robustness to large error or outliers into account. To solve the optimization problem involved in the proposed method, an efficient algorithm which combines the ideas of auxiliary function minimization and block descent is developed. A stability selection method is adapted to our method to find a network topology consisting of edges with scores. The proposed method is applied to both simulation datasets and real experimental datasets. It shows that Huber group LASSO outperforms the group LASSO in terms of both areas under receiver operating characteristic curves and areas under the precision-recall curves. The convergence analysis of the algorithm theoretically shows that the sequence generated from the algorithm converges to the optimal solution of the problem. The simulation and real data examples demonstrate the effectiveness of the Huber group LASSO in integrating multiple time-course gene expression datasets and improving the resistance to large errors or outliers.
Milestone reached for ORFeome Collaboration | Office of Cancer Genomics

Cancer.gov

The ORFeome Collaboration (OC) is a team of academic and commercial entities which have generated the largest collection of clones containing verified open-reading frames (ORFs) of known human genes. The clones are made available to researchers worldwide through multiple distributors. This valuable resource allows researchers to easily express and study human genes.
MultiSite Gateway-Compatible Cell Type-Specific Gene-Inducible System for Plants1[OPEN

PubMed Central

Siligato, Riccardo; Wang, Xin; Yadav, Shri Ram; Lehesranta, Satu; Ma, Guojie; Ursache, Robertas; Sevilem, Iris; Zhang, Jing; Gorte, Maartje; Prasad, Kalika; Heidstra, Renze

2016-01-01

A powerful method to study gene function is expression or overexpression in an inducible, cell type-specific system followed by observation of consequent phenotypic changes and visualization of linked reporters in the target tissue. Multiple inducible gene overexpression systems have been developed for plants, but very few of these combine plant selection markers, control of expression domains, access to multiple promoters and protein fusion reporters, chemical induction, and high-throughput cloning capabilities. Here, we introduce a MultiSite Gateway-compatible inducible system for Arabidopsis (Arabidopsis thaliana) plants that provides the capability to generate such constructs in a single cloning step. The system is based on the tightly controlled, estrogen-inducible XVE system. We demonstrate that the transformants generated with this system exhibit the expected cell type-specific expression, similar to what is observed with constitutively expressed native promoters. With this new system, cloning of inducible constructs is no longer limited to a few special cases but can be used as a standard approach when gene function is studied. In addition, we present a set of entry clones consisting of histochemical and fluorescent reporter variants designed for gene and promoter expression studies. PMID:26644504
Refinement of light-responsive transcript lists using rice oligonucleotide arrays: evaluation of gene-redundancy.

PubMed

Jung, Ki-Hong; Dardick, Christopher; Bartley, Laura E; Cao, Peijian; Phetsom, Jirapa; Canlas, Patrick; Seo, Young-Su; Shultz, Michael; Ouyang, Shu; Yuan, Qiaoping; Frank, Bryan C; Ly, Eugene; Zheng, Li; Jia, Yi; Hsia, An-Ping; An, Kyungsook; Chou, Hui-Hsien; Rocke, David; Lee, Geun Cheol; Schnable, Patrick S; An, Gynheung; Buell, C Robin; Ronald, Pamela C

2008-10-06

Studies of gene function are often hampered by gene-redundancy, especially in organisms with large genomes such as rice (Oryza sativa). We present an approach for using transcriptomics data to focus functional studies and address redundancy. To this end, we have constructed and validated an inexpensive and publicly available rice oligonucleotide near-whole genome array, called the rice NSF45K array. We generated expression profiles for light- vs. dark-grown rice leaf tissue and validated the biological significance of the data by analyzing sources of variation and confirming expression trends with reverse transcription polymerase chain reaction. We examined trends in the data by evaluating enrichment of gene ontology terms at multiple false discovery rate thresholds. To compare data generated with the NSF45K array with published results, we developed publicly available, web-based tools (www.ricearray.org). The Oligo and EST Anatomy Viewer enables visualization of EST-based expression profiling data for all genes on the array. The Rice Multi-platform Microarray Search Tool facilitates comparison of gene expression profiles across multiple rice microarray platforms. Finally, we incorporated gene expression and biochemical pathway data to reduce the number of candidate gene products putatively participating in the eight steps of the photorespiration pathway from 52 to 10, based on expression levels of putatively functionally redundant genes. We confirmed the efficacy of this method to cope with redundancy by correctly predicting participation in photorespiration of a gene with five paralogs. Applying these methods will accelerate rice functional genomics.
Characterization of fusion genes and the significantly expressed fusion isoforms in breast cancer by hybrid sequencing.

PubMed

Weirather, Jason L; Afshar, Pegah Tootoonchi; Clark, Tyson A; Tseng, Elizabeth; Powers, Linda S; Underwood, Jason G; Zabner, Joseph; Korlach, Jonas; Wong, Wing Hung; Au, Kin Fai

2015-10-15

We developed an innovative hybrid sequencing approach, IDP-fusion, to detect fusion genes, determine fusion sites and identify and quantify fusion isoforms. IDP-fusion is the first method to study gene fusion events by integrating Third Generation Sequencing long reads and Second Generation Sequencing short reads. We applied IDP-fusion to PacBio data and Illumina data from the MCF-7 breast cancer cells. Compared with the existing tools, IDP-fusion detects fusion genes at higher precision and a very low false positive rate. The results show that IDP-fusion will be useful for unraveling the complexity of multiple fusion splices and fusion isoforms within tumorigenesis-relevant fusion genes. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Sex-specific Effects of Exercise Ancestry on Metabolic, Morphological, and Gene Expression Phenotypes in Multiple Generations of Mouse Offspring

PubMed Central

Guth, Lisa M.; Ludlow, Andrew T.; Witkowski, Sarah; Marshall, Mallory R.; Lima, Laila C. J.; Venezia, Andrew C.; Xiao, Tao; Lee, Mei-Ling Ting; Spangenburg, Espen E.; Roth, Stephen M.

2013-01-01

Early life and pre-conception environmental stimuli can affect adult health-related phenotypes. Exercise training is an environmental stimulus affecting many systems throughout the body and appears to alter offspring phenotypes. The aim of this study was to examine the influence of parental exercise training, or “exercise ancestry,” on morphological and metabolic phenotypes in two generations of mouse offspring. F0 C57BL/6 mice were exposed to voluntary exercise or sedentary lifestyle and bred with like-exposed mates to produce an F1 generation. F1 mice of both ancestries were sedentary and sacrificed at 8 wk or bred with littermates to produce an F2 generation, which was also sedentary and sacrificed at 8 wk. Small, but broad generation- and sex-specific effects of exercise ancestry were observed for body mass, fat and muscle mass, serum insulin, glucose tolerance, and muscle gene expression. F1 EX females were lighter than F1 SED females, and had lower absolute tibialis anterior and omental fat masses. Serum insulin was higher in F1 EX females compared to F1 SED females. F2 EX females had impaired glucose tolerance compared to F2 SED females. Analysis of skeletal muscle mRNA levels revealed several generation- and sex-specific differences in mRNA levels for multiple genes, especially those related to metabolic genes (e.g., F1 EX males had lower mRNA levels of Hk2, Ppard, Ppargc1α, Adipoq, and Scd1 than F1 SED males). These results provide preliminary evidence that parental exercise training can influence health-related phenotypes in mouse offspring. PMID:23771910
Genetic variation and biological activity of isolates of lymantria dispar multiple nucleopolyhedrovirus from north america, europe, and asia

USDA-ARS?s Scientific Manuscript database

Little is known about genetic variation of Lymantria dispar multiple nucleopolyhedrovirus (LdMNPV; Baculoviridae: Alphabaculovirus) at the nucleotide sequence level. To obtain a more comprehensive view of genetic diversity among isolates of LdMNPV, partial sequences of the lef-8 gene were generated...
A Prototype System for Retrieval of Gene Functional Information

PubMed Central

Folk, Lillian C.; Patrick, Timothy B.; Pattison, James S.; Wolfinger, Russell D.; Mitchell, Joyce A.

2003-01-01

Microarrays allow researchers to gather data about the expression patterns of thousands of genes simultaneously. Statistical analysis can reveal which genes show statistically significant results. Making biological sense of those results requires the retrieval of functional information about the genes thus identified, typically a manual gene-by-gene retrieval of information from various on-line databases. For experiments generating thousands of genes of interest, retrieval of functional information can become a significant bottleneck. To address this issue, we are currently developing a prototype system to automate the process of retrieval of functional information from multiple on-line sources. PMID:14728346
In Vivo Imaging of Transgenic Gene Expression in Individual Retinal Progenitors in Chimeric Zebrafish Embryos to Study Cell Nonautonomous Influences.

PubMed

Dudczig, Stefanie; Currie, Peter D; Poggi, Lucia; Jusuf, Patricia R

2017-03-22

The genetic and technical strengths have made the zebrafish vertebrate a key model organism in which the consequences of gene manipulations can be traced in vivo throughout the rapid developmental period. Multiple processes can be studied including cell proliferation, gene expression, cell migration and morphogenesis. Importantly, the generation of chimeras through transplantations can be easily performed, allowing mosaic labeling and tracking of individual cells under the influence of the host environment. For example, by combining functional gene manipulations of the host embryo (e.g., through morpholino microinjection) and live imaging, the effects of extrinsic, cell nonautonomous signals (provided by the genetically modified environment) on individual transplanted donor cells can be assessed. Here we demonstrate how this approach is used to compare the onset of fluorescent transgene expression as a proxy for the timing of cell fate determination in different genetic host environments. In this article, we provide the protocol for microinjecting zebrafish embryos to mark donor cells and to cause gene knockdown in host embryos, a description of the transplantation technique used to generate chimeric embryos, and the protocol for preparing and running in vivo time-lapse confocal imaging of multiple embryos. In particular, performing multiposition imaging is crucial when comparing timing of events such as the onset of gene expression. This requires data collection from multiple control and experimental embryos processed simultaneously. Such an approach can easily be extended for studies of extrinsic influences in any organ or tissue of choice accessible to live imaging, provided that transplantations can be targeted easily according to established embryonic fate maps.
The "11K" gene family members sf68, sf95 and sf138 modulate transmissibility and insecticidal properties of Spodoptera frugiperda multiple nucleopolyhedrovirus.

PubMed

Beperet, Inés; Simón, Oihane; Williams, Trevor; López-Ferber, Miguel; Caballero, Primitivo

2015-05-01

The "11K" gene family is notable for having homologs in both baculoviruses and entomopoxviruses and is classified as either type 145 or type 150, according to their similarity with the ac145 or ac150 genes of Autographa californica multiple nucleopolyhedrovirus (AcMNPV). One homolog of ac145 (sf138) and two homologs of ac150 (sf68 and sf95) are present in Spodoptera frugiperda multiple nucleopolyhedrovirus (SfMNPV). Recombinant bacmids lacking sf68, sf95 or sf138 (Sf68null, Sf95null and Sf138null, respectively) and the respective repair bacmids were generated from a bacmid comprising the complete virus genome. Occlusion bodies (OBs) of the Sf138null virus were ∼15-fold less orally infective to insects, which was attributed to a 100-fold reduction in ODV infectious titer. Inoculation of insects with Sf138null OBs in mixtures with an optical brightener failed to restore the pathogenicity of Sf138null OBs to that of the parental virus, indicating that the effects of sf138 deletion on OB pathogenicity were unlikely to involve an interaction with the gut peritrophic matrix. In contrast, deletion of sf68 and sf95 resulted in a slower speed-of-kill by 9h, and a concurrent increase in the yield of OBs. Phylogenetic analysis indicated that sf68 and sf95 were not generated after a duplication event of an ancestral gene homologous to the ac150 gene. We conclude that type 145 genes modulate the primary infection process of the virus, whereas type 150 genes appear to have a role in spreading systemic infection within the insect. Copyright © 2015 Elsevier Inc. All rights reserved.
Speed control: cogs and gears that drive the circadian clock.

PubMed

Zheng, Xiangzhong; Sehgal, Amita

2012-09-01

In most organisms, an intrinsic circadian (~24-h) timekeeping system drives rhythms of physiology and behavior. Within cells that contain a circadian clock, specific transcriptional activators and repressors reciprocally regulate each other to generate a basic molecular oscillator. A mismatch of the period generated by this oscillator with the external environment creates circadian disruption, which can have adverse effects on neural function. Although several clock genes have been extensively characterized, a fundamental question remains: how do these genes work together to generate a ~24-h period? Period-altering mutations in clock genes can affect any of multiple regulated steps in the molecular oscillator. In this review, we examine the regulatory mechanisms that contribute to setting the pace of the circadian oscillator. Copyright © 2012 Elsevier Ltd. All rights reserved.
MU OPIOID RECEPTORS IN PAIN MANAGEMENT

PubMed Central

Pasternak, Gavril; Pan, Ying-Xian

2014-01-01

Most of the potent analgesics currently in use act through the mu opioid receptor. Although they are classified as mu opioids, clinical experience suggests differences among them. The relative potencies of the agents can vary from patient to patient, as well as the side-effect profiles. These observations, coupled with pharmacological approaches in preclinical models, led to the suggestion of multiple subtypes of mu receptors. The explosion in molecular biology has led to the identification of a single gene encoding mu opioid receptors. It now appears that this gene undergoes extensive splicing, in which a single gene can generate multiple proteins. Evidence now suggests that these splice variants may help explain the clinical variability in responses among patients. PMID:21453899
Development of a Recombination System for the Generation of Occlusion Positive Genetically Modified Anticarsia Gemmatalis Multiple Nucleopolyhedrovirus

PubMed Central

Haase, Santiago; McCarthy, Christina B.; Ferrelli, M. Leticia; Pidre, Matias L.; Sciocco-Cap, Alicia; Romanowski, Victor

2015-01-01

Anticarsia gemmatalis is an important pest in legume crops in South America and it has been successfully controlled using Anticarsia gemmatalis Multiple Nucleopolyhedrovirus (AgMNPV) in subtropical climate zones. Nevertheless, in temperate climates its speed of kill is too slow. Taking this into account, genetic modification of AgMNPV could lead to improvements of its biopesticidal properties. Here we report the generation of a two-component system that allows the production of recombinant AgMNPV. This system is based on a parental AgMNPV in which the polyhedrin gene (polh) was replaced by a bacterial β-galactosidase (lacZ) gene flanked by two target sites for the homing endonuclease I-PpoI. Co-transfection of insect cells with linearized (I-PpoI-digested) parental genome and a transfer vector allowed the restitution of polh and the expression of a heterologous gene upon homologous recombination, with a low background of non-recombinant AgMNPV. The system was validated by constructing a recombinant occlusion-positive (polh+) AgMNPV expressing the green fluorescent protein gene (gfp). This recombinant virus infected larvae normally per os and led to the expression of GFP in cell culture as well as in A. gemmatalis larvae. These results demonstrate that the system is an efficient method for the generation of recombinant AgMNPV expressing heterologous genes, which can be used for manifold purposes, including biotechnological and pharmaceutical applications and the production of orally infectious recombinants with improved biopesticidal properties. PMID:25835531
Expression of the lef5 gene from Spodoptera exigua multiple nucleopolyhedrovirus contributes to the baculovirus stability in cell culture.

PubMed

Martínez-Solís, María; Jakubowska, Agata K; Herrero, Salvador

2017-10-01

Baculoviruses are a broad group of viruses infecting insects, predominately of the order Lepidoptera. They are used worldwide as biological insecticides and as expression vectors to produce recombinant proteins. Baculoviruses replicate in their host, although several cell lines have been developed for in vitro replication. Nevertheless, replication of baculoviruses in cell culture involves the generation of defective viruses with a decrease in productivity and virulence. Transcriptional studies of the Spodoptera exigua multiple nucleopolyhedrovirus (SeMNPV) and the Autographa californica multiple nucleopolyhedrovirus (AcMNPV) infective process revealed differences in the expression patterns when the virus replicated under in vitro (Se301 cells) or in vivo (S. exigua larvae) conditions. The late expression factor 5 (lef5) gene was found to be highly overexpressed when the virus replicates in larvae. To test the possible role of lef5 expression in viral stability, recombinant AcMNPV expressing the lef5 gene from SeMNPV (Se-lef5) was generated and its stability was monitored during successive infection passages in Sf21 cells by evaluating the loss of several essential and non-essential genes. The gfp transgene was more stable in those viruses expressing the Se-LEF5 protein and the GFP-defective viruses were accumulated at a lower level when compared to its control viruses, confirming the positive influence of lef5 in viral stability during the multiplication process. This work describes for the first time a viral factor involved in transgene stability when baculoviruses replicate in cell culture, opening new ways to facilitate the in vitro production of recombinant proteins using baculovirus.
STOP using just GO: a multi-ontology hypothesis generation tool for high throughput experimentation

PubMed Central

2013-01-01

Background Gene Ontology (GO) enrichment analysis remains one of the most common methods for hypothesis generation from high throughput datasets. However, we believe that researchers strive to test other hypotheses that fall outside of GO. Here, we developed and evaluated a tool for hypothesis generation from gene or protein lists using ontological concepts present in manually curated text that describes those genes and proteins. Results As a consequence we have developed the method Statistical Tracking of Ontological Phrases (STOP) that expands the realm of testable hypotheses in gene set enrichment analyses by integrating automated annotations of genes to terms from over 200 biomedical ontologies. While not as precise as manually curated terms, we find that the additional enriched concepts have value when coupled with traditional enrichment analyses using curated terms. Conclusion Multiple ontologies have been developed for gene and protein annotation, by using a dataset of both manually curated GO terms and automatically recognized concepts from curated text we can expand the realm of hypotheses that can be discovered. The web application STOP is available at http://mooneygroup.org/stop/. PMID:23409969
Origins of extrinsic variability in eukaryotic gene expression

NASA Astrophysics Data System (ADS)

Volfson, Dmitri; Marciniak, Jennifer; Blake, William J.; Ostroff, Natalie; Tsimring, Lev S.; Hasty, Jeff

2006-02-01

Variable gene expression within a clonal population of cells has been implicated in a number of important processes including mutation and evolution, determination of cell fates and the development of genetic disease. Recent studies have demonstrated that a significant component of expression variability arises from extrinsic factors thought to influence multiple genes simultaneously, yet the biological origins of this extrinsic variability have received little attention. Here we combine computational modelling with fluorescence data generated from multiple promoter-gene inserts in Saccharomyces cerevisiae to identify two major sources of extrinsic variability. One unavoidable source arising from the coupling of gene expression with population dynamics leads to a ubiquitous lower limit for expression variability. A second source, which is modelled as originating from a common upstream transcription factor, exemplifies how regulatory networks can convert noise in upstream regulator expression into extrinsic noise at the output of a target gene. Our results highlight the importance of the interplay of gene regulatory networks with population heterogeneity for understanding the origins of cellular diversity.

Origins of extrinsic variability in eukaryotic gene expression

NASA Astrophysics Data System (ADS)

Volfson, Dmitri; Marciniak, Jennifer; Blake, William J.; Ostroff, Natalie; Tsimring, Lev S.; Hasty, Jeff

2006-03-01

Variable gene expression within a clonal population of cells has been implicated in a number of important processes including mutation and evolution, determination of cell fates and the development of genetic disease. Recent studies have demonstrated that a significant component of expression variability arises from extrinsic factors thought to influence multiple genes in concert, yet the biological origins of this extrinsic variability have received little attention. Here we combine computational modeling with fluorescence data generated from multiple promoter-gene inserts in Saccharomyces cerevisiae to identify two major sources of extrinsic variability. One unavoidable source arising from the coupling of gene expression with population dynamics leads to a ubiquitous noise floor in expression variability. A second source which is modeled as originating from a common upstream transcription factor exemplifies how regulatory networks can convert noise in upstream regulator expression into extrinsic noise at the output of a target gene. Our results highlight the importance of the interplay of gene regulatory networks with population heterogeneity for understanding the origins of cellular diversity.
YY1 Regulates Melanocyte Development and Function by Cooperating with MITF

PubMed Central

Bell, Robert J. A.; Tran, Thanh-Nga T.; Haq, Rizwan; Liu, Huifei; Love, Kevin T.; Langer, Robert; Anderson, Daniel G.; Larue, Lionel; Fisher, David E.

2012-01-01

Studies of coat color mutants have greatly contributed to the discovery of genes that regulate melanocyte development and function. Here, we generated Yy1 conditional knockout mice in the melanocyte-lineage and observed profound melanocyte deficiency and premature gray hair, similar to the loss of melanocytes in human piebaldism and Waardenburg syndrome. Although YY1 is a ubiquitous transcription factor, YY1 interacts with M-MITF, the Waardenburg Syndrome IIA gene and a master transcriptional regulator of melanocytes. YY1 cooperates with M-MITF in regulating the expression of piebaldism gene KIT and multiple additional pigmentation genes. Moreover, ChIP–seq identified genome-wide YY1 targets in the melanocyte lineage. These studies mechanistically link genes implicated in human conditions of melanocyte deficiency and reveal how a ubiquitous factor (YY1) gains lineage-specific functions by co-regulating gene expression with a lineage-restricted factor (M-MITF)—a general mechanism which may confer tissue-specific gene expression in multiple lineages. PMID:22570637
Transgenerational Epigenetic Programming of the Embryonic Testis Transcriptome

PubMed Central

Anway, Matthew D.; Rekow, Stephen S.; Skinner, Michael K.

2008-01-01

Embryonic exposure to the endocrine disruptor vinclozolin during gonadal sex determination appears to promote an epigenetic reprogramming of the male germ-line that is associated with transgenerational adult onset disease states. Transgenerational effects on the embryonic day 16 (E16) testis demonstrated reproducible changes in the testis transcriptome for multiple generations (F1-F3). The expression of 196 genes were found to be influenced, with the majority of gene expression being decreased or silenced. Dramatic changes in the gene expression of methyltransferases during gonadal sex determination were observed in the F1 and F2 vinclozolin generation (E16) embryonic testis, but the majority returned to control generation levels by the F3 generation. The most dramatic effects were on the germ-line associated Dnmt3A and Dnmt3L isoforms. Observations demonstrate that an embryonic exposure to vinclozolin appears to promote an epigenetic reprogramming of the male germ-line that correlates with transgenerational alterations in the testis transcriptome in subsequent generations. PMID:18042343
Intrinsic incompatibilities evolving as a by-product of divergent ecological selection: Considering them in empirical studies on divergence with gene flow.

PubMed

Kulmuni, J; Westram, A M

2017-06-01

The possibility of intrinsic barriers to gene flow is often neglected in empirical research on local adaptation and speciation with gene flow, for example when interpreting patterns observed in genome scans. However, we draw attention to the fact that, even with gene flow, divergent ecological selection may generate intrinsic barriers involving both ecologically selected and other interacting loci. Mechanistically, the link between the two types of barriers may be generated by genes that have multiple functions (i.e., pleiotropy), and/or by gene interaction networks. Because most genes function in complex networks, and their evolution is not independent of other genes, changes evolving in response to ecological selection can generate intrinsic barriers as a by-product. A crucial question is to what extent such by-product barriers contribute to divergence and speciation-that is whether they stably reduce gene flow. We discuss under which conditions by-product barriers may increase isolation. However, we also highlight that, depending on the conditions (e.g., the amount of gene flow and the strength of selection acting on the intrinsic vs. the ecological barrier component), the intrinsic incompatibility may actually destabilize barriers to gene flow. In practice, intrinsic barriers generated as a by-product of divergent ecological selection may generate peaks in genome scans that cannot easily be interpreted. We argue that empirical studies on divergence with gene flow should consider the possibility of both ecological and intrinsic barriers. Future progress will likely come from work combining population genomic studies, experiments quantifying fitness and molecular studies on protein function and interactions. © 2017 The Authors. Molecular Ecology Published by John Wiley & Sons Ltd.
Molecular Characterization of Transgene Integration by Next-Generation Sequencing in Transgenic Cattle

PubMed Central

Zhang, Ran; Yin, Yinliang; Zhang, Yujun; Li, Kexin; Zhu, Hongxia; Gong, Qin; Wang, Jianwu; Hu, Xiaoxiang; Li, Ning

2012-01-01

As the number of transgenic livestock increases, reliable detection and molecular characterization of transgene integration sites and copy number are crucial not only for interpreting the relationship between the integration site and the specific phenotype but also for commercial and economic demands. However, the ability of conventional PCR techniques to detect incomplete and multiple integration events is limited, making it technically challenging to characterize transgenes. Next-generation sequencing has enabled cost-effective, routine and widespread high-throughput genomic analysis. Here, we demonstrate the use of next-generation sequencing to extensively characterize cattle harboring a 150-kb human lactoferrin transgene that was initially analyzed by chromosome walking without success. Using this approach, the sites upstream and downstream of the target gene integration site in the host genome were identified at the single nucleotide level. The sequencing result was verified by event-specific PCR for the integration sites and FISH for the chromosomal location. Sequencing depth analysis revealed that multiple copies of the incomplete target gene and the vector backbone were present in the host genome. Upon integration, complex recombination was also observed between the target gene and the vector backbone. These findings indicate that next-generation sequencing is a reliable and accurate approach for the molecular characterization of the transgene sequence, integration sites and copy number in transgenic species. PMID:23185606
Molecular processes of transgenerational acclimation to a warming ocean

NASA Astrophysics Data System (ADS)

Veilleux, Heather D.; Ryu, Taewoo; Donelson, Jennifer M.; van Herwerden, Lynne; Seridi, Loqmane; Ghosheh, Yanal; Berumen, Michael L.; Leggat, William; Ravasi, Timothy; Munday, Philip L.

2015-12-01

Some animals have the remarkable capacity to acclimate across generations to projected future climate change; however, the underlying molecular processes are unknown. We sequenced and assembled de novo transcriptomes of adult tropical reef fish exposed developmentally or transgenerationally to projected future ocean temperatures and correlated the resulting expression profiles with acclimated metabolic traits from the same fish. We identified 69 contigs representing 53 key genes involved in thermal acclimation of aerobic capacity. Metabolic genes were among the most upregulated transgenerationally, suggesting shifts in energy production for maintaining performance at elevated temperatures. Furthermore, immune- and stress-responsive genes were upregulated transgenerationally, indicating a new complement of genes allowing the second generation of fish to better cope with elevated temperatures. Other differentially expressed genes were involved with tissue development and transcriptional regulation. Overall, we found a similar suite of differentially expressed genes among developmental and transgenerational treatments. Heat-shock protein genes were surprisingly unresponsive, indicating that short-term heat-stress responses may not be a good indicator of long-term acclimation capacity. Our results are the first to reveal the molecular processes that may enable marine fishes to adjust to a future warmer environment over multiple generations.
Simple method for assembly of CRISPR synergistic activation mediator gRNA expression array.

PubMed

Vad-Nielsen, Johan; Nielsen, Anders Lade; Luo, Yonglun

2018-05-20

When studying complex interconnected regulatory networks, effective methods for simultaneously manipulating multiple genes expression are paramount. Previously, we have developed a simple method for generation of an all-in-one CRISPR gRNA expression array. We here present a Golden Gate Assembly-based system of synergistic activation mediator (SAM) compatible CRISPR/dCas9 gRNA expression array for the simultaneous activation of multiple genes. Using this system, we demonstrated the simultaneous activation of the transcription factors, TWIST, SNAIL, SLUG, and ZEB1 a human breast cancer cell line. Copyright © 2018 Elsevier B.V. All rights reserved.
Multiple-strand displacement and identification of single nucleotide polymorphisms as markers of genotypic variation of Pasteuria penetrans biotypes infecting root-knot nematodes.

PubMed

Nong, Guang; Chow, Virginia; Schmidt, Liesbeth M; Dickson, Don W; Preston, James F

2007-08-01

Pasteuria species are endospore-forming obligate bacterial parasites of soil-inhabiting nematodes and water-inhabiting cladocerans, e.g. water fleas, and are closely related to Bacillus spp. by 16S rRNA gene sequence. As naturally occurring bacteria, biotypes of Pasteuria penetrans are attractive candidates for the biocontrol of various Meloidogyne spp. (root-knot nematodes). Failure to culture these bacteria outside their hosts has prevented isolation of genomic DNA in quantities sufficient for identification of genes associated with host recognition and virulence. We have applied multiple-strand displacement amplification (MDA) to generate DNA for comparative genomics of biotypes exhibiting different host preferences. Using the genome of Bacillus subtilis as a paradigm, MDA allowed quantitative detection and sequencing of 12 marker genes from 2000 cells. Meloidogyne spp. infected with P. penetrans P20 or B4 contained single nucleotide polymorphisms (SNPs) in the spoIIAB gene that did not change the amino acid sequence, or that substituted amino acids with similar chemical properties. Individual nematodes infected with P. penetrans P20 or B4 contained SNPs in the spoIIAB gene sequenced in MDA-generated products. Detection of SNPs in the spoIIAB gene in a nematode indicates infection by more than one genotype, supporting the need to sequence genomes of Pasteuria spp. derived from single spore isolates.
Allen Brain Atlas-Driven Visualizations: a web-based gene expression energy visualization tool.

PubMed

Zaldivar, Andrew; Krichmar, Jeffrey L

2014-01-01

The Allen Brain Atlas-Driven Visualizations (ABADV) is a publicly accessible web-based tool created to retrieve and visualize expression energy data from the Allen Brain Atlas (ABA) across multiple genes and brain structures. Though the ABA offers their own search engine and software for researchers to view their growing collection of online public data sets, including extensive gene expression and neuroanatomical data from human and mouse brain, many of their tools limit the amount of genes and brain structures researchers can view at once. To complement their work, ABADV generates multiple pie charts, bar charts and heat maps of expression energy values for any given set of genes and brain structures. Such a suite of free and easy-to-understand visualizations allows for easy comparison of gene expression across multiple brain areas. In addition, each visualization links back to the ABA so researchers may view a summary of the experimental detail. ABADV is currently supported on modern web browsers and is compatible with expression energy data from the Allen Mouse Brain Atlas in situ hybridization data. By creating this web application, researchers can immediately obtain and survey numerous amounts of expression energy data from the ABA, which they can then use to supplement their work or perform meta-analysis. In the future, we hope to enable ABADV across multiple data resources.
A methodology to migrate the gene ontology to a description logic environment using DAML+OIL.

PubMed

Wroe, C J; Stevens, R; Goble, C A; Ashburner, M

2003-01-01

The Gene Ontology Next Generation Project (GONG) is developing a staged methodology to evolve the current representation of the Gene Ontology into DAML+OIL in order to take advantage of the richer formal expressiveness and the reasoning capabilities of the underlying description logic. Each stage provides a step level increase in formal explicit semantic content with a view to supporting validation, extension and multiple classification of the Gene Ontology. The paper introduces DAML+OIL and demonstrates the activity within each stage of the methodology and the functionality gained.
The long tail of molecular alterations in non-small cell lung cancer: a single-institution experience of next-generation sequencing in clinical molecular diagnostics.

PubMed

Fumagalli, Caterina; Vacirca, Davide; Rappa, Alessandra; Passaro, Antonio; Guarize, Juliana; Rafaniello Raviele, Paola; de Marinis, Filippo; Spaggiari, Lorenzo; Casadio, Chiara; Viale, Giuseppe; Barberis, Massimo; Guerini-Rocco, Elena

2018-03-13

Molecular profiling of advanced non-small cell lung cancers (NSCLC) is essential to identify patients who may benefit from targeted treatments. In the last years, the number of potentially actionable molecular alterations has rapidly increased. Next-generation sequencing allows for the analysis of multiple genes simultaneously. To evaluate the feasibility and the throughput of next-generation sequencing in clinical molecular diagnostics of advanced NSCLC. A single-institution cohort of 535 non-squamous NSCLC was profiled using a next-generation sequencing panel targeting 22 actionable and cancer-related genes. 441 non-squamous NSCLC (82.4%) harboured at least one gene alteration, including 340 cases (63.6%) with clinically relevant molecular aberrations. Mutations have been detected in all but one gene ( FGFR1 ) of the panel. Recurrent alterations were observed in KRAS , TP53 , EGFR , STK11 and MET genes, whereas the remaining genes were mutated in <5% of the cases. Concurrent mutations were detected in 183 tumours (34.2%), mostly impairing KRAS or EGFR in association with TP53 alterations. The study highlights the feasibility of targeted next-generation sequencing in clinical setting. The majority of NSCLC harboured mutations in clinically relevant genes, thus identifying patients who might benefit from different targeted therapies. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
The CanOE strategy: integrating genomic and metabolic contexts across multiple prokaryote genomes to find candidate genes for orphan enzymes.

PubMed

Smith, Adam Alexander Thil; Belda, Eugeni; Viari, Alain; Medigue, Claudine; Vallenet, David

2012-05-01

Of all biochemically characterized metabolic reactions formalized by the IUBMB, over one out of four have yet to be associated with a nucleic or protein sequence, i.e. are sequence-orphan enzymatic activities. Few bioinformatics annotation tools are able to propose candidate genes for such activities by exploiting context-dependent rather than sequence-dependent data, and none are readily accessible and propose result integration across multiple genomes. Here, we present CanOE (Candidate genes for Orphan Enzymes), a four-step bioinformatics strategy that proposes ranked candidate genes for sequence-orphan enzymatic activities (or orphan enzymes for short). The first step locates "genomic metabolons", i.e. groups of co-localized genes coding proteins catalyzing reactions linked by shared metabolites, in one genome at a time. These metabolons can be particularly helpful for aiding bioanalysts to visualize relevant metabolic data. In the second step, they are used to generate candidate associations between un-annotated genes and gene-less reactions. The third step integrates these gene-reaction associations over several genomes using gene families, and summarizes the strength of family-reaction associations by several scores. In the final step, these scores are used to rank members of gene families which are proposed for metabolic reactions. These associations are of particular interest when the metabolic reaction is a sequence-orphan enzymatic activity. Our strategy found over 60,000 genomic metabolons in more than 1,000 prokaryote organisms from the MicroScope platform, generating candidate genes for many metabolic reactions, of which more than 70 distinct orphan reactions. A computational validation of the approach is discussed. Finally, we present a case study on the anaerobic allantoin degradation pathway in Escherichia coli K-12.
Further enhanced production of heterologous proteins by double-gene disruption (ΔAosedD ΔAovps10) in a hyper-producing mutant of Aspergillus oryzae.

PubMed

Zhu, Lin; Maruyama, Jun-ichi; Kitamoto, Katsuhiko

2013-07-01

The filamentous fungus Aspergillus oryzae is used as one of the most favored hosts for heterologous protein production due to its ability to secrete large amounts of proteins into the culture medium. We previously generated a hyper-producing mutant strain of A. oryzae, AUT1, which produced 3.2- and 2.6-fold higher levels of bovine chymosin (CHY) and human lysozyme (HLY), respectively, compared with the wild-type strain. However, further enhancement of heterologous protein production by multiple gene disruption is difficult because of the low gene-targeting efficiency in strain AUT1. Here, we disrupted the ligD gene, which is involved in nonhomologous recombination, and the pyrG gene to create uridine/uracil auxotrophy in strain AUT1, to generate a hyper-producing mutant applicable to pyrG marker recycling with highly efficient gene targeting. We generated single and double disruptants of the tripeptidyl peptidase gene AosedD and vacuolar sorting receptor gene Aovps10 in the hyper-producing mutant background, and found that all disruptants showed significant increases in heterologous protein production. Particularly, double disruption of the Aovps10 and AosedD genes increased the production levels of CHY and HLY by 1.6- and 2.1-fold, respectively, compared with the parental strain. Thus, we successfully generated a fungal host for further enhancing the heterologous protein production ability by combining mutational and molecular breeding techniques.
Development of a Markerless Genetic Exchange System in Desulfovibrio vulgaris Hildenborough and Its Use in Generating a Strain with Increased Transformation Efficiency

DOE Office of Scientific and Technical Information (OSTI.GOV)

Keller, Kimberly L.; Bender, Kelly S.; Wall, Judy D.

2009-07-21

In recent years, the genetic manipulation of the sulfate-reducing bacterium Desulfovibrio vulgaris Hildenborough has seen enormous progress. In spite of this progress, the current marker exchange deletion method does not allow for easy selection of multiple sequential gene deletions in a single strain because of the limited number of selectable markers available in D. vulgaris. To broaden the repertoire of genetic tools for manipulation, an in-frame, markerless deletion system has been developed. The counterselectable marker that makes this deletion system possible is the pyrimidine salvage enzyme, uracil phosphoribosyltransferase, encoded by upp. In wild-type D. vulgaris, growth was shown to bemore » inhibited by the toxic pyrimidine analog 5-fluorouracil (5-FU); whereas, a mutant bearing a deletion of the upp gene was resistant to 5-FU. When a plasmid containing the wild-type upp gene expressed constitutively from the aph(3')-II promoter (promoter for the kanamycin resistance gene in Tn5) was introduced into the upp deletion strain, sensitivity to 5-FU was restored. This observation allowed us to develop a two-step integration and excision strategy for the deletion of genes of interest. Since this inframe deletion strategy does not retain an antibiotic cassette, multiple deletions can be generated in a single strain without the accumulation of genes conferring antibiotic resistances. We used this strategy to generate a deletion strain lacking the endonuclease (hsdR, DVU1703) of a type I restriction-modification system, that we designated JW7035. The transformation efficiency of the JW7035 strain was found to be 100 to 1000 times greater than that of the wild-type strain when stable plasmids were introduced via electroporation.« less
A Network Approach of Gene Co-expression in the Zea mays/Aspergillus flavus Pathosystem to Map Host/Pathogen Interaction Pathways.

PubMed

Musungu, Bryan M; Bhatnagar, Deepak; Brown, Robert L; Payne, Gary A; OBrian, Greg; Fakhoury, Ahmad M; Geisler, Matt

2016-01-01

A gene co-expression network (GEN) was generated using a dual RNA-seq study with the fungal pathogen Aspergillus flavus and its plant host Zea mays during the initial 3 days of infection. The analysis deciphered novel pathways and mapped genes of interest in both organisms during the infection. This network revealed a high degree of connectivity in many of the previously recognized pathways in Z. mays such as jasmonic acid, ethylene, and reactive oxygen species (ROS). For the pathogen A. flavus , a link between aflatoxin production and vesicular transport was identified within the network. There was significant interspecies correlation of expression between Z. mays and A. flavus for a subset of 104 Z. mays , and 1942 A. flavus genes. This resulted in an interspecies subnetwork enriched in multiple Z. mays genes involved in the production of ROS. In addition to the ROS from Z. mays , there was enrichment in the vesicular transport pathways and the aflatoxin pathway for A. flavus . Included in these genes, a key aflatoxin cluster regulator, AflS, was found to be co-regulated with multiple Z. mays ROS producing genes within the network, suggesting AflS may be monitoring host ROS levels. The entire GEN for both host and pathogen, and the subset of interspecies correlations, is presented as a tool for hypothesis generation and discovery for events in the early stages of fungal infection of Z. mays by A. flavus .
A Network Approach of Gene Co-expression in the Zea mays/Aspergillus flavus Pathosystem to Map Host/Pathogen Interaction Pathways

PubMed Central

Musungu, Bryan M.; Bhatnagar, Deepak; Brown, Robert L.; Payne, Gary A.; OBrian, Greg; Fakhoury, Ahmad M.; Geisler, Matt

2016-01-01

A gene co-expression network (GEN) was generated using a dual RNA-seq study with the fungal pathogen Aspergillus flavus and its plant host Zea mays during the initial 3 days of infection. The analysis deciphered novel pathways and mapped genes of interest in both organisms during the infection. This network revealed a high degree of connectivity in many of the previously recognized pathways in Z. mays such as jasmonic acid, ethylene, and reactive oxygen species (ROS). For the pathogen A. flavus, a link between aflatoxin production and vesicular transport was identified within the network. There was significant interspecies correlation of expression between Z. mays and A. flavus for a subset of 104 Z. mays, and 1942 A. flavus genes. This resulted in an interspecies subnetwork enriched in multiple Z. mays genes involved in the production of ROS. In addition to the ROS from Z. mays, there was enrichment in the vesicular transport pathways and the aflatoxin pathway for A. flavus. Included in these genes, a key aflatoxin cluster regulator, AflS, was found to be co-regulated with multiple Z. mays ROS producing genes within the network, suggesting AflS may be monitoring host ROS levels. The entire GEN for both host and pathogen, and the subset of interspecies correlations, is presented as a tool for hypothesis generation and discovery for events in the early stages of fungal infection of Z. mays by A. flavus. PMID:27917194
Extensive Error in the Number of Genes Inferred from Draft Genome Assemblies

PubMed Central

Denton, James F.; Lugo-Martinez, Jose; Tucker, Abraham E.; Schrider, Daniel R.; Warren, Wesley C.; Hahn, Matthew W.

2014-01-01

Current sequencing methods produce large amounts of data, but genome assemblies based on these data are often woefully incomplete. These incomplete and error-filled assemblies result in many annotation errors, especially in the number of genes present in a genome. In this paper we investigate the magnitude of the problem, both in terms of total gene number and the number of copies of genes in specific families. To do this, we compare multiple draft assemblies against higher-quality versions of the same genomes, using several new assemblies of the chicken genome based on both traditional and next-generation sequencing technologies, as well as published draft assemblies of chimpanzee. We find that upwards of 40% of all gene families are inferred to have the wrong number of genes in draft assemblies, and that these incorrect assemblies both add and subtract genes. Using simulated genome assemblies of Drosophila melanogaster, we find that the major cause of increased gene numbers in draft genomes is the fragmentation of genes onto multiple individual contigs. Finally, we demonstrate the usefulness of RNA-Seq in improving the gene annotation of draft assemblies, largely by connecting genes that have been fragmented in the assembly process. PMID:25474019
Extensive error in the number of genes inferred from draft genome assemblies.

PubMed

Denton, James F; Lugo-Martinez, Jose; Tucker, Abraham E; Schrider, Daniel R; Warren, Wesley C; Hahn, Matthew W

2014-12-01

Current sequencing methods produce large amounts of data, but genome assemblies based on these data are often woefully incomplete. These incomplete and error-filled assemblies result in many annotation errors, especially in the number of genes present in a genome. In this paper we investigate the magnitude of the problem, both in terms of total gene number and the number of copies of genes in specific families. To do this, we compare multiple draft assemblies against higher-quality versions of the same genomes, using several new assemblies of the chicken genome based on both traditional and next-generation sequencing technologies, as well as published draft assemblies of chimpanzee. We find that upwards of 40% of all gene families are inferred to have the wrong number of genes in draft assemblies, and that these incorrect assemblies both add and subtract genes. Using simulated genome assemblies of Drosophila melanogaster, we find that the major cause of increased gene numbers in draft genomes is the fragmentation of genes onto multiple individual contigs. Finally, we demonstrate the usefulness of RNA-Seq in improving the gene annotation of draft assemblies, largely by connecting genes that have been fragmented in the assembly process.
Synthetic Gene Network with Positive Feedback Loop Amplifies Cellulase Gene Expression in Neurospora crassa.

PubMed

Matsu-Ura, Toru; Dovzhenok, Andrey A; Coradetti, Samuel T; Subramanian, Krithika R; Meyer, Daniel R; Kwon, Jaesang J; Kim, Caleb; Salomonis, Nathan; Glass, N Louise; Lim, Sookkyung; Hong, Christian I

2018-05-18

Second-generation or lignocellulosic biofuels are a tangible source of renewable energy, which is critical to combat climate change by reducing the carbon footprint. Filamentous fungi secrete cellulose-degrading enzymes called cellulases, which are used for production of lignocellulosic biofuels. However, inefficient production of cellulases is a major obstacle for industrial-scale production of second-generation biofuels. We used computational simulations to design and implement synthetic positive feedback loops to increase gene expression of a key transcription factor, CLR-2, that activates a large number of cellulases in a filamentous fungus, Neurospora crassa. Overexpression of CLR-2 reveals previously unappreciated roles of CLR-2 in lignocellulosic gene network, which enabled simultaneous induction of approximately 50% of 78 lignocellulosic degradation-related genes in our engineered Neurospora strains. This engineering results in dramatically increased cellulase activity due to cooperative orchestration of multiple enzymes involved in the cellulose degradation pathway. Our work provides a proof of principle in utilizing mathematical modeling and synthetic biology to improve the efficiency of cellulase synthesis for second-generation biofuel production.
Multiplexed resequencing analysis to identify rare variants in pooled DNA with barcode indexing using next-generation sequencer.

PubMed

Mitsui, Jun; Fukuda, Yoko; Azuma, Kyo; Tozaki, Hirokazu; Ishiura, Hiroyuki; Takahashi, Yuji; Goto, Jun; Tsuji, Shoji

2010-07-01

We have recently found that multiple rare variants of the glucocerebrosidase gene (GBA) confer a robust risk for Parkinson disease, supporting the 'common disease-multiple rare variants' hypothesis. To develop an efficient method of identifying rare variants in a large number of samples, we applied multiplexed resequencing using a next-generation sequencer to identification of rare variants of GBA. Sixteen sets of pooled DNAs from six pooled DNA samples were prepared. Each set of pooled DNAs was subjected to polymerase chain reaction to amplify the target gene (GBA) covering 6.5 kb, pooled into one tube with barcode indexing, and then subjected to extensive sequence analysis using the SOLiD System. Individual samples were also subjected to direct nucleotide sequence analysis. With the optimization of data processing, we were able to extract all the variants from 96 samples with acceptable rates of false-positive single-nucleotide variants.

Comparative Analysis of Evolutionary Mechanisms of the Hemagglutinin and Three Internal Protein Genes of Influenza B Virus: Multiple Cocirculating Lineages and Frequent Reassortment of the NP, M, and NS Genes

PubMed Central

Lindstrom, Stephen E.; Hiromoto, Yasuaki; Nishimura, Hidekazu; Saito, Takehiko; Nerome, Reiko; Nerome, Kuniaki

1999-01-01

Phylogenetic profiles of the genes coding for the hemagglutinin (HA) protein, nucleoprotein (NP), matrix (M) protein, and nonstructural (NS) proteins of influenza B viruses isolated from 1940 to 1998 were analyzed in a parallel manner in order to understand the evolutionary mechanisms of these viruses. Unlike human influenza A (H3N2) viruses, the evolutionary pathways of all four genes of recent influenza B viruses revealed similar patterns of genetic divergence into two major lineages. Although evolutionary rates of the HA, NP, M, and NS genes of influenza B viruses were estimated to be generally lower than those of human influenza A viruses, genes of influenza B viruses demonstrated complex phylogenetic patterns, indicating alternative mechanisms for generation of virus variability. Topologies of the evolutionary trees of each gene were determined to be quite distinct from one another, showing that these genes were evolving in an independent manner. Furthermore, variable topologies were apparently the result of frequent genetic exchange among cocirculating epidemic viruses. Evolutionary analysis done in the present study provided further evidence for cocirculation of multiple lineages as well as sequestering and reemergence of phylogenetic lineages of the internal genes. In addition, comparison of deduced amino acid sequences revealed a novel amino acid deletion in the HA1 domain of the HA protein of recent isolates from 1998 belonging to the B/Yamagata/16/88-like lineage. It thus became apparent that, despite lower evolutionary rates, influenza B viruses were able to generate genetic diversity among circulating viruses through a combination of evolutionary mechanisms involving cocirculating lineages and genetic reassortment by which new variants with distinct gene constellations emerged. PMID:10196339
An Unbiased Assessment of the Role of Imprinted Genes in an Intergenerational Model of Developmental Programming

PubMed Central

Radford, Elizabeth J.; Isganaitis, Elvira; Jimenez-Chillaron, Josep; Schroeder, Joshua; Molla, Michael; Andrews, Simon; Didier, Nathalie; Charalambous, Marika; McEwen, Kirsten; Marazzi, Giovanna; Sassoon, David; Patti, Mary-Elizabeth; Ferguson-Smith, Anne C.

2012-01-01

Environmental factors during early life are critical for the later metabolic health of the individual and of future progeny. In our obesogenic environment, it is of great socioeconomic importance to investigate the mechanisms that contribute to the risk of metabolic ill health. Imprinted genes, a class of functionally mono-allelic genes critical for early growth and metabolic axis development, have been proposed to be uniquely susceptible to environmental change. Furthermore, it has also been suggested that perturbation of the epigenetic reprogramming of imprinting control regions (ICRs) may play a role in phenotypic heritability following early life insults. Alternatively, the presence of multiple layers of epigenetic regulation may in fact protect imprinted genes from such perturbation. Unbiased investigation of these alternative hypotheses requires assessment of imprinted gene expression in the context of the response of the whole transcriptome to environmental assault. We therefore analyse the role of imprinted genes in multiple tissues in two affected generations of an established murine model of the developmental origins of health and disease using microarrays and quantitative RT–PCR. We demonstrate that, despite the functional mono-allelicism of imprinted genes and their unique mechanisms of epigenetic dosage control, imprinted genes as a class are neither more susceptible nor protected from expression perturbation induced by maternal undernutrition in either the F1 or the F2 generation compared to other genes. Nor do we find any evidence that the epigenetic reprogramming of ICRs in the germline is susceptible to nutritional restriction. However, we propose that those imprinted genes that are affected may play important roles in the foetal response to undernutrition and potentially its long-term sequelae. We suggest that recently described instances of dosage regulation by relaxation of imprinting are rare and likely to be highly regulated. PMID:22511876
A high-throughput method for the detection of homoeologous gene deletions in hexaploid wheat

PubMed Central

2010-01-01

Background Mutational inactivation of plant genes is an essential tool in gene function studies. Plants with inactivated or deleted genes may also be exploited for crop improvement if such mutations/deletions produce a desirable agronomical and/or quality phenotype. However, the use of mutational gene inactivation/deletion has been impeded in polyploid plant species by genetic redundancy, as polyploids contain multiple copies of the same genes (homoeologous genes) encoded by each of the ancestral genomes. Similar to many other crop plants, bread wheat (Triticum aestivum L.) is polyploid; specifically allohexaploid possessing three progenitor genomes designated as 'A', 'B', and 'D'. Recently modified TILLING protocols have been developed specifically for mutation detection in wheat. Whilst extremely powerful in detecting single nucleotide changes and small deletions, these methods are not suitable for detecting whole gene deletions. Therefore, high-throughput methods for screening of candidate homoeologous gene deletions are needed for application to wheat populations generated by the use of certain mutagenic agents (e.g. heavy ion irradiation) that frequently generate whole-gene deletions. Results To facilitate the screening for specific homoeologous gene deletions in hexaploid wheat, we have developed a TaqMan qPCR-based method that allows high-throughput detection of deletions in homoeologous copies of any gene of interest, provided that sufficient polymorphism (as little as a single nucleotide difference) amongst homoeologues exists for specific probe design. We used this method to identify deletions of individual TaPFT1 homoeologues, a wheat orthologue of the disease susceptibility and flowering regulatory gene PFT1 in Arabidopsis. This method was applied to wheat nullisomic-tetrasomic lines as well as other chromosomal deletion lines to locate the TaPFT1 gene to the long arm of chromosome 5. By screening of individual DNA samples from 4500 M2 mutant wheat lines generated by heavy ion irradiation, we detected multiple mutants with deletions of each TaPFT1 homoeologue, and confirmed these deletions using a CAPS method. We have subsequently designed, optimized, and applied this method for the screening of homoeologous deletions of three additional wheat genes putatively involved in plant disease resistance. Conclusions We have developed a method for automated, high-throughput screening to identify deletions of individual homoeologues of a wheat gene. This method is also potentially applicable to other polyploidy plants. PMID:21114819
Genome structure drives patterns of gene family evolution in ciliates, a case study using Chilodonella uncinata (Protista, Ciliophora, Phyllopharyngea)

PubMed Central

Gao, Feng; Song, Weibo; Katz, Laura A.

2014-01-01

In most lineages, diversity among gene family members results from gene duplication followed by sequence divergence. Because of the genome rearrangements during the development of somatic nuclei, gene family evolution in ciliates involves more complex processes. Previous work on the ciliate Chilodonella uncinata revealed that macronuclear β-tubulin gene family members are generated by alternative processing, in which germline regions are alternatively used in multiple macronuclear chromosomes. To further study genome evolution in this ciliate, we analyzed its transcriptome and found that: 1) alternative processing is extensive among gene families; and 2) such gene families are likely to be C. uncinata-specific. We characterized additional macronuclear and micronuclear copies of one candidate alternatively processed gene family -- a protein kinase domain containing protein (PKc) -- from two C. uncinata strains. Analysis of the PKc sequences reveals: 1) multiple PKc gene family members in the macronucleus share some identical regions flanked by divergent regions; and 2) the shared identical regions are processed from a single micronuclear chromosome. We discuss analogous processes in lineages across the eukaryotic tree of life to provide further insights on the impact of genome structure on gene family evolution in eukaryotes. PMID:24749903
MYD88 and functionally related genes are associated with multiple infections in a model population of Kenyan village dogs.

PubMed

Necesankova, Michaela; Vychodilova, Leona; Albrechtova, Katerina; Kennedy, Lorna J; Hlavac, Jan; Sedlak, Kamil; Modry, David; Janova, Eva; Vyskocil, Mirko; Horin, Petr

2016-12-01

The purpose of this study was to seek associations between immunity-related molecular markers and endemic infections in a model population of African village dogs from Northern Kenya with no veterinary care and no selective breeding. A population of village dogs from Northern Kenya composed of three sub-populations from three different areas (84, 50 and 55 dogs) was studied. Canine distemper virus (CDV), Hepatozoon canis, Microfilariae (Acantocheilonema dracunculoides, Acantocheilonema reconditum) and Neospora caninum were the pathogens studied. The presence of antibodies (CDV, Neospora), light microscopy (Hepatozoon) and diagnostic PCR (Microfilariae) were the methods used for diagnosing infection. Genes involved in innate immune mechanisms, NOS3, IL6, TLR1, TLR2, TLR4, TLR7, TLR9, LY96, MYD88, and three major histocompatibility genes class II genes were selected as candidates. Single nucleotide polymorphism (SNP) markers were detected by Sanger sequencing, next generation sequencing and PCR-RFLP. The Fisher´s exact test for additive and non-additive models was used for association analyses. Three SNPs within the MYD88 gene and one TLR4 SNP marker were associated with more than one infection. Combined genotypes and further markers identified by next generation sequencing confirmed associations observed for individual genes. The genes associated with infection and their combinations in specific genotypes match well our knowledge on their biological role and on the role of the relevant biological pathways, respectively. Associations with multiple infections observed between the MYD88 and TLR4 genes suggest their involvement in the mechanisms of anti-infectious defenses in dogs.
Hairpin RNA Targeting Multiple Viral Genes Confers Strong Resistance to Rice Black-Streaked Dwarf Virus.

PubMed

Wang, Fangquan; Li, Wenqi; Zhu, Jinyan; Fan, Fangjun; Wang, Jun; Zhong, Weigong; Wang, Ming-Bo; Liu, Qing; Zhu, Qian-Hao; Zhou, Tong; Lan, Ying; Zhou, Yijun; Yang, Jie

2016-05-11

Rice black-streaked dwarf virus (RBSDV) belongs to the genus Fijivirus in the family of Reoviridae and causes severe yield loss in rice-producing areas in Asia. RNA silencing, as a natural defence mechanism against plant viruses, has been successfully exploited for engineering virus resistance in plants, including rice. In this study, we generated transgenic rice lines harbouring a hairpin RNA (hpRNA) construct targeting four RBSDV genes, S1, S2, S6 and S10, encoding the RNA-dependent RNA polymerase, the putative core protein, the RNA silencing suppressor and the outer capsid protein, respectively. Both field nursery and artificial inoculation assays of three generations of the transgenic lines showed that they had strong resistance to RBSDV infection. The RBSDV resistance in the segregating transgenic populations correlated perfectly with the presence of the hpRNA transgene. Furthermore, the hpRNA transgene was expressed in the highly resistant transgenic lines, giving rise to abundant levels of 21-24 nt small interfering RNA (siRNA). By small RNA deep sequencing, the RBSDV-resistant transgenic lines detected siRNAs from all four viral gene sequences in the hpRNA transgene, indicating that the whole chimeric fusion sequence can be efficiently processed by Dicer into siRNAs. Taken together, our results suggest that long hpRNA targeting multiple viral genes can be used to generate stable and durable virus resistance in rice, as well as other plant species.
A new set of ESTs and cDNA clones from full-length and normalized libraries for gene discovery and functional characterization in citrus

PubMed Central

Marques, M Carmen; Alonso-Cantabrana, Hugo; Forment, Javier; Arribas, Raquel; Alamar, Santiago; Conejero, Vicente; Perez-Amador, Miguel A

2009-01-01

Background Interpretation of ever-increasing raw sequence information generated by modern genome sequencing technologies faces multiple challenges, such as gene function analysis and genome annotation. Indeed, nearly 40% of genes in plants encode proteins of unknown function. Functional characterization of these genes is one of the main challenges in modern biology. In this regard, the availability of full-length cDNA clones may fill in the gap created between sequence information and biological knowledge. Full-length cDNA clones facilitate functional analysis of the corresponding genes enabling manipulation of their expression in heterologous systems and the generation of a variety of tagged versions of the native protein. In addition, the development of full-length cDNA sequences has the power to improve the quality of genome annotation. Results We developed an integrated method to generate a new normalized EST collection enriched in full-length and rare transcripts of different citrus species from multiple tissues and developmental stages. We constructed a total of 15 cDNA libraries, from which we isolated 10,898 high-quality ESTs representing 6142 different genes. Percentages of redundancy and proportion of full-length clones range from 8 to 33, and 67 to 85, respectively, indicating good efficiency of the approach employed. The new EST collection adds 2113 new citrus ESTs, representing 1831 unigenes, to the collection of citrus genes available in the public databases. To facilitate functional analysis, cDNAs were introduced in a Gateway-based cloning vector for high-throughput functional analysis of genes in planta. Herein, we describe the technical methods used in the library construction, sequence analysis of clones and the overexpression of CitrSEP, a citrus homolog to the Arabidopsis SEP3 gene, in Arabidopsis as an example of a practical application of the engineered Gateway vector for functional analysis. Conclusion The new EST collection denotes an important step towards the identification of all genes in the citrus genome. Furthermore, public availability of the cDNA clones generated in this study, and not only their sequence, enables testing of the biological function of the genes represented in the collection. Expression of the citrus SEP3 homologue, CitrSEP, in Arabidopsis results in early flowering, along with other phenotypes resembling the over-expression of the Arabidopsis SEPALLATA genes. Our findings suggest that the members of the SEP gene family play similar roles in these quite distant plant species. PMID:19747386
Darwin Assembly: fast, efficient, multi-site bespoke mutagenesis

PubMed Central

Cozens, Christopher

2018-01-01

Abstract Engineering proteins for designer functions and biotechnological applications almost invariably requires (or at least benefits from) multiple mutations to non-contiguous residues. Several methods for multiple site-directed mutagenesis exist, but there remains a need for fast and simple methods to efficiently introduce such mutations – particularly for generating large, high quality libraries for directed evolution. Here, we present Darwin Assembly, which can deliver high quality libraries of >108 transformants, targeting multiple (>10) distal sites with minimal wild-type contamination (<0.25% of total population) and which takes a single working day from purified plasmid to library transformation. We demonstrate its efficacy with whole gene codon reassignment of chloramphenicol acetyl transferase, mutating 19 codons in a single reaction in KOD DNA polymerase and generating high quality, multiple-site libraries in T7 RNA polymerase and Tgo DNA polymerase. Darwin Assembly uses commercially available enzymes, can be readily automated, and offers a cost-effective route to highly complex and customizable library generation. PMID:29409059
NON-MENDELIAN ETIOLOGIC FACTORS IN NEUROPSYCHIATRIC ILLNESS: PLEIOTROPY, EPIGENETICS, AND CONVERGENCE

PubMed Central

Deutsch, Curtis K; McIlvane, William J

2013-01-01

The target article by Charney on behavior genetics/genomics discusses how numerous molecular factors can inform heritability estimations and genetic association studies. These factors find application in the search for genes for behavioral phenotypes, including neuropsychiatric disorders. We elaborate upon how single causal factors can generate multiple phenotypes, and discuss how multiple causal factors may converge on common neurodevelopmental mechanisms. PMID:23095384
A method for release and multiple strand amplification of small quantities of DNA from endospores of the fastidious bacterium Pasteuria penetrans.

PubMed

Mauchline, T H; Mohan, S; Davies, K G; Schaff, J E; Opperman, C H; Kerry, B R; Hirsch, P R

2010-05-01

To establish a reliable protocol to extract DNA from Pasteuria penetrans endospores for use as template in multiple strand amplification, thus providing sufficient material for genetic analyses. To develop a highly sensitive PCR-based diagnostic tool for P. penetrans. An optimized method to decontaminate endospores, release and purify DNA enabled multiple strand amplification. DNA purity was assessed by cloning and sequencing gyrB and 16S rRNA gene fragments obtained from PCR using generic primers. Samples indicated to be 100%P. penetrans by the gyrB assay were estimated at 46% using the 16S rRNA gene. No bias was detected on cloning and sequencing 12 housekeeping and sporulation gene fragments from amplified DNA. The detection limit by PCR with Pasteuria-specific 16S rRNA gene primers following multiple strand amplification of DNA extracted using the method was a single endospore. Generation of large quantities DNA will facilitate genomic sequencing of P. penetrans. Apparent differences in sample purity are explained by variations in 16S rRNA gene copy number in Eubacteria leading to exaggerated estimations of sample contamination. Detection of single endospores will facilitate investigations of P. penetrans molecular ecology. These methods will advance studies on P. penetrans and facilitate research on other obligate and fastidious micro-organisms where it is currently impractical to obtain DNA in sufficient quantity and quality.
Generation of siRNA Nanosheets for Efficient RNA Interference

NASA Astrophysics Data System (ADS)

Kim, Hyejin; Lee, Jae Sung; Lee, Jong Bum

2016-04-01

After the discovery of small interference RNA (siRNA), nanostructured siRNA delivery systems have been introduced to achieve an efficient regulation of the target gene expression. Here we report a new siRNA-generating two dimensional nanostructure in a formation of nanosized sheet. Inspired by tunable mechanical and functional properties of the previously reported RNA membrane, siRNA nanosized sheets (siRNA-NS) with multiple Dicer cleavage sites were prepared. The siRNA-NS has two dimensional structure, providing a large surface area for Dicer to cleave the siRNA-NS for the generation of functional siRNAs. Furthermore, downregulation of the cellular target gene expression was achieved by delivery of siRNA-NS without chemical modification of RNA strands or conjugation to other substances.
Multiple independent insertions of 5S rRNA genes in the spliced-leader gene family of trypanosome species.

PubMed

Beauparlant, Marc A; Drouin, Guy

2014-02-01

Analyses of the 5S rRNA genes found in the spliced-leader (SL) gene repeat units of numerous trypanosome species suggest that such linkages were not inherited from a common ancestor, but were the result of independent 5S rRNA gene insertions. In trypanosomes, 5S rRNA genes are found either in the tandemly repeated units coding for SL genes or in independent tandemly repeated units. Given that trypanosome species where 5S rRNA genes are within the tandemly repeated units coding for SL genes are phylogenetically related, one might hypothesize that this arrangement is the result of an ancestral insertion of 5S rRNA genes into the tandemly repeated SL gene family of trypanosomes. Here, we use the types of 5S rRNA genes found associated with SL genes, the flanking regions of the inserted 5S rRNA genes and the position of these insertions to show that most of the 5S rRNA genes found within SL gene repeat units of trypanosome species were not acquired from a common ancestor but are the results of independent insertions. These multiple 5S rRNA genes insertion events in trypanosomes are likely the result of frequent founder events in different hosts and/or geographical locations in species having short generation times.
Structural and functional diversity of CLAVATA3/ESR (CLE)-like genes from the potato cyst nematode Globodera rostochiensis.

PubMed

Lu, Shun-Wen; Chen, Shiyan; Wang, Jianying; Yu, Hang; Chronis, Demosthenis; Mitchum, Melissa G; Wang, Xiaohong

2009-09-01

Plant CLAVATA3/ESR-related (CLE) peptides have diverse roles in plant growth and development. Here, we report the isolation and functional characterization of five new CLE genes from the potato cyst nematode Globodera rostochiensis. Unlike typical plant CLE peptides that contain a single CLE motif, four of the five Gr-CLE genes encode CLE proteins with multiple CLE motifs. These Gr-CLE genes were found to be specifically expressed within the dorsal esophageal gland cell of nematode parasitic stages, suggesting a role for their encoded proteins in plant parasitism. Overexpression phenotypes of Gr-CLE genes in Arabidopsis mimicked those of plant CLE genes, and Gr-CLE proteins could rescue the Arabidopsis clv3-2 mutant phenotype when expressed within meristems. A short root phenotype was observed when synthetic GrCLE peptides were exogenously applied to roots of Arabidopsis or potato similar to the overexpression of Gr-CLE genes in Arabidopsis and potato hairy roots. These results reveal that G. rostochiensis CLE proteins with either single or multiple CLE motifs function similarly to plant CLE proteins and that CLE signaling components are conserved in both Arabidopsis and potato roots. Furthermore, our results provide evidence to suggest that the evolution of multiple CLE motifs may be an important mechanism for generating functional diversity in nematode CLE proteins to facilitate parasitism.
SNPGenie: estimating evolutionary parameters to detect natural selection using pooled next-generation sequencing data.

PubMed

Nelson, Chase W; Moncla, Louise H; Hughes, Austin L

2015-11-15

New applications of next-generation sequencing technologies use pools of DNA from multiple individuals to estimate population genetic parameters. However, no publicly available tools exist to analyse single-nucleotide polymorphism (SNP) calling results directly for evolutionary parameters important in detecting natural selection, including nucleotide diversity and gene diversity. We have developed SNPGenie to fill this gap. The user submits a FASTA reference sequence(s), a Gene Transfer Format (.GTF) file with CDS information and a SNP report(s) in an increasing selection of formats. The program estimates nucleotide diversity, distance from the reference and gene diversity. Sites are flagged for multiple overlapping reading frames, and are categorized by polymorphism type: nonsynonymous, synonymous, or ambiguous. The results allow single nucleotide, single codon, sliding window, whole gene and whole genome/population analyses that aid in the detection of positive and purifying natural selection in the source population. SNPGenie version 1.2 is a Perl program with no additional dependencies. It is free, open-source, and available for download at https://github.com/hugheslab/snpgenie. nelsoncw@email.sc.edu or austin@biol.sc.edu Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Pyviko: an automated Python tool to design gene knockouts in complex viruses with overlapping genes.

PubMed

Taylor, Louis J; Strebel, Klaus

2017-01-07

Gene knockouts are a common tool used to study gene function in various organisms. However, designing gene knockouts is complicated in viruses, which frequently contain sequences that code for multiple overlapping genes. Designing mutants that can be traced by the creation of new or elimination of existing restriction sites further compounds the difficulty in experimental design of knockouts of overlapping genes. While software is available to rapidly identify restriction sites in a given nucleotide sequence, no existing software addresses experimental design of mutations involving multiple overlapping amino acid sequences in generating gene knockouts. Pyviko performed well on a test set of over 240,000 gene pairs collected from viral genomes deposited in the National Center for Biotechnology Information Nucleotide database, identifying a point mutation which added a premature stop codon within the first 20 codons of the target gene in 93.2% of all tested gene-overprinted gene pairs. This shows that Pyviko can be used successfully in a wide variety of contexts to facilitate the molecular cloning and study of viral overprinted genes. Pyviko is an extensible and intuitive Python tool for designing knockouts of overlapping genes. Freely available as both a Python package and a web-based interface ( http://louiejtaylor.github.io/pyViKO/ ), Pyviko simplifies the experimental design of gene knockouts in complex viruses with overlapping genes.
Analysis of the New Zealand Black contribution to lupus-like renal disease

DOE Office of Scientific and Technical Information (OSTI.GOV)

Drake, C.G.; Rozzo, S.J.; Hirschfeld, H.F.

1995-03-01

F{sub 1} progeny of New Zealand Black (NZB) and New Zealand White (NZW) mice spontaneously develop an autoimmune process remarkably similar to human systemic lupus erythematosus. Previous studies have implicated major genetic contributions from the NZW MHC and from a dominant NZB gene on chromosome 4. To identify additional NZB contributions to lupus-like disease, (NZB x SM/J)F{sub 1} x NZW backcross mice were followed for the development of severe renal disease and were comprehensively genotyped. Despite a 50% incidence of disease significant associations between the presence of the NZB genotype and disease were noted on chromosomes 1, 4, 7, 10,more » 13, and 19. The data indicated that multiple NZB genes, in different combinations, contribute to severe renal disease, and that no single gene is required. To further investigate this NZB contribution, NZB x SM/J (NXSM) recombinant inbred (RI) strains were crossed with NZW mice, and F{sub 1} progeny were analyzed for the presence of lupus-like renal disease. Interestingly, nearly all of the (RI x NZW)F{sub 1} cohorts studies expressed some level of disease. Five RI strains generated a high incidence of disease, similar to (NZB x NZW)F{sub 1} mice, and nearly one-half of the cohorts developed disease at intermediate levels. Only two cohorts demonstrated very little disease, supporting the conclusion that multiple genes are capable of disease induction. Experiments correlating the genotypes of these RI strains with their ability to generate disease revealed that none of the disease-associated loci defined by the backcross analysis were present in all five RI strains that generated disease at high levels. Overall, both the backcross data and RI analysis provide additional support for the genetic complexity of lupus nephritis and uphold the conclusion that heterogeneous combinations of contributing NZB genes seem to operate in a threshold manner to generate the disease phenotype. 31 refs., 3 figs., 2 tabs.« less
Animal models of pituitary neoplasia

PubMed Central

Lines, K.E.; Stevenson, M.; Thakker, R.V.

2016-01-01

Pituitary neoplasias can occur as part of a complex inherited disorder, or more commonly as sporadic (non-familial) disease. Studies of the molecular and genetic mechanisms causing such pituitary tumours have identified dysregulation of >35 genes, with many revealed by studies in mice, rats and zebrafish. Strategies used to generate these animal models have included gene knockout, gene knockin and transgenic over-expression, as well as chemical mutagenesis and drug induction. These animal models provide an important resource for investigation of tissue-specific tumourigenic mechanisms, and evaluations of novel therapies, illustrated by studies into multiple endocrine neoplasia type 1 (MEN1), a hereditary syndrome in which ∼30% of patients develop pituitary adenomas. This review describes animal models of pituitary neoplasia that have been generated, together with some recent advances in gene editing technologies, and an illustration of the use of the Men1 mouse as a pre clinical model for evaluating novel therapies. PMID:26320859
Multiple point mutations in a shuttle vector propagated in human cells: evidence for an error-prone DNA polymerase activity.

PubMed

Seidman, M M; Bredberg, A; Seetharam, S; Kraemer, K H

1987-07-01

Mutagenesis was studied at the DNA-sequence level in human fibroblast and lymphoid cells by use of a shuttle vector plasmid, pZ189, containing a suppressor tRNA marker gene. In a series of experiments, 62 plasmids were recovered that had two to six base substitutions in the 160-base-pair marker gene. Approximately 20-30% of the mutant plasmids that were recovered after passing ultraviolet-treated pZ189 through a repair-proficient human fibroblast line contained these multiple mutations. In contrast, passage of ultraviolet-treated pZ189 through an excision-repair-deficient (xeroderma pigmentosum) line yielded only 2% multiple base substitution mutants. Introducing a single-strand nick in otherwise unmodified pZ189 adjacent to the marker, followed by passage through the xeroderma pigmentosum cells, resulted in about 66% multiple base substitution mutants. The multiple mutations were found in a 160-base-pair region containing the marker gene but were rarely found in an adjacent 170-base-pair region. Passing ultraviolet-treated or nicked pZ189 through a repair-proficient human B-cell line also yielded multiple base substitution mutations in 20-33% of the mutant plasmids. An explanation for these multiple mutations is that they were generated by an error-prone polymerase while filling gaps. These mutations share many of the properties displayed by mutations in the immunoglobulin hypervariable regions.
TRAM (Transcriptome Mapper): database-driven creation and analysis of transcriptome maps from multiple sources

PubMed Central

2011-01-01

Background Several tools have been developed to perform global gene expression profile data analysis, to search for specific chromosomal regions whose features meet defined criteria as well as to study neighbouring gene expression. However, most of these tools are tailored for a specific use in a particular context (e.g. they are species-specific, or limited to a particular data format) and they typically accept only gene lists as input. Results TRAM (Transcriptome Mapper) is a new general tool that allows the simple generation and analysis of quantitative transcriptome maps, starting from any source listing gene expression values for a given gene set (e.g. expression microarrays), implemented as a relational database. It includes a parser able to assign univocal and updated gene symbols to gene identifiers from different data sources. Moreover, TRAM is able to perform intra-sample and inter-sample data normalization, including an original variant of quantile normalization (scaled quantile), useful to normalize data from platforms with highly different numbers of investigated genes. When in 'Map' mode, the software generates a quantitative representation of the transcriptome of a sample (or of a pool of samples) and identifies if segments of defined lengths are over/under-expressed compared to the desired threshold. When in 'Cluster' mode, the software searches for a set of over/under-expressed consecutive genes. Statistical significance for all results is calculated with respect to genes localized on the same chromosome or to all genome genes. Transcriptome maps, showing differential expression between two sample groups, relative to two different biological conditions, may be easily generated. We present the results of a biological model test, based on a meta-analysis comparison between a sample pool of human CD34+ hematopoietic progenitor cells and a sample pool of megakaryocytic cells. Biologically relevant chromosomal segments and gene clusters with differential expression during the differentiation toward megakaryocyte were identified. Conclusions TRAM is designed to create, and statistically analyze, quantitative transcriptome maps, based on gene expression data from multiple sources. The release includes FileMaker Pro database management runtime application and it is freely available at http://apollo11.isto.unibo.it/software/, along with preconfigured implementations for mapping of human, mouse and zebrafish transcriptomes. PMID:21333005
Effects of 60 Hz sinusoidal magnetic field on in vitro establishment, multiplication, and acclimatization phases of Coffea arabica seedlings.

PubMed

Isaac Alemán, Elizabeth; Oliveira Moreira, Rafael; Almeida Lima, Andre; Chaves Silva, Samuel; González-Olmedo, Justo Lorenzo; Chalfun-Junior, Antonio

2014-09-01

The influence of extremely low frequency electromagnetic fields on net photosynthesis, transpiration, photosynthetic pigment concentration, and gene expression of ribulose 1,5-bisphosphate carboxylase/oxygenase small subunit (RBCS1), during in vitro establishment, in vitro multiplication and acclimatization phases of coffee seedlings were investigated. Untreated coffee plants were considered as control, whereas treated plants were exposed to a 60 Hz sinusoidal magnetic field of 2 mT of magnetic induction during 3 min. This magnetic field was generated by an electromagnet, connected to a wave generator. The results revealed that magnetically treated plants showed a significant increase in net photosynthesis (85.4% and 117.9%, in multiplication and acclimatization phases, respectively), and in photosynthetic pigment concentration (66.6% for establishment phase, 79.9% for multiplication phase, and 43.8% for acclimatization phase). They also showed a differential RBCS1 gene expression (approximately twofold) and a decrease of transpiration rates in regard to their control plants. In conclusion, the findings suggest that the application of 60 Hz magnetic field to in vitro coffee plants may improve the seedlings quality by modifying some photosynthetic physiological and molecular processes, increasing their vigor, and ensuring better plant development in later stages. © 2014 Wiley Periodicals, Inc.

Unity in defence: honeybee workers exhibit conserved molecular responses to diverse pathogens.

PubMed

Doublet, Vincent; Poeschl, Yvonne; Gogol-Döring, Andreas; Alaux, Cédric; Annoscia, Desiderato; Aurori, Christian; Barribeau, Seth M; Bedoya-Reina, Oscar C; Brown, Mark J F; Bull, James C; Flenniken, Michelle L; Galbraith, David A; Genersch, Elke; Gisder, Sebastian; Grosse, Ivo; Holt, Holly L; Hultmark, Dan; Lattorff, H Michael G; Le Conte, Yves; Manfredini, Fabio; McMahon, Dino P; Moritz, Robin F A; Nazzi, Francesco; Niño, Elina L; Nowick, Katja; van Rij, Ronald P; Paxton, Robert J; Grozinger, Christina M

2017-03-02

Organisms typically face infection by diverse pathogens, and hosts are thought to have developed specific responses to each type of pathogen they encounter. The advent of transcriptomics now makes it possible to test this hypothesis and compare host gene expression responses to multiple pathogens at a genome-wide scale. Here, we performed a meta-analysis of multiple published and new transcriptomes using a newly developed bioinformatics approach that filters genes based on their expression profile across datasets. Thereby, we identified common and unique molecular responses of a model host species, the honey bee (Apis mellifera), to its major pathogens and parasites: the Microsporidia Nosema apis and Nosema ceranae, RNA viruses, and the ectoparasitic mite Varroa destructor, which transmits viruses. We identified a common suite of genes and conserved molecular pathways that respond to all investigated pathogens, a result that suggests a commonality in response mechanisms to diverse pathogens. We found that genes differentially expressed after infection exhibit a higher evolutionary rate than non-differentially expressed genes. Using our new bioinformatics approach, we unveiled additional pathogen-specific responses of honey bees; we found that apoptosis appeared to be an important response following microsporidian infection, while genes from the immune signalling pathways, Toll and Imd, were differentially expressed after Varroa/virus infection. Finally, we applied our bioinformatics approach and generated a gene co-expression network to identify highly connected (hub) genes that may represent important mediators and regulators of anti-pathogen responses. Our meta-analysis generated a comprehensive overview of the host metabolic and other biological processes that mediate interactions between insects and their pathogens. We identified key host genes and pathways that respond to phylogenetically diverse pathogens, representing an important source for future functional studies as well as offering new routes to identify or generate pathogen resilient honey bee stocks. The statistical and bioinformatics approaches that were developed for this study are broadly applicable to synthesize information across transcriptomic datasets. These approaches will likely have utility in addressing a variety of biological questions.
Removal of Heterologous Sequences from Plasmodium falciparum Mutants Using FLPe-Recombinase

PubMed Central

van Schaijk, Ben C. L.; Vos, Martijn W.; Janse, Chris J.; Sauerwein, Robert W.; Khan, Shahid M.

2010-01-01

Genetically-modified mutants are now indispensable Plasmodium gene-function reagents, which are also being pursued as genetically attenuated parasite vaccines. Currently, the generation of transgenic malaria-parasites requires the use of drug-resistance markers. Here we present the development of an FRT/FLP-recombinase system that enables the generation of transgenic parasites free of resistance genes. We demonstrate in the human malaria parasite, P. falciparum, the complete and efficient removal of the introduced resistance gene. We targeted two neighbouring genes, p52 and p36, using a construct that has a selectable marker cassette flanked by FRT-sequences. This permitted the subsequent removal of the selectable marker cassette by transient transfection of a plasmid that expressed a 37°C thermostable and enhanced FLP-recombinase. This method of removing heterologous DNA sequences from the genome opens up new possibilities in Plasmodium research to sequentially target multiple genes and for using genetically-modified parasites as live, attenuated malaria vaccines. PMID:21152048
Analysis of the dynamic co-expression network of heart regeneration in the zebrafish

PubMed Central

Rodius, Sophie; Androsova, Ganna; Götz, Lou; Liechti, Robin; Crespo, Isaac; Merz, Susanne; Nazarov, Petr V.; de Klein, Niek; Jeanty, Céline; González-Rosa, Juan M.; Muller, Arnaud; Bernardin, Francois; Niclou, Simone P.; Vallar, Laurent; Mercader, Nadia; Ibberson, Mark; Xenarios, Ioannis; Azuaje, Francisco

2016-01-01

The zebrafish has the capacity to regenerate its heart after severe injury. While the function of a few genes during this process has been studied, we are far from fully understanding how genes interact to coordinate heart regeneration. To enable systematic insights into this phenomenon, we generated and integrated a dynamic co-expression network of heart regeneration in the zebrafish and linked systems-level properties to the underlying molecular events. Across multiple post-injury time points, the network displays topological attributes of biological relevance. We show that regeneration steps are mediated by modules of transcriptionally coordinated genes, and by genes acting as network hubs. We also established direct associations between hubs and validated drivers of heart regeneration with murine and human orthologs. The resulting models and interactive analysis tools are available at http://infused.vital-it.ch. Using a worked example, we demonstrate the usefulness of this unique open resource for hypothesis generation and in silico screening for genes involved in heart regeneration. PMID:27241320
Analysis of the dynamic co-expression network of heart regeneration in the zebrafish

NASA Astrophysics Data System (ADS)

Rodius, Sophie; Androsova, Ganna; Götz, Lou; Liechti, Robin; Crespo, Isaac; Merz, Susanne; Nazarov, Petr V.; de Klein, Niek; Jeanty, Céline; González-Rosa, Juan M.; Muller, Arnaud; Bernardin, Francois; Niclou, Simone P.; Vallar, Laurent; Mercader, Nadia; Ibberson, Mark; Xenarios, Ioannis; Azuaje, Francisco

2016-05-01

The zebrafish has the capacity to regenerate its heart after severe injury. While the function of a few genes during this process has been studied, we are far from fully understanding how genes interact to coordinate heart regeneration. To enable systematic insights into this phenomenon, we generated and integrated a dynamic co-expression network of heart regeneration in the zebrafish and linked systems-level properties to the underlying molecular events. Across multiple post-injury time points, the network displays topological attributes of biological relevance. We show that regeneration steps are mediated by modules of transcriptionally coordinated genes, and by genes acting as network hubs. We also established direct associations between hubs and validated drivers of heart regeneration with murine and human orthologs. The resulting models and interactive analysis tools are available at http://infused.vital-it.ch. Using a worked example, we demonstrate the usefulness of this unique open resource for hypothesis generation and in silico screening for genes involved in heart regeneration.
Genome-Level Longitudinal Expression of Signaling Pathways and Gene Networks in Pediatric Septic Shock

PubMed Central

Shanley, Thomas P; Cvijanovich, Natalie; Lin, Richard; Allen, Geoffrey L; Thomas, Neal J; Doctor, Allan; Kalyanaraman, Meena; Tofil, Nancy M; Penfil, Scott; Monaco, Marie; Odoms, Kelli; Barnes, Michael; Sakthivel, Bhuvaneswari; Aronow, Bruce J; Wong, Hector R

2007-01-01

We have conducted longitudinal studies focused on the expression profiles of signaling pathways and gene networks in children with septic shock. Genome-level expression profiles were generated from whole blood-derived RNA of children with septic shock (n = 30) corresponding to day one and day three of septic shock, respectively. Based on sequential statistical and expression filters, day one and day three of septic shock were characterized by differential regulation of 2,142 and 2,504 gene probes, respectively, relative to controls (n = 15). Venn analysis demonstrated 239 unique genes in the day one dataset, 598 unique genes in the day three dataset, and 1,906 genes common to both datasets. Functional analyses demonstrated time-dependent, differential regulation of genes involved in multiple signaling pathways and gene networks primarily related to immunity and inflammation. Notably, multiple and distinct gene networks involving T cell- and MHC antigen-related biology were persistently downregulated on both day one and day three. Further analyses demonstrated large scale, persistent downregulation of genes corresponding to functional annotations related to zinc homeostasis. These data represent the largest reported cohort of patients with septic shock subjected to longitudinal genome-level expression profiling. The data further advance our genome-level understanding of pediatric septic shock and support novel hypotheses. PMID:17932561
Genome structure drives patterns of gene family evolution in ciliates, a case study using Chilodonella uncinata (Protista, Ciliophora, Phyllopharyngea).

PubMed

Gao, Feng; Song, Weibo; Katz, Laura A

2014-08-01

In most lineages, diversity among gene family members results from gene duplication followed by sequence divergence. Because of the genome rearrangements during the development of somatic nuclei, gene family evolution in ciliates involves more complex processes. Previous work on the ciliate Chilodonella uncinata revealed that macronuclear β-tubulin gene family members are generated by alternative processing, in which germline regions are alternatively used in multiple macronuclear chromosomes. To further study genome evolution in this ciliate, we analyzed its transcriptome and found that (1) alternative processing is extensive among gene families; and (2) such gene families are likely to be C. uncinata specific. We characterized additional macronuclear and micronuclear copies of one candidate alternatively processed gene family-a protein kinase domain containing protein (PKc)-from two C. uncinata strains. Analysis of the PKc sequences reveals that (1) multiple PKc gene family members in the macronucleus share some identical regions flanked by divergent regions; and (2) the shared identical regions are processed from a single micronuclear chromosome. We discuss analogous processes in lineages across the eukaryotic tree of life to provide further insights on the impact of genome structure on gene family evolution in eukaryotes. © 2014 The Author(s). Evolution © 2014 The Society for the Study of Evolution.
Three copies of a single protein II-encoding sequence in the genome of Neisseria gonorrhoeae JS3: evidence for gene conversion and gene duplication.

PubMed

van der Ley, P

1988-11-01

Gonococci express a family of related outer membrane proteins designated protein II (P.II). These surface proteins are subject to both phase variation and antigenic variation. The P.II gene repertoire of Neisseria gonorrhoeae strain JS3 was found to consist of at least ten genes, eight of which were cloned. Sequence analysis and DNA hybridization studies revealed that one particular P.II-encoding sequence is present in three distinct, but almost identical, copies in the JS3 genome. These genes encode the P.II protein that was previously identified as P.IIc. Comparison of their sequences shows that the multiple copies of this P.IIc-encoding gene might have been generated by both gene conversion and gene duplication.
Overexpression of Multiple Detoxification Genes in Deltamethrin Resistant Laodelphax striatellus (Hemiptera: Delphacidae) in China

PubMed Central

Xu, Lu; Wu, Min; Han, Zhaojun

2013-01-01

Background The small brown planthopper (SBPH), Laodelphax striatellus (Fallén), is one of the major rice pests in Asia and has developed resistance to multiple classes of insecticides. Understanding resistance mechanisms is essential to the management of this pest. Biochemical and molecular assays were performed in this study to systematically characterize deltamethrin resistance mechanisms with laboratory-selected resistant and susceptible strains of SBPH. Methodology/Principal Findings Deltamethrin resistant strains of SBPH (JH-del) were derived from a field population by continuously selections (up to 30 generations) in the laboratory, while a susceptible strain (JHS) was obtained from the same population by removing insecticide pressure for 30 generations. The role of detoxification enzymes in the resistance was investigated using synergism and enzyme activity assays with strains of different resistant levels. Furthermore, 71 cytochrome P450, 93 esterases and 12 glutathione-S-transferases cDNAs were cloned based on transcriptome data of a field collected population. Semi-quantitative RT-PCR screening analysis of 176 identified detoxification genes demonstrated that multiple P450 and esterase genes were overexpressed (>2-fold) in JH-del strains (G4 and G30) when compared to that in JHS, and the results of quantitative PCR coincided with the semi-quantitative RT-PCR results. Target mutation at IIS3–IIS6 regions encoded by the voltage-gated sodium channel gene was ruled out for conferring the observed resistance. Conclusion/Significance As the first attempt to discover genes potentially involved in SBPH pyrethroid resistance, this study putatively identified several candidate genes of detoxification enzymes that were significantly overexpressed in the resistant strain, which matched the synergism and enzyme activity testing. The biochemical and molecular evidences suggest that the high level pyrethroid resistance in L. striatellus could be due to enhanced detoxification rather than target insensitivity. The findings lay a solid ground for further resistance mechanism elucidation studies. PMID:24324548
SYNTHETIC BIOLOGY. Emergent genetic oscillations in a synthetic microbial consortium.

PubMed

Chen, Ye; Kim, Jae Kyoung; Hirning, Andrew J; Josić, Krešimir; Bennett, Matthew R

2015-08-28

A challenge of synthetic biology is the creation of cooperative microbial systems that exhibit population-level behaviors. Such systems use cellular signaling mechanisms to regulate gene expression across multiple cell types. We describe the construction of a synthetic microbial consortium consisting of two distinct cell types—an "activator" strain and a "repressor" strain. These strains produced two orthogonal cell-signaling molecules that regulate gene expression within a synthetic circuit spanning both strains. The two strains generated emergent, population-level oscillations only when cultured together. Certain network topologies of the two-strain circuit were better at maintaining robust oscillations than others. The ability to program population-level dynamics through the genetic engineering of multiple cooperative strains points the way toward engineering complex synthetic tissues and organs with multiple cell types. Copyright © 2015, American Association for the Advancement of Science.
Flume experiments elucidate relationships between microbial genetics, nitrogen species and hydraulics in controlling nitrous oxide production in the hyporheic zone

NASA Astrophysics Data System (ADS)

Quick, A. M.; Farrell, T. B.; Reeder, W. J.; Feris, K. P.; Tonina, D.; Benner, S. G.

2014-12-01

The hyporheic zone is a potentially important producer of nitrous oxide, a powerful greenhouse gas. The location and magnitude of nitrous oxide generation within the hyporheic zone involves complex interactions between multiple nitrogen species, redox conditions, microbial communities, and hydraulics. To better understand nitrous oxide generation and emissions from streams, we conducted large-scale flume experiments in which we monitored pore waters along hyporheic flow paths within stream dune structures. Measured dissolved oxygen, ammonia, nitrate, nitrite, and dissolved nitrous oxide showed distinct spatial relationships reflecting redox changes along flow paths. Denitrifying genes (nosZ, nirS, and nirK), determined using qPCR, were spatially associated with abundances of nitrogen species. Using residence times along a flow path, clear trends in oxygen conditions, genes encoding for microbial catalysis, and nitrogen species were observed. Hotspots of targeted genes correlated with hotspots for conversion of nitrogen species, including nitrous oxide production and conversion to dinitrogen. Trends were apparent regardless of dune size, allowing for the possibility to apply observed relationships to multiple streambed morphologies. Relating streambed morphology and loading of nitrogen species allows for prediction of nitrous oxide production in the hyporheic zone.
CRISPR/Cas9 and TALENs generate heritable mutations for genes involved in small RNA processing of Glycine max and Medicago truncatula.

PubMed

Curtin, Shaun J; Xiong, Yer; Michno, Jean-Michel; Campbell, Benjamin W; Stec, Adrian O; Čermák, Tomas; Starker, Colby; Voytas, Daniel F; Eamens, Andrew L; Stupar, Robert M

2018-06-01

Processing of double-stranded RNA precursors into small RNAs is an essential regulator of gene expression in plant development and stress response. Small RNA processing requires the combined activity of a functionally diverse group of molecular components. However, in most of the plant species, there are insufficient mutant resources to functionally characterize each encoding gene. Here, mutations in loci encoding protein machinery involved in small RNA processing in soya bean and Medicago truncatula were generated using the CRISPR/Cas9 and TAL-effector nuclease (TALEN) mutagenesis platforms. An efficient CRISPR/Cas9 reagent was used to create a bi-allelic double mutant for the two soya bean paralogous Double-stranded RNA-binding2 (GmDrb2a and GmDrb2b) genes. These mutations, along with a CRISPR/Cas9-generated mutation of the M. truncatula Hua enhancer1 (MtHen1) gene, were determined to be germ-line transmissible. Furthermore, TALENs were used to generate a mutation within the soya bean Dicer-like2 gene. CRISPR/Cas9 mutagenesis of the soya bean Dicer-like3 gene and the GmHen1a gene was observed in the T 0 generation, but these mutations failed to transmit to the T 1 generation. The irregular transmission of induced mutations and the corresponding transgenes was investigated by whole-genome sequencing to reveal a spectrum of non-germ-line-targeted mutations and multiple transgene insertion events. Finally, a suite of combinatorial mutant plants were generated by combining the previously reported Gmdcl1a, Gmdcl1b and Gmdcl4b mutants with the Gmdrb2ab double mutant. Altogether, this study demonstrates the synergistic use of different genome engineering platforms to generate a collection of useful mutant plant lines for future study of small RNA processing in legume crops. © 2017 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
MGAS: a powerful tool for multivariate gene-based genome-wide association analysis.

PubMed

Van der Sluis, Sophie; Dolan, Conor V; Li, Jiang; Song, Youqiang; Sham, Pak; Posthuma, Danielle; Li, Miao-Xin

2015-04-01

Standard genome-wide association studies, testing the association between one phenotype and a large number of single nucleotide polymorphisms (SNPs), are limited in two ways: (i) traits are often multivariate, and analysis of composite scores entails loss in statistical power and (ii) gene-based analyses may be preferred, e.g. to decrease the multiple testing problem. Here we present a new method, multivariate gene-based association test by extended Simes procedure (MGAS), that allows gene-based testing of multivariate phenotypes in unrelated individuals. Through extensive simulation, we show that under most trait-generating genotype-phenotype models MGAS has superior statistical power to detect associated genes compared with gene-based analyses of univariate phenotypic composite scores (i.e. GATES, multiple regression), and multivariate analysis of variance (MANOVA). Re-analysis of metabolic data revealed 32 False Discovery Rate controlled genome-wide significant genes, and 12 regions harboring multiple genes; of these 44 regions, 30 were not reported in the original analysis. MGAS allows researchers to conduct their multivariate gene-based analyses efficiently, and without the loss of power that is often associated with an incorrectly specified genotype-phenotype models. MGAS is freely available in KGG v3.0 (http://statgenpro.psychiatry.hku.hk/limx/kgg/download.php). Access to the metabolic dataset can be requested at dbGaP (https://dbgap.ncbi.nlm.nih.gov/). The R-simulation code is available from http://ctglab.nl/people/sophie_van_der_sluis. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press.
Generation of 2A-linked multicistronic cassettes by recombinant PCR.

PubMed

Szymczak-Workman, Andrea L; Vignali, Kate M; Vignali, Dario A A

2012-02-01

The need for reliable, multicistronic vectors for multigene delivery is at the forefront of biomedical technology. It is now possible to express multiple proteins from a single open reading frame (ORF) using 2A peptide-linked multicistronic vectors. These small sequences, when cloned between genes, allow for efficient, stoichiometric production of discrete protein products within a single vector through a novel "cleavage" event within the 2A peptide sequence. Expression of more than two genes using conventional approaches has several limitations, most notably imbalanced protein expression and large size. The use of 2A peptide sequences alleviates these concerns. They are small (18-22 amino acids) and have divergent amino-terminal sequences, which minimizes the chance for homologous recombination and allows for multiple, different 2A peptide sequences to be used within a single vector. Importantly, separation of genes placed between 2A peptide sequences is nearly 100%, which allows for stoichiometric and concordant expression of the genes, regardless of the order of placement within the vector. This protocol describes the use of recombinant polymerase chain reaction (PCR) to connect multiple 2A-linked protein sequences. The final construct is subcloned into an expression vector.
KNQ1, a Kluyveromyces lactis gene encoding a transmembrane protein, may be involved in iron homeostasis.

PubMed

Marchi, Emmanuela; Lodi, Tiziana; Donnini, Claudia

2007-08-01

The original purpose of the experiments described in this article was to identify, in the biotechnologically important yeast Kluyveromyces lactis, gene(s) that are potentially involved in oxidative protein folding within the endoplasmic reticulum (ER), which often represents a bottleneck for heterologous protein production. Because treatment with the membrane-permeable reducing agent dithiothreitol inhibits disulfide bond formation and mimics the reducing effect that the normal transit of folding proteins has in the ER environment, the strategy was to search for genes that conferred higher levels of resistance to dithiothreitol when present in multiple copies. We identified a gene (KNQ1) encoding a drug efflux permease for several toxic compounds that in multiple copies conferred increased dithiothreitol resistance. However, the KNQ1 product is not involved in the excretion of dithiothreitol or in recombinant protein secretion. We generated a knq1 null mutant, and showed that both overexpression and deletion of the KNQ1 gene resulted in increased resistance to dithiothreitol. KNQ1 amplification and deletion resulted in enhanced transcription of iron transport genes, suggesting, for the membrane-associated protein Knq1p, a new, unexpected role in iron homeostasis on which dithiothreitol tolerance may depend.
Genomic approaches for the elucidation of genes and gene networks underlying cardiovascular traits.

PubMed

Adriaens, M E; Bezzina, C R

2018-06-22

Genome-wide association studies have shed light on the association between natural genetic variation and cardiovascular traits. However, linking a cardiovascular trait associated locus to a candidate gene or set of candidate genes for prioritization for follow-up mechanistic studies is all but straightforward. Genomic technologies based on next-generation sequencing technology nowadays offer multiple opportunities to dissect gene regulatory networks underlying genetic cardiovascular trait associations, thereby aiding in the identification of candidate genes at unprecedented scale. RNA sequencing in particular becomes a powerful tool when combined with genotyping to identify loci that modulate transcript abundance, known as expression quantitative trait loci (eQTL), or loci modulating transcript splicing known as splicing quantitative trait loci (sQTL). Additionally, the allele-specific resolution of RNA-sequencing technology enables estimation of allelic imbalance, a state where the two alleles of a gene are expressed at a ratio differing from the expected 1:1 ratio. When multiple high-throughput approaches are combined with deep phenotyping in a single study, a comprehensive elucidation of the relationship between genotype and phenotype comes into view, an approach known as systems genetics. In this review, we cover key applications of systems genetics in the broad cardiovascular field.
Rare Variant Association Test with Multiple Phenotypes

PubMed Central

Lee, Selyeong; Won, Sungho; Kim, Young Jin; Kim, Yongkang; Kim, Bong-Jo; Park, Taesung

2016-01-01

Although genome-wide association studies (GWAS) have now discovered thousands of genetic variants associated with common traits, such variants cannot explain the large degree of “missing heritability,” likely due to rare variants. The advent of next generation sequencing technology has allowed rare variant detection and association with common traits, often by investigating specific genomic regions for rare variant effects on a trait. Although multiply correlated phenotypes are often concurrently observed in GWAS, most studies analyze only single phenotypes, which may lessen statistical power. To increase power, multivariate analyses, which consider correlations between multiple phenotypes, can be used. However, few existing multi-variant analyses can identify rare variants for assessing multiple phenotypes. Here, we propose Multivariate Association Analysis using Score Statistics (MAAUSS), to identify rare variants associated with multiple phenotypes, based on the widely used Sequence Kernel Association Test (SKAT) for a single phenotype. We applied MAAUSS to Whole Exome Sequencing (WES) data from a Korean population of 1,058 subjects, to discover genes associated with multiple traits of liver function. We then assessed validation of those genes by a replication study, using an independent dataset of 3,445 individuals. Notably, we detected the gene ZNF620 among five significant genes. We then performed a simulation study to compare MAAUSS's performance with existing methods. Overall, MAAUSS successfully conserved type 1 error rates and in many cases, had a higher power than the existing methods. This study illustrates a feasible and straightforward approach for identifying rare variants correlated with multiple phenotypes, with likely relevance to missing heritability. PMID:28039885
Targeted next-generation sequencing in steroid-resistant nephrotic syndrome: mutations in multiple glomerular genes may influence disease severity.

PubMed

Bullich, Gemma; Trujillano, Daniel; Santín, Sheila; Ossowski, Stephan; Mendizábal, Santiago; Fraga, Gloria; Madrid, Álvaro; Ariceta, Gema; Ballarín, José; Torra, Roser; Estivill, Xavier; Ars, Elisabet

2015-09-01

Genetic diagnosis of steroid-resistant nephrotic syndrome (SRNS) using Sanger sequencing is complicated by the high genetic heterogeneity and phenotypic variability of this disease. We aimed to improve the genetic diagnosis of SRNS by simultaneously sequencing 26 glomerular genes using massive parallel sequencing and to study whether mutations in multiple genes increase disease severity. High-throughput mutation analysis was performed in 50 SRNS and/or focal segmental glomerulosclerosis (FSGS) patients, a validation cohort of 25 patients with known pathogenic mutations, and a discovery cohort of 25 uncharacterized patients with probable genetic etiology. In the validation cohort, we identified the 42 previously known pathogenic mutations across NPHS1, NPHS2, WT1, TRPC6, and INF2 genes. In the discovery cohort, disease-causing mutations in SRNS/FSGS genes were found in nine patients. We detected three patients with mutations in an SRNS/FSGS gene and COL4A3. Two of them were familial cases and presented a more severe phenotype than family members with mutation in only one gene. In conclusion, our results show that massive parallel sequencing is feasible and robust for genetic diagnosis of SRNS/FSGS. Our results indicate that patients carrying mutations in an SRNS/FSGS gene and also in COL4A3 gene have increased disease severity.
Deletion of a target gene in Indica rice via CRISPR/Cas9.

PubMed

Wang, Ying; Geng, Lizhao; Yuan, Menglong; Wei, Juan; Jin, Chen; Li, Min; Yu, Kun; Zhang, Ya; Jin, Huaibing; Wang, Eric; Chai, Zhijian; Fu, Xiangdong; Li, Xianggan

2017-08-01

Using CRISPR/Cas9, we successfully deleted large fragments of the yield-related gene DENSE AND ERECT PANICLE1 in Indica rice at relatively high frequency and generated gain-of-function dep1 mutants. CRISPR (clustered regularly interspaced short palindromic repeats)/Cas9 is a rapidly developing technology used to produce gene-specific modifications in both mammalian and plant systems. Most CRISPR-induced modifications in plants reported to date have been small insertions or deletions. Few large target gene deletions have thus far been reported, especially for Indica rice. In this study, we designed multiple CRISPR sgRNAs and successfully deleted DNA fragments in the gene DENSE AND ERECT PANICLE1 (DEP1) in the elite Indica rice line IR58025B. We achieved deletion frequencies of up to 21% for a 430 bp target and 9% for a 10 kb target among T0 events. Constructs with four sgRNAs did not generate higher full-length deletion frequencies than constructs with two sgRNAs. The multiple mutagenesis frequency reached 93% for four targets, and the homozygous mutation frequency reached 21% at the T0 stage. Important yield-related trait characteristics, such as dense and erect panicles and reduced plant height, were observed in dep1 homozygous T0 mutant plants produced by CRISPR/Cas9. Therefore, we successfully obtained deletions in DEP1 in the Indica background using the CRISPR/Cas9 editing tool at relatively high frequency.
A versatile modular vector system for rapid combinatorial mammalian genetics.

PubMed

Albers, Joachim; Danzer, Claudia; Rechsteiner, Markus; Lehmann, Holger; Brandt, Laura P; Hejhal, Tomas; Catalano, Antonella; Busenhart, Philipp; Gonçalves, Ana Filipa; Brandt, Simone; Bode, Peter K; Bode-Lesniewska, Beata; Wild, Peter J; Frew, Ian J

2015-04-01

Here, we describe the multiple lentiviral expression (MuLE) system that allows multiple genetic alterations to be introduced simultaneously into mammalian cells. We created a toolbox of MuLE vectors that constitute a flexible, modular system for the rapid engineering of complex polycistronic lentiviruses, allowing combinatorial gene overexpression, gene knockdown, Cre-mediated gene deletion, or CRISPR/Cas9-mediated (where CRISPR indicates clustered regularly interspaced short palindromic repeats) gene mutation, together with expression of fluorescent or enzymatic reporters for cellular assays and animal imaging. Examples of tumor engineering were used to illustrate the speed and versatility of performing combinatorial genetics using the MuLE system. By transducing cultured primary mouse cells with single MuLE lentiviruses, we engineered tumors containing up to 5 different genetic alterations, identified genetic dependencies of molecularly defined tumors, conducted genetic interaction screens, and induced the simultaneous CRISPR/Cas9-mediated knockout of 3 tumor-suppressor genes. Intramuscular injection of MuLE viruses expressing oncogenic H-RasG12V together with combinations of knockdowns of the tumor suppressors cyclin-dependent kinase inhibitor 2A (Cdkn2a), transformation-related protein 53 (Trp53), and phosphatase and tensin homolog (Pten) allowed the generation of 3 murine sarcoma models, demonstrating that genetically defined autochthonous tumors can be rapidly generated and quantitatively monitored via direct injection of polycistronic MuLE lentiviruses into mouse tissues. Together, our results demonstrate that the MuLE system provides genetic power for the systematic investigation of the molecular mechanisms that underlie human diseases.
An attenuated quadruple gene mutant of Mycobacterium tuberculosis imparts protection against tuberculosis in guinea pigs

PubMed Central

Chauhan, Priyanka

2018-01-01

ABSTRACT Previously we had developed a triple gene mutant of Mycobacterium tuberculosis (MtbΔmms) harboring disruption in three genes, namely mptpA, mptpB and sapM. Though vaccination with MtbΔmms strain induced protection in the lungs of guinea pigs, the mutant strain failed to control the hematogenous spread of the challenge strain to the spleen. Additionally, inoculation with MtbΔmms resulted in some pathological damage to the spleens in the early phase of infection. In order to generate a strain that overcomes the pathology caused by MtbΔmms in spleen of guinea pigs and controls dissemination of the challenge strain, MtbΔmms was genetically modified by disrupting bioA gene to generate MtbΔmmsb strain. Further, in vivo attenuation of MtbΔmmsb was evaluated and its protective efficacy was assessed against virulent M. tuberculosis challenge in guinea pigs. MtbΔmmsb mutant strain was highly attenuated for growth and virulence in guinea pigs. Vaccination with MtbΔmmsb mutant generated significant protection in comparison to sham-immunized animals at 4 and 12 weeks post-infection in lungs and spleen of infected animals. However, the protection imparted by MtbΔmmsb was significantly less in comparison to BCG immunized animals. This study indicates the importance of attenuated multiple gene deletion mutants of M. tuberculosis for generating protection against tuberculosis. PMID:29242198

A fast and high performance multiple data integration algorithm for identifying human disease genes

PubMed Central

2015-01-01

Background Integrating multiple data sources is indispensable in improving disease gene identification. It is not only due to the fact that disease genes associated with similar genetic diseases tend to lie close with each other in various biological networks, but also due to the fact that gene-disease associations are complex. Although various algorithms have been proposed to identify disease genes, their prediction performances and the computational time still should be further improved. Results In this study, we propose a fast and high performance multiple data integration algorithm for identifying human disease genes. A posterior probability of each candidate gene associated with individual diseases is calculated by using a Bayesian analysis method and a binary logistic regression model. Two prior probability estimation strategies and two feature vector construction methods are developed to test the performance of the proposed algorithm. Conclusions The proposed algorithm is not only generated predictions with high AUC scores, but also runs very fast. When only a single PPI network is employed, the AUC score is 0.769 by using F2 as feature vectors. The average running time for each leave-one-out experiment is only around 1.5 seconds. When three biological networks are integrated, the AUC score using F3 as feature vectors increases to 0.830, and the average running time for each leave-one-out experiment takes only about 12.54 seconds. It is better than many existing algorithms. PMID:26399620
Systematic comparison of co-expression of multiple recombinant thermophilic enzymes in Escherichia coli BL21(DE3).

PubMed

Chen, Hui; Huang, Rui; Zhang, Y-H Percival

2017-06-01

The precise control of multiple heterologous enzyme expression levels in one Escherichia coli strain is important for cascade biocatalysis, metabolic engineering, synthetic biology, natural product synthesis, and studies of complexed proteins. We systematically investigated the co-expression of up to four thermophilic enzymes (i.e., α-glucan phosphorylase (αGP), phosphoglucomutase (PGM), glucose 6-phosphate dehydrogenase (G6PDH), and 6-phosphogluconate dehydrogenase (6PGDH)) in E. coli BL21(DE3) by adding T7 promoter or T7 terminator of each gene for multiple genes in tandem, changing gene alignment, and comparing one or two plasmid systems. It was found that the addition of T7 terminator after each gene was useful to decrease the influence of the upstream gene. The co-expression of the four enzymes in E. coli BL21(DE3) was demonstrated to generate two NADPH molecules from one glucose unit of maltodextrin, where NADPH was oxidized to convert xylose to xylitol. The best four-gene co-expression system was based on two plasmids (pET and pACYC) which harbored two genes. As a result, apparent enzymatic activities of the four enzymes were regulated to be at similar levels and the overall four-enzyme activity was the highest based on the formation of xylitol. This study provides useful information for the precise control of multi-enzyme-coordinated expression in E. coli BL21(DE3).
A multigenerational family with multiple sclerosis.

PubMed

Dyment, D A; Cader, M Z; Willer, C J; Risch, N; Sadovnick, A D; Ebers, G C

2002-07-01

We report a family with 15 individuals affected with multiple sclerosis present in three and possibly four generations. The segregation of multiple sclerosis within this pedigree is consistent with an autosomal dominant mode of inheritance with reduced penetrance. The clinical characteristics of the affected individuals are indistinguishable from those seen in sporadic multiple sclerosis with respect to sex ratio, age at onset, onset symptom, MRI and clinical course. Eleven of 14 cases (78.6%) were positive for the known multiple sclerosis-associated major histocompatibility complex (MHC) Class II HLA DRB1*15 allele. Parametric linkage analysis gave a non-significant LOD score of 0.31 (theta; = 0.33) for the DRB1 gene. However, among 11 affected children with at least one DRB1*15 bearing parent, all 11 out of 11 received at least one copy of this known susceptibility allele. A transmission disequilibrium test analysis was significant for the DRB1*15 allele within this single family; P = 0.0054. The inheritance pattern in this family suggests the presence of a single major locus responsible for multiple sclerosis susceptibility, with DRB1 acting as an important modifier. This family could be an important resource for the identification of a multiple sclerosis susceptibility gene.
Horizontal acquisition of multiple mitochondrial genes from a parasitic plant followed by gene conversion with host mitochondrial genes

PubMed Central

2010-01-01

Background Horizontal gene transfer (HGT) is relatively common in plant mitochondrial genomes but the mechanisms, extent and consequences of transfer remain largely unknown. Previous results indicate that parasitic plants are often involved as either transfer donors or recipients, suggesting that direct contact between parasite and host facilitates genetic transfer among plants. Results In order to uncover the mechanistic details of plant-to-plant HGT, the extent and evolutionary fate of transfer was investigated between two groups: the parasitic genus Cuscuta and a small clade of Plantago species. A broad polymerase chain reaction (PCR) survey of mitochondrial genes revealed that at least three genes (atp1, atp6 and matR) were recently transferred from Cuscuta to Plantago. Quantitative PCR assays show that these three genes have a mitochondrial location in the one species line of Plantago examined. Patterns of sequence evolution suggest that these foreign genes degraded into pseudogenes shortly after transfer and reverse transcription (RT)-PCR analyses demonstrate that none are detectably transcribed. Three cases of gene conversion were detected between native and foreign copies of the atp1 gene. The identical phylogenetic distribution of the three foreign genes within Plantago and the retention of cytidines at ancestral positions of RNA editing indicate that these genes were probably acquired via a single, DNA-mediated transfer event. However, samplings of multiple individuals from two of the three species in the recipient Plantago clade revealed complex and perplexing phylogenetic discrepancies and patterns of sequence divergence for all three of the foreign genes. Conclusions This study reports the best evidence to date that multiple mitochondrial genes can be transferred via a single HGT event and that transfer occurred via a strictly DNA-level intermediate. The discovery of gene conversion between co-resident foreign and native mitochondrial copies suggests that transferred genes may be evolutionarily important in generating mitochondrial genetic diversity. Finally, the complex relationships within each lineage of transferred genes imply a surprisingly complicated history of these genes in Plantago subsequent to their acquisition via HGT and this history probably involves some combination of additional transfers (including intracellular transfer), gene duplication, differential loss and mutation-rate variation. Unravelling this history will probably require sequencing multiple mitochondrial and nuclear genomes from Plantago. See Commentary: http://www.biomedcentral.com/1741-7007/8/147. PMID:21176201
Electronic nose for detecting multiple targets

NASA Astrophysics Data System (ADS)

Chakraborty, Anirban; Parthasarathi, Ganga; Poddar, Rakesh; Zhao, Weiqiang; Luo, Cheng

2006-05-01

The discovery of high conductivity in doped polyacetylene in 1977 (garnering the 2000 Nobel Prize in Chemistry for the three discovering scientists) has attracted considerable interest in the application of polymers as the semiconducting and conducting materials due to their promising potential to replace silicon and metals in building devices. Previous and current efforts in developing conducting polymer microsystems mainly focus on generating a device of a single function. When multiple micropatterns made of different conducting polymers are produced on the same substrate, many microsystems of multiple functions can be envisioned. For example, analogous to the mammalian olfactory system which includes over 1,000 receptor genes in detecting various odors (e.g., beer, soda etc.), a sensor consisting of multiple distinct conducting polymer sensing elements will be capable of detecting a number of analytes simultaneously. However, existing techniques present significant technical challenges of degradation, low throughput, low resolution, depth of field, and/or residual layer in producing conducting polymer microstructures. To circumvent these challenges, an intermediate-layer lithography method developed in our group is used to generate multiple micropatterns made of different, commonly used conducting polymers, Polypyrrole (PPy), Poly(3,4-ethylenedioxy)thiophene (PEDOT) and Polyaniline (PANI). The generated multiple micropatterns are further used in an "electronic nose" to detect water vapor, glucose, toluene and acetone.
Rapid Sequencing of Complete env Genes from Primary HIV-1 Samples.

PubMed

Laird Smith, Melissa; Murrell, Ben; Eren, Kemal; Ignacio, Caroline; Landais, Elise; Weaver, Steven; Phung, Pham; Ludka, Colleen; Hepler, Lance; Caballero, Gemma; Pollner, Tristan; Guo, Yan; Richman, Douglas; Poignard, Pascal; Paxinos, Ellen E; Kosakovsky Pond, Sergei L; Smith, Davey M

2016-07-01

The ability to study rapidly evolving viral populations has been constrained by the read length of next-generation sequencing approaches and the sampling depth of single-genome amplification methods. Here, we develop and characterize a method using Pacific Biosciences' Single Molecule, Real-Time (SMRT®) sequencing technology to sequence multiple, intact full-length human immunodeficiency virus-1 env genes amplified from viral RNA populations circulating in blood, and provide computational tools for analyzing and visualizing these data.
Diversity in copy number and structure of a silkworm morphogenetic gene as a result of domestication.

PubMed

Sakudoh, Takashi; Nakashima, Takeharu; Kuroki, Yoko; Fujiyama, Asao; Kohara, Yuji; Honda, Naoko; Fujimoto, Hirofumi; Shimada, Toru; Nakagaki, Masao; Banno, Yutaka; Tsuchida, Kozo

2011-03-01

The carotenoid-binding protein (CBP) of the domesticated silkworm, Bombyx mori, a major determinant of cocoon color, is likely to have been substantially influenced by domestication of this species. We analyzed the structure of the CBP gene in multiple strains of B. mori, in multiple individuals of the wild silkworm, B. mandarina (the putative wild ancestor of B. mori), and in a number of other lepidopterans. We found the CBP gene copy number in genomic DNA to vary widely among B. mori strains, ranging from 1 to 20. The copies of CBP are of several types, based on the presence of a retrotransposon or partial deletion of the coding sequence. In contrast to B. mori, B. mandarina was found to possess a single copy of CBP without the retrotransposon insertion, regardless of habitat. Several other lepidopterans were found to contain sequences homologous to CBP, revealing that this gene is evolutionarily conserved in the lepidopteran lineage. Thus, domestication can generate significant diversity of gene copy number and structure over a relatively short evolutionary time. © 2011 by the Genetics Society of America
Diversity in Copy Number and Structure of a Silkworm Morphogenetic Gene as a Result of Domestication

PubMed Central

Sakudoh, Takashi; Nakashima, Takeharu; Kuroki, Yoko; Fujiyama, Asao; Kohara, Yuji; Honda, Naoko; Fujimoto, Hirofumi; Shimada, Toru; Nakagaki, Masao; Banno, Yutaka; Tsuchida, Kozo

2011-01-01

The carotenoid-binding protein (CBP) of the domesticated silkworm, Bombyx mori, a major determinant of cocoon color, is likely to have been substantially influenced by domestication of this species. We analyzed the structure of the CBP gene in multiple strains of B. mori, in multiple individuals of the wild silkworm, B. mandarina (the putative wild ancestor of B. mori), and in a number of other lepidopterans. We found the CBP gene copy number in genomic DNA to vary widely among B. mori strains, ranging from 1 to 20. The copies of CBP are of several types, based on the presence of a retrotransposon or partial deletion of the coding sequence. In contrast to B. mori, B. mandarina was found to possess a single copy of CBP without the retrotransposon insertion, regardless of habitat. Several other lepidopterans were found to contain sequences homologous to CBP, revealing that this gene is evolutionarily conserved in the lepidopteran lineage. Thus, domestication can generate significant diversity of gene copy number and structure over a relatively short evolutionary time. PMID:21242537
A Simple Screening Approach To Prioritize Genes for Functional Analysis Identifies a Role for Interferon Regulatory Factor 7 in the Control of Respiratory Syncytial Virus Disease

PubMed Central

McDonald, Jacqueline U.; Kaforou, Myrsini; Clare, Simon; Hale, Christine; Ivanova, Maria; Huntley, Derek; Dorner, Marcus; Wright, Victoria J.; Levin, Michael; Martinon-Torres, Federico; Herberg, Jethro A.

2016-01-01

ABSTRACT Greater understanding of the functions of host gene products in response to infection is required. While many of these genes enable pathogen clearance, some enhance pathogen growth or contribute to disease symptoms. Many studies have profiled transcriptomic and proteomic responses to infection, generating large data sets, but selecting targets for further study is challenging. Here we propose a novel data-mining approach combining multiple heterogeneous data sets to prioritize genes for further study by using respiratory syncytial virus (RSV) infection as a model pathogen with a significant health care impact. The assumption was that the more frequently a gene is detected across multiple studies, the more important its role is. A literature search was performed to find data sets of genes and proteins that change after RSV infection. The data sets were standardized, collated into a single database, and then panned to determine which genes occurred in multiple data sets, generating a candidate gene list. This candidate gene list was validated by using both a clinical cohort and in vitro screening. We identified several genes that were frequently expressed following RSV infection with no assigned function in RSV control, including IFI27, IFIT3, IFI44L, GBP1, OAS3, IFI44, and IRF7. Drilling down into the function of these genes, we demonstrate a role in disease for the gene for interferon regulatory factor 7, which was highly ranked on the list, but not for IRF1, which was not. Thus, we have developed and validated an approach for collating published data sets into a manageable list of candidates, identifying novel targets for future analysis. IMPORTANCE Making the most of “big data” is one of the core challenges of current biology. There is a large array of heterogeneous data sets of host gene responses to infection, but these data sets do not inform us about gene function and require specialized skill sets and training for their utilization. Here we describe an approach that combines and simplifies these data sets, distilling this information into a single list of genes commonly upregulated in response to infection with RSV as a model pathogen. Many of the genes on the list have unknown functions in RSV disease. We validated the gene list with new clinical, in vitro, and in vivo data. This approach allows the rapid selection of genes of interest for further, more-detailed studies, thus reducing time and costs. Furthermore, the approach is simple to use and widely applicable to a range of diseases. PMID:27822537
Use of Network Inference to Elucidate Common and Chemical-specific Effects on Steoidogenesis

EPA Science Inventory

Microarray data is a key source for modeling gene regulatory interactions. Regulatory network models based on multiple datasets are potentially more robust and can provide greater confidence. In this study, we used network modeling on microarray data generated by exposing the fat...
Two alleles of the AtCesA3 gene in Arabidopsis thaliana display intragenic complementation.

PubMed

Pysh, Leonard D

2015-09-01

Cellulose is the most abundant biomolecule on the planet, yet the mechanism by which it is synthesized by higher plants remains largely unknown. In Arabidopsis thaliana (L.) Heynh, synthesis of cellulose in the primary cell wall requires three different cellulose synthase genes (AtCesA1, AtCesA3, and AtCesA6-related genes [AtCesA2, AtCesA5, and AtCesA6]). The multiple response expansion1 (mre1) mutant contains a hypomorphic AtCesA3 allele that results in significantly shorter, expanded roots. Crosses between mre1 and another allele of AtCesA3 (constitutive expression of VSP1, cev1) yielded an F1 with roots considerably longer and thinner than either parent, suggesting intragenic complementation. The F2 generation resulting from self-crossing these F1 showed three different root phenotypes: roots like mre1, roots like cev1, and roots like the F1. The segregation patterns of the three root phenotypes in multiple F2 and F3 generations were determined. Multiple characteristics of the roots and shoots were analyzed both qualitatively and quantitatively at different developmental stages, both on plates and on soil. The trans-heterozygous plants differed significantly from the parental mre1 and cev1 lines. The two alleles display intragenic complementation. A classic genetic interpretation of these results would suggest that cellulose synthesis requires homo-multimerization of cellulose synthase monomers. © 2015 Botanical Society of America.
Development of a gene synthesis platform for the efficient large scale production of small genes encoding animal toxins.

PubMed

Sequeira, Ana Filipa; Brás, Joana L A; Guerreiro, Catarina I P D; Vincentelli, Renaud; Fontes, Carlos M G A

2016-12-01

Gene synthesis is becoming an important tool in many fields of recombinant DNA technology, including recombinant protein production. De novo gene synthesis is quickly replacing the classical cloning and mutagenesis procedures and allows generating nucleic acids for which no template is available. In addition, when coupled with efficient gene design algorithms that optimize codon usage, it leads to high levels of recombinant protein expression. Here, we describe the development of an optimized gene synthesis platform that was applied to the large scale production of small genes encoding venom peptides. This improved gene synthesis method uses a PCR-based protocol to assemble synthetic DNA from pools of overlapping oligonucleotides and was developed to synthesise multiples genes simultaneously. This technology incorporates an accurate, automated and cost effective ligation independent cloning step to directly integrate the synthetic genes into an effective Escherichia coli expression vector. The robustness of this technology to generate large libraries of dozens to thousands of synthetic nucleic acids was demonstrated through the parallel and simultaneous synthesis of 96 genes encoding animal toxins. An automated platform was developed for the large-scale synthesis of small genes encoding eukaryotic toxins. Large scale recombinant expression of synthetic genes encoding eukaryotic toxins will allow exploring the extraordinary potency and pharmacological diversity of animal venoms, an increasingly valuable but unexplored source of lead molecules for drug discovery.
CLINICAL PROGRESS IN INHERITED RETINAL DEGENERATIONS: GENE THERAPY CLINICAL TRIALS AND ADVANCES IN GENETIC SEQUENCING.

PubMed

Hafler, Brian P

2017-03-01

Inherited retinal dystrophies are a significant cause of vision loss and are characterized by the loss of photoreceptors and the retinal pigment epithelium (RPE). Mutations in approximately 250 genes cause inherited retinal degenerations with a high degree of genetic heterogeneity. New techniques in next-generation sequencing are allowing the comprehensive analysis of all retinal disease genes thus changing the approach to the molecular diagnosis of inherited retinal dystrophies. This review serves to analyze clinical progress in genetic diagnostic testing and implications for retinal gene therapy. A literature search of PubMed and OMIM was conducted to relevant articles in inherited retinal dystrophies. Next-generation genetic sequencing allows the simultaneous analysis of all the approximately 250 genes that cause inherited retinal dystrophies. Reported diagnostic rates range are high and range from 51% to 57%. These new sequencing tools are highly accurate with sensitivities of 97.9% and specificities of 100%. Retinal gene therapy clinical trials are underway for multiple genes including RPE65, ABCA4, CHM, RS1, MYO7A, CNGA3, CNGB3, ND4, and MERTK for which a molecular diagnosis may be beneficial for patients. Comprehensive next-generation genetic sequencing of all retinal dystrophy genes is changing the paradigm for how retinal specialists perform genetic testing for inherited retinal degenerations. Not only are high diagnostic yields obtained, but mutations in genes with novel clinical phenotypes are also identified. In the era of retinal gene therapy clinical trials, identifying specific genetic defects will increasingly be of use to identify patients who may enroll in clinical studies and benefit from novel therapies.
Gene expression analysis of induced pluripotent stem cells from aneuploid chromosomal syndromes

PubMed Central

2013-01-01

Background Human aneuploidy is the leading cause of early pregnancy loss, mental retardation, and multiple congenital anomalies. Due to the high mortality associated with aneuploidy, the pathophysiological mechanisms of aneuploidy syndrome remain largely unknown. Previous studies focused mostly on whether dosage compensation occurs, and the next generation transcriptomics sequencing technology RNA-seq is expected to eventually uncover the mechanisms of gene expression regulation and the related pathological phenotypes in human aneuploidy. Results Using next generation transcriptomics sequencing technology RNA-seq, we profiled the transcriptomes of four human aneuploid induced pluripotent stem cell (iPSC) lines generated from monosomy × (Turner syndrome), trisomy 8 (Warkany syndrome 2), trisomy 13 (Patau syndrome), and partial trisomy 11:22 (Emanuel syndrome) as well as two umbilical cord matrix iPSC lines as euploid controls to examine how phenotypic abnormalities develop with aberrant karyotype. A total of 466 M (50-bp) reads were obtained from the six iPSC lines, and over 13,000 mRNAs were identified by gene annotation. Global analysis of gene expression profiles and functional analysis of differentially expressed (DE) genes were implemented. Over 5000 DE genes are determined between aneuploidy and euploid iPSCs respectively while 9 KEGG pathways are overlapped enriched in four aneuploidy samples. Conclusions Our results demonstrate that the extra or missing chromosome has extensive effects on the whole transcriptome. Functional analysis of differentially expressed genes reveals that the genes most affected in aneuploid individuals are related to central nervous system development and tumorigenesis. PMID:24564826
Lynx web services for annotations and systems analysis of multi-gene disorders.

PubMed

Sulakhe, Dinanath; Taylor, Andrew; Balasubramanian, Sandhya; Feng, Bo; Xie, Bingqing; Börnigen, Daniela; Dave, Utpal J; Foster, Ian T; Gilliam, T Conrad; Maltsev, Natalia

2014-07-01

Lynx is a web-based integrated systems biology platform that supports annotation and analysis of experimental data and generation of weighted hypotheses on molecular mechanisms contributing to human phenotypes and disorders of interest. Lynx has integrated multiple classes of biomedical data (genomic, proteomic, pathways, phenotypic, toxicogenomic, contextual and others) from various public databases as well as manually curated data from our group and collaborators (LynxKB). Lynx provides tools for gene list enrichment analysis using multiple functional annotations and network-based gene prioritization. Lynx provides access to the integrated database and the analytical tools via REST based Web Services (http://lynx.ci.uchicago.edu/webservices.html). This comprises data retrieval services for specific functional annotations, services to search across the complete LynxKB (powered by Lucene), and services to access the analytical tools built within the Lynx platform. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Strategies for comparing gene expression profiles from different microarray platforms: application to a case-control experiment.

PubMed

Severgnini, Marco; Bicciato, Silvio; Mangano, Eleonora; Scarlatti, Francesca; Mezzelani, Alessandra; Mattioli, Michela; Ghidoni, Riccardo; Peano, Clelia; Bonnal, Raoul; Viti, Federica; Milanesi, Luciano; De Bellis, Gianluca; Battaglia, Cristina

2006-06-01

Meta-analysis of microarray data is increasingly important, considering both the availability of multiple platforms using disparate technologies and the accumulation in public repositories of data sets from different laboratories. We addressed the issue of comparing gene expression profiles from two microarray platforms by devising a standardized investigative strategy. We tested this procedure by studying MDA-MB-231 cells, which undergo apoptosis on treatment with resveratrol. Gene expression profiles were obtained using high-density, short-oligonucleotide, single-color microarray platforms: GeneChip (Affymetrix) and CodeLink (Amersham). Interplatform analyses were carried out on 8414 common transcripts represented on both platforms, as identified by LocusLink ID, representing 70.8% and 88.6% of annotated GeneChip and CodeLink features, respectively. We identified 105 differentially expressed genes (DEGs) on CodeLink and 42 DEGs on GeneChip. Among them, only 9 DEGs were commonly identified by both platforms. Multiple analyses (BLAST alignment of probes with target sequences, gene ontology, literature mining, and quantitative real-time PCR) permitted us to investigate the factors contributing to the generation of platform-dependent results in single-color microarray experiments. An effective approach to cross-platform comparison involves microarrays of similar technologies, samples prepared by identical methods, and a standardized battery of bioinformatic and statistical analyses.
Novel application of multi-stimuli network inference to synovial fibroblasts of rheumatoid arthritis patients

PubMed Central

2014-01-01

Background Network inference of gene expression data is an important challenge in systems biology. Novel algorithms may provide more detailed gene regulatory networks (GRN) for complex, chronic inflammatory diseases such as rheumatoid arthritis (RA), in which activated synovial fibroblasts (SFBs) play a major role. Since the detailed mechanisms underlying this activation are still unclear, simultaneous investigation of multi-stimuli activation of SFBs offers the possibility to elucidate the regulatory effects of multiple mediators and to gain new insights into disease pathogenesis. Methods A GRN was therefore inferred from RA-SFBs treated with 4 different stimuli (IL-1 β, TNF- α, TGF- β, and PDGF-D). Data from time series microarray experiments (0, 1, 2, 4, 12 h; Affymetrix HG-U133 Plus 2.0) were batch-corrected applying ‘ComBat’, analyzed for differentially expressed genes over time with ‘Limma’, and used for the inference of a robust GRN with NetGenerator V2.0, a heuristic ordinary differential equation-based method with soft integration of prior knowledge. Results Using all genes differentially expressed over time in RA-SFBs for any stimulus, and selecting the genes belonging to the most significant gene ontology (GO) term, i.e., ‘cartilage development’, a dynamic, robust, moderately complex multi-stimuli GRN was generated with 24 genes and 57 edges in total, 31 of which were gene-to-gene edges. Prior literature-based knowledge derived from Pathway Studio or manual searches was reflected in the final network by 25/57 confirmed edges (44%). The model contained known network motifs crucial for dynamic cellular behavior, e.g., cross-talk among pathways, positive feed-back loops, and positive feed-forward motifs (including suppression of the transcriptional repressor OSR2 by all 4 stimuli. Conclusion A multi-stimuli GRN highly concordant with literature data was successfully generated by network inference from the gene expression of stimulated RA-SFBs. The GRN showed high reliability, since 10 predicted edges were independently validated by literature findings post network inference. The selected GO term ‘cartilage development’ contained a number of differentiation markers, growth factors, and transcription factors with potential relevance for RA. Finally, the model provided new insight into the response of RA-SFBs to multiple stimuli implicated in the pathogenesis of RA, in particular to the ‘novel’ potent growth factor PDGF-D. PMID:24989895
A structured sparse regression method for estimating isoform expression level from multi-sample RNA-seq data.

PubMed

Zhang, L; Liu, X J

2016-06-03

With the rapid development of next-generation high-throughput sequencing technology, RNA-seq has become a standard and important technique for transcriptome analysis. For multi-sample RNA-seq data, the existing expression estimation methods usually deal with each single-RNA-seq sample, and ignore that the read distributions are consistent across multiple samples. In the current study, we propose a structured sparse regression method, SSRSeq, to estimate isoform expression using multi-sample RNA-seq data. SSRSeq uses a non-parameter model to capture the general tendency of non-uniformity read distribution for all genes across multiple samples. Additionally, our method adds a structured sparse regularization, which not only incorporates the sparse specificity between a gene and its corresponding isoform expression levels, but also reduces the effects of noisy reads, especially for lowly expressed genes and isoforms. Four real datasets were used to evaluate our method on isoform expression estimation. Compared with other popular methods, SSRSeq reduced the variance between multiple samples, and produced more accurate isoform expression estimations, and thus more meaningful biological interpretations.
Severe sensory neuropathy in patients with adult-onset multiple acyl-CoA dehydrogenase deficiency.

PubMed

Wang, Zhaoxia; Hong, Daojun; Zhang, Wei; Li, Wurong; Shi, Xin; Zhao, Danhua; Yang, Xu; Lv, He; Yuan, Yun

2016-02-01

Multiple Acyl-CoA dehydrogenase deficiency (MADD) is an autosomal recessive disorder of fatty acid oxidation. Most patients with late-onset MADD are clinically characterized by lipid storage myopathy with dramatic responsiveness to riboflavin treatment. Abnormalities of peripheral neuropathy have rarely been reported in patients with late-onset MADD. We describe six patients who presented with proximal limb weakness and loss of sensation in the distal limbs. Muscle biopsy revealed typical myopathological patterns of lipid storage myopathy and blood acylcarnitine profiles showed a combined elevation of multiple acylcarnitines supporting the diagnosis of MADD. However, nerve conduction investigations and sural nerve biopsies in these patients indicated severe axonal sensory neuropathy. Causative ETFDH gene mutations were found in all six cases. No other causative gene mutations were identified in mitochondrial DNA and genes associated with hereditary neuropathies through next-generation-sequencing panel. Late-onset patients with ETFDH mutations can present with proximal muscle weakness and distal sensory neuropathy, which might be a new phenotypic variation, but the precise underlying pathogenesis remains to be elucidated. Copyright © 2015. Published by Elsevier B.V.
Comparative genomics and evolution of eukaryotic phospholipidbiosynthesis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lykidis, Athanasios

2006-12-01

Phospholipid biosynthetic enzymes produce diverse molecular structures and are often present in multiple forms encoded by different genes. This work utilizes comparative genomics and phylogenetics for exploring the distribution, structure and evolution of phospholipid biosynthetic genes and pathways in 26 eukaryotic genomes. Although the basic structure of the pathways was formed early in eukaryotic evolution, the emerging picture indicates that individual enzyme families followed unique evolutionary courses. For example, choline and ethanolamine kinases and cytidylyltransferases emerged in ancestral eukaryotes, whereas, multiple forms of the corresponding phosphatidyltransferases evolved mainly in a lineage specific manner. Furthermore, several unicellular eukaryotes maintain bacterial-type enzymesmore » and reactions for the synthesis of phosphatidylglycerol and cardiolipin. Also, base-exchange phosphatidylserine synthases are widespread and ancestral enzymes. The multiplicity of phospholipid biosynthetic enzymes has been largely generated by gene expansion in a lineage specific manner. Thus, these observations suggest that phospholipid biosynthesis has been an actively evolving system. Finally, comparative genomic analysis indicates the existence of novel phosphatidyltransferases and provides a candidate for the uncharacterized eukaryotic phosphatidylglycerol phosphate phosphatase.« less

Genetic anticipation in a special form of hypertrophic cardiomyopathy with sudden cardiac death in a family with 74 members across 5 generations.

PubMed

Guo, Xiying; Fan, Chaomei; Wang, Yanping; Wang, Miao; Cai, Chi; Yang, Yinjian; Zhao, Shihua; Duan, Fujian; Li, Yishi

2017-03-01

Hypertrophic cardiomyopathy (HCM) is the most common heritable heart disease. The genetic anticipation of HCM and its associated etiology, sudden cardiac death (SCD), remains unclear. The aim of this study was to investigate the mechanism underlying the genetic anticipation of HCM and associated SCD.An HCM family including 5 generations and 74 members was studied. Two-dimensional echocardiography was performed to diagnose HCM. The age of onset of HCM was defined as the age at first diagnosis according to hospital records. The information on SCD was confirmed by verification by ≥2 family members and a review of hospital records. Whole-genome sequencing was performed on 4 HCM subjects and 1 healthy control in the family. The identified mutations were screened in all available family members and 216 unrelated healthy controls by Sanger sequencing.The median ages of onset of HCM were 63.5, 38.5, and 18.0 years in members of the second, third, and fourth generations of the family, respectively, and the differences between the generations were significant (P < 0.001). The age at SCD also decreased with each subsequent generation (P < 0.05). In particular, among the third-generation family members, SCD occurred between 30 and 40 years of age at approximately 8 AM, whereas among the fourth-generation family members, all 5 males who experienced SCD were 16 years of age and died at approximately 8 AM. The sarcomere gene mutations MYH7-A719H and MYOZ2-L169G were detected in the HCM individuals in this pedigree. Increases in the number of mutations and the frequency of multiple gene mutations were observed in the younger generations. Moreover, a structural variant was present in the HCM phenotype-positive subjects but was absent in the HCM phenotype-negative subjects.HCM may exhibit genetic anticipation, with a decreased age of onset and increased severity in successive generations. Multiple gene mutations may contribute to genetic anticipation in HCM and thus may be of prognostic value.
Genetic analysis of tachyzoite to bradyzoite differentiation mutants in Toxoplasma gondii reveals a hierarchy of gene induction.

PubMed

Singh, Upinder; Brewer, Jeremy L; Boothroyd, John C

2002-05-01

Developmental switching in Toxoplasma gondii, from the virulent tachyzoite to the relatively quiescent bradyzoite stage, is responsible for disease propagation and reactivation. We have generated tachyzoite to bradyzoite differentiation (Tbd-) mutants in T. gondii and used these in combination with a cDNA microarray to identify developmental pathways in bradyzoite formation. Four independently generated Tbd- mutants were analysed and had defects in bradyzoite development in response to multiple bradyzoite-inducing conditions, a stable phenotype after in vivo passages and a markedly reduced brain cyst burden in a murine model of chronic infection. Transcriptional profiles of mutant and wild-type parasites, growing under bradyzoite conditions, revealed a hierarchy of developmentally regulated genes, including many bradyzoite-induced genes whose transcripts were reduced in all mutants. A set of non-developmentally regulated genes whose transcripts were less abundant in Tbd- mutants were also identified. These may represent genes that mediate downstream effects and/or whose expression is dependent on the same transcription factors as the bradyzoite-induced set. Using these data, we have generated a model of transcription regulation during bradyzoite development in T. gondii. Our approach shows the utility of this system as a model to study developmental biology in single-celled eukaryotes including protozoa and fungi.
Large-scale atlas of microarray data reveals biological landscape of gene expression in Arabidopsis

USDA-ARS?s Scientific Manuscript database

Transcriptome datasets from thousands of samples of the model plant Arabidopsis thaliana have been collectively generated by multiple individual labs. Although integration and meta-analysis of these samples has become routine in the plant research community, it is often hampered by the lack of metad...
The Democratization of the Oncogene

PubMed Central

Le, Anh T.; Doebele, Robert C.

2014-01-01

Summary The identification of novel, oncogenic gene rearrangements in inflammatory myofibroblastic tumor (IMT) demonstrates the potential of next generation sequencing (NGS) platforms for the detection of therapeutically relevant oncogenes across multiple tumor types, but raises significant questions relating to the investigation of targeted therapies in this new era of widespread NGS testing. PMID:25092743
Major soybean maturity gene haplotypes revealed by SNPViz analysis of 72 sequenced soybean genomes

USDA-ARS?s Scientific Manuscript database

In this Genomics Era, vast amounts of next generation sequencing data have become publicly-available for multiple genomes across hundreds of species. Analysis of these large-scale datasets can become cumbersome, especially when comparing nucleotide polymorphisms across many samples within a dataset...
High-throughput discovery of novel developmental phenotypes.

PubMed

Dickinson, Mary E; Flenniken, Ann M; Ji, Xiao; Teboul, Lydia; Wong, Michael D; White, Jacqueline K; Meehan, Terrence F; Weninger, Wolfgang J; Westerberg, Henrik; Adissu, Hibret; Baker, Candice N; Bower, Lynette; Brown, James M; Caddle, L Brianna; Chiani, Francesco; Clary, Dave; Cleak, James; Daly, Mark J; Denegre, James M; Doe, Brendan; Dolan, Mary E; Edie, Sarah M; Fuchs, Helmut; Gailus-Durner, Valerie; Galli, Antonella; Gambadoro, Alessia; Gallegos, Juan; Guo, Shiying; Horner, Neil R; Hsu, Chih-Wei; Johnson, Sara J; Kalaga, Sowmya; Keith, Lance C; Lanoue, Louise; Lawson, Thomas N; Lek, Monkol; Mark, Manuel; Marschall, Susan; Mason, Jeremy; McElwee, Melissa L; Newbigging, Susan; Nutter, Lauryl M J; Peterson, Kevin A; Ramirez-Solis, Ramiro; Rowland, Douglas J; Ryder, Edward; Samocha, Kaitlin E; Seavitt, John R; Selloum, Mohammed; Szoke-Kovacs, Zsombor; Tamura, Masaru; Trainor, Amanda G; Tudose, Ilinca; Wakana, Shigeharu; Warren, Jonathan; Wendling, Olivia; West, David B; Wong, Leeyean; Yoshiki, Atsushi; MacArthur, Daniel G; Tocchini-Valentini, Glauco P; Gao, Xiang; Flicek, Paul; Bradley, Allan; Skarnes, William C; Justice, Monica J; Parkinson, Helen E; Moore, Mark; Wells, Sara; Braun, Robert E; Svenson, Karen L; de Angelis, Martin Hrabe; Herault, Yann; Mohun, Tim; Mallon, Ann-Marie; Henkelman, R Mark; Brown, Steve D M; Adams, David J; Lloyd, K C Kent; McKerlie, Colin; Beaudet, Arthur L; Bućan, Maja; Murray, Stephen A

2016-09-22

Approximately one-third of all mammalian genes are essential for life. Phenotypes resulting from knockouts of these genes in mice have provided tremendous insight into gene function and congenital disorders. As part of the International Mouse Phenotyping Consortium effort to generate and phenotypically characterize 5,000 knockout mouse lines, here we identify 410 lethal genes during the production of the first 1,751 unique gene knockouts. Using a standardized phenotyping platform that incorporates high-resolution 3D imaging, we identify phenotypes at multiple time points for previously uncharacterized genes and additional phenotypes for genes with previously reported mutant phenotypes. Unexpectedly, our analysis reveals that incomplete penetrance and variable expressivity are common even on a defined genetic background. In addition, we show that human disease genes are enriched for essential genes, thus providing a dataset that facilitates the prioritization and validation of mutations identified in clinical sequencing efforts.
High-throughput discovery of novel developmental phenotypes

PubMed Central

Dickinson, Mary E.; Flenniken, Ann M.; Ji, Xiao; Teboul, Lydia; Wong, Michael D.; White, Jacqueline K.; Meehan, Terrence F.; Weninger, Wolfgang J.; Westerberg, Henrik; Adissu, Hibret; Baker, Candice N.; Bower, Lynette; Brown, James M.; Caddle, L. Brianna; Chiani, Francesco; Clary, Dave; Cleak, James; Daly, Mark J.; Denegre, James M.; Doe, Brendan; Dolan, Mary E.; Edie, Sarah M.; Fuchs, Helmut; Gailus-Durner, Valerie; Galli, Antonella; Gambadoro, Alessia; Gallegos, Juan; Guo, Shiying; Horner, Neil R.; Hsu, Chih-wei; Johnson, Sara J.; Kalaga, Sowmya; Keith, Lance C.; Lanoue, Louise; Lawson, Thomas N.; Lek, Monkol; Mark, Manuel; Marschall, Susan; Mason, Jeremy; McElwee, Melissa L.; Newbigging, Susan; Nutter, Lauryl M.J.; Peterson, Kevin A.; Ramirez-Solis, Ramiro; Rowland, Douglas J.; Ryder, Edward; Samocha, Kaitlin E.; Seavitt, John R.; Selloum, Mohammed; Szoke-Kovacs, Zsombor; Tamura, Masaru; Trainor, Amanda G; Tudose, Ilinca; Wakana, Shigeharu; Warren, Jonathan; Wendling, Olivia; West, David B.; Wong, Leeyean; Yoshiki, Atsushi; MacArthur, Daniel G.; Tocchini-Valentini, Glauco P.; Gao, Xiang; Flicek, Paul; Bradley, Allan; Skarnes, William C.; Justice, Monica J.; Parkinson, Helen E.; Moore, Mark; Wells, Sara; Braun, Robert E.; Svenson, Karen L.; de Angelis, Martin Hrabe; Herault, Yann; Mohun, Tim; Mallon, Ann-Marie; Henkelman, R. Mark; Brown, Steve D.M.; Adams, David J.; Lloyd, K.C. Kent; McKerlie, Colin; Beaudet, Arthur L.; Bucan, Maja; Murray, Stephen A.

2016-01-01

Approximately one third of all mammalian genes are essential for life. Phenotypes resulting from mouse knockouts of these genes have provided tremendous insight into gene function and congenital disorders. As part of the International Mouse Phenotyping Consortium effort to generate and phenotypically characterize 5000 knockout mouse lines, we have identified 410 lethal genes during the production of the first 1751 unique gene knockouts. Using a standardised phenotyping platform that incorporates high-resolution 3D imaging, we identified novel phenotypes at multiple time points for previously uncharacterized genes and additional phenotypes for genes with previously reported mutant phenotypes. Unexpectedly, our analysis reveals that incomplete penetrance and variable expressivity are common even on a defined genetic background. In addition, we show that human disease genes are enriched for essential genes identified in our screen, thus providing a novel dataset that facilitates prioritization and validation of mutations identified in clinical sequencing efforts. PMID:27626380
Using multi-locus allelic sequence data to estimate genetic divergence among four Lilium (Liliaceae) cultivars

PubMed Central

Shahin, Arwa; Smulders, Marinus J. M.; van Tuyl, Jaap M.; Arens, Paul; Bakker, Freek T.

2014-01-01

Next Generation Sequencing (NGS) may enable estimating relationships among genotypes using allelic variation of multiple nuclear genes simultaneously. We explored the potential and caveats of this strategy in four genetically distant Lilium cultivars to estimate their genetic divergence from transcriptome sequences using three approaches: POFAD (Phylogeny of Organisms from Allelic Data, uses allelic information of sequence data), RAxML (Randomized Accelerated Maximum Likelihood, tree building based on concatenated consensus sequences) and Consensus Network (constructing a network summarizing among gene tree conflicts). Twenty six gene contigs were chosen based on the presence of orthologous sequences in all cultivars, seven of which also had an orthologous sequence in Tulipa, used as out-group. The three approaches generated the same topology. Although the resolution offered by these approaches is high, in this case there was no extra benefit in using allelic information. We conclude that these 26 genes can be widely applied to construct a species tree for the genus Lilium. PMID:25368628
Repression by PRDM13 is critical for generating precision in neuronal identity

PubMed Central

Kollipara, Rahul K; Ma, Zhenzhong; Borromeo, Mark D; Chang, Joshua C

2017-01-01

The mechanisms that activate some genes while silencing others are critical to ensure precision in lineage specification as multipotent progenitors become restricted in cell fate. During neurodevelopment, these mechanisms are required to generate the diversity of neuronal subtypes found in the nervous system. Here we report interactions between basic helix-loop-helix (bHLH) transcriptional activators and the transcriptional repressor PRDM13 that are critical for specifying dorsal spinal cord neurons. PRDM13 inhibits gene expression programs for excitatory neuronal lineages in the dorsal neural tube. Strikingly, PRDM13 also ensures a battery of ventral neural tube specification genes such as Olig1, Olig2 and Prdm12 are excluded dorsally. PRDM13 does this via recruitment to chromatin by multiple neural bHLH factors to restrict gene expression in specific neuronal lineages. Together these findings highlight the function of PRDM13 in repressing the activity of bHLH transcriptional activators that together are required to achieve precise neuronal specification during mouse development. PMID:28850031
ExprAlign - the identification of ESTs in non-model species by alignment of cDNA microarray expression profiles

PubMed Central

2009-01-01

Background Sequence identification of ESTs from non-model species offers distinct challenges particularly when these species have duplicated genomes and when they are phylogenetically distant from sequenced model organisms. For the common carp, an environmental model of aquacultural interest, large numbers of ESTs remained unidentified using BLAST sequence alignment. We have used the expression profiles from large-scale microarray experiments to suggest gene identities. Results Expression profiles from ~700 cDNA microarrays describing responses of 7 major tissues to multiple environmental stressors were used to define a co-expression landscape. This was based on the Pearsons correlation coefficient relating each gene with all other genes, from which a network description provided clusters of highly correlated genes as 'mountains'. We show that these contain genes with known identities and genes with unknown identities, and that the correlation constitutes evidence of identity in the latter. This procedure has suggested identities to 522 of 2701 unknown carp ESTs sequences. We also discriminate several common carp genes and gene isoforms that were not discriminated by BLAST sequence alignment alone. Precision in identification was substantially improved by use of data from multiple tissues and treatments. Conclusion The detailed analysis of co-expression landscapes is a sensitive technique for suggesting an identity for the large number of BLAST unidentified cDNAs generated in EST projects. It is capable of detecting even subtle changes in expression profiles, and thereby of distinguishing genes with a common BLAST identity into different identities. It benefits from the use of multiple treatments or contrasts, and from the large-scale microarray data. PMID:19939286
Disrupting the male germ line to find infertility and contraception targets.

PubMed

Archambeault, Denise R; Matzuk, Martin M

2014-05-01

Genetically-manipulated mouse models have become indispensible for broadening our understanding of genes and pathways related to male germ cell development. Until suitable in vitro systems for studying spermatogenesis are perfected, in vivo models will remain the gold standard for inquiry into testicular function. Here, we discuss exciting advances that are allowing researchers faster, easier, and more customizable access to their mouse models of interest. Specifically, the trans-NIH Knockout Mouse Project (KOMP) is working to generate knockout mouse models of every gene in the mouse genome. The related Knockout Mouse Phenotyping Program (KOMP2) is performing systematic phenotypic analysis of this genome-wide collection of knockout mice, including fertility screening. Together, these programs will not only uncover new genes involved in male germ cell development but also provide the research community with the mouse models necessary for further investigations. In addition to KOMP/KOMP2, another promising development in the field of mouse models is the advent of CRISPR (clustered regularly interspaced short palindromic repeat)-Cas technology. Utilizing 20 nucleotide guide sequences, CRISPR/Cas has the potential to introduce sequence-specific insertions, deletions, and point mutations to produce null, conditional, activated, or reporter-tagged alleles. CRISPR/Cas can also successfully target multiple genes in a single experimental step, forgoing the multiple generations of breeding traditionally required to produce mouse models with deletions, insertions, or mutations in multiple genes. In addition, CRISPR/Cas can be used to create mouse models carrying variants identical to those identified in infertile human patients, providing the opportunity to explore the effects of such mutations in an in vivo system. Both the KOMP/KOMP2 projects and the CRISPR/Cas system provide powerful, accessible genetic approaches to the study of male germ cell development in the mouse. A more complete understanding of male germ cell biology is critical for the identification of novel targets for potential non-hormonal contraceptive intervention. Copyright © 2014. Published by Elsevier Masson SAS.
Integration of heterogeneous molecular networks to unravel gene-regulation in Mycobacterium tuberculosis.

PubMed

van Dam, Jesse C J; Schaap, Peter J; Martins dos Santos, Vitor A P; Suárez-Diez, María

2014-09-26

Different methods have been developed to infer regulatory networks from heterogeneous omics datasets and to construct co-expression networks. Each algorithm produces different networks and efforts have been devoted to automatically integrate them into consensus sets. However each separate set has an intrinsic value that is diluted and partly lost when building a consensus network. Here we present a methodology to generate co-expression networks and, instead of a consensus network, we propose an integration framework where the different networks are kept and analysed with additional tools to efficiently combine the information extracted from each network. We developed a workflow to efficiently analyse information generated by different inference and prediction methods. Our methodology relies on providing the user the means to simultaneously visualise and analyse the coexisting networks generated by different algorithms, heterogeneous datasets, and a suite of analysis tools. As a show case, we have analysed the gene co-expression networks of Mycobacterium tuberculosis generated using over 600 expression experiments. Regarding DNA damage repair, we identified SigC as a key control element, 12 new targets for LexA, an updated LexA binding motif, and a potential mismatch repair system. We expanded the DevR regulon with 27 genes while identifying 9 targets wrongly assigned to this regulon. We discovered 10 new genes linked to zinc uptake and a new regulatory mechanism for ZuR. The use of co-expression networks to perform system level analysis allows the development of custom made methodologies. As show cases we implemented a pipeline to integrate ChIP-seq data and another method to uncover multiple regulatory layers. Our workflow is based on representing the multiple types of information as network representations and presenting these networks in a synchronous framework that allows their simultaneous visualization while keeping specific associations from the different networks. By simultaneously exploring these networks and metadata, we gained insights into regulatory mechanisms in M. tuberculosis that could not be obtained through the separate analysis of each data type.
Rapid Sequencing of Complete env Genes from Primary HIV-1 Samples

PubMed Central

Eren, Kemal; Ignacio, Caroline; Landais, Elise; Weaver, Steven; Phung, Pham; Ludka, Colleen; Hepler, Lance; Caballero, Gemma; Pollner, Tristan; Guo, Yan; Richman, Douglas; Poignard, Pascal; Paxinos, Ellen E.; Kosakovsky Pond, Sergei L.

2016-01-01

Abstract The ability to study rapidly evolving viral populations has been constrained by the read length of next-generation sequencing approaches and the sampling depth of single-genome amplification methods. Here, we develop and characterize a method using Pacific Biosciences’ Single Molecule, Real-Time (SMRT®) sequencing technology to sequence multiple, intact full-length human immunodeficiency virus-1 env genes amplified from viral RNA populations circulating in blood, and provide computational tools for analyzing and visualizing these data. PMID:29492273
Genomic and Epigenomic Alterations in Cancer.

PubMed

Chakravarthi, Balabhadrapatruni V S K; Nepal, Saroj; Varambally, Sooryanarayana

2016-07-01

Multiple genetic and epigenetic events characterize tumor progression and define the identity of the tumors. Advances in high-throughput technologies, like gene expression profiling, next-generation sequencing, proteomics, and metabolomics, have enabled detailed molecular characterization of various tumors. The integration and analyses of these high-throughput data have unraveled many novel molecular aberrations and network alterations in tumors. These molecular alterations include multiple cancer-driving mutations, gene fusions, amplification, deletion, and post-translational modifications, among others. Many of these genomic events are being used in cancer diagnosis, whereas others are therapeutically targeted with small-molecule inhibitors. Multiple genes/enzymes that play a role in DNA and histone modifications are also altered in various cancers, changing the epigenomic landscape during cancer initiation and progression. Apart from protein-coding genes, studies are uncovering the critical regulatory roles played by noncoding RNAs and noncoding regions of the genome during cancer progression. Many of these genomic and epigenetic events function in tandem to drive tumor development and metastasis. Concurrent advances in genome-modulating technologies, like gene silencing and genome editing, are providing ability to understand in detail the process of cancer initiation, progression, and signaling as well as opening up avenues for therapeutic targeting. In this review, we discuss some of the recent advances in cancer genomic and epigenomic research. Copyright © 2016 American Society for Investigative Pathology. Published by Elsevier Inc. All rights reserved.
A non-inheritable maternal Cas9-based multiple-gene editing system in mice.

PubMed

Sakurai, Takayuki; Kamiyoshi, Akiko; Kawate, Hisaka; Mori, Chie; Watanabe, Satoshi; Tanaka, Megumu; Uetake, Ryuichi; Sato, Masahiro; Shindo, Takayuki

2016-01-28

The CRISPR/Cas9 system is capable of editing multiple genes through one-step zygote injection. The preexisting method is largely based on the co-injection of Cas9 DNA (or mRNA) and guide RNAs (gRNAs); however, it is unclear how many genes can be simultaneously edited by this method, and a reliable means to generate transgenic (Tg) animals with multiple gene editing has yet to be developed. Here, we employed non-inheritable maternal Cas9 (maCas9) protein derived from Tg mice with systemic Cas9 overexpression (Cas9 mice). The maCas9 protein in zygotes derived from mating or in vitro fertilization of Tg/+ oocytes and +/+ sperm could successfully edit the target genome. The efficiency of such maCas9-based genome editing was comparable to that of zygote microinjection-based genome editing widely used at present. Furthermore, we demonstrated a novel approach to create "Cas9 transgene-free" gene-modified mice using non-Tg (+/+) zygotes carrying maCas9. The maCas9 protein in mouse zygotes edited nine target loci simultaneously after injection with nine different gRNAs alone. Cas9 mouse-derived zygotes have the potential to facilitate the creation of genetically modified animals carrying the Cas9 transgene, enabling repeatable genome engineering and the production of Cas9 transgene-free mice.
Ectopic Integration of Transforming DNA Is Rare among Neurospora Transformants Selected for Gene Replacement

PubMed Central

Miao, VPW.; Rountree, M. R.; Selker, E. U.

1995-01-01

In a variety of organisms, DNA-mediated transformation experiments commonly produce transformants with multiple copies of the transforming DNA, including both selected and unselected molecules. Such ``cotransformants'' are much more common than expected from the individual transformation frequencies, suggesting that subpopulations of cells, or nuclei, are particularly competent for transformation. We found that Neurospora crassa transformants selected for gene replacement at the am gene had not efficiently incorporated additional DNA, suggesting that nuclei that undergo transformation by homologous recombination are not highly competent at integration of DNA by illegitimate recombination. Spheroplasts were treated with DNA fragments homologous to am and with an Escherichia coli hph plasmid. Transformants were initially selected for hph (hygromycin(R)), allowed to conidiate to generate homokaryons and then selected for either Am(-) (gene replacements) or hph. Surprisingly, most am replacement strains were hygromycin(S) (124/140) and carried no extraneous DNA (116/140). Most transformants selected for hph also had ectopic copies of am DNA and/or multiple copies of hph sequences (32/35), generally at multiple sites, confirming that efficient cotransformation could occur. To test the implication that cotransformation involving gene replacement and ectopic integration is rare, we compared the yields of am replacement strains with or without prior selection for hph. The initial selection did not appreciably help (or hinder) recovery of strains with replacements. PMID:7789758
Conversion events in gene clusters

PubMed Central

2011-01-01

Background Gene clusters containing multiple similar genomic regions in close proximity are of great interest for biomedical studies because of their associations with inherited diseases. However, such regions are difficult to analyze due to their structural complexity and their complicated evolutionary histories, reflecting a variety of large-scale mutational events. In particular, conversion events can mislead inferences about the relationships among these regions, as traced by traditional methods such as construction of phylogenetic trees or multi-species alignments. Results To correct the distorted information generated by such methods, we have developed an automated pipeline called CHAP (Cluster History Analysis Package) for detecting conversion events. We used this pipeline to analyze the conversion events that affected two well-studied gene clusters (α-globin and β-globin) and three gene clusters for which comparative sequence data were generated from seven primate species: CCL (chemokine ligand), IFN (interferon), and CYP2abf (part of cytochrome P450 family 2). CHAP is freely available at http://www.bx.psu.edu/miller_lab. Conclusions These studies reveal the value of characterizing conversion events in the context of studying gene clusters in complex genomes. PMID:21798034
The analysis of genomic structures in the L1 family of cell adhesion molecules provides no evidence for exon shuffling events after the separation of arthropod and chordate lineages.

PubMed

Zhao, G; Hortsch, M

1998-07-17

Members of the L1 family of neural cell adhesion molecules consist of multiple extracellular immunoglobulin and fibronectin type III domains that mediate the adhesive properties of this group of transmembrane proteins. In vertebrate genomes, these protein domains are separated by introns, and it has been suggested that L1-type genes might have been subject to exon-shuffling events during evolution. However, comparison of the human L1-CAM and the chicken neurofascin gene with the genomic structure of their Drosophila homologue, neuroglian, indicates that no major rearrangement of protein domains has taken place subsequent to the split of the arthropod and chordate phyla. The Drosophila neuroglian gene appears to have lost most of the introns that have been conserved in the human L1-CAM and the chicken neurofascin gene. Nevertheless, exon shuffling or the generation of new exons by mutational changes might have been responsible for the generation of additional, alternatively spliced exons in L1-type genes.
Next-generation sequencing to solve complex inherited retinal dystrophy: A case series of multiple genes contributing to disease in extended families.

PubMed

Jones, Kaylie D; Wheaton, Dianna K; Bowne, Sara J; Sullivan, Lori S; Birch, David G; Chen, Rui; Daiger, Stephen P

2017-01-01

With recent availability of next-generation sequencing (NGS), it is becoming more common to pursue disease-targeted panel testing rather than traditional sequential gene-by-gene dideoxy sequencing. In this report, we describe using NGS to identify multiple disease-causing mutations that contribute concurrently or independently to retinal dystrophy in three relatively small families. Family members underwent comprehensive visual function evaluations, and genetic counseling including a detailed family history. A preliminary genetic inheritance pattern was assigned and updated as additional family members were tested. Family 1 (FAM1) and Family 2 (FAM2) were clinically diagnosed with retinitis pigmentosa (RP) and had a suspected autosomal dominant pedigree with non-penetrance (n.p.). Family 3 (FAM3) consisted of a large family with a diagnosis of RP and an overall dominant pedigree, but the proband had phenotypically cone-rod dystrophy. Initial genetic analysis was performed on one family member with traditional Sanger single gene sequencing and/or panel-based testing, and ultimately, retinal gene-targeted NGS was required to identify the underlying cause of disease for individuals within the three families. Results obtained in these families necessitated further genetic and clinical testing of additional family members to determine the complex genetic and phenotypic etiology of each family. Genetic testing of FAM1 (n = 4 affected; 1 n.p.) identified a dominant mutation in RP1 (p.Arg677Ter) that was present for two of the four affected individuals but absent in the proband and the presumed non-penetrant individual. Retinal gene-targeted NGS in the fourth affected family member revealed compound heterozygous mutations in USH2A (p. Cys419Phe, p.Glu767Serfs*21). Genetic testing of FAM2 (n = 3 affected; 1 n.p.) identified three retinal dystrophy genes ( PRPH2 , PRPF8 , and USH2A ) with disease-causing mutations in varying combinations among the affected family members. Genetic testing of FAM3 (n = 7 affected) identified a mutation in PRPH2 (p.Pro216Leu) tracking with disease in six of the seven affected individuals. Additional retinal gene-targeted NGS testing determined that the proband also harbored a multiple exon deletion in the CRX gene likely accounting for her cone-rod phenotype; her son harbored only the mutation in CRX , not the familial mutation in PRPH2 . Multiple genes contributing to the retinal dystrophy genotypes within a family were discovered using retinal gene-targeted NGS. Families with noted examples of phenotypic variation or apparent non-penetrant individuals may offer a clue to suspect complex inheritance. Furthermore, this finding underscores that caution should be taken when attributing a single gene disease-causing mutation (or inheritance pattern) to a family as a whole. Identification of a disease-causing mutation in a proband, even with a clear inheritance pattern in hand, may not be sufficient for targeted, known mutation analysis in other family members.
Overexpressing the Multiple-Stress Responsive Gene At1g74450 Reduces Plant Height and Male Fertility in Arabidopsis thaliana

PubMed Central

Visscher, Anne M.; Belfield, Eric J.; Vlad, Daniela; Irani, Niloufer; Moore, Ian; Harberd, Nicholas P.

2015-01-01

A subset of genes in Arabidopsis thaliana is known to be up-regulated in response to a wide range of different environmental stress factors. However, not all of these genes are characterized as yet with respect to their functions. In this study, we used transgenic knockout, overexpression and reporter gene approaches to try to elucidate the biological roles of five unknown multiple-stress responsive genes in Arabidopsis. The selected genes have the following locus identifiers: At1g18740, At1g74450, At4g27652, At4g29780 and At5g12010. Firstly, T-DNA insertion knockout lines were identified for each locus and screened for altered phenotypes. None of the lines were found to be visually different from wildtype Col-0. Secondly, 35S-driven overexpression lines were generated for each open reading frame. Analysis of these transgenic lines showed altered phenotypes for lines overexpressing the At1g74450 ORF. Plants overexpressing the multiple-stress responsive gene At1g74450 are stunted in height and have reduced male fertility. Alexander staining of anthers from flowers at developmental stage 12–13 showed either an absence or a reduction in viable pollen compared to wildtype Col-0 and At1g74450 knockout lines. Interestingly, the effects of stress on crop productivity are most severe at developmental stages such as male gametophyte development. However, the molecular factors and regulatory networks underlying environmental stress-induced male gametophytic alterations are still largely unknown. Our results indicate that the At1g74450 gene provides a potential link between multiple environmental stresses, plant height and pollen development. In addition, ruthenium red staining analysis showed that At1g74450 may affect the composition of the inner seed coat mucilage layer. Finally, C-terminal GFP fusion proteins for At1g74450 were shown to localise to the cytosol. PMID:26485022

Multi site polyadenylation and transcriptional response to stress of a vacuolar type H+-ATPase subunit A gene in Arabidopsis thaliana

PubMed Central

Magnotta, Scot M; Gogarten, Johann Peter

2002-01-01

Background Vacuolar type H+-ATPases play a critical role in the maintenance of vacuolar homeostasis in plant cells. V-ATPases are also involved in plants' defense against environmental stress. This research examined the expression and regulation of the catalytic subunit of the vacuolar type H+-ATPase in Arabidopsis thaliana and the effect of environmental stress on multiple transcripts generated by this gene. Results Evidence suggests that subunit A of the vacuolar type H+-ATPase is encoded by a single gene in Arabidopsis thaliana. Genome blot analysis showed no indication of a second subunit A gene being present. The single gene identified was shown by whole RNA blot analysis to be transcribed in all organs of the plant. Subunit A was shown by sequencing the 3' end of multiple cDNA clones to exhibit multi site polyadenylation. Four different poly (A) tail attachment sites were revealed. Experiments were performed to determine the response of transcript levels for subunit A to environmental stress. A PCR based strategy was devised to amplify the four different transcripts from the subunit A gene. Conclusions Amplification of cDNA generated from seedlings exposed to cold, salt stress, and etiolation showed that transcript levels for subunit A of the vacuolar type H+-ATPase in Arabidopsis were responsive to stress conditions. Cold and salt stress resulted in a 2–4 fold increase in all four subunit A transcripts evaluated. Etiolation resulted in a slight increase in transcript levels. All four transcripts appeared to behave identically with respect to stress conditions tested with no significant differential regulation. PMID:11985780
A two-step hierarchical hypothesis set testing framework, with applications to gene expression data on ordered categories

PubMed Central

2014-01-01

Background In complex large-scale experiments, in addition to simultaneously considering a large number of features, multiple hypotheses are often being tested for each feature. This leads to a problem of multi-dimensional multiple testing. For example, in gene expression studies over ordered categories (such as time-course or dose-response experiments), interest is often in testing differential expression across several categories for each gene. In this paper, we consider a framework for testing multiple sets of hypothesis, which can be applied to a wide range of problems. Results We adopt the concept of the overall false discovery rate (OFDR) for controlling false discoveries on the hypothesis set level. Based on an existing procedure for identifying differentially expressed gene sets, we discuss a general two-step hierarchical hypothesis set testing procedure, which controls the overall false discovery rate under independence across hypothesis sets. In addition, we discuss the concept of the mixed-directional false discovery rate (mdFDR), and extend the general procedure to enable directional decisions for two-sided alternatives. We applied the framework to the case of microarray time-course/dose-response experiments, and proposed three procedures for testing differential expression and making multiple directional decisions for each gene. Simulation studies confirm the control of the OFDR and mdFDR by the proposed procedures under independence and positive correlations across genes. Simulation results also show that two of our new procedures achieve higher power than previous methods. Finally, the proposed methodology is applied to a microarray dose-response study, to identify 17 β-estradiol sensitive genes in breast cancer cells that are induced at low concentrations. Conclusions The framework we discuss provides a platform for multiple testing procedures covering situations involving two (or potentially more) sources of multiplicity. The framework is easy to use and adaptable to various practical settings that frequently occur in large-scale experiments. Procedures generated from the framework are shown to maintain control of the OFDR and mdFDR, quantities that are especially relevant in the case of multiple hypothesis set testing. The procedures work well in both simulations and real datasets, and are shown to have better power than existing methods. PMID:24731138
Comparative description of ten transcriptomes of newly sequenced invertebrates and efficiency estimation of genomic sampling in non-model taxa

PubMed Central

2012-01-01

Introduction Traditionally, genomic or transcriptomic data have been restricted to a few model or emerging model organisms, and to a handful of species of medical and/or environmental importance. Next-generation sequencing techniques have the capability of yielding massive amounts of gene sequence data for virtually any species at a modest cost. Here we provide a comparative analysis of de novo assembled transcriptomic data for ten non-model species of previously understudied animal taxa. Results cDNA libraries of ten species belonging to five animal phyla (2 Annelida [including Sipuncula], 2 Arthropoda, 2 Mollusca, 2 Nemertea, and 2 Porifera) were sequenced in different batches with an Illumina Genome Analyzer II (read length 100 or 150 bp), rendering between ca. 25 and 52 million reads per species. Read thinning, trimming, and de novo assembly were performed under different parameters to optimize output. Between 67,423 and 207,559 contigs were obtained across the ten species, post-optimization. Of those, 9,069 to 25,681 contigs retrieved blast hits against the NCBI non-redundant database, and approximately 50% of these were assigned with Gene Ontology terms, covering all major categories, and with similar percentages in all species. Local blasts against our datasets, using selected genes from major signaling pathways and housekeeping genes, revealed high efficiency in gene recovery compared to available genomes of closely related species. Intriguingly, our transcriptomic datasets detected multiple paralogues in all phyla and in nearly all gene pathways, including housekeeping genes that are traditionally used in phylogenetic applications for their purported single-copy nature. Conclusions We generated the first study of comparative transcriptomics across multiple animal phyla (comparing two species per phylum in most cases), established the first Illumina-based transcriptomic datasets for sponge, nemertean, and sipunculan species, and generated a tractable catalogue of annotated genes (or gene fragments) and protein families for ten newly sequenced non-model organisms, some of commercial importance (i.e., Octopus vulgaris). These comprehensive sets of genes can be readily used for phylogenetic analysis, gene expression profiling, developmental analysis, and can also be a powerful resource for gene discovery. The characterization of the transcriptomes of such a diverse array of animal species permitted the comparison of sequencing depth, functional annotation, and efficiency of genomic sampling using the same pipelines, which proved to be similar for all considered species. In addition, the datasets revealed their potential as a resource for paralogue detection, a recurrent concern in various aspects of biological inquiry, including phylogenetics, molecular evolution, development, and cellular biochemistry. PMID:23190771
Methylation analysis of plasma cell-free DNA for breast cancer early detection using bisulfite next-generation sequencing.

PubMed

Li, Zibo; Guo, Xinwu; Tang, Lili; Peng, Limin; Chen, Ming; Luo, Xipeng; Wang, Shouman; Xiao, Zhi; Deng, Zhongping; Dai, Lizhong; Xia, Kun; Wang, Jun

2016-10-01

Circulating cell-free DNA (cfDNA) has been considered as a potential biomarker for non-invasive cancer detection. To evaluate the methylation levels of six candidate genes (EGFR, GREM1, PDGFRB, PPM1E, SOX17, and WRN) in plasma cfDNA as biomarkers for breast cancer early detection, quantitative analysis of the promoter methylation of these genes from 86 breast cancer patients and 67 healthy controls was performed by using microfluidic-PCR-based target enrichment and next-generation bisulfite sequencing technology. The predictive performance of different logistic models based on methylation status of candidate genes was investigated by means of the area under the ROC curve (AUC) and odds ratio (OR) analysis. Results revealed that EGFR, PPM1E, and 8 gene-specific CpG sites showed significantly hypermethylation in cancer patients' plasma and significantly associated with breast cancer (OR ranging from 2.51 to 9.88). The AUC values for these biomarkers were ranging from 0.66 to 0.75. Combinations of multiple hypermethylated genes or CpG sites substantially improved the predictive performance for breast cancer detection. Our study demonstrated the feasibility of quantitative measurement of candidate gene methylation in cfDNA by using microfluidic-PCR-based target enrichment and bisulfite next-generation sequencing, which is worthy of further validation and potentially benefits a broad range of applications in clinical oncology practice. Quantitative analysis of methylation pattern of plasma cfDNA by next-generation sequencing might be a valuable non-invasive tool for early detection of breast cancer.
Extensive diversification of IgD-, IgY-, and truncated IgY(δFc)-encoding genes in the red-eared turtle (Trachemys scripta elegans).

PubMed

Li, Lingxiao; Wang, Tao; Sun, Yi; Cheng, Gang; Yang, Hui; Wei, Zhiguo; Wang, Ping; Hu, Xiaoxiang; Ren, Liming; Meng, Qingyong; Zhang, Ran; Guo, Ying; Hammarström, Lennart; Li, Ning; Zhao, Yaofeng

2012-10-15

IgY(ΔFc), containing only CH1 and CH2 domains, is expressed in the serum of some birds and reptiles, such as ducks and turtles. The duck IgY(ΔFc) is produced by the same υ gene that expresses the intact IgY form (CH1-4) using different transcriptional termination sites. In this study, we show that intact IgY and IgY(ΔFc) are encoded by distinct genes in the red-eared turtle (Trachemys scripta elegans). At least eight IgY and five IgY(ΔFc) transcripts were found in a single turtle. Together with Southern blotting, our data suggest that multiple genes encoding both IgY forms are present in the turtle genome. Both of the IgY forms were detected in the serum using rabbit polyclonal Abs. In addition, we show that multiple copies of the turtle δ gene are present in the genome and that alternative splicing is extensively involved in the generation of both the secretory and membrane-bound forms of the IgD H chain transcripts. Although a single μ gene was identified, the α gene was not identified in this species.
Identification of evolutionarily conserved DNA damage response genes that alter sensitivity to cisplatin

PubMed Central

Gaponova, Anna V.; Deneka, Alexander Y.; Beck, Tim N.; Liu, Hanqing; Andrianov, Gregory; Nikonova, Anna S.; Nicolas, Emmanuelle; Einarson, Margret B.; Golemis, Erica A.; Serebriiskii, Ilya G.

2017-01-01

Ovarian, head and neck, and other cancers are commonly treated with cisplatin and other DNA damaging cytotoxic agents. Altered DNA damage response (DDR) contributes to resistance of these tumors to chemotherapies, some targeted therapies, and radiation. DDR involves multiple protein complexes and signaling pathways, some of which are evolutionarily ancient and involve protein orthologs conserved from yeast to humans. To identify new regulators of cisplatin-resistance in human tumors, we integrated high throughput and curated datasets describing yeast genes that regulate sensitivity to cisplatin and/or ionizing radiation. Next, we clustered highly validated genes based on chemogenomic profiling, and then mapped orthologs of these genes in expanded genomic networks for multiple metazoans, including humans. This approach identified an enriched candidate set of genes involved in the regulation of resistance to radiation and/or cisplatin in humans. Direct functional assessment of selected candidate genes using RNA interference confirmed their activity in influencing cisplatin resistance, degree of γH2AX focus formation and ATR phosphorylation, in ovarian and head and neck cancer cell lines, suggesting impaired DDR signaling as the driving mechanism. This work enlarges the set of genes that may contribute to chemotherapy resistance and provides a new contextual resource for interpreting next generation sequencing (NGS) genomic profiling of tumors. PMID:27863405
A Guide to Approaching Regulatory Considerations for Lentiviral-Mediated Gene Therapies.

PubMed

White, Michael; Whittaker, Roger; Gándara, Carolina; Stoll, Elizabeth A

2017-08-01

Lentiviral vectors are increasingly the gene transfer tool of choice for gene or cell therapies, with multiple clinical investigations showing promise for this viral vector in terms of both safety and efficacy. The third-generation vector system is well characterized, effectively delivers genetic material and maintains long-term stable expression in target cells, delivers larger amounts of genetic material than other methods, is nonpathogenic, and does not cause an inflammatory response in the recipient. This report aims to help academic scientists and regulatory managers negotiate the governance framework to achieve successful translation of a lentiviral vector-based gene therapy. The focus is on European regulations and how they are administered in the United Kingdom, although many of the principles will be similar for other regions, including the United States. The report justifies the rationale for using third-generation lentiviral vectors to achieve gene delivery for in vivo and ex vivo applications; briefly summarizes the extant regulatory guidance for gene therapies, categorized as advanced therapeutic medicinal products (ATMPs); provides guidance on specific regulatory issues regarding gene therapies; presents an overview of the key stakeholders to be approached when pursuing clinical trials authorization for an ATMP; and includes a brief catalogue of the documentation required to submit an application for regulatory approval of a new gene therapy.
A Guide to Approaching Regulatory Considerations for Lentiviral-Mediated Gene Therapies

PubMed Central

White, Michael; Whittaker, Roger; Gándara, Carolina; Stoll, Elizabeth A.

2017-01-01

Lentiviral vectors are increasingly the gene transfer tool of choice for gene or cell therapies, with multiple clinical investigations showing promise for this viral vector in terms of both safety and efficacy. The third-generation vector system is well characterized, effectively delivers genetic material and maintains long-term stable expression in target cells, delivers larger amounts of genetic material than other methods, is nonpathogenic, and does not cause an inflammatory response in the recipient. This report aims to help academic scientists and regulatory managers negotiate the governance framework to achieve successful translation of a lentiviral vector-based gene therapy. The focus is on European regulations and how they are administered in the United Kingdom, although many of the principles will be similar for other regions, including the United States. The report justifies the rationale for using third-generation lentiviral vectors to achieve gene delivery for in vivo and ex vivo applications; briefly summarizes the extant regulatory guidance for gene therapies, categorized as advanced therapeutic medicinal products (ATMPs); provides guidance on specific regulatory issues regarding gene therapies; presents an overview of the key stakeholders to be approached when pursuing clinical trials authorization for an ATMP; and includes a brief catalogue of the documentation required to submit an application for regulatory approval of a new gene therapy. PMID:28817344
Gene Therapy with the Sleeping Beauty Transposon System.

PubMed

Kebriaei, Partow; Izsvák, Zsuzsanna; Narayanavari, Suneel A; Singh, Harjeet; Ivics, Zoltán

2017-11-01

The widespread clinical implementation of gene therapy requires the ability to stably integrate genetic information through gene transfer vectors in a safe, effective, and economical manner. The latest generation of Sleeping Beauty (SB) transposon vectors fulfills these requirements, and may overcome limitations associated with viral gene transfer vectors and transient nonviral gene delivery approaches that are prevalent in ongoing clinical trials. The SB system enables high-level stable gene transfer and sustained transgene expression in multiple primary human somatic cell types, thereby representing a highly attractive gene transfer strategy for clinical use. Here, we review the most important aspects of using SB for gene therapy, including vectorization as well as genomic integration features. We also illustrate the path to successful clinical implementation by highlighting the application of chimeric antigen receptor (CAR)-modified T cells in cancer immunotherapy. Copyright © 2017 Elsevier Ltd. All rights reserved.
Elimination of both E1 and E2 from adenovirus vectors further improves prospects for in vivo human gene therapy.

PubMed Central

Gorziglia, M I; Kadan, M J; Yei, S; Lim, J; Lee, G M; Luthra, R; Trapnell, B C

1996-01-01

A novel recombinant adenovirus vector, Av3nBg, was constructed with deletions in adenovirus E1, E2a, and E3 regions and expressing a beta-galactosidase reporter gene. Av3nBg can be propagated at a high titer in a corresponding A549-derived cell line, AE1-2a, which contains the adenovirus E1 and E2a region genes inducibly expressed from separate glucocorticoid-responsive promoters. Av3nBg demonstrated gene transfer and expression comparable to that of Av1nBg, a first-generation adenovirus vector with deletions in E1 and E3. Several lines of evidence suggest that this vector is significantly more attenuated than E1 and E3 deletion vectors. Metabolic DNA labeling studies showed no detectable de novo vector DNA synthesis or accumulation, and metabolic protein labeling demonstrated no detectable de novo hexon protein synthesis for Av3nBg in naive A549 cells even at a multiplicity of infection of up to 3,000 PFU per cell. Additionally, naive A549 cells infected by Av3nBg did not accumulate infectious virions. In contrast, both Av1nBg and Av2Lu vectors showed DNA replication and hexon protein synthesis at multiplicities of infection of 500 PFU per cell. Av2Lu has a deletion in E1 and also carries a temperature-sensitive mutation in E2a. Thus, molecular characterization has demonstrated that the Av3nBg vector is improved with respect to the potential for vector DNA replication and hexon protein expression compared with both first-generation (Av1nBg) and second-generation (Av2Lu) adenoviral vectors. These observations may have important implications for potential use of adenovirus vectors in human gene therapy. PMID:8648763
Identification of a Hyphantria cunea nucleopolyhedrovirus (NPV) gene that is involved in global protein synthesis shutdown and restricted Bombyx mori NPV multiplication in a B. mori cell line.

PubMed

Shirata, Noriko; Ikeda, Motoko; Kobayashi, Michihiro

2010-03-15

We previously demonstrated that Bombyx mori nucleopolyhedrovirus (BmNPV) multiplication is restricted in permissive BmN-4 cells upon coinfection with Hyphantria cunea NPV (HycuNPV). Here, we show that HycuNPV-encoded hycu-ep32 gene is responsible for the restricted BmNPV multiplication in HycuNPV-coinfected BmN-4 cells. The only homologue for hycu-ep32 is in Orgyia pseudotsugata NPV. hycu-ep32 could encode a polypeptide of 312 amino acids, and it contains no characteristic domains or motifs to suggest its possible functions. hycu-ep32 is an early gene, and Hycu-EP32 expression reaches a maximum by 6 h postinfection. hycu-ep32-defective HycuNPV, vHycuDeltaep32, was generated, indicating that hycu-ep32 is nonessential in permissive SpIm cells. In BmN-4 cells, HycuNPV infection resulted in a severe global protein synthesis shutdown, while vHycuDeltaep32 did not cause any specific protein synthesis shutdown. These results indicate that the restriction of BmNPV multiplication by HycuNPV is caused by a global protein synthesis shutdown induced by hycu-ep32 upon coinfection with HycuNPV. Copyright 2009 Elsevier Inc. All rights reserved.
The evolution of resistance genes in multi-protein plant resistance systems.

PubMed

Friedman, Aaron R; Baker, Barbara J

2007-12-01

The genomic perspective aids in integrating the analysis of single resistance (R-) genes into a higher order model of complex plant resistance systems. The majority of R-genes encode a class of proteins with nucleotide binding (NB) and leucine-rich repeat (LRR) domains. Several R-proteins act in multi-protein R-complexes that mediate interaction with pathogen effectors to induce resistance signaling. The complexity of these systems seems to have resulted from multiple rounds of plant-pathogen co-evolution. R-gene evolution is thought to be facilitated by the formation of R-gene clusters, which permit sequence exchanges via recombinatorial mispairing and generate high haplotypic diversity. This pattern of evolution may also generate diversity at other loci that contribute to the R-complex. The rate of recombination at R-clusters is not necessarily homogeneous or consistent over evolutionary time: recent evidence suggests that recombination at R-clusters is increased following pathogen infection, suggesting a mechanism that induces temporary genome instability in response to extreme stress. DNA methylation and chromatin modifications may allow this instability to be conditionally regulated and targeted to specific genome regions. Knowledge of natural R-gene evolution may contribute to strategies for artificial evolution of novel resistance specificities.
Spectrum of mutations in leiomyosarcomas identified by clinical targeted next-generation sequencing.

PubMed

Lee, Paul J; Yoo, Naomi S; Hagemann, Ian S; Pfeifer, John D; Cottrell, Catherine E; Abel, Haley J; Duncavage, Eric J

2017-02-01

Recurrent genomic mutations in uterine and non-uterine leiomyosarcomas have not been well established. Using a next generation sequencing (NGS) panel of common cancer-associated genes, 25 leiomyosarcomas arising from multiple sites were examined to explore genetic alterations, including single nucleotide variants (SNV), small insertions/deletions (indels), and copy number alterations (CNA). Sequencing showed 86 non-synonymous, coding region somatic variants within 151 gene targets in 21 cases, with a mean of 4.1 variants per case; 4 cases had no putative mutations in the panel of genes assayed. The most frequently altered genes were TP53 (36%), ATM and ATRX (16%), and EGFR and RB1 (12%). CNA were identified in 85% of cases, with the most frequent copy number losses observed in chromosomes 10 and 13 including PTEN and RB1; the most frequent gains were seen in chromosomes 7 and 17. Our data show that deletions in canonical cancer-related genes are common in leiomyosarcomas. Further, the spectrum of gene mutations observed shows that defects in DNA repair and chromosomal maintenance are central to the biology of leiomyosarcomas, and that activating mutations observed in other common cancer types are rare in leiomyosarcomas. Copyright © 2017 Elsevier Inc. All rights reserved.
Bypass of lethality with mosaic mice generated by Cre-loxP-mediated recombination.

PubMed

Betz, U A; Vosshenrich, C A; Rajewsky, K; Müller, W

1996-10-01

The analysis of gene function based on the generation of mutant mice by homologous recombination in embryonic stem cells is limited if gene disruption results in embryonic lethality. Mosaic mice, which contain a certain proportion of mutant cells in all organs, allow lethality to be circumvented and the potential of mutant cells to contribute to different cell lineages to be analyzed. To generate mosaic animals, we used the bacteriophage P1-derived Cre-loxP recombination system, which allows gene alteration by Cre-mediated deletion of loxP-flanked gene segments. We generated nestin-cre transgenic mouse lines, which expressed the Cre recombinase under the control of the rat nestin promoter and its second intron enhancer. In crosses to animals carrying a loxP-flanked target gene, partial deletion of the loxP-flanked allele occurred before day 10.5 post coitum and was detectable in all adult organs examined, including germ-line cells. Using this approach, we generated mosaic mice containing cells deficient in the gamma-chain of the interleukin-2 receptor (IL-2R gamma); in these animals, the IL-2R gamma-deficient cells were underrepresented in the thymus and spleen. Because mice deficient in DNA polymerase beta die perinatally, we studied the effects of DNA polymerase beta deficiency in mosaic animals. We found that some of the mosaic polymerase beta-deficient animals were viable, but were often reduced in size and weight. The fraction of DNA polymerase beta-deficient cells in mosaic embryos decreased during embryonic development, presumably because wild-type cells had a competitive advantage. The nestin-cre transgenic mice can be used to generate mosaic animals in which target genes are mutated by Cre-mediated recombination of loxP-flanked target genes. By using mosaic animals, embryonic lethality can be bypassed and cell lineages for whose development a given target gene is critical can be identified. In the case of DNA polymerase beta, deficient cells are already selected against during embryonic development, demonstrating the general importance of this protein in multiple cell types.
CLINICAL PROGRESS IN INHERITED RETINAL DEGENERATIONS: GENE THERAPY CLINICAL TRIALS AND ADVANCES IN GENETIC SEQUENCING

PubMed Central

HAFLER, BRIAN P.

2017-01-01

Purpose Inherited retinal dystrophies are a significant cause of vision loss and are characterized by the loss of photoreceptors and the retinal pigment epithelium (RPE). Mutations in approximately 250 genes cause inherited retinal degenerations with a high degree of genetic heterogeneity. New techniques in next-generation sequencing are allowing the comprehensive analysis of all retinal disease genes thus changing the approach to the molecular diagnosis of inherited retinal dystrophies. This review serves to analyze clinical progress in genetic diagnostic testing and implications for retinal gene therapy. Methods A literature search of PubMed and OMIM was conducted to relevant articles in inherited retinal dystrophies. Results Next-generation genetic sequencing allows the simultaneous analysis of all the approximately 250 genes that cause inherited retinal dystrophies. Reported diagnostic rates range are high and range from 51% to 57%. These new sequencing tools are highly accurate with sensitivities of 97.9% and specificities of 100%. Retinal gene therapy clinical trials are underway for multiple genes including RPE65, ABCA4, CHM, RS1, MYO7A, CNGA3, CNGB3, ND4, and MERTK for which a molecular diagnosis may be beneficial for patients. Conclusion Comprehensive next-generation genetic sequencing of all retinal dystrophy genes is changing the paradigm for how retinal specialists perform genetic testing for inherited retinal degenerations. Not only are high diagnostic yields obtained, but mutations in genes with novel clinical phenotypes are also identified. In the era of retinal gene therapy clinical trials, identifying specific genetic defects will increasingly be of use to identify patients who may enroll in clinical studies and benefit from novel therapies. PMID:27753762
Lignocellulosic sugar management for xylitol and ethanol fermentation with multiple cell recycling by Kluyveromyces marxianus IIPE453.

PubMed

Dasgupta, Diptarka; Ghosh, Debashish; Bandhu, Sheetal; Adhikari, Dilip K

2017-07-01

Optimum utilization of fermentable sugars from lignocellulosic biomass to deliver multiple products under biorefinery concept has been reported in this work. Alcohol fermentation has been carried out with multiple cell recycling of Kluyveromyces marxianus IIPE453. The yeast utilized xylose-rich fraction from acid and steam treated biomass for cell generation and xylitol production with an average yield of 0.315±0.01g/g while the entire glucose rich saccharified fraction had been fermented to ethanol with high productivity of 0.9±0.08g/L/h. A detailed insight into its genome illustrated the strain's complete set of genes associated with sugar transport and metabolism for high-temperature fermentation. A set flocculation proteins were identified that aided in high cell recovery in successive fermentation cycles to achieve alcohols with high productivity. We have brought biomass derived sugars, yeast cell biomass generation, and ethanol and xylitol fermentation in one platform and validated the overall material balance. 2kg sugarcane bagasse yielded 193.4g yeast cell, and with multiple times cell recycling generated 125.56g xylitol and 289.2g ethanol (366mL). Copyright © 2017 Elsevier GmbH. All rights reserved.
A Tol2 Gateway-Compatible Toolbox for the Study of the Nervous System and Neurodegenerative Disease.

PubMed

Don, Emily K; Formella, Isabel; Badrock, Andrew P; Hall, Thomas E; Morsch, Marco; Hortle, Elinor; Hogan, Alison; Chow, Sharron; Gwee, Serene S L; Stoddart, Jack J; Nicholson, Garth; Chung, Roger; Cole, Nicholas J

2017-02-01

Currently there is a lack in fundamental understanding of disease progression of most neurodegenerative diseases, and, therefore, treatments and preventative measures are limited. Consequently, there is a great need for adaptable, yet robust model systems to both investigate elementary disease mechanisms and discover effective therapeutics. We have generated a Tol2 Gateway-compatible toolbox to study neurodegenerative disorders in zebrafish, which includes promoters for astrocytes, microglia and motor neurons, multiple fluorophores, and compatibility for the introduction of genes of interest or disease-linked genes. This toolbox will advance the rapid and flexible generation of zebrafish models to discover the biology of the nervous system and the disease processes that lead to neurodegeneration.
Flexible CRISPR library construction using parallel oligonucleotide retrieval

PubMed Central

Read, Abigail; Gao, Shaojian; Batchelor, Eric

2017-01-01

Abstract CRISPR/Cas9-based gene knockout libraries have emerged as a powerful tool for functional screens. We present here a set of pre-designed human and mouse sgRNA sequences that are optimized for both high on-target potency and low off-target effect. To maximize the chance of target gene inactivation, sgRNAs were curated to target both 5΄ constitutive exons and exons that encode conserved protein domains. We describe here a robust and cost-effective method to construct multiple small sized CRISPR library from a single oligo pool generated by array synthesis using parallel oligonucleotide retrieval. Together, these resources provide a convenient means for individual labs to generate customized CRISPR libraries of variable size and coverage depth for functional genomics application. PMID:28334828
Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks.

PubMed

Deeter, Anthony; Dalman, Mark; Haddad, Joseph; Duan, Zhong-Hui

2017-01-01

The PubMed database offers an extensive set of publication data that can be useful, yet inherently complex to use without automated computational techniques. Data repositories such as the Genomic Data Commons (GDC) and the Gene Expression Omnibus (GEO) offer experimental data storage and retrieval as well as curated gene expression profiles. Genetic interaction databases, including Reactome and Ingenuity Pathway Analysis, offer pathway and experiment data analysis using data curated from these publications and data repositories. We have created a method to generate and analyze consensus networks, inferring potential gene interactions, using large numbers of Bayesian networks generated by data mining publications in the PubMed database. Through the concept of network resolution, these consensus networks can be tailored to represent possible genetic interactions. We designed a set of experiments to confirm that our method is stable across variation in both sample and topological input sizes. Using gene product interactions from the KEGG pathway database and data mining PubMed publication abstracts, we verify that regardless of the network resolution or the inferred consensus network, our method is capable of inferring meaningful gene interactions through consensus Bayesian network generation with multiple, randomized topological orderings. Our method can not only confirm the existence of currently accepted interactions, but has the potential to hypothesize new ones as well. We show our method confirms the existence of known gene interactions such as JAK-STAT-PI3K-AKT-mTOR, infers novel gene interactions such as RAS- Bcl-2 and RAS-AKT, and found significant pathway-pathway interactions between the JAK-STAT signaling and Cardiac Muscle Contraction KEGG pathways.
A cluster merging method for time series microarray with production values.

PubMed

Chira, Camelia; Sedano, Javier; Camara, Monica; Prieto, Carlos; Villar, Jose R; Corchado, Emilio

2014-09-01

A challenging task in time-course microarray data analysis is to cluster genes meaningfully combining the information provided by multiple replicates covering the same key time points. This paper proposes a novel cluster merging method to accomplish this goal obtaining groups with highly correlated genes. The main idea behind the proposed method is to generate a clustering starting from groups created based on individual temporal series (representing different biological replicates measured in the same time points) and merging them by taking into account the frequency by which two genes are assembled together in each clustering. The gene groups at the level of individual time series are generated using several shape-based clustering methods. This study is focused on a real-world time series microarray task with the aim to find co-expressed genes related to the production and growth of a certain bacteria. The shape-based clustering methods used at the level of individual time series rely on identifying similar gene expression patterns over time which, in some models, are further matched to the pattern of production/growth. The proposed cluster merging method is able to produce meaningful gene groups which can be naturally ranked by the level of agreement on the clustering among individual time series. The list of clusters and genes is further sorted based on the information correlation coefficient and new problem-specific relevant measures. Computational experiments and results of the cluster merging method are analyzed from a biological perspective and further compared with the clustering generated based on the mean value of time series and the same shape-based algorithm.

Rapid Assembly of Customized TALENs into Multiple Delivery Systems

PubMed Central

Zhang, Zhengxing; Zhang, Siliang; Huang, Xin; Orwig, Kyle E.; Sheng, Yi

2013-01-01

Transcriptional activator-like effector nucleases (TALENs) have become a powerful tool for genome editing. Here we present an efficient TALEN assembly approach in which TALENs are assembled by direct Golden Gate ligation into Gateway® Entry vectors from a repeat variable di-residue (RVD) plasmid array. We constructed TALEN pairs targeted to mouse Ddx3 subfamily genes, and demonstrated that our modified TALEN assembly approach efficiently generates accurate TALEN moieties that effectively introduce mutations into target genes. We generated “user friendly” TALEN Entry vectors containing TALEN expression cassettes with fluorescent reporter genes that can be efficiently transferred via Gateway (LR) recombination into different delivery systems. We demonstrated that the TALEN Entry vectors can be easily transferred to an adenoviral delivery system to expand application to cells that are difficult to transfect. Since TALENs work in pairs, we also generated a TALEN Entry vector set that combines a TALEN pair into one PiggyBac transposon-based destination vector. The approach described here can also be modified for construction of TALE transcriptional activators, repressors or other functional domains. PMID:24244669
A novel sgRNA selection system for CRISPR-Cas9 in mammalian cells.

PubMed

Zhang, Haiwei; Zhang, Xixi; Fan, Cunxian; Xie, Qun; Xu, Chengxian; Zhao, Qun; Liu, Yongbo; Wu, Xiaoxia; Zhang, Haibing

2016-03-18

CRISPR-Cas9 mediated genome editing system has been developed as a powerful tool for elucidating the function of genes through genetic engineering in multiple cells and organisms. This system takes advantage of a single guide RNA (sgRNA) to direct the Cas9 endonuclease to a specific DNA site to generate mutant alleles. Since the targeting efficiency of sgRNAs to distinct DNA loci can vary widely, there remains a need for a rapid, simple and efficient sgRNA selection method to overcome this limitation of the CRISPR-Cas9 system. Here we report a novel system to select sgRNA with high efficacy for DNA sequence modification by a luciferase assay. Using this sgRNAs selection system, we further demonstrated successful examples of one sgRNA for generating one gene knockout cell lines where the targeted genes are shown to be functionally defective. This system provides a potential application to optimize the sgRNAs in different species and to generate a powerful CRISPR-Cas9 genome-wide screening system with minimum amounts of sgRNAs. Copyright © 2016 Elsevier Inc. All rights reserved.
The democratization of the oncogene.

PubMed

Le, Anh T; Doebele, Robert C

2014-08-01

The identification of novel, oncogenic gene rearrangements in inflammatory myofibroblastic tumor demonstrates the potential of next-generation sequencing (NGS) platforms for the detection of therapeutically relevant oncogenes across multiple tumor types, but raises significant questions relating to the investigation of targeted therapies in this new era of widespread NGS testing. ©2014 American Association for Cancer Research.
Modification of Hematopoietic Stem/Progenitor Cells with CD19-Specific Chimeric Antigen Receptors as a Novel Approach for Cancer Immunotherapy

PubMed Central

Ryan, Christine; Giannoni, Francesca; Hardee, Cinnamon L.; Tremcinska, Irena; Katebian, Behrod; Wherley, Jennifer; Sahaghian, Arineh; Tu, Andy; Grogan, Tristan; Elashoff, David; Cooper, Laurence J.N.; Hollis, Roger P.; Kohn, Donald B.

2013-01-01

Abstract Chimeric antigen receptors (CARs) against CD19 have been shown to direct T-cells to specifically target B-lineage malignant cells in animal models and clinical trials, with efficient tumor cell lysis. However, in some cases, there has been insufficient persistence of effector cells, limiting clinical efficacy. We propose gene transfer to hematopoietic stem/progenitor cells (HSPC) as a novel approach to deliver the CD19-specific CAR, with potential for ensuring persistent production of effector cells of multiple lineages targeting B-lineage malignant cells. Assessments were performed using in vitro myeloid or natural killer (NK) cell differentiation of human HSPCs transduced with lentiviral vectors carrying first and second generations of CD19-specific CAR. Gene transfer did not impair hematopoietic differentiation and cell proliferation when transduced at 1–2 copies/cell. CAR-bearing myeloid and NK cells specifically lysed CD19-positive cells, with second-generation CAR including CD28 domains being more efficient in NK cells. Our results provide evidence for the feasibility and efficacy of the modification of HSPC with CAR as a strategy for generating multiple lineages of effector cells for immunotherapy against B-lineage malignancies to augment graft-versus-leukemia activity. PMID:23978226
ePlant: Visualizing and Exploring Multiple Levels of Data for Hypothesis Generation in Plant Biology[OPEN

PubMed Central

Waese, Jamie; Fan, Jim; Yu, Hans; Fucile, Geoffrey; Shi, Ruian; Cumming, Matthew; Town, Chris; Stuerzlinger, Wolfgang

2017-01-01

A big challenge in current systems biology research arises when different types of data must be accessed from separate sources and visualized using separate tools. The high cognitive load required to navigate such a workflow is detrimental to hypothesis generation. Accordingly, there is a need for a robust research platform that incorporates all data and provides integrated search, analysis, and visualization features through a single portal. Here, we present ePlant (http://bar.utoronto.ca/eplant), a visual analytic tool for exploring multiple levels of Arabidopsis thaliana data through a zoomable user interface. ePlant connects to several publicly available web services to download genome, proteome, interactome, transcriptome, and 3D molecular structure data for one or more genes or gene products of interest. Data are displayed with a set of visualization tools that are presented using a conceptual hierarchy from big to small, and many of the tools combine information from more than one data type. We describe the development of ePlant in this article and present several examples illustrating its integrative features for hypothesis generation. We also describe the process of deploying ePlant as an “app” on Araport. Building on readily available web services, the code for ePlant is freely available for any other biological species research. PMID:28808136
Co-existence of Blau syndrome and NAID? Diagnostic challenges associated with presence of multiple pathogenic variants in NOD2 gene: a case report.

PubMed

Dziedzic, Magdalena; Marjańska, Agata; Bąbol-Pokora, Katarzyna; Urbańczyk, Anna; Grześk, Elżbieta; Młynarski, Wojciech; Kołtan, Sylwia

2017-07-27

Pediatric autoinflammatory diseases are rare and still poorly understood conditions resulting from defective genetic control of innate immune system, inter alia from anomalies of NOD2 gene. The product of this gene is Nod2 protein, taking part in maintenance of immune homeostasis. Clinical form of resultant autoinflammatory condition depends on NOD2 genotype; usually patients with NOD2 defects present with Blau syndrome, NOD2-associated autoinflammatory disease (NAID) or Crohn's disease. We present the case of a 7-year-old girl with co-existing symptoms of two rare diseases, Blau syndrome and NAID. Overlapping manifestations of two syndromes raised a significant diagnostic challenge, until next-generation molecular test (NGS) identified presence of three pathogenic variants of NOD2 gene: P268S, IVS8 +158 , 1007 fs, and established the ultimate diagnosis. Presence of multiple genetical abnormalities resulted in an ambiguous clinical presentation with overlapping symptoms of Blau syndrome and NAID. Final diagnosis of autoinflammatory disease opened new therapeutic possibilities, including the use of biological treatments.
Transcriptomic Identification of ADH1B as a Novel Candidate Gene for Obesity and Insulin Resistance in Human Adipose Tissue in Mexican Americans from the Veterans Administration Genetic Epidemiology Study (VAGES)

PubMed Central

Winnier, Deidre A.; Fourcaudot, Marcel; Norton, Luke; Abdul-Ghani, Muhammad A.; Hu, Shirley L.; Farook, Vidya S.; Coletta, Dawn K.; Kumar, Satish; Puppala, Sobha; Chittoor, Geetha; Dyer, Thomas D.; Arya, Rector; Carless, Melanie; Lehman, Donna M.; Curran, Joanne E.; Cromack, Douglas T.; Tripathy, Devjit; Blangero, John; Duggirala, Ravindranath; Göring, Harald H. H.; DeFronzo, Ralph A.; Jenkinson, Christopher P.

2015-01-01

Type 2 diabetes (T2D) is a complex metabolic disease that is more prevalent in ethnic groups such as Mexican Americans, and is strongly associated with the risk factors obesity and insulin resistance. The goal of this study was to perform whole genome gene expression profiling in adipose tissue to detect common patterns of gene regulation associated with obesity and insulin resistance. We used phenotypic and genotypic data from 308 Mexican American participants from the Veterans Administration Genetic Epidemiology Study (VAGES). Basal fasting RNA was extracted from adipose tissue biopsies from a subset of 75 unrelated individuals, and gene expression data generated on the Illumina BeadArray platform. The number of gene probes with significant expression above baseline was approximately 31,000. We performed multiple regression analysis of all probes with 15 metabolic traits. Adipose tissue had 3,012 genes significantly associated with the traits of interest (false discovery rate, FDR ≤ 0.05). The significance of gene expression changes was used to select 52 genes with significant (FDR ≤ 10-4) gene expression changes across multiple traits. Gene sets/Pathways analysis identified one gene, alcohol dehydrogenase 1B (ADH1B) that was significantly enriched (P < 10-60) as a prime candidate for involvement in multiple relevant metabolic pathways. Illumina BeadChip derived ADH1B expression data was consistent with quantitative real time PCR data. We observed significant inverse correlations with waist circumference (2.8 x 10-9), BMI (5.4 x 10-6), and fasting plasma insulin (P < 0.001). These findings are consistent with a central role for ADH1B in obesity and insulin resistance and provide evidence for a novel genetic regulatory mechanism for human metabolic diseases related to these traits. PMID:25830378
An Improved Single-Step Cloning Strategy Simplifies the Agrobacterium tumefaciens-Mediated Transformation (ATMT)-Based Gene-Disruption Method for Verticillium dahliae.

PubMed

Wang, Sheng; Xing, Haiying; Hua, Chenlei; Guo, Hui-Shan; Zhang, Jie

2016-06-01

The soilborne fungal pathogen Verticillium dahliae infects a broad range of plant species to cause severe diseases. The availability of Verticillium genome sequences has provided opportunities for large-scale investigations of individual gene function in Verticillium strains using Agrobacterium tumefaciens-mediated transformation (ATMT)-based gene-disruption strategies. Traditional ATMT vectors require multiple cloning steps and elaborate characterization procedures to achieve successful gene replacement; thus, these vectors are not suitable for high-throughput ATMT-based gene deletion. Several advancements have been made that either involve simplification of the steps required for gene-deletion vector construction or increase the efficiency of the technique for rapid recombinant characterization. However, an ATMT binary vector that is both simple and efficient is still lacking. Here, we generated a USER-ATMT dual-selection (DS) binary vector, which combines both the advantages of the USER single-step cloning technique and the efficiency of the herpes simplex virus thymidine kinase negative-selection marker. Highly efficient deletion of three different genes in V. dahliae using the USER-ATMT-DS vector enabled verification that this newly-generated vector not only facilitates the cloning process but also simplifies the subsequent identification of fungal homologous recombinants. The results suggest that the USER-ATMT-DS vector is applicable for efficient gene deletion and suitable for large-scale gene deletion in V. dahliae.
Integrating machine learning techniques into robust data enrichment approach and its application to gene expression data.

PubMed

Erdoğdu, Utku; Tan, Mehmet; Alhajj, Reda; Polat, Faruk; Rokne, Jon; Demetrick, Douglas

2013-01-01

The availability of enough samples for effective analysis and knowledge discovery has been a challenge in the research community, especially in the area of gene expression data analysis. Thus, the approaches being developed for data analysis have mostly suffered from the lack of enough data to train and test the constructed models. We argue that the process of sample generation could be successfully automated by employing some sophisticated machine learning techniques. An automated sample generation framework could successfully complement the actual sample generation from real cases. This argument is validated in this paper by describing a framework that integrates multiple models (perspectives) for sample generation. We illustrate its applicability for producing new gene expression data samples, a highly demanding area that has not received attention. The three perspectives employed in the process are based on models that are not closely related. The independence eliminates the bias of having the produced approach covering only certain characteristics of the domain and leading to samples skewed towards one direction. The first model is based on the Probabilistic Boolean Network (PBN) representation of the gene regulatory network underlying the given gene expression data. The second model integrates Hierarchical Markov Model (HIMM) and the third model employs a genetic algorithm in the process. Each model learns as much as possible characteristics of the domain being analysed and tries to incorporate the learned characteristics in generating new samples. In other words, the models base their analysis on domain knowledge implicitly present in the data itself. The developed framework has been extensively tested by checking how the new samples complement the original samples. The produced results are very promising in showing the effectiveness, usefulness and applicability of the proposed multi-model framework.
Genome-Wide Mutagenesis in Borrelia burgdorferi.

PubMed

Lin, Tao; Gao, Lihui

2018-01-01

Signature-tagged mutagenesis (STM) is a functional genomics approach to identify bacterial virulence determinants and virulence factors by simultaneously screening multiple mutants in a single host animal, and has been utilized extensively for the study of bacterial pathogenesis, host-pathogen interactions, and spirochete and tick biology. The signature-tagged transposon mutagenesis has been developed to investigate virulence determinants and pathogenesis of Borrelia burgdorferi. Mutants in genes important in virulence are identified by negative selection in which the mutants fail to colonize or disseminate in the animal host and tick vector. STM procedure combined with Luminex Flex ® Map™ technology and next-generation sequencing (e.g., Tn-seq) are the powerful high-throughput tools for the determination of Borrelia burgdorferi virulence determinants. The assessment of multiple tissue sites and two DNA resources at two different time points using Luminex Flex ® Map™ technology provides a robust data set. B. burgdorferi transposon mutant screening indicates that a high proportion of genes are the novel virulence determinants that are required for mouse and tick infection. In this protocol, an effective signature-tagged Himar1-based transposon suicide vector was developed and used to generate a sequence-defined library of nearly 4800 mutants in the infectious B. burgdorferi B31 clone. In STM, signature-tagged suicide vectors are constructed by inserting unique DNA sequences (tags) into the transposable elements. The signature-tagged transposon mutants are generated when transposon suicide vectors are transformed into an infectious B. burgdorferi clone, and the transposable element is transposed into the 5'-TA-3' sequence in the B. burgdorferi genome with the signature tag. The transposon library is created and consists of many sub-libraries, each sub-library has several hundreds of mutants with same tags. A group of mice or ticks are infected with a mixed population of mutants with different tags, after recovered from different tissues of infected mice and ticks, mutants from output pool and input pool are detected using high-throughput, semi-quantitative Luminex ® FLEXMAP™ or next-generation sequencing (Tn-seq) technologies. Thus far, we have created a high-density, sequence-defined transposon library of over 6600 STM mutants for the efficient genome-wide investigation of genes and gene products required for wild-type pathogenesis, host-pathogen interactions, in vitro growth, in vivo survival, physiology, morphology, chemotaxis, motility, structure, metabolism, gene regulation, plasmid maintenance and replication, etc. The insertion sites of 4480 transposon mutants have been determined. About 800 predicted protein-encoding genes in the genome were disrupted in the STM transposon library. The infectivity and some functions of 800 mutants in 500 genes have been determined. Analysis of these transposon mutants has yielded valuable information regarding the genes and gene products important in the pathogenesis and biology of B. burgdorferi and its tick vectors.
Dealing with the incidental finding of secondary variants by the example of SRNS patients undergoing targeted next-generation sequencing.

PubMed

Weber, Stefanie; Büscher, Anja K; Hagmann, Henning; Liebau, Max C; Heberle, Christian; Ludwig, Michael; Rath, Sabine; Alberer, Martin; Beissert, Antje; Zenker, Martin; Hoyer, Peter F; Konrad, Martin; Klein, Hanns-Georg; Hoefele, Julia

2016-01-01

Steroid-resistant nephrotic syndrome (SRNS) is a severe cause of progressive renal disease. Genetic forms of SRNS can present with autosomal recessive or autosomal dominant inheritance. Recent studies have identified mutations in multiple podocyte genes responsible for SRNS. Improved sequencing methods (next-generation sequencing, NGS) now promise rapid mutational testing of SRNS genes. In the present study, a simultaneous screening of ten SRNS genes in 37 SRNS patients was performed by NGS. In 38 % of the patients, causative mutations in one SRNS gene were found. In 22 % of the patients, in addition to these mutations, a secondary variant in a different gene was identified. This high incidence of accumulating sequence variants was unexpected but, although they might have modifier effects, the pathogenic potential of these additional sequence variants seems unclear so far. The example of molecular diagnostics by NGS in SRNS patients shows that these new sequencing technologies might provide further insight into molecular pathogenicity in genetic disorders but will also generate results, which will be difficult to interpret and complicate genetic counseling. Although NGS promises more frequent identification of disease-causing mutations, the identification of causative mutations, the interpretation of incidental findings and possible pitfalls might pose problems, which hopefully will decrease by further experience and elucidation of molecular interactions.
Identification of fever and vaccine-associated gene interaction networks using ontology-based literature mining

PubMed Central

2012-01-01

Background Fever is one of the most common adverse events of vaccines. The detailed mechanisms of fever and vaccine-associated gene interaction networks are not fully understood. In the present study, we employed a genome-wide, Centrality and Ontology-based Network Discovery using Literature data (CONDL) approach to analyse the genes and gene interaction networks associated with fever or vaccine-related fever responses. Results Over 170,000 fever-related articles from PubMed abstracts and titles were retrieved and analysed at the sentence level using natural language processing techniques to identify genes and vaccines (including 186 Vaccine Ontology terms) as well as their interactions. This resulted in a generic fever network consisting of 403 genes and 577 gene interactions. A vaccine-specific fever sub-network consisting of 29 genes and 28 gene interactions was extracted from articles that are related to both fever and vaccines. In addition, gene-vaccine interactions were identified. Vaccines (including 4 specific vaccine names) were found to directly interact with 26 genes. Gene set enrichment analysis was performed using the genes in the generated interaction networks. Moreover, the genes in these networks were prioritized using network centrality metrics. Making scientific discoveries and generating new hypotheses were possible by using network centrality and gene set enrichment analyses. For example, our study found that the genes in the generic fever network were more enriched in cell death and responses to wounding, and the vaccine sub-network had more gene enrichment in leukocyte activation and phosphorylation regulation. The most central genes in the vaccine-specific fever network are predicted to be highly relevant to vaccine-induced fever, whereas genes that are central only in the generic fever network are likely to be highly relevant to generic fever responses. Interestingly, no Toll-like receptors (TLRs) were found in the gene-vaccine interaction network. Since multiple TLRs were found in the generic fever network, it is reasonable to hypothesize that vaccine-TLR interactions may play an important role in inducing fever response, which deserves a further investigation. Conclusions This study demonstrated that ontology-based literature mining is a powerful method for analyzing gene interaction networks and generating new scientific hypotheses. PMID:23256563
gsSKAT: Rapid gene set analysis and multiple testing correction for rare-variant association studies using weighted linear kernels.

PubMed

Larson, Nicholas B; McDonnell, Shannon; Cannon Albright, Lisa; Teerlink, Craig; Stanford, Janet; Ostrander, Elaine A; Isaacs, William B; Xu, Jianfeng; Cooney, Kathleen A; Lange, Ethan; Schleutker, Johanna; Carpten, John D; Powell, Isaac; Bailey-Wilson, Joan E; Cussenot, Olivier; Cancel-Tassin, Geraldine; Giles, Graham G; MacInnis, Robert J; Maier, Christiane; Whittemore, Alice S; Hsieh, Chih-Lin; Wiklund, Fredrik; Catalona, William J; Foulkes, William; Mandal, Diptasri; Eeles, Rosalind; Kote-Jarai, Zsofia; Ackerman, Michael J; Olson, Timothy M; Klein, Christopher J; Thibodeau, Stephen N; Schaid, Daniel J

2017-05-01

Next-generation sequencing technologies have afforded unprecedented characterization of low-frequency and rare genetic variation. Due to low power for single-variant testing, aggregative methods are commonly used to combine observed rare variation within a single gene. Causal variation may also aggregate across multiple genes within relevant biomolecular pathways. Kernel-machine regression and adaptive testing methods for aggregative rare-variant association testing have been demonstrated to be powerful approaches for pathway-level analysis, although these methods tend to be computationally intensive at high-variant dimensionality and require access to complete data. An additional analytical issue in scans of large pathway definition sets is multiple testing correction. Gene set definitions may exhibit substantial genic overlap, and the impact of the resultant correlation in test statistics on Type I error rate control for large agnostic gene set scans has not been fully explored. Herein, we first outline a statistical strategy for aggregative rare-variant analysis using component gene-level linear kernel score test summary statistics as well as derive simple estimators of the effective number of tests for family-wise error rate control. We then conduct extensive simulation studies to characterize the behavior of our approach relative to direct application of kernel and adaptive methods under a variety of conditions. We also apply our method to two case-control studies, respectively, evaluating rare variation in hereditary prostate cancer and schizophrenia. Finally, we provide open-source R code for public use to facilitate easy application of our methods to existing rare-variant analysis results. © 2017 WILEY PERIODICALS, INC.
Calcisponges have a ParaHox gene and dynamic expression of dispersed NK homeobox genes.

PubMed

Fortunato, Sofia A V; Adamski, Marcin; Ramos, Olivia Mendivil; Leininger, Sven; Liu, Jing; Ferrier, David E K; Adamska, Maja

2014-10-30

Sponges are simple animals with few cell types, but their genomes paradoxically contain a wide variety of developmental transcription factors, including homeobox genes belonging to the Antennapedia (ANTP) class, which in bilaterians encompass Hox, ParaHox and NK genes. In the genome of the demosponge Amphimedon queenslandica, no Hox or ParaHox genes are present, but NK genes are linked in a tight cluster similar to the NK clusters of bilaterians. It has been proposed that Hox and ParaHox genes originated from NK cluster genes after divergence of sponges from the lineage leading to cnidarians and bilaterians. On the other hand, synteny analysis lends support to the notion that the absence of Hox and ParaHox genes in Amphimedon is a result of secondary loss (the ghost locus hypothesis). Here we analysed complete suites of ANTP-class homeoboxes in two calcareous sponges, Sycon ciliatum and Leucosolenia complicata. Our phylogenetic analyses demonstrate that these calcisponges possess orthologues of bilaterian NK genes (Hex, Hmx and Msx), a varying number of additional NK genes and one ParaHox gene, Cdx. Despite the generation of scaffolds spanning multiple genes, we find no evidence of clustering of Sycon NK genes. All Sycon ANTP-class genes are developmentally expressed, with patterns suggesting their involvement in cell type specification in embryos and adults, metamorphosis and body plan patterning. These results demonstrate that ParaHox genes predate the origin of sponges, thus confirming the ghost locus hypothesis, and highlight the need to analyse the genomes of multiple sponge lineages to obtain a complete picture of the ancestral composition of the first animal genome.
Environmental "Omics" of International Space Station: Insights, Significance, and Consequences

NASA Astrophysics Data System (ADS)

Venkateswaran, Kasthuri

2016-07-01

The NASA Space Biology program funded two multi-year studies to catalogue International Space Station (ISS) environmental microbiome. The first Microbial Observatory (MO) experiment will generate a microbial census of the ISS surfaces and atmosphere using advanced molecular microbial community analysis "omics" techniques, supported by traditional culture-based methods and state-of-the art molecular techniques. The second MO experiment will measure presence of viral and select bacterial and fungal pathogens on ISS surfaces and correlate their presence on crew. The "omics" methodologies of the MO experiments will serve as the foundation for an extensive microbial census, offering significant insight into spaceflight-induced changes in the populations of beneficial and potentially harmful microbes. The safety of crewmembers and the maintenance of hardware are the primary goals for monitoring microorganisms in this closed habitat. The statistical analysis of the ISS microbiomes showed that three bacterial phyla dominated both in ISS and Earth cleanrooms, but varied in their abundances. While members of Actinobacteria were predominant on ISS, Proteobacteria dominated the Earth cleanrooms. Alpha diversity estimators indicated a significant drop in viable microbial diversity. To better characterize the shared community composition among samples, beta-diversity metrics analysis were conducted. At the bacterial species level characterization, the microbial community composition is strongly associated with sampling site. Results of the study indicate significant differences between ISS and Earth cleanroom microbiomes in terms of community structure and composition. Bacterial strains isolated from ISS surfaces were also tested for their resistance to nine antibiotics using conventional disc method and Vitek 2 system. Most of the Staphylococcus aureus strains were resistant to penicillin. Five strains were specifically resistant to erythromycin and the ermA gene was also detected. The nine-erythromycin sensitive S. aureus strains exhibited spontaneous mutation when rifampin was tested. Some of the S. aureus strains tolerated gentamycin and tobramycin but cefazolin, cefoxitin, ciprofloxacin and oxacillin inhibited the growth of the S. aureus. Whole genome sequencing (WGS) of 21 ISS strains, exhibiting resistance to various antibiotics, was carried out. The antibiotic resistant genes deduced from the WGS were compared with the resistomes generated directly from the gene pool of the environmental samples. Using a targeted amplification panel consisting of over 500 antimicrobial resistance genes, we were able to confirm the results of the phenotypic assays. Specifically, the presence of multiple β-lactamase genes was observed. The class A β-lactamase genes, tem-1 (ampicillin-resistance) and ctx-M-14 (cefotaxime conferring gene), were found in multiple sites of ISS. In addition, presence of mecA gene (penicillin clusters) was confirmed in several sampling locations from both ISS flights. Finally, the existence of the ermA gene (erythromycin) was established. These results suggest widespread and consistent distribution of multiple antibiotic resistance genes throughout the ISS. The resistome data generated via molecular methods will be extremely important in determining the microbial significance to the crew health and the ISS maintenance. These data sets will be placed in the NASA GeneLab bioinformatics environment - consisting of a database, computational tools, and improved methods - that would subsequently be made open to the scientific research community to encourage innovation.
Analysis, Characterization, and Loci of the tuf Genes in Lactobacillus and Bifidobacterium Species and Their Direct Application for Species Identification

PubMed Central

Ventura, Marco; Canchaya, Carlos; Meylan, Valèrie; Klaenhammer, Todd R.; Zink, Ralf

2003-01-01

We analyzed the tuf gene, encoding elongation factor Tu, from 33 strains representing 17 Lactobacillus species and 8 Bifidobacterium species. The tuf sequences were aligned and used to infer phylogenesis among species of lactobacilli and bifidobacteria. We demonstrated that the synonymous substitution affecting this gene renders elongation factor Tu a reliable molecular clock for investigating evolutionary distances of lactobacilli and bifidobacteria. In fact, the phylogeny generated by these tuf sequences is consistent with that derived from 16S rRNA analysis. The investigation of a multiple alignment of tuf sequences revealed regions conserved among strains belonging to the same species but distinct from those of other species. PCR primers complementary to these regions allowed species-specific identification of closely related species, such as Lactobacillus casei group members. These tuf gene-based assays developed in this study provide an alternative to present methods for the identification for lactic acid bacterial species. Since a variable number of tuf genes have been described for bacteria, the presence of multiple genes was examined. Southern analysis revealed one tuf gene in the genomes of lactobacilli and bifidobacteria, but the tuf gene was arranged differently in the genomes of these two taxa. Our results revealed that the tuf gene in bifidobacteria is flanked by the same gene constellation as the str operon, as originally reported for Escherichia coli. In contrast, bioinformatic and transcriptional analyses of the DNA region flanking the tuf gene in four Lactobacillus species indicated the same four-gene unit and suggested a novel tuf operon specific for the genus Lactobacillus. PMID:14602655
Growth factor transgenes interactively regulate articular chondrocytes.

PubMed

Shi, Shuiliang; Mercer, Scott; Eckert, George J; Trippel, Stephen B

2013-04-01

Adult articular chondrocytes lack an effective repair response to correct damage from injury or osteoarthritis. Polypeptide growth factors that stimulate articular chondrocyte proliferation and cartilage matrix synthesis may augment this response. Gene transfer is a promising approach to delivering such factors. Multiple growth factor genes regulate these cell functions, but multiple growth factor gene transfer remains unexplored. We tested the hypothesis that multiple growth factor gene transfer selectively modulates articular chondrocyte proliferation and matrix synthesis. We tested the hypothesis by delivering combinations of the transgenes encoding insulin-like growth factor I (IGF-I), fibroblast growth factor-2 (FGF-2), transforming growth factor beta1 (TGF-β1), bone morphogenetic protein-2 (BMP-2), and bone morphogenetic protien-7 (BMP-7) to articular chondrocytes and measured changes in the production of DNA, glycosaminoglycan, and collagen. The transgenes differentially regulated all these chondrocyte activities. In concert, the transgenes interacted to generate widely divergent responses from the cells. These interactions ranged from inhibitory to synergistic. The transgene pair encoding IGF-I and FGF-2 maximized cell proliferation. The three-transgene group encoding IGF-I, BMP-2, and BMP-7 maximized matrix production and also optimized the balance between cell proliferation and matrix production. These data demonstrate an approach to articular chondrocyte regulation that may be tailored to stimulate specific cell functions, and suggest that certain growth factor gene combinations have potential value for cell-based articular cartilage repair. Copyright © 2012 Wiley Periodicals, Inc.
Improved Statistical Methods Enable Greater Sensitivity in Rhythm Detection for Genome-Wide Data

PubMed Central

Hutchison, Alan L.; Maienschein-Cline, Mark; Chiang, Andrew H.; Tabei, S. M. Ali; Gudjonson, Herman; Bahroos, Neil; Allada, Ravi; Dinner, Aaron R.

2015-01-01

Robust methods for identifying patterns of expression in genome-wide data are important for generating hypotheses regarding gene function. To this end, several analytic methods have been developed for detecting periodic patterns. We improve one such method, JTK_CYCLE, by explicitly calculating the null distribution such that it accounts for multiple hypothesis testing and by including non-sinusoidal reference waveforms. We term this method empirical JTK_CYCLE with asymmetry search, and we compare its performance to JTK_CYCLE with Bonferroni and Benjamini-Hochberg multiple hypothesis testing correction, as well as to five other methods: cyclohedron test, address reduction, stable persistence, ANOVA, and F24. We find that ANOVA, F24, and JTK_CYCLE consistently outperform the other three methods when data are limited and noisy; empirical JTK_CYCLE with asymmetry search gives the greatest sensitivity while controlling for the false discovery rate. Our analysis also provides insight into experimental design and we find that, for a fixed number of samples, better sensitivity and specificity are achieved with higher numbers of replicates than with higher sampling density. Application of the methods to detecting circadian rhythms in a metadataset of microarrays that quantify time-dependent gene expression in whole heads of Drosophila melanogaster reveals annotations that are enriched among genes with highly asymmetric waveforms. These include a wide range of oxidation reduction and metabolic genes, as well as genes with transcripts that have multiple splice forms. PMID:25793520
DTWscore: differential expression and cell clustering analysis for time-series single-cell RNA-seq data.

PubMed

Wang, Zhuo; Jin, Shuilin; Liu, Guiyou; Zhang, Xiurui; Wang, Nan; Wu, Deliang; Hu, Yang; Zhang, Chiping; Jiang, Qinghua; Xu, Li; Wang, Yadong

2017-05-23

The development of single-cell RNA sequencing has enabled profound discoveries in biology, ranging from the dissection of the composition of complex tissues to the identification of novel cell types and dynamics in some specialized cellular environments. However, the large-scale generation of single-cell RNA-seq (scRNA-seq) data collected at multiple time points remains a challenge to effective measurement gene expression patterns in transcriptome analysis. We present an algorithm based on the Dynamic Time Warping score (DTWscore) combined with time-series data, that enables the detection of gene expression changes across scRNA-seq samples and recovery of potential cell types from complex mixtures of multiple cell types. The DTWscore successfully classify cells of different types with the most highly variable genes from time-series scRNA-seq data. The study was confined to methods that are implemented and available within the R framework. Sample datasets and R packages are available at https://github.com/xiaoxiaoxier/DTWscore .
DOE Office of Scientific and Technical Information (OSTI.GOV)

Pirastu, M.; Galanello, R.; Doherty, M.A.

The predominant ..beta..-thalassemia in Sardinia is the ..beta../sup 0/ type in which no ..beta..-globin chains are synthesized in the homozygous state. The authors determined the ..beta..-thalassemia mutations in this population by the oligonucleotide-probe method and defined the chromosome haplotypes on which the mutation resides. The same ..beta../sup 39(CAG..-->..TAG)/ nonsense mutation was found on nine different chromosome haplotypes. Although this mutation may have arisen more than once, the multiple haplotypes could also be generated by crossing over and gene conversion events. These findings underscore the frequency of mutational events in the ..beta..-globin gene region.

A novel sodium bicarbonate cotransporter-like gene in an ancient duplicated region: SLC4A9 at 5q31

PubMed Central

Lipovich, Leonard; Lynch, Eric D; Lee, Ming K; King, Mary-Claire

2001-01-01

Background: Sodium bicarbonate cotransporter (NBC) genes encode proteins that execute coupled Na+ and HCO3- transport across epithelial cell membranes. We report the discovery, characterization, and genomic context of a novel human NBC-like gene, SLC4A9, on chromosome 5q31. Results: SLC4A9 was initially discovered by genomic sequence annotation and further characterized by sequencing of long-insert cDNA library clones. The predicted protein of 990 amino acids has 12 transmembrane domains and high sequence similarity to other NBCs. The 23-exon gene has 14 known mRNA isoforms. In three regions, mRNA sequence variation is generated by the inclusion or exclusion of portions of an exon. Noncoding SLC4A9 cDNAs were recovered multiple times from different libraries. The 3' untranslated region is fragmented into six alternatively spliced exons and contains expressed Alu, LINE and MER repeats. SLC4A9 has two alternative stop codons and six polyadenylation sites. Its expression is largely restricted to the kidney. In silico approaches were used to characterize two additional novel SLC4A genes and to place SLC4A9 within the context of multiple paralogous gene clusters containing members of the epidermal growth factor (EGF), ankyrin (ANK) and fibroblast growth factor (FGF) families. Seven human EGF-SLC4A-ANK-FGF clusters were found. Conclusion: The novel sodium bicarbonate cotransporter-like gene SLC4A9 demonstrates abundant alternative mRNA processing. It belongs to a growing class of functionally diverse genes characterized by inefficient highly variable splicing. The evolutionary history of the EGF-SLC4A-ANK-FGF gene clusters involves multiple rounds of duplication, apparently followed by large insertions and deletions at paralogous loci and genome-wide gene shuffling. PMID:11305939
Transgenic elite indica rice plants expressing CryIAc delta-endotoxin of Bacillus thuringiensis are resistant against yellow stem borer (Scirpophaga incertulas).

PubMed

Nayak, P; Basu, D; Das, S; Basu, A; Ghosh, D; Ramakrishnan, N A; Ghosh, M; Sen, S K

1997-03-18

Generation of insect-resistant, transgenic crop plants by expression of the insecticidal crystal protein (ICP) gene of Bacillus thuringiensis (Bt) is a standard crop improvement approach. In such cases, adequate expression of the most appropriate ICP against the target insect pest of the crop species is desirable. It is also considered advantageous to generate Bt-transgenics with multiple toxin systems to control rapid development of pest resistance to the ICP. Larvae of yellow stem borer (YSB), Scirpophaga incertulas, a major lepidopteran insect pest of rice, cause massive losses of rice yield. Studies on insect feeding and on the binding properties of ICP to brush border membrane receptors in the midgut of YSB larvae revealed that cryIAb and cryIAc are two individually suitable candidate genes for developing YSB-resistant rice. Programs were undertaken to develop Bt-transgenic rice with these ICP genes independently in a single cultivar. A cryIAc gene was reconstructed and placed under control of the maize ubiquitin 1 promoter, along with the first intron of the maize ubiquitin 1 gene, and the nos terminator. The gene construct was delivered to embryogenic calli of IR64, an elite indica rice cultivar, using the particle bombardment method. Six highly expressive independent transgenic ICP lines were identified. Molecular analyses and insect-feeding assays of two such lines revealed that the transferred synthetic cryIAc gene was expressed stably in the T2 generation of these lines and that the transgenic rice plants were highly toxic to YSB larvae and lessened the damage caused by their feeding.
Increased expression of a set of genes enriched in oxygen binding function discloses a predisposition of breast cancer bone metastases to generate metastasis spread in multiple organs.

PubMed

Capulli, Mattia; Angelucci, Adriano; Driouch, Keltouma; Garcia, Teresa; Clement-Lacroix, Philippe; Martella, Francesco; Ventura, Luca; Bologna, Mauro; Flamini, Stefano; Moreschini, Oreste; Lidereau, Rosette; Ricevuto, Enrico; Muraca, Maurizio; Teti, Anna; Rucci, Nadia

2012-11-01

Bone is the preferential site of distant metastasis in breast carcinoma (BrCa). Patients with metastasis restricted to bone (BO) usually show a longer overall survival compared to patients who rapidly develop multiple metastases also involving liver and lung. Hence, molecular predisposition to generate bone and visceral metastases (BV) represents a clear indication of poor clinical outcome. We performed microarray analysis with two different chip platforms, Affymetrix and Agilent, on bone metastasis samples from BO and BV patients. The unsupervised hierarchical clustering of the resulting transcriptomes correlated with the clinical progression, segregating the BO from the BV profiles. Matching the twofold significantly regulated genes from Affymetrix and Agilent chips resulted in a 15-gene signature with 13 upregulated and two downregulated genes in BV versus BO bone metastasis samples. In order to validate the resulting signature, we isolated different MDA-MB-231 clonal subpopulations that metastasize only in the bone (MDA-BO) or in bone and visceral tissues (MDA-BV). Six of the signature genes were also significantly upregulated in MDA-BV compared to MDA-BO clones. A group of upregulated genes, including Hemoglobin B (HBB), were involved in oxygen metabolism, and in vitro functional analysis of HBB revealed that its expression in the MDA subpopulations was associated with a reduced production of hydrogen peroxide. Expression of HBB was detected in primary BrCa tissue but not in normal breast epithelial cells. Metastatic lymph nodes were frequently more positive for HBB compared to the corresponding primary tumors, whereas BO metastases had a lower expression than BV metastases, suggesting a positive correlation between HBB and ability of bone metastasis to rapidly spread to other organs. We propose that HBB, along with other genes involved in oxygen metabolism, confers a more aggressive metastatic phenotype in BrCa cells disseminated to bone. Copyright © 2012 American Society for Bone and Mineral Research.
Comparative metagenomic and metatranscriptomic analyses of microbial communities in acid mine drainage.

PubMed

Chen, Lin-xing; Hu, Min; Huang, Li-nan; Hua, Zheng-shuang; Kuang, Jia-liang; Li, Sheng-jin; Shu, Wen-sheng

2015-07-01

The microbial communities in acid mine drainage have been extensively studied to reveal their roles in acid generation and adaption to this environment. Lacking, however, are integrated community- and organism-wide comparative gene transcriptional analyses that could reveal the response and adaptation mechanisms of these extraordinary microorganisms to different environmental conditions. In this study, comparative metagenomics and metatranscriptomics were performed on microbial assemblages collected from four geochemically distinct acid mine drainage (AMD) sites. Taxonomic analysis uncovered unexpectedly high microbial biodiversity of these extremely acidophilic communities, and the abundant taxa of Acidithiobacillus, Leptospirillum and Acidiphilium exhibited high transcriptional activities. Community-wide comparative analyses clearly showed that the AMD microorganisms adapted to the different environmental conditions via regulating the expression of genes involved in multiple in situ functional activities, including low-pH adaptation, carbon, nitrogen and phosphate assimilation, energy generation, environmental stress resistance, and other functions. Organism-wide comparative analyses of the active taxa revealed environment-dependent gene transcriptional profiles, especially the distinct strategies used by Acidithiobacillus ferrivorans and Leptospirillum ferrodiazotrophum in nutrients assimilation and energy generation for survival under different conditions. Overall, these findings demonstrate that the gene transcriptional profiles of AMD microorganisms are closely related to the site physiochemical characteristics, providing clues into the microbial response and adaptation mechanisms in the oligotrophic, extremely acidic environments.
Integration of Immune Cell Populations, mRNA-Seq, and CpG Methylation to Better Predict Humoral Immunity to Influenza Vaccination: Dependence of mRNA-Seq/CpG Methylation on Immune Cell Populations

PubMed Central

Zimmermann, Michael T.; Kennedy, Richard B.; Grill, Diane E.; Oberg, Ann L.; Goergen, Krista M.; Ovsyannikova, Inna G.; Haralambieva, Iana H.; Poland, Gregory A.

2017-01-01

The development of a humoral immune response to influenza vaccines occurs on a multisystems level. Due to the orchestration required for robust immune responses when multiple genes and their regulatory components across multiple cell types are involved, we examined an influenza vaccination cohort using multiple high-throughput technologies. In this study, we sought a more thorough understanding of how immune cell composition and gene expression relate to each other and contribute to interindividual variation in response to influenza vaccination. We first hypothesized that many of the differentially expressed (DE) genes observed after influenza vaccination result from changes in the composition of participants’ peripheral blood mononuclear cells (PBMCs), which were assessed using flow cytometry. We demonstrated that DE genes in our study are correlated with changes in PBMC composition. We gathered DE genes from 128 other publically available PBMC-based vaccine studies and identified that an average of 57% correlated with specific cell subset levels in our study (permutation used to control false discovery), suggesting that the associations we have identified are likely general features of PBMC-based transcriptomics. Second, we hypothesized that more robust models of vaccine response could be generated by accounting for the interplay between PBMC composition, gene expression, and gene regulation. We employed machine learning to generate predictive models of B-cell ELISPOT response outcomes and hemagglutination inhibition (HAI) antibody titers. The top HAI and B-cell ELISPOT model achieved an area under the receiver operating curve (AUC) of 0.64 and 0.79, respectively, with linear model coefficients of determination of 0.08 and 0.28. For the B-cell ELISPOT outcomes, CpG methylation had the greatest predictive ability, highlighting potentially novel regulatory features important for immune response. B-cell ELISOT models using only PBMC composition had lower performance (AUC = 0.67), but highlighted well-known mechanisms. Our analysis demonstrated that each of the three data sets (cell composition, mRNA-Seq, and DNA methylation) may provide distinct information for the prediction of humoral immune response outcomes. We believe that these findings are important for the interpretation of current omics-based studies and set the stage for a more thorough understanding of interindividual immune responses to influenza vaccination. PMID:28484452
Proteasome, transporter associated with antigen processing, and class I genes in the nurse shark Ginglymostoma cirratum: evidence for a stable class I region and MHC haplotype lineages.

PubMed

Ohta, Yuko; McKinney, E Churchill; Criscitiello, Michael F; Flajnik, Martin F

2002-01-15

Cartilaginous fish (e.g., sharks) are derived from the oldest vertebrate ancestor having an adaptive immune system, and thus are key models for examining MHC evolution. Previously, family studies in two shark species showed that classical class I (UAA) and class II genes are genetically linked. In this study, we show that proteasome genes LMP2 and LMP7, shark-specific LMP7-like, and the TAP1/2 genes are linked to class I/II. Functional LMP7 and LMP7-like genes, as well as multiple LMP2 genes or gene fragments, are found only in some sharks, suggesting that different sets of peptides might be generated depending upon inherited MHC haplotypes. Cosmid clones bearing the MHC-linked classical class I genes were isolated and shown to contain proteasome gene fragments. A non-MHC-linked LMP7 gene also was identified on another cosmid, but only two exons of this gene were detected, closely linked to a class I pseudogene (UAA-NC2); this region probably resulted from a recent duplication and translocation from the functional MHC. Tight linkage of proteasome and class I genes, in comparison with gene organizations of other vertebrates, suggests a primordial MHC organization. Another nonclassical class I gene (UAA-NC1) was detected that is linked neither to MHC nor to UAA-NC2; its high level of sequence similarity to UAA suggests that UAA-NC1 also was recently derived from UAA and translocated from MHC. These data further support the principle of a primordial class I region with few class I genes. Finally, multiple paternities in one family were demonstrated, with potential segregation distortions.
PSAT: A web tool to compare genomic neighborhoods of multiple prokaryotic genomes

PubMed Central

Fong, Christine; Rohmer, Laurence; Radey, Matthew; Wasnick, Michael; Brittnacher, Mitchell J

2008-01-01

Background The conservation of gene order among prokaryotic genomes can provide valuable insight into gene function, protein interactions, or events by which genomes have evolved. Although some tools are available for visualizing and comparing the order of genes between genomes of study, few support an efficient and organized analysis between large numbers of genomes. The Prokaryotic Sequence homology Analysis Tool (PSAT) is a web tool for comparing gene neighborhoods among multiple prokaryotic genomes. Results PSAT utilizes a database that is preloaded with gene annotation, BLAST hit results, and gene-clustering scores designed to help identify regions of conserved gene order. Researchers use the PSAT web interface to find a gene of interest in a reference genome and efficiently retrieve the sequence homologs found in other bacterial genomes. The tool generates a graphic of the genomic neighborhood surrounding the selected gene and the corresponding regions for its homologs in each comparison genome. Homologs in each region are color coded to assist users with analyzing gene order among various genomes. In contrast to common comparative analysis methods that filter sequence homolog data based on alignment score cutoffs, PSAT leverages gene context information for homologs, including those with weak alignment scores, enabling a more sensitive analysis. Features for constraining or ordering results are designed to help researchers browse results from large numbers of comparison genomes in an organized manner. PSAT has been demonstrated to be useful for helping to identify gene orthologs and potential functional gene clusters, and detecting genome modifications that may result in loss of function. Conclusion PSAT allows researchers to investigate the order of genes within local genomic neighborhoods of multiple genomes. A PSAT web server for public use is available for performing analyses on a growing set of reference genomes through any web browser with no client side software setup or installation required. Source code is freely available to researchers interested in setting up a local version of PSAT for analysis of genomes not available through the public server. Access to the public web server and instructions for obtaining source code can be found at . PMID:18366802
PW1 gene/paternally expressed gene 3 (PW1/Peg3) identifies multiple adult stem and progenitor cell populations

PubMed Central

Besson, Vanessa; Smeriglio, Piera; Wegener, Amélie; Relaix, Frédéric; Nait Oumesmar, Brahim; Sassoon, David A.; Marazzi, Giovanna

2011-01-01

A variety of markers are invaluable for identifying and purifying stem/progenitor cells. Here we report the generation of a murine reporter line driven by Pw1 that reveals cycling and quiescent progenitor/stem cells in all adult tissues thus far examined, including the intestine, blood, testis, central nervous system, bone, skeletal muscle, and skin. Neurospheres generated from the adult PW1-reporter mouse show near 100% reporter-gene expression following a single passage. Furthermore, epidermal stem cells can be purified solely on the basis of reporter-gene expression. These cells are clonogenic, repopulate the epidermal stem-cell niches, and give rise to new hair follicles. Finally, we demonstrate that only PW1 reporter-expressing epidermal cells give rise to follicles that are capable of self-renewal following injury. Our data demonstrate that PW1 serves as an invaluable marker for competent self-renewing stem cells in a wide array of adult tissues, and the PW1-reporter mouse serves as a tool for rapid stem cell isolation and characterization. PMID:21709251
Molecular Diagnosis of Infantile Mitochondrial Disease with Targeted Next-Generation Sequencing

PubMed Central

Calvo, Sarah E.; Compton, Alison G.; Hershman, Steven G.; Lim, Sze Chern; Lieber, Daniel S.; Tucker, Elena J.; Laskowski, Adrienne; Garone, Caterina; Liu, Shangtao; Jaffe, David B.; Christodoulou, John; Fletcher, Janice M.; Bruno, Damien L; Goldblatt, Jack; DiMauro, Salvatore; Thorburn, David R.; Mootha, Vamsi K.

2012-01-01

Advances in next-generation sequencing (NGS) promise to facilitate diagnosis of inherited disorders. While in research settings NGS has pinpointed causal alleles using segregation in large families, the key challenge for clinical diagnosis is application to single individuals. To explore its diagnostic utility, we performed targeted NGS in 42 unrelated infants with clinical and biochemical evidence of mitochondrial oxidative phosphorylation disease, who were refractory to traditional molecular diagnosis. These devastating mitochondrial disorders are characterized by phenotypic and genetic heterogeneity, with over 100 causal genes identified to date. We performed “MitoExome” sequencing of the mitochondrial DNA (mtDNA) and exons of ~1000 nuclear genes encoding mitochondrial proteins and prioritized rare mutations predicted to disrupt function. Since patients and controls harbored a comparable number of such heterozygous alleles, we could not prioritize dominant acting genes. However, patients showed a five-fold enrichment of genes with two such mutations that could underlie recessive disease. In total, 23/42 (55%) patients harbored such recessive genes or pathogenic mtDNA variants. Firm diagnoses were enabled in 10 patients (24%) who had mutations in genes previously linked to disease. 13 patients (31%) had mutations in nuclear genes never linked to disease. The pathogenicity of two such genes, NDUFB3 and AGK, was supported by cDNA complementation and evidence from multiple patients, respectively. The results underscore the immediate potential and challenges of deploying NGS in clinical settings. PMID:22277967
Epigenetic Effects of Diet on Fruit Fly Lifespan: An Investigation to Teach Epigenetics to Biology Students

ERIC Educational Resources Information Center

Billingsley, James; Carlson, Kimberly A.

2010-01-01

Do our genes exclusively control us, or are other factors at play? Epigenetics can provide a means for students to use inquiry-based methods to understand a complex biological concept. Students research and design an experiment testing whether dietary supplements affect the lifespan of Drosophila melanogaster over multiple generations.
CRISPR-Cas9-Edited Site Sequencing (CRES-Seq): An Efficient and High-Throughput Method for the Selection of CRISPR-Cas9-Edited Clones.

PubMed

Veeranagouda, Yaligara; Debono-Lagneaux, Delphine; Fournet, Hamida; Thill, Gilbert; Didier, Michel

2018-01-16

The emergence of clustered regularly interspaced short palindromic repeats-Cas9 (CRISPR-Cas9) gene editing systems has enabled the creation of specific mutants at low cost, in a short time and with high efficiency, in eukaryotic cells. Since a CRISPR-Cas9 system typically creates an array of mutations in targeted sites, a successful gene editing project requires careful selection of edited clones. This process can be very challenging, especially when working with multiallelic genes and/or polyploid cells (such as cancer and plants cells). Here we described a next-generation sequencing method called CRISPR-Cas9 Edited Site Sequencing (CRES-Seq) for the efficient and high-throughput screening of CRISPR-Cas9-edited clones. CRES-Seq facilitates the precise genotyping up to 96 CRISPR-Cas9-edited sites (CRES) in a single MiniSeq (Illumina) run with an approximate sequencing cost of $6/clone. CRES-Seq is particularly useful when multiple genes are simultaneously targeted by CRISPR-Cas9, and also for screening of clones generated from multiallelic genes/polyploid cells. © 2018 by John Wiley & Sons, Inc. Copyright © 2018 John Wiley & Sons, Inc.
Performance Comparison of Bench-Top Next Generation Sequencers Using Microdroplet PCR-Based Enrichment for Targeted Sequencing in Patients with Autism Spectrum Disorder

PubMed Central

Okamoto, Nobuhiko; Nakashima, Mitsuko; Tsurusaki, Yoshinori; Miyake, Noriko; Saitsu, Hirotomo; Matsumoto, Naomichi

2013-01-01

Next-generation sequencing (NGS) combined with enrichment of target genes enables highly efficient and low-cost sequencing of multiple genes for genetic diseases. The aim of this study was to validate the accuracy and sensitivity of our method for comprehensive mutation detection in autism spectrum disorder (ASD). We assessed the performance of the bench-top Ion Torrent PGM and Illumina MiSeq platforms as optimized solutions for mutation detection, using microdroplet PCR-based enrichment of 62 ASD associated genes. Ten patients with known mutations were sequenced using NGS to validate the sensitivity of our method. The overall read quality was better with MiSeq, largely because of the increased indel-related error associated with PGM. The sensitivity of SNV detection was similar between the two platforms, suggesting they are both suitable for SNV detection in the human genome. Next, we used these methods to analyze 28 patients with ASD, and identified 22 novel variants in genes associated with ASD, with one mutation detected by MiSeq only. Thus, our results support the combination of target gene enrichment and NGS as a valuable molecular method for investigating rare variants in ASD. PMID:24066114
Gene panel testing for inherited cancer risk.

PubMed

Hall, Michael J; Forman, Andrea D; Pilarski, Robert; Wiesner, Georgia; Giri, Veda N

2014-09-01

Next-generation sequencing technologies have ushered in the capability to assess multiple genes in parallel for genetic alterations that may contribute to inherited risk for cancers in families. Thus, gene panel testing is now an option in the setting of genetic counseling and testing for cancer risk. This article describes the many gene panel testing options clinically available to assess inherited cancer susceptibility, the potential advantages and challenges associated with various types of panels, clinical scenarios in which gene panels may be particularly useful in cancer risk assessment, and testing and counseling considerations. Given the potential issues for patients and their families, gene panel testing for inherited cancer risk is recommended to be offered in conjunction or consultation with an experienced cancer genetic specialist, such as a certified genetic counselor or geneticist, as an integral part of the testing process. Copyright © 2014 by the National Comprehensive Cancer Network.
Genomic analysis reveals extensive gene duplication within the bovine TRB locus

PubMed Central

Connelley, Timothy; Aerts, Jan; Law, Andy; Morrison, W Ivan

2009-01-01

Background Diverse TR and IG repertoires are generated by V(D)J somatic recombination. Genomic studies have been pivotal in cataloguing the V, D, J and C genes present in the various TR/IG loci and describing how duplication events have expanded the number of these genes. Such studies have also provided insights into the evolution of these loci and the complex mechanisms that regulate TR/IG expression. In this study we analyze the sequence of the third bovine genome assembly to characterize the germline repertoire of bovine TRB genes and compare the organization, evolution and regulatory structure of the bovine TRB locus with that of humans and mice. Results The TRB locus in the third bovine genome assembly is distributed over 5 scaffolds, extending to ~730 Kb. The available sequence contains 134 TRBV genes, assigned to 24 subgroups, and 3 clusters of DJC genes, each comprising a single TRBD gene, 5–7 TRBJ genes and a single TRBC gene. Seventy-nine of the TRBV genes are predicted to be functional. Comparison with the human and murine TRB loci shows that the gene order, as well as the sequences of non-coding elements that regulate TRB expression, are highly conserved in the bovine. Dot-plot analyses demonstrate that expansion of the genomic TRBV repertoire has occurred via a complex and extensive series of duplications, predominantly involving DNA blocks containing multiple genes. These duplication events have resulted in massive expansion of several TRBV subgroups, most notably TRBV6, 9 and 21 which contain 40, 35 and 16 members respectively. Similarly, duplication has lead to the generation of a third DJC cluster. Analyses of cDNA data confirms the diversity of the TRBV genes and, in addition, identifies a substantial number of TRBV genes, predominantly from the larger subgroups, which are still absent from the genome assembly. The observed gene duplication within the bovine TRB locus has created a repertoire of phylogenetically diverse functional TRBV genes, which is substantially larger than that described for humans and mice. Conclusion The analyses completed in this study reveal that, although the gene content and organization of the bovine TRB locus are broadly similar to that of humans and mice, multiple duplication events have led to a marked expansion in the number of TRB genes. Similar expansions in other ruminant TR loci suggest strong evolutionary pressures in this lineage have selected for the development of enlarged sets of TR genes that can contribute to diverse TR repertoires. PMID:19393068
Simultaneous learning of instantaneous and time-delayed genetic interactions using novel information theoretic scoring technique

PubMed Central

2012-01-01

Background Understanding gene interactions is a fundamental question in systems biology. Currently, modeling of gene regulations using the Bayesian Network (BN) formalism assumes that genes interact either instantaneously or with a certain amount of time delay. However in reality, biological regulations, both instantaneous and time-delayed, occur simultaneously. A framework that can detect and model both these two types of interactions simultaneously would represent gene regulatory networks more accurately. Results In this paper, we introduce a framework based on the Bayesian Network (BN) formalism that can represent both instantaneous and time-delayed interactions between genes simultaneously. A novel scoring metric having firm mathematical underpinnings is also proposed that, unlike other recent methods, can score both interactions concurrently and takes into account the reality that multiple regulators can regulate a gene jointly, rather than in an isolated pair-wise manner. Further, a gene regulatory network (GRN) inference method employing an evolutionary search that makes use of the framework and the scoring metric is also presented. Conclusion By taking into consideration the biological fact that both instantaneous and time-delayed regulations can occur among genes, our approach models gene interactions with greater accuracy. The proposed framework is efficient and can be used to infer gene networks having multiple orders of instantaneous and time-delayed regulations simultaneously. Experiments are carried out using three different synthetic networks (with three different mechanisms for generating synthetic data) as well as real life networks of Saccharomyces cerevisiae, E. coli and cyanobacteria gene expression data. The results show the effectiveness of our approach. PMID:22691450
Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks

PubMed Central

Dalman, Mark; Haddad, Joseph; Duan, Zhong-Hui

2017-01-01

The PubMed database offers an extensive set of publication data that can be useful, yet inherently complex to use without automated computational techniques. Data repositories such as the Genomic Data Commons (GDC) and the Gene Expression Omnibus (GEO) offer experimental data storage and retrieval as well as curated gene expression profiles. Genetic interaction databases, including Reactome and Ingenuity Pathway Analysis, offer pathway and experiment data analysis using data curated from these publications and data repositories. We have created a method to generate and analyze consensus networks, inferring potential gene interactions, using large numbers of Bayesian networks generated by data mining publications in the PubMed database. Through the concept of network resolution, these consensus networks can be tailored to represent possible genetic interactions. We designed a set of experiments to confirm that our method is stable across variation in both sample and topological input sizes. Using gene product interactions from the KEGG pathway database and data mining PubMed publication abstracts, we verify that regardless of the network resolution or the inferred consensus network, our method is capable of inferring meaningful gene interactions through consensus Bayesian network generation with multiple, randomized topological orderings. Our method can not only confirm the existence of currently accepted interactions, but has the potential to hypothesize new ones as well. We show our method confirms the existence of known gene interactions such as JAK-STAT-PI3K-AKT-mTOR, infers novel gene interactions such as RAS- Bcl-2 and RAS-AKT, and found significant pathway-pathway interactions between the JAK-STAT signaling and Cardiac Muscle Contraction KEGG pathways. PMID:29049295
Role of the Trichoderma harzianum Endochitinase Gene, ech42, in Mycoparasitism

PubMed Central

Carsolio, Carolina; Benhamou, Nicole; Haran, Shoshan; Cortés, Carlos; Gutiérrez, Ana; Chet, Ilan; Herrera-Estrella, Alfredo

1999-01-01

The role of the Trichoderma harzianum endochitinase (Ech42) in mycoparasitism was studied by genetically manipulating the gene that encodes Ech42, ech42. We constructed several transgenic T. harzianum strains carrying multiple copies of ech42 and the corresponding gene disruptants. The level of extracellular endochitinase activity when T. harzianum was grown under inducing conditions increased up to 42-fold in multicopy strains as compared with the wild type, whereas gene disruptants exhibited practically no activity. The densities of chitin labeling of Rhizoctonia solani cell walls, after interactions with gene disruptants were not statistically significantly different than the density of chitin labeling after interactions with the wild type. Finally, no major differences in the efficacies of the strains generated as biocontrol agents against R. solani or Sclerotium rolfsii were observed in greenhouse experiments. PMID:10049844
Engineering Extracellular Expression Systems in Escherichia coli Based on Transcriptome Analysis and Cell Growth State.

PubMed

Gao, Wen; Yin, Jun; Bao, Lichen; Wang, Qun; Hou, Shan; Yue, Yali; Yao, Wenbing; Gao, Xiangdong

2018-05-18

Escherichia coli extracellular expression systems have a number of advantages over other systems, such as lower pyrogen levels and a simple purification process. Various approaches, such as the generation of leaky mutants via chromosomal engineering, have been explored for this expression system. However, extracellular protein yields in leaky mutants are relatively low compared to that in intracellular expression systems and therefore need to be improved. In this work, we describe the construction, characterization, and mechanism of enhanced extracellular expression in Escherichia coli. On the basis of the localizations, functions, and transcription levels of cell envelope proteins, we systematically elucidated the effects of multiple gene deletions on cell growth and extracellular expression using modified CRISPR/Cas9-based genome editing and a FlAsH labeling assay. High extracellular yields of heterologous proteins of different sizes were obtained by screening multiple gene mutations. The enhancement of extracellular secretion was associated with the derepression of translation and translocation. This work utilized universal methods in the design of extracellular expression systems for genes not directly associated with protein synthesis that were used to generate strains with higher protein expression capability. We anticipate that extracellular expression systems may help to shed light on the poorly understood aspects of these secretion processes as well as to further assist in the construction of engineered prokaryotic cells for efficient extracellular production of heterologous proteins.
Translation initiation events on structured eukaryotic mRNAs generate gene expression noise

PubMed Central

Dacheux, Estelle; Malys, Naglis; Meng, Xiang; Ramachandran, Vinoy; Mendes, Pedro

2017-01-01

Abstract Gene expression stochasticity plays a major role in biology, creating non-genetic cellular individuality and influencing multiple processes, including differentiation and stress responses. We have addressed the lack of knowledge about posttranscriptional contributions to noise by determining cell-to-cell variations in the abundance of mRNA and reporter protein in yeast. Two types of structural element, a stem–loop and a poly(G) motif, not only inhibit translation initiation when inserted into an mRNA 5΄ untranslated region, but also generate noise. The noise-enhancing effect of the stem–loop structure also remains operational when combined with an upstream open reading frame. This has broad significance, since these elements are known to modulate the expression of a diversity of eukaryotic genes. Our findings suggest a mechanism for posttranscriptional noise generation that will contribute to understanding of the generally poor correlation between protein-level stochasticity and transcriptional bursting. We propose that posttranscriptional stochasticity can be linked to cycles of folding/unfolding of a stem–loop structure, or to interconversion between higher-order structural conformations of a G-rich motif, and have created a correspondingly configured computational model that generates fits to the experimental data. Stochastic events occurring during the ribosomal scanning process can therefore feature alongside transcriptional bursting as a source of noise. PMID:28521011
Analysis and functional annotation of expressed sequence tags (ESTs) from multiple tissues of oil palm (Elaeis guineensis Jacq.)

PubMed Central

Ho, Chai-Ling; Kwan, Yen-Yen; Choi, Mei-Chooi; Tee, Sue-Sean; Ng, Wai-Har; Lim, Kok-Ang; Lee, Yang-Ping; Ooi, Siew-Eng; Lee, Weng-Wah; Tee, Jin-Ming; Tan, Siang-Hee; Kulaveerasingam, Harikrishna; Alwee, Sharifah Shahrul Rabiah Syed; Abdullah, Meilina Ong

2007-01-01

Background Oil palm is the second largest source of edible oil which contributes to approximately 20% of the world's production of oils and fats. In order to understand the molecular biology involved in in vitro propagation, flowering, efficient utilization of nitrogen sources and root diseases, we have initiated an expressed sequence tag (EST) analysis on oil palm. Results In this study, six cDNA libraries from oil palm zygotic embryos, suspension cells, shoot apical meristems, young flowers, mature flowers and roots, were constructed. We have generated a total of 14537 expressed sequence tags (ESTs) from these libraries, from which 6464 tentative unique contigs (TUCs) and 2129 singletons were obtained. Approximately 6008 of these tentative unique genes (TUGs) have significant matches to the non-redundant protein database, from which 2361 were assigned to one or more Gene Ontology categories. Predominant transcripts and differentially expressed genes were identified in multiple oil palm tissues. Homologues of genes involved in many aspects of flower development were also identified among the EST collection, such as CONSTANS-like, AGAMOUS-like (AGL)2, AGL20, LFY-like, SQUAMOSA, SQUAMOSA binding protein (SBP) etc. Majority of them are the first representatives in oil palm, providing opportunities to explore the cause of epigenetic homeotic flowering abnormality in oil palm, given the importance of flowering in fruit production. The transcript levels of two flowering-related genes, EgSBP and EgSEP were analysed in the flower tissues of various developmental stages. Gene homologues for enzymes involved in oil biosynthesis, utilization of nitrogen sources, and scavenging of oxygen radicals, were also uncovered among the oil palm ESTs. Conclusion The EST sequences generated will allow comparative genomic studies between oil palm and other monocotyledonous and dicotyledonous plants, development of gene-targeted markers for the reference genetic map, design and fabrication of DNA array for future studies of oil palm. The outcomes of such studies will contribute to oil palm improvements through the establishment of breeding program using marker-assisted selection, development of diagnostic assays using gene targeted markers, and discovery of candidate genes related to important agronomic traits of oil palm. PMID:17953740

Grains of connectivity: analysis at multiple spatial scales in landscape genetics.

PubMed

Galpern, Paul; Manseau, Micheline; Wilson, Paul

2012-08-01

Landscape genetic analyses are typically conducted at one spatial scale. Considering multiple scales may be essential for identifying landscape features influencing gene flow. We examined landscape connectivity for woodland caribou (Rangifer tarandus caribou) at multiple spatial scales using a new approach based on landscape graphs that creates a Voronoi tessellation of the landscape. To illustrate the potential of the method, we generated five resistance surfaces to explain how landscape pattern may influence gene flow across the range of this population. We tested each resistance surface using a raster at the spatial grain of available landscape data (200 m grid squares). We then used our method to produce up to 127 additional grains for each resistance surface. We applied a causal modelling framework with partial Mantel tests, where evidence of landscape resistance is tested against an alternative hypothesis of isolation-by-distance, and found statistically significant support for landscape resistance to gene flow in 89 of the 507 spatial grains examined. We found evidence that major roads as well as the cumulative effects of natural and anthropogenic disturbance may be contributing to the genetic structure. Using only the original grid surface yielded no evidence for landscape resistance to gene flow. Our results show that using multiple spatial grains can reveal landscape influences on genetic structure that may be overlooked with a single grain, and suggest that coarsening the grain of landcover data may be appropriate for highly mobile species. We discuss how grains of connectivity and related analyses have potential landscape genetic applications in a broad range of systems. © 2012 Blackwell Publishing Ltd.
Recent Advancement of the Molecular Diagnosis in Pediatric Brain Tumor.

PubMed

Bae, Jeong-Mo; Won, Jae-Kyung; Park, Sung-Hye

2018-05-01

Recent discoveries of brain tumor-related genes and fast advances in genomic testing technologies have led to the era of molecular diagnosis of brain tumor. Molecular profiling of brain tumor became the significant step in the diagnosis, the prediction of prognosis and the treatment of brain tumor. Because traditional molecular testing methods have limitations in time and cost for multiple gene tests, next-generation sequencing technologies are rapidly introduced into clinical practice. Targeted sequencing panels using these technologies have been developed for brain tumors. In this article, focused on pediatric brain tumor, key discoveries of brain tumor-related genes are reviewed and cancer panels used in the molecular profiling of brain tumor are discussed.
Recent Advancement of the Molecular Diagnosis in Pediatric Brain Tumor

PubMed Central

Bae, Jeong-Mo; Won, Jae-Kyung; Park, Sung-Hye

2018-01-01

Recent discoveries of brain tumor-related genes and fast advances in genomic testing technologies have led to the era of molecular diagnosis of brain tumor. Molecular profiling of brain tumor became the significant step in the diagnosis, the prediction of prognosis and the treatment of brain tumor. Because traditional molecular testing methods have limitations in time and cost for multiple gene tests, next-generation sequencing technologies are rapidly introduced into clinical practice. Targeted sequencing panels using these technologies have been developed for brain tumors. In this article, focused on pediatric brain tumor, key discoveries of brain tumor-related genes are reviewed and cancer panels used in the molecular profiling of brain tumor are discussed. PMID:29742887
Peeling skin syndrome associated with novel variant in FLG2 gene.

PubMed

Alfares, Ahmed; Al-Khenaizan, Sultan; Al Mutairi, Fuad

2017-12-01

Peeling skin syndrome is a rare genodermatosis characterized by variably pruritic superficial generalized peeling of the skin with several genes involved until now little is known about the association between FLG2 and peeling skin syndrome. We describe multiple family members from a consanguineous Saudi family with peeling skin syndrome. Next Generation Sequencing identifies a cosegregating novel variant in FLG2 c.632C>G (p.Ser211*) as a likely etiology in this family. Here, we reported on the clinical manifestation of homozygous loss of function variant in FLG2 as a disease-causing gene for peeling skin syndrome and expand the dermatology findings. © 2017 Wiley Periodicals, Inc.
Network Biomarkers of Bladder Cancer Based on a Genome-Wide Genetic and Epigenetic Network Derived from Next-Generation Sequencing Data.

PubMed

Li, Cheng-Wei; Chen, Bor-Sen

2016-01-01

Epigenetic and microRNA (miRNA) regulation are associated with carcinogenesis and the development of cancer. By using the available omics data, including those from next-generation sequencing (NGS), genome-wide methylation profiling, candidate integrated genetic and epigenetic network (IGEN) analysis, and drug response genome-wide microarray analysis, we constructed an IGEN system based on three coupling regression models that characterize protein-protein interaction networks (PPINs), gene regulatory networks (GRNs), miRNA regulatory networks (MRNs), and epigenetic regulatory networks (ERNs). By applying system identification method and principal genome-wide network projection (PGNP) to IGEN analysis, we identified the core network biomarkers to investigate bladder carcinogenic mechanisms and design multiple drug combinations for treating bladder cancer with minimal side-effects. The progression of DNA repair and cell proliferation in stage 1 bladder cancer ultimately results not only in the derepression of miR-200a and miR-200b but also in the regulation of the TNF pathway to metastasis-related genes or proteins, cell proliferation, and DNA repair in stage 4 bladder cancer. We designed a multiple drug combination comprising gefitinib, estradiol, yohimbine, and fulvestrant for treating stage 1 bladder cancer with minimal side-effects, and another multiple drug combination comprising gefitinib, estradiol, chlorpromazine, and LY294002 for treating stage 4 bladder cancer with minimal side-effects.
Principles of gene microarray data analysis.

PubMed

Mocellin, Simone; Rossi, Carlo Riccardo

2007-01-01

The development of several gene expression profiling methods, such as comparative genomic hybridization (CGH), differential display, serial analysis of gene expression (SAGE), and gene microarray, together with the sequencing of the human genome, has provided an opportunity to monitor and investigate the complex cascade of molecular events leading to tumor development and progression. The availability of such large amounts of information has shifted the attention of scientists towards a nonreductionist approach to biological phenomena. High throughput technologies can be used to follow changing patterns of gene expression over time. Among them, gene microarray has become prominent because it is easier to use, does not require large-scale DNA sequencing, and allows for the parallel quantification of thousands of genes from multiple samples. Gene microarray technology is rapidly spreading worldwide and has the potential to drastically change the therapeutic approach to patients affected with tumor. Therefore, it is of paramount importance for both researchers and clinicians to know the principles underlying the analysis of the huge amount of data generated with microarray technology.
Web-Based Phylogenetic Assignment Tool for Analysis of Terminal Restriction Fragment Length Polymorphism Profiles of Microbial Communities

PubMed Central

Kent, Angela D.; Smith, Dan J.; Benson, Barbara J.; Triplett, Eric W.

2003-01-01

Culture-independent DNA fingerprints are commonly used to assess the diversity of a microbial community. However, relating species composition to community profiles produced by community fingerprint methods is not straightforward. Terminal restriction fragment length polymorphism (T-RFLP) is a community fingerprint method in which phylogenetic assignments may be inferred from the terminal restriction fragment (T-RF) sizes through the use of web-based resources that predict T-RF sizes for known bacteria. The process quickly becomes computationally intensive due to the need to analyze profiles produced by multiple restriction digests and the complexity of profiles generated by natural microbial communities. A web-based tool is described here that rapidly generates phylogenetic assignments from submitted community T-RFLP profiles based on a database of fragments produced by known 16S rRNA gene sequences. Users have the option of submitting a customized database generated from unpublished sequences or from a gene other than the 16S rRNA gene. This phylogenetic assignment tool allows users to employ T-RFLP to simultaneously analyze microbial community diversity and species composition. An analysis of the variability of bacterial species composition throughout the water column in a humic lake was carried out to demonstrate the functionality of the phylogenetic assignment tool. This method was validated by comparing the results generated by this program with results from a 16S rRNA gene clone library. PMID:14602639
Negative Example Selection for Protein Function Prediction: The NoGO Database

PubMed Central

Youngs, Noah; Penfold-Brown, Duncan; Bonneau, Richard; Shasha, Dennis

2014-01-01

Negative examples – genes that are known not to carry out a given protein function – are rarely recorded in genome and proteome annotation databases, such as the Gene Ontology database. Negative examples are required, however, for several of the most powerful machine learning methods for integrative protein function prediction. Most protein function prediction efforts have relied on a variety of heuristics for the choice of negative examples. Determining the accuracy of methods for negative example prediction is itself a non-trivial task, given that the Open World Assumption as applied to gene annotations rules out many traditional validation metrics. We present a rigorous comparison of these heuristics, utilizing a temporal holdout, and a novel evaluation strategy for negative examples. We add to this comparison several algorithms adapted from Positive-Unlabeled learning scenarios in text-classification, which are the current state of the art methods for generating negative examples in low-density annotation contexts. Lastly, we present two novel algorithms of our own construction, one based on empirical conditional probability, and the other using topic modeling applied to genes and annotations. We demonstrate that our algorithms achieve significantly fewer incorrect negative example predictions than the current state of the art, using multiple benchmarks covering multiple organisms. Our methods may be applied to generate negative examples for any type of method that deals with protein function, and to this end we provide a database of negative examples in several well-studied organisms, for general use (The NoGO database, available at: bonneaulab.bio.nyu.edu/nogo.html). PMID:24922051
A simplified Sanger sequencing method for full genome sequencing of multiple subtypes of human influenza A viruses.

PubMed

Deng, Yi-Mo; Spirason, Natalie; Iannello, Pina; Jelley, Lauren; Lau, Hilda; Barr, Ian G

2015-07-01

Full genome sequencing of influenza A viruses (IAV), including those that arise from annual influenza epidemics, is undertaken to determine if reassorting has occurred or if other pathogenic traits are present. Traditionally IAV sequencing has been biased toward the major surface glycoproteins haemagglutinin and neuraminidase, while the internal genes are often ignored. Despite the development of next generation sequencing (NGS), many laboratories are still reliant on conventional Sanger sequencing to sequence IAV. To develop a minimal and robust set of primers for Sanger sequencing of the full genome of IAV currently circulating in humans. A set of 13 primer pairs was designed that enabled amplification of the six internal genes of multiple human IAV subtypes including the recent avian influenza A(H7N9) virus from China. Specific primers were designed to amplify the HA and NA genes of each IAV subtype of interest. Each of the primers also incorporated a binding site at its 5'-end for either a forward or reverse M13 primer, such that only two M13 primers were required for all subsequent sequencing reactions. This minimal set of primers was suitable for sequencing the six internal genes of all currently circulating human seasonal influenza A subtypes as well as the avian A(H7N9) viruses that have infected humans in China. This streamlined Sanger sequencing protocol could be used to generate full genome sequence data more rapidly and easily than existing influenza genome sequencing protocols. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
TumorNext-Lynch-MMR: a comprehensive next generation sequencing assay for the detection of germline and somatic mutations in genes associated with mismatch repair deficiency and Lynch syndrome.

PubMed

Gray, Phillip N; Tsai, Pei; Chen, Daniel; Wu, Sitao; Hoo, Jayne; Mu, Wenbo; Li, Bing; Vuong, Huy; Lu, Hsiao-Mei; Batth, Navanjot; Willett, Sara; Uyeda, Lisa; Shah, Swati; Gau, Chia-Ling; Umali, Monalyn; Espenschied, Carin; Janicek, Mike; Brown, Sandra; Margileth, David; Dobrea, Lavinia; Wagman, Lawrence; Rana, Huma; Hall, Michael J; Ross, Theodora; Terdiman, Jonathan; Cullinane, Carey; Ries, Savita; Totten, Ellen; Elliott, Aaron M

2018-04-17

The current algorithm for Lynch syndrome diagnosis is highly complex with multiple steps which can result in an extended time to diagnosis while depleting precious tumor specimens. Here we describe the analytical validation of a custom probe-based NGS tumor panel, TumorNext-Lynch-MMR, which generates a comprehensive genetic profile of both germline and somatic mutations that can accelerate and streamline the time to diagnosis and preserve specimen. TumorNext-Lynch-MMR can detect single nucleotide variants, small insertions and deletions in 39 genes that are frequently mutated in Lynch syndrome and colorectal cancer. Moreover, the panel provides microsatellite instability status and detects loss of heterozygosity in the five Lynch genes; MSH2 , MSH6 , MLH1 , PMS2 and EPCAM . Clinical cases are described that highlight the assays ability to differentiate between somatic and germline mutations, precisely classify variants and resolve discordant cases.
Alternative Splicing of Four Trafficking Genes Regulates Myofiber Structure and Skeletal Muscle Physiology.

PubMed

Giudice, Jimena; Loehr, James A; Rodney, George G; Cooper, Thomas A

2016-11-15

During development, transcriptional and post-transcriptional networks are coordinately regulated to drive organ maturation. Alternative splicing contributes by producing temporal-specific protein isoforms. We previously found that genes undergoing splicing transitions during mouse postnatal heart development are enriched for vesicular trafficking and membrane dynamics functions. Here, we show that adult trafficking isoforms are also expressed in adult skeletal muscle and hypothesize that striated muscle utilizes alternative splicing to generate specific isoforms required for function of adult tissue. We deliver morpholinos into flexor digitorum brevis muscles in adult mice to redirect splicing of four trafficking genes to the fetal isoforms. The splicing switch results in multiple structural and functional defects, including transverse tubule (T-tubule) disruption and dihydropyridine receptor alpha (DHPR) and Ryr1 mislocalization, impairing excitation-contraction coupling, calcium handling, and force generation. The results demonstrate a previously unrecognized role for trafficking functions in adult muscle tissue homeostasis and a specific requirement for the adult splice variants. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.
TumorNext-Lynch-MMR: a comprehensive next generation sequencing assay for the detection of germline and somatic mutations in genes associated with mismatch repair deficiency and Lynch syndrome

PubMed Central

Gray, Phillip N.; Tsai, Pei; Chen, Daniel; Wu, Sitao; Hoo, Jayne; Mu, Wenbo; Li, Bing; Vuong, Huy; Lu, Hsiao-Mei; Batth, Navanjot; Willett, Sara; Uyeda, Lisa; Shah, Swati; Gau, Chia-Ling; Umali, Monalyn; Espenschied, Carin; Janicek, Mike; Brown, Sandra; Margileth, David; Dobrea, Lavinia; Wagman, Lawrence; Rana, Huma; Hall, Michael J.; Ross, Theodora; Terdiman, Jonathan; Cullinane, Carey; Ries, Savita; Totten, Ellen; Elliott, Aaron M.

2018-01-01

The current algorithm for Lynch syndrome diagnosis is highly complex with multiple steps which can result in an extended time to diagnosis while depleting precious tumor specimens. Here we describe the analytical validation of a custom probe-based NGS tumor panel, TumorNext-Lynch-MMR, which generates a comprehensive genetic profile of both germline and somatic mutations that can accelerate and streamline the time to diagnosis and preserve specimen. TumorNext-Lynch-MMR can detect single nucleotide variants, small insertions and deletions in 39 genes that are frequently mutated in Lynch syndrome and colorectal cancer. Moreover, the panel provides microsatellite instability status and detects loss of heterozygosity in the five Lynch genes; MSH2, MSH6, MLH1, PMS2 and EPCAM. Clinical cases are described that highlight the assays ability to differentiate between somatic and germline mutations, precisely classify variants and resolve discordant cases. PMID:29755653
Molecular Approaches to Thyroid Cancer Diagnosis

PubMed Central

Hsiao, Susan J.; Nikiforov, Yuri E.

2014-01-01

Thyroid nodules are common, and the accurate diagnosis of cancer or benign disease is important for the effective clinical management of these patients. Molecular markers are a helpful diagnostic tool, particularly for cytologically indeterminate thyroid nodules. In the past few years, significant progress has been made in developing molecular markers for clinical use in fine needle aspiration (FNA) specimens, including gene mutation panels and gene expression classifiers. With the availability of next generation sequencing technology, gene mutation panels can be expanded to interrogate multiple genes simultaneously and to provide yet more accurate diagnostic information. In addition, recently several new molecular markers in thyroid cancer have been identified that offer diagnostic, prognostic, and therapeutic information that could potentially be of value in guiding individualized management of patients with thyroid nodules. PMID:24829266
Chronic exposure to water pollutant trichloroethylene increased epigenetic drift in CD4(+) T cells.

PubMed

Gilbert, Kathleen M; Blossom, Sarah J; Erickson, Stephen W; Reisfeld, Brad; Zurlinden, Todd J; Broadfoot, Brannon; West, Kirk; Bai, Shasha; Cooney, Craig A

2016-05-01

Autoimmune disease and CD4(+) T-cell alterations are induced in mice exposed to the water pollutant trichloroethylene (TCE). We examined here whether TCE altered gene-specific DNA methylation in CD4(+) T cells as a possible mechanism of immunotoxicity. Naive and effector/memory CD4(+) T cells from mice exposed to TCE (0.5 mg/ml in drinking water) for 40 weeks were examined by bisulfite next-generation DNA sequencing. A probabilistic model calculated from multiple genes showed that TCE decreased methylation control in CD4(+) T cells. Data from individual genes fitted to a quadratic regression model showed that TCE increased gene-specific methylation variance in both CD4 subsets. TCE increased epigenetic drift of specific CpG sites in CD4(+) T cells.
Single-step generation of gene knockout-rescue system in pluripotent stem cells by promoter insertion with CRISPR/Cas9.

PubMed

Matsunaga, Taichi; Yamashita, Jun K

2014-02-07

Specific gene knockout and rescue experiments are powerful tools in developmental and stem cell biology. Nevertheless, the experiments require multiple steps of molecular manipulation for gene knockout and subsequent rescue procedures. Here we report an efficient and single step strategy to generate gene knockout-rescue system in pluripotent stem cells by promoter insertion with CRISPR/Cas9 genome editing technology. We inserted a tetracycline-regulated inducible gene promoter (tet-OFF/TRE-CMV) upstream of the endogenous promoter region of vascular endothelial growth factor receptor 2 (VEGFR2/Flk1) gene, an essential gene for endothelial cell (EC) differentiation, in mouse embryonic stem cells (ESCs) with homologous recombination. Both homo- and hetero-inserted clones were efficiently obtained through a simple selection with a drug-resistant gene. The insertion of TRE-CMV promoter disrupted endogenous Flk1 expression, resulting in null mutation in homo-inserted clones. When the inserted TRE-CMV promoter was activated with doxycycline (Dox) depletion, Flk1 expression was sufficiently recovered from the downstream genomic Flk1 gene. Whereas EC differentiation was almost completely perturbed in homo-inserted clones, Flk1 rescue with TRE-CMV promoter activation restored EC appearance, indicating that phenotypic changes in EC differentiation can be successfully reproduced with this knockout-rescue system. Thus, this promoter insertion strategy with CRISPR/Cas9 would be a novel attractive method for knockout-rescue experiments. Copyright © 2014 Elsevier Inc. All rights reserved.
A deep transcriptomic resource for the copepod crustacean Labidocera madurae: A potential indicator species for assessing near shore ecosystem health

PubMed Central

Christie, Andrew E.; Sommer, Stephanie A.; Cieslak, Matthew C.; Hartline, Daniel K.; Lenz, Petra H.

2017-01-01

Coral reef ecosystems of many sub-tropical and tropical marine coastal environments have suffered significant degradation from anthropogenic sources. Research to inform management strategies that mitigate stressors and promote a healthy ecosystem has focused on the ecology and physiology of coral reefs and associated organisms. Few studies focus on the surrounding pelagic communities, which are equally important to ecosystem function. Zooplankton, often dominated by small crustaceans such as copepods, is an important food source for invertebrates and fishes, especially larval fishes. The reef-associated zooplankton includes a sub-neustonic copepod family that could serve as an indicator species for the community. Here, we describe the generation of a de novo transcriptome for one such copepod, Labidocera madurae, a pontellid from an intensively-studied coral reef ecosystem, Kāne‘ohe Bay, Oahu, Hawai‘i. The transcriptome was assembled using high-throughput sequence data obtained from whole organisms. It comprised 211,002 unique transcripts, including 72,391 with coding regions. It was assessed for quality and completeness using multiple workflows. Bench-marking-universal-single-copy-orthologs (BUSCO) analysis identified transcripts for 88% of expected eukaryotic core proteins. Targeted gene-discovery analyses included searches for transcripts coding full-length “giant” proteins (>4,000 amino acids), proteins and splice variants of voltage-gated sodium channels, and proteins involved in the circadian signaling pathway. Four different reference transcriptomes were generated and compared for the detection of differential gene expression between copepodites and adult females; 6,229 genes were consistently identified as differentially expressed between the two regardless of reference. Automated bioinformatics analyses and targeted manual gene curation suggest that the de novo assembled L. madurae transcriptome is of high quality and completeness. This transcriptome provides a new resource for assessing the global physiological status of a planktonic species inhabiting a coral reef ecosystem that is subjected to multiple anthropogenic stressors. The workflows provide a template for generating and assessing transcriptomes in other non-model species. PMID:29065152
A deep transcriptomic resource for the copepod crustacean Labidocera madurae: A potential indicator species for assessing near shore ecosystem health.

PubMed

Roncalli, Vittoria; Christie, Andrew E; Sommer, Stephanie A; Cieslak, Matthew C; Hartline, Daniel K; Lenz, Petra H

2017-01-01

Coral reef ecosystems of many sub-tropical and tropical marine coastal environments have suffered significant degradation from anthropogenic sources. Research to inform management strategies that mitigate stressors and promote a healthy ecosystem has focused on the ecology and physiology of coral reefs and associated organisms. Few studies focus on the surrounding pelagic communities, which are equally important to ecosystem function. Zooplankton, often dominated by small crustaceans such as copepods, is an important food source for invertebrates and fishes, especially larval fishes. The reef-associated zooplankton includes a sub-neustonic copepod family that could serve as an indicator species for the community. Here, we describe the generation of a de novo transcriptome for one such copepod, Labidocera madurae, a pontellid from an intensively-studied coral reef ecosystem, Kāne'ohe Bay, Oahu, Hawai'i. The transcriptome was assembled using high-throughput sequence data obtained from whole organisms. It comprised 211,002 unique transcripts, including 72,391 with coding regions. It was assessed for quality and completeness using multiple workflows. Bench-marking-universal-single-copy-orthologs (BUSCO) analysis identified transcripts for 88% of expected eukaryotic core proteins. Targeted gene-discovery analyses included searches for transcripts coding full-length "giant" proteins (>4,000 amino acids), proteins and splice variants of voltage-gated sodium channels, and proteins involved in the circadian signaling pathway. Four different reference transcriptomes were generated and compared for the detection of differential gene expression between copepodites and adult females; 6,229 genes were consistently identified as differentially expressed between the two regardless of reference. Automated bioinformatics analyses and targeted manual gene curation suggest that the de novo assembled L. madurae transcriptome is of high quality and completeness. This transcriptome provides a new resource for assessing the global physiological status of a planktonic species inhabiting a coral reef ecosystem that is subjected to multiple anthropogenic stressors. The workflows provide a template for generating and assessing transcriptomes in other non-model species.
Integrative sparse principal component analysis of gene expression data.

PubMed

Liu, Mengque; Fan, Xinyan; Fang, Kuangnan; Zhang, Qingzhao; Ma, Shuangge

2017-12-01

In the analysis of gene expression data, dimension reduction techniques have been extensively adopted. The most popular one is perhaps the PCA (principal component analysis). To generate more reliable and more interpretable results, the SPCA (sparse PCA) technique has been developed. With the "small sample size, high dimensionality" characteristic of gene expression data, the analysis results generated from a single dataset are often unsatisfactory. Under contexts other than dimension reduction, integrative analysis techniques, which jointly analyze the raw data of multiple independent datasets, have been developed and shown to outperform "classic" meta-analysis and other multidatasets techniques and single-dataset analysis. In this study, we conduct integrative analysis by developing the iSPCA (integrative SPCA) method. iSPCA achieves the selection and estimation of sparse loadings using a group penalty. To take advantage of the similarity across datasets and generate more accurate results, we further impose contrasted penalties. Different penalties are proposed to accommodate different data conditions. Extensive simulations show that iSPCA outperforms the alternatives under a wide spectrum of settings. The analysis of breast cancer and pancreatic cancer data further shows iSPCA's satisfactory performance. © 2017 WILEY PERIODICALS, INC.
Characterization of IGH locus breakpoints in multiple myeloma indicates a subset of translocations appear to occur in pregerminal center B cells.

PubMed

Walker, Brian A; Wardell, Christopher P; Johnson, David C; Kaiser, Martin F; Begum, Dil B; Dahir, Nasrin B; Ross, Fiona M; Davies, Faith E; Gonzalez, David; Morgan, Gareth J

2013-04-25

Translocations in myeloma are thought to occur solely in mature B cells in the germinal center through class switch recombination (CSR). We used a targeted captured technique followed by massively parallel sequencing to determine the exact breakpoints in both the immunoglobulin heavy chain (IGH) locus and the partner chromosome in 61 presentation multiple myeloma samples. The majority of samples (62%) have a breakpoint within the switch regions upstream of the IGH constant genes and are generated through CSR in a mature B cell. However, the proportion of CSR translocations is not consistent between cytogenetic subgroups. We find that 100% of t(4;14) are CSR-mediated; however, 21% of t(11;14) and 25% of t(14;20) are generated through DH-JH recombination activation gene-mediated mechanisms, indicating they occur earlier in B-cell development at the pro-B-cell stage in the bone marrow. These 2 groups also generate translocations through receptor revision, as determined by the breakpoints and mutation status of the segments used in 10% and 50% of t(11;14) and t(14;20) samples, respectively. The study indicates that in a significant number of cases the translocation-based etiological events underlying myeloma may arise at the pro-B-cell hematological progenitor cell level, much earlier in B-cell development than was previously thought.
ePlant: Visualizing and Exploring Multiple Levels of Data for Hypothesis Generation in Plant Biology.

PubMed

Waese, Jamie; Fan, Jim; Pasha, Asher; Yu, Hans; Fucile, Geoffrey; Shi, Ruian; Cumming, Matthew; Kelley, Lawrence A; Sternberg, Michael J; Krishnakumar, Vivek; Ferlanti, Erik; Miller, Jason; Town, Chris; Stuerzlinger, Wolfgang; Provart, Nicholas J

2017-08-01

A big challenge in current systems biology research arises when different types of data must be accessed from separate sources and visualized using separate tools. The high cognitive load required to navigate such a workflow is detrimental to hypothesis generation. Accordingly, there is a need for a robust research platform that incorporates all data and provides integrated search, analysis, and visualization features through a single portal. Here, we present ePlant (http://bar.utoronto.ca/eplant), a visual analytic tool for exploring multiple levels of Arabidopsis thaliana data through a zoomable user interface. ePlant connects to several publicly available web services to download genome, proteome, interactome, transcriptome, and 3D molecular structure data for one or more genes or gene products of interest. Data are displayed with a set of visualization tools that are presented using a conceptual hierarchy from big to small, and many of the tools combine information from more than one data type. We describe the development of ePlant in this article and present several examples illustrating its integrative features for hypothesis generation. We also describe the process of deploying ePlant as an "app" on Araport. Building on readily available web services, the code for ePlant is freely available for any other biological species research. © 2017 American Society of Plant Biologists. All rights reserved.

GOEAST: a web-based software toolkit for Gene Ontology enrichment analysis.

PubMed

Zheng, Qi; Wang, Xiu-Jie

2008-07-01

Gene Ontology (GO) analysis has become a commonly used approach for functional studies of large-scale genomic or transcriptomic data. Although there have been a lot of software with GO-related analysis functions, new tools are still needed to meet the requirements for data generated by newly developed technologies or for advanced analysis purpose. Here, we present a Gene Ontology Enrichment Analysis Software Toolkit (GOEAST), an easy-to-use web-based toolkit that identifies statistically overrepresented GO terms within given gene sets. Compared with available GO analysis tools, GOEAST has the following improved features: (i) GOEAST displays enriched GO terms in graphical format according to their relationships in the hierarchical tree of each GO category (biological process, molecular function and cellular component), therefore, provides better understanding of the correlations among enriched GO terms; (ii) GOEAST supports analysis for data from various sources (probe or probe set IDs of Affymetrix, Illumina, Agilent or customized microarrays, as well as different gene identifiers) and multiple species (about 60 prokaryote and eukaryote species); (iii) One unique feature of GOEAST is to allow cross comparison of the GO enrichment status of multiple experiments to identify functional correlations among them. GOEAST also provides rigorous statistical tests to enhance the reliability of analysis results. GOEAST is freely accessible at http://omicslab.genetics.ac.cn/GOEAST/
Rare variants in axonogenesis genes connect three families with sound-color synesthesia.

PubMed

Tilot, Amanda K; Kucera, Katerina S; Vino, Arianna; Asher, Julian E; Baron-Cohen, Simon; Fisher, Simon E

2018-03-20

Synesthesia is a rare nonpathological phenomenon where stimulation of one sense automatically provokes a secondary perception in another. Hypothesized to result from differences in cortical wiring during development, synesthetes show atypical structural and functional neural connectivity, but the underlying molecular mechanisms are unknown. The trait also appears to be more common among people with autism spectrum disorder and savant abilities. Previous linkage studies searching for shared loci of large effect size across multiple families have had limited success. To address the critical lack of candidate genes, we applied whole-exome sequencing to three families with sound-color (auditory-visual) synesthesia affecting multiple relatives across three or more generations. We identified rare genetic variants that fully cosegregate with synesthesia in each family, uncovering 37 genes of interest. Consistent with reports indicating genetic heterogeneity, no variants were shared across families. Gene ontology analyses highlighted six genes- COL4A1 , ITGA2 , MYO10 , ROBO3 , SLC9A6 , and SLIT2 -associated with axonogenesis and expressed during early childhood when synesthetic associations are formed. These results are consistent with neuroimaging-based hypotheses about the role of hyperconnectivity in the etiology of synesthesia and offer a potential entry point into the neurobiology that organizes our sensory experiences. Copyright © 2018 the Author(s). Published by PNAS.
Development of unidentified dna-specific hif 1α gene of lizard (hemidactylus platyurus) which plays a role in tissue regeneration process

NASA Astrophysics Data System (ADS)

Novianti, T.; Sadikin, M.; Widia, S.; Juniantito, V.; Arida, E. A.

2018-03-01

Development of unidentified specific gene is essential to analyze the availability these genes in biological process. Identification unidentified specific DNA of HIF 1α genes is important to analyze their contribution in tissue regeneration process in lizard tail (Hemidactylus platyurus). Bioinformatics and PCR techniques are relatively an easier method to identify an unidentified gene. The most widely used method is BLAST (Basic Local Alignment Sequence Tools) method for alignment the sequences from the other organism. BLAST technique is online software from website https://blast.ncbi.nlm.nih.gov/Blast.cgi that capable to generate the similar sequences from closest kinship to distant kindship. Gecko japonicus is a species that it has closest kinship with H. platyurus. Comparing HIF 1 α gene sequence of G. japonicus with the other species used multiple alignment methods from Mega7 software. Conserved base areas were identified using Clustal IX method. Primary DNA of HIF 1 α gene was design by Primer3 software. HIF 1α gene of lizard (H. platyurus) was successfully amplified using a real-time PCR machine by primary DNA that we had designed from Gecko japonicus. Identification unidentified gene of HIF 1a lizard has been done successfully with multiple alignment method. The study was conducted by analyzing during the growth of tail on day 1, 3, 5, 7, 10, 13 and 17 of lizard tail after autotomy. Process amplification of HIF 1α gene was described by CT value in real time PCR machine. HIF 1α expression of gene is quantified by Livak formula. Chi-square statistic test is 0.000 which means that there is a different expression of HIF 1 α gene in every growth day treatment.
ptxD gene in combination with phosphite serves as a highly effective selection system to generate transgenic cotton (Gossypium hirsutum L.).

PubMed

Pandeya, Devendra; Campbell, LeAnne M; Nunes, Eugenia; Lopez-Arredondo, Damar L; Janga, Madhusudhana R; Herrera-Estrella, Luis; Rathore, Keerti S

2017-12-01

This report demonstrates the usefulness of ptxD/phosphite as a selection system that not only provides a highly efficient and simple means to generate transgenic cotton plants, but also helps address many of the concerns related to the use of antibiotic and herbicide resistance genes in the production of transgenic crops. Two of the most popular dominant selectable marker systems for plant transformation are based on either antibiotic or herbicide resistance genes. Due to concerns regarding their safety and in order to stack multiple traits in a single plant, there is a need for alternative selectable marker genes. The ptxD gene, derived from Pseudomonas stutzeri WM88, that confers to cells the ability to convert phosphite (Phi) into orthophosphate (Pi) offers an alternative selectable marker gene as demonstrated for tobacco and maize. Here, we show that the ptxD gene in combination with a protocol based on selection medium containing Phi, as the sole source of phosphorus (P), can serve as an effective and efficient system to select for transformed cells and generate transgenic cotton plants. Fluorescence microscopy examination of the cultures under selection and molecular analyses on the regenerated plants demonstrate the efficacy of the system in recovering cotton transformants following Agrobacterium-mediated transformation. Under the ptxD/Phi selection, an average of 3.43 transgenic events per 100 infected explants were recovered as opposed to only 0.41% recovery when bar/phosphinothricin (PPT) selection was used. The event recovery rates for nptII/kanamycin and hpt/hygromycin systems were 2.88 and 2.47%, respectively. Molecular analysis on regenerated events showed a selection efficiency of ~ 97% under the ptxD/Phi system. Thus, ptxD/Phi has proven to be a very efficient, positive selection system for the generation of transgenic cotton plants with equal or higher transformation efficiencies compared to the commonly used, negative selection systems.
Gene alterations at Drosophila inversion breakpoints provide prima facie evidence for natural selection as an explanation for rapid chromosomal evolution.

PubMed

Guillén, Yolanda; Ruiz, Alfredo

2012-02-01

Chromosomal inversions have been pervasive during the evolution of the genus Drosophila, but there is significant variation between lineages in the rate of rearrangement fixation. D. mojavensis, an ecological specialist adapted to a cactophilic niche under extreme desert conditions, is a chromosomally derived species with ten fixed inversions, five of them not present in any other species. In order to explore the causes of the rapid chromosomal evolution in D. mojavensis, we identified and characterized all breakpoints of seven inversions fixed in chromosome 2, the most dynamic one. One of the inversions presents unequivocal evidence for its generation by ectopic recombination between transposon copies and another two harbor inverted duplications of non-repetitive DNA at the two breakpoints and were likely generated by staggered single-strand breaks and repair by non-homologous end joining. Four out of 14 breakpoints lay in the intergenic region between preexisting duplicated genes, suggesting an adaptive advantage of separating previously tightly linked duplicates. Four out of 14 breakpoints are associated with transposed genes, suggesting these breakpoints are fragile regions. Finally two inversions contain novel genes at their breakpoints and another three show alterations of genes at breakpoints with potential adaptive significance. D. mojavensis chromosomal inversions were generated by multiple mechanisms, an observation that does not provide support for increased mutation rate as explanation for rapid chromosomal evolution. On the other hand, we have found a number of gene alterations at the breakpoints with putative adaptive consequences that directly point to natural selection as the cause of D. mojavensis rapid chromosomal evolution.
Inhibition of adenovirus multiplication by short interfering RNAs directly or indirectly targeting the viral DNA replication machinery.

PubMed

Kneidinger, Doris; Ibrišimović, Mirza; Lion, Thomas; Klein, Reinhard

2012-06-01

Human adenoviruses are a common threat to immunocompromised patients, e.g., HIV-positive individuals or solid-organ and, in particular, allogeneic stem cell transplant recipients. Antiviral drugs have a limited effect on adenoviruses, and existing treatment modalities often fail to prevent fatal outcome. Silencing of viral genes by short interfering RNAs (siRNAs) holds a great promise in the treatment of viral infections. The aim of the present study was to identify adenoviral candidate targets for RNA interference-mediated inhibition of adenoviral replication. We investigated the impact of silencing of a set of early, middle, and late viral genes on the replication of adenovirus 5 in vitro. Adenovirus replication was inhibited by siRNAs directed against the adenoviral E1A, DNA polymerase, preterminal protein (pTP), IVa2, hexon, and protease genes. Silencing of early and middle genes was more effective in inhibiting adenovirus multiplication than was silencing of late genes. A siRNA directed against the viral DNA polymerase mRNA decreased viral genome copy numbers and infectious virus progeny by several orders of magnitude. Since silencing of any of the early genes directly or indirectly affected viral DNA synthesis, our data suggest that reducing viral genome copy numbers is a more promising strategy for the treatment of adenoviral infections than is reducing the numbers of proteins necessary for capsid generation. Thus, adenoviral DNA replication was identified as a key target for RNAi-mediated inhibition of adenovirus multiplication. In addition, the E1A transcripts emerged as a second important target, because its knockdown markedly improved the viability of cells at late stages of infection. Copyright © 2012 Elsevier B.V. All rights reserved.
Complete genomic screen in Parkinson disease: evidence for multiple genes.

PubMed

Scott, W K; Nance, M A; Watts, R L; Hubble, J P; Koller, W C; Lyons, K; Pahwa, R; Stern, M B; Colcher, A; Hiner, B C; Jankovic, J; Ondo, W G; Allen, F H; Goetz, C G; Small, G W; Masterman, D; Mastaglia, F; Laing, N G; Stajich, J M; Slotterbeck, B; Booze, M W; Ribble, R C; Rampersaud, E; West, S G; Gibson, R A; Middleton, L T; Roses, A D; Haines, J L; Scott, B L; Vance, J M; Pericak-Vance, M A

2001-11-14

The relative contribution of genes vs environment in idiopathic Parkinson disease (PD) is controversial. Although genetic studies have identified 2 genes in which mutations cause rare single-gene variants of PD and observational studies have suggested a genetic component, twin studies have suggested that little genetic contribution exists in the common forms of PD. To identify genetic risk factors for idiopathic PD. Genetic linkage study conducted 1995-2000 in which a complete genomic screen (n = 344 markers) was performed in 174 families with multiple individuals diagnosed as having idiopathic PD, identified through probands in 13 clinic populations in the continental United States and Australia. A total of 870 family members were studied: 378 diagnosed as having PD, 379 unaffected by PD, and 113 with unclear status. Logarithm of odds (lod) scores generated from parametric and nonparametric genetic linkage analysis. Two-point parametric maximum parametric lod score (MLOD) and multipoint nonparametric lod score (LOD) linkage analysis detected significant evidence for linkage to 5 distinct chromosomal regions: chromosome 6 in the parkin gene (MLOD = 5.07; LOD = 5.47) in families with at least 1 individual with PD onset at younger than 40 years, chromosomes 17q (MLOD = 2.28; LOD = 2.62), 8p (MLOD = 2.01; LOD = 2.22), and 5q (MLOD = 2.39; LOD = 1.50) overall and in families with late-onset PD, and chromosome 9q (MLOD = 1.52; LOD = 2.59) in families with both levodopa-responsive and levodopa-nonresponsive patients. Our data suggest that the parkin gene is important in early-onset PD and that multiple genetic factors may be important in the development of idiopathic late-onset PD.
Transcriptional changes in canine distemper virus-induced demyelinating leukoencephalitis favor a biphasic mode of demyelination.

PubMed

Ulrich, Reiner; Puff, Christina; Wewetzer, Konstantin; Kalkuhl, Arno; Deschl, Ulrich; Baumgärtner, Wolfgang

2014-01-01

Canine distemper virus (CDV)-induced demyelinating leukoencephalitis in dogs (Canis familiaris) is suggested to represent a naturally occurring translational model for subacute sclerosing panencephalitis and multiple sclerosis in humans. The aim of this study was a hypothesis-free microarray analysis of the transcriptional changes within cerebellar specimens of five cases of acute, six cases of subacute demyelinating, and three cases of chronic demyelinating and inflammatory CDV leukoencephalitis as compared to twelve non-infected control dogs. Frozen cerebellar specimens were used for analysis of histopathological changes including demyelination, transcriptional changes employing microarrays, and presence of CDV nucleoprotein RNA and protein using microarrays, RT-qPCR and immunohistochemistry. Microarray analysis revealed 780 differentially expressed probe sets. The dominating change was an up-regulation of genes related to the innate and the humoral immune response, and less distinct the cytotoxic T-cell-mediated immune response in all subtypes of CDV leukoencephalitis as compared to controls. Multiple myelin genes including myelin basic protein and proteolipid protein displayed a selective down-regulation in subacute CDV leukoencephalitis, suggestive of an oligodendrocyte dystrophy. In contrast, a marked up-regulation of multiple immunoglobulin-like expressed sequence tags and the delta polypeptide of the CD3 antigen was observed in chronic CDV leukoencephalitis, in agreement with the hypothesis of an immune-mediated demyelination in the late inflammatory phase of the disease. Analysis of pathways intimately linked to demyelination as determined by morphometry employing correlation-based Gene Set Enrichment Analysis highlighted the pathomechanistic importance of up-regulated genes comprised by the gene ontology terms "viral replication" and "humoral immune response" as well as down-regulated genes functionally related to "metabolite and energy generation".
Animal Models of Lymphangioleiomyomatosis (LAM) and Tuberous Sclerosis Complex (TSC)

PubMed Central

2010-01-01

Abstract Animal models of lymphangioleiomyomatosis (LAM) and tuberous sclerosis complex (TSC) are highly desired to enable detailed investigation of the pathogenesis of these diseases. Multiple rats and mice have been generated in which a mutation similar to that occurring in TSC patients is present in an allele of Tsc1 or Tsc2. Unfortunately, these mice do not develop pathologic lesions that match those seen in LAM or TSC. However, these Tsc rodent models have been useful in confirming the two-hit model of tumor development in TSC, and in providing systems in which therapeutic trials (e.g., rapamycin) can be performed. In addition, conditional alleles of both Tsc1 and Tsc2 have provided the opportunity to target loss of these genes to specific tissues and organs, to probe the in vivo function of these genes, and attempt to generate better models. Efforts to generate an authentic LAM model are impeded by a lack of understanding of the cell of origin of this process. However, ongoing studies provide hope that such a model will be generated in the coming years. PMID:20235887
Kcne2 Deletion Creates a Multisystem Syndrome Predisposing to Sudden Cardiac Death

PubMed Central

Hu, Zhaoyang; Kant, Ritu; Anand, Marie; King, Elizabeth C.; Krogh-Madsen, Trine; Christini, David J.; Abbott, Geoffrey W.

2014-01-01

Background Sudden cardiac death (SCD) is the leading global cause of mortality, exhibiting increased incidence in diabetics. Ion channel gene perturbations provide a well-established ventricular arrhythmogenic substrate for SCD. However, most arrhythmia susceptibility genes - including the KCNE2 K+ channel β subunit - are expressed in multiple tissues, suggesting potential multiplex SCD substrates. Methods and Results Using “whole transcript” transcriptomics, we uncovered cardiac angiotensinogen upregulation and remodeling of cardiac angiotensinogen interaction networks in P21 Kcne2−/− mouse pups, and adrenal remodeling consistent with metabolic syndrome in adult Kcne2−/− mice. This led to the discovery that Kcne2 disruption causes multiple acknowledged SCD substrates of extracardiac origin: diabetes, hypercholesterolemia, hyperkalemia, anemia and elevated angiotensin II. Kcne2 deletion was also prerequisite for aging-dependent QT prolongation, ventricular fibrillation and SCD immediately following transient ischemia, and fasting-dependent hypoglycemia, myocardial ischemia and atrioventricular block. Conclusions Disruption of a single, widely expressed arrhythmia susceptibility gene can generate a multisystem syndrome comprising manifold electrical and systemic substrates and triggers of SCD. This paradigm is expected to apply to other arrhythmia susceptibility genes, the majority of which encode ubiquitously expressed ion channel subunits or regulatory proteins. PMID:24403551
Genetic association of impulsivity in young adults: a multivariate study

PubMed Central

Khadka, S; Narayanan, B; Meda, S A; Gelernter, J; Han, S; Sawyer, B; Aslanzadeh, F; Stevens, M C; Hawkins, K A; Anticevic, A; Potenza, M N; Pearlson, G D

2014-01-01

Impulsivity is a heritable, multifaceted construct with clinically relevant links to multiple psychopathologies. We assessed impulsivity in young adult (N~2100) participants in a longitudinal study, using self-report questionnaires and computer-based behavioral tasks. Analysis was restricted to the subset (N=426) who underwent genotyping. Multivariate association between impulsivity measures and single-nucleotide polymorphism data was implemented using parallel independent component analysis (Para-ICA). Pathways associated with multiple genes in components that correlated significantly with impulsivity phenotypes were then identified using a pathway enrichment analysis. Para-ICA revealed two significantly correlated genotype–phenotype component pairs. One impulsivity component included the reward responsiveness subscale and behavioral inhibition scale of the Behavioral-Inhibition System/Behavioral-Activation System scale, and the second impulsivity component included the non-planning subscale of the Barratt Impulsiveness Scale and the Experiential Discounting Task. Pathway analysis identified processes related to neurogenesis, nervous system signal generation/amplification, neurotransmission and immune response. We identified various genes and gene regulatory pathways associated with empirically derived impulsivity components. Our study suggests that gene networks implicated previously in brain development, neurotransmission and immune response are related to impulsive tendencies and behaviors. PMID:25268255
Expression of short hairpin RNAs using the compact architecture of retroviral microRNA genes.

PubMed

Burke, James M; Kincaid, Rodney P; Aloisio, Francesca; Welch, Nicole; Sullivan, Christopher S

2017-09-29

Short hairpin RNAs (shRNAs) are effective in generating stable repression of gene expression. RNA polymerase III (RNAP III) type III promoters (U6 or H1) are typically used to drive shRNA expression. While useful for some knockdown applications, the robust expression of U6/H1-driven shRNAs can induce toxicity and generate heterogeneous small RNAs with undesirable off-target effects. Additionally, typical U6/H1 promoters encompass the majority of the ∼270 base pairs (bp) of vector space required for shRNA expression. This can limit the efficacy and/or number of delivery vector options, particularly when delivery of multiple gene/shRNA combinations is required. Here, we develop a compact shRNA (cshRNA) expression system based on retroviral microRNA (miRNA) gene architecture that uses RNAP III type II promoters. We demonstrate that cshRNAs coded from as little as 100 bps of total coding space can precisely generate small interfering RNAs (siRNAs) that are active in the RNA-induced silencing complex (RISC). We provide an algorithm with a user-friendly interface to design cshRNAs for desired target genes. This cshRNA expression system reduces the coding space required for shRNA expression by >2-fold as compared to the typical U6/H1 promoters, which may facilitate therapeutic RNAi applications where delivery vector space is limiting. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
A novel pathogenic splice acceptor site germline mutation in intron 14 of the APC gene in a Chinese family with familial adenomatous polyposis.

PubMed

Wang, Dan; Liang, Shengyun; Zhang, Zhao; Zhao, Guoru; Hu, Yuan; Liang, Shengran; Zhang, Xipeng; Banerjee, Santasree

2017-03-28

Familial adenomatous polyposis (FAP) is an autosomal dominant precancerous condition, clinically characterized by the presence of multiple colorectal adenomas or polyps. Patients with FAP has a high risk of developing colorectal cancer (CRC) from these colorectal adenomatous polyps by the mean age of diagnosis at 40 years. Germline mutations of the APC gene cause familial adenomatous polyposis (FAP). Colectomy has recommended for the FAP patients with significant polyposis. Here, we present a clinical molecular study of a four generation Chinese family with FAP. Clinical diagnosis of FAP has been done according to the phenotype, family history and medical records. Patient's blood samples were collected and genomic DNA was extracted. In order to identify the pathogenic mutation underlying the disease phenotype targeted next-generation sequencing and confirmatory sanger sequencing has undertaken. Targeted next generation sequencing identified a novel heterozygous splice-acceptor site mutation [c.1744-1G>A] in intron 14 of APC gene, which is co-segregated with the FAP phenotypes in the proband and amongst all the affected family members. This mutation is not present in unaffected family members and in normal healthy controls of same ethnic origin. According to the LOVD database for Chinese colorectal cancer patients, in Chinese population, 60% of the previously reported APC gene mutations causes FAP, are missense mutations. This novel splice-acceptor site mutation causing FAP in this Chinese family expands the germline mutation spectrum of the APC gene in the Chinese population.
Regularized rare variant enrichment analysis for case-control exome sequencing data.

PubMed

Larson, Nicholas B; Schaid, Daniel J

2014-02-01

Rare variants have recently garnered an immense amount of attention in genetic association analysis. However, unlike methods traditionally used for single marker analysis in GWAS, rare variant analysis often requires some method of aggregation, since single marker approaches are poorly powered for typical sequencing study sample sizes. Advancements in sequencing technologies have rendered next-generation sequencing platforms a realistic alternative to traditional genotyping arrays. Exome sequencing in particular not only provides base-level resolution of genetic coding regions, but also a natural paradigm for aggregation via genes and exons. Here, we propose the use of penalized regression in combination with variant aggregation measures to identify rare variant enrichment in exome sequencing data. In contrast to marginal gene-level testing, we simultaneously evaluate the effects of rare variants in multiple genes, focusing on gene-based least absolute shrinkage and selection operator (LASSO) and exon-based sparse group LASSO models. By using gene membership as a grouping variable, the sparse group LASSO can be used as a gene-centric analysis of rare variants while also providing a penalized approach toward identifying specific regions of interest. We apply extensive simulations to evaluate the performance of these approaches with respect to specificity and sensitivity, comparing these results to multiple competing marginal testing methods. Finally, we discuss our findings and outline future research. © 2013 WILEY PERIODICALS, INC.
PathMAPA: a tool for displaying gene expression and performing statistical tests on metabolic pathways at multiple levels for Arabidopsis.

PubMed

Pan, Deyun; Sun, Ning; Cheung, Kei-Hoi; Guan, Zhong; Ma, Ligeng; Holford, Matthew; Deng, Xingwang; Zhao, Hongyu

2003-11-07

To date, many genomic and pathway-related tools and databases have been developed to analyze microarray data. In published web-based applications to date, however, complex pathways have been displayed with static image files that may not be up-to-date or are time-consuming to rebuild. In addition, gene expression analyses focus on individual probes and genes with little or no consideration of pathways. These approaches reveal little information about pathways that are key to a full understanding of the building blocks of biological systems. Therefore, there is a need to provide useful tools that can generate pathways without manually building images and allow gene expression data to be integrated and analyzed at pathway levels for such experimental organisms as Arabidopsis. We have developed PathMAPA, a web-based application written in Java that can be easily accessed over the Internet. An Oracle database is used to store, query, and manipulate the large amounts of data that are involved. PathMAPA allows its users to (i) upload and populate microarray data into a database; (ii) integrate gene expression with enzymes of the pathways; (iii) generate pathway diagrams without building image files manually; (iv) visualize gene expressions for each pathway at enzyme, locus, and probe levels; and (v) perform statistical tests at pathway, enzyme and gene levels. PathMAPA can be used to examine Arabidopsis thaliana gene expression patterns associated with metabolic pathways. PathMAPA provides two unique features for the gene expression analysis of Arabidopsis thaliana: (i) automatic generation of pathways associated with gene expression and (ii) statistical tests at pathway level. The first feature allows for the periodical updating of genomic data for pathways, while the second feature can provide insight into how treatments affect relevant pathways for the selected experiment(s).
PathMAPA: a tool for displaying gene expression and performing statistical tests on metabolic pathways at multiple levels for Arabidopsis

PubMed Central

Pan, Deyun; Sun, Ning; Cheung, Kei-Hoi; Guan, Zhong; Ma, Ligeng; Holford, Matthew; Deng, Xingwang; Zhao, Hongyu

2003-01-01

Background To date, many genomic and pathway-related tools and databases have been developed to analyze microarray data. In published web-based applications to date, however, complex pathways have been displayed with static image files that may not be up-to-date or are time-consuming to rebuild. In addition, gene expression analyses focus on individual probes and genes with little or no consideration of pathways. These approaches reveal little information about pathways that are key to a full understanding of the building blocks of biological systems. Therefore, there is a need to provide useful tools that can generate pathways without manually building images and allow gene expression data to be integrated and analyzed at pathway levels for such experimental organisms as Arabidopsis. Results We have developed PathMAPA, a web-based application written in Java that can be easily accessed over the Internet. An Oracle database is used to store, query, and manipulate the large amounts of data that are involved. PathMAPA allows its users to (i) upload and populate microarray data into a database; (ii) integrate gene expression with enzymes of the pathways; (iii) generate pathway diagrams without building image files manually; (iv) visualize gene expressions for each pathway at enzyme, locus, and probe levels; and (v) perform statistical tests at pathway, enzyme and gene levels. PathMAPA can be used to examine Arabidopsis thaliana gene expression patterns associated with metabolic pathways. Conclusion PathMAPA provides two unique features for the gene expression analysis of Arabidopsis thaliana: (i) automatic generation of pathways associated with gene expression and (ii) statistical tests at pathway level. The first feature allows for the periodical updating of genomic data for pathways, while the second feature can provide insight into how treatments affect relevant pathways for the selected experiment(s). PMID:14604444
Transgenic elite indica rice plants expressing CryIAc ∂-endotoxin of Bacillus thuringiensis are resistant against yellow stem borer (Scirpophaga incertulas)

PubMed Central

Nayak, Pritilata; Basu, Debabrata; Das, Sampa; Basu, Asitava; Ghosh, Dipankar; Ramakrishnan, Neeliyath A.; Ghosh, Maloy; Sen, Soumitra K.

1997-01-01

Generation of insect-resistant, transgenic crop plants by expression of the insecticidal crystal protein (ICP) gene of Bacillus thuringiensis (Bt) is a standard crop improvement approach. In such cases, adequate expression of the most appropriate ICP against the target insect pest of the crop species is desirable. It is also considered advantageous to generate Bt-transgenics with multiple toxin systems to control rapid development of pest resistance to the ICP. Larvae of yellow stem borer (YSB), Scirpophaga incertulas, a major lepidopteran insect pest of rice, cause massive losses of rice yield. Studies on insect feeding and on the binding properties of ICP to brush border membrane receptors in the midgut of YSB larvae revealed that cryIAb and cryIAc are two individually suitable candidate genes for developing YSB-resistant rice. Programs were undertaken to develop Bt-transgenic rice with these ICP genes independently in a single cultivar. A cryIAc gene was reconstructed and placed under control of the maize ubiquitin 1 promoter, along with the first intron of the maize ubiquitin 1 gene, and the nos terminator. The gene construct was delivered to embryogenic calli of IR64, an elite indica rice cultivar, using the particle bombardment method. Six highly expressive independent transgenic ICP lines were identified. Molecular analyses and insect-feeding assays of two such lines revealed that the transferred synthetic cryIAc gene was expressed stably in the T2 generation of these lines and that the transgenic rice plants were highly toxic to YSB larvae and lessened the damage caused by their feeding. PMID:9122157
How to make stripes: deciphering the transition from non-periodic to periodic patterns in Drosophila segmentation

PubMed Central

Schroeder, Mark D.; Greer, Christina; Gaul, Ulrike

2011-01-01

The generation of metameric body plans is a key process in development. In Drosophila segmentation, periodicity is established rapidly through the complex transcriptional regulation of the pair-rule genes. The ‘primary’ pair-rule genes generate their 7-stripe expression through stripe-specific cis-regulatory elements controlled by the preceding non-periodic maternal and gap gene patterns, whereas ‘secondary’ pair-rule genes are thought to rely on 7-stripe elements that read off the already periodic primary pair-rule patterns. Using a combination of computational and experimental approaches, we have conducted a comprehensive systems-level examination of the regulatory architecture underlying pair-rule stripe formation. We find that runt (run), fushi tarazu (ftz) and odd skipped (odd) establish most of their pattern through stripe-specific elements, arguing for a reclassification of ftz and odd as primary pair-rule genes. In the case of run, we observe long-range cis-regulation across multiple intervening genes. The 7-stripe elements of run, ftz and odd are active concurrently with the stripe-specific elements, indicating that maternal/gap-mediated control and pair-rule gene cross-regulation are closely integrated. Stripe-specific elements fall into three distinct classes based on their principal repressive gap factor input; stripe positions along the gap gradients correlate with the strength of predicted input. The prevalence of cis-elements that generate two stripes and their genomic organization suggest that single-stripe elements arose by splitting and subfunctionalization of ancestral dual-stripe elements. Overall, our study provides a greatly improved understanding of how periodic patterns are established in the Drosophila embryo. PMID:21693522
A framework for list representation, enabling list stabilization through incorporation of gene exchangeabilities.

PubMed

Soneson, Charlotte; Fontes, Magnus

2012-01-01

Analysis of multivariate data sets from, for example, microarray studies frequently results in lists of genes which are associated with some response of interest. The biological interpretation is often complicated by the statistical instability of the obtained gene lists, which may partly be due to the functional redundancy among genes, implying that multiple genes can play exchangeable roles in the cell. In this paper, we use the concept of exchangeability of random variables to model this functional redundancy and thereby account for the instability. We present a flexible framework to incorporate the exchangeability into the representation of lists. The proposed framework supports straightforward comparison between any 2 lists. It can also be used to generate new more stable gene rankings incorporating more information from the experimental data. Using 2 microarray data sets, we show that the proposed method provides more robust gene rankings than existing methods with respect to sampling variations, without compromising the biological significance of the rankings.
High-density array-CGH with targeted NGS unmask multiple noncontiguous minute deletions on chromosome 3p21 in mesothelioma.

PubMed

Yoshikawa, Yoshie; Emi, Mitsuru; Hashimoto-Tamaoki, Tomoko; Ohmuraya, Masaki; Sato, Ayuko; Tsujimura, Tohru; Hasegawa, Seiki; Nakano, Takashi; Nasu, Masaki; Pastorino, Sandra; Szymiczek, Agata; Bononi, Angela; Tanji, Mika; Pagano, Ian; Gaudino, Giovanni; Napolitano, Andrea; Goparaju, Chandra; Pass, Harvey I; Yang, Haining; Carbone, Michele

2016-11-22

We used a custom-made comparative genomic hybridization array (aCGH; average probe interval 254 bp) to screen 33 malignant mesothelioma (MM) biopsies for somatic copy number loss throughout the 3p21 region (10.7 Mb) that harbors 251 genes, including BRCA1 (breast cancer 1)-associated protein 1 (BAP1), the most commonly mutated gene in MM. We identified frequent minute biallelic deletions (<3 kb) in 46 of 251 genes: four were cancer-associated genes: SETD2 (SET domain-containing protein 2) (7 of 33), BAP1 (8 of 33), PBRM1 (polybromo 1) (3 of 33), and SMARCC1 (switch/sucrose nonfermentable- SWI/SNF-related, matrix-associated, actin-dependent regulator of chromatin, subfamily c, member 1) (2 of 33). These four genes were further investigated by targeted next-generation sequencing (tNGS), which revealed sequence-level mutations causing biallelic inactivation. Combined high-density aCGH and tNGS revealed biallelic gene inactivation in SETD2 (9 of 33, 27%), BAP1 (16 of 33, 48%), PBRM1 (5 of 33, 15%), and SMARCC1 (2 of 33, 6%). The incidence of genetic alterations detected is much higher than reported in the literature because minute deletions are not detected by NGS or commercial aCGH. Many of these minute deletions were not contiguous, but rather alternated with segments showing oscillating copy number changes along the 3p21 region. In summary, we found that in MM: (i) multiple minute simultaneous biallelic deletions are frequent in chromosome 3p21, where they occur as distinct events involving multiple genes; (ii) in addition to BAP1, mutations of SETD2, PBRM1, and SMARCC1 are frequent in MM; and (iii) our results suggest that high-density aCGH combined with tNGS provides a more precise estimate of the frequency and types of genes inactivated in human cancer than approaches based exclusively on NGS strategy.

Targeted 'next-generation' sequencing in anophthalmia and microphthalmia patients confirms SOX2, OTX2 and FOXE3 mutations.

PubMed

Jimenez, Nelson Lopez; Flannick, Jason; Yahyavi, Mani; Li, Jiang; Bardakjian, Tanya; Tonkin, Leath; Schneider, Adele; Sherr, Elliott H; Slavotinek, Anne M

2011-12-28

Anophthalmia/microphthalmia (A/M) is caused by mutations in several different transcription factors, but mutations in each causative gene are relatively rare, emphasizing the need for a testing approach that screens multiple genes simultaneously. We used next-generation sequencing to screen 15 A/M patients for mutations in 9 pathogenic genes to evaluate this technology for screening in A/M. We used a pooled sequencing design, together with custom single nucleotide polymorphism (SNP) calling software. We verified predicted sequence alterations using Sanger sequencing. We verified three mutations - c.542delC in SOX2, resulting in p.Pro181Argfs*22, p.Glu105X in OTX2 and p.Cys240X in FOXE3. We found several novel sequence alterations and SNPs that were likely to be non-pathogenic - p.Glu42Lys in CRYBA4, p.Val201Met in FOXE3 and p.Asp291Asn in VSX2. Our analysis methodology gave one false positive result comprising a mutation in PAX6 (c.1268A > T, predicting p.X423LeuextX*15) that was not verified by Sanger sequencing. We also failed to detect one 20 base pair (bp) deletion and one 3 bp duplication in SOX2. Our results demonstrated the power of next-generation sequencing with pooled sample groups for the rapid screening of candidate genes for A/M as we were correctly able to identify disease-causing mutations. However, next-generation sequencing was less useful for small, intragenic deletions and duplications. We did not find mutations in 10/15 patients and conclude that there is a need for further gene discovery in A/M.
Targeted 'Next-Generation' sequencing in anophthalmia and microphthalmia patients confirms SOX2, OTX2 and FOXE3 mutations

PubMed Central

2011-01-01

Background Anophthalmia/microphthalmia (A/M) is caused by mutations in several different transcription factors, but mutations in each causative gene are relatively rare, emphasizing the need for a testing approach that screens multiple genes simultaneously. We used next-generation sequencing to screen 15 A/M patients for mutations in 9 pathogenic genes to evaluate this technology for screening in A/M. Methods We used a pooled sequencing design, together with custom single nucleotide polymorphism (SNP) calling software. We verified predicted sequence alterations using Sanger sequencing. Results We verified three mutations - c.542delC in SOX2, resulting in p.Pro181Argfs*22, p.Glu105X in OTX2 and p.Cys240X in FOXE3. We found several novel sequence alterations and SNPs that were likely to be non-pathogenic - p.Glu42Lys in CRYBA4, p.Val201Met in FOXE3 and p.Asp291Asn in VSX2. Our analysis methodology gave one false positive result comprising a mutation in PAX6 (c.1268A > T, predicting p.X423LeuextX*15) that was not verified by Sanger sequencing. We also failed to detect one 20 base pair (bp) deletion and one 3 bp duplication in SOX2. Conclusions Our results demonstrated the power of next-generation sequencing with pooled sample groups for the rapid screening of candidate genes for A/M as we were correctly able to identify disease-causing mutations. However, next-generation sequencing was less useful for small, intragenic deletions and duplications. We did not find mutations in 10/15 patients and conclude that there is a need for further gene discovery in A/M. PMID:22204637
Low gene copy number shows that arbuscular mycorrhizal fungi inherit genetically different nuclei.

PubMed

Hijri, Mohamed; Sanders, Ian R

2005-01-13

Arbuscular mycorrhizal fungi (AMF) are ancient asexually reproducing organisms that form symbioses with the majority of plant species, improving plant nutrition and promoting plant diversity. Little is known about the evolution or organization of the genomes of any eukaryotic symbiont or ancient asexual organism. Direct evidence shows that one AMF species is heterokaryotic; that is, containing populations of genetically different nuclei. It has been suggested, however, that the genetic variation passed from generation to generation in AMF is simply due to multiple chromosome sets (that is, high ploidy). Here we show that previously documented genetic variation in Pol-like sequences, which are passed from generation to generation, cannot be due to either high ploidy or repeated gene duplications. Our results provide the clearest evidence so far for substantial genetic differences among nuclei in AMF. We also show that even AMF with a very large nuclear DNA content are haploid. An underlying principle of evolutionary theory is that an individual passes on one or half of its genome to each of its progeny. The coexistence of a population of many genomes in AMF and their transfer to subsequent generations, therefore, has far-reaching consequences for understanding genome evolution.
Gene expression profiling in liver and testis of rats to characterize the toxicity of triazole fungicides.

PubMed

Tully, Douglas B; Bao, Wenjun; Goetz, Amber K; Blystone, Chad R; Ren, Hongzu; Schmid, Judith E; Strader, Lillian F; Wood, Carmen R; Best, Deborah S; Narotsky, Michael G; Wolf, Douglas C; Rockett, John C; Dix, David J

2006-09-15

Four triazole fungicides were studied using toxicogenomic techniques to identify potential mechanisms of action. Adult male Sprague-Dawley rats were dosed for 14 days by gavage with fluconazole, myclobutanil, propiconazole, or triadimefon. Following exposure, serum was collected for hormone measurements, and liver and testes were collected for histology, enzyme biochemistry, or gene expression profiling. Body and testis weights were unaffected, but liver weights were significantly increased by all four triazoles, and hepatocytes exhibited centrilobular hypertrophy. Myclobutanil exposure increased serum testosterone and decreased sperm motility, but no treatment-related testis histopathology was observed. We hypothesized that gene expression profiles would identify potential mechanisms of toxicity and used DNA microarrays and quantitative real-time PCR (qPCR) to generate profiles. Triazole fungicides are designed to inhibit fungal cytochrome P450 (CYP) 51 enzyme but can also modulate the expression and function of mammalian CYP genes and enzymes. Triazoles affected the expression of numerous CYP genes in rat liver and testis, including multiple Cyp2c and Cyp3a isoforms as well as other xenobiotic metabolizing enzyme (XME) and transporter genes. For some genes, such as Ces2 and Udpgtr2, all four triazoles had similar effects on expression, suggesting possible common mechanisms of action. Many of these CYP, XME and transporter genes are regulated by xeno-sensing nuclear receptors, and hierarchical clustering of CAR/PXR-regulated genes demonstrated the similarities of toxicogenomic responses in liver between all four triazoles and in testis between myclobutanil and triadimefon. Triazoles also affected expression of multiple genes involved in steroid hormone metabolism in the two tissues. Thus, gene expression profiles helped identify possible toxicological mechanisms of the triazole fungicides.
Genomics of Natural Populations: How Differentially Expressed Genes Shape the Evolution of Chromosomal Inversions in Drosophila pseudoobscura

PubMed Central

Fuller, Zachary L.; Haynes, Gwilym D.; Richards, Stephen; Schaeffer, Stephen W.

2016-01-01

Chromosomal rearrangements can shape the structure of genetic variation in the genome directly through alteration of genes at breakpoints or indirectly by holding combinations of genetic variants together due to reduced recombination. The third chromosome of Drosophila pseudoobscura is a model system to test hypotheses about how rearrangements are established in populations because its third chromosome is polymorphic for >30 gene arrangements that were generated by a series of overlapping inversion mutations. Circumstantial evidence has suggested that these gene arrangements are selected. Despite the expected homogenizing effects of extensive gene flow, the frequencies of arrangements form gradients or clines in nature, which have been stable since the system was first described >80 years ago. Furthermore, multiple arrangements exist at appreciable frequencies across several ecological niches providing the opportunity for heterokaryotypes to form. In this study, we tested whether genes are differentially expressed among chromosome arrangements in first instar larvae, adult females and males. In addition, we asked whether transcriptional patterns in heterokaryotypes are dominant, semidominant, overdominant, or underdominant. We find evidence for a significant abundance of differentially expressed genes across the inverted regions of the third chromosome, including an enrichment of genes involved in sensory perception for males. We find the majority of loci show additivity in heterokaryotypes. Our results suggest that multiple genes have expression differences among arrangements that were either captured by the original inversion mutation or accumulated after it reached polymorphic frequencies, providing a potential source of genetic variation for selection to act upon. These data suggest that the inversions are favored because of their indirect effect of recombination suppression that has held different combinations of differentially expressed genes together in the various gene arrangement backgrounds. PMID:27401754
Cap 'n' collar C regulates genes responsible for imidacloprid resistance in the Colorado potato beetle, Leptinotarsa decemlineata.

PubMed

Gaddelapati, Sharath Chandra; Kalsi, Megha; Roy, Amit; Palli, Subba Reddy

2018-08-01

The Colorado potato beetle (CPB), Leptinotarsa decemlineata developed resistance to imidacloprid after exposure to this insecticide for multiple generations. Our previous studies showed that xenobiotic transcription factor, cap 'n' collar isoform C (CncC) regulates the expression of multiple cytochrome P450 genes, which play essential roles in resistance to plant allelochemicals and insecticides. In this study, we sought to obtain a comprehensive picture of the genes regulated by CncC in imidacloprid-resistant CPB. We performed sequencing of RNA isolated from imidacloprid-resistant CPB treated with dsRNA targeting CncC or gene coding for green fluorescent protein (control). Comparative transcriptome analysis showed that CncC regulated the expression of 1798 genes, out of which 1499 genes were downregulated in CncC knockdown beetles. Interestingly, expression of 79% of imidacloprid induced P450 genes requires CncC. We performed quantitative real-time PCR to verify the reduction in the expression of 20 genes including those coding for detoxification enzymes (P450s, glutathione S-transferases, and esterases) and ABC transporters. The genes coding for ABC transporters are induced in insecticide resistant CPB and require CncC for their expression. Knockdown of genes coding for ABC transporters simultaneously or individually caused an increase in imidacloprid-induced mortality in resistant beetles confirming their contribution to insecticide resistance. These studies identified CncC as a transcription factor involved in regulation of genes responsible for imidacloprid resistance. Small molecule inhibitors of CncC or suppression of CncC by RNAi could provide effective synergists for pest control or management of insecticide resistance. Copyright © 2018 Elsevier Ltd. All rights reserved.
Non-invasive prenatal diagnosis of achondroplasia and thanatophoric dysplasia: next-generation sequencing allows for a safer, more accurate, and comprehensive approach

PubMed Central

Chitty, Lyn S; Mason, Sarah; Barrett, Angela N; McKay, Fiona; Lench, Nicholas; Daley, Rebecca; Jenkins, Lucy A

2015-01-01

Abstract Objective Accurate prenatal diagnosis of genetic conditions can be challenging and usually requires invasive testing. Here, we demonstrate the potential of next-generation sequencing (NGS) for the analysis of cell-free DNA in maternal blood to transform prenatal diagnosis of monogenic disorders. Methods Analysis of cell-free DNA using a PCR and restriction enzyme digest (PCR–RED) was compared with a novel NGS assay in pregnancies at risk of achondroplasia and thanatophoric dysplasia. Results PCR–RED was performed in 72 cases and was correct in 88.6%, inconclusive in 7% with one false negative. NGS was performed in 47 cases and was accurate in 96.2% with no inconclusives. Both approaches were used in 27 cases, with NGS giving the correct result in the two cases inconclusive with PCR–RED. Conclusion NGS provides an accurate, flexible approach to non-invasive prenatal diagnosis of de novo and paternally inherited mutations. It is more sensitive than PCR–RED and is ideal when screening a gene with multiple potential pathogenic mutations. These findings highlight the value of NGS in the development of non-invasive prenatal diagnosis for other monogenic disorders. © 2015 The Authors. Prenatal Diagnosis published by John Wiley & Sons, Ltd. What's already known about this topic? Non-invasive prenatal diagnosis (NIPD) using PCR-based methods has been reported for the detection or exclusion of individual paternally inherited or de novo alleles in maternal plasma. What does this study add? NIPD using next generation sequencing provides an accurate, more sensitive approach which can be used to detect multiple mutations in a single assay and so is ideal when screening a gene with multiple potential pathogenic mutations. Next generation sequencing thus provides a flexible approach to non-invasive prenatal diagnosis ideal for use in a busy service laboratory. PMID:25728633
Novel genomic findings in multiple myeloma identified through routine diagnostic sequencing.

PubMed

Ryland, Georgina L; Jones, Kate; Chin, Melody; Markham, John; Aydogan, Elle; Kankanige, Yamuna; Caruso, Marisa; Guinto, Jerick; Dickinson, Michael; Prince, H Miles; Yong, Kwee; Blombery, Piers

2018-05-14

Multiple myeloma is a genomically complex haematological malignancy with many genomic alterations recognised as important in diagnosis, prognosis and therapeutic decision making. Here, we provide a summary of genomic findings identified through routine diagnostic next-generation sequencing at our centre. A cohort of 86 patients with multiple myeloma underwent diagnostic sequencing using a custom hybridisation-based panel targeting 104 genes. Sequence variants, genome-wide copy number changes and structural rearrangements were detected using an inhouse-developed bioinformatics pipeline. At least one mutation was found in 69 (80%) patients. Frequently mutated genes included TP53 (36%), KRAS (22.1%), NRAS (15.1%), FAM46C/DIS3 (8.1%) and TET2/FGFR3 (5.8%), including multiple mutations not previously described in myeloma. Importantly we observed TP53 mutations in the absence of a 17 p deletion in 8% of the cohort, highlighting the need for sequencing-based assessment in addition to cytogenetics to identify these high-risk patients. Multiple novel copy number changes and immunoglobulin heavy chain translocations are also discussed. Our results demonstrate that many clinically relevant genomic findings remain in multiple myeloma which have not yet been identified through large-scale sequencing efforts, and provide important mechanistic insights into plasma cell pathobiology. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Statistical inference of the generation probability of T-cell receptors from sequence repertoires.

PubMed

Murugan, Anand; Mora, Thierry; Walczak, Aleksandra M; Callan, Curtis G

2012-10-02

Stochastic rearrangement of germline V-, D-, and J-genes to create variable coding sequence for certain cell surface receptors is at the origin of immune system diversity. This process, known as "VDJ recombination", is implemented via a series of stochastic molecular events involving gene choices and random nucleotide insertions between, and deletions from, genes. We use large sequence repertoires of the variable CDR3 region of human CD4+ T-cell receptor beta chains to infer the statistical properties of these basic biochemical events. Because any given CDR3 sequence can be produced in multiple ways, the probability distribution of hidden recombination events cannot be inferred directly from the observed sequences; we therefore develop a maximum likelihood inference method to achieve this end. To separate the properties of the molecular rearrangement mechanism from the effects of selection, we focus on nonproductive CDR3 sequences in T-cell DNA. We infer the joint distribution of the various generative events that occur when a new T-cell receptor gene is created. We find a rich picture of correlation (and absence thereof), providing insight into the molecular mechanisms involved. The generative event statistics are consistent between individuals, suggesting a universal biochemical process. Our probabilistic model predicts the generation probability of any specific CDR3 sequence by the primitive recombination process, allowing us to quantify the potential diversity of the T-cell repertoire and to understand why some sequences are shared between individuals. We argue that the use of formal statistical inference methods, of the kind presented in this paper, will be essential for quantitative understanding of the generation and evolution of diversity in the adaptive immune system.
Genome-wide network analysis of Wnt signaling in three pediatric cancers

NASA Astrophysics Data System (ADS)

Bao, Ju; Lee, Ho-Jin; Zheng, Jie J.

2013-10-01

Genomic structural alteration is common in pediatric cancers, and analysis of data generated by the Pediatric Cancer Genome Project reveals such tumor-related alterations in many Wnt signaling-associated genes. Most pediatric cancers are thought to arise within developing tissues that undergo substantial expansion during early organ formation, growth and maturation, and Wnt signaling plays an important role in this development. We examined three pediatric tumors--medullobastoma, early T-cell precursor acute lymphoblastic leukemia, and retinoblastoma--that show multiple genomic structural variations within Wnt signaling pathways. We mathematically modeled this pathway to investigate the effects of cancer-related structural variations on Wnt signaling. Surprisingly, we found that an outcome measure of canonical Wnt signaling was consistently similar in matched cancer cells and normal cells, even in the context of different cancers, different mutations, and different Wnt-related genes. Our results suggest that the cancer cells maintain a normal level of Wnt signaling by developing multiple mutations.
Analyzing multiple data sets by interconnecting RSAT programs via SOAP Web services: an example with ChIP-chip data.

PubMed

Sand, Olivier; Thomas-Chollier, Morgane; Vervisch, Eric; van Helden, Jacques

2008-01-01

This protocol shows how to access the Regulatory Sequence Analysis Tools (RSAT) via a programmatic interface in order to automate the analysis of multiple data sets. We describe the steps for writing a Perl client that connects to the RSAT Web services and implements a workflow to discover putative cis-acting elements in promoters of gene clusters. In the presented example, we apply this workflow to lists of transcription factor target genes resulting from ChIP-chip experiments. For each factor, the protocol predicts the binding motifs by detecting significantly overrepresented hexanucleotides in the target promoters and generates a feature map that displays the positions of putative binding sites along the promoter sequences. This protocol is addressed to bioinformaticians and biologists with programming skills (notions of Perl). Running time is approximately 6 min on the example data set.
Synthetic Biology Platform for Sensing and Integrating Endogenous Transcriptional Inputs in Mammalian Cells.

PubMed

Angelici, Bartolomeo; Mailand, Erik; Haefliger, Benjamin; Benenson, Yaakov

2016-08-30

One of the goals of synthetic biology is to develop programmable artificial gene networks that can transduce multiple endogenous molecular cues to precisely control cell behavior. Realizing this vision requires interfacing natural molecular inputs with synthetic components that generate functional molecular outputs. Interfacing synthetic circuits with endogenous mammalian transcription factors has been particularly difficult. Here, we describe a systematic approach that enables integration and transduction of multiple mammalian transcription factor inputs by a synthetic network. The approach is facilitated by a proportional amplifier sensor based on synergistic positive autoregulation. The circuits efficiently transduce endogenous transcription factor levels into RNAi, transcriptional transactivation, and site-specific recombination. They also enable AND logic between pairs of arbitrary transcription factors. The results establish a framework for developing synthetic gene networks that interface with cellular processes through transcriptional regulators. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.
Predicting functional divergence in protein evolution by site-specific rate shifts

NASA Technical Reports Server (NTRS)

Gaucher, Eric A.; Gu, Xun; Miyamoto, Michael M.; Benner, Steven A.

2002-01-01

Most modern tools that analyze protein evolution allow individual sites to mutate at constant rates over the history of the protein family. However, Walter Fitch observed in the 1970s that, if a protein changes its function, the mutability of individual sites might also change. This observation is captured in the "non-homogeneous gamma model", which extracts functional information from gene families by examining the different rates at which individual sites evolve. This model has recently been coupled with structural and molecular biology to identify sites that are likely to be involved in changing function within the gene family. Applying this to multiple gene families highlights the widespread divergence of functional behavior among proteins to generate paralogs and orthologs.
Mechanism for DNA transposons to generate introns on genomic scales

PubMed Central

Huff, Jason T.; Zilberman, Daniel; Roy, Scott W.

2017-01-01

Discovered four decades ago, the existence of introns was one of the most unexpected findings in molecular biology1. Introns are sequences interrupting genes that must be removed as part of mRNA production. Genome sequencing projects have documented that most eukaryotic genes contain at least one and frequently many introns2,3. Comparison of these genomes reveals a history of long evolutionary periods with little intron gain punctuated by episodes of rapid, extensive gain2,3. However, no detailed mechanism for such episodic intron generation has been empirically supported on a sufficient scale, despite several proposals4–8. Here we show how short non-autonomous DNA transposons independently generated hundreds to thousands of introns in the prasinophyte Micromonas pusilla and the pelagophyte Aureococcus anophagefferens. Each transposon carries one splice site. The other splice site is co-opted from gene sequence duplicated upon transposon insertion, allowing perfect splicing out of RNA. The distributions of sequences that can be co-opted are biased with respect to codons, and phasing of transposon-generated introns is similarly biased. These transposons insert between preexisting nucleosomes, so that multiple nearby insertions generate nucleosome-sized intervening segments. Thus, transposon insertion and sequence co-option may explain the intron phase biases2 and prevalence of nucleosome-sized exons9 observed in eukaryotes. Overall, the two independent examples of proliferating elements illustrate a general DNA transposon mechanism plausibly accounting for episodes of rapid, extensive intron gain during eukaryotic evolution2,3. PMID:27760113
A Genetic Screen for Mutations Affecting Cell Division in the Arabidopsis thaliana Embryo Identifies Seven Loci Required for Cytokinesis

DOE PAGES

Gillmor, C. Stewart; Roeder, Adrienne H. K.; Sieber, Patrick; ...

2016-01-08

Cytokinesis in plants involves the formation of unique cellular structures such as the phragmoplast and the cell plate, both of which are required to divide the cell after nuclear division. In order to isolate genes that are involved in de novo cell wall formation, we performed a large-scale, microscope-based screen for Arabidopsis mutants that severely impair cytokinesis in the embryo. We recovered 35 mutations that form abnormally enlarged cells with multiple, often polyploid nuclei and incomplete cell walls. These mutants represent seven genes, four of which have previously been implicated in phragmoplast or cell plate function. Mutations in two locimore » show strongly reduced transmission through the haploid gametophytic generation. Molecular cloning of both corresponding genes reveals that one is represented by hypomorphic alleles of the kinesin-5 gene RADIALLY SWOLLEN 7 (homologous to tobacco kinesin-related protein TKRP125), and that the other gene corresponds to the Arabidopsis FUSED ortholog TWO-IN-ONE (originally identified based on its function in pollen development). No mutations that completely abolish the formation of cross walls in diploid cells were found. Lastly, our results support the idea that cytokinesis in the diploid and haploid generations involve similar mechanisms.« less
A Genetic Screen for Mutations Affecting Cell Division in the Arabidopsis thaliana Embryo Identifies Seven Loci Required for Cytokinesis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gillmor, C. Stewart; Roeder, Adrienne H. K.; Sieber, Patrick

Cytokinesis in plants involves the formation of unique cellular structures such as the phragmoplast and the cell plate, both of which are required to divide the cell after nuclear division. In order to isolate genes that are involved in de novo cell wall formation, we performed a large-scale, microscope-based screen for Arabidopsis mutants that severely impair cytokinesis in the embryo. We recovered 35 mutations that form abnormally enlarged cells with multiple, often polyploid nuclei and incomplete cell walls. These mutants represent seven genes, four of which have previously been implicated in phragmoplast or cell plate function. Mutations in two locimore » show strongly reduced transmission through the haploid gametophytic generation. Molecular cloning of both corresponding genes reveals that one is represented by hypomorphic alleles of the kinesin-5 gene RADIALLY SWOLLEN 7 (homologous to tobacco kinesin-related protein TKRP125), and that the other gene corresponds to the Arabidopsis FUSED ortholog TWO-IN-ONE (originally identified based on its function in pollen development). No mutations that completely abolish the formation of cross walls in diploid cells were found. Lastly, our results support the idea that cytokinesis in the diploid and haploid generations involve similar mechanisms.« less
Multiplexed CRISPR/Cas9 Genome Editing and Gene Regulation Using Csy4 in Saccharomyces cerevisiae.

PubMed

Ferreira, Raphael; Skrekas, Christos; Nielsen, Jens; David, Florian

2018-01-19

Clustered regularly interspaced short palindromic repeats (CRISPR) technology has greatly accelerated the field of strain engineering. However, insufficient efforts have been made toward developing robust multiplexing tools in Saccharomyces cerevisiae. Here, we exploit the RNA processing capacity of the bacterial endoribonuclease Csy4 from Pseudomonas aeruginosa, to generate multiple gRNAs from a single transcript for genome editing and gene interference applications in S. cerevisiae. In regards to genome editing, we performed a quadruple deletion of FAA1, FAA4, POX1 and TES1 reaching 96% efficiency out of 24 colonies tested. Then, we used this system to efficiently transcriptionally regulate the three genes, OLE1, HMG1 and ACS1. Thus, we demonstrate that multiplexed genome editing and gene regulation can be performed in a fast and effective manner using Csy4.
Chronic exposure to water pollutant trichloroethylene increased epigenetic drift in CD4+ T cells

PubMed Central

Gilbert, Kathleen M; Blossom, Sarah J; Erickson, Stephen W; Reisfeld, Brad; Zurlinden, Todd J; Broadfoot, Brannon; West, Kirk; Bai, Shasha; Cooney, Craig A

2016-01-01

Aim: Autoimmune disease and CD4+ T-cell alterations are induced in mice exposed to the water pollutant trichloroethylene (TCE). We examined here whether TCE altered gene-specific DNA methylation in CD4+ T cells as a possible mechanism of immunotoxicity. Materials & methods: Naive and effector/memory CD4+ T cells from mice exposed to TCE (0.5 mg/ml in drinking water) for 40 weeks were examined by bisulfite next-generation DNA sequencing. Results: A probabilistic model calculated from multiple genes showed that TCE decreased methylation control in CD4+ T cells. Data from individual genes fitted to a quadratic regression model showed that TCE increased gene-specific methylation variance in both CD4 subsets. Conclusion: TCE increased epigenetic drift of specific CpG sites in CD4+ T cells. PMID:27092578
NCBI GEO: archive for functional genomics data sets--10 years on.

PubMed

Barrett, Tanya; Troup, Dennis B; Wilhite, Stephen E; Ledoux, Pierre; Evangelista, Carlos; Kim, Irene F; Tomashevsky, Maxim; Marshall, Kimberly A; Phillippy, Katherine H; Sherman, Patti M; Muertter, Rolf N; Holko, Michelle; Ayanbule, Oluwabukunmi; Yefanov, Andrey; Soboleva, Alexandra

2011-01-01

A decade ago, the Gene Expression Omnibus (GEO) database was established at the National Center for Biotechnology Information (NCBI). The original objective of GEO was to serve as a public repository for high-throughput gene expression data generated mostly by microarray technology. However, the research community quickly applied microarrays to non-gene-expression studies, including examination of genome copy number variation and genome-wide profiling of DNA-binding proteins. Because the GEO database was designed with a flexible structure, it was possible to quickly adapt the repository to store these data types. More recently, as the microarray community switches to next-generation sequencing technologies, GEO has again adapted to host these data sets. Today, GEO stores over 20,000 microarray- and sequence-based functional genomics studies, and continues to handle the majority of direct high-throughput data submissions from the research community. Multiple mechanisms are provided to help users effectively search, browse, download and visualize the data at the level of individual genes or entire studies. This paper describes recent database enhancements, including new search and data representation tools, as well as a brief review of how the community uses GEO data. GEO is freely accessible at http://www.ncbi.nlm.nih.gov/geo/.
A Case-by-Case Evolutionary Analysis of Four Imprinted Retrogenes

PubMed Central

McCole, Ruth B; Loughran, Noeleen B; Chahal, Mandeep; Fernandes, Luis P; Roberts, Roland G; Fraternali, Franca; O'Connell, Mary J; Oakey, Rebecca J

2011-01-01

Retroposition is a widespread phenomenon resulting in the generation of new genes that are initially related to a parent gene via very high coding sequence similarity. We examine the evolutionary fate of four retrogenes generated by such an event; mouse Inpp5f_v2, Mcts2, Nap1l5, and U2af1-rs1. These genes are all subject to the epigenetic phenomenon of parental imprinting. We first provide new data on the age of these retrogene insertions. Using codon-based models of sequence evolution, we show these retrogenes have diverse evolutionary trajectories, including divergence from the parent coding sequence under positive selection pressure, purifying selection pressure maintaining parent-retrogene similarity, and neutral evolution. Examination of the expression pattern of retrogenes shows an atypical, broad pattern across multiple tissues. Protein 3D structure modeling reveals that a positively selected residue in U2af1-rs1, not shared by its parent, may influence protein conformation. Our case-by-case analysis of the evolution of four imprinted retrogenes reveals that this interesting class of imprinted genes, while similar in regulation and sequence characteristics, follow very varied evolutionary paths. PMID:21166792

Dystrophic Cardiomyopathy: Complex Pathobiological Processes to Generate Clinical Phenotype

PubMed Central

Tsuda, Takeshi; Fitzgerald, Kristi K.

2017-01-01

Duchenne muscular dystrophy (DMD), Becker muscular dystrophy (BMD), and X-linked dilated cardiomyopathy (XL-DCM) consist of a unique clinical entity, the dystrophinopathies, which are due to variable mutations in the dystrophin gene. Dilated cardiomyopathy (DCM) is a common complication of dystrophinopathies, but the onset, progression, and severity of heart disease differ among these subgroups. Extensive molecular genetic studies have been conducted to assess genotype-phenotype correlation in DMD, BMD, and XL-DCM to understand the underlying mechanisms of these diseases, but the results are not always conclusive, suggesting the involvement of complex multi-layers of pathological processes that generate the final clinical phenotype. Dystrophin protein is a part of dystrophin-glycoprotein complex (DGC) that is localized in skeletal muscles, myocardium, smooth muscles, and neuronal tissues. Diversity of cardiac phenotype in dystrophinopathies suggests multiple layers of pathogenetic mechanisms in forming dystrophic cardiomyopathy. In this review article, we review the complex molecular interactions involving the pathogenesis of dystrophic cardiomyopathy, including primary gene mutations and loss of structural integrity, secondary cellular responses, and certain epigenetic and other factors that modulate gene expressions. Involvement of epigenetic gene regulation appears to lead to specific cardiac phenotypes in dystrophic hearts. PMID:29367543
GeneSCF: a real-time based functional enrichment tool with support for multiple organisms.

PubMed

Subhash, Santhilal; Kanduri, Chandrasekhar

2016-09-13

High-throughput technologies such as ChIP-sequencing, RNA-sequencing, DNA sequencing and quantitative metabolomics generate a huge volume of data. Researchers often rely on functional enrichment tools to interpret the biological significance of the affected genes from these high-throughput studies. However, currently available functional enrichment tools need to be updated frequently to adapt to new entries from the functional database repositories. Hence there is a need for a simplified tool that can perform functional enrichment analysis by using updated information directly from the source databases such as KEGG, Reactome or Gene Ontology etc. In this study, we focused on designing a command-line tool called GeneSCF (Gene Set Clustering based on Functional annotations), that can predict the functionally relevant biological information for a set of genes in a real-time updated manner. It is designed to handle information from more than 4000 organisms from freely available prominent functional databases like KEGG, Reactome and Gene Ontology. We successfully employed our tool on two of published datasets to predict the biologically relevant functional information. The core features of this tool were tested on Linux machines without the need for installation of more dependencies. GeneSCF is more reliable compared to other enrichment tools because of its ability to use reference functional databases in real-time to perform enrichment analysis. It is an easy-to-integrate tool with other pipelines available for downstream analysis of high-throughput data. More importantly, GeneSCF can run multiple gene lists simultaneously on different organisms thereby saving time for the users. Since the tool is designed to be ready-to-use, there is no need for any complex compilation and installation procedures.
De Novo Assembled Wheat Transcriptomes Delineate Differentially Expressed Host Genes in Response to Leaf Rust Infection.

PubMed

Chandra, Saket; Singh, Dharmendra; Pathak, Jyoti; Kumari, Supriya; Kumar, Manish; Poddar, Raju; Balyan, Harindra Singh; Gupta, Puspendra Kumar; Prabhu, Kumble Vinod; Mukhopadhyay, Kunal

2016-01-01

Pathogens like Puccinia triticina, the causal organism for leaf rust, extensively damages wheat production. The interaction at molecular level between wheat and the pathogen is complex and less explored. The pathogen induced response was characterized using mock- or pathogen inoculated near-isogenic wheat lines (with or without seedling leaf rust resistance gene Lr28). Four Serial Analysis of Gene Expression libraries were prepared from mock- and pathogen inoculated plants and were subjected to Sequencing by Oligonucleotide Ligation and Detection, which generated a total of 165,767,777 reads, each 35 bases long. The reads were processed and multiple k-mers were attempted for de novo transcript assembly; 22 k-mers showed the best results. Altogether 21,345 contigs were generated and functionally characterized by gene ontology annotation, mining for transcription factors and resistance genes. Expression analysis among the four libraries showed extensive alterations in the transcriptome in response to pathogen infection, reflecting reorganizations in major biological processes and metabolic pathways. Role of auxin in determining pathogenesis in susceptible and resistant lines were imperative. The qPCR expression study of four LRR-RLK (Leucine-rich repeat receptor-like protein kinases) genes showed higher expression at 24 hrs after inoculation with pathogen. In summary, the conceptual model of induced resistance in wheat contributes insights on defense responses and imparts knowledge of Puccinia triticina-induced defense transcripts in wheat plants.
De Novo Assembled Wheat Transcriptomes Delineate Differentially Expressed Host Genes in Response to Leaf Rust Infection

PubMed Central

Pathak, Jyoti; Kumari, Supriya; Kumar, Manish; Poddar, Raju; Balyan, Harindra Singh; Gupta, Puspendra Kumar; Prabhu, Kumble Vinod; Mukhopadhyay, Kunal

2016-01-01

Pathogens like Puccinia triticina, the causal organism for leaf rust, extensively damages wheat production. The interaction at molecular level between wheat and the pathogen is complex and less explored. The pathogen induced response was characterized using mock- or pathogen inoculated near-isogenic wheat lines (with or without seedling leaf rust resistance gene Lr28). Four Serial Analysis of Gene Expression libraries were prepared from mock- and pathogen inoculated plants and were subjected to Sequencing by Oligonucleotide Ligation and Detection, which generated a total of 165,767,777 reads, each 35 bases long. The reads were processed and multiple k-mers were attempted for de novo transcript assembly; 22 k-mers showed the best results. Altogether 21,345 contigs were generated and functionally characterized by gene ontology annotation, mining for transcription factors and resistance genes. Expression analysis among the four libraries showed extensive alterations in the transcriptome in response to pathogen infection, reflecting reorganizations in major biological processes and metabolic pathways. Role of auxin in determining pathogenesis in susceptible and resistant lines were imperative. The qPCR expression study of four LRR-RLK (Leucine-rich repeat receptor-like protein kinases) genes showed higher expression at 24 hrs after inoculation with pathogen. In summary, the conceptual model of induced resistance in wheat contributes insights on defense responses and imparts knowledge of Puccinia triticina-induced defense transcripts in wheat plants. PMID:26840746
Genomic and Transcriptomic Analyses to Identify Pathways Involved in Nanoparticle Generation in the Ubiquitous Marine Bacterium Alteromonas macleodii Under Elevated Copper Conditions

NASA Astrophysics Data System (ADS)

Cusick, K. D.; Dale, J.; Little, B.; Cockrell, A.; Biffinger, J.

2016-02-01

Alteromonas macleodii is a ubiquitous marine bacterium that clusters by molecular analyses into two ecotypes: surface and deep-water. Our group isolated a marine bacterium from copper coupons that generates nanoparticles (NPs) at elevated copper concentrations. Sequencing of the 16S rRNA gene identified it as an A. macleodii strain. In phylogenetic analyses based on the gyrB gene, it clustered with other surface isolates; however, it formed a unique cluster separate from that of other surface isolates based on rpoB gene sequences. Copper is commonly employed as an antifouling agent on the hulls of ships, and so copper tolerance and NP generation is under investigation in this strain. The overall goals of this study were: (1) to determine if copper tolerance is the result of changes at the genetic or transcriptional level and (2) to identify the genes involved in NP formation. Sub-cultures were established from the initial isolate in which copper concentrations were increased in .25 mM increments through multiple generations. These sub-cultures were assayed for NP formation in seawater medium supplemented with 3-4 mM copper. Scanning electron microscopy revealed large aggregates of NPs on the exterior surface of all sub-cultures. Additionally, a portion of the cells in all sub-cultures displayed an elongated morphology in comparison to the wild-type. No NPs were observed in wild-type controls grown without the addition of increased copper. Metagenomic sequencing of natural populations of A. macleodii revealed extreme divergence in several large genomic regions whose content includes genes coding for exopolysaccharide production and metal resistance. High-throughput sequencing is being used to determine whether copper tolerance and NP generation is the result of genetic or transcriptional changes. These results will be extended to natural communities to gain insights into the role of bacterial NPs during conditions of elevated metal concentrations in coastal systems.
Evolutionary origins of a novel host plant detoxification gene in butterflies.

PubMed

Fischer, Hanna M; Wheat, Christopher W; Heckel, David G; Vogel, Heiko

2008-05-01

Chemical interactions between plants and their insect herbivores provide an excellent opportunity to study the evolution of species interactions on a molecular level. Here, we investigate the molecular evolutionary events that gave rise to a novel detoxifying enzyme (nitrile-specifier protein [NSP]) in the butterfly family Pieridae, previously identified as a coevolutionary key innovation. By generating and sequencing expressed sequence tags, genomic libraries, and screening databases we found NSP to be a member of an insect-specific gene family, which we characterized and named the NSP-like gene family. Members consist of variable tandem repeats, are gut expressed, and are found across Insecta evolving in a dynamic, ongoing birth-death process. In the Lepidoptera, multiple copies of single-domain major allergen genes are present and originate via tandem duplications. Multiple domain genes are found solely within the brassicaceous-feeding Pieridae butterflies, one of them being NSP and another called major allergen (MA). Analyses suggest that NSP and its paralog MA have a unique single-domain evolutionary origin, being formed by intragenic domain duplication followed by tandem whole-gene duplication. Duplicates subsequently experienced a period of relaxed constraint followed by an increase in constraint, perhaps after neofunctionalization. NSP and its ortholog MA are still experiencing high rates of change, reflecting a dynamic evolution consistent with the known role of NSP in plant-insect interactions. Our results provide direct evidence to the hypothesis that gene duplication is one of the driving forces for speciation and adaptation, showing that both within- and whole-gene tandem duplications are a powerful force underlying evolutionary adaptation.
Integrating multiple immunogenetic data sources for feature extraction and mining somatic hypermutation patterns: the case of "towards analysis" in chronic lymphocytic leukaemia.

PubMed

Kavakiotis, Ioannis; Xochelli, Aliki; Agathangelidis, Andreas; Tsoumakas, Grigorios; Maglaveras, Nicos; Stamatopoulos, Kostas; Hadzidimitriou, Anastasia; Vlahavas, Ioannis; Chouvarda, Ioanna

2016-06-06

Somatic Hypermutation (SHM) refers to the introduction of mutations within rearranged V(D)J genes, a process that increases the diversity of Immunoglobulins (IGs). The analysis of SHM has offered critical insight into the physiology and pathology of B cells, leading to strong prognostication markers for clinical outcome in chronic lymphocytic leukaemia (CLL), the most frequent adult B-cell malignancy. In this paper we present a methodology for integrating multiple immunogenetic and clinocobiological data sources in order to extract features and create high quality datasets for SHM analysis in IG receptors of CLL patients. This dataset is used as the basis for a higher level integration procedure, inspired form social choice theory. This is applied in the Towards Analysis, our attempt to investigate the potential ontogenetic transformation of genes belonging to specific stereotyped CLL subsets towards other genes or gene families, through SHM. The data integration process, followed by feature extraction, resulted in the generation of a dataset containing information about mutations occurring through SHM. The Towards analysis performed on the integrated dataset applying voting techniques, revealed the distinct behaviour of subset #201 compared to other subsets, as regards SHM related movements among gene clans, both in allele-conserved and non-conserved gene areas. With respect to movement between genes, a high percentage movement towards pseudo genes was found in all CLL subsets. This data integration and feature extraction process can set the basis for exploratory analysis or a fully automated computational data mining approach on many as yet unanswered, clinically relevant biological questions.
BEACON: automated tool for Bacterial GEnome Annotation ComparisON.

PubMed

Kalkatawi, Manal; Alam, Intikhab; Bajic, Vladimir B

2015-08-18

Genome annotation is one way of summarizing the existing knowledge about genomic characteristics of an organism. There has been an increased interest during the last several decades in computer-based structural and functional genome annotation. Many methods for this purpose have been developed for eukaryotes and prokaryotes. Our study focuses on comparison of functional annotations of prokaryotic genomes. To the best of our knowledge there is no fully automated system for detailed comparison of functional genome annotations generated by different annotation methods (AMs). The presence of many AMs and development of new ones introduce needs to: a/ compare different annotations for a single genome, and b/ generate annotation by combining individual ones. To address these issues we developed an Automated Tool for Bacterial GEnome Annotation ComparisON (BEACON) that benefits both AM developers and annotation analysers. BEACON provides detailed comparison of gene function annotations of prokaryotic genomes obtained by different AMs and generates extended annotations through combination of individual ones. For the illustration of BEACON's utility, we provide a comparison analysis of multiple different annotations generated for four genomes and show on these examples that the extended annotation can increase the number of genes annotated by putative functions up to 27%, while the number of genes without any function assignment is reduced. We developed BEACON, a fast tool for an automated and a systematic comparison of different annotations of single genomes. The extended annotation assigns putative functions to many genes with unknown functions. BEACON is available under GNU General Public License version 3.0 and is accessible at: http://www.cbrc.kaust.edu.sa/BEACON/ .
Comparative Genomics of Syntrophic Branched-Chain Fatty Acid Degrading Bacteria

PubMed Central

Narihiro, Takashi; Nobu, Masaru K.; Tamaki, Hideyuki; Kamagata, Yoichi; Sekiguchi, Yuji; Liu, Wen-Tso

2016-01-01

The syntrophic degradation of branched-chain fatty acids (BCFAs) such as 2-methylbutyrate and isobutyrate is an essential step in the production of methane from proteins/amino acids in anaerobic ecosystems. While a few syntrophic BCFA-degrading bacteria have been isolated, their metabolic pathways in BCFA and short-chain fatty acid (SCFA) degradation as well as energy conservation systems remain unclear. In an attempt to identify these pathways, we herein performed comparative genomics of three syntrophic bacteria: 2-methylbutyrate-degrading “Syntrophomonas wolfei subsp. methylbutyratica” strain JCM 14075T (=4J5T), isobutyrate-degrading Syntrophothermus lipocalidus strain TGB-C1T, and non-BCFA-metabolizing S. wolfei subsp. wolfei strain GöttingenT. We demonstrated that 4J5 and TGB-C1 both encode multiple genes/gene clusters involved in β-oxidation, as observed in the Göttingen genome, which has multiple copies of genes associated with butyrate degradation. The 4J5 genome possesses phylogenetically distinct β-oxidation genes, which may be involved in 2-methylbutyrate degradation. In addition, these Syntrophomonadaceae strains harbor various hydrogen/formate generation systems (i.e., electron-bifurcating hydrogenase, formate dehydrogenase, and membrane-bound hydrogenase) and energy-conserving electron transport systems, including electron transfer flavoprotein (ETF)-linked acyl-CoA dehydrogenase, ETF-linked iron-sulfur binding reductase, ETF dehydrogenase (FixABCX), and flavin oxidoreductase-heterodisulfide reductase (Flox-Hdr). Unexpectedly, the TGB-C1 genome encodes a nitrogenase complex, which may function as an alternative H2 generation mechanism. These results suggest that the BCFA-degrading syntrophic strains 4J5 and TGB-C1 possess specific β-oxidation-related enzymes for BCFA oxidation as well as appropriate energy conservation systems to perform thermodynamically unfavorable syntrophic metabolism. PMID:27431485
Induction of human cardiomyocyte-like cells from fibroblasts by defined factors.

PubMed

Wada, Rie; Muraoka, Naoto; Inagawa, Kohei; Yamakawa, Hiroyuki; Miyamoto, Kazutaka; Sadahiro, Taketaro; Umei, Tomohiko; Kaneda, Ruri; Suzuki, Tomoyuki; Kamiya, Kaichiro; Tohyama, Shugo; Yuasa, Shinsuke; Kokaji, Kiyokazu; Aeba, Ryo; Yozu, Ryohei; Yamagishi, Hiroyuki; Kitamura, Toshio; Fukuda, Keiichi; Ieda, Masaki

2013-07-30

Heart disease remains a leading cause of death worldwide. Owing to the limited regenerative capacity of heart tissue, cardiac regenerative therapy has emerged as an attractive approach. Direct reprogramming of human cardiac fibroblasts (HCFs) into cardiomyocytes may hold great potential for this purpose. We reported previously that induced cardiomyocyte-like cells (iCMs) can be directly generated from mouse cardiac fibroblasts in vitro and vivo by transduction of three transcription factors: Gata4, Mef2c, and Tbx5, collectively termed GMT. In the present study, we sought to determine whether human fibroblasts also could be converted to iCMs by defined factors. Our initial finding that GMT was not sufficient for cardiac induction in HCFs prompted us to screen for additional factors to promote cardiac reprogramming by analyzing multiple cardiac-specific gene induction with quantitative RT-PCR. The addition of Mesp1 and Myocd to GMT up-regulated a broader spectrum of cardiac genes in HCFs more efficiently compared with GMT alone. The HCFs and human dermal fibroblasts transduced with GMT, Mesp1, and Myocd (GMTMM) changed the cell morphology from a spindle shape to a rod-like or polygonal shape, expressed multiple cardiac-specific proteins, increased a broad range of cardiac genes and concomitantly suppressed fibroblast genes, and exhibited spontaneous Ca(2+) oscillations. Moreover, the cells matured to exhibit action potentials and contract synchronously in coculture with murine cardiomyocytes. A 5-ethynyl-2'-deoxyuridine assay revealed that the iCMs thus generated do not pass through a mitotic cell state. These findings demonstrate that human fibroblasts can be directly converted to iCMs by defined factors, which may facilitate future applications in regenerative medicine.
The evolutionary landscape of intergenic trans-splicing events in insects

PubMed Central

Kong, Yimeng; Zhou, Hongxia; Yu, Yao; Chen, Longxian; Hao, Pei; Li, Xuan

2015-01-01

To explore the landscape of intergenic trans-splicing events and characterize their functions and evolutionary dynamics, we conduct a mega-data study of a phylogeny containing eight species across five orders of class Insecta, a model system spanning 400 million years of evolution. A total of 1,627 trans-splicing events involving 2,199 genes are identified, accounting for 1.58% of the total genes. Homology analysis reveals that mod(mdg4)-like trans-splicing is the only conserved event that is consistently observed in multiple species across two orders, which represents a unique case of functional diversification involving trans-splicing. Thus, evolutionarily its potential for generating proteins with novel function is not broadly utilized by insects. Furthermore, 146 non-mod trans-spliced transcripts are found to resemble canonical genes from different species. Trans-splicing preserving the function of ‘breakup' genes may serve as a general mechanism for relaxing the constraints on gene structure, with profound implications for the evolution of genes and genomes. PMID:26521696
The Architecture of Parent-of-Origin Effects in Mice

PubMed Central

Mott, Richard; Yuan, Wei; Kaisaki, Pamela; Gan, Xiangchao; Cleak, James; Edwards, Andrew; Baud, Amelie; Flint, Jonathan

2014-01-01

Summary The number of imprinted genes in the mammalian genome is predicted to be small, yet we show here, in a survey of 97 traits measured in outbred mice, that most phenotypes display parent-of-origin effects that are partially confounded with family structure. To address this contradiction, using reciprocal F1 crosses, we investigated the effects of knocking out two nonimprinted candidate genes, Man1a2 and H2-ab1, that reside at nonimprinted loci but that show parent-of-origin effects. We show that expression of multiple genes becomes dysregulated in a sex-, tissue-, and parent-of-origin-dependent manner. We provide evidence that nonimprinted genes can generate parent-of-origin effects by interaction with imprinted loci and deduce that the importance of the number of imprinted genes is secondary to their interactions. We propose that this gene network effect may account for some of the missing heritability seen when comparing sibling-based to population-based studies of the phenotypic effects of genetic variants. PMID:24439386
Oncogenes and tumor suppressors in the molecular pathogenesis of acute promyelocytic leukemia.

PubMed

Pandolfi, P P

2001-04-01

Acute promyelocytic leukemia (APL) is associated with reciprocal chromosomal translocations always involving the retinoic acid receptor alpha (RARalpha) gene on chromosome 17 and variable partner genes (X genes) on distinct chromosomes. RARalpha fuses to the PML gene in the vast majority of APL cases, and in a few cases to the PLZF, NPM, NuMA and Stat5b genes, respectively, leading to the generation of RARalpha-X: and X:-RARalpha fusion genes. Both fusion proteins can exert oncogenic functions through their ability to interfere with the activities of X and RARalpha proteins. Here, it will be discussed in detail how an extensive biochemical analysis as well as a systematic in vivo genetic approach in the mouse has allowed the definition of the multiple oncogenic activities of PML-RARalpha, and how it has become apparent that this oncoprotein is able to impair RARalpha at the transcription level and the tumor suppressive function of the PML protein.
A condition-specific codon optimization approach for improved heterologous gene expression in Saccharomyces cerevisiae

PubMed Central

2014-01-01

Background Heterologous gene expression is an important tool for synthetic biology that enables metabolic engineering and the production of non-natural biologics in a variety of host organisms. The translational efficiency of heterologous genes can often be improved by optimizing synonymous codon usage to better match the host organism. However, traditional approaches for optimization neglect to take into account many factors known to influence synonymous codon distributions. Results Here we define an alternative approach for codon optimization that utilizes systems level information and codon context for the condition under which heterologous genes are being expressed. Furthermore, we utilize a probabilistic algorithm to generate multiple variants of a given gene. We demonstrate improved translational efficiency using this condition-specific codon optimization approach with two heterologous genes, the fluorescent protein-encoding eGFP and the catechol 1,2-dioxygenase gene CatA, expressed in S. cerevisiae. For the latter case, optimization for stationary phase production resulted in nearly 2.9-fold improvements over commercial gene optimization algorithms. Conclusions Codon optimization is now often a standard tool for protein expression, and while a variety of tools and approaches have been developed, they do not guarantee improved performance for all hosts of applications. Here, we suggest an alternative method for condition-specific codon optimization and demonstrate its utility in Saccharomyces cerevisiae as a proof of concept. However, this technique should be applicable to any organism for which gene expression data can be generated and is thus of potential interest for a variety of applications in metabolic and cellular engineering. PMID:24636000
Differential gene expression in the siphonophore Nanomia bijuga (Cnidaria) assessed with multiple next-generation sequencing workflows.

PubMed

Siebert, Stefan; Robinson, Mark D; Tintori, Sophia C; Goetz, Freya; Helm, Rebecca R; Smith, Stephen A; Shaner, Nathan; Haddock, Steven H D; Dunn, Casey W

2011-01-01

We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through workflow choice and deeper reference sequencing.
Differential Gene Expression in the Siphonophore Nanomia bijuga (Cnidaria) Assessed with Multiple Next-Generation Sequencing Workflows

PubMed Central

Siebert, Stefan; Robinson, Mark D.; Tintori, Sophia C.; Goetz, Freya; Helm, Rebecca R.; Smith, Stephen A.; Shaner, Nathan; Haddock, Steven H. D.; Dunn, Casey W.

2011-01-01

We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through workflow choice and deeper reference sequencing. PMID:21829563
Uptake, Results, and Outcomes of Germline Multiple-Gene Sequencing After Diagnosis of Breast Cancer.

PubMed

Kurian, Allison W; Ward, Kevin C; Hamilton, Ann S; Deapen, Dennis M; Abrahamse, Paul; Bondarenko, Irina; Li, Yun; Hawley, Sarah T; Morrow, Monica; Jagsi, Reshma; Katz, Steven J

2018-05-10

Low-cost sequencing of multiple genes is increasingly available for cancer risk assessment. Little is known about uptake or outcomes of multiple-gene sequencing after breast cancer diagnosis in community practice. To examine the effect of multiple-gene sequencing on the experience and treatment outcomes for patients with breast cancer. For this population-based retrospective cohort study, patients with breast cancer diagnosed from January 2013 to December 2015 and accrued from SEER registries across Georgia and in Los Angeles, California, were surveyed (n = 5080, response rate = 70%). Responses were merged with SEER data and results of clinical genetic tests, either BRCA1 and BRCA2 (BRCA1/2) sequencing only or including additional other genes (multiple-gene sequencing), provided by 4 laboratories. Type of testing (multiple-gene sequencing vs BRCA1/2-only sequencing), test results (negative, variant of unknown significance, or pathogenic variant), patient experiences with testing (timing of testing, who discussed results), and treatment (strength of patient consideration of, and surgeon recommendation for, prophylactic mastectomy), and prophylactic mastectomy receipt. We defined a patient subgroup with higher pretest risk of carrying a pathogenic variant according to practice guidelines. Among 5026 patients (mean [SD] age, 59.9 [10.7]), 1316 (26.2%) were linked to genetic results from any laboratory. Multiple-gene sequencing increasingly replaced BRCA1/2-only testing over time: in 2013, the rate of multiple-gene sequencing was 25.6% and BRCA1/2-only testing, 74.4%;in 2015 the rate of multiple-gene sequencing was 66.5% and BRCA1/2-only testing, 33.5%. Multiple-gene sequencing was more often ordered by genetic counselors (multiple-gene sequencing, 25.5% and BRCA1/2-only testing, 15.3%) and delayed until after surgery (multiple-gene sequencing, 32.5% and BRCA1/2-only testing, 19.9%). Multiple-gene sequencing substantially increased rate of detection of any pathogenic variant (multiple-gene sequencing: higher-risk patients, 12%; average-risk patients, 4.2% and BRCA1/2-only testing: higher-risk patients, 7.8%; average-risk patients, 2.2%) and variants of uncertain significance, especially in minorities (multiple-gene sequencing: white patients, 23.7%; black patients, 44.5%; and Asian patients, 50.9% and BRCA1/2-only testing: white patients, 2.2%; black patients, 5.6%; and Asian patients, 0%). Multiple-gene sequencing was not associated with an increase in the rate of prophylactic mastectomy use, which was highest with pathogenic variants in BRCA1/2 (BRCA1/2, 79.0%; other pathogenic variant, 37.6%; variant of uncertain significance, 30.2%; negative, 35.3%). Multiple-gene sequencing rapidly replaced BRCA1/2-only testing for patients with breast cancer in the community and enabled 2-fold higher detection of clinically relevant pathogenic variants without an associated increase in prophylactic mastectomy. However, important targets for improvement in the clinical utility of multiple-gene sequencing include postsurgical delay and racial/ethnic disparity in variants of uncertain significance.
Clustered metallothionein genes are co-regulated in rice and ectopic expression of OsMT1e-P confers multiple abiotic stress tolerance in tobacco via ROS scavenging

PubMed Central

2012-01-01

Background Metallothioneins (MT) are low molecular weight, cysteine rich metal binding proteins, found across genera and species, but their function(s) in abiotic stress tolerance are not well documented. Results We have characterized a rice MT gene, OsMT1e-P, isolated from a subtractive library generated from a stressed salinity tolerant rice genotype, Pokkali. Bioinformatics analysis of the rice genome sequence revealed that this gene belongs to a multigenic family, which consists of 13 genes with 15 protein products. OsMT1e-P is located on chromosome XI, away from the majority of other type I genes that are clustered on chromosome XII. Various members of this MT gene cluster showed a tight co-regulation pattern under several abiotic stresses. Sequence analysis revealed the presence of conserved cysteine residues in OsMT1e-P protein. Salinity stress was found to regulate the transcript abundance of OsMT1e-P in a developmental and organ specific manner. Using transgenic approach, we found a positive correlation between ectopic expression of OsMT1e-P and stress tolerance. Our experiments further suggest ROS scavenging to be the possible mechanism for multiple stress tolerance conferred by OsMT1e-P. Conclusion We present an overview of MTs, describing their gene structure, genome localization and expression patterns under salinity and development in rice. We have found that ectopic expression of OsMT1e-P enhances tolerance towards multiple abiotic stresses in transgenic tobacco and the resultant plants could survive and set viable seeds under saline conditions. Taken together, the experiments presented here have indicated that ectopic expression of OsMT1e-P protects against oxidative stress primarily through efficient scavenging of reactive oxygen species. PMID:22780875
Clustered metallothionein genes are co-regulated in rice and ectopic expression of OsMT1e-P confers multiple abiotic stress tolerance in tobacco via ROS scavenging.

PubMed

Kumar, Gautam; Kushwaha, Hemant Ritturaj; Panjabi-Sabharwal, Vaishali; Kumari, Sumita; Joshi, Rohit; Karan, Ratna; Mittal, Shweta; Pareek, Sneh L Singla; Pareek, Ashwani

2012-07-10

Metallothioneins (MT) are low molecular weight, cysteine rich metal binding proteins, found across genera and species, but their function(s) in abiotic stress tolerance are not well documented. We have characterized a rice MT gene, OsMT1e-P, isolated from a subtractive library generated from a stressed salinity tolerant rice genotype, Pokkali. Bioinformatics analysis of the rice genome sequence revealed that this gene belongs to a multigenic family, which consists of 13 genes with 15 protein products. OsMT1e-P is located on chromosome XI, away from the majority of other type I genes that are clustered on chromosome XII. Various members of this MT gene cluster showed a tight co-regulation pattern under several abiotic stresses. Sequence analysis revealed the presence of conserved cysteine residues in OsMT1e-P protein. Salinity stress was found to regulate the transcript abundance of OsMT1e-P in a developmental and organ specific manner. Using transgenic approach, we found a positive correlation between ectopic expression of OsMT1e-P and stress tolerance. Our experiments further suggest ROS scavenging to be the possible mechanism for multiple stress tolerance conferred by OsMT1e-P. We present an overview of MTs, describing their gene structure, genome localization and expression patterns under salinity and development in rice. We have found that ectopic expression of OsMT1e-P enhances tolerance towards multiple abiotic stresses in transgenic tobacco and the resultant plants could survive and set viable seeds under saline conditions. Taken together, the experiments presented here have indicated that ectopic expression of OsMT1e-P protects against oxidative stress primarily through efficient scavenging of reactive oxygen species.
Synergistic effect of amino acids modified on dendrimer surface in gene delivery.

PubMed

Wang, Fei; Wang, Yitong; Wang, Hui; Shao, Naimin; Chen, Yuanyuan; Cheng, Yiyun

2014-11-01

Design of an efficient gene vector based on dendrimer remains a great challenge due to the presence of multiple barriers in gene delivery. Single-functionalization on dendrimer cannot overcome all the barriers. In this study, we synthesized a list of single-, dual- and triple-functionalized dendrimers with arginine, phenylalanine and histidine for gene delivery using a one-pot approach. The three amino acids play different roles in gene delivery: arginine is essential in formation of stable complexes, phenylalanine improves cellular uptake efficacy, and histidine increases pH-buffering capacity and minimizes cytotoxicity of the cationic dendrimer. A combination of these amino acids on dendrimer generates a synergistic effect in gene delivery. The dual- and triple-functionalized dendrimers show minimal cytotoxicity on the transfected NIH 3T3 cells. Using this combination strategy, we can obtain triple-functionalized dendrimers with comparable transfection efficacy to several commercial transfection reagents. Such a combination strategy should be applicable to the design of efficient and biocompatible gene vectors for gene delivery. Copyright © 2014 Elsevier Ltd. All rights reserved.

Whole exome sequencing identified 1 base pair novel deletion in BCL2-associated athanogene 3 (BAG3) gene associated with severe dilated cardiomyopathy (DCM) requiring heart transplant in multiple family members.

PubMed

Rafiq, Muhammad Arshad; Chaudhry, Ayeshah; Care, Melanie; Spears, Danna A; Morel, Chantal F; Hamilton, Robert M

2017-03-01

Dilated cardiomyopathy (DCM) is characterized by dilation and impaired contraction of the left ventricle or both ventricles. Among hereditary DCM, the genetic causes are heterogeneous, and include mutations encoding cytoskeletal, nucleoskeletal, mitochondrial, and calcium-handling proteins. We report three severely affected males, in a four-generation pedigree, with DCM phenotype who underwent cardiac transplant. Cardiomegaly with marked biventricular dilation and fibrosis were noticeable histopathological findings. The affected males had tested negative on a 46-gene pancardiomyopathy panel. Whole Exome Sequencing (WES) was performed to reveal mutation in the gene responsible in generation of DCM phenotypes. The 1-bp (Chr10:121435979delC; c.913delC) novel heterozygous deletion in exon 4 of BAG3, was identified in three affected males, resulted in frame-shift and a premature termination codon (p.Met306-Stop) producing a truncated BAG3 protein lacking functionally important PXXP and BAG domains. WES data were further utilized to map 10 SNP markers around the discovered mutation to generate shared disease haplotype in all affected individuals encompassing 11 Mb on 10q25.3-26.2 harboring BAG3. Finally genotypes were inferred for the unavailable/deceased individuals in the pedigrees. Here we propose that Chr10:121435979delC in BAG3 is a causal mutation in these subjects. Our and earlier studies indicate that BAG3 mutations are associated with DCM phenotypes. BAG3 should be added to cardiomyopathy gene panels for screening of DCM patients, and patients previously considered gene elusive should undergo sequencing of the BAG3 gene. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Gene alterations at Drosophila inversion breakpoints provide prima facie evidence for natural selection as an explanation for rapid chromosomal evolution

PubMed Central

2012-01-01

Background Chromosomal inversions have been pervasive during the evolution of the genus Drosophila, but there is significant variation between lineages in the rate of rearrangement fixation. D. mojavensis, an ecological specialist adapted to a cactophilic niche under extreme desert conditions, is a chromosomally derived species with ten fixed inversions, five of them not present in any other species. Results In order to explore the causes of the rapid chromosomal evolution in D. mojavensis, we identified and characterized all breakpoints of seven inversions fixed in chromosome 2, the most dynamic one. One of the inversions presents unequivocal evidence for its generation by ectopic recombination between transposon copies and another two harbor inverted duplications of non-repetitive DNA at the two breakpoints and were likely generated by staggered single-strand breaks and repair by non-homologous end joining. Four out of 14 breakpoints lay in the intergenic region between preexisting duplicated genes, suggesting an adaptive advantage of separating previously tightly linked duplicates. Four out of 14 breakpoints are associated with transposed genes, suggesting these breakpoints are fragile regions. Finally two inversions contain novel genes at their breakpoints and another three show alterations of genes at breakpoints with potential adaptive significance. Conclusions D. mojavensis chromosomal inversions were generated by multiple mechanisms, an observation that does not provide support for increased mutation rate as explanation for rapid chromosomal evolution. On the other hand, we have found a number of gene alterations at the breakpoints with putative adaptive consequences that directly point to natural selection as the cause of D. mojavensis rapid chromosomal evolution. PMID:22296923
LNDriver: identifying driver genes by integrating mutation and expression data based on gene-gene interaction network.

PubMed

Wei, Pi-Jing; Zhang, Di; Xia, Junfeng; Zheng, Chun-Hou

2016-12-23

Cancer is a complex disease which is characterized by the accumulation of genetic alterations during the patient's lifetime. With the development of the next-generation sequencing technology, multiple omics data, such as cancer genomic, epigenomic and transcriptomic data etc., can be measured from each individual. Correspondingly, one of the key challenges is to pinpoint functional driver mutations or pathways, which contributes to tumorigenesis, from millions of functional neutral passenger mutations. In this paper, in order to identify driver genes effectively, we applied a generalized additive model to mutation profiles to filter genes with long length and constructed a new gene-gene interaction network. Then we integrated the mutation data and expression data into the gene-gene interaction network. Lastly, greedy algorithm was used to prioritize candidate driver genes from the integrated data. We named the proposed method Length-Net-Driver (LNDriver). Experiments on three TCGA datasets, i.e., head and neck squamous cell carcinoma, kidney renal clear cell carcinoma and thyroid carcinoma, demonstrated that the proposed method was effective. Also, it can identify not only frequently mutated drivers, but also rare candidate driver genes.
Identification of Single- and Multiple-Class Specific Signature Genes from Gene Expression Profiles by Group Marker Index

PubMed Central

Tsai, Yu-Shuen; Aguan, Kripamoy; Pal, Nikhil R.; Chung, I-Fang

2011-01-01

Informative genes from microarray data can be used to construct prediction model and investigate biological mechanisms. Differentially expressed genes, the main targets of most gene selection methods, can be classified as single- and multiple-class specific signature genes. Here, we present a novel gene selection algorithm based on a Group Marker Index (GMI), which is intuitive, of low-computational complexity, and efficient in identification of both types of genes. Most gene selection methods identify only single-class specific signature genes and cannot identify multiple-class specific signature genes easily. Our algorithm can detect de novo certain conditions of multiple-class specificity of a gene and makes use of a novel non-parametric indicator to assess the discrimination ability between classes. Our method is effective even when the sample size is small as well as when the class sizes are significantly different. To compare the effectiveness and robustness we formulate an intuitive template-based method and use four well-known datasets. We demonstrate that our algorithm outperforms the template-based method in difficult cases with unbalanced distribution. Moreover, the multiple-class specific genes are good biomarkers and play important roles in biological pathways. Our literature survey supports that the proposed method identifies unique multiple-class specific marker genes (not reported earlier to be related to cancer) in the Central Nervous System data. It also discovers unique biomarkers indicating the intrinsic difference between subtypes of lung cancer. We also associate the pathway information with the multiple-class specific signature genes and cross-reference to published studies. We find that the identified genes participate in the pathways directly involved in cancer development in leukemia data. Our method gives a promising way to find genes that can involve in pathways of multiple diseases and hence opens up the possibility of using an existing drug on other diseases as well as designing a single drug for multiple diseases. PMID:21909426
Emergent biomarker derived from next-generation sequencing to identify pain patients requiring uncommonly high opioid doses

PubMed Central

Kringel, D; Ultsch, A; Zimmermann, M; Jansen, J-P; Ilias, W; Freynhagen, R; Griessinger, N; Kopf, A; Stein, C; Doehring, A; Resch, E; Lötsch, J

2017-01-01

Next-generation sequencing (NGS) provides unrestricted access to the genome, but it produces ‘big data’ exceeding in amount and complexity the classical analytical approaches. We introduce a bioinformatics-based classifying biomarker that uses emergent properties in genetics to separate pain patients requiring extremely high opioid doses from controls. Following precisely calculated selection of the 34 most informative markers in the OPRM1, OPRK1, OPRD1 and SIGMAR1 genes, pattern of genotypes belonging to either patient group could be derived using a k-nearest neighbor (kNN) classifier that provided a diagnostic accuracy of 80.6±4%. This outperformed alternative classifiers such as reportedly functional opioid receptor gene variants or complex biomarkers obtained via multiple regression or decision tree analysis. The accumulation of several genetic variants with only minor functional influences may result in a qualitative consequence affecting complex phenotypes, pointing at emergent properties in genetics. PMID:27139154
Emergent biomarker derived from next-generation sequencing to identify pain patients requiring uncommonly high opioid doses.

PubMed

Kringel, D; Ultsch, A; Zimmermann, M; Jansen, J-P; Ilias, W; Freynhagen, R; Griessinger, N; Kopf, A; Stein, C; Doehring, A; Resch, E; Lötsch, J

2017-10-01

Next-generation sequencing (NGS) provides unrestricted access to the genome, but it produces 'big data' exceeding in amount and complexity the classical analytical approaches. We introduce a bioinformatics-based classifying biomarker that uses emergent properties in genetics to separate pain patients requiring extremely high opioid doses from controls. Following precisely calculated selection of the 34 most informative markers in the OPRM1, OPRK1, OPRD1 and SIGMAR1 genes, pattern of genotypes belonging to either patient group could be derived using a k-nearest neighbor (kNN) classifier that provided a diagnostic accuracy of 80.6±4%. This outperformed alternative classifiers such as reportedly functional opioid receptor gene variants or complex biomarkers obtained via multiple regression or decision tree analysis. The accumulation of several genetic variants with only minor functional influences may result in a qualitative consequence affecting complex phenotypes, pointing at emergent properties in genetics.
GEMC1 is a critical regulator of multiciliated cell differentiation.

PubMed

Terré, Berta; Piergiovanni, Gabriele; Segura-Bayona, Sandra; Gil-Gómez, Gabriel; Youssef, Sameh A; Attolini, Camille Stephan-Otto; Wilsch-Bräuninger, Michaela; Jung, Carole; Rojas, Ana M; Marjanović, Marko; Knobel, Philip A; Palenzuela, Lluís; López-Rovira, Teresa; Forrow, Stephen; Huttner, Wieland B; Valverde, Miguel A; de Bruin, Alain; Costanzo, Vincenzo; Stracker, Travis H

2016-05-02

The generation of multiciliated cells (MCCs) is required for the proper function of many tissues, including the respiratory tract, brain, and germline. Defects in MCC development have been demonstrated to cause a subclass of mucociliary clearance disorders termed reduced generation of multiple motile cilia (RGMC). To date, only two genes, Multicilin (MCIDAS) and cyclin O (CCNO) have been identified in this disorder in humans. Here, we describe mice lacking GEMC1 (GMNC), a protein with a similar domain organization as Multicilin that has been implicated in DNA replication control. We have found that GEMC1-deficient mice are growth impaired, develop hydrocephaly with a high penetrance, and are infertile, due to defects in the formation of MCCs in the brain, respiratory tract, and germline. Our data demonstrate that GEMC1 is a critical regulator of MCC differentiation and a candidate gene for human RGMC or related disorders. © 2016 The Authors.
Multiple Renal Cyst Development but Not Situs Abnormalities in Transgenic RNAi Mice against Inv::GFP Rescue Gene

PubMed Central

Kamijho, Yuki; Shiozaki, Yayoi; Sakurai, Eiki; Hanaoka, Kazunori; Watanabe, Daisuke

2014-01-01

In this study we generated RNA interference (RNAi)-mediated gene knockdown transgenic mice (transgenic RNAi mice) against the functional Inv gene. Inv mutant mice show consistently reversed internal organs (situs inversus), multiple renal cysts and neonatal lethality. The Inv::GFP-rescue mice, which introduced the Inv::GFP fusion gene, can rescue inv mutant mice phenotypes. This indicates that the Inv::GFP gene is functional in vivo. To analyze the physiological functions of the Inv gene, and to demonstrate the availability of transgenic RNAi mice, we introduced a short hairpin RNA expression vector against GFP mRNA into Inv::GFP-rescue mice and analyzed the gene silencing effects and Inv functions by examining phenotypes. Transgenic RNAi mice with the Inv::GFP-rescue gene (Inv-KD mice) down-regulated Inv::GFP fusion protein and showed hypomorphic phenotypes of inv mutant mice, such as renal cyst development, but not situs abnormalities or postnatal lethality. This indicates that shRNAi-mediated gene silencing systems that target the tag sequence of the fusion gene work properly in vivo, and suggests that a relatively high level of Inv protein is required for kidney development in contrast to left/right axis determination. Inv::GFP protein was significantly down-regulated in the germ cells of Inv-KD mice testis compared with somatic cells, suggesting the existence of a testicular germ cell-specific enhanced RNAi system that regulates germ cell development. The Inv-KD mouse is useful for studying Inv gene functions in adult tissue that are unable to be analyzed in inv mutant mice showing postnatal lethality. In addition, the shRNA-based gene silencing system against the tag sequence of the fusion gene can be utilized as a new technique to regulate gene expression in either in vitro or in vivo experiments. PMID:24586938
Identification and Correction of Additive and Multiplicative Spatial Biases in Experimental High-Throughput Screening.

PubMed

Mazoure, Bogdan; Caraus, Iurie; Nadon, Robert; Makarenkov, Vladimir

2018-06-01

Data generated by high-throughput screening (HTS) technologies are prone to spatial bias. Traditionally, bias correction methods used in HTS assume either a simple additive or, more recently, a simple multiplicative spatial bias model. These models do not, however, always provide an accurate correction of measurements in wells located at the intersection of rows and columns affected by spatial bias. The measurements in these wells depend on the nature of interaction between the involved biases. Here, we propose two novel additive and two novel multiplicative spatial bias models accounting for different types of bias interactions. We describe a statistical procedure that allows for detecting and removing different types of additive and multiplicative spatial biases from multiwell plates. We show how this procedure can be applied by analyzing data generated by the four HTS technologies (homogeneous, microorganism, cell-based, and gene expression HTS), the three high-content screening (HCS) technologies (area, intensity, and cell-count HCS), and the only small-molecule microarray technology available in the ChemBank small-molecule screening database. The proposed methods are included in the AssayCorrector program, implemented in R, and available on CRAN.
Integrating Multiple Data Sources for Combinatorial Marker Discovery: A Study in Tumorigenesis.

PubMed

Bandyopadhyay, Sanghamitra; Mallik, Saurav

2018-01-01

Identification of combinatorial markers from multiple data sources is a challenging task in bioinformatics. Here, we propose a novel computational framework for identifying significant combinatorial markers ( s) using both gene expression and methylation data. The gene expression and methylation data are integrated into a single continuous data as well as a (post-discretized) boolean data based on their intrinsic (i.e., inverse) relationship. A novel combined score of methylation and expression data (viz., ) is introduced which is computed on the integrated continuous data for identifying initial non-redundant set of genes. Thereafter, (maximal) frequent closed homogeneous genesets are identified using a well-known biclustering algorithm applied on the integrated boolean data of the determined non-redundant set of genes. A novel sample-based weighted support ( ) is then proposed that is consecutively calculated on the integrated boolean data of the determined non-redundant set of genes in order to identify the non-redundant significant genesets. The top few resulting genesets are identified as potential s. Since our proposed method generates a smaller number of significant non-redundant genesets than those by other popular methods, the method is much faster than the others. Application of the proposed technique on an expression and a methylation data for Uterine tumor or Prostate Carcinoma produces a set of significant combination of markers. We expect that such a combination of markers will produce lower false positives than individual markers.
Multiple PAR and E4BP4 bZIP transcription factors in zebrafish: diverse spatial and temporal expression patterns.

PubMed

Ben-Moshe, Zohar; Vatine, Gad; Alon, Shahar; Tovin, Adi; Mracek, Philipp; Foulkes, Nicholas S; Gothilf, Yoav

2010-09-01

Circadian rhythms of physiology and behavior are generated by an autonomous circadian oscillator that is synchronized daily with the environment, mainly by light input. The PAR subfamily of transcriptional activators and the related E4BP4 repressor belonging to the basic leucine zipper (bZIP) family are clock-controlled genes that are suggested to mediate downstream circadian clock processes and to feedback onto the core oscillator. Here, the authors report the characterization of these genes in the zebrafish, an increasingly important model in the field of chronobiology. Five novel PAR and six novel e4bp4 zebrafish homolog genes were identified using bioinformatic tools and their coding sequences were cloned. Based on their evolutionary relationships, these genes were annotated as ztef2, zhlf1 and zhlf2, zdbp1 and zdbp2, and ze4bp4-1 to -6. The spatial and temporal mRNA expression pattern of each of these factors was characterized in zebrafish embryos in the context of a functional circadian clock and regulation by light. Nine of the factors exhibited augmented and rhythmic expression in the pineal gland, a central clock organ in zebrafish. Moreover, these genes were found to be regulated, to variable extents, by the circadian clock and/or by light. Differential expression patterns of multiple paralogs in zebrafish suggest multiple roles for these factors within the vertebrate circadian clock. This study, in the genetically accessible zebrafish model, lays the foundation for further research regarding the involvement and specific roles of PAR and E4BP4 transcription factors in the vertebrate circadian clock mechanism.
Highly efficient gene transfer using a retroviral vector into murine T cells for preclinical chimeric antigen receptor-expressing T cell therapy

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kusabuka, Hotaka; Fujiwara, Kento; Tokunaga, Yusuke

Adoptive immunotherapy using chimeric antigen receptor-expressing T (CAR-T) cells has attracted attention as an efficacious strategy for cancer treatment. To prove the efficacy and safety of CAR-T cell therapy, the elucidation of immunological mechanisms underlying it in mice is required. Although a retroviral vector (Rv) is mainly used for the introduction of CAR to murine T cells, gene transduction efficiency is generally less than 50%. The low transduction efficiency causes poor precision in the functional analysis of CAR-T cells. We attempted to improve the Rv gene transduction protocol to more efficiently generate functional CAR-T cells by optimizing the period ofmore » pre-cultivation and antibody stimulation. In the improved protocol, gene transduction efficiency to murine T cells was more than 90%. In addition, almost all of the prepared murine T cells expressed CAR after puromycin selection. These CAR-T cells had antigen-specific cytotoxic activity and secreted multiple cytokines by antigen stimulation. We believe that our optimized gene transduction protocol for murine T cells contributes to the advancement of T cell biology and development of immunotherapy using genetically engineered T cells. - Highlights: • We established highly efficient gene transduction protocols for murine T cells. • CD8{sup +} CAR-T cells had antigen-specific cytotoxic activity. • CD4{sup +} CAR-T cells secreted multiple cytokines by antigen stimulation. • This finding can contribute to the development of T-cell biology and immunotherapy.« less
Complexity in the cattle CD94/NKG2 gene families.

PubMed

Birch, James; Ellis, Shirley A

2007-04-01

Natural killer cell responses are controlled to a large extent by the interaction of an array of inhibitory and activating receptors with their ligands. The mostly nonpolymorphic CD94/NKG2 receptors in both humans and mice were shown to recognize a single nonclassical MHC class I molecule in each case. In this paper, we describe the CD94/NKG2 gene family in cattle. NKG2 and CD94 sequences were amplified from cDNA derived from four animals. Four CD94 sequences, ten NKG2A, and three NKG2C sequences were identified in total. In contrast to human, we show that cattle have multiple distinct NKG2A genes, some of which show minor allelic variation. All of the sequences designated NKG2A have two tyrosine-based inhibitory motifs in the cytoplasmic domain and one putative gene has, in addition, a charged residue in the transmembrane domain. NKG2C appears to be essentially monomorphic in cattle. All of the NKG2A sequences are similar apart from NKG2A-01, which, in contrast, shares the majority of its carbohydrate recognition domain with NKG2-C. Most of the genes appear to generate multiple alternatively spliced forms. These findings suggest that the CD94/NKG2A heterodimers in cattle, in contrast to other species, are binding several different ligands. Because NKG2C is not polymorphic, this raises questions as to the combined functional capacity of the CD94/NKG2 gene families in cattle.
pico-PLAZA, a genome database of microbial photosynthetic eukaryotes.

PubMed

Vandepoele, Klaas; Van Bel, Michiel; Richard, Guilhem; Van Landeghem, Sofie; Verhelst, Bram; Moreau, Hervé; Van de Peer, Yves; Grimsley, Nigel; Piganeau, Gwenael

2013-08-01

With the advent of next generation genome sequencing, the number of sequenced algal genomes and transcriptomes is rapidly growing. Although a few genome portals exist to browse individual genome sequences, exploring complete genome information from multiple species for the analysis of user-defined sequences or gene lists remains a major challenge. pico-PLAZA is a web-based resource (http://bioinformatics.psb.ugent.be/pico-plaza/) for algal genomics that combines different data types with intuitive tools to explore genomic diversity, perform integrative evolutionary sequence analysis and study gene functions. Apart from homologous gene families, multiple sequence alignments, phylogenetic trees, Gene Ontology, InterPro and text-mining functional annotations, different interactive viewers are available to study genome organization using gene collinearity and synteny information. Different search functions, documentation pages, export functions and an extensive glossary are available to guide non-expert scientists. To illustrate the versatility of the platform, different case studies are presented demonstrating how pico-PLAZA can be used to functionally characterize large-scale EST/RNA-Seq data sets and to perform environmental genomics. Functional enrichments analysis of 16 Phaeodactylum tricornutum transcriptome libraries offers a molecular view on diatom adaptation to different environments of ecological relevance. Furthermore, we show how complementary genomic data sources can easily be combined to identify marker genes to study the diversity and distribution of algal species, for example in metagenomes, or to quantify intraspecific diversity from environmental strains. © 2013 John Wiley & Sons Ltd and Society for Applied Microbiology.
Molecular and FISH analyses of a 53-kbp intact DNA fragment inserted by biolistics in wheat (Triticum aestivum L.) genome.

PubMed

Partier, A; Gay, G; Tassy, C; Beckert, M; Feuillet, C; Barret, P

2017-10-01

A large, 53-kbp, intact DNA fragment was inserted into the wheat ( Triticum aestivum L.) genome. FISH analyses of individual transgenic events revealed multiple insertions of intact fragments. Transferring large intact DNA fragments containing clusters of resistance genes or complete metabolic pathways into the wheat genome remains a challenge. In a previous work, we showed that the use of dephosphorylated cassettes for wheat transformation enabled the production of simple integration patterns. Here, we used the same technology to produce a cassette containing a 44-kb Arabidopsis thaliana BAC, flanked by one selection gene and one reporter gene. This 53-kb linear cassette was integrated in the bread wheat (Triticum aestivum L.) genome by biolistic transformation. Our results showed that transgenic plants harboring the entire cassette were generated. The inheritability of the cassette was demonstrated in the T1 and T2 generation. Surprisingly, FISH analysis performed on T1 progeny of independent events identified double genomic insertions of intact fragments in non-homoeologous positions. Inheritability of these double insertions was demonstrated by FISH analysis of the T1 generation. Relative conclusions that can be drawn from molecular or FISH analysis are discussed along with future prospects of the engineering of large fragments for wheat transformation or genome editing.
Silencing of Endogenous IL-10 in Human Dendritic Cells Leads to the Generation of an Improved CTL Response Against Human Melanoma Associated Antigenic Epitope, MART-127−35

PubMed Central

Chhabra, Arvind; Chakraborty, Nityo G.; Mukherji, Bijay

2008-01-01

Dendritic cells (DC) present antigenic epitopes to and activate T cells. They also polarize the ensuing T cell response to Th1 or Th2 type response, depending on their cytokine production profile. For example, IL-12 producing DC generate Th1 type T cell response whereas IL-10 producing DC is usually tolerogenic. Different strategies -- such as the use of cytokines and anti-cytokine antibodies, dominant negative forms of protein, anti-sense RNA etc. -- have been employed to influence the cytokine synthetic profile of DC as well as to make DC more immunogenic. Utilizing GFP expressing recombinant adenoviruses in association with lipid-mediated transfection of siRNA, we have silenced the endogenous IL-10 gene in DC. We show that IL-10 gene silenced DC produce more IL-12 and also generates a better cytolytic T cell response against the human melanoma associated epitope, MART-127−35, in-vitro. We also show that the GFP expressing adenoviral vector can be used to optimize the parameters for siRNA delivery in primary cells and show that RNA interference methodology can efficiently knock-down virus encoded genes transcribed at very high multiplicity of infection in DC. PMID:18249038
Reengineering a transmembrane protein to treat muscular dystrophy using exon skipping.

PubMed

Gao, Quan Q; Wyatt, Eugene; Goldstein, Jeff A; LoPresti, Peter; Castillo, Lisa M; Gazda, Alec; Petrossian, Natalie; Earley, Judy U; Hadhazy, Michele; Barefield, David Y; Demonbreun, Alexis R; Bönnemann, Carsten; Wolf, Matthew; McNally, Elizabeth M

2015-11-02

Exon skipping uses antisense oligonucleotides as a treatment for genetic diseases. The antisense oligonucleotides used for exon skipping are designed to bypass premature stop codons in the target RNA and restore reading frame disruption. Exon skipping is currently being tested in humans with dystrophin gene mutations who have Duchenne muscular dystrophy. For Duchenne muscular dystrophy, the rationale for exon skipping derived from observations in patients with naturally occurring dystrophin gene mutations that generated internally deleted but partially functional dystrophin proteins. We have now expanded the potential for exon skipping by testing whether an internal, in-frame truncation of a transmembrane protein γ-sarcoglycan is functional. We generated an internally truncated γ-sarcoglycan protein that we have termed Mini-Gamma by deleting a large portion of the extracellular domain. Mini-Gamma provided functional and pathological benefits to correct the loss of γ-sarcoglycan in a Drosophila model, in heterologous cell expression studies, and in transgenic mice lacking γ-sarcoglycan. We generated a cellular model of human muscle disease and showed that multiple exon skipping could be induced in RNA that encodes a mutant human γ-sarcoglycan. Since Mini-Gamma represents removal of 4 of the 7 coding exons in γ-sarcoglycan, this approach provides a viable strategy to treat the majority of patients with γ-sarcoglycan gene mutations.
Reengineering a transmembrane protein to treat muscular dystrophy using exon skipping

PubMed Central

Gao, Quan Q.; Wyatt, Eugene; Goldstein, Jeff A.; LoPresti, Peter; Castillo, Lisa M.; Gazda, Alec; Petrossian, Natalie; Earley, Judy U.; Hadhazy, Michele; Barefield, David Y.; Demonbreun, Alexis R.; Bönnemann, Carsten; Wolf, Matthew; McNally, Elizabeth M.

2015-01-01

Exon skipping uses antisense oligonucleotides as a treatment for genetic diseases. The antisense oligonucleotides used for exon skipping are designed to bypass premature stop codons in the target RNA and restore reading frame disruption. Exon skipping is currently being tested in humans with dystrophin gene mutations who have Duchenne muscular dystrophy. For Duchenne muscular dystrophy, the rationale for exon skipping derived from observations in patients with naturally occurring dystrophin gene mutations that generated internally deleted but partially functional dystrophin proteins. We have now expanded the potential for exon skipping by testing whether an internal, in-frame truncation of a transmembrane protein γ-sarcoglycan is functional. We generated an internally truncated γ-sarcoglycan protein that we have termed Mini-Gamma by deleting a large portion of the extracellular domain. Mini-Gamma provided functional and pathological benefits to correct the loss of γ-sarcoglycan in a Drosophila model, in heterologous cell expression studies, and in transgenic mice lacking γ-sarcoglycan. We generated a cellular model of human muscle disease and showed that multiple exon skipping could be induced in RNA that encodes a mutant human γ-sarcoglycan. Since Mini-Gamma represents removal of 4 of the 7 coding exons in γ-sarcoglycan, this approach provides a viable strategy to treat the majority of patients with γ-sarcoglycan gene mutations. PMID:26457733
H-2 compatibility requirement for virus-specific T-cell-mediated cytolysis. Evaluation of the role of H-2I region and non-H-2 genes in regulating immune response

PubMed Central

1976-01-01

Lymphocytic choriomeningitis virus (LCMV) and ectromelia virus-specific T-cell-mediated cytotoxicity was assayed in various strain combinations using as targets peritoneal macrophages which have been shown to express Ia antigens. Virus-specific cytotoxicity was found only in H-2K- or D-region compatible combinations. I-region compatibility was not necessary nor alone sufficient for lysis. Six different I-region specificities had no obvious effect on the capacity to generate in vivo specific cytotoxicity (expressed in vitro) associated with Dd. Low LCMV- specific cytotoxic activity generated in DBA/2 mice was caused by the non-H-2 genetic background. This trait was inversely related to the infectious virus dose and recessive. Non-H-2 genes, possibly involved in controlling initial spread and multiplication of virus, seem to be, at least in the examples tested, more important in determining virus- specific cytotoxic T-cell activity in spleens than are Ir genes coded in H-2. PMID:1085331
Use of Partial Least Squares improves the efficacy of removing unwanted variability in differential expression analyses based on RNA-Seq data.

PubMed

Chakraborty, Sutirtha

2018-05-26

RNA-Seq technology has revolutionized the face of gene expression profiling by generating read count data measuring the transcript abundances for each queried gene on multiple experimental subjects. But on the downside, the underlying technical artefacts and hidden biological profiles of the samples generate a wide variety of latent effects that may potentially distort the actual transcript/gene expression signals. Standard normalization techniques fail to correct for these hidden variables and lead to flawed downstream analyses. In this work I demonstrate the use of Partial Least Squares (built as an R package 'SVAPLSseq') to correct for the traces of extraneous variability in RNA-Seq data. A novel and thorough comparative analysis of the PLS based method is presented along with some of the other popularly used approaches for latent variable correction in RNA-Seq. Overall, the method is found to achieve a substantially improved estimation of the hidden effect signatures in the RNA-Seq transcriptome expression landscape compared to other available techniques. Copyright © 2017. Published by Elsevier Inc.

Tissue-Specific Chromatin Modifications at a Multigene Locus Generate Asymmetric Transcriptional Interactions

PubMed Central

Yoo, Eung Jae; Cajiao, Isabela; Kim, Jeong-Seon; Kimura, Atsushi P.; Zhang, Aiwen; Cooke, Nancy E.; Liebhaber, Stephen A.

2006-01-01

Random assortment within mammalian genomes juxtaposes genes with distinct expression profiles. This organization, along with the prevalence of long-range regulatory controls, generates a potential for aberrant transcriptional interactions. The human CD79b/GH locus contains six tightly linked genes with three mutually exclusive tissue specificities and interdigitated control elements. One consequence of this compact organization is that the pituitarycell-specific transcriptional events that activate hGH-N also trigger ectopic activation of CD79b. However, the B-cell-specific events that activate CD79b do not trigger reciprocal activation of hGH-N. Here we utilized DNase I hypersensitive site mapping, chromatin immunoprecipitation, and transgenic models to explore the basis for this asymmetric relationship. The results reveal tissue-specific patterns of chromatin structures and transcriptional controls at the CD79b/GH locus in B cells distinct from those in the pituitary gland and placenta. These three unique transcriptional environments suggest a set of corresponding gene expression pathways and transcriptional interactions that are likely to be found juxtaposed at multiple sites within the eukaryotic genome. PMID:16847312
Targeted therapy according to next generation sequencing-based panel sequencing.

PubMed

Saito, Motonobu; Momma, Tomoyuki; Kono, Koji

2018-04-17

Targeted therapy against actionable gene mutations shows a significantly higher response rate as well as longer survival compared to conventional chemotherapy, and has become a standard therapy for many cancers. Recent progress in next-generation sequencing (NGS) has enabled to identify huge number of genetic aberrations. Based on sequencing results, patients recommend to undergo targeted therapy or immunotherapy. In cases where there are no available approved drugs for the genetic mutations detected in the patients, it is recommended to be facilitate the registration for the clinical trials. For that purpose, a NGS-based sequencing panel that can simultaneously target multiple genes in a single investigation has been used in daily clinical practice. To date, various types of sequencing panels have been developed to investigate genetic aberrations with tumor somatic genome variants (gain-of-function or loss-of-function mutations, high-level copy number alterations, and gene fusions) through comprehensive bioinformatics. Because sequencing panels are efficient and cost-effective, they are quickly being adopted outside the lab, in hospitals and clinics, in order to identify personal targeted therapy for individual cancer patients.
One-Step and Stepwise Magnification of a BOBBED LETHAL Chromosome in DROSOPHILA MELANOGASTER

PubMed Central

Endow, Sharyn A.; Komma, Donald J.

1986-01-01

Bobbed lethal (bbl) chromosomes carry too few ribosomal genes for homozygous flies to be viable. Reversion of bbl chromosomes to bb or nearly bb + occurs under magnifying conditions at a low frequency in a single generation. These reversions occur too rapidly to be accounted for by single unequal sister chromatid exchanges and seem unlikely to be due to multiple sister strand exchanges within a given cell lineage. Analysis of several one-step revertants indicates that they are X-Y recombinant chromosomes which probably arise from X-Y recombination at bb. The addition of ribosomal genes from the Y chromosome to the bbl chromosome explains the more rapid reversion of the bbl chromosome than is permitted by single events of unequal sister chromatid exchange. Analysis of stepwise bbl magnified chromosomes, which were selected over a period of 4–9 magnifying generations, shows ribosomal gene patterns that are closely similar to each other. Similarity in rDNA pattern among stepwise magnified products of the same parental chromosome is consistent with reversion by a mechanism of unequal sister strand exchange. PMID:3095184
The Natural History of Class I Primate Alcohol Dehydrogenases Includes Gene Duplication, Gene Loss, and Gene Conversion

PubMed Central

Carrigan, Matthew A.; Uryasev, Oleg; Davis, Ross P.; Zhai, LanMin; Hurley, Thomas D.; Benner, Steven A.

2012-01-01

Background Gene duplication is a source of molecular innovation throughout evolution. However, even with massive amounts of genome sequence data, correlating gene duplication with speciation and other events in natural history can be difficult. This is especially true in its most interesting cases, where rapid and multiple duplications are likely to reflect adaptation to rapidly changing environments and life styles. This may be so for Class I of alcohol dehydrogenases (ADH1s), where multiple duplications occurred in primate lineages in Old and New World monkeys (OWMs and NWMs) and hominoids. Methodology/Principal Findings To build a preferred model for the natural history of ADH1s, we determined the sequences of nine new ADH1 genes, finding for the first time multiple paralogs in various prosimians (lemurs, strepsirhines). Database mining then identified novel ADH1 paralogs in both macaque (an OWM) and marmoset (a NWM). These were used with the previously identified human paralogs to resolve controversies relating to dates of duplication and gene conversion in the ADH1 family. Central to these controversies are differences in the topologies of trees generated from exonic (coding) sequences and intronic sequences. Conclusions/Significance We provide evidence that gene conversions are the primary source of difference, using molecular clock dating of duplications and analyses of microinsertions and deletions (micro-indels). The tree topology inferred from intron sequences appear to more correctly represent the natural history of ADH1s, with the ADH1 paralogs in platyrrhines (NWMs) and catarrhines (OWMs and hominoids) having arisen by duplications shortly predating the divergence of OWMs and NWMs. We also conclude that paralogs in lemurs arose independently. Finally, we identify errors in database interpretation as the source of controversies concerning gene conversion. These analyses provide a model for the natural history of ADH1s that posits four ADH1 paralogs in the ancestor of Catarrhine and Platyrrhine primates, followed by the loss of an ADH1 paralog in the human lineage. PMID:22859968
Feasibility of a workflow for the molecular characterization of single cells by next generation sequencing.

PubMed

Salvianti, Francesca; Rotunno, Giada; Galardi, Francesca; De Luca, Francesca; Pestrin, Marta; Vannucchi, Alessandro Maria; Di Leo, Angelo; Pazzagli, Mario; Pinzani, Pamela

2015-09-01

The purpose of the study was to explore the feasibility of a protocol for the isolation and molecular characterization of single circulating tumor cells (CTCs) from cancer patients using a single-cell next generation sequencing (NGS) approach. To reach this goal we used as a model an artificial sample obtained by spiking a breast cancer cell line (MDA-MB-231) into the blood of a healthy donor. Tumor cells were enriched and enumerated by CellSearch(®) and subsequently isolated by DEPArray™ to obtain single or pooled pure samples to be submitted to the analysis of the mutational status of multiple genes involved in cancer. Upon whole genome amplification, samples were analysed by NGS on the Ion Torrent PGM™ system (Life Technologies) using the Ion AmpliSeq™ Cancer Hotspot Panel v2 (Life Technologies), designed to investigate genomic "hot spot" regions of 50 oncogenes and tumor suppressor genes. We successfully sequenced five single cells, a pool of 5 cells and DNA from a cellular pellet of the same cell line with a mean depth of the sequencing reaction ranging from 1581 to 3479 reads. We found 27 sequence variants in 18 genes, 15 of which already reported in the COSMIC or dbSNP databases. We confirmed the presence of two somatic mutations, in the BRAF and TP53 gene, which had been already reported for this cells line, but also found new mutations and single nucleotide polymorphisms. Three variants were common to all the analysed samples, while 18 were present only in a single cell suggesting a high heterogeneity within the same cell line. This paper presents an optimized workflow for the molecular characterization of multiple genes in single cells by NGS. The described pipeline can be easily transferred to the study of single CTCs from oncologic patients.
Evolution of Daily Gene Co-expression Patterns from Algae to Plants

PubMed Central

de los Reyes, Pedro; Romero-Campero, Francisco J.; Ruiz, M. Teresa; Romero, José M.; Valverde, Federico

2017-01-01

Daily rhythms play a key role in transcriptome regulation in plants and microalgae orchestrating responses that, among other processes, anticipate light transitions that are essential for their metabolism and development. The recent accumulation of genome-wide transcriptomic data generated under alternating light:dark periods from plants and microalgae has made possible integrative and comparative analysis that could contribute to shed light on the evolution of daily rhythms in the green lineage. In this work, RNA-seq and microarray data generated over 24 h periods in different light regimes from the eudicot Arabidopsis thaliana and the microalgae Chlamydomonas reinhardtii and Ostreococcus tauri have been integrated and analyzed using gene co-expression networks. This analysis revealed a reduction in the size of the daily rhythmic transcriptome from around 90% in Ostreococcus, being heavily influenced by light transitions, to around 40% in Arabidopsis, where a certain independence from light transitions can be observed. A novel Multiple Bidirectional Best Hit (MBBH) algorithm was applied to associate single genes with a family of potential orthologues from evolutionary distant species. Gene duplication, amplification and divergence of rhythmic expression profiles seems to have played a central role in the evolution of gene families in the green lineage such as Pseudo Response Regulators (PRRs), CONSTANS-Likes (COLs), and DNA-binding with One Finger (DOFs). Gene clustering and functional enrichment have been used to identify groups of genes with similar rhythmic gene expression patterns. The comparison of gene clusters between species based on potential orthologous relationships has unveiled a low to moderate level of conservation of daily rhythmic expression patterns. However, a strikingly high conservation was found for the gene clusters exhibiting their highest and/or lowest expression value during the light transitions. PMID:28751903
Broad genomic and transcriptional analysis reveals a highly derived genome in dinoflagellate mitochondria

PubMed Central

Jackson, Christopher J; Norman, John E; Schnare, Murray N; Gray, Michael W; Keeling, Patrick J; Waller, Ross F

2007-01-01

Background Dinoflagellates comprise an ecologically significant and diverse eukaryotic phylum that is sister to the phylum containing apicomplexan endoparasites. The mitochondrial genome of apicomplexans is uniquely reduced in gene content and size, encoding only three proteins and two ribosomal RNAs (rRNAs) within a highly compacted 6 kb DNA. Dinoflagellate mitochondrial genomes have been comparatively poorly studied: limited available data suggest some similarities with apicomplexan mitochondrial genomes but an even more radical type of genomic organization. Here, we investigate structure, content and expression of dinoflagellate mitochondrial genomes. Results From two dinoflagellates, Crypthecodinium cohnii and Karlodinium micrum, we generated over 42 kb of mitochondrial genomic data that indicate a reduced gene content paralleling that of mitochondrial genomes in apicomplexans, i.e., only three protein-encoding genes and at least eight conserved components of the highly fragmented large and small subunit rRNAs. Unlike in apicomplexans, dinoflagellate mitochondrial genes occur in multiple copies, often as gene fragments, and in numerous genomic contexts. Analysis of cDNAs suggests several novel aspects of dinoflagellate mitochondrial gene expression. Polycistronic transcripts were found, standard start codons are absent, and oligoadenylation occurs upstream of stop codons, resulting in the absence of termination codons. Transcripts of at least one gene, cox3, are apparently trans-spliced to generate full-length mRNAs. RNA substitutional editing, a process previously identified for mRNAs in dinoflagellate mitochondria, is also implicated in rRNA expression. Conclusion The dinoflagellate mitochondrial genome shares the same gene complement and fragmentation of rRNA genes with its apicomplexan counterpart. However, it also exhibits several unique characteristics. Most notable are the expansion of gene copy numbers and their arrangements within the genome, RNA editing, loss of stop codons, and use of trans-splicing. PMID:17897476
Dissecting DNA repair in adult high grade gliomas for patient stratification in the post-genomic era

PubMed Central

Perry, Christina; Agarwal, Devika; Abdel-Fatah, Tarek M.A.; Lourdusamy, Anbarasu; Grundy, Richard; Auer, Dorothee T.; Walker, David; Lakhani, Ravi; Scott, Ian S.; Chan, Stephen; Ball, Graham; Madhusudan, Srinivasan

2014-01-01

Deregulation of multiple DNA repair pathways may contribute to aggressive biology and therapy resistance in gliomas. We evaluated transcript levels of 157 genes involved in DNA repair in an adult glioblastoma Test set (n=191) and validated in ‘The Cancer Genome Atlas’ (TCGA) cohort (n=508). A DNA repair prognostic index model was generated. Artificial neural network analysis (ANN) was conducted to investigate global gene interactions. Protein expression by immunohistochemistry was conducted in 61 tumours. A fourteen DNA repair gene expression panel was associated with poor survival in Test and TCGA cohorts. A Cox multivariate model revealed APE1, NBN, PMS2, MGMT and PTEN as independently associated with poor prognosis. A DNA repair prognostic index incorporating APE1, NBN, PMS2, MGMT and PTEN stratified patients in to three prognostic sub-groups with worsening survival. APE1, NBN, PMS2, MGMT and PTEN also have predictive significance in patients who received chemotherapy and/or radiotherapy. ANN analysis of APE1, NBN, PMS2, MGMT and PTEN revealed interactions with genes involved in transcription, hypoxia and metabolic regulation. At the protein level, low APE1 and low PTEN remain associated with poor prognosis. In conclusion, multiple DNA repair pathways operate to influence biology and clinical outcomes in adult high grade gliomas. PMID:25026297
Regulation of Na(+)/K(+)-ATPase by nuclear respiratory factor 1: implication in the tight coupling of neuronal activity, energy generation, and energy consumption.

PubMed

Johar, Kaid; Priya, Anusha; Wong-Riley, Margaret T T

2012-11-23

NRF-1 regulates mediators of neuronal activity and energy generation. NRF-1 transcriptionally regulates Na(+)/K(+)-ATPase subunits α1 and β1. NRF-1 functionally regulates mediators of energy consumption in neurons. NRF-1 mediates the tight coupling of neuronal activity, energy generation, and energy consumption at the molecular level. Energy generation and energy consumption are tightly coupled to neuronal activity at the cellular level. Na(+)/K(+)-ATPase, a major energy-consuming enzyme, is well expressed in neurons rich in cytochrome c oxidase, an important enzyme of the energy-generating machinery, and glutamatergic receptors that are mediators of neuronal activity. The present study sought to test our hypothesis that the coupling extends to the molecular level, whereby Na(+)/K(+)-ATPase subunits are regulated by the same transcription factor, nuclear respiratory factor 1 (NRF-1), found recently by our laboratory to regulate all cytochrome c oxidase subunit genes and some NMDA and AMPA receptor subunit genes. By means of multiple approaches, including in silico analysis, electrophoretic mobility shift and supershift assays, in vivo chromatin immunoprecipitation, promoter mutational analysis, and real-time quantitative PCR, NRF-1 was found to functionally bind to the promoters of Atp1a1 and Atp1b1 genes but not of the Atp1a3 gene in neurons. The transcripts of Atp1a1 and Atp1b1 subunit genes were up-regulated by KCl and down-regulated by tetrodotoxin. Atp1b1 is positively regulated by NRF-1, and silencing of NRF-1 with small interference RNA blocked the up-regulation of Atp1b1 induced by KCl, whereas overexpression of NRF-1 rescued these transcripts from being suppressed by tetrodotoxin. On the other hand, Atp1a1 is negatively regulated by NRF-1. The binding sites of NRF-1 on Atp1a1 and Atp1b1 are conserved among mice, rats, and humans. Thus, NRF-1 regulates key Na(+)/K(+)-ATPase subunits and plays an important role in mediating the tight coupling between energy consumption, energy generation, and neuronal activity at the molecular level.
Phylogenetic analysis of pectin-related gene families in Physcomitrella patens and nine other plant species yields evolutionary insights into cell walls

PubMed Central

2014-01-01

Background Pectins are acidic sugar-containing polysaccharides that are universally conserved components of the primary cell walls of plants and modulate both tip and diffuse cell growth. However, many of their specific functions and the evolution of the genes responsible for producing and modifying them are incompletely understood. The moss Physcomitrella patens is emerging as a powerful model system for the study of plant cell walls. To identify deeply conserved pectin-related genes in Physcomitrella, we generated phylogenetic trees for 16 pectin-related gene families using sequences from ten plant genomes and analyzed the evolutionary relationships within these families. Results Contrary to our initial hypothesis that a single ancestral gene was present for each pectin-related gene family in the common ancestor of land plants, five of the 16 gene families, including homogalacturonan galacturonosyltransferases, polygalacturonases, pectin methylesterases, homogalacturonan methyltransferases, and pectate lyase-like proteins, show evidence of multiple members in the early land plant that gave rise to the mosses and vascular plants. Seven of the gene families, the UDP-rhamnose synthases, UDP-glucuronic acid epimerases, homogalacturonan galacturonosyltransferase-like proteins, β-1,4-galactan β-1,4-galactosyltransferases, rhamnogalacturonan II xylosyltransferases, and pectin acetylesterases appear to have had a single member in the common ancestor of land plants. We detected no Physcomitrella members in the xylogalacturonan xylosyltransferase, rhamnogalacturonan I arabinosyltransferase, pectin methylesterase inhibitor, or polygalacturonase inhibitor protein families. Conclusions Several gene families related to the production and modification of pectins in plants appear to have multiple members that are conserved as far back as the common ancestor of mosses and vascular plants. The presence of multiple members of these families even before the divergence of other important cell wall-related genes, such as cellulose synthases, suggests a more complex role than previously suspected for pectins in the evolution of land plants. The presence of relatively small pectin-related gene families in Physcomitrella as compared to Arabidopsis makes it an attractive target for analysis of the functions of pectins in cell walls. In contrast, the absence of genes in Physcomitrella for some families suggests that certain pectin modifications, such as homogalacturonan xylosylation, arose later during land plant evolution. PMID:24666997
Finding novel relationships with integrated gene-gene association network analysis of Synechocystis sp. PCC 6803 using species-independent text-mining.

PubMed

Kreula, Sanna M; Kaewphan, Suwisa; Ginter, Filip; Jones, Patrik R

2018-01-01

The increasing move towards open access full-text scientific literature enhances our ability to utilize advanced text-mining methods to construct information-rich networks that no human will be able to grasp simply from 'reading the literature'. The utility of text-mining for well-studied species is obvious though the utility for less studied species, or those with no prior track-record at all, is not clear. Here we present a concept for how advanced text-mining can be used to create information-rich networks even for less well studied species and apply it to generate an open-access gene-gene association network resource for Synechocystis sp. PCC 6803, a representative model organism for cyanobacteria and first case-study for the methodology. By merging the text-mining network with networks generated from species-specific experimental data, network integration was used to enhance the accuracy of predicting novel interactions that are biologically relevant. A rule-based algorithm (filter) was constructed in order to automate the search for novel candidate genes with a high degree of likely association to known target genes by (1) ignoring established relationships from the existing literature, as they are already 'known', and (2) demanding multiple independent evidences for every novel and potentially relevant relationship. Using selected case studies, we demonstrate the utility of the network resource and filter to ( i ) discover novel candidate associations between different genes or proteins in the network, and ( ii ) rapidly evaluate the potential role of any one particular gene or protein. The full network is provided as an open-source resource.
Transcriptome analysis of woodland strawberry (Fragaria vesca) response to the infection by Strawberry vein banding virus (SVBV).

PubMed

Chen, Jing; Zhang, Hanping; Feng, Mingfeng; Zuo, Dengpan; Hu, Yahui; Jiang, Tong

2016-07-13

Woodland strawberry (Fragaria vesca) infected with Strawberry vein banding virus (SVBV) exhibits chlorotic symptoms along the leaf veins. However, little is known about the molecular mechanism of strawberry disease caused by SVBV. We performed the next-generation sequencing (RNA-Seq) study to identify gene expression changes induced by SVBV in woodland strawberry using mock-inoculated plants as a control. Using RNA-Seq, we have identified 36,850 unigenes, of which 517 were differentially expressed in the virus-infected plants (DEGs). The unigenes were annotated and classified with Gene Ontology (GO), Clusters of Orthologous Group (COG) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses. The KEGG pathway analysis of these genes suggested that strawberry disease caused by SVBV may affect multiple processes including pigment metabolism, photosynthesis and plant-pathogen interactions. Our research provides comprehensive transcriptome information regarding SVBV infection in strawberry.
Generation of Myostatin Gene-Edited Channel Catfish (Ictalurus punctatus) via Zygote Injection of CRISPR/Cas9 System.

PubMed

Khalil, Karim; Elayat, Medhat; Khalifa, Elsayed; Daghash, Samer; Elaswad, Ahmed; Miller, Michael; Abdelrahman, Hisham; Ye, Zhi; Odin, Ramjie; Drescher, David; Vo, Khoi; Gosh, Kamal; Bugg, William; Robinson, Dalton; Dunham, Rex

2017-08-04

The myostatin (MSTN) gene is important because of its role in regulation of skeletal muscle growth in all vertebrates. In this study, CRISPR/Cas9 was utilized to successfully target the channel catfish, Ictalurus punctatus, muscle suppressor gene MSTN. CRISPR/Cas9 induced high rates (88-100%) of mutagenesis in the target protein-encoding sites of MSTN. MSTN-edited fry had more muscle cells (p < 0.001) than controls, and the mean body weight of gene-edited fry increased by 29.7%. The nucleic acid alignment of the mutated sequences against the wild-type sequence revealed multiple insertions and deletions. These results demonstrate that CRISPR/Cas9 is a highly efficient tool for editing the channel catfish genome, and opens ways for facilitating channel catfish genetic enhancement and functional genomics. This approach may produce growth-enhanced channel catfish and increase productivity.
Mouse forward genetics in the study of the peripheral nervous system and human peripheral neuropathy

PubMed Central

Douglas, Darlene S.; Popko, Brian

2009-01-01

Forward genetics, the phenotype-driven approach to investigating gene identity and function, has a long history in mouse genetics. Random mutations in the mouse transcend bias about gene function and provide avenues towards unique discoveries. The study of the peripheral nervous system is no exception; from historical strains such as the trembler mouse, which led to the identification of PMP22 as a human disease gene causing multiple forms of peripheral neuropathy, to the more recent identification of the claw paw and sprawling mutations, forward genetics has long been a tool for probing the physiology, pathogenesis, and genetics of the PNS. Even as spontaneous and mutagenized mice continue to enable the identification of novel genes, provide allelic series for detailed functional studies, and generate models useful for clinical research, new methods, such as the piggyBac transposon, are being developed to further harness the power of forward genetics. PMID:18481175
Sequence of a second gene encoding bovine submaxillary mucin: implication for mucin heterogeneity and cloning.

PubMed

Jiang, W; Woitach, J T; Gupta, D; Bhavanandan, V P

1998-10-20

Secreted epithelial mucins are extremely large and heterogeneous glycoproteins. We report the 5 kilobase DNA sequence of a second gene, BSM2, which encodes bovine submaxillary mucin. The determined nucleotide and deduced amino acid sequences of BSM2 are 95.2% and 92. 2% identical, respectively, to those of the previously described BSM1 gene isolated from the same cow. Further, the five predicted protein domains of the two genes are 100%, 94%, 93%, 77%, and 88% identical. Based on the above results, we propose that expression of multiple homologous core proteins from a single animal is a factor in generating diversity of saccharides in mucins and in providing resistance of the molecules to proteolysis. In addition, this work raises several important issues in mucin cloning such as assembling sequences from seemingly overlapping clones and deducing consensus sequences for nearly identical tandem repeats. Copyright 1998 Academic Press.
Multiple Cytochrome P450 genes: their constitutive overexpression and permethrin induction in insecticide resistant mosquitoes, Culex quinquefasciatus.

PubMed

Liu, Nannan; Li, Ting; Reid, William R; Yang, Ting; Zhang, Lee

2011-01-01

Four cytochrome P450 cDNAs, CYP6AA7, CYP9J40, CYP9J34, and CYP9M10, were isolated from mosquitoes, Culex quinquefasciatus. The P450 gene expression and induction by permethrin were compared for three different mosquito populations bearing different resistance phenotypes, ranging from susceptible (S-Lab), through intermediate (HAmCq(G0), the field parental population) to highly resistant (HAmCq(G8), the 8(th) generation of permethrin selected offspring of HAmCq(G0)). A strong correlation was found for P450 gene expression with the levels of resistance and following permethrin selection at the larval stage of mosquitoes, with the highest expression levels identified in HAmCq(G8), suggesting the importance of CYP6AA7, CYP9J40, CYP9J34, and CYP9M10 in the permethrin resistance of larva mosquitoes. Only CYP6AA7 showed a significant overexpression in HAmCq(G8) adult mosquitoes. Other P450 genes had similar expression levels among the mosquito populations tested, suggesting different P450 genes may be involved in the response to insecticide pressure in different developmental stages. The expression of CYP6AA7, CYP9J34, and CYP9M10 was further induced by permethrin in resistant mosquitoes. Taken together, these results indicate that multiple P450 genes are up-regulated in insecticide resistant mosquitoes through both constitutive overexpression and induction mechanisms, thus increasing the overall expression levels of P450 genes.
Metabolomic analysis reveals key metabolites related to the rapid adaptation of Saccharomyce cerevisiae to multiple inhibitors of furfural, acetic acid, and phenol.

PubMed

Wang, Xin; Li, Bing-Zhi; Ding, Ming-Zhu; Zhang, Wei-Wen; Yuan, Ying-Jin

2013-03-01

During hydrolysis of lignocellulosic biomass, a broad range of inhibitors are generated, which interfere with yeast growth and bioethanol production. In order to improve the strain tolerance to multiple inhibitors--acetic acid, furfural, and phenol (three representative lignocellulose-derived inhibitors) and uncover the underlying tolerant mechanism, an adaptation experiment was performed in which the industrial Saccharomyces cerevisiae was cultivated repeatedly in a medium containing multiple inhibitors. The adaptation occurred quickly, accompanied with distinct increase in growth rate, glucose utilization rate, furfural metabolism rate, and ethanol yield, only after the first transfer. A similar rapid adaptation was also observed for the lab strains of BY4742 and BY4743. The metabolomic analysis was employed to investigate the responses of the industrial S. cereviaise to three inhibitors during the adaptation. The results showed that higher levels of 2-furoic acid, 2, 3-butanediol, intermediates in glycolytic pathway, and amino acids derived from glycolysis, were discovered in the adapted strains, suggesting that enhanced metabolic activity in these pathways may relate to resistance against inhibitors. Additionally, through single-gene knockouts, several genes related to alanine metabolism, GABA shunt, and glycerol metabolism were verified to be crucial for the resistance to multiple inhibitors. This study provides new insights into the tolerance mechanism against multiple inhibitors, and guides for the improvement of tolerant ethanologenic yeast strains for lignocellulose-bioethanol fermentation.
Reverse genetics in high throughput: rapid generation of complete negative strand RNA virus cDNA clones and recombinant viruses thereof.

PubMed

Nolden, T; Pfaff, F; Nemitz, S; Freuling, C M; Höper, D; Müller, T; Finke, Stefan

2016-04-05

Reverse genetics approaches are indispensable tools for proof of concepts in virus replication and pathogenesis. For negative strand RNA viruses (NSVs) the limited number of infectious cDNA clones represents a bottleneck as clones are often generated from cell culture adapted or attenuated viruses, with limited potential for pathogenesis research. We developed a system in which cDNA copies of complete NSV genomes were directly cloned into reverse genetics vectors by linear-to-linear RedE/T recombination. Rapid cloning of multiple rabies virus (RABV) full length genomes and identification of clones identical to field virus consensus sequence confirmed the approache's reliability. Recombinant viruses were recovered from field virus cDNA clones. Similar growth kinetics of parental and recombinant viruses, preservation of field virus characters in cell type specific replication and virulence in the mouse model were confirmed. Reduced titers after reporter gene insertion indicated that the low level of field virus replication is affected by gene insertions. The flexibility of the strategy was demonstrated by cloning multiple copies of an orthobunyavirus L genome segment. This important step in reverse genetics technology development opens novel avenues for the analysis of virus variability combined with phenotypical characterization of recombinant viruses at a clonal level.
Capture-based next-generation sequencing reveals multiple actionable mutations in cancer patients failed in traditional testing.

PubMed

Xie, Jing; Lu, Xiongxiong; Wu, Xue; Lin, Xiaoyi; Zhang, Chao; Huang, Xiaofang; Chang, Zhili; Wang, Xinjing; Wen, Chenlei; Tang, Xiaomei; Shi, Minmin; Zhan, Qian; Chen, Hao; Deng, Xiaxing; Peng, Chenghong; Li, Hongwei; Fang, Yuan; Shao, Yang; Shen, Baiyong

2016-05-01

Targeted therapies including monoclonal antibodies and small molecule inhibitors have dramatically changed the treatment of cancer over past 10 years. Their therapeutic advantages are more tumor specific and with less side effects. For precisely tailoring available targeted therapies to each individual or a subset of cancer patients, next-generation sequencing (NGS) has been utilized as a promising diagnosis tool with its advantages of accuracy, sensitivity, and high throughput. We developed and validated a NGS-based cancer genomic diagnosis targeting 115 prognosis and therapeutics relevant genes on multiple specimen including blood, tumor tissue, and body fluid from 10 patients with different cancer types. The sequencing data was then analyzed by the clinical-applicable analytical pipelines developed in house. We have assessed analytical sensitivity, specificity, and accuracy of the NGS-based molecular diagnosis. Also, our developed analytical pipelines were capable of detecting base substitutions, indels, and gene copy number variations (CNVs). For instance, several actionable mutations of EGFR,PIK3CA,TP53, and KRAS have been detected for indicating drug susceptibility and resistance in the cases of lung cancer. Our study has shown that NGS-based molecular diagnosis is more sensitive and comprehensive to detect genomic alterations in cancer, and supports a direct clinical use for guiding targeted therapy.
Multiple spinal nerve enlargement and SOS1 mutation: Further evidence of overlap between neurofibromatosis type 1 and Noonan phenotype.

PubMed

Santoro, C; Giugliano, T; Melone, M A B; Cirillo, M; Schettino, C; Bernardo, P; Cirillo, G; Perrotta, S; Piluso, G

2018-01-01

Neurofibromatosis type 1 (NF1) has long been considered a well-defined, recognizable monogenic disorder, with neurofibromas constituting a pathognomonic sign. This dogma has been challenged by recent descriptions of patients with enlarged nerves or paraspinal tumors, suggesting that neurogenic tumors and hypertrophic neuropathy may be a complication of Noonan syndrome with multiple lentigines (NSML) or RASopathy phenotype. We describe a 15-year-old boy, whose mother previously received clinical diagnosis of NF1 due to presence of bilateral cervical and lumbar spinal lesions resembling plexiform neurofibromas and features suggestive of NS. NF1 molecular analysis was negative in the mother. The boy presented with Noonan features, multiple lentigines and pectus excavatum. Next-generation sequencing analysis of all RASopathy genes identified p.Ser548Arg missense mutation in SOS1 in the boy, confirmed in his mother. Brain and spinal magnetic resonance imaging scans were negative in the boy. No heart involvement or deafness was observed in proband or mother. This is the first report of a SOS1 mutation associated with hypertrophic neuropathy resembling plexiform neurofibromas, a rare complication in Noonan phenotypes with mutations in RASopathy genes. Our results highlight the overlap between RASopathies, suggesting that NF1 diagnostic criteria need rethinking. Genetic analysis of RASopathy genes should be considered when diagnosis is uncertain. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

Optimization of techniques for multiple platform testing in small, precious samples such as human chorionic villus sampling.

PubMed

Pisarska, Margareta D; Akhlaghpour, Marzieh; Lee, Bora; Barlow, Gillian M; Xu, Ning; Wang, Erica T; Mackey, Aaron J; Farber, Charles R; Rich, Stephen S; Rotter, Jerome I; Chen, Yii-der I; Goodarzi, Mark O; Guller, Seth; Williams, John

2016-11-01

Multiple testing to understand global changes in gene expression based on genetic and epigenetic modifications is evolving. Chorionic villi, obtained for prenatal testing, is limited, but can be used to understand ongoing human pregnancies. However, optimal storage, processing and utilization of CVS for multiple platform testing have not been established. Leftover CVS samples were flash-frozen or preserved in RNAlater. Modifications to standard isolation kits were performed to isolate quality DNA and RNA from samples as small as 2-5 mg. RNAlater samples had significantly higher RNA yields and quality and were successfully used in microarray and RNA-sequencing (RNA-seq). RNA-seq libraries generated using 200 versus 800-ng RNA showed similar biological coefficients of variation. RNAlater samples had lower DNA yields and quality, which improved by heating the elution buffer to 70 °C. Purification of DNA was not necessary for bisulfite-conversion and genome-wide methylation profiling. CVS cells were propagated and continue to express genes found in freshly isolated chorionic villi. CVS samples preserved in RNAlater are superior. Our optimized techniques provide specimens for genetic, epigenetic and gene expression studies from a single small sample which can be used to develop diagnostics and treatments using a systems biology approach in the prenatal period. © 2016 John Wiley & Sons, Ltd. © 2016 John Wiley & Sons, Ltd.
Insights into TREM2 biology by network analysis of human brain gene expression data

PubMed Central

Forabosco, Paola; Ramasamy, Adaikalavan; Trabzuni, Daniah; Walker, Robert; Smith, Colin; Bras, Jose; Levine, Adam P.; Hardy, John; Pocock, Jennifer M.; Guerreiro, Rita; Weale, Michael E.; Ryten, Mina

2013-01-01

Rare variants in TREM2 cause susceptibility to late-onset Alzheimer's disease. Here we use microarray-based expression data generated from 101 neuropathologically normal individuals and covering 10 brain regions, including the hippocampus, to understand TREM2 biology in human brain. Using network analysis, we detect a highly preserved TREM2-containing module in human brain, show that it relates to microglia, and demonstrate that TREM2 is a hub gene in 5 brain regions, including the hippocampus, suggesting that it can drive module function. Using enrichment analysis we show significant overrepresentation of genes implicated in the adaptive and innate immune system. Inspection of genes with the highest connectivity to TREM2 suggests that it plays a key role in mediating changes in the microglial cytoskeleton necessary not only for phagocytosis, but also migration. Most importantly, we show that the TREM2-containing module is significantly enriched for genes genetically implicated in Alzheimer's disease, multiple sclerosis, and motor neuron disease, implying that these diseases share common pathways centered on microglia and that among the genes identified are possible new disease-relevant genes. PMID:23855984
Construction of an infectious clone of canine herpesvirus genome as a bacterial artificial chromosome.

PubMed

Arii, Jun; Hushur, Orkash; Kato, Kentaro; Kawaguchi, Yasushi; Tohya, Yukinobu; Akashi, Hiroomi

2006-04-01

Canine herpesvirus (CHV) is an attractive candidate not only for use as a recombinant vaccine to protect dogs from a variety of canine pathogens but also as a viral vector for gene therapy in domestic animals. However, developments in this area have been impeded by the complicated techniques used for eukaryotic homologous recombination. To overcome these problems, we used bacterial artificial chromosomes (BACs) to generate infectious BACs. Our findings may be summarized as follows: (i) the CHV genome (pCHV/BAC), in which a BAC flanked by loxP sites was inserted into the thymidine kinase gene, was maintained in Escherichia coli; (ii) transfection of pCHV/BAC into A-72 cells resulted in the production of infectious virus; (iii) the BAC vector sequence was almost perfectly excisable from the genome of the reconstituted virus CHV/BAC by co-infection with CHV/BAC and a recombinant adenovirus that expressed the Cre recombinase; and (iv) a recombinant virus in which the glycoprotein C gene was deleted was generated by lambda recombination followed by Flp recombination, which resulted in a reduction in viral titer compared with that of the wild-type virus. The infectious clone pCHV/BAC is useful for the modification of the CHV genome using bacterial genetics, and CHV/BAC should have multiple applications in the rapid generation of genetically engineered CHV recombinants and the development of CHV vectors for vaccination and gene therapy in domestic animals.
A CRISPR view of development

PubMed Central

Harrison, Melissa M.; Jenkins, Brian V.; O’Connor-Giles, Kate M.

2014-01-01

The CRISPR (clustered regularly interspaced short palindromic repeat)–Cas9 (CRISPR-associated nuclease 9) system is poised to transform developmental biology by providing a simple, efficient method to precisely manipulate the genome of virtually any developing organism. This RNA-guided nuclease (RGN)-based approach already has been effectively used to induce targeted mutations in multiple genes simultaneously, create conditional alleles, and generate endogenously tagged proteins. Illustrating the adaptability of RGNs, the genomes of >20 different plant and animal species as well as multiple cell lines and primary cells have been successfully modified. Here we review the current and potential uses of RGNs to investigate genome function during development. PMID:25184674
Genome Enabled Discovery of Carbon Sequestration Genes in Poplar

DOE Office of Scientific and Technical Information (OSTI.GOV)

Filichkin, Sergei; Etherington, Elizabeth; Ma, Caiping

2007-02-22

The goals of the S.H. Strauss laboratory portion of 'Genome-enabled discovery of carbon sequestration genes in poplar' are (1) to explore the functions of candidate genes using Populus transformation by inserting genes provided by Oakridge National Laboratory (ORNL) and the University of Florida (UF) into poplar; (2) to expand the poplar transformation toolkit by developing transformation methods for important genotypes; and (3) to allow induced expression, and efficient gene suppression, in roots and other tissues. As part of the transformation improvement effort, OSU developed transformation protocols for Populus trichocarpa 'Nisqually-1' clone and an early flowering P. alba clone, 6K10. Completemore » descriptions of the transformation systems were published (Ma et. al. 2004, Meilan et. al 2004). Twenty-one 'Nisqually-1' and 622 6K10 transgenic plants were generated. To identify root predominant promoters, a set of three promoters were tested for their tissue-specific expression patterns in poplar and in Arabidopsis as a model system. A novel gene, ET304, was identified by analyzing a collection of poplar enhancer trap lines generated at OSU (Filichkin et. al 2006a, 2006b). Other promoters include the pGgMT1 root-predominant promoter from Casuarina glauca and the pAtPIN2 promoter from Arabidopsis root specific PIN2 gene. OSU tested two induction systems, alcohol- and estrogen-inducible, in multiple poplar transgenics. Ethanol proved to be the more efficient when tested in tissue culture and greenhouse conditions. Two estrogen-inducible systems were evaluated in transgenic Populus, neither of which functioned reliably in tissue culture conditions. GATEWAY-compatible plant binary vectors were designed to compare the silencing efficiency of homologous (direct) RNAi vs. heterologous (transitive) RNAi inverted repeats. A set of genes was targeted for post transcriptional silencing in the model Arabidopsis system; these include the floral meristem identity gene (APETALA1 or AP1), auxin response factor gene (ETTIN), the gene encoding transcriptional factor of WD40 family (TRANSPARENTTESTAGLABRA1 or TTG1), and the auxin efflux carrier (PIN-FORMED2 or PIN2) gene. More than 220 transgenic lines of the 1st, 2nd and 3rd generations were analyzed for RNAi suppression phenotypes (Filichkin et. al., manuscript submitted). A total of 108 constructs were supplied by ORNL, UF and OSU and used to generate over 1,881 PCR verified transgenic Populus and over 300 PCR verified transgenic Arabidopsis events. The Populus transgenics alone required Agrobacterium co-cultivations of 124.406 explants.« less
Combinatorial Strategies for Improving Multiple-Stress Resistance in Industrially Relevant Escherichia coli Strains

PubMed Central

Herrgård, Markus J.

2014-01-01

High-cell-density fermentation for industrial production of chemicals can impose numerous stresses on cells due to high substrate, product, and by-product concentrations; high osmolarity; reactive oxygen species; and elevated temperatures. There is a need to develop platform strains of industrial microorganisms that are more tolerant toward these typical processing conditions. In this study, the growth of six industrially relevant strains of Escherichia coli was characterized under eight stress conditions representative of fed-batch fermentation, and strains W and BL21(DE3) were selected as platforms for transposon (Tn) mutagenesis due to favorable resistance characteristics. Selection experiments, followed by either targeted or genome-wide next-generation-sequencing-based Tn insertion site determination, were performed to identify mutants with improved growth properties under a subset of three stress conditions and two combinations of individual stresses. A subset of the identified loss-of-function mutants were selected for a combinatorial approach, where strains with combinations of two and three gene deletions were systematically constructed and tested for single and multistress resistance. These approaches allowed identification of (i) strain-background-specific stress resistance phenotypes, (ii) novel gene deletion mutants in E. coli that confer single and multistress resistance in a strain-background-dependent manner, and (iii) synergistic effects of multiple gene deletions that confer improved resistance over single deletions. The results of this study underscore the suboptimality and strain-specific variability of the genetic network regulating growth under stressful conditions and suggest that further exploration of the combinatorial gene deletion space in multiple strain backgrounds is needed for optimizing strains for microbial bioprocessing applications. PMID:25085490
DNA Translator and Aligner: HyperCard utilities to aid phylogenetic analysis of molecules.

PubMed

Eernisse, D J

1992-04-01

DNA Translator and Aligner are molecular phylogenetics HyperCard stacks for Macintosh computers. They manipulate sequence data to provide graphical gene mapping, conversions, translations and manual multiple-sequence alignment editing. DNA Translator is able to convert documented GenBank or EMBL documented sequences into linearized, rescalable gene maps whose gene sequences are extractable by clicking on the corresponding map button or by selection from a scrolling list. Provided gene maps, complete with extractable sequences, consist of nine metazoan, one yeast, and one ciliate mitochondrial DNAs and three green plant chloroplast DNAs. Single or multiple sequences can be manipulated to aid in phylogenetic analysis. Sequences can be translated between nucleic acids and proteins in either direction with flexible support of alternate genetic codes and ambiguous nucleotide symbols. Multiple aligned sequence output from diverse sources can be converted to Nexus, Hennig86 or PHYLIP format for subsequent phylogenetic analysis. Input or output alignments can be examined with Aligner, a convenient accessory stack included in the DNA Translator package. Aligner is an editor for the manual alignment of up to 100 sequences that toggles between display of matched characters and normal unmatched sequences. DNA Translator also generates graphic displays of amino acid coding and codon usage frequency relative to all other, or only synonymous, codons for approximately 70 select organism-organelle combinations. Codon usage data is compatible with spreadsheet or UWGCG formats for incorporation of additional molecules of interest. The complete package is available via anonymous ftp and is free for non-commercial uses.
Turning publicly available gene expression data into discoveries using gene set context analysis.

PubMed

Ji, Zhicheng; Vokes, Steven A; Dang, Chi V; Ji, Hongkai

2016-01-08

Gene Set Context Analysis (GSCA) is an open source software package to help researchers use massive amounts of publicly available gene expression data (PED) to make discoveries. Users can interactively visualize and explore gene and gene set activities in 25,000+ consistently normalized human and mouse gene expression samples representing diverse biological contexts (e.g. different cells, tissues and disease types, etc.). By providing one or multiple genes or gene sets as input and specifying a gene set activity pattern of interest, users can query the expression compendium to systematically identify biological contexts associated with the specified gene set activity pattern. In this way, researchers with new gene sets from their own experiments may discover previously unknown contexts of gene set functions and hence increase the value of their experiments. GSCA has a graphical user interface (GUI). The GUI makes the analysis convenient and customizable. Analysis results can be conveniently exported as publication quality figures and tables. GSCA is available at https://github.com/zji90/GSCA. This software significantly lowers the bar for biomedical investigators to use PED in their daily research for generating and screening hypotheses, which was previously difficult because of the complexity, heterogeneity and size of the data. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Integrative Functional Genomics for Systems Genetics in GeneWeaver.org.

PubMed

Bubier, Jason A; Langston, Michael A; Baker, Erich J; Chesler, Elissa J

2017-01-01

The abundance of existing functional genomics studies permits an integrative approach to interpreting and resolving the results of diverse systems genetics studies. However, a major challenge lies in assembling and harmonizing heterogeneous data sets across species for facile comparison to the positional candidate genes and coexpression networks that come from systems genetic studies. GeneWeaver is an online database and suite of tools at www.geneweaver.org that allows for fast aggregation and analysis of gene set-centric data. GeneWeaver contains curated experimental data together with resource-level data such as GO annotations, MP annotations, and KEGG pathways, along with persistent stores of user entered data sets. These can be entered directly into GeneWeaver or transferred from widely used resources such as GeneNetwork.org. Data are analyzed using statistical tools and advanced graph algorithms to discover new relations, prioritize candidate genes, and generate function hypotheses. Here we use GeneWeaver to find genes common to multiple gene sets, prioritize candidate genes from a quantitative trait locus, and characterize a set of differentially expressed genes. Coupling a large multispecies repository curated and empirical functional genomics data to fast computational tools allows for the rapid integrative analysis of heterogeneous data for interpreting and extrapolating systems genetics results.
International interlaboratory study comparing single organism 16S rRNA gene sequencing data: Beyond consensus sequence comparisons

PubMed Central

Olson, Nathan D.; Lund, Steven P.; Zook, Justin M.; Rojas-Cornejo, Fabiola; Beck, Brian; Foy, Carole; Huggett, Jim; Whale, Alexandra S.; Sui, Zhiwei; Baoutina, Anna; Dobeson, Michael; Partis, Lina; Morrow, Jayne B.

2015-01-01

This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA) sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing®, or Ion Torrent PGM®. The sequencing data were evaluated on three levels: (1) identity of biologically conserved position, (2) ratio of 16S rRNA gene copies featuring identified variants, and (3) the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies. PMID:27077030
OncoSimulR: genetic simulation with arbitrary epistasis and mutator genes in asexual populations.

PubMed

Diaz-Uriarte, Ramon

2017-06-15

OncoSimulR implements forward-time genetic simulations of biallelic loci in asexual populations with special focus on cancer progression. Fitness can be defined as an arbitrary function of genetic interactions between multiple genes or modules of genes, including epistasis, restrictions in the order of accumulation of mutations, and order effects. Mutation rates can differ among genes, and can be affected by (anti)mutator genes. Also available are sampling from simulations (including single-cell sampling), plotting the genealogical relationships of clones and generating and plotting fitness landscapes. Implemented in R and C ++, freely available from BioConductor for Linux, Mac and Windows under the GNU GPL license. Version 2.5.9 or higher available from: http://www.bioconductor.org/packages/devel/bioc/html/OncoSimulR.html . GitHub repository at: https://github.com/rdiaz02/OncoSimul. ramon.diaz@iib.uam.es. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press.
Disruption of Hox9,10,11 function results in cellular level lineage infidelity in the kidney.

PubMed

Drake, Keri A; Adam, Mike; Mahoney, Robert; Potter, S Steven

2018-04-20

Hox genes are important regulators of development. The 39 mammalian Hox genes have considerable functional overlap, greatly confounding their study. In this report, we generated mice with multiple combinations of paralogous and flanking Abd-B Hox gene mutations to investigate functional redundancies in kidney development. The resulting mice developed a number of kidney abnormalities, including hypoplasia, agenesis, and severe cysts, with distinct Hox functions observed in early metanephric kidney formation and nephron progenitor maintenance. Most surprising, however, was that extensive removal of Hox shared function in these kidneys resulted in cellular level lineage infidelity. Strikingly, mutant nephron tubules consisted of intermixed cells with proximal tubule, loop of Henle, and collecting duct identities, with some single cells expressing markers associated with more than one nephron segment. These results indicate that Hox genes are required for proper lineage selection/maintenance and full repression of genes involved in cell fate restriction in the developing kidney.
Muscle-specific CRISPR/Cas9 dystrophin gene editing ameliorates pathophysiology in a mouse model for Duchenne muscular dystrophy

PubMed Central

Bengtsson, Niclas E.; Hall, John K.; Odom, Guy L.; Phelps, Michael P.; Andrus, Colin R.; Hawkins, R. David; Hauschka, Stephen D.; Chamberlain, Joel R.; Chamberlain, Jeffrey S.

2017-01-01

Gene replacement therapies utilizing adeno-associated viral (AAV) vectors hold great promise for treating Duchenne muscular dystrophy (DMD). A related approach uses AAV vectors to edit specific regions of the DMD gene using CRISPR/Cas9. Here we develop multiple approaches for editing the mutation in dystrophic mdx4cv mice using single and dual AAV vector delivery of a muscle-specific Cas9 cassette together with single-guide RNA cassettes and, in one approach, a dystrophin homology region to fully correct the mutation. Muscle-restricted Cas9 expression enables direct editing of the mutation, multi-exon deletion or complete gene correction via homologous recombination in myogenic cells. Treated muscles express dystrophin in up to 70% of the myogenic area and increased force generation following intramuscular delivery. Furthermore, systemic administration of the vectors results in widespread expression of dystrophin in both skeletal and cardiac muscles. Our results demonstrate that AAV-mediated muscle-specific gene editing has significant potential for therapy of neuromuscular disorders. PMID:28195574
Gene therapies that restore dystrophin expression for the treatment of Duchenne muscular dystrophy

PubMed Central

Robinson-Hamm, Jacqueline N.; Gersbach, Charles A.

2016-01-01

Duchenne muscular dystrophy is one of the most common inherited genetic diseases and is caused by mutations to the DMD gene that encodes the dystrophin protein. Recent advances in genome editing and gene therapy offer hope for the development of potential therapeutics. Truncated versions of the DMD gene can be delivered to the affected tissues with viral vectors and show promising results in a variety of animal models. Genome editing with the CRISPR/Cas9 system has recently been used to restore dystrophin expression by deleting one or more exons of the DMD gene in patient cells and in a mouse model that led to functional improvement of muscle strength. Exon skipping with oligonucleotides has been successful in several animal models and evaluated in multiple clinical trials. Next-generation oligonucleotide formulations offer significant promise to build on these results. All these approaches to restoring dystrophin expression are encouraging, but many hurdles remain. This review summarizes the current state of these technologies and summarizes considerations for their future development. PMID:27542949
Genetics and evolution of Yersinia pseudotuberculosis O-specific polysaccharides: a novel pattern of O-antigen diversity

PubMed Central

Kenyon, Johanna J.; Cunneen, Monica M.

2017-01-01

Abstract O-antigen polysaccharide is a major immunogenic feature of the lipopolysaccharide of Gram-negative bacteria, and most species produce a large variety of forms that differ substantially from one another. There are 18 known O-antigen forms in the Yersinia pseudotuberculosis complex, which are typical in being composed of multiple copies of a short oligosaccharide called an O unit. The O-antigen gene clusters are located between the hemH and gsk genes, and are atypical as 15 of them are closely related, each having one of five downstream gene modules for alternative main-chain synthesis, and one of seven upstream modules for alternative side-branch sugar synthesis. As a result, many of the genes are in more than one gene cluster. The gene order in each module is such that, in general, the earlier a gene product functions in O-unit synthesis, the closer the gene is to the 5΄ end for side-branch modules or the 3΄ end for main-chain modules. We propose a model whereby natural selection could generate the observed pattern in gene order, a pattern that has also been observed in other species. PMID:28364730
Targeted next generation sequencing of parotid gland cancer uncovers genetic heterogeneity.

PubMed

Grünewald, Inga; Vollbrecht, Claudia; Meinrath, Jeannine; Meyer, Moritz F; Heukamp, Lukas C; Drebber, Uta; Quaas, Alexander; Beutner, Dirk; Hüttenbrink, Karl-Bernd; Wardelmann, Eva; Hartmann, Wolfgang; Büttner, Reinhard; Odenthal, Margarete; Stenner, Markus

2015-07-20

Salivary gland cancer represents a heterogeneous group of malignant tumors. Due to their low incidence and the existence of multiple morphologically defined subtypes, these tumors are still poorly understood with regard to their molecular pathogenesis and therapeutically relevant genetic alterations.Performing a systematic and comprehensive study covering 13 subtypes of salivary gland cancer, next generation sequencing was done on 84 tissue samples of parotid gland cancer using multiplex PCR for enrichment of cancer related gene loci covering hotspots of 46 cancer genes.Mutations were identified in 22 different genes. The most frequent alterations affected TP53, followed by RAS genes, PIK3CA, SMAD4 and members of the ERB family. HRAS mutations accounted for more than 90% of RAS mutations, occurring especially in epithelial-myoepithelial carcinomas and salivary duct carcinomas. Additional mutations in PIK3CA also affected particularly epithelial-myoepithelial carcinomas and salivary duct carcinomas, occurring simultaneously with HRAS mutations in almost all cases, pointing to an unknown and therapeutically relevant molecular constellation. Interestingly, 14% of tumors revealed mutations in surface growth factor receptor genes including ALK, HER2, ERBB4, FGFR, cMET and RET, which might prove to be targetable by new therapeutic agents. 6% of tumors revealed mutations in SMAD4.In summary, our data provide novel insight into the fundamental molecular heterogeneity of salivary gland cancer, relevant in terms of tumor classification and the establishment of targeted therapeutic concepts.
Production of α1,3-galactosyltransferase and cytidine monophosphate-N-acetylneuraminic acid hydroxylase gene double-deficient pigs by CRISPR/Cas9 and handmade cloning.

PubMed

Gao, Hanchao; Zhao, Chengjiang; Xiang, Xi; Li, Yong; Zhao, Yanli; Li, Zesong; Pan, Dengke; Dai, Yifan; Hara, Hidetaka; Cooper, David K C; Cai, Zhiming; Mou, Lisha

2017-02-16

Gene-knockout pigs hold great promise as a solution to the shortage of organs from donor animals for xenotransplantation. Several groups have generated gene-knockout pigs via clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated 9 (Cas9) and somatic cell nuclear transfer (SCNT). Herein, we adopted a simple and micromanipulator-free method, handmade cloning (HMC) instead of SCNT, to generate double gene-knockout pigs. First, we applied the CRISPR/Cas9 system to target α1,3-galactosyltransferase (GGTA1) and cytidine monophosphate-N-acetylneuraminic acid hydroxylase (CMAH) genes simultaneously in porcine fetal fibroblast cells (PFFs), which were derived from wild-type Chinese domestic miniature Wuzhishan pigs. Cell colonies were obtained by screening and were identified by Surveyor assay and sequencing. Next, we chose the GGTA1/CMAH double-knockout (DKO) cells for HMC to produce piglets. As a result, we obtained 11 live bi-allelic GGTA1/CMAH DKO piglets with the identical phenotype. Compared to cells from GGTA1-knockout pigs, human antibody binding and antibody-mediated complement-dependent cytotoxicity were significantly reduced in cells from GGTA1/CMAH DKO pigs, which demonstrated that our pigs would exhibit reduced humoral rejection in xenotransplantation. These data suggested that the combination of CRISPR/Cas9 and HMC technology provided an efficient and new strategy for producing pigs with multiple genetic modifications.
The Emerging Role of Zinc in the Pathogenesis of Multiple Sclerosis.

PubMed

Choi, Bo Young; Jung, Jong Won; Suh, Sang Won

2017-09-28

Our lab has previously demonstrated that multiple sclerosis-induced spinal cord white matter damage and motor deficits are mediated by the pathological disruption of zinc homeostasis. Abnormal vesicular zinc release and intracellular zinc accumulation may mediate several steps in the pathophysiological processes of multiple sclerosis (MS), such as matrix metallopeptidase 9 (MMP-9) activation, blood-brain barrier (BBB) disruption, and subsequent immune cell infiltration from peripheral systems. Oral administration of a zinc chelator decreased BBB disruption, immune cell infiltration, and spinal white matter myelin destruction. Therefore, we hypothesized that zinc released into the extracellular space during MS progression is involved in destruction of the myelin sheath in spinal cord white mater and in generation of motor deficits. To confirm our previous study, we employed zinc transporter 3 ( ZnT3 ) knockout mice to test whether vesicular zinc depletion shows protective effects on multiple sclerosis-induced white matter damage and motor deficits. ZnT3 gene deletion profoundly reduced the daily clinical score of experimental autoimmune encephalomyelitis (EAE) by suppression of inflammation and demyelination in the spinal cord. ZnT3 gene deletion also remarkably inhibited formation of multiple sclerosis-associated aberrant synaptic zinc patches, MMP-9 activation, and BBB disruption. These two studies strongly support our hypothesis that zinc release from presynaptic terminals may be involved in multiple sclerosis pathogenesis. Further studies will no doubt continue to add mechanistic detail to this process and with luck, clarify how these observations may lead to development of novel therapeutic approaches for the treatment of multiple sclerosis.
Pms2 and uracil-DNA glycosylases act jointly in the mismatch repair pathway to generate Ig gene mutations at A-T base pairs.

PubMed

Girelli Zubani, Giulia; Zivojnovic, Marija; De Smet, Annie; Albagli-Curiel, Olivier; Huetz, François; Weill, Jean-Claude; Reynaud, Claude-Agnès; Storck, Sébastien

2017-04-03

During somatic hypermutation (SHM) of immunoglobulin genes, uracils introduced by activation-induced cytidine deaminase are processed by uracil-DNA glycosylase (UNG) and mismatch repair (MMR) pathways to generate mutations at G-C and A-T base pairs, respectively. Paradoxically, the MMR-nicking complex Pms2/Mlh1 is apparently dispensable for A-T mutagenesis. Thus, how detection of U:G mismatches is translated into the single-strand nick required for error-prone synthesis is an open question. One model proposed that UNG could cooperate with MMR by excising a second uracil in the vicinity of the U:G mismatch, but it failed to explain the low impact of UNG inactivation on A-T mutagenesis. In this study, we show that uracils generated in the G1 phase in B cells can generate equal proportions of A-T and G-C mutations, which suggests that UNG and MMR can operate within the same time frame during SHM. Furthermore, we show that Ung -/- Pms2 -/- mice display a 50% reduction in mutations at A-T base pairs and that most remaining mutations at A-T bases depend on two additional uracil glycosylases, thymine-DNA glycosylase and SMUG1. These results demonstrate that Pms2/Mlh1 and multiple uracil glycosylases act jointly, each one with a distinct strand bias, to enlarge the immunoglobulin gene mutation spectrum from G-C to A-T bases. © 2017 Girelli Zubani et al.
Pms2 and uracil-DNA glycosylases act jointly in the mismatch repair pathway to generate Ig gene mutations at A-T base pairs

PubMed Central

De Smet, Annie; Albagli-Curiel, Olivier; Huetz, François; Weill, Jean-Claude

2017-01-01

During somatic hypermutation (SHM) of immunoglobulin genes, uracils introduced by activation-induced cytidine deaminase are processed by uracil-DNA glycosylase (UNG) and mismatch repair (MMR) pathways to generate mutations at G-C and A-T base pairs, respectively. Paradoxically, the MMR-nicking complex Pms2/Mlh1 is apparently dispensable for A-T mutagenesis. Thus, how detection of U:G mismatches is translated into the single-strand nick required for error-prone synthesis is an open question. One model proposed that UNG could cooperate with MMR by excising a second uracil in the vicinity of the U:G mismatch, but it failed to explain the low impact of UNG inactivation on A-T mutagenesis. In this study, we show that uracils generated in the G1 phase in B cells can generate equal proportions of A-T and G-C mutations, which suggests that UNG and MMR can operate within the same time frame during SHM. Furthermore, we show that Ung−/−Pms2−/− mice display a 50% reduction in mutations at A-T base pairs and that most remaining mutations at A-T bases depend on two additional uracil glycosylases, thymine-DNA glycosylase and SMUG1. These results demonstrate that Pms2/Mlh1 and multiple uracil glycosylases act jointly, each one with a distinct strand bias, to enlarge the immunoglobulin gene mutation spectrum from G-C to A-T bases. PMID:28283534

Transcriptome and methylome profiling reveals relics of genome dominance in the mesopolyploid Brassica oleracea

PubMed Central

2014-01-01

Background Brassica oleracea is a valuable vegetable species that has contributed to human health and nutrition for hundreds of years and comprises multiple distinct cultivar groups with diverse morphological and phytochemical attributes. In addition to this phenotypic wealth, B. oleracea offers unique insights into polyploid evolution, as it results from multiple ancestral polyploidy events and a final Brassiceae-specific triplication event. Further, B. oleracea represents one of the diploid genomes that formed the economically important allopolyploid oilseed, Brassica napus. A deeper understanding of B. oleracea genome architecture provides a foundation for crop improvement strategies throughout the Brassica genus. Results We generate an assembly representing 75% of the predicted B. oleracea genome using a hybrid Illumina/Roche 454 approach. Two dense genetic maps are generated to anchor almost 92% of the assembled scaffolds to nine pseudo-chromosomes. Over 50,000 genes are annotated and 40% of the genome predicted to be repetitive, thus contributing to the increased genome size of B. oleracea compared to its close relative B. rapa. A snapshot of both the leaf transcriptome and methylome allows comparisons to be made across the triplicated sub-genomes, which resulted from the most recent Brassiceae-specific polyploidy event. Conclusions Differential expression of the triplicated syntelogs and cytosine methylation levels across the sub-genomes suggest residual marks of the genome dominance that led to the current genome architecture. Although cytosine methylation does not correlate with individual gene dominance, the independent methylation patterns of triplicated copies suggest epigenetic mechanisms play a role in the functional diversification of duplicate genes. PMID:24916971
Reliable pre-eclampsia pathways based on multiple independent microarray data sets.

PubMed

Kawasaki, Kaoru; Kondoh, Eiji; Chigusa, Yoshitsugu; Ujita, Mari; Murakami, Ryusuke; Mogami, Haruta; Brown, J B; Okuno, Yasushi; Konishi, Ikuo

2015-02-01

Pre-eclampsia is a multifactorial disorder characterized by heterogeneous clinical manifestations. Gene expression profiling of preeclamptic placenta have provided different and even opposite results, partly due to data compromised by various experimental artefacts. Here we aimed to identify reliable pre-eclampsia-specific pathways using multiple independent microarray data sets. Gene expression data of control and preeclamptic placentas were obtained from Gene Expression Omnibus. Single-sample gene-set enrichment analysis was performed to generate gene-set activation scores of 9707 pathways obtained from the Molecular Signatures Database. Candidate pathways were identified by t-test-based screening using data sets, GSE10588, GSE14722 and GSE25906. Additionally, recursive feature elimination was applied to arrive at a further reduced set of pathways. To assess the validity of the pre-eclampsia pathways, a statistically-validated protocol was executed using five data sets including two independent other validation data sets, GSE30186, GSE44711. Quantitative real-time PCR was performed for genes in a panel of potential pre-eclampsia pathways using placentas of 20 women with normal or severe preeclamptic singleton pregnancies (n = 10, respectively). A panel of ten pathways were found to discriminate women with pre-eclampsia from controls with high accuracy. Among these were pathways not previously associated with pre-eclampsia, such as the GABA receptor pathway, as well as pathways that have already been linked to pre-eclampsia, such as the glutathione and CDKN1C pathways. mRNA expression of GABRA3 (GABA receptor pathway), GCLC and GCLM (glutathione metabolic pathway), and CDKN1C was significantly reduced in the preeclamptic placentas. In conclusion, ten accurate and reliable pre-eclampsia pathways were identified based on multiple independent microarray data sets. A pathway-based classification may be a worthwhile approach to elucidate the pathogenesis of pre-eclampsia. © The Author 2014. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Generation of blue chrysanthemums by anthocyanin B-ring hydroxylation and glucosylation and its coloration mechanism.

PubMed

Noda, Naonobu; Yoshioka, Satoshi; Kishimoto, Sanae; Nakayama, Masayoshi; Douzono, Mitsuru; Tanaka, Yoshikazu; Aida, Ryutaro

2017-07-01

Various colored cultivars of ornamental flowers have been bred by hybridization and mutation breeding; however, the generation of blue flowers for major cut flower plants, such as roses, chrysanthemums, and carnations, has not been achieved by conventional breeding or genetic engineering. Most blue-hued flowers contain delphinidin-based anthocyanins; therefore, delphinidin-producing carnation, rose, and chrysanthemum flowers have been generated by overexpression of the gene encoding flavonoid 3',5'-hydroxylase (F3'5'H), the key enzyme for delphinidin biosynthesis. Even so, the flowers are purple/violet rather than blue. To generate true blue flowers, blue pigments, such as polyacylated anthocyanins and metal complexes, must be introduced by metabolic engineering; however, introducing and controlling multiple transgenes in plants are complicated processes. We succeeded in generating blue chrysanthemum flowers by introduction of butterfly pea UDP (uridine diphosphate)-glucose:anthocyanin 3',5'- O -glucosyltransferase gene, in addition to the expression of the Canterbury bells F3'5'H . Newly synthesized 3',5'-diglucosylated delphinidin-based anthocyanins exhibited a violet color under the weakly acidic pH conditions of flower petal juice and showed a blue color only through intermolecular association, termed "copigmentation," with flavone glucosides in planta. Thus, we achieved the development of blue color by a two-step modification of the anthocyanin structure. This simple method is a promising approach to generate blue flowers in various ornamental plants by metabolic engineering.
GeneNetFinder2: Improved Inference of Dynamic Gene Regulatory Relations with Multiple Regulators.

PubMed

Han, Kyungsook; Lee, Jeonghoon

2016-01-01

A gene involved in complex regulatory interactions may have multiple regulators since gene expression in such interactions is often controlled by more than one gene. Another thing that makes gene regulatory interactions complicated is that regulatory interactions are not static, but change over time during the cell cycle. Most research so far has focused on identifying gene regulatory relations between individual genes in a particular stage of the cell cycle. In this study we developed a method for identifying dynamic gene regulations of several types from the time-series gene expression data. The method can find gene regulations with multiple regulators that work in combination or individually as well as those with single regulators. The method has been implemented as the second version of GeneNetFinder (hereafter called GeneNetFinder2) and tested on several gene expression datasets. Experimental results with gene expression data revealed the existence of genes that are not regulated by individual genes but rather by a combination of several genes. Such gene regulatory relations cannot be found by conventional methods. Our method finds such regulatory relations as well as those with multiple, independent regulators or single regulators, and represents gene regulatory relations as a dynamic network in which different gene regulatory relations are shown in different stages of the cell cycle. GeneNetFinder2 is available at http://bclab.inha.ac.kr/GeneNetFinder and will be useful for modeling dynamic gene regulations with multiple regulators.
Using Drosophila melanogaster as a Model for Genotoxic Chemical Mutational Studies with a New Program, SnpSift

PubMed Central

Cingolani, Pablo; Patel, Viral M.; Coon, Melissa; Nguyen, Tung; Land, Susan J.; Ruden, Douglas M.; Lu, Xiangyi

2012-01-01

This paper describes a new program SnpSift for filtering differential DNA sequence variants between two or more experimental genomes after genotoxic chemical exposure. Here, we illustrate how SnpSift can be used to identify candidate phenotype-relevant variants including single nucleotide polymorphisms, multiple nucleotide polymorphisms, insertions, and deletions (InDels) in mutant strains isolated from genome-wide chemical mutagenesis of Drosophila melanogaster. First, the genomes of two independently isolated mutant fly strains that are allelic for a novel recessive male-sterile locus generated by genotoxic chemical exposure were sequenced using the Illumina next-generation DNA sequencer to obtain 20- to 29-fold coverage of the euchromatic sequences. The sequencing reads were processed and variants were called using standard bioinformatic tools. Next, SnpEff was used to annotate all sequence variants and their potential mutational effects on associated genes. Then, SnpSift was used to filter and select differential variants that potentially disrupt a common gene in the two allelic mutant strains. The potential causative DNA lesions were partially validated by capillary sequencing of polymerase chain reaction-amplified DNA in the genetic interval as defined by meiotic mapping and deletions that remove defined regions of the chromosome. Of the five candidate genes located in the genetic interval, the Pka-like gene CG12069 was found to carry a separate pre-mature stop codon mutation in each of the two allelic mutants whereas the other four candidate genes within the interval have wild-type sequences. The Pka-like gene is therefore a strong candidate gene for the male-sterile locus. These results demonstrate that combining SnpEff and SnpSift can expedite the identification of candidate phenotype-causative mutations in chemically mutagenized Drosophila strains. This technique can also be used to characterize the variety of mutations generated by genotoxic chemicals. PMID:22435069
Detection of Epistasis for Flowering Time Using Bayesian Multilocus Estimation in a Barley MAGIC Population

PubMed Central

Mathew, Boby; Léon, Jens; Sannemann, Wiebke; Sillanpää, Mikko J.

2018-01-01

Gene-by-gene interactions, also known as epistasis, regulate many complex traits in different species. With the availability of low-cost genotyping it is now possible to study epistasis on a genome-wide scale. However, identifying genome-wide epistasis is a high-dimensional multiple regression problem and needs the application of dimensionality reduction techniques. Flowering Time (FT) in crops is a complex trait that is known to be influenced by many interacting genes and pathways in various crops. In this study, we successfully apply Sure Independence Screening (SIS) for dimensionality reduction to identify two-way and three-way epistasis for the FT trait in a Multiparent Advanced Generation Inter-Cross (MAGIC) barley population using the Bayesian multilocus model. The MAGIC barley population was generated from intercrossing among eight parental lines and thus, offered greater genetic diversity to detect higher-order epistatic interactions. Our results suggest that SIS is an efficient dimensionality reduction approach to detect high-order interactions in a Bayesian multilocus model. We also observe that many of our findings (genomic regions with main or higher-order epistatic effects) overlap with known candidate genes that have been already reported in barley and closely related species for the FT trait. PMID:29254994
Acute multi-sgRNA knockdown of KEOPS complex genes reproduces the microcephaly phenotype of the stable knockout zebrafish model.

PubMed

Jobst-Schwan, Tilman; Schmidt, Johanna Magdalena; Schneider, Ronen; Hoogstraten, Charlotte A; Ullmann, Jeremy F P; Schapiro, David; Majmundar, Amar J; Kolb, Amy; Eddy, Kaitlyn; Shril, Shirlee; Braun, Daniela A; Poduri, Annapurna; Hildebrandt, Friedhelm

2018-01-01

Until recently, morpholino oligonucleotides have been widely employed in zebrafish as an acute and efficient loss-of-function assay. However, off-target effects and reproducibility issues when compared to stable knockout lines have compromised their further use. Here we employed an acute CRISPR/Cas approach using multiple single guide RNAs targeting simultaneously different positions in two exemplar genes (osgep or tprkb) to increase the likelihood of generating mutations on both alleles in the injected F0 generation and to achieve a similar effect as morpholinos but with the reproducibility of stable lines. This multi single guide RNA approach resulted in median likelihoods for at least one mutation on each allele of >99% and sgRNA specific insertion/deletion profiles as revealed by deep-sequencing. Immunoblot showed a significant reduction for Osgep and Tprkb proteins. For both genes, the acute multi-sgRNA knockout recapitulated the microcephaly phenotype and reduction in survival that we observed previously in stable knockout lines, though milder in the acute multi-sgRNA knockout. Finally, we quantify the degree of mutagenesis by deep sequencing, and provide a mathematical model to quantitate the chance for a biallelic loss-of-function mutation. Our findings can be generalized to acute and stable CRISPR/Cas targeting for any zebrafish gene of interest.
The orphan estrogen-related receptor alpha and metabolic regulation: new frontiers.

PubMed

Ranhotra, Harmit S

2015-01-01

Metabolic homeostasis during long-term adaptation in animals is primarily achieved by controlling the expression of metabolic genes by a plethora of cellular transcription factors. The nuclear receptor (NR) superfamily in eukaryotes is an assembly of diverse receptors working as transcriptional regulators of multiple genes. The orphan estrogen-related receptor alpha (ERRα) is one such receptor of the NR superfamily with significant influence on numerous metabolic and other genes. Although it is presently unknown as to which endogenous hormones or ligands activate ERRα, nevertheless it regulates a host of genes whose products participate in various metabolic pathways. Studies over the years show new and interesting data that add to the growing knowledge on ERRα and metabolic regulation. For instance, novel findings indicate existence of mTOR/ERRα regulatory axis and also that ERRα control PGC-1α expression which potentially have significant impact on cellular metabolism. Data show that ERRα exerts its metabolic control by regulating the expression of SIRT5 that influences oxygen consumption and ATP generation. Moreover, ERRα has a role in creatine and lactate uptake in skeletal muscle which is important towards energy generation and contraction. This review is focused on the new insights gained into ERRα regulation of metabolism, networks and pathways that have important consequences in maintaining metabolic homeostasis including development of cancer.
An Optimal Bahadur-Efficient Method in Detection of Sparse Signals with Applications to Pathway Analysis in Sequencing Association Studies.

PubMed

Dai, Hongying; Wu, Guodong; Wu, Michael; Zhi, Degui

2016-01-01

Next-generation sequencing data pose a severe curse of dimensionality, complicating traditional "single marker-single trait" analysis. We propose a two-stage combined p-value method for pathway analysis. The first stage is at the gene level, where we integrate effects within a gene using the Sequence Kernel Association Test (SKAT). The second stage is at the pathway level, where we perform a correlated Lancaster procedure to detect joint effects from multiple genes within a pathway. We show that the Lancaster procedure is optimal in Bahadur efficiency among all combined p-value methods. The Bahadur efficiency,[Formula: see text], compares sample sizes among different statistical tests when signals become sparse in sequencing data, i.e. ε →0. The optimal Bahadur efficiency ensures that the Lancaster procedure asymptotically requires a minimal sample size to detect sparse signals ([Formula: see text]). The Lancaster procedure can also be applied to meta-analysis. Extensive empirical assessments of exome sequencing data show that the proposed method outperforms Gene Set Enrichment Analysis (GSEA). We applied the competitive Lancaster procedure to meta-analysis data generated by the Global Lipids Genetics Consortium to identify pathways significantly associated with high-density lipoprotein cholesterol, low-density lipoprotein cholesterol, triglycerides, and total cholesterol.
NCBI GEO: archive for functional genomics data sets—10 years on

PubMed Central

Barrett, Tanya; Troup, Dennis B.; Wilhite, Stephen E.; Ledoux, Pierre; Evangelista, Carlos; Kim, Irene F.; Tomashevsky, Maxim; Marshall, Kimberly A.; Phillippy, Katherine H.; Sherman, Patti M.; Muertter, Rolf N.; Holko, Michelle; Ayanbule, Oluwabukunmi; Yefanov, Andrey; Soboleva, Alexandra

2011-01-01

A decade ago, the Gene Expression Omnibus (GEO) database was established at the National Center for Biotechnology Information (NCBI). The original objective of GEO was to serve as a public repository for high-throughput gene expression data generated mostly by microarray technology. However, the research community quickly applied microarrays to non-gene-expression studies, including examination of genome copy number variation and genome-wide profiling of DNA-binding proteins. Because the GEO database was designed with a flexible structure, it was possible to quickly adapt the repository to store these data types. More recently, as the microarray community switches to next-generation sequencing technologies, GEO has again adapted to host these data sets. Today, GEO stores over 20 000 microarray- and sequence-based functional genomics studies, and continues to handle the majority of direct high-throughput data submissions from the research community. Multiple mechanisms are provided to help users effectively search, browse, download and visualize the data at the level of individual genes or entire studies. This paper describes recent database enhancements, including new search and data representation tools, as well as a brief review of how the community uses GEO data. GEO is freely accessible at http://www.ncbi.nlm.nih.gov/geo/. PMID:21097893
Tools for neuroanatomy and neurogenetics in Drosophila

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pfeiffer, Barret D.; Jenett, Arnim; Hammonds, Ann S.

2008-08-11

We demonstrate the feasibility of generating thousands of transgenic Drosophila melanogaster lines in which the expression of an exogenous gene is reproducibly directed to distinct small subsets of cells in the adult brain. We expect the expression patterns produced by the collection of 5,000 lines that we are currently generating to encompass all neurons in the brain in a variety of intersecting patterns. Overlapping 3-kb DNA fragments from the flanking noncoding and intronic regions of genes thought to have patterned expression in the adult brain were inserted into a defined genomic location by site-specific recombination. These fragments were then assayedmore » for their ability to function as transcriptional enhancers in conjunction with a synthetic core promoter designed to work with a wide variety of enhancer types. An analysis of 44 fragments from four genes found that >80% drive expression patterns in the brain; the observed patterns were, on average, comprised of <100 cells. Our results suggest that the D. melanogaster genome contains >50,000 enhancers and that multiple enhancers drive distinct subsets of expression of a gene in each tissue and developmental stage. We expect that these lines will be valuable tools for neuroanatomy as well as for the elucidation of neuronal circuits and information flow in the fly brain.« less
A Next-Generation Sequencing Strategy for Evaluating the Most Common Genetic Abnormalities in Multiple Myeloma.

PubMed

Jiménez, Cristina; Jara-Acevedo, María; Corchete, Luis A; Castillo, David; Ordóñez, Gonzalo R; Sarasquete, María E; Puig, Noemí; Martínez-López, Joaquín; Prieto-Conde, María I; García-Álvarez, María; Chillón, María C; Balanzategui, Ana; Alcoceba, Miguel; Oriol, Albert; Rosiñol, Laura; Palomera, Luis; Teruel, Ana I; Lahuerta, Juan J; Bladé, Joan; Mateos, María V; Orfão, Alberto; San Miguel, Jesús F; González, Marcos; Gutiérrez, Norma C; García-Sanz, Ramón

2017-01-01

Identification and characterization of genetic alterations are essential for diagnosis of multiple myeloma and may guide therapeutic decisions. Currently, genomic analysis of myeloma to cover the diverse range of alterations with prognostic impact requires fluorescence in situ hybridization (FISH), single nucleotide polymorphism arrays, and sequencing techniques, which are costly and labor intensive and require large numbers of plasma cells. To overcome these limitations, we designed a targeted-capture next-generation sequencing approach for one-step identification of IGH translocations, V(D)J clonal rearrangements, the IgH isotype, and somatic mutations to rapidly identify risk groups and specific targetable molecular lesions. Forty-eight newly diagnosed myeloma patients were tested with the panel, which included IGH and six genes that are recurrently mutated in myeloma: NRAS, KRAS, HRAS, TP53, MYC, and BRAF. We identified 14 of 17 IGH translocations previously detected by FISH and three confirmed translocations not detected by FISH, with the additional advantage of breakpoint identification, which can be used as a target for evaluating minimal residual disease. IgH subclass and V(D)J rearrangements were identified in 77% and 65% of patients, respectively. Mutation analysis revealed the presence of missense protein-coding alterations in at least one of the evaluating genes in 16 of 48 patients (33%). This method may represent a time- and cost-effective diagnostic method for the molecular characterization of multiple myeloma. Copyright © 2017 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Care delivery considerations for widespread and equitable implementation of inherited cancer predisposition testing

PubMed Central

Cragun, Deborah; Kinney, Anita Y; Pal, Tuya

2017-01-01

Introduction DNA sequencing advances through next-generation sequencing (NGS) and several practice changing events, have led to shifting paradigms for inherited cancer predisposition testing. These changes necessitated a means by which to maximize health benefits without unnecessarily inflating healthcare costs and exacerbating health disparities. Areas covered NGS-based tests encompass multi-gene panel tests, whole exome sequencing, and whole genome sequencing, all of which test for multiple genes simultaneously, compared to prior sequencing practices through which testing was performed sequentially for one or two genes. Taking an ecological approach, this article synthesizes the current literature to consider the broad impact of these advances from the individual patient-, interpersonal-, organizational-, community- and policy-levels. Furthermore, the authors describe how multi-level factors that impact genetic testing and follow-up care reveal great potential to widen existing health disparities if these issues are not addressed. Expert Commentary As we consider ways to maximize patient benefit from testing in a cost effective manner, it is important to consider perspectives from multiple levels. This information is needed to guide the development of interventions such that the promise of genomic testing may be realized by all populations, regardless of race, ethnicity and ability to pay. PMID:27910721
Optimized inducible shRNA and CRISPR/Cas9 platforms for in vitro studies of human development using hPSCs.

PubMed

Bertero, Alessandro; Pawlowski, Matthias; Ortmann, Daniel; Snijders, Kirsten; Yiangou, Loukia; Cardoso de Brito, Miguel; Brown, Stephanie; Bernard, William G; Cooper, James D; Giacomelli, Elisa; Gambardella, Laure; Hannan, Nicholas R F; Iyer, Dharini; Sampaziotis, Fotios; Serrano, Felipe; Zonneveld, Mariëlle C F; Sinha, Sanjay; Kotter, Mark; Vallier, Ludovic

2016-12-01

Inducible loss of gene function experiments are necessary to uncover mechanisms underlying development, physiology and disease. However, current methods are complex, lack robustness and do not work in multiple cell types. Here we address these limitations by developing single-step optimized inducible gene knockdown or knockout (sOPTiKD or sOPTiKO) platforms. These are based on genetic engineering of human genomic safe harbors combined with an improved tetracycline-inducible system and CRISPR/Cas9 technology. We exemplify the efficacy of these methods in human pluripotent stem cells (hPSCs), and show that generation of sOPTiKD/KO hPSCs is simple, rapid and allows tightly controlled individual or multiplexed gene knockdown or knockout in hPSCs and in a wide variety of differentiated cells. Finally, we illustrate the general applicability of this approach by investigating the function of transcription factors (OCT4 and T), cell cycle regulators (cyclin D family members) and epigenetic modifiers (DPY30). Overall, sOPTiKD and sOPTiKO provide a unique opportunity for functional analyses in multiple cell types relevant for the study of human development. © 2016. Published by The Company of Biologists Ltd.
Feeding-Related Traits Are Affected by Dosage of the foraging Gene in Drosophila melanogaster

PubMed Central

Allen, Aaron M.; Anreiter, Ina; Neville, Megan C.; Sokolowski, Marla B.

2017-01-01

Nutrient acquisition and energy storage are critical parts of achieving metabolic homeostasis. The foraging gene in Drosophila melanogaster has previously been implicated in multiple feeding-related and metabolic traits. Before foraging’s functions can be further dissected, we need a precise genetic null mutant to definitively map its amorphic phenotypes. We used homologous recombination to precisely delete foraging, generating the for0 null allele, and used recombineering to reintegrate a full copy of the gene, generating the {forBAC} rescue allele. We show that a total loss of foraging expression in larvae results in reduced larval path length and food intake behavior, while conversely showing an increase in triglyceride levels. Furthermore, varying foraging gene dosage demonstrates a linear dose-response on these phenotypes in relation to foraging gene expression levels. These experiments have unequivocally proven a causal, dose-dependent relationship between the foraging gene and its pleiotropic influence on these feeding-related traits. Our analysis of foraging’s transcription start sites, termination sites, and splicing patterns using rapid amplification of cDNA ends (RACE) and full-length cDNA sequencing, revealed four independent promoters, pr1–4, that produce 21 transcripts with nine distinct open reading frames (ORFs). The use of alternative promoters and alternative splicing at the foraging locus creates diversity and flexibility in the regulation of gene expression, and ultimately function. Future studies will exploit these genetic tools to precisely dissect the isoform- and tissue-specific requirements of foraging’s functions and shed light on the genetic control of feeding-related traits involved in energy homeostasis. PMID:28007892
Penicillin production in industrial strain Penicillium chrysogenum P2niaD18 is not dependent on the copy number of biosynthesis genes.

PubMed

Ziemons, Sandra; Koutsantas, Katerina; Becker, Kordula; Dahlmann, Tim; Kück, Ulrich

2017-02-16

Multi-copy gene integration into microbial genomes is a conventional tool for obtaining improved gene expression. For Penicillium chrysogenum, the fungal producer of the beta-lactam antibiotic penicillin, many production strains carry multiple copies of the penicillin biosynthesis gene cluster. This discovery led to the generally accepted view that high penicillin titers are the result of multiple copies of penicillin genes. Here we investigated strain P2niaD18, a production line that carries only two copies of the penicillin gene cluster. We performed pulsed-field gel electrophoresis (PFGE), quantitative qRT-PCR, and penicillin bioassays to investigate production, deletion and overexpression strains generated in the P. chrysogenum P2niaD18 background, in order to determine the copy number of the penicillin biosynthesis gene cluster, and study the expression of one penicillin biosynthesis gene, and the penicillin titer. Analysis of production and recombinant strain showed that the enhanced penicillin titer did not depend on the copy number of the penicillin gene cluster. Our assumption was strengthened by results with a penicillin null strain lacking pcbC encoding isopenicillin N synthase. Reintroduction of one or two copies of the cluster into the pcbC deletion strain restored transcriptional high expression of the pcbC gene, but recombinant strains showed no significantly different penicillin titer compared to parental strains. Here we present a molecular genetic analysis of production and recombinant strains in the P2niaD18 background carrying different copy numbers of the penicillin biosynthesis gene cluster. Our analysis shows that the enhanced penicillin titer does not strictly depend on the copy number of the cluster. Based on these overall findings, we hypothesize that instead, complex regulatory mechanisms are prominently implicated in increased penicillin biosynthesis in production strains.
The Cancer Target Discovery and Development Network Dashboard Allows Users to Search for Interesting Data and Results | Office of Cancer Genomics

Cancer.gov

The CTD2 Dashboard hosts analyzed data and other evidence generated by the CTD2 Network. It is a web interface for the research community to browse and search CTD2 Network data related to genes, proteins, and compounds from individual CTD2 Centers, or explore observations across multiple Centers.
Temperature Sensitivity Conferred by ligA Alleles from Psychrophilic Bacteria upon Substitution in Mesophilic Bacteria and a Yeast Species

PubMed Central

Pankowski, Jarosław A.; Puckett, Stephanie M.

2016-01-01

We have assembled a collection of 13 psychrophilic ligA alleles that can serve as genetic elements for engineering mesophiles to a temperature-sensitive (TS) phenotype. When these ligA alleles were substituted into Francisella novicida, they conferred a TS phenotype with restrictive temperatures between 33 and 39°C. When the F. novicida ligA hybrid strains were plated above their restrictive temperatures, eight of them generated temperature-resistant variants. For two alleles, the mutations that led to temperature resistance clustered near the 5′ end of the gene, and the mutations increased the predicted strength of the ribosome binding site at least 3-fold. Four F. novicida ligA hybrid strains generated no temperature-resistant variants at a detectable level. These results suggest that multiple mutations are needed to create temperature-resistant variants of these ligA gene products. One ligA allele was isolated from a Colwellia species that has a maximal growth temperature of 12°C, and this allele supported growth of F. novicida only as a hybrid between the psychrophilic and the F. novicida ligA genes. However, the full psychrophilic gene alone supported the growth of Salmonella enterica, imparting a restrictive temperature of 27°C. We also tested two ligA alleles from two Pseudoalteromonas strains for their ability to support the viability of a Saccharomyces cerevisiae strain that lacked its essential gene, CDC9, encoding an ATP-dependent DNA ligase. In both cases, the psychrophilic bacterial alleles supported yeast viability and their expression generated TS phenotypes. This collection of ligA alleles should be useful in engineering bacteria, and possibly eukaryotic microbes, to predictable TS phenotypes. PMID:26773080
Dual control of pcdh8l/PCNS expression and function in Xenopus laevis neural crest cells by adam13/33 via the transcription factors tfap2α and arid3a.

PubMed

Khedgikar, Vikram; Abbruzzese, Genevieve; Mathavan, Ketan; Szydlo, Hannah; Cousin, Helene; Alfandari, Dominique

2017-08-22

Adam13/33 is a cell surface metalloprotease critical for cranial neural crest (CNC) cell migration. It can cleave multiple substrates including itself, fibronectin, ephrinB, cadherin-11, pcdh8 and pcdh8l (this work). Cleavage of cadherin-11 produces an extracellular fragment that promotes CNC migration. In addition, the adam13 cytoplasmic domain is cleaved by gamma secretase, translocates into the nucleus and regulates multiple genes. Here, we show that adam13 interacts with the arid3a/dril1/Bright transcription factor. This interaction promotes a proteolytic cleavage of arid3a and its translocation to the nucleus where it regulates another transcription factor: tfap2α. Tfap2α in turn activates multiple genes including the protocadherin pcdh8l (PCNS). The proteolytic activity of adam13 is critical for the release of arid3a from the plasma membrane while the cytoplasmic domain appears critical for the cleavage of arid3a. In addition to this transcriptional control of pcdh8l, adam13 cleaves pcdh8l generating an extracellular fragment that also regulates cell migration.
Dual control of pcdh8l/PCNS expression and function in Xenopus laevis neural crest cells by adam13/33 via the transcription factors tfap2α and arid3a

PubMed Central

Khedgikar, Vikram; Abbruzzese, Genevieve; Mathavan, Ketan; Szydlo, Hannah; Cousin, Helene

2017-01-01

Adam13/33 is a cell surface metalloprotease critical for cranial neural crest (CNC) cell migration. It can cleave multiple substrates including itself, fibronectin, ephrinB, cadherin-11, pcdh8 and pcdh8l (this work). Cleavage of cadherin-11 produces an extracellular fragment that promotes CNC migration. In addition, the adam13 cytoplasmic domain is cleaved by gamma secretase, translocates into the nucleus and regulates multiple genes. Here, we show that adam13 interacts with the arid3a/dril1/Bright transcription factor. This interaction promotes a proteolytic cleavage of arid3a and its translocation to the nucleus where it regulates another transcription factor: tfap2α. Tfap2α in turn activates multiple genes including the protocadherin pcdh8l (PCNS). The proteolytic activity of adam13 is critical for the release of arid3a from the plasma membrane while the cytoplasmic domain appears critical for the cleavage of arid3a. In addition to this transcriptional control of pcdh8l, adam13 cleaves pcdh8l generating an extracellular fragment that also regulates cell migration. PMID:28829038

Principal network analysis: identification of subnetworks representing major dynamics using gene expression data

PubMed Central

Kim, Yongsoo; Kim, Taek-Kyun; Kim, Yungu; Yoo, Jiho; You, Sungyong; Lee, Inyoul; Carlson, George; Hood, Leroy; Choi, Seungjin; Hwang, Daehee

2011-01-01

Motivation: Systems biology attempts to describe complex systems behaviors in terms of dynamic operations of biological networks. However, there is lack of tools that can effectively decode complex network dynamics over multiple conditions. Results: We present principal network analysis (PNA) that can automatically capture major dynamic activation patterns over multiple conditions and then generate protein and metabolic subnetworks for the captured patterns. We first demonstrated the utility of this method by applying it to a synthetic dataset. The results showed that PNA correctly captured the subnetworks representing dynamics in the data. We further applied PNA to two time-course gene expression profiles collected from (i) MCF7 cells after treatments of HRG at multiple doses and (ii) brain samples of four strains of mice infected with two prion strains. The resulting subnetworks and their interactions revealed network dynamics associated with HRG dose-dependent regulation of cell proliferation and differentiation and early PrPSc accumulation during prion infection. Availability: The web-based software is available at: http://sbm.postech.ac.kr/pna. Contact: dhhwang@postech.ac.kr; seungjin@postech.ac.kr Supplementary information: Supplementary data are available at Bioinformatics online. PMID:21193522
10-Oxo-trans-11-octadecenoic acid generated from linoleic acid by a gut lactic acid bacterium Lactobacillus plantarum is cytoprotective against oxidative stress

DOE Office of Scientific and Technical Information (OSTI.GOV)

Furumoto, Hidehiro; Nanthirudjanar, Tharnath; Kume, Toshiaki

Oxidative stress is a well-known cause of multiple diseases. The nuclear factor erythroid 2-related factor 2 (Nrf2)-antioxidant response element (ARE) pathway plays a central role in cellular antioxidative responses. In this study, we investigated the effects of novel fatty acid metabolite derivatives of linoleic acid generated by the gut lactic acid bacteria Lactobacillus plantarum on the Nrf2-ARE pathway. 10-Oxo-trans-11-octadecenoic acid (KetoC) protected HepG2 cells from cytotoxicity induced by hydrogen peroxide. KetoC also significantly increased cellular Nrf2 protein levels, ARE-dependent transcription, and the gene expression of antioxidative enzymes such as heme oxygenase-1 (HO-1), glutamate-cysteine ligase modifier subunit (GCLM), and NAD(P)H:quinone oxidoreductasemore » 1 (NQO1) in HepG2 cells. Additionally, a single oral dose administration of KetoC also increased antioxidative gene expression and protein levels of Nrf2 and HO-1 in mouse organs. Since other fatty acid metabolites and linoleic acid did not affect cellular antioxidative responses, the cytoprotective effect of KetoC may be because of its α,β-unsaturated carbonyl moiety. Collectively, our data suggested that KetoC activated the Nrf2-ARE pathway to enhance cellular antioxidative responses in vitro and in vivo, which further suggests that KetoC may prevent multiple diseases induced by oxidative stress. - Highlights: • We evaluated the effect of modified fatty acids generated by Lactobacillus plantarum. • 10-Oxo-trans-11-ocatadecenoic acid (KetoC) protected cells from oxidative stress. • KetoC activated the Nrf2-ARE pathway to promote antioxidative gene expression. • KetoC promoted the expression of antioxidative enzymes in mice organs. • The cytoprotective effect of KetoC was because of α,β-unsaturated carbonyl moiety.« less
Development of an expression plasmid and its use in genetic manipulation of Lingzhi or Reishi medicinal mushroom, Ganoderma lucidum (higher Basidiomycetes).

PubMed

Yu, Xuya; Ji, Sen-Lin; He, Yi-Long; Ren, Meng-Fei; Xu, Jun-Wei

2014-01-01

We report the construction of a plasmid, pJW-EXP, designed for the expression of homologous and heterologous genes in Ganoderma lucidum. pJW-EXP was generated from the plasmid pMD19-T by inserting the G. lucidum glyceraldehyde-3-phosphate dehydrogenase gene promoter, the G. lucidum iron-sulfur protein subunit of succinate dehydrogenase gene terminator and the homologous carboxin-resistance gene as selection marker. This expression plasmid can be efficiently transformed into Ganoderma through polyethylene glycol-mediated protoplast transformation. Southern blot analysis showed that most of the integrated DNA appeared as multiple copies in the genome. The applicability of the constructed plasmid was tested by expression of the truncated G. lucidum 3-hydroxy-3-methylglutaryl coenzyme A reductase (HMGR) gene that encodes the catalytic domain of HMGR. Overexpression of the truncated HMGR gene, which is a key gene in the biosynthetic pathway of the antitumor compounds, ganoderic acids, increased the transcription of the HMGR gene and enhanced ganoderic acid accumulation. pJW-EXP can serve as a useful tool in the genetic improvement and metabolic engineering of Ganoderma.
The Chloroplast atpA Gene Cluster in Chlamydomonas reinhardtii1

PubMed Central

Drapier, Dominique; Suzuki, Hideki; Levy, Haim; Rimbault, Blandine; Kindle, Karen L.; Stern, David B.; Wollman, Francis-André

1998-01-01

Most chloroplast genes in vascular plants are organized into polycistronic transcription units, which generate a complex pattern of mono-, di-, and polycistronic transcripts. In contrast, most Chlamydomonas reinhardtii chloroplast transcripts characterized to date have been monocistronic. This paper describes the atpA gene cluster in the C. reinhardtii chloroplast genome, which includes the atpA, psbI, cemA, and atpH genes, encoding the α-subunit of the coupling-factor-1 (CF1) ATP synthase, a small photosystem II polypeptide, a chloroplast envelope membrane protein, and subunit III of the CF0 ATP synthase, respectively. We show that promoters precede the atpA, psbI, and atpH genes, but not the cemA gene, and that cemA mRNA is present only as part of di-, tri-, or tetracistronic transcripts. Deletions introduced into the gene cluster reveal, first, that CF1-α can be translated from di- or polycistronic transcripts, and, second, that substantial reductions in mRNA quantity have minimal effects on protein synthesis rates. We suggest that posttranscriptional mRNA processing is common in C. reinhardtii chloroplasts, permitting the expression of multiple genes from a single promoter. PMID:9625716
Gene Ontology Consortium: going forward

PubMed Central

2015-01-01

The Gene Ontology (GO; http://www.geneontology.org) is a community-based bioinformatics resource that supplies information about gene product function using ontologies to represent biological knowledge. Here we describe improvements and expansions to several branches of the ontology, as well as updates that have allowed us to more efficiently disseminate the GO and capture feedback from the research community. The Gene Ontology Consortium (GOC) has expanded areas of the ontology such as cilia-related terms, cell-cycle terms and multicellular organism processes. We have also implemented new tools for generating ontology terms based on a set of logical rules making use of templates, and we have made efforts to increase our use of logical definitions. The GOC has a new and improved web site summarizing new developments and documentation, serving as a portal to GO data. Users can perform GO enrichment analysis, and search the GO for terms, annotations to gene products, and associated metadata across multiple species using the all-new AmiGO 2 browser. We encourage and welcome the input of the research community in all biological areas in our continued effort to improve the Gene Ontology. PMID:25428369
Exome sequencing of a large family identifies potential candidate genes contributing risk to bipolar disorder.

PubMed

Zhang, Tianxiao; Hou, Liping; Chen, David T; McMahon, Francis J; Wang, Jen-Chyong; Rice, John P

2018-03-01

Bipolar disorder is a mental illness with lifetime prevalence of about 1%. Previous genetic studies have identified multiple chromosomal linkage regions and candidate genes that might be associated with bipolar disorder. The present study aimed to identify potential susceptibility variants for bipolar disorder using 6 related case samples from a four-generation family. A combination of exome sequencing and linkage analysis was performed to identify potential susceptibility variants for bipolar disorder. Our study identified a list of five potential candidate genes for bipolar disorder. Among these five genes, GRID1(Glutamate Receptor Delta-1 Subunit), which was previously reported to be associated with several psychiatric disorders and brain related traits, is particularly interesting. Variants with functional significance in this gene were identified from two cousins in our bipolar disorder pedigree. Our findings suggest a potential role for these genes and the related rare variants in the onset and development of bipolar disorder in this one family. Additional research is needed to replicate these findings and evaluate their patho-biological significance. Copyright © 2017 Elsevier B.V. All rights reserved.
5S rRNA Promoter for Guide RNA Expression Enabled Highly Efficient CRISPR/Cas9 Genome Editing in Aspergillus niger.

PubMed

Zheng, Xiaomei; Zheng, Ping; Zhang, Kun; Cairns, Timothy C; Meyer, Vera; Sun, Jibin; Ma, Yanhe

2018-04-30

The CRISPR/Cas9 system is a revolutionary genome editing tool. However, in eukaryotes, search and optimization of a suitable promoter for guide RNA expression is a significant technical challenge. Here we used the industrially important fungus, Aspergillus niger, to demonstrate that the 5S rRNA gene, which is both highly conserved and efficiently expressed in eukaryotes, can be used as a guide RNA promoter. The gene editing system was established with 100% rates of precision gene modifications among dozens of transformants using short (40-bp) homologous donor DNA. This system was also applicable for generation of designer chromosomes, as evidenced by deletion of a 48 kb gene cluster required for biosynthesis of the mycotoxin fumonisin B1. Moreover, this system also facilitated simultaneous mutagenesis of multiple genes in A. niger. We anticipate that the use of the 5S rRNA gene as guide RNA promoter can broadly be applied for engineering highly efficient eukaryotic CRISPR/Cas9 toolkits. Additionally, the system reported here will enable development of designer chromosomes in model and industrially important fungi.
Gene expression analysis of flax seed development

PubMed Central

2011-01-01

Background Flax, Linum usitatissimum L., is an important crop whose seed oil and stem fiber have multiple industrial applications. Flax seeds are also well-known for their nutritional attributes, viz., omega-3 fatty acids in the oil and lignans and mucilage from the seed coat. In spite of the importance of this crop, there are few molecular resources that can be utilized toward improving seed traits. Here, we describe flax embryo and seed development and generation of comprehensive genomic resources for the flax seed. Results We describe a large-scale generation and analysis of expressed sequences in various tissues. Collectively, the 13 libraries we have used provide a broad representation of genes active in developing embryos (globular, heart, torpedo, cotyledon and mature stages) seed coats (globular and torpedo stages) and endosperm (pooled globular to torpedo stages) and genes expressed in flowers, etiolated seedlings, leaves, and stem tissue. A total of 261,272 expressed sequence tags (EST) (GenBank accessions LIBEST_026995 to LIBEST_027011) were generated. These EST libraries included transcription factor genes that are typically expressed at low levels, indicating that the depth is adequate for in silico expression analysis. Assembly of the ESTs resulted in 30,640 unigenes and 82% of these could be identified on the basis of homology to known and hypothetical genes from other plants. When compared with fully sequenced plant genomes, the flax unigenes resembled poplar and castor bean more than grape, sorghum, rice or Arabidopsis. Nearly one-fifth of these (5,152) had no homologs in sequences reported for any organism, suggesting that this category represents genes that are likely unique to flax. Digital analyses revealed gene expression dynamics for the biosynthesis of a number of important seed constituents during seed development. Conclusions We have developed a foundational database of expressed sequences and collection of plasmid clones that comprise even low-expressed genes such as those encoding transcription factors. This has allowed us to delineate the spatio-temporal aspects of gene expression underlying the biosynthesis of a number of important seed constituents in flax. Flax belongs to a taxonomic group of diverse plants and the large sequence database will allow for evolutionary studies as well. PMID:21529361
Action of multiple intra-QTL genes concerted around a co-localized transcription factor underpins a large effect QTL

PubMed Central

Dixit, Shalabh; Kumar Biswal, Akshaya; Min, Aye; Henry, Amelia; Oane, Rowena H.; Raorane, Manish L.; Longkumer, Toshisangba; Pabuayon, Isaiah M.; Mutte, Sumanth K.; Vardarajan, Adithi R.; Miro, Berta; Govindan, Ganesan; Albano-Enriquez, Blesilda; Pueffeld, Mandy; Sreenivasulu, Nese; Slamet-Loedin, Inez; Sundarvelpandian, Kalaipandian; Tsai, Yuan-Ching; Raghuvanshi, Saurabh; Hsing, Yue-Ie C.; Kumar, Arvind; Kohli, Ajay

2015-01-01

Sub-QTLs and multiple intra-QTL genes are hypothesized to underpin large-effect QTLs. Known QTLs over gene families, biosynthetic pathways or certain traits represent functional gene-clusters of genes of the same gene ontology (GO). Gene-clusters containing genes of different GO have not been elaborated, except in silico as coexpressed genes within QTLs. Here we demonstrate the requirement of multiple intra-QTL genes for the full impact of QTL qDTY12.1 on rice yield under drought. Multiple evidences are presented for the need of the transcription factor ‘no apical meristem’ (OsNAM12.1) and its co-localized target genes of separate GO categories for qDTY12.1 function, raising a regulon-like model of genetic architecture. The molecular underpinnings of qDTY12.1 support its effectiveness in further improving a drought tolerant genotype and for its validity in multiple genotypes/ecosystems/environments. Resolving the combinatorial value of OsNAM12.1 with individual intra-QTL genes notwithstanding, identification and analyses of qDTY12.1has fast-tracked rice improvement towards food security. PMID:26507552
Properties of a herpes simplex virus multiple immediate-early gene-deleted recombinant as a vaccine vector

DOE Office of Scientific and Technical Information (OSTI.GOV)

Watanabe, Daisuke; Brockman, Mark A.; Ndung'u, Thumbi

2007-01-20

Herpes simplex virus (HSV) recombinants induce durable immune responses in rhesus macaques and mice and have induced partial protection in rhesus macaques against mucosal challenge with virulent simian immunodeficiency virus (SIV). In this study, we evaluated the properties of a new generation HSV vaccine vector, an HSV-1 multiple immediate-early (IE) gene deletion mutant virus, d106, which contains deletions in the ICP4, ICP27, ICP22, and ICP47 genes. Because several of the HSV IE genes have been implicated in immune evasion, inactivation of the genes encoding these proteins was expected to result in enhanced immunogenicity. The d106 virus expresses few HSV genemore » products and shows minimal cytopathic effect in cultured cells. When d106 was inoculated into mice, viral DNA accumulated at high levels in draining lymph nodes, consistent with an ability to transduce dendritic cells and activate their maturation and movement to lymph nodes. A d106 recombinant expressing Escherichia coli {beta}-galactosidase induced durable {beta}-gal-specific IgG and CD8{sup +} T cell responses in naive and HSV-immune mice. Finally, d106-based recombinants have been constructed that express simian immunodeficiency virus (SIV) gag, env, or a rev-tat-nef fusion protein for several days in cultured cells. Thus, d106 shows many of the properties desirable in a vaccine vector: limited expression of HSV gene products and cytopathogenicity, high level expression of transgenes, ability to induce durable immune responses, and an ability to transduce dendritic cells and induce their maturation and migration to lymph nodes.« less
Investigation of terpene diversification across multiple sequenced plant genomes

PubMed Central

Boutanaev, Alexander M.; Moses, Tessa; Zi, Jiachen; Nelson, David R.; Mugford, Sam T.; Peters, Reuben J.; Osbourn, Anne

2015-01-01

Plants produce an array of specialized metabolites, including chemicals that are important as medicines, flavors, fragrances, pigments and insecticides. The vast majority of this metabolic diversity is untapped. Here we take a systematic approach toward dissecting genetic components of plant specialized metabolism. Focusing on the terpenes, the largest class of plant natural products, we investigate the basis of terpene diversity through analysis of multiple sequenced plant genomes. The primary drivers of terpene diversification are terpenoid synthase (TS) “signature” enzymes (which generate scaffold diversity), and cytochromes P450 (CYPs), which modify and further diversify these scaffolds, so paving the way for further downstream modifications. Our systematic search of sequenced plant genomes for all TS and CYP genes reveals that distinct TS/CYP gene pairs are found together far more commonly than would be expected by chance, and that certain TS/CYP pairings predominate, providing signals for key events that are likely to have shaped terpene diversity. We recover TS/CYP gene pairs for previously characterized terpene metabolic gene clusters and demonstrate new functional pairing of TSs and CYPs within previously uncharacterized clusters. Unexpectedly, we find evidence for different mechanisms of pathway assembly in eudicots and monocots; in the former, microsyntenic blocks of TS/CYP gene pairs duplicate and provide templates for the evolution of new pathways, whereas in the latter, new pathways arise by mixing and matching of individual TS and CYP genes through dynamic genome rearrangements. This is, to our knowledge, the first documented observation of the unique pattern of TS and CYP assembly in eudicots and monocots. PMID:25502595
Functionally Structured Genomes in Lactobacillus kunkeei Colonizing the Honey Crop and Food Products of Honeybees and Stingless Bees

PubMed Central

Tamarit, Daniel; Ellegaard, Kirsten M.; Wikander, Johan; Olofsson, Tobias; Vásquez, Alejandra; Andersson, Siv G.E.

2015-01-01

Lactobacillus kunkeei is the most abundant bacterial species in the honey crop and food products of honeybees. The 16 S rRNA genes of strains isolated from different bee species are nearly identical in sequence and therefore inadequate as markers for studies of coevolutionary patterns. Here, we have compared the 1.5 Mb genomes of ten L. kunkeei strains isolated from all recognized Apis species and another two strains from Meliponini species. A gene flux analysis, including previously sequenced Lactobacillus species as outgroups, indicated the influence of reductive evolution. The genome architecture is unique in that vertically inherited core genes are located near the terminus of replication, whereas genes for secreted proteins and putative host-adaptive traits are located near the origin of replication. We suggest that these features have resulted from a genome-wide loss of genes, with integrations of novel genes mostly occurring in regions flanking the origin of replication. The phylogenetic analyses showed that the bacterial topology was incongruent with the host topology, and that strains of the same microcluster have recombined frequently across the host species barriers, arguing against codiversification. Multiple genotypes were recovered in the individual hosts and transfers of mobile elements could be demonstrated for strains isolated from the same host species. Unlike other bacteria with small genomes, short generation times and multiple rRNA operons suggest that L. kunkeei evolves under selection for rapid growth in its natural growth habitat. The results provide an extended framework for reductive genome evolution and functional genome organization in bacteria. PMID:25953738
Comprehensive Analysis of Interaction Networks of Telomerase Reverse Transcriptase with Multiple Bioinformatic Approaches: Deep Mining the Potential Functions of Telomere and Telomerase.

PubMed

Hou, Chunyu; Wang, Fei; Liu, Xuewen; Chang, Guangming; Wang, Feng; Geng, Xin

2017-08-01

Telomerase reverse transcriptase (TERT) is the protein component of telomerase complex. Evidence has accumulated showing that the nontelomeric functions of TERT are independent of telomere elongation. However, the mechanisms governing the interaction between TERT and its target genes are not clearly revealed. The biological functions of TERT are not fully elucidated and have thus far been underestimated. To further explore these functions, we investigated TERT interaction networks using multiple bioinformatic databases, including BioGRID, STRING, DAVID, GeneCards, GeneMANIA, PANTHER, miRWalk, mirTarBase, miRNet, miRDB, and TargetScan. In addition, network diagrams were built using Cytoscape software. As competing endogenous RNAs (ceRNAs) are endogenous transcripts that compete for the binding of microRNAs (miRNAs) by using shared miRNA recognition elements, they are involved in creating widespread regulatory networks. Therefore, the ceRNA regulatory networks of TERT were also investigated in this study. Interestingly, we found that the three genes PABPC1, SLC7A11, and TP53 were present in both TERT interaction networks and ceRNAs target genes. It was predicted that TERT might play nontelomeric roles in the generation or development of some rare diseases, such as Rift Valley fever and dyscalculia. Thus, our data will help to decipher the interaction networks of TERT and reveal the unknown functions of telomerase in cancer and aging-related diseases.
Multiple homologous genes knockout (KO) by CRISPR/Cas9 system in rabbit.

PubMed

Liu, Huan; Sui, Tingting; Liu, Di; Liu, Tingjun; Chen, Mao; Deng, Jichao; Xu, Yuanyuan; Li, Zhanjun

2018-03-20

The CRISPR/Cas9 system is a highly efficient and convenient genome editing tool, which has been widely used for single or multiple gene mutation in a variety of organisms. Disruption of multiple homologous genes, which have similar DNA sequences and gene function, is required for the study of the desired phenotype. In this study, to test whether the CRISPR/Cas9 system works on the mutation of multiple homologous genes, a single guide RNA (sgRNA) targeting three fucosyltransferases encoding genes (FUT1, FUT2 and SEC1) was designed. As expected, triple gene mutation of FUT1, FUT2 and SEC1 could be achieved simultaneously via a sgRNA mediated CRISPR/Cas9 system. Besides, significantly reduced serum fucosyltransferases enzymes activity was also determined in those triple gene mutation rabbits. Thus, we provide the first evidence that multiple homologous genes knockout (KO) could be achieved efficiently by a sgRNA mediated CRISPR/Cas9 system in mammals, which could facilitate the genotype to phenotype studies of homologous genes in future. Copyright © 2018 Elsevier B.V. All rights reserved.
Next Generation Models for Storage and Representation of Microbial Biological Annotation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Quest, Daniel J; Land, Miriam L; Brettin, Thomas S

2010-01-01

Background Traditional genome annotation systems were developed in a very different computing era, one where the World Wide Web was just emerging. Consequently, these systems are built as centralized black boxes focused on generating high quality annotation submissions to GenBank/EMBL supported by expert manual curation. The exponential growth of sequence data drives a growing need for increasingly higher quality and automatically generated annotation. Typical annotation pipelines utilize traditional database technologies, clustered computing resources, Perl, C, and UNIX file systems to process raw sequence data, identify genes, and predict and categorize gene function. These technologies tightly couple the annotation software systemmore » to hardware and third party software (e.g. relational database systems and schemas). This makes annotation systems hard to reproduce, inflexible to modification over time, difficult to assess, difficult to partition across multiple geographic sites, and difficult to understand for those who are not domain experts. These systems are not readily open to scrutiny and therefore not scientifically tractable. The advent of Semantic Web standards such as Resource Description Framework (RDF) and OWL Web Ontology Language (OWL) enables us to construct systems that address these challenges in a new comprehensive way. Results Here, we develop a framework for linking traditional data to OWL-based ontologies in genome annotation. We show how data standards can decouple hardware and third party software tools from annotation pipelines, thereby making annotation pipelines easier to reproduce and assess. An illustrative example shows how TURTLE (Terse RDF Triple Language) can be used as a human readable, but also semantically-aware, equivalent to GenBank/EMBL files. Conclusions The power of this approach lies in its ability to assemble annotation data from multiple databases across multiple locations into a representation that is understandable to researchers. In this way, all researchers, experimental and computational, will more easily understand the informatics processes constructing genome annotation and ultimately be able to help improve the systems that produce them.« less
"Mini-array" transcriptional analysis of the Listeria monocytogenes lecithinase operon as a class project: A student investigative molecular biology laboratory experience*.

PubMed

Christensen, Douglas; Jovic, Marko

2006-05-01

This report describes a molecular biotechnology-based laboratory curriculum developed to accompany an undergraduate genetics course. During the course of a semester, students researched the pathogen, developed a research question, designed experiments, and performed transcriptional analysis of a set of genes that confer virulence to the food-borne pathogen, Listeria monocytogenes. Gene fragments were amplified via PCR and utilized in "mini-arrays," a dot-blot-based format suitable for the simultaneous transcriptional analysis of multiple genes. The project provides exposure to a wide range of molecular techniques and can be easily modified for variations in class size. Data are generated at various steps of the process, allowing for student interpretation, troubleshooting, and assessment opportunities. Copyright © 2006 International Union of Biochemistry and Molecular Biology, Inc.
Genome-wide mapping in a house mouse hybrid zone reveals hybrid sterility loci and Dobzhansky-Muller interactions.

PubMed

Turner, Leslie M; Harr, Bettina

2014-12-09

Mapping hybrid defects in contact zones between incipient species can identify genomic regions contributing to reproductive isolation and reveal genetic mechanisms of speciation. The house mouse features a rare combination of sophisticated genetic tools and natural hybrid zones between subspecies. Male hybrids often show reduced fertility, a common reproductive barrier between incipient species. Laboratory crosses have identified sterility loci, but each encompasses hundreds of genes. We map genetic determinants of testis weight and testis gene expression using offspring of mice captured in a hybrid zone between M. musculus musculus and M. m. domesticus. Many generations of admixture enables high-resolution mapping of loci contributing to these sterility-related phenotypes. We identify complex interactions among sterility loci, suggesting multiple, non-independent genetic incompatibilities contribute to barriers to gene flow in the hybrid zone.
The potential application and challenge of powerful CRISPR/Cas9 system in cardiovascular research.

PubMed

Li, Yangxin; Song, Yao-Hua; Liu, Bin; Yu, Xi-Yong

2017-01-15

CRISPR/Cas9 is a precision-guided munition found in bacteria to fight against invading viruses. This technology has enormous potential applications, including altering genes in both somatic and germ cells, as well as generating knockout animals. Compared to other gene editing techniques such as zinc finger nucleases and TALENS, CRISPR/Cas9 is much easier to use and highly efficient. Importantly, the multiplex capacity of this technology allows multiple genes to be edited simultaneously. CRISPR/Cas9 also has the potential to prevent and cure human diseases. In this review, we wish to highlight some key points regarding the future prospect of using CRISPR/Cas9 as a powerful tool for cardiovascular research, and as a novel therapeutic strategy to treat cardiovascular diseases. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Finding novel relationships with integrated gene-gene association network analysis of Synechocystis sp. PCC 6803 using species-independent text-mining

PubMed Central

Kreula, Sanna M.; Kaewphan, Suwisa; Ginter, Filip

2018-01-01

The increasing move towards open access full-text scientific literature enhances our ability to utilize advanced text-mining methods to construct information-rich networks that no human will be able to grasp simply from ‘reading the literature’. The utility of text-mining for well-studied species is obvious though the utility for less studied species, or those with no prior track-record at all, is not clear. Here we present a concept for how advanced text-mining can be used to create information-rich networks even for less well studied species and apply it to generate an open-access gene-gene association network resource for Synechocystis sp. PCC 6803, a representative model organism for cyanobacteria and first case-study for the methodology. By merging the text-mining network with networks generated from species-specific experimental data, network integration was used to enhance the accuracy of predicting novel interactions that are biologically relevant. A rule-based algorithm (filter) was constructed in order to automate the search for novel candidate genes with a high degree of likely association to known target genes by (1) ignoring established relationships from the existing literature, as they are already ‘known’, and (2) demanding multiple independent evidences for every novel and potentially relevant relationship. Using selected case studies, we demonstrate the utility of the network resource and filter to (i) discover novel candidate associations between different genes or proteins in the network, and (ii) rapidly evaluate the potential role of any one particular gene or protein. The full network is provided as an open-source resource. PMID:29844966
Adolescent mouse takes on an active transcriptomic expression during postnatal cerebral development.

PubMed

Xu, Wei; Xin, Chengqi; Lin, Qiang; Ding, Feng; Gong, Wei; Zhou, Yuanyuan; Yu, Jun; Cui, Peng; Hu, Songnian

2014-06-01

Postnatal cerebral development is a complicated biological process precisely controlled by multiple genes. To understand the molecular mechanism of cerebral development, we compared dynamics of mouse cerebrum transcriptome through three developmental stages using high-throughput RNA-seq technique. Three libraries were generated from the mouse cerebrum at infancy, adolescence and adulthood, respectively. Consequently, 44,557,729 (infancy), 59,257,530 (adolescence) and 72,729,636 (adulthood) reads were produced, which were assembled into 15,344, 16,048 and 15,775 genes, respectively. We found that the overall gene expression level increased from infancy to adolescence and decreased later on upon reaching adulthood. The adolescence cerebrum has the most active gene expression, with expression of a large number of regulatory genes up-regulated and some crucial pathways activated. Transcription factor (TF) analysis suggested the similar dynamics as expression profiling, especially those TFs functioning in neurogenesis differentiation, oligodendrocyte lineage determination and circadian rhythm regulation. Moreover, our data revealed a drastic increase in myelin basic protein (MBP)-coding gene expression in adolescence and adulthood, suggesting that the brain myelin may be generated since mouse adolescence. In addition, differential gene expression analysis indicated the activation of rhythmic pathway, suggesting the function of rhythmic movement since adolescence; Furthermore, during infancy and adolescence periods, gene expression related to axonrepulsion and attraction showed the opposite trends, indicating that axon repulsion was activated after birth, while axon attraction might be activated at the embryonic stage and declined during the postnatal development. Our results from the present study may shed light on the molecular mechanism underlying the postnatal development of the mammalian cerebrum. Copyright © 2014. Production and hosting by Elsevier Ltd.

Genetic variation and gene expression across multiple tissues and developmental stages in a nonhuman primate.

PubMed

Jasinska, Anna J; Zelaya, Ivette; Service, Susan K; Peterson, Christine B; Cantor, Rita M; Choi, Oi-Wa; DeYoung, Joseph; Eskin, Eleazar; Fairbanks, Lynn A; Fears, Scott; Furterer, Allison E; Huang, Yu S; Ramensky, Vasily; Schmitt, Christopher A; Svardal, Hannes; Jorgensen, Matthew J; Kaplan, Jay R; Villar, Diego; Aken, Bronwen L; Flicek, Paul; Nag, Rishi; Wong, Emily S; Blangero, John; Dyer, Thomas D; Bogomolov, Marina; Benjamini, Yoav; Weinstock, George M; Dewar, Ken; Sabatti, Chiara; Wilson, Richard K; Jentsch, J David; Warren, Wesley; Coppola, Giovanni; Woods, Roger P; Freimer, Nelson B

2017-12-01

By analyzing multitissue gene expression and genome-wide genetic variation data in samples from a vervet monkey pedigree, we generated a transcriptome resource and produced the first catalog of expression quantitative trait loci (eQTLs) in a nonhuman primate model. This catalog contains more genome-wide significant eQTLs per sample than comparable human resources and identifies sex- and age-related expression patterns. Findings include a master regulatory locus that likely has a role in immune function and a locus regulating hippocampal long noncoding RNAs (lncRNAs), whose expression correlates with hippocampal volume. This resource will facilitate genetic investigation of quantitative traits, including brain and behavioral phenotypes relevant to neuropsychiatric disorders.
SNPitty: An Intuitive Web Application for Interactive B-Allele Frequency and Copy Number Visualization of Next-Generation Sequencing Data.

PubMed

van Riet, Job; Krol, Niels M G; Atmodimedjo, Peggy N; Brosens, Erwin; van IJcken, Wilfred F J; Jansen, Maurice P H M; Martens, John W M; Looijenga, Leendert H; Jenster, Guido; Dubbink, Hendrikus J; Dinjens, Winand N M; van de Werken, Harmen J G

2018-03-01

Exploration and visualization of next-generation sequencing data are crucial for clinical diagnostics. Software allowing simultaneous visualization of multiple regions of interest coupled with dynamic heuristic filtering of genetic aberrations is, however, lacking. Therefore, the authors developed the web application SNPitty that allows interactive visualization and interrogation of variant call format files by using B-allele frequencies of single-nucleotide polymorphisms and single-nucleotide variants, coverage metrics, and copy numbers analysis results. SNPitty displays variant alleles and allelic imbalances with a focus on loss of heterozygosity and copy number variation using genome-wide heterozygous markers and somatic mutations. In addition, SNPitty is capable of generating predefined reports that summarize and highlight disease-specific targets of interest. SNPitty was validated for diagnostic interpretation of somatic events by showcasing a serial dilution series of glioma tissue. Additionally, SNPitty is demonstrated in four cancer-related scenarios encountered in daily clinical practice and on whole-exome sequencing data of peripheral blood from a Down syndrome patient. SNPitty allows detection of loss of heterozygosity, chromosomal and gene amplifications, homozygous or heterozygous deletions, somatic mutations, or any combination thereof in regions or genes of interest. Furthermore, SNPitty can be used to distinguish molecular relationships between multiple tumors from a single patient. On the basis of these data, the authors demonstrate that SNPitty is robust and user friendly in a wide range of diagnostic scenarios. Copyright © 2018 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Viral MicroRNAs Repress the Cholesterol Pathway, and 25-Hydroxycholesterol Inhibits Infection.

PubMed

Serquiña, Anna K P; Kambach, Diane M; Sarker, Ontara; Ziegelbauer, Joseph M

2017-07-11

From various screens, we found that Kaposi's sarcoma-associated herpesvirus (KSHV) viral microRNAs (miRNAs) target several enzymes in the mevalonate/cholesterol pathway. 3-Hydroxy-3-methylglutaryl-coenzyme A (CoA) synthase 1 (HMGCS1), 3-hydroxy-3-methylglutaryl-CoA reductase (HMGCR [a rate-limiting step in the mevalonate pathway]), and farnesyl-diphosphate farnesyltransferase 1 (FDFT1 [a committed step in the cholesterol branch]) are repressed by multiple KSHV miRNAs. Transfection of viral miRNA mimics in primary endothelial cells (human umbilical vein endothelial cells [HUVECs]) is sufficient to reduce intracellular cholesterol levels; however, small interfering RNAs (siRNAs) targeting only HMGCS1 did not reduce cholesterol levels. This suggests that multiple targets are needed to perturb this tightly regulated pathway. We also report here that cholesterol levels were decreased in de novo -infected HUVECs after 7 days. This reduction is at least partially due to viral miRNAs, since the mutant form of KSHV lacking 10 of the 12 miRNA genes had increased cholesterol compared to wild-type infections. We hypothesized that KSHV is downregulating cholesterol to suppress the antiviral response by a modified form of cholesterol, 25-hydroxycholesterol (25HC). We found that the cholesterol 25-hydroxylase (CH25H) gene, which is responsible for generating 25HC, had increased expression in de novo -infected HUVECs but was strongly suppressed in long-term latently infected cell lines. We found that 25HC inhibits KSHV infection when added exogenously prior to de novo infection. In conclusion, we found that multiple KSHV viral miRNAs target enzymes in the mevalonate pathway to modulate cholesterol in infected cells during latency. This repression of cholesterol levels could potentially be beneficial to viral infection by decreasing the levels of 25HC. IMPORTANCE A subset of viruses express unique microRNAs (miRNAs), which act like cellular miRNAs to generally repress host gene expression. A cancer virus, Kaposi's sarcoma-associated herpesvirus (KSHV, or human herpesvirus 8 [HHV-8]), encodes multiple miRNAs that repress gene expression of multiple enzymes that are important for cholesterol synthesis. In cells with these viral miRNAs or with natural infection, cholesterol levels are reduced, indicating these viral miRNAs decrease cholesterol levels. A modified form of cholesterol, 25-hydroxycholesterol, is generated directly from cholesterol. Addition of 25-hydroxycholesterol to primary cells inhibited KSHV infection of cells, suggesting that viral miRNAs may decrease cholesterol levels to decrease the concentration of 25-hydroxycholesterol and to promote infection. These results suggest a new virus-host relationship and indicate a previously unidentified viral strategy to lower cholesterol levels. Copyright © 2017 Serquiña et al.
Circadian abnormalities in mouse models of Smith-Magenis syndrome: evidence for involvement of RAI1.

PubMed

Lacaria, Melanie; Gu, Wenli; Lupski, James R

2013-07-01

Smith-Magenis syndrome (SMS; OMIM 182290) is a genomic disorder characterized by multiple congenital anomalies, intellectual disability, behavioral abnormalities, and disordered sleep resulting from an ~3.7 Mb deletion copy number variant (CNV) on chromosome 17p11.2 or from point mutations in the gene RAI1. The reciprocal duplication of this region results in another genomic disorder, Potocki-Lupski syndrome (PTLS; OMIM 610883), characterized by autism, intellectual disability, and congenital anomalies. We previously used chromosome-engineering and gene targeting to generate mouse models for PTLS (Dp(11)17/+), and SMS due to either deletion CNV or gene knock-out (Df(11)17-2/+ and Rai1(+/-) , respectively) and we observed phenotypes in these mouse models consistent with their associated human syndromes. To investigate the contribution of individual genes to the circadian phenotypes observed in SMS, we now report the analysis of free-running period lengths in Rai1(+/-) and Df(11)17-2/+ mice, as well as in mice deficient for another known circadian gene mapping within the commonly deleted/duplicated region, Dexras1, and we compare these results to those previously observed in Dp(11)17/+ mice. Reduced free-running period lengths were seen in Df(11)17-2/+, Rai1(+/-) , and Dexras1(-/-) , but not Dexras1(+/-) mice, suggesting that Rai1 may be the primary gene underlying the circadian defects in SMS. However, we cannot rule out the possibility that cis effects between multiple haploinsufficient genes in the SMS critical interval (e.g., RAI1 and DEXRAS1) either exacerbate the circadian phenotypes observed in SMS patients with deletions or increase their penetrance in certain environments. This study also confirms a previous report of abnormal circadian function in Dexras1(-/-) mice. Copyright © 2013 Wiley Periodicals, Inc.
Differential DNA methylation profile of key genes in malignant prostate epithelial cells transformed by inorganic arsenic or cadmium.

PubMed

Pelch, Katherine E; Tokar, Erik J; Merrick, B Alex; Waalkes, Michael P

2015-08-01

Previous work shows altered methylation patterns in inorganic arsenic (iAs)- or cadmium (Cd)-transformed epithelial cells. Here, the methylation status near the transcriptional start site was assessed in the normal human prostate epithelial cell line (RWPE-1) that was malignantly transformed by 10μM Cd for 11weeks (CTPE) or 5μM iAs for 29weeks (CAsE-PE), at which time cells showed multiple markers of acquired cancer phenotype. Next generation sequencing of the transcriptome of CAsE-PE cells identified multiple dysregulated genes. Of the most highly dysregulated genes, five genes that can be relevant to the carcinogenic process (S100P, HYAL1, NTM, NES, ALDH1A1) were chosen for an in-depth analysis of the DNA methylation profile. DNA was isolated, bisulfite converted, and combined bisulfite restriction analysis was used to identify differentially methylated CpG sites, which was confirmed with bisulfite sequencing. Four of the five genes showed differential methylation in transformants relative to control cells that was inversely related to altered gene expression. Increased expression of HYAL1 (>25-fold) and S100P (>40-fold) in transformants was correlated with hypomethylation near the transcriptional start site. Decreased expression of NES (>15-fold) and NTM (>1000-fold) in transformants was correlated with hypermethylation near the transcriptional start site. ALDH1A1 expression was differentially expressed in transformed cells but was not differentially methylated relative to control. In conclusion, altered gene expression observed in Cd and iAs transformed cells may result from altered DNA methylation status. Published by Elsevier Inc.
Sex genes for genomic analysis in human brain: internal controls for comparison of probe level data extraction.

PubMed Central

Galfalvy, Hanga C; Erraji-Benchekroun, Loubna; Smyrniotopoulos, Peggy; Pavlidis, Paul; Ellis, Steven P; Mann, J John; Sibille, Etienne; Arango, Victoria

2003-01-01

Background Genomic studies of complex tissues pose unique analytical challenges for assessment of data quality, performance of statistical methods used for data extraction, and detection of differentially expressed genes. Ideally, to assess the accuracy of gene expression analysis methods, one needs a set of genes which are known to be differentially expressed in the samples and which can be used as a "gold standard". We introduce the idea of using sex-chromosome genes as an alternative to spiked-in control genes or simulations for assessment of microarray data and analysis methods. Results Expression of sex-chromosome genes were used as true internal biological controls to compare alternate probe-level data extraction algorithms (Microarray Suite 5.0 [MAS5.0], Model Based Expression Index [MBEI] and Robust Multi-array Average [RMA]), to assess microarray data quality and to establish some statistical guidelines for analyzing large-scale gene expression. These approaches were implemented on a large new dataset of human brain samples. RMA-generated gene expression values were markedly less variable and more reliable than MAS5.0 and MBEI-derived values. A statistical technique controlling the false discovery rate was applied to adjust for multiple testing, as an alternative to the Bonferroni method, and showed no evidence of false negative results. Fourteen probesets, representing nine Y- and two X-chromosome linked genes, displayed significant sex differences in brain prefrontal cortex gene expression. Conclusion In this study, we have demonstrated the use of sex genes as true biological internal controls for genomic analysis of complex tissues, and suggested analytical guidelines for testing alternate oligonucleotide microarray data extraction protocols and for adjusting multiple statistical analysis of differentially expressed genes. Our results also provided evidence for sex differences in gene expression in the brain prefrontal cortex, supporting the notion of a putative direct role of sex-chromosome genes in differentiation and maintenance of sexual dimorphism of the central nervous system. Importantly, these analytical approaches are applicable to all microarray studies that include male and female human or animal subjects. PMID:12962547
Sex genes for genomic analysis in human brain: internal controls for comparison of probe level data extraction.

PubMed

Galfalvy, Hanga C; Erraji-Benchekroun, Loubna; Smyrniotopoulos, Peggy; Pavlidis, Paul; Ellis, Steven P; Mann, J John; Sibille, Etienne; Arango, Victoria

2003-09-08

Genomic studies of complex tissues pose unique analytical challenges for assessment of data quality, performance of statistical methods used for data extraction, and detection of differentially expressed genes. Ideally, to assess the accuracy of gene expression analysis methods, one needs a set of genes which are known to be differentially expressed in the samples and which can be used as a "gold standard". We introduce the idea of using sex-chromosome genes as an alternative to spiked-in control genes or simulations for assessment of microarray data and analysis methods. Expression of sex-chromosome genes were used as true internal biological controls to compare alternate probe-level data extraction algorithms (Microarray Suite 5.0 [MAS5.0], Model Based Expression Index [MBEI] and Robust Multi-array Average [RMA]), to assess microarray data quality and to establish some statistical guidelines for analyzing large-scale gene expression. These approaches were implemented on a large new dataset of human brain samples. RMA-generated gene expression values were markedly less variable and more reliable than MAS5.0 and MBEI-derived values. A statistical technique controlling the false discovery rate was applied to adjust for multiple testing, as an alternative to the Bonferroni method, and showed no evidence of false negative results. Fourteen probesets, representing nine Y- and two X-chromosome linked genes, displayed significant sex differences in brain prefrontal cortex gene expression. In this study, we have demonstrated the use of sex genes as true biological internal controls for genomic analysis of complex tissues, and suggested analytical guidelines for testing alternate oligonucleotide microarray data extraction protocols and for adjusting multiple statistical analysis of differentially expressed genes. Our results also provided evidence for sex differences in gene expression in the brain prefrontal cortex, supporting the notion of a putative direct role of sex-chromosome genes in differentiation and maintenance of sexual dimorphism of the central nervous system. Importantly, these analytical approaches are applicable to all microarray studies that include male and female human or animal subjects.
Systematic gene tagging using CRISPR/Cas9 in human stem cells to illuminate cell organization

PubMed Central

Roberts, Brock; Haupt, Amanda; Tucker, Andrew; Grancharova, Tanya; Arakaki, Joy; Fuqua, Margaret A.; Nelson, Angelique; Hookway, Caroline; Ludmann, Susan A.; Mueller, Irina A.; Yang, Ruian; Horwitz, Rick; Rafelski, Susanne M.; Gunawardane, Ruwanthi N.

2017-01-01

We present a CRISPR/Cas9 genome-editing strategy to systematically tag endogenous proteins with fluorescent tags in human induced pluripotent stem cells (hiPSC). To date, we have generated multiple hiPSC lines with monoallelic green fluorescent protein tags labeling 10 proteins representing major cellular structures. The tagged proteins include alpha tubulin, beta actin, desmoplakin, fibrillarin, nuclear lamin B1, nonmuscle myosin heavy chain IIB, paxillin, Sec61 beta, tight junction protein ZO1, and Tom20. Our genome-editing methodology using Cas9/crRNA ribonuclear protein and donor plasmid coelectroporation, followed by fluorescence-based enrichment of edited cells, typically resulted in <0.1–4% homology-directed repair (HDR). Twenty-five percent of clones generated from each edited population were precisely edited. Furthermore, 92% (36/39) of expanded clonal lines displayed robust morphology, genomic stability, expression and localization of the tagged protein to the appropriate subcellular structure, pluripotency-marker expression, and multilineage differentiation. It is our conclusion that, if cell lines are confirmed to harbor an appropriate gene edit, pluripotency, differentiation potential, and genomic stability are typically maintained during the clonal line–generation process. The data described here reveal general trends that emerged from this systematic gene-tagging approach. Final clonal lines corresponding to each of the 10 cellular structures are now available to the research community. PMID:28814507
Maize transformation technology development for commercial event generation.

PubMed

Que, Qiudeng; Elumalai, Sivamani; Li, Xianggan; Zhong, Heng; Nalapalli, Samson; Schweiner, Michael; Fei, Xiaoyin; Nuccio, Michael; Kelliher, Timothy; Gu, Weining; Chen, Zhongying; Chilton, Mary-Dell M

2014-01-01

Maize is an important food and feed crop in many countries. It is also one of the most important target crops for the application of biotechnology. Currently, there are more biotech traits available on the market in maize than in any other crop. Generation of transgenic events is a crucial step in the development of biotech traits. For commercial applications, a high throughput transformation system producing a large number of high quality events in an elite genetic background is highly desirable. There has been tremendous progress in Agrobacterium-mediated maize transformation since the publication of the Ishida et al. (1996) paper and the technology has been widely adopted for transgenic event production by many labs around the world. We will review general efforts in establishing efficient maize transformation technologies useful for transgenic event production in trait research and development. The review will also discuss transformation systems used for generating commercial maize trait events currently on the market. As the number of traits is increasing steadily and two or more modes of action are used to control key pests, new tools are needed to efficiently transform vectors containing multiple trait genes. We will review general guidelines for assembling binary vectors for commercial transformation. Approaches to increase transformation efficiency and gene expression of large gene stack vectors will be discussed. Finally, recent studies of targeted genome modification and transgene insertion using different site-directed nuclease technologies will be reviewed.
DNA polymerase θ contributes to the generation of C/G mutations during somatic hypermutation of Ig genes

PubMed Central

Masuda, Keiji; Ouchida, Rika; Takeuchi, Arata; Saito, Takashi; Koseki, Haruhiko; Kawamura, Kiyoko; Tagawa, Masatoshi; Tokuhisa, Takeshi; Azuma, Takachika; O-Wang, Jiyang

2005-01-01

Somatic hypermutation of Ig variable region genes is initiated by activation-induced cytidine deaminase; however, the activity of multiple DNA polymerases is required to ultimately introduce mutations. DNA polymerase η (Polη) has been implicated in mutations at A/T, but polymerases involved in C/G mutations have not been identified. We have generated mutant mice expressing DNA polymerase (Polθ) specifically devoid of polymerase activity. Compared with WT mice, Polq-inactive (Polq, the gene encoding Polθ) mice exhibited a reduced level of serum IgM and IgG1. The mutant mice mounted relatively normal primary and secondary immune responses to a T-dependent antigen, but the production of high-affinity specific antibodies was partially impaired. Analysis of the JH4 intronic sequences revealed a slight reduction in the overall mutation frequency in Polq-inactive mice. Remarkably, although mutations at A/T were unaffected, mutations at C/G were significantly decreased, indicating an important, albeit not exclusive, role for Polθ activity. The reduction of C/G mutations was particularly focused on the intrinsic somatic hypermutation hotspots and both transitions and transversions were similarly reduced. These findings, together with the recent observation that Polθ efficiently catalyzes the bypass of abasic sites, lead us to propose that Polθ introduces mutations at C/G by replicating over abasic sites generated via uracil-DNA glycosylase. PMID:16172387
Maize transformation technology development for commercial event generation

PubMed Central

Que, Qiudeng; Elumalai, Sivamani; Li, Xianggan; Zhong, Heng; Nalapalli, Samson; Schweiner, Michael; Fei, Xiaoyin; Nuccio, Michael; Kelliher, Timothy; Gu, Weining; Chen, Zhongying; Chilton, Mary-Dell M.

2014-01-01

Maize is an important food and feed crop in many countries. It is also one of the most important target crops for the application of biotechnology. Currently, there are more biotech traits available on the market in maize than in any other crop. Generation of transgenic events is a crucial step in the development of biotech traits. For commercial applications, a high throughput transformation system producing a large number of high quality events in an elite genetic background is highly desirable. There has been tremendous progress in Agrobacterium-mediated maize transformation since the publication of the Ishida et al. (1996) paper and the technology has been widely adopted for transgenic event production by many labs around the world. We will review general efforts in establishing efficient maize transformation technologies useful for transgenic event production in trait research and development. The review will also discuss transformation systems used for generating commercial maize trait events currently on the market. As the number of traits is increasing steadily and two or more modes of action are used to control key pests, new tools are needed to efficiently transform vectors containing multiple trait genes. We will review general guidelines for assembling binary vectors for commercial transformation. Approaches to increase transformation efficiency and gene expression of large gene stack vectors will be discussed. Finally, recent studies of targeted genome modification and transgene insertion using different site-directed nuclease technologies will be reviewed. PMID:25140170
A curated catalog of canine and equine keratin genes

PubMed Central

Pujar, Shashikant; McGarvey, Kelly M.; Welle, Monika; Galichet, Arnaud; Müller, Eliane J.; Pruitt, Kim D.; Leeb, Tosso

2017-01-01

Keratins represent a large protein family with essential structural and functional roles in epithelial cells of skin, hair follicles, and other organs. During evolution the genes encoding keratins have undergone multiple rounds of duplication and humans have two clusters with a total of 55 functional keratin genes in their genomes. Due to the high similarity between different keratin paralogs and species-specific differences in gene content, the currently available keratin gene annotation in species with draft genome assemblies such as dog and horse is still imperfect. We compared the National Center for Biotechnology Information (NCBI) (dog annotation release 103, horse annotation release 101) and Ensembl (release 87) gene predictions for the canine and equine keratin gene clusters to RNA-seq data that were generated from adult skin of five dogs and two horses and from adult hair follicle tissue of one dog. Taking into consideration the knowledge on the conserved exon/intron structure of keratin genes, we annotated 61 putatively functional keratin genes in both the dog and horse, respectively. Subsequently, curators in the RefSeq group at NCBI reviewed their annotation of keratin genes in the dog and horse genomes (Annotation Release 104 and Annotation Release 102, respectively) and updated annotation and gene nomenclature of several keratin genes. The updates are now available in the NCBI Gene database (https://www.ncbi.nlm.nih.gov/gene). PMID:28846680
mCAL: A New Approach for Versatile Multiplex Action of Cas9 Using One sgRNA and Loci Flanked by a Programmed Target Sequence.

PubMed

Finnigan, Gregory C; Thorner, Jeremy

2016-07-07

Genome editing exploiting CRISPR/Cas9 has been adopted widely in academia and in the biotechnology industry to manipulate DNA sequences in diverse organisms. Molecular engineering of Cas9 itself and its guide RNA, and the strategies for using them, have increased efficiency, optimized specificity, reduced inappropriate off-target effects, and introduced modifications for performing other functions (transcriptional regulation, high-resolution imaging, protein recruitment, and high-throughput screening). Moreover, Cas9 has the ability to multiplex, i.e., to act at different genomic targets within the same nucleus. Currently, however, introducing concurrent changes at multiple loci involves: (i) identification of appropriate genomic sites, especially the availability of suitable PAM sequences; (ii) the design, construction, and expression of multiple sgRNA directed against those sites; (iii) potential difficulties in altering essential genes; and (iv) lingering concerns about "off-target" effects. We have devised a new approach that circumvents these drawbacks, as we demonstrate here using the yeast Saccharomyces cerevisiae First, any gene(s) of interest are flanked upstream and downstream with a single unique target sequence that does not normally exist in the genome. Thereafter, expression of one sgRNA and cotransformation with appropriate PCR fragments permits concomitant Cas9-mediated alteration of multiple genes (both essential and nonessential). The system we developed also allows for maintenance of the integrated, inducible Cas9-expression cassette or its simultaneous scarless excision. Our scheme-dubbed mCAL for " M: ultiplexing of C: as9 at A: rtificial L: oci"-can be applied to any organism in which the CRISPR/Cas9 methodology is currently being utilized. In principle, it can be applied to install synthetic sequences into the genome, to generate genomic libraries, and to program strains or cell lines so that they can be conveniently (and repeatedly) manipulated at multiple loci with extremely high efficiency. Copyright © 2016 Finnigan and Thorner.
Integrated computational biology analysis to evaluate target genes for chronic myelogenous leukemia.

PubMed

Zheng, Yu; Wang, Yu-Ping; Cao, Hongbao; Chen, Qiusheng; Zhang, Xi

2018-06-05

Although hundreds of genes have been linked to chronic myelogenous leukemia (CML), many of the results lack reproducibility. In the present study, data across multiple modalities were integrated to evaluate 579 CML candidate genes, including literature‑based CML‑gene relation data, Gene Expression Omnibus RNA expression data and pathway‑based gene‑gene interaction data. The expression data included samples from 76 patients with CML and 73 healthy controls. For each target gene, four metrics were proposed and tested with case/control classification. The effectiveness of the four metrics presented was demonstrated by the high classification accuracy (94.63%; P<2x10‑4). Cross metric analysis suggested nine top candidate genes for CML: Epidermal growth factor receptor, tumor protein p53, catenin β 1, janus kinase 2, tumor necrosis factor, abelson murine leukemia viral oncogene homolog 1, vascular endothelial growth factor A, B‑cell lymphoma 2 and proto‑oncogene tyrosine‑protein kinase. In addition, 145 CML candidate pathways enriched with 485 out of 579 genes were identified (P<8.2x10‑11; q=0.005). In conclusion, weighted genetic networks generated using computational biology may be complementary to biological experiments for the evaluation of known or novel CML target genes.
A new link between stress response and nucleolar function during pollen development in Arabidopsis mediated by AtREN1 protein.

PubMed

Reňák, David; Gibalová, Antónia; Solcová, Katarzyna; Honys, David

2014-03-01

Heat shock transcription factors (Hsfs) are involved in multiple aspects of stress response and plant growth. However, their role during male gametophyte development is largely unknown, although the generative phase is the most sensitive and critical period in the plant life cycle. Based on a wide screen of T-DNA mutant lines, we identified the atren1 mutation (restricted to nucleolus1) in early male gametophytic gene At1g77570, which has the closest homology to HSFA5 gene, the member of a heat shock transcription factor (HSF) gene family. The mutation causes multiple defects in male gametophyte development in both structure and function. Because the mutation disrupts an early acting (AtREN1) gene, these pollen phenotype abnormalities appear from bicellular pollen stage to pollen maturation. Moreover, the consequent progamic phase is compromised as well as documented by pollen germination defects and limited transmission via male gametophyte. In addition, atren1/- plants are defective in heat stress (HS) response and produce notably higher proportion of aberrant pollen grains. AtREN1 protein is targeted specifically to the nucleolus that, together with the increased size of the nucleolus in atren1 pollen, suggests that it is likely to be involved in ribosomal RNA biogenesis or other nucleolar functions. © 2013 John Wiley & Sons Ltd.
Digital encoding of cellular mRNAs enabling precise and absolute gene expression measurement by single-molecule counting.

PubMed

Fu, Glenn K; Wilhelmy, Julie; Stern, David; Fan, H Christina; Fodor, Stephen P A

2014-03-18

We present a new approach for the sensitive detection and accurate quantitation of messenger ribonucleic acid (mRNA) gene transcripts in single cells. First, the entire population of mRNAs is encoded with molecular barcodes during reverse transcription. After amplification of the gene targets of interest, molecular barcodes are counted by sequencing or scored on a simple hybridization detector to reveal the number of molecules in the starting sample. Since absolute quantities are measured, calibration to standards is unnecessary, and many of the relative quantitation challenges such as polymerase chain reaction (PCR) bias are avoided. We apply the method to gene expression analysis of minute sample quantities and demonstrate precise measurements with sensitivity down to sub single-cell levels. The method is an easy, single-tube, end point assay utilizing standard thermal cyclers and PCR reagents. Accurate and precise measurements are obtained without any need for cycle-to-cycle intensity-based real-time monitoring or physical partitioning into multiple reactions (e.g., digital PCR). Further, since all mRNA molecules are encoded with molecular barcodes, amplification can be used to generate more material for multiple measurements and technical replicates can be carried out on limited samples. The method is particularly useful for small sample quantities, such as single-cell experiments. Digital encoding of cellular content preserves true abundance levels and overcomes distortions introduced by amplification.
Multiple zebrafish atoh1 genes specify a diversity of neuronal types in the zebrafish cerebellum.

PubMed

Kidwell, Chelsea U; Su, Chen-Ying; Hibi, Masahiko; Moens, Cecilia B

2018-06-01

A single Atoh1 basic-helix-loop-helix transcription factor specifies multiple neuron types in the mammalian cerebellum and anterior hindbrain. The zebrafish genome encodes three paralagous atoh1 genes whose functions in cerebellum and anterior hindbrain development we explore here. With use of a transgenic reporter, we report that zebrafish atoh1c-expressing cells are organized in two distinct domains that are separated both by space and developmental time. An early isthmic expression domain gives rise to an extracerebellar population in rhombomere 1 and an upper rhombic lip domain gives rise to granule cell progenitors that migrate to populate all four granule cell territories of the fish cerebellum. Using genetic mutants we find that of the three zebrafish atoh1 paralogs, atoh1c and atoh1a are required for the full complement of granule neurons. Surprisingly, the two genes are expressed in non-overlapping granule cell progenitor populations, indicating that fish use duplicate atoh1 genes to generate granule cell diversity that is not detected in mammals. Finally, live imaging of granule cell migration in wildtype and atoh1c mutant embryos reveals that while atoh1c is not required for granule cell specification per se, it is required for granule cells to delaminate and migrate away from the rhombic lip. Copyright © 2018 Elsevier Inc. All rights reserved.
Stable expression of mtlD gene imparts multiple stress tolerance in finger millet.

PubMed

Hema, Ramanna; Vemanna, Ramu S; Sreeramulu, Shivakumar; Reddy, Chandrasekhara P; Senthil-Kumar, Muthappa; Udayakumar, Makarla

2014-01-01

Finger millet is susceptible to abiotic stresses, especially drought and salinity stress, in the field during seed germination and early stages of seedling development. Therefore developing stress tolerant finger millet plants combating drought, salinity and associated oxidative stress in these two growth stages is important. Cellular protection through osmotic adjustment and efficient free radical scavenging ability during abiotic stress are important components of stress tolerance mechanisms in plants. Mannitol, an osmolyte, is known to scavenge hydroxyl radicals generated during various abiotic stresses and thereby minimize stress damage in several plant species. In this study transgenic finger millet plants expressing the mannitol biosynthetic pathway gene from bacteria, mannitol-1-phosphate dehydrogenase (mtlD), were developed through Agrobacterium tumefaciens-mediated genetic transformation. mtlD gene integration in the putative transgenic plants was confirmed by Southern blot. Further, performance of transgenic finger millet under drought, salinity and oxidative stress was studied at plant level in T1 generation and in T1 and T2 generation seedlings. Results from these experiments showed that transgenic finger millet had better growth under drought and salinity stress compared to wild-type. At plant level, transgenic plants showed better osmotic adjustment and chlorophyll retention under drought stress compared to the wild-type. However, the overall increase in stress tolerance of transgenics for the three stresses, especially for oxidative stress, was only marginal compared to other mtlD gene expressing plant species reported in the literature. Moreover, the Agrobacterium-mediated genetic transformation protocol developed for finger millet in this study can be used to introduce diverse traits of agronomic importance in finger millet.
Molecular diagnosis of maturity-onset diabetes of the young (MODY) in Turkish children by using targeted next-generation sequencing.

PubMed

Anık, Ahmet; Çatlı, Gönül; Abacı, Ayhan; Sarı, Erkan; Yeşilkaya, Ediz; Korkmaz, Hüseyin Anıl; Demir, Korcan; Altıncık, Ayça; Tuhan, Hale Ünver; Kızıldağ, Sefa; Özkan, Behzat; Ceylaner, Serdar; Böber, Ece

2015-11-01

To perform molecular analysis of pediatric maturity onset diabetes of the young (MODY) patients by next-generation sequencing, which enables simultaneous analysis of multiple genes in a single test, to determine the genetic etiology of a group of Turkish children clinically diagnosed as MODY, and to assess genotype-phenotype relationship. Forty-two children diagnosed with MODY and their parents were enrolled in the study. Clinical and laboratory characteristics of the patients at the time of diagnosis were obtained from hospital records. Molecular analyses of GCK, HNF1A, HNF4A, HNF1B, PDX1, NEUROD1, KLF11, CEL, PAX4, INS, and BLK genes were performed on genomic DNA by using next-generation sequencing. Pathogenicity for novel mutations was assessed by bioinformatics prediction software programs and segregation analyses. A mutation in MODY genes was identified in 12 (29%) of the cases. GCK mutations were detected in eight cases, and HNF1B, HNF1A, PDX1, and BLK mutations in the others. We identified five novel missense mutations - three in GCK (p.Val338Met, p.Cys252Ser, and p.Val86Ala), one in HNF1A (p.Cys241Ter), and one in PDX1 (p.Gly55Asp), which we believe to be pathogenic. The results of this study showed that mutations in the GCK gene are the leading cause of MODY in our population. Moreover, genetic diagnosis could be made in 29% of Turkish patients, and five novel mutations were identified.
Identification of mutated driver pathways in cancer using a multi-objective optimization model.

PubMed

Zheng, Chun-Hou; Yang, Wu; Chong, Yan-Wen; Xia, Jun-Feng

2016-05-01

New-generation high-throughput technologies, including next-generation sequencing technology, have been extensively applied to solve biological problems. As a result, large cancer genomics projects such as the Cancer Genome Atlas (TCGA) and the International Cancer Genome Consortium are producing large amount of rich and diverse data in multiple cancer types. The identification of mutated driver genes and driver pathways from these data is a significant challenge. Genome aberrations in cancer cells can be divided into two types: random 'passenger mutation' and functional 'driver mutation'. In this paper, we introduced a Multi-objective Optimization model based on a Genetic Algorithm (MOGA) to solve the maximum weight submatrix problem, which can be employed to identify driver genes and driver pathways promoting cancer proliferation. The maximum weight submatrix problem defined to find mutated driver pathways is based on two specific properties, i.e., high coverage and high exclusivity. The multi-objective optimization model can adjust the trade-off between high coverage and high exclusivity. We proposed an integrative model by combining gene expression data and mutation data to improve the performance of the MOGA algorithm in a biological context. Copyright © 2016 Elsevier Ltd. All rights reserved.

A Modular Lentiviral and Retroviral Construction System to Rapidly Generate Vectors for Gene Expression and Gene Knockdown In Vitro and In Vivo

PubMed Central

Geiling, Benjamin; Vandal, Guillaume; Posner, Ada R.; de Bruyns, Angeline; Dutchak, Kendall L.; Garnett, Samantha; Dankort, David

2013-01-01

The ability to express exogenous cDNAs while suppressing endogenous genes via RNAi represents an extremely powerful research tool with the most efficient non-transient approach being accomplished through stable viral vector integration. Unfortunately, since traditional restriction enzyme based methods for constructing such vectors are sequence dependent, their construction is often difficult and not amenable to mass production. Here we describe a non-sequence dependent Gateway recombination cloning system for the rapid production of novel lentiviral (pLEG) and retroviral (pREG) vectors. Using this system to recombine 3 or 4 modular plasmid components it is possible to generate viral vectors expressing cDNAs with or without inhibitory RNAs (shRNAmirs). In addition, we demonstrate a method to rapidly produce and triage novel shRNAmirs for use with this system. Once strong candidate shRNAmirs have been identified they may be linked together in tandem to knockdown expression of multiple targets simultaneously or to improve the knockdown of a single target. Here we demonstrate that these recombinant vectors are able to express cDNA and effectively knockdown protein expression using both cell culture and animal model systems. PMID:24146852
PsRobot: a web-based plant small RNA meta-analysis toolbox.

PubMed

Wu, Hua-Jun; Ma, Ying-Ke; Chen, Tong; Wang, Meng; Wang, Xiu-Jie

2012-07-01

Small RNAs (smRNAs) in plants, mainly microRNAs and small interfering RNAs, play important roles in both transcriptional and post-transcriptional gene regulation. The broad application of high-throughput sequencing technology has made routinely generation of bulk smRNA sequences in laboratories possible, thus has significantly increased the need for batch analysis tools. PsRobot is a web-based easy-to-use tool dedicated to the identification of smRNAs with stem-loop shaped precursors (such as microRNAs and short hairpin RNAs) and their target genes/transcripts. It performs fast analysis to identify smRNAs with stem-loop shaped precursors among batch input data and predicts their targets using a modified Smith-Waterman algorithm. PsRobot integrates the expression data of smRNAs in major plant smRNA biogenesis gene mutants and smRNA-associated protein complexes to give clues to the smRNA generation and functional processes. Besides improved specificity, the reliability of smRNA target prediction results can also be evaluated by mRNA cleavage (degradome) data. The cross species conservation statuses and the multiplicity of smRNA target sites are also provided. PsRobot is freely accessible at http://omicslab.genetics.ac.cn/psRobot/.
Identification of a feather β-keratin gene exclusively expressed in pennaceous barbule cells of contour feathers in chicken.

PubMed

Kowata, Kinue; Nakaoka, Minori; Nishio, Kaori; Fukao, Ayaka; Satoh, Akira; Ogoshi, Maho; Takahashi, Sumio; Tsudzuki, Masaoki; Takeuchi, Sakae

2014-05-25

Feathers are elaborate skin appendages shared by birds and theropod dinosaurs that have hierarchical branching of the rachis, barbs, and barbules. Feather filaments consist of β-keratins encoded by multiple genes, most of which are located in tandem arrays on chromosomes 2, 25, and 27 in chicken. The expansion of the genes is thought to have contributed to feather evolution; however, it is unclear how the individual genes are involved in feather formation. The aim of the present study was to identify feather keratin genes involved in the formation of barbules. Using a combination of microarray analysis, reverse-transcription polymerase chain reaction, and in situ hybridization, we found an uncharacterized keratin gene on chromosome 7 that was expressed specifically in barbule cells in regenerating chicken feathers. We have named the gene barbule specific keratin 1 (BlSK1). The BlSK1 gene structure was similar to the gene structure of previously characterized feather keratin genes, and consisted of a non-coding leader exon, an intron, and an exon with an open reading frame (ORF). The ORF was predicted to encode a 98 aa long protein, which shared 59% identity with feather keratin B. Orthologs of BlSK1 were found in the genomes of other avian species, including turkey, duck, zebra finch, and flycatcher, in regions that shared synteny with chromosome 7 of chicken. Interestingly, BlSK1 was expressed in feather follicles that generated pennaceous barbules but not in follicles that generated plumulaceous barbules. These results suggested that the composition of feather keratins probably varies depending on the structure of the feather filaments and, that individual feather keratin genes may be involved in building different portions and/or types of feathers in chicken. Copyright © 2014 Elsevier B.V. All rights reserved.
Implementation and assessment of a yeast orphan gene research project; involving undergraduates in authentic research experiences and progressing our understanding of uncharacterized open reading frames

PubMed Central

Bowling, Bethany V.; Schultheis, Patrick J.

2015-01-01

Saccharomyces cerevisiae was the first eukaryotic organism to be sequenced, however little progress has been made in recent years in furthering our understanding of all open reading frames (ORFs). From October 2012 to May 2015 the number of verified ORFs has only risen from 75.31% to 78% while the number of uncharacterized ORFs have decreased from 12.8% to 11% (representing more than 700 genes still left in this category) [http://www.yeastgenome.org/genomesnapshot]. Course-based research has been shown to increase student learning while providing experience with real scientific investigation; however, implementation in large, multi-section courses presents many challenges. This study sought to test the feasibility and effectiveness of incorporating authentic research into a core genetics course with multiple instructors to increase student learning and progress our understanding of uncharacterized ORFs. We generated a module-based annotation toolkit and utilized easily accessible bioinformatics tools to predict gene function for uncharacterized ORFs within the Saccharomyces Genome Database (SGD). Students were each assigned an uncharacterized ORF which they annotated using contemporary comparative genomics methodologies including multiple sequence alignment, conserved domain identification, signal peptide prediction and cellular localization algorithms. Student learning outcomes were measured by quizzes, project reports and presentations, as well as a post-project questionnaire. Our results indicate the authentic research experience had positive impacts on student's perception of their learning and their confidence to conduct future research. Furthermore we believe that creation of an online repository and adoption and/or adaptation of this project across multiple researchers and institutions could speed the process of gene function prediction. PMID:26460164
Implementation and assessment of a yeast orphan gene research project: involving undergraduates in authentic research experiences and progressing our understanding of uncharacterized open reading frames.

PubMed

Bowling, Bethany V; Schultheis, Patrick J; Strome, Erin D

2016-02-01

Saccharomyces cerevisiae was the first eukaryotic organism to be sequenced; however, little progress has been made in recent years in furthering our understanding of all open reading frames (ORFs). From October 2012 to May 2015 the number of verified ORFs had only risen from 75.31% to 78%, while the number of uncharacterized ORFs had decreased from 12.8% to 11% (representing > 700 genes still left in this category; http://www.yeastgenome.org/genomesnapshot). Course-based research has been shown to increase student learning while providing experience with real scientific investigation; however, implementation in large, multi-section courses presents many challenges. This study sought to test the feasibility and effectiveness of incorporating authentic research into a core genetics course, with multiple instructors, to increase student learning and progress our understanding of uncharacterized ORFs. We generated a module-based annotation toolkit and utilized easily accessible bioinformatics tools to predict gene function for uncharacterized ORFs within the Saccharomyces Genome Database (SGD). Students were each assigned an uncharacterized ORF, which they annotated using contemporary comparative genomics methodologies, including multiple sequence alignment, conserved domain identification, signal peptide prediction and cellular localization algorithms. Student learning outcomes were measured by quizzes, project reports and presentations, as well as a post-project questionnaire. Our results indicate that the authentic research experience had positive impacts on students' perception of their learning and their confidence to conduct future research. Furthermore, we believe that creation of an online repository and adoption and/or adaptation of this project across multiple researchers and institutions could speed the process of gene function prediction. Copyright © 2015 John Wiley & Sons, Ltd.
Myeloid transformation of plasma cell myeloma: molecular evidence of clonal evolution revealed by next generation sequencing.

PubMed

Gralewski, Jonathon H; Post, Ginell R; van Rhee, Frits; Yuan, Youzhong

2018-02-20

Plasma cell myeloma (PCM) is a neoplasm of terminally differentiated B lymphocytes with molecular heterogeneity. Although therapy-related myeloid neoplasms are common in plasma cell myeloma patients after chemotherapy, transdifferentiation of plasma cell myeloma into myeloid neoplasms has not been reported in literature. Here we report a very rare case of myeloid neoplasm transformed from plasma cell myeloma. A 60-year-old man with a history of plasma cell myeloma with IGH-MAF gene rearrangement and RAS/RAF mutations developed multiple soft tissue lesions one year following melphalan-based chemotherapy and autologous stem cell transplant. Morphological and immunohistochemical characterization of the extramedullary disease demonstrated that the tumor cells were derived from the monocyte-macrophage lineage. Next generation sequencing (NGS) studies detected similar clonal aberrations in the diagnostic plasma cell population and post-therapy neoplastic cells, including IGH-MAF rearrangement, multiple genetic mutations in RAS signaling pathway proteins, and loss of tumor suppressor genes. Molecular genetic analysis also revealed unique genomic alterations in the transformed tumor cells, including gain of NF1 and loss of TRAF3. To our knowledge, this is the first case of myeloid sarcoma transdifferentiated from plasma cell neoplasm. Our findings in this unique case suggest clonal evolution of plasma cell myeloma to myeloma neoplasm and the potential roles of abnormal RAS/RAF signaling pathway in lineage switch or transdifferentiation.
Joint Analysis of Strain and Parent-of-Origin Effects for Recombinant Inbred Intercrosses Generated from Multiparent Populations with the Collaborative Cross as an Example.

PubMed

Liu, Yanyan; Xiong, Sican; Sun, Wei; Zou, Fei

2018-02-02

Multiparent populations (MPP) have become popular resources for complex trait mapping because of their wider allelic diversity and larger population size compared with traditional two-way recombinant inbred (RI) strains. In mice, the collaborative cross (CC) is one of the most popular MPP and is derived from eight genetically diverse inbred founder strains. The strategy of generating RI intercrosses (RIX) from MPP in general and from the CC in particular can produce a large number of completely reproducible heterozygote genomes that better represent the (outbred) human population. Since both maternal and paternal haplotypes of each RIX are readily available, RIX is a powerful resource for studying both standing genetic and epigenetic variations of complex traits, in particular, the parent-of-origin (PoO) effects, which are important contributors to many complex traits. Furthermore, most complex traits are affected by >1 genes, where multiple quantitative trait locus mapping could be more advantageous. In this paper, for MPP-RIX data but taking CC-RIX as a working example, we propose a general Bayesian variable selection procedure to simultaneously search for multiple genes with founder allelic effects and PoO effects. The proposed model respects the complex relationship among RIX samples, and the performance of the proposed method is examined by extensive simulations. Copyright © 2018 Liu et al.
Mesenchymal stem cells: biological characteristics and potential clinical applications.

PubMed

Kassem, Moustapha

2004-01-01

Mesenchymal stem cells (MSC) are clonogenic, non-hematpoietic stem cells present in the bone marrow and are able to differentiate into multiple mesoderm-type cell lineages, for example, osteoblasts, chondrocytes, endothelial-cells and also non-mesoderm-type lineages, for example, neuronal-like cells. Several methods are currently available for isolation of the MSC based on their physical and physico-chemical characteristics, for example, adherence to plastics or other extracellular matrix components. Because of the ease of their isolation and their extensive differentiation potential, MSC are among the first stem cell types to be introduced in the clinic. Several studies have demonstrated the possible use of MSC in systemic transplantation for systemic diseases, local implantation for local tissue defects, as a vehicle for genes in gene therapy protocols or to generate transplantable tissues and organs in tissue engineering protocols. Before their widespread use in therapy, methods allowing the generation of large number of cells without affecting their differentiation potential as well as technologies that overcome immunological rejection (in case allogenic transplantation) must be developed.
Cellular resolution maps of X-chromosome inactivation: implications for neural development, function, and disease

PubMed Central

Wu, Hao; Luo, Junjie; Yu, Huimin; Rattner, Amir; Mo, Alisa; Wang, Yanshu; Smallwood, Philip M.; Erlanger, Bracha; Wheelan, Sarah J.; Nathans, Jeremy

2014-01-01

Female eutherian mammals use X-chromosome inactivation (XCI) to epigenetically regulate gene expression from ~4% of genes. To quantitatively map the topography of XCI for defined cell types at single cell resolution, we have generated female mice that carry X-linked, Cre-activated, and nuclear-localized fluorescent reporters – GFP on one X-chromosome and tdTomato on the other. Using these reporters in combination with different Cre drivers we have defined the topographies of XCI mosaicism for multiple CNS cell types and of retinal vascular dysfunction in a model of Norrie Disease. Depending on cell type, fluctuations in the XCI mosaic are observed over a wide range of spatial scales, from neighboring cells to left vs. right sides of the body. These data imply a major role for XCI in generating female-specific, genetically directed, stochastic diversity in eutherian mammals on spatial scales that would be predicted to affect CNS function within and between individuals. PMID:24411735
In vivo genome-wide analysis of multiple tissues identifies gene regulatory networks, novel functions and downstream regulatory genes for Bapx1 and its co-regulation with Sox9 in the mammalian vertebral column.

PubMed

Chatterjee, Sumantra; Sivakamasundari, V; Yap, Sook Peng; Kraus, Petra; Kumar, Vibhor; Xing, Xing; Lim, Siew Lan; Sng, Joel; Prabhakar, Shyam; Lufkin, Thomas

2014-12-05

Vertebrate organogenesis is a highly complex process involving sequential cascades of transcription factor activation or repression. Interestingly a single developmental control gene can occasionally be essential for the morphogenesis and differentiation of tissues and organs arising from vastly disparate embryological lineages. Here we elucidated the role of the mammalian homeobox gene Bapx1 during the embryogenesis of five distinct organs at E12.5 - vertebral column, spleen, gut, forelimb and hindlimb - using expression profiling of sorted wildtype and mutant cells combined with genome wide binding site analysis. Furthermore we analyzed the development of the vertebral column at the molecular level by combining transcriptional profiling and genome wide binding data for Bapx1 with similarly generated data sets for Sox9 to assemble a detailed gene regulatory network revealing genes previously not reported to be controlled by either of these two transcription factors. The gene regulatory network appears to control cell fate decisions and morphogenesis in the vertebral column along with the prevention of premature chondrocyte differentiation thus providing a detailed molecular view of vertebral column development.
Heterogeneous Stock Rat: A Unique Animal Model for Mapping Genes Influencing Bone Fragility

PubMed Central

Alam, Imranul; Koller, Daniel L.; Sun, Qiwei; Roeder, Ryan K.; Cañete, Toni; Blázquez, Gloria; López-Aumatell, Regina; Martínez-Membrives, Esther; Vicens-Costa, Elia; Mont, Carme; Díaz, Sira; Tobeña, Adolf; Fernández-Teruel, Alberto; Whitley, Adam; Strid, Pernilla; Diez, Margarita; Johannesson, Martina; Flint, Jonathan; Econs, Michael J.; Turner, Charles H.; Foroud, Tatiana

2011-01-01

Previously, we demonstrated that skeletal mass, structure and biomechanical properties vary considerably among 11 different inbred rat strains. Subsequently, we performed quantitative trait loci (QTL) analysis in 4 inbred rat strains (F344, LEW, COP and DA) for different bone phenotypes and identified several candidate genes influencing various bone traits. The standard approach to narrowing QTL intervals down to a few candidate genes typically employs the generation of congenic lines, which is time consuming and often not successful. A potential alternative approach is to use a highly genetically informative animal model resource capable of delivering very high-resolution gene mapping such as Heterogeneous stock (HS) rat. HS rat was derived from eight inbred progenitors: ACI/N, BN/SsN, BUF/N, F344/N, M520/N, MR/N, WKY/N and WN/N. The genetic recombination pattern generated across 50 generations in these rats has been shown to deliver ultra-high even gene-level resolution for complex genetic studies. The purpose of this study is to investigate the usefulness of the HS rat model for fine mapping and identification of genes underlying bone fragility phenotypes. We compared bone geometry, density and strength phenotypes at multiple skeletal sites in HS rats with those obtained from 5 of the 8 progenitor inbred strains. In addition, we estimated the heritability for different bone phenotypes in these rats and employed principal component analysis to explore relationships among bone phenotypes in the HS rats. Our study demonstrates that significant variability exists for different skeletal phenotypes in HS rats compared with their inbred progenitors. In addition, we estimated high heritability for several bone phenotypes and biologically interpretable factors explaining significant overall variability, suggesting that the HS rat model could be a unique genetic resource for rapid and efficient discovery of the genetic determinants of bone fragility. PMID:21334473
Heterogeneous stock rat: a unique animal model for mapping genes influencing bone fragility.

PubMed

Alam, Imranul; Koller, Daniel L; Sun, Qiwei; Roeder, Ryan K; Cañete, Toni; Blázquez, Gloria; López-Aumatell, Regina; Martínez-Membrives, Esther; Vicens-Costa, Elia; Mont, Carme; Díaz, Sira; Tobeña, Adolf; Fernández-Teruel, Alberto; Whitley, Adam; Strid, Pernilla; Diez, Margarita; Johannesson, Martina; Flint, Jonathan; Econs, Michael J; Turner, Charles H; Foroud, Tatiana

2011-05-01

Previously, we demonstrated that skeletal mass, structure and biomechanical properties vary considerably among 11 different inbred rat strains. Subsequently, we performed quantitative trait loci (QTL) analysis in four inbred rat strains (F344, LEW, COP and DA) for different bone phenotypes and identified several candidate genes influencing various bone traits. The standard approach to narrowing QTL intervals down to a few candidate genes typically employs the generation of congenic lines, which is time consuming and often not successful. A potential alternative approach is to use a highly genetically informative animal model resource capable of delivering very high resolution gene mapping such as Heterogeneous stock (HS) rat. HS rat was derived from eight inbred progenitors: ACI/N, BN/SsN, BUF/N, F344/N, M520/N, MR/N, WKY/N and WN/N. The genetic recombination pattern generated across 50 generations in these rats has been shown to deliver ultra-high even gene-level resolution for complex genetic studies. The purpose of this study is to investigate the usefulness of the HS rat model for fine mapping and identification of genes underlying bone fragility phenotypes. We compared bone geometry, density and strength phenotypes at multiple skeletal sites in HS rats with those obtained from five of the eight progenitor inbred strains. In addition, we estimated the heritability for different bone phenotypes in these rats and employed principal component analysis to explore relationships among bone phenotypes in the HS rats. Our study demonstrates that significant variability exists for different skeletal phenotypes in HS rats compared with their inbred progenitors. In addition, we estimated high heritability for several bone phenotypes and biologically interpretable factors explaining significant overall variability, suggesting that the HS rat model could be a unique genetic resource for rapid and efficient discovery of the genetic determinants of bone fragility. Copyright © 2010 Elsevier Inc. All rights reserved.
Characterizing mutation-expression network relationships in multiple cancers.

PubMed

Ghazanfar, Shila; Yang, Jean Yee Hwa

2016-08-01

Data made available through large cancer consortia like The Cancer Genome Atlas make for a rich source of information to be studied across and between cancers. In recent years, network approaches have been applied to such data in uncovering the complex interrelationships between mutational and expression profiles, but lack direct testing for expression changes via mutation. In this pan-cancer study we analyze mutation and gene expression information in an integrative manner by considering the networks generated by testing for differences in expression in direct association with specific mutations. We relate our findings among the 19 cancers examined to identify commonalities and differences as well as their characteristics. Using somatic mutation and gene expression information across 19 cancers, we generated mutation-expression networks per cancer. On evaluation we found that our generated networks were significantly enriched for known cancer-related genes, such as skin cutaneous melanoma (p<0.01 using Network of Cancer Genes 4.0). Our framework identified that while different cancers contained commonly mutated genes, there was little concordance between associated gene expression changes among cancers. Comparison between cancers showed a greater overlap of network nodes for cancers with higher overall non-silent mutation load, compared to those with a lower overall non-silent mutation load. This study offers a framework that explores network information through co-analysis of somatic mutations and gene expression profiles. Our pan-cancer application of this approach suggests that while mutations are frequently common among cancer types, the impact they have on the surrounding networks via gene expression changes varies. Despite this finding, there are some cancers for which mutation-associated network behaviour appears to be similar: suggesting a potential framework for uncovering related cancers for which similar therapeutic strategies may be applicable. Our framework for understanding relationships among cancers has been integrated into an interactive R Shiny application, PAn Cancer Mutation Expression Networks (PACMEN), containing dynamic and static network visualization of the mutation-expression networks. PACMEN also features tools for further examination of network topology characteristics among cancers. Copyright © 2016 Elsevier Ltd. All rights reserved.
Genome-wide generation and use of informative intron-spanning and intron-length polymorphism markers for high-throughput genetic analysis in rice

PubMed Central

Badoni, Saurabh; Das, Sweta; Sayal, Yogesh K.; Gopalakrishnan, S.; Singh, Ashok K.; Rao, Atmakuri R.; Agarwal, Pinky; Parida, Swarup K.; Tyagi, Akhilesh K.

2016-01-01

We developed genome-wide 84634 ISM (intron-spanning marker) and 16510 InDel-fragment length polymorphism-based ILP (intron-length polymorphism) markers from genes physically mapped on 12 rice chromosomes. These genic markers revealed much higher amplification-efficiency (80%) and polymorphic-potential (66%) among rice accessions even by a cost-effective agarose gel-based assay. A wider level of functional molecular diversity (17–79%) and well-defined precise admixed genetic structure was assayed by 3052 genome-wide markers in a structured population of indica, japonica, aromatic and wild rice. Six major grain weight QTLs (11.9–21.6% phenotypic variation explained) were mapped on five rice chromosomes of a high-density (inter-marker distance: 0.98 cM) genetic linkage map (IR 64 x Sonasal) anchored with 2785 known/candidate gene-derived ISM and ILP markers. The designing of multiple ISM and ILP markers (2 to 4 markers/gene) in an individual gene will broaden the user-preference to select suitable primer combination for efficient assaying of functional allelic variation/diversity and realistic estimation of differential gene expression profiles among rice accessions. The genomic information generated in our study is made publicly accessible through a user-friendly web-resource, “Oryza ISM-ILP marker” database. The known/candidate gene-derived ISM and ILP markers can be enormously deployed to identify functionally relevant trait-associated molecular tags by optimal-resource expenses, leading towards genomics-assisted crop improvement in rice. PMID:27032371
Long-read whole genome sequencing and comparative analysis of six strains of the human pathogen Orientia tsutsugamushi.

PubMed

Batty, Elizabeth M; Chaemchuen, Suwittra; Blacksell, Stuart; Richards, Allen L; Paris, Daniel; Bowden, Rory; Chan, Caroline; Lachumanan, Ramkumar; Day, Nicholas; Donnelly, Peter; Chen, Swaine; Salje, Jeanne

2018-06-01

Orientia tsutsugamushi is a clinically important but neglected obligate intracellular bacterial pathogen of the Rickettsiaceae family that causes the potentially life-threatening human disease scrub typhus. In contrast to the genome reduction seen in many obligate intracellular bacteria, early genetic studies of Orientia have revealed one of the most repetitive bacterial genomes sequenced to date. The dramatic expansion of mobile elements has hampered efforts to generate complete genome sequences using short read sequencing methodologies, and consequently there have been few studies of the comparative genomics of this neglected species. We report new high-quality genomes of O. tsutsugamushi, generated using PacBio single molecule long read sequencing, for six strains: Karp, Kato, Gilliam, TA686, UT76 and UT176. In comparative genomics analyses of these strains together with existing reference genomes from Ikeda and Boryong strains, we identify a relatively small core genome of 657 genes, grouped into core gene islands and separated by repeat regions, and use the core genes to infer the first whole-genome phylogeny of Orientia. Complete assemblies of multiple Orientia genomes verify initial suggestions that these are remarkable organisms. They have larger genomes compared with most other Rickettsiaceae, with widespread amplification of repeat elements and massive chromosomal rearrangements between strains. At the gene level, Orientia has a relatively small set of universally conserved genes, similar to other obligate intracellular bacteria, and the relative expansion in genome size can be accounted for by gene duplication and repeat amplification. Our study demonstrates the utility of long read sequencing to investigate complex bacterial genomes and characterise genomic variation.
Identification of Fitness Determinants during Energy-Limited Growth Arrest in Pseudomonas aeruginosa

PubMed Central

Basta, David W.; Bergkessel, Megan

2017-01-01

ABSTRACT Microbial growth arrest can be triggered by diverse factors, one of which is energy limitation due to scarcity of electron donors or acceptors. Genes that govern fitness during energy-limited growth arrest and the extent to which they overlap between different types of energy limitation are poorly defined. In this study, we exploited the fact that Pseudomonas aeruginosa can remain viable over several weeks when limited for organic carbon (pyruvate) as an electron donor or oxygen as an electron acceptor. ATP values were reduced under both types of limitation, yet more severely in the absence of oxygen. Using transposon-insertion sequencing (Tn-seq), we identified fitness determinants in these two energy-limited states. Multiple genes encoding general functions like transcriptional regulation and energy generation were required for fitness during carbon or oxygen limitation, yet many specific genes, and thus specific activities, differed in their relevance between these states. For instance, the global regulator RpoS was required during both types of energy limitation, while other global regulators such as DksA and LasR were required only during carbon or oxygen limitation, respectively. Similarly, certain ribosomal and tRNA modifications were specifically required during oxygen limitation. We validated fitness defects during energy limitation using independently generated mutants of genes detected in our screen. Mutants in distinct functional categories exhibited different fitness dynamics: regulatory genes generally manifested a phenotype early, whereas genes involved in cell wall metabolism were required later. Together, these results provide a new window into how P. aeruginosa survives growth arrest. PMID:29184024
The Chlamydomonas Dhc1 gene encodes a dynein heavy chain subunit required for assembly of the I1 inner arm complex.

PubMed Central

Myster, S H; Knott, J A; O'Toole, E; Porter, M E

1997-01-01

Multiple members of the dynein heavy chain (Dhc) gene family have been recovered in several organisms, but the relationships between these sequences and the Dhc isoforms that they encode are largely unknown. To identify Dhc loci and determine the specific functions of the individual Dhc isoforms, we have screened a collection of motility mutants generated by insertional mutagenesis in Chlamydomonas. In this report, we characterize one strain, pf9-3, in which the insertion event was accompanied by a deletion of approximately 13 kb of genomic DNA within the transcription unit of the Dhc1 gene. Northern blot analysis confirms that pf9-3 is a null mutation. Biochemical and structural studies of isolated axonemes demonstrate that the pf9-3 mutant fails to assemble the I1 inner arm complex, a two-headed dynein isoform composed of two Dhcs (1 alpha and 1 beta) and three intermediate chains. To determine if the Dhc1 gene product corresponds to one of the Dhcs of the I1 complex, antibodies were generated against a Dhc1-specific peptide sequence. Immunoblot analysis reveals that the Dhc1 gene encodes the 1 alpha Dhc subunit. These studies thus, identify the first inner arm Dhc locus to be described in any organism and further demonstrate that the 1 alpha Dhc subunit plays an essential role in the assembly of the I1 inner arm complex. Images PMID:9247642
A versatile toolkit for high throughput functional genomics with Trichoderma reesei

DOE Office of Scientific and Technical Information (OSTI.GOV)

Schuster, Andre; Bruno, Kenneth S.; Collett, James R.

2012-01-02

The ascomycete fungus, Trichoderma reesei (anamorph of Hypocrea jecorina), represents a biotechnological workhorse and is currently one of the most proficient cellulase producers. While strain improvement was traditionally accomplished by random mutagenesis, a detailed understanding of cellulase regulation can only be gained using recombinant technologies. RESULTS: Aiming at high efficiency and high throughput methods, we present here a construction kit for gene knock out in T. reesei. We provide a primer database for gene deletion using the pyr4, amdS and hph selection markers. For high throughput generation of gene knock outs, we constructed vectors using yeast mediated recombination and thenmore » transformed a T. reesei strain deficient in non-homologous end joining (NHEJ) by spore electroporation. This NHEJ-defect was subsequently removed by crossing of mutants with a sexually competent strain derived from the parental strain, QM9414.CONCLUSIONS:Using this strategy and the materials provided, high throughput gene deletion in T. reesei becomes feasible. Moreover, with the application of sexual development, the NHEJ-defect can be removed efficiently and without the need for additional selection markers. The same advantages apply for the construction of multiple mutants by crossing of strains with different gene deletions, which is now possible with considerably less hands-on time and minimal screening effort compared to a transformation approach. Consequently this toolkit can considerably boost research towards efficient exploitation of the resources of T. reesei for cellulase expression and hence second generation biofuel production.« less
Dynamic evolution of the GnRH receptor gene family in vertebrates.

PubMed

Williams, Barry L; Akazome, Yasuhisa; Oka, Yoshitaka; Eisthen, Heather L

2014-10-25

Elucidating the mechanisms underlying coevolution of ligands and receptors is an important challenge in molecular evolutionary biology. Peptide hormones and their receptors are excellent models for such efforts, given the relative ease of examining evolutionary changes in genes encoding for both molecules. Most vertebrates possess multiple genes for both the decapeptide gonadotropin releasing hormone (GnRH) and for the GnRH receptor. The evolutionary history of the receptor family, including ancestral copy number and timing of duplications and deletions, has been the subject of controversy. We report here for the first time sequences of three distinct GnRH receptor genes in salamanders (axolotls, Ambystoma mexicanum), which are orthologous to three GnRH receptors from ranid frogs. To understand the origin of these genes within the larger evolutionary context of the gene family, we performed phylogenetic analyses and probabilistic protein homology searches of GnRH receptor genes in vertebrates and their near relatives. Our analyses revealed four points that alter previous views about the evolution of the GnRH receptor gene family. First, the "mammalian" pituitary type GnRH receptor, which is the sole GnRH receptor in humans and previously presumed to be highly derived because it lacks the cytoplasmic C-terminal domain typical of most G-protein coupled receptors, is actually an ancient gene that originated in the common ancestor of jawed vertebrates (Gnathostomata). Second, unlike previous studies, we classify vertebrate GnRH receptors into five subfamilies. Third, the order of subfamily origins is the inverse of previous proposed models. Fourth, the number of GnRH receptor genes has been dynamic in vertebrates and their ancestors, with multiple duplications and losses. Our results provide a novel evolutionary framework for generating hypotheses concerning the functional importance of structural characteristics of vertebrate GnRH receptors. We show that five subfamilies of vertebrate GnRH receptors evolved early in the vertebrate phylogeny, followed by several independent instances of gene loss. Chief among cases of gene loss are humans, best described as degenerate with respect to GnRH receptors because we retain only a single, ancient gene.
The molecular and mathematical basis of Waddington's epigenetic landscape: a framework for post-Darwinian biology?

PubMed

Huang, Sui

2012-02-01

The Neo-Darwinian concept of natural selection is plausible when one assumes a straightforward causation of phenotype by genotype. However, such simple 1:1 mapping must now give place to the modern concepts of gene regulatory networks and gene expression noise. Both can, in the absence of genetic mutations, jointly generate a diversity of inheritable randomly occupied phenotypic states that could also serve as a substrate for natural selection. This form of epigenetic dynamics challenges Neo-Darwinism. It needs to incorporate the non-linear, stochastic dynamics of gene networks. A first step is to consider the mathematical correspondence between gene regulatory networks and Waddington's metaphoric 'epigenetic landscape', which actually represents the quasi-potential function of global network dynamics. It explains the coexistence of multiple stable phenotypes within one genotype. The landscape's topography with its attractors is shaped by evolution through mutational re-wiring of regulatory interactions - offering a link between genetic mutation and sudden, broad evolutionary changes. Copyright © 2012 WILEY Periodicals, Inc.

Gene Expression Profiling in Fish Toxicology: A Review.

PubMed

Kumar, Girish; Denslow, Nancy D

In this review, we present an overview of transcriptomic responses to chemical exposures in a variety of fish species. We have discussed the use of several molecular approaches such as northern blotting, differential display reverse transcription-polymerase chain reaction (DDRT-PCR), suppression subtractive hybridization (SSH), real time quantitative PCR (RT-qPCR), microarrays, and next-generation sequencing (NGS) for measuring gene expression. These techniques have been mainly used to measure the toxic effects of single compounds or simple mixtures in laboratory conditions. In addition, only few studies have been conducted to examine the biological significance of differentially expressed gene sets following chemical exposure. Therefore, future studies should focus more under field conditions using a multidisciplinary approach (genomics, proteomics and metabolomics) to understand the synergetic effects of multiple environmental stressors and to determine the functional significance of differentially expressed genes. Nevertheless, recent developments in NGS technologies and decreasing costs of sequencing holds the promise to uncover the complexity of anthropogenic impacts and biological effects in wild fish populations.
Relating genes to function: identifying enriched transcription factors using the ENCODE ChIP-Seq significance tool.

PubMed

Auerbach, Raymond K; Chen, Bin; Butte, Atul J

2013-08-01

Biological analysis has shifted from identifying genes and transcripts to mapping these genes and transcripts to biological functions. The ENCODE Project has generated hundreds of ChIP-Seq experiments spanning multiple transcription factors and cell lines for public use, but tools for a biomedical scientist to analyze these data are either non-existent or tailored to narrow biological questions. We present the ENCODE ChIP-Seq Significance Tool, a flexible web application leveraging public ENCODE data to identify enriched transcription factors in a gene or transcript list for comparative analyses. The ENCODE ChIP-Seq Significance Tool is written in JavaScript on the client side and has been tested on Google Chrome, Apple Safari and Mozilla Firefox browsers. Server-side scripts are written in PHP and leverage R and a MySQL database. The tool is available at http://encodeqt.stanford.edu. abutte@stanford.edu Supplementary material is available at Bioinformatics online.
GENEASE: Real time bioinformatics tool for multi-omics and disease ontology exploration, analysis and visualization.

PubMed

Ghandikota, Sudhir; Hershey, Gurjit K Khurana; Mersha, Tesfaye B

2018-03-24

Advances in high-throughput sequencing technologies have made it possible to generate multiple omics data at an unprecedented rate and scale. The accumulation of these omics data far outpaces the rate at which biologists can mine and generate new hypothesis to test experimentally. There is an urgent need to develop a myriad of powerful tools to efficiently and effectively search and filter these resources to address specific post-GWAS functional genomics questions. However, to date, these resources are scattered across several databases and often lack a unified portal for data annotation and analytics. In addition, existing tools to analyze and visualize these databases are highly fragmented, resulting researchers to access multiple applications and manual interventions for each gene or variant in an ad hoc fashion until all the questions are answered. In this study, we present GENEASE, a web-based one-stop bioinformatics tool designed to not only query and explore multi-omics and phenotype databases (e.g., GTEx, ClinVar, dbGaP, GWAS Catalog, ENCODE, Roadmap Epigenomics, KEGG, Reactome, Gene and Phenotype Ontology) in a single web interface but also to perform seamless post genome-wide association downstream functional and overlap analysis for non-coding regulatory variants. GENEASE accesses over 50 different databases in public domain including model organism-specific databases to facilitate gene/variant and disease exploration, enrichment and overlap analysis in real time. It is a user-friendly tool with point-and-click interface containing links for support information including user manual and examples. GENEASE can be accessed freely at http://research.cchmc.org/mershalab/genease_new/login.html. Tesfaye.Mersha@cchmc.org, Sudhir.Ghandikota@cchmc.org. Supplementary data are available at Bioinformatics online.
Fabrication of multi-well chips for spheroid cultures and implantable constructs through rapid prototyping techniques.

PubMed

Lopa, Silvia; Piraino, Francesco; Kemp, Raymond J; Di Caro, Clelia; Lovati, Arianna B; Di Giancamillo, Alessia; Moroni, Lorenzo; Peretti, Giuseppe M; Rasponi, Marco; Moretti, Matteo

2015-07-01

Three-dimensional (3D) culture models are widely used in basic and translational research. In this study, to generate and culture multiple 3D cell spheroids, we exploited laser ablation and replica molding for the fabrication of polydimethylsiloxane (PDMS) multi-well chips, which were validated using articular chondrocytes (ACs). Multi-well ACs spheroids were comparable or superior to standard spheroids, as revealed by glycosaminoglycan and type-II collagen deposition. Moreover, the use of our multi-well chips significantly reduced the operation time for cell seeding and medium refresh. Exploiting a similar approach, we used clinical-grade fibrin to generate implantable multi-well constructs allowing for the precise distribution of multiple cell types. Multi-well fibrin constructs were seeded with ACs generating high cell density regions, as shown by histology and cell fluorescent staining. Multi-well constructs were compared to standard constructs with homogeneously distributed ACs. After 7 days in vitro, expression of SOX9, ACAN, COL2A1, and COMP was increased in both constructs, with multi-well constructs expressing significantly higher levels of chondrogenic genes than standard constructs. After 5 weeks in vivo, we found that despite a dramatic size reduction, the cell distribution pattern was maintained and glycosaminoglycan content per wet weight was significantly increased respect to pre-implantation samples. In conclusion, multi-well chips for the generation and culture of multiple cell spheroids can be fabricated by low-cost rapid prototyping techniques. Furthermore, these techniques can be used to generate implantable constructs with defined architecture and controlled cell distribution, allowing for in vitro and in vivo investigation of cell interactions in a 3D environment. © 2015 Wiley Periodicals, Inc.
BM-Map: Bayesian Mapping of Multireads for Next-Generation Sequencing Data

PubMed Central

Ji, Yuan; Xu, Yanxun; Zhang, Qiong; Tsui, Kam-Wah; Yuan, Yuan; Norris, Clift; Liang, Shoudan; Liang, Han

2011-01-01

Summary Next-generation sequencing (NGS) technology generates millions of short reads, which provide valuable information for various aspects of cellular activities and biological functions. A key step in NGS applications (e.g., RNA-Seq) is to map short reads to correct genomic locations within the source genome. While most reads are mapped to a unique location, a significant proportion of reads align to multiple genomic locations with equal or similar numbers of mismatches; these are called multireads. The ambiguity in mapping the multireads may lead to bias in downstream analyses. Currently, most practitioners discard the multireads in their analysis, resulting in a loss of valuable information, especially for the genes with similar sequences. To refine the read mapping, we develop a Bayesian model that computes the posterior probability of mapping a multiread to each competing location. The probabilities are used for downstream analyses, such as the quantification of gene expression. We show through simulation studies and RNA-Seq analysis of real life data that the Bayesian method yields better mapping than the current leading methods. We provide a C++ program for downloading that is being packaged into a user-friendly software. PMID:21517792
Discovery of ALK-PTPN3 gene fusion from human non-small cell lung carcinoma cell line using next generation RNA sequencing.

PubMed

Jung, Yeonjoo; Kim, Pora; Jung, Yeonhwa; Keum, Juhee; Kim, Soon-Nam; Choi, Yong Soo; Do, In-Gu; Lee, Jinseon; Choi, So-Jung; Kim, Sujin; Lee, Jong-Eun; Kim, Jhingook; Lee, Sanghyuk; Kim, Jaesang

2012-06-01

An increasing number of chromosomal aberrations is being identified in solid tumors providing novel biomarkers for various types of cancer and new insights into the mechanisms of carcinogenesis. We applied next generation sequencing technique to analyze the transcriptome of the non-small cell lung carcinoma (NSCLC) cell line H2228 and discovered a fusion transcript composed of multiple exons of ALK (anaplastic lymphoma receptor tyrosine kinase) and PTPN3 (protein tyrosine phosphatase, nonreceptor Type 3). Detailed analysis of the genomic structure revealed that a portion of genomic region encompassing Exons 10 and 11 of ALK has been translocated into the intronic region between Exons 2 and 3 of PTPN3. The key net result appears to be the null mutation of one allele of PTPN3, a gene with tumor suppressor activity. Consistently, ectopic expression of PTPN3 in NSCLC cell lines led to inhibition of colony formation. Our study confirms the utility of next generation sequencing as a tool for the discovery of somatic mutations and has led to the identification of a novel mutation in NSCLC that may be of diagnostic, prognostic, and therapeutic importance. Copyright © 2012 Wiley Periodicals, Inc.
Cardiogenic Genes Expressed in Cardiac Fibroblasts Contribute to Heart Development and Repair

PubMed Central

Furtado, Milena B.; Costa, Mauro W.; Pranoto, Edward Adi; Salimova, Ekaterina; Pinto, Alex; Lam, Nicholas T.; Park, Anthony; Snider, Paige; Chandran, Anjana; Harvey, Richard P.; Boyd, Richard; Conway, Simon J.; Pearson, James; Kaye, David M.; Rosenthal, Nadia A.

2014-01-01

Rationale Cardiac fibroblasts are critical to proper heart function through multiple interactions with the myocardial compartment but appreciation of their contribution has suffered from incomplete characterization and lack of cell-specific markers. Objective To generate an unbiased comparative gene expression profile of the cardiac fibroblast pool, identify and characterize the role of key genes in cardiac fibroblast function, and determine their contribution to myocardial development and regeneration. Methods and Results High-throughput cell surface and intracellular profiling of cardiac and tail fibroblasts identified canonical MSC and a surprising number of cardiogenic genes, some expressed at higher levels than in whole heart. Whilst genetically marked fibroblasts contributed heterogeneously to interstitial but not cardiomyocyte compartments in infarcted hearts, fibroblast-restricted depletion of one highly expressed cardiogenic marker, Tbx20, caused marked myocardial dysmorphology and perturbations in scar formation upon myocardial infarction. Conclusions The surprising transcriptional identity of cardiac fibroblasts, the adoption of cardiogenic gene programs and direct contribution to cardiac development and repair provokes alternative interpretations for studies on more specialized cardiac progenitors, offering a novel perspective for reinterpreting cardiac regenerative therapies. PMID:24650916
Abscisic-acid-dependent basic leucine zipper (bZIP) transcription factors in plant abiotic stress.

PubMed

Banerjee, Aditya; Roychoudhury, Aryadeep

2017-01-01

One of the major causes of significant crop loss throughout the world is the myriad of environmental stresses including drought, salinity, cold, heavy metal toxicity, and ultraviolet-B (UV-B) rays. Plants as sessile organisms have evolved various effective mechanism which enable them to withstand this plethora of stresses. Most of such regulatory mechanisms usually follow the abscisic-acid (ABA)-dependent pathway. In this review, we have primarily focussed on the basic leucine zipper (bZIP) transcription factors (TFs) activated by the ABA-mediated signalosome. Upon perception of ABA by specialized receptors, the signal is transduced via various groups of Ser/Thr kinases, which phosphorylate the bZIP TFs. Following such post-translational modification of TFs, they are activated so that they bind to specific cis-acting sequences called abscisic-acid-responsive elements (ABREs) or GC-rich coupling elements (CE), thereby influencing the expression of their target downstream genes. Several in silico techniques have been adopted so far to predict the structural features, recognize the regulatory modification sites, undergo phylogenetic analyses, and facilitate genome-wide survey of TF under multiple stresses. Current investigations on the epigenetic regulation that controls greater accessibility of the inducible regions of DNA of the target gene to the bZIP TFs exclusively under stress situations, along with the evolved stress memory responses via genomic imprinting mechanism, have been highlighted. The potentiality of overexpression of bZIP TFs, either in a homologous or in a heterologous background, in generating transgenic plants tolerant to various abiotic stressors have also been addressed by various groups. The present review will provide a coherent documentation on the functional characterization and regulation of bZIP TFs under multiple environmental stresses, with the major goal of generating multiple-stress-tolerant plant cultivars in near future.
A pipeline for the de novo assembly of the Themira biloba (Sepsidae: Diptera) transcriptome using a multiple k-mer length approach.

PubMed

Melicher, Dacotah; Torson, Alex S; Dworkin, Ian; Bowsher, Julia H

2014-03-12

The Sepsidae family of flies is a model for investigating how sexual selection shapes courtship and sexual dimorphism in a comparative framework. However, like many non-model systems, there are few molecular resources available. Large-scale sequencing and assembly have not been performed in any sepsid, and the lack of a closely related genome makes investigation of gene expression challenging. Our goal was to develop an automated pipeline for de novo transcriptome assembly, and to use that pipeline to assemble and analyze the transcriptome of the sepsid Themira biloba. Our bioinformatics pipeline uses cloud computing services to assemble and analyze the transcriptome with off-site data management, processing, and backup. It uses a multiple k-mer length approach combined with a second meta-assembly to extend transcripts and recover more bases of transcript sequences than standard single k-mer assembly. We used 454 sequencing to generate 1.48 million reads from cDNA generated from embryo, larva, and pupae of T. biloba and assembled a transcriptome consisting of 24,495 contigs. Annotation identified 16,705 transcripts, including those involved in embryogenesis and limb patterning. We assembled transcriptomes from an additional three non-model organisms to demonstrate that our pipeline assembled a higher-quality transcriptome than single k-mer approaches across multiple species. The pipeline we have developed for assembly and analysis increases contig length, recovers unique transcripts, and assembles more base pairs than other methods through the use of a meta-assembly. The T. biloba transcriptome is a critical resource for performing large-scale RNA-Seq investigations of gene expression patterns, and is the first transcriptome sequenced in this Dipteran family.
Heme as a danger molecule in pathogen recognition.

PubMed

Wegiel, Barbara; Hauser, Carl J; Otterbein, Leo E

2015-12-01

Appropriate control of redox mechanisms are critical for and effective innate immune response, which employs multiple cell types, receptors and molecules that recognize danger signals when they reach the host. Recognition of pathogen-associated pattern molecules (PAMPs) is a fundamental host survival mechanism for efficient elimination of invading pathogens and resolution of the infection and inflammation. In addition to PAMPs, eukaryotic cells contain a plethora of intracellular molecules that are normally secured within the confines of the plasma membrane, but if liberated and encountered in the extracellular milieu can provoke rapid cell activation. These are known as Alarmins or Danger-Associated Molecular Patterns (DAMPs) and can be released actively by cells or passively as a result of sterile cellular injury after trauma, ischemia, or toxin-induced cell rupture. Both PAMPs and DAMPs are recognized by a series of cognate receptors that increase the generation of free radicals and activate specific signaling pathways that result in regulation of a variety of stress response, redox sensitive genes. Multiple mediators released, as cells die include, but are not limited to ATP, hydrogen peroxide, heme, formyl peptides, DNA or mitochondria provide the second signal to amplify immune responses. In this review, we will focus on how sterile and infective stimuli activate the stress response gene heme oxygenase-1 (Hmox1, HO-1), a master gene critical to an appropriate host response that is now recognized as one with enormous therapeutic potential. HO-1 gene expression is regulated in large part by redox-sensitive proteins including but not limited to nrf2. Both PAMPs and DAMPs increase the activation of nrf2 and HO-1. Heme is a powerful pro-oxidant and as such should be qualified as a DAMP. With its degradation by HO-1a molecule of carbon monoxide (CO) is generated that in turn serves as a bioactive signaling molecule. PAMPs such as bacterial endotoxin activate HO-1, and the CO that is generated diffuses into the extracellular milieu where it interacts with bacteria, altering their behavior to increase production of ATP, which then functions as a second signal danger molecule. This two-hit cycle scenario results in efficient and effective activation of host leukocytes to attack and clear bacteria in part via enhanced reactive oxygen species generation. We discuss this intimate communication that occurs between host and bacteria and how these molecules serve as critical regulators of the acute inflammatory response, the overall redox status of the cell, and survival of the host. Copyright © 2015 Elsevier Inc. All rights reserved.
Unprecedented high-resolution view of bacterial operon architecture revealed by RNA sequencing.

PubMed

Conway, Tyrrell; Creecy, James P; Maddox, Scott M; Grissom, Joe E; Conkle, Trevor L; Shadid, Tyler M; Teramoto, Jun; San Miguel, Phillip; Shimada, Tomohiro; Ishihama, Akira; Mori, Hirotada; Wanner, Barry L

2014-07-08

We analyzed the transcriptome of Escherichia coli K-12 by strand-specific RNA sequencing at single-nucleotide resolution during steady-state (logarithmic-phase) growth and upon entry into stationary phase in glucose minimal medium. To generate high-resolution transcriptome maps, we developed an organizational schema which showed that in practice only three features are required to define operon architecture: the promoter, terminator, and deep RNA sequence read coverage. We precisely annotated 2,122 promoters and 1,774 terminators, defining 1,510 operons with an average of 1.98 genes per operon. Our analyses revealed an unprecedented view of E. coli operon architecture. A large proportion (36%) of operons are complex with internal promoters or terminators that generate multiple transcription units. For 43% of operons, we observed differential expression of polycistronic genes, despite being in the same operons, indicating that E. coli operon architecture allows fine-tuning of gene expression. We found that 276 of 370 convergent operons terminate inefficiently, generating complementary 3' transcript ends which overlap on average by 286 nucleotides, and 136 of 388 divergent operons have promoters arranged such that their 5' ends overlap on average by 168 nucleotides. We found 89 antisense transcripts of 397-nucleotide average length, 7 unannotated transcripts within intergenic regions, and 18 sense transcripts that completely overlap operons on the opposite strand. Of 519 overlapping transcripts, 75% correspond to sequences that are highly conserved in E. coli (>50 genomes). Our data extend recent studies showing unexpected transcriptome complexity in several bacteria and suggest that antisense RNA regulation is widespread. Importance: We precisely mapped the 5' and 3' ends of RNA transcripts across the E. coli K-12 genome by using a single-nucleotide analytical approach. Our resulting high-resolution transcriptome maps show that ca. one-third of E. coli operons are complex, with internal promoters and terminators generating multiple transcription units and allowing differential gene expression within these operons. We discovered extensive antisense transcription that results from more than 500 operons, which fully overlap or extensively overlap adjacent divergent or convergent operons. The genomic regions corresponding to these antisense transcripts are highly conserved in E. coli (including Shigella species), although it remains to be proven whether or not they are functional. Our observations of features unearthed by single-nucleotide transcriptome mapping suggest that deeper layers of transcriptional regulation in bacteria are likely to be revealed in the future. Copyright © 2014 Conway et al.
Single cell genomics of uncultured marine alveolates shows paraphyly of basal dinoflagellates.

PubMed

Strassert, Jürgen F H; Karnkowska, Anna; Hehenberger, Elisabeth; Del Campo, Javier; Kolisko, Martin; Okamoto, Noriko; Burki, Fabien; Janouškovec, Jan; Poirier, Camille; Leonard, Guy; Hallam, Steven J; Richards, Thomas A; Worden, Alexandra Z; Santoro, Alyson E; Keeling, Patrick J

2018-01-01

Marine alveolates (MALVs) are diverse and widespread early-branching dinoflagellates, but most knowledge of the group comes from a few cultured species that are generally not abundant in natural samples, or from diversity analyses of PCR-based environmental SSU rRNA gene sequences. To more broadly examine MALV genomes, we generated single cell genome sequences from seven individually isolated cells. Genes expected of heterotrophic eukaryotes were found, with interesting exceptions like presence of proteorhodopsin and vacuolar H + -pyrophosphatase. Phylogenetic analysis of concatenated SSU and LSU rRNA gene sequences provided strong support for the paraphyly of MALV lineages. Dinoflagellate viral nucleoproteins were found only in MALV groups that branched as sister to dinokaryotes. Our findings indicate that multiple independent origins of several characteristics early in dinoflagellate evolution, such as a parasitic life style, underlie the environmental diversity of MALVs, and suggest they have more varied trophic modes than previously thought.
Genome-wide mapping in a house mouse hybrid zone reveals hybrid sterility loci and Dobzhansky-Muller interactions

PubMed Central

Turner, Leslie M; Harr, Bettina

2014-01-01

Mapping hybrid defects in contact zones between incipient species can identify genomic regions contributing to reproductive isolation and reveal genetic mechanisms of speciation. The house mouse features a rare combination of sophisticated genetic tools and natural hybrid zones between subspecies. Male hybrids often show reduced fertility, a common reproductive barrier between incipient species. Laboratory crosses have identified sterility loci, but each encompasses hundreds of genes. We map genetic determinants of testis weight and testis gene expression using offspring of mice captured in a hybrid zone between M. musculus musculus and M. m. domesticus. Many generations of admixture enables high-resolution mapping of loci contributing to these sterility-related phenotypes. We identify complex interactions among sterility loci, suggesting multiple, non-independent genetic incompatibilities contribute to barriers to gene flow in the hybrid zone. DOI: http://dx.doi.org/10.7554/eLife.02504.001 PMID:25487987
Mutations of RNA splicing factors in hematological malignancies.

PubMed

Shukla, Girish C; Singh, Jagjit

2017-11-28

Systematic large-scale cancer genomic studies have produced numerous significant findings. These studies have not only revealed new cancer-promoting genes, but they also have identified cancer-promoting functions of previously known "housekeeping" genes. These studies have identified numerous mutations in genes which play a fundamental role in nuclear precursor mRNA splicing. Somatic mutations and copy number variation in many of the splicing factors which participate in the formation of multiple spliceosomal complexes appear to play a role in many cancers and in particular in myelodysplastic syndromes (MDS). Mutated proteins seem to interfere with the recognition of the authentic splice sites (SS) leading to utilization of suboptimal alternative splicing sites generating aberrantly spliced mRNA isoforms. This short review is focusing on the function of the splice factors involved in the formation of splicing complexes and potential mechanisms which affect usage of the authentic splice site recognition. Copyright © 2017 Elsevier B.V. All rights reserved.
Hox Genes: Choreographers in Neural Development, Architects of Circuit Organization

PubMed Central

Philippidou, Polyxeni; Dasen, Jeremy S.

2013-01-01

Summary The neural circuits governing vital behaviors, such as respiration and locomotion, are comprised of discrete neuronal populations residing within the brainstem and spinal cord. Work over the past decade has provided a fairly comprehensive understanding of the developmental pathways that determine the identity of major neuronal classes within the neural tube. However, the steps through which neurons acquire the subtype diversities necessary for their incorporation into a particular circuit are still poorly defined. Studies on the specification of motor neurons indicate that the large family of Hox transcription factors has a key role in generating the subtypes required for selective muscle innervation. There is also emerging evidence that Hox genes function in multiple neuronal classes to shape synaptic specificity during development, suggesting a broader role in circuit assembly. This review highlights the functions and mechanisms of Hox gene networks, and their multifaceted roles during neuronal specification and connectivity. PMID:24094100
“Guest list” or “Black list”? Heritable Small RNAs as Immunogenic Memories

PubMed Central

Rechavi, Oded

2016-01-01

Small RNA-mediated gene silencing plays a pivotal role in genome immunity by recognizing and eliminating viruses and transposons which otherwise may colonize the genome. However, this can be challenging since individual genomic parasites are highly diverse, and employ multiple immune evasion techniques. In this review, I discuss a new theory proposing that the integrity of the germline is maintained by transgenerationally-transmitted RNA “memories” that record ancestral gene expression patterns, and delineate “Self” from “Foreign” sequences. To maintain such recollection two tactics are employed in parallel: “black listing” of invading nucleic acids, and “guest listing” of endogenous genes. Studies in a number of organisms have shown that this memorization is used by the next generation small RNAs to act as “Inherited Vaccines” that ambush invading elements, or as “Inherited Licenses” that grant the transcription of autogenous sequences. PMID:24231398
A role for SR proteins in plant stress responses.

PubMed

Duque, Paula

2011-01-01

Members of the SR (serine/arginine-rich) protein gene family are key players in the regulation of alternative splicing, an important means of generating proteome diversity and regulating gene expression. In plants, marked changes in alternative splicing are induced by a wide variety of abiotic stresses, suggesting a role for this highly versatile gene regulation mechanism in the response to environmental cues. In support of this notion, the expression of plant SR proteins is stress-regulated at multiple levels, with environmental signals controlling their own alternative splicing patterns, phosphorylation status and subcellular distribution. Most importantly, functional links between these RNA-binding proteins and plant stress tolerance are beginning to emerge, including a role in the regulation of abscisic acid (ABA) signaling. Future identification of the physiological mRNA targets of plant SR proteins holds much promise for the elucidation of the molecular mechanisms underlying their role in the response to abiotic stress.
A role for SR proteins in plant stress responses

PubMed Central

2011-01-01

Members of the SR (serine/arginine-rich) protein gene family are key players in the regulation of alternative splicing, an important means of generating proteome diversity and regulating gene expression. In plants, marked changes in alternative splicing are induced by a wide variety of abiotic stresses, suggesting a role for this highly versatile gene regulation mechanism in the response to environmental cues. In support of this notion, the expression of plant SR proteins is stress-regulated at multiple levels, with environmental signals controlling their own alternative splicing patterns, phosphorylation status and subcellular distribution. Most importantly, functional links between these RNA-binding proteins and plant stress tolerance are beginning to emerge, including a role in the regulation of abscisic acid (ABA) signaling. Future identification of the physiological mRNA targets of plant SR proteins holds much promise for the elucidation of the molecular mechanisms underlying their role in the response to abiotic stress. PMID:21258207
Single molecule targeted sequencing for cancer gene mutation detection.

PubMed

Gao, Yan; Deng, Liwei; Yan, Qin; Gao, Yongqian; Wu, Zengding; Cai, Jinsen; Ji, Daorui; Li, Gailing; Wu, Ping; Jin, Huan; Zhao, Luyang; Liu, Song; Ge, Liangjin; Deem, Michael W; He, Jiankui

2016-05-19

With the rapid decline in cost of sequencing, it is now affordable to examine multiple genes in a single disease-targeted clinical test using next generation sequencing. Current targeted sequencing methods require a separate step of targeted capture enrichment during sample preparation before sequencing. Although there are fast sample preparation methods available in market, the library preparation process is still relatively complicated for physicians to use routinely. Here, we introduced an amplification-free Single Molecule Targeted Sequencing (SMTS) technology, which combined targeted capture and sequencing in one step. We demonstrated that this technology can detect low-frequency mutations using artificially synthesized DNA sample. SMTS has several potential advantages, including simple sample preparation thus no biases and errors are introduced by PCR reaction. SMTS has the potential to be an easy and quick sequencing technology for clinical diagnosis such as cancer gene mutation detection, infectious disease detection, inherited condition screening and noninvasive prenatal diagnosis.
High-resolution phylogenetic microbial community profiling

DOE Office of Scientific and Technical Information (OSTI.GOV)

Singer, Esther; Bushnell, Brian; Coleman-Derr, Devin

Over the past decade, high-throughput short-read 16S rRNA gene amplicon sequencing has eclipsed clone-dependent long-read Sanger sequencing for microbial community profiling. The transition to new technologies has provided more quantitative information at the expense of taxonomic resolution with implications for inferring metabolic traits in various ecosystems. We applied single-molecule real-time sequencing for microbial community profiling, generating full-length 16S rRNA gene sequences at high throughput, which we propose to name PhyloTags. We benchmarked and validated this approach using a defined microbial community. When further applied to samples from the water column of meromictic Sakinaw Lake, we show that while community structuresmore » at the phylum level are comparable between PhyloTags and Illumina V4 16S rRNA gene sequences (iTags), variance increases with community complexity at greater water depths. PhyloTags moreover allowed less ambiguous classification. Last, a platform-independent comparison of PhyloTags and in silico generated partial 16S rRNA gene sequences demonstrated significant differences in community structure and phylogenetic resolution across multiple taxonomic levels, including a severe underestimation in the abundance of specific microbial genera involved in nitrogen and methane cycling across the Lake's water column. Thus, PhyloTags provide a reliable adjunct or alternative to cost-effective iTags, enabling more accurate phylogenetic resolution of microbial communities and predictions on their metabolic potential.« less

Acute multi-sgRNA knockdown of KEOPS complex genes reproduces the microcephaly phenotype of the stable knockout zebrafish model

PubMed Central

Schneider, Ronen; Hoogstraten, Charlotte A.; Schapiro, David; Majmundar, Amar J.; Kolb, Amy; Eddy, Kaitlyn; Shril, Shirlee; Braun, Daniela A.; Poduri, Annapurna

2018-01-01

Until recently, morpholino oligonucleotides have been widely employed in zebrafish as an acute and efficient loss-of-function assay. However, off-target effects and reproducibility issues when compared to stable knockout lines have compromised their further use. Here we employed an acute CRISPR/Cas approach using multiple single guide RNAs targeting simultaneously different positions in two exemplar genes (osgep or tprkb) to increase the likelihood of generating mutations on both alleles in the injected F0 generation and to achieve a similar effect as morpholinos but with the reproducibility of stable lines. This multi single guide RNA approach resulted in median likelihoods for at least one mutation on each allele of >99% and sgRNA specific insertion/deletion profiles as revealed by deep-sequencing. Immunoblot showed a significant reduction for Osgep and Tprkb proteins. For both genes, the acute multi-sgRNA knockout recapitulated the microcephaly phenotype and reduction in survival that we observed previously in stable knockout lines, though milder in the acute multi-sgRNA knockout. Finally, we quantify the degree of mutagenesis by deep sequencing, and provide a mathematical model to quantitate the chance for a biallelic loss-of-function mutation. Our findings can be generalized to acute and stable CRISPR/Cas targeting for any zebrafish gene of interest. PMID:29346415
High-resolution phylogenetic microbial community profiling

DOE PAGES

Singer, Esther; Bushnell, Brian; Coleman-Derr, Devin; ...

2016-02-09

Over the past decade, high-throughput short-read 16S rRNA gene amplicon sequencing has eclipsed clone-dependent long-read Sanger sequencing for microbial community profiling. The transition to new technologies has provided more quantitative information at the expense of taxonomic resolution with implications for inferring metabolic traits in various ecosystems. We applied single-molecule real-time sequencing for microbial community profiling, generating full-length 16S rRNA gene sequences at high throughput, which we propose to name PhyloTags. We benchmarked and validated this approach using a defined microbial community. When further applied to samples from the water column of meromictic Sakinaw Lake, we show that while community structuresmore » at the phylum level are comparable between PhyloTags and Illumina V4 16S rRNA gene sequences (iTags), variance increases with community complexity at greater water depths. PhyloTags moreover allowed less ambiguous classification. Last, a platform-independent comparison of PhyloTags and in silico generated partial 16S rRNA gene sequences demonstrated significant differences in community structure and phylogenetic resolution across multiple taxonomic levels, including a severe underestimation in the abundance of specific microbial genera involved in nitrogen and methane cycling across the Lake's water column. Thus, PhyloTags provide a reliable adjunct or alternative to cost-effective iTags, enabling more accurate phylogenetic resolution of microbial communities and predictions on their metabolic potential.« less
Germ-line variants identified by next generation sequencing in a panel of estrogen and cancer associated genes correlate with poor clinical outcome in Lynch syndrome patients.

PubMed

Jóri, Balazs; Kamps, Rick; Xanthoulea, Sofia; Delvoux, Bert; Blok, Marinus J; Van de Vijver, Koen K; de Koning, Bart; Oei, Felicia Trups; Tops, Carli M; Speel, Ernst Jm; Kruitwagen, Roy F; Gomez-Garcia, Encarna B; Romano, Andrea

2015-12-01

The risk to develop colorectal and endometrial cancers among subjects testing positive for a pathogenic Lynch syndrome mutation varies, making the risk prediction difficult. Genetic risk modifiers alter the risk conferred by inherited Lynch syndrome mutations, and their identification can improve genetic counseling. We aimed at identifying rare genetic modifiers of the risk of Lynch syndrome endometrial cancer. A family based approach was used to assess the presence of genetic risk modifiers among 35 Lynch syndrome mutation carriers having either a poor clinical phenotype (early age of endometrial cancer diagnosis or multiple cancers) or a neutral clinical phenotype. Putative genetic risk modifiers were identified by Next Generation Sequencing among a panel of 154 genes involved in endometrial physiology and carcinogenesis. A simple pipeline, based on an allele frequency lower than 0.001 and on predicted non-conservative amino-acid substitutions returned 54 variants that were considered putative risk modifiers. The presence of two or more risk modifying variants in women carrying a pathogenic Lynch syndrome mutation was associated with a poor clinical phenotype. A gene-panel is proposed that comprehends genes that can carry variants with putative modifying effects on the risk of Lynch syndrome endometrial cancer. Validation in further studies is warranted before considering the possible use of this tool in genetic counseling.
A double-strand break can trigger immunoglobulin gene conversion

PubMed Central

Bastianello, Giulia; Arakawa, Hiroshi

2017-01-01

All three B cell-specific activities of the immunoglobulin (Ig) gene re-modeling system—gene conversion, somatic hypermutation and class switch recombination—require activation-induced deaminase (AID). AID-induced DNA lesions must be further processed and dissected into different DNA recombination pathways. In order to characterize potential intermediates for Ig gene conversion, we inserted an I-SceI recognition site into the complementarity determining region 1 (CDR1) of the Ig light chain locus of the AID knockout DT40 cell line, and conditionally expressed I-SceI endonuclease. Here, we show that a double-strand break (DSB) in CDR1 is sufficient to trigger Ig gene conversion in the absence of AID. The pattern and pseudogene usage of DSB-induced gene conversion were comparable to those of AID-induced gene conversion; surprisingly, sometimes a single DSB induced multiple gene conversion events. These constitute direct evidence that a DSB in the V region can be an intermediate for gene conversion. The fate of the DNA lesion downstream of a DSB had more flexibility than that of AID, suggesting two alternative models: (i) DSBs during the physiological gene conversion are in the minority compared to single-strand breaks (SSBs), which are frequently generated following DNA deamination, or (ii) the physiological gene conversion is mediated by a tightly regulated DSB that is locally protected from non-homologous end joining (NHEJ) or other non-homologous DNA recombination machineries. PMID:27701075
Colloquium paper: uniquely human evolution of sialic acid genetics and biology.

PubMed

Varki, Ajit

2010-05-11

Darwinian evolution of humans from our common ancestors with nonhuman primates involved many gene-environment interactions at the population level, and the resulting human-specific genetic changes must contribute to the "Human Condition." Recent data indicate that the biology of sialic acids (which directly involves less than 60 genes) shows more than 10 uniquely human genetic changes in comparison with our closest evolutionary relatives. Known outcomes are tissue-specific changes in abundant cell-surface glycans, changes in specificity and/or expression of multiple proteins that recognize these glycans, and novel pathogen regimes. Specific events include Alu-mediated inactivation of the CMAH gene, resulting in loss of synthesis of the Sia N-glycolylneuraminic acid (Neu5Gc) and increase in expression of the precursor N-acetylneuraminic acid (Neu5Ac); increased expression of alpha2-6-linked Sias (likely because of changed expression of ST6GALI); and multiple changes in SIGLEC genes encoding Sia-recognizing Ig-like lectins (Siglecs). The last includes binding specificity changes (in Siglecs -5, -7, -9, -11, and -12); expression pattern changes (in Siglecs -1, -5, -6, and -11); gene conversion (SIGLEC11); and deletion or pseudogenization (SIGLEC13, SIGLEC14, and SIGLEC16). A nongenetic outcome of the CMAH mutation is human metabolic incorporation of foreign dietary Neu5Gc, in the face of circulating anti-Neu5Gc antibodies, generating a novel "xeno-auto-antigen" situation. Taken together, these data suggest that both the genes associated with Sia biology and the related impacts of the environment comprise a relative "hot spot" of genetic and physiological changes in human evolution, with implications for uniquely human features both in health and disease.
Parallel and convergent evolution of the dim-light vision gene RH1 in bats (Order: Chiroptera).

PubMed

Shen, Yong-Yi; Liu, Jie; Irwin, David M; Zhang, Ya-Ping

2010-01-21

Rhodopsin, encoded by the gene Rhodopsin (RH1), is extremely sensitive to light, and is responsible for dim-light vision. Bats are nocturnal mammals that inhabit poor light environments. Megabats (Old-World fruit bats) generally have well-developed eyes, while microbats (insectivorous bats) have developed echolocation and in general their eyes were degraded, however, dramatic differences in the eyes, and their reliance on vision, exist in this group. In this study, we examined the rod opsin gene (RH1), and compared its evolution to that of two cone opsin genes (SWS1 and M/LWS). While phylogenetic reconstruction with the cone opsin genes SWS1 and M/LWS generated a species tree in accord with expectations, the RH1 gene tree united Pteropodidae (Old-World fruit bats) and Yangochiroptera, with very high bootstrap values, suggesting the possibility of convergent evolution. The hypothesis of convergent evolution was further supported when nonsynonymous sites or amino acid sequences were used to construct phylogenies. Reconstructed RH1 sequences at internal nodes of the bat species phylogeny showed that: (1) Old-World fruit bats share an amino acid change (S270G) with the tomb bat; (2) Miniopterus share two amino acid changes (V104I, M183L) with Rhinolophoidea; (3) the amino acid replacement I123V occurred independently on four branches, and the replacements L99M, L266V and I286V occurred each on two branches. The multiple parallel amino acid replacements that occurred in the evolution of bat RH1 suggest the possibility of multiple convergences of their ecological specialization (i.e., various photic environments) during adaptation for the nocturnal lifestyle, and suggest that further attention is needed on the study of the ecology and behavior of bats.
Parallel and Convergent Evolution of the Dim-Light Vision Gene RH1 in Bats (Order: Chiroptera)

PubMed Central

Shen, Yong-Yi; Liu, Jie; Irwin, David M.; Zhang, Ya-Ping

2010-01-01

Rhodopsin, encoded by the gene Rhodopsin (RH1), is extremely sensitive to light, and is responsible for dim-light vision. Bats are nocturnal mammals that inhabit poor light environments. Megabats (Old-World fruit bats) generally have well-developed eyes, while microbats (insectivorous bats) have developed echolocation and in general their eyes were degraded, however, dramatic differences in the eyes, and their reliance on vision, exist in this group. In this study, we examined the rod opsin gene (RH1), and compared its evolution to that of two cone opsin genes (SWS1 and M/LWS). While phylogenetic reconstruction with the cone opsin genes SWS1 and M/LWS generated a species tree in accord with expectations, the RH1 gene tree united Pteropodidae (Old-World fruit bats) and Yangochiroptera, with very high bootstrap values, suggesting the possibility of convergent evolution. The hypothesis of convergent evolution was further supported when nonsynonymous sites or amino acid sequences were used to construct phylogenies. Reconstructed RH1 sequences at internal nodes of the bat species phylogeny showed that: (1) Old-World fruit bats share an amino acid change (S270G) with the tomb bat; (2) Miniopterus share two amino acid changes (V104I, M183L) with Rhinolophoidea; (3) the amino acid replacement I123V occurred independently on four branches, and the replacements L99M, L266V and I286V occurred each on two branches. The multiple parallel amino acid replacements that occurred in the evolution of bat RH1 suggest the possibility of multiple convergences of their ecological specialization (i.e., various photic environments) during adaptation for the nocturnal lifestyle, and suggest that further attention is needed on the study of the ecology and behavior of bats. PMID:20098620
Causal relationship between the AHSG gene and BMD through fetuin-A and BMI: multiple mediation analysis.

PubMed

Sritara, C; Thakkinstian, A; Ongphiphadhanakul, B; Chailurkit, L; Chanprasertyothin, S; Ratanachaiwong, W; Vathesatogkit, P; Sritara, P

2014-05-01

Using mediation analysis, a causal relationship between the AHSG gene and bone mineral density (BMD) through fetuin-A and body mass index (BMI) mediators was suggested. Fetuin-A, a multifunctional protein of hepatic origin, is associated with bone mineral density. It is unclear if this association is causal. This study aimed at clarification of this issue. A cross-sectional study was conducted among 1,741 healthy workers from the Electricity Generating Authority of Thailand (EGAT) cohort. The alpha-2-Heremans-Schmid glycoprotein (AHSG) rs2248690 gene was genotyped. Three mediation models were constructed using seemingly unrelated regression analysis. First, the ln[fetuin-A] group was regressed on the AHSG gene. Second, the BMI group was regressed on the AHSG gene and the ln[fetuin-A] group. Finally, the BMD model was constructed by fitting BMD on two mediators (ln[fetuin-A] and BMI) and the independent AHSG variable. All three analyses were adjusted for confounders. The prevalence of the minor T allele for the AHSG locus was 15.2%. The AHSG locus was highly related to serum fetuin-A levels (P < 0.001). Multiple mediation analyses showed that AHSG was significantly associated with BMD through the ln[fetuin-A] and BMI pathway, with beta coefficients of 0.0060 (95% CI 0.0038, 0.0083) and 0.0030 (95% CI 0.0020, 0.0045) at the total hip and lumbar spine, respectively. About 27.3 and 26.0% of total genetic effects on hip and spine BMD, respectively, were explained by the mediation effects of fetuin-A and BMI. Our study suggested evidence of a causal relationship between the AHSG gene and BMD through fetuin-A and BMI mediators.
Flightless I (Drosophila) homolog facilitates chromatin accessibility of the estrogen receptor α target genes in MCF-7 breast cancer cells

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jeong, Kwang Won, E-mail: kwjeong@gachon.ac.kr

2014-04-04

Highlights: • H3K4me3 and Pol II binding at TFF1 promoter were reduced in FLII-depleted MCF-7 cells. • FLII is required for chromatin accessibility of the enhancer of ERalpha target genes. • Depletion of FLII causes inhibition of proliferation of MCF-7 cells. - Abstract: The coordinated activities of multiple protein complexes are essential to the remodeling of chromatin structure and for the recruitment of RNA polymerase II (Pol II) to the promoter in order to facilitate the initiation of transcription in nuclear receptor-mediated gene expression. Flightless I (Drosophila) homolog (FLII), a nuclear receptor coactivator, is associated with the SWI/SNF-chromatin remodeling complexmore » during estrogen receptor (ER)α-mediated transcription. However, the function of FLII in estrogen-induced chromatin opening has not been fully explored. Here, we show that FLII plays a critical role in establishing active histone modification marks and generating the open chromatin structure of ERα target genes. We observed that the enhancer regions of ERα target genes are heavily occupied by FLII, and histone H3K4me3 and Pol II binding induced by estrogen are decreased in FLII-depleted MCF-7 cells. Furthermore, formaldehyde-assisted isolation of regulatory elements (FAIRE)-quantitative polymerase chain reaction (qPCR) experiments showed that depletion of FLII resulted in reduced chromatin accessibility of multiple ERα target genes. These data suggest FLII as a key regulator of ERα-mediated transcription through its role in regulating chromatin accessibility for the binding of RNA Polymerase II and possibly other transcriptional coactivators.« less
Deconstructing mammalian reproduction: using knockouts to define fertility pathways.

PubMed

Roy, Angshumoy; Matzuk, Martin M

2006-02-01

Reproduction is the sine qua non for the propagation of species and continuation of life. It is a complex biological process that is regulated by multiple factors during the reproductive life of an organism. Over the past decade, the molecular mechanisms regulating reproduction in mammals have been rapidly unraveled by the study of a vast number of mouse gene knockouts with impaired fertility. The use of reverse genetics to generate null mutants in mice through targeted disruption of specific genes has enabled researchers to identify essential regulators of spermatogenesis and oogenesis in vivo and model human disorders affecting reproduction. This review focuses on the merits, utility, and the variations of the knockout technology in studies of reproduction in mammals.
Genetic variation and gene expression across multiple tissues and developmental stages in a non-human primate

PubMed Central

Jasinska, Anna J.; Zelaya, Ivette; Service, Susan K.; Peterson, Christine B.; Cantor, Rita M.; Choi, Oi-Wa; DeYoung, Joseph; Eskin, Eleazar; Fairbanks, Lynn A.; Fears, Scott; Furterer, Allison E.; Huang, Yu S.; Ramensky, Vasily; Schmitt, Christopher A.; Svardal, Hannes; Jorgensen, Matthew J.; Kaplan, Jay R.; Villar, Diego; Aken, Bronwen L.; Flicek, Paul; Nag, Rishi; Wong, Emily S.; Blangero, John; Dyer, Thomas D.; Bogomolov, Marina; Benjamini, Yoav; Weinstock, George M.; Dewar, Ken; Sabatti, Chiara; Wilson, Richard K.; Jentsch, J. David; Warren, Wesley; Coppola, Giovanni; Woods, Roger P.; Freimer, Nelson B.

2017-01-01

By analyzing multi-tissue gene expression and genome-wide genetic variation data in samples from a vervet monkey pedigree, we generated a transcriptome resource and produced the first catalogue of expression quantitative trait loci (eQTLs) in a non-human primate model. This catalogue contains more genome-wide significant eQTLs, per sample, than comparable human resources, and reveals sex and age-related expression patterns. Findings include a master regulatory locus that likely plays a role in immune function, and a locus regulating hippocampal long non-coding RNAs (lncRNAs), whose expression correlates with hippocampal volume. This resource will facilitate genetic investigation of quantitative traits, including brain and behavioral phenotypes relevant to neuropsychiatric disorders. PMID:29083405
Analyses of transcriptome sequences reveal multiple ancient large-scale duplication events in the ancestor of Sphagnopsida (Bryophyta).

PubMed

Devos, Nicolas; Szövényi, Péter; Weston, David J; Rothfels, Carl J; Johnson, Matthew G; Shaw, A Jonathan

2016-07-01

The goal of this research was to investigate whether there has been a whole-genome duplication (WGD) in the ancestry of Sphagnum (peatmoss) or the class Sphagnopsida, and to determine if the timing of any such duplication(s) and patterns of paralog retention could help explain the rapid radiation and current ecological dominance of peatmosses. RNA sequencing (RNA-seq) data were generated for nine taxa in Sphagnopsida (Bryophyta). Analyses of frequency plots for synonymous substitutions per synonymous site (Ks ) between paralogous gene pairs and reconciliation of 578 gene trees were conducted to assess evidence of large-scale or genome-wide duplication events in each transcriptome. Both Ks frequency plots and gene tree-based analyses indicate multiple duplication events in the history of the Sphagnopsida. The most recent WGD event predates divergence of Sphagnum from the two other genera of Sphagnopsida. Duplicate retention is highly variable across species, which might be best explained by local adaptation. Our analyses indicate that the last WGD could have been an important factor underlying the diversification of peatmosses and facilitated their rise to ecological dominance in peatlands. The timing of the duplication events and their significance in the evolutionary history of peat mosses are discussed. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.
Expression of hybrid fusion protein (Cry1Ac::ASAL) in transgenic rice plants imparts resistance against multiple insect pests.

PubMed

Boddupally, Dayakar; Tamirisa, Srinath; Gundra, Sivakrishna Rao; Vudem, Dashavantha Reddy; Khareedu, Venkateswara Rao

2018-05-31

To evolve rice varieties resistant to different groups of insect pests a fusion gene, comprising DI and DII domains of Bt Cry1Ac and carbohydrate binding domain of garlic lectin (ASAL), was constructed. Transgenic rice lines were generated and evaluated to assess the efficacy of Cry1Ac::ASAL fusion protein against three major pests, viz., yellow stem borer (YSB), leaf folder (LF) and brown planthopper (BPH). Molecular analyses of transgenic plants revealed stable integration and expression of the fusion gene. In planta insect bioassays on transgenics disclosed enhanced levels of resistance compared to the control plants. High insect mortality of YSB, LF and BPH was observed on transgenics compared to that of control plants. Furthermore, honeydew assays revealed significant decreases in the feeding ability of BPH on transgenic plants as compared to the controls. Ligand blot analysis, using BPH insects fed on cry1Ac::asal transgenic rice plants, revealed a modified receptor protein-binding pattern owing to its ability to bind to additional receptors in insects. The overall results authenticate that Cry1Ac::ASAL protein is endowed with remarkable entomotoxic effects against major lepidopteran and hemipteran insects. As such, the fusion gene appears promising and can be introduced into various other crops to control multiple insect pests.
Cell of origin associated classification of B-cell malignancies by gene signatures of the normal B-cell hierarchy.

PubMed

Johnsen, Hans Erik; Bergkvist, Kim Steve; Schmitz, Alexander; Kjeldsen, Malene Krag; Hansen, Steen Møller; Gaihede, Michael; Nørgaard, Martin Agge; Bæch, John; Grønholdt, Marie-Louise; Jensen, Frank Svendsen; Johansen, Preben; Bødker, Julie Støve; Bøgsted, Martin; Dybkær, Karen

2014-06-01

Recent findings have suggested biological classification of B-cell malignancies as exemplified by the "activated B-cell-like" (ABC), the "germinal-center B-cell-like" (GCB) and primary mediastinal B-cell lymphoma (PMBL) subtypes of diffuse large B-cell lymphoma and "recurrent translocation and cyclin D" (TC) classification of multiple myeloma. Biological classification of B-cell derived cancers may be refined by a direct and systematic strategy where identification and characterization of normal B-cell differentiation subsets are used to define the cancer cell of origin phenotype. Here we propose a strategy combining multiparametric flow cytometry, global gene expression profiling and biostatistical modeling to generate B-cell subset specific gene signatures from sorted normal human immature, naive, germinal centrocytes and centroblasts, post-germinal memory B-cells, plasmablasts and plasma cells from available lymphoid tissues including lymph nodes, tonsils, thymus, peripheral blood and bone marrow. This strategy will provide an accurate image of the stage of differentiation, which prospectively can be used to classify any B-cell malignancy and eventually purify tumor cells. This report briefly describes the current models of the normal B-cell subset differentiation in multiple tissues and the pathogenesis of malignancies originating from the normal germinal B-cell hierarchy.
Vaccines for leishmaniasis in the fore coming 25 years.

PubMed

Palatnik-de-Sousa, Clarisa B

2008-03-25

Human vaccination against leishmaniasis using live Leishmania was used in Middle East and Russia (1941-1980). First-generation vaccines, composed by killed parasites induce low efficacies (54%) and were tested in humans and dogs Phase III trials in Asia and South America since 1940. Second-generation vaccines using live genetically modified parasites, or bacteria or viruses containing Leishmania genes, recombinant or native fractions are known since the 1990s. Due to the loss of PAMPs, the use of adjuvants increased vaccine efficacies of the purified antigens to 82%, in Phase III dog trials. Recombinant second-generation vaccines and third-generation DNA vaccines showed average values of parasite load reduction of 68% and 59% in laboratory animal models, respectively, but their success in field trials had not yet been reported. This review is focused on vaccine candidates that show any efficacy against leishmaniasis and that are already in different phase trials. A lot of interest though was generated in recent years, by the studies going on in experimental models. The promising candidates may find a place in the forth coming years. Among them most probably are the multiple-gene DNA vaccines that are stable and do not require cold-chain transportation. In the mean time, second-generation vaccines with native antigens and effective adjuvants are likely to be licensed and used in Public Health control programs in the fore coming 25 years. To date, only three vaccines have been licensed for use: one live vaccine for humans in Uzbekistan, one killed vaccine for human immunotherapy in Brazil and a second-generation vaccine for dog prophylaxis in Brazil.
JRmGRN: Joint reconstruction of multiple gene regulatory networks with common hub genes using data from multiple tissues or conditions.

PubMed

Deng, Wenping; Zhang, Kui; Liu, Sanzhen; Zhao, Patrick; Xu, Shizhong; Wei, Hairong

2018-04-30

Joint reconstruction of multiple gene regulatory networks (GRNs) using gene expression data from multiple tissues/conditions is very important for understanding common and tissue/condition-specific regulation. However, there are currently no computational models and methods available for directly constructing such multiple GRNs that not only share some common hub genes but also possess tissue/condition-specific regulatory edges. In this paper, we proposed a new graphic Gaussian model for joint reconstruction of multiple gene regulatory networks (JRmGRN), which highlighted hub genes, using gene expression data from several tissues/conditions. Under the framework of Gaussian graphical model, JRmGRN method constructs the GRNs through maximizing a penalized log likelihood function. We formulated it as a convex optimization problem, and then solved it with an alternating direction method of multipliers (ADMM) algorithm. The performance of JRmGRN was first evaluated with synthetic data and the results showed that JRmGRN outperformed several other methods for reconstruction of GRNs. We also applied our method to real Arabidopsis thaliana RNA-seq data from two light regime conditions in comparison with other methods, and both common hub genes and some conditions-specific hub genes were identified with higher accuracy and precision. JRmGRN is available as a R program from: https://github.com/wenpingd. hairong@mtu.edu. Proof of theorem, derivation of algorithm and supplementary data are available at Bioinformatics online.
Derived variants at six genes explain nearly half of size reduction in dog breeds.

PubMed

Rimbault, Maud; Beale, Holly C; Schoenebeck, Jeffrey J; Hoopes, Barbara C; Allen, Jeremy J; Kilroy-Glynn, Paul; Wayne, Robert K; Sutter, Nathan B; Ostrander, Elaine A

2013-12-01

Selective breeding of dogs by humans has generated extraordinary diversity in body size. A number of multibreed analyses have been undertaken to identify the genetic basis of this diversity. We analyzed four loci discovered in a previous genome-wide association study that used 60,968 SNPs to identify size-associated genomic intervals, which were too large to assign causative roles to genes. First, we performed fine-mapping to define critical intervals that included the candidate genes GHR, HMGA2, SMAD2, and STC2, identifying five highly associated markers at the four loci. We hypothesize that three of the variants are likely to be causative. We then genotyped each marker, together with previously reported size-associated variants in the IGF1 and IGF1R genes, on a panel of 500 domestic dogs from 93 breeds, and identified the ancestral allele by genotyping the same markers on 30 wild canids. We observed that the derived alleles at all markers correlated with reduced body size, and smaller dogs are more likely to carry derived alleles at multiple markers. However, breeds are not generally fixed at all markers; multiple combinations of genotypes are found within most breeds. Finally, we show that 46%-52.5% of the variance in body size of dog breeds can be explained by seven markers in proximity to exceptional candidate genes. Among breeds with standard weights <41 kg (90 lb), the genotypes accounted for 64.3% of variance in weight. This work advances our understanding of mammalian growth by describing genetic contributions to canine size determination in non-giant dog breeds.
MicroRNA-20a is essential for normal embryogenesis by targeting vsx1 mRNA in fish

PubMed Central

Sun, Lei; Li, Heng; Xu, Xiaofeng; Xiao, Guanxiu; Luo, Chen

2015-01-01

MicroRNAs are major post-transcriptional regulators of gene expression and have essential roles in diverse developmental processes. In vertebrates, some regulatory genes play different roles at different developmental stages. These genes are initially transcribed in a wide embryonic region but restricted within distinct cell types at subsequent stages during development. Therefore, post-transcriptional regulation is required for the transition from one developmental stage to the next and the establishment of different cell identities. However, the regulation of many multiple functional genes at post-transcription level during development remains unknown. Here we show that miR-20a can target the mRNA of vsx1, a multiple functional gene, at the 3′-UTR and inhibit protein expression in both goldfish and zebrafish. The expression of miR-20a is initiated ubiquitously at late gastrula stage and exhibits a tissue-specific pattern in the developing retina. Inhibition of vsx1 3′-UTR mediated protein expression occurs when and where miR-20a is expressed. Decoying miR-20a resulted in severely impaired head, eye and trunk formation in association with excessive generation of vsx1 marked neurons in the spinal cord and defects of somites in the mesoderm region. These results demonstrate that miR-20a is essential for normal embryogenesis by restricting Vsx1 expression in goldfish and zebrafish, and that post-transcriptional regulation is an essential mechanism for Vsx1 playing different roles in diverse developmental processes. PMID:25833418
[Epigenetic inheritance and its possible role in the evolution of plant species].

PubMed

Lavrov, S A; Mavrodiev, E V

2003-01-01

As it is clear now, the level of gene expression in eukariotes is determined mainly by chromatin composition. Chromatin structure of a particular gene (it is a complex item, which includes nucleosome positioning, histone modifications and non-histone chromatin proteins) can be modified externally and is able to be inherited mitotically and meiotically. Changes in chromatine structure are the basis of so called epigenetic inheritance that occurs without modification of DNA sequence. One of the most striking examples of epigenetic inheritance in plants is epimutations--stable for many generation's alleles of some genes that do not differ in primary DNA structure. Molecular basis of epimutations seems to be DNA metylation. Epimutations may be widely distributed in nature and affect some basis morphological features that have a systematic significance. Possibility of inheritance of acquired epigenetic modifications lead us to reconsider an idea of multipLe independent origins of some plant forms (or ecotypes) under action of similar external conditions. Different populations of the same species may in this case be unrelated and has no common ancestor. Species should be considered as invariant of multiple ways of origin. Wide distribution of polyploids amongst higher plants suggests effective mechanism of repression of multicopy genes. Each allopolyploidisation event is followed by repression of random set of parent genes via changes in its chromatin structure. As a result, in the limits of the same hybrid formula may arise different stable combinations of epigenetically controlled features of parent species. These combinations may be classified as different species of other taxa.
Multi-variant study of obesity risk genes in African Americans: The Jackson Heart Study.

PubMed

Liu, Shijian; Wilson, James G; Jiang, Fan; Griswold, Michael; Correa, Adolfo; Mei, Hao

2016-11-30

Genome-wide association study (GWAS) has been successful in identifying obesity risk genes by single-variant association analysis. For this study, we designed steps of analysis strategy and aimed to identify multi-variant effects on obesity risk among candidate genes. Our analyses were focused on 2137 African American participants with body mass index measured in the Jackson Heart Study and 657 common single nucleotide polymorphisms (SNPs) genotyped at 8 GWAS-identified obesity risk genes. Single-variant association test showed that no SNPs reached significance after multiple testing adjustment. The following gene-gene interaction analysis, which was focused on SNPs with unadjusted p-value<0.10, identified 6 significant multi-variant associations. Logistic regression showed that SNPs in these associations did not have significant linear interactions; examination of genetic risk score evidenced that 4 multi-variant associations had significant additive effects of risk SNPs; and haplotype association test presented that all multi-variant associations contained one or several combinations of particular alleles or haplotypes, associated with increased obesity risk. Our study evidenced that obesity risk genes generated multi-variant effects, which can be additive or non-linear interactions, and multi-variant study is an important supplement to existing GWAS for understanding genetic effects of obesity risk genes. Copyright © 2016 Elsevier B.V. All rights reserved.

Use of deep whole-genome sequencing data to identify structure risk variants in breast cancer susceptibility genes.

PubMed

Guo, Xingyi; Shi, Jiajun; Cai, Qiuyin; Shu, Xiao-Ou; He, Jing; Wen, Wanqing; Allen, Jamie; Pharoah, Paul; Dunning, Alison; Hunter, David J; Kraft, Peter; Easton, Douglas F; Zheng, Wei; Long, Jirong

2018-03-01

Functional disruptions of susceptibility genes by large genomic structure variant (SV) deletions in germlines are known to be associated with cancer risk. However, few studies have been conducted to systematically search for SV deletions in breast cancer susceptibility genes. We analysed deep (> 30x) whole-genome sequencing (WGS) data generated in blood samples from 128 breast cancer patients of Asian and European descent with either a strong family history of breast cancer or early cancer onset disease. To identify SV deletions in known or suspected breast cancer susceptibility genes, we used multiple SV calling tools including Genome STRiP, Delly, Manta, BreakDancer and Pindel. SV deletions were detected by at least three of these bioinformatics tools in five genes. Specifically, we identified heterozygous deletions covering a fraction of the coding regions of BRCA1 (with approximately 80kb in two patients), and TP53 genes (with ∼1.6 kb in two patients), and of intronic regions (∼1 kb) of the PALB2 (one patient), PTEN (three patients) and RAD51C genes (one patient). We confirmed the presence of these deletions using real-time quantitative PCR (qPCR). Our study identified novel SV deletions in breast cancer susceptibility genes and the identification of such SV deletions may improve clinical testing.
Mutational Analysis of Extranodal NK/T-Cell Lymphoma Using Targeted Sequencing with a Comprehensive Cancer Panel.

PubMed

Choi, Seungkyu; Go, Jai Hyang; Kim, Eun Kyung; Lee, Hojung; Lee, Won Mi; Cho, Chun-Sung; Han, Kyudong

2016-09-01

Extranodal natural killer (NK)/T-cell lymphoma, nasal type (NKTCL), is a malignant disorder of cytotoxic lymphocytes of NK or T cells. It is an aggressive neoplasm with a very poor prognosis. Although extranodal NKTCL reportedly has a strong association with Epstein-Barr virus, the molecular pathogenesis of NKTCL has been unexplored. The recent technological advancements in next-generation sequencing (NGS) have made DNA sequencing cost- and time-effective, with more reliable results. Using the Ion Proton Comprehensive Cancer Panel, we sequenced 409 cancer-related genes to identify somatic mutations in five NKTCL tissue samples. The sequencing analysis detected 25 mutations in 21 genes. Among them, KMT2D , a histone modification-related gene, was the most frequently mutated gene (four of the five cases). This result was consistent with recent NGS studies that have suggested KMT2D as a novel driver gene in NKTCL. Mutations were also found in ARID1A , a chromatin remodeling gene, and TP53 , which also recurred in recent NGS studies. We also found mutations in 18 novel candidate genes, with molecular functions that were potentially implicated in cancer development. We suggest that these genes may result in multiple oncogenic events and may be used as potential bio-markers of NKTCL in the future.
A novel frameshift mutation of CHD7 in a Japanese patient with CHARGE syndrome

PubMed Central

Kohmoto, Tomohiro; Shono, Miki; Naruto, Takuya; Watanabe, Miki; Suga, Ken-ichi; Nakagawa, Ryuji; Kagami, Shoji; Masuda, Kiyoshi; Imoto, Issei

2016-01-01

CHARGE syndrome is a rare autosomal dominant developmental disorder involving multiple organs. CHD7 is a major causative gene of CHARGE syndrome. We performed targeted-exome sequencing using a next-generation sequencer for molecular diagnosis of a 4-month-old male patient who was clinically suspected to have CHARGE syndrome, and report a novel monoallelic mutation in CHD7, NM_017780.3(CHD7_v001):c.2966del causing a reading frameshift [p.(Cys989Serfs*3)]. PMID:27081570
A novel frameshift mutation of CHD7 in a Japanese patient with CHARGE syndrome.

PubMed

Kohmoto, Tomohiro; Shono, Miki; Naruto, Takuya; Watanabe, Miki; Suga, Ken-Ichi; Nakagawa, Ryuji; Kagami, Shoji; Masuda, Kiyoshi; Imoto, Issei

2016-01-01

CHARGE syndrome is a rare autosomal dominant developmental disorder involving multiple organs. CHD7 is a major causative gene of CHARGE syndrome. We performed targeted-exome sequencing using a next-generation sequencer for molecular diagnosis of a 4-month-old male patient who was clinically suspected to have CHARGE syndrome, and report a novel monoallelic mutation in CHD7, NM_017780.3(CHD7_v001):c.2966del causing a reading frameshift [p.(Cys989Serfs*3)].
GO-based functional dissimilarity of gene sets.

PubMed

Díaz-Díaz, Norberto; Aguilar-Ruiz, Jesús S

2011-09-01

The Gene Ontology (GO) provides a controlled vocabulary for describing the functions of genes and can be used to evaluate the functional coherence of gene sets. Many functional coherence measures consider each pair of gene functions in a set and produce an output based on all pairwise distances. A single gene can encode multiple proteins that may differ in function. For each functionality, other proteins that exhibit the same activity may also participate. Therefore, an identification of the most common function for all of the genes involved in a biological process is important in evaluating the functional similarity of groups of genes and a quantification of functional coherence can helps to clarify the role of a group of genes working together. To implement this approach to functional assessment, we present GFD (GO-based Functional Dissimilarity), a novel dissimilarity measure for evaluating groups of genes based on the most relevant functions of the whole set. The measure assigns a numerical value to the gene set for each of the three GO sub-ontologies. Results show that GFD performs robustly when applied to gene set of known functionality (extracted from KEGG). It performs particularly well on randomly generated gene sets. An ROC analysis reveals that the performance of GFD in evaluating the functional dissimilarity of gene sets is very satisfactory. A comparative analysis against other functional measures, such as GS2 and those presented by Resnik and Wang, also demonstrates the robustness of GFD.
Multiple schwannomatosis caused by the recently described INI1 gene--molecular pathology, and implications for prognosis.

PubMed

Brennan, Paul M; Barlow, Antonio; Geraghty, Alistair; Summers, David; Fitzpatrick, Michael M

2011-06-01

The most common genetic predisposition to multiple schwannoma growth is mutation of the neurofibromatosis type 2 gene. We describe a patient with multiple schwannomas and mutation in the recently described INI1 gene, which also predisposes to the disease. We explore the implications for prognosis and outcome.
Gene-for-genes interactions between cotton R genes and Xanthomonas campestris pv. malvacearum avr genes.

PubMed

De Feyter, R; Yang, Y; Gabriel, D W

1993-01-01

Six plasmid-borne avirulence (avr) genes were previously cloned from strain XcmH of the cotton pathogen, Xanthomonas campestris pv. malvacearum. We have now localized all six avr genes on the cloned fragments by subcloning and Tn5-gusA insertional mutagenesis. None of these avr genes appeared to exhibit exclusively gene-for-gene patterns of interactions with cotton R genes, and avrB4 was demonstrated to confer avr gene-for-R genes (plural) avirulence to X. c. pv. malvacearum on congenic cotton lines carrying either of two different resistance loci, B1 or B4. Furthermore, the B1 locus appeared to confer R gene-for-avr genes resistance to cotton against isogenic X. c. pv. malvacearum strains carrying any one of three avr genes: avrB4, avrb6, or avrB102. Restriction enzyme, Southern blot hybridization, and DNA sequence analyses showed that the XcmH avr genes are all highly similar to each other, to avrBs3 and avrBsP from the pepper pathogen X. c. pv. vesicatoria, and to the host-specific virulence gene pthA from the citrus pathogen X. citri. The XcmH avr genes differed primarily in the multiplicity of a tandemly repeated 102-base pair motif within the central portions of the genes, repeated from 14 to 23 times in members of this gene family. The complete nucleotide sequence of avrb6 revealed that it is 97% identical in DNA sequence to avrB4, avrBs3, avrBsP, and pthA and that 62-bp inverted terminal repeats mark the boundaries of homology between avrb6 and all members of this Xanthomonas virulence/avirulence gene family sequenced to date. The terminal 38 bp of both inverted repeats are highly similar to the 38-bp consensus terminal sequence of the Tn3 family of transposons. Up to 11 members of the avr gene family appear to be present in North American strains of X. c. pv. malvacearum, including XcmH. The high level of homology observed among these avr genes and their presence in multiple copies may explain the gene-for-genes interactions and also the observed high frequencies (10(-3) to 10(-4) per locus) of X. c. pv. malvacearum race change mutations. Five spontaneous race change mutants of XcmH suffered avr locus deletions, strongly indicating intergenic recombination as the primary mechanism for generating new races in X. c. pv. malvacearum.
Enhancing knowledge discovery from cancer genomics data with Galaxy

PubMed Central

Albuquerque, Marco A.; Grande, Bruno M.; Ritch, Elie J.; Pararajalingam, Prasath; Jessa, Selin; Krzywinski, Martin; Grewal, Jasleen K.; Shah, Sohrab P.; Boutros, Paul C.

2017-01-01

Abstract The field of cancer genomics has demonstrated the power of massively parallel sequencing techniques to inform on the genes and specific alterations that drive tumor onset and progression. Although large comprehensive sequence data sets continue to be made increasingly available, data analysis remains an ongoing challenge, particularly for laboratories lacking dedicated resources and bioinformatics expertise. To address this, we have produced a collection of Galaxy tools that represent many popular algorithms for detecting somatic genetic alterations from cancer genome and exome data. We developed new methods for parallelization of these tools within Galaxy to accelerate runtime and have demonstrated their usability and summarized their runtimes on multiple cloud service providers. Some tools represent extensions or refinement of existing toolkits to yield visualizations suited to cohort-wide cancer genomic analysis. For example, we present Oncocircos and Oncoprintplus, which generate data-rich summaries of exome-derived somatic mutation. Workflows that integrate these to achieve data integration and visualizations are demonstrated on a cohort of 96 diffuse large B-cell lymphomas and enabled the discovery of multiple candidate lymphoma-related genes. Our toolkit is available from our GitHub repository as Galaxy tool and dependency definitions and has been deployed using virtualization on multiple platforms including Docker. PMID:28327945
Enhancing knowledge discovery from cancer genomics data with Galaxy.

PubMed

Albuquerque, Marco A; Grande, Bruno M; Ritch, Elie J; Pararajalingam, Prasath; Jessa, Selin; Krzywinski, Martin; Grewal, Jasleen K; Shah, Sohrab P; Boutros, Paul C; Morin, Ryan D

2017-05-01

The field of cancer genomics has demonstrated the power of massively parallel sequencing techniques to inform on the genes and specific alterations that drive tumor onset and progression. Although large comprehensive sequence data sets continue to be made increasingly available, data analysis remains an ongoing challenge, particularly for laboratories lacking dedicated resources and bioinformatics expertise. To address this, we have produced a collection of Galaxy tools that represent many popular algorithms for detecting somatic genetic alterations from cancer genome and exome data. We developed new methods for parallelization of these tools within Galaxy to accelerate runtime and have demonstrated their usability and summarized their runtimes on multiple cloud service providers. Some tools represent extensions or refinement of existing toolkits to yield visualizations suited to cohort-wide cancer genomic analysis. For example, we present Oncocircos and Oncoprintplus, which generate data-rich summaries of exome-derived somatic mutation. Workflows that integrate these to achieve data integration and visualizations are demonstrated on a cohort of 96 diffuse large B-cell lymphomas and enabled the discovery of multiple candidate lymphoma-related genes. Our toolkit is available from our GitHub repository as Galaxy tool and dependency definitions and has been deployed using virtualization on multiple platforms including Docker. © The Author 2017. Published by Oxford University Press.
GeneSilico protein structure prediction meta-server.

PubMed

Kurowski, Michal A; Bujnicki, Janusz M

2003-07-01

Rigorous assessments of protein structure prediction have demonstrated that fold recognition methods can identify remote similarities between proteins when standard sequence search methods fail. It has been shown that the accuracy of predictions is improved when refined multiple sequence alignments are used instead of single sequences and if different methods are combined to generate a consensus model. There are several meta-servers available that integrate protein structure predictions performed by various methods, but they do not allow for submission of user-defined multiple sequence alignments and they seldom offer confidentiality of the results. We developed a novel WWW gateway for protein structure prediction, which combines the useful features of other meta-servers available, but with much greater flexibility of the input. The user may submit an amino acid sequence or a multiple sequence alignment to a set of methods for primary, secondary and tertiary structure prediction. Fold-recognition results (target-template alignments) are converted into full-atom 3D models and the quality of these models is uniformly assessed. A consensus between different FR methods is also inferred. The results are conveniently presented on-line on a single web page over a secure, password-protected connection. The GeneSilico protein structure prediction meta-server is freely available for academic users at http://genesilico.pl/meta.
GeneSilico protein structure prediction meta-server

PubMed Central

Kurowski, Michal A.; Bujnicki, Janusz M.

2003-01-01

Rigorous assessments of protein structure prediction have demonstrated that fold recognition methods can identify remote similarities between proteins when standard sequence search methods fail. It has been shown that the accuracy of predictions is improved when refined multiple sequence alignments are used instead of single sequences and if different methods are combined to generate a consensus model. There are several meta-servers available that integrate protein structure predictions performed by various methods, but they do not allow for submission of user-defined multiple sequence alignments and they seldom offer confidentiality of the results. We developed a novel WWW gateway for protein structure prediction, which combines the useful features of other meta-servers available, but with much greater flexibility of the input. The user may submit an amino acid sequence or a multiple sequence alignment to a set of methods for primary, secondary and tertiary structure prediction. Fold-recognition results (target-template alignments) are converted into full-atom 3D models and the quality of these models is uniformly assessed. A consensus between different FR methods is also inferred. The results are conveniently presented on-line on a single web page over a secure, password-protected connection. The GeneSilico protein structure prediction meta-server is freely available for academic users at http://genesilico.pl/meta. PMID:12824313
Establishment of spatial pattern.

PubMed

Slack, Jonathan

2014-01-01

An overview and perspective are presented of mechanisms for the development of spatial pattern in animal embryos. It is intended both for new entrants to developmental biology and for specialists in other fields, with only a basic knowledge of animal life cycles being required. The first event of pattern formation is normally the localization of a cytoplasmic determinant in the egg, either during oogenesis or post-fertilization. Following cleavage to a multicellular stage, some cells contain the determinant and others do not. The determinant confers a specific developmental pathway on the cells that contain it, often making them the source of the first extracellular signal, or inducing factor. Inducing factors often form concentration gradients to which cells respond by up or downregulating genes at various concentration thresholds. This enables an initial situation consisting of two cell states (with or without the determinant) to generate a multistate pattern. Multiple rounds of gradient signaling, interspersed with phases of morphogenetic movements, can generate a complex pattern using a small number of signals and responding genes. Development proceeds in a hierarchical manner, with broad body subdivisions being specified initially, and becoming successively subdivided to give individual organs and tissues composed of multiple cell types in a characteristic arrangement. Double gradient models can account for embryonic regulation, whereby a similarly proportioned body pattern is formed following removal of material. Processes that are involved at the later stages include the formation of repeating structures by the combination of an oscillator with a gradient, and the formation of tissues with one cell type scattered in a background of another through a process called lateral inhibition. This set of processes make up a 'developmental toolkit' which can be deployed in various sequences and combinations to generate a very wide variety of structures and cell types. © 2014 Wiley Periodicals, Inc.
Gene Ontology Consortium: going forward.

PubMed

2015-01-01

The Gene Ontology (GO; http://www.geneontology.org) is a community-based bioinformatics resource that supplies information about gene product function using ontologies to represent biological knowledge. Here we describe improvements and expansions to several branches of the ontology, as well as updates that have allowed us to more efficiently disseminate the GO and capture feedback from the research community. The Gene Ontology Consortium (GOC) has expanded areas of the ontology such as cilia-related terms, cell-cycle terms and multicellular organism processes. We have also implemented new tools for generating ontology terms based on a set of logical rules making use of templates, and we have made efforts to increase our use of logical definitions. The GOC has a new and improved web site summarizing new developments and documentation, serving as a portal to GO data. Users can perform GO enrichment analysis, and search the GO for terms, annotations to gene products, and associated metadata across multiple species using the all-new AmiGO 2 browser. We encourage and welcome the input of the research community in all biological areas in our continued effort to improve the Gene Ontology. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Computational Tools and Algorithms for Designing Customized Synthetic Genes

PubMed Central

Gould, Nathan; Hendy, Oliver; Papamichail, Dimitris

2014-01-01

Advances in DNA synthesis have enabled the construction of artificial genes, gene circuits, and genomes of bacterial scale. Freedom in de novo design of synthetic constructs provides significant power in studying the impact of mutations in sequence features, and verifying hypotheses on the functional information that is encoded in nucleic and amino acids. To aid this goal, a large number of software tools of variable sophistication have been implemented, enabling the design of synthetic genes for sequence optimization based on rationally defined properties. The first generation of tools dealt predominantly with singular objectives such as codon usage optimization and unique restriction site incorporation. Recent years have seen the emergence of sequence design tools that aim to evolve sequences toward combinations of objectives. The design of optimal protein-coding sequences adhering to multiple objectives is computationally hard, and most tools rely on heuristics to sample the vast sequence design space. In this review, we study some of the algorithmic issues behind gene optimization and the approaches that different tools have adopted to redesign genes and optimize desired coding features. We utilize test cases to demonstrate the efficiency of each approach, as well as identify their strengths and limitations. PMID:25340050
Heart morphogenesis gene regulatory networks revealed by temporal expression analysis.

PubMed

Hill, Jonathon T; Demarest, Bradley; Gorsi, Bushra; Smith, Megan; Yost, H Joseph

2017-10-01

During embryogenesis the heart forms as a linear tube that then undergoes multiple simultaneous morphogenetic events to obtain its mature shape. To understand the gene regulatory networks (GRNs) driving this phase of heart development, during which many congenital heart disease malformations likely arise, we conducted an RNA-seq timecourse in zebrafish from 30 hpf to 72 hpf and identified 5861 genes with altered expression. We clustered the genes by temporal expression pattern, identified transcription factor binding motifs enriched in each cluster, and generated a model GRN for the major gene batteries in heart morphogenesis. This approach predicted hundreds of regulatory interactions and found batteries enriched in specific cell and tissue types, indicating that the approach can be used to narrow the search for novel genetic markers and regulatory interactions. Subsequent analyses confirmed the GRN using two mutants, Tbx5 and nkx2-5 , and identified sets of duplicated zebrafish genes that do not show temporal subfunctionalization. This dataset provides an essential resource for future studies on the genetic/epigenetic pathways implicated in congenital heart defects and the mechanisms of cardiac transcriptional regulation. © 2017. Published by The Company of Biologists Ltd.
Diverse Antibiotic Resistance Genes in Dairy Cow Manure

PubMed Central

Wichmann, Fabienne; Udikovic-Kolic, Nikolina; Andrew, Sheila; Handelsman, Jo

2014-01-01

ABSTRACT Application of manure from antibiotic-treated animals to crops facilitates the dissemination of antibiotic resistance determinants into the environment. However, our knowledge of the identity, diversity, and patterns of distribution of these antibiotic resistance determinants remains limited. We used a new combination of methods to examine the resistome of dairy cow manure, a common soil amendment. Metagenomic libraries constructed with DNA extracted from manure were screened for resistance to beta-lactams, phenicols, aminoglycosides, and tetracyclines. Functional screening of fosmid and small-insert libraries identified 80 different antibiotic resistance genes whose deduced protein sequences were on average 50 to 60% identical to sequences deposited in GenBank. The resistance genes were frequently found in clusters and originated from a taxonomically diverse set of species, suggesting that some microorganisms in manure harbor multiple resistance genes. Furthermore, amid the great genetic diversity in manure, we discovered a novel clade of chloramphenicol acetyltransferases. Our study combined functional metagenomics with third-generation PacBio sequencing to significantly extend the roster of functional antibiotic resistance genes found in animal gut bacteria, providing a particularly broad resource for understanding the origins and dispersal of antibiotic resistance genes in agriculture and clinical settings. PMID:24757214
DOE Office of Scientific and Technical Information (OSTI.GOV)

Kim, Young June; Ahn, Kwang Sung; Kim, Minjeong

Highlights: • ATM gene-targeted pigs were produced by somatic cell nuclear transfer. • A novel large animal model for ataxia telangiectasia was developed. • The new model may provide an alternative to the mouse model. - Abstract: Ataxia telangiectasia (A-T) is a recessive autosomal disorder associated with pleiotropic phenotypes, including progressive cerebellar degeneration, gonad atrophy, and growth retardation. Even though A-T is known to be caused by the mutations in the Ataxia telangiectasia mutated (ATM) gene, the correlation between abnormal cellular physiology caused by ATM mutations and the multiple symptoms of A-T disease has not been clearly determined. None ofmore » the existing ATM mouse models properly reflects the extent to which neurological degeneration occurs in human. In an attempt to provide a large animal model for A-T, we produced gene-targeted pigs with mutations in the ATM gene by somatic cell nuclear transfer. The disrupted allele in the ATM gene of cloned piglets was confirmed via PCR and Southern blot analysis. The ATM gene-targeted pigs generated in the present study may provide an alternative to the current mouse model for the study of mechanisms underlying A-T disorder and for the development of new therapies.« less
Transcriptome Profiling of Human FoxP3+ Regulatory T Cells

PubMed Central

Bhairavabhotla, Ravikiran; Kim, Yong C.; Glass, Deborah D.; Escobar, Thelma M.; Patel, Mira C.; Zahr, Rami; Nguyen, Cuong K.; Kilaru, Gokhul K.; Muljo, Stefan A.; Shevach, Ethan M.

2015-01-01

The major goal of this study was to perform an in depth characterization of the “gene signature” of human FoxP3+ T regulatory cells (Tregs). Highly purified Tregs and T conventional cells (Tconvs) from multiple healthy donors (HD), either freshly explanted or activated in vitro, were analyzed via RNA sequencing (RNA-seq) and gene expression changes validated using the nCounter system. Additionally, we analyzed microRNA (miRNA) expression using TaqMan low-density arrays. Our results confirm previous studies demonstrating selective gene expression of FoxP3, IKZF2, and CTLA4 in Tregs. Notably, a number of yet uncharacterized genes (RTKN2, LAYN, UTS2, CSF2RB, TRIB1, F5, CECAM4, CD70, ENC1 and NKG7) were identified and validated as being differentially expressed in human Tregs. We further characterize the functional roles of RTKN2 and LAYN by analyzing their roles in vitro human Treg suppression assays by knocking them down in Tregs and overexpressing them in Tconvs. In order to facilitate a better understanding of the human Treg gene expression signature, we have generated from our results a hypothetical interactome of genes and miRNAs in Tregs and Tconvs, PMID:26686412
Effects of a petunia scaffold/matrix attachment region on copy number dependency and stability of transgene expression in Nicotiana tabacum.

PubMed

Dietz-Pfeilstetter, Antje; Arndt, Nicola; Manske, Ulrike

2016-04-01

Transgenes in genetically modified plants are often not reliably expressed during development or in subsequent generations. Transcriptional gene silencing (TGS) as well as post-transcriptional gene silencing (PTGS) have been shown to occur in transgenic plants depending on integration pattern, copy number and integration site. In an effort to reduce position effects, to prevent read-through transcription and to provide a more accessible chromatin structure, a P35S-ß-glucuronidase (P35S-gus) transgene flanked by a scaffold/matrix attachment region from petunia (Petun-SAR), was introduced in Nicotiana tabacum plants by Agrobacterium tumefaciens mediated transformation. It was found that Petun-SAR mediates enhanced expression and copy number dependency up to 2 gene copies, but did not prevent gene silencing in transformants with multiple and rearranged gene copies. However, in contrast to the non-SAR transformants where silencing was irreversible and proceeded during long-term vegetative propagation and in progeny plants, gus expression in Petun-SAR plants was re-established in the course of development. Gene silencing was not necessarily accompanied by DNA methylation, while the gus transgene could still be expressed despite considerable CG methylation within the coding region.
Dizeez: An Online Game for Human Gene-Disease Annotation

PubMed Central

Loguercio, Salvatore; Good, Benjamin M.; Su, Andrew I.

2013-01-01

Structured gene annotations are a foundation upon which many bioinformatics and statistical analyses are built. However the structured annotations available in public databases are a sparse representation of biological knowledge as a whole. The rate of biomedical data generation is such that centralized biocuration efforts struggle to keep up. New models for gene annotation need to be explored that expand the pace at which we are able to structure biomedical knowledge. Recently, online games have emerged as an effective way to recruit, engage and organize large numbers of volunteers to help address difficult biological challenges. For example, games have been successfully developed for protein folding (Foldit), multiple sequence alignment (Phylo) and RNA structure design (EteRNA). Here we present Dizeez, a simple online game built with the purpose of structuring knowledge of gene-disease associations. Preliminary results from game play online and at scientific conferences suggest that Dizeez is producing valid gene-disease annotations not yet present in any public database. These early results provide a basic proof of principle that online games can be successfully applied to the challenge of gene annotation. Dizeez is available at http://genegames.org. PMID:23951102

Some links on this page may take you to non-federal websites. Their policies may differ from this site.