Upregulating endogenous genes by an RNA-programmable artificial transactivator
Fimiani, Cristina; Goina, Elisa; Mallamaci, Antonello
2015-01-01
To promote expression of endogenous genes ad libitum, we developed a novel, programmable transcription factor prototype. Kept together via an MS2 coat protein/RNA interface, it includes a fixed, polypeptidic transactivating domain and a variable RNA domain that recognizes the desired gene. Thanks to this device, we specifically upregulated five genes, in cell lines and primary cultures of murine pallial precursors. Gene upregulation was small, however sufficient to robustly inhibit neuronal differentiation. The transactivator interacted with target gene chromatin via its RNA cofactor. Its activity was restricted to cells in which the target gene is normally transcribed. Our device might be useful for specific applications. However for this purpose, it will require an improvement of its transactivation power as well as a better characterization of its target specificity and mechanism of action. PMID:26152305
Gene transfer of Hodgkin cell lines via multivalent anti-CD30 scFv displaying bacteriophage.
Chung, Yoon-Suk A; Sabel, Katja; Krönke, Martin; Klimka, Alexander
2008-04-16
The display of binding ligands, such as recombinant antibody fragments, on the surface of filamentous phage makes it possible to specifically attach these phage particles to target cells. After uptake of the phage, their internal single-stranded DNA is processed by the host cell, which allows transient expression of an encoded eukaryotic gene cassette. This opens the possibility to use bacteriophage as vectors for targeted gene therapy, although the transduction efficiency is very low. Here we demonstrate the display of an anti-CD30 single chain variable fragment fused to the major coat protein pVIII on the surface of bacteriophage. These phage particles showed an improved binding and transduction efficiency of CD30 positive Hodgkin-lymphoma cells, compared to bacteriophage with the anti-CD30 single chain variable fragment fused to the minor coat protein pIII. We can conclude from the results that the postulated multivalency of the anti-CD30-pVIII displaying bacteriophage combined with disseminated display of the anti-CD30 scFv on the whole particle surface is responsible for the improved gene transfer rate. These results mark an important step towards the use of phage particles as a cheap and safe gene transfer vehicle for the gene delivery of the desired target cells via their specific surface receptors.
Chen, Eric C H; Morin, Emmanuelle; Beaudet, Denis; Noel, Jessica; Yildirir, Gokalp; Ndikumana, Steve; Charron, Philippe; St-Onge, Camille; Giorgi, John; Krüger, Manuela; Marton, Timea; Ropars, Jeanne; Grigoriev, Igor V; Hainaut, Matthieu; Henrissat, Bernard; Roux, Christophe; Martin, Francis; Corradi, Nicolas
2018-01-22
Arbuscular mycorrhizal fungi (AMF) are known to improve plant fitness through the establishment of mycorrhizal symbioses. Genetic and phenotypic variations among closely related AMF isolates can significantly affect plant growth, but the genomic changes underlying this variability are unclear. To address this issue, we improved the genome assembly and gene annotation of the model strain Rhizophagus irregularis DAOM197198, and compared its gene content with five isolates of R. irregularis sampled in the same field. All isolates harbor striking genome variations, with large numbers of isolate-specific genes, gene family expansions, and evidence of interisolate genetic exchange. The observed variability affects all gene ontology terms and PFAM protein domains, as well as putative mycorrhiza-induced small secreted effector-like proteins and other symbiosis differentially expressed genes. High variability is also found in active transposable elements. Overall, these findings indicate a substantial divergence in the functioning capacity of isolates harvested from the same field, and thus their genetic potential for adaptation to biotic and abiotic changes. Our data also provide a first glimpse into the genome diversity that resides within natural populations of these symbionts, and open avenues for future analyses of plant-AMF interactions that link AMF genome variation with plant phenotype and fitness. © 2018 The Authors. New Phytologist © 2018 New Phytologist Trust.
Disease-modifying genetic factors in cystic fibrosis.
Marson, Fernando A L
2018-05-01
To compile data from the past 10 years regarding the role of modifying genes in cystic fibrosis (CF). CF is a model disease for understanding of the action of modifying genes. Although it is a monogenic (CFTR) autosomal recessive disease, CF presents with wide phenotypic variability. In CF, variability occurs with different intensity among patients by each organ, being organ-specific, resulting from the mutual interaction of environmental and genetic factors, including CFTR mutations and various other genes, most of which are associated with inflammatory processes. In individuals, using precision medicine, gene modification studies have revealed individualized responses to drugs depending on particular CFTR mutations and modifying genes, most of which are alternative ion channels. Studies of modifying genes in CF allow: understanding of clinical variability among patients with the same CFTR genotype; evaluation of precision medicine; understanding of environmental and genetic effects at the organ level; understanding the involvement of genetic variants in inflammatory responses; improvements in genetic counseling; understanding the involvement of genetic variants in inflammatory responses in lung diseases, such as asthma; and understanding the individuality of the person with the disease.
The AGT Gene M235T Polymorphism and Response of Power-Related Variables to Aerobic Training
Aleksandra, Zarębska; Zbigniew, Jastrzębski; Waldemar, Moska; Agata, Leońska-Duniec; Mariusz, Kaczmarczyk; Marek, Sawczuk; Agnieszka, Maciejewska-Skrendo; Piotr, Żmijewski; Krzysztof, Ficek; Grzegorz, Trybek; Ewelina, Lulińska-Kuklik; Semenova, Ekaterina A.; Ahmetov, Ildus I.; Paweł, Cięszczyk
2016-01-01
The C allele of the M235T (rs699) polymorphism of the AGT gene correlates with higher levels of angiotensin II and has been associated with power and strength sport performance. The aim of the study was to investigate whether or not selected power-related variables and their response to a 12-week program of aerobic dance training are modulated by the AGT M235T genotype in healthy participants. Two hundred and one Polish Caucasian women aged 21 ± 1 years met the inclusion criteria and were included in the study. All women completed a 12-week program of low and high impact aerobics. Wingate peak power and total work capacity, 5 m, 10 m, and 30 m running times and jump height and jump power were determined before and after the training programme. All power-related variables improved significantly in response to aerobic dance training. We found a significant association between the M235T polymorphism and jump-based variables (squat jump (SJ) height, p = 0.005; SJ power, p = 0.015; countermovement jump height, p = 0.025; average of 10 countermovement jumps with arm swing (ACMJ) height, p = 0.001; ACMJ power, p = 0.035). Specifically, greater improvements were observed in the C allele carriers in comparison with TT homozygotes. In conclusion, aerobic dance, one of the most commonly practiced adult fitness activities in the world, provides sufficient training stimuli for augmenting the explosive strength necessary to increase vertical jump performance. The AGT gene M235T polymorphism seems to be not only a candidate gene variant for power/strength related phenotypes, but also a genetic marker for predicting response to training. Key points Aerobic dance provides sufficient training stimuli for the improvement of explosive power. The AGT gene M235T polymorphism is associated with individual variation in the change of power-related phenotypes in response to aerobic dance training. The C allele carriers of the AGT gene M235T polymorphism show greater improvements of jump-based variables in comparison with TT homozygotes. PMID:27928207
Genetic Influence on Slope Variability in a Childhood Reflexive Attention Task.
Lundwall, Rebecca A; Watkins, Jeffrey K
2015-01-01
Individuals are not perfectly consistent, and interindividual variability is a common feature in all varieties of human behavior. Some individuals respond more variably than others, however, and this difference may be important to understanding how the brain works. In this paper, we explore genetic contributions to response time (RT) slope variability on a reflexive attention task. We are interested in such variability because we believe it is an important part of the overall picture of attention that, if understood, has the potential to improve intervention for those with attentional deficits. Genetic association studies are valuable in discovering biological pathways of variability and several studies have found such associations with a sustained attention task. Here, we expand our knowledge to include a reflexive attention task. We ask whether specific candidate genes are associated with interindividual variability on a childhood reflexive attention task in 9-16 year olds. The genetic makers considered are on 11 genes: APOE, BDNF, CHRNA4, COMT, DRD4, HTR4, IGF2, MAOA, SLC5A7, SLC6A3, and SNAP25. We find significant associations with variability with markers on nine and we discuss the results in terms of neurotransmitters associated with each gene and the characteristics of the associated measures from the reflexive attention task.
Zhang, Zhang; Liu, Jingxing; Wu, Jiayan; Yu, Jun
2013-01-01
The regulation of gene expression is essential for eukaryotes, as it drives the processes of cellular differentiation and morphogenesis, leading to the creation of different cell types in multicellular organisms. RNA-Sequencing (RNA-Seq) provides researchers with a powerful toolbox for characterization and quantification of transcriptome. Many different human tissue/cell transcriptome datasets coming from RNA-Seq technology are available on public data resource. The fundamental issue here is how to develop an effective analysis method to estimate expression pattern similarities between different tumor tissues and their corresponding normal tissues. We define the gene expression pattern from three directions: 1) expression breadth, which reflects gene expression on/off status, and mainly concerns ubiquitously expressed genes; 2) low/high or constant/variable expression genes, based on gene expression level and variation; and 3) the regulation of gene expression at the gene structure level. The cluster analysis indicates that gene expression pattern is higher related to physiological condition rather than tissue spatial distance. Two sets of human housekeeping (HK) genes are defined according to cell/tissue types, respectively. To characterize the gene expression pattern in gene expression level and variation, we firstly apply improved K-means algorithm and a gene expression variance model. We find that cancer-associated HK genes (a HK gene is specific in cancer group, while not in normal group) are expressed higher and more variable in cancer condition than in normal condition. Cancer-associated HK genes prefer to AT-rich genes, and they are enriched in cell cycle regulation related functions and constitute some cancer signatures. The expression of large genes is also avoided in cancer group. These studies will help us understand which cell type-specific patterns of gene expression differ among different cell types, and particularly for cancer. PMID:23382867
Improving RNA-Seq expression estimation by modeling isoform- and exon-specific read sequencing rate.
Liu, Xuejun; Shi, Xinxin; Chen, Chunlin; Zhang, Li
2015-10-16
The high-throughput sequencing technology, RNA-Seq, has been widely used to quantify gene and isoform expression in the study of transcriptome in recent years. Accurate expression measurement from the millions or billions of short generated reads is obstructed by difficulties. One is ambiguous mapping of reads to reference transcriptome caused by alternative splicing. This increases the uncertainty in estimating isoform expression. The other is non-uniformity of read distribution along the reference transcriptome due to positional, sequencing, mappability and other undiscovered sources of biases. This violates the uniform assumption of read distribution for many expression calculation approaches, such as the direct RPKM calculation and Poisson-based models. Many methods have been proposed to address these difficulties. Some approaches employ latent variable models to discover the underlying pattern of read sequencing. However, most of these methods make bias correction based on surrounding sequence contents and share the bias models by all genes. They therefore cannot estimate gene- and isoform-specific biases as revealed by recent studies. We propose a latent variable model, NLDMseq, to estimate gene and isoform expression. Our method adopts latent variables to model the unknown isoforms, from which reads originate, and the underlying percentage of multiple spliced variants. The isoform- and exon-specific read sequencing biases are modeled to account for the non-uniformity of read distribution, and are identified by utilizing the replicate information of multiple lanes of a single library run. We employ simulation and real data to verify the performance of our method in terms of accuracy in the calculation of gene and isoform expression. Results show that NLDMseq obtains competitive gene and isoform expression compared to popular alternatives. Finally, the proposed method is applied to the detection of differential expression (DE) to show its usefulness in the downstream analysis. The proposed NLDMseq method provides an approach to accurately estimate gene and isoform expression from RNA-Seq data by modeling the isoform- and exon-specific read sequencing biases. It makes use of a latent variable model to discover the hidden pattern of read sequencing. We have shown that it works well in both simulations and real datasets, and has competitive performance compared to popular methods. The method has been implemented as a freely available software which can be found at https://github.com/PUGEA/NLDMseq.
Tsafa, Effrosyni; Al-Bahrani, Mariam; Bentayebi, Kaoutar; Przystal, Justyna; Suwan, Keittisak; Hajitou, Amin
2016-08-09
Gene therapy has long been regarded as a promising treatment for cancer. However, cancer gene therapy is still facing the challenge of targeting gene delivery vectors specifically to tumors when administered via clinically acceptable non-invasive systemic routes (i.e. intravenous). The bacteria virus, bacteriophage (phage), represents a new generation of promising vectors in systemic gene delivery since their targeting can be achieved through phage capsid display ligands, which enable them to home to specific tumor receptors without the need to ablate any native eukaryotic tropism. We have previously reported a tumor specific bacteriophage vector named adeno-associated virus/phage, or AAVP, in which gene expression is under a recombinant human rAAV2 virus genome targeted to tumors via a ligand-directed phage capsid. However, cancer gene therapy with this tumor-targeted vector achieved variable outcomes ranging from tumor regression to no effect in both experimental and natural preclinical models. Herein, we hypothesized that combining the natural dietary genistein, with proven anticancer activity, would improve bacteriophage anticancer safe therapy. We show that combination treatment with genistein and AAVP increased targeted cancer cell killing by AAVP carrying the gene for Herpes simplex virus thymidine kinase (HSVtk) in 2D tissue cultures and 3D tumor spheroids. We found this increased tumor cell killing was associated with enhanced AAVP-mediated gene expression. Next, we established that genistein protects AAVP against proteasome degradation and enhances vector genome accumulation in the nucleus. Combination of genistein and phage-guided virotherapy is a safe and promising strategy that should be considered in anticancer therapy with AAVP.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Edward DeLong
2011-10-07
Our overarching goals in this project were to: Develop and improve high-throughput sequencing methods and analytical approaches for quantitative analyses of microbial gene expression at the Hawaii Ocean Time Series Station and the Bermuda Atlantic Time Series Station; Conduct field analyses following gene expression patterns in picoplankton microbial communities in general, and Prochlorococcus flow sorted from that community, as they respond to different environmental variables (light, macronutrients, dissolved organic carbon), that are predicted to influence activity, productivity, and carbon cycling; Use the expression analyses of flow sorted Prochlorococcus to identify horizontally transferred genes and gene products, in particular those thatmore » are located in genomic islands and likely to confer habitat-specific fitness advantages; Use the microbial community gene expression data that we generate to gain insights, and test hypotheses, about the variability, genomic context, activity and function of as yet uncharacterized gene products, that appear highly expressed in the environment. We achieved the above goals, and even more over the course of the project. This includes a number of novel methodological developments, as well as the standardization of microbial community gene expression analyses in both field surveys, and experimental modalities. The availability of these methods, tools and approaches is changing current practice in microbial community analyses.« less
Chau, John H; Rahfeldt, Wolfgang A; Olmstead, Richard G
2018-03-01
Targeted sequence capture can be used to efficiently gather sequence data for large numbers of loci, such as single-copy nuclear loci. Most published studies in plants have used taxon-specific locus sets developed individually for a clade using multiple genomic and transcriptomic resources. General locus sets can also be developed from loci that have been identified as single-copy and have orthologs in large clades of plants. We identify and compare a taxon-specific locus set and three general locus sets (conserved ortholog set [COSII], shared single-copy nuclear [APVO SSC] genes, and pentatricopeptide repeat [PPR] genes) for targeted sequence capture in Buddleja (Scrophulariaceae) and outgroups. We evaluate their performance in terms of assembly success, sequence variability, and resolution and support of inferred phylogenetic trees. The taxon-specific locus set had the most target loci. Assembly success was high for all locus sets in Buddleja samples. For outgroups, general locus sets had greater assembly success. Taxon-specific and PPR loci had the highest average variability. The taxon-specific data set produced the best-supported tree, but all data sets showed improved resolution over previous non-sequence capture data sets. General locus sets can be a useful source of sequence capture targets, especially if multiple genomic resources are not available for a taxon.
Cong, Le; Zhou, Ruhong; Kuo, Yu-chi; Cunniff, Margaret; Zhang, Feng
2012-01-01
Transcription activator-like effectors (TALE) are sequence-specific DNA binding proteins that harbor modular, repetitive DNA binding domains. TALEs have enabled the creation of customizable designer transcriptional factors and sequence-specific nucleases for genome engineering. Here we report two improvements of the TALE toolbox for achieving efficient activation and repression of endogenous gene expression in mammalian cells. We show that the naturally occurring repeat variable diresidue (RVD) Asn-His (NH) has high biological activity and specificity for guanine, a highly prevalent base in mammalian genomes. We also report an effective TALE transcriptional repressor architecture for targeted inhibition of transcription in mammalian cells. These findings will improve the precision and effectiveness of genome engineering that can be achieved using TALEs. PMID:22828628
Tsafa, Effrosyni; Al-Bahrani, Mariam; Bentayebi, Kaoutar; Przystal, Justyna; Suwan, Keittisak; Hajitou, Amin
2016-01-01
Gene therapy has long been regarded as a promising treatment for cancer. However, cancer gene therapy is still facing the challenge of targeting gene delivery vectors specifically to tumors when administered via clinically acceptable non-invasive systemic routes (i.e. intravenous). The bacteria virus, bacteriophage (phage), represents a new generation of promising vectors in systemic gene delivery since their targeting can be achieved through phage capsid display ligands, which enable them to home to specific tumor receptors without the need to ablate any native eukaryotic tropism. We have previously reported a tumor specific bacteriophage vector named adeno-associated virus/phage, or AAVP, in which gene expression is under a recombinant human rAAV2 virus genome targeted to tumors via a ligand-directed phage capsid. However, cancer gene therapy with this tumor-targeted vector achieved variable outcomes ranging from tumor regression to no effect in both experimental and natural preclinical models. Herein, we hypothesized that combining the natural dietary genistein, with proven anticancer activity, would improve bacteriophage anticancer safe therapy. We show that combination treatment with genistein and AAVP increased targeted cancer cell killing by AAVP carrying the gene for Herpes simplex virus thymidine kinase (HSVtk) in 2D tissue cultures and 3D tumor spheroids. We found this increased tumor cell killing was associated with enhanced AAVP-mediated gene expression. Next, we established that genistein protects AAVP against proteasome degradation and enhances vector genome accumulation in the nucleus. Combination of genistein and phage-guided virotherapy is a safe and promising strategy that should be considered in anticancer therapy with AAVP. PMID:27437775
The AGT Gene M235T Polymorphism and Response of Power-Related Variables to Aerobic Training.
Aleksandra, Zarębska; Zbigniew, Jastrzębski; Waldemar, Moska; Agata, Leońska-Duniec; Mariusz, Kaczmarczyk; Marek, Sawczuk; Agnieszka, Maciejewska-Skrendo; Piotr, Żmijewski; Krzysztof, Ficek; Grzegorz, Trybek; Ewelina, Lulińska-Kuklik; Semenova, Ekaterina A; Ahmetov, Ildus I; Paweł, Cięszczyk
2016-12-01
The C allele of the M235T (rs699) polymorphism of the AGT gene correlates with higher levels of angiotensin II and has been associated with power and strength sport performance. The aim of the study was to investigate whether or not selected power-related variables and their response to a 12-week program of aerobic dance training are modulated by the AGT M235T genotype in healthy participants. Two hundred and one Polish Caucasian women aged 21 ± 1 years met the inclusion criteria and were included in the study. All women completed a 12-week program of low and high impact aerobics. Wingate peak power and total work capacity, 5 m, 10 m, and 30 m running times and jump height and jump power were determined before and after the training programme. All power-related variables improved significantly in response to aerobic dance training. We found a significant association between the M235T polymorphism and jump-based variables (squat jump (SJ) height, p = 0.005; SJ power, p = 0.015; countermovement jump height, p = 0.025; average of 10 countermovement jumps with arm swing (ACMJ) height, p = 0.001; ACMJ power, p = 0.035). Specifically, greater improvements were observed in the C allele carriers in comparison with TT homozygotes. In conclusion, aerobic dance, one of the most commonly practiced adult fitness activities in the world, provides sufficient training stimuli for augmenting the explosive strength necessary to increase vertical jump performance. The AGT gene M235T polymorphism seems to be not only a candidate gene variant for power/strength related phenotypes, but also a genetic marker for predicting response to training.
Parker, Jennifer K.; Havird, Justin C.
2012-01-01
Isolates of the plant pathogen Xylella fastidiosa are genetically very similar, but studies on their biological traits have indicated differences in virulence and infection symptomatology. Taxonomic analyses have identified several subspecies, and phylogenetic analyses of housekeeping genes have shown broad host-based genetic differences; however, results are still inconclusive for genetic differentiation of isolates within subspecies. This study employs multilocus sequence analysis of environmentally mediated genes (MLSA-E; genes influenced by environmental factors) to investigate X. fastidiosa relationships and differentiate isolates with low genetic variability. Potential environmentally mediated genes, including host colonization and survival genes related to infection establishment, were identified a priori. The ratio of the rate of nonsynonymous substitutions to the rate of synonymous substitutions (dN/dS) was calculated to select genes that may be under increased positive selection compared to previously studied housekeeping genes. Nine genes were sequenced from 54 X. fastidiosa isolates infecting different host plants across the United States. Results of maximum likelihood (ML) and Bayesian phylogenetic (BP) analyses are in agreement with known X. fastidiosa subspecies clades but show novel within-subspecies differentiation, including geographic differentiation, and provide additional information regarding host-based isolate variation and specificity. dN/dS ratios of environmentally mediated genes, though <1 due to high sequence similarity, are significantly greater than housekeeping gene dN/dS ratios and correlate with increased sequence variability. MLSA-E can more precisely resolve relationships between closely related bacterial strains with low genetic variability, such as X. fastidiosa isolates. Discovering the genetic relationships between X. fastidiosa isolates will provide new insights into the epidemiology of populations of X. fastidiosa, allowing improved disease management in economically important crops. PMID:22194287
Parker, Jennifer K; Havird, Justin C; De La Fuente, Leonardo
2012-03-01
Isolates of the plant pathogen Xylella fastidiosa are genetically very similar, but studies on their biological traits have indicated differences in virulence and infection symptomatology. Taxonomic analyses have identified several subspecies, and phylogenetic analyses of housekeeping genes have shown broad host-based genetic differences; however, results are still inconclusive for genetic differentiation of isolates within subspecies. This study employs multilocus sequence analysis of environmentally mediated genes (MLSA-E; genes influenced by environmental factors) to investigate X. fastidiosa relationships and differentiate isolates with low genetic variability. Potential environmentally mediated genes, including host colonization and survival genes related to infection establishment, were identified a priori. The ratio of the rate of nonsynonymous substitutions to the rate of synonymous substitutions (dN/dS) was calculated to select genes that may be under increased positive selection compared to previously studied housekeeping genes. Nine genes were sequenced from 54 X. fastidiosa isolates infecting different host plants across the United States. Results of maximum likelihood (ML) and Bayesian phylogenetic (BP) analyses are in agreement with known X. fastidiosa subspecies clades but show novel within-subspecies differentiation, including geographic differentiation, and provide additional information regarding host-based isolate variation and specificity. dN/dS ratios of environmentally mediated genes, though <1 due to high sequence similarity, are significantly greater than housekeeping gene dN/dS ratios and correlate with increased sequence variability. MLSA-E can more precisely resolve relationships between closely related bacterial strains with low genetic variability, such as X. fastidiosa isolates. Discovering the genetic relationships between X. fastidiosa isolates will provide new insights into the epidemiology of populations of X. fastidiosa, allowing improved disease management in economically important crops.
Rancourt, Rebecca C; Schellong, Karen; Tzschentke, Barbara; Henrich, Wolfgang; Plagemann, Andreas
2018-06-01
Increased availability and improved sequence annotation of the chicken ( Gallus gallus f. domestica ) genome have sparked interest in the bird as a model system to investigate translational embryonic development and health/disease outcomes. However, the epigenetics of this bird genome remain unclear. The aim of this study was to determine the levels of gene expression and DNA methylation at the proopiomelanocortin ( POMC ) gene in the hypothalamus of 3-week-old chickens. POMC is a key player in the control of the stress response, food intake, and metabolism. DNA methylation of the promoter, CpG island, and gene body regions of POMC were measured. Our data illustrate the pattern, variability, and functionality of DNA methylation for POMC expression in the chicken. Our findings show correlation of methylation pattern and gene expression along with sex-specific differences in POMC . Overall, these novel data highlight the promising potential of the chicken as a model and also the need for breeders and researchers to consider sex ratios in their studies.
Removing technical variability in RNA-seq data using conditional quantile normalization.
Hansen, Kasper D; Irizarry, Rafael A; Wu, Zhijin
2012-04-01
The ability to measure gene expression on a genome-wide scale is one of the most promising accomplishments in molecular biology. Microarrays, the technology that first permitted this, were riddled with problems due to unwanted sources of variability. Many of these problems are now mitigated, after a decade's worth of statistical methodology development. The recently developed RNA sequencing (RNA-seq) technology has generated much excitement in part due to claims of reduced variability in comparison to microarrays. However, we show that RNA-seq data demonstrate unwanted and obscuring variability similar to what was first observed in microarrays. In particular, we find guanine-cytosine content (GC-content) has a strong sample-specific effect on gene expression measurements that, if left uncorrected, leads to false positives in downstream results. We also report on commonly observed data distortions that demonstrate the need for data normalization. Here, we describe a statistical methodology that improves precision by 42% without loss of accuracy. Our resulting conditional quantile normalization algorithm combines robust generalized regression to remove systematic bias introduced by deterministic features such as GC-content and quantile normalization to correct for global distortions.
Technical variables in high-throughput miRNA expression profiling: much work remains to be done.
Nelson, Peter T; Wang, Wang-Xia; Wilfred, Bernard R; Tang, Guiliang
2008-11-01
MicroRNA (miRNA) gene expression profiling has provided important insights into plant and animal biology. However, there has not been ample published work about pitfalls associated with technical parameters in miRNA gene expression profiling. One source of pertinent information about technical variables in gene expression profiling is the separate and more well-established literature regarding mRNA expression profiling. However, many aspects of miRNA biochemistry are unique. For example, the cellular processing and compartmentation of miRNAs, the differential stability of specific miRNAs, and aspects of global miRNA expression regulation require specific consideration. Additional possible sources of systematic bias in miRNA expression studies include the differential impact of pre-analytical variables, substrate specificity of nucleic acid processing enzymes used in labeling and amplification, and issues regarding new miRNA discovery and annotation. We conclude that greater focus on technical parameters is required to bolster the validity, reliability, and cultural credibility of miRNA gene expression profiling studies.
2011-01-01
Background The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied statistical computing methods to ChIP-seq datasets to gain insight into the epigenetic regulation of transcription. However, methods for estimating enrichment levels in ChIP-seq data for these computational studies are understudied and variable. Since the conclusions drawn from these data mining and machine learning applications strongly depend on the enrichment level inputs, a comparison of estimation methods with respect to the performance of statistical models should be made. Results Various methods were used to estimate the gene-wise ChIP-seq enrichment levels for 20 histone methylations and the histone variant H2A.Z. The Multivariate Adaptive Regression Splines (MARS) algorithm was applied for each estimation method using the estimation of enrichment levels as predictors and gene expression levels as responses. The methods used to estimate enrichment levels included tag counting and model-based methods that were applied to whole genes and specific gene regions. These methods were also applied to various sizes of estimation windows. The MARS model performance was assessed with the Generalized Cross-Validation Score (GCV). We determined that model-based methods of enrichment estimation that spatially weight enrichment based on average patterns provided an improvement over tag counting methods. Also, methods that included information across the entire gene body provided improvement over methods that focus on a specific sub-region of the gene (e.g., the 5' or 3' region). Conclusion The performance of data mining and machine learning methods when applied to histone modification ChIP-seq data can be improved by using data across the entire gene body, and incorporating the spatial distribution of enrichment. Refinement of enrichment estimation ultimately improved accuracy of model predictions. PMID:21834981
Epi-fingerprinting and epi-interventions for improved crop production and food quality
Rodríguez López, Carlos M.; Wilkinson, Mike J.
2015-01-01
Increasing crop production at a time of rapid climate change represents the greatest challenge facing contemporary agricultural research. Our understanding of the genetic control of yield derives from controlled field experiments designed to minimize environmental variance. In spite of these efforts there is substantial residual variability among plants attributable to Genotype × Environment interactions. Recent advances in the field of epigenetics have revealed a plethora of gene control mechanisms that could account for much of this unassigned variation. These systems act as a regulatory interface between the perception of the environment and associated alterations in gene expression. Direct intervention of epigenetic control systems hold the enticing promise of creating new sources of variability that could enhance crop performance. Equally, understanding the relationship between various epigenetic states and responses of the crop to specific aspects of the growing environment (epigenetic fingerprinting) could allow for a more tailored approach to plant agronomy. In this review, we explore the many ways in which epigenetic interventions and epigenetic fingerprinting can be deployed for the improvement of crop production and quality. PMID:26097484
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kadam, Suhas; Abril, Alejandra; Dhanapal, Arun P.
Waterlogging is a significant environmental constraint to crop production, and a better understanding of plant responses is critical for the improvement of crop tolerance to waterlogged soils. Aquaporins (AQPs) are a class of channel-forming proteins that play an important role in water transport in plants. Our study aimed to examine the regulation of AQP genes under waterlogging stress and to characterize the genetic variability of AQP genes in sorghum (Sorghum bicolor). Transcriptional profiling of AQP genes in response to waterlogging stress in nodal root tips and nodal root basal regions of two tolerant and two sensitive sorghum genotypes at 18more » and 96 h after waterlogging stress imposition revealed significant gene-specific pattern with regard to genotype, root tissue sample, and time point. For some tissue sample and time point combinations, PIP2-6, PIP2-7, TIP2-2, TIP4-4, and TIP5-1 expression was differentially regulated in tolerant compared to sensitive genotypes. The differential response of these AQP genes suggests that they may play a tissue specific role in mitigating waterlogging stress. Genetic analysis of sorghum revealed that AQP genes were clustered into the same four subfamilies as in maize (Zea mays) and rice (Oryza sativa) and that residues determining the AQP channel specificity were largely conserved across species. Single nucleotide polymorphism (SNP) data from 50 sorghum accessions were used to build an AQP gene-based phylogeny of the haplotypes. Phylogenetic analysis based on single nucleotide polymorphisms of sorghum AQP genes placed the tolerant and sensitive genotypes used for the expression study in distinct groups. Expression analyses suggested that selected AQPs may play a pivotal role in sorghum tolerance to water logging stress. Furthermore experimentation is needed to verify their role and to leverage phylogenetic analyses and AQP expression data to improve water logging tolerance in sorghum.« less
Kadam, Suhas; Abril, Alejandra; Dhanapal, Arun P.; ...
2017-05-30
Waterlogging is a significant environmental constraint to crop production, and a better understanding of plant responses is critical for the improvement of crop tolerance to waterlogged soils. Aquaporins (AQPs) are a class of channel-forming proteins that play an important role in water transport in plants. Our study aimed to examine the regulation of AQP genes under waterlogging stress and to characterize the genetic variability of AQP genes in sorghum (Sorghum bicolor). Transcriptional profiling of AQP genes in response to waterlogging stress in nodal root tips and nodal root basal regions of two tolerant and two sensitive sorghum genotypes at 18more » and 96 h after waterlogging stress imposition revealed significant gene-specific pattern with regard to genotype, root tissue sample, and time point. For some tissue sample and time point combinations, PIP2-6, PIP2-7, TIP2-2, TIP4-4, and TIP5-1 expression was differentially regulated in tolerant compared to sensitive genotypes. The differential response of these AQP genes suggests that they may play a tissue specific role in mitigating waterlogging stress. Genetic analysis of sorghum revealed that AQP genes were clustered into the same four subfamilies as in maize (Zea mays) and rice (Oryza sativa) and that residues determining the AQP channel specificity were largely conserved across species. Single nucleotide polymorphism (SNP) data from 50 sorghum accessions were used to build an AQP gene-based phylogeny of the haplotypes. Phylogenetic analysis based on single nucleotide polymorphisms of sorghum AQP genes placed the tolerant and sensitive genotypes used for the expression study in distinct groups. Expression analyses suggested that selected AQPs may play a pivotal role in sorghum tolerance to water logging stress. Furthermore experimentation is needed to verify their role and to leverage phylogenetic analyses and AQP expression data to improve water logging tolerance in sorghum.« less
Simultaneous enumeration of cancer and immune cell types from bulk tumor gene expression data.
Racle, Julien; de Jonge, Kaat; Baumgaertner, Petra; Speiser, Daniel E; Gfeller, David
2017-11-13
Immune cells infiltrating tumors can have important impact on tumor progression and response to therapy. We present an efficient algorithm to simultaneously estimate the fraction of cancer and immune cell types from bulk tumor gene expression data. Our method integrates novel gene expression profiles from each major non-malignant cell type found in tumors, renormalization based on cell-type-specific mRNA content, and the ability to consider uncharacterized and possibly highly variable cell types. Feasibility is demonstrated by validation with flow cytometry, immunohistochemistry and single-cell RNA-Seq analyses of human melanoma and colorectal tumor specimens. Altogether, our work not only improves accuracy but also broadens the scope of absolute cell fraction predictions from tumor gene expression data, and provides a unique novel experimental benchmark for immunogenomics analyses in cancer research (http://epic.gfellerlab.org).
Gascuel, Quentin; Diretto, Gianfranco; Monforte, Antonio J.; Fortes, Ana M.; Granell, Antonio
2017-01-01
Improving fruit quality has become a major goal in plant breeding. Direct approaches to tackling fruit quality traits specifically linked to consumer preferences and environmental friendliness, such as improved flavor, nutraceutical compounds, and sustainability, have slowly been added to a breeder priority list that already includes traits like productivity, efficiency, and, especially, pest and disease control. Breeders already use molecular genetic tools to improve fruit quality although most advances have been made in producer and industrial quality standards. Furthermore, progress has largely been limited to simple agronomic traits easy-to-observe, whereas the vast majority of quality attributes, specifically those relating to flavor and nutrition, are complex and have mostly been neglected. Fortunately, wild germplasm, which is used for resistance against/tolerance of environmental stresses (including pathogens), is still available and harbors significant genetic variation for taste and health-promoting traits. Similarly, heirloom/traditional varieties could be used to identify which genes contribute to flavor and health quality and, at the same time, serve as a good source of the best alleles for organoleptic quality improvement. Grape (Vitis vinifera L.) and tomato (Solanum lycopersicum L.) produce fleshy, berry-type fruits, among the most consumed in the world. Both have undergone important domestication and selection processes, that have dramatically reduced their genetic variability, and strongly standardized fruit traits. Moreover, more and more consumers are asking for sustainable production, incompatible with the wide range of chemical inputs. In the present paper, we review the genetic resources available to tomato/grape breeders, and the recent technological progresses that facilitate the identification of genes/alleles of interest within the natural or generated variability gene pool. These technologies include omics, high-throughput phenotyping/phenomics, and biotech approaches. Our review also covers a range of technologies used to transfer to tomato and grape those alleles considered of interest for fruit quality. These include traditional breeding, TILLING (Targeting Induced Local Lesions in Genomes), genetic engineering, or NPBT (New Plant Breeding Technologies). Altogether, the combined exploitation of genetic variability and innovative biotechnological tools may facilitate breeders to improve fruit quality tacking more into account the consumer standards and the needs to move forward into more sustainable farming practices. PMID:28553296
Anderson, Ashley K.; Ohler, Uwe; Wassarman, David A.
2012-01-01
To investigate the importance of core promoter elements for tissue-specific transcription of RNA polymerase II genes, we examined testis-specific transcription in Drosophila melanogaster. Bioinformatic analyses of core promoter sequences from 190 genes that are specifically expressed in testes identified a 10 bp A/T-rich motif that is identical to the translational control element (TCE). The TCE functions in the 5′ untranslated region of Mst(3)CGP mRNAs to repress translation, and it also functions in a heterologous gene to regulate transcription. We found that among genes with focused initiation patterns, the TCE is significantly enriched in core promoters of genes that are specifically expressed in testes but not in core promoters of genes that are specifically expressed in other tissues. The TCE is variably located in core promoters and is conserved in melanogaster subgroup species, but conservation dramatically drops in more distant species. In transgenic flies, short (300–400 bp) genomic regions containing a TCE directed testis-specific transcription of a reporter gene. Mutation of the TCE significantly reduced but did not abolish reporter gene transcription indicating that the TCE is important but not essential for transcription activation. Finally, mutation of testis-specific TFIID (tTFIID) subunits significantly reduced the transcription of a subset of endogenous TCE-containing but not TCE-lacking genes, suggesting that tTFIID activity is limited to TCE-containing genes but that tTFIID is not an obligatory regulator of TCE-containing genes. Thus, the TCE is a core promoter element in a subset of genes that are specifically expressed in testes. Furthermore, the TCE regulates transcription in the context of short genomic regions, from variable locations in the core promoter, and both dependently and independently of tTFIID. These findings set the stage for determining the mechanism by which the TCE regulates testis-specific transcription and understanding the dual role of the TCE in translational and transcriptional regulation. PMID:22984601
Katzenberger, Rebeccah J; Rach, Elizabeth A; Anderson, Ashley K; Ohler, Uwe; Wassarman, David A
2012-01-01
To investigate the importance of core promoter elements for tissue-specific transcription of RNA polymerase II genes, we examined testis-specific transcription in Drosophila melanogaster. Bioinformatic analyses of core promoter sequences from 190 genes that are specifically expressed in testes identified a 10 bp A/T-rich motif that is identical to the translational control element (TCE). The TCE functions in the 5' untranslated region of Mst(3)CGP mRNAs to repress translation, and it also functions in a heterologous gene to regulate transcription. We found that among genes with focused initiation patterns, the TCE is significantly enriched in core promoters of genes that are specifically expressed in testes but not in core promoters of genes that are specifically expressed in other tissues. The TCE is variably located in core promoters and is conserved in melanogaster subgroup species, but conservation dramatically drops in more distant species. In transgenic flies, short (300-400 bp) genomic regions containing a TCE directed testis-specific transcription of a reporter gene. Mutation of the TCE significantly reduced but did not abolish reporter gene transcription indicating that the TCE is important but not essential for transcription activation. Finally, mutation of testis-specific TFIID (tTFIID) subunits significantly reduced the transcription of a subset of endogenous TCE-containing but not TCE-lacking genes, suggesting that tTFIID activity is limited to TCE-containing genes but that tTFIID is not an obligatory regulator of TCE-containing genes. Thus, the TCE is a core promoter element in a subset of genes that are specifically expressed in testes. Furthermore, the TCE regulates transcription in the context of short genomic regions, from variable locations in the core promoter, and both dependently and independently of tTFIID. These findings set the stage for determining the mechanism by which the TCE regulates testis-specific transcription and understanding the dual role of the TCE in translational and transcriptional regulation.
Gross, G; Snel, J; Boekhorst, J; Smits, M A; Kleerebezem, M
2010-03-01
Recently, we have identified the mannose-specific adhesin encoding gene (msa) of Lactobacillus plantarum. In the current study, structure and function of this potentially probiotic effector gene were further investigated, exploring genetic diversity of msa in L. plantarum in relation to mannose adhesion capacity. The results demonstrate that there is considerable variation in quantitative in vitro mannose adhesion capacity, which is paralleled by msa gene sequence variation. The msa genes of different L. plantarum strains encode proteins with variable domain composition. Construction of L. plantarum 299v mutant strains revealed that the msa gene product is the key-protein for mannose adhesion, also in a strain with high mannose adhering capacity. However, no straightforward correlation between adhesion capacity and domain composition of Msa in L. plantarum could be identified. Nevertheless, differences in Msa sequences in combination with variable genetic background of specific bacterial strains appears to determine mannose adhesion capacity and potentially affects probiotic properties. These findings exemplify the strain-specificity of probiotic characteristics and illustrate the need for careful and molecular selection of new candidate probiotics.
Mori, Masayuki; Higuchi, Keiichi; Sakurai, Akihiro; Tabara, Yasuharu; Miki, Tetsuro; Nose, Hiroshi
2009-01-01
Habitual exercise training, including a high-intensity interval walking programme, improves cardiorespiratory fitness and alleviates lifestyle-related diseases, such as obesity, hypertension and dyslipidaemia. However, the extent of improvement has been shown to differ substantially among individuals for various exercise regimens. A body of literature has demonstrated that gene polymorphisms could account for the inter-individual variability in the improvement of risk factors for lifestyle-related diseases following exercise training. However, the fractions of the variability explained by the polymorphisms are small (∼5%). Also, it is likely that the effects of gene polymorphisms differ with exercise regimens and subject characteristics. These observations suggest the necessity for further studies to exhaustively identify such gene polymorphisms. More importantly, the physiological and molecular genetic mechanisms by which gene polymorphisms interact with exercise to influence the improvements of risk factors for lifestyle-related diseases differentially remain to be clarified. A better understanding of these issues should lead to more effective integration of exercise to optimize the treatment and management of individuals with lifestyle-related diseases. PMID:19736300
Unifying measures of gene function and evolution.
Wolf, Yuri I; Carmel, Liran; Koonin, Eugene V
2006-06-22
Recent genome analyses revealed intriguing correlations between variables characterizing the functioning of a gene, such as expression level (EL), connectivity of genetic and protein-protein interaction networks, and knockout effect, and variables describing gene evolution, such as sequence evolution rate (ER) and propensity for gene loss. Typically, variables within each of these classes are positively correlated, e.g. products of highly expressed genes also have a propensity to be involved in many protein-protein interactions, whereas variables between classes are negatively correlated, e.g. highly expressed genes, on average, evolve slower than weakly expressed genes. Here, we describe principal component (PC) analysis of seven genome-related variables and propose biological interpretations for the first three PCs. The first PC reflects a gene's 'importance', or the 'status' of a gene in the genomic community, with positive contributions from knockout lethality, EL, number of protein-protein interaction partners and the number of paralogues, and negative contributions from sequence ER and gene loss propensity. The next two PCs define a plane that seems to reflect the functional and evolutionary plasticity of a gene. Specifically, PC2 can be interpreted as a gene's 'adaptability' whereby genes with high adaptability readily duplicate, have many genetic interaction partners and tend to be non-essential. PC3 also might reflect the role of a gene in organismal adaptation albeit with a negative rather than a positive contribution of genetic interactions; we provisionally designate this PC 'reactivity'. The interpretation of PC2 and PC3 as measures of a gene's plasticity is compatible with the observation that genes with high values of these PCs tend to be expressed in a condition- or tissue-specific manner. Functional classes of genes substantially vary in status, adaptability and reactivity, with the highest status characteristic of the translation system and cytoskeletal proteins, highest adaptability seen in cellular processes and signalling genes, and top reactivity characteristic of metabolic enzymes.
True, Lawrence D
2014-03-01
Paralleling the growth of ever more cost efficient methods to sequence the whole genome in minute fragments of tissue has been the identification of increasingly numerous molecular abnormalities in cancers--mutations, amplifications, insertions and deletions of genes, and patterns of differential gene expression, i.e., overexpression of growth factors and underexpression of tumor suppressor genes. These abnormalities can be translated into assays to be used in clinical decision making. In general terms, the result of such an assay is subject to a large number of variables regarding the characteristics of the available sample, particularities of the used assay, and the interpretation of the results. This review discusses the effects of these variables on assays of tissue-based biomarkers, classified by macromolecule--DNA, RNA (including micro RNA, messenger RNA, long noncoding RNA, protein, and phosphoprotein). Since the majority of clinically applicable biomarkers are immunohistochemically detectable proteins this review focuses on protein biomarkers. However, the principles outlined are mostly applicable to any other analyte. A variety of preanalytical variables impacts on the results obtained, including analyte stability (which is different for different analytes, i.e., DNA, RNA, or protein), period of warm and of cold ischemia, fixation time, tissue processing, sample storage time, and storage conditions. In addition, assay variables play an important role, including reagent specificity (notably but not uniquely an issue concerning antibodies used in immunohistochemistry), technical components of the assay, quantitation, and assay interpretation. Finally, appropriateness of an assay for clinical application is an important issue. Reference is made to publicly available guidelines to improve on biomarker development in general and requirements for clinical use in particular. Strategic goals are formulated in order to improve on the quality of biomarker reporting, including issues of analyte quality, experimental detail, assay efficiency and precision, and assay appropriateness.
Promoter architecture dictates cell-to-cell variability in gene expression.
Jones, Daniel L; Brewster, Robert C; Phillips, Rob
2014-12-19
Variability in gene expression among genetically identical cells has emerged as a central preoccupation in the study of gene regulation; however, a divide exists between the predictions of molecular models of prokaryotic transcriptional regulation and genome-wide experimental studies suggesting that this variability is indifferent to the underlying regulatory architecture. We constructed a set of promoters in Escherichia coli in which promoter strength, transcription factor binding strength, and transcription factor copy numbers are systematically varied, and used messenger RNA (mRNA) fluorescence in situ hybridization to observe how these changes affected variability in gene expression. Our parameter-free models predicted the observed variability; hence, the molecular details of transcription dictate variability in mRNA expression, and transcriptional noise is specifically tunable and thus represents an evolutionarily accessible phenotypic parameter. Copyright © 2014, American Association for the Advancement of Science.
Improved Sparse Multi-Class SVM and Its Application for Gene Selection in Cancer Classification
Huang, Lingkang; Zhang, Hao Helen; Zeng, Zhao-Bang; Bushel, Pierre R.
2013-01-01
Background Microarray techniques provide promising tools for cancer diagnosis using gene expression profiles. However, molecular diagnosis based on high-throughput platforms presents great challenges due to the overwhelming number of variables versus the small sample size and the complex nature of multi-type tumors. Support vector machines (SVMs) have shown superior performance in cancer classification due to their ability to handle high dimensional low sample size data. The multi-class SVM algorithm of Crammer and Singer provides a natural framework for multi-class learning. Despite its effective performance, the procedure utilizes all variables without selection. In this paper, we propose to improve the procedure by imposing shrinkage penalties in learning to enforce solution sparsity. Results The original multi-class SVM of Crammer and Singer is effective for multi-class classification but does not conduct variable selection. We improved the method by introducing soft-thresholding type penalties to incorporate variable selection into multi-class classification for high dimensional data. The new methods were applied to simulated data and two cancer gene expression data sets. The results demonstrate that the new methods can select a small number of genes for building accurate multi-class classification rules. Furthermore, the important genes selected by the methods overlap significantly, suggesting general agreement among different variable selection schemes. Conclusions High accuracy and sparsity make the new methods attractive for cancer diagnostics with gene expression data and defining targets of therapeutic intervention. Availability: The source MATLAB code are available from http://math.arizona.edu/~hzhang/software.html. PMID:23966761
Simultaneous enumeration of cancer and immune cell types from bulk tumor gene expression data
Racle, Julien; de Jonge, Kaat; Baumgaertner, Petra; Speiser, Daniel E
2017-01-01
Immune cells infiltrating tumors can have important impact on tumor progression and response to therapy. We present an efficient algorithm to simultaneously estimate the fraction of cancer and immune cell types from bulk tumor gene expression data. Our method integrates novel gene expression profiles from each major non-malignant cell type found in tumors, renormalization based on cell-type-specific mRNA content, and the ability to consider uncharacterized and possibly highly variable cell types. Feasibility is demonstrated by validation with flow cytometry, immunohistochemistry and single-cell RNA-Seq analyses of human melanoma and colorectal tumor specimens. Altogether, our work not only improves accuracy but also broadens the scope of absolute cell fraction predictions from tumor gene expression data, and provides a unique novel experimental benchmark for immunogenomics analyses in cancer research (http://epic.gfellerlab.org). PMID:29130882
Yancopoulos, G D; Blackwell, T K; Suh, H; Hood, L; Alt, F W
1986-01-31
We have recently proposed that a common recombinase performs all of the many variable region gene assembly events in B and T cells, and that the specificity of these joining events is mediated by regulating the "accessibility" of the involved gene segments. To test this possibility, we have introduced "accessible" T cell receptor (TCR) variable region gene segments into a pre-B cell line capable of recombining endogenous and transfected immunoglobulin (Ig) variable region gene segments. Although the corresponding "inaccessible" endogenous TCR gene segments do not rearrange in this line or in B cells in general, the introduced TCR gene segments join very frequently and, in fact, closely resemble introduced Ig gene segments in their recombination characteristics. These observations suggest a new role for conventional Ig transcriptional enhancers--recombinational enhancement. Our studies provide insight into additional aspects of the joining mechanism such as N region insertion, aberrant joining, and recombination-recognition sequence requirements for joining.
Gottlieb, Assaf; Daneshjou, Roxana; DeGorter, Marianne; Bourgeois, Stephane; Svensson, Peter J; Wadelius, Mia; Deloukas, Panos; Montgomery, Stephen B; Altman, Russ B
2017-11-24
Genome-wide association studies are useful for discovering genotype-phenotype associations but are limited because they require large cohorts to identify a signal, which can be population-specific. Mapping genetic variation to genes improves power and allows the effects of both protein-coding variation as well as variation in expression to be combined into "gene level" effects. Previous work has shown that warfarin dose can be predicted using information from genetic variation that affects protein-coding regions. Here, we introduce a method that improves dose prediction by integrating tissue-specific gene expression. In particular, we use drug pathways and expression quantitative trait loci knowledge to impute gene expression-on the assumption that differential expression of key pathway genes may impact dose requirement. We focus on 116 genes from the pharmacokinetic and pharmacodynamic pathways of warfarin within training and validation sets comprising both European and African-descent individuals. We build gene-tissue signatures associated with warfarin dose in a cohort-specific manner and identify a signature of 11 gene-tissue pairs that significantly augments the International Warfarin Pharmacogenetics Consortium dosage-prediction algorithm in both populations. Our results demonstrate that imputed expression can improve dose prediction and bridge population-specific compositions. MATLAB code is available at https://github.com/assafgo/warfarin-cohort.
Familial cancer syndromes and clusters.
Birch, J M
1994-07-01
The study of rare families in which a variety of cancers occur, usually at an early age and with patterns consistent with a common hereditary mechanism, has contributed much to our understanding of the process of carcinogenesis. So far, genes identified as having a role in cancer predisposition in these families have also been important in the histogenesis of sporadic cancers. In the two most clearly defined cancer family syndromes, the Li-Fraumeni syndrome and Lynch syndrome II, the genes involved predispose to diverse but specific constellations of cancers. Genes associated with site-specific familial cancer clusters may also give rise to increased susceptibility to other cancers, and site-specific clusters may represent one end of a spectrum. A consistent feature of familial cancer syndromes is the variable expression within and between families. A challenge for the future will be to determine other factors which may interact with the principal genes involved, giving rise to this variability.
Herrgård, Markus J.
2014-01-01
High-cell-density fermentation for industrial production of chemicals can impose numerous stresses on cells due to high substrate, product, and by-product concentrations; high osmolarity; reactive oxygen species; and elevated temperatures. There is a need to develop platform strains of industrial microorganisms that are more tolerant toward these typical processing conditions. In this study, the growth of six industrially relevant strains of Escherichia coli was characterized under eight stress conditions representative of fed-batch fermentation, and strains W and BL21(DE3) were selected as platforms for transposon (Tn) mutagenesis due to favorable resistance characteristics. Selection experiments, followed by either targeted or genome-wide next-generation-sequencing-based Tn insertion site determination, were performed to identify mutants with improved growth properties under a subset of three stress conditions and two combinations of individual stresses. A subset of the identified loss-of-function mutants were selected for a combinatorial approach, where strains with combinations of two and three gene deletions were systematically constructed and tested for single and multistress resistance. These approaches allowed identification of (i) strain-background-specific stress resistance phenotypes, (ii) novel gene deletion mutants in E. coli that confer single and multistress resistance in a strain-background-dependent manner, and (iii) synergistic effects of multiple gene deletions that confer improved resistance over single deletions. The results of this study underscore the suboptimality and strain-specific variability of the genetic network regulating growth under stressful conditions and suggest that further exploration of the combinatorial gene deletion space in multiple strain backgrounds is needed for optimizing strains for microbial bioprocessing applications. PMID:25085490
Boone, Deborah R; Leek, Jeanna M; Falduto, Michael T; Torres, Karen E O; Sell, Stacy L; Parsley, Margaret A; Cowart, Jeremy C; Uchida, Tatsuo; Micci, Maria-Adelaide; DeWitt, Douglas S; Prough, Donald S; Hellmich, Helen L
2017-01-01
Virally mediated RNA interference (RNAi) to knock down injury-induced genes could improve functional outcome after traumatic brain injury (TBI); however, little is known about the consequences of gene knockdown on downstream cell signaling pathways and how RNAi influences neurodegeneration and behavior. Here, we assessed the effects of adeno-associated virus (AAV) siRNA vectors that target two genes with opposing roles in TBI pathogenesis: the allegedly detrimental neuronal nitric oxide synthase (nNOS) and the potentially protective glutathione peroxidase 1 (GPx-1). In rat hippocampal progenitor cells, three siRNAs that target different regions of each gene (nNOS, GPx-1) effectively knocked down gene expression. However, in vivo, in our rat model of fluid percussion brain injury, the consequences of AAV-siRNA were variable. One nNOS siRNA vector significantly reduced the number of degenerating hippocampal neurons and showed a tendency to improve working memory. GPx-1 siRNA treatment did not alter TBI-induced neurodegeneration or working memory deficits. Nevertheless, microarray analysis of laser captured, virus-infected neurons showed that knockdown of nNOS or GPx-1 was specific and had broad effects on downstream genes. Since nNOS knockdown only modestly ameliorated TBI-induced working memory deficits, despite widespread genomic changes, manipulating expression levels of single genes may not be sufficient to alter functional outcome after TBI.
Comprehensive analyses of tissue-specific networks with implications to psychiatric diseases
Lin, Guan Ning; Corominas, Roser; Nam, Hyun-Jun; Urresti, Jorge; Iakoucheva, Lilia M.
2017-01-01
Recent advances in genome sequencing and “omics” technologies are opening new opportunities for improving diagnosis and treatment of human diseases. The precision medicine initiative in particular aims at developing individualized treatment options that take into account individual variability in genes and environment of each person. Systems biology approaches that group genes, transcripts and proteins into functionally meaningful networks will play crucial role in the future of personalized medicine. They will allow comparison of healthy and disease-affected tissues and organs from the same individual, as well as between healthy and disease-afflicted individuals. However, the field faces a multitude of challenges ranging from data integration to statistical and combinatorial issues in data analyses. This chapter describes computational approaches developed by us and the others to tackle challenges in tissue-specific network analyses, with the main focus on psychiatric diseases. PMID:28849569
Saini, Jasmine; Hershberg, Uri
2015-01-01
The exceptional ability of B cells to diversify through somatic mutation and improve affinity of the repertoire towards the antigens is the cornerstone of adaptive immunity. Somatic mutation is not evenly distributed and exhibits certain micro-sequence specificities. We show here that the combination of somatic mutation targeting and the codon usage in human B cell receptor (BCR) Variable (V) genes create expected patterns of mutation and post mutation changes that are focused on their complementarity determining regions (CDR). T cell V genes are also skewed in targeting mutations but to a lesser extent and are lacking the codon usage bias observed in BCRs. This suggests that the observed skew in T cell receptors is due to their amino acid usage, which is similar to that of BCRs. The mutation targeting and the codon bias allow B cell CDRs to diversify by specifically accumulating nonconservative changes. We counted the distribution of mutations to CDR in 4 different human datasets. In all four cases we found that the number of actual mutations in the CDR correlated significantly with the V gene mutation biases to the CDR predicted by our models. Finally, it appears that the mutation bias in V genes indeed relates to their long-term survival in actual human repertoires. We observed that resting repertoires of B cells overexpressed V genes that were especially biased towards focused mutation and change in the CDR. This bias in V gene usage was somewhat relaxed at the height of the immune response to a vaccine, presumably because of the need for a wider diversity in a primary response. However, older patients did not retain this flexibility and were biased towards using only highly skewed V genes at all stages of their response. PMID:25660968
Saini, Jasmine; Hershberg, Uri
2015-05-01
The exceptional ability of B cells to diversify through somatic mutation and improve affinity of the repertoire toward the antigens is the cornerstone of adaptive immunity. Somatic mutation is not evenly distributed and exhibits certain micro-sequence specificities. We show here that the combination of somatic mutation targeting and the codon usage in human B cell receptor (BCR) Variable (V) genes create expected patterns of mutation and post mutation changes that are focused on their complementarity determining regions (CDR). T cell V genes are also skewed in targeting mutations but to a lesser extent and are lacking the codon usage bias observed in BCRs. This suggests that the observed skew in T cell receptors is due to their amino acid usage, which is similar to that of BCRs. The mutation targeting and the codon bias allow B cell CDRs to diversify by specifically accumulating nonconservative changes. We counted the distribution of mutations to CDR in 4 different human datasets. In all four cases we found that the number of actual mutations in the CDR correlated significantly with the V gene mutation biases to the CDR predicted by our models. Finally, it appears that the mutation bias in V genes indeed relates to their long-term survival in actual human repertoires. We observed that resting repertoires of B cells overexpressed V genes that were especially biased toward focused mutation and change in the CDR. This bias in V gene usage was somewhat relaxed at the height of the immune response to a vaccine, presumably because of the need for a wider diversity in a primary response. However, older patients did not retain this flexibility and were biased toward using only highly skewed V genes at all stages of their response. Copyright © 2015 Elsevier Ltd. All rights reserved.
Li, Ziyi; Safo, Sandra E; Long, Qi
2017-07-11
Sparse principal component analysis (PCA) is a popular tool for dimensionality reduction, pattern recognition, and visualization of high dimensional data. It has been recognized that complex biological mechanisms occur through concerted relationships of multiple genes working in networks that are often represented by graphs. Recent work has shown that incorporating such biological information improves feature selection and prediction performance in regression analysis, but there has been limited work on extending this approach to PCA. In this article, we propose two new sparse PCA methods called Fused and Grouped sparse PCA that enable incorporation of prior biological information in variable selection. Our simulation studies suggest that, compared to existing sparse PCA methods, the proposed methods achieve higher sensitivity and specificity when the graph structure is correctly specified, and are fairly robust to misspecified graph structures. Application to a glioblastoma gene expression dataset identified pathways that are suggested in the literature to be related with glioblastoma. The proposed sparse PCA methods Fused and Grouped sparse PCA can effectively incorporate prior biological information in variable selection, leading to improved feature selection and more interpretable principal component loadings and potentially providing insights on molecular underpinnings of complex diseases.
Shafie, Suraiya M.; Barria von-Bischhoffshausen, Fernando R.; Bateman, J. Bronwyn
2006-01-01
PURPOSE To document intrafamilial and interocular phenotypic variability of autosomal dominant cataract (ADC). DESIGN Prospective observational case series. METHODS We performed ophthalmologic examination in four Chilean ADC families. RESULTS The families exhibited variability with respect to morphology, location with the lens, color and density of cataracts among affected members. We documented asymmetry between eyes in the morphology, location within the lens, color and density of cataracts, and a variable rate of progression. CONCLUSIONS The cataracts in these families exhibit wide intrafamilial and interocular phenotypic variability, supporting the premise that the mutated genes are expressed differentially in individuals and between eyes; other genes or environmental factors may be the bases for this variability. Marked progression among some family members underscores the variable clinical course of a common mutation within a family. Like retinitis pigmentosa, classification of ADC will be most useful if based on the gene and specific mutation. PMID:16564818
Improved prediction of biochemical recurrence after radical prostatectomy by genetic polymorphisms.
Morote, Juan; Del Amo, Jokin; Borque, Angel; Ars, Elisabet; Hernández, Carlos; Herranz, Felipe; Arruza, Antonio; Llarena, Roberto; Planas, Jacques; Viso, María J; Palou, Joan; Raventós, Carles X; Tejedor, Diego; Artieda, Marta; Simón, Laureano; Martínez, Antonio; Rioja, Luis A
2010-08-01
Single nucleotide polymorphisms are inherited genetic variations that can predispose or protect individuals against clinical events. We hypothesized that single nucleotide polymorphism profiling may improve the prediction of biochemical recurrence after radical prostatectomy. We performed a retrospective, multi-institutional study of 703 patients treated with radical prostatectomy for clinically localized prostate cancer who had at least 5 years of followup after surgery. All patients were genotyped for 83 prostate cancer related single nucleotide polymorphisms using a low density oligonucleotide microarray. Baseline clinicopathological variables and single nucleotide polymorphisms were analyzed to predict biochemical recurrence within 5 years using stepwise logistic regression. Discrimination was measured by ROC curve AUC, specificity, sensitivity, predictive values, net reclassification improvement and integrated discrimination index. The overall biochemical recurrence rate was 35%. The model with the best fit combined 8 covariates, including the 5 clinicopathological variables prostate specific antigen, Gleason score, pathological stage, lymph node involvement and margin status, and 3 single nucleotide polymorphisms at the KLK2, SULT1A1 and TLR4 genes. Model predictive power was defined by 80% positive predictive value, 74% negative predictive value and an AUC of 0.78. The model based on clinicopathological variables plus single nucleotide polymorphisms showed significant improvement over the model without single nucleotide polymorphisms, as indicated by 23.3% net reclassification improvement (p = 0.003), integrated discrimination index (p <0.001) and likelihood ratio test (p <0.001). Internal validation proved model robustness (bootstrap corrected AUC 0.78, range 0.74 to 0.82). The calibration plot showed close agreement between biochemical recurrence observed and predicted probabilities. Predicting biochemical recurrence after radical prostatectomy based on clinicopathological data can be significantly improved by including patient genetic information. Copyright (c) 2010 American Urological Association Education and Research, Inc. Published by Elsevier Inc. All rights reserved.
Roles of the plasticity regions of Helicobacter pylori in gastroduodenal pathogenesis.
Yamaoka, Yoshio
2008-05-01
Putative virulence genes of Helicobacter pylori are generally classified into three categories: strain-specific genes, phase-variable genes and genes with variable structures/genotypes. Among these, there has recently been considerable interest in strain-specific genes found outside of the cag pathogenicity island, especially genes in the plasticity regions. Nearly half of the strain-specific genes of H. pylori are located in the plasticity regions in strains 26695 and J99. Strain HPAG1, however, seems to lack a typical plasticity region; instead it has 43 HPAG1-specific genes which are either undetectable or incompletely represented in the genomes of strains 26695 and J99. Recent studies showed that certain genes or combination of genes in this region may play important roles in the pathogenesis of H. pylori-associated gastroduodenal diseases. Most previous studies have focused on the plasticity region in strain J99 (jhp0914-jhp0961) and the jhp0947 gene and the duodenal ulcer promoting (dupA) gene are good candidate markers for gastroduodenal diseases although there are some paradoxical findings. The jhp0947 gene is reported to be associated with an increased risk of both duodenal ulcers and gastric cancers, whereas the dupA gene, which encompasses jhp0917 and jhp0918, is reported to be associated with an increased risk of duodenal ulcers and protection against gastric cancers. In addition, recent studies showed that approximately 10-30 % of clinical isolates possess a 16.3 kb type IV secretion apparatus (tfs3) in the plasticity region. Studies on the plasticity region have only just begun, and further investigation is necessary to elucidate the roles of genes in this region in gastroduodenal pathogenesis.
Roles of the plasticity regions of Helicobacter pylori in gastroduodenal pathogenesis
Yamaoka, Yoshio
2010-01-01
Putative virulence genes of Helicobacter pylori are generally classified into three categories: strain-specific genes, phase-variable genes and genes with variable structures/genotypes. Among these, there has recently been considerable interest in strain-specific genes found outside of the cag pathogenicity island, especially genes in the plasticity regions. Nearly half of the strain-specific genes of H. pylori are located in the plasticity regions in strains 26695 and J99. Strain HPAG1, however, seems to lack a typical plasticity region; instead it has 43 HPAG1-specific genes which are either undetectable or incompletely represented in the genomes of strains 26695 and J99. Recent studies showed that certain genes or combination of genes in this region may play important roles in the pathogenesis of H. pylori-associated gastroduodenal diseases. Most previous studies have focused on the plasticity region in strain J99 (jhp0914–jhp0961) and the jhp0947 gene and the duodenal ulcer promoting (dupA) gene are good candidate markers for gastroduodenal diseases although there are some paradoxical findings. The jhp0947 gene is reported to be associated with an increased risk of both duodenal ulcers and gastric cancers, whereas the dupA gene, which encompasses jhp0917 and jhp0918, is reported to be associated with an increased risk of duodenal ulcers and protection against gastric cancers. In addition, recent studies showed that approximately 10–30% of clinical isolates possess a 16.3 kb type IV secretion apparatus (tfs3) in the plasticity region. Studies on the plasticity region have only just begun, and further investigation is necessary to elucidate the roles of genes in this region in gastroduodenal pathogenesis. PMID:18436586
Reynolds, Chandra A; Gatz, Margaret; Christensen, Kaare; Christiansen, Lene; Dahl Aslan, Anna K; Kaprio, Jaakko; Korhonen, Tellervo; Kremen, William S; Krueger, Robert; McGue, Matt; Neiderhiser, Jenae M; Pedersen, Nancy L
2016-01-01
Despite emerging interest in gene-environment interaction (GxE) effects, there is a dearth of studies evaluating its potential relevance apart from specific hypothesized environments and biometrical variance trends. Using a monozygotic within-pair approach, we evaluated evidence of G×E for body mass index (BMI), depressive symptoms, and cognition (verbal, spatial, attention, working memory, perceptual speed) in twin studies from four countries. We also evaluated whether APOE is a 'variability gene' across these measures and whether it partly represents the 'G' in G×E effects. In all three domains, G×E effects were pervasive across country and gender, with small-to-moderate effects. Age-cohort trends were generally stable for BMI and depressive symptoms; however, they were variable-with both increasing and decreasing age-cohort trends-for different cognitive measures. Results also suggested that APOE may represent a 'variability gene' for depressive symptoms and spatial reasoning, but not for BMI or other cognitive measures. Hence, additional genes are salient beyond APOE.
Gene Circuit Analysis of the Terminal Gap Gene huckebein
Ashyraliyev, Maksat; Siggens, Ken; Janssens, Hilde; Blom, Joke; Akam, Michael; Jaeger, Johannes
2009-01-01
The early embryo of Drosophila melanogaster provides a powerful model system to study the role of genes in pattern formation. The gap gene network constitutes the first zygotic regulatory tier in the hierarchy of the segmentation genes involved in specifying the position of body segments. Here, we use an integrative, systems-level approach to investigate the regulatory effect of the terminal gap gene huckebein (hkb) on gap gene expression. We present quantitative expression data for the Hkb protein, which enable us to include hkb in gap gene circuit models. Gap gene circuits are mathematical models of gene networks used as computational tools to extract regulatory information from spatial expression data. This is achieved by fitting the model to gap gene expression patterns, in order to obtain estimates for regulatory parameters which predict a specific network topology. We show how considering variability in the data combined with analysis of parameter determinability significantly improves the biological relevance and consistency of the approach. Our models are in agreement with earlier results, which they extend in two important respects: First, we show that Hkb is involved in the regulation of the posterior hunchback (hb) domain, but does not have any other essential function. Specifically, Hkb is required for the anterior shift in the posterior border of this domain, which is now reproduced correctly in our models. Second, gap gene circuits presented here are able to reproduce mutants of terminal gap genes, while previously published models were unable to reproduce any null mutants correctly. As a consequence, our models now capture the expression dynamics of all posterior gap genes and some variational properties of the system correctly. This is an important step towards a better, quantitative understanding of the developmental and evolutionary dynamics of the gap gene network. PMID:19876378
Gene circuit analysis of the terminal gap gene huckebein.
Ashyraliyev, Maksat; Siggens, Ken; Janssens, Hilde; Blom, Joke; Akam, Michael; Jaeger, Johannes
2009-10-01
The early embryo of Drosophila melanogaster provides a powerful model system to study the role of genes in pattern formation. The gap gene network constitutes the first zygotic regulatory tier in the hierarchy of the segmentation genes involved in specifying the position of body segments. Here, we use an integrative, systems-level approach to investigate the regulatory effect of the terminal gap gene huckebein (hkb) on gap gene expression. We present quantitative expression data for the Hkb protein, which enable us to include hkb in gap gene circuit models. Gap gene circuits are mathematical models of gene networks used as computational tools to extract regulatory information from spatial expression data. This is achieved by fitting the model to gap gene expression patterns, in order to obtain estimates for regulatory parameters which predict a specific network topology. We show how considering variability in the data combined with analysis of parameter determinability significantly improves the biological relevance and consistency of the approach. Our models are in agreement with earlier results, which they extend in two important respects: First, we show that Hkb is involved in the regulation of the posterior hunchback (hb) domain, but does not have any other essential function. Specifically, Hkb is required for the anterior shift in the posterior border of this domain, which is now reproduced correctly in our models. Second, gap gene circuits presented here are able to reproduce mutants of terminal gap genes, while previously published models were unable to reproduce any null mutants correctly. As a consequence, our models now capture the expression dynamics of all posterior gap genes and some variational properties of the system correctly. This is an important step towards a better, quantitative understanding of the developmental and evolutionary dynamics of the gap gene network.
Gene Expression Signatures Based on Variability can Robustly Predict Tumor Progression and Prognosis
Dinalankara, Wikum; Bravo, Héctor Corrada
2015-01-01
Gene expression signatures are commonly used to create cancer prognosis and diagnosis methods, yet only a small number of them are successfully deployed in the clinic since many fail to replicate performance on subsequent validation. A primary reason for this lack of reproducibility is the fact that these signatures attempt to model the highly variable and unstable genomic behavior of cancer. Our group recently introduced gene expression anti-profiles as a robust methodology to derive gene expression signatures based on the observation that while gene expression measurements are highly heterogeneous across tumors of a specific cancer type relative to the normal tissue, their degree of deviation from normal tissue expression in specific genes involved in tissue differentiation is a stable tumor mark that is reproducible across experiments and cancer types. Here we show that constructing gene expression signatures based on variability and the anti-profile approach yields classifiers capable of successfully distinguishing benign growths from cancerous growths based on deviation from normal expression. We then show that this same approach generates stable and reproducible signatures that predict probability of relapse and survival based on tumor gene expression. These results suggest that using the anti-profile framework for the discovery of genomic signatures is an avenue leading to the development of reproducible signatures suitable for adoption in clinical settings. PMID:26078586
Lessons learned: Optimization of a murine small bowel resection model
Taylor, Janice A.; Martin, Colin A.; Nair, Rajalakshmi; Guo, Jun; Erwin, Christopher R.; Warner, Brad W.
2008-01-01
Background/Purpose Central to the use of murine models of disease is the ability to derive reproducible data. The purpose of this study was to determine factors contributing to variability in our murine model of small bowel resection (SBR). Methods Male C57Bl/6 mice were randomized to sham or 50% SBR. The effect of housing type (pathogen-free versus standard housing), nutrition (reconstituted powder versus tube feeding formulation), and correlates of intestinal morphology with gene expression changes were investigated Multiple linear regression modeling or one-way ANOVA was used for data analysis. Results Pathogen-free mice had significantly shorter ileal villi at baseline and demonstrated greater villus growth after SBR compared to mice housed in standard rooms. Food type did not affect adaptation. Gene expression changes were more consistent and significant in isolated crypt cells that demonstrated adaptive growth when compared with crypts that did not deepen after SBR. Conclusion Maintenance of mice in pathogen-free conditions and restricting gene expression analysis to individual animals exhibiting morphologic adaptation enhances sensitivity and specificity of data derived from this model. These refinements will minimize experimental variability and lead to improved understanding of the complex process of intestinal adaptation. PMID:18558176
Pacifico, D; Alma, A; Bagnoli, B; Foissac, X; Pasquini, G; Tessitori, M; Marzachì, C
2009-06-01
Bois noir phytoplasma (BNp), widespread in wine-producing areas of Europe and endemic in France and Italy, is classified in the 16SrXII-A subgroup, whose members are referred to as Stolbur phytoplasmas. The 16S rDNA gene of Stolbur phytoplasma shows low variability, and few non-ribosomal genes are available as markers to assess variation among isolates. We used the Stolbur-specific stol-1H10 gene, encoding a putative membrane-exposed protein, to investigate genetic diversity of French and Italian BNp isolates from plants and insects. Amplification of stol-1H10 from infected grapevines, weeds, and Hyalesthes obsoletus produced fragments of three sizes, and restriction fragment length polymorphism analysis divided these amplicons further into 12 profiles (V1 to V12). French BNp isolates were more variable than Italian ones, and different profiles were present in infected grapevines from France and Italy. Isolate V3, most abundant among Italian affected grapes but present among French ones, was found in one Urtica dioica sample and in all H. obsoletus collected on this species. Four Italian-specific profiles were represented among infected Convolvulus arvensis, the most frequent of which (V12) was also detected in H. obsoletus collected on this species. Most of the variability in the stol-1H10 sequence was associated with type II on the tuf gene.
Quantifying Intrinsic and Extrinsic Variability in Stochastic Gene Expression Models
Singh, Abhyudai; Soltani, Mohammad
2013-01-01
Genetically identical cell populations exhibit considerable intercellular variation in the level of a given protein or mRNA. Both intrinsic and extrinsic sources of noise drive this variability in gene expression. More specifically, extrinsic noise is the expression variability that arises from cell-to-cell differences in cell-specific factors such as enzyme levels, cell size and cell cycle stage. In contrast, intrinsic noise is the expression variability that is not accounted for by extrinsic noise, and typically arises from the inherent stochastic nature of biochemical processes. Two-color reporter experiments are employed to decompose expression variability into its intrinsic and extrinsic noise components. Analytical formulas for intrinsic and extrinsic noise are derived for a class of stochastic gene expression models, where variations in cell-specific factors cause fluctuations in model parameters, in particular, transcription and/or translation rate fluctuations. Assuming mRNA production occurs in random bursts, transcription rate is represented by either the burst frequency (how often the bursts occur) or the burst size (number of mRNAs produced in each burst). Our analysis shows that fluctuations in the transcription burst frequency enhance extrinsic noise but do not affect the intrinsic noise. On the contrary, fluctuations in the transcription burst size or mRNA translation rate dramatically increase both intrinsic and extrinsic noise components. Interestingly, simultaneous fluctuations in transcription and translation rates arising from randomness in ATP abundance can decrease intrinsic noise measured in a two-color reporter assay. Finally, we discuss how these formulas can be combined with single-cell gene expression data from two-color reporter experiments for estimating model parameters. PMID:24391934
Quantifying intrinsic and extrinsic variability in stochastic gene expression models.
Singh, Abhyudai; Soltani, Mohammad
2013-01-01
Genetically identical cell populations exhibit considerable intercellular variation in the level of a given protein or mRNA. Both intrinsic and extrinsic sources of noise drive this variability in gene expression. More specifically, extrinsic noise is the expression variability that arises from cell-to-cell differences in cell-specific factors such as enzyme levels, cell size and cell cycle stage. In contrast, intrinsic noise is the expression variability that is not accounted for by extrinsic noise, and typically arises from the inherent stochastic nature of biochemical processes. Two-color reporter experiments are employed to decompose expression variability into its intrinsic and extrinsic noise components. Analytical formulas for intrinsic and extrinsic noise are derived for a class of stochastic gene expression models, where variations in cell-specific factors cause fluctuations in model parameters, in particular, transcription and/or translation rate fluctuations. Assuming mRNA production occurs in random bursts, transcription rate is represented by either the burst frequency (how often the bursts occur) or the burst size (number of mRNAs produced in each burst). Our analysis shows that fluctuations in the transcription burst frequency enhance extrinsic noise but do not affect the intrinsic noise. On the contrary, fluctuations in the transcription burst size or mRNA translation rate dramatically increase both intrinsic and extrinsic noise components. Interestingly, simultaneous fluctuations in transcription and translation rates arising from randomness in ATP abundance can decrease intrinsic noise measured in a two-color reporter assay. Finally, we discuss how these formulas can be combined with single-cell gene expression data from two-color reporter experiments for estimating model parameters.
Barbour, Elie K; Saade, Maya F; Sleiman, Fawwak T; Hamadeh, Shady K; Mouneimne, Youssef; Kassaifi, Zeina; Kayali, Ghazi; Harakeh, Steve; Jaber, Lina S; Shaib, Houssam A
2012-10-01
The purpose of this research is to optimize quantitatively the amplification of specific sperm genes in reference genomically characterized Saanen goat and to evaluate the standardized protocols applicability on sperms of uncharacterized genome of rural goats reared under subtropical environment for inclusion in future selection programs. The optimization of the protocols in Saanen sperms included three production genes (growth hormone (GH) exons 2, 3, and 4, αS1-casein (CSN1S1), and α-lactalbumin) and two health genes (MHC class II DRB and prion (PrP)). The optimization was based on varying the primers concentrations and the inclusion of a PCR cosolvent (Triton X). The impact of the studied variables on statistically significant increase in the yield of amplicons was noticed in four out of five (80%) optimized protocols, namely in those related to GH, CSN1S1, α-lactalbumin, and PrP genes (P < 0.05). There was no significant difference in the yield of amplicons related to MHC class II DRB gene, regardless of the variables used (P > 0.05). The applicability of the optimized protocols of Saanen sperm genes on amplification of uncharacterized rural goat sperms revealed a 100% success in tested individuals for amplification of GH, CSN1S1, α-lactalbumin, and MHC class II DRB genes and a 75% success for the PrP gene. The significant success in applicability of the Saanen quantitatively optimized protocols to other uncharacterized genome of rural goats allows for their inclusion in future selection, targeting the sustainability of this farming system in a subtropical environment and the improvement of the farmers livelihood.
Matus, José Tomás; Aquea, Felipe; Espinoza, Carmen; Vega, Andrea; Cavallini, Erika; Dal Santo, Silvia; Cañón, Paola; Rodríguez-Hoces de la Guardia, Amparo; Serrano, Jennifer; Tornielli, Giovanni Battista; Arce-Johnson, Patricio
2014-01-01
The RESPONSIVE TO DEHYDRATION 22 (RD22) gene is a molecular link between abscisic acid (ABA) signalling and abiotic stress responses. Its expression has been used as a reliable ABA early response marker. In Arabidopsis, the single copy RD22 gene possesses a BURP domain also located at the C-terminus of USP embryonic proteins and the beta subunit of polygalacturonases. In grapevine, a RD22 gene has been identified but putative paralogs are also found in the grape genome, possibly forming a large RD22 family in this species. In this work, we searched for annotations containing BURP domains in the Vitis vinifera genome. Nineteen proteins were defined by a comparative analysis between the two genome predictions and RNA-Seq data. These sequences were compared to other plant BURPs identified in previous genome surveys allowing us to reconceive group classifications based on phylogenetic relationships and protein motif occurrence. We observed a lineage-specific evolution of the RD22 family, with the biggest expansion in grapevine and poplar. In contrast, rice, sorghum and maize presented highly expanded monocot-specific groups. The Vitis RD22 group may have expanded from segmental duplications as most of its members are confined to a region in chromosome 4. The inspection of transcriptomic data revealed variable expression of BURP genes in vegetative and reproductive organs. Many genes were induced in specific tissues or by abiotic and biotic stresses. Three RD22 genes were further studied showing that they responded oppositely to ABA and to stress conditions. Our results show that the inclusion of RNA-Seq data is essential while describing gene families and improving gene annotations. Robust phylogenetic analyses including all BURP members from other sequenced species helped us redefine previous relationships that were erroneously established. This work provides additional evidence for RD22 genes serving as marker genes for different organs or stresses in grapevine.
Coalitional game theory as a promising approach to identify candidate autism genes.
Gupta, Anika; Sun, Min Woo; Paskov, Kelley Marie; Stockham, Nate Tyler; Jung, Jae-Yoon; Wall, Dennis Paul
2018-01-01
Despite mounting evidence for the strong role of genetics in the phenotypic manifestation of Autism Spectrum Disorder (ASD), the specific genes responsible for the variable forms of ASD remain undefined. ASD may be best explained by a combinatorial genetic model with varying epistatic interactions across many small effect mutations. Coalitional or cooperative game theory is a technique that studies the combined effects of groups of players, known as coalitions, seeking to identify players who tend to improve the performance--the relationship to a specific disease phenotype--of any coalition they join. This method has been previously shown to boost biologically informative signal in gene expression data but to-date has not been applied to the search for cooperative mutations among putative ASD genes. We describe our approach to highlight genes relevant to ASD using coalitional game theory on alteration data of 1,965 fully sequenced genomes from 756 multiplex families. Alterations were encoded into binary matrices for ASD (case) and unaffected (control) samples, indicating likely gene-disrupting, inherited mutations in altered genes. To determine individual gene contributions given an ASD phenotype, a "player" metric, referred to as the Shapley value, was calculated for each gene in the case and control cohorts. Sixty seven genes were found to have significantly elevated player scores and likely represent significant contributors to the genetic coordination underlying ASD. Using network and cross-study analysis, we found that these genes are involved in biological pathways known to be affected in the autism cases and that a subset directly interact with several genes known to have strong associations to autism. These findings suggest that coalitional game theory can be applied to large-scale genomic data to identify hidden yet influential players in complex polygenic disorders such as autism.
Cerruti Mainardi, Paola
2006-01-01
The Cri du Chat syndrome (CdCS) is a genetic disease resulting from a deletion of variable size occurring on the short arm of chromosome 5 (5p-). The incidence ranges from 1:15,000 to 1:50,000 live-born infants. The main clinical features are a high-pitched monochromatic cry, microcephaly, broad nasal bridge, epicanthal folds, micrognathia, abnormal dermatoglyphics, and severe psychomotor and mental retardation. Malformations, although not very frequent, may be present: cardiac, neurological and renal abnormalities, preauricular tags, syndactyly, hypospadias, and cryptorchidism. Molecular cytogenetic analysis has allowed a cytogenetic and phenotypic map of 5p to be defined, even if results from the studies reported up to now are not completely in agreement. Genotype-phenotype correlation studies showed a clinical and cytogenetic variability. The identification of phenotypic subsets associated with a specific size and type of deletion is of diagnostic and prognostic relevance. Specific growth and psychomotor development charts have been established. Two genes, Semaphorin F (SEMAF) and δ-catenin (CTNND2), which have been mapped to the "critical regions", are potentially involved in cerebral development and their deletion may be associated with mental retardation in CdCS patients. Deletion of the telomerase reverse transcriptase (hTERT) gene, localised to 5p15.33, could contribute to the phenotypic changes in CdCS. The critical regions were recently refined by using array comparative genomic hybridisation. The cat-like cry critical region was further narrowed using quantitative polymerase chain reaction (PCR) and three candidate genes were characterised in this region. The diagnosis is based on typical clinical manifestations. Karyotype analysis and, in doubtful cases, FISH analysis will confirm the diagnosis. There is no specific therapy for CdCS but early rehabilitative and educational interventions improve the prognosis and considerable progress has been made in the social adjustment of CdCS patients. PMID:16953888
Tian, Xin; Xin, Mingyuan; Luo, Jian; Liu, Mingyao; Jiang, Zhenran
2017-02-01
The selection of relevant genes for breast cancer metastasis is critical for the treatment and prognosis of cancer patients. Although much effort has been devoted to the gene selection procedures by use of different statistical analysis methods or computational techniques, the interpretation of the variables in the resulting survival models has been limited so far. This article proposes a new Random Forest (RF)-based algorithm to identify important variables highly related with breast cancer metastasis, which is based on the important scores of two variable selection algorithms, including the mean decrease Gini (MDG) criteria of Random Forest and the GeneRank algorithm with protein-protein interaction (PPI) information. The new gene selection algorithm can be called PPIRF. The improved prediction accuracy fully illustrated the reliability and high interpretability of gene list selected by the PPIRF approach.
Rodríguez-Blanco, Arturo; Lemos, Manuel L; Osorio, Carlos R
2016-08-01
Integrating conjugative elements (ICEs) of the SXT/R391 family have been identified in fish-isolated bacterial strains collected from marine aquaculture environments of the northwestern Iberian Peninsula. Here we analysed the variable regions of two ICEs, one preliminarily characterised in a previous study (ICEVscSpa3) and one newly identified (ICEPspSpa1). Bacterial strains harboring these ICEs were phylogenetically assigned to Vibrio scophthalmi and Pseudoalteromonas sp., thus constituting the first evidence of SXT/R391-like ICEs in the genus Pseudoalteromonas to date. Variable DNA regions, which confer element-specific properties to ICEs of this family, were characterised. Interestingly, the two ICEs contained 29 genes not found in variable DNA insertions of previously described ICEs. Most notably, variable gene content for ICEVscSpa3 showed similarity to genes potentially involved in housekeeping functions of replication, nucleotide metabolism and transcription. For these genes, closest homologues were found clustered in the genome of Pseudomonas psychrotolerans L19, suggesting a transfer as a block to ICEVscSpa3. Genes encoding antibiotic resistance, restriction modification systems and toxin/antitoxin systems were absent from hotspots of ICEVscSpa3. In contrast, the variable gene content of ICEPspSpa1 included genes involved in restriction/modification functions in two different hotspots and genes related to ICE maintenance. The present study unveils a relatively large number of novel genes in SXT/R391-ICEs, and demonstrates the major role of ICE elements as contributors to horizontal gene transfer.
Site-Specific Fat-1 Knock-In Enables Significant Decrease of n-6PUFAs/n-3PUFAs Ratio in Pigs
Li, Mengjing; Ouyang, Hongsheng; Yuan, Hongming; Li, Jianing; Xie, Zicong; Wang, Kankan; Yu, Tingting; Liu, Minghao; Chen, Xue; Tang, Xiaochun; Jiao, Huping; Pang, Daxin
2018-01-01
The fat-1 gene from Caenorhabditis elegans encodes a fatty acid desaturase which was widely studied due to its beneficial function of converting n-6 polyunsaturated fatty acids (n-6PUFAs) to n-3 polyunsaturated fatty acids (n-3PUFAs). To date, many fat-1 transgenic animals have been generated to study disease pathogenesis or improve meat quality. However, all of them were generated using a random integration method with variable transgene expression levels and the introduction of selectable marker genes often raise biosafety concern. To this end, we aimed to generate marker-free fat-1 transgenic pigs in a site-specific manner. The Rosa26 locus, first found in mouse embryonic stem cells, has become one of the most common sites for inserting transgenes due to its safe and ubiquitous expression. In our study, the fat-1 gene was inserted into porcine Rosa 26 (pRosa26) locus via Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)/CRISPR-associated 9 (Cas9) system. The Southern blot analysis of our knock-in pigs indicated a single copy of the fat-1 gene at the pRosa26 locus. Furthermore, this single-copy fat-1 gene supported satisfactory expression in a variety of tissues in F1 generation pigs. Importantly, the gas chromatography analysis indicated that these fat-1 knock-in pigs exhibited a significant increase in the level of n-3PUFAs, leading to an obvious decrease in the n-6PUFAs/n-3PUFAs ratio from 9.36 to 2.12 (***P < 0.0001). Altogether, our fat-1 knock-in pigs hold great promise for improving the nutritional value of pork and serving as an animal model to investigate therapeutic effects of n-3PUFAs on various diseases. PMID:29563188
Barcoding and species recognition of opportunistic pathogens in Ochroconis and Verruconis.
Samerpitak, Kittipan; Gerrits van den Ende, Bert H G; Stielow, J Benjamin; Menken, Steph B J; de Hoog, G Sybren
2016-02-01
The genera Ochroconis and Verruconis (Sympoventuriaceae, Venturiales) have remarkably high molecular diversity despite relatively high degrees of phenotypic similarity. Tree topologies, inter-specific and intra-specific heterogeneities, barcoding gaps and reciprocal monophyly of all currently known species were analyzed. It was concluded that all currently used genes viz. SSU, ITS, LSU, ACT1, BT2, and TEF1 were unable to reach all 'gold standard' criteria of barcoding markers. They could nevertheless be used for reasonably reliable identification of species, because the markers, although variable, were associated with large inter-specific heterogeneity. Of the coding protein-genes, ACT1 revealed highest potentiality as barcoding marker in mostly all parts of the investigated sequence. SSU, LSU, ITS, and ACT1 yielded consistent monophyly in all investigated species, but only SSU and LSU generated clear barcoding gaps. For phylogeny, LSU was an informative marker, suitable to reconstruct gene-trees showing correct phylogenetic relationships. Cryptic species were revealed especially in complexes with very high intra-specific variability. When all these complexes will be taxonomically resolved, ACT1 will probably appear to be the most reliable barcoding gene for Ochroconis and Verruconis. Copyright © 2015 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.
Inoue, Kimiko; Oikawa, Mami; Kamimura, Satoshi; Ogonuki, Narumi; Nakamura, Toshinobu; Nakano, Toru; Abe, Kuniya; Ogura, Atsuo
2015-01-01
Although mammalian cloning by somatic cell nuclear transfer (SCNT) has been established in various species, the low developmental efficiency has hampered its practical applications. Treatment of SCNT-derived embryos with histone deacetylase (HDAC) inhibitors can improve their development, but the underlying mechanism is still unclear. To address this question, we analysed gene expression profiles of SCNT-derived 2-cell mouse embryos treated with trichostatin A (TSA), a potent HDAC inhibitor that is best used for mouse cloning. Unexpectedly, TSA had no effect on the numbers of aberrantly expressed genes or the overall gene expression pattern in the embryos. However, in-depth investigation by gene ontology and functional analyses revealed that TSA treatment specifically improved the expression of a small subset of genes encoding transcription factors and their regulatory factors, suggesting their positive involvement in de novo RNA synthesis. Indeed, introduction of one of such transcription factors, Spi-C, into the embryos at least partially mimicked the TSA-induced improvement in embryonic development by activating gene networks associated with transcriptional regulation. Thus, the effects of TSA treatment on embryonic gene expression did not seem to be stochastic, but more specific than expected, targeting genes that direct development and trigger zygotic genome activation at the 2-cell stage. PMID:25974394
Sabir, Jamal S. M.; Jansen, Robert K.; Arasappan, Dhivya; Calderon, Virginie; Noutahi, Emmanuel; Zheng, Chunfang; Park, Seongjun; Sabir, Meshaal J.; Baeshen, Mohammed N.; Hajrah, Nahid H.; Khiyami, Mohammad A.; Baeshen, Nabih A.; Obaid, Abdullah Y.; Al-Malki, Abdulrahman L.; Sankoff, David; El-Mabrouk, Nadia; Ruhlman, Tracey A.
2016-01-01
Alkaloid accumulation in plants is activated in response to stress, is limited in distribution and specific alkaloid repertoires are variable across taxa. Rauvolfioideae (Apocynaceae, Gentianales) represents a major center of structural expansion in the monoterpenoid indole alkaloids (MIAs) yielding thousands of unique molecules including highly valuable chemotherapeutics. The paucity of genome-level data for Apocynaceae precludes a deeper understanding of MIA pathway evolution hindering the elucidation of remaining pathway enzymes and the improvement of MIA availability in planta or in vitro. We sequenced the nuclear genome of Rhazya stricta (Apocynaceae, Rauvolfioideae) and present this high quality assembly in comparison with that of coffee (Rubiaceae, Coffea canephora, Gentianales) and others to investigate the evolution of genome-scale features. The annotated Rhazya genome was used to develop the community resource, RhaCyc, a metabolic pathway database. Gene family trees were constructed to identify homologs of MIA pathway genes and to examine their evolutionary history. We found that, unlike Coffea, the Rhazya lineage has experienced many structural rearrangements. Gene tree analyses suggest recent, lineage-specific expansion and diversification among homologs encoding MIA pathway genes in Gentianales and provide candidate sequences with the potential to close gaps in characterized pathways and support prospecting for new MIA production avenues. PMID:27653669
Sabir, Jamal S M; Jansen, Robert K; Arasappan, Dhivya; Calderon, Virginie; Noutahi, Emmanuel; Zheng, Chunfang; Park, Seongjun; Sabir, Meshaal J; Baeshen, Mohammed N; Hajrah, Nahid H; Khiyami, Mohammad A; Baeshen, Nabih A; Obaid, Abdullah Y; Al-Malki, Abdulrahman L; Sankoff, David; El-Mabrouk, Nadia; Ruhlman, Tracey A
2016-09-22
Alkaloid accumulation in plants is activated in response to stress, is limited in distribution and specific alkaloid repertoires are variable across taxa. Rauvolfioideae (Apocynaceae, Gentianales) represents a major center of structural expansion in the monoterpenoid indole alkaloids (MIAs) yielding thousands of unique molecules including highly valuable chemotherapeutics. The paucity of genome-level data for Apocynaceae precludes a deeper understanding of MIA pathway evolution hindering the elucidation of remaining pathway enzymes and the improvement of MIA availability in planta or in vitro. We sequenced the nuclear genome of Rhazya stricta (Apocynaceae, Rauvolfioideae) and present this high quality assembly in comparison with that of coffee (Rubiaceae, Coffea canephora, Gentianales) and others to investigate the evolution of genome-scale features. The annotated Rhazya genome was used to develop the community resource, RhaCyc, a metabolic pathway database. Gene family trees were constructed to identify homologs of MIA pathway genes and to examine their evolutionary history. We found that, unlike Coffea, the Rhazya lineage has experienced many structural rearrangements. Gene tree analyses suggest recent, lineage-specific expansion and diversification among homologs encoding MIA pathway genes in Gentianales and provide candidate sequences with the potential to close gaps in characterized pathways and support prospecting for new MIA production avenues.
Sánchez-Herrera, K; Sandoval, H; Mouniee, D; Ramírez-Durán, N; Bergeron, E; Boiron, P; Sánchez-Saucedo, N; Rodríguez-Nava, V
2017-09-01
Currently for bacterial identification and classification the rrs gene encoding 16S rRNA is used as a reference method for the analysis of strains of the genus Nocardia. However, it does not have enough polymorphism to differentiate them at the species level. This fact makes it necessary to search for molecular targets that can provide better identification. The sod A gene (encoding the enzyme superoxide dismutase) has had good results in identifying species of other Actinomycetes. In this study the sod A gene is proposed for the identification and differentiation at the species level of the genus Nocardia. We used 41 type species of various collections; a 386 bp fragment of the sod A gene was amplified and sequenced, and a phylogenetic analysis was performed comparing the genes rrs (1171 bp), hsp 65 (401 bp), sec A1 (494 bp), gyr B (1195 bp) and rpo B (401 bp). The sequences were aligned using the Clustal X program. Evolutionary trees according to the neighbour-joining method were created with the programs Phylo_win and MEGA 6. The specific variability of the sod A genus of the genus Nocardia was analysed. A high phylogenetic resolution, significant genetic variability, and specificity and reliability were observed for the differentiation of the isolates at the species level. The polymorphism observed in the sod A gene sequence contains variable regions that allow the discrimination of closely related Nocardia species. The clear specificity, despite its small size, proves to be of great advantage for use in taxonomic studies and clinical diagnosis of the genus Nocardia.
Morata, Jordi; Puigdomènech, Pere
2017-02-08
Cucurbitaceae species contain a significantly lower number of genes coding for proteins with similarity to plant resistance genes belonging to the NBS-LRR family than other plant species of similar genome size. A large proportion of these genes are organized in clusters that appear to be hotspots of variability. The genomes of the Cucurbitaceae species measured until now are intermediate in size (between 350 and 450 Mb) and they apparently have not undergone any genome duplications beside those at the origin of eudicots. The cluster containing the largest number of NBS-LRR genes has previously been analyzed in melon and related species and showed a high degree of interspecific and intraspecific variability. It was of interest to study whether similar behavior occurred in other cluster of the same family of genes. The cluster of NBS-LRR genes located in melon chromosome 9 was analyzed and compared with the syntenic regions in other cucurbit genomes. This is the second cluster in number within this species and it contains nine sequences with a NBS-LRR annotation including two genes, Fom1 and Prv, providing resistance against Fusarium and Ppapaya ring-spot virus (PRSV). The variability within the melon species appears to consist essentially of single nucleotide polymorphisms. Clusters of similar genes are present in the syntenic regions of the two species of Cucurbitaceae that were sequenced, cucumber and watermelon. Most of the genes in the syntenic clusters can be aligned between species and a hypothesis of generation of the cluster is proposed. The number of genes in the watermelon cluster is similar to that in melon while a higher number of genes (12) is present in cucumber, a species with a smaller genome than melon. After comparing genome resequencing data of 115 cucumber varieties, deletion of a group of genes is observed in a group of varieties of Indian origin. Clusters of genes coding for NBS-LRR proteins in cucurbits appear to have specific variability in different regions of the genome and between different species. This observation is in favour of considering that the adaptation of plant species to changing environments is based upon the variability that may occur at any location in the genome and that has been produced by specific mechanisms of sequence variation acting on plant genomes. This information could be useful both to understand the evolution of species and for plant breeding.
Transforming RNA-Seq data to improve the performance of prognostic gene signatures.
Zwiener, Isabella; Frisch, Barbara; Binder, Harald
2014-01-01
Gene expression measurements have successfully been used for building prognostic signatures, i.e for identifying a short list of important genes that can predict patient outcome. Mostly microarray measurements have been considered, and there is little advice available for building multivariable risk prediction models from RNA-Seq data. We specifically consider penalized regression techniques, such as the lasso and componentwise boosting, which can simultaneously consider all measurements and provide both, multivariable regression models for prediction and automated variable selection. However, they might be affected by the typical skewness, mean-variance-dependency or extreme values of RNA-Seq covariates and therefore could benefit from transformations of the latter. In an analytical part, we highlight preferential selection of covariates with large variances, which is problematic due to the mean-variance dependency of RNA-Seq data. In a simulation study, we compare different transformations of RNA-Seq data for potentially improving detection of important genes. Specifically, we consider standardization, the log transformation, a variance-stabilizing transformation, the Box-Cox transformation, and rank-based transformations. In addition, the prediction performance for real data from patients with kidney cancer and acute myeloid leukemia is considered. We show that signature size, identification performance, and prediction performance critically depend on the choice of a suitable transformation. Rank-based transformations perform well in all scenarios and can even outperform complex variance-stabilizing approaches. Generally, the results illustrate that the distribution and potential transformations of RNA-Seq data need to be considered as a critical step when building risk prediction models by penalized regression techniques.
Transforming RNA-Seq Data to Improve the Performance of Prognostic Gene Signatures
Zwiener, Isabella; Frisch, Barbara; Binder, Harald
2014-01-01
Gene expression measurements have successfully been used for building prognostic signatures, i.e for identifying a short list of important genes that can predict patient outcome. Mostly microarray measurements have been considered, and there is little advice available for building multivariable risk prediction models from RNA-Seq data. We specifically consider penalized regression techniques, such as the lasso and componentwise boosting, which can simultaneously consider all measurements and provide both, multivariable regression models for prediction and automated variable selection. However, they might be affected by the typical skewness, mean-variance-dependency or extreme values of RNA-Seq covariates and therefore could benefit from transformations of the latter. In an analytical part, we highlight preferential selection of covariates with large variances, which is problematic due to the mean-variance dependency of RNA-Seq data. In a simulation study, we compare different transformations of RNA-Seq data for potentially improving detection of important genes. Specifically, we consider standardization, the log transformation, a variance-stabilizing transformation, the Box-Cox transformation, and rank-based transformations. In addition, the prediction performance for real data from patients with kidney cancer and acute myeloid leukemia is considered. We show that signature size, identification performance, and prediction performance critically depend on the choice of a suitable transformation. Rank-based transformations perform well in all scenarios and can even outperform complex variance-stabilizing approaches. Generally, the results illustrate that the distribution and potential transformations of RNA-Seq data need to be considered as a critical step when building risk prediction models by penalized regression techniques. PMID:24416353
Evolution of Alternative Adaptive Immune Systems in Vertebrates.
Boehm, Thomas; Hirano, Masayuki; Holland, Stephen J; Das, Sabyasachi; Schorpp, Michael; Cooper, Max D
2018-04-26
Adaptive immunity in jawless fishes is based on antigen recognition by three types of variable lymphocyte receptors (VLRs) composed of variable leucine-rich repeats, which are differentially expressed by two T-like lymphocyte lineages and one B-like lymphocyte lineage. The T-like cells express either VLRAs or VLRCs of yet undefined antigen specificity, whereas the VLRB antibodies secreted by B-like cells bind proteinaceous and carbohydrate antigens. The incomplete VLR germline genes are assembled into functional units by a gene conversion-like mechanism that employs flanking variable leucine-rich repeat sequences as templates in association with lineage-specific expression of cytidine deaminases. B-like cells develop in the hematopoietic typhlosole and kidneys, whereas T-like cells develop in the thymoid, a thymus-equivalent region at the gill fold tips. Thus, the dichotomy between T-like and B-like cells and the presence of dedicated lymphopoietic tissues emerge as ancestral vertebrate features, whereas the somatic diversification of structurally distinct antigen receptor genes evolved independently in jawless and jawed vertebrates.
Combining Genotype, Phenotype, and Environment to Infer Potential Candidate Genes.
Talbot, Benoit; Chen, Ting-Wen; Zimmerman, Shawna; Joost, Stéphane; Eckert, Andrew J; Crow, Taylor M; Semizer-Cuming, Devrim; Seshadri, Chitra; Manel, Stéphanie
2017-03-01
Population genomic analysis can be an important tool in understanding local adaptation. Identification of potential adaptive loci in such analyses is usually based on the survey of a large genomic dataset in combination with environmental variables. Phenotypic data are less commonly incorporated into such studies, although combining a genome scan analysis with a phenotypic trait analysis can greatly improve the insights obtained from each analysis individually. Here, we aimed to identify loci potentially involved in adaptation to climate in 283 Loblolly pine (Pinus taeda) samples from throughout the species' range in the southeastern United States. We analyzed associations between phenotypic, molecular, and environmental variables from datasets of 3082 single nucleotide polymorphism (SNP) loci and 3 categories of phenotypic traits (gene expression, metabolites, and whole-plant traits). We found only 6 SNP loci that displayed potential signals of local adaptation. Five of the 6 identified SNPs are linked to gene expression traits for lignin development, and 1 is linked with whole-plant traits. We subsequently compared the 6 candidate genes with environmental variables and found a high correlation in only 3 of them (R2 > 0.2). Our study highlights the need for a combination of genotypes, phenotypes, and environmental variables, and for an appropriate sampling scheme and study design, to improve confidence in the identification of potential candidate genes. © The American Genetic Association 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
BASiCS: Bayesian Analysis of Single-Cell Sequencing Data
Vallejos, Catalina A.; Marioni, John C.; Richardson, Sylvia
2015-01-01
Single-cell mRNA sequencing can uncover novel cell-to-cell heterogeneity in gene expression levels in seemingly homogeneous populations of cells. However, these experiments are prone to high levels of unexplained technical noise, creating new challenges for identifying genes that show genuine heterogeneous expression within the population of cells under study. BASiCS (Bayesian Analysis of Single-Cell Sequencing data) is an integrated Bayesian hierarchical model where: (i) cell-specific normalisation constants are estimated as part of the model parameters, (ii) technical variability is quantified based on spike-in genes that are artificially introduced to each analysed cell’s lysate and (iii) the total variability of the expression counts is decomposed into technical and biological components. BASiCS also provides an intuitive detection criterion for highly (or lowly) variable genes within the population of cells under study. This is formalised by means of tail posterior probabilities associated to high (or low) biological cell-to-cell variance contributions, quantities that can be easily interpreted by users. We demonstrate our method using gene expression measurements from mouse Embryonic Stem Cells. Cross-validation and meaningful enrichment of gene ontology categories within genes classified as highly (or lowly) variable supports the efficacy of our approach. PMID:26107944
BASiCS: Bayesian Analysis of Single-Cell Sequencing Data.
Vallejos, Catalina A; Marioni, John C; Richardson, Sylvia
2015-06-01
Single-cell mRNA sequencing can uncover novel cell-to-cell heterogeneity in gene expression levels in seemingly homogeneous populations of cells. However, these experiments are prone to high levels of unexplained technical noise, creating new challenges for identifying genes that show genuine heterogeneous expression within the population of cells under study. BASiCS (Bayesian Analysis of Single-Cell Sequencing data) is an integrated Bayesian hierarchical model where: (i) cell-specific normalisation constants are estimated as part of the model parameters, (ii) technical variability is quantified based on spike-in genes that are artificially introduced to each analysed cell's lysate and (iii) the total variability of the expression counts is decomposed into technical and biological components. BASiCS also provides an intuitive detection criterion for highly (or lowly) variable genes within the population of cells under study. This is formalised by means of tail posterior probabilities associated to high (or low) biological cell-to-cell variance contributions, quantities that can be easily interpreted by users. We demonstrate our method using gene expression measurements from mouse Embryonic Stem Cells. Cross-validation and meaningful enrichment of gene ontology categories within genes classified as highly (or lowly) variable supports the efficacy of our approach.
Therapeutic Gene Editing Safety and Specificity.
Lux, Christopher T; Scharenberg, Andrew M
2017-10-01
Therapeutic gene editing is significant for medical advancement. Safety is intricately linked to the specificity of the editing tools used to cut at precise genomic targets. Improvements can be achieved by thoughtful design of nucleases and repair templates, analysis of off-target editing, and careful utilization of viral vectors. Advancements in DNA repair mechanisms and development of new generations of tools improve targeting of specific sequences while minimizing risks. It is important to plot a safe course for future clinical trials. This article reviews safety and specificity for therapeutic gene editing to spur dialogue and advancement. Copyright © 2017 Elsevier Inc. All rights reserved.
Pounds, Stan; Cheng, Cheng; Cao, Xueyuan; Crews, Kristine R; Plunkett, William; Gandhi, Varsha; Rubnitz, Jeffrey; Ribeiro, Raul C; Downing, James R; Lamba, Jatinder
2009-08-15
In some applications, prior biological knowledge can be used to define a specific pattern of association of multiple endpoint variables with a genomic variable that is biologically most interesting. However, to our knowledge, there is no statistical procedure designed to detect specific patterns of association with multiple endpoint variables. Projection onto the most interesting statistical evidence (PROMISE) is proposed as a general procedure to identify genomic variables that exhibit a specific biologically interesting pattern of association with multiple endpoint variables. Biological knowledge of the endpoint variables is used to define a vector that represents the biologically most interesting values for statistics that characterize the associations of the endpoint variables with a genomic variable. A test statistic is defined as the dot-product of the vector of the observed association statistics and the vector of the most interesting values of the association statistics. By definition, this test statistic is proportional to the length of the projection of the observed vector of correlations onto the vector of most interesting associations. Statistical significance is determined via permutation. In simulation studies and an example application, PROMISE shows greater statistical power to identify genes with the interesting pattern of associations than classical multivariate procedures, individual endpoint analyses or listing genes that have the pattern of interest and are significant in more than one individual endpoint analysis. Documented R routines are freely available from www.stjuderesearch.org/depts/biostats and will soon be available as a Bioconductor package from www.bioconductor.org.
Rare Genetic Forms of Obesity: Clinical Approach and Current Treatments in 2016
Huvenne, Hélène; Dubern, Béatrice; Clément, Karine; Poitou, Christine
2016-01-01
Obesity results from a synergistic relationship between genes and the environment. The phenotypic expression of genetic factors involved in obesity is variable, allowing to distinguish several clinical pictures of obesity. Monogenic obesity is described as rare and severe early-onset obesity with abnormal feeding behavior and endocrine disorders. This is mainly due to autosomal recessive mutations in genes of the leptin-melanocortin pathway which plays a key role in the hypothalamic control of food intake. Melanocortin 4 receptor(MC4R)-linked obesity is characterized by the variable severity of obesity and no notable additional phenotypes. Mutations in the MC4R gene are involved in 2-3% of obese children and adults; the majority of these are heterozygous. Syndromic obesity is associated with mental retardation, dysmorphic features, and organ-specific developmental abnormalities. Additional genes participating in the development of hypothalamus and central nervous system have been regularly identified. But to date, not all involved genes have been identified so far. New diagnostic tools, such as whole-exome sequencing, will probably help to identify other genes. Managing these patients is challenging. Indeed, specific treatments are available only for specific types of monogenic obesity, such as leptin deficiency. Data on bariatric surgery are limited and controversial. New molecules acting on the leptin-melanocortin pathway are currently being developed. PMID:27241181
Moriano-Gutierrez, A; Colomer-Revuelta, J; Sanjuan, J; Carot-Sierra, J M
2017-01-01
A great deal of research has addressed problems in the correct acquisition of language, but with few overall conclusions. The reasons for this lie in the individual variability, the existence of different measures for assessing language and the fact that a complex network of genetic and environmental factors are involved in its development. To review the environmental and genetic variables that have been studied to date, in order to gain a better under-standing of the causes of specific language impairment and create new evidence that can help in the development of screening systems for the early detection of these disorders. The environmental variables related with poorer early child language development include male gender, low level of education of the mother, familial history of problems with language or psychiatric problems, perinatal problems and health problems in early childhood. Bilingualism seems to be a protective factor. Temperament and language are related. Within the genetic factors there are several specific genes associated with language, two of which have a greater influence on its physiological acquisition: FOXP2 and CNTNAP2. The other genes that are most related with specific language disorders are ATP2C2, CMIP, ROBO2, ZNF277 and NOP9. The key to comprehending the development of specific language disorders lies in reaching an understanding of the true role played by genes in the ontogenesis, in the regulation of the different developmental processes, and how this role is modulated by the environment.
State Space Model with hidden variables for reconstruction of gene regulatory networks.
Wu, Xi; Li, Peng; Wang, Nan; Gong, Ping; Perkins, Edward J; Deng, Youping; Zhang, Chaoyang
2011-01-01
State Space Model (SSM) is a relatively new approach to inferring gene regulatory networks. It requires less computational time than Dynamic Bayesian Networks (DBN). There are two types of variables in the linear SSM, observed variables and hidden variables. SSM uses an iterative method, namely Expectation-Maximization, to infer regulatory relationships from microarray datasets. The hidden variables cannot be directly observed from experiments. How to determine the number of hidden variables has a significant impact on the accuracy of network inference. In this study, we used SSM to infer Gene regulatory networks (GRNs) from synthetic time series datasets, investigated Bayesian Information Criterion (BIC) and Principle Component Analysis (PCA) approaches to determining the number of hidden variables in SSM, and evaluated the performance of SSM in comparison with DBN. True GRNs and synthetic gene expression datasets were generated using GeneNetWeaver. Both DBN and linear SSM were used to infer GRNs from the synthetic datasets. The inferred networks were compared with the true networks. Our results show that inference precision varied with the number of hidden variables. For some regulatory networks, the inference precision of DBN was higher but SSM performed better in other cases. Although the overall performance of the two approaches is compatible, SSM is much faster and capable of inferring much larger networks than DBN. This study provides useful information in handling the hidden variables and improving the inference precision.
Chakraborty, Sutirtha
2018-05-26
RNA-Seq technology has revolutionized the face of gene expression profiling by generating read count data measuring the transcript abundances for each queried gene on multiple experimental subjects. But on the downside, the underlying technical artefacts and hidden biological profiles of the samples generate a wide variety of latent effects that may potentially distort the actual transcript/gene expression signals. Standard normalization techniques fail to correct for these hidden variables and lead to flawed downstream analyses. In this work I demonstrate the use of Partial Least Squares (built as an R package 'SVAPLSseq') to correct for the traces of extraneous variability in RNA-Seq data. A novel and thorough comparative analysis of the PLS based method is presented along with some of the other popularly used approaches for latent variable correction in RNA-Seq. Overall, the method is found to achieve a substantially improved estimation of the hidden effect signatures in the RNA-Seq transcriptome expression landscape compared to other available techniques. Copyright © 2017. Published by Elsevier Inc.
Regional differentiation among populations of the Diamondback terrapin (Malaclemys terrapin)
Hart, Kristen M.; Hunter, Margaret E.; King, Tim L.
2014-01-01
The Diamondback terrapin (Malaclemys terrapin) is a brackish-water turtle species whose populations have been fragmented due to anthropogenic activity such as development of coastal habitat and entrapment in commercial blue crab (Callinectes sapidus) fishing gear. Genetic analyses can improve conservation efforts for the long-term protection of the species. We used microsatellite DNA analysis to investigate levels of gene flow among and genetic variability within 21 geographically separate collections of the species distributed from Massachusetts to Texas. Quantified levels of genetic variability (allelic diversity, genotypic frequencies, and heterozygosity) revealed three zones of genetic discontinuity, resulting in four discrete populations: Northeast Atlantic, Coastal Mid-Atlantic, Florida and Texas/Louisiana. The average number of alleles and expected heterozygosity for the four genetic clusters were NA = 6.54 and HE = 0.050, respectively. However, the geographic boundaries of the populations did not correspond to accepted terrapin subspecies limits. Our results illuminate not only the need to sample terrapins in additional sites, specifically in the southeast, but also the necessity for allowing uninterrupted gene flow among population groupings to preserve current levels of genetic diversity.
DiLeo, Michelle F; Siu, Jenna C; Rhodes, Matthew K; López-Villalobos, Adriana; Redwine, Angela; Ksiazek, Kelly; Dyer, Rodney J
2014-08-01
Pollen-mediated gene flow is a major driver of spatial genetic structure in plant populations. Both individual plant characteristics and site-specific features of the landscape can modify the perceived attractiveness of plants to their pollinators and thus play an important role in shaping spatial genetic variation. Most studies of landscape-level genetic connectivity in plants have focused on the effects of interindividual distance using spatial and increasingly ecological separation, yet have not incorporated individual plant characteristics or other at-site ecological variables. Using spatially explicit simulations, we first tested the extent to which the inclusion of at-site variables influencing local pollination success improved the statistical characterization of genetic connectivity based upon examination of pollen pool genetic structure. The addition of at-site characteristics provided better models than those that only considered interindividual spatial distance (e.g. IBD). Models parameterized using conditional genetic covariance (e.g. population graphs) also outperformed those assuming panmixia. In a natural population of Cornus florida L. (Cornaceae), we showed that the addition of at-site characteristics (clumping of primary canopy opening above each maternal tree and maternal tree floral output) provided significantly better models describing gene flow than models including only between-site spatial (IBD) and ecological (isolation by resistance) variables. Overall, our results show that including interindividual and local ecological variation greatly aids in characterizing landscape-level measures of contemporary gene flow. © 2014 John Wiley & Sons Ltd.
Ibáñez, Clara; Pérez-Torrado, Roberto; Morard, Miguel; Toft, Christina; Barrio, Eladio; Querol, Amparo
2017-09-18
Transcriptome analyses play a central role in unraveling the complexity of gene expression regulation in Saccharomyces cerevisiae. This species, one of the most important microorganisms for humans given its industrial applications, shows an astonishing degree of genetic and phenotypic variability among different strains adapted to specific environments. In order to gain novel insights into the Saccharomyces cerevisiae biology of strains adapted to different fermentative environments, we analyzed the whole transcriptome of three strains isolated from wine, flor wine or mezcal fermentations. An RNA-seq transcriptome comparison of the different yeasts in the samples obtained during synthetic must fermentation highlighted the differences observed in the genes that encode mannoproteins, and in those involved in aroma, sugar transport, glycerol and alcohol metabolism, which are important under alcoholic fermentation conditions. These differences were also observed in the physiology of the strains after mannoprotein and aroma determinations. This study offers an essential foundation for understanding how gene expression variations contribute to the fermentation differences of the strains adapted to unequal fermentative environments. Such knowledge is crucial to make improvements in fermentation processes and to define targets for the genetic improvement or selection of wine yeasts. Copyright © 2017 Elsevier B.V. All rights reserved.
Röder, Christoph; König, Helmut; Fröhlich, Jürgen
2007-09-01
Sequencing of the complete 26S rRNA genes of all Dekkera/Brettanomyces species colonizing different beverages revealed the potential for a specific primer and probe design to support diagnostic PCR approaches and FISH. By analysis of the complete 26S rRNA genes of all five currently known Dekkera/Brettanomyces species (Dekkera bruxellensis, D. anomala, Brettanomyces custersianus, B. nanus and B. naardenensis), several regions with high nucleotide sequence variability yet distinct from the D1/D2 domains were identified. FISH species-specific probes targeting the 26S rRNA gene's most variable regions were designed. Accessibility of probe targets for hybridization was facilitated by the construction of partially complementary 'side'-labeled probes, based on secondary structure models of the rRNA sequences. The specificity and routine applicability of the FISH-based method for yeast identification were tested by analyzing different wine isolates. Investigation of the prevalence of Dekkera/Brettanomyces yeasts in the German viticultural regions Wonnegau, Nierstein and Bingen (Rhinehesse, Rhineland-Palatinate) resulted in the isolation of 37 D. bruxellensis strains from 291 wine samples.
Church, Sheri A; Livingstone, Kevin; Lai, Zhao; Kozik, Alexander; Knapp, Steven J; Michelmore, Richard W; Rieseberg, Loren H
2007-02-01
Using likelihood-based variable selection models, we determined if positive selection was acting on 523 EST sequence pairs from two lineages of sunflower and lettuce. Variable rate models are generally not used for comparisons of sequence pairs due to the limited information and the inaccuracy of estimates of specific substitution rates. However, previous studies have shown that the likelihood ratio test (LRT) is reliable for detecting positive selection, even with low numbers of sequences. These analyses identified 56 genes that show a signature of selection, of which 75% were not identified by simpler models that average selection across codons. Subsequent mapping studies in sunflower show four of five of the positively selected genes identified by these methods mapped to domestication QTLs. We discuss the validity and limitations of using variable rate models for comparisons of sequence pairs, as well as the limitations of using ESTs for identification of positively selected genes.
X chromosome regulation: diverse patterns in development, tissues and disease
Deng, Xinxian; Berletch, Joel B.; Nguyen, Di K.; Disteche, Christine M.
2014-01-01
Genes on the mammalian X chromosome are present in one copy in males and two copies in females. The complex mechanisms that regulate the X chromosome lead to evolutionary and physiological variability in gene expression between species, the sexes, individuals, developmental stages, tissues and cell types. In early development, delayed and incomplete X chromosome inactivation (XCI) in some species causes variability in gene expression. Additional diversity stems from escape from XCI and from mosaicism or XCI skewing in females. This causes sex-specific differences that manifest as differential gene expression and associated phenotypes. Furthermore, the complexity and diversity of X dosage regulation affect the severity of diseases caused by X-linked mutations. PMID:24733023
Pounds, Stan; Cheng, Cheng; Cao, Xueyuan; Crews, Kristine R.; Plunkett, William; Gandhi, Varsha; Rubnitz, Jeffrey; Ribeiro, Raul C.; Downing, James R.; Lamba, Jatinder
2009-01-01
Motivation: In some applications, prior biological knowledge can be used to define a specific pattern of association of multiple endpoint variables with a genomic variable that is biologically most interesting. However, to our knowledge, there is no statistical procedure designed to detect specific patterns of association with multiple endpoint variables. Results: Projection onto the most interesting statistical evidence (PROMISE) is proposed as a general procedure to identify genomic variables that exhibit a specific biologically interesting pattern of association with multiple endpoint variables. Biological knowledge of the endpoint variables is used to define a vector that represents the biologically most interesting values for statistics that characterize the associations of the endpoint variables with a genomic variable. A test statistic is defined as the dot-product of the vector of the observed association statistics and the vector of the most interesting values of the association statistics. By definition, this test statistic is proportional to the length of the projection of the observed vector of correlations onto the vector of most interesting associations. Statistical significance is determined via permutation. In simulation studies and an example application, PROMISE shows greater statistical power to identify genes with the interesting pattern of associations than classical multivariate procedures, individual endpoint analyses or listing genes that have the pattern of interest and are significant in more than one individual endpoint analysis. Availability: Documented R routines are freely available from www.stjuderesearch.org/depts/biostats and will soon be available as a Bioconductor package from www.bioconductor.org. Contact: stanley.pounds@stjude.org Supplementary information: Supplementary data are available at Bioinformatics online. PMID:19528086
Chaouachi, Maher; El Malki, Redouane; Berard, Aurélie; Romaniuk, Marcel; Laval, Valérie; Brunel, Dominique; Bertheau, Yves
2008-03-26
The labeling of products containing genetically modified organisms (GMO) is linked to their quantification since a threshold for the presence of fortuitous GMOs in food has been established. This threshold is calculated from a combination of two absolute quantification values: one for the specific GMO target and the second for an endogenous reference gene specific to the taxon. Thus, the development of reliable methods to quantify GMOs using endogenous reference genes in complex matrixes such as food and feed is needed. Plant identification can be difficult in the case of closely related taxa, which moreover are subject to introgression events. Based on the homology of beta-fructosidase sequences obtained from public databases, two couples of consensus primers were designed for the detection, quantification, and differentiation of four Solanaceae: potato (Solanum tuberosum), tomato (Solanum lycopersicum), pepper (Capsicum annuum), and eggplant (Solanum melongena). Sequence variability was studied first using lines and cultivars (intraspecies sequence variability), then using taxa involved in gene introgressions, and finally, using taxonomically close taxa (interspecies sequence variability). This study allowed us to design four highly specific TaqMan-MGB probes. A duplex real time PCR assay was developed for simultaneous quantification of tomato and potato. For eggplant and pepper, only simplex real time PCR tests were developed. The results demonstrated the high specificity and sensitivity of the assays. We therefore conclude that beta-fructosidase can be used as an endogenous reference gene for GMO analysis.
Dose response relationship in anti-stress gene regulatory networks.
Zhang, Qiang; Andersen, Melvin E
2007-03-02
To maintain a stable intracellular environment, cells utilize complex and specialized defense systems against a variety of external perturbations, such as electrophilic stress, heat shock, and hypoxia, etc. Irrespective of the type of stress, many adaptive mechanisms contributing to cellular homeostasis appear to operate through gene regulatory networks that are organized into negative feedback loops. In general, the degree of deviation of the controlled variables, such as electrophiles, misfolded proteins, and O2, is first detected by specialized sensor molecules, then the signal is transduced to specific transcription factors. Transcription factors can regulate the expression of a suite of anti-stress genes, many of which encode enzymes functioning to counteract the perturbed variables. The objective of this study was to explore, using control theory and computational approaches, the theoretical basis that underlies the steady-state dose response relationship between cellular stressors and intracellular biochemical species (controlled variables, transcription factors, and gene products) in these gene regulatory networks. Our work indicated that the shape of dose response curves (linear, superlinear, or sublinear) depends on changes in the specific values of local response coefficients (gains) distributed in the feedback loop. Multimerization of anti-stress enzymes and transcription factors into homodimers, homotrimers, or even higher-order multimers, play a significant role in maintaining robust homeostasis. Moreover, our simulation noted that dose response curves for the controlled variables can transition sequentially through four distinct phases as stressor level increases: initial superlinear with lesser control, superlinear more highly controlled, linear uncontrolled, and sublinear catastrophic. Each phase relies on specific gain-changing events that come into play as stressor level increases. The low-dose region is intrinsically nonlinear, and depending on the level of local gains, presence of gain-changing events, and degree of feedforward gene activation, this region can appear as superlinear, sublinear, or even J-shaped. The general dose response transition proposed here was further examined in a complex anti-electrophilic stress pathway, which involves multiple genes, enzymes, and metabolic reactions. This work would help biologists and especially toxicologists to better assess and predict the cellular impact brought about by biological stressors.
2014-01-01
Background Heterologous gene expression is an important tool for synthetic biology that enables metabolic engineering and the production of non-natural biologics in a variety of host organisms. The translational efficiency of heterologous genes can often be improved by optimizing synonymous codon usage to better match the host organism. However, traditional approaches for optimization neglect to take into account many factors known to influence synonymous codon distributions. Results Here we define an alternative approach for codon optimization that utilizes systems level information and codon context for the condition under which heterologous genes are being expressed. Furthermore, we utilize a probabilistic algorithm to generate multiple variants of a given gene. We demonstrate improved translational efficiency using this condition-specific codon optimization approach with two heterologous genes, the fluorescent protein-encoding eGFP and the catechol 1,2-dioxygenase gene CatA, expressed in S. cerevisiae. For the latter case, optimization for stationary phase production resulted in nearly 2.9-fold improvements over commercial gene optimization algorithms. Conclusions Codon optimization is now often a standard tool for protein expression, and while a variety of tools and approaches have been developed, they do not guarantee improved performance for all hosts of applications. Here, we suggest an alternative method for condition-specific codon optimization and demonstrate its utility in Saccharomyces cerevisiae as a proof of concept. However, this technique should be applicable to any organism for which gene expression data can be generated and is thus of potential interest for a variety of applications in metabolic and cellular engineering. PMID:24636000
Variable sexually dimorphic gene expression in laboratory strains of Drosophila melanogaster.
Baker, Dean A; Meadows, Lisa A; Wang, Jing; Dow, Julian At; Russell, Steven
2007-12-10
Wild-type laboratory strains of model organisms are typically kept in isolation for many years, with the action of genetic drift and selection on mutational variation causing lineages to diverge with time. Natural populations from which such strains are established, show that gender-specific interactions in particular drive many aspects of sequence level and transcriptional level variation. Here, our goal was to identify genes that display transcriptional variation between laboratory strains of Drosophila melanogaster, and to explore evidence of gender-biased interactions underlying that variability. Transcriptional variation among the laboratory genotypes studied occurs more frequently in males than in females. Qualitative differences are also apparent to suggest that genes within particular functional classes disproportionately display variation in gene expression. Our analysis indicates that genes with reproductive functions are most often divergent between genotypes in both sexes, however a large proportion of female variation can also be attributed to genes without expression in the ovaries. The present study clearly shows that transcriptional variation between common laboratory strains of Drosophila can differ dramatically due to sexual dimorphism. Much of this variation reflects sex-specific challenges associated with divergent physiological trade-offs, morphology and regulatory pathways operating within males and females.
Gene-diet interactions and aging in C. elegans
Yen, Chia An; Curran, Sean P.
2016-01-01
Diet is the most variable aspect of life history, as most individuals have a large diversity of food choices, varying in the type and amount that they ingest. In the short-term, diet can affect metabolism and energy levels. However, in the long run, the net deficiency or excess of calories from diet can influence the progression and severity of age-related diseases. An old and yet still debated question is: how do specific dietary choices impact health- and lifespan? It is clear that genetics can play a critical role — perhaps just as important as diet choices. For example, poor diet in combination with genetic susceptibility can lead to metabolic disorders, such as obesity and type 2 diabetes. Recent work in Caenorhabditis elegans has identified the existence of diet-gene pairs, where the consequence of mutating a specific gene is only realized on specific diets. Many core metabolic pathways are conserved from worm to human. Although only a handful of these diet-gene pairs has been characterized, there are potentially hundreds, if not thousands, of such interactions, which may explain the variability in the rates of aging in humans and the incidence and severity of age-related diseases. PMID:26924670
Genetic variability of milk fatty acids.
Arnould, V M-R; Soyeurt, H
2009-01-01
The milk fatty acid (FA) profile is far from the optimal fat composition in regards to human health. The natural sources of variation, such as feeding or genetics, could be used to increase the concentrations of unsaturated fatty acids. The impact of feeding is well described. However, genetic effects on the milk FA composition begin to be extensively studied. This paper summarizes the available information about the genetic variability of FAs. The greatest breed differences in FA composition are observed between Holstein and Jersey milk. Milk fat of the latter breed contains higher concentrations of saturated FAs, especially short-chain FAs. The variation of the delta-9 desaturase activity estimated from specific FA ratios could explain partly these breed differences. The choice of a specific breed seems to be a possibility to improve the nutritional quality of milk fat. Generally, the proportions of FAs in milk are more heritable than the proportions of these same FAs in fat. Heritability estimates range from 0.00 to 0.54. The presence of some single nucleotide polymorphisms could explain partly the observed individual genetic variability. The polymorphisms detected on SCD1 and DGAT1 genes influence the milk FA composition. The SCD1 V allele increases the unsaturation of C16 and C18. The DGAT1 A allele is related to the unsaturation of C18. So, a combination of the molecular and quantitative approaches should be used to develop tools helping farmers in the selection of their animals to improve the nutritional quality of the produced milk fat.
Deep sequencing reveals cell-type-specific patterns of single-cell transcriptome variation.
Dueck, Hannah; Khaladkar, Mugdha; Kim, Tae Kyung; Spaethling, Jennifer M; Francis, Chantal; Suresh, Sangita; Fisher, Stephen A; Seale, Patrick; Beck, Sheryl G; Bartfai, Tamas; Kuhn, Bernhard; Eberwine, James; Kim, Junhyong
2015-06-09
Differentiation of metazoan cells requires execution of different gene expression programs but recent single-cell transcriptome profiling has revealed considerable variation within cells of seeming identical phenotype. This brings into question the relationship between transcriptome states and cell phenotypes. Additionally, single-cell transcriptomics presents unique analysis challenges that need to be addressed to answer this question. We present high quality deep read-depth single-cell RNA sequencing for 91 cells from five mouse tissues and 18 cells from two rat tissues, along with 30 control samples of bulk RNA diluted to single-cell levels. We find that transcriptomes differ globally across tissues with regard to the number of genes expressed, the average expression patterns, and within-cell-type variation patterns. We develop methods to filter genes for reliable quantification and to calibrate biological variation. All cell types include genes with high variability in expression, in a tissue-specific manner. We also find evidence that single-cell variability of neuronal genes in mice is correlated with that in rats consistent with the hypothesis that levels of variation may be conserved. Single-cell RNA-sequencing data provide a unique view of transcriptome function; however, careful analysis is required in order to use single-cell RNA-sequencing measurements for this purpose. Technical variation must be considered in single-cell RNA-sequencing studies of expression variation. For a subset of genes, biological variability within each cell type appears to be regulated in order to perform dynamic functions, rather than solely molecular noise.
Ecker, Simone; Chen, Lu; Pancaldi, Vera; Bagger, Frederik O; Fernández, José María; Carrillo de Santa Pau, Enrique; Juan, David; Mann, Alice L; Watt, Stephen; Casale, Francesco Paolo; Sidiropoulos, Nikos; Rapin, Nicolas; Merkel, Angelika; Stunnenberg, Hendrik G; Stegle, Oliver; Frontini, Mattia; Downes, Kate; Pastinen, Tomi; Kuijpers, Taco W; Rico, Daniel; Valencia, Alfonso; Beck, Stephan; Soranzo, Nicole; Paul, Dirk S
2017-01-26
A healthy immune system requires immune cells that adapt rapidly to environmental challenges. This phenotypic plasticity can be mediated by transcriptional and epigenetic variability. We apply a novel analytical approach to measure and compare transcriptional and epigenetic variability genome-wide across CD14 + CD16 - monocytes, CD66b + CD16 + neutrophils, and CD4 + CD45RA + naïve T cells from the same 125 healthy individuals. We discover substantially increased variability in neutrophils compared to monocytes and T cells. In neutrophils, genes with hypervariable expression are found to be implicated in key immune pathways and are associated with cellular properties and environmental exposure. We also observe increased sex-specific gene expression differences in neutrophils. Neutrophil-specific DNA methylation hypervariable sites are enriched at dynamic chromatin regions and active enhancers. Our data highlight the importance of transcriptional and epigenetic variability for the key role of neutrophils as the first responders to inflammatory stimuli. We provide a resource to enable further functional studies into the plasticity of immune cells, which can be accessed from: http://blueprint-dev.bioinfo.cnio.es/WP10/hypervariability .
Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse
Hillier, LaDeana W.; Zody, Michael C.; Goldstein, Steve; She, Xinwe; Bult, Carol J.; Agarwala, Richa; Cherry, Joshua L.; DiCuccio, Michael; Hlavina, Wratko; Kapustin, Yuri; Meric, Peter; Maglott, Donna; Birtle, Zoë; Marques, Ana C.; Graves, Tina; Zhou, Shiguo; Teague, Brian; Potamousis, Konstantinos; Churas, Christopher; Place, Michael; Herschleb, Jill; Runnheim, Ron; Forrest, Daniel; Amos-Landgraf, James; Schwartz, David C.; Cheng, Ze; Lindblad-Toh, Kerstin; Eichler, Evan E.; Ponting, Chris P.
2009-01-01
The mouse (Mus musculus) is the premier animal model for understanding human disease and development. Here we show that a comprehensive understanding of mouse biology is only possible with the availability of a finished, high-quality genome assembly. The finished clone-based assembly of the mouse strain C57BL/6J reported here has over 175,000 fewer gaps and over 139 Mb more of novel sequence, compared with the earlier MGSCv3 draft genome assembly. In a comprehensive analysis of this revised genome sequence, we are now able to define 20,210 protein-coding genes, over a thousand more than predicted in the human genome (19,042 genes). In addition, we identified 439 long, non–protein-coding RNAs with evidence for transcribed orthologs in human. We analyzed the complex and repetitive landscape of 267 Mb of sequence that was missing or misassembled in the previously published assembly, and we provide insights into the reasons for its resistance to sequencing and assembly by whole-genome shotgun approaches. Duplicated regions within newly assembled sequence tend to be of more recent ancestry than duplicates in the published draft, correcting our initial understanding of recent evolution on the mouse lineage. These duplicates appear to be largely composed of sequence regions containing transposable elements and duplicated protein-coding genes; of these, some may be fixed in the mouse population, but at least 40% of segmentally duplicated sequences are copy number variable even among laboratory mouse strains. Mouse lineage-specific regions contain 3,767 genes drawn mainly from rapidly-changing gene families associated with reproductive functions. The finished mouse genome assembly, therefore, greatly improves our understanding of rodent-specific biology and allows the delineation of ancestral biological functions that are shared with human from derived functions that are not. PMID:19468303
Crocco, Paolina; Barale, Roberto; Rose, Giuseppina; Rizzato, Cosmeri; Santoro, Aurelia; De Rango, Francesco; Carrai, Maura; Fogar, Paola; Monti, Daniela; Biondi, Fiammetta; Bucci, Laura; Ostan, Rita; Tallaro, Federica; Montesanto, Alberto; Zambon, Carlo-Federico; Franceschi, Claudio; Canzian, Federico; Passarino, Giuseppe; Campa, Daniele
2015-06-01
Leukocyte telomere length (LTL) has been observed to be hereditable and correlated with longevity. However, contrasting results have been reported in different populations on the value of LTL heritability and on how biology of telomeres influences longevity. We investigated whether the variability of genes correlated to telomere maintenance is associated with telomere length and affects longevity in a population from Southern Italy (20-106 years). For this purpose we analyzed thirty-one polymorphisms in eight telomerase-associated genes of which twelve in the genes coding for the core enzyme (TERT and TERC) and the remaining in genes coding for components of the telomerase complex (TERF1, TERF2, TERF2IP, TNKS, TNKS2 and TEP1). We did not observe (after correcting for multiple testing) statistically significant associations between SNPs and LTL, possibly suggesting a low genetic influence of the variability of these genes on LTL in the elderly. On the other hand, we found that the variability of genes encoding for TERF1 and TNKS2, not directly involved in LTL, but important for keeping the integrity of the structure, shows a significant association with longevity. This suggests that the maintenance of these chromosomal structures may be critically important for preventing, or delaying, senescence and aging. Such a correlation was not observed in a population from northern Italy that we used as an independent replication set. This discrepancy is in line with previous reports regarding both the population specificity of results on telomere biology and the differences of aging in northern and southern Italy.
Farra, Rossella; Musiani, Francesco; Perrone, Francesca; Čemažar, Maja; Kamenšek, Urška; Tonon, Federica; Abrami, Michela; Ručigaj, Aleš; Grassi, Mario; Pozzato, Gabriele; Bonazza, Deborah; Zanconati, Fabrizio; Forte, Giancarlo; El Boustani, Maguie; Scarabel, Lucia; Garziera, Marica; Russo Spena, Concetta; De Stefano, Lucia; Salis, Barbara; Toffoli, Giuseppe; Rizzolio, Flavio; Grassi, Gabriele; Dapas, Barbara
2018-03-28
Despite the advances in anticancer therapies, their effectiveness for many human tumors is still far from being optimal. Significant improvements in treatment efficacy can come from the enhancement of drug specificity. This goal may be achieved by combining the use of therapeutic molecules with tumor specific effects and delivery carriers with tumor targeting ability. In this regard, nucleic acid-based drug (NABD) and particularly small interfering RNAs (siRNAs), are attractive molecules due to the possibility to be engineered to target specific tumor genes. On the other hand, polymeric-based delivery systems are emerging as versatile carriers to generate tumor-targeted delivery systems. Here we will focus on the most recent findings in the selection of siRNA/polymeric targeted delivery systems for hepatocellular carcinoma (HCC), a human tumor for which currently available therapeutic approaches are poorly effective. In addition, we will discuss the most attracting and, in our opinion, promising siRNA-polymer combinations for HCC in relation to the biological features of HCC tissue. Attention will be also put on the mathematical description of the mechanisms ruling siRNA-carrier delivery, this being an important aspect to improve effectiveness reducing the experimental work.
Molecular diagnostics for human leptospirosis.
Waggoner, Jesse J; Pinsky, Benjamin A
2016-10-01
The definitive diagnosis of leptospirosis, which results from infection with spirochetes of the genus Leptospira, currently relies on the use of culture, serological testing (microscopic agglutination testing), and molecular detection. The purpose of this review is to describe new molecular diagnostics for Leptospira and discuss advancements in the use of available methods. Efforts have been focused on improving the clinical sensitivity of Leptospira detection using molecular methods. In this review, we describe a reoptimized pathogenic species-specific real-time PCR (targeting lipL32) that has demonstrated improved sensitivity, findings by two groups that real-time reverse-transcription PCR assays targeting the 16S rrs gene can improve detection, and two new loop-mediated amplification techniques. Quantitation of leptospiremia, detection in different specimen types, and the complementary roles played by molecular detection and microscopic agglutination testing will be discussed. Finally, a protocol for Leptospira strain subtyping using variable number tandem repeat targets and high-resolution melting will be described. Molecular diagnostics have an established role for the diagnosis of leptospirosis and provide an actionable diagnosis in the acute setting. The use of real-time reverse-transcription PCR for testing serum/plasma and cerebrospinal fluid, when available, may improve the detection of Leptospira without decreasing clinical specificity.
Improved high-dimensional prediction with Random Forests by the use of co-data.
Te Beest, Dennis E; Mes, Steven W; Wilting, Saskia M; Brakenhoff, Ruud H; van de Wiel, Mark A
2017-12-28
Prediction in high dimensional settings is difficult due to the large number of variables relative to the sample size. We demonstrate how auxiliary 'co-data' can be used to improve the performance of a Random Forest in such a setting. Co-data are incorporated in the Random Forest by replacing the uniform sampling probabilities that are used to draw candidate variables by co-data moderated sampling probabilities. Co-data here are defined as any type information that is available on the variables of the primary data, but does not use its response labels. These moderated sampling probabilities are, inspired by empirical Bayes, learned from the data at hand. We demonstrate the co-data moderated Random Forest (CoRF) with two examples. In the first example we aim to predict the presence of a lymph node metastasis with gene expression data. We demonstrate how a set of external p-values, a gene signature, and the correlation between gene expression and DNA copy number can improve the predictive performance. In the second example we demonstrate how the prediction of cervical (pre-)cancer with methylation data can be improved by including the location of the probe relative to the known CpG islands, the number of CpG sites targeted by a probe, and a set of p-values from a related study. The proposed method is able to utilize auxiliary co-data to improve the performance of a Random Forest.
Parker, Craig T.; Gilbert, Michel; Yuki, Nobuhiro; Endtz, Hubert P.; Mandrell, Robert E.
2008-01-01
The lipooligosaccharide (LOS) biosynthesis region is one of the more variable genomic regions between strains of Campylobacter jejuni. Indeed, eight classes of LOS biosynthesis loci have been established previously based on gene content and organization. In this study, we characterize additional classes of LOS biosynthesis loci and analyze various mechanisms that result in changes to LOS structures. To gain further insights into the genomic diversity of C. jejuni LOS biosynthesis region, we sequenced the LOS biosynthesis loci of 15 strains that possessed gene content that was distinct from the eight classes. This analysis identified 11 new classes of LOS loci that exhibited examples of deletions and insertions of genes and cassettes of genes found in other LOS classes or capsular biosynthesis loci leading to mosaic LOS loci. The sequence analysis also revealed both missense mutations leading to “allelic” glycosyltransferases and phase-variable and non-phase-variable gene inactivation by the deletion or insertion of bases. Specifically, we demonstrated that gene inactivation is an important mechanism for altering the LOS structures of strains possessing the same class of LOS biosynthesis locus. Together, these observations suggest that LOS biosynthesis region is a hotspot for genetic exchange and variability, often leading to changes in the LOS produced. PMID:18556784
Trinh, Alice T; Ball, Bret G; Weber, Erin; Gallaher, Timothy K; Gluzman-Poltorak, Zoya; Anderson, French; Basile, Lena A
2009-12-30
Murine retroviral vectors have been used in several hundred gene therapy clinical trials, but have fallen out of favor for a number of reasons. One issue is that gene expression from viral or internal promoters is highly variable and essentially unregulated. Moreover, with retroviral vectors, gene expression is usually silenced over time. Mammalian genes, in contrast, are characterized by highly regulated, precise levels of expression in both a temporal and a cell-specific manner. To ascertain if recapitulation of endogenous adenosine deaminase (ADA) expression can be achieved in a vector construct we created a new series of Moloney murine leukemia virus (MuLV) based retroviral vector that carry human regulatory elements including combinations of the ADA promoter, the ADA locus control region (LCR), ADA introns and human polyadenylation sequences in a self-inactivating vector backbone. A MuLV-based retroviral vector with a self-inactivating (SIN) backbone, the phosphoglycerate kinase promoter (PGK) and the enhanced green fluorescent protein (eGFP), as a reporter gene, was generated. Subsequent vectors were constructed from this basic vector by deletion or addition of certain elements. The added elements that were assessed are the human ADA promoter, human ADA locus control region (LCR), introns 7, 8, and 11 from the human ADA gene, and human growth hormone polyadenylation signal. Retroviral vector particles were produced by transient three-plasmid transfection of 293T cells. Retroviral vectors encoding eGFP were titered by transducing 293A cells, and then the proportion of GFP-positive cells was determined using fluorescence-activated cell sorting (FACS). Non T-cell and T-cell lines were transduced at a multiplicity of infection (MOI) of 0.1 and the yield of eGFP transgene expression was evaluated by FACS analysis using mean fluorescent intensity (MFI) detection. Vectors that contained the ADA LCR were preferentially expressed in T-cell lines. Further improvements in T-cell specific gene expression were observed with the incorporation of additional cis-regulatory elements, such as a human polyadenylation signal and intron 7 from the human ADA gene. These studies suggest that the combination of an authentically regulated ADA gene in a murine retroviral vector, together with additional locus-specific regulatory refinements, will yield a vector with a safer profile and greater efficacy in terms of high-level, therapeutic, regulated gene expression for the treatment of ADA-deficient severe combined immunodeficiency.
Zhang, Wenping; Yue, Bisong; Wang, Xiaofang; Zhang, Xiuyue; Xie, Zhong; Liu, Nonglin; Fu, Wenyuan; Yuan, Yaohua; Chen, Daqing; Fu, Danghua; Zhao, Bo; Yin, Yuzhong; Yan, Xiahui; Wang, Xinjing; Zhang, Rongying; Liu, Jie; Li, Maoping; Tang, Yao; Hou, Rong; Zhang, Zhihe
2011-10-01
In order to investigate the mitochondrial genome of Panthera tigris amoyensis, two South China tigers (P25 and P27) were analyzed following 15 cymt-specific primer sets. The entire mtDNA sequence was found to be 16,957 bp and 17,001 bp long for P25 and P27 respectively, and this difference in length between P25 and P27 occurred in the number of tandem repeats in the RS-3 segment of the control region. The structural characteristics of complete P. t. amoyensis mitochondrial genomes were also highly similar to those of P. uncia. Additionally, the rate of point mutation was only 0.3% and a total of 59 variable sites between P25 and P27 were found. Out of the 59 variable sites, 6 were located in 6 different tRNA genes, 6 in the 2 rRNA genes, 7 in non-coding regions (one located between tRNA-Asn and tRNA-Tyr and six in the D-loop), and 40 in 10 protein-coding genes. COI held the largest amount of variable sites (9 sites) and Cytb contained the highest variable rate (0.7%) in the complete sequences. Moreover, out of the 40 variable sites located in 10 protein-coding genes, 12 sites were nonsynonymous.
Regulatory genes and their roles for improvement of antibiotic biosynthesis in Streptomyces.
Lu, Fengjuan; Hou, Yanyan; Zhang, Heming; Chu, Yiwen; Xia, Haiyang; Tian, Yongqiang
2017-08-01
The numerous secondary metabolites in Streptomyces spp. are crucial for various applications. For example, cephamycin C is used as an antibiotic, and avermectin is used as an insecticide. Specifically, antibiotic yield is closely related to many factors, such as the external environment, nutrition (including nitrogen and carbon sources), biosynthetic efficiency and the regulatory mechanisms in producing strains. There are various types of regulatory genes that work in different ways, such as pleiotropic (or global) regulatory genes, cluster-situated regulators, which are also called pathway-specific regulatory genes, and many other regulators. The study of regulatory genes that influence antibiotic biosynthesis in Streptomyces spp. not only provides a theoretical basis for antibiotic biosynthesis in Streptomyces but also helps to increase the yield of antibiotics via molecular manipulation of these regulatory genes. Currently, more and more emphasis is being placed on the regulatory genes of antibiotic biosynthetic gene clusters in Streptomyces spp., and many studies on these genes have been performed to improve the yield of antibiotics in Streptomyces. This paper lists many antibiotic biosynthesis regulatory genes in Streptomyces spp. and focuses on frequently investigated regulatory genes that are involved in pathway-specific regulation and pleiotropic regulation and their applications in genetic engineering.
Roschmann, E; Wienker, T F; Gerok, W; Volk, B A
1993-12-01
Genetic susceptibility of celiac disease is primarily associated with a particular combination of and HLA-DQA1/DQB1 gene; however, this does not fully account for the genetic predisposition. Therefore, the aim of this study was to examine whether T-cell receptor (TCR) genes may be susceptibility genes in celiac disease. HLA class II typing was performed by polymerase chain reaction amplification in combination with sequence-specific oligonucleotide hybridization. TCR alpha (TCRA), TCR gamma (TCRG), and TCR beta (TCRB) loci were investigated by restriction fragment length polymorphism analysis. Allelic frequencies of TCRA, TCRG, and TCRB variable genes were compared between patients with celiac disease (n = 53) and control patients (n = 67), and relative risk (RR) estimates were calculated. The RR was 1.67 for allele C1 at TCRA1, 3.35 for allele D2 at TCRA2, 1.66 for allele B2 at TCRG, and 1.35 for allele B at TCRB, showing no significant association. Additionally, linkage analysis was performed in 23 families. The logarithm of odd scores for celiac disease vs. the TCR variable genes at TCRA, TCRG, and TCRB showed no significant linkage. These data suggest that the analyzed TCR variable gene segments V alpha 1.2, V gamma 11, and V beta 8 do not play a major role in susceptibility to celiac disease.
Ovsyannikova, Inna G; Schaid, Daniel J; Larrabee, Beth R; Haralambieva, Iana H; Kennedy, Richard B; Poland, Gregory A
2017-01-01
Human antibody response to measles vaccine is highly variable in the population. Host genes contribute to inter-individual antibody response variation. The killer cell immunoglobulin-like receptors (KIR) are recognized to interact with HLA molecules and possibly influence humoral immune response to viral antigens. To expand on and improve our previous work with HLA genes, and to explore the genetic contribution of KIR genes to the inter-individual variability in measles vaccine-induced antibody responses, we performed a large population-based study in 2,506 healthy immunized subjects (ages 11 to 41 years) to identify HLA and KIR associations with measles vaccine-induced neutralizing antibodies. After correcting for the large number of statistical tests of allele effects on measles-specific neutralizing antibody titers, no statistically significant associations were found for either HLA or KIR loci. However, suggestive associations worthy of follow-up in other cohorts include B*57:01, DQB1*06:02, and DRB1*15:05 alleles. Specifically, the B*57:01 allele (1,040 mIU/mL; p = 0.0002) was suggestive of an association with lower measles antibody titer. In contrast, the DQB1*06:02 (1,349 mIU/mL; p = 0.0004) and DRB1*15:05 (2,547 mIU/mL; p = 0.0004) alleles were suggestive of an association with higher measles antibodies. Notably, the associations with KIR genotypes were strongly nonsignificant, suggesting that KIR loci in terms of copy number and haplotypes are not likely to play a major role in antibody response to measles vaccination. These findings refine our knowledge of the role of HLA and KIR alleles in measles vaccine-induced immunity.
Adaptation to climate through flowering phenology: a case study in Medicago truncatula.
Burgarella, Concetta; Chantret, Nathalie; Gay, Laurène; Prosperi, Jean-Marie; Bonhomme, Maxime; Tiffin, Peter; Young, Nevin D; Ronfort, Joelle
2016-07-01
Local climatic conditions likely constitute an important selective pressure on genes underlying important fitness-related traits such as flowering time, and in many species, flowering phenology and climatic gradients strongly covary. To test whether climate shapes the genetic variation on flowering time genes and to identify candidate flowering genes involved in the adaptation to environmental heterogeneity, we used a large Medicago truncatula core collection to examine the association between nucleotide polymorphisms at 224 candidate genes and both climate variables and flowering phenotypes. Unlike genome-wide studies, candidate gene approaches are expected to enrich for the number of meaningful trait associations because they specifically target genes that are known to affect the trait of interest. We found that flowering time mediates adaptation to climatic conditions mainly by variation at genes located upstream in the flowering pathways, close to the environmental stimuli. Variables related to the annual precipitation regime reflected selective constraints on flowering time genes better than the other variables tested (temperature, altitude, latitude or longitude). By comparing phenotype and climate associations, we identified 12 flowering genes as the most promising candidates responsible for phenological adaptation to climate. Four of these genes were located in the known flowering time QTL region on chromosome 7. However, climate and flowering associations also highlighted largely distinct gene sets, suggesting different genetic architectures for adaptation to climate and flowering onset. © 2016 John Wiley & Sons Ltd.
Marck, Christian; Kachouri-Lafond, Rym; Lafontaine, Ingrid; Westhof, Eric; Dujon, Bernard; Grosjean, Henri
2006-01-01
We present the first comprehensive analysis of RNA polymerase III (Pol III) transcribed genes in ten yeast genomes. This set includes all tRNA genes (tDNA) and genes coding for SNR6 (U6), SNR52, SCR1 and RPR1 RNA in the nine hemiascomycetes Saccharomyces cerevisiae, Saccharomyces castellii, Candida glabrata, Kluyveromyces waltii, Kluyveromyces lactis, Eremothecium gossypii, Debaryomyces hansenii, Candida albicans, Yarrowia lipolytica and the archiascomycete Schizosaccharomyces pombe. We systematically analysed sequence specificities of tRNA genes, polymorphism, variability of introns, gene redundancy and gene clustering. Analysis of decoding strategies showed that yeasts close to S.cerevisiae use bacterial decoding rules to read the Leu CUN and Arg CGN codons, in contrast to all other known Eukaryotes. In D.hansenii and C.albicans, we identified a novel tDNA-Leu (AAG), reading the Leu CUU/CUC/CUA codons with an unusual G at position 32. A systematic ‘p-distance tree’ using the 60 variable positions of the tRNA molecule revealed that most tDNAs cluster into amino acid-specific sub-trees, suggesting that, within hemiascomycetes, orthologous tDNAs are more closely related than paralogs. We finally determined the bipartite A- and B-box sequences recognized by TFIIIC. These minimal sequences are nearly conserved throughout hemiascomycetes and were satisfactorily retrieved at appropriate locations in other Pol III genes. PMID:16600899
Guellouz, Asma; Valerio-Lepiniec, Marie; Urvoas, Agathe; Chevrel, Anne; Graille, Marc; Fourati-Kammoun, Zaineb; Desmadril, Michel; van Tilbeurgh, Herman; Minard, Philippe
2013-01-01
We previously designed a new family of artificial proteins named αRep based on a subgroup of thermostable helicoidal HEAT-like repeats. We have now assembled a large optimized αRep library. In this library, the side chains at each variable position are not fully randomized but instead encoded by a distribution of codons based on the natural frequency of side chains of the natural repeats family. The library construction is based on a polymerization of micro-genes and therefore results in a distribution of proteins with a variable number of repeats. We improved the library construction process using a "filtration" procedure to retain only fully coding modules that were recombined to recreate sequence diversity. The final library named Lib2.1 contains 1.7×10(9) independent clones. Here, we used phage display to select, from the previously described library or from the new library, new specific αRep proteins binding to four different non-related predefined protein targets. Specific binders were selected in each case. The results show that binders with various sizes are selected including relatively long sequences, with up to 7 repeats. ITC-measured affinities vary with Kd values ranging from micromolar to nanomolar ranges. The formation of complexes is associated with a significant thermal stabilization of the bound target protein. The crystal structures of two complexes between αRep and their cognate targets were solved and show that the new interfaces are established by the variable surfaces of the repeated modules, as well by the variable N-cap residues. These results suggest that αRep library is a new and versatile source of tight and specific binding proteins with favorable biophysical properties.
Chevrel, Anne; Graille, Marc; Fourati-Kammoun, Zaineb; Desmadril, Michel; van Tilbeurgh, Herman; Minard, Philippe
2013-01-01
We previously designed a new family of artificial proteins named αRep based on a subgroup of thermostable helicoidal HEAT-like repeats. We have now assembled a large optimized αRep library. In this library, the side chains at each variable position are not fully randomized but instead encoded by a distribution of codons based on the natural frequency of side chains of the natural repeats family. The library construction is based on a polymerization of micro-genes and therefore results in a distribution of proteins with a variable number of repeats. We improved the library construction process using a “filtration” procedure to retain only fully coding modules that were recombined to recreate sequence diversity. The final library named Lib2.1 contains 1.7×109 independent clones. Here, we used phage display to select, from the previously described library or from the new library, new specific αRep proteins binding to four different non-related predefined protein targets. Specific binders were selected in each case. The results show that binders with various sizes are selected including relatively long sequences, with up to 7 repeats. ITC-measured affinities vary with Kd values ranging from micromolar to nanomolar ranges. The formation of complexes is associated with a significant thermal stabilization of the bound target protein. The crystal structures of two complexes between αRep and their cognate targets were solved and show that the new interfaces are established by the variable surfaces of the repeated modules, as well by the variable N-cap residues. These results suggest that αRep library is a new and versatile source of tight and specific binding proteins with favorable biophysical properties. PMID:24014183
Fischer, Wolfgang; Breithaupt, Ute; Kern, Beate; Smith, Stella I; Spicher, Carolin; Haas, Rainer
2014-04-27
The human gastric pathogen Helicobacter pylori is a paradigm for chronic bacterial infections. Its persistence in the stomach mucosa is facilitated by several mechanisms of immune evasion and immune modulation, but also by an unusual genetic variability which might account for the capability to adapt to changing environmental conditions during long-term colonization. This variability is reflected by the fact that almost each infected individual is colonized by a genetically unique strain. Strain-specific genes are dispersed throughout the genome, but clusters of genes organized as genomic islands may also collectively be present or absent. We have comparatively analysed such clusters, which are commonly termed plasticity zones, in a high number of H. pylori strains of varying geographical origin. We show that these regions contain fixed gene sets, rather than being true regions of genome plasticity, but two different types and several subtypes with partly diverging gene content can be distinguished. Their genetic diversity is incongruent with variations in the rest of the genome, suggesting that they are subject to horizontal gene transfer within H. pylori populations. We identified 40 distinct integration sites in 45 genome sequences, with a conserved heptanucleotide motif that seems to be the minimal requirement for integration. The significant number of possible integration sites, together with the requirement for a short conserved integration motif and the high level of gene conservation, indicates that these elements are best described as integrating conjugative elements (ICEs) with an intermediate integration site specificity.
Ludeña, Bertha; Chabrillange, Nathalie; Aberlenc-Bertossi, Frédérique; Adam, Hélène; Tregear, James W.; Pintaud, Jean-Christophe
2011-01-01
Background and Aims Molecular phylogenetic studies of palms (Arecaceae) have not yet provided a fully resolved phylogeny of the family. There is a need to increase the current set of markers to resolve difficult groups such as the Neotropical subtribe Bactridinae (Arecoideae: Cocoseae). We propose the use of two single-copy nuclear genes as valuable tools for palm phylogenetics. Methods New primers were developed for the amplification of the AGAMOUS 1 (AG1) and PHYTOCHROME B (PHYB) genes. For the AGAMOUS gene, the paralogue 1 of Elaeis guineensis (EgAG1) was targeted. The region amplified contained coding sequences between the MIKC K and C MADS-box domains. For the PHYB gene, exon 1 (partial sequence) was first amplified in palm species using published degenerate primers for Poaceae, and then specific palm primers were designed. The two gene portions were sequenced in 22 species of palms representing all genera of Bactridinae, with emphasis on Astrocaryum and Hexopetion, the status of the latter genus still being debated. Key Results The new primers designed allow consistent amplification and high-quality sequencing within the palm family. The two loci studied produced more variability than chloroplast loci and equally or less variability than PRK, RPBII and ITS nuclear markers. The phylogenetic structure obtained with AG1 and PHYB genes provides new insights into intergeneric relationships within the Bactridinae and the intrageneric structure of Astrocaryum. The Hexopetion clade was recovered as monophyletic with both markers and was weakly supported as sister to Astrocaryum sensu stricto in the combined analysis. The rare Astrocaryum minus formed a species complex with Astrocaryum gynacanthum. Moreover, both AG1 and PHYB contain a microsatellite that could have further uses in species delimitation and population genetics. Conclusions AG1 and PHYB provide additional phylogenetic information within the palm family, and should prove useful in combination with other genes to improve the resolution of palm phylogenies. PMID:21828068
Site-Specific Fat-1 Knock-In Enables Significant Decrease of n-6PUFAs/n-3PUFAs Ratio in Pigs.
Li, Mengjing; Ouyang, Hongsheng; Yuan, Hongming; Li, Jianing; Xie, Zicong; Wang, Kankan; Yu, Tingting; Liu, Minghao; Chen, Xue; Tang, Xiaochun; Jiao, Huping; Pang, Daxin
2018-05-04
The fat-1 gene from Caenorhabditis elegans encodes a fatty acid desaturase which was widely studied due to its beneficial function of converting n-6 polyunsaturated fatty acids (n-6PUFAs) to n-3 polyunsaturated fatty acids (n-3PUFAs). To date, many fat-1 transgenic animals have been generated to study disease pathogenesis or improve meat quality. However, all of them were generated using a random integration method with variable transgene expression levels and the introduction of selectable marker genes often raise biosafety concern. To this end, we aimed to generate marker-free fat-1 transgenic pigs in a site-specific manner. The Rosa26 locus, first found in mouse embryonic stem cells, has become one of the most common sites for inserting transgenes due to its safe and ubiquitous expression. In our study, the fat-1 gene was inserted into porcine Rosa 26 (pRosa26) locus via Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)/CRISPR-associated 9 (Cas9) system. The Southern blot analysis of our knock-in pigs indicated a single copy of the fat-1 gene at the pRosa26 locus. Furthermore, this single-copy fat-1 gene supported satisfactory expression in a variety of tissues in F1 generation pigs. Importantly, the gas chromatography analysis indicated that these fat-1 knock-in pigs exhibited a significant increase in the level of n-3PUFAs, leading to an obvious decrease in the n-6PUFAs/n-3PUFAs ratio from 9.36 to 2.12 (*** P < 0.0001). Altogether, our fat-1 knock-in pigs hold great promise for improving the nutritional value of pork and serving as an animal model to investigate therapeutic effects of n-3PUFAs on various diseases. Copyright © 2018 Li et al.
Kazemian, Majid; Zhu, Qiyun; Halfon, Marc S.; Sinha, Saurabh
2011-01-01
Despite recent advances in experimental approaches for identifying transcriptional cis-regulatory modules (CRMs, ‘enhancers’), direct empirical discovery of CRMs for all genes in all cell types and environmental conditions is likely to remain an elusive goal. Effective methods for computational CRM discovery are thus a critically needed complement to empirical approaches. However, existing computational methods that search for clusters of putative binding sites are ineffective if the relevant TFs and/or their binding specificities are unknown. Here, we provide a significantly improved method for ‘motif-blind’ CRM discovery that does not depend on knowledge or accurate prediction of TF-binding motifs and is effective when limited knowledge of functional CRMs is available to ‘supervise’ the search. We propose a new statistical method, based on ‘Interpolated Markov Models’, for motif-blind, genome-wide CRM discovery. It captures the statistical profile of variable length words in known CRMs of a regulatory network and finds candidate CRMs that match this profile. The method also uses orthologs of the known CRMs from closely related genomes. We perform in silico evaluation of predicted CRMs by assessing whether their neighboring genes are enriched for the expected expression patterns. This assessment uses a novel statistical test that extends the widely used Hypergeometric test of gene set enrichment to account for variability in intergenic lengths. We find that the new CRM prediction method is superior to existing methods. Finally, we experimentally validate 12 new CRM predictions by examining their regulatory activity in vivo in Drosophila; 10 of the tested CRMs were found to be functional, while 6 of the top 7 predictions showed the expected activity patterns. We make our program available as downloadable source code, and as a plugin for a genome browser installed on our servers. PMID:21821659
Mechanisms of gap gene expression canalization in the Drosophila blastoderm.
Gursky, Vitaly V; Panok, Lena; Myasnikova, Ekaterina M; Manu; Samsonova, Maria G; Reinitz, John; Samsonov, Alexander M
2011-01-01
Extensive variation in early gap gene expression in the Drosophila blastoderm is reduced over time because of gap gene cross regulation. This phenomenon is a manifestation of canalization, the ability of an organism to produce a consistent phenotype despite variations in genotype or environment. The canalization of gap gene expression can be understood as arising from the actions of attractors in the gap gene dynamical system. In order to better understand the processes of developmental robustness and canalization in the early Drosophila embryo, we investigated the dynamical effects of varying spatial profiles of Bicoid protein concentration on the formation of the expression border of the gap gene hunchback. At several positions on the anterior-posterior axis of the embryo, we analyzed attractors and their basins of attraction in a dynamical model describing expression of four gap genes with the Bicoid concentration profile accounted as a given input in the model equations. This model was tested against a family of Bicoid gradients obtained from individual embryos. These gradients were normalized by two independent methods, which are based on distinct biological hypotheses and provide different magnitudes for Bicoid spatial variability. We showed how the border formation is dictated by the biological initial conditions (the concentration gradient of maternal Hunchback protein) being attracted to specific attracting sets in a local vicinity of the border. Different types of these attracting sets (point attractors or one dimensional attracting manifolds) define several possible mechanisms of border formation. The hunchback border formation is associated with intersection of the spatial gradient of the maternal Hunchback protein and a boundary between the attraction basins of two different point attractors. We demonstrated how the positional variability for hunchback is related to the corresponding variability of the basin boundaries. The observed reduction in variability of the hunchback gene expression can be accounted for by specific geometrical properties of the basin boundaries. We clarified the mechanisms of gap gene expression canalization in early Drosophila embryos. These mechanisms were specified in the case of hunchback in well defined terms of the dynamical system theory.
Centanni, T M; Pantazis, D; Truong, D T; Gruen, J R; Gabrieli, J D E; Hogan, T P
2018-05-26
Individuals with dyslexia exhibit increased brainstem variability in response to sound. It is unknown as to whether increased variability extends to neocortical regions associated with audition and reading, extends to visual stimuli, and whether increased variability characterizes all children with dyslexia or, instead, a specific subset of children. We evaluated the consistency of stimulus-evoked neural responses in children with (N = 20) or without dyslexia (N = 12) as measured by magnetoencephalography (MEG). Approximately half of the children with dyslexia had significantly higher levels of variability in cortical responses to both auditory and visual stimuli in multiple nodes of the reading network. There was a significant and positive relationship between the number of risk alleles at rs6935076 in the dyslexia-susceptibility gene KIAA0319 and the degree of neural variability in primary auditory cortex across all participants. This gene has been linked with neural variability in rodents and in typical readers. These findings indicate that unstable representations of auditory and visual stimuli in auditory and other reading-related neocortical regions are present in a subset of children with dyslexia and support the link between the gene KIAA0319 and the auditory neural variability across children with or without dyslexia. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Chen, Zhiyuan; Hagen, Darren E.; Elsik, Christine G.; Ji, Tieming; Morris, Collin James; Moon, Laura Emily; Rivera, Rocío Melissa
2015-01-01
Embryos generated with the use of assisted reproductive technologies (ART) can develop overgrowth syndromes. In ruminants, the condition is referred to as large offspring syndrome (LOS) and exhibits variable phenotypic abnormalities including overgrowth, enlarged tongue, and abdominal wall defects. These characteristics recapitulate those observed in the human loss-of-imprinting (LOI) overgrowth syndrome Beckwith–Wiedemann (BWS). We have recently shown LOI at the KCNQ1 locus in LOS, the most common epimutation in BWS. Although the first case of ART-induced LOS was reported in 1995, studies have not yet determined the extent of LOI in this condition. Here, we determined allele-specific expression of imprinted genes previously identified in human and/or mouse in day ∼105 Bos taurus indicus × Bos taurus taurus F1 hybrid control and LOS fetuses using RNAseq. Our analysis allowed us to determine the monoallelic expression of 20 genes in tissues of control fetuses. LOS fetuses displayed variable LOI compared with controls. Biallelic expression of imprinted genes in LOS was associated with tissue-specific hypomethylation of the normally methylated parental allele. In addition, a positive correlation was observed between body weight and the number of biallelically expressed imprinted genes in LOS fetuses. Furthermore, not only was there loss of allele-specific expression of imprinted genes in LOS, but also differential transcript amounts of these genes between control and overgrown fetuses. In summary, we characterized previously unidentified imprinted genes in bovines and identified misregulation of imprinting at multiple loci in LOS. We concluded that LOS is a multilocus LOI syndrome, as is BWS. PMID:25825726
Chen, Zhiyuan; Hagen, Darren E; Elsik, Christine G; Ji, Tieming; Morris, Collin James; Moon, Laura Emily; Rivera, Rocío Melissa
2015-04-14
Embryos generated with the use of assisted reproductive technologies (ART) can develop overgrowth syndromes. In ruminants, the condition is referred to as large offspring syndrome (LOS) and exhibits variable phenotypic abnormalities including overgrowth, enlarged tongue, and abdominal wall defects. These characteristics recapitulate those observed in the human loss-of-imprinting (LOI) overgrowth syndrome Beckwith-Wiedemann (BWS). We have recently shown LOI at the KCNQ1 locus in LOS, the most common epimutation in BWS. Although the first case of ART-induced LOS was reported in 1995, studies have not yet determined the extent of LOI in this condition. Here, we determined allele-specific expression of imprinted genes previously identified in human and/or mouse in day ∼105 Bos taurus indicus × Bos taurus taurus F1 hybrid control and LOS fetuses using RNAseq. Our analysis allowed us to determine the monoallelic expression of 20 genes in tissues of control fetuses. LOS fetuses displayed variable LOI compared with controls. Biallelic expression of imprinted genes in LOS was associated with tissue-specific hypomethylation of the normally methylated parental allele. In addition, a positive correlation was observed between body weight and the number of biallelically expressed imprinted genes in LOS fetuses. Furthermore, not only was there loss of allele-specific expression of imprinted genes in LOS, but also differential transcript amounts of these genes between control and overgrown fetuses. In summary, we characterized previously unidentified imprinted genes in bovines and identified misregulation of imprinting at multiple loci in LOS. We concluded that LOS is a multilocus LOI syndrome, as is BWS.
Lambers Heerspink, Hiddo J; Oberbauer, Rainer; Perco, Paul; Heinzel, Andreas; Heinze, Georg; Mayer, Gert; Mayer, Bernd
2015-08-01
Diabetic kidney disease (DKD) is a complex, multifactorial disease and is associated with a high risk of renal and cardiovascular morbidity and mortality. Clinical practice guidelines for diabetes recommend essentially identical treatments for all patients without taking into account how the individual responds to the instituted therapy. Yet, individuals vary widely in how they respond to medications and therefore optimal therapy differs between individuals. Understanding the underlying molecular mechanisms of variability in drug response will help tailor optimal therapy. Polymorphisms in genes related to drug pharmacokinetics have been used to explore mechanisms of response variability in DKD, but with limited success. The complex interaction between genetic make-up and environmental factors on the abundance of proteins and metabolites renders pharmacogenomics alone insufficient to fully capture response variability. A complementary approach is to attribute drug response variability to individual variability in underlying molecular mechanisms involved in the progression of disease. The interplay of different processes (e.g. inflammation, fibrosis, angiogenesis, oxidative stress) appears to drive disease progression, but the individual contribution of each process varies. Drugs at the other hand address specific targets and thereby interfere in certain disease-associated processes. At this level, biomarkers may help to gain insight into which specific pathophysiological processes are involved in an individual followed by a rational assessment whether a specific drug's mode of action indeed targets the relevant process at hand. This article describes the conceptual background and data-driven workflow developed by the SysKid consortium aimed at improving characterization of the molecular mechanisms underlying DKD at the interference of the molecular impact of individual drugs in order to tailor optimal therapy to individual patients. © The Author 2015. Published by Oxford University Press on behalf of ERA-EDTA. All rights reserved.
Zambon, Carlo-Federico; Prayer-Galetti, Tommaso; Basso, Daniela; Padoan, Andrea; Rossi, Elisa; Secco, Silvia; Pelloso, Michela; Fogar, Paola; Navaglia, Filippo; Moz, Stefania; Zattoni, Filiberto; Plebani, Mario
2012-10-01
Of serum prostate specific antigen variability 40% depends on inherited factors. We ascertained whether the knowledge of KLK3 genetics would enhance prostate specific antigen diagnostic performance in patients with clinical suspicion of prostate cancer. We studied 1,058 men who consecutively underwent prostate biopsy for clinical suspicion of prostate cancer. At histology prostate cancer was present in 401 cases and absent in 657. Serum total prostate specific antigen and the free-to-total prostate specific antigen ratio were determined. Four polymorphisms of the KLK3 gene (rs2569733, rs2739448, rs925013 and rs2735839) and 1 polymorphism of the SRD5A2 gene (rs523349) were studied. The influence of genetics on prostate specific antigen variability was evaluated by multivariate linear regression analysis. The performance of total prostate specific antigen and the free-to-total prostate specific antigen ratio alone or combined with a genetically based patient classification were defined by ROC curve analyses. For prostate cancer diagnosis the free-to-total prostate specific antigen ratio index alone (cutoff 11%) was superior to total prostate specific antigen (cutoff 4 ng/ml) and to free-to-total prostate specific antigen ratio reflex testing (positive predictive value 61%, 43% and 54%, respectively). Prostate specific antigen correlated with KLK3 genetics (rs2735839 polymorphism p = 0.001, and rs2569733, rs2739448 and rs925013 haplotype combination p = 0.003). In patients with different KLK3 genetics 2 optimal free-to-total prostate specific antigen ratio cutoffs (11% and 14.5%) were found. For free-to-total prostate specific antigen ratio values between 11% and 14.5% the prostate cancer probability ranged from 30.0% to 47.4% according to patient genetics. The free-to-total prostate specific antigen ratio is superior to total prostate specific antigen for prostate cancer diagnosis, independent of total prostate specific antigen results. Free-to-total prostate specific antigen ratio findings below 11% are positively associated with prostate cancer and those above 14.5% are negatively associated with prostate cancer, while the interpretation of those between 11% and 14.5% is improved by patient KLK3 genetic analysis. Copyright © 2012 American Urological Association Education and Research, Inc. Published by Elsevier Inc. All rights reserved.
Variability of Creatine Metabolism Genes in Children with Autism Spectrum Disorder.
Cameron, Jessie M; Levandovskiy, Valeriy; Roberts, Wendy; Anagnostou, Evdokia; Scherer, Stephen; Loh, Alvin; Schulze, Andreas
2017-07-31
Creatine deficiency syndrome (CDS) comprises three separate enzyme deficiencies with overlapping clinical presentations: arginine:glycine amidinotransferase ( GATM gene, glycine amidinotransferase), guanidinoacetate methyltransferase ( GAMT gene), and creatine transporter deficiency ( SLC6A8 gene, solute carrier family 6 member 8). CDS presents with developmental delays/regression, intellectual disability, speech and language impairment, autistic behaviour, epileptic seizures, treatment-refractory epilepsy, and extrapyramidal movement disorders; symptoms that are also evident in children with autism. The objective of the study was to test the hypothesis that genetic variability in creatine metabolism genes is associated with autism. We sequenced GATM , GAMT and SLC6A8 genes in 166 patients with autism (coding sequence, introns and adjacent untranslated regions). A total of 29, 16 and 25 variants were identified in each gene, respectively. Four variants were novel in GATM , and 5 in SLC6A8 (not present in the 1000 Genomes, Exome Sequencing Project (ESP) or Exome Aggregation Consortium (ExAC) databases). A single variant in each gene was identified as non-synonymous, and computationally predicted to be potentially damaging. Nine variants in GATM were shown to have a lower minor allele frequency (MAF) in the autism population than in the 1000 Genomes database, specifically in the East Asian population (Fisher's exact test). Two variants also had lower MAFs in the European population. In summary, there were no apparent associations of variants in GAMT and SLC6A8 genes with autism. The data implying there could be a lower association of some specific GATM gene variants with autism is an observation that would need to be corroborated in a larger group of autism patients, and with sub-populations of Asian ethnicities. Overall, our findings suggest that the genetic variability of creatine synthesis/transport is unlikely to play a part in the pathogenesis of autism spectrum disorder (ASD) in children.
USDA-ARS?s Scientific Manuscript database
Genetic factors, specifically the VKORC1 and GGCX genes, have been shown to contribute to the interindividual variability in response to the vitamin K-antagonist, warfarin, which influences the dose required to achieve the desired anticoagulation response. These differences in warfarin sensitivity ...
Sotelo, Pablo H.; Collazo, Noberto; Zuñiga, Roberto; Gutiérrez-González, Matías; Catalán, Diego; Ribeiro, Carolina Hager; Aguillón, Juan Carlos; Molina, María Carmen
2012-01-01
Phage display library technology is a common method to produce human antibodies. In this technique, the immunoglobulin variable regions are displayed in a bacteriophage in a way that each filamentous virus displays the product of a single antibody gene on its surface. From the collection of different phages, it is possible to isolate the virus that recognizes specific targets. The most common form in which to display antibody variable regions in the phage is the single chain variable fragment format (scFv), which requires assembly of the heavy and light immunoglobulin variable regions in a single gene. In this work, we describe a simple and efficient method for the assembly of immunoglobulin heavy and light chain variable regions in a scFv format. This procedure involves a two-step reaction: (1) DNA amplification to produce the single strand form of the heavy or light chain gene required for the fusion; and (2) mixture of both single strand products followed by an assembly reaction to construct a complete scFv gene. Using this method, we produced 6-fold more scFv encoding DNA than the commonly used splicing by overlap extension PCR (SOE-PCR) approach. The scFv gene produced by this method also proved to be efficient in generating a diverse scFv phage display library. From this scFv library, we obtained phages that bound several non-related antigens, including recombinant proteins and rotavirus particles. PMID:22692130
A Genome-Wide Landscape of Retrocopies in Primate Genomes.
Navarro, Fábio C P; Galante, Pedro A F
2015-07-29
Gene duplication is a key factor contributing to phenotype diversity across and within species. Although the availability of complete genomes has led to the extensive study of genomic duplications, the dynamics and variability of gene duplications mediated by retrotransposition are not well understood. Here, we predict mRNA retrotransposition and use comparative genomics to investigate their origin and variability across primates. Analyzing seven anthropoid primate genomes, we found a similar number of mRNA retrotranspositions (∼7,500 retrocopies) in Catarrhini (Old Word Monkeys, including humans), but a surprising large number of retrocopies (∼10,000) in Platyrrhini (New World Monkeys), which may be a by-product of higher long interspersed nuclear element 1 activity in these genomes. By inferring retrocopy orthology, we dated most of the primate retrocopy origins, and estimated a decrease in the fixation rate in recent primate history, implying a smaller number of species-specific retrocopies. Moreover, using RNA-Seq data, we identified approximately 3,600 expressed retrocopies. As expected, most of these retrocopies are located near or within known genes, present tissue-specific and even species-specific expression patterns, and no expression correlation to their parental genes. Taken together, our results provide further evidence that mRNA retrotransposition is an active mechanism in primate evolution and suggest that retrocopies may not only introduce great genetic variability between lineages but also create a large reservoir of potentially functional new genomic loci in primate genomes. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Zera, Anthony J; Vellichirammal, Neetha Nanoth; Brisson, Jennifer A
2018-04-12
The functional basis of life history adaptation is a key topic of research in life history evolution. Studies of wing-polymorphism in the cricket Gryllus firmus have played a prominent role in this field. However, prior in-depth investigations of morph specialization have primarily focused on a single hormone, juvenile hormone, and a single aspect of intermediary metabolism, the fatty-acid biosynthetic component of lipid metabolism. Moreover, the role of diurnal variation in life history adaptation in G. firmus has been understudied, as is the case for organisms in general. Here, we identify genes whose expression differs consistently between the morphs independent of time-of-day during early adulthood, as well as genes that exhibit a strong pattern of morph-specific diurnal expression. We find strong, consistent, morph-specific differences in the expression of genes involved in endocrine regulation, carbohydrate and lipid metabolism, and immunity - in particular, in the expression of an insulin-like-peptide precursor gene and genes involved in triglyceride production. We also find that the flight-capable morph exhibited a substantially greater number of genes exhibiting diurnal change in gene expression compared with the flightless morph, correlated with the greater circadian change in the hemolymph juvenile titer in the dispersing morph. In fact, diurnal differences in expression within the dispersing morph at different times of the day were significantly greater in magnitude than differences between dispersing and flightless morphs at the same time-of-day. These results provide important baseline information regarding the potential role of variable gene expression on life history specialization in morphs of G. firmus, and the first information on genetically-variable, diurnal change in gene expression, associated with a key life history polymorphism. These results also suggest the existence of prominent morph-specific circadian differences in gene expression in G. firmus, possibly caused by the morph-specific circadian rhythm in the juvenile hormone titer. Copyright © 2018 Elsevier Ltd. All rights reserved.
Genome-wide prediction and analysis of human tissue-selective genes using microarray expression data
2013-01-01
Background Understanding how genes are expressed specifically in particular tissues is a fundamental question in developmental biology. Many tissue-specific genes are involved in the pathogenesis of complex human diseases. However, experimental identification of tissue-specific genes is time consuming and difficult. The accurate predictions of tissue-specific gene targets could provide useful information for biomarker development and drug target identification. Results In this study, we have developed a machine learning approach for predicting the human tissue-specific genes using microarray expression data. The lists of known tissue-specific genes for different tissues were collected from UniProt database, and the expression data retrieved from the previously compiled dataset according to the lists were used for input vector encoding. Random Forests (RFs) and Support Vector Machines (SVMs) were used to construct accurate classifiers. The RF classifiers were found to outperform SVM models for tissue-specific gene prediction. The results suggest that the candidate genes for brain or liver specific expression can provide valuable information for further experimental studies. Our approach was also applied for identifying tissue-selective gene targets for different types of tissues. Conclusions A machine learning approach has been developed for accurately identifying the candidate genes for tissue specific/selective expression. The approach provides an efficient way to select some interesting genes for developing new biomedical markers and improve our knowledge of tissue-specific expression. PMID:23369200
Robinson, Joshua F; Theunissen, Peter T; van Dartel, Dorien A M; Pennings, Jeroen L; Faustman, Elaine M; Piersma, Aldert H
2011-09-01
Toxicogenomic evaluations may improve toxicity prediction of in vitro-based developmental models, such as whole embryo culture (WEC) and embryonic stem cells (ESC), by providing a robust mechanistic marker which can be linked with responses associated with developmental toxicity in vivo. While promising in theory, toxicogenomic comparisons between in vivo and in vitro models are complex due to inherent differences in model characteristics and experimental design. Determining factors which influence these global comparisons are critical in the identification of reliable mechanistic-based markers of developmental toxicity. In this study, we compared available toxicogenomic data assessing the impact of the known teratogen, methylmercury (MeHg) across a diverse set of in vitro and in vivo models to investigate the impact of experimental variables (i.e. model, dose, time) on our comparative assessments. We evaluated common and unique aspects at both the functional (Gene Ontology) and gene level of MeHg-induced response. At the functional level, we observed stronger similarity in MeHg-response between mouse embryos exposed in utero (2 studies), ESC, and WEC as compared to liver, brain and mouse embryonic fibroblast MeHg studies. These findings were strongly correlated to the presence of a MeHg-induced developmentally related gene signature. In addition, we identified specific MeHg-induced gene expression alterations associated with developmental signaling and heart development across WEC, ESC and in vivo systems. However, the significance of overlap between studies was highly dependent on traditional experimental variables (i.e. dose, time). In summary, we identify promising examples of unique gene expression responses which show in vitro-in vivo similarities supporting the relevance of in vitro developmental models for predicting in vivo developmental toxicity. Copyright © 2011 Elsevier Inc. All rights reserved.
Chung, Tai-Chun; Jones, Charles H; Gollakota, Akhila; Kamal Ahmadi, Mahmoud; Rane, Snehal; Zhang, Guojian; Pfeifer, Blaine A
2015-05-04
Bactofection offers a gene delivery option particularly useful in the context of immune modulation. The bacterial host naturally attracts recognition and cellular uptake by antigen presenting cells (APCs) as the initial step in triggering an immune response. Moreover, depending on the bacterial vector, molecular biology tools are available to influence and/or overcome additional steps and barriers to effective antigen presentation. In this work, molecular engineering was applied using Escherichia coli as a bactofection vector. In particular, the bacteriophage ΦX174 lysis E (LyE) gene was designed for variable expression across strains containing different levels of lysteriolysin O (LLO). The objective was to generate a bacterial vector with improved attenuation and delivery characteristics. The resulting strains exhibited enhanced gene and protein release and inducible cellular death. In addition, the new vectors demonstrated improved gene delivery and cytotoxicity profiles to RAW264.7 macrophage APCs.
An Independent Filter for Gene Set Testing Based on Spectral Enrichment.
Frost, H Robert; Li, Zhigang; Asselbergs, Folkert W; Moore, Jason H
2015-01-01
Gene set testing has become an indispensable tool for the analysis of high-dimensional genomic data. An important motivation for testing gene sets, rather than individual genomic variables, is to improve statistical power by reducing the number of tested hypotheses. Given the dramatic growth in common gene set collections, however, testing is often performed with nearly as many gene sets as underlying genomic variables. To address the challenge to statistical power posed by large gene set collections, we have developed spectral gene set filtering (SGSF), a novel technique for independent filtering of gene set collections prior to gene set testing. The SGSF method uses as a filter statistic the p-value measuring the statistical significance of the association between each gene set and the sample principal components (PCs), taking into account the significance of the associated eigenvalues. Because this filter statistic is independent of standard gene set test statistics under the null hypothesis but dependent under the alternative, the proportion of enriched gene sets is increased without impacting the type I error rate. As shown using simulated and real gene expression data, the SGSF algorithm accurately filters gene sets unrelated to the experimental outcome resulting in significantly increased gene set testing power.
Akimoto, Yuki; Yugi, Katsuyuki; Uda, Shinsuke; Kudo, Takamasa; Komori, Yasunori; Kubota, Hiroyuki; Kuroda, Shinya
2013-01-01
Cells use common signaling molecules for the selective control of downstream gene expression and cell-fate decisions. The relationship between signaling molecules and downstream gene expression and cellular phenotypes is a multiple-input and multiple-output (MIMO) system and is difficult to understand due to its complexity. For example, it has been reported that, in PC12 cells, different types of growth factors activate MAP kinases (MAPKs) including ERK, JNK, and p38, and CREB, for selective protein expression of immediate early genes (IEGs) such as c-FOS, c-JUN, EGR1, JUNB, and FOSB, leading to cell differentiation, proliferation and cell death; however, how multiple-inputs such as MAPKs and CREB regulate multiple-outputs such as expression of the IEGs and cellular phenotypes remains unclear. To address this issue, we employed a statistical method called partial least squares (PLS) regression, which involves a reduction of the dimensionality of the inputs and outputs into latent variables and a linear regression between these latent variables. We measured 1,200 data points for MAPKs and CREB as the inputs and 1,900 data points for IEGs and cellular phenotypes as the outputs, and we constructed the PLS model from these data. The PLS model highlighted the complexity of the MIMO system and growth factor-specific input-output relationships of cell-fate decisions in PC12 cells. Furthermore, to reduce the complexity, we applied a backward elimination method to the PLS regression, in which 60 input variables were reduced to 5 variables, including the phosphorylation of ERK at 10 min, CREB at 5 min and 60 min, AKT at 5 min and JNK at 30 min. The simple PLS model with only 5 input variables demonstrated a predictive ability comparable to that of the full PLS model. The 5 input variables effectively extracted the growth factor-specific simple relationships within the MIMO system in cell-fate decisions in PC12 cells.
Karan, Ratna; DeLeon, Teresa; Biradar, Hanamareddy; Subudhi, Prasanta K.
2012-01-01
Background Salinity is a major environmental factor limiting productivity of crop plants including rice in which wide range of natural variability exists. Although recent evidences implicate epigenetic mechanisms for modulating the gene expression in plants under environmental stresses, epigenetic changes and their functional consequences under salinity stress in rice are underexplored. DNA methylation is one of the epigenetic mechanisms regulating gene expression in plant’s responses to environmental stresses. Better understanding of epigenetic regulation of plant growth and response to environmental stresses may create novel heritable variation for crop improvement. Methodology/Principal Findings Methylation sensitive amplification polymorphism (MSAP) technique was used to assess the effect of salt stress on extent and patterns of DNA methylation in four genotypes of rice differing in the degree of salinity tolerance. Overall, the amount of DNA methylation was more in shoot compared to root and the contribution of fully methylated loci was always more than hemi-methylated loci. Sequencing of ten randomly selected MSAP fragments indicated gene-body specific DNA methylation of retrotransposons, stress responsive genes, and chromatin modification genes, distributed on different rice chromosomes. Bisulphite sequencing and quantitative RT-PCR analysis of selected MSAP loci showed that cytosine methylation changes under salinity as well as gene expression varied with genotypes and tissue types irrespective of the level of salinity tolerance of rice genotypes. Conclusions/Significance The gene body methylation may have an important role in regulating gene expression in organ and genotype specific manner under salinity stress. Association between salt tolerance and methylation changes observed in some cases suggested that many methylation changes are not “directed”. The natural genetic variation for salt tolerance observed in rice germplasm may be independent of the extent and pattern of DNA methylation which may have been induced by abiotic stress followed by accumulation through the natural selection process. PMID:22761959
Severino, Patricia; Alvares, Adriana M; Michaluart, Pedro; Okamoto, Oswaldo K; Nunes, Fabio D; Moreira-Filho, Carlos A; Tajara, Eloiza H
2008-01-01
Background Oral squamous cell carcinoma (OSCC) is a frequent neoplasm, which is usually aggressive and has unpredictable biological behavior and unfavorable prognosis. The comprehension of the molecular basis of this variability should lead to the development of targeted therapies as well as to improvements in specificity and sensitivity of diagnosis. Results Samples of primary OSCCs and their corresponding surgical margins were obtained from male patients during surgery and their gene expression profiles were screened using whole-genome microarray technology. Hierarchical clustering and Principal Components Analysis were used for data visualization and One-way Analysis of Variance was used to identify differentially expressed genes. Samples clustered mostly according to disease subsite, suggesting molecular heterogeneity within tumor stages. In order to corroborate our results, two publicly available datasets of microarray experiments were assessed. We found significant molecular differences between OSCC anatomic subsites concerning groups of genes presently or potentially important for drug development, including mRNA processing, cytoskeleton organization and biogenesis, metabolic process, cell cycle and apoptosis. Conclusion Our results corroborate literature data on molecular heterogeneity of OSCCs. Differences between disease subsites and among samples belonging to the same TNM class highlight the importance of gene expression-based classification and challenge the development of targeted therapies. PMID:19014556
Multiple-input multiple-output causal strategies for gene selection.
Bontempi, Gianluca; Haibe-Kains, Benjamin; Desmedt, Christine; Sotiriou, Christos; Quackenbush, John
2011-11-25
Traditional strategies for selecting variables in high dimensional classification problems aim to find sets of maximally relevant variables able to explain the target variations. If these techniques may be effective in generalization accuracy they often do not reveal direct causes. The latter is essentially related to the fact that high correlation (or relevance) does not imply causation. In this study, we show how to efficiently incorporate causal information into gene selection by moving from a single-input single-output to a multiple-input multiple-output setting. We show in synthetic case study that a better prioritization of causal variables can be obtained by considering a relevance score which incorporates a causal term. In addition we show, in a meta-analysis study of six publicly available breast cancer microarray datasets, that the improvement occurs also in terms of accuracy. The biological interpretation of the results confirms the potential of a causal approach to gene selection. Integrating causal information into gene selection algorithms is effective both in terms of prediction accuracy and biological interpretation.
Tettelin, Hervé; Masignani, Vega; Cieslewicz, Michael J.; Donati, Claudio; Medini, Duccio; Ward, Naomi L.; Angiuoli, Samuel V.; Crabtree, Jonathan; Jones, Amanda L.; Durkin, A. Scott; DeBoy, Robert T.; Davidsen, Tanja M.; Mora, Marirosa; Scarselli, Maria; Margarit y Ros, Immaculada; Peterson, Jeremy D.; Hauser, Christopher R.; Sundaram, Jaideep P.; Nelson, William C.; Madupu, Ramana; Brinkac, Lauren M.; Dodson, Robert J.; Rosovitz, Mary J.; Sullivan, Steven A.; Daugherty, Sean C.; Haft, Daniel H.; Selengut, Jeremy; Gwinn, Michelle L.; Zhou, Liwei; Zafar, Nikhat; Khouri, Hoda; Radune, Diana; Dimitrov, George; Watkins, Kisha; O'Connor, Kevin J. B.; Smith, Shannon; Utterback, Teresa R.; White, Owen; Rubens, Craig E.; Grandi, Guido; Madoff, Lawrence C.; Kasper, Dennis L.; Telford, John L.; Wessels, Michael R.; Rappuoli, Rino; Fraser, Claire M.
2005-01-01
The development of efficient and inexpensive genome sequencing methods has revolutionized the study of human bacterial pathogens and improved vaccine design. Unfortunately, the sequence of a single genome does not reflect how genetic variability drives pathogenesis within a bacterial species and also limits genome-wide screens for vaccine candidates or for antimicrobial targets. We have generated the genomic sequence of six strains representing the five major disease-causing serotypes of Streptococcus agalactiae, the main cause of neonatal infection in humans. Analysis of these genomes and those available in databases showed that the S. agalactiae species can be described by a pan-genome consisting of a core genome shared by all isolates, accounting for ≈80% of any single genome, plus a dispensable genome consisting of partially shared and strain-specific genes. Mathematical extrapolation of the data suggests that the gene reservoir available for inclusion in the S. agalactiae pan-genome is vast and that unique genes will continue to be identified even after sequencing hundreds of genomes. PMID:16172379
Dzialo, Magdalena; Szopa, Jan; Czuj, Tadeusz; Zuk, Magdalena
2017-01-01
Chalcone synthase (CHS) has been recognized as an essential enzyme in the phenylpropanoid biosynthesis pathway. Apart from the leading role in the production of phenolic compounds with many valuable biological activities beneficial to biomedicine, CHS is well appreciated in science. Genetic engineering greatly facilitates expanding knowledge on the function and genetics of CHS in plants. The CHS gene is one of the most intensively studied genes in flax. In our study, we investigated engineering of the CHS gene through genetic and epigenetic approaches. Considering the numerous restrictions concerning the application of genetically modified (GM) crops, the main purpose of this research was optimization of the plant's modulation via epigenetics. In our study, plants modified through two methods were compared: a widely popular agrotransformation and a relatively recent oligodeoxynucleotide (ODN) strategy. It was recently highlighted that the ODN technique can be a rapid and time-serving antecedent in quick analysis of gene function before taking vector-mediated transformation. In order to understand the molecular background of epigenetic variation in more detail and evaluate the use of ODNs as a tool for predictable and stable gene engineering, we concentrated on the integration of gene expression and gene-body methylation. The treatment of flax with a series of short oligonucleotides homologous to a different part of CHS gene isoforms revealed that those directed to regulatory gene regions (5′- and 3′-UTR) activated gene expression, directed to non-coding region (introns) caused gen activity reduction, while those homologous to a coding region may have a variable influence on its activity. Gene expression changes were accompanied by changes in its methylation status. However, only certain (CCGG) motifs along the gene sequence were affected. The analyzed DNA motifs of the CHS flax gene are more accessible for methylation when located within a CpG island. The methylation motifs also led to rearrangement of the nucleosome location. The obtained results suggest high specificity of ODN action and establish a potential valuable alternative for improvement of crops. PMID:28555142
Dzialo, Magdalena; Szopa, Jan; Czuj, Tadeusz; Zuk, Magdalena
2017-01-01
Chalcone synthase (CHS) has been recognized as an essential enzyme in the phenylpropanoid biosynthesis pathway. Apart from the leading role in the production of phenolic compounds with many valuable biological activities beneficial to biomedicine, CHS is well appreciated in science. Genetic engineering greatly facilitates expanding knowledge on the function and genetics of CHS in plants. The CHS gene is one of the most intensively studied genes in flax. In our study, we investigated engineering of the CHS gene through genetic and epigenetic approaches. Considering the numerous restrictions concerning the application of genetically modified (GM) crops, the main purpose of this research was optimization of the plant's modulation via epigenetics. In our study, plants modified through two methods were compared: a widely popular agrotransformation and a relatively recent oligodeoxynucleotide (ODN) strategy. It was recently highlighted that the ODN technique can be a rapid and time-serving antecedent in quick analysis of gene function before taking vector-mediated transformation. In order to understand the molecular background of epigenetic variation in more detail and evaluate the use of ODNs as a tool for predictable and stable gene engineering, we concentrated on the integration of gene expression and gene-body methylation. The treatment of flax with a series of short oligonucleotides homologous to a different part of CHS gene isoforms revealed that those directed to regulatory gene regions (5'- and 3'-UTR) activated gene expression, directed to non-coding region (introns) caused gen activity reduction, while those homologous to a coding region may have a variable influence on its activity. Gene expression changes were accompanied by changes in its methylation status. However, only certain (CCGG) motifs along the gene sequence were affected. The analyzed DNA motifs of the CHS flax gene are more accessible for methylation when located within a CpG island. The methylation motifs also led to rearrangement of the nucleosome location. The obtained results suggest high specificity of ODN action and establish a potential valuable alternative for improvement of crops.
Strong motion deficits in dyslexia associated with DCDC2 gene alteration.
Cicchini, Guido Marco; Marino, Cecilia; Mascheretti, Sara; Perani, Daniela; Morrone, Maria Concetta
2015-05-27
Dyslexia is a specific impairment in reading that affects 1 in 10 people. Previous studies have failed to isolate a single cause of the disorder, but several candidate genes have been reported. We measured motion perception in two groups of dyslexics, with and without a deletion within the DCDC2 gene, a risk gene for dyslexia. We found impairment for motion particularly strong at high spatial frequencies in the population carrying the deletion. The data suggest that deficits in motion processing occur in a specific genotype, rather than the entire dyslexia population, contributing to the large variability in impairment of motion thresholds in dyslexia reported in the literature. Copyright © 2015 the authors 0270-6474/15/358059-06$15.00/0.
Santini, A C; Magalhães, J T; Cascardo, J C M; Corrêa, R X
2016-04-28
Chromobacterium violaceum is a free-living Gram-negative bacillus usually found in the water and soil in tropical regions, which causes infections in humans. Chromobacteriosis is characterized by rapid dissemination and high mortality. The aim of this study was to detect the genetic variability among C. violaceum type strain ATCC 12472, and seven isolates from the environment and one from a pulmonary secretion from a chromobacteriosis patient from Ilhéus, Bahia. The molecular characterization of all samples was performed by polymerase chain reaction (PCR) sequencing and 16S rDNA analysis. Primers specific for two ATCC 12472 pathogenicity genes, hilA and yscD, as well as random amplified polymorphic DNA (RAPD), were used for PCR amplification and comparative sequencing of the products. For a more specific approach, the PCR products of 16S rDNA were digested with restriction enzymes. Seven of the samples, including type-strain ATCC 12472, were amplified by the hilA primers; these were subsequently sequenced. Gene yscD was amplified only in type-strain ATCC 12472. MspI and AluI digestion revealed 16S rDNA polymorphisms. This data allowed the generation of a dendogram for each analysis. The isolates of C. violaceum have variability in random genomic regions demonstrated by RAPD. Also, these isolates have variability in pathogenicity genes, as demonstrated by sequencing and restriction enzyme digestion.
Using nearly full-genome HIV sequence data improves phylogeny reconstruction in a simulated epidemic
Yebra, Gonzalo; Hodcroft, Emma B.; Ragonnet-Cronin, Manon L.; Pillay, Deenan; Brown, Andrew J. Leigh; Fraser, Christophe; Kellam, Paul; de Oliveira, Tulio; Dennis, Ann; Hoppe, Anne; Kityo, Cissy; Frampton, Dan; Ssemwanga, Deogratius; Tanser, Frank; Keshani, Jagoda; Lingappa, Jairam; Herbeck, Joshua; Wawer, Maria; Essex, Max; Cohen, Myron S.; Paton, Nicholas; Ratmann, Oliver; Kaleebu, Pontiano; Hayes, Richard; Fidler, Sarah; Quinn, Thomas; Novitsky, Vladimir; Haywards, Andrew; Nastouli, Eleni; Morris, Steven; Clark, Duncan; Kozlakidis, Zisis
2016-01-01
HIV molecular epidemiology studies analyse viral pol gene sequences due to their availability, but whole genome sequencing allows to use other genes. We aimed to determine what gene(s) provide(s) the best approximation to the real phylogeny by analysing a simulated epidemic (created as part of the PANGEA_HIV project) with a known transmission tree. We sub-sampled a simulated dataset of 4662 sequences into different combinations of genes (gag-pol-env, gag-pol, gag, pol, env and partial pol) and sampling depths (100%, 60%, 20% and 5%), generating 100 replicates for each case. We built maximum-likelihood trees for each combination using RAxML (GTR + Γ), and compared their topologies to the corresponding true tree’s using CompareTree. The accuracy of the trees was significantly proportional to the length of the sequences used, with the gag-pol-env datasets showing the best performance and gag and partial pol sequences showing the worst. The lowest sampling depths (20% and 5%) greatly reduced the accuracy of tree reconstruction and showed high variability among replicates, especially when using the shortest gene datasets. In conclusion, using longer sequences derived from nearly whole genomes will improve the reliability of phylogenetic reconstruction. With low sample coverage, results can be highly variable, particularly when based on short sequences. PMID:28008945
Yebra, Gonzalo; Hodcroft, Emma B; Ragonnet-Cronin, Manon L; Pillay, Deenan; Brown, Andrew J Leigh
2016-12-23
HIV molecular epidemiology studies analyse viral pol gene sequences due to their availability, but whole genome sequencing allows to use other genes. We aimed to determine what gene(s) provide(s) the best approximation to the real phylogeny by analysing a simulated epidemic (created as part of the PANGEA_HIV project) with a known transmission tree. We sub-sampled a simulated dataset of 4662 sequences into different combinations of genes (gag-pol-env, gag-pol, gag, pol, env and partial pol) and sampling depths (100%, 60%, 20% and 5%), generating 100 replicates for each case. We built maximum-likelihood trees for each combination using RAxML (GTR + Γ), and compared their topologies to the corresponding true tree's using CompareTree. The accuracy of the trees was significantly proportional to the length of the sequences used, with the gag-pol-env datasets showing the best performance and gag and partial pol sequences showing the worst. The lowest sampling depths (20% and 5%) greatly reduced the accuracy of tree reconstruction and showed high variability among replicates, especially when using the shortest gene datasets. In conclusion, using longer sequences derived from nearly whole genomes will improve the reliability of phylogenetic reconstruction. With low sample coverage, results can be highly variable, particularly when based on short sequences.
Kanthak, Magdalena K.; Chen, Frances S.; Kumsta, Robert; Hill, LaBarron K.; Thayer, Julian F.; Heinrichs, Markus
2017-01-01
A large body of empirical research has demonstrated stress-buffering effects of social support. However, recent studies suggest that genetic variation of the oxytocin system (specifically, a common single nucleotide polymorphism, rs53576, of the oxytocin receptor gene) modulates the efficacy of social support. The timing and neurobiological basis of this genetic modulation were investigated using a standardized, laboratory-based psychological stress procedure (Trier Social Stress Test for Groups, TSST-G). To index potential stress buffering effects of social support mediated by the oxytocin system, heart rate variability (HRV) was obtained before and during the TSST-G from 40 healthy participants. Results indicate that social support is associated with higher HRV only in G allele carriers. Specifically, social support increased heart rate variability during direct social interaction and only in individuals with at least one copy of the G allele of rs53576. These findings support the idea that the stress-attenuating effects of social support are modulated by genetic variation of the oxytocin system. PMID:26903384
von Dassow, Peter; John, Uwe; Ogata, Hiroyuki; Probert, Ian; Bendif, El Mahdi; Kegel, Jessica U; Audic, Stéphane; Wincker, Patrick; Da Silva, Corinne; Claverie, Jean-Michel; Doney, Scott; Glover, David M; Flores, Daniella Mella; Herrera, Yeritza; Lescot, Magali; Garet-Delmas, Marie-José; de Vargas, Colomban
2015-06-01
Emiliania huxleyi is the most abundant calcifying plankton in modern oceans with substantial intraspecific genome variability and a biphasic life cycle involving sexual alternation between calcified 2N and flagellated 1N cells. We show that high genome content variability in Emiliania relates to erosion of 1N-specific genes and loss of the ability to form flagellated cells. Analysis of 185 E. huxleyi strains isolated from world oceans suggests that loss of flagella occurred independently in lineages inhabiting oligotrophic open oceans over short evolutionary timescales. This environmentally linked physiogenomic change suggests life cycling is not advantageous in very large/diluted populations experiencing low biotic pressure and low ecological variability. Gene loss did not appear to reflect pressure for genome streamlining in oligotrophic oceans as previously observed in picoplankton. Life-cycle modifications might be common in plankton and cause major functional variability to be hidden from traditional taxonomic or molecular markers.
Ding, Xiaotao; Jiang, Yuping; He, Lizhong; Zhou, Qiang; Yu, Jizhu; Hui, Dafeng; Huang, Danfeng
2016-01-01
To investigate the physiological responses of plants to high root-zone temperature (HT, 35 °C) stress mitigated by exogenous glutathione (GSH), cucumber (Cucumis sativus L.) seedlings were exposed to HT with or without GSH treatment for 4 days and following with 4 days of recovery. Plant physiological variables, growth, and gene expression related to antioxidant enzymes and Calvin cycle were quantified. The results showed that HT significantly decreased GSH content, the ratio of reduced to oxidized glutathione (GSH/GSSG), chlorophyll content, photosynthesis and related gene expression, shoot height, stem diameter, as well as dry weight. The exogenous GSH treatment clearly lessened the HT stress by increasing the above variables. Meanwhile, HT significantly increased soluble protein content, proline and malondialdehyde (MDA) content as well as O2•− production rate, the gene expression and activities of antioxidant enzymes. The GSH treatment remarkably improved soluble protein content, proline content, antioxidant enzymes activities, and antioxidant enzymes related gene expression, and reduced the MDA content and O2•− production rate compared to no GSH treatment in the HT condition. Our results suggest that exogenous GSH enhances cucumber seedling tolerance of HT stress by modulating the photosynthesis, antioxidant and osmolytes systems to improve physiological adaptation. PMID:27752105
Opazo, Juan C; Zavala, Kattina; Krall, Paola; Arias, Rodrigo A
2017-01-01
Understanding the processes that give rise to genomic variability in extant species is an active area of research within evolutionary biology. With the availability of whole genome sequences, it is possible to quantify different forms of variability such as variation in gene copy number, which has been described as an important source of genetic variability and in consequence of phenotypic variability. Most of the research on this topic has been focused on understanding the biological significance of gene duplication, and less attention has been given to the evolutionary role of gene loss. Gremlin 2 is a member of the DAN gene family and plays a significant role in tooth development by blocking the ligand-signaling pathway of BMP2 and BMP4. The goal of this study was to investigate the evolutionary history of gremlin 2 in cetartiodactyl mammals, a group that possesses highly divergent teeth morphology. Results from our analyses indicate that gremlin 2 has experienced a mixture of gene loss, gene duplication, and rate acceleration. Although the last common ancestor of cetartiodactyls possessed a single gene copy, pigs and camels are the only cetartiodactyl groups that have retained gremlin 2. According to the phyletic distribution of this gene and synteny analyses, we propose that gremlin 2 was lost in the common ancestor of ruminants and cetaceans between 56.3 and 63.5 million years ago as a product of a chromosomal rearrangement. Our analyses also indicate that the rate of evolution of gremlin 2 has been accelerated in the two groups that have retained this gene. Additionally, the lack of this gene could explain the high diversity of teeth among cetartiodactyl mammals; specifically, the presence of this gene could act as a biological constraint. Thus, our results support the notions that gene loss is a way to increase phenotypic diversity and that gremlin 2 is a dispensable gene, at least in cetartiodactyl mammals.
Goodhardt, M; Babinet, C; Lutfalla, G; Kallenbach, S; Cavelier, P; Rougeon, F
1989-01-01
We have produced transgenic mice which synthesize chimeric mouse-rabbit immunoglobulin (Ig) kappa light chains following in vivo recombination of an injected unrearranged kappa gene. The exogenous gene construct contained a mouse germ-line kappa variable (V kappa) gene segment, the mouse germ-line joining (J kappa) locus including the enhancer, and the rabbit b9 constant (C kappa) region. A high level of V-J recombination of the kappa transgene was observed in spleen of the transgenic mice. Surprisingly, a particularly high degree of variability in the exact site of recombination and the presence of non germ-line encoded nucleotides (N-regions) were found at the V-J junction of the rearranged kappa transgene. Furthermore, unlike endogenous kappa genes, rearrangement of the exogenous gene occurred in T-cells of the transgenic mice. These results show that additional sequences, other than the heptamer-nonamer signal sequences and the promoter and enhancer elements, are required to obtain stage- and lineage- specific regulation of Ig kappa light chain gene rearrangement in vivo. Images PMID:2508061
Primer sets for cloning the human repertoire of T cell Receptor Variable regions.
Boria, Ilenia; Cotella, Diego; Dianzani, Irma; Santoro, Claudio; Sblattero, Daniele
2008-08-29
Amplification and cloning of naïve T cell Receptor (TR) repertoires or antigen-specific TR is crucial to shape immune response and to develop immuno-based therapies. TR variable (V) regions are encoded by several genes that recombine during T cell development. The cloning of expressed genes as large diverse libraries from natural sources relies upon the availability of primers able to amplify as many V genes as possible. Here, we present a list of primers computationally designed on all functional TR V and J genes listed in the IMGT, the ImMunoGeneTics information system. The list consists of unambiguous or degenerate primers suitable to theoretically amplify and clone the entire TR repertoire. We show that it is possible to selectively amplify and clone expressed TR V genes in one single RT-PCR step and from as little as 1000 cells. This new primer set will facilitate the creation of more diverse TR libraries than has been possible using currently available primer sets.
Genetics of Obsessive-Compulsive Disorder and Related Disorders
Browne, Heidi A.; Gair, Shannon L.; Scharf, Jeremiah M.; Grice, Dorothy E.
2014-01-01
Synopsis Twin and family studies support a significant genetic contribution to obsessive-compulsive disorder (OCD) and related disorders such as chronic tic disorders, trichotillomania, skin picking disorder, body dysmorphic disorder, and hoarding disorder. Recently, population-based studies and novel laboratory-based methods have confirmed substantial heritability in OCD. Genome-wide association studies and candidate gene association studies have provided information on specific genes that may be involved in the pathobiology of OCD and also of related disorders, particularly chronic tic disorders, though these genes each contribute only a small portion of the total genetic risk and a substantial portion of the specific genetic risk profile in OCD is still unknown. Nevertheless, there are some examples of genes for which perturbations produce OCD-like phenotypes in animal model systems, allowing a laboratory platform for investigating the pathobiology of --- and new treatments for --- OCD and related disorders. Future work promises to continue to clarify the specific genes involved in risk for OCD as well as their interaction with environmental variables. PMID:25150565
A PCR primer bank for quantitative gene expression analysis.
Wang, Xiaowei; Seed, Brian
2003-12-15
Although gene expression profiling by microarray analysis is a useful tool for assessing global levels of transcriptional activity, variability associated with the data sets usually requires that observed differences be validated by some other method, such as real-time quantitative polymerase chain reaction (real-time PCR). However, non-specific amplification of non-target genes is frequently observed in the latter, confounding the analysis in approximately 40% of real-time PCR attempts when primer-specific labels are not used. Here we present an experimentally validated algorithm for the identification of transcript-specific PCR primers on a genomic scale that can be applied to real-time PCR with sequence-independent detection methods. An online database, PrimerBank, has been created for researchers to retrieve primer information for their genes of interest. PrimerBank currently contains 147 404 primers encompassing most known human and mouse genes. The primer design algorithm has been tested by conventional and real-time PCR for a subset of 112 primer pairs with a success rate of 98.2%.
eap Gene as novel target for specific identification of Staphylococcus aureus.
Hussain, Muzaffar; von Eiff, Christof; Sinha, Bhanu; Joost, Insa; Herrmann, Mathias; Peters, Georg; Becker, Karsten
2008-02-01
The cell surface-associated extracellular adherence protein (Eap) mediates adherence of Staphylococcus aureus to host extracellular matrix components and inhibits inflammation, wound healing, and angiogenesis. A well-characterized collection of S. aureus and non-S. aureus staphylococcal isolates (n = 813) was tested for the presence of the Eap-encoding gene (eap) by PCR to investigate the use of the eap gene as a specific diagnostic tool for identification of S. aureus. Whereas all 597 S. aureus isolates were eap positive, this gene was not detectable in 216 non-S. aureus staphylococcal isolates comprising 47 different species and subspecies of coagulase-negative staphylococci and non-S. aureus coagulase-positive or coagulase-variable staphylococci. Furthermore, non-S. aureus isolates did not express Eap homologs, as verified on the transcriptional and protein levels. Based on these data, the sensitivity and specificity of the newly developed PCR targeting the eap gene were both 100%. Thus, the unique occurrence of Eap in S. aureus offers a promising tool particularly suitable for molecular diagnostics of this pathogen.
2011-01-01
Background Because biotechnological uses of bacteriophage gene products as alternatives to conventional antibiotics will require a thorough understanding of their genomic context, we sequenced and analyzed the genomes of four closely related phages isolated from Clostridium perfringens, an important agricultural and human pathogen. Results Phage whole-genome tetra-nucleotide signatures and proteomic tree topologies correlated closely with host phylogeny. Comparisons of our phage genomes to 26 others revealed three shared COGs; of particular interest within this core genome was an endolysin (PF01520, an N-acetylmuramoyl-L-alanine amidase) and a holin (PF04531). Comparative analyses of the evolutionary history and genomic context of these common phage proteins revealed two important results: 1) strongly significant host-specific sequence variation within the endolysin, and 2) a protein domain architecture apparently unique to our phage genomes in which the endolysin is located upstream of its associated holin. Endolysin sequences from our phages were one of two very distinct genotypes distinguished by variability within the putative enzymatically-active domain. The shared or core genome was comprised of genes with multiple sequence types belonging to five pfam families, and genes belonging to 12 pfam families, including the holin genes, which were nearly identical. Conclusions Significant genomic diversity exists even among closely-related bacteriophages. Holins and endolysins represent conserved functions across divergent phage genomes and, as we demonstrate here, endolysins can have significant variability and host-specificity even among closely-related genomes. Endolysins in our phage genomes may be subject to different selective pressures than the rest of the genome. These findings may have important implications for potential biotechnological applications of phage gene products. PMID:21631945
Godec, Jernej; Tan, Yan; Liberzon, Arthur; Tamayo, Pablo; Bhattacharya, Sanchita; Butte, Atul J; Mesirov, Jill P; Haining, W Nicholas
2016-01-19
Gene-expression profiling has become a mainstay in immunology, but subtle changes in gene networks related to biological processes are hard to discern when comparing various datasets. For instance, conservation of the transcriptional response to sepsis in mouse models and human disease remains controversial. To improve transcriptional analysis in immunology, we created ImmuneSigDB: a manually annotated compendium of ∼5,000 gene-sets from diverse cell states, experimental manipulations, and genetic perturbations in immunology. Analysis using ImmuneSigDB identified signatures induced in activated myeloid cells and differentiating lymphocytes that were highly conserved between humans and mice. Sepsis triggered conserved patterns of gene expression in humans and mouse models. However, we also identified species-specific biological processes in the sepsis transcriptional response: although both species upregulated phagocytosis-related genes, a mitosis signature was specific to humans. ImmuneSigDB enables granular analysis of transcriptomic data to improve biological understanding of immune processes of the human and mouse immune systems. Copyright © 2016 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Yu, Yuan; Tong, Qi; Li, Zhongxia; Tian, Jinhai; Wang, Yizhi; Su, Feng; Wang, Yongsheng; Liu, Jun; Zhang, Yong
2014-02-01
PhiC31 integrase-mediated gene delivery has been extensively used in gene therapy and animal transgenesis. However, random integration events are observed in phiC31-mediated integration in different types of mammalian cells; as a result, the efficiencies of pseudo attP site integration and evaluation of site-specific integration are compromised. To improve this system, we used an attB-TK fusion gene as a negative selection marker, thereby eliminating random integration during phiC31-mediated transfection. We also excised the selection system and plasmid bacterial backbone by using two other site-specific recombinases, Cre and Dre. Thus, we generated clean transgenic bovine fetal fibroblast cells free of selectable marker and plasmid bacterial backbone. These clean cells were used as donor nuclei for somatic cell nuclear transfer (SCNT), indicating a similar developmental competence of SCNT embryos to that of non-transgenic cells. Therefore, the present gene delivery system facilitated the development of gene therapy and agricultural biotechnology.
Pellegrini, Silvia; Palumbo, Sara; Iofrida, Caterina; Melissari, Erika; Rota, Giuseppina; Mariotti, Veronica; Anastasio, Teresa; Manfrinati, Andrea; Rumiati, Rino; Lotto, Lorella; Sarlo, Michela; Pietrini, Pietro
2017-01-01
Moral behavior has been a key topic of debate for philosophy and psychology for a long time. In recent years, thanks to the development of novel methodologies in cognitive sciences, the question of how we make moral choices has expanded to the study of neurobiological correlates that subtend the mental processes involved in moral behavior. For instance, in vivo brain imaging studies have shown that distinct patterns of brain neural activity, associated with emotional response and cognitive processes, are involved in moral judgment. Moreover, while it is well-known that responses to the same moral dilemmas differ across individuals, to what extent this variability may be rooted in genetics still remains to be understood. As dopamine is a key modulator of neural processes underlying executive functions, we questioned whether genetic polymorphisms associated with decision-making and dopaminergic neurotransmission modulation would contribute to the observed variability in moral judgment. To this aim, we genotyped five genetic variants of the dopaminergic pathway [rs1800955 in the dopamine receptor D4 (DRD4) gene, DRD4 48 bp variable number of tandem repeat (VNTR), solute carrier family 6 member 3 (SLC6A3) 40 bp VNTR, rs4680 in the catechol-O-methyl transferase (COMT) gene, and rs1800497 in the ankyrin repeat and kinase domain containing 1 (ANKK1) gene] in 200 subjects, who were requested to answer 56 moral dilemmas. As these variants are all located in genes belonging to the dopaminergic pathway, they were combined in multilocus genetic profiles for the association analysis. While no individual variant showed any significant effects on moral dilemma responses, the multilocus genetic profile analysis revealed a significant gender-specific influence on human moral acceptability. Specifically, those genotype combinations that improve dopaminergic signaling selectively increased moral acceptability in females, by making their responses to moral dilemmas more similar to those provided by males. As females usually give more emotionally-based answers and engage the “emotional brain” more than males, our results, though preliminary and therefore in need of replication in independent samples, suggest that this increase in dopamine availability enhances the cognitive and reduces the emotional components of moral decision-making in females, thus favoring a more rationally-driven decision process. PMID:28900390
Bacillus subtilis genome diversity.
Earl, Ashlee M; Losick, Richard; Kolter, Roberto
2007-02-01
Microarray-based comparative genomic hybridization (M-CGH) is a powerful method for rapidly identifying regions of genome diversity among closely related organisms. We used M-CGH to examine the genome diversity of 17 strains belonging to the nonpathogenic species Bacillus subtilis. Our M-CGH results indicate that there is considerable genetic heterogeneity among members of this species; nearly one-third of Bsu168-specific genes exhibited variability, as measured by the microarray hybridization intensities. The variable loci include those encoding proteins involved in antibiotic production, cell wall synthesis, sporulation, and germination. The diversity in these genes may reflect this organism's ability to survive in diverse natural settings.
Mitochondria and the non-genetic origins of cell-to-cell variability: More is different.
Guantes, Raúl; Díaz-Colunga, Juan; Iborra, Francisco J
2016-01-01
Gene expression activity is heterogeneous in a population of isogenic cells. Identifying the molecular basis of this variability will improve our understanding of phenomena like tumor resistance to drugs, virus infection, or cell fate choice. The complexity of the molecular steps and machines involved in transcription and translation could introduce sources of randomness at many levels, but a common constraint to most of these processes is its energy dependence. In eukaryotic cells, most of this energy is provided by mitochondria. A clonal population of cells may show a large variability in the number and functionality of mitochondria. Here, we discuss how differences in the mitochondrial content of each cell contribute to heterogeneity in gene products. Changes in the amount of mitochondria can also entail drastic alterations of a cell's gene expression program, which ultimately leads to phenotypic diversity. Also watch the Video Abstract. © 2015 WILEY Periodicals, Inc.
Erickson, Keesha E; Otoupal, Peter B; Chatterjee, Anushree
2017-01-01
Antibiotic-resistant bacteria are an increasingly serious public health concern, as strains emerge that demonstrate resistance to almost all available treatments. One factor that contributes to the crisis is the adaptive ability of bacteria, which exhibit remarkable phenotypic and gene expression heterogeneity in order to gain a survival advantage in damaging environments. This high degree of variability in gene expression across biological populations makes it a challenging task to identify key regulators of bacterial adaptation. Here, we research the regulation of adaptive resistance by investigating transcriptome profiles of Escherichia coli upon adaptation to disparate toxins, including antibiotics and biofuels. We locate potential target genes via conventional gene expression analysis as well as using a new analysis technique examining differential gene expression variability. By investigating trends across the diverse adaptation conditions, we identify a focused set of genes with conserved behavior, including those involved in cell motility, metabolism, membrane structure, and transport, and several genes of unknown function. To validate the biological relevance of the observed changes, we synthetically perturb gene expression using clustered regularly interspaced short palindromic repeat (CRISPR)-dCas9. Manipulation of select genes in combination with antibiotic treatment promotes adaptive resistance as demonstrated by an increased degree of antibiotic tolerance and heterogeneity in MICs. We study the mechanisms by which identified genes influence adaptation and find that select differentially variable genes have the potential to impact metabolic rates, mutation rates, and motility. Overall, this work provides evidence for a complex nongenetic response, encompassing shifts in gene expression and gene expression variability, which underlies adaptive resistance. IMPORTANCE Even initially sensitive bacteria can rapidly thwart antibiotic treatment through stress response processes known as adaptive resistance. Adaptive resistance fosters transient tolerance increases and the emergence of mutations conferring heritable drug resistance. In order to extend the applicable lifetime of new antibiotics, we must seek to hinder the occurrence of bacterial adaptive resistance; however, the regulation of adaptation is difficult to identify due to immense heterogeneity emerging during evolution. This study specifically seeks to generate heterogeneity by adapting bacteria to different stresses and then examines gene expression trends across the disparate populations in order to pinpoint key genes and pathways associated with adaptive resistance. The targets identified here may eventually inform strategies for impeding adaptive resistance and prolonging the effectiveness of antibiotic treatment.
Glew, Michelle D.; Marenda, Marc; Rosengarten, Renate; Citti, Christine
2002-01-01
The ruminant pathogen Mycoplasma agalactiae possesses a family of abundantly expressed variable surface lipoproteins called Vpmas. Phenotypic switches between Vpma members have previously been correlated with DNA rearrangements within a locus of vpma genes and are proposed to play an important role in disease pathogenesis. In this study, six vpma genes were characterized in the M. agalactiae type strain PG2. All vpma genes clustered within an 8-kb region and shared highly conserved 5′ untranslated regions, lipoprotein signal sequences, and short N-terminal sequences. Analyses of the vpma loci from consecutive clonal isolates showed that vpma DNA rearrangements were site specific and that cleavage and strand exchange occurred within a minimal region of 21 bp located within the 5′ untranslated region of all vpma genes. This process controlled expression of vpma genes by effectively linking the open reading frame (ORF) of a silent gene to a unique active promoter sequence within the locus. An ORF (xer1) immediately adjacent to one end of the vpma locus did not undergo rearrangement and had significant homology to a distinct subset of genes belonging to the λ integrase family of site-specific xer recombinases. It is proposed that xer1 codes for a site-specific recombinase that is not involved in chromosome dimer resolution but rather is responsible for the observed vpma-specific recombination in M. agalactiae. PMID:12374833
2009-01-01
Background Murine retroviral vectors have been used in several hundred gene therapy clinical trials, but have fallen out of favor for a number of reasons. One issue is that gene expression from viral or internal promoters is highly variable and essentially unregulated. Moreover, with retroviral vectors, gene expression is usually silenced over time. Mammalian genes, in contrast, are characterized by highly regulated, precise levels of expression in both a temporal and a cell-specific manner. To ascertain if recapitulation of endogenous adenosine deaminase (ADA) expression can be achieved in a vector construct we created a new series of Moloney murine leukemia virus (MuLV) based retroviral vector that carry human regulatory elements including combinations of the ADA promoter, the ADA locus control region (LCR), ADA introns and human polyadenylation sequences in a self-inactivating vector backbone. Methods A MuLV-based retroviral vector with a self-inactivating (SIN) backbone, the phosphoglycerate kinase promoter (PGK) and the enhanced green fluorescent protein (eGFP), as a reporter gene, was generated. Subsequent vectors were constructed from this basic vector by deletion or addition of certain elements. The added elements that were assessed are the human ADA promoter, human ADA locus control region (LCR), introns 7, 8, and 11 from the human ADA gene, and human growth hormone polyadenylation signal. Retroviral vector particles were produced by transient three-plasmid transfection of 293T cells. Retroviral vectors encoding eGFP were titered by transducing 293A cells, and then the proportion of GFP-positive cells was determined using fluorescence-activated cell sorting (FACS). Non T-cell and T-cell lines were transduced at a multiplicity of infection (MOI) of 0.1 and the yield of eGFP transgene expression was evaluated by FACS analysis using mean fluorescent intensity (MFI) detection. Results Vectors that contained the ADA LCR were preferentially expressed in T-cell lines. Further improvements in T-cell specific gene expression were observed with the incorporation of additional cis-regulatory elements, such as a human polyadenylation signal and intron 7 from the human ADA gene. Conclusion These studies suggest that the combination of an authentically regulated ADA gene in a murine retroviral vector, together with additional locus-specific regulatory refinements, will yield a vector with a safer profile and greater efficacy in terms of high-level, therapeutic, regulated gene expression for the treatment of ADA-deficient severe combined immunodeficiency. PMID:20042112
Sayegh, Camil E.; Demaries, Sandra L.; Iacampo, Sandra; Ratcliffe, Michael J. H.
1999-01-01
Immunoglobulin gene rearrangement in avian B cell precursors generates surface Ig receptors of limited diversity. It has been proposed that specificities encoded by these receptors play a critical role in B lineage development by recognizing endogenous ligands within the bursa of Fabricius. To address this issue directly we have introduced a truncated surface IgM, lacking variable region domains, into developing B precursors by retroviral gene transfer in vivo. Cells expressing this truncated receptor lack endogenous surface IgM, and the low level of endogenous Ig rearrangements that have occurred within this population of cells has not been selected for having a productive reading frame. Such cells proliferate rapidly within bursal epithelial buds of normal morphology. In addition, despite reduced levels of endogenous light chain rearrangement, those light chain rearrangements that have occurred have undergone variable region diversification by gene conversion. Therefore, although surface expression of an Ig receptor is required for bursal colonization and the induction of gene conversion, the specificity encoded by the prediversified receptor is irrelevant and, consequently, there is no obligate ligand for V(D)J-encoded determinants of prediversified avian cell surface IgM receptor. PMID:10485907
Prediction of essential proteins based on gene expression programming.
Zhong, Jiancheng; Wang, Jianxin; Peng, Wei; Zhang, Zhen; Pan, Yi
2013-01-01
Essential proteins are indispensable for cell survive. Identifying essential proteins is very important for improving our understanding the way of a cell working. There are various types of features related to the essentiality of proteins. Many methods have been proposed to combine some of them to predict essential proteins. However, it is still a big challenge for designing an effective method to predict them by integrating different features, and explaining how these selected features decide the essentiality of protein. Gene expression programming (GEP) is a learning algorithm and what it learns specifically is about relationships between variables in sets of data and then builds models to explain these relationships. In this work, we propose a GEP-based method to predict essential protein by combing some biological features and topological features. We carry out experiments on S. cerevisiae data. The experimental results show that the our method achieves better prediction performance than those methods using individual features. Moreover, our method outperforms some machine learning methods and performs as well as a method which is obtained by combining the outputs of eight machine learning methods. The accuracy of predicting essential proteins can been improved by using GEP method to combine some topological features and biological features.
Singh, Urvashi B.; Pandey, Pooja; Mehta, Girija; Bhatnagar, Anuj K.; Mohan, Anant; Goyal, Vinay; Ahuja, Vineet; Ramachandran, Ranjani; Sachdeva, Kuldeep S.; Samantaray, Jyotish C.
2016-01-01
Background Newer molecular diagnostics have brought paradigm shift in early diagnosis of tuberculosis [TB]. WHO recommended use of GeneXpert MTB/RIF [Xpert] for Extra-pulmonary [EP] TB; critics have since questioned its efficiency. Methods The present study was designed to assess the performance of GeneXpert in 761 extra-pulmonary and 384 pulmonary specimens from patients clinically suspected of TB and compare with Phenotypic, Genotypic and Composite reference standards [CRS]. Results Comparison of GeneXpert results to CRS, demonstrated sensitivity of 100% and 90.68%, specificity of 100% and 99.62% for pulmonary and extra-pulmonary samples. On comparison with culture, sensitivity for Rifampicin [Rif] resistance detection was 87.5% and 81.82% respectively, while specificity was 100% for both pulmonary and extra-pulmonary TB. On comparison to sequencing of rpoB gene [Rif resistance determining region, RRDR], sensitivity was respectively 93.33% and 90% while specificity was 100% in both pulmonary and extra-pulmonary TB. GeneXpert assay missed 533CCG mutation in one sputum and dual mutation [517 & 519] in one pus sample, detected by sequencing. Sequencing picked dual mutation [529, 530] in a sputum sample sensitive to Rif, demonstrating, not all RRDR mutations lead to resistance. Conclusions Current study reports observations in a patient care setting in a high burden region, from a large collection of pulmonary and extra-pulmonary samples and puts to rest questions regarding sensitivity, specificity, detection of infrequent mutations and mutations responsible for low-level Rif resistance by GeneXpert. Improvements in the assay could offer further improvement in sensitivity of detection in different patient samples; nevertheless it may be difficult to improve sensitivity of Rif resistance detection if only one gene is targeted. Assay specificity was high both for TB detection and Rif resistance detection. Despite a few misses, the assay offers major boost to early diagnosis of TB and MDR-TB, in difficult to diagnose pauci-bacillary TB. PMID:26894283
Sexually divergent induction of microglial-associated neuroinflammation with hippocampal aging.
Mangold, Colleen A; Wronowski, Benjamin; Du, Mei; Masser, Dustin R; Hadad, Niran; Bixler, Georgina V; Brucklacher, Robert M; Ford, Matthew M; Sonntag, William E; Freeman, Willard M
2017-07-21
The necessity of including both males and females in molecular neuroscience research is now well understood. However, there is relatively limited basic biological data on brain sex differences across the lifespan despite the differences in age-related neurological dysfunction and disease between males and females. Whole genome gene expression of young (3 months), adult (12 months), and old (24 months) male and female C57BL6 mice hippocampus was analyzed. Subsequent bioinformatic analyses and confirmations of age-related changes and sex differences in hippocampal gene and protein expression were performed. Males and females demonstrate both common expression changes with aging and marked sex differences in the nature and magnitude of the aging responses. Age-related hippocampal induction of neuroinflammatory gene expression was sexually divergent and enriched for microglia-specific genes such as complement pathway components. Sexually divergent C1q protein expression was confirmed by immunoblotting and immunohistochemistry. Similar patterns of cortical sexually divergent gene expression were also evident. Additionally, inter-animal gene expression variability increased with aging in males, but not females. These findings demonstrate sexually divergent neuroinflammation with aging that may contribute to sex differences in age-related neurological diseases such as stroke and Alzheimer's, specifically in the complement system. The increased expression variability in males suggests a loss of fidelity in gene expression regulation with aging. These findings reveal a central role of sex in the transcriptomic response of the hippocampus to aging that warrants further, in depth, investigations.
Colonizing the world in spite of reduced MHC variation
Gangoso, L.; Alcaide, M.; Grande, J.M.; Muñoz, J.; Talbot, Sandra L.; Sonsthagen, Sarah A.; Sage, Kevin; Figuerola, J.
2012-01-01
Reduced immune gene diversity is thought to negatively affect the capacity of organisms to adapt to pathogen challenges, which represent a major force in natural selection. Genes of the Major Histocompatibility Complex (MHC) are the most widely invoked adaptive loci in conservation biology, and have become the most popular genetic markers to investigate pathogen-host interactions in vertebrates. Although MHC genes are the most polymorphic genes described in the vertebrate genome, the extent to which MHC diversity determines the long-term persistence of populations is, unclear and often debated, as recent studies have documented the occurrence of natural populations thriving even after a depletion of MHC diversity caused by genetic drift. Here, we show that some phylogenetically related species belonging to the Falco genus (Aves: Falconidae) present a dramatically low MHC variability that has not precluded, nevertheless, the successful colonization of almost all existing regions and habitats worldwide. We found evidence for two remarkably different patterns of MHC variation within the genus. While kestrels show a high MHC variation according to the general theory, falcons exhibit an ancestrally low intra- and inter-specific MHC allelic diversity. We provide compelling evidence that this pattern is not caused by the degeneration of functional genes into pseudogenes, the inadvertent analyses of paralogous MHC genes, or the devastating action of genetic drift. Instead, our results strongly support the idea of an evolutionary transition driven and maintained by natural selection from primarily highly variable towards low polymorphic, but functional and expressed, MHC genes with species-specific pathogen-recognition capabilities.
Cerón, J; Ortíz, A; Quintero, R; Güereca, L; Bravo, A
1995-01-01
In this paper we describe a PCR strategy that can be used to rapidly identify Bacillus thuringiensis strains that harbor any of the known cryI or cryIII genes. Four general PCR primers which amplify DNA fragments from the known cryI or cryIII genes were selected from conserved regions. Once a strain was identified as an organism that contains a particular type of cry gene, it could be easily characterized by performing additional PCR with specific cryI and cryIII primers selected from variable regions. The method described in this paper can be used to identify the 10 different cryI genes and the five different cryIII genes. One feature of this screening method is that each cry gene is expected to produce a PCR product having a precise molecular weight. The genes which produce PCR products having different sizes probably represent strains that harbor a potentially novel cry gene. Finally, we present evidence that novel crystal genes can be identified by the method described in this paper. PMID:8526493
Differential gene expression patterns in the autogamous plant Hordeum euclaston (Poaceae).
Georg-Kraemer, J E; Ferreira, C A S; Cavalli, S S
2011-02-22
Sib-seedlings of 95 strains of the strictly autogamous grass Hordeum euclaston were analyzed by horizontal polyacrylamide gel electrophoresis for four isoenzyme systems at a specific ontogenetic stage. We found differences in the activity of some genes among individuals of this species. Hence, an ontogenetic analysis was carried out to investigate 12 strains at five ontogenetic stages, to determine the patterns of expression of these genes during development. The differences in the presence versus absence of certain isoenzyme bands may be due to differential regulatory activation in response to environmental differences, as all plants showed the same structural genes, although these genes were active in different tissues and/or times of development. These results indicate the importance of differential gene activation in the metabolic phenotype variability of this strictly autogamous, highly homozygous species. The same structural alleles for isoenzymes showed the active form of the enzymes (phenotypic expression) to be present in different tissues and/or stages of development. Differential isoenzyme gene activation was shown to be directly responsible for the enzymatic variability (metabolic phenotype) presented by the plants, which seem to possess almost no heterozygosis.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Young, M; Craft, D
Purpose: To develop an efficient, pathway-based classification system using network biology statistics to assist in patient-specific response predictions to radiation and drug therapies across multiple cancer types. Methods: We developed PICS (Pathway Informed Classification System), a novel two-step cancer classification algorithm. In PICS, a matrix m of mRNA expression values for a patient cohort is collapsed into a matrix p of biological pathways. The entries of p, which we term pathway scores, are obtained from either principal component analysis (PCA), normal tissue centroid (NTC), or gene expression deviation (GED). The pathway score matrix is clustered using both k-means and hierarchicalmore » clustering, and a clustering is judged by how well it groups patients into distinct survival classes. The most effective pathway scoring/clustering combination, per clustering p-value, thus generates various ‘signatures’ for conventional and functional cancer classification. Results: PICS successfully regularized large dimension gene data, separated normal and cancerous tissues, and clustered a large patient cohort spanning six cancer types. Furthermore, PICS clustered patient cohorts into distinct, statistically-significant survival groups. For a suboptimally-debulked ovarian cancer set, the pathway-classified Kaplan-Meier survival curve (p = .00127) showed significant improvement over that of a prior gene expression-classified study (p = .0179). For a pancreatic cancer set, the pathway-classified Kaplan-Meier survival curve (p = .00141) showed significant improvement over that of a prior gene expression-classified study (p = .04). Pathway-based classification confirmed biomarkers for the pyrimidine, WNT-signaling, glycerophosphoglycerol, beta-alanine, and panthothenic acid pathways for ovarian cancer. Despite its robust nature, PICS requires significantly less run time than current pathway scoring methods. Conclusion: This work validates the PICS method to improve cancer classification using biological pathways. Patients are classified with greater specificity and physiological relevance as compared to current gene-specific approaches. Focus now moves to utilizing PICS for pan-cancer patient-specific treatment response prediction.« less
Sato, M; Figueiredo, ML; Burton, JB; Johnson, M; Chen, M; Powell, R; Gambhir, SS; Carey, M; Wu, L
2009-01-01
Effective treatment for recurrent, disseminated prostate cancer is notably limited. We have developed adenoviral vectors with a prostate-specific two-step transcriptional amplification (TSTA) system that would express therapeutic genes at a robust level to target metastatic disease. The TSTA system employs the prostate-specific antigen (PSA) promoter/enhancer to drive a potent synthetic activator, which in turn activates the expression of the therapeutic gene. In this study, we explored different configurations of this bipartite system and discovered that physical separation of the two TSTA components into E1 and E3 regions of adenovirus was able to enhance androgen regulation and cell-discriminatory expression. The TSTA vectors that express imaging reporter genes were assessed by noninvasive imaging technologies in animal models. The improved selectivity of the E1E3 configured vector was reflected in silenced ectopic expression in the lung. Significantly, the enhanced specificity of the E1E3 vector enabled the detection of lung metastasis of prostate cancer. An E1E3 TSTA vector that expresses the herpes simplex virus thymidine kinase gene can effectively direct positron emission tomography (PET) imaging of the tumor. The prostate-targeted gene delivery vectors with robust and cell-specific expression capability will advance the development of safe and effective imaging guided therapy for recurrent metastatic stages of prostate cancer. PMID:18305574
Chakravorty, S; Sarkar, S; Gachhui, R
2015-01-01
The Acetobacteraceae family of the class Alpha Proteobacteria is comprised of high sugar and acid tolerant bacteria. The Acetic Acid Bacteria are the economically most significant group of this family because of its association with food products like vinegar, wine etc. Acetobacteraceae are often hard to culture in laboratory conditions and they also maintain very low abundances in their natural habitats. Thus identification of the organisms in such environments is greatly dependent on modern tools of molecular biology which require a thorough knowledge of specific conserved gene sequences that may act as primers and or probes. Moreover unconserved domains in genes also become markers for differentiating closely related genera. In bacteria, the 16S rRNA gene is an ideal candidate for such conserved and variable domains. In order to study the conserved and variable domains of the 16S rRNA gene of Acetic Acid Bacteria and the Acetobacteraceae family, sequences from publicly available databases were aligned and compared. Near complete sequences of the gene were also obtained from Kombucha tea biofilm, a known Acetobacteraceae family habitat, in order to corroborate the domains obtained from the alignment studies. The study indicated that the degree of conservation in the gene is significantly higher among the Acetic Acid Bacteria than the whole Acetobacteraceae family. Moreover it was also observed that the previously described hypervariable regions V1, V3, V5, V6 and V7 were more or less conserved in the family and the spans of the variable regions are quite distinct as well.
Evans, Perry; Avey, Stefan; Kong, Yong; Krauthammer, Michael
2013-09-01
A common goal of tumor sequencing projects is finding genes whose mutations are selected for during tumor development. This is accomplished by choosing genes that have more non-synonymous mutations than expected from an estimated background mutation frequency. While this background frequency is unknown, it can be estimated using both the observed synonymous mutation frequency and the non-synonymous to synonymous mutation ratio. The synonymous mutation frequency can be determined across all genes or in a gene-specific manner. This choice introduces an interesting trade-off. A gene-specific frequency adjusts for an underlying mutation bias, but is difficult to estimate given missing synonymous mutation counts. Using a genome-wide synonymous frequency is more robust, but is less suited for adjusting biases. Studying four evaluation criteria for identifying genes with high non-synonymous mutation burden (reflecting preferential selection of expressed genes, genes with mutations in conserved bases, genes with many protein interactions, and genes that show loss of heterozygosity), we find that the gene-specific synonymous frequency is superior in the gene expression and protein interaction tests. In conclusion, the use of the gene-specific synonymous mutation frequency is well suited for assessing a gene's non-synonymous mutation burden.
Hudson, Quanah J.; Seidl, Christine I.M.; Kulinski, Tomasz M.; Huang, Ru; Warczok, Katarzyna E.; Bittner, Romana; Bartolomei, Marisa S.; Barlow, Denise P.
2011-01-01
A subset of imprinted genes in the mouse have been reported to show imprinted expression that is restricted to the placenta, a short-lived extra-embryonic organ. Notably these so-called 'placental-specific' imprinted genes are expressed from both parental alleles in embryo and adult tissues. The placenta is an embryonic-derived organ that is closely associated with maternal tissue and as a consequence, maternal contamination can be mistaken for maternal-specific imprinted expression. The complexity of the placenta, which arises from multiple embryonic lineages, poses additional problems in accurately assessing allele-specific repressive epigenetic modifications in genes that also show lineage-specific silencing in this organ. These problems require that extra evidence be obtained to support the imprinted status of genes whose imprinted expression is restricted to the placenta. We show here that the extra-embryonic visceral yolk sac (VYS), a nutritive membrane surrounding the developing embryo, shows a similar 'extra-embryonic-lineage-specific' pattern of imprinted expression. We present an improved enzymatic technique for separating the bilaminar VYS and show that this pattern of imprinted expression is restricted to the endoderm layer. Finally, we show that VYS 'extra-embryonic-lineage-specific' imprinted expression is regulated by DNA methylation in a similar manner as shown for genes showing multi-lineage imprinted expression in extra-embryonic, embryonic and adult tissues. These results show that the VYS is an improved model for studying the epigenetic mechanisms regulating extra-embryonic-lineage-specific imprinted expression. PMID:21354127
Pasture-feeding of Charolais steers influences skeletal muscle metabolism and gene expression.
Cassar-Malek, I; Jurie, C; Bernard, C; Barnola, I; Micol, D; Hocquette, J-F
2009-10-01
Extensive beef production systems on pasture are promoted to improve animal welfare and beef quality. This study aimed to compare the influence on muscle characteristics of two management approaches representative of intensive and extensive production systems. One group of 6 Charolais steers was fed maize-silage indoors and another group of 6 Charolais steers grazed on pasture. Activities of enzymes representative of glycolytic and oxidative (Isocitrate dehydrogenase [ICDH], citrate synthase [CS], hydroxyacyl-CoA dehydrogenase [HAD]) muscle metabolism were assessed in Rectus abdominis (RA) and Semitendinosus (ST) muscles. Activities of oxidative enzymes ICDH, CS and HAD were higher in muscles from grazing animals demonstrating a plasticity of muscle metabolism according to the production and feeding system. Gene expression profiling in RA and ST muscles was performed on both production groups using a multi-tissue bovine cDNA repertoire. Variance analysis showed an effect of the muscle type and of the production system on gene expression (P<0.001). A list of the 212 most variable genes according to the production system was established, of which 149 genes corresponded to identified genes. They were classified according to their gene function annotation mainly in the "protein metabolism and modification", "signal transduction", "cell cycle", "developmental processes" and "muscle contraction" biological processes. Selenoprotein W was found to be underexpressed in pasture-fed animals and could be proposed as a putative gene marker of the grass-based system. In conclusion, enzyme-specific adaptations and gene expression modifications were observed in response to the production system and some of them could be candidates for grazing or grass-feeding traceability.
Miflin, Ben J; Habash, Dimah Z
2002-04-01
This short review outlines the central role of glutamine synthetase (GS) in plant nitrogen metabolism and discusses some possibilities for crop improvement. GS functions as the major assimilatory enzyme for ammonia produced from N fixation, and nitrate or ammonia nutrition. It also reassimilates ammonia released as a result of photorespiration and the breakdown of proteins and nitrogen transport compounds. GS is distributed in different subcellular locations (chloroplast and cytoplasm) and in different tissues and organs. This distribution probably changes as a function of the development of the tissue, for example, GS1 appears to play a key role in leaf senescence. The enzyme is the product of multiple genes with complex promoters that ensure the expression of the genes in an organ- and tissue-specific manner and in response to a number of environmental variables affecting the nutritional status of the cell. GS activity is also regulated post-translationally in a manner that involves 14-3-3 proteins and phosphorylation. GS and plant nitrogen metabolism is best viewed as a complex matrix continually changing during the development cycle of plants. Along with GS, a number of other enzymes play key roles in maintaining the balance of carbon and nitrogen. It is proposed that one of these is glutamate dehydrogenase (GDH). There is considerable evidence for a GDH shunt to return the carbon in amino acids back into reactions of carbon metabolism and the tri-carboxylic acid cycle. Results with transgenic plants containing transferred GS genes suggest that there may be ways in which it is possible to improve the efficiency with which crop plants use nitrogen. Marker-assisted breeding may also bring about such improvements.
The influence of antibody fragment format on phage display based affinity maturation of IgG
Steinwand, Miriam; Droste, Patrick; Frenzel, Andrè; Hust, Michael; Dübel, Stefan; Schirrmann, Thomas
2014-01-01
Today, most approved therapeutic antibodies are provided as immunoglobulin G (IgG), whereas small recombinant antibody formats are required for in vitro antibody generation and engineering during drug development. Particularly, single chain (sc) antibody fragments like scFv or scFab are well suited for phage display and bacterial expression, but some have been found to lose affinity during conversion into IgG. In this study, we compared the influence of the antibody format on affinity maturation of the CD30-specific scFv antibody fragment SH313-F9, with the overall objective being improvement of the IgG. The variable genes of SH313-F9 were randomly mutated and then cloned into libraries encoding different recombinant antibody formats, including scFv, Fab, scFabΔC, and FabΔC. All tested antibody formats except Fab allowed functional phage display of the parental antibody SH313-F9, and the corresponding mutated antibody gene libraries allowed isolation of candidates with enhanced CD30 binding. Moreover, scFv and scFabΔC antibody variants retained improved antigen binding after subcloning into the single gene encoded IgG-like formats scFv-Fc or scIgG, but lost affinity after conversion into IgGs. Only affinity maturation using the Fab-like FabΔC format, which does not contain the carboxy terminal cysteines, allowed successful selection of molecules with improved binding that was retained after conversion to IgG. Thus, affinity maturation of IgGs is dependent on the antibody format employed for selection and screening. In this study, only FabΔC resulted in the efficient selection of IgG candidates with higher affinity by combination of Fab-like conformation and improved phage display compared with Fab. PMID:24262918
Gbadegesin, M A; Beeching, J R
2011-06-07
Cassava can be cultivated on impoverished soils with minimum inputs, and its storage roots are a staple food for millions in Africa. However, these roots are low in bioavailable nutrients and in protein content, contain cyanogenic glycosides, and suffer from a very short post-harvest shelf-life, and the plant is susceptible to viral and bacterial diseases prevalent in Africa. The demand for improvement of cassava with respect to these traits comes from both farmers and national agricultural institutions. Genetic improvement of cassava cultivars by molecular biology techniques requires the availability of appropriate genes, a system to introduce these genes into cassava, and the use of suitable gene promoters. Cassava root-specific promoter for auxin-repressed protein was isolated using the gene walking approach, starting with a cDNA sequence. In silico analysis of promoter sequences revealed putative cis-acting regulatory elements, including root-specific elements, which may be required for gene expression in vascular tissues. Research on the activities of this promoter is continuing, with the development of plant expression cassettes for transformation into major African elite lines and farmers' preferred cassava cultivars to enable testing of tissue-specific expression patterns in the field.
Puglia, Meghan H.; Lillard, Travis S.; Morris, James P.; Connelly, Jessica J.
2015-01-01
In humans, the neuropeptide oxytocin plays a critical role in social and emotional behavior. The actions of this molecule are dependent on a protein that acts as its receptor, which is encoded by the oxytocin receptor gene (OXTR). DNA methylation of OXTR, an epigenetic modification, directly influences gene transcription and is variable in humans. However, the impact of this variability on specific social behaviors is unknown. We hypothesized that variability in OXTR methylation impacts social perceptual processes often linked with oxytocin, such as perception of facial emotions. Using an imaging epigenetic approach, we established a relationship between OXTR methylation and neural activity in response to emotional face processing. Specifically, high levels of OXTR methylation were associated with greater amounts of activity in regions associated with face and emotion processing including amygdala, fusiform, and insula. Importantly, we found that these higher levels of OXTR methylation were also associated with decreased functional coupling of amygdala with regions involved in affect appraisal and emotion regulation. These data indicate that the human endogenous oxytocin system is involved in attenuation of the fear response, corroborating research implicating intranasal oxytocin in the same processes. Our findings highlight the importance of including epigenetic mechanisms in the description of the endogenous oxytocin system and further support a central role for oxytocin in social cognition. This approach linking epigenetic variability with neural endophenotypes may broadly explain individual differences in phenotype including susceptibility or resilience to disease. PMID:25675509
QuASAR-MPRA: accurate allele-specific analysis for massively parallel reporter assays.
Kalita, Cynthia A; Moyerbrailean, Gregory A; Brown, Christopher; Wen, Xiaoquan; Luca, Francesca; Pique-Regi, Roger
2018-03-01
The majority of the human genome is composed of non-coding regions containing regulatory elements such as enhancers, which are crucial for controlling gene expression. Many variants associated with complex traits are in these regions, and may disrupt gene regulatory sequences. Consequently, it is important to not only identify true enhancers but also to test if a variant within an enhancer affects gene regulation. Recently, allele-specific analysis in high-throughput reporter assays, such as massively parallel reporter assays (MPRAs), have been used to functionally validate non-coding variants. However, we are still missing high-quality and robust data analysis tools for these datasets. We have further developed our method for allele-specific analysis QuASAR (quantitative allele-specific analysis of reads) to analyze allele-specific signals in barcoded read counts data from MPRA. Using this approach, we can take into account the uncertainty on the original plasmid proportions, over-dispersion, and sequencing errors. The provided allelic skew estimate and its standard error also simplifies meta-analysis of replicate experiments. Additionally, we show that a beta-binomial distribution better models the variability present in the allelic imbalance of these synthetic reporters and results in a test that is statistically well calibrated under the null. Applying this approach to the MPRA data, we found 602 SNPs with significant (false discovery rate 10%) allele-specific regulatory function in LCLs. We also show that we can combine MPRA with QuASAR estimates to validate existing experimental and computational annotations of regulatory variants. Our study shows that with appropriate data analysis tools, we can improve the power to detect allelic effects in high-throughput reporter assays. http://github.com/piquelab/QuASAR/tree/master/mpra. fluca@wayne.edu or rpique@wayne.edu. Supplementary data are available online at Bioinformatics. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Highly specific targeting of the TMPRSS2/ERG fusion gene using liposomal nanovectors
Shao, Longjiang; Tekedereli, Ibrahim; Wang, Jianghua; Yuca, Erkan; Tsang, Susan; Sood, Anil; Lopez-Berestein, Gabriel; Ozpolat, Bulent; Ittmann, Michael
2012-01-01
Purpose The TMPRSS2/ERG (T/E) fusion gene is present in half of all prostate cancer (PCa) tumors. Fusion of the oncogenic ERG gene with the androgen-regulated TMPRSS2 gene promoter results in expression of fusion mRNAs in PCa cells. The junction of theTMPRSS2 and ERG derived portions of the fusion mRNA constitutes a cancer specific target in cells containing the T/E fusion gene. Targeting the most common alternatively spliced fusion gene mRNA junctional isoforms in vivo using siRNAs in liposomal nanovectors may potentially be a novel, low toxicity treatment for PCa. Experimental Design We designed and optimized siRNAs targeting the two most common T/E fusion gene mRNA junctional isoforms (Type III or Type VI). Specificity of siRNAs was assessed by transient co-transfection in vitro. To test their ability to inhibit growth of PCa cells expressing these fusion gene isoforms in vivo, specific siRNAs in liposomal nanovectors were used to treat mice bearing orthotopic or subcutaneous xenograft tumors expressing the targeted fusion isoforms. Results The targeting siRNAs were both potent and highly specific in vitro. In vivo they significantly inhibited tumor growth. The degree of growth inhibition was variable and was correlated with the extent of fusion gene knockdown. The growth inhibition was associated with marked inhibition of angiogenesis and, to a lesser degree, proliferation and a marked increase in apoptosis of tumor cells. No toxicity was observed. Conclusions Targeting the T/E fusion junction in vivo with specific siRNAs delivered via liposomal nanovectors is a promising therapy for men with PCa. PMID:23052253
Highly specific targeting of the TMPRSS2/ERG fusion gene using liposomal nanovectors.
Shao, Longjiang; Tekedereli, Ibrahim; Wang, Jianghua; Yuca, Erkan; Tsang, Susan; Sood, Anil; Lopez-Berestein, Gabriel; Ozpolat, Bulent; Ittmann, Michael
2012-12-15
The TMPRSS2/ERG (T/E) fusion gene is present in half of all prostate cancer tumors. Fusion of the oncogenic ERG gene with the androgen-regulated TMPRSS2 gene promoter results in expression of fusion mRNAs in prostate cancer cells. The junction of theTMPRSS2- and ERG-derived portions of the fusion mRNA constitutes a cancer-specific target in cells containing the T/E fusion gene. Targeting the most common alternatively spliced fusion gene mRNA junctional isoforms in vivo using siRNAs in liposomal nanovectors may potentially be a novel, low-toxicity treatment for prostate cancer. We designed and optimized siRNAs targeting the two most common T/E fusion gene mRNA junctional isoforms (type III or type VI). Specificity of siRNAs was assessed by transient co-transfection in vitro. To test their ability to inhibit growth of prostate cancer cells expressing these fusion gene isoforms in vivo, specific siRNAs in liposomal nanovectors were used to treat mice bearing orthotopic or subcutaneous xenograft tumors expressing the targeted fusion isoforms. The targeting siRNAs were both potent and highly specific in vitro. In vivo they significantly inhibited tumor growth. The degree of growth inhibition was variable and was correlated with the extent of fusion gene knockdown. The growth inhibition was associated with marked inhibition of angiogenesis and, to a lesser degree, proliferation and a marked increase in apoptosis of tumor cells. No toxicity was observed. Targeting the T/E fusion junction in vivo with specific siRNAs delivered via liposomal nanovectors is a promising therapy for men with prostate cancer. ©2012 AACR.
The genome of melon (Cucumis melo L.)
Garcia-Mas, Jordi; Benjak, Andrej; Sanseverino, Walter; Bourgeois, Michael; Mir, Gisela; González, Víctor M.; Hénaff, Elizabeth; Câmara, Francisco; Cozzuto, Luca; Lowy, Ernesto; Alioto, Tyler; Capella-Gutiérrez, Salvador; Blanca, Jose; Cañizares, Joaquín; Ziarsolo, Pello; Gonzalez-Ibeas, Daniel; Rodríguez-Moreno, Luis; Droege, Marcus; Du, Lei; Alvarez-Tejado, Miguel; Lorente-Galdos, Belen; Melé, Marta; Yang, Luming; Weng, Yiqun; Navarro, Arcadi; Marques-Bonet, Tomas; Aranda, Miguel A.; Nuez, Fernando; Picó, Belén; Gabaldón, Toni; Roma, Guglielmo; Guigó, Roderic; Casacuberta, Josep M.; Arús, Pere; Puigdomènech, Pere
2012-01-01
We report the genome sequence of melon, an important horticultural crop worldwide. We assembled 375 Mb of the double-haploid line DHL92, representing 83.3% of the estimated melon genome. We predicted 27,427 protein-coding genes, which we analyzed by reconstructing 22,218 phylogenetic trees, allowing mapping of the orthology and paralogy relationships of sequenced plant genomes. We observed the absence of recent whole-genome duplications in the melon lineage since the ancient eudicot triplication, and our data suggest that transposon amplification may in part explain the increased size of the melon genome compared with the close relative cucumber. A low number of nucleotide-binding site–leucine-rich repeat disease resistance genes were annotated, suggesting the existence of specific defense mechanisms in this species. The DHL92 genome was compared with that of its parental lines allowing the quantification of sequence variability in the species. The use of the genome sequence in future investigations will facilitate the understanding of evolution of cucurbits and the improvement of breeding strategies. PMID:22753475
Peters, Derek T.; Henderson, Christopher A.; Warren, Curtis R.; Friesen, Max; Xia, Fang; Becker, Caroline E.; Musunuru, Kiran; Cowan, Chad A.
2016-01-01
ABSTRACT Hepatocyte-like cells (HLCs) are derived from human pluripotent stem cells (hPSCs) in vitro, but differentiation protocols commonly give rise to a heterogeneous mixture of cells. This variability confounds the evaluation of in vitro functional assays performed using HLCs. Increased differentiation efficiency and more accurate approximation of the in vivo hepatocyte gene expression profile would improve the utility of hPSCs. Towards this goal, we demonstrate the purification of a subpopulation of functional HLCs using the hepatocyte surface marker asialoglycoprotein receptor 1 (ASGR1). We analyzed the expression profile of ASGR1-positive cells by microarray, and tested their ability to perform mature hepatocyte functions (albumin and urea secretion, cytochrome activity). By these measures, ASGR1-positive HLCs are enriched for the gene expression profile and functional characteristics of primary hepatocytes compared with unsorted HLCs. We have demonstrated that ASGR1-positive sorting isolates a functional subpopulation of HLCs from among the heterogeneous cellular population produced by directed differentiation. PMID:27143754
Plankton networks driving carbon export in the oligotrophic ocean
NASA Astrophysics Data System (ADS)
Guidi, L.; Chaffron, S.; Bittner, L.; Eveillard, D.; Raes, J.; Karsenti, E.; Bowler, C.; Gorsky, G.
2016-02-01
The biological carbon pump is the process by which CO2 is transformed to organic carbon via photosynthesis that sinks to the deep ocean as particles where it is sequestered. While the intensity of the pump correlates with plankton community composition, the underlying ecosystem structure and interactions driving the process remain largely uncharacterised. Here we use environmental and metagenomic data gathered during the Tara Oceans expedition to improve our understanding of the underlying processes. We show that specific plankton communities correlate with carbon export and highlight unexpected and overlooked taxa such as Radiolaria, alveolate parasites, as well as Synechococcus and their phages, as lineages most strongly associated with carbon export in the subtropical oligotrophic ocean. Additionally, we show that the relative abundance of just a few bacterial and viral genes can predict most of the variability in carbon export in these regions. Together these results help elucidate ecosystem drivers of the biological carbon pump and present a case study for scaling from genes-to-ecosystems.
Gadermaier, Elisabeth; Flicker, Sabine; Lupinek, Christian; Steinberger, Peter; Valenta, Rudolf
2013-04-01
Affinity and clonality of allergen-specific IgE antibodies are important determinants for the magnitude of IgE-mediated allergic inflammation. We sought to analyze the contribution of heavy and light chains of human allergen-specific IgE antibodies for allergen specificity and to test whether promiscuous pairing of heavy and light chains with different allergen specificity allows binding and might affect affinity. Ten IgE Fabs specific for 3 non-cross-reactive major timothy grass pollen allergens (Phl p 1, Phl p 2, and Phl p 5) obtained by means of combinatorial cloning from patients with grass pollen allergy were used to construct stable recombinant single chain variable fragments (ScFvs) representing the original Fabs and shuffled ScFvs in which heavy chains were recombined with light chains from IgE Fabs with specificity for other allergens by using the pCANTAB 5 E expression system. Possible ancestor genes for the heavy chain and light chain variable region-encoding genes were determined by using sequence comparison with the ImMunoGeneTics database, and their chromosomal locations were determined. Recombinant ScFvs were tested for allergen specificity and epitope recognition by means of direct and sandwich ELISA, and affinity by using surface plasmon resonance experiments. The shuffling experiments demonstrate that promiscuous pairing of heavy and light chains is possible and maintains allergen specificity, which is mainly determined by the heavy chains. ScFvs consisting of different heavy and light chains exhibited different affinities and even epitope specificity for the corresponding allergen. Our results indicate that allergen specificity of allergen-specific IgE is mainly determined by the heavy chains. Different heavy and light chain pairings in allergen-specific IgE antibodies affect affinity and epitope specificity and thus might influence clinical reactivity to allergens. Copyright © 2012 American Academy of Allergy, Asthma & Immunology. Published by Mosby, Inc. All rights reserved.
Han, Yuedong; Haun, Yi; Deng, Jinlan; Gao, Feng; Pan, Bifeng; Cui, Daxiang
2006-01-01
Fabricating a single-chain variable fragment specific for human seminoprotein is very important in antibody-directed enzyme prodrug therapy and NMR imaging for prostate cancer. Here a single-chain Fv specific for gamma-seminoprotein was expressed by RTS. Its activity and the efficiency of entry into prostate cancer cells are investigated by immunoprecipitation and Western blotting and immunofluorescent staining, as well as entry of conjugated magnetic beads into cells. Results showed that ScFv peptides specific for gamma-seminoprotein were successfully prepared, which can bind with the prostate cells specifically and can bring magnetic beads into prostate cancer cells within 15 min, the amount of magnetic beads inside prostate cancer cells increased as the culture time prolonged. ScFv-conjugated magnetic beads did not enter into control cells. In conclusion, the ScFv peptide against human gamma-seminoprotein with biological activity was successfully fabricated, which can take magnetic beads to prostate cancer cells specifically and not to the control cells. This ScFv peptide against human gamma-seminoprotein should be useful in improving the detection and therapy of prostate cancer at early stages and NMR imaging.
Ceapa, Corina; Davids, Mark; Ritari, Jarmo; Lambert, Jolanda; Wels, Michiel; Douillard, François P.; Smokvina, Tamara; de Vos, Willem M.; Knol, Jan; Kleerebezem, Michiel
2016-01-01
Lactobacillus rhamnosus is a diverse Gram-positive species with strains isolated from different ecological niches. Here, we report the genome sequence analysis of 40 diverse strains of L. rhamnosus and their genomic comparison, with a focus on the variable genome. Genomic comparison of 40 L. rhamnosus strains discriminated the conserved genes (core genome) and regions of plasticity involving frequent rearrangements and horizontal transfer (variome). The L. rhamnosus core genome encompasses 2,164 genes, out of 4,711 genes in total (the pan-genome). The accessory genome is dominated by genes encoding carbohydrate transport and metabolism, extracellular polysaccharides (EPS) biosynthesis, bacteriocin production, pili production, the cas system, and the associated clustered regularly interspaced short palindromic repeat (CRISPR) loci, and more than 100 transporter functions and mobile genetic elements like phages, plasmid genes, and transposons. A clade distribution based on amino acid differences between core (shared) proteins matched with the clade distribution obtained from the presence–absence of variable genes. The phylogenetic and variome tree overlap indicated that frequent events of gene acquisition and loss dominated the evolutionary segregation of the strains within this species, which is paralleled by evolutionary diversification of core gene functions. The CRISPR-Cas system could have contributed to this evolutionary segregation. Lactobacillus rhamnosus strains contain the genetic and metabolic machinery with strain-specific gene functions required to adapt to a large range of environments. A remarkable congruency of the evolutionary relatedness of the strains’ core and variome functions, possibly favoring interspecies genetic exchanges, underlines the importance of gene-acquisition and loss within the L. rhamnosus strain diversification. PMID:27358423
Trefzer, Axel; Jungmann, Volker; Molnár, István; Botejue, Ajit; Buckel, Dagmar; Frey, Gerhard; Hill, D. Steven; Jörg, Mario; Ligon, James M.; Mason, Dylan; Moore, David; Pachlatko, J. Paul; Richardson, Toby H.; Spangenberg, Petra; Wall, Mark A.; Zirkle, Ross; Stege, Justin T.
2007-01-01
Discovery of the CYP107Z subfamily of cytochrome P450 oxidases (CYPs) led to an alternative biocatalytic synthesis of 4″-oxo-avermectin, a key intermediate for the commercial production of the semisynthetic insecticide emamectin. However, under industrial process conditions, these wild-type CYPs showed lower yields due to side product formation. Molecular evolution employing GeneReassembly was used to improve the regiospecificity of these enzymes by a combination of random mutagenesis, protein structure-guided site-directed mutagenesis, and recombination of multiple natural and synthetic CYP107Z gene fragments. To assess the specificity of CYP mutants, a miniaturized, whole-cell biocatalytic reaction system that allowed high-throughput screening of large numbers of variants was developed. In an iterative process consisting of four successive rounds of GeneReassembly evolution, enzyme variants with significantly improved specificity for the production of 4″-oxo-avermectin were identified; these variants could be employed for a more economical industrial biocatalytic process to manufacture emamectin. PMID:17483257
Synthetic spike-in standards for high-throughput 16S rRNA gene amplicon sequencing
Tourlousse, Dieter M.; Yoshiike, Satowa; Ohashi, Akiko; Matsukura, Satoko; Noda, Naohiro
2017-01-01
Abstract High-throughput sequencing of 16S rRNA gene amplicons (16S-seq) has become a widely deployed method for profiling complex microbial communities but technical pitfalls related to data reliability and quantification remain to be fully addressed. In this work, we have developed and implemented a set of synthetic 16S rRNA genes to serve as universal spike-in standards for 16S-seq experiments. The spike-ins represent full-length 16S rRNA genes containing artificial variable regions with negligible identity to known nucleotide sequences, permitting unambiguous identification of spike-in sequences in 16S-seq read data from any microbiome sample. Using defined mock communities and environmental microbiota, we characterized the performance of the spike-in standards and demonstrated their utility for evaluating data quality on a per-sample basis. Further, we showed that staggered spike-in mixtures added at the point of DNA extraction enable concurrent estimation of absolute microbial abundances suitable for comparative analysis. Results also underscored that template-specific Illumina sequencing artifacts may lead to biases in the perceived abundance of certain taxa. Taken together, the spike-in standards represent a novel bioanalytical tool that can substantially improve 16S-seq-based microbiome studies by enabling comprehensive quality control along with absolute quantification. PMID:27980100
Targeted polymeric nanoparticles for cancer gene therapy
Kim, Jayoung; Wilson, David R.; Zamboni, Camila G.; Green, Jordan J.
2015-01-01
In this article, advances in designing polymeric nanoparticles for targeted cancer gene therapy are reviewed. Characterization and evaluation of biomaterials, targeting ligands, and transcriptional elements are each discussed. Advances in biomaterials have driven improvements to nanoparticle stability and tissue targeting, conjugation of ligands to the surface of polymeric nanoparticles enable binding to specific cancer cells, and the design of transcriptional elements has enabled selective DNA expression specific to the cancer cells. Together, these features have improved the performance of polymeric nanoparticles as targeted non-viral gene delivery vectors to treat cancer. As polymeric nanoparticles can be designed to be biodegradable, non-toxic, and to have reduced immunogenicity and tumorigenicity compared to viral platforms, they have significant potential for clinical use. Results of polymeric gene therapy in clinical trials and future directions for the engineering of nanoparticle systems for targeted cancer gene therapy are also presented. PMID:26061296
Frech, Christian; Chen, Nansheng
2011-01-01
Genes underlying important phenotypic differences between Plasmodium species, the causative agents of malaria, are frequently found in only a subset of species and cluster at dynamically evolving subtelomeric regions of chromosomes. We hypothesized that chromosome-internal regions of Plasmodium genomes harbour additional species subset-specific genes that underlie differences in human pathogenicity, human-to-human transmissibility, and human virulence. We combined sequence similarity searches with synteny block analyses to identify species subset-specific genes in chromosome-internal regions of six published Plasmodium genomes, including Plasmodium falciparum, Plasmodium vivax, Plasmodium knowlesi, Plasmodium yoelii, Plasmodium berghei, and Plasmodium chabaudi. To improve comparative analysis, we first revised incorrectly annotated gene models using homology-based gene finders and examined putative subset-specific genes within syntenic contexts. Confirmed subset-specific genes were then analyzed for their role in biological pathways and examined for molecular functions using publicly available databases. We identified 16 genes that are well conserved in the three primate parasites but not found in rodent parasites, including three key enzymes of the thiamine (vitamin B1) biosynthesis pathway. Thirteen genes were found to be present in both human parasites but absent in the monkey parasite P. knowlesi, including genes specifically upregulated in sporozoites or gametocytes that could be linked to parasite transmission success between humans. Furthermore, we propose 15 chromosome-internal P. falciparum-specific genes as new candidate genes underlying increased human virulence and detected a currently uncharacterized cluster of P. vivax-specific genes on chromosome 6 likely involved in erythrocyte invasion. In conclusion, Plasmodium species harbour many chromosome-internal differences in the form of protein-coding genes, some of which are potentially linked to human disease and thus promising leads for future laboratory research. PMID:22215999
Dickey, C; Santella, R M; Hattis, D; Tang, D; Hsu, Y; Cooper, T; Young, T L; Perera, F P
1997-10-01
Biomarkers such as DNA adducts have significant potential to improve quantitative risk assessment by characterizing individual differences in metabolism of genotoxins and DNA repair and accounting for some of the factors that could affect interindividual variation in cancer risk. Inherent uncertainty in laboratory measurements and within-person variability of DNA adduct levels over time are putatively unrelated to cancer risk and should be subtracted from observed variation to better estimate interindividual variability of response to carcinogen exposure. A total of 41 volunteers, both smokers and nonsmokers, were asked to provide a peripheral blood sample every 3 weeks for several months in order to specifically assess intraindividual variability of polycyclic aromatic hydrocarbon (PAH)-DNA adduct levels. The intraindividual variance in PAH-DNA adduct levels, together with measurement uncertainty (laboratory variability and unaccounted for differences in exposure), constituted roughly 30% of the overall variance. An estimated 70% of the total variance was contributed by interindividual variability and is probably representative of the true biologic variability of response to carcinogenic exposure in lymphocytes. The estimated interindividual variability in DNA damage after subtracting intraindividual variability and measurement uncertainty was 24-fold. Inter-individual variance was higher (52-fold) in persons who constitutively lack the Glutathione S-Transferase M1 (GSTM1) gene which is important in the detoxification pathway of PAH. Risk assessment models that do not consider the variability of susceptibility to DNA damage following carcinogen exposure may underestimate risks to the general population, especially for those people who are most vulnerable.
Primer sets for cloning the human repertoire of T cell Receptor Variable regions
Boria, Ilenia; Cotella, Diego; Dianzani, Irma; Santoro, Claudio; Sblattero, Daniele
2008-01-01
Background Amplification and cloning of naïve T cell Receptor (TR) repertoires or antigen-specific TR is crucial to shape immune response and to develop immuno-based therapies. TR variable (V) regions are encoded by several genes that recombine during T cell development. The cloning of expressed genes as large diverse libraries from natural sources relies upon the availability of primers able to amplify as many V genes as possible. Results Here, we present a list of primers computationally designed on all functional TR V and J genes listed in the IMGT®, the ImMunoGeneTics information system®. The list consists of unambiguous or degenerate primers suitable to theoretically amplify and clone the entire TR repertoire. We show that it is possible to selectively amplify and clone expressed TR V genes in one single RT-PCR step and from as little as 1000 cells. Conclusion This new primer set will facilitate the creation of more diverse TR libraries than has been possible using currently available primer sets. PMID:18759974
Systems Biophysics of Gene Expression
Vilar, Jose M.G.; Saiz, Leonor
2013-01-01
Gene expression is a process central to any form of life. It involves multiple temporal and functional scales that extend from specific protein-DNA interactions to the coordinated regulation of multiple genes in response to intracellular and extracellular changes. This diversity in scales poses fundamental challenges to the use of traditional approaches to fully understand even the simplest gene expression systems. Recent advances in computational systems biophysics have provided promising avenues to reliably integrate the molecular detail of biophysical process into the system behavior. Here, we review recent advances in the description of gene regulation as a system of biophysical processes that extend from specific protein-DNA interactions to the combinatorial assembly of nucleoprotein complexes. There is now basic mechanistic understanding on how promoters controlled by multiple, local and distal, DNA binding sites for transcription factors can actively control transcriptional noise, cell-to-cell variability, and other properties of gene regulation, including precision and flexibility of the transcriptional responses. PMID:23790365
Hoppe, U C; Marbán, E; Johns, D C
2001-04-24
The long QT syndrome (LQTS) is a heritable disorder that predisposes to sudden cardiac death. LQTS is caused by mutations in ion channel genes including HERG and KCNE1, but the precise mechanisms remain unclear. To clarify this situation we injected adenoviral vectors expressing wild-type or LQT mutants of HERG and KCNE1 into guinea pig myocardium. End points at 48-72 h included electrophysiology in isolated myocytes and electrocardiography in vivo. HERG increased the rapid component, I(Kr), of the delayed rectifier current, thereby accelerating repolarization, increasing refractoriness, and diminishing beat-to-beat action potential variability. Conversely, HERG-G628S suppressed I(Kr) without significantly delaying repolarization. Nevertheless, HERG-G628S abbreviated refractoriness and increased beat-to-beat variability, leading to early afterdepolarizations (EADs). KCNE1 increased the slow component of the delayed rectifier, I(Ks), without clear phenotypic sequelae. In contrast, KCNE1-D76N suppressed I(Ks) and markedly slowed repolarization, leading to frequent EADs and electrocardiographic QT prolongation. Thus, the two genes predispose to sudden death by distinct mechanisms: the KCNE1 mutant flagrantly undermines cardiac repolarization, and HERG-G628S subtly facilitates the genesis and propagation of premature beats. Our ability to produce electrocardiographic long QT in vivo with a clinical KCNE1 mutation demonstrates the utility of somatic gene transfer in creating genotype-specific disease models.
Noise in gene expression is coupled to growth rate.
Keren, Leeat; van Dijk, David; Weingarten-Gabbay, Shira; Davidi, Dan; Jona, Ghil; Weinberger, Adina; Milo, Ron; Segal, Eran
2015-12-01
Genetically identical cells exposed to the same environment display variability in gene expression (noise), with important consequences for the fidelity of cellular regulation and biological function. Although population average gene expression is tightly coupled to growth rate, the effects of changes in environmental conditions on expression variability are not known. Here, we measure the single-cell expression distributions of approximately 900 Saccharomyces cerevisiae promoters across four environmental conditions using flow cytometry, and find that gene expression noise is tightly coupled to the environment and is generally higher at lower growth rates. Nutrient-poor conditions, which support lower growth rates, display elevated levels of noise for most promoters, regardless of their specific expression values. We present a simple model of noise in expression that results from having an asynchronous population, with cells at different cell-cycle stages, and with different partitioning of the cells between the stages at different growth rates. This model predicts non-monotonic global changes in noise at different growth rates as well as overall higher variability in expression for cell-cycle-regulated genes in all conditions. The consistency between this model and our data, as well as with noise measurements of cells growing in a chemostat at well-defined growth rates, suggests that cell-cycle heterogeneity is a major contributor to gene expression noise. Finally, we identify gene and promoter features that play a role in gene expression noise across conditions. Our results show the existence of growth-related global changes in gene expression noise and suggest their potential phenotypic implications. © 2015 Keren et al.; Published by Cold Spring Harbor Laboratory Press.
Noise in gene expression is coupled to growth rate
Keren, Leeat; van Dijk, David; Weingarten-Gabbay, Shira; Davidi, Dan; Jona, Ghil; Weinberger, Adina; Milo, Ron; Segal, Eran
2015-01-01
Genetically identical cells exposed to the same environment display variability in gene expression (noise), with important consequences for the fidelity of cellular regulation and biological function. Although population average gene expression is tightly coupled to growth rate, the effects of changes in environmental conditions on expression variability are not known. Here, we measure the single-cell expression distributions of approximately 900 Saccharomyces cerevisiae promoters across four environmental conditions using flow cytometry, and find that gene expression noise is tightly coupled to the environment and is generally higher at lower growth rates. Nutrient-poor conditions, which support lower growth rates, display elevated levels of noise for most promoters, regardless of their specific expression values. We present a simple model of noise in expression that results from having an asynchronous population, with cells at different cell-cycle stages, and with different partitioning of the cells between the stages at different growth rates. This model predicts non-monotonic global changes in noise at different growth rates as well as overall higher variability in expression for cell-cycle–regulated genes in all conditions. The consistency between this model and our data, as well as with noise measurements of cells growing in a chemostat at well-defined growth rates, suggests that cell-cycle heterogeneity is a major contributor to gene expression noise. Finally, we identify gene and promoter features that play a role in gene expression noise across conditions. Our results show the existence of growth-related global changes in gene expression noise and suggest their potential phenotypic implications. PMID:26355006
Frequency of Fanconi anemia in Brazil and efficacy of screening for the FANCA 3788-3790del mutation.
Magdalena, N; Pilonetto, D V; Bitencourt, M A; Pereira, N F; Ribeiro, R C; Jeng, M; Pasquini, R
2005-05-01
Fanconi anemia (FA) is an autosomal recessive genetic disease characterized by progressive bone marrow failure, susceptibility to cancer and multiple congenital anomalies. There is important clinical variability among patients and the knowledge of factors which might predict outcome would greatly help the decision making regarding the choices of treatment and the appropriate time to start it. Future studies of the possible correlation between specific mutations with specific clinical presentations will provide the answer to one of these factors. At our Center we standardized a rapid and precise screening test using a mismatch PCR assay for a specific mutation (3788-3790del in exon 38 of gene FANCA) in Brazilian FA patients. We present the results obtained after screening 80 non-consanguineous FA patients referred from all regions of Brazil with a clinical diagnosis of FA supported by cellular hypersensitivity to diepoxybutane. We were able to detect the 3788-3790del allele in 24 of the 80 (30%) FA patients studied. Thirteen of the 80 (16.25%) were homozygotes and 11 of the 80 (13.75%) were compound heterozygotes, thus confirming the high frequency of the FANCA 3788-3790del mutation in Brazilian FA patients. The identification of patients with specific mutations in the FA genes may lead to a better clinical description of this condition, also providing data for genotype-phenotype correlations, to a better understanding of the interaction of this specific mutation with other mutations in compound heterozygote patients, and ultimately to the right choices of treatment for each patient with improvement of the prognosis on future studies.
Functional analysis and transcriptional output of the Göttingen minipig genome.
Heckel, Tobias; Schmucki, Roland; Berrera, Marco; Ringshandl, Stephan; Badi, Laura; Steiner, Guido; Ravon, Morgane; Küng, Erich; Kuhn, Bernd; Kratochwil, Nicole A; Schmitt, Georg; Kiialainen, Anna; Nowaczyk, Corinne; Daff, Hamina; Khan, Azinwi Phina; Lekolool, Isaac; Pelle, Roger; Okoth, Edward; Bishop, Richard; Daubenberger, Claudia; Ebeling, Martin; Certa, Ulrich
2015-11-14
In the past decade the Göttingen minipig has gained increasing recognition as animal model in pharmaceutical and safety research because it recapitulates many aspects of human physiology and metabolism. Genome-based comparison of drug targets together with quantitative tissue expression analysis allows rational prediction of pharmacology and cross-reactivity of human drugs in animal models thereby improving drug attrition which is an important challenge in the process of drug development. Here we present a new chromosome level based version of the Göttingen minipig genome together with a comparative transcriptional analysis of tissues with pharmaceutical relevance as basis for translational research. We relied on mapping and assembly of WGS (whole-genome-shotgun sequencing) derived reads to the reference genome of the Duroc pig and predict 19,228 human orthologous protein-coding genes. Genome-based prediction of the sequence of human drug targets enables the prediction of drug cross-reactivity based on conservation of binding sites. We further support the finding that the genome of Sus scrofa contains about ten-times less pseudogenized genes compared to other vertebrates. Among the functional human orthologs of these minipig pseudogenes we found HEPN1, a putative tumor suppressor gene. The genomes of Sus scrofa, the Tibetan boar, the African Bushpig, and the Warthog show sequence conservation of all inactivating HEPN1 mutations suggesting disruption before the evolutionary split of these pig species. We identify 133 Sus scrofa specific, conserved long non-coding RNAs (lncRNAs) in the minipig genome and show that these transcripts are highly conserved in the African pigs and the Tibetan boar suggesting functional significance. Using a new minipig specific microarray we show high conservation of gene expression signatures in 13 tissues with biomedical relevance between humans and adult minipigs. We underline this relationship for minipig and human liver where we could demonstrate similar expression levels for most phase I drug-metabolizing enzymes. Higher expression levels and metabolic activities were found for FMO1, AKR/CRs and for phase II drug metabolizing enzymes in minipig as compared to human. The variability of gene expression in equivalent human and minipig tissues is considerably higher in minipig organs, which is important for study design in case a human target belongs to this variable category in the minipig. The first analysis of gene expression in multiple tissues during development from young to adult shows that the majority of transcriptional programs are concluded four weeks after birth. This finding is in line with the advanced state of human postnatal organ development at comparative age categories and further supports the minipig as model for pediatric drug safety studies. Genome based assessment of sequence conservation combined with gene expression data in several tissues improves the translational value of the minipig for human drug development. The genome and gene expression data presented here are important resources for researchers using the minipig as model for biomedical research or commercial breeding. Potential impact of our data for comparative genomics, translational research, and experimental medicine are discussed.
Kia, Azadeh; Yata, Teerapong; Hajji, Nabil; Hajitou, Amin
2013-10-22
Bacteriophage (phage), viruses that infect bacteria only, have become promising vectors for targeted systemic delivery of genes to cancer, although, with poor efficiency. We previously designed an improved phage vector by incorporating cis genetic elements of adeno-associated virus (AAV). This novel AAV/phage hybrid (AAVP) specifically targeted systemic delivery of therapeutic genes into tumors. To advance the AAVP vector, we recently introduced the stress-inducible Grp78 tumor specific promoter and found that this dual tumor-targeted AAVP provides persistent gene expression, over time, in cancer cells compared to silenced gene expression from the CMV promoter in the parental AAVP. Herein, we investigated the effect of histone deacetylation and DNA methylation on AAVP-mediated gene expression in cancer cells and explored the effect of cell confluence state on AAVP gene expression efficacy. Using a combination of AAVP expressing the GFP reporter gene, flow cytometry, inhibitors of histone deacetylation, and DNA methylation, we have demonstrated that histone deacetylation and DNA methylation are associated with silencing of gene expression from the CMV promoter in the parental AAVP. Importantly, inhibitors of histone deacetylases boost gene expression in cancer cells from the Grp78 promoter in the dual tumor-targeted AAVP. However, cell confluence had no effect on AAVP-guided gene expression. Our findings prove that combination of histone deacetylase inhibitor drugs with the Grp78 promoter is an effective approach to improve AAVP-mediated gene expression in cancer cells and should be considered for AAVP-based clinical cancer gene therapy.
Kazemian, Majid; Zhu, Qiyun; Halfon, Marc S; Sinha, Saurabh
2011-12-01
Despite recent advances in experimental approaches for identifying transcriptional cis-regulatory modules (CRMs, 'enhancers'), direct empirical discovery of CRMs for all genes in all cell types and environmental conditions is likely to remain an elusive goal. Effective methods for computational CRM discovery are thus a critically needed complement to empirical approaches. However, existing computational methods that search for clusters of putative binding sites are ineffective if the relevant TFs and/or their binding specificities are unknown. Here, we provide a significantly improved method for 'motif-blind' CRM discovery that does not depend on knowledge or accurate prediction of TF-binding motifs and is effective when limited knowledge of functional CRMs is available to 'supervise' the search. We propose a new statistical method, based on 'Interpolated Markov Models', for motif-blind, genome-wide CRM discovery. It captures the statistical profile of variable length words in known CRMs of a regulatory network and finds candidate CRMs that match this profile. The method also uses orthologs of the known CRMs from closely related genomes. We perform in silico evaluation of predicted CRMs by assessing whether their neighboring genes are enriched for the expected expression patterns. This assessment uses a novel statistical test that extends the widely used Hypergeometric test of gene set enrichment to account for variability in intergenic lengths. We find that the new CRM prediction method is superior to existing methods. Finally, we experimentally validate 12 new CRM predictions by examining their regulatory activity in vivo in Drosophila; 10 of the tested CRMs were found to be functional, while 6 of the top 7 predictions showed the expected activity patterns. We make our program available as downloadable source code, and as a plugin for a genome browser installed on our servers. © The Author(s) 2011. Published by Oxford University Press.
Gerrits, Martin F; Ghosh, Sujoy; Kavaslar, Nihan; Hill, Benjamin; Tour, Anastasia; Seifert, Erin L; Beauchamp, Brittany; Gorman, Shelby; Stuart, Joan; Dent, Robert; McPherson, Ruth; Harper, Mary-Ellen
2010-08-01
Inter-individual variability in weight gain and loss under energy surfeit and deficit conditions, respectively, are well recognized but poorly understood phenomena. We documented weight loss variability in an intensively supervised clinical weight loss program and assessed skeletal muscle gene expression and phenotypic characteristics related to variable response to a 900 kcal regimen. Matched pairs of healthy, diet-compliant, obese diet-sensitive (ODS) and diet-resistant (ODR) subjects were defined as those in the highest and lowest quintiles for weight loss rate. Physical activity energy expenditure was minimal and comparable. Following program completion and weight stabilization, skeletal muscle biopsies were obtained. Gene expression analysis of rectus femoris and vastus lateralis indicated upregulation of genes and gene sets involved in oxidative phosphorylation and glucose and fatty acid metabolism in ODS compared with ODR. In vastus lateralis, there was a higher proportion of oxidative (type I) fibers in ODS compared with ODR women and lean controls, fiber hypertrophy in ODS compared with ODR women and lean controls, and lower succinate dehydrogenase in oxidative and oxidative-glycolytic fibers in all obese compared with lean subjects. Intramuscular lipid content was generally higher in obese versus lean, and specifically higher in ODS vs. lean women. Altogether, our findings demonstrate differences in muscle gene expression and fiber composition related to clinical weight loss success.
Gerrits, Martin F.; Ghosh, Sujoy; Kavaslar, Nihan; Hill, Benjamin; Tour, Anastasia; Seifert, Erin L.; Beauchamp, Brittany; Gorman, Shelby; Stuart, Joan; Dent, Robert; McPherson, Ruth; Harper, Mary-Ellen
2010-01-01
Inter-individual variability in weight gain and loss under energy surfeit and deficit conditions, respectively, are well recognized but poorly understood phenomena. We documented weight loss variability in an intensively supervised clinical weight loss program and assessed skeletal muscle gene expression and phenotypic characteristics related to variable response to a 900 kcal regimen. Matched pairs of healthy, diet-compliant, obese diet-sensitive (ODS) and diet-resistant (ODR) subjects were defined as those in the highest and lowest quintiles for weight loss rate. Physical activity energy expenditure was minimal and comparable. Following program completion and weight stabilization, skeletal muscle biopsies were obtained. Gene expression analysis of rectus femoris and vastus lateralis indicated upregulation of genes and gene sets involved in oxidative phosphorylation and glucose and fatty acid metabolism in ODS compared with ODR. In vastus lateralis, there was a higher proportion of oxidative (type I) fibers in ODS compared with ODR women and lean controls, fiber hypertrophy in ODS compared with ODR women and lean controls, and lower succinate dehydrogenase in oxidative and oxidative-glycolytic fibers in all obese compared with lean subjects. Intramuscular lipid content was generally higher in obese versus lean, and specifically higher in ODS vs. lean women. Altogether, our findings demonstrate differences in muscle gene expression and fiber composition related to clinical weight loss success. PMID:20332421
Connallon, Tim; Clark, Andrew G
2010-12-01
Sex-biased genes--genes that are differentially expressed within males and females--are nonrandomly distributed across animal genomes, with sex chromosomes and autosomes often carrying markedly different concentrations of male- and female-biased genes. These linkage patterns are often gene- and lineage-dependent, differing between functional genetic categories and between species. Although sex-specific selection is often hypothesized to shape the evolution of sex-linked and autosomal gene content, population genetics theory has yet to account for many of the gene- and lineage-specific idiosyncrasies emerging from the empirical literature. With the goal of improving the connection between evolutionary theory and a rapidly growing body of genome-wide empirical studies, we extend previous population genetics theory of sex-specific selection by developing and analyzing a biologically informed model that incorporates sex linkage, pleiotropy, recombination, and epistasis, factors that are likely to vary between genes and between species. Our results demonstrate that sex-specific selection and sex-specific recombination rates can generate, and are compatible with, the gene- and species-specific linkage patterns reported in the genomics literature. The theory suggests that sexual selection may strongly influence the architectures of animal genomes, as well as the chromosomal distribution of fixed substitutions underlying sexually dimorphic traits. © 2010 The Author(s). Evolution© 2010 The Society for the Study of Evolution.
Kanagawa, N; Yanagawa, T; Nakagawa, T; Okada, N; Nakagawa, S
2013-01-01
Angiogenesis is required for normal physiologic processes, but it is also involved in tumor growth, progression and metastasis. Here, we report the development of an immune-based antiangiogenic strategy based on the generation of T lymphocytes that possess killing specificity for cells expressing vascular endothelial growth factor receptor 2 (VEGFR2). To target VEGFR2-expressing cells, we engineered cytotoxic T lymphocyte (CTL) expressing chimeric T-cell receptors (cTCR-CTL) comprised of a single-chain variable fragment (scFv) against VEGFR2 linked to an intracellular signaling sequence derived from the CD3ζ chain of the TCR and CD28 by retroviral gene transduction methods. The cTCR-CTL exhibited efficient killing specificity against VEGFR2 and a tumor-targeting function in vitro and in vivo. Reflecting such abilities, we confirmed that the cTCR-CTL strongly inhibited the growth of a variety of syngeneic tumors after adoptive transfer into tumor-bearing mice without consequent damage to normal tissue. In addition, CTL expressing both cTCR and tumor-specific TCR induced complete tumor regression due to enhanced tumor infiltration by the CTL and long-term antigen-specific function. These findings provide evidence that the tumor vessel-injuring ability improved the antitumor effect of CTLs in adoptive immunotherapy for a broad range of cancers by inducing immune-mediated destruction of the tumor neovasculature.
Santos, André S; Ramos, Rommel T; Silva, Artur; Hirata, Raphael; Mattos-Guaraldi, Ana L; Meyer, Roberto; Azevedo, Vasco; Felicori, Liza; Pacheco, Luis G C
2018-05-11
Biochemical tests are traditionally used for bacterial identification at the species level in clinical microbiology laboratories. While biochemical profiles are generally efficient for the identification of the most important corynebacterial pathogen Corynebacterium diphtheriae, their ability to differentiate between biovars of this bacterium is still controversial. Besides, the unambiguous identification of emerging human pathogenic species of the genus Corynebacterium may be hampered by highly variable biochemical profiles commonly reported for these species, including Corynebacterium striatum, Corynebacterium amycolatum, Corynebacterium minutissimum, and Corynebacterium xerosis. In order to identify the genomic basis contributing for the biochemical variabilities observed in phenotypic identification methods of these bacteria, we combined a comprehensive literature review with a bioinformatics approach based on reconstruction of six specific biochemical reactions/pathways in 33 recently released whole genome sequences. We used data retrieved from curated databases (MetaCyc, PathoSystems Resource Integration Center (PATRIC), The SEED, TransportDB, UniProtKB) associated with homology searches by BLAST and profile Hidden Markov Models (HMMs) to detect enzymes participating in the various pathways and performed ab initio protein structure modeling and molecular docking to confirm specific results. We found a differential distribution among the various strains of genes that code for some important enzymes, such as beta-phosphoglucomutase and fructokinase, and also for individual components of carbohydrate transport systems, including the fructose-specific phosphoenolpyruvate-dependent sugar phosphotransferase (PTS) and the ribose-specific ATP-binging cassette (ABC) transporter. Horizontal gene transfer plays a role in the biochemical variability of the isolates, as some genes needed for sucrose fermentation were seen to be present in genomic islands. Noteworthy, using profile HMMs, we identified an enzyme with putative alpha-1,6-glycosidase activity only in some specific strains of C. diphtheriae and this may aid to understanding of the differential abilities to utilize glycogen and starch between the biovars.
Zimmerman, Carl-Ulrich R; Rosengarten, Renate; Spergser, Joachim
2013-01-01
Phase variation of two loci (‘mba locus’ and ‘UU172 phase-variable element’) in Ureaplasma parvum serovar 3 has been suggested as result of site-specific DNA inversion occurring at short inverted repeats. Three potential tyrosine recombinases (RipX, XerC, and CodV encoded by the genes UU145, UU222, and UU529) have been annotated in the genome of U. parvum serovar 3, which could be mediators in the proposed recombination event. We document that only orthologs of the gene xerC are present in all strains that show phase variation in the two loci. We demonstrate in vitro binding of recombinant maltose-binding protein fusions of XerC to the inverted repeats of the phase-variable loci, of RipX to a direct repeat that flanks a 20-kbp region, which has been proposed as putative pathogenicity island, and of CodV to a putative dif site. Co-transformation of the model organism Mycoplasma pneumoniae M129 with both the ‘mba locus’ and the recombinase gene xerC behind an active promoter region resulted in DNA inversion in the ‘mba locus’. Results suggest that XerC of U. parvum serovar 3 is a mediator in the proposed DNA inversion event of the two phase-variable loci. PMID:23305333
pelB gene in isolates of Colletotrichum gloeosporioides from several hosts.
Medeiros, L V; Maciel, D B; Medeiros, V V; Houllou Kido, L M; Oliveira, N T
2010-04-13
Colletotrichum gloeosporioides is an important pathogen for a great number of economically important crops. During the necrotrophic phase of infection by Colletotrichum spp, the degradative enzymes of plant cell walls, such as pectate lyase, clearly increase. A gene pelB that expresses a pectate lyase was identified in isolates of C. gloeosporioides in avocado pathogens. Various molecular studies have identified a kind of specialization of C. gloeosporioides isolates with specific hosts; however, there have been no studies of this gene in isolates from hosts other than avocado. The same is true for other species of Colletotrichum. We examined genetic variability in order to design primers that would amplify pelB gene fragments and compared the products of this amplification in C. gloeosporioides isolates from different hosts. Genetic variability was assessed using ISSR primers; the resultant data were grouped based on the UPGMA clustering method. Primers for the pelB gene were designed from selected GenBank sequences using the Primer 3 program at an annealing temperature of 60 degrees C and product amplification of nearly 600 bp. The ISSR primers were efficient in demonstrating the genetic variability of the Colletotrichum isolates and in distinguishing C. gloeosporioides, C. acutatum and C. sublineolum species. The gene pelB was found in C. gloeosporioides, C. acutatum and C. sublineolum. Amplified restriction fragments using MspI did not reveal differences in pelB gene structure in isolates from the three different host species that we investigated.
Li, Xiaohong; Brock, Guy N; Rouchka, Eric C; Cooper, Nigel G F; Wu, Dongfeng; O'Toole, Timothy E; Gill, Ryan S; Eteleeb, Abdallah M; O'Brien, Liz; Rai, Shesh N
2017-01-01
Normalization is an essential step with considerable impact on high-throughput RNA sequencing (RNA-seq) data analysis. Although there are numerous methods for read count normalization, it remains a challenge to choose an optimal method due to multiple factors contributing to read count variability that affects the overall sensitivity and specificity. In order to properly determine the most appropriate normalization methods, it is critical to compare the performance and shortcomings of a representative set of normalization routines based on different dataset characteristics. Therefore, we set out to evaluate the performance of the commonly used methods (DESeq, TMM-edgeR, FPKM-CuffDiff, TC, Med UQ and FQ) and two new methods we propose: Med-pgQ2 and UQ-pgQ2 (per-gene normalization after per-sample median or upper-quartile global scaling). Our per-gene normalization approach allows for comparisons between conditions based on similar count levels. Using the benchmark Microarray Quality Control Project (MAQC) and simulated datasets, we performed differential gene expression analysis to evaluate these methods. When evaluating MAQC2 with two replicates, we observed that Med-pgQ2 and UQ-pgQ2 achieved a slightly higher area under the Receiver Operating Characteristic Curve (AUC), a specificity rate > 85%, the detection power > 92% and an actual false discovery rate (FDR) under 0.06 given the nominal FDR (≤0.05). Although the top commonly used methods (DESeq and TMM-edgeR) yield a higher power (>93%) for MAQC2 data, they trade off with a reduced specificity (<70%) and a slightly higher actual FDR than our proposed methods. In addition, the results from an analysis based on the qualitative characteristics of sample distribution for MAQC2 and human breast cancer datasets show that only our gene-wise normalization methods corrected data skewed towards lower read counts. However, when we evaluated MAQC3 with less variation in five replicates, all methods performed similarly. Thus, our proposed Med-pgQ2 and UQ-pgQ2 methods perform slightly better for differential gene analysis of RNA-seq data skewed towards lowly expressed read counts with high variation by improving specificity while maintaining a good detection power with a control of the nominal FDR level.
Li, Xiaohong; Brock, Guy N.; Rouchka, Eric C.; Cooper, Nigel G. F.; Wu, Dongfeng; O’Toole, Timothy E.; Gill, Ryan S.; Eteleeb, Abdallah M.; O’Brien, Liz
2017-01-01
Normalization is an essential step with considerable impact on high-throughput RNA sequencing (RNA-seq) data analysis. Although there are numerous methods for read count normalization, it remains a challenge to choose an optimal method due to multiple factors contributing to read count variability that affects the overall sensitivity and specificity. In order to properly determine the most appropriate normalization methods, it is critical to compare the performance and shortcomings of a representative set of normalization routines based on different dataset characteristics. Therefore, we set out to evaluate the performance of the commonly used methods (DESeq, TMM-edgeR, FPKM-CuffDiff, TC, Med UQ and FQ) and two new methods we propose: Med-pgQ2 and UQ-pgQ2 (per-gene normalization after per-sample median or upper-quartile global scaling). Our per-gene normalization approach allows for comparisons between conditions based on similar count levels. Using the benchmark Microarray Quality Control Project (MAQC) and simulated datasets, we performed differential gene expression analysis to evaluate these methods. When evaluating MAQC2 with two replicates, we observed that Med-pgQ2 and UQ-pgQ2 achieved a slightly higher area under the Receiver Operating Characteristic Curve (AUC), a specificity rate > 85%, the detection power > 92% and an actual false discovery rate (FDR) under 0.06 given the nominal FDR (≤0.05). Although the top commonly used methods (DESeq and TMM-edgeR) yield a higher power (>93%) for MAQC2 data, they trade off with a reduced specificity (<70%) and a slightly higher actual FDR than our proposed methods. In addition, the results from an analysis based on the qualitative characteristics of sample distribution for MAQC2 and human breast cancer datasets show that only our gene-wise normalization methods corrected data skewed towards lower read counts. However, when we evaluated MAQC3 with less variation in five replicates, all methods performed similarly. Thus, our proposed Med-pgQ2 and UQ-pgQ2 methods perform slightly better for differential gene analysis of RNA-seq data skewed towards lowly expressed read counts with high variation by improving specificity while maintaining a good detection power with a control of the nominal FDR level. PMID:28459823
Cramer, Dina; Serrano, Luis; Schaefer, Martin H
2016-11-10
Copy number alterations (CNAs) in cancer patients show a large variability in their number, length and position, but the sources of this variability are not known. CNA number and length are linked to patient survival, suggesting clinical relevance. We have identified genes that tend to be mutated in samples that have few or many CNAs, which we term CONIM genes (COpy Number Instability Modulators). CONIM proteins cluster into a densely connected subnetwork of physical interactions and many of them are epigenetic modifiers. Therefore, we investigated how the epigenome of the tissue-of-origin influences the position of CNA breakpoints and the properties of the resulting CNAs. We found that the presence of heterochromatin in the tissue-of-origin contributes to the recurrence and length of CNAs in the respective cancer type.
Mutant phenotypes for thousands of bacterial genes of unknown function
Price, Morgan N.; Wetmore, Kelly M.; Waters, R. Jordan; ...
2018-05-16
One-third of all protein-coding genes from bacterial genomes cannot be annotated with a function. Here, to investigate the functions of these genes, we present genome-wide mutant fitness data from 32 diverse bacteria across dozens of growth conditions. We identified mutant phenotypes for 11,779 protein-coding genes that had not been annotated with a specific function. Many genes could be associated with a specific condition because the gene affected fitness only in that condition, or with another gene in the same bacterium because they had similar mutant phenotypes. Of the poorly annotated genes, 2,316 had associations that have high confidence because theymore » are conserved in other bacteria. By combining these conserved associations with comparative genomics, we identified putative DNA repair proteins; in addition, we propose specific functions for poorly annotated enzymes and transporters and for uncharacterized protein families. Lastly, our study demonstrates the scalability of microbial genetics and its utility for improving gene annotations.« less
Mutant phenotypes for thousands of bacterial genes of unknown function
DOE Office of Scientific and Technical Information (OSTI.GOV)
Price, Morgan N.; Wetmore, Kelly M.; Waters, R. Jordan
One-third of all protein-coding genes from bacterial genomes cannot be annotated with a function. Here, to investigate the functions of these genes, we present genome-wide mutant fitness data from 32 diverse bacteria across dozens of growth conditions. We identified mutant phenotypes for 11,779 protein-coding genes that had not been annotated with a specific function. Many genes could be associated with a specific condition because the gene affected fitness only in that condition, or with another gene in the same bacterium because they had similar mutant phenotypes. Of the poorly annotated genes, 2,316 had associations that have high confidence because theymore » are conserved in other bacteria. By combining these conserved associations with comparative genomics, we identified putative DNA repair proteins; in addition, we propose specific functions for poorly annotated enzymes and transporters and for uncharacterized protein families. Lastly, our study demonstrates the scalability of microbial genetics and its utility for improving gene annotations.« less
Structure and variation of the mitochondrial genome of fishes.
Satoh, Takashi P; Miya, Masaki; Mabuchi, Kohji; Nishida, Mutsumi
2016-09-07
The mitochondrial (mt) genome has been used as an effective tool for phylogenetic and population genetic analyses in vertebrates. However, the structure and variability of the vertebrate mt genome are not well understood. A potential strategy for improving our understanding is to conduct a comprehensive comparative study of large mt genome data. The aim of this study was to characterize the structure and variability of the fish mt genome through comparative analysis of large datasets. An analysis of the secondary structure of proteins for 250 fish species (248 ray-finned and 2 cartilaginous fishes) illustrated that cytochrome c oxidase subunits (COI, COII, and COIII) and a cytochrome bc1 complex subunit (Cyt b) had substantial amino acid conservation. Among the four proteins, COI was the most conserved, as more than half of all amino acid sites were invariable among the 250 species. Our models identified 43 and 58 stems within 12S rRNA and 16S rRNA, respectively, with larger numbers than proposed previously for vertebrates. The models also identified 149 and 319 invariable sites in 12S rRNA and 16S rRNA, respectively, in all fishes. In particular, the present result verified that a region corresponding to the peptidyl transferase center in prokaryotic 23S rRNA, which is homologous to mt 16S rRNA, is also conserved in fish mt 16S rRNA. Concerning the gene order, we found 35 variations (in 32 families) that deviated from the common gene order in vertebrates. These gene rearrangements were mostly observed in the area spanning the ND5 gene to the control region as well as two tRNA gene cluster regions (IQM and WANCY regions). Although many of such gene rearrangements were unique to a specific taxon, some were shared polyphyletically between distantly related species. Through a large-scale comparative analysis of 250 fish species mt genomes, we elucidated various structural aspects of the fish mt genome and the encoded genes. The present results will be important for understanding functions of the mt genome and developing programs for nucleotide sequence analysis. This study demonstrated the significance of extensive comparisons for understanding the structure of the mt genome.
Kerkhofs, Johan; Geris, Liesbet
2015-01-01
Boolean models have been instrumental in predicting general features of gene networks and more recently also as explorative tools in specific biological applications. In this study we introduce a basic quantitative and a limited time resolution to a discrete (Boolean) framework. Quantitative resolution is improved through the employ of normalized variables in unison with an additive approach. Increased time resolution stems from the introduction of two distinct priority classes. Through the implementation of a previously published chondrocyte network and T helper cell network, we show that this addition of quantitative and time resolution broadens the scope of biological behaviour that can be captured by the models. Specifically, the quantitative resolution readily allows models to discern qualitative differences in dosage response to growth factors. The limited time resolution, in turn, can influence the reachability of attractors, delineating the likely long term system behaviour. Importantly, the information required for implementation of these features, such as the nature of an interaction, is typically obtainable from the literature. Nonetheless, a trade-off is always present between additional computational cost of this approach and the likelihood of extending the model’s scope. Indeed, in some cases the inclusion of these features does not yield additional insight. This framework, incorporating increased and readily available time and semi-quantitative resolution, can help in substantiating the litmus test of dynamics for gene networks, firstly by excluding unlikely dynamics and secondly by refining falsifiable predictions on qualitative behaviour. PMID:26067297
Grünwald, Geoffrey K; Vetter, Alexandra; Klutz, Kathrin; Willhauck, Michael J; Schwenk, Nathalie; Senekowitsch-Schmidtke, Reingard; Schwaiger, Markus; Zach, Christian; Wagner, Ernst; Göke, Burkhard; Holm, Per S; Ogris, Manfred; Spitzweg, Christine
2013-01-01
We recently demonstrated tumor-selective iodide uptake and therapeutic efficacy of combined radiovirotherapy after systemic delivery of the theranostic sodium iodide symporter (NIS) gene using a dendrimer-coated adenovirus. To further improve shielding and targeting we physically coated replication-selective adenoviruses carrying the hNIS gene with a conjugate consisting of cationic poly(amidoamine) (PAMAM) dendrimer linked to the peptidic, epidermal growth factor receptor (EGFR)-specific ligand GE11. In vitro experiments demonstrated coxsackie-adenovirus receptor-independent but EGFR-specific transduction efficiency. Systemic injection of the uncoated adenovirus in a liver cancer xenograft mouse model led to high levels of NIS expression in the liver due to hepatic sequestration, which were significantly reduced after coating as demonstrated by 123I-scintigraphy. Reduction of adenovirus liver pooling resulted in decreased hepatotoxicity and increased transduction efficiency in peripheral xenograft tumors. 124I-PET-imaging confirmed EGFR-specificity by significantly lower tumoral radioiodine accumulation after pretreatment with the EGFR-specific antibody cetuximab. A significantly enhanced oncolytic effect was observed following systemic application of dendrimer-coated adenovirus that was further increased by additional treatment with a therapeutic dose of 131I. These results demonstrate restricted virus tropism and tumor-selective retargeting after systemic application of coated, EGFR-targeted adenoviruses therefore representing a promising strategy for improved systemic adenoviral NIS gene therapy. PMID:24193032
DOE Office of Scientific and Technical Information (OSTI.GOV)
Spreitzer, Robert Joseph
Ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) catalyzes the rate-limiting step of CO 2 fixation in photosynthesis. However, it is a slow enzyme, and O 2 competes with CO 2 at the active site. Oxygenation initiates the photorespiratory pathway, which also results in the loss of CO 2. If carboxylation could be increased or oxygenation decreased, an increase in net CO 2 fixation would be realized. Because Rubisco provides the primary means by which carbon enters all life on earth, there is much interest in engineering Rubisco to increase the production of food and renewable energy. Rubisco is located in the chloroplasts of plants,more » and it is comprised of two subunits. Much is known about the chloroplast-gene-encoded large subunit (rbcL gene), which contains the active site, but much less is known about the role of the nuclear-gene-encoded small subunit in Rubisco function (rbcS gene). Both subunits are coded by multiple genes in plants, which makes genetic engineering difficult. In the eukaryotic, green alga Chlamydomonas reinhardtii, it has been possible to eliminate all the Rubisco genes. These Rubisco-less mutants can be maintained by providing acetate as an alternative carbon source. In this project, focus has been placed on determining whether the small subunit might be a better genetic-engineering target for improving Rubisco. Analysis of a variable-loop structure (βA-βB loop) of the small subunit by genetic selection, directed mutagenesis, and construction of chimeras has shown that the small subunit can influence CO 2/O 2 specificity. X-ray crystal structures of engineered chimeric-loop enzymes have indicated that additional residues and regions of the small subunit may also contribute to Rubisco function. Structural dynamics of the small-subunit carboxyl terminus was also investigated. Alanine-scanning mutagenesis of the most-conserved small-subunit residues has identified a possible structural pathway between the small-subunit βA-βB loop and alpha-helix 8 of the large-subunit α/β-barrel active site. Hybrid enzymes were also created comprised of plant small subunits and Chlamydomonas large subunits, and these enzymes have increases in CO 2/O 2 specificity, further indicating that small subunits may be the key for ultimately engineering an improved Rubisco enzyme.« less
NASA Astrophysics Data System (ADS)
Donde, Oscar Omondi; Tian, Cuicui; Xiao, Bangding
2017-11-01
The presence of feacal-derived pathogens in water is responsible for several infectious diseases and deaths worldwide. As a solution, sources of fecal pollution in waters must be accurately assessed, properly determined and strictly controlled. However, the exercise has remained challenging due to the existing overlapping characteristics by different members of faecal coliform bacteria and the inadequacy of information pertaining to the contribution of seasonality and weather condition on tracking the possible sources of pollution. There are continued efforts to improve the Faecal Contamination Source Tracking (FCST) techniques such as Microbial Source Tracking (MST). This study aimed to make contribution to MST by evaluating the efficacy of combining site specific quantification of faecal contamination indicator bacteria and detection of DNA markers while accounting for seasonality and weather conditions' effects in tracking the major sources of faecal contamination in a freshwater system (Donghu Lake, China). The results showed that the use of cyd gene in addition to lacZ and uidA genes differentiates E. coli from other closely related faecal bacteria. The use of selective media increases the pollution source tracking accuracy. BSA addition boosts PCR detection and increases FCST efficiency. Seasonality and weather variability also influence the detection limit for DNA markers.
Sadee, Wolfgang
2013-09-01
Pharmacogenetic biomarker tests include mostly specific single gene-drug pairs, capable of accounting for a portion of interindividual variability in drug response and toxicity. However, multiple genes are likely to contribute, either acting independently or epistatically, with the CYP2C9-VKORC1-warfarin test panel, an example of a clinically used gene-gene-dug interaction. I discuss here further instances of gene-gene-drug interactions, including a proposed dynamic effect on statin therapy by genetic variants in both a transporter (SLCO1B1) and a metabolizing enzyme (CYP3A4) in liver cells, the main target site where statins block cholesterol synthesis. These examples set a conceptual framework for developing diagnostic panels involving multiple gene-drug combinations. Copyright © 2013 Wiley Periodicals, Inc.
Erskine, Robert M; Williams, Alun G; Jones, David A; Stewart, Claire E; Degens, Hans
2012-04-01
The protein tyrosine kinase-2 (PTK2) gene encodes focal adhesion kinase, a structural protein involved in lateral transmission of muscle fiber force. We investigated whether single-nucleotide polymorphisms (SNPs) of the PTK2 gene were associated with various indexes of human skeletal muscle strength and the interindividual variability in the strength responses to resistance training. We determined unilateral knee extension single repetition maximum (1-RM), maximum isometric voluntary contraction (MVC) knee joint torque, and quadriceps femoris muscle specific force (maximum force per unit physiological cross-sectional area) before and after 9 wk of knee extension resistance training in 51 untrained young men. All participants were genotyped for the PTK2 intronic rs7843014 A/C and 3'-untranslated region (UTR) rs7460 A/T SNPs. There were no genotype associations with baseline measures or posttraining changes in 1-RM or MVC. Although the training-induced increase in specific force was similar for all PTK2 genotypes, baseline specific force was higher in PTK2 rs7843014 AA and rs7460 TT homozygotes than in the respective rs7843014 C- (P = 0.016) and rs7460 A-allele (P = 0.009) carriers. These associations between muscle specific force and PTK2 SNPs suggest that interindividual differences exist in the way force is transmitted from the muscle fibers to the tendon. Therefore, our results demonstrate for the first time the impact of genetic variation on the intrinsic strength of human skeletal muscle.
Zhou, Bangjun; Zeng, Lirong
2017-01-01
Virus-induced gene silencing (VIGS) has been used in many plant species as an attractive post transcriptional gene silencing (PTGS) method for studying gene function either individually or at large-scale in a high-throughput manner. However, the specificity and efficiency for knocking down members of a highly homologous gene family have remained to date a significant challenge in VIGS due to silencing of off-targets. Here we present an improved method for the selection and evaluation of gene fragments used for VIGS to specifically and efficiently knock down members of a highly homologous gene family. Using this method, we knocked down twelve and four members, respectively of group III of the gene family encoding ubiquitin-conjugating enzymes (E2) in Nicotiana benthamiana . Assays using these VIGS-treated plants revealed that the group III E2s are essential for plant development, plant immunity-associated reactive oxygen species (ROS) production, expression of the gene NbRbohB that is required for ROS production, and suppression of immunity-associated programmed cell death (PCD) by AvrPtoB, an effector protein of the bacterial pathogen Pseudomons syringae . Moreover, functional redundancy for plant development and ROS production was found to exist among members of group III E2s. We have found that employment of a gene fragment as short as approximately 70 base pairs (bp) that contains at least three mismatched nucleotides to other genes within any 21-bp sequences prevents silencing of off-target(s) in VIGS. This improved approach in the selection and evaluation of gene fragments allows for specific and efficient knocking down of highly homologous members of a gene family. Using this approach, we implicated N. benthamiana group III E2s in plant development, immunity-associated ROS production, and suppression of multiple immunity-associated PCD by AvrPtoB. We also unraveled functional redundancy among group III members in their requirement for plant development and plant immunity-associated ROS production.
Pereira, S; Lavado, N; Nogueira, L; Lopez, M; Abreu, J; Silva, H
2014-10-01
Orthodontic-induced external apical root resorption (EARR) is a complex phenotype determined by poorly defined mechanical and patient intrinsic factors. The aim of this work was to construct a multifactorial integrative model, including clinical and genetic susceptibility factors, to analyze the risk of developing this common orthodontic complication. This retrospective study included 195 orthodontic patients. Using a multiple-linear regression model, where the dependent variable was the maximum% of root resorption (%EARRmax) for each patient, we assessed the contribution of nine clinical variables and four polymorphisms of genes involved in bone and tooth root remodeling (rs1718119 from P2RX7, rs1143634 from IL1B, rs3102735 from TNFRSF11B, encoding OPG, and rs1805034 from TNFRSF11A, encoding RANK). Clinical and genetic variables explained 30% of%EARRmax variability. The variables with the most significant unique contribution to the model were: gender (P < 0.05), treatment duration (P < 0.001), premolar extractions (P < 0.01), Hyrax appliance (P < 0.001) and GG genotype of rs1718119 from P2RX7 gene (P < 0.01). Age, overjet, tongue thrust, skeletal class II and the other polymorphisms made minor contributions. This study highlights the P2RX7 gene as a possible factor of susceptibility to EARR. A more extensive genetic profile may improve this model. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Kurushima, Jun; Ike, Yasuyoshi; Tomita, Haruyoshi
2016-09-01
Bacteriocin 41 (Bac41) is the plasmid-encoded bacteriocin produced by the opportunistic pathogen Enterococcus faecalis Its genetic determinant consists of bacL1 (effector), bacL2 (regulator), bacA (effector), and bacI (immunity). The secreted effectors BacL1 and BacA coordinate to induce the lytic cell death of E. faecalis Meanwhile, the immunity factor BacI provides self-resistance to the Bac41 producer, E. faecalis, against the action of BacL1 and BacA. In this study, we demonstrated that more than half of the 327 clinical strains of E. faecalis screened had functional Bac41 genes. Analysis of the genetic structure of the Bac41 genes in the DNA sequences of the E. faecalis strains revealed that the Bac41-like genes consist of a relatively conserved region and a variable region located downstream from bacA Based on similarities in the variable region, the Bac41-like genes could be classified into type I, type IIa, and type IIb. Interestingly, the distinct Bac41 types had specific immunity factors for self-resistance, BacI1 or BacI2, and did not show cross-immunity to the other type of effector. We also demonstrated experimentally that the specificity of the immunity was determined by the combination of the C-terminal region of BacA and the presence of the unique BacI1 or BacI2 factor. These observations suggested that Bac41-like bacteriocin genes are extensively disseminated among E. faecalis strains in the clinical environment and can be grouped into at least three types. It was also indicated that the partial diversity results in specificity of self-resistance which may offer these strains a competitive advantage. Bacteriocins are antibacterial effectors produced by bacteria. In general, a bacteriocin-coding gene is accompanied by a cognate immunity gene that confers self-resistance on the bacteriocin-producing bacterium itself. We demonstrated that one of the bacteriocins, Bac41, is disseminated among E. faecalis clinical strains and the Bac41 subtypes with partial diversity. The Bac41-like bacteriocins were found to be classified into type I, type IIa, and type IIb by variation of the cognate immunity factors. The antibacterial activity of the respective effectors was specifically inhibited by the immunity factor from the same type of Bac41 but not the other types. This specificity of effector-immunity pairs suggests that bacteriocin genes might have evolved to change the immunity specificity to acquire an advantage in interbacterial competition. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Palhiere, Isabelle; Brochard, Mickaël; Moazami-Goudarzi, Katayoun; Laloë, Denis; Amigues, Yves; Bed'hom, Bertrand; Neuts, Étienne; Leymarie, Cyril; Pantano, Thais; Cribiu, Edmond Paul; Bibé, Bernard; Verrier, Étienne
2008-01-01
Effective selection on the PrP gene has been implemented since October 2001 in all French sheep breeds. After four years, the ARR "resistant" allele frequency increased by about 35% in young males. The aim of this study was to evaluate the impact of this strong selection on genetic variability. It is focussed on four French sheep breeds and based on the comparison of two groups of 94 animals within each breed: the first group of animals was born before the selection began, and the second, 3–4 years later. Genetic variability was assessed using genealogical and molecular data (29 microsatellite markers). The expected loss of genetic variability on the PrP gene was confirmed. Moreover, among the five markers located in the PrP region, only the three closest ones were affected. The evolution of the number of alleles, heterozygote deficiency within population, expected heterozygosity and the Reynolds distances agreed with the criteria from pedigree and pointed out that neutral genetic variability was not much affected. This trend depended on breed, i.e. on their initial states (population size, PrP frequencies) and on the selection strategies for improving scrapie resistance while carrying out selection for production traits. PMID:18990357
Breeding and Genetics Symposium: networks and pathways to guide genomic selection.
Snelling, W M; Cushman, R A; Keele, J W; Maltecca, C; Thomas, M G; Fortes, M R S; Reverter, A
2013-02-01
Many traits affecting profitability and sustainability of meat, milk, and fiber production are polygenic, with no single gene having an overwhelming influence on observed variation. No knowledge of the specific genes controlling these traits has been needed to make substantial improvement through selection. Significant gains have been made through phenotypic selection enhanced by pedigree relationships and continually improving statistical methodology. Genomic selection, recently enabled by assays for dense SNP located throughout the genome, promises to increase selection accuracy and accelerate genetic improvement by emphasizing the SNP most strongly correlated to phenotype although the genes and sequence variants affecting phenotype remain largely unknown. These genomic predictions theoretically rely on linkage disequilibrium (LD) between genotyped SNP and unknown functional variants, but familial linkage may increase effectiveness when predicting individuals related to those in the training data. Genomic selection with functional SNP genotypes should be less reliant on LD patterns shared by training and target populations, possibly allowing robust prediction across unrelated populations. Although the specific variants causing polygenic variation may never be known with certainty, a number of tools and resources can be used to identify those most likely to affect phenotype. Associations of dense SNP genotypes with phenotype provide a 1-dimensional approach for identifying genes affecting specific traits; in contrast, associations with multiple traits allow defining networks of genes interacting to affect correlated traits. Such networks are especially compelling when corroborated by existing functional annotation and established molecular pathways. The SNP occurring within network genes, obtained from public databases or derived from genome and transcriptome sequences, may be classified according to expected effects on gene products. As illustrated by functionally informed genomic predictions being more accurate than naive whole-genome predictions of beef tenderness, coupling evidence from livestock genotypes, phenotypes, gene expression, and genomic variants with existing knowledge of gene functions and interactions may provide greater insight into the genes and genomic mechanisms affecting polygenic traits and facilitate functional genomic selection for economically important traits.
Heun, Yvonn; Hildebrand, Staffan; Heidsieck, Alexandra; Gleich, Bernhard; Anton, Martina; Pircher, Joachim; Ribeiro, Andrea; Mykhaylyk, Olga; Eberbeck, Dietmar; Wenzel, Daniela; Pfeifer, Alexander; Woernle, Markus; Krötz, Florian; Pohl, Ulrich; Mannell, Hanna
2017-01-01
In the field of vascular gene therapy, targeting systems are promising advancements to improve site-specificity of gene delivery. Here, we studied whether incorporation of magnetic nanoparticles (MNP) with different magnetic properties into ultrasound sensitive microbubbles may represent an efficient way to enable gene targeting in the vascular system after systemic application. Thus, we associated novel silicon oxide-coated magnetic nanoparticle containing microbubbles (SO-Mag MMB) with lentiviral particles carrying therapeutic genes and determined their physico-chemical as well as biological properties compared to MMB coated with polyethylenimine-coated magnetic nanoparticles (PEI-Mag MMB). While there were no differences between both MMB types concerning size and lentivirus binding, SO-Mag MMB exhibited superior characteristics regarding magnetic moment, magnetizability as well as transduction efficiency under static and flow conditions in vitro . Focal disruption of lentiviral SO-Mag MMB by ultrasound within isolated vessels exposed to an external magnetic field decisively improved localized VEGF expression in aortic endothelium ex vivo and enhanced the angiogenic response. Using the same system in vivo , we achieved a highly effective, site-specific lentiviral transgene expression in microvessels of the mouse dorsal skin after arterial injection. Thus, we established a novel lentiviral MMB technique, which has great potential towards site-directed vascular gene therapy.
Nikiforova, Marina N; Mercurio, Stephanie; Wald, Abigail I; Barbi de Moura, Michelle; Callenberg, Keith; Santana-Santos, Lucas; Gooding, William E; Yip, Linwah; Ferris, Robert L; Nikiforov, Yuri E
2018-04-15
Molecular tests have clinical utility for thyroid nodules with indeterminate fine-needle aspiration (FNA) cytology, although their performance requires further improvement. This study evaluated the analytical performance of the newly created ThyroSeq v3 test. ThyroSeq v3 is a DNA- and RNA-based next-generation sequencing assay that analyzes 112 genes for a variety of genetic alterations, including point mutations, insertions/deletions, gene fusions, copy number alterations, and abnormal gene expression, and it uses a genomic classifier (GC) to separate malignant lesions from benign lesions. It was validated in 238 tissue samples and 175 FNA samples with known surgical follow-up. Analytical performance studies were conducted. In the training tissue set of samples, ThyroSeq GC detected more than 100 genetic alterations, including BRAF, RAS, TERT, and DICER1 mutations, NTRK1/3, BRAF, and RET fusions, 22q loss, and gene expression alterations. GC cutoffs were established to distinguish cancer from benign nodules with 93.9% sensitivity, 89.4% specificity, and 92.1% accuracy. This correctly classified most papillary, follicular, and Hurthle cell lesions, medullary thyroid carcinomas, and parathyroid lesions. In the FNA validation set, the GC sensitivity was 98.0%, the specificity was 81.8%, and the accuracy was 90.9%. Analytical accuracy studies demonstrated a minimal required nucleic acid input of 2.5 ng, a 12% minimal acceptable tumor content, and reproducible test results under variable stress conditions. The ThyroSeq v3 GC analyzes 5 different classes of molecular alterations and provides high accuracy for detecting all common types of thyroid cancer and parathyroid lesions. The analytical sensitivity, specificity, and robustness of the test have been successfully validated and indicate its suitability for clinical use. Cancer 2018;124:1682-90. © 2018 American Cancer Society. © 2018 American Cancer Society.
Damberg, M; Garpenstrand, H; Alfredsson, J; Ekblom, J; Forslund, K; Rylander, G; Oreland, L
2000-03-01
Transcription factor AP-2beta is implicated in playing an important role during embryonic development of different parts of the brain, eg, midbrain, hindbrain, spinal cord, dorsal and cranial root ganglia.1,2 The gene encoding AP-2beta contains a polymorphic region which includes a tetranucleotide repeat of [CAAA] four or five times, located in intron 2 between nucleotides 12593 and 12612.3 Since the midbrain contains structures important for variables such as mood and personality, we have investigated if the AP-2beta genotype is associated with personality traits estimated by the Karolinska Scales of Personality (KSP). Identification of transcription factor genes as candidate genes in psychiatric disorders is a novel approach to further elucidate the genetic factors that, together with environmental factors, are involved in the expression of specific psychiatric phenotypes. The AP-2beta genotype and KSP scores were determined for 137 Caucasian volunteers (73 females and 64 males). The personality traits muscular tension, guilt, somatic anxiety, psychastenia and indirect aggression were significantly associated with the specific AP-2beta genotype, albeit with significant difference between genders. Based on this result the human AP-2beta gene seems to be an important candidate gene for personality disorders. Moreover, the present results suggest that the structure of the intron 2 region of the AP-2beta gene is one factor that contributes to development of the constitutional component of specific personality traits.
LoParo, Devon; Johansson, Ada; Walum, Hasse; Westberg, Lars; Santtila, Pekka; Waldman, Irwin
2016-07-01
Naturalistic studies of gene-environment interactions (G X E) have been plagued by several limitations, including difficulty isolating specific environmental risk factors from other correlated aspects of the environment, gene-environment correlation (rGE ), and the use of a single genetic variant to represent the influence of a gene. We present results from 235 Finnish young men in two lab studies of aggression and alcohol challenge that attempt to redress these limitations of the extant G X E literature. Specifically, we use a latent variable modeling approach in an attempt to more fully account for genetic variation across the oxytocin receptor gene (OXTR) and to robustly test its main effects on aggression and its interaction with alcohol exposure. We also modeled aggression as a latent variable comprising various indices, including the average and maximum levels of aggression, the earliest trial on which aggression was expressed, and the proportion of trials on which the minimum and maximum levels of aggression were expressed. The best fitting model for the genetic variation across OXTR included six factors derived from an exploratory factor analysis, roughly corresponding to six haplotype blocks. Aggression levels were higher on trials in which participants were administered alcohol, won, or were provoked. There was a significant main effect of OXTR on aggression across studies after controlling for covariates. The interaction of OXTR and alcohol was also significant across studies, such that OXTR had stronger effects on aggression in the alcohol administration condition. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.
Gassó, Patricia; Rodríguez, Natalia; Blázquez, Ana; Monteagudo, Ana; Boloc, Daniel; Plana, Maria Teresa; Lafuente, Amalia; Lázaro, Luisa; Arnaiz, Joan Albert; Mas, Sergi
2017-04-03
The serotonin 1B receptor (5-HT 1B ) is important to both the pathogenesis of major depressive disorder and the antidepressant effects of selective serotonin reuptake inhibitors. Although fluoxetine has been shown to be effective and safe in children and adolescents, not all patients experience a proper clinical response, which has led to further study into the main factors involved in this inter-individual variability. Our aim was to study the effect of epigenetic and genetic factors that could affect 5-hydroxytryptamine receptor 1B (HTR1B) gene expression, and thereby response to fluoxetine. A total of 83 children and adolescents were clinically assessed 12weeks after of initiating an antidepressant treatment with fluoxetine for the first time. We evaluated the influence of single nucleotide polymorphisms (SNPs) specifically located in transcription factor binding sites (TFBSs) on their clinical improvement. A combined genetic analysis considering the significant SNPs together with the functional variant rs130058 previously associated in our population was also performed. Moreover, we assessed, for the first time in the literature, whether methylation levels of the HTR1B promoter region could be associated with the pharmacological response. Two, rs9361233 and rs9361235, were significantly associated with clinical improvement after treatment with fluoxetine. The heterozygous genotype combination analysis showed a negative correlation with clinical improvement. The lowest improvement was experienced by patients who were heterozygous for all three SNPs. Moreover, a negative correlation was found between clinical improvement and the average methylation level of the HTR1B promoter. These results give new evidence for the role of epigenetic and genetic factors which could modulate HTR1B expression in the pharmacological response to antidepressants. Copyright © 2016 Elsevier Inc. All rights reserved.
Why weight? Modelling sample and observational level variability improves power in RNA-seq analyses
Liu, Ruijie; Holik, Aliaksei Z.; Su, Shian; Jansz, Natasha; Chen, Kelan; Leong, Huei San; Blewitt, Marnie E.; Asselin-Labat, Marie-Liesse; Smyth, Gordon K.; Ritchie, Matthew E.
2015-01-01
Variations in sample quality are frequently encountered in small RNA-sequencing experiments, and pose a major challenge in a differential expression analysis. Removal of high variation samples reduces noise, but at a cost of reducing power, thus limiting our ability to detect biologically meaningful changes. Similarly, retaining these samples in the analysis may not reveal any statistically significant changes due to the higher noise level. A compromise is to use all available data, but to down-weight the observations from more variable samples. We describe a statistical approach that facilitates this by modelling heterogeneity at both the sample and observational levels as part of the differential expression analysis. At the sample level this is achieved by fitting a log-linear variance model that includes common sample-specific or group-specific parameters that are shared between genes. The estimated sample variance factors are then converted to weights and combined with observational level weights obtained from the mean–variance relationship of the log-counts-per-million using ‘voom’. A comprehensive analysis involving both simulations and experimental RNA-sequencing data demonstrates that this strategy leads to a universally more powerful analysis and fewer false discoveries when compared to conventional approaches. This methodology has wide application and is implemented in the open-source ‘limma’ package. PMID:25925576
Snitkin, Evan S; Dudley, Aimée M; Janse, Daniel M; Wong, Kaisheen; Church, George M; Segrè, Daniel
2008-01-01
Background Understanding the response of complex biochemical networks to genetic perturbations and environmental variability is a fundamental challenge in biology. Integration of high-throughput experimental assays and genome-scale computational methods is likely to produce insight otherwise unreachable, but specific examples of such integration have only begun to be explored. Results In this study, we measured growth phenotypes of 465 Saccharomyces cerevisiae gene deletion mutants under 16 metabolically relevant conditions and integrated them with the corresponding flux balance model predictions. We first used discordance between experimental results and model predictions to guide a stage of experimental refinement, which resulted in a significant improvement in the quality of the experimental data. Next, we used discordance still present in the refined experimental data to assess the reliability of yeast metabolism models under different conditions. In addition to estimating predictive capacity based on growth phenotypes, we sought to explain these discordances by examining predicted flux distributions visualized through a new, freely available platform. This analysis led to insight into the glycerol utilization pathway and the potential effects of metabolic shortcuts on model results. Finally, we used model predictions and experimental data to discriminate between alternative raffinose catabolism routes. Conclusions Our study demonstrates how a new level of integration between high throughput measurements and flux balance model predictions can improve understanding of both experimental and computational results. The added value of a joint analysis is a more reliable platform for specific testing of biological hypotheses, such as the catabolic routes of different carbon sources. PMID:18808699
Krinitsyna, A A; Mel'nikova, N V; Belenikin, M S; Poltronieri, P; Santino, A; Kudriavtseva, A V; Savilova, A M; Speranskaia, A S
2013-01-01
Kunitz-type proteinase inhibitor proteins of group A (KPI-A) are involved in the protection of potato plants from pathogens and pests. Although sequences of large number of the KPI-A genes from different species of cultivated potato (Solanum tuberosum subsp. tuberosum) and a few genes from tomato (Solanum lycopersicum) are known to date, information about the allelic diversity of these genes in other species of the genus Solanum is lacking. In our work, the consensus sequences of the KPI-A genes were established in two species of subgenus Potatoe sect. Petota (Solanum tuberosum subsp. andigenum--5 genes and Solanum stoloniferum--2 genes) and in the subgenus Solanum (Solanum nigrum--5 genes) by amplification, cloning, sequencing and subsequent analysis. The determined sequences of KPI-A genes were 97-100% identical to known sequences of the cultivated potato of sect. Petota (cultivated potato Solanum tuberosum subsp. tuberosum) and sect. Etuberosum (S. palustre). The interspecific variability of these genes did not exceed the intraspecific variability for all studied species except Solanum lycopersicum. The distribution of highly variable and conserved sequences in the mature protein-encoding regions was uniform for all investigated KPI-A genes. However, our attempts to amplify the homologous genes using the same primers and the genomes of Solanum dulcamarum, Solanum lycopersicum and Mandragora officinarum resulted in no product formation. Phylogenetic analysis of KPI-A diversity showed that the sequences of the S. lycopersicum form independent cluster, whereas KPI-A of S. nigrum and species of sect. Etuberosum and sect. Petota are closely related and do not form species-specific subclasters. Although Solanum nigrum is resistant to all known races of economically one of the most important diseases of solanaceous plants oomycete Phytophthora infestans aminoacid sequences encoding by KPI-A genes from its genome have nearly or absolutely no differences to the same from genomes of cultivated potatoes involved by P. infestans.
Variable neighborhood search for reverse engineering of gene regulatory networks.
Nicholson, Charles; Goodwin, Leslie; Clark, Corey
2017-01-01
A new search heuristic, Divided Neighborhood Exploration Search, designed to be used with inference algorithms such as Bayesian networks to improve on the reverse engineering of gene regulatory networks is presented. The approach systematically moves through the search space to find topologies representative of gene regulatory networks that are more likely to explain microarray data. In empirical testing it is demonstrated that the novel method is superior to the widely employed greedy search techniques in both the quality of the inferred networks and computational time. Copyright © 2016 Elsevier Inc. All rights reserved.
Holloway, Andrew J; Oshlack, Alicia; Diyagama, Dileepa S; Bowtell, David DL; Smyth, Gordon K
2006-01-01
Background Concerns are often raised about the accuracy of microarray technologies and the degree of cross-platform agreement, but there are yet no methods which can unambiguously evaluate precision and sensitivity for these technologies on a whole-array basis. Results A methodology is described for evaluating the precision and sensitivity of whole-genome gene expression technologies such as microarrays. The method consists of an easy-to-construct titration series of RNA samples and an associated statistical analysis using non-linear regression. The method evaluates the precision and responsiveness of each microarray platform on a whole-array basis, i.e., using all the probes, without the need to match probes across platforms. An experiment is conducted to assess and compare four widely used microarray platforms. All four platforms are shown to have satisfactory precision but the commercial platforms are superior for resolving differential expression for genes at lower expression levels. The effective precision of the two-color platforms is improved by allowing for probe-specific dye-effects in the statistical model. The methodology is used to compare three data extraction algorithms for the Affymetrix platforms, demonstrating poor performance for the commonly used proprietary algorithm relative to the other algorithms. For probes which can be matched across platforms, the cross-platform variability is decomposed into within-platform and between-platform components, showing that platform disagreement is almost entirely systematic rather than due to measurement variability. Conclusion The results demonstrate good precision and sensitivity for all the platforms, but highlight the need for improved probe annotation. They quantify the extent to which cross-platform measures can be expected to be less accurate than within-platform comparisons for predicting disease progression or outcome. PMID:17118209
2011-01-01
Background Streptococcus is an economically important genus as a number of species belonging to this genus are human and animal pathogens. The genus has been divided into different groups based on 16S rRNA gene sequence similarity. The variability observed among the members of these groups is low and it is difficult to distinguish them. The present study was taken up to explore 16S rRNA gene sequence to develop methods that can be used for preliminary identification and can supplement the existing methods for identification of clinically-relevant isolates of the genus Streptococcus. Methods 16S rRNA gene sequences belonging to the isolates of S. dysgalactiae, S. equi, S. pyogenes, S. agalactiae, S. bovis, S. gallolyticus, S. mutans, S. sobrinus, S. mitis, S. pneumoniae, S. thermophilus and S. anginosus were analyzed with the purpose to define genetic variability within each species to generate a phylogenetic framework, to identify species-specific signatures and in-silico restriction enzyme analysis. Results The framework based analysis was used to segregate Streptococcus spp. previously identified upto genus level. This segregation was validated using species-specific signatures and in-silico restriction enzyme analysis. 43 uncharacterized Streptococcus spp. could be identified using this approach. Conclusions The markers generated exploring 16S rRNA gene sequences provided useful tool that can be further used for identification of different species of the genus Streptococcus. PMID:21702978
Correlation between Hox code and vertebral morphology in archosaurs.
Böhmer, Christine; Rauhut, Oliver W M; Wörheide, Gert
2015-07-07
The relationship between developmental genes and phenotypic variation is of central interest in evolutionary biology. An excellent example is the role of Hox genes in the anteroposterior regionalization of the vertebral column in vertebrates. Archosaurs (crocodiles, dinosaurs including birds) are highly variable both in vertebral morphology and number. Nevertheless, functionally equivalent Hox genes are active in the axial skeleton during embryonic development, indicating that the morphological variation across taxa is likely owing to modifications in the pattern of Hox gene expression. By using geometric morphometrics, we demonstrate a correlation between vertebral Hox code and quantifiable vertebral morphology in modern archosaurs, in which the boundaries between morphological subgroups of vertebrae can be linked to anterior Hox gene expression boundaries. Our findings reveal homologous units of cervical vertebrae in modern archosaurs, each with their specific Hox gene pattern, enabling us to trace these homologies in the extinct sauropodomorph dinosaurs, a group with highly variable vertebral counts. Based on the quantifiable vertebral morphology, this allows us to infer the underlying genetic mechanisms in vertebral evolution in fossils, which represents not only an important case study, but will lead to a better understanding of the origin of morphological disparity in recent archosaur vertebral columns.
Correlation between Hox code and vertebral morphology in archosaurs
Böhmer, Christine; Rauhut, Oliver W. M.; Wörheide, Gert
2015-01-01
The relationship between developmental genes and phenotypic variation is of central interest in evolutionary biology. An excellent example is the role of Hox genes in the anteroposterior regionalization of the vertebral column in vertebrates. Archosaurs (crocodiles, dinosaurs including birds) are highly variable both in vertebral morphology and number. Nevertheless, functionally equivalent Hox genes are active in the axial skeleton during embryonic development, indicating that the morphological variation across taxa is likely owing to modifications in the pattern of Hox gene expression. By using geometric morphometrics, we demonstrate a correlation between vertebral Hox code and quantifiable vertebral morphology in modern archosaurs, in which the boundaries between morphological subgroups of vertebrae can be linked to anterior Hox gene expression boundaries. Our findings reveal homologous units of cervical vertebrae in modern archosaurs, each with their specific Hox gene pattern, enabling us to trace these homologies in the extinct sauropodomorph dinosaurs, a group with highly variable vertebral counts. Based on the quantifiable vertebral morphology, this allows us to infer the underlying genetic mechanisms in vertebral evolution in fossils, which represents not only an important case study, but will lead to a better understanding of the origin of morphological disparity in recent archosaur vertebral columns. PMID:26085583
DOE Office of Scientific and Technical Information (OSTI.GOV)
Demaison, C.; Chastagner, P.; Theze, J.
1994-01-18
Monoclonal anti-DNA antibodies bearing a lupus nephritis-associated idiotype were derived from five patients with systemic lupus erythematosus (SLE). Genes encoding their heavy (H)-chain variable (V[sub H]) regions were cloned and sequenced. When compared with their closest V[sub h] germ-line gene relatives, these sequences exhibit a number of silent (S) and replacement (R) substitutions. The ratios of R/S mutations were much higher in the complementarity-determining regions (CDRs) of the antibodies than in the framework regions. Molecular amplification of genomic V[sub H] genes and Southern hybridization with somatic CDR2-specific oligonucleotide probes showed that the configuration of the V[sub H] genes corresponding tomore » V[sub H] sequences in the nephritogenic antibodies is not present in the patient's own germ-line DNA, implying that the B-cell clones underwent somatic mutation in vivo. These findings, together with the characteristics of the diversity and junctional gene elements utilized to form the antibody, indicate that these autoantibodies have been driven through somatic selection processes reminiscent of those that govern antibody responses triggered by exogenous stimuli.« less
A Nested-Splicing by Overlap Extension PCR Improves Specificity of this Standard Method.
Karkhane, Ali Asghar; Yakhchali, Bagher; Rastgar Jazii, Ferdous; Bambai, Bijan; Aminzadeh, Saeed; Rahimi, Fatemeh
2015-06-01
Splicing by overlap extension (SOE) PCR is used to create mutation in the coding sequence of an enzyme in order to study the role of specific residues in protein's structure and function. We introduced a nested-SOE-PCR (N -SOE-PCR) in order to increase the specificity and generating mutations in a gene by SOE-PCR. Genomic DNA from Bacillus thermocatenulatus was extracted. Nested PCR was used to amplify B. thermocatenulatus lipase gene variants, namely wild type and mutant, using gene specific and mutagenic specific primers, followed by cloning in a suitable vector. Briefly in N-SOE-PCR method, instead of two pairs of primers, three pairs of primers are used to amplify a mutagenic fragment. Moreover, the first and second PCR products are slightly longer than PCR products in a conventional SOE. PCR products obtained from the first round of PCR are used for the second PCR by applying the nested and mutated primers. Following to the purification of the amplified fragments, they will be subject of the further purification and will be used as template to perform the third round of PCR using gene specific primers. In the end, the products will be cloned into a suitable vector for subsequent application. In comparison to the conventional SOE-PCR, the improved method (i.e. N-SOE-PCR) increases the yield and specificity of the products. In addition, the proposed method shows a large reduction in the non-specific products. By applying two more primers in the conventional SOE, the specificity of the method will be improved. This would be in part due to annealing of the primers further inside the amplicon that increases both the efficiency and a better attachment of the primers. Positioning of the primer far from both ends of an amplicon leads to an enhanced binding as well as increased affinity in the third round of amplification in SOE.
Gadala-Maria, Daniel; Yaari, Gur; Uduman, Mohamed; Kleinstein, Steven H
2015-02-24
Individual variation in germline and expressed B-cell immunoglobulin (Ig) repertoires has been associated with aging, disease susceptibility, and differential response to infection and vaccination. Repertoire properties can now be studied at large-scale through next-generation sequencing of rearranged Ig genes. Accurate analysis of these repertoire-sequencing (Rep-Seq) data requires identifying the germline variable (V), diversity (D), and joining (J) gene segments used by each Ig sequence. Current V(D)J assignment methods work by aligning sequences to a database of known germline V(D)J segment alleles. However, existing databases are likely to be incomplete and novel polymorphisms are hard to differentiate from the frequent occurrence of somatic hypermutations in Ig sequences. Here we develop a Tool for Ig Genotype Elucidation via Rep-Seq (TIgGER). TIgGER analyzes mutation patterns in Rep-Seq data to identify novel V segment alleles, and also constructs a personalized germline database containing the specific set of alleles carried by a subject. This information is then used to improve the initial V segment assignments from existing tools, like IMGT/HighV-QUEST. The application of TIgGER to Rep-Seq data from seven subjects identified 11 novel V segment alleles, including at least one in every subject examined. These novel alleles constituted 13% of the total number of unique alleles in these subjects, and impacted 3% of V(D)J segment assignments. These results reinforce the highly polymorphic nature of human Ig V genes, and suggest that many novel alleles remain to be discovered. The integration of TIgGER into Rep-Seq processing pipelines will increase the accuracy of V segment assignments, thus improving B-cell repertoire analyses.
NASA Astrophysics Data System (ADS)
Haskel-Ittah, Michal; Yarden, Anat
2017-12-01
Previous studies have shown that students often ignore molecular mechanisms when describing genetic phenomena. Specifically, students tend to directly link genes to their encoded traits, ignoring the role of proteins as mediators in this process. We tested the ability of 10th grade students to connect genes to traits through proteins, using concept maps and reasoning questions. The context of this study was a computational learning environment developed specifically to foster this ability. This environment presents proteins as the mechanism-mediating genetic phenomena. We found that students' ability to connect genes, proteins, and traits, or to reason using this connection, was initially poor. However, significant improvement was obtained when using the learning environment. Our results suggest that visual representations of proteins' functions in the context of a specific trait contributed to this improvement. One significant aspect of these results is the indication that 10th graders are capable of accurately describing genetic phenomena and their underlying mechanisms, a task that has been shown to raise difficulties, even in higher grades of high school.
The Transcriptome of the Reference Potato Genome Solanum tuberosum Group Phureja Clone DM1-3 516R44
Massa, Alicia N.; Childs, Kevin L.; Lin, Haining; Bryan, Glenn J.; Giuliano, Giovanni; Buell, C. Robin
2011-01-01
Advances in molecular breeding in potato have been limited by its complex biological system, which includes vegetative propagation, autotetraploidy, and extreme heterozygosity. The availability of the potato genome and accompanying gene complement with corresponding gene structure, location, and functional annotation are powerful resources for understanding this complex plant and advancing molecular breeding efforts. Here, we report a reference for the potato transcriptome using 32 tissues and growth conditions from the doubled monoploid Solanum tuberosum Group Phureja clone DM1-3 516R44 for which a genome sequence is available. Analysis of greater than 550 million RNA-Seq reads permitted the detection and quantification of expression levels of over 22,000 genes. Hierarchical clustering and principal component analyses captured the biological variability that accounts for gene expression differences among tissues suggesting tissue-specific gene expression, and genes with tissue or condition restricted expression. Using gene co-expression network analysis, we identified 18 gene modules that represent tissue-specific transcriptional networks of major potato organs and developmental stages. This information provides a powerful resource for potato research as well as studies on other members of the Solanaceae family. PMID:22046362
Shakoor, Nadia; Nair, Ramesh; Crasta, Oswald; Morris, Geoffrey; Feltus, Alex; Kresovich, Stephen
2014-01-23
Effective improvement in sorghum crop development necessitates a genomics-based approach to identify functional genes and QTLs. Sequenced in 2009, a comprehensive annotation of the sorghum genome and the development of functional genomics resources is key to enable the discovery and deployment of regulatory and metabolic genes and gene networks for crop improvement. This study utilizes the first commercially available whole-transcriptome sorghum microarray (Sorgh-WTa520972F) to identify tissue and genotype-specific expression patterns for all identified Sorghum bicolor exons and UTRs. The genechip contains 1,026,373 probes covering 149,182 exons (27,577 genes) across the Sorghum bicolor nuclear, chloroplast, and mitochondrial genomes. Specific probesets were also included for putative non-coding RNAs that may play a role in gene regulation (e.g., microRNAs), and confirmed functional small RNAs in related species (maize and sugarcane) were also included in our array design. We generated expression data for 78 samples with a combination of four different tissue types (shoot, root, leaf and stem), two dissected stem tissues (pith and rind) and six diverse genotypes, which included 6 public sorghum lines (R159, Atlas, Fremont, PI152611, AR2400 and PI455230) representing grain, sweet, forage, and high biomass ideotypes. Here we present a summary of the microarray dataset, including analysis of tissue-specific gene expression profiles and associated expression profiles of relevant metabolic pathways. With an aim to enable identification and functional characterization of genes in sorghum, this expression atlas presents a new and valuable resource to the research community.
2014-01-01
Background Effective improvement in sorghum crop development necessitates a genomics-based approach to identify functional genes and QTLs. Sequenced in 2009, a comprehensive annotation of the sorghum genome and the development of functional genomics resources is key to enable the discovery and deployment of regulatory and metabolic genes and gene networks for crop improvement. Results This study utilizes the first commercially available whole-transcriptome sorghum microarray (Sorgh-WTa520972F) to identify tissue and genotype-specific expression patterns for all identified Sorghum bicolor exons and UTRs. The genechip contains 1,026,373 probes covering 149,182 exons (27,577 genes) across the Sorghum bicolor nuclear, chloroplast, and mitochondrial genomes. Specific probesets were also included for putative non-coding RNAs that may play a role in gene regulation (e.g., microRNAs), and confirmed functional small RNAs in related species (maize and sugarcane) were also included in our array design. We generated expression data for 78 samples with a combination of four different tissue types (shoot, root, leaf and stem), two dissected stem tissues (pith and rind) and six diverse genotypes, which included 6 public sorghum lines (R159, Atlas, Fremont, PI152611, AR2400 and PI455230) representing grain, sweet, forage, and high biomass ideotypes. Conclusions Here we present a summary of the microarray dataset, including analysis of tissue-specific gene expression profiles and associated expression profiles of relevant metabolic pathways. With an aim to enable identification and functional characterization of genes in sorghum, this expression atlas presents a new and valuable resource to the research community. PMID:24456189
Smith, Stephen A; Moore, Michael J; Brown, Joseph W; Yang, Ya
2015-08-05
The use of transcriptomic and genomic datasets for phylogenetic reconstruction has become increasingly common as researchers attempt to resolve recalcitrant nodes with increasing amounts of data. The large size and complexity of these datasets introduce significant phylogenetic noise and conflict into subsequent analyses. The sources of conflict may include hybridization, incomplete lineage sorting, or horizontal gene transfer, and may vary across the phylogeny. For phylogenetic analysis, this noise and conflict has been accommodated in one of several ways: by binning gene regions into subsets to isolate consistent phylogenetic signal; by using gene-tree methods for reconstruction, where conflict is presumed to be explained by incomplete lineage sorting (ILS); or through concatenation, where noise is presumed to be the dominant source of conflict. The results provided herein emphasize that analysis of individual homologous gene regions can greatly improve our understanding of the underlying conflict within these datasets. Here we examined two published transcriptomic datasets, the angiosperm group Caryophyllales and the aculeate Hymenoptera, for the presence of conflict, concordance, and gene duplications in individual homologs across the phylogeny. We found significant conflict throughout the phylogeny in both datasets and in particular along the backbone. While some nodes in each phylogeny showed patterns of conflict similar to what might be expected with ILS alone, the backbone nodes also exhibited low levels of phylogenetic signal. In addition, certain nodes, especially in the Caryophyllales, had highly elevated levels of strongly supported conflict that cannot be explained by ILS alone. This study demonstrates that phylogenetic signal is highly variable in phylogenomic data sampled across related species and poses challenges when conducting species tree analyses on large genomic and transcriptomic datasets. Further insight into the conflict and processes underlying these complex datasets is necessary to improve and develop adequate models for sequence analysis and downstream applications. To aid this effort, we developed the open source software phyparts ( https://bitbucket.org/blackrim/phyparts ), which calculates unique, conflicting, and concordant bipartitions, maps gene duplications, and outputs summary statistics such as internode certainy (ICA) scores and node-specific counts of gene duplications.
Carbonari, Damiano; Chiarella, Pieranna; Mansi, Antonella; Pigini, Daniela; Iavicoli, Sergio; Tranfo, Giovanna
2016-01-01
Benzene is a ubiquitous occupational and environmental pollutant. Improved industrial hygiene allowed airborne concentrations close to the environmental context (1-1000 µg/m(3)). Conversely, new limits for benzene levels in urban air were set (5 µg/m(3)). The biomonitoring of exposure to such low benzene concentrations are performed measuring specific and sensitive biomarkers such as S-phenylmercapturic acid, trans, trans-muconic acid and urinary benzene: many studies referred high variability in the levels of these biomarkers, suggesting the involvement of polymorphic metabolic genes in the individual susceptibility to benzene toxicity. We reviewed the influence of metabolic polymorphisms on the biomarkers levels of benzene exposure and effect, in order to understand the real impact of benzene exposure on subjects with increased susceptibility.
Modeling Bi-modality Improves Characterization of Cell Cycle on Gene Expression in Single Cells
Danaher, Patrick; Finak, Greg; Krouse, Michael; Wang, Alice; Webster, Philippa; Beechem, Joseph; Gottardo, Raphael
2014-01-01
Advances in high-throughput, single cell gene expression are allowing interrogation of cell heterogeneity. However, there is concern that the cell cycle phase of a cell might bias characterizations of gene expression at the single-cell level. We assess the effect of cell cycle phase on gene expression in single cells by measuring 333 genes in 930 cells across three phases and three cell lines. We determine each cell's phase non-invasively without chemical arrest and use it as a covariate in tests of differential expression. We observe bi-modal gene expression, a previously-described phenomenon, wherein the expression of otherwise abundant genes is either strongly positive, or undetectable within individual cells. This bi-modality is likely both biologically and technically driven. Irrespective of its source, we show that it should be modeled to draw accurate inferences from single cell expression experiments. To this end, we propose a semi-continuous modeling framework based on the generalized linear model, and use it to characterize genes with consistent cell cycle effects across three cell lines. Our new computational framework improves the detection of previously characterized cell-cycle genes compared to approaches that do not account for the bi-modality of single-cell data. We use our semi-continuous modelling framework to estimate single cell gene co-expression networks. These networks suggest that in addition to having phase-dependent shifts in expression (when averaged over many cells), some, but not all, canonical cell cycle genes tend to be co-expressed in groups in single cells. We estimate the amount of single cell expression variability attributable to the cell cycle. We find that the cell cycle explains only 5%–17% of expression variability, suggesting that the cell cycle will not tend to be a large nuisance factor in analysis of the single cell transcriptome. PMID:25032992
Centurion-Lara, Arturo; Giacani, Lorenzo; Godornes, Charmie; Molini, Barbara J.; Brinck Reid, Tara; Lukehart, Sheila A.
2013-01-01
Background The pathogenic non-cultivable treponemes include three subspecies of Treponema pallidum (pallidum, pertenue, endemicum), T. carateum, T. paraluiscuniculi, and the unclassified Fribourg-Blanc treponeme (Simian isolate). These treponemes are morphologically indistinguishable and antigenically and genetically highly similar, yet cross-immunity is variable or non-existent. Although all of these organisms cause chronic, multistage skin and systemic disease, they have historically been classified by mode of transmission, clinical presentations and host ranges. Whole genome studies underscore the high degree of sequence identity among species, subspecies and strains, pinpointing a limited number of genomic regions for variation. Many of these “hot spots” include members of the tpr gene family, composed of 12 paralogs encoding candidate virulence factors. We hypothesize that the distinct clinical presentations, host specificity, and variable cross-immunity might reside on virulence factors such as the tpr genes. Methodology/Principal Findings Sequence analysis of 11 tpr loci (excluding tprK) from 12 strains demonstrated an impressive heterogeneity, including SNPs, indels, chimeric genes, truncated gene products and large deletions. Comparative analyses of sequences and 3D models of predicted proteins in Subfamily I highlight the striking co-localization of discrete variable regions with predicted surface-exposed loops. A hallmark of Subfamily II is the presence of chimeric genes in the tprG and J loci. Diversity in Subfamily III is limited to tprA and tprL. Conclusions/Significance An impressive sequence variability was found in tpr sequences among the Treponema isolates examined in this study, with most of the variation being consistent within subspecies or species, or between syphilis vs. non-syphilis strains. Variability was seen in the pallidum subspecies, which can be divided into 5 genogroups. These findings support a genetic basis for the classification of these organisms into their respective subspecies and species. Future functional studies will determine whether the identified genetic differences relate to cross-immunity, clinical differences, or host ranges. PMID:23696912
Bergsveinson, Jordyn; Ziola, Barry
2017-12-01
Beer-spoilage-related lactic acid bacteria (BSR LAB) belong to multiple genera and species; however, beer-spoilage capacity is isolate-specific and partially acquired via horizontal gene transfer within the brewing environment. Thus, the extent to which genus-, species-, or environment- (i.e., brewery-) level genetic variability influences beer-spoilage phenotype is unknown. Publicly available Lactobacillus brevis genomes were analyzed via BlAst Diagnostic Gene findEr (BADGE) for BSR genes and assessed for pangenomic relationships. Also analyzed were functional coding capacities of plasmids of LAB inhabiting extreme niche environments. Considerable genetic variation was observed in L. brevis isolated from clinical samples, whereas 16 candidate genes distinguish BSR and non-BSR L. brevis genomes. These genes are related to nutrient scavenging of gluconate or pentoses, mannose, and metabolism of pectin. BSR L. brevis isolates also have higher average nucleotide identity and stronger pangenome association with one another, though isolation source (i.e., specific brewery) also appears to influence the plasmid coding capacity of BSR LAB. Finally, it is shown that niche-specific adaptation and phenotype are plasmid-encoded for both BSR and non-BSR LAB. The ultimate combination of plasmid-encoded genes dictates the ability of L. brevis to survive in the most extreme beer environment, namely, gassed (i.e., pressurized) beer.
Leighton, Philip A; Schusser, Benjamin; Yi, Henry; Glanville, Jacob; Harriman, William
2015-01-01
Chicken immune responses to human proteins are often more robust than rodent responses because of the phylogenetic relationship between the different species. For discovery of a diverse panel of unique therapeutic antibody candidates, chickens therefore represent an attractive host for human-derived targets. Recent advances in monoclonal antibody technology, specifically new methods for the molecular cloning of antibody genes directly from primary B cells, has ushered in a new era of generating monoclonal antibodies from non-traditional host animals that were previously inaccessible through hybridoma technology. However, such monoclonals still require post-discovery humanization in order to be developed as therapeutics. To obviate the need for humanization, a modified strain of chickens could be engineered to express a human-sequence immunoglobulin variable region repertoire. Here, human variable genes introduced into the chicken immunoglobulin loci through gene targeting were evaluated for their ability to be recognized and diversified by the native chicken recombination machinery that is present in the B-lineage cell line DT40. After expansion in culture the DT40 population accumulated genetic mutants that were detected via deep sequencing. Bioinformatic analysis revealed that the human targeted constructs are performing as expected in the cell culture system, and provide a measure of confidence that they will be functional in transgenic animals.
Synthetic spike-in standards for high-throughput 16S rRNA gene amplicon sequencing.
Tourlousse, Dieter M; Yoshiike, Satowa; Ohashi, Akiko; Matsukura, Satoko; Noda, Naohiro; Sekiguchi, Yuji
2017-02-28
High-throughput sequencing of 16S rRNA gene amplicons (16S-seq) has become a widely deployed method for profiling complex microbial communities but technical pitfalls related to data reliability and quantification remain to be fully addressed. In this work, we have developed and implemented a set of synthetic 16S rRNA genes to serve as universal spike-in standards for 16S-seq experiments. The spike-ins represent full-length 16S rRNA genes containing artificial variable regions with negligible identity to known nucleotide sequences, permitting unambiguous identification of spike-in sequences in 16S-seq read data from any microbiome sample. Using defined mock communities and environmental microbiota, we characterized the performance of the spike-in standards and demonstrated their utility for evaluating data quality on a per-sample basis. Further, we showed that staggered spike-in mixtures added at the point of DNA extraction enable concurrent estimation of absolute microbial abundances suitable for comparative analysis. Results also underscored that template-specific Illumina sequencing artifacts may lead to biases in the perceived abundance of certain taxa. Taken together, the spike-in standards represent a novel bioanalytical tool that can substantially improve 16S-seq-based microbiome studies by enabling comprehensive quality control along with absolute quantification. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Schmidt, Florian; Gasparoni, Nina; Gasparoni, Gilles; Gianmoena, Kathrin; Cadenas, Cristina; Polansky, Julia K.; Ebert, Peter; Nordström, Karl; Barann, Matthias; Sinha, Anupam; Fröhler, Sebastian; Xiong, Jieyi; Dehghani Amirabad, Azim; Behjati Ardakani, Fatemeh; Hutter, Barbara; Zipprich, Gideon; Felder, Bärbel; Eils, Jürgen; Brors, Benedikt; Chen, Wei; Hengstler, Jan G.; Hamann, Alf; Lengauer, Thomas; Rosenstiel, Philip; Walter, Jörn; Schulz, Marcel H.
2017-01-01
The binding and contribution of transcription factors (TF) to cell specific gene expression is often deduced from open-chromatin measurements to avoid costly TF ChIP-seq assays. Thus, it is important to develop computational methods for accurate TF binding prediction in open-chromatin regions (OCRs). Here, we report a novel segmentation-based method, TEPIC, to predict TF binding by combining sets of OCRs with position weight matrices. TEPIC can be applied to various open-chromatin data, e.g. DNaseI-seq and NOMe-seq. Additionally, Histone-Marks (HMs) can be used to identify candidate TF binding sites. TEPIC computes TF affinities and uses open-chromatin/HM signal intensity as quantitative measures of TF binding strength. Using machine learning, we find low affinity binding sites to improve our ability to explain gene expression variability compared to the standard presence/absence classification of binding sites. Further, we show that both footprints and peaks capture essential TF binding events and lead to a good prediction performance. In our application, gene-based scores computed by TEPIC with one open-chromatin assay nearly reach the quality of several TF ChIP-seq data sets. Finally, these scores correctly predict known transcriptional regulators as illustrated by the application to novel DNaseI-seq and NOMe-seq data for primary human hepatocytes and CD4+ T-cells, respectively. PMID:27899623
Autism genetics: searching for specificity and convergence
2012-01-01
Advances in genetics and genomics have improved our understanding of autism spectrum disorders. As many genes have been implicated, we look to points of convergence among these genes across biological systems to better understand and treat these disorders. PMID:22849751
Rincon, Melvin Y; Sarcar, Shilpita; Danso-Abeam, Dina; Keyaerts, Marleen; Matrai, Janka; Samara-Kuko, Ermira; Acosta-Sanchez, Abel; Athanasopoulos, Takis; Dickson, George; Lahoutte, Tony; De Bleser, Pieter; VandenDriessche, Thierry; Chuah, Marinee K
2015-01-01
Gene therapy is a promising emerging therapeutic modality for the treatment of cardiovascular diseases and hereditary diseases that afflict the heart. Hence, there is a need to develop robust cardiac-specific expression modules that allow for stable expression of the gene of interest in cardiomyocytes. We therefore explored a new approach based on a genome-wide bioinformatics strategy that revealed novel cardiac-specific cis-acting regulatory modules (CS-CRMs). These transcriptional modules contained evolutionary-conserved clusters of putative transcription factor binding sites that correspond to a "molecular signature" associated with robust gene expression in the heart. We then validated these CS-CRMs in vivo using an adeno-associated viral vector serotype 9 that drives a reporter gene from a quintessential cardiac-specific α-myosin heavy chain promoter. Most de novo designed CS-CRMs resulted in a >10-fold increase in cardiac gene expression. The most robust CRMs enhanced cardiac-specific transcription 70- to 100-fold. Expression was sustained and restricted to cardiomyocytes. We then combined the most potent CS-CRM4 with a synthetic heart and muscle-specific promoter (SPc5-12) and obtained a significant 20-fold increase in cardiac gene expression compared to the cytomegalovirus promoter. This study underscores the potential of rational vector design to improve the robustness of cardiac gene therapy.
Tsumura, Y; Uchiyama, K; Moriguchi, Y; Ueno, S; Ihara-Ujino, T
2012-12-01
Local adaptation is important in evolutionary processes and speciation. We used multiple tests to identify several candidate genes that may be involved in local adaptation from 1026 loci in 14 natural populations of Cryptomeria japonica, the most economically important forestry tree in Japan. We also studied the relationships between genotypes and environmental variables to obtain information on the selective pressures acting on individual populations. Outlier loci were mapped onto a linkage map, and the positions of loci associated with specific environmental variables are considered. The outlier loci were not randomly distributed on the linkage map; linkage group 11 was identified as a genomic island of divergence. Three loci in this region were also associated with environmental variables such as mean annual temperature, daily maximum temperature, maximum snow depth, and so on. Outlier loci identified with high significance levels will be essential for conservation purposes and for future work on molecular breeding.
Hu, Wei; Hou, Xiaowan; Huang, Chao; Yan, Yan; Tie, Weiwei; Ding, Zehong; Wei, Yunxie; Liu, Juhua; Miao, Hongxia; Lu, Zhiwei; Li, Meiying; Xu, Biyu; Jin, Zhiqiang
2015-01-01
Aquaporins (AQPs) function to selectively control the flow of water and other small molecules through biological membranes, playing crucial roles in various biological processes. However, little information is available on the AQP gene family in bananas. In this study, we identified 47 banana AQP genes based on the banana genome sequence. Evolutionary analysis of AQPs from banana, Arabidopsis, poplar, and rice indicated that banana AQPs (MaAQPs) were clustered into four subfamilies. Conserved motif analysis showed that all banana AQPs contained the typical AQP-like or major intrinsic protein (MIP) domain. Gene structure analysis suggested the majority of MaAQPs had two to four introns with a highly specific number and length for each subfamily. Expression analysis of MaAQP genes during fruit development and postharvest ripening showed that some MaAQP genes exhibited high expression levels during these stages, indicating the involvement of MaAQP genes in banana fruit development and ripening. Additionally, some MaAQP genes showed strong induction after stress treatment and therefore, may represent potential candidates for improving banana resistance to abiotic stress. Taken together, this study identified some excellent tissue-specific, fruit development- and ripening-dependent, and abiotic stress-responsive candidate MaAQP genes, which could lay a solid foundation for genetic improvement of banana cultivars. PMID:26307965
Carvalho, Benilton S.; Bilevicius, Elizabeth; Alvim, Marina K. M.; Lopes-Cendes, Iscia
2017-01-01
Mesial temporal lobe epilepsy is the most common form of adult epilepsy in surgical series. Currently, the only characteristic used to predict poor response to clinical treatment in this syndrome is the presence of hippocampal sclerosis. Single nucleotide polymorphisms (SNPs) located in genes encoding drug transporter and metabolism proteins could influence response to therapy. Therefore, we aimed to evaluate whether combining information from clinical variables as well as SNPs in candidate genes could improve the accuracy of predicting response to drug therapy in patients with mesial temporal lobe epilepsy. For this, we divided 237 patients into two groups: 75 responsive and 162 refractory to antiepileptic drug therapy. We genotyped 119 SNPs in ABCB1, ABCC2, CYP1A1, CYP1A2, CYP1B1, CYP2C9, CYP2C19, CYP2D6, CYP2E1, CYP3A4, and CYP3A5 genes. We used 98 additional SNPs to evaluate population stratification. We assessed a first scenario using only clinical variables and a second one including SNP information. The random forests algorithm combined with leave-one-out cross-validation was used to identify the best predictive model in each scenario and compared their accuracies using the area under the curve statistic. Additionally, we built a variable importance plot to present the set of most relevant predictors on the best model. The selected best model included the presence of hippocampal sclerosis and 56 SNPs. Furthermore, including SNPs in the model improved accuracy from 0.4568 to 0.8177. Our findings suggest that adding genetic information provided by SNPs, located on drug transport and metabolism genes, can improve the accuracy for predicting which patients with mesial temporal lobe epilepsy are likely to be refractory to drug treatment, making it possible to identify patients who may benefit from epilepsy surgery sooner. PMID:28052106
Guo, Jin-Ying; Hu, Kun-Le; Bi, Chang-Hao; Li, Qing-Yan; Zhang, Xue-Li
2018-05-11
Glycerol, which is an inevitable by-product of biodiesel production, is an ideal carbon source for the production of carotenoids due to its low price, good availability and chemically reduced status, which results in a low requirement for additional reducing equivalents. In this study, an alternative carbon-utilization pathway was constructed in Escherichia coli to enable more efficient β-carotene production from glycerol. An aldehyde reductase gene (alrd) and an aldehyde dehydrogenase gene (aldH) from Ralstonia eutropha H16 were integrated into the E. coli chromosome to form a novel glycerol-utilization pathway. The β-carotene specific production value was increased by 50% after the introduction of alrd and aldH. It was found that the glycerol kinase gene (garK), alrd and aldH were the bottleneck of the alternative glycerol metabolic pathway, and modulation of garK gene with an mRS library further increased the β-carotene specific production value by 13%. Finally, co-modulation of genes in the introduced aldH-alrd operon led to 86% more of β-carotene specific production value than that of the strain without the alternative glycerol-utilization pathway and the glycerol-utilization rate was also increased. In this work, β-carotene production of E. coli was significantly improved by constructing and optimizing an alternative glycerol-utilization pathway. This strategy can potentially be used to improve the production of other isoprenoids using glycerol as a cheap and abundant substrate, and therefore has industrial relevance.
Suzuki, Motoshi; Toyoda, Naoya; Takagi, Shin
2014-01-01
Methods for turning on/off gene expression at the experimenter’s discretion would be useful for various biological studies. Recently, we reported on a novel microscope system utilizing an infrared laser-evoked gene operator (IR-LEGO) designed for inducing heat shock response efficiently in targeted single cells in living organisms without cell damage, thereby driving expression of a transgene under the control of a heat shock promoter. Although the original IR-LEGO can be successfully used for gene induction, several limitations hinder its wider application. Here, using the nematode Caenorhabditis elegans (C. elegans) as a subject, we have made improvements in IR-LEGO. For better spatial control of heating, a pulsed irradiation method using an optical chopper was introduced. As a result, single cells of C. elegans embryos as early as the 2-cell stage and single neurons in ganglia can be induced to express genes selectively. In addition, the introduction of site-specific recombination systems to IR-LEGO enables the induction of gene expression controlled by constitutive and cell type-specific promoters. The strategies adopted here will be useful for future applications of IR-LEGO to other organisms. PMID:24465705
Gene expression variability in human hepatic drug metabolizing enzymes and transporters.
Yang, Lun; Price, Elvin T; Chang, Ching-Wei; Li, Yan; Huang, Ying; Guo, Li-Wu; Guo, Yongli; Kaput, Jim; Shi, Leming; Ning, Baitang
2013-01-01
Interindividual variability in the expression of drug-metabolizing enzymes and transporters (DMETs) in human liver may contribute to interindividual differences in drug efficacy and adverse reactions. Published studies that analyzed variability in the expression of DMET genes were limited by sample sizes and the number of genes profiled. We systematically analyzed the expression of 374 DMETs from a microarray data set consisting of gene expression profiles derived from 427 human liver samples. The standard deviation of interindividual expression for DMET genes was much higher than that for non-DMET genes. The 20 DMET genes with the largest variability in the expression provided examples of the interindividual variation. Gene expression data were also analyzed using network analysis methods, which delineates the similarities of biological functionalities and regulation mechanisms for these highly variable DMET genes. Expression variability of human hepatic DMET genes may affect drug-gene interactions and disease susceptibility, with concomitant clinical implications.
2012-01-01
Background We explore the benefits of applying a new proportional hazard model to analyze survival of breast cancer patients. As a parametric model, the hypertabastic survival model offers a closer fit to experimental data than Cox regression, and furthermore provides explicit survival and hazard functions which can be used as additional tools in the survival analysis. In addition, one of our main concerns is utilization of multiple gene expression variables. Our analysis treats the important issue of interaction of different gene signatures in the survival analysis. Methods The hypertabastic proportional hazards model was applied in survival analysis of breast cancer patients. This model was compared, using statistical measures of goodness of fit, with models based on the semi-parametric Cox proportional hazards model and the parametric log-logistic and Weibull models. The explicit functions for hazard and survival were then used to analyze the dynamic behavior of hazard and survival functions. Results The hypertabastic model provided the best fit among all the models considered. Use of multiple gene expression variables also provided a considerable improvement in the goodness of fit of the model, as compared to use of only one. By utilizing the explicit survival and hazard functions provided by the model, we were able to determine the magnitude of the maximum rate of increase in hazard, and the maximum rate of decrease in survival, as well as the times when these occurred. We explore the influence of each gene expression variable on these extrema. Furthermore, in the cases of continuous gene expression variables, represented by a measure of correlation, we were able to investigate the dynamics with respect to changes in gene expression. Conclusions We observed that use of three different gene signatures in the model provided a greater combined effect and allowed us to assess the relative importance of each in determination of outcome in this data set. These results point to the potential to combine gene signatures to a greater effect in cases where each gene signature represents some distinct aspect of the cancer biology. Furthermore we conclude that the hypertabastic survival models can be an effective survival analysis tool for breast cancer patients. PMID:23241496
Structural polymorphism at LCR and its role in beta-globin gene regulation.
Kukreti, Shrikant; Kaur, Harpreet; Kaushik, Mahima; Bansal, Aparna; Saxena, Sarika; Kaushik, Shikha; Kukreti, Ritushree
2010-09-01
Information on the secondary structures and conformational manifestations of eukaryotic DNA and their biological significance with reference to gene regulation and expression is limited. The human beta-globin gene Locus Control Region (LCR), a dominant regulator of globin gene expression, is a contiguous piece of DNA with five tissue-specific DNase I-hypersensitive sites (HSs). Since these HSs have a high density of transcription factor binding sites, structural interdependencies between HSs and different promoters may directly or indirectly regulate LCR functions. Mutations and SNPs may stabilize or destabilize the local secondary structures, affecting the gene expression by changes in the protein-DNA recognition patterns. Various palindromic or quasi-palindromic segments within LCR, could cause structural polymorphism and geometrical switching of DNA. This emphasizes the importance of understanding of the sequence-dependent variations of the DNA structure. Such structural motifs might act as regulatory elements. The local conformational variability of a DNA segment or action of a DNA specific protein is key to create and maintain active chromatin domains and affect transcription of various tissue specific beta-globin genes. We, summarize here the current status of beta-globin LCR structure and function. Further structural studies at molecular level and functional genomics might solve the regulatory puzzles that control the beta-globin gene locus. Copyright (c) 2010 Elsevier Masson SAS. All rights reserved.
Highly Effective Serodiagnosis for Chagas' Disease ▿
Hernández, Pilar; Heimann, Michael; Riera, Cristina; Solano, Marco; Santalla, José; Luquetti, Alejandro O.; Beck, Ewald
2010-01-01
Many proteins of Trypanosoma cruzi, the causative agent of Chagas' disease, contain characteristic arrays of highly repetitive immunogenic amino acid motifs. Diagnostic tests using these motifs in monomeric or dimeric form have proven to provide markedly improved specificity compared to conventional tests based on crude parasite extracts. However, in many cases the available tests still suffer from limited sensitivity. In this study we produced stable synthetic genes with maximal codon variability for the four diagnostic antigens, B13, CRA, TcD, and TcE, each containing between three and nine identical amino acid repeats. These genes were combined by linker sequences encoding short proline-rich peptides, giving rise to a 24-kDa fusion protein which was used as a novel diagnostic antigen in an enzyme-linked immunosorbent assay setup. Validation of the assay with a large number of well-characterized patient sera from Bolivia and Brazil revealed excellent diagnostic performance. The high sensitivity of the new test may allow future studies to use blood collected by finger prick and dried on filter paper, thus dramatically reducing the costs and effort for the detection of T. cruzi infection. PMID:20668136
Ahmadi, Mahmoud Kamal; Pfeifer, Blaine A
2016-11-01
Biosynthesis of complex natural products like polyketides and nonribosomal peptides using Escherichia coli as a heterologous host provides an opportunity to access these molecules. The value in doing so stems from the fact that many compounds hold some therapeutic or other beneficial property and their original production hosts are intractable for a variety of reasons. In this work, metabolic engineering and induction variable optimization were used to increase production of the polyketide-nonribosomal peptide compound yersiniabactin, a siderophore that has been utilized to selectively remove metals from various solid and aqueous samples. Specifically, several precursor substrate support pathways were altered through gene expression and exogenous supplementation in order to boost production of the final compound. The gene expression induction process was also analyzed to identify the temperatures and inducer concentrations resulting in highest final production levels. When combined, yersiniabactin production was extended to ∼175 mg L -1 . © 2016 American Institute of Chemical Engineers Biotechnol. Prog., 32:1412-1417, 2016. © 2016 American Institute of Chemical Engineers.
Peters, Derek T; Henderson, Christopher A; Warren, Curtis R; Friesen, Max; Xia, Fang; Becker, Caroline E; Musunuru, Kiran; Cowan, Chad A
2016-05-01
Hepatocyte-like cells (HLCs) are derived from human pluripotent stem cells (hPSCs) in vitro, but differentiation protocols commonly give rise to a heterogeneous mixture of cells. This variability confounds the evaluation of in vitro functional assays performed using HLCs. Increased differentiation efficiency and more accurate approximation of the in vivo hepatocyte gene expression profile would improve the utility of hPSCs. Towards this goal, we demonstrate the purification of a subpopulation of functional HLCs using the hepatocyte surface marker asialoglycoprotein receptor 1 (ASGR1). We analyzed the expression profile of ASGR1-positive cells by microarray, and tested their ability to perform mature hepatocyte functions (albumin and urea secretion, cytochrome activity). By these measures, ASGR1-positive HLCs are enriched for the gene expression profile and functional characteristics of primary hepatocytes compared with unsorted HLCs. We have demonstrated that ASGR1-positive sorting isolates a functional subpopulation of HLCs from among the heterogeneous cellular population produced by directed differentiation. © 2016. Published by The Company of Biologists Ltd.
Reis-Cunha, João Luís; Valdivia, Hugo O; Bartholomeu, Daniella Castanheira
2018-02-01
Trypanosomatids are a group of kinetoplastid parasites including some of great public health importance, causing debilitating and life-long lasting diseases that affect more than 24 million people worldwide. Among the trypanosomatids, Trypanosoma cruzi, Trypanosoma brucei and species from the Leishmania genus are the most well studied parasites, due to their high prevalence in human infections. These parasites have an extreme genomic and phenotypic variability, with a massive expansion in the copy number of species-specific multigene families enrolled in host-parasite interactions that mediate cellular invasion and immune evasion processes. As most trypanosomatids are heteroxenous, and therefore their lifecycles involve the transition between different hosts, these parasites have developed several strategies to ensure a rapid adaptation to changing environments. Among these strategies, a rapid shift in the repertoire of expressed genes, genetic variability and genome plasticity are key mechanisms. Trypanosomatid genomes are organized into large directional gene clusters that are transcribed polycistronically, where genes derived from the same polycistron may have very distinct mRNA levels. This particular mode of transcription implies that the control of gene expression operates mainly at post-transcriptional level. In this sense, gene duplications/losses were already associated with changes in mRNA levels in these parasites. Gene duplications also allow the generation of sequence variability, as the newly formed copy can diverge without loss of function of the original copy. Recently, aneuploidies have been shown to occur in several Leishmania species and T. cruzi strains. Although aneuploidies are usually associated with debilitating phenotypes in superior eukaryotes, recent data shows that it could also provide increased fitness in stress conditions and generate drug resistance in unicellular eukaryotes. In this review, we will focus on gene and chromosomal copy number variations and their relevance to the evolution of trypanosomatid parasites.
Genetic engineering of microbial pesticides
Bruce C. Carlton
1985-01-01
Recent advances in genetics and molecular biology make possible the cloning and genetic manipulation of genes for insecticidal activities from natural insect pathogens. Using recombinant DNA methods and site-directed mutagenesis of specific gene regions, production of new and improved biorationals should be possible.
Liccardo, Raffaella; De Rosa, Marina; Duraturo, Francesca
2018-01-01
Lynch syndrome is an autosomal dominant syndrome that can be subdivided into Lynch syndrome I, or site-specific colonic cancer, and Lynch syndrome II, or extracolonic cancers, particularly carcinomas of the stomach, endometrium, biliary and pancreatic systems, and urinary tract. Lynch syndrome is associated with point mutations and large rearrangements in DNA MisMatch Repair ( MMR ) genes. This syndrome shows a variable phenotypic expression in people who carry pathogenetic mutations. So far, a correlation in genotype-phenotype has not been definitely established. In this study, we describe 2 Lynch syndrome cases presenting with the same genotype but different phenotypes and discuss possible reasons for this.
Takeuchi, Takeshi; Koyanagi, Ryo; Gyoja, Fuki; Kanda, Miyuki; Hisata, Kanako; Fujie, Manabu; Goto, Hiroki; Yamasaki, Shinichi; Nagai, Kiyohito; Morino, Yoshiaki; Miyamoto, Hiroshi; Endo, Kazuyoshi; Endo, Hirotoshi; Nagasawa, Hiromichi; Kinoshita, Shigeharu; Asakawa, Shuichi; Watabe, Shugo; Satoh, Noriyuki; Kawashima, Takeshi
2016-01-01
Bivalve molluscs have flourished in marine environments, and many species constitute important aquatic resources. Recently, whole genome sequences from two bivalves, the pearl oyster, Pinctada fucata, and the Pacific oyster, Crassostrea gigas, have been decoded, making it possible to compare genomic sequences among molluscs, and to explore general and lineage-specific genetic features and trends in bivalves. In order to improve the quality of sequence data for these purposes, we have updated the entire P. fucata genome assembly. We present a new genome assembly of the pearl oyster, Pinctada fucata (version 2.0). To update the assembly, we conducted additional sequencing, obtaining accumulated sequence data amounting to 193× the P. fucata genome. Sequence redundancy in contigs that was caused by heterozygosity was removed in silico, which significantly improved subsequent scaffolding. Gene model version 2.0 was generated with the aid of manual gene annotations supplied by the P. fucata research community. Comparison of mollusc and other bilaterian genomes shows that gene arrangements of Hox, ParaHox, and Wnt clusters in the P. fucata genome are similar to those of other molluscs. Like the Pacific oyster, P. fucata possesses many genes involved in environmental responses and in immune defense. Phylogenetic analyses of heat shock protein70 and C1q domain-containing protein families indicate that extensive expansion of genes occurred independently in each lineage. Several gene duplication events prior to the split between the pearl oyster and the Pacific oyster are also evident. In addition, a number of tandem duplications of genes that encode shell matrix proteins are also well characterized in the P. fucata genome. Both the Pinctada and Crassostrea lineages have expanded specific gene families in a lineage-specific manner. Frequent duplication of genes responsible for shell formation in the P. fucata genome explains the diversity of mollusc shell structures. These duplications reveal dynamic genome evolution to forge the complex physiology that enables bivalves to employ a sessile lifestyle in the intertidal zone.
Gutierrez, Alejandro; Kentsis, Alex
2018-03-01
Advances in the classification of acute leukaemias have led to improved outcomes for a substantial fraction of patients. However, chemotherapy resistance remains a major problem for specific subsets of acute leukaemias. Here, we propose that a molecularly distinct subtype of acute leukaemia with shared myeloid and T cell lymphoblastic features, which we term acute myeloid/T-lymphoblastic leukaemia (AMTL), is divided across 3 diagnostic categories owing to variable expression of markers deemed to be defining of myeloid and T-lymphoid lineages, such as myeloperoxidase and CD3. This proposed diagnostic group is supported by (i) retained myeloid differentiation potential during early T cell lymphoid development, (ii) recognition that some cases of acute myeloid leukaemia (AML) harbour hallmarks of T cell development, such as T-cell receptor gene rearrangements and (iii) common gene mutations in subsets of AML and T cell acute lymphoblastic leukaemia (T-ALL), including WT1, PHF6, RUNX1 and BCL11B. This proposed diagnostic entity overlaps with early T cell precursor (ETP) T-ALL and T cell/myeloid mixed phenotype acute leukaemias (MPALs), and also includes a subset of leukaemias currently classified as AML with features of T-lymphoblastic development. The proposed classification of AMTL as a distinct entity would enable more precise prospective diagnosis and permit the development of improved therapies for patients whose treatment is inadequate with current approaches. © 2018 John Wiley & Sons Ltd.
Lee, Nelson; Gatton, Michelle L.; Pelecanos, Anita; Bubb, Martin; Gonzalez, Iveth; Bell, David; Cheng, Qin
2012-01-01
Rapid diagnostic tests (RDTs) represent important tools to diagnose malaria infection. To improve understanding of the variable performance of RDTs that detect the major target in Plasmodium falciparum, namely, histidine-rich protein 2 (HRP2), and to inform the design of better tests, we undertook detailed mapping of the epitopes recognized by eight HRP-specific monoclonal antibodies (MAbs). To investigate the geographic skewing of this polymorphic protein, we analyzed the distribution of these epitopes in parasites from geographically diverse areas. To identify an ideal amino acid motif for a MAb to target in HRP2 and in the related protein HRP3, we used a purpose-designed script to perform bioinformatic analysis of 448 distinct gene sequences from pfhrp2 and from 99 sequences from the closely related gene pfhrp3. The frequency and distribution of these motifs were also compared to the MAb epitopes. Heat stability testing of MAbs immobilized on nitrocellulose membranes was also performed. Results of these experiments enabled the identification of MAbs with the most desirable characteristics for inclusion in RDTs, including copy number and coverage of target epitopes, geographic skewing, heat stability, and match with the most abundant amino acid motifs identified. This study therefore informs the selection of MAbs to include in malaria RDTs as well as in the generation of improved MAbs that should improve the performance of HRP-detecting malaria RDTs. PMID:22259210
Liu, Jia; Shui, Sai-Lan
2016-12-28
The advent of site-specific nucleases, particularly CRISPR/Cas9, provides researchers with the unprecedented ability to manipulate genomic sequences. These nucleases are used to create model cell lines, engineer metabolic pathways, produce transgenic animals and plants, perform genome-wide functional screen and, most importantly, treat human diseases that are difficult to tackle by traditional medications. Considerable efforts have been devoted to improving the efficiency and specificity of nucleases for clinical applications. However, safe and efficient delivery methods remain the major obstacle for therapeutic gene editing. In this review, we summarize the recent progress on nuclease delivery methods, highlight their impact on the outcomes of gene editing and discuss the potential of different delivery approaches for therapeutic gene editing. Copyright © 2016 Elsevier B.V. All rights reserved.
Levran, Orna; Randesi, Matthew; Peles, Einat; Correa da Rosa, Joel; Ott, Jurg; Rotrosen, John; Adelson, Miriam; Kreek, Mary Jeanne
2016-06-01
This study was designed to determine whether polymorphisms in acetylcholine receptors contribute to opioid dependence and/or cocaine dependence. The sample (n = 1860) was divided by drug and ancestry, and 55 polymorphisms (nine genes) were analyzed. Of the 20 SNPs that showed nominally significant associations, the association of the African-specific CHRM4 SNP rs2229163 (Asn417=) with cocaine dependence survived correction for multiple testing (Pcorrected = 0.047). CHRM4 is located in a region of strong linkage disequilibrium on chromosome 11 that includes genes associated with schizophrenia. CHRM4 SNP rs2229163 is in strong linkage disequilibrium with several African-specific SNPs in DGKZ and AMBRA1. Cholinergic receptors' variants may contribute to drug addiction and have a potential role as pharmacogenetic markers.
Eo, JungWoo; Lee, Hee-Eun; Nam, Gyu-Hwi; Kwon, Yun-Jeong; Choi, Yuri; Choi, Bong-Hwan; Huh, Jae-Won; Kim, Minkyu; Lee, Sang-Eun; Seo, Bohyun; Kim, Heui-Soo
2016-04-15
The monoamine oxidase A (MAOA) gene is an important candidate gene for human behavior that encodes an enzyme regulating the metabolism of key neurotransmitters. The regulatory mechanisms of the MAOA gene in dogs are yet to be elucidated. We measured MAOA gene transcription and analyzed the VNTR genotype and methylation status of the gene promoter region in different dog breeds to determine whether MAOA expression is correlated with the MAOA genotype or epigenetic modification in dogs. We found brain-specific expression of the MAOA gene and different transcription levels in different dog breeds including Beagle, Sapsaree, and German shepherd, and also a robust association of the DNA methylation of the gene promoter with mRNA levels. However, the 90 bp tandem repeats that we observed near the transcription start site were not variable, indicating no correlation with canine MAOA activity. These results show that differential DNA methylation in the MAOA promoter region may affect gene expression by modulating promoter activity. Moreover, the distinctive patterns of MAOA expression and DNA methylation may be involved in breed-specific or individual behavioral characteristics, such as aggression, because behavioral phenotypes are related to different physiological and neuroendocrine responses. Copyright © 2016 Elsevier B.V. All rights reserved.
Bussmann, Bianca M.; Horn, Susanne; Sieg, Michael; Jassoy, Christian
2015-01-01
The diversity of virus-specific antibodies and of B cells among different individuals is unknown. Using single-cell cloning of antibody genes, we generated recombinant human monoclonal antibodies from influenza nucleoprotein-specific memory B cells in four adult humans with and without preceding influenza vaccination. We examined the diversity of the antibody repertoires and found that NP-specific B cells used numerous immunoglobulin genes. The heavy chains (HCs) originated from 26 and the kappa light chains (LCs) from 19 different germ line genes. Matching HC and LC chains gave rise to 43 genetically distinct antibodies that bound influenza NP. The median lengths of the CDR3 of the HC, kappa and lambda LC were 14, 9 and 11 amino acids, respectively. We identified changes at 13.6% of the amino acid positions in the V gene of the antibody heavy chain, at 8.4 % in the kappa and at 10.6 % in the lambda V gene. We identified somatic insertions or deletions in 8.1% of the variable genes. We also found several small groups of clonal relatives that were highly diversified. Our findings demonstrate broadly diverse memory B cell repertoires for the influenza nucleoprotein. We found extensive variation within individuals with a high number of point mutations, insertions, and deletions, and extensive clonal diversification. Thus, structurally conserved proteins can elicit broadly diverse and highly mutated B-cell responses. PMID:26086076
DOE Office of Scientific and Technical Information (OSTI.GOV)
He, Jintang; Sun, Xuefei; Shi, Tujin
2014-10-01
Fusions between the transmembrane protease serine 2 (TMPRSS2) and ETS related gene (ERG) represent one of the most specific biomarkers that define a distinct molecular subtype of prostate cancer. The studies on TMPRSS2-ERG gene fusions have seldom been performed at the protein level, primarily due to the lack of high-quality antibodies or an antibody-independent method that is sufficiently sensitive for detecting the truncated ERG protein products resulting from TMPRSS2-ERG gene fusions and alternative splicing. Herein, we applied a recently developed PRISM (high-pressure high-resolution separations with intelligent selection and multiplexing)-SRM (selected reaction monitoring) strategy for quantifying ERG protein in prostate cancermore » cell lines and tumors. The highly sensitive PRISM-SRM assays led to confident detection of 6 unique ERG peptides in either the TMPRSS2-ERG positive cell lines or tissues but not in the negative controls, indicating that ERG protein expression is highly correlated with TMPRSS2-ERG gene rearrangements. Significantly, our results demonstrated for the first time that at least two groups of ERG protein isoforms were simultaneously expressed at variable levels in TMPRSS2-ERG positive samples as evidenced by concomitant detection of two mutually exclusive peptides. Three peptides shared across almost all fusion protein products were determined to be the most abundant peptides, and hence can be used as “signature” peptides for detecting ERG overexpression resulting from TMPRSS2-ERG gene fusion. These PRISM-SRM assays provide valuable tools for studying TMPRSS2-ERG gene fusion protein products, thus improving our understanding of the role of TMPRSS2-ERG gene fusion in the biology of prostate cancer.« less
Molecular Structure-Based Large-Scale Prediction of Chemical-Induced Gene Expression Changes.
Liu, Ruifeng; AbdulHameed, Mohamed Diwan M; Wallqvist, Anders
2017-09-25
The quantitative structure-activity relationship (QSAR) approach has been used to model a wide range of chemical-induced biological responses. However, it had not been utilized to model chemical-induced genomewide gene expression changes until very recently, owing to the complexity of training and evaluating a very large number of models. To address this issue, we examined the performance of a variable nearest neighbor (v-NN) method that uses information on near neighbors conforming to the principle that similar structures have similar activities. Using a data set of gene expression signatures of 13 150 compounds derived from cell-based measurements in the NIH Library of Integrated Network-based Cellular Signatures program, we were able to make predictions for 62% of the compounds in a 10-fold cross validation test, with a correlation coefficient of 0.61 between the predicted and experimentally derived signatures-a reproducibility rivaling that of high-throughput gene expression measurements. To evaluate the utility of the predicted gene expression signatures, we compared the predicted and experimentally derived signatures in their ability to identify drugs known to cause specific liver, kidney, and heart injuries. Overall, the predicted and experimentally derived signatures had similar receiver operating characteristics, whose areas under the curve ranged from 0.71 to 0.77 and 0.70 to 0.73, respectively, across the three organ injury models. However, detailed analyses of enrichment curves indicate that signatures predicted from multiple near neighbors outperformed those derived from experiments, suggesting that averaging information from near neighbors may help improve the signal from gene expression measurements. Our results demonstrate that the v-NN method can serve as a practical approach for modeling large-scale, genomewide, chemical-induced, gene expression changes.
NASA Astrophysics Data System (ADS)
Streets, Aaron M.; Cao, Chen; Zhang, Xiannian; Huang, Yanyi
2016-03-01
Phenotype classification of single cells reveals biological variation that is masked in ensemble measurement. This heterogeneity is found in gene and protein expression as well as in cell morphology. Many techniques are available to probe phenotypic heterogeneity at the single cell level, for example quantitative imaging and single-cell RNA sequencing, but it is difficult to perform multiple assays on the same single cell. In order to directly track correlation between morphology and gene expression at the single cell level, we developed a microfluidic platform for quantitative coherent Raman imaging and immediate RNA sequencing (RNA-Seq) of single cells. With this device we actively sort and trap cells for analysis with stimulated Raman scattering microscopy (SRS). The cells are then processed in parallel pipelines for lysis, and preparation of cDNA for high-throughput transcriptome sequencing. SRS microscopy offers three-dimensional imaging with chemical specificity for quantitative analysis of protein and lipid distribution in single cells. Meanwhile, the microfluidic platform facilitates single-cell manipulation, minimizes contamination, and furthermore, provides improved RNA-Seq detection sensitivity and measurement precision, which is necessary for differentiating biological variability from technical noise. By combining coherent Raman microscopy with RNA sequencing, we can better understand the relationship between cellular morphology and gene expression at the single-cell level.
Negro-Demontel, María Luciana; Saccardo, Paolo; Giacomini, Cecilia; Yáñez-Muñoz, Rafael Joaquín; Ferrer-Miralles, Neus; Vazquez, Esther; Villaverde, Antonio; Peluffo, Hugo
2014-01-01
Traumatic brain injury (TBI) remains as one of the leading causes of mortality and morbidity worldwide and there are no effective treatments currently available. Gene therapy applications have emerged as important alternatives for the treatment of diverse nervous system injuries. New strategies are evolving with the notion that each particular pathological condition may require a specific vector. Moreover, the lack of detailed comparative studies between different vectors under similar conditions hampers the selection of an ideal vector for a given pathological condition. The potential use of lentiviral vectors versus several modular protein-based nanovectors was compared using a controlled cortical impact model of TBI under the same gene therapy conditions. We show that variables such as protein/DNA ratio, incubation volume, and presence of serum or chloroquine in the transfection medium impact on both nanovector formation and transfection efficiency in vitro. While lentiviral vectors showed GFP protein 1 day after TBI and increased expression at 14 days, nanovectors showed stable and lower GFP transgene expression from 1 to 14 days. No toxicity after TBI by any of the vectors was observed as determined by resulting levels of IL-1β or using neurological sticky tape test. In fact, both vector types induced functional improvement per se. PMID:26015985
Paul, Catherine J; Twine, Susan M; Tam, Kevin J; Mullen, James A; Kelly, John F; Austin, John W; Logan, Susan M
2007-05-01
Strains of Clostridium botulinum are traditionally identified by botulinum neurotoxin type; however, identification of an additional target for typing would improve differentiation. Isolation of flagellar filaments and analysis by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) showed that C. botulinum produced multiple flagellin proteins. Nano-liquid chromatography-tandem mass spectrometry (nLC-MS/MS) analysis of in-gel tryptic digests identified peptides in all flagellin bands that matched two homologous tandem flagellin genes identified in the C. botulinum Hall A genome. Designated flaA1 and flaA2, these open reading frames encode the major structural flagellins of C. botulinum. Colony PCR and sequencing of flaA1/A2 variable regions classified 80 environmental and clinical strains into group I or group II and clustered isolates into 12 flagellar types. Flagellar type was distinct from neurotoxin type, and epidemiologically related isolates clustered together. Sequencing a larger PCR product, obtained during amplification of flaA1/A2 from type E strain Bennett identified a second flagellin gene, flaB. LC-MS analysis confirmed that flaB encoded a large type E-specific flagellin protein, and the predicted molecular mass for FlaB matched that observed by SDS-PAGE. In contrast, the molecular mass of FlaA was 2 to 12 kDa larger than the mass predicted by the flaA1/A2 sequence of a given strain, suggesting that FlaA is posttranslationally modified. While identification of FlaB, and the observation by SDS-PAGE of different masses of the FlaA proteins, showed the flagellin proteins of C. botulinum to be diverse, the presence of the flaA1/A2 gene in all strains examined facilitates single locus sequence typing of C. botulinum using the flagellin variable region.
Measuring semantic similarities by combining gene ontology annotations and gene co-function networks
Peng, Jiajie; Uygun, Sahra; Kim, Taehyong; ...
2015-02-14
Background: Gene Ontology (GO) has been used widely to study functional relationships between genes. The current semantic similarity measures rely only on GO annotations and GO structure. This limits the power of GO-based similarity because of the limited proportion of genes that are annotated to GO in most organisms. Results: We introduce a novel approach called NETSIM (network-based similarity measure) that incorporates information from gene co-function networks in addition to using the GO structure and annotations. Using metabolic reaction maps of yeast, Arabidopsis, and human, we demonstrate that NETSIM can improve the accuracy of GO term similarities. We also demonstratemore » that NETSIM works well even for genomes with sparser gene annotation data. We applied NETSIM on large Arabidopsis gene families such as cytochrome P450 monooxygenases to group the members functionally and show that this grouping could facilitate functional characterization of genes in these families. Conclusions: Using NETSIM as an example, we demonstrated that the performance of a semantic similarity measure could be significantly improved after incorporating genome-specific information. NETSIM incorporates both GO annotations and gene co-function network data as a priori knowledge in the model. Therefore, functional similarities of GO terms that are not explicitly encoded in GO but are relevant in a taxon-specific manner become measurable when GO annotations are limited.« less
Nepolean, Thirunavukkarsau; Kaul, Jyoti; Mukri, Ganapati; Mittal, Shikha
2018-01-01
Breeding science has immensely contributed to the global food security. Several varieties and hybrids in different food crops including maize have been released through conventional breeding. The ever growing population, decreasing agricultural land, lowering water table, changing climate, and other variables pose tremendous challenge to the researchers to improve the production and productivity of food crops. Drought is one of the major problems to sustain and improve the productivity of food crops including maize in tropical and subtropical production systems. With advent of novel genomics and breeding tools, the way of doing breeding has been tremendously changed in the last two decades. Drought tolerance is a combination of several component traits with a quantitative mode of inheritance. Rapid DNA and RNA sequencing tools and high-throughput SNP genotyping techniques, trait mapping, functional characterization, genomic selection, rapid generation advancement, and other tools are now available to understand the genetics of drought tolerance and to accelerate the breeding cycle. Informatics play complementary role by managing the big-data generated from the large-scale genomics and breeding experiments. Genome editing is the latest technique to alter specific genes to improve the trait expression. Integration of novel genomics, next-generation breeding, and informatics tools will accelerate the stress breeding process and increase the genetic gain under different production systems. PMID:29696027
Familial aggregation of focal seizure semiology in the Epilepsy Phenome/Genome Project.
Tobochnik, Steven; Fahlstrom, Robyn; Shain, Catherine; Winawer, Melodie R
2017-07-04
To improve phenotype definition in genetic studies of epilepsy, we assessed the familial aggregation of focal seizure types and of specific seizure symptoms within the focal epilepsies in families from the Epilepsy Phenome/Genome Project. We studied 302 individuals with nonacquired focal epilepsy from 149 families. Familial aggregation was assessed by logistic regression analysis of relatives' traits (dependent variable) by probands' traits (independent variable), estimating the odds ratio for each symptom in a relative given presence vs absence of the symptom in the proband. In families containing multiple individuals with nonacquired focal epilepsy, we found significant evidence for familial aggregation of ictal motor, autonomic, psychic, and aphasic symptoms. Within these categories, ictal whole body posturing, diaphoresis, dyspnea, fear/anxiety, and déjà vu/jamais vu showed significant familial aggregation. Focal seizure type aggregated as well, including complex partial, simple partial, and secondarily generalized tonic-clonic seizures. Our results provide insight into genotype-phenotype correlation in the nonacquired focal epilepsies and a framework for identifying subgroups of patients likely to share susceptibility genes. © 2017 American Academy of Neurology.
Risk Classification with an Adaptive Naive Bayes Kernel Machine Model.
Minnier, Jessica; Yuan, Ming; Liu, Jun S; Cai, Tianxi
2015-04-22
Genetic studies of complex traits have uncovered only a small number of risk markers explaining a small fraction of heritability and adding little improvement to disease risk prediction. Standard single marker methods may lack power in selecting informative markers or estimating effects. Most existing methods also typically do not account for non-linearity. Identifying markers with weak signals and estimating their joint effects among many non-informative markers remains challenging. One potential approach is to group markers based on biological knowledge such as gene structure. If markers in a group tend to have similar effects, proper usage of the group structure could improve power and efficiency in estimation. We propose a two-stage method relating markers to disease risk by taking advantage of known gene-set structures. Imposing a naive bayes kernel machine (KM) model, we estimate gene-set specific risk models that relate each gene-set to the outcome in stage I. The KM framework efficiently models potentially non-linear effects of predictors without requiring explicit specification of functional forms. In stage II, we aggregate information across gene-sets via a regularization procedure. Estimation and computational efficiency is further improved with kernel principle component analysis. Asymptotic results for model estimation and gene set selection are derived and numerical studies suggest that the proposed procedure could outperform existing procedures for constructing genetic risk models.
PreCisIon: PREdiction of CIS-regulatory elements improved by gene's positION.
Elati, Mohamed; Nicolle, Rémy; Junier, Ivan; Fernández, David; Fekih, Rim; Font, Julio; Képès, François
2013-02-01
Conventional approaches to predict transcriptional regulatory interactions usually rely on the definition of a shared motif sequence on the target genes of a transcription factor (TF). These efforts have been frustrated by the limited availability and accuracy of TF binding site motifs, usually represented as position-specific scoring matrices, which may match large numbers of sites and produce an unreliable list of target genes. To improve the prediction of binding sites, we propose to additionally use the unrelated knowledge of the genome layout. Indeed, it has been shown that co-regulated genes tend to be either neighbors or periodically spaced along the whole chromosome. This study demonstrates that respective gene positioning carries significant information. This novel type of information is combined with traditional sequence information by a machine learning algorithm called PreCisIon. To optimize this combination, PreCisIon builds a strong gene target classifier by adaptively combining weak classifiers based on either local binding sequence or global gene position. This strategy generically paves the way to the optimized incorporation of any future advances in gene target prediction based on local sequence, genome layout or on novel criteria. With the current state of the art, PreCisIon consistently improves methods based on sequence information only. This is shown by implementing a cross-validation analysis of the 20 major TFs from two phylogenetically remote model organisms. For Bacillus subtilis and Escherichia coli, respectively, PreCisIon achieves on average an area under the receiver operating characteristic curve of 70 and 60%, a sensitivity of 80 and 70% and a specificity of 60 and 56%. The newly predicted gene targets are demonstrated to be functionally consistent with previously known targets, as assessed by analysis of Gene Ontology enrichment or of the relevant literature and databases.
Oxytocin and Opioid Receptor Gene Polymorphisms Associated with Greeting Behavior in Dogs.
Kubinyi, Enikő; Bence, Melinda; Koller, Dora; Wan, Michele; Pergel, Eniko; Ronai, Zsolt; Sasvari-Szekely, Maria; Miklósi, Ádám
2017-01-01
Meeting humans is an everyday experience for most companion dogs, and their behavior in these situations and its genetic background is of major interest. Previous research in our laboratory reported that in German shepherd dogs the lack of G allele, and in Border collies the lack of A allele, of the oxytocin receptor gene (OXTR) 19208A/G single nucleotide polymorphism (SNP) was linked to increased friendliness, which suggests that although broad traits are affected by genetic variability, the specific links between alleles and behavioral variables might be breed-specific. In the current study, we found that Siberian huskies with the A allele approached a friendly unfamiliar woman less frequently in a greeting test, which indicates that certain polymorphisms are related to human directed behavior, but that the relationship patterns between polymorphisms and behavioral phenotypes differ between populations. This finding was further supported by our next investigation. According to primate studies, endogenous opioid peptide (e.g., endorphins) receptor genes have also been implicated in social relationships. Therefore, we examined the rs21912990 of the OPRM1 gene. Firstly, we found that the allele frequencies of Siberian huskies and gray wolves were similar, but differed from that of Border collies and German shepherd dogs, which might reflect their genetic relationship. Secondly, we detected significant associations between the OPRM1 SNP and greeting behavior among German shepherd dogs and a trend in Border collies, but we could not detect an association in Siberian huskies. Although our results with OXTR and OPRM1 gene variants should be regarded as preliminary due to the relatively low sample size, they suggest that (1) OXTR and OPRM1 gene variants in dogs affect human-directed social behavior and (2) their effects differ between breeds.
Oxytocin and Opioid Receptor Gene Polymorphisms Associated with Greeting Behavior in Dogs
Kubinyi, Enikő; Bence, Melinda; Koller, Dora; Wan, Michele; Pergel, Eniko; Ronai, Zsolt; Sasvari-Szekely, Maria; Miklósi, Ádám
2017-01-01
Meeting humans is an everyday experience for most companion dogs, and their behavior in these situations and its genetic background is of major interest. Previous research in our laboratory reported that in German shepherd dogs the lack of G allele, and in Border collies the lack of A allele, of the oxytocin receptor gene (OXTR) 19208A/G single nucleotide polymorphism (SNP) was linked to increased friendliness, which suggests that although broad traits are affected by genetic variability, the specific links between alleles and behavioral variables might be breed-specific. In the current study, we found that Siberian huskies with the A allele approached a friendly unfamiliar woman less frequently in a greeting test, which indicates that certain polymorphisms are related to human directed behavior, but that the relationship patterns between polymorphisms and behavioral phenotypes differ between populations. This finding was further supported by our next investigation. According to primate studies, endogenous opioid peptide (e.g., endorphins) receptor genes have also been implicated in social relationships. Therefore, we examined the rs21912990 of the OPRM1 gene. Firstly, we found that the allele frequencies of Siberian huskies and gray wolves were similar, but differed from that of Border collies and German shepherd dogs, which might reflect their genetic relationship. Secondly, we detected significant associations between the OPRM1 SNP and greeting behavior among German shepherd dogs and a trend in Border collies, but we could not detect an association in Siberian huskies. Although our results with OXTR and OPRM1 gene variants should be regarded as preliminary due to the relatively low sample size, they suggest that (1) OXTR and OPRM1 gene variants in dogs affect human-directed social behavior and (2) their effects differ between breeds. PMID:28936190
Long, Qin; Yue, Fang; Liu, Ruochen; Song, Shuiqing; Li, Xianbi; Ding, Bo; Yan, Xingying; Pei, Yan
2018-05-11
Cotton fibers are the most important natural raw material used in textile industries world-wide. Fiber length, strength, and fineness are the three major traits which determine the quality and economic value of cotton. It is known that exogenous application of phosphatidylinositols (PtdIns), important structural phospholipids, can promote cotton fiber elongation. Here, we sought to increase the in planta production of PtdIns to improve fiber traits. Transgenic cotton plants were generated in which the expression of a cotton phosphatidylinositol synthase gene (i.e., GhPIS) was controlled by the fiber-specific SCFP promoter element, resulting in the specific up-regulation of GhPIS during cotton fiber development. We demonstrate that PtdIns content was significantly enhanced in transgenic cotton fibers and the elevated level of PtdIns stimulated the expression of genes involved in PtdIns phosphorylation as well as promoting lignin/lignin-like phenolic biosynthesis. Fiber length, strength and fineness were also improved in the transgenic plants as compared to the wild-type cotton, with no loss in overall fiber yield. Our data indicate that fiber-specific up-regulation of PtdIns synthesis is a promising strategy for cotton fiber quality improvement.
Lessons learned from the dog genome.
Wayne, Robert K; Ostrander, Elaine A
2007-11-01
Extensive genetic resources and a high-quality genome sequence position the dog as an important model species for understanding genome evolution, population genetics and genes underlying complex phenotypic traits. Newly developed genomic resources have expanded our understanding of canine evolutionary history and dog origins. Domestication involved genetic contributions from multiple populations of gray wolves probably through backcrossing. More recently, the advent of controlled breeding practices has segregated genetic variability into distinct dog breeds that possess specific phenotypic traits. Consequently, genome-wide association and selective sweep scans now allow the discovery of genes underlying breed-specific characteristics. The dog is finally emerging as a novel resource for studying the genetic basis of complex traits, including behavior.
Computing and Applying Atomic Regulons to Understand Gene Expression and Regulation
Faria, José P.; Davis, James J.; Edirisinghe, Janaka N.; Taylor, Ronald C.; Weisenhorn, Pamela; Olson, Robert D.; Stevens, Rick L.; Rocha, Miguel; Rocha, Isabel; Best, Aaron A.; DeJongh, Matthew; Tintle, Nathan L.; Parrello, Bruce; Overbeek, Ross; Henry, Christopher S.
2016-01-01
Understanding gene function and regulation is essential for the interpretation, prediction, and ultimate design of cell responses to changes in the environment. An important step toward meeting the challenge of understanding gene function and regulation is the identification of sets of genes that are always co-expressed. These gene sets, Atomic Regulons (ARs), represent fundamental units of function within a cell and could be used to associate genes of unknown function with cellular processes and to enable rational genetic engineering of cellular systems. Here, we describe an approach for inferring ARs that leverages large-scale expression data sets, gene context, and functional relationships among genes. We computed ARs for Escherichia coli based on 907 gene expression experiments and compared our results with gene clusters produced by two prevalent data-driven methods: Hierarchical clustering and k-means clustering. We compared ARs and purely data-driven gene clusters to the curated set of regulatory interactions for E. coli found in RegulonDB, showing that ARs are more consistent with gold standard regulons than are data-driven gene clusters. We further examined the consistency of ARs and data-driven gene clusters in the context of gene interactions predicted by Context Likelihood of Relatedness (CLR) analysis, finding that the ARs show better agreement with CLR predicted interactions. We determined the impact of increasing amounts of expression data on AR construction and find that while more data improve ARs, it is not necessary to use the full set of gene expression experiments available for E. coli to produce high quality ARs. In order to explore the conservation of co-regulated gene sets across different organisms, we computed ARs for Shewanella oneidensis, Pseudomonas aeruginosa, Thermus thermophilus, and Staphylococcus aureus, each of which represents increasing degrees of phylogenetic distance from E. coli. Comparison of the organism-specific ARs showed that the consistency of AR gene membership correlates with phylogenetic distance, but there is clear variability in the regulatory networks of closely related organisms. As large scale expression data sets become increasingly common for model and non-model organisms, comparative analyses of atomic regulons will provide valuable insights into fundamental regulatory modules used across the bacterial domain. PMID:27933038
Poddar, Sushmita; Loh, Pei She; Ooi, Zi Hao; Osman, Farhana; Eul, Joachim; Patzel, Volker
2018-06-01
Spliceosome-mediated RNA trans-splicing enables correction or labeling of pre-mRNA, but therapeutic applications are hampered by issues related to the activity and target specificity of trans-splicing RNA (tsRNA). We employed computational RNA structure design to improve both on-target activity and specificity of tsRNA in a herpes simplex virus thymidine kinase/ganciclovir suicide gene therapy approach targeting alpha fetoprotein (AFP), a marker of hepatocellular carcinoma (HCC) or human papillomavirus type 16 (HPV-16) pre-mRNA. While unstructured, mismatched target binding domains significantly improved 3' exon replacement (3'ER), 5' exon replacement (5'ER) correlated with the thermodynamic stability of the tsRNA 3' end. Alternative on-target trans-splicing was found to be a prevalent event. The specificity of trans-splicing with the intended target splice site was improved 10-fold by designing tsRNA that harbors secondary target binding domains shielding alternative on-target and blinding off-target splicing events. Such rationally designed suicide RNAs efficiently triggered death of HPV-16-transduced or hepatoblastoma-derived human tissue culture cells without evidence for off-target cell killing. Highest cell death activities were observed with novel dual-targeting tsRNAs programmed for trans-splicing toward AFP and a second HCC pre-mRNA biomarker. Our observations suggest trans-splicing represents a promising approach to suicide gene therapy. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.
Arm-specific dynamics of chromosome evolution in malaria mosquitoes
2011-01-01
Background The malaria mosquito species of subgenus Cellia have rich inversion polymorphisms that correlate with environmental variables. Polymorphic inversions tend to cluster on the chromosomal arms 2R and 2L but not on X, 3R and 3L in Anopheles gambiae and homologous arms in other species. However, it is unknown whether polymorphic inversions on homologous chromosomal arms of distantly related species from subgenus Cellia nonrandomly share similar sets of genes. It is also unclear if the evolutionary breakage of inversion-poor chromosomal arms is under constraints. Results To gain a better understanding of the arm-specific differences in the rates of genome rearrangements, we compared gene orders and established syntenic relationships among Anopheles gambiae, Anopheles funestus, and Anopheles stephensi. We provided evidence that polymorphic inversions on the 2R arms in these three species nonrandomly captured similar sets of genes. This nonrandom distribution of genes was not only a result of preservation of ancestral gene order but also an outcome of extensive reshuffling of gene orders that created new combinations of homologous genes within independently originated polymorphic inversions. The statistical analysis of distribution of conserved gene orders demonstrated that the autosomal arms differ in their tolerance to generating evolutionary breakpoints. The fastest evolving 2R autosomal arm was enriched with gene blocks conserved between only a pair of species. In contrast, all identified syntenic blocks were preserved on the slowly evolving 3R arm of An. gambiae and on the homologous arms of An. funestus and An. stephensi. Conclusions Our results suggest that natural selection favors specific gene combinations within polymorphic inversions when distant species are exposed to similar environmental pressures. This knowledge could be useful for the discovery of genes responsible for an association of inversion polymorphisms with phenotypic variations in multiple species. Our data support the chromosomal arm specificity in rates of gene order disruption during mosquito evolution. We conclude that the distribution of breakpoint regions is evolutionary conserved on slowly evolving arms and tends to be lineage-specific on rapidly evolving arms. PMID:21473772
Simons, Johannes WIM
2009-01-01
Background We have previously shown that deviations from the average transcription profile of a group of functionally related genes are not only heritable, but also demonstrate specific patterns associated with age, gender and differentiation, thereby implicating genome-wide nuclear programming as the cause. To determine whether these results could be reproduced, a different micro-array database (obtained from two types of muscle tissue, derived from 81 human donors aged between 16 to 89 years) was studied. Results This new database also revealed the existence of age, gender and tissue-specific features in a small group of functionally related genes. In order to further analyze this phenomenon, a method was developed for quantifying the contribution of different factors to the variability in gene expression, and for generating a database limited to residual values reflecting constitutional differences between individuals. These constitutional differences, presumably epigenetic in origin, contribute to about 50% of the observed residual variance which is connected with a network of interrelated changes in gene expression with some genes displaying a decrease or increase in residual variation with age. Conclusion Epigenetic variation in gene expression without a clear concomitant relation to gene function appears to be a widespread phenomenon. This variation is connected with interactions between genes, is gender and tissue specific and is related to cellular aging. This finding, together with the method developed for analysis, might contribute to the elucidation of the role of nuclear programming in differentiation, aging and carcinogenesis Reviewers This article was reviewed by Thiago M. Venancio (nominated by Aravind Iyer), Hua Li (nominated by Arcady Mushegian) and Arcady Mushegian and J.P.de Magelhaes (nominated by G. Church). PMID:19796384
Álvarez, Isabel; Pérez-Pardal, Lucía; Traoré, Amadou; Fernández, Iván; Goyache, Félix
2016-08-01
A panel of 81 Asian, African and European cattle (Bos taurus and B. indicus) was analysed for the whole sequence of the CXCR4 gene (3844bp), a strong candidate for cattle trypanotolerance. Thirty-one polymorphic sites identified gave 31 different haplotypes. Neutrality tests rejected the hypothesis of either positive or purifying selection. Bayesian phylogenetic tree showed differentiation of haplotypes into two clades gathering genetic variability predating domestication. Related with clades definition, linkage disequilibrium analyses suggested the existence of one only linkage block on the CXCR4 gene. Two tag SNPs identified on exon 2 captured 50% of variability. Whatever the analysis carried out, no clear separation between cattle groups was identified. Most haplotypes identified in West African taurine cattle were also found in European cattle and in Asian and West African zebu. West African taurine samples did not carry unique variants on the CXCR4 gene sequence. The current analysis failed in identifying a causal mutation on the CXCR4 gene underlying a previously reported QTL for cattle trypanotolerance on BTA2. Copyright © 2016 Elsevier B.V. All rights reserved.
Knowles, D P; Cheevers, W P; McGuire, T C; Brassfield, A L; Harwood, W G; Stem, T A
1991-11-01
To define the structure of the caprine arthritis-encephalitis virus (CAEV) env gene and characterize genetic changes which occur during antigenic variation, we sequenced the env genes of CAEV-63 and CAEV-Co, two antigenic variants of CAEV defined by serum neutralization. The deduced primary translation product of the CAEV env gene consists of a 60- to 80-amino-acid signal peptide followed by an amino-terminal surface protein (SU) and a carboxy-terminal transmembrane protein (TM) separated by an Arg-Lys-Lys-Arg cleavage site. The signal peptide cleavage site was verified by amino-terminal amino acid sequencing of native CAEV-63 SU. In addition, immunoprecipitation of [35S]methionine-labeled CAEV-63 proteins by sera from goats immunized with recombinant vaccinia virus expressing the CAEV-63 env gene confirmed that antibodies induced by env-encoded recombinant proteins react specifically with native virion SU and TM. The env genes of CAEV-63 and CAEV-Co encode 28 conserved cysteines and 25 conserved potential N-linked glycosylation sites. Nucleotide sequence variability results in 62 amino acid changes and one deletion within the SU and 34 amino acid changes within the TM.
Knowles, D P; Cheevers, W P; McGuire, T C; Brassfield, A L; Harwood, W G; Stem, T A
1991-01-01
To define the structure of the caprine arthritis-encephalitis virus (CAEV) env gene and characterize genetic changes which occur during antigenic variation, we sequenced the env genes of CAEV-63 and CAEV-Co, two antigenic variants of CAEV defined by serum neutralization. The deduced primary translation product of the CAEV env gene consists of a 60- to 80-amino-acid signal peptide followed by an amino-terminal surface protein (SU) and a carboxy-terminal transmembrane protein (TM) separated by an Arg-Lys-Lys-Arg cleavage site. The signal peptide cleavage site was verified by amino-terminal amino acid sequencing of native CAEV-63 SU. In addition, immunoprecipitation of [35S]methionine-labeled CAEV-63 proteins by sera from goats immunized with recombinant vaccinia virus expressing the CAEV-63 env gene confirmed that antibodies induced by env-encoded recombinant proteins react specifically with native virion SU and TM. The env genes of CAEV-63 and CAEV-Co encode 28 conserved cysteines and 25 conserved potential N-linked glycosylation sites. Nucleotide sequence variability results in 62 amino acid changes and one deletion within the SU and 34 amino acid changes within the TM. Images PMID:1656067
Maslov, D A; Hollar, L; Haghighat, P; Nawathean, P
1998-06-01
Maxicircle molecules of kDNA in several isolates of Phytomonas were detected by hybridization with the 12S rRNA gene probe from Leishmania tarentolae. The estimated size of maxicircles is isolate-specific and varies from 27 to 36 kb. Fully edited and polyadenylated mRNA for kinetoplast-encoded ribosomal protein S12 (RPS12) was found in the steady-state kinetoplast RNA isolated from Phytomonas serpens strain 1G. Two minicircles (1.45 kb) from this strain were also sequenced. Each minicircle contains two 120 bp conserved regions positioned 180 degrees apart, a region enriched with G and T bases and a variable region. One minicircle encodes a gRNA for the first block of editing of RPSl2 mRNA, and the other encodes a gRNA with unknown function. A gRNA gene for the second block of RPSl2 was found on a minicircle sequenced previously. On each minicircle, a gRNA gene is located in the variable region in a similar position and orientation with respect to the conserved regions.
Transcriptomics of cortical gray matter thickness decline during normal aging
Kochunov, P; Charlesworth, J; Winkler, A; Hong, LE; Nichols, T; Curran, JE; Sprooten, E; Jahanshad, N; Thompson, PM; Johnson, MP; Kent, JW; Landman, BA; Mitchell, B; Cole, SA; Dyer, TD; Moses, EK; Goring, HHH; Almasy, L; Duggirala, R; Olvera, RL; Glahn, DC; Blangero, J
2013-01-01
Introduction We performed a whole-transcriptome correlation analysis, followed by the pathway enrichment and testing of innate immune response pathways analyses to evaluate the hypothesis that transcriptional activity can predict cortical gray matter thickness (GMT) variability during normal cerebral aging Methods Transcriptome and GMT data were availabe for 379 individuals (age range=28–85) community-dwelling members of large extended Mexican-American families. Collection of transcriptome data preceded that of neuroimaging data by 17 years. Genome-wide gene transcriptome data consisted of 20,413 heritable lymphocytes-based transcripts. GMT measurements were performed from high-resolution (isotropic 800µm) T1-weighted MRI. Transcriptome-wide and pathway enrichment analysis was used to classify genes correlated with GMT. Transcripts for sixty genes from seven innate immune pathways were tested as specific predictors of GMT variability. Results Transcripts for eight genes (IGFBP3, LRRN3, CRIP2, SCD, IDS, TCF4, GATA3, HN1) passed the transcriptome-wide significance threshold. Four orthogonal factors extracted from this set predicted 31.9% of the variability in the whole-brain and between 23.4 and 35% of regional GMT measurements. Pathway enrichment analysis identified six functional categories including cellular proliferation, aggregation, differentiation, viral infection, and metabolism. The integrin signaling pathway was significantly (p<10−6) enriched with GMT. Finally, three innate immune pathways (complement signaling, toll-receptors and scavenger and immunoglobulins) were significantly associated with GMT. Conclusion Expression activity for the genes that regulate cellular proliferation, adhesion, differentiation and inflammation can explain a significant proportion of individual variability in cortical GMT. Our findings suggest that normal cerebral aging is the product of a progressive decline in regenerative capacity and increased neuroinflammation. PMID:23707588
Transcriptomics of cortical gray matter thickness decline during normal aging.
Kochunov, P; Charlesworth, J; Winkler, A; Hong, L E; Nichols, T E; Curran, J E; Sprooten, E; Jahanshad, N; Thompson, P M; Johnson, M P; Kent, J W; Landman, B A; Mitchell, B; Cole, S A; Dyer, T D; Moses, E K; Goring, H H H; Almasy, L; Duggirala, R; Olvera, R L; Glahn, D C; Blangero, J
2013-11-15
We performed a whole-transcriptome correlation analysis, followed by the pathway enrichment and testing of innate immune response pathway analyses to evaluate the hypothesis that transcriptional activity can predict cortical gray matter thickness (GMT) variability during normal cerebral aging. Transcriptome and GMT data were available for 379 individuals (age range=28-85) community-dwelling members of large extended Mexican American families. Collection of transcriptome data preceded that of neuroimaging data by 17 years. Genome-wide gene transcriptome data consisted of 20,413 heritable lymphocytes-based transcripts. GMT measurements were performed from high-resolution (isotropic 800 μm) T1-weighted MRI. Transcriptome-wide and pathway enrichment analysis was used to classify genes correlated with GMT. Transcripts for sixty genes from seven innate immune pathways were tested as specific predictors of GMT variability. Transcripts for eight genes (IGFBP3, LRRN3, CRIP2, SCD, IDS, TCF4, GATA3, and HN1) passed the transcriptome-wide significance threshold. Four orthogonal factors extracted from this set predicted 31.9% of the variability in the whole-brain and between 23.4 and 35% of regional GMT measurements. Pathway enrichment analysis identified six functional categories including cellular proliferation, aggregation, differentiation, viral infection, and metabolism. The integrin signaling pathway was significantly (p<10(-6)) enriched with GMT. Finally, three innate immune pathways (complement signaling, toll-receptors and scavenger and immunoglobulins) were significantly associated with GMT. Expression activity for the genes that regulate cellular proliferation, adhesion, differentiation and inflammation can explain a significant proportion of individual variability in cortical GMT. Our findings suggest that normal cerebral aging is the product of a progressive decline in regenerative capacity and increased neuroinflammation. Copyright © 2013 Elsevier Inc. All rights reserved.
Geric Stare, Barbara; Fouville, Didier; Širca, Saša; Gallot, Aurore; Urek, Gregor; Grenier, Eric
2011-02-01
While pectate lyases are major parasitism factors in plant-parasitic nematodes, there is little information on the variability of these genes within species and their utility as pathotype or host range molecular markers. We have analysed polymorphisms of pectate lyase 2 (pel-2) gene, which degrades the unesterified polygalacturonate (pectate) of the host cell-wall, in the genus Globodera. Molecular variability of the pel-2 gene and the predicted protein was evaluated in populations of G. rostochiensis, G. pallida, G. "mexicana" and G. tabacum. Seventy eight pel-2 sequences were obtained and aligned. Point mutations were observed at 373 positions, 57% of these affect the coding part of the gene and produce 129 aa replacements. The observed polymorphism does not correlate either to the pathotypes proposed in potato cyst nematodes (PCN) or the subspecies described in tobacco cyst nematodes. The trees reveal a topology different from the admitted species topology as G. rostochiensis and G. pallida sequences are more similar to each other than to G. tabacum. Species-specific sites, potentially applicable for identification, and sites distinguishing PCN from tobacco cyst nematodes, were identified. As both G. rostochiensis and G. pallida display the same host range, but distinct from G. tabacum, which cannot parasitize potato plants, it is tempting to speculate that pel-2 genes polymorphism may be implicated in this adaptation, a view supported by the fact that no active pectate lyase 2 was found in G. "mexicana", a close relative of G. pallida that is unable to develop on cultivated potato varieties.
Toward an Integration of Cognitive and Genetic Models of Risk for Depression
Gibb, Brandon E.; Beevers, Christopher G.; McGeary, John E.
2012-01-01
There is growing interest in integrating cognitive and genetic models of depression risk. We review two ways in which these models can be meaningfully integrated. First, information-processing biases may represent intermediate phenotypes for specific genetic influences. These genetic influences may represent main effects on specific cognitive processes or may moderate the impact of environmental influences on information-processing biases. Second, cognitive and genetic influences may combine to increase reactivity to environmental stressors, increasing risk for depression in a gene × cognition × environment model of risk. There is now growing support for both of these ways of integrating cognitive and genetic models of depression risk. Specifically, there is support for genetic influences on information-processing biases, particularly the link between 5-HTTLPR and attentional biases, from both genetic association and gene × environment (G × E) studies. There is also initial support for gene × cognition × environment models of risk in which specific genetic influences contribute to increased reactivity to environmental influences. We review this research and discuss important areas of future research, particularly the need for larger samples that allow for a broader examination of genetic and epigenetic influences as well as the combined influence of variability across a number of genes. PMID:22920216
[New perspectives on molecular and genic therapies in Down syndrome].
Delabar, Jean Maurice
2010-04-01
Trisomy 21 was first described as a syndrome in the middle of the nineteenth century and associated to a chromosomic anomaly one hundred years later: the most salient feature of this syndrome is a mental retardation of variable intensity. Molecular mapping and DNA sequencing have allowed identifying the gene content of chromosome 21. Molecular quantitative analyses indicated that trisomy is inducing an overexpression for a large part of the triplicated genes and deregulates also pathways involving non HSA21 genes. Together with the physiological description of murine models overexpressing orthologous genes, these data have allowed to elaborate hypotheses on the cause of cognitive impairment. From these hypotheses and using murine models it is now possible to assess the efficiency of various therapeutic strategies. This paper reviews these new perspectives starting from the strategies targeting the level of HSA21 RNAs or HSA21 proteins; then it describes methods targeting activities either of proteins involved in cell cycle pathways or of proteins controlling the synaptic plasticity. It is promising that strategies targeting specific genes or specific pathways are already giving positive results.
Expression of Selenoprotein Genes Is Affected by Obesity of Pigs Fed a High-Fat Diet123
Zhao, Hua; Li, Ke; Tang, Jia-Yong; Zhou, Ji-Chang; Wang, Kang-Ning; Xia, Xin-Jie; Lei, Xin Gen
2015-01-01
Background: Relations of the 25 mammalian selenoprotein genes with obesity and the associated inflammation remain unclear. Objective: This study explored impacts of high-fat diet-induced obesity on inflammation and expressions of selenoprotein and obesity-related genes in 10 tissues of pigs. Methods: Plasma and 10 tissues were collected from pigs (n = 10) fed a corn-soy–based control diet or that diet containing 3–7% lard from weanling to finishing (180 d). Plasma concentrations (n = 8) of cytokines and thyroid hormones and tissue mRNA abundance (n = 4) of 25 selenoprotein genes and 16 obesity-related genes were compared between the pigs fed the control and high-fat diets. Stepwise regression was applied to analyze correlations among all these measures, including the previously reported body physical and plasma biochemical variables. Results: The high-fat diet elevated (P < 0.05) plasma concentrations of tumor necrosis factor α, interleukin-6, leptin, and leptin receptor by 29–42% and affected (P < 0.05–0.1) tissue mRNA levels of the selenoprotein and obesity-related genes in 3 patterns. Specifically, the high-fat diet up-regulated 12 selenoprotein genes in 6 tissues, down-regulated 13 selenoprotein genes in 7 tissues, and exerted no effect on 5 genes in any tissue. Body weights and plasma triglyceride concentrations of pigs showed the strongest regressions to tissue mRNA abundances of selenoprotein and obesity-related genes. Among the selenoprotein genes, selenoprotein V and I were ranked as the strongest independent variables for the regression of phenotypic and plasma measures. Meanwhile, agouti signaling protein, adiponectin, and resistin genes represented the strongest independent variables of the obesity-related genes for the regression of tissue selenoprotein mRNA. Conclusions: The high-fat diet induced inflammation in pigs and affected their gene expression of selenoproteins associated with thioredoxin and oxidoreductase systems, local tissue thyroid hormone activity, endoplasmic reticulum protein degradation, and phosphorylation of lipids. This porcine model may be used to study interactive mechanisms between excess fat intake and selenoprotein function. PMID:25972525
Expression of Selenoprotein Genes Is Affected by Obesity of Pigs Fed a High-Fat Diet.
Zhao, Hua; Li, Ke; Tang, Jia-Yong; Zhou, Ji-Chang; Wang, Kang-Ning; Xia, Xin-Jie; Lei, Xin Gen
2015-07-01
Relations of the 25 mammalian selenoprotein genes with obesity and the associated inflammation remain unclear. This study explored impacts of high-fat diet-induced obesity on inflammation and expressions of selenoprotein and obesity-related genes in 10 tissues of pigs. Plasma and 10 tissues were collected from pigs (n = 10) fed a corn-soy-based control diet or that diet containing 3-7% lard from weanling to finishing (180 d). Plasma concentrations (n = 8) of cytokines and thyroid hormones and tissue mRNA abundance (n = 4) of 25 selenoprotein genes and 16 obesity-related genes were compared between the pigs fed the control and high-fat diets. Stepwise regression was applied to analyze correlations among all these measures, including the previously reported body physical and plasma biochemical variables. The high-fat diet elevated (P < 0.05) plasma concentrations of tumor necrosis factor α, interleukin-6, leptin, and leptin receptor by 29-42% and affected (P < 0.05-0.1) tissue mRNA levels of the selenoprotein and obesity-related genes in 3 patterns. Specifically, the high-fat diet up-regulated 12 selenoprotein genes in 6 tissues, down-regulated 13 selenoprotein genes in 7 tissues, and exerted no effect on 5 genes in any tissue. Body weights and plasma triglyceride concentrations of pigs showed the strongest regressions to tissue mRNA abundances of selenoprotein and obesity-related genes. Among the selenoprotein genes, selenoprotein V and I were ranked as the strongest independent variables for the regression of phenotypic and plasma measures. Meanwhile, agouti signaling protein, adiponectin, and resistin genes represented the strongest independent variables of the obesity-related genes for the regression of tissue selenoprotein mRNA. The high-fat diet induced inflammation in pigs and affected their gene expression of selenoproteins associated with thioredoxin and oxidoreductase systems, local tissue thyroid hormone activity, endoplasmic reticulum protein degradation, and phosphorylation of lipids. This porcine model may be used to study interactive mechanisms between excess fat intake and selenoprotein function. © 2015 American Society for Nutrition.
Vandenbon, Alexis; Dinh, Viet H.; Mikami, Norihisa; Kitagawa, Yohko; Teraguchi, Shunsuke; Ohkura, Naganari; Sakaguchi, Shimon
2016-01-01
High-throughput gene expression data are one of the primary resources for exploring complex intracellular dynamics in modern biology. The integration of large amounts of public data may allow us to examine general dynamical relationships between regulators and target genes. However, obstacles for such analyses are study-specific biases or batch effects in the original data. Here we present Immuno-Navigator, a batch-corrected gene expression and coexpression database for 24 cell types of the mouse immune system. We systematically removed batch effects from the underlying gene expression data and showed that this removal considerably improved the consistency between inferred correlations and prior knowledge. The data revealed widespread cell type-specific correlation of expression. Integrated analysis tools allow users to use this correlation of expression for the generation of hypotheses about biological networks and candidate regulators in specific cell types. We show several applications of Immuno-Navigator as examples. In one application we successfully predicted known regulators of importance in naturally occurring Treg cells from their expression correlation with a set of Treg-specific genes. For one high-scoring gene, integrin β8 (Itgb8), we confirmed an association between Itgb8 expression in forkhead box P3 (Foxp3)-positive T cells and Treg-specific epigenetic remodeling. Our results also suggest that the regulation of Treg-specific genes within Treg cells is relatively independent of Foxp3 expression, supporting recent results pointing to a Foxp3-independent component in the development of Treg cells. PMID:27078110
González, Víctor M; Aventín, Núria; Centeno, Emilio; Puigdomènech, Pere
2014-12-17
Plant NBS-LRR -resistance genes tend to be found in clusters, which have been shown to be hot spots of genome variability. In melon, half of the 81 predicted NBS-LRR genes group in nine clusters, and a 1 Mb region on linkage group V contains the highest density of R-genes and presence/absence gene polymorphisms found in the melon genome. This region is known to contain the locus of Vat, an agronomically important gene that confers resistance to aphids. However, the presence of duplications makes the sequencing and annotation of R-gene clusters difficult, usually resulting in multi-gapped sequences with higher than average errors. A 1-Mb sequence that contains the largest NBS-LRR gene cluster found in melon was improved using a strategy that combines Illumina paired-end mapping and PCR-based gap closing. Unknown sequence was decreased by 70% while about 3,000 SNPs and small indels were corrected. As a result, the annotations of 18 of a total of 23 NBS-LRR genes found in this region were modified, including additional coding sequences, amino acid changes, correction of splicing boundaries, or fussion of ORFs in common transcription units. A phylogeny analysis of the R-genes and their comparison with syntenic sequences in other cucurbits point to a pattern of local gene amplifications since the diversification of cucurbits from other families, and through speciation within the family. A candidate Vat gene is proposed based on the sequence similarity between a reported Vat gene from a Korean melon cultivar and a sequence fragment previously absent in the unrefined sequence. A sequence refinement strategy allowed substantial improvement of a 1 Mb fragment of the melon genome and the re-annotation of the largest cluster of NBS-LRR gene homologues found in melon. Analysis of the cluster revealed that resistance genes have been produced by sequence duplication in adjacent genome locations since the divergence of cucurbits from other close families, and through the process of speciation within the family a candidate Vat gene was also identified using sequence previously unavailable, which demonstrates the advantages of genome assembly refinements when analyzing complex regions such as those containing clusters of highly similar genes.
RNAi screening comes of age: improved techniques and complementary approaches
Mohr, Stephanie E.; Smith, Jennifer A.; Shamu, Caroline E.; Neumüller, Ralph A.; Perrimon, Norbert
2014-01-01
Gene silencing through sequence-specific targeting of mRNAs by RNAi has enabled genome-wide functional screens in cultured cells and in vivo in model organisms. These screens have resulted in the identification of new cellular pathways and potential drug targets. Considerable progress has been made to improve the quality of RNAi screen data through the development of new experimental and bioinformatics approaches. The recent availability of genome-editing strategies, such as the CRISPR (clustered regularly interspaced short palindromic repeats)-Cas9 system, when combined with RNAi, could lead to further improvements in screen data quality and follow-up experiments, thus promoting our understanding of gene function and gene regulatory networks. PMID:25145850
oPOSSUM: integrated tools for analysis of regulatory motif over-representation
Ho Sui, Shannan J.; Fulton, Debra L.; Arenillas, David J.; Kwon, Andrew T.; Wasserman, Wyeth W.
2007-01-01
The identification of over-represented transcription factor binding sites from sets of co-expressed genes provides insights into the mechanisms of regulation for diverse biological contexts. oPOSSUM, an internet-based system for such studies of regulation, has been improved and expanded in this new release. New features include a worm-specific version for investigating binding sites conserved between Caenorhabditis elegans and C. briggsae, as well as a yeast-specific version for the analysis of co-expressed sets of Saccharomyces cerevisiae genes. The human and mouse applications feature improvements in ortholog mapping, sequence alignments and the delineation of multiple alternative promoters. oPOSSUM2, introduced for the analysis of over-represented combinations of motifs in human and mouse genes, has been integrated with the original oPOSSUM system. Analysis using user-defined background gene sets is now supported. The transcription factor binding site models have been updated to include new profiles from the JASPAR database. oPOSSUM is available at http://www.cisreg.ca/oPOSSUM/ PMID:17576675
The role of transcriptome resilience in resistance of corals to bleaching.
Seneca, Francois O; Palumbi, Stephen R
2015-04-01
Wild populations increasingly experience extreme conditions as climate change amplifies environmental variability. How individuals respond to environmental extremes determines the impact of climate change overall. The variability of response from individual to individual can represent the opportunity for natural selection to occur as a result of extreme conditions. Here, we experimentally replicated the natural exposure to extreme temperatures of the reef lagoon at Ofu Island (American Samoa), where corals can experience severe heat stress during midday low tide. We investigated the bleaching and transcriptome response of 20 Acropora hyacinthus colonies 5 and 20 h after exposure to control (29 °C) or heated (35 °C) conditions. We found a highly dynamic transcriptome response: 27% of the coral transcriptome was significantly regulated 1 h postheat exposure. Yet 15 h later, when heat-induced coral bleaching became apparent, only 12% of the transcriptome was differentially regulated. A large proportion of responsive genes at the first time point returned to control levels, others remained differentially expressed over time, while an entirely different subset of genes was successively regulated at the second time point. However, a noteworthy variability in gene expression was observed among individual coral colonies. Among the genes of which expression lingered over time, fast return to normal levels was associated with low bleaching. Colonies that maintained higher expression levels of these genes bleached severely. Return to normal levels of gene expression after stress has been termed transcriptome resilience, and in the case of some specific genes may signal the physiological health and response ability of individuals to environmental stress. © 2015 John Wiley & Sons Ltd.
Raherison, Elie S M; Giguère, Isabelle; Caron, Sébastien; Lamara, Mebarek; MacKay, John J
2015-07-01
Transcript profiling has shown the molecular bases of several biological processes in plants but few studies have developed an understanding of overall transcriptome variation. We investigated transcriptome structure in white spruce (Picea glauca), aiming to delineate its modular organization and associated functional and evolutionary attributes. Microarray analyses were used to: identify and functionally characterize groups of co-expressed genes; investigate expressional and functional diversity of vascular tissue preferential genes which were conserved among Picea species, and identify expression networks underlying wood formation. We classified 22 857 genes as variable (79%; 22 coexpression groups) or invariant (21%) by profiling across several vegetative tissues. Modular organization and complex transcriptome restructuring among vascular tissue preferential genes was revealed by their assignment to coexpression groups with partially overlapping profiles and partially distinct functions. Integrated analyses of tissue-based and temporally variable profiles identified secondary xylem gene networks, showed their remodelling over a growing season and identified PgNAC-7 (no apical meristerm (NAM), Arabidopsis transcription activation factor (ATAF) and cup-shaped cotyledon (CUC) transcription factor 007 in Picea glauca) as a major hub gene specific to earlywood formation. Reference profiling identified comprehensive, statistically robust coexpressed groups, revealing that modular organization underpins the evolutionary conservation of the transcriptome structure. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
Hybrid Nanomaterial Complexes for Advanced Phage-guided Gene Delivery
Yata, Teerapong; Lee, Koon-Yang; Dharakul, Tararaj; Songsivilai, Sirirurg; Bismarck, Alexander; Mintz, Paul J; Hajitou, Amin
2014-01-01
Developing nanomaterials that are effective, safe, and selective for gene transfer applications is challenging. Bacteriophages (phage), viruses that infect bacteria only, have shown promise for targeted gene transfer applications. Unfortunately, limited progress has been achieved in improving their potential to overcome mammalian cellular barriers. We hypothesized that chemical modification of the bacteriophage capsid could be applied to improve targeted gene delivery by phage vectors into mammalian cells. Here, we introduce a novel hybrid system consisting of two classes of nanomaterial systems, cationic polymers and M13 bacteriophage virus particles genetically engineered to display a tumor-targeting ligand and carry a transgene cassette. We demonstrate that the phage complex with cationic polymers generates positively charged phage and large aggregates that show enhanced cell surface attachment, buffering capacity, and improved transgene expression while retaining cell type specificity. Moreover, phage/polymer complexes carrying a therapeutic gene achieve greater cancer cell killing than phage alone. This new class of hybrid nanomaterial platform can advance targeted gene delivery applications by bacteriophage. PMID:25118171
Sequences of heavy and light chain variable regions from four bovine immunoglobulins.
Armour, K L; Tempest, P R; Fawcett, P H; Fernie, M L; King, S I; White, P; Taylor, G; Harris, W J
1994-12-01
Oligodeoxyribonucleotide primers based on the 5' ends of bovine IgG1/2 and lambda constant (C) region genes, together with primers encoding conserved amino acids at the N-terminus of mature variable (V) regions from other species, have been used in cDNA and polymerase chain reactions (PCRs) to amplify heavy and light chain V region cDNA from bovine heterohybridomas. The amino acid sequences of VH and V lambda from four bovine immunoglobulins of different specificities are presented.
González, Carolina; Yanquepe, María; Cardenas, Juan Pablo; Valdes, Jorge; Quatrini, Raquel; Holmes, David S; Dopson, Mark
2014-11-01
Acidophilic microorganisms inhabit low pH environments such as acid mine drainage that is generated when sulfide minerals are exposed to air. The genome sequence of the psychrotolerant Acidithiobacillus ferrivorans SS3 was compared to a metagenome from a low temperature acidic stream dominated by an A. ferrivorans-like strain. Stretches of genomic DNA characterized by few matches to the metagenome, termed 'metagenomic islands', encoded genes associated with metal efflux and pH homeostasis. The metagenomic islands were enriched in mobile elements such as phage proteins, transposases, integrases and in one case, predicted to be flanked by truncated tRNAs. Cus gene clusters predicted to be involved in copper efflux and further Cus-like RND systems were predicted to be located in metagenomic islands and therefore, constitute part of the flexible gene complement of the species. Phylogenetic analysis of Cus clusters showed both lineage specificity within the Acidithiobacillus genus as well as niche specificity associated with an acidic environment. The metagenomic islands also contained a predicted copper efflux P-type ATPase system and a polyphosphate kinase potentially involved in polyphosphate mediated copper resistance. This study identifies genetic variability of low temperature acidophiles that likely reflects metal resistance selective pressures in the copper rich environment. Copyright © 2014 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
Tolmachov, Oleg E
2015-01-01
Gene delivery in vivo that is tightly focused on the intended target cells is essential to maximize the benefits of gene therapy and to reduce unwanted side-effects. Cell surface markers are immediately available for probing by therapeutic gene vectors and are often used to direct gene transfer with these vectors to specific target cell populations. However, it is not unusual for the choice of available extra-cellular markers to be too scarce to provide a reliable definition of the desired therapeutically relevant set of target cells. Therefore, interrogation of intra-cellular determinants of cell-specificity, such as tissue-specific transcription factors, can be vital in order to provide detailed cell-guiding information to gene vector particles. An important improvement in cell-specific gene delivery can be achieved through auto-buildup in vector homing efficiency using intelligent 'self-focusing' of swarms of vector particles on target cells. Vector self-focusing was previously suggested to rely on the release of diffusible chemo-attractants after a successful target-specific hit by 'scout' vector particles. I hypothesize that intelligent self-focusing behaviour of swarms of cell-targeted therapeutic gene vectors can be accomplished without the employment of difficult-to-use diffusible chemo-attractants, instead relying on the intra-swarm signalling through cells expressing a non-diffusible extra-cellular receptor for the gene vectors. In the proposed model, cell-guiding information is gathered by the 'scout' gene vector particles, which: (1) attach to a variety of cells via a weakly binding (low affinity) receptor; (2) successfully facilitate gene transfer into these cells; (3) query intra-cellular determinants of cell-specificity with their transgene expression control elements and (4) direct the cell-specific biosynthesis of a vector-encoded strongly binding (high affinity) cell-surface receptor. Free members of the vector swarm loaded with therapeutic cargo are then attracted to and internalized into the intended target cells via the expressed cognate strongly binding extra-cellular receptor, causing escalation of gene transfer into these cells and increasing the copy number of the therapeutic gene expression modules. Such self-focusing swarms of gene vectors can be either homogeneous, with 'scout' and 'therapeutic' members of the swarm being structurally identical, or, alternatively, heterogeneous (split), with 'scout' and 'therapeutic' members of the swarm being structurally specialized. It is hoped that the proposed self-focusing cell-targeted gene vector swarms with receptor-mediated intra-swarm signalling could be particularly effective in 'top-up' gene delivery scenarios, achieving high-level and sustained expression of therapeutic transgenes that are prone to shut-down through degradation and silencing. Crucially, in contrast to low-precision 'general location' vector guidance by diffusible chemo-attractants, ear-marking non-diffusible receptors can provide high-accuracy targeting of therapeutic vector particles to the specific cell, which has undergone a 'successful cell-specific hit' by a 'scout' vector particle. Opportunities for cell targeting could be expanded, since in the proposed model of self-focusing it could be possible to probe a broad selection of intra-cellular determinants of cell-specificity and not just to rely exclusively on extra-cellular markers of cell-specificity. By employing such self-focusing gene vectors for the improvement of cell-targeted delivery of therapeutic genes, e.g., in cancer therapy or gene addition therapy of recessive genetic diseases, it could be possible to broaden a leeway for the reduction of the vector load and, consequently, to minimize undesired vector cytotoxicity, immune reactions, and the risk of inadvertent genetic modification of germline cells in genetic treatment in vivo. Copyright © 2014 Elsevier B.V. All rights reserved.
The road ahead: working towards effective clinical translation of myocardial gene therapies
Katz, Michael G; Fargnoli, Anthony S; Williams, Richard D; Bridges, Charles R
2014-01-01
During the last two decades the fields of molecular and cellular cardiology, and more recently molecular cardiac surgery, have developed rapidly. The concept of delivering cDNA encoding a therapeutic gene to cardiomyocytes using a vector system with substantial cardiac tropism, allowing for long-term expression of a therapeutic protein, has moved from hypothesis to bench to clinical application. However, the clinical results to date are still disappointing. The ideal gene transfer method should be explored in clinically relevant animal models of heart disease to evaluate the relative roles of specific molecular pathways in disease pathogenesis, helping to validate the potential targets for therapeutic intervention. Successful clinical cardiovascular gene therapy also requires the use of nonimmunogenic cardiotropic vectors capable of expressing the requisite amount of therapeutic protein in vivo and in situ. Depending on the desired application either regional or global myocardial gene delivery is required. Cardiac-specific delivery techniques incorporating mapping technologies for regional delivery and highly efficient methodologies for global delivery should improve the precision and specificity of gene transfer to the areas of interest and minimize collateral organ gene expression. PMID:24341816
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kolker, Eugene
Our project focused primarily on analysis of different types of data produced by global high-throughput technologies, data integration of gene annotation, and gene and protein expression information, as well as on getting a better functional annotation of Shewanella genes. Specifically, four of our numerous major activities and achievements include the development of: statistical models for identification and expression proteomics, superior to currently available approaches (including our own earlier ones); approaches to improve gene annotations on the whole-organism scale; standards for annotation, transcriptomics and proteomics approaches; and generalized approaches for data integration of gene annotation, gene and protein expression information.
Thomas, Austen C; Jarman, Simon N; Haman, Katherine H; Trites, Andrew W; Deagle, Bruce E
2014-08-01
Ecologists are increasingly interested in quantifying consumer diets based on food DNA in dietary samples and high-throughput sequencing of marker genes. It is tempting to assume that food DNA sequence proportions recovered from diet samples are representative of consumer's diet proportions, despite the fact that captive feeding studies do not support that assumption. Here, we examine the idea of sequencing control materials of known composition along with dietary samples in order to correct for technical biases introduced during amplicon sequencing and biological biases such as variable gene copy number. Using the Ion Torrent PGM(©) , we sequenced prey DNA amplified from scats of captive harbour seals (Phoca vitulina) fed a constant diet including three fish species in known proportions. Alongside, we sequenced a prey tissue mix matching the seals' diet to generate tissue correction factors (TCFs). TCFs improved the diet estimates (based on sequence proportions) for all species and reduced the average estimate error from 28 ± 15% (uncorrected) to 14 ± 9% (TCF-corrected). The experimental design also allowed us to infer the magnitude of prey-specific digestion biases and calculate digestion correction factors (DCFs). The DCFs were compared with possible proxies for differential digestion (e.g. fish protein%, fish lipid%) revealing a strong relationship between the DCFs and percent lipid of the fish prey, suggesting prey-specific corrections based on lipid content would produce accurate diet estimates in this study system. These findings demonstrate the value of parallel sequencing of food tissue mixtures in diet studies and offer new directions for future research in quantitative DNA diet analysis. © 2013 John Wiley & Sons Ltd.
Kraker, Jessica; Viswanathan, Shiv Kumar; Knöll, Ralph; Sadayappan, Sakthivel
2016-01-01
The South Asian population, numbered at 1.8 billion, is estimated to comprise around 20% of the global population and 1% of the American population, and has one of the highest rates of cardiovascular disease. While South Asians show increased classical risk factors for developing heart failure, the role of population-specific genetic risk factors has not yet been examined for this group. Hypertrophic cardiomyopathy (HCM) is one of the major cardiac genetic disorders among South Asians, leading to contractile dysfunction, heart failure, and sudden cardiac death. This disease displays autosomal dominant inheritance, and it is associated with a large number of variants in both sarcomeric and non-sarcomeric proteins. The South Asians, a population with large ethnic diversity, potentially carries region-specific polymorphisms. There is high variability in disease penetrance and phenotypic expression of variants associated with HCM. Thus, extensive studies are required to decipher pathogenicity and the physiological mechanisms of these variants, as well as the contribution of modifier genes and environmental factors to disease phenotypes. Conducting genotype-phenotype correlation studies will lead to improved understanding of HCM and, consequently, improved treatment options for this high-risk population. The objective of this review is to report the history of cardiovascular disease and HCM in South Asians, present previously published pathogenic variants, and introduce current efforts to study HCM using induced pluripotent stem cell-derived cardiomyocytes, next-generation sequencing, and gene editing technologies. The authors ultimately hope that this review will stimulate further research, drive novel discoveries, and contribute to the development of personalized medicine with the aim of expanding therapeutic strategies for HCM. PMID:27840609
Binder, Harald; Porzelius, Christine; Schumacher, Martin
2011-03-01
Analysis of molecular data promises identification of biomarkers for improving prognostic models, thus potentially enabling better patient management. For identifying such biomarkers, risk prediction models can be employed that link high-dimensional molecular covariate data to a clinical endpoint. In low-dimensional settings, a multitude of statistical techniques already exists for building such models, e.g. allowing for variable selection or for quantifying the added value of a new biomarker. We provide an overview of techniques for regularized estimation that transfer this toward high-dimensional settings, with a focus on models for time-to-event endpoints. Techniques for incorporating specific covariate structure are discussed, as well as techniques for dealing with more complex endpoints. Employing gene expression data from patients with diffuse large B-cell lymphoma, some typical modeling issues from low-dimensional settings are illustrated in a high-dimensional application. First, the performance of classical stepwise regression is compared to stage-wise regression, as implemented by a component-wise likelihood-based boosting approach. A second issues arises, when artificially transforming the response into a binary variable. The effects of the resulting loss of efficiency and potential bias in a high-dimensional setting are illustrated, and a link to competing risks models is provided. Finally, we discuss conditions for adequately quantifying the added value of high-dimensional gene expression measurements, both at the stage of model fitting and when performing evaluation. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
NASA Technical Reports Server (NTRS)
Moyer, E. L.; Al-Shayeb, B.; Baer, L. A.; Ronca, A. E.
2016-01-01
Exposure to stress in the womb shapes neurobiological and physiological outcomes of offspring in later life, including body weight regulation and metabolic profiles. Our previous work utilizing a centrifugation-induced hyper-gravity demonstrated significantly increased (8-15%) body mass in male, but not female, rats exposed throughout gestation to chronic 2-g from conception to birth. We reported a similar outcome in adult offspring exposed throughout gestation to Unpredictable Variable Prenatal Stress (UVPS). Here we examine gene expression changes and the plasma of animals treated with our UVPS model to identify a potential role for prenatal stress in this hypergravity programming effect. Specifically we focused on appetite control and energy expenditure pathways in prenatally stressed adult (90-day-old) male Sprague-Dawley rats.
Seed, Kimberley D.; Faruque, Shah M.; Mekalanos, John J.; Calderwood, Stephen B.; Qadri, Firdausi; Camilli, Andrew
2012-01-01
The Vibrio cholerae lipopolysaccharide O1 antigen is a major target of bacteriophages and the human immune system and is of critical importance for vaccine design. We used an O1-specific lytic bacteriophage as a tool to probe the capacity of V. cholerae to alter its O1 antigen and identified a novel mechanism by which this organism can modulate O antigen expression and exhibit intra-strain heterogeneity. We identified two phase variable genes required for O1 antigen biosynthesis, manA and wbeL. manA resides outside of the previously recognized O1 antigen biosynthetic locus, and encodes for a phosphomannose isomerase critical for the initial step in O1 antigen biosynthesis. We determined that manA and wbeL phase variants are attenuated for virulence, providing functional evidence to further support the critical role of the O1 antigen for infectivity. We provide the first report of phase variation modulating O1 antigen expression in V. cholerae, and show that the maintenance of these phase variable loci is an important means by which this facultative pathogen can generate the diverse subpopulations of cells needed for infecting the host intestinal tract and for escaping predation by an O1-specific phage. PMID:23028317
Lee, Donald W; Khavrutskii, Ilja V; Wallqvist, Anders; Bavari, Sina; Cooper, Christopher L; Chaudhury, Sidhartha
2016-01-01
The somatic diversity of antigen-recognizing B-cell receptors (BCRs) arises from Variable (V), Diversity (D), and Joining (J) (VDJ) recombination and somatic hypermutation (SHM) during B-cell development and affinity maturation. The VDJ junction of the BCR heavy chain forms the highly variable complementarity determining region 3 (CDR3), which plays a critical role in antigen specificity and binding affinity. Tracking the selection and mutation of the CDR3 can be useful in characterizing humoral responses to infection and vaccination. Although tens to hundreds of thousands of unique BCR genes within an expressed B-cell repertoire can now be resolved with high-throughput sequencing, tracking SHMs is still challenging because existing annotation methods are often limited by poor annotation coverage, inconsistent SHM identification across the VDJ junction, or lack of B-cell lineage data. Here, we present B-cell repertoire inductive lineage and immunosequence annotator (BRILIA), an algorithm that leverages repertoire-wide sequencing data to globally improve the VDJ annotation coverage, lineage tree assembly, and SHM identification. On benchmark tests against simulated human and mouse BCR repertoires, BRILIA correctly annotated germline and clonally expanded sequences with 94 and 70% accuracy, respectively, and it has a 90% SHM-positive prediction rate in the CDR3 of heavily mutated sequences; these are substantial improvements over existing methods. We used BRILIA to process BCR sequences obtained from splenic germinal center B cells extracted from C57BL/6 mice. BRILIA returned robust B-cell lineage trees and yielded SHM patterns that are consistent across the VDJ junction and agree with known biological mechanisms of SHM. By contrast, existing BCR annotation tools, which do not account for repertoire-wide clonal relationships, systematically underestimated both the size of clonally related B-cell clusters and yielded inconsistent SHM frequencies. We demonstrate BRILIA's utility in B-cell repertoire studies related to VDJ gene usage, mechanisms for adenosine mutations, and SHM hot spot motifs. Furthermore, we show that the complete gene usage annotation and SHM identification across the entire CDR3 are essential for studying the B-cell affinity maturation process through immunosequencing methods.
Lee, Kyu Ha; Tadesse, Mahlet G; Baccarelli, Andrea A; Schwartz, Joel; Coull, Brent A
2017-03-01
The analysis of multiple outcomes is becoming increasingly common in modern biomedical studies. It is well-known that joint statistical models for multiple outcomes are more flexible and more powerful than fitting a separate model for each outcome; they yield more powerful tests of exposure or treatment effects by taking into account the dependence among outcomes and pooling evidence across outcomes. It is, however, unlikely that all outcomes are related to the same subset of covariates. Therefore, there is interest in identifying exposures or treatments associated with particular outcomes, which we term outcome-specific variable selection. In this work, we propose a variable selection approach for multivariate normal responses that incorporates not only information on the mean model, but also information on the variance-covariance structure of the outcomes. The approach effectively leverages evidence from all correlated outcomes to estimate the effect of a particular covariate on a given outcome. To implement this strategy, we develop a Bayesian method that builds a multivariate prior for the variable selection indicators based on the variance-covariance of the outcomes. We show via simulation that the proposed variable selection strategy can boost power to detect subtle effects without increasing the probability of false discoveries. We apply the approach to the Normative Aging Study (NAS) epigenetic data and identify a subset of five genes in the asthma pathway for which gene-specific DNA methylations are associated with exposures to either black carbon, a marker of traffic pollution, or sulfate, a marker of particles generated by power plants. © 2016, The International Biometric Society.
An RNA-Seq-based reference transcriptome for Citrus.
Terol, Javier; Tadeo, Francisco; Ventimilla, Daniel; Talon, Manuel
2016-03-01
Previous RNA-Seq studies in citrus have been focused on physiological processes relevant to fruit quality and productivity of the major species, especially sweet orange. Less attention has been paid to vegetative or reproductive tissues, while most Citrus species have never been analysed. In this work, we characterized the transcriptome of vegetative and reproductive tissues from 12 Citrus species from all main phylogenetic groups. Our aims were to acquire a complete view of the citrus transcriptome landscape, to improve previous functional annotations and to obtain genetic markers associated with genes of agronomic interest. 28 samples were used for RNA-Seq analysis, obtained from 12 Citrus species: C. medica, C. aurantifolia, C. limon, C. bergamia, C. clementina, C. deliciosa, C. reshni, C. maxima, C. paradisi, C. aurantium, C. sinensis and Poncirus trifoliata. Four different organs were analysed: root, phloem, leaf and flower. A total of 3421 million Illumina reads were produced and mapped against the reference C. clementina genome sequence. Transcript discovery pipeline revealed 3326 new genes, the number of genes with alternative splicing was increased to 19,739, and a total of 73,797 transcripts were identified. Differential expression studies between the four tissues showed that gene expression is overall related to the physiological function of the specific organs above any other variable. Variants discovery analysis revealed the presence of indels and SNPs in genes associated with fruit quality and productivity. Pivotal pathways in citrus such as those of flavonoids, flavonols, ethylene and auxin were also analysed in detail. © 2015 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.
UGT2B17 and SULT1A1 gene copy number variation (CNV) detection by LabChip microfluidic technology.
Gaedigk, Andrea; Gaedigk, Roger; Leeder, J Steven
2010-05-01
Gene copy number variations (CNVs) are increasingly recognized to play important roles in the expression of genes and hence on their respective enzymatic activities. This has been demonstrated for a number of drug metabolizing genes, such as UDP-glucuronosyltransferases 2B17 (UGT2B17) and sulfotransferase 1A1 (SULT1A1), which are subject to genetic heterogeneity, including CNV. Quantitative assays to assess gene copy number are therefore becoming an integral part of accurate genotype assessment and phenotype prediction. In this study, we evaluated a microfluidics-based system, the Bio-Rad Experion system, to determine the power and utility of this platform to detect UGT2B17 and SULT1A1 CNV in DNA samples derived from blood and tissue. UGT2B17 is known to present with 0, 1 or 2 and SULT1A1 with up to 5 gene copies. Distinct clustering (p<0.001) into copy number groups was achieved for both genes. DNA samples derived from blood exhibited less inter-run variability compared to DNA samples obtained from liver tissue. This variability may be caused by tissue-specific PCR inhibitors as it could be overcome by using DNA from another tissue, or after the DNA had undergone whole genome amplification. This method produced results comparable to those reported for other quantitative test platforms.
Genetic Variability of 27 Traits in a Core Collection of Flax (Linum usitatissimum L.)
You, Frank M.; Jia, Gaofeng; Xiao, Jin; Duguid, Scott D.; Rashid, Khalid Y.; Booker, Helen M.; Cloutier, Sylvie
2017-01-01
Assessment of genetic variability of plant core germplasm is needed for efficient germplasm utilization in breeding improvement. A total of 391 accessions of a flax core collection, which preserves the variation present in the world collection of 3,378 accessions maintained by Plant Gene Resources of Canada (PGRC) and represents a broad range of geographical origins, different improvement statuses and two morphotypes, was evaluated in field trials in up to 8 year-location environments for 10 agronomic, eight seed quality, six fiber and three disease resistance traits. The large phenotypic variation in this subset was explained by morphotypes (22%), geographical origins (11%), and other variance components (67%). Both divergence and similarity between two basic morphotypes, namely oil or linseed and fiber types, were observed, whereby linseed accessions had greater thousand seed weight, seeds m−2, oil content, branching capability and resistance to powdery mildew while fiber accessions had greater straw weight, plant height, protein content and resistance to pasmo and fusarium wilt diseases, but they had similar performance in many traits and some of them shared common characteristics of fiber and linseed types. Weak geographical patterns within either fiber or linseed accessions were confirmed, but specific trait performance was identified in East Asia for fiber type, and South Asia and North America for linseed type. Relatively high broad-sense heritability was obtained for seed quality traits, followed by agronomic traits and resistance to powdery mildew and fusarium wilt. Diverse phenotypic and genetic variability in the flax core collection constitutes a useful resource for breeding. PMID:28993783
Genetic Variability of 27 Traits in a Core Collection of Flax (Linum usitatissimum L.).
You, Frank M; Jia, Gaofeng; Xiao, Jin; Duguid, Scott D; Rashid, Khalid Y; Booker, Helen M; Cloutier, Sylvie
2017-01-01
Assessment of genetic variability of plant core germplasm is needed for efficient germplasm utilization in breeding improvement. A total of 391 accessions of a flax core collection, which preserves the variation present in the world collection of 3,378 accessions maintained by Plant Gene Resources of Canada (PGRC) and represents a broad range of geographical origins, different improvement statuses and two morphotypes, was evaluated in field trials in up to 8 year-location environments for 10 agronomic, eight seed quality, six fiber and three disease resistance traits. The large phenotypic variation in this subset was explained by morphotypes (22%), geographical origins (11%), and other variance components (67%). Both divergence and similarity between two basic morphotypes, namely oil or linseed and fiber types, were observed, whereby linseed accessions had greater thousand seed weight, seeds m -2 , oil content, branching capability and resistance to powdery mildew while fiber accessions had greater straw weight, plant height, protein content and resistance to pasmo and fusarium wilt diseases, but they had similar performance in many traits and some of them shared common characteristics of fiber and linseed types. Weak geographical patterns within either fiber or linseed accessions were confirmed, but specific trait performance was identified in East Asia for fiber type, and South Asia and North America for linseed type. Relatively high broad-sense heritability was obtained for seed quality traits, followed by agronomic traits and resistance to powdery mildew and fusarium wilt. Diverse phenotypic and genetic variability in the flax core collection constitutes a useful resource for breeding.
Lu, Haiwei; Viswanath, Venkatesh; Ma, Cathleen; ...
2015-11-13
Overexpression of genes that modify gibberellin (GA) metabolism and signaling have been previously shown to produce trees with improved biomass production but highly disturbed development. In order to examine if more subtle types of genetic modification of GA could improve growth rate and modify tree architecture, we transformed a model poplar genotype (Populus tremula × P. alba) with eight genes, including two cisgenes (intact copies of native genes), four intragenes (modified copies of native genes), and two transgenes (from sexually incompatible species), and studied their effects under greenhouse and field conditions. In the greenhouse, four out of the eight testedmore » genes produced a significant and often striking improvement of stem volume, and two constructs significantly modified the proportion of root or shoot biomass. Characterization of GA concentrations in the cisgenic population that had an additional copy of a poplar GA20-oxidase gene showed elevated concentrations of 13-hydroxylated GAs compared to wild-type poplars. In the field, we observed growth improvement for three of the six tested constructs, but it was significantly greater for only one of the constructs, a pRGL:GA20-oxidase intragene. The greenhouse and field responses were highly variable, possibly to due to cross-talk among the GA pathway and other stress response pathways, or due to interactions between the cisgenes and intragenes with highly similar endogenes. Our results indicate that extensive field trials, similar to those required for conventional breeding, will be critical to evaluating the value and pleiotropic effects of GA-modifying genes.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lu, Haiwei; Viswanath, Venkatesh; Ma, Cathleen
Overexpression of genes that modify gibberellin (GA) metabolism and signaling have been previously shown to produce trees with improved biomass production but highly disturbed development. In order to examine if more subtle types of genetic modification of GA could improve growth rate and modify tree architecture, we transformed a model poplar genotype (Populus tremula × P. alba) with eight genes, including two cisgenes (intact copies of native genes), four intragenes (modified copies of native genes), and two transgenes (from sexually incompatible species), and studied their effects under greenhouse and field conditions. In the greenhouse, four out of the eight testedmore » genes produced a significant and often striking improvement of stem volume, and two constructs significantly modified the proportion of root or shoot biomass. Characterization of GA concentrations in the cisgenic population that had an additional copy of a poplar GA20-oxidase gene showed elevated concentrations of 13-hydroxylated GAs compared to wild-type poplars. In the field, we observed growth improvement for three of the six tested constructs, but it was significantly greater for only one of the constructs, a pRGL:GA20-oxidase intragene. The greenhouse and field responses were highly variable, possibly to due to cross-talk among the GA pathway and other stress response pathways, or due to interactions between the cisgenes and intragenes with highly similar endogenes. Our results indicate that extensive field trials, similar to those required for conventional breeding, will be critical to evaluating the value and pleiotropic effects of GA-modifying genes.« less
Pazhamala, Lekha T.; Purohit, Shilp; Saxena, Rachit K.; Garg, Vanika; Krishnamurthy, L.; Verdier, Jerome
2017-01-01
Abstract Pigeonpea (Cajanus cajan) is an important grain legume of the semi-arid tropics, mainly used for its protein rich seeds. To link the genome sequence information with agronomic traits resulting from specific developmental processes, a Cajanus cajan gene expression atlas (CcGEA) was developed using the Asha genotype. Thirty tissues/organs representing developmental stages from germination to senescence were used to generate 590.84 million paired-end RNA-Seq data. The CcGEA revealed a compendium of 28 793 genes with differential, specific, spatio-temporal and constitutive expression during various stages of development in different tissues. As an example to demonstrate the application of the CcGEA, a network of 28 flower-related genes analysed for cis-regulatory elements and splicing variants has been identified. In addition, expression analysis of these candidate genes in male sterile and male fertile genotypes suggested their critical role in normal pollen development leading to seed formation. Gene network analysis also identified two regulatory genes, a pollen-specific SF3 and a sucrose–proton symporter, that could have implications for improvement of agronomic traits such as seed production and yield. In conclusion, the CcGEA provides a valuable resource for pigeonpea to identify candidate genes involved in specific developmental processes and to understand the well-orchestrated growth and developmental process in this resilient crop. PMID:28338822
Li, Zibo; Heng, Jianfu; Yan, Jinhua; Guo, Xinwu; Tang, Lili; Chen, Ming; Peng, Limin; Wu, Yepeng; Wang, Shouman; Xiao, Zhi; Deng, Zhongping; Dai, Lizhong; Wang, Jun
2016-11-01
Gene-specific methylation and expression have shown biological and clinical importance for breast cancer diagnosis and prognosis. Integrated analysis of gene methylation and gene expression may identify genes associated with biology mechanism and clinical outcome of breast cancer and aid in clinical management. Using high-throughput microfluidic quantitative PCR, we analyzed the expression profiles of 48 candidate genes in 96 Chinese breast cancer patients and investigated their correlation with gene methylation and associations with breast cancer clinical parameters. Breast cancer-specific gene expression alternation was found in 25 genes with significant expression difference between paired tumor and normal tissues. A total of 9 genes (CCND2, EGFR, GSTP1, PGR, PTGS2, RECK, SOX17, TNFRSF10D, and WIF1) showed significant negative correlation between methylation and gene expression, which were validated in the TCGA database. Total 23 genes (ACADL, APC, BRCA2, CADM1, CAV1, CCND2, CST6, EGFR, ESR2, GSTP1, ICAM5, NPY, PGR, PTGS2, RECK, RUNX3, SFRP1, SOX17, SYK, TGFBR2, TNFRSF10D, WIF1, and WRN) annotated with potential TFBSs in the promoter regions showed negative correlation between methylation and expression. In logistics regression analysis, 31 of the 48 genes showed improved performance in disease prediction with combination of methylation and expression coefficient. Our results demonstrated the complex correlation and the possible regulatory mechanisms between DNA methylation and gene expression. Integration analysis of methylation and expression of candidate genes could improve performance in breast cancer prediction. These findings would contribute to molecular characterization and identification of biomarkers for potential clinical applications.
Why weight? Modelling sample and observational level variability improves power in RNA-seq analyses.
Liu, Ruijie; Holik, Aliaksei Z; Su, Shian; Jansz, Natasha; Chen, Kelan; Leong, Huei San; Blewitt, Marnie E; Asselin-Labat, Marie-Liesse; Smyth, Gordon K; Ritchie, Matthew E
2015-09-03
Variations in sample quality are frequently encountered in small RNA-sequencing experiments, and pose a major challenge in a differential expression analysis. Removal of high variation samples reduces noise, but at a cost of reducing power, thus limiting our ability to detect biologically meaningful changes. Similarly, retaining these samples in the analysis may not reveal any statistically significant changes due to the higher noise level. A compromise is to use all available data, but to down-weight the observations from more variable samples. We describe a statistical approach that facilitates this by modelling heterogeneity at both the sample and observational levels as part of the differential expression analysis. At the sample level this is achieved by fitting a log-linear variance model that includes common sample-specific or group-specific parameters that are shared between genes. The estimated sample variance factors are then converted to weights and combined with observational level weights obtained from the mean-variance relationship of the log-counts-per-million using 'voom'. A comprehensive analysis involving both simulations and experimental RNA-sequencing data demonstrates that this strategy leads to a universally more powerful analysis and fewer false discoveries when compared to conventional approaches. This methodology has wide application and is implemented in the open-source 'limma' package. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Sandeep, I Sriram; Das, Suryasnata; Nasim, Noohi; Mishra, Antaryami; Acharya, Laxmikanta; Joshi, Raj Kumar; Nayak, Sanghamitra; Mohanty, Sujata
2017-09-01
Curcuma longa L., accumulates substantial amount of curcumin and essential oil. Little is known about the differential expression of curcumin synthase (CURS) gene and consequent curcumin content variations at different agroclimatic zones. The present study aimed to evaluate the effect of climate, soil and harvesting phase on expression of CURS gene for curcumin yield in two high yielding turmeric cultivars. Expression of CURS gene at different experimental zones as well as at different harvesting phase was studied through transcriptional analysis by qRT-PCR. Curcumin varied from 1.5 to 5% and 1.4-5% in Surama and Roma respectively. The expression of CURS also varied from 0.402 to 5.584 fold in Surama and 0.856-5.217 fold in Roma. Difference in curcumin content at a particular zone varied among different harvesting period from 3.95 to 4.31% in Surama and 3.57-3.83% in Roma. Expression of CURS gene was also effected by harvesting time of the rhizome which varied from 7.389 to 16.882 fold in Surama and 4.41-8.342 fold in Roma. The CURS gene expression was found regardless of variations in curcumin content at different experimental zones. This may be due to the effects of soil and environmental variables. Expression was positively correlated with curcumin content with different harvesting time at a particular zone. This find indicates effect of soil and environment on molecular and biochemical dynamics of curcumin biosynthesis and could be useful in genetic improvement of turmeric. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
Zhu, Genfa; Yang, Fengxi; Shi, Shanshan; Li, Dongmei; Wang, Zhen; Liu, Hailin; Huang, Dan; Wang, Caiyun
2015-01-01
The highly variable leaf color of Cymbidium sinense significantly improves its horticultural and economic value, and makes it highly desirable in the flower markets in China and Southeast Asia. However, little is understood about the molecular mechanism underlying leaf-color variations. In this study, we found the content of photosynthetic pigments, especially chlorophyll degradation metabolite in the leaf-color mutants is distinguished significantly from that in the wild type of Cymbidium sinense 'Dharma'. To further determine the candidate genes controlling leaf-color variations, we first sequenced the global transcriptome using 454 pyrosequencing. More than 0.7 million expressed sequence tags (ESTs) with an average read length of 445.9 bp were generated and assembled into 103,295 isotigs representing 68,460 genes. Of these isotigs, 43,433 were significantly aligned to known proteins in the public database, of which 29,299 could be categorized into 42 functional groups in the gene ontology system, 10,079 classified into 23 functional classifications in the clusters of orthologous groups system, and 23,092 assigned to 139 clusters of specific metabolic pathways in the Kyoto Encyclopedia of Genes and Genomes. Among these annotations, 95 isotigs were designated as involved in chlorophyll metabolism. On this basis, we identified 16 key enzyme-encoding genes in the chlorophyll metabolism pathway, the full length cDNAs and expressions of which were further confirmed. Expression pattern indicated that the key enzyme-encoding genes for chlorophyll degradation were more highly expressed in the leaf color mutants, as was consistent with their lower chlorophyll contents. This study is the first to supply an informative 454 EST dataset for Cymbidium sinense 'Dharma' and to identify original leaf color-associated genes, which provide important resources to facilitate gene discovery for molecular breeding, marketable trait discovery, and investigating various biological process in this species.
Shi, Shanshan; Li, Dongmei; Wang, Zhen; Liu, Hailin; Huang, Dan; Wang, Caiyun
2015-01-01
The highly variable leaf color of Cymbidium sinense significantly improves its horticultural and economic value, and makes it highly desirable in the flower markets in China and Southeast Asia. However, little is understood about the molecular mechanism underlying leaf-color variations. In this study, we found the content of photosynthetic pigments, especially chlorophyll degradation metabolite in the leaf-color mutants is distinguished significantly from that in the wild type of Cymbidium sinense 'Dharma'. To further determine the candidate genes controlling leaf-color variations, we first sequenced the global transcriptome using 454 pyrosequencing. More than 0.7 million expressed sequence tags (ESTs) with an average read length of 445.9 bp were generated and assembled into 103,295 isotigs representing 68,460 genes. Of these isotigs, 43,433 were significantly aligned to known proteins in the public database, of which 29,299 could be categorized into 42 functional groups in the gene ontology system, 10,079 classified into 23 functional classifications in the clusters of orthologous groups system, and 23,092 assigned to 139 clusters of specific metabolic pathways in the Kyoto Encyclopedia of Genes and Genomes. Among these annotations, 95 isotigs were designated as involved in chlorophyll metabolism. On this basis, we identified 16 key enzyme-encoding genes in the chlorophyll metabolism pathway, the full length cDNAs and expressions of which were further confirmed. Expression pattern indicated that the key enzyme-encoding genes for chlorophyll degradation were more highly expressed in the leaf color mutants, as was consistent with their lower chlorophyll contents. This study is the first to supply an informative 454 EST dataset for Cymbidium sinense 'Dharma' and to identify original leaf color-associated genes, which provide important resources to facilitate gene discovery for molecular breeding, marketable trait discovery, and investigating various biological process in this species. PMID:26042676
Schmidt, Florian; Gasparoni, Nina; Gasparoni, Gilles; Gianmoena, Kathrin; Cadenas, Cristina; Polansky, Julia K; Ebert, Peter; Nordström, Karl; Barann, Matthias; Sinha, Anupam; Fröhler, Sebastian; Xiong, Jieyi; Dehghani Amirabad, Azim; Behjati Ardakani, Fatemeh; Hutter, Barbara; Zipprich, Gideon; Felder, Bärbel; Eils, Jürgen; Brors, Benedikt; Chen, Wei; Hengstler, Jan G; Hamann, Alf; Lengauer, Thomas; Rosenstiel, Philip; Walter, Jörn; Schulz, Marcel H
2017-01-09
The binding and contribution of transcription factors (TF) to cell specific gene expression is often deduced from open-chromatin measurements to avoid costly TF ChIP-seq assays. Thus, it is important to develop computational methods for accurate TF binding prediction in open-chromatin regions (OCRs). Here, we report a novel segmentation-based method, TEPIC, to predict TF binding by combining sets of OCRs with position weight matrices. TEPIC can be applied to various open-chromatin data, e.g. DNaseI-seq and NOMe-seq. Additionally, Histone-Marks (HMs) can be used to identify candidate TF binding sites. TEPIC computes TF affinities and uses open-chromatin/HM signal intensity as quantitative measures of TF binding strength. Using machine learning, we find low affinity binding sites to improve our ability to explain gene expression variability compared to the standard presence/absence classification of binding sites. Further, we show that both footprints and peaks capture essential TF binding events and lead to a good prediction performance. In our application, gene-based scores computed by TEPIC with one open-chromatin assay nearly reach the quality of several TF ChIP-seq data sets. Finally, these scores correctly predict known transcriptional regulators as illustrated by the application to novel DNaseI-seq and NOMe-seq data for primary human hepatocytes and CD4+ T-cells, respectively. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Mans, Ben J; Pienaar, Ronel; Latif, Abdalla A; Potgieter, Fred T
2011-05-01
Sequence variation within the 18S SSU rRNA V4 hyper-variable region can affect the accuracy of real-time hybridization probe-based diagnostics for the detection of Theileria spp. infections. This is relevant for assays that use non-specific primers, such as the real-time hybridization assay for T. parva (Sibeko et al. 2008). To assess the effect of sequence variation on this test, the Theileria 18S gene from 62 buffalo and 49 cattle samples was cloned and ∼1000 clones sequenced. Twenty-six genotypes were detected which included known and novel genotypes for the T. buffeli, T. mutans, T. taurotragi and T. velifera clades. A novel genotype related to T. sp. (sable) was also detected in 1 bovine sample. Theileria genotypic diversity was higher in buffalo compared to cattle. Polymorphism within the T. parva hyper-variable region was confirmed by aberrant real-time melting peaks and supported by sequencing of the S5 ribosomal gene. Analysis of the S5 gene suggests that this gene can be a marker for species differentiation. T. parva, T. sp. (buffalo) and T. sp. (bougasvlei) remain the only genotypes amplified by the primer set of the hybridization assay. Therefore, the 18S sequence diversity observed does not seem to affect the current real-time hybridization assay for T. parva.
Fava, Natália M. N.; Soares, Rodrigo M.; Scalia, Luana A. M.; Kalapothakis, Evanguedes; Pena, Isabella F.; Vieira, Carlos U.; Faria, Elaine S. M.; Cunha, Maria J.; Couto, Talles R.; Cury, Márcia Cristina
2013-01-01
Giardia duodenalis is a small intestinal protozoan parasite of several terrestrial vertebrates. This work aims to assess the genotypic variability of Giardia duodenalis isolates from cattle, sheep and pigs in the Southeast of Brazil, by comparing the standard characterization between glutamate dehydrogenase (gdh) and triose phosphate isomerase (tpi) primers. Fecal samples from the three groups of animals were analyzed using the zinc sulphate centrifugal flotation technique. Out of 59 positive samples, 30 were from cattle, 26 from sheep and 3 from pigs. Cyst pellets were stored and submitted to PCR and nested-PCR reactions with gdh and tpi primers. Fragment amplification of gdh and tpi genes was observed in 25 (42.4%) and 36 (61.0%) samples, respectively. Regarding the sequencing, 24 sequences were obtained with gdh and 20 with tpi. For both genes, there was a prevalence of E specific species assemblage, although some isolates have been identified as A and B, by the tpi sequencing. This has also shown a larger number of heterogeneous sequences, which have been attribute to mixed infections between assemblages B and E. The largest variability of inter-assemblage associated to the frequency of heterogeneity provided by tpi sequencing reinforces the polymorphic nature of this gene and makes it an excellent target for studies on molecular epidemiology. PMID:24308010
Perera, Piyumali K; Gasser, Robin B; Jabbar, Abdul
2015-03-01
Oriental theileriosis is a tick-borne, protozoan disease of cattle caused by one or more genotypes of Theileria orientalis complex. In this study, we assessed sequence variability in a region of the 23kDa piroplasm membrane protein (p23) gene within and among three T. orientalis genotypes (designated buffeli, chitose and ikeda) in south-eastern Australia. Genomic DNA (n=100) was extracted from blood of infected cattle from various locations endemic for oriental theileriosis and tested by polymerase chain reaction (PCR)-coupled mutation scanning (single-strand conformation polymorphism (SSCP)) and targeted sequencing analysis. Eight distinct sequences represented all DNA samples, and three genotypes were found: buffeli (n=3), chitose (3) and ikeda (2). Nucleotide pairwise comparisons among these eight sequences revealed considerably higher variability among the genotypes (6.6-11.7%) than within them (0-1.9%), indicating that the p23 gene region allows the accurate identification of T. orientalis genotypes. In the future, we will combine this gene with other molecular markers to study the genetic structure of T. orientalis populations in Australasia, which will pave the way to establish a highly sensitive and specific PCR-based assay for genotypic diagnosis of infection and for assessing levels of parasitaemia in cattle. Copyright © 2014 Elsevier GmbH. All rights reserved.
Oleksandr Skyba; Daniel Cullen; Carl J. Douglas; Shawn D. Mansfield
2016-01-01
Identification of the specific genes and enzymes involved in the fungal degradation of lignocellulosic biomass derived from feedstocks with various compositions is essential to the development of improved bioenergy processes. In order to elucidate the effect of substrate composition on gene expression in wood-rotting fungi, we employed microarrays based on the...
Mission Advantages of Constant Power, Variable Isp Electrostatic Thrusters
NASA Technical Reports Server (NTRS)
Oleson, Steven R.
2000-01-01
Electric propulsion has moved from station-keeping capability for spacecraft to primary propulsion with the advent of both the Deep Space One asteroid flyby and geosynchronous spacecraft orbit insertion. In both cases notably more payload was delivered than would have been possible with chemical propulsion. To provide even greater improvements electrostatic thruster performance could be varied in specific impulse, but kept at constant power to provide better payload or trip time performance for different mission phases. Such variable specific impulse mission applications include geosynchronous and low earth orbit spacecraft stationkeeping and orbit insertion, geosynchronous reusable tug missions, and interplanetary probes. The application of variable specific impulse devices is shown to add from 5 to 15% payload for these missions. The challenges to building such devices include variable voltage power supplies and extending fuel throughput capabilities across the specific impulse range.
2014-01-01
Background The rhizome, the original stem of land plants, enables species to invade new territory and is a critical component of perenniality, especially in grasses. Red rice (Oryza longistaminata) is a perennial wild rice species with many valuable traits that could be used to improve cultivated rice cultivars, including rhizomatousness, disease resistance and drought tolerance. Despite these features, little is known about the molecular mechanisms that contribute to rhizome growth, development and function in this plant. Results We used an integrated approach to compare the transcriptome, proteome and metabolome of the rhizome to other tissues of red rice. 116 Gb of transcriptome sequence was obtained from various tissues and used to identify rhizome-specific and preferentially expressed genes, including transcription factors and hormone metabolism and stress response-related genes. Proteomics and metabolomics approaches identified 41 proteins and more than 100 primary metabolites and plant hormones with rhizome preferential accumulation. Of particular interest was the identification of a large number of gene transcripts from Magnaportha oryzae, the fungus that causes rice blast disease in cultivated rice, even though the red rice plants showed no sign of disease. Conclusions A significant set of genes, proteins and metabolites appear to be specifically or preferentially expressed in the rhizome of O. longistaminata. The presence of M. oryzae gene transcripts at a high level in apparently healthy plants suggests that red rice is resistant to this pathogen, and may be able to provide genes to cultivated rice that will enable resistance to rice blast disease. PMID:24521476
USDA-ARS?s Scientific Manuscript database
Common bean (Phaseolus vulgaris L.) is able to fix atmospheric nitrogen (N2) through symbiotic nitrogen fixation (SNF). Effective utilization of existing variability for SNF in common bean for genetic improvement requires an understanding of underlying genes and molecular mechanisms. The utility of ...
Dru, P.; Bras, F.; Dezelee, S.; Gay, P.; Petitjean, A. M.; Pierre-Deneubourg, A.; Teninges, D.; Contamine, D.
1993-01-01
The ref(2)P gene of Drosophila melanogaster was identified by the discovery of two alleles, P(o) and P(p), respectively, permissive and restrictive for sigma rhabdovirus multiplication. A surprising variability of this gene was first noticed by the observation of size differences between the transcripts of permissive and restrictive alleles. In this paper, another restrictive allele, P(n), clearly distinct from P(p), is described: it exhibits a weaker antiviral effect than P(p) and differs from P(p) by its molecular structure. Five types of alleles were distinguished on the basis of their molecular structure, as revealed by S1 nuclease analysis of 17 D. melanogaster strains; three alleles were permissive and two restrictive. Comparison of the sequences of four haplotypes revealed numerous point mutations, two deletions (21 and 24 bp) and a complex event involving a 3-bp deletion, all affected the coding region. The unusual variability of the ref(2)P locus was confirmed by the high ratio of amino acid replacements to synonymous mutations (7:1), as compared to that of other genes, such as the Adh (2:42). Nevertheless, nucleotide sequence comparison with the Drosophila erecta ref(2)P gene shows that selective pressures are exerted to maintain the existence of a functional protein. The effects of this high variability on the ref(2)P protein are discussed in relation to its specific antiviral properties and to its function in D. melanogaster, where it is required for male fertility. PMID:8462852
[Progress of gene editing technologies and prospect in traditional Chinese medicine].
Ma, Yan-Yan; Li, Jing-Zhe; Gao, Er-Ning; Qian, Dan; Zhong, Ju-Ying; Liu, Chang-Zhen
2017-01-01
Gene editing is a kind of technologies that makes precise modification to the genome. It can be used to knock out/in and replace the specific DNA fragment, and make accurate gene editing on the genome level. The essence of the technique is the DNA sequence change with use of non homologous end link repair and homologous recombination repair, combined with specific DNA target recognition and endonuclease.This technology has wide range of development prospects and high application value in terms of scientific research, agriculture, medical treatment and other fields. In the field of gene therapy, gene editing technology has achieved cross-time success in cancers such as leukemia, genetic disorders such as hemophilia, thalassemia, multiple muscle nutritional disorders and retrovirus associated infectious diseases such as AIDS and other diseases. The preparation work for new experimental methods and animal models combined with gene editing technology is under rapid development and improvement. Laboratories around the world have also applied gene editing technique in prevention of malaria, organ transplantation, biological pharmaceuticals, agricultural breeding improvement, resurrection of extinct species, and other research areas. This paper summarizes the application and development status of gene editing technique in the above fields, and also preliminarily explores the potential application prospect of the technology in the field of traditional Chinese medicine, and discusses the present controversy and thoughts. Copyright© by the Chinese Pharmaceutical Association.
FOXM1 promotes the progression of prostate cancer by regulating PSA gene transcription.
Liu, Youhong; Liu, Yijun; Yuan, Bowen; Yin, Linglong; Peng, Yuchong; Yu, Xiaohui; Zhou, Weibing; Gong, Zhicheng; Liu, Jianye; He, Leye; Li, Xiong
2017-03-07
Androgen/AR is the primary contributor to prostate cancer (PCa) progression by regulating Prostate Specific Antigen (PSA) gene transcription. The disease inevitably evolves to androgen-independent (AI) status. Other mechanisms by which PSA is regulated and develops to AI have not yet been fully determined. FOXM1 is a cell proliferation-specific transcription factor highly expressed in PCa cells compared to non-malignant prostate epithelial cells, suggesting that the aberrant overexpression of FOXM1 contributes to PCa development. In addition to regulating AR gene transcription and cell cycle-regulatory genes, FOXM1 selectively regulates the gene transcription of KLK2 and PSA, typical androgen responsive genes. Screening the potential FOXM1-binding sites by ChIP-PCR, we found that FOXM1 directly binds to the FHK binding motifs in the PSA promoter/enhancer regions. AI C4-2 cells have more FOXM1 binding sites than androgen dependent LNCaP cells. The depletion of FOXM1 by small molecular inhibitors significantly improves the suppression of PSA gene transcription by the anti-AR agent Cadosax. This is the first report showing that FOXM1 promotes PCa progression by regulating PSA gene transcription, particularly in AI PCa cells. The combination of anti-AR agents and FOXM1 inhibitors has the potential to greatly improve therapy for late-stage PCa patients by suppressing PSA levels.
Variability of Actinobacteria, a minor component of rumen microflora.
Suľák, M; Sikorová, L; Jankuvová, J; Javorský, P; Pristaš, P
2012-07-01
Actinobacteria (Actinomycetes) are a significant and interesting group of gram-positive bacteria. They are regular, though infrequent, members of the microbial life in the rumen and represent up to 3 % of total rumen bacteria; there is considerable lack of information about ecology and biology of rumen actinobacteria. During the characterization of variability of rumen treponemas using non-cultivation approach, we also noted the variability of rumen actinobacteria. By using Treponema-specific primers a specific 16S rRNA gene library was prepared from cow and sheep rumen total DNA. About 10 % of recombinant clones contained actinobacteria-like sequences. Phylogenetic analyses of 11 clones obtained showed the high variability of actinobacteria in the ruminant digestive system. While some sequences are nearly identical to known sequences of actinobacteria, we detected completely new clusters of actinobacteria-like sequences, representing probably new, as yet undiscovered, group of rumen Actinobacteria. Further research will be necessary for understanding their nature and functions in the rumen.
New Developments in CRISPR Technology: Improvements in Specificity and Efficiency.
Safari, Fatemeh; Farajnia, Safar; Ghasemi, Younes; Zarghami, Nosratollah
2017-01-01
RNA-guided endonuclease as a versatile genome editing technology opened new windows in various fields of biology. The simplicity of this revolutionary technique provides a promising future for its application in a broad range of approaches from functional annotation of genes to diseases, to genetic manipulation and gene therapy. Besides the site-specific activity of Cas9 endonuclease, the unintended cleavage known as off-target effect is still a major challenge for this genome editing technique. Various strategies have been developed to resolve this bottleneck including development of new softwares for designing optimized guide RNA (gRNA), engineering Cas9 enzyme, improvement in off-target detection assays, etc. Results: This review dedicated to discuss on methods that have been used for optimizing Cas9, specificity with the aim of improving this technology for therapeutic applications. In addition, the applications and novel breakthroughs in the field of CRISPR technology will be described. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Joo, Jihoon E; Hiden, Ursula; Lassance, Luciana; Gordon, Lavinia; Martino, David J; Desoye, Gernot; Saffery, Richard
2013-07-15
The endothelial compartment, comprising arterial, venous and lymphatic cell types, is established prenatally in association with rapid phenotypic and functional changes. The molecular mechanisms underpinning this process in utero have yet to be fully elucidated. The aim of this study was to investigate the potential for DNA methylation to act as a driver of the specific gene expression profiles of arterial and venous endothelial cells. Placenta-derived venous and arterial endothelial cells were collected at birth prior to culturing. DNA methylation was measured at >450,000 CpG sites in parallel with expression measurements taken from 25,000 annotated genes. A consistent set of genomic loci was found to show coordinate differential methylation between the arterial and venous cell types. This included many loci previously not investigated in relation to endothelial function. An inverse relationship was observed between gene expression and promoter methylation levels for a limited subset of genes implicated in endothelial function, including NOS3, encoding endothelial Nitric Oxide Synthase. Endothelial cells derived from the placental vasculature at birth contain widespread methylation of key regulatory genes. These are candidates involved in the specification of different endothelial cell types and represent potential target genes for environmentally mediated epigenetic disruption in utero in association with cardiovascular disease risk later in life.
Webster, Gordon; Embley, T Martin; Prosser, James I
2002-01-01
The impact of soil management practices on ammonia oxidizer diversity and spatial heterogeneity was determined in improved (addition of N fertilizer), unimproved (no additions), and semi-improved (intermediate management) grassland pastures at the Sourhope Research Station in Scotland. Ammonia oxidizer diversity within each grassland soil was assessed by PCR amplification of microbial community DNA with both ammonia oxidizer-specific, 16S rRNA gene (rDNA) and functional, amoA, gene primers. PCR products were analysed by denaturing gradient gel electrophoresis, phylogenetic analysis of partial 16S rDNA and amoA sequences, and hybridization with ammonia oxidizer-specific oligonucleotide probes. Ammonia oxidizer populations in unimproved soils were more diverse than those in improved soils and were dominated by organisms representing Nitrosospira clusters 1 and 3 and Nitrosomonas cluster 7 (closely related phylogenetically to Nitrosomonas europaea). Improved soils were only dominated by Nitrosospira cluster 3 and Nitrosomonas cluster 7. These differences were also reflected in functional gene (amoA) diversity, with amoA gene sequences of both Nitrosomonas and Nitrosospira species detected. Replicate 0.5-g samples of unimproved soil demonstrated significant spatial heterogeneity in 16S rDNA-defined ammonia oxidizer clusters, which was reflected in heterogeneity in ammonium concentration and pH. Heterogeneity in soil characteristics and ammonia oxidizer diversity were lower in improved soils. The results therefore demonstrate significant effects of soil management on diversity and heterogeneity of ammonia oxidizer populations that are related to similar changes in relevant soil characteristics.
Luo, Ming; Gilbert, Brian; Ayliffe, Michael
2016-07-01
Mutagenesis continues to play an essential role for understanding plant gene function and, in some instances, provides an opportunity for plant improvement. The development of gene editing technologies such as TALENs and zinc fingers has revolutionised the targeted mutation specificity that can now be achieved. The CRISPR/Cas9 system is the most recent addition to gene editing technologies and arguably the simplest requiring only two components; a small guide RNA molecule (sgRNA) and Cas9 endonuclease protein which complex to recognise and cleave a specific 20 bp target site present in a genome. Target specificity is determined by complementary base pairing between the sgRNA and target site sequence enabling highly specific, targeted mutation to be readily engineered. Upon target site cleavage, error-prone endogenous repair mechanisms produce small insertion/deletions at the target site usually resulting in loss of gene function. CRISPR/Cas9 gene editing has been rapidly adopted in plants and successfully undertaken in numerous species including major crop species. Its applications are not restricted to mutagenesis and target site cleavage can be exploited to promote sequence insertion or replacement by recombination. The multiple applications of this technology in plants are described.
Kumar, Manu; Choi, Ju-Young; Kumari, Nisha; Pareek, Ashwani; Kim, Seong-Ryong
2015-01-01
Salinity is one of the important abiotic factors for any crop management in irrigated as well as rainfed areas, which leads to poor harvests. This yield reduction in salt affected soils can be overcome by improving salt tolerance in crops or by soil reclamation. Salty soils can be reclaimed by leaching the salt or by cultivation of salt tolerance crops. Salt tolerance is a quantitative trait controlled by several genes. Poor knowledge about mechanism of its inheritance makes slow progress in its introgression into target crops. Brassica is known to be a good reclamation crop. Inter and intra specific variation within Brassica species shows potential of molecular breeding to raise salinity tolerant genotypes. Among the various molecular markers, SSR markers are getting high attention, since they are randomly sparsed, highly variable and show co-dominant inheritance. Furthermore, as sequencing techniques are improving and softwares to find SSR markers are being developed, SSR markers technology is also evolving rapidly. Comparative SSR marker studies targeting Arabidopsis thaliana and Brassica species which lie in the same family will further aid in studying the salt tolerance related QTLs and subsequent identification of the "candidate genes" and finding out the origin of important QTLs. Although, there are a few reports on molecular breeding for improving salt tolerance using molecular markers in Brassica species, usage of SSR markers has a big potential to improve salt tolerance in Brassica crops. In order to obtain best harvests, role of SSR marker driven breeding approaches play important role and it has been discussed in this review especially for the introgression of salt tolerance traits in crops.
Muscatelli, F; Abrous, D N; Massacrier, A; Boccaccio, I; Le Moal, M; Cau, P; Cremer, H
2000-12-12
Prader-Willi syndrome (PWS) is a complex neurogenetic disorder with considerable clinical variability that is thought in large part to be the result of a hypothalamic defect. PWS results from the absence of paternal expression of imprinted genes localized in the 15q11-q13 region; however, none of the characterized genes has so far been shown to be involved in the etiology of PWS. Here, we provide a detailed investigation of a mouse model deficient for NECDIN: Linked to the mutation, a neonatal lethality of variable penetrance is observed. Viable NECDIN: mutants show a reduction in both oxytocin-producing and luteinizing hormone-releasing hormone (LHRH)-producing neurons in hypothalamus. This represents the first evidence of a hypothalamic deficiency in a mouse model of PWS. NECDIN:-deficient mice also display increased skin scraping activity in the open field test and improved spatial learning and memory in the Morris water maze. The latter features are reminiscent of the skin picking and improved spatial memory that are characteristics of the PWS phenotype. These striking parallels in hypothalamic structure, emotional and cognitive-related behaviors strongly suggest that NECDIN is responsible for at least a subset of the multiple clinical manifestations of PWS.
Bruggeman, Jan Willem; Koster, Jan; Lodder, Paul; Repping, Sjoerd; Hamer, Geert
2018-06-15
Cancer cells have been found to frequently express genes that are normally restricted to the testis, often referred to as cancer/testis (CT) antigens or genes. Because germ cell-specific antigens are not recognized as "self" by the innate immune system, CT-genes have previously been suggested as ideal candidate targets for cancer therapy. The use of CT-genes in cancer therapy has thus far been unsuccessful, most likely because their identification has relied on gene expression in whole testis, including the testicular somatic cells, precluding the detection of true germ cell-specific genes. By comparing the transcriptomes of micro-dissected germ cell subtypes, representing the main developmental stages of human spermatogenesis, with the publicly accessible transcriptomes of 2617 samples from 49 different healthy somatic tissues and 9232 samples from 33 tumor types, we here discover hundreds of true germ cell-specific cancer expressed genes. Strikingly, we found these germ cell cancer genes (GC-genes) to be widely expressed in all analyzed tumors. Many GC-genes appeared to be involved in processes that are likely to actively promote tumor viability, proliferation and metastasis. Targeting these true GC-genes thus has the potential to inhibit tumor growth with infertility being the only possible side effect. Moreover, we identified a subset of GC-genes that are not expressed in spermatogonial stem cells. Targeting of this GC-gene subset is predicted to only lead to temporary infertility, as untargeted spermatogonial stem cells can recover spermatogenesis after treatment. Our GC-gene dataset enables improved understanding of tumor biology and provides multiple novel targets for cancer treatment.
Xu, Man K.; Gaysina, Darya; Tsonaka, Roula; Morin, Alexandre J. S.; Croudace, Tim J.; Barnett, Jennifer H.; Houwing-Duistermaat, Jeanine; Richards, Marcus; Jones, Peter B.
2017-01-01
Very few molecular genetic studies of personality traits have used longitudinal phenotypic data, therefore molecular basis for developmental change and stability of personality remains to be explored. We examined the role of the monoamine oxidase A gene (MAOA) on extraversion and neuroticism from adolescence to adulthood, using modern latent variable methods. A sample of 1,160 male and 1,180 female participants with complete genotyping data was drawn from a British national birth cohort, the MRC National Survey of Health and Development (NSHD). The predictor variable was based on a latent variable representing genetic variations of the MAOA gene measured by three SNPs (rs3788862, rs5906957, and rs979606). Latent phenotype variables were constructed using psychometric methods to represent cross-sectional and longitudinal phenotypes of extraversion and neuroticism measured at ages 16 and 26. In males, the MAOA genetic latent variable (AAG) was associated with lower extraversion score at age 16 (β = −0.167; CI: −0.289, −0.045; p = 0.007, FDRp = 0.042), as well as greater increase in extraversion score from 16 to 26 years (β = 0.197; CI: 0.067, 0.328; p = 0.003, FDRp = 0.036). No genetic association was found for neuroticism after adjustment for multiple testing. Although, we did not find statistically significant associations after multiple testing correction in females, this result needs to be interpreted with caution due to issues related to x-inactivation in females. The latent variable method is an effective way of modeling phenotype- and genetic-based variances and may therefore improve the methodology of molecular genetic studies of complex psychological traits. PMID:29075213
Xu, Man K; Gaysina, Darya; Tsonaka, Roula; Morin, Alexandre J S; Croudace, Tim J; Barnett, Jennifer H; Houwing-Duistermaat, Jeanine; Richards, Marcus; Jones, Peter B
2017-01-01
Very few molecular genetic studies of personality traits have used longitudinal phenotypic data, therefore molecular basis for developmental change and stability of personality remains to be explored. We examined the role of the monoamine oxidase A gene ( MAOA ) on extraversion and neuroticism from adolescence to adulthood, using modern latent variable methods. A sample of 1,160 male and 1,180 female participants with complete genotyping data was drawn from a British national birth cohort, the MRC National Survey of Health and Development (NSHD). The predictor variable was based on a latent variable representing genetic variations of the MAOA gene measured by three SNPs (rs3788862, rs5906957, and rs979606). Latent phenotype variables were constructed using psychometric methods to represent cross-sectional and longitudinal phenotypes of extraversion and neuroticism measured at ages 16 and 26. In males, the MAOA genetic latent variable (AAG) was associated with lower extraversion score at age 16 (β = -0.167; CI: -0.289, -0.045; p = 0.007, FDRp = 0.042), as well as greater increase in extraversion score from 16 to 26 years (β = 0.197; CI: 0.067, 0.328; p = 0.003, FDRp = 0.036). No genetic association was found for neuroticism after adjustment for multiple testing. Although, we did not find statistically significant associations after multiple testing correction in females, this result needs to be interpreted with caution due to issues related to x-inactivation in females. The latent variable method is an effective way of modeling phenotype- and genetic-based variances and may therefore improve the methodology of molecular genetic studies of complex psychological traits.
Application of DNA Machineries for the Barcode Patterned Detection of Genes or Proteins.
Zhou, Zhixin; Luo, Guofeng; Wulf, Verena; Willner, Itamar
2018-06-05
The study introduces an analytical platform for the detection of genes or aptamer-ligand complexes by nucleic acid barcode patterns generated by DNA machineries. The DNA machineries consist of nucleic acid scaffolds that include specific recognition sites for the different genes or aptamer-ligand analytes. The binding of the analytes to the scaffolds initiate, in the presence of the nucleotide mixture, a cyclic polymerization/nicking machinery that yields displaced strands of variable lengths. The electrophoretic separation of the resulting strands provides barcode patterns for the specific detection of the different analytes. Mixtures of DNA machineries that yield, upon sensing of different genes (or aptamer ligands), one-, two-, or three-band barcode patterns are described. The combination of nucleic acid scaffolds acting, in the presence of polymerase/nicking enzyme and nucleotide mixture, as DNA machineries, that generate multiband barcode patterns provide an analytical platform for the detection of an individual gene out of many possible genes. The diversity of genes (or other analytes) that can be analyzed by the DNA machineries and the barcode patterned imaging is given by the Pascal's triangle. As a proof-of-concept, the detection of one of six genes, that is, TP53, Werner syndrome, Tay-Sachs normal gene, BRCA1, Tay-Sachs mutant gene, and cystic fibrosis disorder gene by six two-band barcode patterns is demonstrated. The advantages and limitations of the detection of analytes by polymerase/nicking DNA machineries that yield barcode patterns as imaging readout signals are discussed.
Fast and robust group-wise eQTL mapping using sparse graphical models.
Cheng, Wei; Shi, Yu; Zhang, Xiang; Wang, Wei
2015-01-16
Genome-wide expression quantitative trait loci (eQTL) studies have emerged as a powerful tool to understand the genetic basis of gene expression and complex traits. The traditional eQTL methods focus on testing the associations between individual single-nucleotide polymorphisms (SNPs) and gene expression traits. A major drawback of this approach is that it cannot model the joint effect of a set of SNPs on a set of genes, which may correspond to hidden biological pathways. We introduce a new approach to identify novel group-wise associations between sets of SNPs and sets of genes. Such associations are captured by hidden variables connecting SNPs and genes. Our model is a linear-Gaussian model and uses two types of hidden variables. One captures the set associations between SNPs and genes, and the other captures confounders. We develop an efficient optimization procedure which makes this approach suitable for large scale studies. Extensive experimental evaluations on both simulated and real datasets demonstrate that the proposed methods can effectively capture both individual and group-wise signals that cannot be identified by the state-of-the-art eQTL mapping methods. Considering group-wise associations significantly improves the accuracy of eQTL mapping, and the successful multi-layer regression model opens a new approach to understand how multiple SNPs interact with each other to jointly affect the expression level of a group of genes.
Dryselius, Rikard; Izutsu, Kaori; Honda, Takeshi; Iida, Tetsuya
2008-01-01
Background Replication of bacterial chromosomes increases copy numbers of genes located near origins of replication relative to genes located near termini. Such differential gene dosage depends on replication rate, doubling time and chromosome size. Although little explored, differential gene dosage may influence both gene expression and location. For vibrios, a diverse family of fast growing gammaproteobacteria, gene dosage may be particularly important as they harbor two chromosomes of different size. Results Here we examined replication dynamics and gene dosage effects for the separate chromosomes of three Vibrio species. We also investigated locations for specific gene types within the genome. The results showed consistently larger gene dosage differences for the large chromosome which also initiated replication long before the small. Accordingly, large chromosome gene expression levels were generally higher and showed an influence from gene dosage. This was reflected by a higher abundance of growth essential and growth contributing genes of which many locate near the origin of replication. In contrast, small chromosome gene expression levels were low and appeared independent of gene dosage. Also, species specific genes are highly abundant and an over-representation of genes involved in transcription could explain its gene dosage independent expression. Conclusion Here we establish a link between replication dynamics and differential gene dosage on one hand and gene expression levels and the location of specific gene types on the other. For vibrios, this relationship appears connected to a polarisation of genetic content between its chromosomes, which may both contribute to and be enhanced by an improved adaptive capacity. PMID:19032792
Cellular and molecular mechanisms of HIV-1 integration targeting.
Engelman, Alan N; Singh, Parmit K
2018-07-01
Integration is central to HIV-1 replication and helps mold the reservoir of cells that persists in AIDS patients. HIV-1 interacts with specific cellular factors to target integration to interior regions of transcriptionally active genes within gene-dense regions of chromatin. The viral capsid interacts with several proteins that are additionally implicated in virus nuclear import, including cleavage and polyadenylation specificity factor 6, to suppress integration into heterochromatin. The viral integrase protein interacts with transcriptional co-activator lens epithelium-derived growth factor p75 to principally position integration within gene bodies. The integrase additionally senses target DNA distortion and nucleotide sequence to help fine-tune the specific phosphodiester bonds that are cleaved at integration sites. Research into virus-host interactions that underlie HIV-1 integration targeting has aided the development of a novel class of integrase inhibitors and may help to improve the safety of viral-based gene therapy vectors.
Deonovic, Benjamin; Wang, Yunhao; Weirather, Jason; Wang, Xiu-Jie; Au, Kin Fai
2017-01-01
Abstract Allele-specific expression (ASE) is a fundamental problem in studying gene regulation and diploid transcriptome profiles, with two key challenges: (i) haplotyping and (ii) estimation of ASE at the gene isoform level. Existing ASE analysis methods are limited by a dependence on haplotyping from laborious experiments or extra genome/family trio data. In addition, there is a lack of methods for gene isoform level ASE analysis. We developed a tool, IDP-ASE, for full ASE analysis. By innovative integration of Third Generation Sequencing (TGS) long reads with Second Generation Sequencing (SGS) short reads, the accuracy of haplotyping and ASE quantification at the gene and gene isoform level was greatly improved as demonstrated by the gold standard data GM12878 data and semi-simulation data. In addition to methodology development, applications of IDP-ASE to human embryonic stem cells and breast cancer cells indicate that the imbalance of ASE and non-uniformity of gene isoform ASE is widespread, including tumorigenesis relevant genes and pluripotency markers. These results show that gene isoform expression and allele-specific expression cooperate to provide high diversity and complexity of gene regulation and expression, highlighting the importance of studying ASE at the gene isoform level. Our study provides a robust bioinformatics solution to understand ASE using RNA sequencing data only. PMID:27899656
Biotype-specific tcpA genes in Vibrio cholerae.
Iredell, J R; Manning, P A
1994-08-01
The tcpA gene, encoding the structural subunit of the toxin-coregulated pilus, has been isolated from a variety of clinical isolates of Vibrio cholerae, and the nucleotide sequence determined. Strict biotype-specific conservation within both the coding and putative regulatory regions was observed, with important differences between the El Tor and classical biotypes. V. cholerae O139 Bengal strains appear to have El Tor-type tcpA genes. Environmental O1 and non-O1 isolates have sequences that bind an El Tor-specific tcpA DNA probe and that are weakly and variably amplified by tcpA-specific polymerase chain reaction primers, under conditions of reduced stringency. The data presented allow the selection of primer pairs to help distinguish between clinical and environmental isolates, and to distinguish El Tor (and Bengal) biotypes from classical biotypes of V. cholerae. While the role of TcpA in cholera vaccine preparations remains unclear, the data strongly suggest that TcpA-containing vaccines directed at O1 strains need include only the two forms of TcpA, and that such vaccines directed at (O139) Bengal strains should include the TcpA of El Tor biotype.
Ren, Xiangkui; Feng, Yakai; Guo, Jintang; Wang, Haixia; Li, Qian; Yang, Jing; Hao, Xuefang; Lv, Juan; Ma, Nan; Li, Wenzhong
2015-08-07
Surface modification and endothelialization of vascular biomaterials are common approaches that are used to both resist the nonspecific adhesion of proteins and improve the hemocompatibility and long-term patency of artificial vascular grafts. Surface modification of vascular grafts using hydrophilic poly(ethylene glycol), zwitterionic polymers, heparin or other bioactive molecules can efficiently enhance hemocompatibility, and consequently prevent thrombosis on artificial vascular grafts. However, these modified surfaces may be excessively hydrophilic, which limits initial vascular endothelial cell adhesion and formation of a confluent endothelial lining. Therefore, the improvement of endothelialization on these grafts by chemical modification with specific peptides and genes is now arousing more and more interest. Several active peptides, such as RGD, CAG, REDV and YIGSR, can be specifically recognized by endothelial cells. Consequently, graft surfaces that are modified by these peptides can exhibit targeting selectivity for the adhesion of endothelial cells, and genes can be delivered by targeting carriers to specific tissues to enhance the promotion and regeneration of blood vessels. These methods could effectively accelerate selective endothelial cell recruitment and functional endothelialization. In this review, recent developments in the surface modification and endothelialization of biomaterials in vascular tissue engineering are summarized. Both gene engineering and targeting ligand immobilization are promising methods to improve the clinical outcome of artificial vascular grafts.
Cell-type specific features of circular RNA expression.
Salzman, Julia; Chen, Raymond E; Olsen, Mari N; Wang, Peter L; Brown, Patrick O
2013-01-01
Thousands of loci in the human and mouse genomes give rise to circular RNA transcripts; at many of these loci, the predominant RNA isoform is a circle. Using an improved computational approach for circular RNA identification, we found widespread circular RNA expression in Drosophila melanogaster and estimate that in humans, circular RNA may account for 1% as many molecules as poly(A) RNA. Analysis of data from the ENCODE consortium revealed that the repertoire of genes expressing circular RNA, the ratio of circular to linear transcripts for each gene, and even the pattern of splice isoforms of circular RNAs from each gene were cell-type specific. These results suggest that biogenesis of circular RNA is an integral, conserved, and regulated feature of the gene expression program.
Machado, Elisabete; Ferreira, Joana; Novais, Ângela; Peixe, Luísa; Cantón, Rafael; Baquero, Fernando; Coque, Teresa M.
2007-01-01
The variable presence of integrons among extended-spectrum beta-lactamase (ESBL)-producing Enterobacteriaceae species (0 to 66%) is described. Association between blaESBL and integrons occurred when these are linked to specific ESBL-type genes (In60 bearing ISCR1 and blaCTX-M-9) or when ESBL genes were superimposed onto selected plasmids carrying integrons. Some integrons were identical to those found during decades worldwide, illustrating the preservation of the genetic elements carrying them. PMID:17404002
How Does the Scientific Community Contribute to Gene Ontology?
Lovering, Ruth C
2017-01-01
Collaborations between the scientific community and members of the Gene Ontology (GO) Consortium have led to an increase in the number and specificity of GO terms, as well as increasing the number of GO annotations. A variety of approaches have been taken to encourage research scientists to contribute to the GO, but the success of these approaches has been variable. This chapter reviews both the successes and failures of engaging the scientific community in GO development and annotation, as well as, providing motivation and advice to encourage individual researchers to contribute to GO.
Selective DNA demethylation by fusion of TDG with a sequence-specific DNA-binding domain
Gregory, David J.; Mikhaylova, Lyudmila; Fedulov, Alexey V.
2012-01-01
Our ability to selectively manipulate gene expression by epigenetic means is limited, as there is no approach for targeted reactivation of epigenetically silenced genes, in contrast to what is available for selective gene silencing. We aimed to develop a tool for selective transcriptional activation by DNA demethylation. Here we present evidence that direct targeting of thymine-DNA-glycosylase (TDG) to specific sequences in the DNA can result in local DNA demethylation at potential regulatory sequences and lead to enhanced gene induction. When TDG was fused to a well-characterized DNA-binding domain [the Rel-homology domain (RHD) of NFκB], we observed decreased DNA methylation and increased transcriptional response to unrelated stimulus of inducible nitric oxide synthase (NOS2). The effect was not seen for control genes lacking either RHD-binding sites or high levels of methylation, nor in control mock-transduced cells. Specific reactivation of epigenetically silenced genes may thus be achievable by this approach, which provides a broadly useful strategy to further our exploration of biological mechanisms and to improve control over the epigenome. PMID:22419066
Recombinant Rp1 genes confer necrotic or nonspecific resistance phenotypes.
Smith, Shavannor M; Steinau, Martin; Trick, Harold N; Hulbert, Scot H
2010-06-01
Genes at the Rp1 rust resistance locus of maize confer race-specific resistance to the common rust fungus Puccinia sorghi. Three variant genes with nonspecific effects (HRp1 -Kr1N, -D*21 and -MD*19) were found to be generated by intragenic crossing over within the LRR region. The LRR region of most NBS-LRR encoding genes is quite variable and codes for one of the regions in resistance gene proteins that controls specificity. Sequence comparisons demonstrated that the Rp1-Kr1N recombinant gene was identical to the N-terminus of the rp1-kp2 gene and C-terminus of another gene from its HRp1-K grandparent. The Rp1-D*21 recombinant gene consists of the N-terminus of the rp1-dp2 gene and C-terminus of the Rp1-D gene from the parental haplotype. Similarly, a recombinant gene from the Rp1-MD*19 haplotype has the N-terminus of an rp1 gene from the HRp1-M parent and C-terminus of the rp1-D19 gene from the HRp1-D parent. The recombinant Rp1 -Kr1N, -D*21 and -MD*19 genes activated defense responses in the absence of their AVR proteins triggering HR (hypersensitive response) in the absence of the pathogen. The results indicate that the frequent intragenic recombination events that occur in the Rp1 gene cluster not only recombine the genes into novel haplotypes, but also create genes with nonspecific effects. Some of these may contribute to nonspecific quantitative resistance but others have severe consequences for the fitness of the plant.
NASA Astrophysics Data System (ADS)
Corcoran, Martin M.; Phad, Ganesh E.; Bernat, Néstor Vázquez; Stahl-Hennig, Christiane; Sumida, Noriyuki; Persson, Mats A. A.; Martin, Marcel; Hedestam, Gunilla B. Karlsson
2016-12-01
Comprehensive knowledge of immunoglobulin genetics is required to advance our understanding of B cell biology. Validated immunoglobulin variable (V) gene databases are close to completion only for human and mouse. We present a novel computational approach, IgDiscover, that identifies germline V genes from expressed repertoires to a specificity of 100%. IgDiscover uses a cluster identification process to produce candidate sequences that, once filtered, results in individualized germline V gene databases. IgDiscover was tested in multiple species, validated by genomic cloning and cross library comparisons and produces comprehensive gene databases even where limited genomic sequence is available. IgDiscover analysis of the allelic content of the Indian and Chinese-origin rhesus macaques reveals high levels of immunoglobulin gene diversity in this species. Further, we describe a novel human IGHV3-21 allele and confirm significant gene differences between Balb/c and C57BL6 mouse strains, demonstrating the power of IgDiscover as a germline V gene discovery tool.
Corcoran, Martin M.; Phad, Ganesh E.; Bernat, Néstor Vázquez; Stahl-Hennig, Christiane; Sumida, Noriyuki; Persson, Mats A.A.; Martin, Marcel; Hedestam, Gunilla B. Karlsson
2016-01-01
Comprehensive knowledge of immunoglobulin genetics is required to advance our understanding of B cell biology. Validated immunoglobulin variable (V) gene databases are close to completion only for human and mouse. We present a novel computational approach, IgDiscover, that identifies germline V genes from expressed repertoires to a specificity of 100%. IgDiscover uses a cluster identification process to produce candidate sequences that, once filtered, results in individualized germline V gene databases. IgDiscover was tested in multiple species, validated by genomic cloning and cross library comparisons and produces comprehensive gene databases even where limited genomic sequence is available. IgDiscover analysis of the allelic content of the Indian and Chinese-origin rhesus macaques reveals high levels of immunoglobulin gene diversity in this species. Further, we describe a novel human IGHV3-21 allele and confirm significant gene differences between Balb/c and C57BL6 mouse strains, demonstrating the power of IgDiscover as a germline V gene discovery tool. PMID:27995928
Fuentes, Lisa de las; Schwander, Karen; Cupples, L. Adrienne; Rao, D. C.
2015-01-01
Background Genetic variation accounts for approximately 30% of blood pressure (BP) variability but most of that variability hasn't been attributed to specific variants. Interactions between genes and BP-associated factors may explain some ‘missing heritability.’ Cigarette smoking increases BP after short-term exposure and decreases BP with longer exposure. Gene-smoking interactions have discovered novel BP loci, but the contribution of smoking status and intensity to gene discovery is unknown. Methods We analyzed gene-smoking intensity interactions for association with systolic BP (SBP) in three subgroups from the Framingham Heart Study: current smokers only (N = 1,057), current and former smokers (‘ever smokers’, N = 3,374), and all subjects (N = 6,710). We used three smoking intensity variables defined at cutoffs of 10, 15, and 20 cigarettes per day (CPD). We evaluated the 1 degree-of-freedom (df) interaction and 2df joint test using generalized estimating equations. Results Analysis of current smokers using a CPD cutoff of 10 produced two loci associated with SBP. The rs9399633 minor allele was associated with increased SBP (5 mmHg) in heavy smokers (CPD>10) but decreased SBP (7 mmHg) in light smokers (CPD≤10). The rs11717948 minor allele was associated with decreased SBP (8 mmHg) in light smokers but decreased SBP (2 mmHg) in heavy smokers. Across all nine analyses, 19 additional loci reached p < 1×10−6. Discussion Analysis of current smokers may have the highest power to detect gene-smoking interactions, despite the reduced sample size. Associations of loci near SASH1 and KLHL6/KLHL24 with SBP may be modulated by tobacco smoking. PMID:25940791
Basson, Jacob; Sung, Yun Ju; Fuentes, Lisa de Las; Schwander, Karen; Cupples, L Adrienne; Rao, D C
2015-09-01
Genetic variation accounts for approximately 30% of blood pressure (BP) variability but most of that variability has not been attributed to specific variants. Interactions between genes and BP-associated factors may explain some "missing heritability." Cigarette smoking increases BP after short-term exposure and decreases BP with longer exposure. Gene-smoking interactions have discovered novel BP loci, but the contribution of smoking status and intensity to gene discovery is unknown. We analyzed gene-smoking intensity interactions for association with systolic BP (SBP) in three subgroups from the Framingham Heart Study: current smokers only (N = 1,057), current and former smokers ("ever smokers," N = 3,374), and all subjects (N = 6,710). We used three smoking intensity variables defined at cutoffs of 10, 15, and 20 cigarettes per day (CPD). We evaluated the 1 degree-of-freedom (df) interaction and 2df joint test using generalized estimating equations. Analysis of current smokers using a CPD cutoff of 10 produced two loci associated with SBP. The rs9399633 minor allele was associated with increased SBP (5 mmHg) in heavy smokers (CPD > 10) but decreased SBP (7 mmHg) in light smokers (CPD ≤ 10). The rs11717948 minor allele was associated with decreased SBP (8 mmHg) in light smokers but decreased SBP (2 mmHg) in heavy smokers. Across all nine analyses, 19 additional loci reached P < 1 × 10(-6). Analysis of current smokers may have the highest power to detect gene-smoking interactions, despite the reduced sample size. Associations of loci near SASH1 and KLHL6/KLHL24 with SBP may be modulated by tobacco smoking. © 2015 WILEY PERIODICALS, INC.
Lee, Wan Sin; Gudimella, Ranganath; Wong, Gwo Rong; Tammi, Martti Tapani; Khalid, Norzulaani; Harikrishna, Jennifer Ann
2015-01-01
Physiological responses to stress are controlled by expression of a large number of genes, many of which are regulated by microRNAs. Since most banana cultivars are salt-sensitive, improved understanding of genetic regulation of salt induced stress responses in banana can support future crop management and improvement in the face of increasing soil salinity related to irrigation and climate change. In this study we focused on determining miRNA and their targets that respond to NaCl exposure and used transcriptome sequencing of RNA and small RNA from control and NaCl-treated banana roots to assemble a cultivar-specific reference transcriptome and identify orthologous and Musa-specific miRNA responding to salinity. We observed that, banana roots responded to salinity stress with changes in expression for a large number of genes (9.5% of 31,390 expressed unigenes) and reduction in levels of many miRNA, including several novel miRNA and banana-specific miRNA-target pairs. Banana roots expressed a unique set of orthologous and Musa-specific miRNAs of which 59 respond to salt stress in a dose-dependent manner. Gene expression patterns of miRNA compared with those of their predicted mRNA targets indicated that a majority of the differentially expressed miRNAs were down-regulated in response to increased salinity, allowing increased expression of targets involved in diverse biological processes including stress signaling, stress defence, transport, cellular homeostasis, metabolism and other stress-related functions. This study may contribute to the understanding of gene regulation and abiotic stress response of roots and the high-throughput sequencing data sets generated may serve as important resources related to salt tolerance traits for functional genomic studies and genetic improvement in banana. PMID:25993649
Lee, Wan Sin; Gudimella, Ranganath; Wong, Gwo Rong; Tammi, Martti Tapani; Khalid, Norzulaani; Harikrishna, Jennifer Ann
2015-01-01
Physiological responses to stress are controlled by expression of a large number of genes, many of which are regulated by microRNAs. Since most banana cultivars are salt-sensitive, improved understanding of genetic regulation of salt induced stress responses in banana can support future crop management and improvement in the face of increasing soil salinity related to irrigation and climate change. In this study we focused on determining miRNA and their targets that respond to NaCl exposure and used transcriptome sequencing of RNA and small RNA from control and NaCl-treated banana roots to assemble a cultivar-specific reference transcriptome and identify orthologous and Musa-specific miRNA responding to salinity. We observed that, banana roots responded to salinity stress with changes in expression for a large number of genes (9.5% of 31,390 expressed unigenes) and reduction in levels of many miRNA, including several novel miRNA and banana-specific miRNA-target pairs. Banana roots expressed a unique set of orthologous and Musa-specific miRNAs of which 59 respond to salt stress in a dose-dependent manner. Gene expression patterns of miRNA compared with those of their predicted mRNA targets indicated that a majority of the differentially expressed miRNAs were down-regulated in response to increased salinity, allowing increased expression of targets involved in diverse biological processes including stress signaling, stress defence, transport, cellular homeostasis, metabolism and other stress-related functions. This study may contribute to the understanding of gene regulation and abiotic stress response of roots and the high-throughput sequencing data sets generated may serve as important resources related to salt tolerance traits for functional genomic studies and genetic improvement in banana.
Detecting regulatory gene-environment interactions with unmeasured environmental factors.
Fusi, Nicoló; Lippert, Christoph; Borgwardt, Karsten; Lawrence, Neil D; Stegle, Oliver
2013-06-01
Genomic studies have revealed a substantial heritable component of the transcriptional state of the cell. To fully understand the genetic regulation of gene expression variability, it is important to study the effect of genotype in the context of external factors such as alternative environmental conditions. In model systems, explicit environmental perturbations have been considered for this purpose, allowing to directly test for environment-specific genetic effects. However, such experiments are limited to species that can be profiled in controlled environments, hampering their use in important systems such as human. Moreover, even in seemingly tightly regulated experimental conditions, subtle environmental perturbations cannot be ruled out, and hence unknown environmental influences are frequent. Here, we propose a model-based approach to simultaneously infer unmeasured environmental factors from gene expression profiles and use them in genetic analyses, identifying environment-specific associations between polymorphic loci and individual gene expression traits. In extensive simulation studies, we show that our method is able to accurately reconstruct environmental factors and their interactions with genotype in a variety of settings. We further illustrate the use of our model in a real-world dataset in which one environmental factor has been explicitly experimentally controlled. Our method is able to accurately reconstruct the true underlying environmental factor even if it is not given as an input, allowing to detect genuine genotype-environment interactions. In addition to the known environmental factor, we find unmeasured factors involved in novel genotype-environment interactions. Our results suggest that interactions with both known and unknown environmental factors significantly contribute to gene expression variability. and implementation: Software available at http://pmbio.github.io/envGPLVM/. Supplementary data are available at Bioinformatics online.
HIV promoter integration site primarily modulates transcriptional burst size rather than frequency.
Skupsky, Ron; Burnett, John C; Foley, Jonathan E; Schaffer, David V; Arkin, Adam P
2010-09-30
Mammalian gene expression patterns, and their variability across populations of cells, are regulated by factors specific to each gene in concert with its surrounding cellular and genomic environment. Lentiviruses such as HIV integrate their genomes into semi-random genomic locations in the cells they infect, and the resulting viral gene expression provides a natural system to dissect the contributions of genomic environment to transcriptional regulation. Previously, we showed that expression heterogeneity and its modulation by specific host factors at HIV integration sites are key determinants of infected-cell fate and a possible source of latent infections. Here, we assess the integration context dependence of expression heterogeneity from diverse single integrations of a HIV-promoter/GFP-reporter cassette in Jurkat T-cells. Systematically fitting a stochastic model of gene expression to our data reveals an underlying transcriptional dynamic, by which multiple transcripts are produced during short, infrequent bursts, that quantitatively accounts for the wide, highly skewed protein expression distributions observed in each of our clonal cell populations. Interestingly, we find that the size of transcriptional bursts is the primary systematic covariate over integration sites, varying from a few to tens of transcripts across integration sites, and correlating well with mean expression. In contrast, burst frequencies are scattered about a typical value of several per cell-division time and demonstrate little correlation with the clonal means. This pattern of modulation generates consistently noisy distributions over the sampled integration positions, with large expression variability relative to the mean maintained even for the most productive integrations, and could contribute to specifying heterogeneous, integration-site-dependent viral production patterns in HIV-infected cells. Genomic environment thus emerges as a significant control parameter for gene expression variation that may contribute to structuring mammalian genomes, as well as be exploited for survival by integrating viruses.
Junick, Jana
2012-01-01
Quantitative real-time PCR assays targeting the groEL gene for the specific enumeration of 12 human fecal Bifidobacterium species were developed. The housekeeping gene groEL (HSP60 in eukaryotes) was used as a discriminative marker for the differentiation of Bifidobacterium adolescentis, B. angulatum, B. animalis, B. bifidum, B. breve, B. catenulatum, B. dentium, B. gallicum, B. longum, B. pseudocatenulatum, B. pseudolongum, and B. thermophilum. The bifidobacterial chromosome contains a single copy of the groEL gene, allowing the determination of the cell number by quantification of the groEL copy number. Real-time PCR assays were validated by comparing fecal samples spiked with known numbers of a given Bifidobacterium species. Independent of the Bifidobacterium species tested, the proportion of groEL copies recovered from fecal samples spiked with 5 to 9 log10 cells/g feces was approximately 50%. The quantification limit was 5 to 6 log10 groEL copies/g feces. The interassay variability was less than 10%, and variability between different DNA extractions was less than 23%. The method developed was applied to fecal samples from healthy adults and full-term breast-fed infants. Bifidobacterial diversity in both adults and infants was low, with mostly ≤3 Bifidobacterium species and B. longum frequently detected. The predominant species in infant and adult fecal samples were B. breve and B. adolescentis, respectively. It was possible to distinguish B. catenulatum and B. pseudocatenulatum. We conclude that the groEL gene is a suitable molecular marker for the specific and accurate quantification of human fecal Bifidobacterium species by real-time PCR. PMID:22307308
Computational Selection of Transcriptomics Experiments Improves Guilt-by-Association Analyses
Bhat, Prajwal; Yang, Haixuan; Bögre, László; Devoto, Alessandra; Paccanaro, Alberto
2012-01-01
The Guilt-by-Association (GBA) principle, according to which genes with similar expression profiles are functionally associated, is widely applied for functional analyses using large heterogeneous collections of transcriptomics data. However, the use of such large collections could hamper GBA functional analysis for genes whose expression is condition specific. In these cases a smaller set of condition related experiments should instead be used, but identifying such functionally relevant experiments from large collections based on literature knowledge alone is an impractical task. We begin this paper by analyzing, both from a mathematical and a biological point of view, why only condition specific experiments should be used in GBA functional analysis. We are able to show that this phenomenon is independent of the functional categorization scheme and of the organisms being analyzed. We then present a semi-supervised algorithm that can select functionally relevant experiments from large collections of transcriptomics experiments. Our algorithm is able to select experiments relevant to a given GO term, MIPS FunCat term or even KEGG pathways. We extensively test our algorithm on large dataset collections for yeast and Arabidopsis. We demonstrate that: using the selected experiments there is a statistically significant improvement in correlation between genes in the functional category of interest; the selected experiments improve GBA-based gene function prediction; the effectiveness of the selected experiments increases with annotation specificity; our algorithm can be successfully applied to GBA-based pathway reconstruction. Importantly, the set of experiments selected by the algorithm reflects the existing literature knowledge about the experiments. [A MATLAB implementation of the algorithm and all the data used in this paper can be downloaded from the paper website: http://www.paccanarolab.org/papers/CorrGene/]. PMID:22879875
A Century of Shope Papillomavirus in Museum Rabbit Specimens
Escudero Duch, Clara; Williams, Richard A. J.; Timm, Robert M.; Perez-Tris, Javier; Benitez, Laura
2015-01-01
Sylvilagus floridanus Papillomavirus (SfPV) causes growth of large horn-like tumors on rabbits. SfPV was described in cottontail rabbits (probably Sylvilagus floridanus) from Kansas and Iowa by Richard Shope in 1933, and detected in S. audubonii in 2011. It is known almost exclusively from the US Midwest. We explored the University of Kansas Natural History Museum for historical museum specimens infected with SfPV, using molecular techniques, to assess if additional wild species host SfPV, and whether SfPV occurs throughout the host range, or just in the Midwest. Secondary aims were to detect distinct strains, and evidence for strain spatio-temporal specificity. We found 20 of 1395 rabbits in the KU collection SfPV symptomatic. Three of 17 lagomorph species (S. nuttallii, and the two known hosts) were symptomatic, while Brachylagus, Lepus and eight additional Sylvilagus species were not. 13 symptomatic individuals were positive by molecular testing, including the first S. nuttallii detection. Prevalence of symptomatic individuals was significantly higher in Sylvilagus (1.8%) than Lepus. Half of these specimens came from Kansas, though new molecular detections were obtained from Jalisco—Mexico’s first—and Nebraska, Nevada, New Mexico, and Texas, USA. We document the oldest lab-confirmed case (Kansas, 1915), pre-dating Shope’s first case. SfPV amplification was possible from 63.2% of symptomatic museum specimens. Using multiple methodologies, rolling circle amplification and, multiple isothermal displacement amplification in addition to PCR, greatly improved detection rates. Short sequences were obtained from six individuals for two genes. L1 gene sequences were identical to all previously detected sequences; E7 gene sequences, were more variable, yielding five distinct SfPV1 strains that differing by less than 2% from strains circulating in the Midwest and Mexico, between 1915 and 2005. Our results do not clarify whether strains are host species specific, though they are consistent with SfPV specificity to genus Sylvilagus. PMID:26147570
Wang, Jack P.; Matthews, Megan L.; Williams, Cranos M.; ...
2018-04-20
A multi-omics quantitative integrative analysis of lignin biosynthesis can advance the strategic engineering of wood for timber, pulp, and biofuels. Lignin is polymerized from three monomers (monolignols) produced by a grid-like pathway. The pathway in wood formation of Populus trichocarpa has at least 21 genes, encoding enzymes that mediate 37 reactions on 24 metabolites, leading to lignin and affecting wood properties. We perturb these 21 pathway genes and integrate transcriptomic, proteomic, fluxomic and phenomic data from 221 lines selected from ~2000 transgenics (6-month-old). The integrative analysis estimates how changing expression of pathway gene or gene combination affects protein abundance, metabolic-flux,more » metabolite concentrations, and 25 wood traits, including lignin, tree-growth, density, strength, and saccharification. The analysis then predicts improvements in any of these 25 traits individually or in combinations, through engineering expression of specific monolignol genes. The analysis may lead to greater understanding of other pathways for improved growth and adaptation.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, Jack P.; Matthews, Megan L.; Williams, Cranos M.
A multi-omics quantitative integrative analysis of lignin biosynthesis can advance the strategic engineering of wood for timber, pulp, and biofuels. Lignin is polymerized from three monomers (monolignols) produced by a grid-like pathway. The pathway in wood formation of Populus trichocarpa has at least 21 genes, encoding enzymes that mediate 37 reactions on 24 metabolites, leading to lignin and affecting wood properties. We perturb these 21 pathway genes and integrate transcriptomic, proteomic, fluxomic and phenomic data from 221 lines selected from ~2000 transgenics (6-month-old). The integrative analysis estimates how changing expression of pathway gene or gene combination affects protein abundance, metabolic-flux,more » metabolite concentrations, and 25 wood traits, including lignin, tree-growth, density, strength, and saccharification. The analysis then predicts improvements in any of these 25 traits individually or in combinations, through engineering expression of specific monolignol genes. The analysis may lead to greater understanding of other pathways for improved growth and adaptation.« less
Wang, Jack P; Matthews, Megan L; Williams, Cranos M; Shi, Rui; Yang, Chenmin; Tunlaya-Anukit, Sermsawat; Chen, Hsi-Chuan; Li, Quanzi; Liu, Jie; Lin, Chien-Yuan; Naik, Punith; Sun, Ying-Hsuan; Loziuk, Philip L; Yeh, Ting-Feng; Kim, Hoon; Gjersing, Erica; Shollenberger, Todd; Shuford, Christopher M; Song, Jina; Miller, Zachary; Huang, Yung-Yun; Edmunds, Charles W; Liu, Baoguang; Sun, Yi; Lin, Ying-Chung Jimmy; Li, Wei; Chen, Hao; Peszlen, Ilona; Ducoste, Joel J; Ralph, John; Chang, Hou-Min; Muddiman, David C; Davis, Mark F; Smith, Chris; Isik, Fikret; Sederoff, Ronald; Chiang, Vincent L
2018-04-20
A multi-omics quantitative integrative analysis of lignin biosynthesis can advance the strategic engineering of wood for timber, pulp, and biofuels. Lignin is polymerized from three monomers (monolignols) produced by a grid-like pathway. The pathway in wood formation of Populus trichocarpa has at least 21 genes, encoding enzymes that mediate 37 reactions on 24 metabolites, leading to lignin and affecting wood properties. We perturb these 21 pathway genes and integrate transcriptomic, proteomic, fluxomic and phenomic data from 221 lines selected from ~2000 transgenics (6-month-old). The integrative analysis estimates how changing expression of pathway gene or gene combination affects protein abundance, metabolic-flux, metabolite concentrations, and 25 wood traits, including lignin, tree-growth, density, strength, and saccharification. The analysis then predicts improvements in any of these 25 traits individually or in combinations, through engineering expression of specific monolignol genes. The analysis may lead to greater understanding of other pathways for improved growth and adaptation.
Detection, Characterization, and Typing of Shiga Toxin-Producing Escherichia coli.
Parsons, Brendon D; Zelyas, Nathan; Berenger, Byron M; Chui, Linda
2016-01-01
Shiga toxin-producing Escherichia coli (STEC) are responsible for gastrointestinal diseases reported in numerous outbreaks around the world. Given the public health importance of STEC, effective detection, characterization and typing is critical to any medical laboratory system. While non-O157 serotypes account for the majority of STEC infections, frontline microbiology laboratories may only screen for STEC using O157-specific agar-based methods. As a result, non-O157 STEC infections are significantly under-reported. This review discusses recent advances on the detection, characterization and typing of STEC with emphasis on work performed at the Alberta Provincial Laboratory for Public Health (ProvLab). Candidates for the detection of all STEC serotypes include chromogenic agars, enzyme immunoassays (EIA) and quantitative real time polymerase chain reaction (qPCR). Culture methods allow further characterization of isolates, whereas qPCR provides the greatest sensitivity and specificity, followed by EIA. The virulence gene profiles using PCR arrays and stx gene subtypes can subsequently be determined. Different non-O157 serotypes exhibit markedly different virulence gene profiles and a greater prevalence of stx1 than stx2 subtypes compared to O157:H7 isolates. Finally, recent innovations in whole genome sequencing (WGS) have allowed it to emerge as a candidate for the characterization and typing of STEC in diagnostic surveillance isolates. Methods of whole genome analysis such as single nucleotide polymorphisms and k-mer analysis are concordant with epidemiological data and standard typing methods, such as pulsed-field gel electrophoresis and multiple-locus variable number tandem repeat analysis while offering additional strain differentiation. Together these findings highlight improved strategies for STEC detection using currently available systems and the development of novel approaches for future surveillance.
Detection, Characterization, and Typing of Shiga Toxin-Producing Escherichia coli
Parsons, Brendon D.; Zelyas, Nathan; Berenger, Byron M.; Chui, Linda
2016-01-01
Shiga toxin-producing Escherichia coli (STEC) are responsible for gastrointestinal diseases reported in numerous outbreaks around the world. Given the public health importance of STEC, effective detection, characterization and typing is critical to any medical laboratory system. While non-O157 serotypes account for the majority of STEC infections, frontline microbiology laboratories may only screen for STEC using O157-specific agar-based methods. As a result, non-O157 STEC infections are significantly under-reported. This review discusses recent advances on the detection, characterization and typing of STEC with emphasis on work performed at the Alberta Provincial Laboratory for Public Health (ProvLab). Candidates for the detection of all STEC serotypes include chromogenic agars, enzyme immunoassays (EIA) and quantitative real time polymerase chain reaction (qPCR). Culture methods allow further characterization of isolates, whereas qPCR provides the greatest sensitivity and specificity, followed by EIA. The virulence gene profiles using PCR arrays and stx gene subtypes can subsequently be determined. Different non-O157 serotypes exhibit markedly different virulence gene profiles and a greater prevalence of stx1 than stx2 subtypes compared to O157:H7 isolates. Finally, recent innovations in whole genome sequencing (WGS) have allowed it to emerge as a candidate for the characterization and typing of STEC in diagnostic surveillance isolates. Methods of whole genome analysis such as single nucleotide polymorphisms and k-mer analysis are concordant with epidemiological data and standard typing methods, such as pulsed-field gel electrophoresis and multiple-locus variable number tandem repeat analysis while offering additional strain differentiation. Together these findings highlight improved strategies for STEC detection using currently available systems and the development of novel approaches for future surveillance. PMID:27148176
The CRISPR-Cas9 system has revolutionized gene editing both at single genes and in multiplexed loss-of-function screens, thus enabling precise genome-scale identification of genes essential for proliferation and survival of cancer cells. However, previous studies have reported that a gene-independent antiproliferative effect of Cas9-mediated DNA cleavage confounds such measurement of genetic dependency, thereby leading to false-positive results in copy number-amplified regions.
Voigt, Emily A; Haralambieva, Iana H; Larrabee, Beth L; Kennedy, Richard B; Ovsyannikova, Inna G; Schaid, Daniel J; Poland, Gregory A
2018-01-30
Rubella vaccination induces widely variable immune responses in vaccine recipients. While rubella vaccination is effective at inducing immunity to rubella infection in most subjects, up to 5% of individuals do not achieve or maintain long-term protective immunity. To expand upon our previous work identifying genetic polymorphisms that are associated with these interindividual differences in humoral immunity to rubella virus, we performed a genome-wide association study in a large cohort of 1843 subjects to discover single-nucleotide polymorphisms (SNPs) associated with rubella virus-specific cellular immune responses. We identified SNPs in the Wilms tumor protein gene (WT1) that were significantly associated (P < 5 × 10-8) with interindividual variations in rubella-specific interleukin 6 secretion from subjects' peripheral blood mononuclear cells postvaccination. No SNPs were found to be significantly associated with variations in rubella-specific interferon-γ secretion. Our findings demonstrate that genetic polymorphisms in the WT1 gene in subjects of European ancestry are associated with interindividual differences in rubella virus-specific cellular immunity after measles-mumps-rubella II vaccination. © The Author(s) 2017. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail: journals.permissions@oup.com.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kilbane, J.J. II
1993-12-31
IGT has developed a microbial culture of Rhodococcus rhodochrous, designated as IGTS8, that is capable of specifically cleaving carbon-sulfur bonds in a range of organosulfur model compounds and is capable of removing organic sulfur from coal and petroleum without significantly sacrificing the calorific value of the fuel. Although IGTS8 possesses the ability to specifically remove organic sulfur from coal, a major research need is to develop improved strains of microorganisms that possess higher levels of desulfurization activity and therefore will permit more favorable biodesulfurization process conditions: faster rates, more complete removal, and smaller reactor size. Strain improvement is the singlemore » most important aspect to the development of a practical coal biodesulfurization process and accordingly is the focus of research in this project. During the past year, significant progress was made toward improving the biodesulfurization capabilities of Rhodococcus Rhodochrous IGTS8. The main objective was to identify and characterize strong promoters of IGTS8. The DNA sequencing of the promoter region and chloramphenicol resistance gene of pRF2, as well as six mutant promoters, was determined. The 16S structural gene of IGTS8 was isolated and used to identify the putative promoter of this gene. Four promoter probe vectors were constructed and are currently being used to analyze the strength of Rhodococcus promoters: from the IGTS8 genome, mutants of promoters from the chloramphenicol resistance gene of pRF2, the promoter from the 16S RNA gene, and various strong inducible promoters.« less
Plevova, Karla; Francova, Hana Skuhrova; Burckova, Katerina; Brychtova, Yvona; Doubek, Michael; Pavlova, Sarka; Malcikova, Jitka; Mayer, Jiri; Tichy, Boris; Pospisilova, Sarka
2014-01-01
In chronic lymphocytic leukemia, usually a monoclonal disease, multiple productive immunoglobulin heavy chain gene rearrangements are identified sporadically. Prognostication of such cases based on immunoglobulin heavy variable gene mutational status can be problematic, especially if the different rearrangements have discordant mutational status. To gain insight into the possible biological mechanisms underlying the origin of the multiple rearrangements, we performed a comprehensive immunogenetic and immunophenotypic characterization of 31 cases with the multiple rearrangements identified in a cohort of 1147 patients with chronic lymphocytic leukemia. For the majority of cases (25/31), we provide evidence of the co-existence of at least two B lymphocyte clones with a chronic lymphocytic leukemia phenotype. We also identified clonal drifts in serial samples, likely driven by selection forces. More specifically, higher immunoglobulin variable gene identity to germline and longer complementarity determining region 3 were preferred in persistent or newly appearing clones, a phenomenon more pronounced in patients with stereotyped B-cell receptors. Finally, we report that other factors, such as TP53 gene defects and therapy administration, influence clonal selection. Our findings are relevant to clonal evolution in the context of antigen stimulation and transition of monoclonal B-cell lymphocytosis to chronic lymphocytic leukemia. PMID:24038023
NASA Astrophysics Data System (ADS)
Vislova, A.; Aylward, F.; Sosa, O.; DeLong, E.
2016-02-01
Previous work has revealed diel periodicity of gene expression in key metabolic pathways in both autotrophic and heterotrophic microbes in the surface ocean. In this study, we investigated patterns of diel periodicity of gene expression in depth profiles (25, 75, 125 and 250 meters). We postulated that microbial diel transcriptional signals would be increasingly dampened with depth, and that the timing of peak expression of specific transcripts would be shifted in time between depths, in accordance with depth-dependent diel light variability. Bacterioplankton were sampled from four depths every four hours at station ALOHA (22° 45' N 158° W) over 2 days. RNA was extracted from cells preserved on filters, converted to cDNA, and sequenced on the Illumina platform. Surprisingly, harmonic regression analysis revealed an increasing proportion of genes with diel periodic expression patterns with increasing depth between 25- 125 meters. At 250 meters, the proportion of genes exhibiting diel expression patterns decreased an order of magnitude compared to the photic zone. Community composition, functional gene categories, and diel patterns of gene expression were significantly different between the photic zone and 250 meter samples. The signals driving diel periodic gene expression in microbes at 250 meters is under further investigation. These data are now beginning provide a better understanding of the tempo and mode of microbial dynamics among specific taxa, throughout the ocean's interior.
Francy, Donna S.; Stelzer, Erin A.; Duris, Joseph W.; Brady, Amie M.G.; Harrison, John H.; Johnson, Heather E.; Ware, Michael W.
2013-01-01
Predictive models, based on environmental and water quality variables, have been used to improve the timeliness and accuracy of recreational water quality assessments, but their effectiveness has not been studied in inland waters. Sampling at eight inland recreational lakes in Ohio was done in order to investigate using predictive models for Escherichia coli and to understand the links between E. coli concentrations, predictive variables, and pathogens. Based upon results from 21 beach sites, models were developed for 13 sites, and the most predictive variables were rainfall, wind direction and speed, turbidity, and water temperature. Models were not developed at sites where the E. coli standard was seldom exceeded. Models were validated at nine sites during an independent year. At three sites, the model resulted in increased correct responses, sensitivities, and specificities compared to use of the previous day's E. coli concentration (the current method). Drought conditions during the validation year precluded being able to adequately assess model performance at most of the other sites. Cryptosporidium, adenovirus, eaeA (E. coli), ipaH (Shigella), and spvC (Salmonella) were found in at least 20% of samples collected for pathogens at five sites. The presence or absence of the three bacterial genes was related to some of the model variables but was not consistently related to E. coli concentrations. Predictive models were not effective at all inland lake sites; however, their use at two lakes with high swimmer densities will provide better estimates of public health risk than current methods and will be a valuable resource for beach managers and the public.
Francy, Donna S; Stelzer, Erin A; Duris, Joseph W; Brady, Amie M G; Harrison, John H; Johnson, Heather E; Ware, Michael W
2013-03-01
Predictive models, based on environmental and water quality variables, have been used to improve the timeliness and accuracy of recreational water quality assessments, but their effectiveness has not been studied in inland waters. Sampling at eight inland recreational lakes in Ohio was done in order to investigate using predictive models for Escherichia coli and to understand the links between E. coli concentrations, predictive variables, and pathogens. Based upon results from 21 beach sites, models were developed for 13 sites, and the most predictive variables were rainfall, wind direction and speed, turbidity, and water temperature. Models were not developed at sites where the E. coli standard was seldom exceeded. Models were validated at nine sites during an independent year. At three sites, the model resulted in increased correct responses, sensitivities, and specificities compared to use of the previous day's E. coli concentration (the current method). Drought conditions during the validation year precluded being able to adequately assess model performance at most of the other sites. Cryptosporidium, adenovirus, eaeA (E. coli), ipaH (Shigella), and spvC (Salmonella) were found in at least 20% of samples collected for pathogens at five sites. The presence or absence of the three bacterial genes was related to some of the model variables but was not consistently related to E. coli concentrations. Predictive models were not effective at all inland lake sites; however, their use at two lakes with high swimmer densities will provide better estimates of public health risk than current methods and will be a valuable resource for beach managers and the public.
Ander, Bradley P.; Zhang, Xiaoshuai; Xue, Fuzhong; Sharp, Frank R.; Yang, Xiaowei
2013-01-01
The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of subjects. Traditional gene-wise selection methods using univariate analyses face difficulty to incorporate correlational, structural, or functional structures amongst the molecular measures. For microarray gene expression data, we first summarize solutions in dealing with ‘large p, small n’ problems, and then propose an integrative Bayesian variable selection (iBVS) framework for simultaneously identifying causal or marker genes and regulatory pathways. A novel partial least squares (PLS) g-prior for iBVS is developed to allow the incorporation of prior knowledge on gene-gene interactions or functional relationships. From the point view of systems biology, iBVS enables user to directly target the joint effects of multiple genes and pathways in a hierarchical modeling diagram to predict disease status or phenotype. The estimated posterior selection probabilities offer probabilitic and biological interpretations. Both simulated data and a set of microarray data in predicting stroke status are used in validating the performance of iBVS in a Probit model with binary outcomes. iBVS offers a general framework for effective discovery of various molecular biomarkers by combining data-based statistics and knowledge-based priors. Guidelines on making posterior inferences, determining Bayesian significance levels, and improving computational efficiencies are also discussed. PMID:23844055
Peng, Bin; Zhu, Dianwen; Ander, Bradley P; Zhang, Xiaoshuai; Xue, Fuzhong; Sharp, Frank R; Yang, Xiaowei
2013-01-01
The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of subjects. Traditional gene-wise selection methods using univariate analyses face difficulty to incorporate correlational, structural, or functional structures amongst the molecular measures. For microarray gene expression data, we first summarize solutions in dealing with 'large p, small n' problems, and then propose an integrative Bayesian variable selection (iBVS) framework for simultaneously identifying causal or marker genes and regulatory pathways. A novel partial least squares (PLS) g-prior for iBVS is developed to allow the incorporation of prior knowledge on gene-gene interactions or functional relationships. From the point view of systems biology, iBVS enables user to directly target the joint effects of multiple genes and pathways in a hierarchical modeling diagram to predict disease status or phenotype. The estimated posterior selection probabilities offer probabilitic and biological interpretations. Both simulated data and a set of microarray data in predicting stroke status are used in validating the performance of iBVS in a Probit model with binary outcomes. iBVS offers a general framework for effective discovery of various molecular biomarkers by combining data-based statistics and knowledge-based priors. Guidelines on making posterior inferences, determining Bayesian significance levels, and improving computational efficiencies are also discussed.
Haney, Robert A.; Clarke, Thomas H.; Gadgil, Rujuta; Fitzpatrick, Ryan; Hayashi, Cheryl Y.; Ayoub, Nadia A.; Garb, Jessica E.
2016-01-01
Gene duplication and positive selection can be important determinants of the evolution of venom, a protein-rich secretion used in prey capture and defense. In a typical model of venom evolution, gene duplicates switch to venom gland expression and change function under the action of positive selection, which together with further duplication produces large gene families encoding diverse toxins. Although these processes have been demonstrated for individual toxin families, high-throughput multitissue sequencing of closely related venomous species can provide insights into evolutionary dynamics at the scale of the entire venom gland transcriptome. By assembling and analyzing multitissue transcriptomes from the Western black widow spider and two closely related species with distinct venom toxicity phenotypes, we do not find that gene duplication and duplicate retention is greater in gene families with venom gland biased expression in comparison with broadly expressed families. Positive selection has acted on some venom toxin families, but does not appear to be in excess for families with venom gland biased expression. Moreover, we find 309 distinct gene families that have single transcripts with venom gland biased expression, suggesting that the switching of genes to venom gland expression in numerous unrelated gene families has been a dominant mode of evolution. We also find ample variation in protein sequences of venom gland–specific transcripts, lineage-specific family sizes, and ortholog expression among species. This variation might contribute to the variable venom toxicity of these species. PMID:26733576
Geffroy, V; Sicard, D; de Oliveira, J C; Sévignac, M; Cohen, S; Gepts, P; Neema, C; Langin, T; Dron, M
1999-09-01
The recent cloning of plant resistance (R) genes and the sequencing of resistance gene clusters have shed light on the molecular evolution of R genes. However, up to now, no attempt has been made to correlate this molecular evolution with the host-pathogen coevolution process at the population level. Cross-inoculations were carried out between 26 strains of the fungal pathogen Colletotrichum lindemuthianum and 48 Phaseolus vulgaris plants collected in the three centers of diversity of the host species. A high level of diversity for resistance against the pathogen was revealed. Most of the resistance specificities were overcome in sympatric situations, indicating an adaptation of the pathogen to the local host. In contrast, plants were generally resistant to allopatric strains, suggesting that R genes that were efficient against exotic strains but had been overcome locally were maintained in the plant genome. These results indicated that coevolution processes between the two protagonists led to a differentiation for resistance in the three centers of diversity of the host. To improve our understanding of the molecular evolution of these different specificities, a recombinant inbred (RI) population derived from two representative genotypes of the Andean (JaloEEP558) and Mesoamerican (BAT93) gene pools was used to map anthracnose specificities. A gene cluster comprising both Andean (Co-y; Co-z) and Mesoamerican (Co-9) host resistance specificities was identified, suggesting that this locus existed prior to the separation of the two major gene pools of P. vulgaris. Molecular analysis revealed a high level of complexity at this locus. It harbors 11 restriction fragment length polymorphisms when R gene analog (RGA) clones are used. The relationship between the coevolution process and diversification of resistance specificities at resistance gene clusters is discussed.
Brauburger, Kristina; Boehmann, Yannik; Tsuda, Yoshimi; Hoenen, Thomas; Olejnik, Judith; Schümann, Michael; Ebihara, Hideki
2014-01-01
ABSTRACT Ebola virus (EBOV) belongs to the group of nonsegmented negative-sense RNA viruses. The seven EBOV genes are separated by variable gene borders, including short (4- or 5-nucleotide) intergenic regions (IRs), a single long (144-nucleotide) IR, and gene overlaps, where the neighboring gene end and start signals share five conserved nucleotides. The unique structure of the gene overlaps and the presence of a single long IR are conserved among all filoviruses. Here, we sought to determine the impact of the EBOV gene borders during viral transcription. We show that readthrough mRNA synthesis occurs in EBOV-infected cells irrespective of the structure of the gene border, indicating that the gene overlaps do not promote recognition of the gene end signal. However, two consecutive gene end signals at the VP24 gene might improve termination at the VP24-L gene border, ensuring efficient L gene expression. We further demonstrate that the long IR is not essential for but regulates transcription reinitiation in a length-dependent but sequence-independent manner. Mutational analysis of bicistronic minigenomes and recombinant EBOVs showed no direct correlation between IR length and reinitiation rates but demonstrated that specific IR lengths not found naturally in filoviruses profoundly inhibit downstream gene expression. Intriguingly, although truncation of the 144-nucleotide-long IR to 5 nucleotides did not substantially affect EBOV transcription, it led to a significant reduction of viral growth. IMPORTANCE Our current understanding of EBOV transcription regulation is limited due to the requirement for high-containment conditions to study this highly pathogenic virus. EBOV is thought to share many mechanistic features with well-analyzed prototype nonsegmented negative-sense RNA viruses. A single polymerase entry site at the 3′ end of the genome determines that transcription of the genes is mainly controlled by gene order and cis-acting signals found at the gene borders. Here, we examined the regulatory role of the structurally unique EBOV gene borders during viral transcription. Our data suggest that transcriptional regulation in EBOV is highly complex and differs from that in prototype viruses and further the understanding of this most fundamental process in the filovirus replication cycle. Moreover, our results with recombinant EBOVs suggest a novel role of the long IR found in all filovirus genomes during the viral replication cycle. PMID:25142600
Brauburger, Kristina; Boehmann, Yannik; Tsuda, Yoshimi; Hoenen, Thomas; Olejnik, Judith; Schümann, Michael; Ebihara, Hideki; Mühlberger, Elke
2014-11-01
Ebola virus (EBOV) belongs to the group of nonsegmented negative-sense RNA viruses. The seven EBOV genes are separated by variable gene borders, including short (4- or 5-nucleotide) intergenic regions (IRs), a single long (144-nucleotide) IR, and gene overlaps, where the neighboring gene end and start signals share five conserved nucleotides. The unique structure of the gene overlaps and the presence of a single long IR are conserved among all filoviruses. Here, we sought to determine the impact of the EBOV gene borders during viral transcription. We show that readthrough mRNA synthesis occurs in EBOV-infected cells irrespective of the structure of the gene border, indicating that the gene overlaps do not promote recognition of the gene end signal. However, two consecutive gene end signals at the VP24 gene might improve termination at the VP24-L gene border, ensuring efficient L gene expression. We further demonstrate that the long IR is not essential for but regulates transcription reinitiation in a length-dependent but sequence-independent manner. Mutational analysis of bicistronic minigenomes and recombinant EBOVs showed no direct correlation between IR length and reinitiation rates but demonstrated that specific IR lengths not found naturally in filoviruses profoundly inhibit downstream gene expression. Intriguingly, although truncation of the 144-nucleotide-long IR to 5 nucleotides did not substantially affect EBOV transcription, it led to a significant reduction of viral growth. Our current understanding of EBOV transcription regulation is limited due to the requirement for high-containment conditions to study this highly pathogenic virus. EBOV is thought to share many mechanistic features with well-analyzed prototype nonsegmented negative-sense RNA viruses. A single polymerase entry site at the 3' end of the genome determines that transcription of the genes is mainly controlled by gene order and cis-acting signals found at the gene borders. Here, we examined the regulatory role of the structurally unique EBOV gene borders during viral transcription. Our data suggest that transcriptional regulation in EBOV is highly complex and differs from that in prototype viruses and further the understanding of this most fundamental process in the filovirus replication cycle. Moreover, our results with recombinant EBOVs suggest a novel role of the long IR found in all filovirus genomes during the viral replication cycle. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Harkey, Michael A; Villagran, Alexandra M; Venkataraman, Gopalakrishnan M; Leisenring, Wendy M; Hullar, Meredith A J; Torok-Storb, Beverly J
2017-08-01
OBJECTIVE To determine whether specific alleles of candidate genes of the major histocompatibility complex (MHC) and innate immune system were associated with gastric dilatation-volvulus (GDV) in Great Danes. ANIMALS 42 healthy Great Danes (control group) and 39 Great Danes with ≥ 1 GDV episode. PROCEDURES Variable regions of the 2 most polymorphic MHC genes (DLA88 and DRB1) were amplified and sequenced from the dogs in each group. Similarly, regions of 3 genes associated with the innate immune system (TLR5, NOD2, and ATG16L1), which have been linked to inflammatory bowel disease, were amplified and sequenced. Alleles were evaluated for associations with GDV, controlling for age and dog family. RESULTS Specific alleles of genes DLA88, DRB1, and TLR5 were significantly associated with GDV. One allele of each gene had an OR > 2 in the unadjusted univariate analyses and retained a hazard ratio > 2 after controlling for temperament, age, and familial association in the multivariate analysis. CONCLUSIONS AND CLINICAL RELEVANCE The 3 GDV-associated alleles identified in this study may serve as diagnostic markers for identification of Great Danes at risk for GDV. Additional research is needed to determine whether other dog breeds have the same genetic associations. These findings also provided a new target for research into the etiology of, and potential treatments for, GDV in dogs.
Yue, Zongliang; Zheng, Qi; Neylon, Michael T; Yoo, Minjae; Shin, Jimin; Zhao, Zhiying; Tan, Aik Choon
2018-01-01
Abstract Integrative Gene-set, Network and Pathway Analysis (GNPA) is a powerful data analysis approach developed to help interpret high-throughput omics data. In PAGER 1.0, we demonstrated that researchers can gain unbiased and reproducible biological insights with the introduction of PAGs (Pathways, Annotated-lists and Gene-signatures) as the basic data representation elements. In PAGER 2.0, we improve the utility of integrative GNPA by significantly expanding the coverage of PAGs and PAG-to-PAG relationships in the database, defining a new metric to quantify PAG data qualities, and developing new software features to simplify online integrative GNPA. Specifically, we included 84 282 PAGs spanning 24 different data sources that cover human diseases, published gene-expression signatures, drug–gene, miRNA–gene interactions, pathways and tissue-specific gene expressions. We introduced a new normalized Cohesion Coefficient (nCoCo) score to assess the biological relevance of genes inside a PAG, and RP-score to rank genes and assign gene-specific weights inside a PAG. The companion web interface contains numerous features to help users query and navigate the database content. The database content can be freely downloaded and is compatible with third-party Gene Set Enrichment Analysis tools. We expect PAGER 2.0 to become a major resource in integrative GNPA. PAGER 2.0 is available at http://discovery.informatics.uab.edu/PAGER/. PMID:29126216
Palmer, Jeffrey D.; Adams, Keith L.; Cho, Yangrae; Parkinson, Christopher L.; Qiu, Yin-Long; Song, Keming
2000-01-01
We summarize our recent studies showing that angiosperm mitochondrial (mt) genomes have experienced remarkably high rates of gene loss and concomitant transfer to the nucleus and of intron acquisition by horizontal transfer. Moreover, we find substantial lineage-specific variation in rates of these structural mutations and also point mutations. These findings mostly arise from a Southern blot survey of gene and intron distribution in 281 diverse angiosperms. These blots reveal numerous losses of mt ribosomal protein genes but, with one exception, only rare loss of respiratory genes. Some lineages of angiosperms have kept all of their mt ribosomal protein genes whereas others have lost most of them. These many losses appear to reflect remarkably high (and variable) rates of functional transfer of mt ribosomal protein genes to the nucleus in angiosperms. The recent transfer of cox2 to the nucleus in legumes provides both an example of interorganellar gene transfer in action and a starting point for discussion of the roles of mechanistic and selective forces in determining the distribution of genetic labor between organellar and nuclear genomes. Plant mt genomes also acquire sequences by horizontal transfer. A striking example of this is a homing group I intron in the mt cox1 gene. This extraordinarily invasive mobile element has probably been acquired over 1,000 times separately during angiosperm evolution via a recent wave of cross-species horizontal transfers. Finally, whereas all previously examined angiosperm mtDNAs have low rates of synonymous substitutions, mtDNAs of two distantly related angiosperms have highly accelerated substitution rates. PMID:10860957
Sautto, Giuseppe; Mancini, Nicasio; Solforosi, Laura; Diotti, Roberta A; Clementi, Massimo; Burioni, Roberto
2012-01-01
The association between hepatitis C virus (HCV) infection and type II mixed cryoglobulinemia (MCII) is well established, but the role played by distinct HCV proteins and by specific components of the anti-HCV humoral immune response remains to be clearly defined. It is widely accepted that HCV drives the expansion of few B-cell clones expressing a restricted pool of selected immunoglobulin variable (IgV) gene subfamilies frequently endowed with rheumatoid factor (RF) activity. Moreover, the same IgV subfamilies are frequently observed in HCV-transformed malignant B-cell clones occasionally complicating MCII. In this paper, we analyze both the humoral and viral counterparts at the basis of cryoglobulins production in HCV-induced MCII, with particular attention reserved to the single IgV subfamilies most frequently involved.
Sautto, Giuseppe; Mancini, Nicasio; Solforosi, Laura; Diotti, Roberta A.; Clementi, Massimo; Burioni, Roberto
2012-01-01
The association between hepatitis C virus (HCV) infection and type II mixed cryoglobulinemia (MCII) is well established, but the role played by distinct HCV proteins and by specific components of the anti-HCV humoral immune response remains to be clearly defined. It is widely accepted that HCV drives the expansion of few B-cell clones expressing a restricted pool of selected immunoglobulin variable (IgV) gene subfamilies frequently endowed with rheumatoid factor (RF) activity. Moreover, the same IgV subfamilies are frequently observed in HCV-transformed malignant B-cell clones occasionally complicating MCII. In this paper, we analyze both the humoral and viral counterparts at the basis of cryoglobulins production in HCV-induced MCII, with particular attention reserved to the single IgV subfamilies most frequently involved. PMID:22690241
Long, Nguyen Phuoc; Jung, Kyung Hee; Yoon, Sang Jun; Anh, Nguyen Hoang; Nghi, Tran Diem; Kang, Yun Pyo; Yan, Hong Hua; Min, Jung Eun; Hong, Soon-Sun; Kwon, Sung Won
2017-12-12
Although many outstanding achievements in the management of cervical cancer (CxCa) have obtained, it still imposes a major burden which has prompted scientists to discover and validate new CxCa biomarkers to improve the diagnostic and prognostic assessment of CxCa. In this study, eight different gene expression data sets containing 202 cancer, 115 cervical intraepithelial neoplasia (CIN), and 105 normal samples were utilized for an integrative systems biology assessment in a multi-stage carcinogenesis manner. Deep learning-based diagnostic models were established based on the genetic panels of intrinsic genes of cervical carcinogenesis as well as on the unbiased variable selection approach. Survival analysis was also conducted to explore the potential biomarker candidates for prognostic assessment. Our results showed that cell cycle, RNA transport, mRNA surveillance, and one carbon pool by folate were the key regulatory mechanisms involved in the initiation, progression, and metastasis of CxCa. Various genetic panels combined with machine learning algorithms successfully differentiated CxCa from CIN and normalcy in cross-study normalized data sets. In particular, the 168-gene deep learning model for the differentiation of cancer from normalcy achieved an externally validated accuracy of 97.96% (99.01% sensitivity and 95.65% specificity). Survival analysis revealed that ZNF281 and EPHB6 were the two most promising prognostic genetic markers for CxCa among others. Our findings open new opportunities to enhance current understanding of the characteristics of CxCa pathobiology. In addition, the combination of transcriptomics-based signatures and deep learning classification may become an important approach to improve CxCa diagnosis and management in clinical practice.
Long, Nguyen Phuoc; Jung, Kyung Hee; Yoon, Sang Jun; Anh, Nguyen Hoang; Nghi, Tran Diem; Kang, Yun Pyo; Yan, Hong Hua; Min, Jung Eun; Hong, Soon-Sun; Kwon, Sung Won
2017-01-01
Although many outstanding achievements in the management of cervical cancer (CxCa) have obtained, it still imposes a major burden which has prompted scientists to discover and validate new CxCa biomarkers to improve the diagnostic and prognostic assessment of CxCa. In this study, eight different gene expression data sets containing 202 cancer, 115 cervical intraepithelial neoplasia (CIN), and 105 normal samples were utilized for an integrative systems biology assessment in a multi-stage carcinogenesis manner. Deep learning-based diagnostic models were established based on the genetic panels of intrinsic genes of cervical carcinogenesis as well as on the unbiased variable selection approach. Survival analysis was also conducted to explore the potential biomarker candidates for prognostic assessment. Our results showed that cell cycle, RNA transport, mRNA surveillance, and one carbon pool by folate were the key regulatory mechanisms involved in the initiation, progression, and metastasis of CxCa. Various genetic panels combined with machine learning algorithms successfully differentiated CxCa from CIN and normalcy in cross-study normalized data sets. In particular, the 168-gene deep learning model for the differentiation of cancer from normalcy achieved an externally validated accuracy of 97.96% (99.01% sensitivity and 95.65% specificity). Survival analysis revealed that ZNF281 and EPHB6 were the two most promising prognostic genetic markers for CxCa among others. Our findings open new opportunities to enhance current understanding of the characteristics of CxCa pathobiology. In addition, the combination of transcriptomics-based signatures and deep learning classification may become an important approach to improve CxCa diagnosis and management in clinical practice. PMID:29312619
Detection of aberrant methylation of a six-gene panel in serum DNA for diagnosis of breast cancer
Li, Junnan; Li, Xiaobo; Wang, Dong; Su, Yonghui; Niu, Ming; Zhong, Zhenbin; Wang, Ji; Zhang, Xianyu; Kang, Wenli; Pang, Da
2016-01-01
Detection of breast cancer at an early stage is the key for successful treatment and improvement of outcome. However the limitations of mammography are well recognized, especially for those women with premenopausal breast cancer. Novel approaches to breast cancer screening are necessary, especially in the developing world where mammography is not feasible. In this study, we examined the promoter methylation of six genes (SFN, P16, hMLH1, HOXD13, PCDHGB7 and RASSF1a) in circulating free DNA (cfDNA) extracted from serum. We used a high-throughput DNA methylation assay (MethyLight) to examine serum from 749 cases including breast cancer patients, patients with benign breast diseases and healthy women. The six-gene methylation panel test achieved 79.6% and 82.4% sensitivity with a specificity of 72.4% and 78.1% in diagnosis of breast cancer when compared with healthy and benign disease controls, respectively. Moreover, the methylation panel positive group showed significant differences in the following independent variables: (a) involvement of family history of tumors; (b) a low proliferative index, ki-67; (c) high ratios in luminal subtypes. Additionally the panel also complemented some breast cancer cases which were neglected by mammography or ultrasound. These data suggest that epigenetic markers in serum have potential for diagnosis of breast cancer. PMID:26918343
Larsen, Peter E; Cseke, Leland J; Miller, R Michael; Collart, Frank R
2014-10-21
Rising atmospheric levels of carbon dioxide and ozone will impact productivity and carbon sequestration in forest ecosystems. The scale of this process and the potential economic consequences provide an incentive for the development of models to predict the types and rates of ecosystem responses and feedbacks that result from and influence of climate change. In this paper, we use phenotypic and molecular data derived from the Aspen Free Air CO2 Enrichment site (Aspen-FACE) to evaluate modeling approaches for ecosystem responses to changing conditions. At FACE, it was observed that different aspen clones exhibit clone-specific responses to elevated atmospheric levels of carbon dioxide and ozone. To identify the molecular basis for these observations, we used artificial neural networks (ANN) to examine above and below-ground community phenotype responses to elevated carbon dioxide, elevated ozone and gene expression profiles. The aspen community models generated using this approach identified specific genes and subnetworks of genes associated with variable sensitivities for aspen clones. The ANN model also predicts specific co-regulated gene clusters associated with differential sensitivity to elevated carbon dioxide and ozone in aspen species. The results suggest ANN is an effective approach to predict relevant gene expression changes resulting from environmental perturbation and provides useful information for the rational design of future biological experiments. Copyright © 2014 Elsevier Ltd. All rights reserved.
Diverse hematological phenotypes of β-thalassemia carriers.
Luo, Hong-Yuan; Chui, David H K
2016-03-01
Most β-thalassemia carriers have mild anemia, low mean corpuscular volume and mean corpuscular hemoglobin, and elevated hemoglobin α2 (HbA2 ). However, there is considerable variability resulting from coinheritance with α- and/or δ-globin gene mutations, dominant inheritance of β-thalassemia mutations, highly unstable variant globin chains, large deletions removing part or all of the β-globin gene cluster, loss of heterozygosity of the β-globin gene cluster during development, or concomitant erythroid enzyme or membrane protein abnormalities. Recognition of the specific abnormality and correct diagnosis can allay anxiety and unnecessary investigation, help formulate treatment programs, and deliver appropriate genetic and family counseling. © 2016 New York Academy of Sciences.
Quéméneur, Marianne; Heinrich-Salmeron, Audrey; Muller, Daniel; Lièvremont, Didier; Jauzein, Michel; Bertin, Philippe N; Garrido, Francis; Joulian, Catherine
2008-07-01
A new primer set was designed to specifically amplify ca. 1,100 bp of aoxB genes encoding the As(III) oxidase catalytic subunit from taxonomically diverse aerobic As(III)-oxidizing bacteria. Comparative analysis of AoxB protein sequences showed variable conservation levels and highlighted the conservation of essential amino acids and structural motifs. AoxB phylogeny of pure strains showed well-discriminated taxonomic groups and was similar to 16S rRNA phylogeny. Alphaproteobacteria-, Betaproteobacteria-, and Gammaproteobacteria-related sequences were retrieved from environmental surveys, demonstrating their prevalence in mesophilic As-contaminated soils. Our study underlines the usefulness of the aoxB gene as a functional marker of aerobic As(III) oxidizers.
Pazhamala, Lekha T; Purohit, Shilp; Saxena, Rachit K; Garg, Vanika; Krishnamurthy, L; Verdier, Jerome; Varshney, Rajeev K
2017-04-01
Pigeonpea (Cajanus cajan) is an important grain legume of the semi-arid tropics, mainly used for its protein rich seeds. To link the genome sequence information with agronomic traits resulting from specific developmental processes, a Cajanus cajan gene expression atlas (CcGEA) was developed using the Asha genotype. Thirty tissues/organs representing developmental stages from germination to senescence were used to generate 590.84 million paired-end RNA-Seq data. The CcGEA revealed a compendium of 28 793 genes with differential, specific, spatio-temporal and constitutive expression during various stages of development in different tissues. As an example to demonstrate the application of the CcGEA, a network of 28 flower-related genes analysed for cis-regulatory elements and splicing variants has been identified. In addition, expression analysis of these candidate genes in male sterile and male fertile genotypes suggested their critical role in normal pollen development leading to seed formation. Gene network analysis also identified two regulatory genes, a pollen-specific SF3 and a sucrose-proton symporter, that could have implications for improvement of agronomic traits such as seed production and yield. In conclusion, the CcGEA provides a valuable resource for pigeonpea to identify candidate genes involved in specific developmental processes and to understand the well-orchestrated growth and developmental process in this resilient crop. © The Author 2017. Published by Oxford University Press on behalf of the Society for Experimental Biology.
USDA-ARS?s Scientific Manuscript database
In recent years, the scientific literature has become replete with examples of the improvement of abiotic stress tolerance by overexpression of specific genes. Few studies, however, have evaluated transgenic plants under field conditions or the impact of overexpression on non-target traits. We pre...
Quantitative genetic analysis of agronomic and morphological traits in sorghum, Sorghum bicolor
Mohammed, Riyazaddin; Are, Ashok K.; Bhavanasi, Ramaiah; Munghate, Rajendra S.; Kavi Kishor, Polavarapu B.; Sharma, Hari C.
2015-01-01
The productivity in sorghum is low, owing to various biotic and abiotic constraints. Combining insect resistance with desirable agronomic and morphological traits is important to increase sorghum productivity. Therefore, it is important to understand the variability for various agronomic traits, their heritabilities and nature of gene action to develop appropriate strategies for crop improvement. Therefore, a full diallel set of 10 parents and their 90 crosses including reciprocals were evaluated in replicated trials during the 2013–14 rainy and postrainy seasons. The crosses between the parents with early- and late-flowering flowered early, indicating dominance of earliness for anthesis in the test material used. Association between the shoot fly resistance, morphological, and agronomic traits suggested complex interactions between shoot fly resistance and morphological traits. Significance of the mean sum of squares for GCA (general combining ability) and SCA (specific combining ability) of all the studied traits suggested the importance of both additive and non-additive components in inheritance of these traits. The GCA/SCA, and the predictability ratios indicated predominance of additive gene effects for majority of the traits studied. High broad-sense and narrow-sense heritability estimates were observed for most of the morphological and agronomic traits. The significance of reciprocal combining ability effects for days to 50% flowering, plant height and 100 seed weight, suggested maternal effects for inheritance of these traits. Plant height and grain yield across seasons, days to 50% flowering, inflorescence exsertion, and panicle shape in the postrainy season showed greater specific combining ability variance, indicating the predominance of non-additive type of gene action/epistatic interactions in controlling the expression of these traits. Additive gene action in the rainy season, and dominance in the postrainy season for days to 50% flowering and plant height suggested G X E interactions for these traits. PMID:26579183
Hodgetts, Jennifer; Boonham, Neil; Mumford, Rick; Dickinson, Matthew
2009-01-01
Primers and probes based on the 23S rRNA gene have been utilized to design a range of real-time PCR assays for routine phytoplasma diagnostics. These assays have been authenticated as phytoplasma specific and shown to be at least as sensitive as nested PCR. A universal assay to detect all phytoplasmas has been developed, along with a multiplex assay to discriminate 16SrI group phytoplasmas from members of all of the other 16Sr groups. Assays for the 16SrII, 16SrIV, and 16SrXII groups have also been developed to confirm that the 23S rRNA gene can be used to design group-specific assays. PMID:19270148
Genetic Algorithms Applied to Multi-Objective Aerodynamic Shape Optimization
NASA Technical Reports Server (NTRS)
Holst, Terry L.
2004-01-01
A genetic algorithm approach suitable for solving multi-objective optimization problems is described and evaluated using a series of aerodynamic shape optimization problems. Several new features including two variations of a binning selection algorithm and a gene-space transformation procedure are included. The genetic algorithm is suitable for finding pareto optimal solutions in search spaces that are defined by any number of genes and that contain any number of local extrema. A new masking array capability is included allowing any gene or gene subset to be eliminated as decision variables from the design space. This allows determination of the effect of a single gene or gene subset on the pareto optimal solution. Results indicate that the genetic algorithm optimization approach is flexible in application and reliable. The binning selection algorithms generally provide pareto front quality enhancements and moderate convergence efficiency improvements for most of the problems solved.
Genetic Algorithms Applied to Multi-Objective Aerodynamic Shape Optimization
NASA Technical Reports Server (NTRS)
Holst, Terry L.
2005-01-01
A genetic algorithm approach suitable for solving multi-objective problems is described and evaluated using a series of aerodynamic shape optimization problems. Several new features including two variations of a binning selection algorithm and a gene-space transformation procedure are included. The genetic algorithm is suitable for finding Pareto optimal solutions in search spaces that are defined by any number of genes and that contain any number of local extrema. A new masking array capability is included allowing any gene or gene subset to be eliminated as decision variables from the design space. This allows determination of the effect of a single gene or gene subset on the Pareto optimal solution. Results indicate that the genetic algorithm optimization approach is flexible in application and reliable. The binning selection algorithms generally provide Pareto front quality enhancements and moderate convergence efficiency improvements for most of the problems solved.
Norman, Paul J.; Parham, Peter
2012-01-01
Pinnipeds, marine carnivores, diverged from terrestrial carnivores ~45 million years ago, before their adaptation to marine environments. This lifestyle change exposed pinnipeds to different microbiota and pathogens, with probable impact on their MHC class I genes. Investigating this question, genomic sequences were determined for 71 MHC class I variants: 27 from harbor seal and 44 from gray seal. These variants form three MHC class I gene lineages, one comprising a pseudogene. The second, a candidate nonclassical MHC class I gene, comprises a nonpolymorphic transcribed gene related to dog DLA-79 and giant panda Aime-1906. The third is the diversity lineage, which includes 62 of the 71 seal MHC class I variants. All are transcribed, and they minimally represent six harbor and 12 gray seal MHC class I genes. Besides species-specific differences in gene number, seal MHC class I haplotypes exhibit gene content variation and allelic polymorphism. Patterns of sequence variation, and of positions for positively selected sites, indicate the diversity lineage genes are the seals’ classical MHC class I genes. Evidence that expansion of diversity lineage genes began before gray and harbor seals diverged is the presence in both species of two distinctive sublineages of diversity lineage genes. Pointing to further expansion following the divergence are the presence of species-specific genes and greater MHC class I diversity in gray seals than harbor seals. The elaboration of a complex variable family of classical MHC class I genes in pinnipeds contrasts with the single, highly polymorphic classical MHC class I gene of dog and giant panda, terrestrial carnivores. PMID:23001684
A New Chicken Genome Assembly Provides Insight into Avian Genome Structure.
Warren, Wesley C; Hillier, LaDeana W; Tomlinson, Chad; Minx, Patrick; Kremitzki, Milinn; Graves, Tina; Markovic, Chris; Bouk, Nathan; Pruitt, Kim D; Thibaud-Nissen, Francoise; Schneider, Valerie; Mansour, Tamer A; Brown, C Titus; Zimin, Aleksey; Hawken, Rachel; Abrahamsen, Mitch; Pyrkosz, Alexis B; Morisson, Mireille; Fillon, Valerie; Vignal, Alain; Chow, William; Howe, Kerstin; Fulton, Janet E; Miller, Marcia M; Lovell, Peter; Mello, Claudio V; Wirthlin, Morgan; Mason, Andrew S; Kuo, Richard; Burt, David W; Dodgson, Jerry B; Cheng, Hans H
2017-01-05
The importance of the Gallus gallus (chicken) as a model organism and agricultural animal merits a continuation of sequence assembly improvement efforts. We present a new version of the chicken genome assembly (Gallus_gallus-5.0; GCA_000002315.3), built from combined long single molecule sequencing technology, finished BACs, and improved physical maps. In overall assembled bases, we see a gain of 183 Mb, including 16.4 Mb in placed chromosomes with a corresponding gain in the percentage of intact repeat elements characterized. Of the 1.21 Gb genome, we include three previously missing autosomes, GGA30, 31, and 33, and improve sequence contig length 10-fold over the previous Gallus_gallus-4.0. Despite the significant base representation improvements made, 138 Mb of sequence is not yet located to chromosomes. When annotated for gene content, Gallus_gallus-5.0 shows an increase of 4679 annotated genes (2768 noncoding and 1911 protein-coding) over those in Gallus_gallus-4.0. We also revisited the question of what genes are missing in the avian lineage, as assessed by the highest quality avian genome assembly to date, and found that a large fraction of the original set of missing genes are still absent in sequenced bird species. Finally, our new data support a detailed map of MHC-B, encompassing two segments: one with a highly stable gene copy number and another in which the gene copy number is highly variable. The chicken model has been a critical resource for many other fields of study, and this new reference assembly will substantially further these efforts. Copyright © 2017 Warren et al.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gihring, Thomas; Green, Stefan; Schadt, Christopher Warren
2011-01-01
Technologies for massively parallel sequencing are revolutionizing microbial ecology and are vastly increasing the scale of ribosomal RNA (rRNA) gene studies. Although pyrosequencing has increased the breadth and depth of possible rRNA gene sampling, one drawback is that the number of reads obtained per sample is difficult to control. Pyrosequencing libraries typically vary widely in the number of sequences per sample, even within individual studies, and there is a need to revisit the behaviour of richness estimators and diversity indices with variable gene sequence library sizes. Multiple reports and review papers have demonstrated the bias in non-parametric richness estimators (e.g.more » Chao1 and ACE) and diversity indices when using clone libraries. However, we found that biased community comparisons are accumulating in the literature. Here we demonstrate the effects of sample size on Chao1, ACE, CatchAll, Shannon, Chao-Shen and Simpson's estimations specifically using pyrosequencing libraries. The need to equalize the number of reads being compared across libraries is reiterated, and investigators are directed towards available tools for making unbiased diversity comparisons.« less
Plankton networks driving carbon export in the oligotrophic ocean
Larhlimi, Abdelhalim; Roux, Simon; Darzi, Youssef; Audic, Stephane; Berline, Léo; Brum, Jennifer; Coelho, Luis Pedro; Espinoza, Julio Cesar Ignacio; Malviya, Shruti; Sunagawa, Shinichi; Dimier, Céline; Kandels-Lewis, Stefanie; Picheral, Marc; Poulain, Julie; Searson, Sarah; Stemmann, Lars; Not, Fabrice; Hingamp, Pascal; Speich, Sabrina; Follows, Mick; Karp-Boss, Lee; Boss, Emmanuel; Ogata, Hiroyuki; Pesant, Stephane; Weissenbach, Jean; Wincker, Patrick; Acinas, Silvia G.; Bork, Peer; de Vargas, Colomban; Iudicone, Daniele; Sullivan, Matthew B.; Raes, Jeroen; Karsenti, Eric; Bowler, Chris; Gorsky, Gabriel
2015-01-01
The biological carbon pump is the process by which CO2 is transformed to organic carbon via photosynthesis, exported through sinking particles, and finally sequestered in the deep ocean. While the intensity of the pump correlates with plankton community composition, the underlying ecosystem structure driving the process remains largely uncharacterised. Here we use environmental and metagenomic data gathered during the Tara Oceans expedition to improve our understanding of carbon export in the oligotrophic ocean. We show that specific plankton communities, from the surface and deep chlorophyll maximum, correlate with carbon export at 150 m and highlight unexpected taxa such as Radiolaria, alveolate parasites, as well as Synechococcus and their phages, as lineages most strongly associated with carbon export in the subtropical, nutrient-depleted, oligotrophic ocean. Additionally, we show that the relative abundance of just a few bacterial and viral genes can predict most of the variability in carbon export in these regions. PMID:26863193
Plankton networks driving carbon export in the oligotrophic ocean
NASA Astrophysics Data System (ADS)
2016-04-01
The biological carbon pump is the process by which CO2 is transformed to organic carbon via photosynthesis, exported through sinking particles, and finally sequestered in the deep ocean. While the intensity of the pump correlates with plankton community composition, the underlying ecosystem structure driving the process remains largely uncharacterized. Here we use environmental and metagenomic data gathered during the Tara Oceans expedition to improve our understanding of carbon export in the oligotrophic ocean. We show that specific plankton communities, from the surface and deep chlorophyll maximum, correlate with carbon export at 150 m and highlight unexpected taxa such as Radiolaria and alveolate parasites, as well as Synechococcus and their phages, as lineages most strongly associated with carbon export in the subtropical, nutrient-depleted, oligotrophic ocean. Additionally, we show that the relative abundance of a few bacterial and viral genes can predict a significant fraction of the variability in carbon export in these regions.
Zhang, Xu; Zhang, Wei
2016-06-01
Cytosine modification on DNA is variable among individuals, which could correlate with gene expression variation. The effect of cytosine modification on interindividual transcript isoform variation (TIV), however, remains unclear. In this study, we assessed the extent of cytosine modification-specific TIV in lymphoblastoid cell lines (LCLs) derived from unrelated individuals of European and African descent. Our study detected cytosine modification-specific TIVs for 17% of the analyzed genes at a 5% false discovery rate. Forty-five percent of the TIV-associated cytosine modifications correlated with the overall gene expression levels as well, with the corresponding CpG sites overrepresented in transcript initiation sites, transcription factor binding sites, and distinct histone modification peaks, suggesting that alternative isoform transcription underlies the TIVs. Our analysis also revealed 33% of the TIV-associated cytosine modifications that affected specific exons, with the corresponding CpG sites overrepresented in exon/intron junctions, splicing branching points, and transcript termination sites, implying that the TIVs are attributable to alternative splicing or transcription termination. Genetic and epigenetic regulation of TIV shared target preference but exerted independent effects on 61% of the common exon targets. Cytosine modification-specific TIVs detected from LCLs were differentially enriched in those detected from various tissues in The Cancer Genome Atlas, indicating their developmental dependency. Genes containing cytosine modification-specific TIVs were enriched in pathways of cancers and metabolic disorders. Our study demonstrated a prominent effect of cytosine modification variation on the transcript isoform spectrum over gross transcript abundance and revealed epigenetic contributions to diseases that were mediated through cytosine modification-specific TIV. Copyright © 2016 by the Genetics Society of America.
Biasogram: Visualization of Confounding Technical Bias in Gene Expression Data
Krzystanek, Marcin; Szallasi, Zoltan; Eklund, Aron C.
2013-01-01
Gene expression profiles of clinical cohorts can be used to identify genes that are correlated with a clinical variable of interest such as patient outcome or response to a particular drug. However, expression measurements are susceptible to technical bias caused by variation in extraneous factors such as RNA quality and array hybridization conditions. If such technical bias is correlated with the clinical variable of interest, the likelihood of identifying false positive genes is increased. Here we describe a method to visualize an expression matrix as a projection of all genes onto a plane defined by a clinical variable and a technical nuisance variable. The resulting plot indicates the extent to which each gene is correlated with the clinical variable or the technical variable. We demonstrate this method by applying it to three clinical trial microarray data sets, one of which identified genes that may have been driven by a confounding technical variable. This approach can be used as a quality control step to identify data sets that are likely to yield false positive results. PMID:23613961
Fischetto, Giuseppe; Bermon, Stéphane
2013-10-01
During the last 2 decades, progress in deciphering the human gene map as well as the discovery of specific defective genes encoding particular proteins in some serious human diseases have resulted in attempts to treat sick patients with gene therapy. There has been considerable focus on human recombinant proteins which were gene-engineered and produced in vitro (insulin, growth hormone, insulin-like growth factor-1, erythropoietin). Unfortunately, these substances and methods also became improper tools for unscrupulous athletes. Biomedical research has focused on the possible direct insertion of gene material into the body, in order to replace some defective genes in vivo and/or to promote long-lasting endogenous synthesis of deficient proteins. Theoretically, diabetes, anaemia, muscular dystrophies, immune deficiency, cardiovascular diseases and numerous other illnesses could benefit from such innovative biomedical research, though much work remains to be done. Considering recent findings linking specific genotypes and physical performance, it is tempting to submit the young athletic population to genetic screening or, alternatively, to artificial gene expression modulation. Much research is already being conducted in order to achieve a safe transfer of genetic material to humans. This is of critical importance since uncontrolled production of the specifically coded protein, with serious secondary adverse effects (polycythaemia, acute cardiovascular problems, cancer, etc.), could occur. Other unpredictable reactions (immunogenicity of vectors or DNA-vector complex, autoimmune anaemia, production of wild genetic material) also remain possible at the individual level. Some new substances (myostatin blockers or anti-myostatin antibodies), although not gene material, might represent a useful and well-tolerated treatment to prevent progression of muscular dystrophies. Similarly, other molecules, in the roles of gene or metabolic activators [5-aminoimidazole-4-carboxamide 1-β-D-ribofuranoside (AICAR), GW1516], might concomitantly improve endurance exercise capacity in ischaemic conditions but also in normal conditions. Undoubtedly, some athletes will attempt to take advantage of these new molecules to increase strength or endurance. Antidoping laboratories are improving detection methods. These are based both on direct identification of new substances or their metabolites and on indirect evaluation of changes in gene, protein or metabolite patterns (genomics, proteomics or metabolomics).
Schlump, Jan-Ulrich; Stein, Anja; Hehr, Ute; Karen, Tanja; Möller-Hartmann, Claudia; Elcioglu, Nursel H; Bogdanova, Nadja; Woike, Hartmut Fritz; Lohmann, Dietmar R; Felderhoff-Mueser, Ursula; Linz, Annette; Wieczorek, Dagmar
2012-11-01
Treacher Collins syndrome (TCS) is the most common and well-known mandibulofacial dysostosis caused by mutations in at least three genes involved in pre-rRNA transcription, the TCOF1, POLR1D and POLR1C genes. We present a severely affected male individual with TCS with a heterozygous de novo frameshift mutation within the TCOF1 gene (c.790_791delAG,p.Ser264GlnfsX7) and compare the clinical findings with three previously unpublished, milder affected individuals from two families with the same mutation. We elucidate typical clinical features of TCS and its clinical implications for the paediatrician and mandibulofacial surgeon, especially in severely affected individuals and give a short review of the literature. The clinical data of these three families illustrate that the phenotype associated with this specific mutation has a wide intra- and interfamilial variability, which confirms that variable expressivity in carriers of TCOF1 mutations is not a simple consequence of the mutation but might be modified by the combination of genetic, environmental and stochastic factors. Being such a highly complex disease treatment of individuals with TCS should be tailored to the specific needs of each individual, preferably by a multidisciplinary team consisting of paediatricians, craniofacial surgeons and geneticists.
Ventura, Marco; Canchaya, Carlos; Meylan, Valèrie; Klaenhammer, Todd R.; Zink, Ralf
2003-01-01
We analyzed the tuf gene, encoding elongation factor Tu, from 33 strains representing 17 Lactobacillus species and 8 Bifidobacterium species. The tuf sequences were aligned and used to infer phylogenesis among species of lactobacilli and bifidobacteria. We demonstrated that the synonymous substitution affecting this gene renders elongation factor Tu a reliable molecular clock for investigating evolutionary distances of lactobacilli and bifidobacteria. In fact, the phylogeny generated by these tuf sequences is consistent with that derived from 16S rRNA analysis. The investigation of a multiple alignment of tuf sequences revealed regions conserved among strains belonging to the same species but distinct from those of other species. PCR primers complementary to these regions allowed species-specific identification of closely related species, such as Lactobacillus casei group members. These tuf gene-based assays developed in this study provide an alternative to present methods for the identification for lactic acid bacterial species. Since a variable number of tuf genes have been described for bacteria, the presence of multiple genes was examined. Southern analysis revealed one tuf gene in the genomes of lactobacilli and bifidobacteria, but the tuf gene was arranged differently in the genomes of these two taxa. Our results revealed that the tuf gene in bifidobacteria is flanked by the same gene constellation as the str operon, as originally reported for Escherichia coli. In contrast, bioinformatic and transcriptional analyses of the DNA region flanking the tuf gene in four Lactobacillus species indicated the same four-gene unit and suggested a novel tuf operon specific for the genus Lactobacillus. PMID:14602655
Asgeirsdóttir, Sigridur A; Talman, Eduard G; de Graaf, Inge A; Kamps, Jan A A M; Satchell, Simon C; Mathieson, Peter W; Ruiters, Marcel H J; Molema, Grietje
2010-01-25
Applications of small-interfering RNA (siRNA) call for specific and efficient delivery of siRNA into particular cell types. We developed a novel, non-viral targeting system to deliver siRNA specifically into inflammation-activated endothelial cells. This was achieved by conjugating the cationic amphiphilic lipid SAINT to antibodies recognizing the inflammatory cell adhesion molecule E-selectin. These anti-E-selectin-SAINT lipoplexes (SAINTarg) maintained antigen recognition capacity of the parental antibody in vitro, and ex vivo in human kidney tissue slices subjected to inflammatory conditions. Regular SAINT mediated transfection resulted in efficient gene silencing in human microvascular endothelial cells (HMEC-1) and conditionally immortalized glomerular endothelial cells (ciGEnC). However, primary human umbilical vein endothelial cells (HUVEC) transfected poorly, a phenomenon that we could quantitatively correlate with a cell-type specific capacity to facilitate siRNA uptake. Importantly, SAINTarg increased siRNA uptake and transfection specificity for activated endothelial cells. Transfection with SAINTarg delivered significantly more siRNA into activated HUVEC, compared to transfection with non-targeted SAINT. The enhanced uptake of siRNA was corroborated by improved silencing of both gene- and protein expression of VE-cadherin in activated HUVEC, indicating that SAINTarg delivered functionally active siRNA into endothelial cells. The obtained results demonstrate a successful design of a small nucleotide carrier system with improved and specific siRNA delivery into otherwise difficult-to-transfect primary endothelial cells, which in addition reduced considerably the amount of siRNA needed for gene silencing. Copyright 2009 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shakoor, N; Nair, R; Crasta, O
2014-01-23
Background: Effective improvement in sorghum crop development necessitates a genomics-based approach to identify functional genes and QTLs. Sequenced in 2009, a comprehensive annotation of the sorghum genome and the development of functional genomics resources is key to enable the discovery and deployment of regulatory and metabolic genes and gene networks for crop improvement. Results: This study utilizes the first commercially available whole-transcriptome sorghum microarray (Sorgh-WTa520972F) to identify tissue and genotype-specific expression patterns for all identified Sorghum bicolor exons and UTRs. The genechip contains 1,026,373 probes covering 149,182 exons (27,577 genes) across the Sorghum bicolor nuclear, chloroplast, and mitochondrial genomes. Specificmore » probesets were also included for putative non-coding RNAs that may play a role in gene regulation (e. g., microRNAs), and confirmed functional small RNAs in related species (maize and sugarcane) were also included in our array design. We generated expression data for 78 samples with a combination of four different tissue types (shoot, root, leaf and stem), two dissected stem tissues (pith and rind) and six diverse genotypes, which included 6 public sorghum lines (R159, Atlas, Fremont, PI152611, AR2400 and PI455230) representing grain, sweet, forage, and high biomass ideotypes. Conclusions: Here we present a summary of the microarray dataset, including analysis of tissue-specific gene expression profiles and associated expression profiles of relevant metabolic pathways. With an aim to enable identification and functional characterization of genes in sorghum, this expression atlas presents a new and valuable resource to the research community.« less
Preliminary definition of improvement in juvenile arthritis.
Giannini, E H; Ruperto, N; Ravelli, A; Lovell, D J; Felson, D T; Martini, A
1997-07-01
To identify a core set of outcome variables for the assessment of children with juvenile arthritis (JA), to use the core set to develop a definition of improvement to determine whether individual patients demonstrate clinically important improvement, and to promote this definition as a single efficacy measure in JA clinical trials. A core set of outcome variables was established using a combination of statistical and consensus formation techniques. Variables in the core set consisted of 1) physician global assessment of disease activity; 2) parent/patient assessment of overall well-being; 3) functional ability; 4) number of joints with active arthritis; 5) number of joints with limited range of motion; and 6) erythrocyte sedimentation rate. To establish a definition of improvement using this core set, 21 pediatric rheumatologists from 14 countries met, and, using consensus formation techniques, scored each of 72 patient profiles as improved or not improved. Using the physicians' consensus as the gold standard, the chi-square, sensitivity, and specificity were calculated for each of 240 possible definitions of improvement. Definitions with sensitivity or specificity of <80% were eliminated. The ability of the remaining definitions to discriminate between the effects of active agent and those of placebo, using actual trial data, was then observed. Each definition was also ranked for face validity, and the sum of the ranks was then multiplied by the kappa statistic. The definition of improvement with the highest final score was as follows: at least 30% improvement from baseline in 3 of any 6 variables in the core set, with no more than 1 of the remaining variables worsening by >30%. The second highest scoring definition was closely related to the first; the third highest was similar to the Paulus criteria used in adult rheumatoid arthritis trials, except with different variables. This indicates convergent validity of the process used. We propose a definition of improvement for JA. Use of a uniform definition will help standardize the conduct and reporting of clinical trials, and should help practitioners decide if a child with JA has responded adequately to therapy. We are in the process of prospectively validating this definition and several others that scored highly.
O'Mara, Leigh A.; Gangadhara, Sailaja; McQuoid, Monica; Zhang, Xiugen; Zheng, Rui; Gill, Kiran; Verma, Meena; Yu, Tianwei; Johnson, Brent; Li, Bing; Derdeyn, Cynthia A.; Ibegbu, Chris; Altman, John D.; Hunter, Eric; Feinberg, Mark B.
2012-01-01
Modified vaccinia virus Ankara (MVA) is a safe, attenuated orthopoxvirus that is being developed as a vaccine vector but has demonstrated limited immunogenicity in several early-phase clinical trials. Our objective was to rationally improve the immunogenicity of MVA-based HIV/AIDS vaccines via the targeted deletion of specific poxvirus immune-modulatory genes. Vaccines expressing codon-optimized HIV subtype C consensus Env and Gag antigens were generated from MVA vector backbones that (i) harbor simultaneous deletions of four viral immune-modulatory genes, encoding an interleukin-18 (IL-18) binding protein, an IL-1β receptor, a dominant negative Toll/IL-1 signaling adapter, and CC-chemokine binding protein (MVAΔ4-HIV); (ii) harbor a deletion of an additional (fifth) viral gene, encoding uracil-DNA glycosylase (MVAΔ5-HIV); or (iii) represent the parental MVA backbone as a control (MVA-HIV). We performed head-to-head comparisons of the cellular and humoral immune responses that were elicited by these vectors during homologous prime-boost immunization regimens utilizing either high-dose (2 × 108 PFU) or low-dose (1 × 107 PFU) intramuscular immunization of rhesus macaques. At all time points, a majority of the HIV-specific T cell responses, elicited by all vectors, were directed against Env, rather than Gag, determinants, as previously observed with other vector systems. Both modified vectors elicited up to 6-fold-higher frequencies of HIV-specific CD8 and CD4 T cell responses and up to 25-fold-higher titers of Env (gp120)-specific binding (nonneutralizing) antibody responses that were relatively transient in nature. While the correlates of protection against HIV infection remain incompletely defined, our results indicate that the rational deletion of specific genes from MVA vectors can positively alter their cellular and humoral immunogenicity profiles in nonhuman primates. PMID:22973033
NETWORK ASSISTED ANALYSIS TO REVEAL THE GENETIC BASIS OF AUTISM1
Liu, Li; Lei, Jing; Roeder, Kathryn
2016-01-01
While studies show that autism is highly heritable, the nature of the genetic basis of this disorder remains illusive. Based on the idea that highly correlated genes are functionally interrelated and more likely to affect risk, we develop a novel statistical tool to find more potentially autism risk genes by combining the genetic association scores with gene co-expression in specific brain regions and periods of development. The gene dependence network is estimated using a novel partial neighborhood selection (PNS) algorithm, where node specific properties are incorporated into network estimation for improved statistical and computational efficiency. Then we adopt a hidden Markov random field (HMRF) model to combine the estimated network and the genetic association scores in a systematic manner. The proposed modeling framework can be naturally extended to incorporate additional structural information concerning the dependence between genes. Using currently available genetic association data from whole exome sequencing studies and brain gene expression levels, the proposed algorithm successfully identified 333 genes that plausibly affect autism risk. PMID:27134692
Origins of extrinsic variability in eukaryotic gene expression
NASA Astrophysics Data System (ADS)
Volfson, Dmitri; Marciniak, Jennifer; Blake, William J.; Ostroff, Natalie; Tsimring, Lev S.; Hasty, Jeff
2006-02-01
Variable gene expression within a clonal population of cells has been implicated in a number of important processes including mutation and evolution, determination of cell fates and the development of genetic disease. Recent studies have demonstrated that a significant component of expression variability arises from extrinsic factors thought to influence multiple genes simultaneously, yet the biological origins of this extrinsic variability have received little attention. Here we combine computational modelling with fluorescence data generated from multiple promoter-gene inserts in Saccharomyces cerevisiae to identify two major sources of extrinsic variability. One unavoidable source arising from the coupling of gene expression with population dynamics leads to a ubiquitous lower limit for expression variability. A second source, which is modelled as originating from a common upstream transcription factor, exemplifies how regulatory networks can convert noise in upstream regulator expression into extrinsic noise at the output of a target gene. Our results highlight the importance of the interplay of gene regulatory networks with population heterogeneity for understanding the origins of cellular diversity.
Origins of extrinsic variability in eukaryotic gene expression
NASA Astrophysics Data System (ADS)
Volfson, Dmitri; Marciniak, Jennifer; Blake, William J.; Ostroff, Natalie; Tsimring, Lev S.; Hasty, Jeff
2006-03-01
Variable gene expression within a clonal population of cells has been implicated in a number of important processes including mutation and evolution, determination of cell fates and the development of genetic disease. Recent studies have demonstrated that a significant component of expression variability arises from extrinsic factors thought to influence multiple genes in concert, yet the biological origins of this extrinsic variability have received little attention. Here we combine computational modeling with fluorescence data generated from multiple promoter-gene inserts in Saccharomyces cerevisiae to identify two major sources of extrinsic variability. One unavoidable source arising from the coupling of gene expression with population dynamics leads to a ubiquitous noise floor in expression variability. A second source which is modeled as originating from a common upstream transcription factor exemplifies how regulatory networks can convert noise in upstream regulator expression into extrinsic noise at the output of a target gene. Our results highlight the importance of the interplay of gene regulatory networks with population heterogeneity for understanding the origins of cellular diversity.
Foggetti, Giorgia; Raimondi, Ivan; Campomenosi, Paola; Menichini, Paola
2014-01-01
TP63 is a member of the TP53 gene family that encodes for up to ten different TA and ΔN isoforms through alternative promoter usage and alternative splicing. Besides being a master regulator of gene expression for squamous epithelial proliferation, differentiation and maintenance, P63, through differential expression of its isoforms, plays important roles in tumorigenesis. All P63 isoforms share an immunoglobulin-like folded DNA binding domain responsible for binding to sequence-specific response elements (REs), whose overall consensus sequence is similar to that of the canonical p53 RE. Using a defined assay in yeast, where P63 isoforms and RE sequences are the only variables, and gene expression assays in human cell lines, we demonstrated that human TA- and ΔN-P63α proteins exhibited differences in transactivation specificity not observed with the corresponding P73 or P53 protein isoforms. These differences 1) were dependent on specific features of the RE sequence, 2) could be related to intrinsic differences in their oligomeric state and cooperative DNA binding, and 3) appeared to be conserved in evolution. Since genotoxic stress can change relative ratio of TA- and ΔN-P63α protein levels, the different transactivation specificity of each P63 isoform could potentially influence cellular responses to specific stresses. PMID:24926492
Variables and Strategies in Development of Therapeutic Post-Transcriptional Gene Silencing Agents
Sullivan, Jack M.; Yau, Edwin H.; Kolniak, Tiffany A.; Sheflin, Lowell G.; Taggart, R. Thomas; Abdelmaksoud, Heba E.
2011-01-01
Post-transcriptional gene silencing (PTGS) agents such as ribozymes, RNAi and antisense have substantial potential for gene therapy of human retinal degenerations. These technologies are used to knockdown a specific target RNA and its cognate protein. The disease target mRNA may be a mutant mRNA causing an autosomal dominant retinal degeneration or a normal mRNA that is overexpressed in certain diseases. All PTGS technologies depend upon the initial critical annealing event of the PTGS ligand to the target RNA. This event requires that the PTGS agent is in a conformational state able to support hybridization and that the target have a large and accessible single-stranded platform to allow rapid annealing, although such platforms are rare. We address the biocomplexity that currently limits PTGS therapeutic development with particular emphasis on biophysical variables that influence cellular performance. We address the different strategies that can be used for development of PTGS agents intended for therapeutic translation. These issues apply generally to the development of PTGS agents for retinal, ocular, or systemic diseases. This review should assist the interested reader to rapidly appreciate critical variables in PTGS development and facilitate initial design and testing of such agents against new targets of clinical interest. PMID:21785698
Jangam, Sujit R; Hayter, Gary; Dunn, Timothy C
2018-02-01
Glycemic variability refers to oscillations in blood glucose within a day and differences in blood glucose at the same time on different days. Glycemic variability is linked to hypoglycemia and hyperglycemia. The relationship among these three important metrics is examined here, specifically to show how reduction in both hypo- and hyperglycemia risk is dependent on changes in variability. To understand the importance of glycemic variability in the simultaneous reduction of hypoglycemia and hyperglycemia risk, we introduce the glycemic risk plot-estimated HbA1c % (eA1c) vs. minutes below 70 mg/dl (MB70) with constant variability contours for predicting post-intervention risks in the absence of a change in glycemic variability. The glycemic risk plot illustrates that individuals who do not reduce glycemic variability improve one of the two metrics (hypoglycemia risk or hyperglycemia risk) at the cost of the other. It is important to reduce variability to improve both risks. These results were confirmed by data collected in a randomized controlled trial consisting of individuals with type 1 and type 2 diabetes on insulin therapy. For type 1, a total of 28 individuals out of 35 (80%) showed improvement in at least one of the risks (hypo and/or hyper) during the 100-day course of the study. Seven individuals (20%) showed improvement in both. Similar data were observed for type 2 where a total of 36 individuals out of 43 (84%) showed improvement in at least one risk and 8 individuals (19%) showed improvement in both. All individuals in the study who showed improvement in both hypoglycemia and hyperglycemia risk also showed a reduction in variability. Therapy changes intended to improve an individual's hypoglycemia or hyperglycemia risk often result in the reduction of one risk at the expense of another. It is important to improve glucose variability to reduce both risks or at least maintain one risk while reducing the other. Abbott Diabetes Care.
Catecholaminergic systems in stress: structural and molecular genetic approaches.
Kvetnansky, Richard; Sabban, Esther L; Palkovits, Miklos
2009-04-01
Stressful stimuli evoke complex endocrine, autonomic, and behavioral responses that are extremely variable and specific depending on the type and nature of the stressors. We first provide a short overview of physiology, biochemistry, and molecular genetics of sympatho-adrenomedullary, sympatho-neural, and brain catecholaminergic systems. Important processes of catecholamine biosynthesis, storage, release, secretion, uptake, reuptake, degradation, and transporters in acutely or chronically stressed organisms are described. We emphasize the structural variability of catecholamine systems and the molecular genetics of enzymes involved in biosynthesis and degradation of catecholamines and transporters. Characterization of enzyme gene promoters, transcriptional and posttranscriptional mechanisms, transcription factors, gene expression and protein translation, as well as different phases of stress-activated transcription and quantitative determination of mRNA levels in stressed organisms are discussed. Data from catecholamine enzyme gene knockout mice are shown. Interaction of catecholaminergic systems with other neurotransmitter and hormonal systems are discussed. We describe the effects of homotypic and heterotypic stressors, adaptation and maladaptation of the organism, and the specificity of stressors (physical, emotional, metabolic, etc.) on activation of catecholaminergic systems at all levels from plasma catecholamines to gene expression of catecholamine enzymes. We also discuss cross-adaptation and the effect of novel heterotypic stressors on organisms adapted to long-term monotypic stressors. The extra-adrenal nonneuronal adrenergic system is described. Stress-related central neuronal regulatory circuits and central organization of responses to various stressors are presented with selected examples of regulatory molecular mechanisms. Data summarized here indicate that catecholaminergic systems are activated in different ways following exposure to distinct stressful stimuli.
Gene set analysis using variance component tests.
Huang, Yen-Tsung; Lin, Xihong
2013-06-28
Gene set analyses have become increasingly important in genomic research, as many complex diseases are contributed jointly by alterations of numerous genes. Genes often coordinate together as a functional repertoire, e.g., a biological pathway/network and are highly correlated. However, most of the existing gene set analysis methods do not fully account for the correlation among the genes. Here we propose to tackle this important feature of a gene set to improve statistical power in gene set analyses. We propose to model the effects of an independent variable, e.g., exposure/biological status (yes/no), on multiple gene expression values in a gene set using a multivariate linear regression model, where the correlation among the genes is explicitly modeled using a working covariance matrix. We develop TEGS (Test for the Effect of a Gene Set), a variance component test for the gene set effects by assuming a common distribution for regression coefficients in multivariate linear regression models, and calculate the p-values using permutation and a scaled chi-square approximation. We show using simulations that type I error is protected under different choices of working covariance matrices and power is improved as the working covariance approaches the true covariance. The global test is a special case of TEGS when correlation among genes in a gene set is ignored. Using both simulation data and a published diabetes dataset, we show that our test outperforms the commonly used approaches, the global test and gene set enrichment analysis (GSEA). We develop a gene set analyses method (TEGS) under the multivariate regression framework, which directly models the interdependence of the expression values in a gene set using a working covariance. TEGS outperforms two widely used methods, GSEA and global test in both simulation and a diabetes microarray data.
Partial least squares based identification of Duchenne muscular dystrophy specific genes.
An, Hui-bo; Zheng, Hua-cheng; Zhang, Li; Ma, Lin; Liu, Zheng-yan
2013-11-01
Large-scale parallel gene expression analysis has provided a greater ease for investigating the underlying mechanisms of Duchenne muscular dystrophy (DMD). Previous studies typically implemented variance/regression analysis, which would be fundamentally flawed when unaccounted sources of variability in the arrays existed. Here we aim to identify genes that contribute to the pathology of DMD using partial least squares (PLS) based analysis. We carried out PLS-based analysis with two datasets downloaded from the Gene Expression Omnibus (GEO) database to identify genes contributing to the pathology of DMD. Except for the genes related to inflammation, muscle regeneration and extracellular matrix (ECM) modeling, we found some genes with high fold change, which have not been identified by previous studies, such as SRPX, GPNMB, SAT1, and LYZ. In addition, downregulation of the fatty acid metabolism pathway was found, which may be related to the progressive muscle wasting process. Our results provide a better understanding for the downstream mechanisms of DMD.
Sun, Dongbo; Shi, Hongyan; Chen, Jianfei; Shi, Da; Zhu, Qinghe; Zhang, Hong; Liu, Shengwang; Wang, Yunfeng; Qiu, Huaji; Feng, Li
2012-06-01
Porcine aminopeptidase N (pAPN) is a common cellular receptor for swine transmissible gastroenteritis virus (TGEV) and porcine epidemic diarrhea virus (PEDV). To investigate single-chain fragment variable (scFv) repertoire against pAPN, the genes encoding the immunoglobulin light chain variable region (VL) and heavy chain variable region (VH) were amplified by reverse transcript polymerase chain reaction (RT-PCR) using a series of degenerate primers from the spleen of BABL/c mice immunized with native pAPN. The VL and VH amplicons were combined randomly by a 12 amino acid flexible linker by splicing by overlap extension PCR (SOE-PCR), which produced the scFv gene repertoire. After ligation of the scFv gene repertoire into the T7Select10-3b vector, a mouse scFv phage library specific for pAPN was produced through in vitro packaging. The primary scFv library against pAPN contained 2.0×10(7) recombinant phage clones, and the titer of the amplified library was 3.6×10(9)pfu/mL. BstNI restriction analysis and DNA sequencing revealed that 28 phage clones from the primary pAPN scFv library showed excellent diversity. The effectiveness of the scFv library against pAPN was verified further by phage ELISA using the recombinant protein of the pAPN C subunit as coating antigen. The construction and evaluation of a murine scFv library against the common receptor pAPN of porcine coronaviruses TGEV and PEDV using the T7 phage display system are described. Copyright © 2012 Elsevier B.V. All rights reserved.
Short and long-term genome stability analysis of prokaryotic genomes.
Brilli, Matteo; Liò, Pietro; Lacroix, Vincent; Sagot, Marie-France
2013-05-08
Gene organization dynamics is actively studied because it provides useful evolutionary information, makes functional annotation easier and often enables to characterize pathogens. There is therefore a strong interest in understanding the variability of this trait and the possible correlations with life-style. Two kinds of events affect genome organization: on one hand translocations and recombinations change the relative position of genes shared by two genomes (i.e. the backbone gene order); on the other, insertions and deletions leave the backbone gene order unchanged but they alter the gene neighborhoods by breaking the syntenic regions. A complete picture about genome organization evolution therefore requires to account for both kinds of events. We developed an approach where we model chromosomes as graphs on which we compute different stability estimators; we consider genome rearrangements as well as the effect of gene insertions and deletions. In a first part of the paper, we fit a measure of backbone gene order conservation (hereinafter called backbone stability) against phylogenetic distance for over 3000 genome comparisons, improving existing models for the divergence in time of backbone stability. Intra- and inter-specific comparisons were treated separately to focus on different time-scales. The use of multiple genomes of a same species allowed to identify genomes with diverging gene order with respect to their conspecific. The inter-species analysis indicates that pathogens are more often unstable with respect to non-pathogens. In a second part of the text, we show that in pathogens, gene content dynamics (insertions and deletions) have a much more dramatic effect on genome organization stability than backbone rearrangements. In this work, we studied genome organization divergence taking into account the contribution of both genome order rearrangements and genome content dynamics. By studying species with multiple sequenced genomes available, we were able to explore genome organization stability at different time-scales and to find significant differences for pathogen and non-pathogen species. The output of our framework also allows to identify the conserved gene clusters and/or partial occurrences thereof, making possible to explore how gene clusters assembled during evolution.
Production, characteristics and applications of the cell-bound phytase of Pichia anomala.
Vohra, Ashima; Kaur, Parvinder; Satyanarayana, T
2011-01-01
Among several yeasts isolated from dried flowers of Woodfordia fruticosa, Pichia anomala produced a high titre of cell-bound phytase. The optimization of fermentation variables led to formulation of media and selection of cultural variables that supported enhanced phytase production. The enzyme productivity was very high in fed batch fermentation in air-lift fermentor as compared to that in stirred tank fermentor. Amelioration in the cell-bound phytase activity was observed when yeast cells were permeabilized with Triton-X-100. The enzyme is thermostable and acid stable with broad substrate specificity, the characteristics that are desirable for enzymes to be used in the animal feed industry. The phytase-encoding gene was cloned and sequenced. The 3D structure of the enzyme was proposed by comparative modeling using phytase of Debaryomyces occidentalis (50% sequence identity) as template. When broiler chicks, and fresh water and marine fishes were fed with the feed supplemented with yeast biomass containing phytase, improvement in growth and phosphorus retention, and decrease in the excretion of phosphorus in the faeces were recorded. The cell-bound phytase of P. anomala could effectively dephytinize wheat flour and soymilk.
Dobbyn, Amanda; Huckins, Laura M; Boocock, James; Sloofman, Laura G; Glicksberg, Benjamin S; Giambartolomei, Claudia; Hoffman, Gabriel E; Perumal, Thanneer M; Girdhar, Kiran; Jiang, Yan; Raj, Towfique; Ruderfer, Douglas M; Kramer, Robin S; Pinto, Dalila; Akbarian, Schahram; Roussos, Panos; Domenici, Enrico; Devlin, Bernie; Sklar, Pamela; Stahl, Eli A; Sieberts, Solveig K
2018-06-07
Causal genes and variants within genome-wide association study (GWAS) loci can be identified by integrating GWAS statistics with expression quantitative trait loci (eQTL) and determining which variants underlie both GWAS and eQTL signals. Most analyses, however, consider only the marginal eQTL signal, rather than dissect this signal into multiple conditionally independent signals for each gene. Here we show that analyzing conditional eQTL signatures, which could be important under specific cellular or temporal contexts, leads to improved fine mapping of GWAS associations. Using genotypes and gene expression levels from post-mortem human brain samples (n = 467) reported by the CommonMind Consortium (CMC), we find that conditional eQTL are widespread; 63% of genes with primary eQTL also have conditional eQTL. In addition, genomic features associated with conditional eQTL are consistent with context-specific (e.g., tissue-, cell type-, or developmental time point-specific) regulation of gene expression. Integrating the 2014 Psychiatric Genomics Consortium schizophrenia (SCZ) GWAS and CMC primary and conditional eQTL data reveals 40 loci with strong evidence for co-localization (posterior probability > 0.8), including six loci with co-localization of conditional eQTL. Our co-localization analyses support previously reported genes, identify novel genes associated with schizophrenia risk, and provide specific hypotheses for their functional follow-up. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Combining Evidence of Preferential Gene-Tissue Relationships from Multiple Sources
Guo, Jing; Hammar, Mårten; Öberg, Lisa; Padmanabhuni, Shanmukha S.; Bjäreland, Marcus; Dalevi, Daniel
2013-01-01
An important challenge in drug discovery and disease prognosis is to predict genes that are preferentially expressed in one or a few tissues, i.e. showing a considerably higher expression in one tissue(s) compared to the others. Although several data sources and methods have been published explicitly for this purpose, they often disagree and it is not evident how to retrieve these genes and how to distinguish true biological findings from those that are due to choice-of-method and/or experimental settings. In this work we have developed a computational approach that combines results from multiple methods and datasets with the aim to eliminate method/study-specific biases and to improve the predictability of preferentially expressed human genes. A rule-based score is used to merge and assign support to the results. Five sets of genes with known tissue specificity were used for parameter pruning and cross-validation. In total we identify 3434 tissue-specific genes. We compare the genes of highest scores with the public databases: PaGenBase (microarray), TiGER (EST) and HPA (protein expression data). The results have 85% overlap to PaGenBase, 71% to TiGER and only 28% to HPA. 99% of our predictions have support from at least one of these databases. Our approach also performs better than any of the databases on identifying drug targets and biomarkers with known tissue-specificity. PMID:23950964
Gender-specific responses to climate variability in a semi-arid ecosystem in northern Benin.
Dah-Gbeto, Afiavi P; Villamor, Grace B
2016-12-01
Highly erratic rainfall patterns in northern Benin complicate the ability of rural farmers to engage in subsistence agriculture. This research explores gender-specific responses to climate variability in the context of agrarian Benin through a household survey (n = 260) and an experimental gaming exercise among a subset of the survey respondents. Although men and women from the sample population are equally aware of climate variability and share similar coping strategies, their specific land-use strategies, preferences, and motivations are distinct. Over the long term, these differences would likely lead to dissimilar coping strategies and vulnerability to the effects of climate change. Examination of gender-specific land-use responses to climate change and anticipatory learning can enhance efforts to improve adaptability and resilience among rural subsistence farmers.
Aging Shapes the Population-Mean and -Dispersion of Gene Expression in Human Brains
Brinkmeyer-Langford, Candice L.; Guan, Jinting; Ji, Guoli; Cai, James J.
2016-01-01
Human aging is associated with cognitive decline and an increased risk of neurodegenerative disease. Our objective for this study was to evaluate potential relationships between age and variation in gene expression across different regions of the brain. We analyzed the Genotype-Tissue Expression (GTEx) data from 54 to 101 tissue samples across 13 brain regions in post-mortem donors of European descent aged between 20 and 70 years at death. After accounting for the effects of covariates and hidden confounding factors, we identified 1446 protein-coding genes whose expression in one or more brain regions is correlated with chronological age at a false discovery rate of 5%. These genes are involved in various biological processes including apoptosis, mRNA splicing, amino acid biosynthesis, and neurotransmitter transport. The distribution of these genes among brain regions is uneven, suggesting variable regional responses to aging. We also found that the aging response of many genes, e.g., TP37 and C1QA, depends on individuals' genotypic backgrounds. Finally, using dispersion-specific analysis, we identified genes such as IL7R, MS4A4E, and TERF1/TERF2 whose expressions are differentially dispersed by aging, i.e., variances differ between age groups. Our results demonstrate that age-related gene expression is brain region-specific, genotype-dependent, and associated with both mean and dispersion changes. Our findings provide a foundation for more sophisticated gene expression modeling in the studies of age-related neurodegenerative diseases. PMID:27536236
Database of cattle candidate genes and genetic markers for milk production and mastitis
Ogorevc, J; Kunej, T; Razpet, A; Dovc, P
2009-01-01
A cattle database of candidate genes and genetic markers for milk production and mastitis has been developed to provide an integrated research tool incorporating different types of information supporting a genomic approach to study lactation, udder development and health. The database contains 943 genes and genetic markers involved in mammary gland development and function, representing candidates for further functional studies. The candidate loci were drawn on a genetic map to reveal positional overlaps. For identification of candidate loci, data from seven different research approaches were exploited: (i) gene knockouts or transgenes in mice that result in specific phenotypes associated with mammary gland (143 loci); (ii) cattle QTL for milk production (344) and mastitis related traits (71); (iii) loci with sequence variations that show specific allele-phenotype interactions associated with milk production (24) or mastitis (10) in cattle; (iv) genes with expression profiles associated with milk production (207) or mastitis (107) in cattle or mouse; (v) cattle milk protein genes that exist in different genetic variants (9); (vi) miRNAs expressed in bovine mammary gland (32) and (vii) epigenetically regulated cattle genes associated with mammary gland function (1). Fourty-four genes found by multiple independent analyses were suggested as the most promising candidates and were further in silico analysed for expression levels in lactating mammary gland, genetic variability and top biological functions in functional networks. A miRNA target search for mammary gland expressed miRNAs identified 359 putative binding sites in 3′UTRs of candidate genes. PMID:19508288
Katikireddi, S. Vittal; Valles, Sean A
2015-01-01
The categorization of variables can stigmatize populations, which is ethically problematic and threatens the central purpose of public health: to improve population health and reduce health inequities. How social variables (e.g., behavioral risks for HIV) are categorized can reinforce stigma and cause unintended harms to the populations practitioners and researchers strive to serve. Although debates about the validity or ethical consequences of epidemiological variables are familiar for specific variables (e.g., ethnicity), these issues apply more widely. We argue that these tensions and debates regarding epidemiological variables should be analyzed simultaneously as ethical and epistemic challenges. We describe a framework derived from the philosophy of science that may be usefully applied to public health, and we illustrate its application. PMID:25393193
Khan, Arif O.; Aldahmesh, Mohammed A.; Alkuraya, Fowzan S.
2015-01-01
Purpose: To assess for phenotype-genotype correlations in families with recessive pediatric cataract and identified gene mutations. Methods: Retrospective review (2004 through 2013) of 26 Saudi Arabian apparently nonsyndromic pediatric cataract families referred to one of the authors (A.O.K.) and for which recessive gene mutations were identified. Results: Fifteen different homozygous recessive gene mutations were identified in the 26 consanguineous families; two genes and five families are novel to this study. Ten families had a founder CRYBB1 deletion (all with bilateral central pulverulent cataract), two had the same missense mutation in CRYAB (both with bilateral juvenile cataract with marked variable expressivity), and two had different mutations in FYCO1 (both with bilateral posterior capsular abnormality). The remaining 12 families each had mutations in 12 different genes (CRYAA, CRYBA1, AKR1E2, AGK, BFSP2, CYP27A1, CYP51A1, EPHA2, GCNT2, LONP1, RNLS, WDR87) with unique phenotypes noted for CYP27A1 (bilateral juvenile fleck with anterior and/or posterior capsular cataract and later cerebrotendinous xanthomatosis), EPHA2 (bilateral anterior persistent fetal vasculature), and BFSP2 (bilateral flecklike with cloudy cortex). Potential carrier signs were documented for several families. Conclusions: In this recessive pediatric cataract case series most identified genes are noncrystallin. Recessive pediatric cataract phenotypes are generally nonspecific, but some notable phenotypes are distinct and associated with specific gene mutations. Marked variable expressivity can occur from a recessive missense CRYAB mutation. Genetic analysis of apparently isolated pediatric cataract can sometimes uncover mutations in a syndromic gene. Some gene mutations seem to be associated with apparent heterozygous carrier signs. PMID:26622071
Antonescu, Cristina R; Viale, Agnes; Sarran, Lisa; Tschernyavsky, Sylvia J; Gonen, Mithat; Segal, Neil H; Maki, Robert G; Socci, Nicholas D; DeMatteo, Ronald P; Besmer, Peter
2004-05-15
Gastrointestinal stromal tumors (GISTs) are specific KIT expressing and KIT-signaling driven mesenchymal tumors of the human digestive tract, many of which have KIT-activating mutations. Previous studies have found a relatively homogeneous gene expression profile in GIST, as compared with other histological types of sarcomas. Transcriptional heterogeneity within clinically or molecularly defined subsets of GISTs has not been previously reported. We tested the hypothesis that the gene expression profile in GISTs might be related to KIT genotype and possibly to other clinicopathological factors. An HG-U133A Affymetrix chip (22,000 genes) platform was used to determine the variability of gene expression in 28 KIT-expressing GIST samples from 24 patients. A control group of six intra-abdominal leiomyosarcomas was also included for comparison. Statistical analyses (t tests) were performed to identify discriminatory gene lists among various GIST subgroups. The levels of expression of various GIST subsets were also linked to a modified version of the growth factor/KIT signaling pathway to analyze differences at various steps in signal transduction. Genes involved in KIT signaling were differentially expressed among wild-type and mutant GISTs. High gene expression of potential drug targets, such as VEGF, MCSF, and BCL2 in the wild-type group, and Mesothelin in exon 9 GISTs were found. There was a striking difference in gene expression between stomach and small bowel GISTs. This finding was validated in four separate tumors, two gastric and two intestinal, from a patient with familial GIST with a germ-line KIT W557R substitution. GISTs have heterogeneous gene expression depending on KIT genotype and tumor location, which is seen at both the genomic level and the KIT signaling pathway in particular. These findings may explain their variable clinical behavior and response to therapy.
Zhang, Han; Rokas, Antonis; Slot, Jason C
2012-01-01
Dermatophyte fungi of the family Arthrodermataceae (Eurotiomycetes) colonize keratinized tissue, such as skin, frequently causing superficial mycoses in humans and other mammals, reptiles, and birds. Competition with native microflora likely underlies the propensity of these dermatophytes to produce a diversity of antibiotics and compounds for scavenging iron, which is extremely scarce, as well as the presence of an unusually large number of putative secondary metabolism gene clusters, most of which contain non-ribosomal peptide synthetases (NRPS), in their genomes. To better understand the historical origins and diversification of NRPS-containing gene clusters we examined the evolution of a variable locus (VL) that exists in one of three alternative conformations among the genomes of seven dermatophyte species. The first conformation of the VL (termed VLA) contains only 539 base pairs of sequence and lacks protein-coding genes, whereas the other two conformations (termed VLB and VLC) span 36 Kb and 27 Kb and contain 12 and 10 genes, respectively. Interestingly, both VLB and VLC appear to contain distinct secondary metabolism gene clusters; VLB contains a NRPS gene as well as four porphyrin metabolism genes never found to be physically linked in the genomes of 128 other fungal species, whereas VLC also contains a NRPS gene as well as several others typically found associated with secondary metabolism gene clusters. Phylogenetic evidence suggests that the VL locus was present in the ancestor of all seven species achieving its present distribution through subsequent differential losses or retentions of specific conformations. We propose that the existence of variable loci, similar to the one we studied, in fungal genomes could potentially explain the dramatic differences in secondary metabolic diversity between closely related species of filamentous fungi, and contribute to host adaptation and the generation of metabolic diversity.
Cell-Type Specific Features of Circular RNA Expression
Salzman, Julia; Chen, Raymond E.; Olsen, Mari N.; Wang, Peter L.; Brown, Patrick O.
2013-01-01
Thousands of loci in the human and mouse genomes give rise to circular RNA transcripts; at many of these loci, the predominant RNA isoform is a circle. Using an improved computational approach for circular RNA identification, we found widespread circular RNA expression in Drosophila melanogaster and estimate that in humans, circular RNA may account for 1% as many molecules as poly(A) RNA. Analysis of data from the ENCODE consortium revealed that the repertoire of genes expressing circular RNA, the ratio of circular to linear transcripts for each gene, and even the pattern of splice isoforms of circular RNAs from each gene were cell-type specific. These results suggest that biogenesis of circular RNA is an integral, conserved, and regulated feature of the gene expression program. PMID:24039610
The genome sequence of taurine cattle: a window to ruminant biology and evolution.
Elsik, Christine G; Tellam, Ross L; Worley, Kim C; Gibbs, Richard A; Muzny, Donna M; Weinstock, George M; Adelson, David L; Eichler, Evan E; Elnitski, Laura; Guigó, Roderic; Hamernik, Debora L; Kappes, Steve M; Lewin, Harris A; Lynn, David J; Nicholas, Frank W; Reymond, Alexandre; Rijnkels, Monique; Skow, Loren C; Zdobnov, Evgeny M; Schook, Lawrence; Womack, James; Alioto, Tyler; Antonarakis, Stylianos E; Astashyn, Alex; Chapple, Charles E; Chen, Hsiu-Chuan; Chrast, Jacqueline; Câmara, Francisco; Ermolaeva, Olga; Henrichsen, Charlotte N; Hlavina, Wratko; Kapustin, Yuri; Kiryutin, Boris; Kitts, Paul; Kokocinski, Felix; Landrum, Melissa; Maglott, Donna; Pruitt, Kim; Sapojnikov, Victor; Searle, Stephen M; Solovyev, Victor; Souvorov, Alexandre; Ucla, Catherine; Wyss, Carine; Anzola, Juan M; Gerlach, Daniel; Elhaik, Eran; Graur, Dan; Reese, Justin T; Edgar, Robert C; McEwan, John C; Payne, Gemma M; Raison, Joy M; Junier, Thomas; Kriventseva, Evgenia V; Eyras, Eduardo; Plass, Mireya; Donthu, Ravikiran; Larkin, Denis M; Reecy, James; Yang, Mary Q; Chen, Lin; Cheng, Ze; Chitko-McKown, Carol G; Liu, George E; Matukumalli, Lakshmi K; Song, Jiuzhou; Zhu, Bin; Bradley, Daniel G; Brinkman, Fiona S L; Lau, Lilian P L; Whiteside, Matthew D; Walker, Angela; Wheeler, Thomas T; Casey, Theresa; German, J Bruce; Lemay, Danielle G; Maqbool, Nauman J; Molenaar, Adrian J; Seo, Seongwon; Stothard, Paul; Baldwin, Cynthia L; Baxter, Rebecca; Brinkmeyer-Langford, Candice L; Brown, Wendy C; Childers, Christopher P; Connelley, Timothy; Ellis, Shirley A; Fritz, Krista; Glass, Elizabeth J; Herzig, Carolyn T A; Iivanainen, Antti; Lahmers, Kevin K; Bennett, Anna K; Dickens, C Michael; Gilbert, James G R; Hagen, Darren E; Salih, Hanni; Aerts, Jan; Caetano, Alexandre R; Dalrymple, Brian; Garcia, Jose Fernando; Gill, Clare A; Hiendleder, Stefan G; Memili, Erdogan; Spurlock, Diane; Williams, John L; Alexander, Lee; Brownstein, Michael J; Guan, Leluo; Holt, Robert A; Jones, Steven J M; Marra, Marco A; Moore, Richard; Moore, Stephen S; Roberts, Andy; Taniguchi, Masaaki; Waterman, Richard C; Chacko, Joseph; Chandrabose, Mimi M; Cree, Andy; Dao, Marvin Diep; Dinh, Huyen H; Gabisi, Ramatu Ayiesha; Hines, Sandra; Hume, Jennifer; Jhangiani, Shalini N; Joshi, Vandita; Kovar, Christie L; Lewis, Lora R; Liu, Yih-Shin; Lopez, John; Morgan, Margaret B; Nguyen, Ngoc Bich; Okwuonu, Geoffrey O; Ruiz, San Juana; Santibanez, Jireh; Wright, Rita A; Buhay, Christian; Ding, Yan; Dugan-Rocha, Shannon; Herdandez, Judith; Holder, Michael; Sabo, Aniko; Egan, Amy; Goodell, Jason; Wilczek-Boney, Katarzyna; Fowler, Gerald R; Hitchens, Matthew Edward; Lozado, Ryan J; Moen, Charles; Steffen, David; Warren, James T; Zhang, Jingkun; Chiu, Readman; Schein, Jacqueline E; Durbin, K James; Havlak, Paul; Jiang, Huaiyang; Liu, Yue; Qin, Xiang; Ren, Yanru; Shen, Yufeng; Song, Henry; Bell, Stephanie Nicole; Davis, Clay; Johnson, Angela Jolivet; Lee, Sandra; Nazareth, Lynne V; Patel, Bella Mayurkumar; Pu, Ling-Ling; Vattathil, Selina; Williams, Rex Lee; Curry, Stacey; Hamilton, Cerissa; Sodergren, Erica; Wheeler, David A; Barris, Wes; Bennett, Gary L; Eggen, André; Green, Ronnie D; Harhay, Gregory P; Hobbs, Matthew; Jann, Oliver; Keele, John W; Kent, Matthew P; Lien, Sigbjørn; McKay, Stephanie D; McWilliam, Sean; Ratnakumar, Abhirami; Schnabel, Robert D; Smith, Timothy; Snelling, Warren M; Sonstegard, Tad S; Stone, Roger T; Sugimoto, Yoshikazu; Takasuga, Akiko; Taylor, Jeremy F; Van Tassell, Curtis P; Macneil, Michael D; Abatepaulo, Antonio R R; Abbey, Colette A; Ahola, Virpi; Almeida, Iassudara G; Amadio, Ariel F; Anatriello, Elen; Bahadue, Suria M; Biase, Fernando H; Boldt, Clayton R; Carroll, Jeffery A; Carvalho, Wanessa A; Cervelatti, Eliane P; Chacko, Elsa; Chapin, Jennifer E; Cheng, Ye; Choi, Jungwoo; Colley, Adam J; de Campos, Tatiana A; De Donato, Marcos; Santos, Isabel K F de Miranda; de Oliveira, Carlo J F; Deobald, Heather; Devinoy, Eve; Donohue, Kaitlin E; Dovc, Peter; Eberlein, Annett; Fitzsimmons, Carolyn J; Franzin, Alessandra M; Garcia, Gustavo R; Genini, Sem; Gladney, Cody J; Grant, Jason R; Greaser, Marion L; Green, Jonathan A; Hadsell, Darryl L; Hakimov, Hatam A; Halgren, Rob; Harrow, Jennifer L; Hart, Elizabeth A; Hastings, Nicola; Hernandez, Marta; Hu, Zhi-Liang; Ingham, Aaron; Iso-Touru, Terhi; Jamis, Catherine; Jensen, Kirsty; Kapetis, Dimos; Kerr, Tovah; Khalil, Sari S; Khatib, Hasan; Kolbehdari, Davood; Kumar, Charu G; Kumar, Dinesh; Leach, Richard; Lee, Justin C-M; Li, Changxi; Logan, Krystin M; Malinverni, Roberto; Marques, Elisa; Martin, William F; Martins, Natalia F; Maruyama, Sandra R; Mazza, Raffaele; McLean, Kim L; Medrano, Juan F; Moreno, Barbara T; Moré, Daniela D; Muntean, Carl T; Nandakumar, Hari P; Nogueira, Marcelo F G; Olsaker, Ingrid; Pant, Sameer D; Panzitta, Francesca; Pastor, Rosemeire C P; Poli, Mario A; Poslusny, Nathan; Rachagani, Satyanarayana; Ranganathan, Shoba; Razpet, Andrej; Riggs, Penny K; Rincon, Gonzalo; Rodriguez-Osorio, Nelida; Rodriguez-Zas, Sandra L; Romero, Natasha E; Rosenwald, Anne; Sando, Lillian; Schmutz, Sheila M; Shen, Libing; Sherman, Laura; Southey, Bruce R; Lutzow, Ylva Strandberg; Sweedler, Jonathan V; Tammen, Imke; Telugu, Bhanu Prakash V L; Urbanski, Jennifer M; Utsunomiya, Yuri T; Verschoor, Chris P; Waardenberg, Ashley J; Wang, Zhiquan; Ward, Robert; Weikard, Rosemarie; Welsh, Thomas H; White, Stephen N; Wilming, Laurens G; Wunderlich, Kris R; Yang, Jianqi; Zhao, Feng-Qi
2009-04-24
To understand the biology and evolution of ruminants, the cattle genome was sequenced to about sevenfold coverage. The cattle genome contains a minimum of 22,000 genes, with a core set of 14,345 orthologs shared among seven mammalian species of which 1217 are absent or undetected in noneutherian (marsupial or monotreme) genomes. Cattle-specific evolutionary breakpoint regions in chromosomes have a higher density of segmental duplications, enrichment of repetitive elements, and species-specific variations in genes associated with lactation and immune responsiveness. Genes involved in metabolism are generally highly conserved, although five metabolic genes are deleted or extensively diverged from their human orthologs. The cattle genome sequence thus provides a resource for understanding mammalian evolution and accelerating livestock genetic improvement for milk and meat production.
Smith, Cody J.; Olszewski, Adam M.; Mauro, Steven A.
2009-01-01
Shiga toxin (Stx) genes produce proteins that are pathogenic to humans, leading to severe gastrointestinal illness. This work focuses on examining the abundance and distribution of stx genes in relation to common microbial indicators in beach water and streams in the vicinity of Presque Isle State Park in Erie, PA. By use of quantitative PCR, the relative abundance levels of stx DNA in over 700 samples in the sampling area were determined. The results demonstrate that the abundance and distribution of stx genes are variable and do not correlate with the abundance of Escherichia coli bacteria, enterococci, or viral particles. These results suggest that microbial indicators of water quality are not adequate in predicting the occurrence of organisms that harbor stx genes and highlight the need for standardized pathogen-specific detection protocols for waters utilized for recreational swimming. PMID:19011065
Is Each Light-Harvesting Complex Protein Important for Plant Fitness?1[w
Ganeteg, Ulrika; Külheim, Carsten; Andersson, Jenny; Jansson, Stefan
2004-01-01
Many of the photosynthetic genes are conserved among all higher plants, indicating that there is strong selective pressure to maintain the genes of each protein. However, mutants of these genes often lack visible growth phenotypes, suggesting that they are important only under certain conditions or have overlapping functions. To assess the importance of specific genes encoding the light-harvesting complex (LHC) proteins for the survival of the plant in the natural environment, we have combined two different scientific traditions by using an ecological fitness assay on a set of genetically modified Arabidopsis plants with differing LHC protein contents. The fitness of all of the LHC-deficient plants was reduced in some of the growth environments, supporting the hypothesis that each of the genes has been conserved because they provide ecological flexibility, which is of great adaptive value given the highly variable conditions encountered in nature. PMID:14730076
Smoking and diabetes. Epigenetics involvement in osseointegration.
Razzouk, Sleiman; Sarkis, Rami
2013-03-01
Bone quality is a poorly defined parameter for successful implant placement, which largely depends upon many environmental and genetic factors unique to every individual. Smoking and diabetes are among the environmental factors that most impact osseointegration. However, there is an inter-individual variability of bone response in smokers and diabetic patients. Recent data on gene-environment interactions highlight the major role of epigenetic changes to induce a specific phenotype. Histone acetylation and DNA methylation are the main events that occur and modulate the gene expression. In this paper, we emphasize the impact of epigenetics on diabetes and smoking and describe their significance in bone healing. Also, we underscore the importance of adopting a new approach in clinical management for implant placement by customizing the treatment according to the patient's specific characteristics.
2011-01-01
Background Major Histocompatibility Complex (MHC) genes are central to vertebrate immune response and are believed to be under balancing selection by pathogens. This hypothesis has been supported by observations of extremely high polymorphism, elevated nonsynonymous to synonymous base pair substitution rates and trans-species polymorphisms at these loci. In equids, the organization and variability of this gene family has been described, however the full extent of diversity and selection is unknown. As selection is not expected to act uniformly on a functional gene, maximum likelihood codon-based models of selection that allow heterogeneity in selection across codon positions can be valuable for examining MHC gene evolution and the molecular basis for species adaptations. Results We investigated the evolution of two class II MHC genes of the Equine Lymphocyte Antigen (ELA), DRA and DQA, in the genus Equus with the addition of novel alleles identified in plains zebra (E. quagga, formerly E. burchelli). We found that both genes exhibited a high degree of polymorphism and inter-specific sharing of allele lineages. To our knowledge, DRA allelic diversity was discovered to be higher than has ever been observed in vertebrates. Evidence was also found to support a duplication of the DQA locus. Selection analyses, evaluated in terms of relative rates of nonsynonymous to synonymous mutations (dN/dS) averaged over the gene region, indicated that the majority of codon sites were conserved and under purifying selection (dN
Siezen, Roland J.; Bayjanov, Jumamurat R.; Felis, Giovanna E.; van der Sijde, Marijke R.; Starrenburg, Marjo; Molenaar, Douwe; Wels, Michiel; van Hijum, Sacha A. F. T.; van Hylckama Vlieg, Johan E. T.
2011-01-01
Summary Lactococcus lactis produces lactic acid and is widely used in the manufacturing of various fermented dairy products. However, the species is also frequently isolated from non‐dairy niches, such as fermented plant material. Recently, these non‐dairy strains have gained increasing interest, as they have been described to possess flavour‐forming activities that are rarely found in dairy isolates and have diverse metabolic properties. We performed an extensive whole‐genome diversity analysis on 39 L. lactis strains, isolated from dairy and plant sources. Comparative genome hybridization analysis with multi‐strain microarrays was used to assess presence or absence of genes and gene clusters in these strains, relative to all L. lactis sequences in public databases, whereby chromosomal and plasmid‐encoded genes were computationally analysed separately. Nearly 3900 chromosomal orthologous groups (chrOGs) were defined on basis of four sequenced chromosomes of L. lactis strains (IL1403, KF147, SK11, MG1363). Of these, 1268 chrOGs are present in at least 35 strains and represent the presently known core genome of L. lactis, and 72 chrOGs appear to be unique for L. lactis. Nearly 600 and 400 chrOGs were found to be specific for either the subspecies lactis or subspecies cremoris respectively. Strain variability was found in presence or absence of gene clusters related to growth on plant substrates, such as genes involved in the consumption of arabinose, xylan, α‐galactosides and galacturonate. Further niche‐specific differences were found in gene clusters for exopolysaccharides biosynthesis, stress response (iron transport, osmotolerance) and bacterial defence mechanisms (nisin biosynthesis). Strain variability of functions encoded on known plasmids included proteolysis, lactose fermentation, citrate uptake, metal ion resistance and exopolysaccharides biosynthesis. The present study supports the view of L. lactis as a species with a very flexible genome. PMID:21338475
Diverse Cis-Regulatory Mechanisms Contribute to Expression Evolution of Tandem Gene Duplicates
Baudouin-Gonzalez, Luís; Santos, Marília A; Tempesta, Camille; Sucena, Élio; Roch, Fernando; Tanaka, Kohtaro
2017-01-01
Abstract Pairs of duplicated genes generally display a combination of conserved expression patterns inherited from their unduplicated ancestor and newly acquired domains. However, how the cis-regulatory architecture of duplicated loci evolves to produce these expression patterns is poorly understood. We have directly examined the gene-regulatory evolution of two tandem duplicates, the Drosophila Ly6 genes CG9336 and CG9338, which arose at the base of the drosophilids between 40 and 60 Ma. Comparing the expression patterns of the two paralogs in four Drosophila species with that of the unduplicated ortholog in the tephritid Ceratitis capitata, we show that they diverged from each other as well as from the unduplicated ortholog. Moreover, the expression divergence appears to have occurred close to the duplication event and also more recently in a lineage-specific manner. The comparison of the tissue-specific cis-regulatory modules (CRMs) controlling the paralog expression in the four Drosophila species indicates that diverse cis-regulatory mechanisms, including the novel tissue-specific enhancers, differential inactivation, and enhancer sharing, contributed to the expression evolution. Our analysis also reveals a surprisingly variable cis-regulatory architecture, in which the CRMs driving conserved expression domains change in number, location, and specificity. Altogether, this study provides a detailed historical account that uncovers a highly dynamic picture of how the paralog expression patterns and their underlying cis-regulatory landscape evolve. We argue that our findings will encourage studying cis-regulatory evolution at the whole-locus level to understand how interactions between enhancers and other regulatory levels shape the evolution of gene expression. PMID:28961967
Eleid, Mackram F; Caracciolo, Giuseppe; Cho, Eun Joo; Scott, Robert L; Steidley, D Eric; Wilansky, Susan; Arabia, Francisco A; Khandheria, Bijoy K; Sengupta, Partho P
2010-10-01
The aim of this study was to explore the temporal evolution of left ventricular (LV) mechanics in relation to clinical variables and genetic expression profiles implicated in cardiac allograft function. Considerable uncertainty exists regarding the range and determinants of variability in LV systolic performance in transplanted hearts (TXH). Fifty-one patients (mean age 53 ± 12 years; 37 men) underwent serial assessment of echocardiograms, cardiac catheterization, gene expression profiles, and endomyocardial biopsy data within 2 weeks and at 3, 6, 12, and 24 months after transplantation. Two-dimensional speckle-tracking data were compared between patients with TXH and 37 controls (including 12 post-coronary artery bypass patients). Post-transplantation mortality and hospitalizations were recorded with a median follow-up period of 944 days. Global longitudinal strain (LS) and radial strain remained attenuated in patients with TXH at all time points (p < 0.001 and p = 0.005), independent of clinical rejection episodes. Failure to improve global LS at 3 months (≥ 1 SD) was associated with higher incidence of death and cardiac events (hazard ratio: 5.92; 95% confidence interval: 1.96 to 17.91; p = 0.049). Multivariate analysis revealed gene expression score as the only independent predictor of global LS (R(2) = 0.53, p = 0.005), with SEMA7A gene expression having the highest correlation with global LS (r = -0.84, p < 0.001). Speckle tracking-derived LV strains are helpful in estimating the burden of LV dysfunction in patients with TXH that evolves independent of biopsy-detected cellular rejection. Failure to improve global LS at 3 months after transplantation is associated with a higher incidence of death and cardiac events. Serial changes in LV mechanics correlate with peripheral blood gene expression profiles and may affect the clinical assessment of long-term prognosis in patients with TXH. Copyright © 2010 American College of Cardiology Foundation. Published by Elsevier Inc. All rights reserved.
Dong, Yufeng; Jin, Xi; Tang, Qiaoling; Zhang, Xin; Yang, Jiangtao; Liu, Xiaojing; Cai, Junfeng; Zhang, Xiaobing; Wang, Xujing; Wang, Zhixing
2017-01-01
Glyphosate is a widely used herbicide, due to its broad spectrum, low cost, low toxicity, high efficiency, and non-selective characteristics. Rice farmers rarely use glyphosate as a herbicide, because the crop is sensitive to this chemical. The development of transgenic glyphosate-tolerant rice could greatly improve the economics of rice production. Here, we transformed the Pseudomonas fluorescens G2 5-enolpyruvyl shikimate-3-phosphate synthase (EPSPS) gene G2-EPSPS, which conferred tolerance to glyphosate herbicide into a widely used japonica rice cultivar, Zhonghua 11 (ZH11), to develop two highly glyphosate-tolerant transgenic rice lines, G2-6 and G2-7, with one exogenous gene integration. Seed germination tests and glyphosate-tolerance assays of plants grown in a greenhouse showed that the two transgenic lines could greatly improve glyphosate-tolerance compared with the wild-type; The glyphosate-tolerance field test indicated that both transgenic lines could grow at concentrations of 20,000 ppm glyphosate, which is more than 20-times the recommended concentration in the field. Isolation of the flanking sequence of transgenic rice G2-6 indicated that the 5′-terminal of T-DNA was inserted into chromosome 8 of the rice genome. An event-specific PCR test system was established and the limit of detection of the primers reached five copies. Overall, the G2-EPSPS gene significantly improved glyphosate-tolerance in transgenic rice; furthermore, it is a useful candidate gene for the future development of commercial transgenic rice. PMID:28611804
Sardi, Maria; Rovinskiy, Nikolay; Zhang, Yaoping; ...
2016-07-22
We report a major obstacle to sustainable lignocellulosic biofuel production is microbe inhibition by the combinatorial stresses in pretreated plant hydrolysate. Chemical biomass pretreatment releases a suite of toxins that interact with other stressors, including high osmolarity and temperature, which together can have poorly understood synergistic effects on cells. Improving tolerance in industrial strains has been hindered, in part because the mechanisms of tolerance reported in the literature often fail to recapitulate in other strain backgrounds. Here, we explored and then exploited variations in stress tolerance, toxin-induced transcriptomic responses, and fitness effects of gene overexpression in different Saccharomyces cerevisiae (yeast)more » strains to identify genes and processes linked to tolerance of hydrolysate stressors. Using six different S. cerevisiae strains that together maximized phenotypic and genetic diversity, first we explored transcriptomic differences between resistant and sensitive strains to identify common and strain-specific responses. This comparative analysis implicated primary cellular targets of hydrolysate toxins, secondary effects of defective defense strategies, and mechanisms of tolerance. Dissecting the responses to individual hydrolysate components across strains pointed to synergistic interactions between osmolarity, pH, hydrolysate toxins, and nutrient composition. By characterizing the effects of high-copy gene overexpression in three different strains, we revealed the breadth of the background-specific effects of gene fitness contributions in synthetic hydrolysate. Lastly, our approach identified new genes for engineering improved stress tolerance in diverse strains while illuminating the effects of genetic background on molecular mechanisms.« less
Influence of rol genes in floriculture.
Casanova, Eva; Trillas, Maria Isabel; Moysset, Lluïsa; Vainstein, Alexander
2005-01-01
Traditionally, new traits have been introduced into ornamental plants through classical breeding. However, genetic engineering now enables specific alterations of single traits in already successful varieties. New or improved varieties of floricultural crops can be obtained by acting on floral traits, such as color, shape or fragrance, on vase life in cut-flower species, and on rooting potential or overall plant morphology. Overexpression of the rol genes of the Ri plasmid of Agrobacterium rhizogenes in plants alters several of the plant's developmental processes and affects their architecture. Both A. rhizogenes- and rol-transgenic plants display the "hairy-root phenotype", although specific differences are found between species and between transgenic lines. In general, these plants show a dwarfed phenotype, reduced apical dominance, smaller, wrinkled leaves, increased rooting, altered flowering and reduced fertility. Among the rol genes, termed rolA, B, C and D, rolC has been the most widely studied because its effects are the most advantageous in terms of improving ornamental and horticultural traits. In addition to the dwarfness and the increase in lateral shoots that lead to a bushy phenotype, rolC-plants display more, smaller flowers, and advanced flowering; surprisingly, these plants may have better rooting capacity and they show almost no undesirable traits. rolD, the least studied among the rol genes, offers promising applications due to its promotion of flowering. Although the biochemical functions of rol genes remain poorly understood, they are useful tools for improving ornamental flowers, as their expression in transgenic plants yields many beneficial traits.
Johnston, Christopher; Douarre, Pierre E; Soulimane, Tewfik; Pletzer, Daniel; Weingart, Helge; MacSharry, John; Coffey, Aidan; Sleator, Roy D; O'Mahony, Jim
2013-06-01
Subunit and DNA-based vaccines against Mycobacterium avium ssp. paratuberculosis (MAP) attempt to overcome inherent issues associated with whole-cell formulations. However, these vaccines can be hampered by poor expression of recombinant antigens from a number of disparate hosts. The high G+C content of MAP invariably leads to a codon bias throughout gene expression. To investigate if the codon bias affects recombinant MAP antigen expression, the open reading frame of a MAP-specific antigen MptD (MAP3733c) was codon optimised for expression against a Lactobacillus salivarius host. Of the total 209 codons which constitute MAP3733c, 172 were modified resulting in a reduced G+C content from 61% for the native gene to 32.7% for the modified form. Both genes were placed under the transcriptional control of the PnisA promoter; allowing controlled heterologous expression in L. salivarius. Expression was monitored using fluorescence microscopy and microplate fluorometry via GFP tags translationally fused to the C-termini of the two MptD genes. A > 37-fold increase in expression was observed for the codon-optimised MAP3733synth variant over the native gene. Due to the low cost and improved expression achieved, codon optimisation significantly improves the potential of L. salivarius as an oral vaccine stratagem against Johne's disease. © 2013 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
Poxvirus Host Range Genes and Virus–Host Spectrum: A Critical Review
Oliveira, Graziele Pereira; Rodrigues, Rodrigo Araújo Lima; Lima, Maurício Teixeira; Drumond, Betânia Paiva; Abrahão, Jônatas Santos
2017-01-01
The Poxviridae family is comprised of double-stranded DNA viruses belonging to nucleocytoplasmic large DNA viruses (NCLDV). Among the NCLDV, poxviruses exhibit the widest known host range, which is likely observed because this viral family has been more heavily investigated. However, relative to each member of the Poxviridae family, the spectrum of the host is variable, where certain viruses can infect a large range of hosts, while others are restricted to only one host species. It has been suggested that the variability in host spectrum among poxviruses is linked with the presence or absence of some host range genes. Would it be possible to extrapolate the restriction of viral replication in a specific cell lineage to an animal, a far more complex organism? In this study, we compare and discuss the relationship between the host range of poxvirus species and the abundance/diversity of host range genes. We analyzed the sequences of 38 previously identified and putative homologs of poxvirus host range genes, and updated these data with deposited sequences of new poxvirus genomes. Overall, the term host range genes might not be the most appropriate for these genes, since no correlation between them and the viruses’ host spectrum was observed, and a change in nomenclature should be considered. Finally, we analyzed the evolutionary history of these genes, and reaffirmed the occurrence of horizontal gene transfer (HGT) for certain elements, as previously suggested. Considering the data presented in this study, it is not possible to associate the diversity of host range factors with the amount of hosts of known poxviruses, and this traditional nomenclature creates misunderstandings. PMID:29112165
de la Bastide, Paul Y; Leung, Wai Lam; Hintz, William E
2015-01-01
The ITS region of the rDNA gene was compared for Saprolegnia spp. in order to improve our understanding of nucleotide sequence variability within and between species of this genus, determine species composition in Canadian fin fish aquaculture facilities, and to assess the utility of ITS sequence variability in genetic marker development. From a collection of more than 400 field isolates, ITS region nucleotide sequences were studied and it was determined that there was sufficient consistent inter-specific variation to support the designation of species identity based on ITS sequence data. This non-subjective approach to species identification does not rely upon transient morphological features. Phylogenetic analyses comparing our ITS sequences and species designations with data from previous studies generally supported the clade scheme of Diéguez-Uribeondo et al. (2007) and found agreement with the molecular taxonomic cluster system of Sandoval-Sierra et al. (2014). Our Canadian ITS sequence collection will thus contribute to the public database and assist the clarification of Saprolegnia spp. taxonomy. The analysis of ITS region sequence variability facilitated genus- and species-level identification of unknown samples from aquaculture facilities and provided useful information on species composition. A unique ITS-RFLP for the identification of S. parasitica was also described. Copyright © 2014 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.
Iben, James R.; Maraia, Richard J.
2012-01-01
tRNA genes are interspersed throughout eukaryotic DNA, contributing to genome architecture and evolution in addition to translation of the transcriptome. Codon use correlates with tRNA gene copy number in noncomplex organisms including yeasts. Synonymous codons impact translation with various outcomes, dependent on relative tRNA abundances. Availability of whole-genome sequences allowed us to examine tRNA gene copy number variation (tgCNV) and codon use in four Schizosaccharomyces species and Saccharomyces cerevisiae. tRNA gene numbers vary from 171 to 322 in the four Schizosaccharomyces despite very high similarity in other features of their genomes. In addition, we performed whole-genome sequencing of several related laboratory strains of Schizosaccharomyces pombe and found tgCNV at a cluster of tRNA genes. We examined for the first time effects of wobble rules on correlation of tRNA gene number and codon use and showed improvement for S. cerevisiae and three of the Schizosaccharomyces species. In contrast, correlation in Schizosaccharomyces japonicus is poor due to markedly divergent tRNA gene content, and much worsened by the wobble rules. In japonicus, some tRNA iso-acceptor genes are absent and others are greatly reduced relative to the other yeasts, while genes for synonymous wobble iso-acceptors are amplified, indicating wobble use not apparent in any other eukaryote. We identified a subset of japonicus-specific wobbles that improves correlation of codon use and tRNA gene content in japonicus. We conclude that tgCNV is high among Schizo species and occurs in related laboratory strains of S. pombe (and expectedly other species), and tRNAome-codon analyses can provide insight into species-specific wobble decoding. PMID:22586155
Roth, Justin C.; Ismail, Mourad; Reese, Jane S.; Lingas, Karen T.; Ferrari, Giuliana; Gerson, Stanton L.
2012-01-01
The P140K point mutant of MGMT allows robust hematopoietic stem cell (HSC) enrichment in vivo. Thus, dual-gene vectors that couple MGMT and therapeutic gene expression have allowed enrichment of gene-corrected HSCs in animal models. However, expression levels from dual-gene vectors are often reduced for one or both genes. Further, it may be desirable to express selection and therapeutic genes at distinct stages of cell differentiation. In this regard, we evaluated whether hematopoietic cells could be efficiently cotransduced using low MOIs of two separate single-gene lentiviruses, including MGMT for dual-positive cell enrichment. Cotransduction efficiencies were evaluated using a range of MGMT : GFP virus ratios, MOIs, and selection stringencies in vitro. Cotransduction was optimal when equal proportions of each virus were used, but low MGMT : GFP virus ratios resulted in the highest proportion of dual-positive cells after selection. This strategy was then evaluated in murine models for in vivo selection of HSCs cotransduced with a ubiquitous MGMT expression vector and an erythroid-specific GFP vector. Although the MGMT and GFP expression percentages were variable among engrafted recipients, drug selection enriched MGMT-positive leukocyte and GFP-positive erythroid cell populations. These data demonstrate cotransduction as a mean to rapidly enrich and evaluate therapeutic lentivectors in vivo. PMID:22888445
Máximo, Wesley P. F.; Zanetti, Ronald; Paiva, Luciano V.
2018-01-01
Although several ant species are important targets for the development of molecular control strategies, only a few studies focus on identifying and validating reference genes for quantitative reverse transcription polymerase chain reaction (RT-qPCR) data normalization. We provide here an extensive study to identify and validate suitable reference genes for gene expression analysis in the ant Atta sexdens, a threatening agricultural pest in South America. The optimal number of reference genes varies according to each sample and the result generated by RefFinder differed about which is the most suitable reference gene. Results suggest that the RPS16, NADH and SDHB genes were the best reference genes in the sample pool according to stability values. The SNF7 gene expression pattern was stable in all evaluated sample set. In contrast, when using less stable reference genes for normalization a large variability in SNF7 gene expression was recorded. There is no universal reference gene suitable for all conditions under analysis, since these genes can also participate in different cellular functions, thus requiring a systematic validation of possible reference genes for each specific condition. The choice of reference genes on SNF7 gene normalization confirmed that unstable reference genes might drastically change the expression profile analysis of target candidate genes. PMID:29419794
Al-Qurayshi, Zaid; Deniwar, Ahmed; Thethi, Tina; Mallik, Tilak; Srivastav, Sudesh; Murad, Fadi; Bhatia, Parisha; Moroz, Krzysztof; Sholl, Andrew B; Kandil, Emad
2017-04-01
It is crucial for clinicians to know the malignancy prevalence within each indeterminate cytologic category to estimate the performance of the gene expression classifier (GEC). To examine the variability in the performance of the GEC. This retrospective cohort study of patients with Bethesda category III and IV thyroid nodules used single-institution data from January 1, 2013, through February 29, 2016. Expected negative predictive value (NPV) was calculated by adopting published sensitivity and specificity. Observed NPV was calculated based on the true-negative rate. Outcomes were compared with pooled data from 11 studies published January 1, 2010, to January 31, 2016. A total of 145 patients with 154 thyroid nodules were included in the study (mean [SD] age, 56.0 [16.2] years; 106 females [73.1%]). Malignancy prevalence was 45%. On the basis of this prevalence, the expected NPV is 85% and the observed NPV is 69%. If the prevalence is assumed to be 25%, the expected NPV would be 94%, whereas the observed NPV would be 85%. Pooled data analysis of 11 studies comprising 1303 participants revealed a malignancy prevalence of 31% (95% CI, 29%-34%) and a pooled NPV of 92% (95% CI, 87%-96%). In this study, variability in the performance of the GEC was not solely a function of malignancy prevalence and may have been attributable to intrinsic variability of the test sensitivity and specificity. The utility of the GEC in practice is elusive because of this variability. A better definition of the GEC's intrinsic properties is needed.
Transcriptional Analysis of In vivo Plasmodium yoelii Liver Stage Gene Expression
2005-04-26
reaction was added to a PCR master mix with one of several oligonucleotide primer pairs (Table S6). The primers were designed and the specific conditions ...Koonin EV. Using the COG database to improve gene recognition in complete genomes. Genetica 2000;108:9–17. 26] Florens L, Washburn MP, Raine JD, et al
Dekeyser, S; Beclin, E; Descamps, D
2011-04-01
The closed system PCR for the rapid detection of vanA and vanB genes (Xpert vanA/vanB Cepheid(®)) was evaluated in our laboratory, to improve the rapidity of the response and thus the management of patients and isolation measures during two GRE outbreaks. From March to December2009, 565 samples were analysed by PCR associated to bacterial culture initially for all samples for 2months (n = 75), and thereafter for PCR-positive samples only. In this study, sensitivity and negative predictive values of the PCR were 100%. Specificity was evaluated in the presence and absence of outbreak: 69.3 and 76.8% respectively. The variability of false positive rates between units were lower in nonepidemic than during epidemic phase. The global false positive rate was 23.9%. This easy-to-use technology provides rapid results… four samples are tested in 1h versus 72h for culture. Despite its reagent cost, it represents an important hospital diagnostic tool: improvement of the management of cohorting areas and patient transfer between units, adaptation of isolation measures and treatments. However, culture remains necessary to confirm any positive result obtained by PCR and for epidemiological surveillance. Copyright © 2010 Elsevier Masson SAS. All rights reserved.
Whole genome sequencing revealed host adaptation-focused genomic plasticity of pathogenic Leptospira
Xu, Yinghua; Zhu, Yongzhang; Wang, Yuezhu; Chang, Yung-Fu; Zhang, Ying; Jiang, Xiugao; Zhuang, Xuran; Zhu, Yongqiang; Zhang, Jinlong; Zeng, Lingbing; Yang, Minjun; Li, Shijun; Wang, Shengyue; Ye, Qiang; Xin, Xiaofang; Zhao, Guoping; Zheng, Huajun; Guo, Xiaokui; Wang, Junzhi
2016-01-01
Leptospirosis, caused by pathogenic Leptospira spp., has recently been recognized as an emerging infectious disease worldwide. Despite its severity and global importance, knowledge about the molecular pathogenesis and virulence evolution of Leptospira spp. remains limited. Here we sequenced and analyzed 102 isolates representing global sources. A high genomic variability were observed among different Leptospira species, which was attributed to massive gene gain and loss events allowing for adaptation to specific niche conditions and changing host environments. Horizontal gene transfer and gene duplication allowed the stepwise acquisition of virulence factors in pathogenic Leptospira evolved from a recent common ancestor. More importantly, the abundant expansion of specific virulence-related protein families, such as metalloproteases-associated paralogs, were exclusively identified in pathogenic species, reflecting the importance of these protein families in the pathogenesis of leptospirosis. Our observations also indicated that positive selection played a crucial role on this bacteria adaptation to hosts. These novel findings may lead to greater understanding of the global diversity and virulence evolution of Leptospira spp. PMID:26833181
Ogah, Danlami Moses; Iannaccone, Marco; Erhardt, Georg; Di Stasio, Liliana; Cosenza, Gianfranco
2018-01-01
Oxytocin is a neurohypophysial peptide linked to a wide range of biological functions, including milk ejection, temperament and reproduction. Aims of the present study were a) the characterization of the OXT (Oxytocin-neurophysin I) gene and its regulatory regions in Old and New world camelids; b) the investigation of the genetic diversity and the discovery of markers potentially affecting the gene regulation. On average, the gene extends over 814 bp, ranging between 825 bp in dromedary, 811 bp in Bactrian and 810 bp in llama and alpaca. Such difference in size is due to a duplication event of 21 bp in dromedary. The main regulatory elements, including the composite hormone response elements (CHREs), were identified in the promoter, whereas the presence of mature microRNAs binding sequences in the 3’UTR improves the knowledge on the factors putatively involved in the OXT gene regulation, although their specific biological effect needs to be still elucidated. The sequencing of genomic DNA allowed the identification of 17 intraspecific polymorphisms and 69 nucleotide differences among the four species. One of these (MF464535:g.622C>G) is responsible, in alpaca, for the loss of a consensus sequence for the transcription factor SP1. Furthermore, the same SNP falls within a CpG island and it creates a new methylation site, thus opening future possibilities of investigation to verify the influence of the novel allelic variant in the OXT gene regulation. A PCR-RFLP method was setup for the genotyping and the frequency of the allele C was 0.93 in a population of 71 alpacas. The obtained data clarify the structure of OXT gene in domestic camelids and add knowledge to the genetic variability of a genomic region, which has received little investigation so far. These findings open the opportunity for new investigations, including association studies with productive and reproductive traits. PMID:29608621
Pauciullo, Alfredo; Ogah, Danlami Moses; Iannaccone, Marco; Erhardt, Georg; Di Stasio, Liliana; Cosenza, Gianfranco
2018-01-01
Oxytocin is a neurohypophysial peptide linked to a wide range of biological functions, including milk ejection, temperament and reproduction. Aims of the present study were a) the characterization of the OXT (Oxytocin-neurophysin I) gene and its regulatory regions in Old and New world camelids; b) the investigation of the genetic diversity and the discovery of markers potentially affecting the gene regulation. On average, the gene extends over 814 bp, ranging between 825 bp in dromedary, 811 bp in Bactrian and 810 bp in llama and alpaca. Such difference in size is due to a duplication event of 21 bp in dromedary. The main regulatory elements, including the composite hormone response elements (CHREs), were identified in the promoter, whereas the presence of mature microRNAs binding sequences in the 3'UTR improves the knowledge on the factors putatively involved in the OXT gene regulation, although their specific biological effect needs to be still elucidated. The sequencing of genomic DNA allowed the identification of 17 intraspecific polymorphisms and 69 nucleotide differences among the four species. One of these (MF464535:g.622C>G) is responsible, in alpaca, for the loss of a consensus sequence for the transcription factor SP1. Furthermore, the same SNP falls within a CpG island and it creates a new methylation site, thus opening future possibilities of investigation to verify the influence of the novel allelic variant in the OXT gene regulation. A PCR-RFLP method was setup for the genotyping and the frequency of the allele C was 0.93 in a population of 71 alpacas. The obtained data clarify the structure of OXT gene in domestic camelids and add knowledge to the genetic variability of a genomic region, which has received little investigation so far. These findings open the opportunity for new investigations, including association studies with productive and reproductive traits.
Wang, Hongwei; Xin, Haibo; Yang, Xiaohong; Yan, Jianbing; Li, Jiansheng; Tran, Lam-Son Phan; Shinozaki, Kazuo; Yamaguchi-Shinozaki, Kazuko; Qin, Feng
2013-01-01
The worldwide production of maize (Zea mays L.) is frequently impacted by water scarcity and as a result, increased drought tolerance is a priority target in maize breeding programs. While DREB transcription factors have been demonstrated to play a central role in desiccation tolerance, whether or not natural sequence variations in these genes are associated with the phenotypic variability of this trait is largely unknown. In the present study, eighteen ZmDREB genes present in the maize B73 genome were cloned and systematically analyzed to determine their phylogenetic relationship, synteny with rice, maize and sorghum genomes; pattern of drought-responsive gene expression, and protein transactivation activity. Importantly, the association between the nucleic acid variation of each ZmDREB gene with drought tolerance was evaluated using a diverse population of maize consisting of 368 varieties from tropical and temperate regions. A significant association between the genetic variation of ZmDREB2.7 and drought tolerance at seedling stage was identified. Further analysis found that the DNA polymorphisms in the promoter region of ZmDREB2.7, but not the protein coding region itself, was associated with different levels of drought tolerance among maize varieties, likely due to distinct patterns of gene expression in response to drought stress. In vitro, protein-DNA binding assay demonstrated that ZmDREB2.7 protein could specifically interact with the target DNA sequences. The transgenic Arabidopsis overexpressing ZmDREB2.7 displayed enhanced tolerance to drought stress. Moreover, a favorable allele of ZmDREB2.7, identified in the drought-tolerant maize varieties, was effective in imparting plant tolerance to drought stress. Based upon these findings, we conclude that natural variation in the promoter of ZmDREB2.7 contributes to maize drought tolerance, and that the gene and its favorable allele may be an important genetic resource for the genetic improvement of drought tolerance in maize. PMID:24086146
Genome editing for crop improvement: Challenges and opportunities
Abdallah, Naglaa A; Prakash, Channapatna S; McHughen, Alan G
2015-01-01
ABSTRACT Genome or gene editing includes several new techniques to help scientists precisely modify genome sequences. The techniques also enables us to alter the regulation of gene expression patterns in a pre-determined region and facilitates novel insights into the functional genomics of an organism. Emergence of genome editing has brought considerable excitement especially among agricultural scientists because of its simplicity, precision and power as it offers new opportunities to develop improved crop varieties with clear-cut addition of valuable traits or removal of undesirable traits. Research is underway to improve crop varieties with higher yields, strengthen stress tolerance, disease and pest resistance, decrease input costs, and increase nutritional value. Genome editing encompasses a wide variety of tools using either a site-specific recombinase (SSR) or a site-specific nuclease (SSN) system. Both systems require recognition of a known sequence. The SSN system generates single or double strand DNA breaks and activates endogenous DNA repair pathways. SSR technology, such as Cre/loxP and Flp/FRT mediated systems, are able to knockdown or knock-in genes in the genome of eukaryotes, depending on the orientation of the specific sites (loxP, FLP, etc.) flanking the target site. There are 4 main classes of SSN developed to cleave genomic sequences, mega-nucleases (homing endonuclease), zinc finger nucleases (ZFNs), transcriptional activator-like effector nucleases (TALENs), and the CRISPR/Cas nuclease system (clustered regularly interspaced short palindromic repeat/CRISPR-associated protein). The recombinase mediated genome engineering depends on recombinase (sub-) family and target-site and induces high frequencies of homologous recombination. Improving crops with gene editing provides a range of options: by altering only a few nucleotides from billions found in the genomes of living cells, altering the full allele or by inserting a new gene in a targeted region of the genome. Due to its precision, gene editing is more precise than either conventional crop breeding methods or standard genetic engineering methods. Thus this technology is a very powerful tool that can be used toward securing the world's food supply. In addition to improving the nutritional value of crops, it is the most effective way to produce crops that can resist pests and thrive in tough climates. There are 3 types of modifications produced by genome editing; Type I includes altering a few nucleotides, Type II involves replacing an allele with a pre-existing one and Type III allows for the insertion of new gene(s) in predetermined regions in the genome. Because most genome-editing techniques can leave behind traces of DNA alterations evident in a small number of nucleotides, crops created through gene editing could avoid the stringent regulation procedures commonly associated with GM crop development. For this reason many scientists believe plants improved with the more precise gene editing techniques will be more acceptable to the public than transgenic plants. With genome editing comes the promise of new crops being developed more rapidly with a very low risk of off-target effects. It can be performed in any laboratory with any crop, even those that have complex genomes and are not easily bred using conventional methods. PMID:26930114
Teo, Min Yuen; Bambury, Richard M; Zabor, Emily C; Jordan, Emmet; Al-Ahmadie, Hikmat; Boyd, Mariel E; Bouvier, Nancy; Mullane, Stephanie A; Cha, Eugene K; Roper, Nitin; Ostrovnaya, Irina; Hyman, David M; Bochner, Bernard H; Arcila, Maria E; Solit, David B; Berger, Michael F; Bajorin, Dean F; Bellmunt, Joaquim; Iyer, Gopakumar; Rosenberg, Jonathan E
2017-07-15
Purpose: Platinum-based chemotherapy remains the standard treatment for advanced urothelial carcinoma by inducing DNA damage. We hypothesize that somatic alterations in DNA damage response and repair (DDR) genes are associated with improved sensitivity to platinum-based chemotherapy. Experimental Design: Patients with diagnosis of locally advanced and metastatic urothelial carcinoma treated with platinum-based chemotherapy who had exon sequencing with the Memorial Sloan Kettering-Integrated Mutation Profiling of Actionable Cancer Targets (MSK-IMPACT) assay were identified. Patients were dichotomized based on the presence/absence of alterations in a panel of 34 DDR genes. DDR alteration status was correlated with clinical outcomes and disease features. Results: One hundred patients were identified, of which 47 harbored alterations in DDR genes. Patients with DDR alterations had improved progression-free survival (9.3 vs. 6.0 months, log-rank P = 0.007) and overall survival (23.7 vs. 13.0 months, log-rank P = 0.006). DDR alterations were also associated with higher number mutations and copy-number alterations. A trend toward positive correlation between DDR status and nodal metastases and inverse correlation with visceral metastases were observed. Different DDR pathways also suggested variable impact on clinical outcomes. Conclusions: Somatic DDR alteration is associated with improved clinical outcomes in platinum-treated patients with advanced urothelial carcinoma. Once validated, it can improve patient selection for clinical practice and future study enrollment. Clin Cancer Res; 23(14); 3610-8. ©2017 AACR . ©2017 American Association for Cancer Research.
Fernández-Hidalgo, N; Ferreria-González, I; Marsal, J R; Ribera, A; Aznar, M L; de Alarcón, A; García-Cabrera, E; Gálvez-Acebal, J; Sánchez-Espín, G; Reguera-Iglesias, J M; De La Torre-Lima, J; Lomas, J M; Hidalgo-Tenorio, C; Vallejo, N; Miranda, B; Santos-Ortega, A; Castro, M A; Tornos, P; García-Dorado, D; Almirante, B
2018-03-03
To simplify and optimize the ability of EuroSCORE I and II to predict early mortality after surgery for infective endocarditis (IE). Multicentre retrospective study (n = 775). Simplified scores, eliminating irrelevant variables, and new specific scores, adding specific IE variables, were created. The performance of the original, recalibrated and specific EuroSCOREs was assessed by Brier score, C-statistic and calibration plot in bootstrap samples. The Net Reclassification Index was quantified. Recalibrated scores including age, previous cardiac surgery, critical preoperative state, New York Heart Association >I, and emergent surgery (EuroSCORE I and II); renal failure and pulmonary hypertension (EuroSCORE I); and urgent surgery (EuroSCORE II) performed better than the original EuroSCOREs (Brier original and recalibrated: EuroSCORE I: 0.1770 and 0.1667; EuroSCORE II: 0.2307 and 0.1680). Performance improved with the addition of fistula, staphylococci and mitral location (EuroSCORE I and II) (Brier specific: EuroSCORE I 0.1587, EuroSCORE II 0.1592). Discrimination improved in specific models (C-statistic original, recalibrated and specific: EuroSCORE I: 0.7340, 0.7471 and 0.7728; EuroSCORE II: 0.7442, 0.7423 and 0.7700). Calibration improved in both EuroSCORE I models (intercept 0.295, slope 0.829 (original); intercept -0.094, slope 0.888 (recalibrated); intercept -0.059, slope 0.925 (specific)) but only in specific EuroSCORE II model (intercept 2.554, slope 1.114 (original); intercept -0.260, slope 0.703 (recalibrated); intercept -0.053, slope 0.930 (specific)). Net Reclassification Index was 5.1% and 20.3% for the specific EuroSCORE I and II CONCLUSIONS: The use of simplified EuroSCORE I and EuroSCORE II models in IE with the addition of specific variables may lead to simpler and more accurate models. Copyright © 2018 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.
Evolutionary Trails of Plant Group II Pyridoxal Phosphate-Dependent Decarboxylase Genes.
Kumar, Rahul
2016-01-01
Type II pyridoxal phosphate-dependent decarboxylase (PLP_deC) enzymes play important metabolic roles during nitrogen metabolism. Recent evolutionary profiling of these genes revealed a sharp expansion of histidine decarboxylase genes in the members of Solanaceae family. In spite of the high sequence homology shared by PLP_deC orthologs, these enzymes display remarkable differences in their substrate specificities. Currently, limited information is available on the gene repertoires and substrate specificities of PLP_deCs which renders their precise annotation challenging and offers technical challenges in the immediate identification and biochemical characterization of their full gene complements in plants. Herein, we explored their evolutionary trails in a comprehensive manner by taking advantage of high-throughput data accessibility and computational approaches. We discussed the premise that has enabled an improved reconstruction of their evolutionary lineage and evaluated the factors offering constraints in their rapid functional characterization, till date. We envisage that the synthesized information herein would act as a catalyst for the rapid exploration of their biochemical specificity and physiological roles in more plant species.
Meyers, Robin M; Bryan, Jordan G; McFarland, James M; Weir, Barbara A; Sizemore, Ann E; Xu, Han; Dharia, Neekesh V; Montgomery, Phillip G; Cowley, Glenn S; Pantel, Sasha; Goodale, Amy; Lee, Yenarae; Ali, Levi D; Jiang, Guozhi; Lubonja, Rakela; Harrington, William F; Strickland, Matthew; Wu, Ting; Hawes, Derek C; Zhivich, Victor A; Wyatt, Meghan R; Kalani, Zohra; Chang, Jaime J; Okamoto, Michael; Stegmaier, Kimberly; Golub, Todd R; Boehm, Jesse S; Vazquez, Francisca; Root, David E; Hahn, William C; Tsherniak, Aviad
2017-12-01
The CRISPR-Cas9 system has revolutionized gene editing both at single genes and in multiplexed loss-of-function screens, thus enabling precise genome-scale identification of genes essential for proliferation and survival of cancer cells. However, previous studies have reported that a gene-independent antiproliferative effect of Cas9-mediated DNA cleavage confounds such measurement of genetic dependency, thereby leading to false-positive results in copy number-amplified regions. We developed CERES, a computational method to estimate gene-dependency levels from CRISPR-Cas9 essentiality screens while accounting for the copy number-specific effect. In our efforts to define a cancer dependency map, we performed genome-scale CRISPR-Cas9 essentiality screens across 342 cancer cell lines and applied CERES to this data set. We found that CERES decreased false-positive results and estimated sgRNA activity for both this data set and previously published screens performed with different sgRNA libraries. We further demonstrate the utility of this collection of screens, after CERES correction, for identifying cancer-type-specific vulnerabilities.
Meyers, Robin M.; Bryan, Jordan G.; McFarland, James M.; Weir, Barbara A.; Sizemore, Ann E.; Xu, Han; Dharia, Neekesh V.; Montgomery, Phillip G.; Cowley, Glenn S.; Pantel, Sasha; Goodale, Amy; Lee, Yenarae; Ali, Levi D.; Jiang, Guozhi; Lubonja, Rakela; Harrington, William F.; Strickland, Matthew; Wu, Ting; Hawes, Derek C.; Zhivich, Victor A.; Wyatt, Meghan R.; Kalani, Zohra; Chang, Jaime J.; Okamoto, Michael; Stegmaier, Kimberly; Golub, Todd R.; Boehm, Jesse S.; Vazquez, Francisca; Root, David E.; Hahn, William C.; Tsherniak, Aviad
2017-01-01
The CRISPR-Cas9 system has revolutionized gene editing both on single genes and in multiplexed loss-of-function screens, enabling precise genome-scale identification of genes essential to proliferation and survival of cancer cells1,2. However, previous studies reported that a gene-independent anti-proliferative effect of Cas9-mediated DNA cleavage confounds such measurement of genetic dependency, leading to false positive results in copy number amplified regions3,4. We developed CERES, a computational method to estimate gene dependency levels from CRISPR-Cas9 essentiality screens while accounting for the copy-number-specific effect. As part of our efforts to define a cancer dependency map, we performed genome-scale CRISPR-Cas9 essentiality screens across 342 cancer cell lines and applied CERES to this dataset. We found that CERES reduced false positive results and estimated sgRNA activity for both this dataset and previously published screens performed with different sgRNA libraries. Here, we demonstrate the utility of this collection of screens, upon CERES correction, in revealing cancer-type-specific vulnerabilities. PMID:29083409
The Genetics of Pulmonary Arterial Hypertension
Austin, Eric D.; Loyd, James E.
2014-01-01
Pulmonary arterial hypertension (PAH) is a progressive and fatal disease for which there is an ever-expanding body of genetic and related pathophysiological information on disease pathogenesis. A number of germline gene mutations have now been described, including mutations in the gene coding bone morphogenic protein receptor type 2 (BMPR2) and related genes. Recent advanced gene sequencing methods have facilitated the discovery of additional genes with mutations among those with and without familial forms of PAH (CAV1, KCNK3, EIF2AK4). The reduced penetrance, variable expressivity, and female predominance of PAH suggest that genetic, genomic and other factors modify disease expression. These multi-faceted variations are an active area of investigation in the field, including but not limited to common genetic variants and epigenetic processes, and may provide novel opportunities for pharmacologic intervention in the near future. They also highlight the need for a systems-oriented multi-level approach to incorporate the multitude of biologic variations now associated with PAH. Ultimately, improved understanding provides the opportunity for improved patient and family counseling about this devastating disease, but do require in depth understanding of the genetic factors relevant to PAH. PMID:24951767
Comprehensive genetic dissection of wood properties in a widely-grown tropical tree: Eucalyptus
2011-01-01
Background Eucalyptus is an important genus in industrial plantations throughout the world and is grown for use as timber, pulp, paper and charcoal. Several breeding programmes have been launched worldwide to concomitantly improve growth performance and wood properties (WPs). In this study, an interspecific cross between Eucalyptus urophylla and E. grandis was used to identify major genomic regions (Quantitative Trait Loci, QTL) controlling the variability of WPs. Results Linkage maps were generated for both parent species. A total of 117 QTLs were detected for a series of wood and end-use related traits, including chemical, technological, physical, mechanical and anatomical properties. The QTLs were mainly clustered into five linkage groups. In terms of distribution of QTL effects, our result agrees with the typical L-shape reported in most QTL studies, i.e. most WP QTLs had limited effects and only a few (13) had major effects (phenotypic variance explained > 15%). The co-locations of QTLs for different WPs as well as QTLs and candidate genes are discussed in terms of phenotypic correlations between traits, and of the function of the candidate genes. The major wood property QTL harbours a gene encoding a Cinnamoyl CoA reductase (CCR), a structural enzyme of the monolignol-specific biosynthesis pathway. Conclusions Given the number of traits analysed, this study provides a comprehensive understanding of the genetic architecture of wood properties in this Eucalyptus full-sib pedigree. At the dawn of Eucalyptus genome sequence, it will provide a framework to identify the nature of genes underlying these important quantitative traits. PMID:21651758
Van Loo, Peter; Aerts, Stein; Thienpont, Bernard; De Moor, Bart; Moreau, Yves; Marynen, Peter
2008-01-01
We present ModuleMiner, a novel algorithm for computationally detecting cis-regulatory modules (CRMs) in a set of co-expressed genes. ModuleMiner outperforms other methods for CRM detection on benchmark data, and successfully detects CRMs in tissue-specific microarray clusters and in embryonic development gene sets. Interestingly, CRM predictions for differentiated tissues exhibit strong enrichment close to the transcription start site, whereas CRM predictions for embryonic development gene sets are depleted in this region. PMID:18394174
[The application of genome editing in identification of plant gene function and crop breeding].
Zhou, Xiang-chun; Xing, Yong-zhong
2016-03-01
Plant genome can be modified via current biotechnology with high specificity and excellent efficiency. Zinc finger nucleases (ZFN), transcription activator-like effector nucleases (TALEN) and clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated 9 (Cas9) system are the key engineered nucleases used in the genome editing. Genome editing techniques enable gene targeted mutagenesis, gene knock-out, gene insertion or replacement at the target sites during the endogenous DNA repair process, including non-homologous end joining (NHEJ) and homologous recombination (HR), triggered by the induction of DNA double-strand break (DSB). Genome editing has been successfully applied in the genome modification of diverse plant species, such as Arabidopsis thaliana, Oryza sativa, and Nicotiana tabacum. In this review, we summarize the application of genome editing in identification of plant gene function and crop breeding. Moreover, we also discuss the improving points of genome editing in crop precision genetic improvement for further study.
Compeer, Ewoud B; Janssen, Willemijn; van Royen-Kerkhof, Annet; van Gijn, Marielle; van Montfrans, Joris M; Boes, Marianne
2015-05-10
Common Variable Immunodeficiency (CVID) is the most prevalent primary antibody deficiency, and characterized by defective generation of high-affinity antibodies. Patients have therefore increased risk to recurrent infections of the respiratory and intestinal tract. Development of high-affinity antigen-specific antibodies involves two key actions of B-cell receptors (BCR): transmembrane signaling through BCR-complexes to induce B-cell differentiation and proliferation, and BCR-mediated antigen internalization for class-II MHC-mediated presentation to acquire antigen-specific CD4(+) T-cell help.We identified a variant (L3P) in the B-lymphoid tyrosine kinase (BLK) gene of 2 related CVID-patients, which was absent in healthy relatives. BLK belongs to the Src-kinases family and involved in BCR-signaling. Here, we sought to clarify BLK function in healthy human B-cells and its association to CVID.BLK expression was comparable in patient and healthy B-cells. Functional analysis of L3P-BLK showed reduced BCR crosslinking-induced Syk phosphorylation and proliferation, in both primary B-cells and B-LCLs. B-cells expressing L3P-BLK showed accelerated destruction of BCR-internalized antigen and reduced ability to elicit CD40L-expression on antigen-specific CD4(+) T-cells.In conclusion, we found a novel BLK gene variant in CVID-patients that causes suppressed B-cell proliferation and reduced ability of B-cells to elicit antigen-specific CD4(+) T-cell responses. Both these mechanisms may contribute to hypogammaglobulinemia in CVID-patients.
Zhou, Zhan; Zou, Yangyun; Liu, Gangbiao; Zhou, Jingqi; Wu, Jingcheng; Zhao, Shimin; Su, Zhixi; Gu, Xun
2017-08-29
Human genes exhibit different effects on fitness in cancer and normal cells. Here, we present an evolutionary approach to measure the selection pressure on human genes, using the well-known ratio of the nonsynonymous to synonymous substitution rate in both cancer genomes ( C N / C S ) and normal populations ( p N / p S ). A new mutation-profile-based method that adopts sample-specific mutation rate profiles instead of conventional substitution models was developed. We found that cancer-specific selection pressure is quite different from the selection pressure at the species and population levels. Both the relaxation of purifying selection on passenger mutations and the positive selection of driver mutations may contribute to the increased C N / C S values of human genes in cancer genomes compared with the p N / p S values in human populations. The C N / C S values also contribute to the improved classification of cancer genes and a better understanding of the onco-functionalization of cancer genes during oncogenesis. The use of our computational pipeline to identify cancer-specific positively and negatively selected genes may provide useful information for understanding the evolution of cancers and identifying possible targets for therapeutic intervention.
Zhou, Xionghui; Liu, Juan
2014-01-01
Although many methods have been proposed to reconstruct gene regulatory network, most of them, when applied in the sample-based data, can not reveal the gene regulatory relations underlying the phenotypic change (e.g. normal versus cancer). In this paper, we adopt phenotype as a variable when constructing the gene regulatory network, while former researches either neglected it or only used it to select the differentially expressed genes as the inputs to construct the gene regulatory network. To be specific, we integrate phenotype information with gene expression data to identify the gene dependency pairs by using the method of conditional mutual information. A gene dependency pair (A,B) means that the influence of gene A on the phenotype depends on gene B. All identified gene dependency pairs constitute a directed network underlying the phenotype, namely gene dependency network. By this way, we have constructed gene dependency network of breast cancer from gene expression data along with two different phenotype states (metastasis and non-metastasis). Moreover, we have found the network scale free, indicating that its hub genes with high out-degrees may play critical roles in the network. After functional investigation, these hub genes are found to be biologically significant and specially related to breast cancer, which suggests that our gene dependency network is meaningful. The validity has also been justified by literature investigation. From the network, we have selected 43 discriminative hubs as signature to build the classification model for distinguishing the distant metastasis risks of breast cancer patients, and the result outperforms those classification models with published signatures. In conclusion, we have proposed a promising way to construct the gene regulatory network by using sample-based data, which has been shown to be effective and accurate in uncovering the hidden mechanism of the biological process and identifying the gene signature for phenotypic change.
Alcohol-related Genes Show an Enrichment of Associations with a Persistent Externalizing Factor
Ashenhurst, James R.; Harden, K. Paige; Corbin, William R.; Fromme, Kim
2016-01-01
Research using twins has found that much of the variability in externalizing phenotypes – including alcohol and drug use, impulsive personality traits, risky sex and property crime – is explained by genetic factors. Nevertheless, identification of specific genes and variants associated with these traits has proven to be difficult, likely because individual differences in externalizing are explained by many genes of small individual effect. Moreover, twin research indicates that heritable variance in externalizing behaviors is mostly shared across the externalizing spectrum rather than specific to any behavior. We use a longitudinal, “deep phenotyping” approach to model a general externalizing factor reflecting persistent engagement in a variety of socially problematic behaviors measured at eleven assessment occasions spanning early adulthood (ages 18 to 28). In an ancestrally homogenous sample of non-Hispanic Whites (N = 337), we then tested for enrichment of associations between the persistent externalizing factor and a set of 3,281 polymorphisms within 104 genes that were previously identified as associated with alcohol-use behaviors. Next we tested for enrichment among domain-specific factors (e.g., property crime) composed of residual variance not accounted for by the common factor. Significance was determined relative to bootstrapped empirical thresholds derived from permutations of phenotypic data. Results indicated significant enrichment of genetic associations for persistent externalizing, but not for domain-specific factors. Consistent with twin research findings, these results suggest that genetic variants are broadly associated with externalizing behaviors rather than unique to specific behaviors. General Scientific Summary This study shows that variation in 104 genes is associated with socially problematic “externalizing” behavior, including substance misuse, property crime, risky sex, and aspects of impulsive personality. Importantly, this association was with the common variation across these behaviors rather than with the variation unique to any given behavior. The manuscript demonstrates a potentially advantageous technique for relating sets of hypothesized genes to complex traits or behaviors. PMID:27505405
Transposable element islands facilitate adaptation to novel environments in an invasive species
Schrader, Lukas; Kim, Jay W.; Ence, Daniel; Zimin, Aleksey; Klein, Antonia; Wyschetzki, Katharina; Weichselgartner, Tobias; Kemena, Carsten; Stökl, Johannes; Schultner, Eva; Wurm, Yannick; Smith, Christopher D.; Yandell, Mark; Heinze, Jürgen; Gadau, Jürgen; Oettler, Jan
2014-01-01
Adaptation requires genetic variation, but founder populations are generally genetically depleted. Here we sequence two populations of an inbred ant that diverge in phenotype to determine how variability is generated. Cardiocondyla obscurior has the smallest of the sequenced ant genomes and its structure suggests a fundamental role of transposable elements (TEs) in adaptive evolution. Accumulations of TEs (TE islands) comprising 7.18% of the genome evolve faster than other regions with regard to single-nucleotide variants, gene/exon duplications and deletions and gene homology. A non-random distribution of gene families, larvae/adult specific gene expression and signs of differential methylation in TE islands indicate intragenomic differences in regulation, evolutionary rates and coalescent effective population size. Our study reveals a tripartite interplay between TEs, life history and adaptation in an invasive species. PMID:25510865
Molecular basis of length polymorphism in the human zeta-globin gene complex.
Goodbourn, S E; Higgs, D R; Clegg, J B; Weatherall, D J
1983-01-01
The length polymorphism between the human zeta-globin gene and its pseudogene is caused by an allele-specific variation in the copy number of a tandemly repeating 36-base-pair sequence. This sequence is related to a tandemly repeated 14-base-pair sequence in the 5' flanking region of the human insulin gene, which is known to cause length polymorphism, and to a repetitive sequence in intervening sequence (IVS) 1 of the pseudo-zeta-globin gene. Evidence is presented that the latter is also of variable length, probably because of differences in the copy number of the tandem repeat. The homology between the three length polymorphisms may be an indication of the presence of a more widespread group of related sequences in the human genome, which might be useful for generalized linkage studies. PMID:6308667
Quantitative Real-Time Legionella PCR for Environmental Water Samples: Data Interpretation
Joly, Philippe; Falconnet, Pierre-Alain; André, Janine; Weill, Nicole; Reyrolle, Monique; Vandenesch, François; Maurin, Max; Etienne, Jerome; Jarraud, Sophie
2006-01-01
Quantitative Legionella PCRs targeting the 16S rRNA gene (specific for the genus Legionella) and the mip gene (specific for the species Legionella pneumophila) were applied to a total of 223 hot water system samples (131 in one laboratory and 92 in another laboratory) and 37 cooling tower samples (all in the same laboratory). The PCR results were compared with those of conventional culture. 16S rRNA gene PCR results were nonquantifiable for 2.8% of cooling tower samples and up to 39.1% of hot water system samples, and this was highly predictive of Legionella CFU counts below 250/liter. PCR cutoff values for identifying hot water system samples containing >103 CFU/liter legionellae were determined separately in each laboratory. The cutoffs differed widely between the laboratories and had sensitivities from 87.7 to 92.9% and specificities from 77.3 to 96.5%. The best specificity was obtained with mip PCR. PCR cutoffs could not be determined for cooling tower samples, as the results were highly variable and often high for culture-negative samples. Thus, quantitative Legionella PCR appears to be applicable to samples from hot water systems, but the positivity cutoff has to be determined in each laboratory. PMID:16597985
Leveraging Gene-Environment Interactions and Endotypes for Asthma Gene Discovery
Bønnelykke, Klaus; Ober, Carole
2016-01-01
Asthma is a heterogeneous clinical syndrome that includes subtypes of disease with different underlying causes and disease mechanisms. Asthma is caused by a complex interaction between genes and environmental exposures; early-life exposures in particular play an important role. Asthma is also heritable, and a number of susceptibility variants have been discovered in genome-wide association studies, although the known risk alleles explain only a small proportion of the heritability. In this review, we present evidence supporting the hypothesis that focusing on more specific asthma phenotypes, such as childhood asthma with severe exacerbations, and on relevant exposures that are involved in gene-environment interactions (GEIs), such as rhinovirus infections, will improve detection of asthma genes and our understanding of the underlying mechanisms. We will discuss the challenges of considering GEIs and the advantages of studying responses to asthma-associated exposures in clinical birth cohorts, as well as in cell models of GEIs, to dissect the context-specific nature of genotypic risks, to prioritize variants in genome-wide association studies, and to identify pathways involved in pathogenesis in subgroups of patients. We propose that such approaches, in spite of their many challenges, present great opportunities for better understanding of asthma pathogenesis and heterogeneity and, ultimately, for improving prevention and treatment of disease. PMID:26947980
Nelson-Coffey, S Katherine; Fritz, Megan M; Lyubomirsky, Sonja; Cole, Steve W
2017-07-01
Prosocial behavior is linked to longevity, but few studies have experimentally manipulated prosocial behavior to identify the causal mechanisms underlying this association. One possible mediating pathway involves changes in gene expression that may subsequently influence disease development or resistance. In the current study, we examined changes in a leukocyte gene expression profile known as the Conserved Transcriptional Response to Adversity (CTRA) in 159 adults who were randomly assigned for 4 weeks to engage in prosocial behavior directed towards specific others, prosocial behavior directed towards the world in general, self-focused kindness, or a neutral control task. Those randomized to prosocial behavior towards specific others demonstrated improvements (i.e., reductions) in leukocyte expression of CTRA indicator genes. No significant changes in CTRA gene expression were observed in the other 3 conditions. These findings suggest that prosocial behavior can causally impact leukocyte gene expression profiles in ways that might potentially help explain the previously observed health advantages associated with social ties. Copyright © 2017 Elsevier Ltd. All rights reserved.
USDA-ARS?s Scientific Manuscript database
Developmental ethanol exposure is able to induce Fetal Alcohol Spectrum Disorder (FASD) phenotypes in Japanese rice fish (Oryzias latipes). This study investigated possible differential expression of cannabinoid receptor (cnr) mRNAs during Japanese rice fish embryogenesis and variability to ethanol-...
Janek, Daniela; Zipperer, Alexander; Kulik, Andreas; Krismer, Bernhard; Peschel, Andreas
2016-01-01
The human nasal microbiota is highly variable and dynamic often enclosing major pathogens such as Staphylococcus aureus. The potential roles of bacteriocins or other mechanisms allowing certain bacterial clones to prevail in this nutrient-poor habitat have hardly been studied. Of 89 nasal Staphylococcus isolates, unexpectedly, the vast majority (84%) was found to produce antimicrobial substances in particular under habitat-specific stress conditions, such as iron limitation or exposure to hydrogen peroxide. Activity spectra were generally narrow but highly variable with activities against certain nasal members of the Actinobacteria, Proteobacteria, Firmicutes, or several groups of bacteria. Staphylococcus species and many other Firmicutes were insusceptible to most of the compounds. A representative bacteriocin was identified as a nukacin-related peptide whose inactivation reduced the capacity of the producer Staphylococcus epidermidis IVK45 to limit growth of other nasal bacteria. Of note, the bacteriocin genes were found on mobile genetic elements exhibiting signs of extensive horizontal gene transfer and rearrangements. Thus, continuously evolving bacteriocins appear to govern bacterial competition in the human nose and specific bacteriocins may become important agents for eradication of notorious opportunistic pathogens from human microbiota. PMID:27490492
Rizzo, Milena; Evangelista, Monica; Simili, Marcella; Mariani, Laura; Pitto, Letizia; Rainaldi, Giuseppe
2011-01-01
The life span (Hayflick limit) of primary mouse embryo fibroblasts (MEF) in culture is variable but it is still unclear if the escape of the Hayflick limit is also variable. To address this point MEF were expanded every fifteen days (6T15) instead of every three days (6T3) until they became immortal. With this protocol MEF lifespan was extended and immortalization accordingly delayed. By testing a panel of genes (p19ARF, p16, p21) and miRNAs (miR-20a, miR-21, miR-28, miR-290) related to primary MEF senescence, a switch of p21 from up to down regulation, the down regulation of specific miRNAs as well as a massive shift from diploidy to hyperdiploidy were observed in coincidence with the resumption of cell proliferation. Collectively, these data indicate that the inactivation of genes and miRNAs, important in controlling cell proliferation, might be determinant for the escape from the Hayflick limit. In support of this hypothesis was the finding that some of the down regulated miRNAs transfected in immortalized MEF inhibited cell proliferation thus displaying a tumor suppressor-like activity. PMID:21765199
Rizzo, Milena; Evangelista, Monica; Simili, Marcella; Mariani, Laura; Pitto, Letizia; Rainaldi, Giuseppe
2011-07-01
The life span (Hayflick limit) of primary mouse embryo fibroblasts (MEF) in culture is variable but it is still unclear if the escape of the Hayflick limit is also variable. To address this point MEF were expanded every fifteen days (6T15) instead of every three days (6T3) until they became immortal. With this protocol MEF lifespan was extended and immortalization accordingly delayed. By testing a panel of genes (p19ARF, p16, p21) and miRNAs (miR-20a, miR-21, miR-28, miR-290) related to primary MEF senescence, a switch of p21 from up to down regulation, the down regulation of specific miRNAs as well as a massive shift from diploidy to hyperdiploidy were observed in coincidence with the resumption of cell proliferation. Collectively, these data indicate that the inactivation of genes and miRNAs, important in controlling cell proliferation, might be determinant for the escape from the Hayflick limit. In support of this hypothesis was the finding that some of the down regulated miRNAs transfected in immortalized MEF inhibited cell proliferation thus displaying a tumor suppressor-like activity.
Wang, Xiao; Gu, Jinghua; Hilakivi-Clarke, Leena; Clarke, Robert; Xuan, Jianhua
2017-01-15
The advent of high-throughput DNA methylation profiling techniques has enabled the possibility of accurate identification of differentially methylated genes for cancer research. The large number of measured loci facilitates whole genome methylation study, yet posing great challenges for differential methylation detection due to the high variability in tumor samples. We have developed a novel probabilistic approach, D: ifferential M: ethylation detection using a hierarchical B: ayesian model exploiting L: ocal D: ependency (DM-BLD), to detect differentially methylated genes based on a Bayesian framework. The DM-BLD approach features a joint model to capture both the local dependency of measured loci and the dependency of methylation change in samples. Specifically, the local dependency is modeled by Leroux conditional autoregressive structure; the dependency of methylation changes is modeled by a discrete Markov random field. A hierarchical Bayesian model is developed to fully take into account the local dependency for differential analysis, in which differential states are embedded as hidden variables. Simulation studies demonstrate that DM-BLD outperforms existing methods for differential methylation detection, particularly when the methylation change is moderate and the variability of methylation in samples is high. DM-BLD has been applied to breast cancer data to identify important methylated genes (such as polycomb target genes and genes involved in transcription factor activity) associated with breast cancer recurrence. A Matlab package of DM-BLD is available at http://www.cbil.ece.vt.edu/software.htm CONTACT: Xuan@vt.eduSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kipps, T.J.; Fong, S.; Tomhave, E.
Malignant B lymphocytes from several patients with chronic lymphocytic leukemia (CLL) were examined for reactivity with murine monoclonal antibody 17.109. This antibody, prepared against the rheumatoid factor (RF) paraprotein Sie, recognizes a cross reactive idiotype on 48% of human IgM RF paraproteins, but does not react with IgM paraproteins without RF activity or substantially with normal pooled immunoglobulin. The 17.109-reactive idiotype is a marker for a kappa III variable-region gene, designated V/sub kappa/RF, that is conserved in outbred human populations. In a limited study of 31 CLL patients, the leukemic cells from 5 of 20 patients with kappa light chain-expressingmore » CLL were recognized by the 17.109 monoclonal antibody. Despite having malignant cells specifically reactive with this antibody, patients with 17.109-positive CLL did not have elevated serum levels of circulating antibody bearing 17.109-reactive determinants. Total RNAs isolated from the CLL B lymphocytes, or from hybridomas produced by fusing the CLL cells with the WI-L2-729-HF/sub 2/ cell line, were fractionated electrophoretically and examined by blot hybridization. Under stringent hybridization conditions capable of discerning a single base-pair mismatch, RNA from the 17.109-idiotype-positive CLL cells hybridized to synthetic oligonucleotide probes corresponding to framework and complementary-determining regions in the V/sub kappa/RF gene. The high frequency of the 17.109-associated idiotype and the V/sub kappa/RF gene in CLL suggests that the disease may arise from B lymphocytes that express a restricted set of inherited immunoglobulin variable-region genes with little or no somatic mutation.« less
Association study of ERβ, AR, and CYP19A1 genes and MtF transsexualism.
Fernández, Rosa; Esteva, Isabel; Gómez-Gil, Esther; Rumbo, Teresa; Almaraz, Mari Cruz; Roda, Ester; Haro-Mora, Juan-Jesús; Guillamón, Antonio; Pásaro, Eduardo
2014-12-01
The etiology of male-to-female (MtF) transsexualism is unknown. Both genetic and neurological factors may play an important role. To investigate the possible influence of the genetic factor on the etiology of MtF transsexualism. We carried out a cytogenetic and molecular analysis in 442 MtFs and 473 healthy, age- and geographical origin-matched XY control males. The karyotype was investigated by G-banding and by high-density array in the transsexual group. The molecular analysis involved three tandem variable regions of genes estrogen receptor β (ERβ) (CA tandem repeats in intron 5), androgen receptor (AR) (CAG tandem repeats in exon 1), and CYP19A1 (TTTA tandem repeats in intron 4). The allele and genotype frequencies, after division into short and long alleles, were obtained. We investigated the association between genotype and transsexualism by performing a molecular analysis of three variable regions of genes ERβ, AR, and CYP19A1 in 915 individuals (442 MtFs and 473 control males). Most MtFs showed an unremarkable 46,XY karyotype (97.96%). No specific chromosome aberration was associated with MtF transsexualism, and prevalence of aneuploidy (2.04%) was slightly higher than in the general population. Molecular analyses showed no significant difference in allelic or genotypic distribution of the genes examined between MtFs and controls. Moreover, molecular findings presented no evidence of an association between the sex hormone-related genes (ERβ, AR, and CYP19A1) and MtF transsexualism. The study suggests that the analysis of karyotype provides limited information in these subjects. Variable regions analyzed from ERβ, AR, and CYP19A1 are not associated with MtF transsexualism. Nevertheless, this does not exclude other polymorphic regions not analyzed. © 2014 International Society for Sexual Medicine.
Zahra, Zamani; Reza, Razavi Mohammad; Mehdi, Assmar; Sedigheh, Sadeghi; Fatemeh, Pourfallah; Nikoo, Nasoohi; Ashraf, Sheibani; Mohammad, Raisi
2007-02-01
Plasmodiumfalciparum merozoite surface protein-1 (MSP-1) shows extensive antigenic diversity. This is due to the presence of seven variable blocks, five semi-conserved and also five conserved blocks. The variable blocks in the MSP-1 gene are principally dimorphic, displaying either K1 or MAD20 type; except for the block 2 region which is represented by three alleles, an RO33 type in addition to the other two. Allelic diversity is reported to be generated by intra-genic recombination between the variable blocks. A study of allelic variation of MSP-1 gene in Plasmodium falciparum was carried out in the southern province of Sistan Baluchistan in Iran in 2001-2003. Samples were obtained from 30 febrile patients and DNA was extracted and association types between blocks 2 and 6 was identified on each block using specific primers and compared with those from Vietnam, Brazil and Africa. The association types obtained, were similar though less in number than the ones from Vietnam, but more than those from Africa and Brazil.
Abramenko, Iryna; Bilous, Nadia; Chumak, Anatoliy; Davidova, Ekaterina; Kryachok, Iryna; Martina, Zoya; Nechaev, Stanislav; Dyagil, Iryna; Bazyka, Dmytriy; Bebeshko, Vladimir
2008-04-01
Clinical data and immunoglobulin variable heavy chain (IgVH) gene configuration were analyzed in 47 CLL patients, exposed to ionizing radiation (IR) due to Chernobyl NPP accident, and 141 non-exposed patients. Clean-up workers of the second quarter of 1986 (n=19) were picked out as separate group with the highest number of unmutated cases (94.4%), increased usage of IgVH1-69 (33.3%) and IgVH3-21 (16.7%) genes, high frequency of secondary solid tumors (6 cases) and Richter transformation (4 cases). These preliminary data suggest that CLL in the most suffered contingent due to Chernobyl NPP accident might have some specific features.