Sample records for gene-specific variability improves

  1. Upregulating endogenous genes by an RNA-programmable artificial transactivator

    PubMed Central

    Fimiani, Cristina; Goina, Elisa; Mallamaci, Antonello

    2015-01-01

    To promote expression of endogenous genes ad libitum, we developed a novel, programmable transcription factor prototype. Kept together via an MS2 coat protein/RNA interface, it includes a fixed, polypeptidic transactivating domain and a variable RNA domain that recognizes the desired gene. Thanks to this device, we specifically upregulated five genes, in cell lines and primary cultures of murine pallial precursors. Gene upregulation was small, however sufficient to robustly inhibit neuronal differentiation. The transactivator interacted with target gene chromatin via its RNA cofactor. Its activity was restricted to cells in which the target gene is normally transcribed. Our device might be useful for specific applications. However for this purpose, it will require an improvement of its transactivation power as well as a better characterization of its target specificity and mechanism of action. PMID:26152305

  2. Gene transfer of Hodgkin cell lines via multivalent anti-CD30 scFv displaying bacteriophage.

    PubMed

    Chung, Yoon-Suk A; Sabel, Katja; Krönke, Martin; Klimka, Alexander

    2008-04-16

    The display of binding ligands, such as recombinant antibody fragments, on the surface of filamentous phage makes it possible to specifically attach these phage particles to target cells. After uptake of the phage, their internal single-stranded DNA is processed by the host cell, which allows transient expression of an encoded eukaryotic gene cassette. This opens the possibility to use bacteriophage as vectors for targeted gene therapy, although the transduction efficiency is very low. Here we demonstrate the display of an anti-CD30 single chain variable fragment fused to the major coat protein pVIII on the surface of bacteriophage. These phage particles showed an improved binding and transduction efficiency of CD30 positive Hodgkin-lymphoma cells, compared to bacteriophage with the anti-CD30 single chain variable fragment fused to the minor coat protein pIII. We can conclude from the results that the postulated multivalency of the anti-CD30-pVIII displaying bacteriophage combined with disseminated display of the anti-CD30 scFv on the whole particle surface is responsible for the improved gene transfer rate. These results mark an important step towards the use of phage particles as a cheap and safe gene transfer vehicle for the gene delivery of the desired target cells via their specific surface receptors.

  3. High intraspecific genome diversity in the model arbuscular mycorrhizal symbiont Rhizophagus irregularis.

    PubMed

    Chen, Eric C H; Morin, Emmanuelle; Beaudet, Denis; Noel, Jessica; Yildirir, Gokalp; Ndikumana, Steve; Charron, Philippe; St-Onge, Camille; Giorgi, John; Krüger, Manuela; Marton, Timea; Ropars, Jeanne; Grigoriev, Igor V; Hainaut, Matthieu; Henrissat, Bernard; Roux, Christophe; Martin, Francis; Corradi, Nicolas

    2018-01-22

    Arbuscular mycorrhizal fungi (AMF) are known to improve plant fitness through the establishment of mycorrhizal symbioses. Genetic and phenotypic variations among closely related AMF isolates can significantly affect plant growth, but the genomic changes underlying this variability are unclear. To address this issue, we improved the genome assembly and gene annotation of the model strain Rhizophagus irregularis DAOM197198, and compared its gene content with five isolates of R. irregularis sampled in the same field. All isolates harbor striking genome variations, with large numbers of isolate-specific genes, gene family expansions, and evidence of interisolate genetic exchange. The observed variability affects all gene ontology terms and PFAM protein domains, as well as putative mycorrhiza-induced small secreted effector-like proteins and other symbiosis differentially expressed genes. High variability is also found in active transposable elements. Overall, these findings indicate a substantial divergence in the functioning capacity of isolates harvested from the same field, and thus their genetic potential for adaptation to biotic and abiotic changes. Our data also provide a first glimpse into the genome diversity that resides within natural populations of these symbionts, and open avenues for future analyses of plant-AMF interactions that link AMF genome variation with plant phenotype and fitness. © 2018 The Authors. New Phytologist © 2018 New Phytologist Trust.

  4. Disease-modifying genetic factors in cystic fibrosis.

    PubMed

    Marson, Fernando A L

    2018-05-01

    To compile data from the past 10 years regarding the role of modifying genes in cystic fibrosis (CF). CF is a model disease for understanding of the action of modifying genes. Although it is a monogenic (CFTR) autosomal recessive disease, CF presents with wide phenotypic variability. In CF, variability occurs with different intensity among patients by each organ, being organ-specific, resulting from the mutual interaction of environmental and genetic factors, including CFTR mutations and various other genes, most of which are associated with inflammatory processes. In individuals, using precision medicine, gene modification studies have revealed individualized responses to drugs depending on particular CFTR mutations and modifying genes, most of which are alternative ion channels. Studies of modifying genes in CF allow: understanding of clinical variability among patients with the same CFTR genotype; evaluation of precision medicine; understanding of environmental and genetic effects at the organ level; understanding the involvement of genetic variants in inflammatory responses; improvements in genetic counseling; understanding the involvement of genetic variants in inflammatory responses in lung diseases, such as asthma; and understanding the individuality of the person with the disease.

  5. The AGT Gene M235T Polymorphism and Response of Power-Related Variables to Aerobic Training

    PubMed Central

    Aleksandra, Zarębska; Zbigniew, Jastrzębski; Waldemar, Moska; Agata, Leońska-Duniec; Mariusz, Kaczmarczyk; Marek, Sawczuk; Agnieszka, Maciejewska-Skrendo; Piotr, Żmijewski; Krzysztof, Ficek; Grzegorz, Trybek; Ewelina, Lulińska-Kuklik; Semenova, Ekaterina A.; Ahmetov, Ildus I.; Paweł, Cięszczyk

    2016-01-01

    The C allele of the M235T (rs699) polymorphism of the AGT gene correlates with higher levels of angiotensin II and has been associated with power and strength sport performance. The aim of the study was to investigate whether or not selected power-related variables and their response to a 12-week program of aerobic dance training are modulated by the AGT M235T genotype in healthy participants. Two hundred and one Polish Caucasian women aged 21 ± 1 years met the inclusion criteria and were included in the study. All women completed a 12-week program of low and high impact aerobics. Wingate peak power and total work capacity, 5 m, 10 m, and 30 m running times and jump height and jump power were determined before and after the training programme. All power-related variables improved significantly in response to aerobic dance training. We found a significant association between the M235T polymorphism and jump-based variables (squat jump (SJ) height, p = 0.005; SJ power, p = 0.015; countermovement jump height, p = 0.025; average of 10 countermovement jumps with arm swing (ACMJ) height, p = 0.001; ACMJ power, p = 0.035). Specifically, greater improvements were observed in the C allele carriers in comparison with TT homozygotes. In conclusion, aerobic dance, one of the most commonly practiced adult fitness activities in the world, provides sufficient training stimuli for augmenting the explosive strength necessary to increase vertical jump performance. The AGT gene M235T polymorphism seems to be not only a candidate gene variant for power/strength related phenotypes, but also a genetic marker for predicting response to training. Key points Aerobic dance provides sufficient training stimuli for the improvement of explosive power. The AGT gene M235T polymorphism is associated with individual variation in the change of power-related phenotypes in response to aerobic dance training. The C allele carriers of the AGT gene M235T polymorphism show greater improvements of jump-based variables in comparison with TT homozygotes. PMID:27928207

  6. Genetic Influence on Slope Variability in a Childhood Reflexive Attention Task.

    PubMed

    Lundwall, Rebecca A; Watkins, Jeffrey K

    2015-01-01

    Individuals are not perfectly consistent, and interindividual variability is a common feature in all varieties of human behavior. Some individuals respond more variably than others, however, and this difference may be important to understanding how the brain works. In this paper, we explore genetic contributions to response time (RT) slope variability on a reflexive attention task. We are interested in such variability because we believe it is an important part of the overall picture of attention that, if understood, has the potential to improve intervention for those with attentional deficits. Genetic association studies are valuable in discovering biological pathways of variability and several studies have found such associations with a sustained attention task. Here, we expand our knowledge to include a reflexive attention task. We ask whether specific candidate genes are associated with interindividual variability on a childhood reflexive attention task in 9-16 year olds. The genetic makers considered are on 11 genes: APOE, BDNF, CHRNA4, COMT, DRD4, HTR4, IGF2, MAOA, SLC5A7, SLC6A3, and SNAP25. We find significant associations with variability with markers on nine and we discuss the results in terms of neurotransmitters associated with each gene and the characteristics of the associated measures from the reflexive attention task.

  7. Identification of Human HK Genes and Gene Expression Regulation Study in Cancer from Transcriptomics Data Analysis

    PubMed Central

    Zhang, Zhang; Liu, Jingxing; Wu, Jiayan; Yu, Jun

    2013-01-01

    The regulation of gene expression is essential for eukaryotes, as it drives the processes of cellular differentiation and morphogenesis, leading to the creation of different cell types in multicellular organisms. RNA-Sequencing (RNA-Seq) provides researchers with a powerful toolbox for characterization and quantification of transcriptome. Many different human tissue/cell transcriptome datasets coming from RNA-Seq technology are available on public data resource. The fundamental issue here is how to develop an effective analysis method to estimate expression pattern similarities between different tumor tissues and their corresponding normal tissues. We define the gene expression pattern from three directions: 1) expression breadth, which reflects gene expression on/off status, and mainly concerns ubiquitously expressed genes; 2) low/high or constant/variable expression genes, based on gene expression level and variation; and 3) the regulation of gene expression at the gene structure level. The cluster analysis indicates that gene expression pattern is higher related to physiological condition rather than tissue spatial distance. Two sets of human housekeeping (HK) genes are defined according to cell/tissue types, respectively. To characterize the gene expression pattern in gene expression level and variation, we firstly apply improved K-means algorithm and a gene expression variance model. We find that cancer-associated HK genes (a HK gene is specific in cancer group, while not in normal group) are expressed higher and more variable in cancer condition than in normal condition. Cancer-associated HK genes prefer to AT-rich genes, and they are enriched in cell cycle regulation related functions and constitute some cancer signatures. The expression of large genes is also avoided in cancer group. These studies will help us understand which cell type-specific patterns of gene expression differ among different cell types, and particularly for cancer. PMID:23382867

  8. Improving RNA-Seq expression estimation by modeling isoform- and exon-specific read sequencing rate.

    PubMed

    Liu, Xuejun; Shi, Xinxin; Chen, Chunlin; Zhang, Li

    2015-10-16

    The high-throughput sequencing technology, RNA-Seq, has been widely used to quantify gene and isoform expression in the study of transcriptome in recent years. Accurate expression measurement from the millions or billions of short generated reads is obstructed by difficulties. One is ambiguous mapping of reads to reference transcriptome caused by alternative splicing. This increases the uncertainty in estimating isoform expression. The other is non-uniformity of read distribution along the reference transcriptome due to positional, sequencing, mappability and other undiscovered sources of biases. This violates the uniform assumption of read distribution for many expression calculation approaches, such as the direct RPKM calculation and Poisson-based models. Many methods have been proposed to address these difficulties. Some approaches employ latent variable models to discover the underlying pattern of read sequencing. However, most of these methods make bias correction based on surrounding sequence contents and share the bias models by all genes. They therefore cannot estimate gene- and isoform-specific biases as revealed by recent studies. We propose a latent variable model, NLDMseq, to estimate gene and isoform expression. Our method adopts latent variables to model the unknown isoforms, from which reads originate, and the underlying percentage of multiple spliced variants. The isoform- and exon-specific read sequencing biases are modeled to account for the non-uniformity of read distribution, and are identified by utilizing the replicate information of multiple lanes of a single library run. We employ simulation and real data to verify the performance of our method in terms of accuracy in the calculation of gene and isoform expression. Results show that NLDMseq obtains competitive gene and isoform expression compared to popular alternatives. Finally, the proposed method is applied to the detection of differential expression (DE) to show its usefulness in the downstream analysis. The proposed NLDMseq method provides an approach to accurately estimate gene and isoform expression from RNA-Seq data by modeling the isoform- and exon-specific read sequencing biases. It makes use of a latent variable model to discover the hidden pattern of read sequencing. We have shown that it works well in both simulations and real datasets, and has competitive performance compared to popular methods. The method has been implemented as a freely available software which can be found at https://github.com/PUGEA/NLDMseq.

  9. The natural dietary genistein boosts bacteriophage-mediated cancer cell killing by improving phage-targeted tumor cell transduction.

    PubMed

    Tsafa, Effrosyni; Al-Bahrani, Mariam; Bentayebi, Kaoutar; Przystal, Justyna; Suwan, Keittisak; Hajitou, Amin

    2016-08-09

    Gene therapy has long been regarded as a promising treatment for cancer. However, cancer gene therapy is still facing the challenge of targeting gene delivery vectors specifically to tumors when administered via clinically acceptable non-invasive systemic routes (i.e. intravenous). The bacteria virus, bacteriophage (phage), represents a new generation of promising vectors in systemic gene delivery since their targeting can be achieved through phage capsid display ligands, which enable them to home to specific tumor receptors without the need to ablate any native eukaryotic tropism. We have previously reported a tumor specific bacteriophage vector named adeno-associated virus/phage, or AAVP, in which gene expression is under a recombinant human rAAV2 virus genome targeted to tumors via a ligand-directed phage capsid. However, cancer gene therapy with this tumor-targeted vector achieved variable outcomes ranging from tumor regression to no effect in both experimental and natural preclinical models. Herein, we hypothesized that combining the natural dietary genistein, with proven anticancer activity, would improve bacteriophage anticancer safe therapy. We show that combination treatment with genistein and AAVP increased targeted cancer cell killing by AAVP carrying the gene for Herpes simplex virus thymidine kinase (HSVtk) in 2D tissue cultures and 3D tumor spheroids. We found this increased tumor cell killing was associated with enhanced AAVP-mediated gene expression. Next, we established that genistein protects AAVP against proteasome degradation and enhances vector genome accumulation in the nucleus. Combination of genistein and phage-guided virotherapy is a safe and promising strategy that should be considered in anticancer therapy with AAVP.

  10. Final technical report

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Edward DeLong

    2011-10-07

    Our overarching goals in this project were to: Develop and improve high-throughput sequencing methods and analytical approaches for quantitative analyses of microbial gene expression at the Hawaii Ocean Time Series Station and the Bermuda Atlantic Time Series Station; Conduct field analyses following gene expression patterns in picoplankton microbial communities in general, and Prochlorococcus flow sorted from that community, as they respond to different environmental variables (light, macronutrients, dissolved organic carbon), that are predicted to influence activity, productivity, and carbon cycling; Use the expression analyses of flow sorted Prochlorococcus to identify horizontally transferred genes and gene products, in particular those thatmore » are located in genomic islands and likely to confer habitat-specific fitness advantages; Use the microbial community gene expression data that we generate to gain insights, and test hypotheses, about the variability, genomic context, activity and function of as yet uncharacterized gene products, that appear highly expressed in the environment. We achieved the above goals, and even more over the course of the project. This includes a number of novel methodological developments, as well as the standardization of microbial community gene expression analyses in both field surveys, and experimental modalities. The availability of these methods, tools and approaches is changing current practice in microbial community analyses.« less

  11. Comparison of taxon-specific versus general locus sets for targeted sequence capture in plant phylogenomics.

    PubMed

    Chau, John H; Rahfeldt, Wolfgang A; Olmstead, Richard G

    2018-03-01

    Targeted sequence capture can be used to efficiently gather sequence data for large numbers of loci, such as single-copy nuclear loci. Most published studies in plants have used taxon-specific locus sets developed individually for a clade using multiple genomic and transcriptomic resources. General locus sets can also be developed from loci that have been identified as single-copy and have orthologs in large clades of plants. We identify and compare a taxon-specific locus set and three general locus sets (conserved ortholog set [COSII], shared single-copy nuclear [APVO SSC] genes, and pentatricopeptide repeat [PPR] genes) for targeted sequence capture in Buddleja (Scrophulariaceae) and outgroups. We evaluate their performance in terms of assembly success, sequence variability, and resolution and support of inferred phylogenetic trees. The taxon-specific locus set had the most target loci. Assembly success was high for all locus sets in Buddleja samples. For outgroups, general locus sets had greater assembly success. Taxon-specific and PPR loci had the highest average variability. The taxon-specific data set produced the best-supported tree, but all data sets showed improved resolution over previous non-sequence capture data sets. General locus sets can be a useful source of sequence capture targets, especially if multiple genomic resources are not available for a taxon.

  12. Comprehensive Interrogation of Natural TALE DNA Binding Modules and Transcriptional Repressor Domains

    PubMed Central

    Cong, Le; Zhou, Ruhong; Kuo, Yu-chi; Cunniff, Margaret; Zhang, Feng

    2012-01-01

    Transcription activator-like effectors (TALE) are sequence-specific DNA binding proteins that harbor modular, repetitive DNA binding domains. TALEs have enabled the creation of customizable designer transcriptional factors and sequence-specific nucleases for genome engineering. Here we report two improvements of the TALE toolbox for achieving efficient activation and repression of endogenous gene expression in mammalian cells. We show that the naturally occurring repeat variable diresidue (RVD) Asn-His (NH) has high biological activity and specificity for guanine, a highly prevalent base in mammalian genomes. We also report an effective TALE transcriptional repressor architecture for targeted inhibition of transcription in mammalian cells. These findings will improve the precision and effectiveness of genome engineering that can be achieved using TALEs. PMID:22828628

  13. The natural dietary genistein boosts bacteriophage-mediated cancer cell killing by improving phage-targeted tumor cell transduction

    PubMed Central

    Tsafa, Effrosyni; Al-Bahrani, Mariam; Bentayebi, Kaoutar; Przystal, Justyna; Suwan, Keittisak; Hajitou, Amin

    2016-01-01

    Gene therapy has long been regarded as a promising treatment for cancer. However, cancer gene therapy is still facing the challenge of targeting gene delivery vectors specifically to tumors when administered via clinically acceptable non-invasive systemic routes (i.e. intravenous). The bacteria virus, bacteriophage (phage), represents a new generation of promising vectors in systemic gene delivery since their targeting can be achieved through phage capsid display ligands, which enable them to home to specific tumor receptors without the need to ablate any native eukaryotic tropism. We have previously reported a tumor specific bacteriophage vector named adeno-associated virus/phage, or AAVP, in which gene expression is under a recombinant human rAAV2 virus genome targeted to tumors via a ligand-directed phage capsid. However, cancer gene therapy with this tumor-targeted vector achieved variable outcomes ranging from tumor regression to no effect in both experimental and natural preclinical models. Herein, we hypothesized that combining the natural dietary genistein, with proven anticancer activity, would improve bacteriophage anticancer safe therapy. We show that combination treatment with genistein and AAVP increased targeted cancer cell killing by AAVP carrying the gene for Herpes simplex virus thymidine kinase (HSVtk) in 2D tissue cultures and 3D tumor spheroids. We found this increased tumor cell killing was associated with enhanced AAVP-mediated gene expression. Next, we established that genistein protects AAVP against proteasome degradation and enhances vector genome accumulation in the nucleus. Combination of genistein and phage-guided virotherapy is a safe and promising strategy that should be considered in anticancer therapy with AAVP. PMID:27437775

  14. The AGT Gene M235T Polymorphism and Response of Power-Related Variables to Aerobic Training.

    PubMed

    Aleksandra, Zarębska; Zbigniew, Jastrzębski; Waldemar, Moska; Agata, Leońska-Duniec; Mariusz, Kaczmarczyk; Marek, Sawczuk; Agnieszka, Maciejewska-Skrendo; Piotr, Żmijewski; Krzysztof, Ficek; Grzegorz, Trybek; Ewelina, Lulińska-Kuklik; Semenova, Ekaterina A; Ahmetov, Ildus I; Paweł, Cięszczyk

    2016-12-01

    The C allele of the M235T (rs699) polymorphism of the AGT gene correlates with higher levels of angiotensin II and has been associated with power and strength sport performance. The aim of the study was to investigate whether or not selected power-related variables and their response to a 12-week program of aerobic dance training are modulated by the AGT M235T genotype in healthy participants. Two hundred and one Polish Caucasian women aged 21 ± 1 years met the inclusion criteria and were included in the study. All women completed a 12-week program of low and high impact aerobics. Wingate peak power and total work capacity, 5 m, 10 m, and 30 m running times and jump height and jump power were determined before and after the training programme. All power-related variables improved significantly in response to aerobic dance training. We found a significant association between the M235T polymorphism and jump-based variables (squat jump (SJ) height, p = 0.005; SJ power, p = 0.015; countermovement jump height, p = 0.025; average of 10 countermovement jumps with arm swing (ACMJ) height, p = 0.001; ACMJ power, p = 0.035). Specifically, greater improvements were observed in the C allele carriers in comparison with TT homozygotes. In conclusion, aerobic dance, one of the most commonly practiced adult fitness activities in the world, provides sufficient training stimuli for augmenting the explosive strength necessary to increase vertical jump performance. The AGT gene M235T polymorphism seems to be not only a candidate gene variant for power/strength related phenotypes, but also a genetic marker for predicting response to training.

  15. Differentiation of Xylella fastidiosa Strains via Multilocus Sequence Analysis of Environmentally Mediated Genes (MLSA-E)

    PubMed Central

    Parker, Jennifer K.; Havird, Justin C.

    2012-01-01

    Isolates of the plant pathogen Xylella fastidiosa are genetically very similar, but studies on their biological traits have indicated differences in virulence and infection symptomatology. Taxonomic analyses have identified several subspecies, and phylogenetic analyses of housekeeping genes have shown broad host-based genetic differences; however, results are still inconclusive for genetic differentiation of isolates within subspecies. This study employs multilocus sequence analysis of environmentally mediated genes (MLSA-E; genes influenced by environmental factors) to investigate X. fastidiosa relationships and differentiate isolates with low genetic variability. Potential environmentally mediated genes, including host colonization and survival genes related to infection establishment, were identified a priori. The ratio of the rate of nonsynonymous substitutions to the rate of synonymous substitutions (dN/dS) was calculated to select genes that may be under increased positive selection compared to previously studied housekeeping genes. Nine genes were sequenced from 54 X. fastidiosa isolates infecting different host plants across the United States. Results of maximum likelihood (ML) and Bayesian phylogenetic (BP) analyses are in agreement with known X. fastidiosa subspecies clades but show novel within-subspecies differentiation, including geographic differentiation, and provide additional information regarding host-based isolate variation and specificity. dN/dS ratios of environmentally mediated genes, though <1 due to high sequence similarity, are significantly greater than housekeeping gene dN/dS ratios and correlate with increased sequence variability. MLSA-E can more precisely resolve relationships between closely related bacterial strains with low genetic variability, such as X. fastidiosa isolates. Discovering the genetic relationships between X. fastidiosa isolates will provide new insights into the epidemiology of populations of X. fastidiosa, allowing improved disease management in economically important crops. PMID:22194287

  16. Differentiation of Xylella fastidiosa strains via multilocus sequence analysis of environmentally mediated genes (MLSA-E).

    PubMed

    Parker, Jennifer K; Havird, Justin C; De La Fuente, Leonardo

    2012-03-01

    Isolates of the plant pathogen Xylella fastidiosa are genetically very similar, but studies on their biological traits have indicated differences in virulence and infection symptomatology. Taxonomic analyses have identified several subspecies, and phylogenetic analyses of housekeeping genes have shown broad host-based genetic differences; however, results are still inconclusive for genetic differentiation of isolates within subspecies. This study employs multilocus sequence analysis of environmentally mediated genes (MLSA-E; genes influenced by environmental factors) to investigate X. fastidiosa relationships and differentiate isolates with low genetic variability. Potential environmentally mediated genes, including host colonization and survival genes related to infection establishment, were identified a priori. The ratio of the rate of nonsynonymous substitutions to the rate of synonymous substitutions (dN/dS) was calculated to select genes that may be under increased positive selection compared to previously studied housekeeping genes. Nine genes were sequenced from 54 X. fastidiosa isolates infecting different host plants across the United States. Results of maximum likelihood (ML) and Bayesian phylogenetic (BP) analyses are in agreement with known X. fastidiosa subspecies clades but show novel within-subspecies differentiation, including geographic differentiation, and provide additional information regarding host-based isolate variation and specificity. dN/dS ratios of environmentally mediated genes, though <1 due to high sequence similarity, are significantly greater than housekeeping gene dN/dS ratios and correlate with increased sequence variability. MLSA-E can more precisely resolve relationships between closely related bacterial strains with low genetic variability, such as X. fastidiosa isolates. Discovering the genetic relationships between X. fastidiosa isolates will provide new insights into the epidemiology of populations of X. fastidiosa, allowing improved disease management in economically important crops.

  17. DNA methylation and expression of proopiomelanocortin (POMC) gene in the hypothalamus of three-week-old chickens show sex-specific differences.

    PubMed

    Rancourt, Rebecca C; Schellong, Karen; Tzschentke, Barbara; Henrich, Wolfgang; Plagemann, Andreas

    2018-06-01

    Increased availability and improved sequence annotation of the chicken ( Gallus gallus f.  domestica ) genome have sparked interest in the bird as a model system to investigate translational embryonic development and health/disease outcomes. However, the epigenetics of this bird genome remain unclear. The aim of this study was to determine the levels of gene expression and DNA methylation at the proopiomelanocortin ( POMC ) gene in the hypothalamus of 3-week-old chickens. POMC is a key player in the control of the stress response, food intake, and metabolism. DNA methylation of the promoter, CpG island, and gene body regions of POMC were measured. Our data illustrate the pattern, variability, and functionality of DNA methylation for POMC expression in the chicken. Our findings show correlation of methylation pattern and gene expression along with sex-specific differences in POMC . Overall, these novel data highlight the promising potential of the chicken as a model and also the need for breeders and researchers to consider sex ratios in their studies.

  18. Removing technical variability in RNA-seq data using conditional quantile normalization.

    PubMed

    Hansen, Kasper D; Irizarry, Rafael A; Wu, Zhijin

    2012-04-01

    The ability to measure gene expression on a genome-wide scale is one of the most promising accomplishments in molecular biology. Microarrays, the technology that first permitted this, were riddled with problems due to unwanted sources of variability. Many of these problems are now mitigated, after a decade's worth of statistical methodology development. The recently developed RNA sequencing (RNA-seq) technology has generated much excitement in part due to claims of reduced variability in comparison to microarrays. However, we show that RNA-seq data demonstrate unwanted and obscuring variability similar to what was first observed in microarrays. In particular, we find guanine-cytosine content (GC-content) has a strong sample-specific effect on gene expression measurements that, if left uncorrected, leads to false positives in downstream results. We also report on commonly observed data distortions that demonstrate the need for data normalization. Here, we describe a statistical methodology that improves precision by 42% without loss of accuracy. Our resulting conditional quantile normalization algorithm combines robust generalized regression to remove systematic bias introduced by deterministic features such as GC-content and quantile normalization to correct for global distortions.

  19. Technical variables in high-throughput miRNA expression profiling: much work remains to be done.

    PubMed

    Nelson, Peter T; Wang, Wang-Xia; Wilfred, Bernard R; Tang, Guiliang

    2008-11-01

    MicroRNA (miRNA) gene expression profiling has provided important insights into plant and animal biology. However, there has not been ample published work about pitfalls associated with technical parameters in miRNA gene expression profiling. One source of pertinent information about technical variables in gene expression profiling is the separate and more well-established literature regarding mRNA expression profiling. However, many aspects of miRNA biochemistry are unique. For example, the cellular processing and compartmentation of miRNAs, the differential stability of specific miRNAs, and aspects of global miRNA expression regulation require specific consideration. Additional possible sources of systematic bias in miRNA expression studies include the differential impact of pre-analytical variables, substrate specificity of nucleic acid processing enzymes used in labeling and amplification, and issues regarding new miRNA discovery and annotation. We conclude that greater focus on technical parameters is required to bolster the validity, reliability, and cultural credibility of miRNA gene expression profiling studies.

  20. Quantification of histone modification ChIP-seq enrichment for data mining and machine learning applications

    PubMed Central

    2011-01-01

    Background The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied statistical computing methods to ChIP-seq datasets to gain insight into the epigenetic regulation of transcription. However, methods for estimating enrichment levels in ChIP-seq data for these computational studies are understudied and variable. Since the conclusions drawn from these data mining and machine learning applications strongly depend on the enrichment level inputs, a comparison of estimation methods with respect to the performance of statistical models should be made. Results Various methods were used to estimate the gene-wise ChIP-seq enrichment levels for 20 histone methylations and the histone variant H2A.Z. The Multivariate Adaptive Regression Splines (MARS) algorithm was applied for each estimation method using the estimation of enrichment levels as predictors and gene expression levels as responses. The methods used to estimate enrichment levels included tag counting and model-based methods that were applied to whole genes and specific gene regions. These methods were also applied to various sizes of estimation windows. The MARS model performance was assessed with the Generalized Cross-Validation Score (GCV). We determined that model-based methods of enrichment estimation that spatially weight enrichment based on average patterns provided an improvement over tag counting methods. Also, methods that included information across the entire gene body provided improvement over methods that focus on a specific sub-region of the gene (e.g., the 5' or 3' region). Conclusion The performance of data mining and machine learning methods when applied to histone modification ChIP-seq data can be improved by using data across the entire gene body, and incorporating the spatial distribution of enrichment. Refinement of enrichment estimation ultimately improved accuracy of model predictions. PMID:21834981

  1. Epi-fingerprinting and epi-interventions for improved crop production and food quality

    PubMed Central

    Rodríguez López, Carlos M.; Wilkinson, Mike J.

    2015-01-01

    Increasing crop production at a time of rapid climate change represents the greatest challenge facing contemporary agricultural research. Our understanding of the genetic control of yield derives from controlled field experiments designed to minimize environmental variance. In spite of these efforts there is substantial residual variability among plants attributable to Genotype × Environment interactions. Recent advances in the field of epigenetics have revealed a plethora of gene control mechanisms that could account for much of this unassigned variation. These systems act as a regulatory interface between the perception of the environment and associated alterations in gene expression. Direct intervention of epigenetic control systems hold the enticing promise of creating new sources of variability that could enhance crop performance. Equally, understanding the relationship between various epigenetic states and responses of the crop to specific aspects of the growing environment (epigenetic fingerprinting) could allow for a more tailored approach to plant agronomy. In this review, we explore the many ways in which epigenetic interventions and epigenetic fingerprinting can be deployed for the improvement of crop production and quality. PMID:26097484

  2. Characterization and Regulation of Aquaporin Genes of Sorghum [Sorghum bicolor (L.) Moench] in Response to Waterlogging Stress

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kadam, Suhas; Abril, Alejandra; Dhanapal, Arun P.

    Waterlogging is a significant environmental constraint to crop production, and a better understanding of plant responses is critical for the improvement of crop tolerance to waterlogged soils. Aquaporins (AQPs) are a class of channel-forming proteins that play an important role in water transport in plants. Our study aimed to examine the regulation of AQP genes under waterlogging stress and to characterize the genetic variability of AQP genes in sorghum (Sorghum bicolor). Transcriptional profiling of AQP genes in response to waterlogging stress in nodal root tips and nodal root basal regions of two tolerant and two sensitive sorghum genotypes at 18more » and 96 h after waterlogging stress imposition revealed significant gene-specific pattern with regard to genotype, root tissue sample, and time point. For some tissue sample and time point combinations, PIP2-6, PIP2-7, TIP2-2, TIP4-4, and TIP5-1 expression was differentially regulated in tolerant compared to sensitive genotypes. The differential response of these AQP genes suggests that they may play a tissue specific role in mitigating waterlogging stress. Genetic analysis of sorghum revealed that AQP genes were clustered into the same four subfamilies as in maize (Zea mays) and rice (Oryza sativa) and that residues determining the AQP channel specificity were largely conserved across species. Single nucleotide polymorphism (SNP) data from 50 sorghum accessions were used to build an AQP gene-based phylogeny of the haplotypes. Phylogenetic analysis based on single nucleotide polymorphisms of sorghum AQP genes placed the tolerant and sensitive genotypes used for the expression study in distinct groups. Expression analyses suggested that selected AQPs may play a pivotal role in sorghum tolerance to water logging stress. Furthermore experimentation is needed to verify their role and to leverage phylogenetic analyses and AQP expression data to improve water logging tolerance in sorghum.« less

  3. Characterization and Regulation of Aquaporin Genes of Sorghum [Sorghum bicolor (L.) Moench] in Response to Waterlogging Stress

    DOE PAGES

    Kadam, Suhas; Abril, Alejandra; Dhanapal, Arun P.; ...

    2017-05-30

    Waterlogging is a significant environmental constraint to crop production, and a better understanding of plant responses is critical for the improvement of crop tolerance to waterlogged soils. Aquaporins (AQPs) are a class of channel-forming proteins that play an important role in water transport in plants. Our study aimed to examine the regulation of AQP genes under waterlogging stress and to characterize the genetic variability of AQP genes in sorghum (Sorghum bicolor). Transcriptional profiling of AQP genes in response to waterlogging stress in nodal root tips and nodal root basal regions of two tolerant and two sensitive sorghum genotypes at 18more » and 96 h after waterlogging stress imposition revealed significant gene-specific pattern with regard to genotype, root tissue sample, and time point. For some tissue sample and time point combinations, PIP2-6, PIP2-7, TIP2-2, TIP4-4, and TIP5-1 expression was differentially regulated in tolerant compared to sensitive genotypes. The differential response of these AQP genes suggests that they may play a tissue specific role in mitigating waterlogging stress. Genetic analysis of sorghum revealed that AQP genes were clustered into the same four subfamilies as in maize (Zea mays) and rice (Oryza sativa) and that residues determining the AQP channel specificity were largely conserved across species. Single nucleotide polymorphism (SNP) data from 50 sorghum accessions were used to build an AQP gene-based phylogeny of the haplotypes. Phylogenetic analysis based on single nucleotide polymorphisms of sorghum AQP genes placed the tolerant and sensitive genotypes used for the expression study in distinct groups. Expression analyses suggested that selected AQPs may play a pivotal role in sorghum tolerance to water logging stress. Furthermore experimentation is needed to verify their role and to leverage phylogenetic analyses and AQP expression data to improve water logging tolerance in sorghum.« less

  4. Simultaneous enumeration of cancer and immune cell types from bulk tumor gene expression data.

    PubMed

    Racle, Julien; de Jonge, Kaat; Baumgaertner, Petra; Speiser, Daniel E; Gfeller, David

    2017-11-13

    Immune cells infiltrating tumors can have important impact on tumor progression and response to therapy. We present an efficient algorithm to simultaneously estimate the fraction of cancer and immune cell types from bulk tumor gene expression data. Our method integrates novel gene expression profiles from each major non-malignant cell type found in tumors, renormalization based on cell-type-specific mRNA content, and the ability to consider uncharacterized and possibly highly variable cell types. Feasibility is demonstrated by validation with flow cytometry, immunohistochemistry and single-cell RNA-Seq analyses of human melanoma and colorectal tumor specimens. Altogether, our work not only improves accuracy but also broadens the scope of absolute cell fraction predictions from tumor gene expression data, and provides a unique novel experimental benchmark for immunogenomics analyses in cancer research (http://epic.gfellerlab.org).

  5. Use of Natural Diversity and Biotechnology to Increase the Quality and Nutritional Content of Tomato and Grape

    PubMed Central

    Gascuel, Quentin; Diretto, Gianfranco; Monforte, Antonio J.; Fortes, Ana M.; Granell, Antonio

    2017-01-01

    Improving fruit quality has become a major goal in plant breeding. Direct approaches to tackling fruit quality traits specifically linked to consumer preferences and environmental friendliness, such as improved flavor, nutraceutical compounds, and sustainability, have slowly been added to a breeder priority list that already includes traits like productivity, efficiency, and, especially, pest and disease control. Breeders already use molecular genetic tools to improve fruit quality although most advances have been made in producer and industrial quality standards. Furthermore, progress has largely been limited to simple agronomic traits easy-to-observe, whereas the vast majority of quality attributes, specifically those relating to flavor and nutrition, are complex and have mostly been neglected. Fortunately, wild germplasm, which is used for resistance against/tolerance of environmental stresses (including pathogens), is still available and harbors significant genetic variation for taste and health-promoting traits. Similarly, heirloom/traditional varieties could be used to identify which genes contribute to flavor and health quality and, at the same time, serve as a good source of the best alleles for organoleptic quality improvement. Grape (Vitis vinifera L.) and tomato (Solanum lycopersicum L.) produce fleshy, berry-type fruits, among the most consumed in the world. Both have undergone important domestication and selection processes, that have dramatically reduced their genetic variability, and strongly standardized fruit traits. Moreover, more and more consumers are asking for sustainable production, incompatible with the wide range of chemical inputs. In the present paper, we review the genetic resources available to tomato/grape breeders, and the recent technological progresses that facilitate the identification of genes/alleles of interest within the natural or generated variability gene pool. These technologies include omics, high-throughput phenotyping/phenomics, and biotech approaches. Our review also covers a range of technologies used to transfer to tomato and grape those alleles considered of interest for fruit quality. These include traditional breeding, TILLING (Targeting Induced Local Lesions in Genomes), genetic engineering, or NPBT (New Plant Breeding Technologies). Altogether, the combined exploitation of genetic variability and innovative biotechnological tools may facilitate breeders to improve fruit quality tacking more into account the consumer standards and the needs to move forward into more sustainable farming practices. PMID:28553296

  6. The Drosophila Translational Control Element (TCE) Is Required for High-Level Transcription of Many Genes That Are Specifically Expressed in Testes

    PubMed Central

    Anderson, Ashley K.; Ohler, Uwe; Wassarman, David A.

    2012-01-01

    To investigate the importance of core promoter elements for tissue-specific transcription of RNA polymerase II genes, we examined testis-specific transcription in Drosophila melanogaster. Bioinformatic analyses of core promoter sequences from 190 genes that are specifically expressed in testes identified a 10 bp A/T-rich motif that is identical to the translational control element (TCE). The TCE functions in the 5′ untranslated region of Mst(3)CGP mRNAs to repress translation, and it also functions in a heterologous gene to regulate transcription. We found that among genes with focused initiation patterns, the TCE is significantly enriched in core promoters of genes that are specifically expressed in testes but not in core promoters of genes that are specifically expressed in other tissues. The TCE is variably located in core promoters and is conserved in melanogaster subgroup species, but conservation dramatically drops in more distant species. In transgenic flies, short (300–400 bp) genomic regions containing a TCE directed testis-specific transcription of a reporter gene. Mutation of the TCE significantly reduced but did not abolish reporter gene transcription indicating that the TCE is important but not essential for transcription activation. Finally, mutation of testis-specific TFIID (tTFIID) subunits significantly reduced the transcription of a subset of endogenous TCE-containing but not TCE-lacking genes, suggesting that tTFIID activity is limited to TCE-containing genes but that tTFIID is not an obligatory regulator of TCE-containing genes. Thus, the TCE is a core promoter element in a subset of genes that are specifically expressed in testes. Furthermore, the TCE regulates transcription in the context of short genomic regions, from variable locations in the core promoter, and both dependently and independently of tTFIID. These findings set the stage for determining the mechanism by which the TCE regulates testis-specific transcription and understanding the dual role of the TCE in translational and transcriptional regulation. PMID:22984601

  7. The Drosophila Translational Control Element (TCE) is required for high-level transcription of many genes that are specifically expressed in testes.

    PubMed

    Katzenberger, Rebeccah J; Rach, Elizabeth A; Anderson, Ashley K; Ohler, Uwe; Wassarman, David A

    2012-01-01

    To investigate the importance of core promoter elements for tissue-specific transcription of RNA polymerase II genes, we examined testis-specific transcription in Drosophila melanogaster. Bioinformatic analyses of core promoter sequences from 190 genes that are specifically expressed in testes identified a 10 bp A/T-rich motif that is identical to the translational control element (TCE). The TCE functions in the 5' untranslated region of Mst(3)CGP mRNAs to repress translation, and it also functions in a heterologous gene to regulate transcription. We found that among genes with focused initiation patterns, the TCE is significantly enriched in core promoters of genes that are specifically expressed in testes but not in core promoters of genes that are specifically expressed in other tissues. The TCE is variably located in core promoters and is conserved in melanogaster subgroup species, but conservation dramatically drops in more distant species. In transgenic flies, short (300-400 bp) genomic regions containing a TCE directed testis-specific transcription of a reporter gene. Mutation of the TCE significantly reduced but did not abolish reporter gene transcription indicating that the TCE is important but not essential for transcription activation. Finally, mutation of testis-specific TFIID (tTFIID) subunits significantly reduced the transcription of a subset of endogenous TCE-containing but not TCE-lacking genes, suggesting that tTFIID activity is limited to TCE-containing genes but that tTFIID is not an obligatory regulator of TCE-containing genes. Thus, the TCE is a core promoter element in a subset of genes that are specifically expressed in testes. Furthermore, the TCE regulates transcription in the context of short genomic regions, from variable locations in the core promoter, and both dependently and independently of tTFIID. These findings set the stage for determining the mechanism by which the TCE regulates testis-specific transcription and understanding the dual role of the TCE in translational and transcriptional regulation.

  8. Biodiversity of mannose-specific adhesion in Lactobacillus plantarum revisited: strain-specific domain composition of the mannose-adhesin.

    PubMed

    Gross, G; Snel, J; Boekhorst, J; Smits, M A; Kleerebezem, M

    2010-03-01

    Recently, we have identified the mannose-specific adhesin encoding gene (msa) of Lactobacillus plantarum. In the current study, structure and function of this potentially probiotic effector gene were further investigated, exploring genetic diversity of msa in L. plantarum in relation to mannose adhesion capacity. The results demonstrate that there is considerable variation in quantitative in vitro mannose adhesion capacity, which is paralleled by msa gene sequence variation. The msa genes of different L. plantarum strains encode proteins with variable domain composition. Construction of L. plantarum 299v mutant strains revealed that the msa gene product is the key-protein for mannose adhesion, also in a strain with high mannose adhering capacity. However, no straightforward correlation between adhesion capacity and domain composition of Msa in L. plantarum could be identified. Nevertheless, differences in Msa sequences in combination with variable genetic background of specific bacterial strains appears to determine mannose adhesion capacity and potentially affects probiotic properties. These findings exemplify the strain-specificity of probiotic characteristics and illustrate the need for careful and molecular selection of new candidate probiotics.

  9. Genetic basis of inter-individual variability in the effects of exercise on the alleviation of lifestyle-related diseases

    PubMed Central

    Mori, Masayuki; Higuchi, Keiichi; Sakurai, Akihiro; Tabara, Yasuharu; Miki, Tetsuro; Nose, Hiroshi

    2009-01-01

    Habitual exercise training, including a high-intensity interval walking programme, improves cardiorespiratory fitness and alleviates lifestyle-related diseases, such as obesity, hypertension and dyslipidaemia. However, the extent of improvement has been shown to differ substantially among individuals for various exercise regimens. A body of literature has demonstrated that gene polymorphisms could account for the inter-individual variability in the improvement of risk factors for lifestyle-related diseases following exercise training. However, the fractions of the variability explained by the polymorphisms are small (∼5%). Also, it is likely that the effects of gene polymorphisms differ with exercise regimens and subject characteristics. These observations suggest the necessity for further studies to exhaustively identify such gene polymorphisms. More importantly, the physiological and molecular genetic mechanisms by which gene polymorphisms interact with exercise to influence the improvements of risk factors for lifestyle-related diseases differentially remain to be clarified. A better understanding of these issues should lead to more effective integration of exercise to optimize the treatment and management of individuals with lifestyle-related diseases. PMID:19736300

  10. Unifying measures of gene function and evolution.

    PubMed

    Wolf, Yuri I; Carmel, Liran; Koonin, Eugene V

    2006-06-22

    Recent genome analyses revealed intriguing correlations between variables characterizing the functioning of a gene, such as expression level (EL), connectivity of genetic and protein-protein interaction networks, and knockout effect, and variables describing gene evolution, such as sequence evolution rate (ER) and propensity for gene loss. Typically, variables within each of these classes are positively correlated, e.g. products of highly expressed genes also have a propensity to be involved in many protein-protein interactions, whereas variables between classes are negatively correlated, e.g. highly expressed genes, on average, evolve slower than weakly expressed genes. Here, we describe principal component (PC) analysis of seven genome-related variables and propose biological interpretations for the first three PCs. The first PC reflects a gene's 'importance', or the 'status' of a gene in the genomic community, with positive contributions from knockout lethality, EL, number of protein-protein interaction partners and the number of paralogues, and negative contributions from sequence ER and gene loss propensity. The next two PCs define a plane that seems to reflect the functional and evolutionary plasticity of a gene. Specifically, PC2 can be interpreted as a gene's 'adaptability' whereby genes with high adaptability readily duplicate, have many genetic interaction partners and tend to be non-essential. PC3 also might reflect the role of a gene in organismal adaptation albeit with a negative rather than a positive contribution of genetic interactions; we provisionally designate this PC 'reactivity'. The interpretation of PC2 and PC3 as measures of a gene's plasticity is compatible with the observation that genes with high values of these PCs tend to be expressed in a condition- or tissue-specific manner. Functional classes of genes substantially vary in status, adaptability and reactivity, with the highest status characteristic of the translation system and cytoskeletal proteins, highest adaptability seen in cellular processes and signalling genes, and top reactivity characteristic of metabolic enzymes.

  11. Methodological requirements for valid tissue-based biomarker studies that can be used in clinical practice.

    PubMed

    True, Lawrence D

    2014-03-01

    Paralleling the growth of ever more cost efficient methods to sequence the whole genome in minute fragments of tissue has been the identification of increasingly numerous molecular abnormalities in cancers--mutations, amplifications, insertions and deletions of genes, and patterns of differential gene expression, i.e., overexpression of growth factors and underexpression of tumor suppressor genes. These abnormalities can be translated into assays to be used in clinical decision making. In general terms, the result of such an assay is subject to a large number of variables regarding the characteristics of the available sample, particularities of the used assay, and the interpretation of the results. This review discusses the effects of these variables on assays of tissue-based biomarkers, classified by macromolecule--DNA, RNA (including micro RNA, messenger RNA, long noncoding RNA, protein, and phosphoprotein). Since the majority of clinically applicable biomarkers are immunohistochemically detectable proteins this review focuses on protein biomarkers. However, the principles outlined are mostly applicable to any other analyte. A variety of preanalytical variables impacts on the results obtained, including analyte stability (which is different for different analytes, i.e., DNA, RNA, or protein), period of warm and of cold ischemia, fixation time, tissue processing, sample storage time, and storage conditions. In addition, assay variables play an important role, including reagent specificity (notably but not uniquely an issue concerning antibodies used in immunohistochemistry), technical components of the assay, quantitation, and assay interpretation. Finally, appropriateness of an assay for clinical application is an important issue. Reference is made to publicly available guidelines to improve on biomarker development in general and requirements for clinical use in particular. Strategic goals are formulated in order to improve on the quality of biomarker reporting, including issues of analyte quality, experimental detail, assay efficiency and precision, and assay appropriateness.

  12. Promoter architecture dictates cell-to-cell variability in gene expression.

    PubMed

    Jones, Daniel L; Brewster, Robert C; Phillips, Rob

    2014-12-19

    Variability in gene expression among genetically identical cells has emerged as a central preoccupation in the study of gene regulation; however, a divide exists between the predictions of molecular models of prokaryotic transcriptional regulation and genome-wide experimental studies suggesting that this variability is indifferent to the underlying regulatory architecture. We constructed a set of promoters in Escherichia coli in which promoter strength, transcription factor binding strength, and transcription factor copy numbers are systematically varied, and used messenger RNA (mRNA) fluorescence in situ hybridization to observe how these changes affected variability in gene expression. Our parameter-free models predicted the observed variability; hence, the molecular details of transcription dictate variability in mRNA expression, and transcriptional noise is specifically tunable and thus represents an evolutionarily accessible phenotypic parameter. Copyright © 2014, American Association for the Advancement of Science.

  13. Improved Sparse Multi-Class SVM and Its Application for Gene Selection in Cancer Classification

    PubMed Central

    Huang, Lingkang; Zhang, Hao Helen; Zeng, Zhao-Bang; Bushel, Pierre R.

    2013-01-01

    Background Microarray techniques provide promising tools for cancer diagnosis using gene expression profiles. However, molecular diagnosis based on high-throughput platforms presents great challenges due to the overwhelming number of variables versus the small sample size and the complex nature of multi-type tumors. Support vector machines (SVMs) have shown superior performance in cancer classification due to their ability to handle high dimensional low sample size data. The multi-class SVM algorithm of Crammer and Singer provides a natural framework for multi-class learning. Despite its effective performance, the procedure utilizes all variables without selection. In this paper, we propose to improve the procedure by imposing shrinkage penalties in learning to enforce solution sparsity. Results The original multi-class SVM of Crammer and Singer is effective for multi-class classification but does not conduct variable selection. We improved the method by introducing soft-thresholding type penalties to incorporate variable selection into multi-class classification for high dimensional data. The new methods were applied to simulated data and two cancer gene expression data sets. The results demonstrate that the new methods can select a small number of genes for building accurate multi-class classification rules. Furthermore, the important genes selected by the methods overlap significantly, suggesting general agreement among different variable selection schemes. Conclusions High accuracy and sparsity make the new methods attractive for cancer diagnostics with gene expression data and defining targets of therapeutic intervention. Availability: The source MATLAB code are available from http://math.arizona.edu/~hzhang/software.html. PMID:23966761

  14. Simultaneous enumeration of cancer and immune cell types from bulk tumor gene expression data

    PubMed Central

    Racle, Julien; de Jonge, Kaat; Baumgaertner, Petra; Speiser, Daniel E

    2017-01-01

    Immune cells infiltrating tumors can have important impact on tumor progression and response to therapy. We present an efficient algorithm to simultaneously estimate the fraction of cancer and immune cell types from bulk tumor gene expression data. Our method integrates novel gene expression profiles from each major non-malignant cell type found in tumors, renormalization based on cell-type-specific mRNA content, and the ability to consider uncharacterized and possibly highly variable cell types. Feasibility is demonstrated by validation with flow cytometry, immunohistochemistry and single-cell RNA-Seq analyses of human melanoma and colorectal tumor specimens. Altogether, our work not only improves accuracy but also broadens the scope of absolute cell fraction predictions from tumor gene expression data, and provides a unique novel experimental benchmark for immunogenomics analyses in cancer research (http://epic.gfellerlab.org). PMID:29130882

  15. Introduced T cell receptor variable region gene segments recombine in pre-B cells: evidence that B and T cells use a common recombinase.

    PubMed

    Yancopoulos, G D; Blackwell, T K; Suh, H; Hood, L; Alt, F W

    1986-01-31

    We have recently proposed that a common recombinase performs all of the many variable region gene assembly events in B and T cells, and that the specificity of these joining events is mediated by regulating the "accessibility" of the involved gene segments. To test this possibility, we have introduced "accessible" T cell receptor (TCR) variable region gene segments into a pre-B cell line capable of recombining endogenous and transfected immunoglobulin (Ig) variable region gene segments. Although the corresponding "inaccessible" endogenous TCR gene segments do not rearrange in this line or in B cells in general, the introduced TCR gene segments join very frequently and, in fact, closely resemble introduced Ig gene segments in their recombination characteristics. These observations suggest a new role for conventional Ig transcriptional enhancers--recombinational enhancement. Our studies provide insight into additional aspects of the joining mechanism such as N region insertion, aberrant joining, and recombination-recognition sequence requirements for joining.

  16. Cohort-specific imputation of gene expression improves prediction of warfarin dose for African Americans.

    PubMed

    Gottlieb, Assaf; Daneshjou, Roxana; DeGorter, Marianne; Bourgeois, Stephane; Svensson, Peter J; Wadelius, Mia; Deloukas, Panos; Montgomery, Stephen B; Altman, Russ B

    2017-11-24

    Genome-wide association studies are useful for discovering genotype-phenotype associations but are limited because they require large cohorts to identify a signal, which can be population-specific. Mapping genetic variation to genes improves power and allows the effects of both protein-coding variation as well as variation in expression to be combined into "gene level" effects. Previous work has shown that warfarin dose can be predicted using information from genetic variation that affects protein-coding regions. Here, we introduce a method that improves dose prediction by integrating tissue-specific gene expression. In particular, we use drug pathways and expression quantitative trait loci knowledge to impute gene expression-on the assumption that differential expression of key pathway genes may impact dose requirement. We focus on 116 genes from the pharmacokinetic and pharmacodynamic pathways of warfarin within training and validation sets comprising both European and African-descent individuals. We build gene-tissue signatures associated with warfarin dose in a cohort-specific manner and identify a signature of 11 gene-tissue pairs that significantly augments the International Warfarin Pharmacogenetics Consortium dosage-prediction algorithm in both populations. Our results demonstrate that imputed expression can improve dose prediction and bridge population-specific compositions. MATLAB code is available at https://github.com/assafgo/warfarin-cohort.

  17. Familial cancer syndromes and clusters.

    PubMed

    Birch, J M

    1994-07-01

    The study of rare families in which a variety of cancers occur, usually at an early age and with patterns consistent with a common hereditary mechanism, has contributed much to our understanding of the process of carcinogenesis. So far, genes identified as having a role in cancer predisposition in these families have also been important in the histogenesis of sporadic cancers. In the two most clearly defined cancer family syndromes, the Li-Fraumeni syndrome and Lynch syndrome II, the genes involved predispose to diverse but specific constellations of cancers. Genes associated with site-specific familial cancer clusters may also give rise to increased susceptibility to other cancers, and site-specific clusters may represent one end of a spectrum. A consistent feature of familial cancer syndromes is the variable expression within and between families. A challenge for the future will be to determine other factors which may interact with the principal genes involved, giving rise to this variability.

  18. Combinatorial Strategies for Improving Multiple-Stress Resistance in Industrially Relevant Escherichia coli Strains

    PubMed Central

    Herrgård, Markus J.

    2014-01-01

    High-cell-density fermentation for industrial production of chemicals can impose numerous stresses on cells due to high substrate, product, and by-product concentrations; high osmolarity; reactive oxygen species; and elevated temperatures. There is a need to develop platform strains of industrial microorganisms that are more tolerant toward these typical processing conditions. In this study, the growth of six industrially relevant strains of Escherichia coli was characterized under eight stress conditions representative of fed-batch fermentation, and strains W and BL21(DE3) were selected as platforms for transposon (Tn) mutagenesis due to favorable resistance characteristics. Selection experiments, followed by either targeted or genome-wide next-generation-sequencing-based Tn insertion site determination, were performed to identify mutants with improved growth properties under a subset of three stress conditions and two combinations of individual stresses. A subset of the identified loss-of-function mutants were selected for a combinatorial approach, where strains with combinations of two and three gene deletions were systematically constructed and tested for single and multistress resistance. These approaches allowed identification of (i) strain-background-specific stress resistance phenotypes, (ii) novel gene deletion mutants in E. coli that confer single and multistress resistance in a strain-background-dependent manner, and (iii) synergistic effects of multiple gene deletions that confer improved resistance over single deletions. The results of this study underscore the suboptimality and strain-specific variability of the genetic network regulating growth under stressful conditions and suggest that further exploration of the combinatorial gene deletion space in multiple strain backgrounds is needed for optimizing strains for microbial bioprocessing applications. PMID:25085490

  19. Effects of AAV-mediated knockdown of nNOS and GPx-1 gene expression in rat hippocampus after traumatic brain injury.

    PubMed

    Boone, Deborah R; Leek, Jeanna M; Falduto, Michael T; Torres, Karen E O; Sell, Stacy L; Parsley, Margaret A; Cowart, Jeremy C; Uchida, Tatsuo; Micci, Maria-Adelaide; DeWitt, Douglas S; Prough, Donald S; Hellmich, Helen L

    2017-01-01

    Virally mediated RNA interference (RNAi) to knock down injury-induced genes could improve functional outcome after traumatic brain injury (TBI); however, little is known about the consequences of gene knockdown on downstream cell signaling pathways and how RNAi influences neurodegeneration and behavior. Here, we assessed the effects of adeno-associated virus (AAV) siRNA vectors that target two genes with opposing roles in TBI pathogenesis: the allegedly detrimental neuronal nitric oxide synthase (nNOS) and the potentially protective glutathione peroxidase 1 (GPx-1). In rat hippocampal progenitor cells, three siRNAs that target different regions of each gene (nNOS, GPx-1) effectively knocked down gene expression. However, in vivo, in our rat model of fluid percussion brain injury, the consequences of AAV-siRNA were variable. One nNOS siRNA vector significantly reduced the number of degenerating hippocampal neurons and showed a tendency to improve working memory. GPx-1 siRNA treatment did not alter TBI-induced neurodegeneration or working memory deficits. Nevertheless, microarray analysis of laser captured, virus-infected neurons showed that knockdown of nNOS or GPx-1 was specific and had broad effects on downstream genes. Since nNOS knockdown only modestly ameliorated TBI-induced working memory deficits, despite widespread genomic changes, manipulating expression levels of single genes may not be sufficient to alter functional outcome after TBI.

  20. Comprehensive analyses of tissue-specific networks with implications to psychiatric diseases

    PubMed Central

    Lin, Guan Ning; Corominas, Roser; Nam, Hyun-Jun; Urresti, Jorge; Iakoucheva, Lilia M.

    2017-01-01

    Recent advances in genome sequencing and “omics” technologies are opening new opportunities for improving diagnosis and treatment of human diseases. The precision medicine initiative in particular aims at developing individualized treatment options that take into account individual variability in genes and environment of each person. Systems biology approaches that group genes, transcripts and proteins into functionally meaningful networks will play crucial role in the future of personalized medicine. They will allow comparison of healthy and disease-affected tissues and organs from the same individual, as well as between healthy and disease-afflicted individuals. However, the field faces a multitude of challenges ranging from data integration to statistical and combinatorial issues in data analyses. This chapter describes computational approaches developed by us and the others to tackle challenges in tissue-specific network analyses, with the main focus on psychiatric diseases. PMID:28849569

  1. B cell Variable genes have evolved their codon usage to focus the targeted patterns of somatic mutation on the complementarity determining regions

    PubMed Central

    Saini, Jasmine; Hershberg, Uri

    2015-01-01

    The exceptional ability of B cells to diversify through somatic mutation and improve affinity of the repertoire towards the antigens is the cornerstone of adaptive immunity. Somatic mutation is not evenly distributed and exhibits certain micro-sequence specificities. We show here that the combination of somatic mutation targeting and the codon usage in human B cell receptor (BCR) Variable (V) genes create expected patterns of mutation and post mutation changes that are focused on their complementarity determining regions (CDR). T cell V genes are also skewed in targeting mutations but to a lesser extent and are lacking the codon usage bias observed in BCRs. This suggests that the observed skew in T cell receptors is due to their amino acid usage, which is similar to that of BCRs. The mutation targeting and the codon bias allow B cell CDRs to diversify by specifically accumulating nonconservative changes. We counted the distribution of mutations to CDR in 4 different human datasets. In all four cases we found that the number of actual mutations in the CDR correlated significantly with the V gene mutation biases to the CDR predicted by our models. Finally, it appears that the mutation bias in V genes indeed relates to their long-term survival in actual human repertoires. We observed that resting repertoires of B cells overexpressed V genes that were especially biased towards focused mutation and change in the CDR. This bias in V gene usage was somewhat relaxed at the height of the immune response to a vaccine, presumably because of the need for a wider diversity in a primary response. However, older patients did not retain this flexibility and were biased towards using only highly skewed V genes at all stages of their response. PMID:25660968

  2. B cell variable genes have evolved their codon usage to focus the targeted patterns of somatic mutation on the complementarity determining regions.

    PubMed

    Saini, Jasmine; Hershberg, Uri

    2015-05-01

    The exceptional ability of B cells to diversify through somatic mutation and improve affinity of the repertoire toward the antigens is the cornerstone of adaptive immunity. Somatic mutation is not evenly distributed and exhibits certain micro-sequence specificities. We show here that the combination of somatic mutation targeting and the codon usage in human B cell receptor (BCR) Variable (V) genes create expected patterns of mutation and post mutation changes that are focused on their complementarity determining regions (CDR). T cell V genes are also skewed in targeting mutations but to a lesser extent and are lacking the codon usage bias observed in BCRs. This suggests that the observed skew in T cell receptors is due to their amino acid usage, which is similar to that of BCRs. The mutation targeting and the codon bias allow B cell CDRs to diversify by specifically accumulating nonconservative changes. We counted the distribution of mutations to CDR in 4 different human datasets. In all four cases we found that the number of actual mutations in the CDR correlated significantly with the V gene mutation biases to the CDR predicted by our models. Finally, it appears that the mutation bias in V genes indeed relates to their long-term survival in actual human repertoires. We observed that resting repertoires of B cells overexpressed V genes that were especially biased toward focused mutation and change in the CDR. This bias in V gene usage was somewhat relaxed at the height of the immune response to a vaccine, presumably because of the need for a wider diversity in a primary response. However, older patients did not retain this flexibility and were biased toward using only highly skewed V genes at all stages of their response. Copyright © 2015 Elsevier Ltd. All rights reserved.

  3. Incorporating biological information in sparse principal component analysis with application to genomic data.

    PubMed

    Li, Ziyi; Safo, Sandra E; Long, Qi

    2017-07-11

    Sparse principal component analysis (PCA) is a popular tool for dimensionality reduction, pattern recognition, and visualization of high dimensional data. It has been recognized that complex biological mechanisms occur through concerted relationships of multiple genes working in networks that are often represented by graphs. Recent work has shown that incorporating such biological information improves feature selection and prediction performance in regression analysis, but there has been limited work on extending this approach to PCA. In this article, we propose two new sparse PCA methods called Fused and Grouped sparse PCA that enable incorporation of prior biological information in variable selection. Our simulation studies suggest that, compared to existing sparse PCA methods, the proposed methods achieve higher sensitivity and specificity when the graph structure is correctly specified, and are fairly robust to misspecified graph structures. Application to a glioblastoma gene expression dataset identified pathways that are suggested in the literature to be related with glioblastoma. The proposed sparse PCA methods Fused and Grouped sparse PCA can effectively incorporate prior biological information in variable selection, leading to improved feature selection and more interpretable principal component loadings and potentially providing insights on molecular underpinnings of complex diseases.

  4. Autosomal Dominant Cataract: Intrafamilial Phenotypic Variability, Interocular Asymmetry, and Variable Progression in Four Chilean Families

    PubMed Central

    Shafie, Suraiya M.; Barria von-Bischhoffshausen, Fernando R.; Bateman, J. Bronwyn

    2006-01-01

    PURPOSE To document intrafamilial and interocular phenotypic variability of autosomal dominant cataract (ADC). DESIGN Prospective observational case series. METHODS We performed ophthalmologic examination in four Chilean ADC families. RESULTS The families exhibited variability with respect to morphology, location with the lens, color and density of cataracts among affected members. We documented asymmetry between eyes in the morphology, location within the lens, color and density of cataracts, and a variable rate of progression. CONCLUSIONS The cataracts in these families exhibit wide intrafamilial and interocular phenotypic variability, supporting the premise that the mutated genes are expressed differentially in individuals and between eyes; other genes or environmental factors may be the bases for this variability. Marked progression among some family members underscores the variable clinical course of a common mutation within a family. Like retinitis pigmentosa, classification of ADC will be most useful if based on the gene and specific mutation. PMID:16564818

  5. Improved prediction of biochemical recurrence after radical prostatectomy by genetic polymorphisms.

    PubMed

    Morote, Juan; Del Amo, Jokin; Borque, Angel; Ars, Elisabet; Hernández, Carlos; Herranz, Felipe; Arruza, Antonio; Llarena, Roberto; Planas, Jacques; Viso, María J; Palou, Joan; Raventós, Carles X; Tejedor, Diego; Artieda, Marta; Simón, Laureano; Martínez, Antonio; Rioja, Luis A

    2010-08-01

    Single nucleotide polymorphisms are inherited genetic variations that can predispose or protect individuals against clinical events. We hypothesized that single nucleotide polymorphism profiling may improve the prediction of biochemical recurrence after radical prostatectomy. We performed a retrospective, multi-institutional study of 703 patients treated with radical prostatectomy for clinically localized prostate cancer who had at least 5 years of followup after surgery. All patients were genotyped for 83 prostate cancer related single nucleotide polymorphisms using a low density oligonucleotide microarray. Baseline clinicopathological variables and single nucleotide polymorphisms were analyzed to predict biochemical recurrence within 5 years using stepwise logistic regression. Discrimination was measured by ROC curve AUC, specificity, sensitivity, predictive values, net reclassification improvement and integrated discrimination index. The overall biochemical recurrence rate was 35%. The model with the best fit combined 8 covariates, including the 5 clinicopathological variables prostate specific antigen, Gleason score, pathological stage, lymph node involvement and margin status, and 3 single nucleotide polymorphisms at the KLK2, SULT1A1 and TLR4 genes. Model predictive power was defined by 80% positive predictive value, 74% negative predictive value and an AUC of 0.78. The model based on clinicopathological variables plus single nucleotide polymorphisms showed significant improvement over the model without single nucleotide polymorphisms, as indicated by 23.3% net reclassification improvement (p = 0.003), integrated discrimination index (p <0.001) and likelihood ratio test (p <0.001). Internal validation proved model robustness (bootstrap corrected AUC 0.78, range 0.74 to 0.82). The calibration plot showed close agreement between biochemical recurrence observed and predicted probabilities. Predicting biochemical recurrence after radical prostatectomy based on clinicopathological data can be significantly improved by including patient genetic information. Copyright (c) 2010 American Urological Association Education and Research, Inc. Published by Elsevier Inc. All rights reserved.

  6. Roles of the plasticity regions of Helicobacter pylori in gastroduodenal pathogenesis.

    PubMed

    Yamaoka, Yoshio

    2008-05-01

    Putative virulence genes of Helicobacter pylori are generally classified into three categories: strain-specific genes, phase-variable genes and genes with variable structures/genotypes. Among these, there has recently been considerable interest in strain-specific genes found outside of the cag pathogenicity island, especially genes in the plasticity regions. Nearly half of the strain-specific genes of H. pylori are located in the plasticity regions in strains 26695 and J99. Strain HPAG1, however, seems to lack a typical plasticity region; instead it has 43 HPAG1-specific genes which are either undetectable or incompletely represented in the genomes of strains 26695 and J99. Recent studies showed that certain genes or combination of genes in this region may play important roles in the pathogenesis of H. pylori-associated gastroduodenal diseases. Most previous studies have focused on the plasticity region in strain J99 (jhp0914-jhp0961) and the jhp0947 gene and the duodenal ulcer promoting (dupA) gene are good candidate markers for gastroduodenal diseases although there are some paradoxical findings. The jhp0947 gene is reported to be associated with an increased risk of both duodenal ulcers and gastric cancers, whereas the dupA gene, which encompasses jhp0917 and jhp0918, is reported to be associated with an increased risk of duodenal ulcers and protection against gastric cancers. In addition, recent studies showed that approximately 10-30 % of clinical isolates possess a 16.3 kb type IV secretion apparatus (tfs3) in the plasticity region. Studies on the plasticity region have only just begun, and further investigation is necessary to elucidate the roles of genes in this region in gastroduodenal pathogenesis.

  7. Roles of the plasticity regions of Helicobacter pylori in gastroduodenal pathogenesis

    PubMed Central

    Yamaoka, Yoshio

    2010-01-01

    Putative virulence genes of Helicobacter pylori are generally classified into three categories: strain-specific genes, phase-variable genes and genes with variable structures/genotypes. Among these, there has recently been considerable interest in strain-specific genes found outside of the cag pathogenicity island, especially genes in the plasticity regions. Nearly half of the strain-specific genes of H. pylori are located in the plasticity regions in strains 26695 and J99. Strain HPAG1, however, seems to lack a typical plasticity region; instead it has 43 HPAG1-specific genes which are either undetectable or incompletely represented in the genomes of strains 26695 and J99. Recent studies showed that certain genes or combination of genes in this region may play important roles in the pathogenesis of H. pylori-associated gastroduodenal diseases. Most previous studies have focused on the plasticity region in strain J99 (jhp0914–jhp0961) and the jhp0947 gene and the duodenal ulcer promoting (dupA) gene are good candidate markers for gastroduodenal diseases although there are some paradoxical findings. The jhp0947 gene is reported to be associated with an increased risk of both duodenal ulcers and gastric cancers, whereas the dupA gene, which encompasses jhp0917 and jhp0918, is reported to be associated with an increased risk of duodenal ulcers and protection against gastric cancers. In addition, recent studies showed that approximately 10–30% of clinical isolates possess a 16.3 kb type IV secretion apparatus (tfs3) in the plasticity region. Studies on the plasticity region have only just begun, and further investigation is necessary to elucidate the roles of genes in this region in gastroduodenal pathogenesis. PMID:18436586

  8. Gene-Environment Interplay in Physical, Psychological, and Cognitive Domains in Mid to Late Adulthood: Is APOE a Variability Gene?

    PubMed

    Reynolds, Chandra A; Gatz, Margaret; Christensen, Kaare; Christiansen, Lene; Dahl Aslan, Anna K; Kaprio, Jaakko; Korhonen, Tellervo; Kremen, William S; Krueger, Robert; McGue, Matt; Neiderhiser, Jenae M; Pedersen, Nancy L

    2016-01-01

    Despite emerging interest in gene-environment interaction (GxE) effects, there is a dearth of studies evaluating its potential relevance apart from specific hypothesized environments and biometrical variance trends. Using a monozygotic within-pair approach, we evaluated evidence of G×E for body mass index (BMI), depressive symptoms, and cognition (verbal, spatial, attention, working memory, perceptual speed) in twin studies from four countries. We also evaluated whether APOE is a 'variability gene' across these measures and whether it partly represents the 'G' in G×E effects. In all three domains, G×E effects were pervasive across country and gender, with small-to-moderate effects. Age-cohort trends were generally stable for BMI and depressive symptoms; however, they were variable-with both increasing and decreasing age-cohort trends-for different cognitive measures. Results also suggested that APOE may represent a 'variability gene' for depressive symptoms and spatial reasoning, but not for BMI or other cognitive measures. Hence, additional genes are salient beyond APOE.

  9. Gene Circuit Analysis of the Terminal Gap Gene huckebein

    PubMed Central

    Ashyraliyev, Maksat; Siggens, Ken; Janssens, Hilde; Blom, Joke; Akam, Michael; Jaeger, Johannes

    2009-01-01

    The early embryo of Drosophila melanogaster provides a powerful model system to study the role of genes in pattern formation. The gap gene network constitutes the first zygotic regulatory tier in the hierarchy of the segmentation genes involved in specifying the position of body segments. Here, we use an integrative, systems-level approach to investigate the regulatory effect of the terminal gap gene huckebein (hkb) on gap gene expression. We present quantitative expression data for the Hkb protein, which enable us to include hkb in gap gene circuit models. Gap gene circuits are mathematical models of gene networks used as computational tools to extract regulatory information from spatial expression data. This is achieved by fitting the model to gap gene expression patterns, in order to obtain estimates for regulatory parameters which predict a specific network topology. We show how considering variability in the data combined with analysis of parameter determinability significantly improves the biological relevance and consistency of the approach. Our models are in agreement with earlier results, which they extend in two important respects: First, we show that Hkb is involved in the regulation of the posterior hunchback (hb) domain, but does not have any other essential function. Specifically, Hkb is required for the anterior shift in the posterior border of this domain, which is now reproduced correctly in our models. Second, gap gene circuits presented here are able to reproduce mutants of terminal gap genes, while previously published models were unable to reproduce any null mutants correctly. As a consequence, our models now capture the expression dynamics of all posterior gap genes and some variational properties of the system correctly. This is an important step towards a better, quantitative understanding of the developmental and evolutionary dynamics of the gap gene network. PMID:19876378

  10. Gene circuit analysis of the terminal gap gene huckebein.

    PubMed

    Ashyraliyev, Maksat; Siggens, Ken; Janssens, Hilde; Blom, Joke; Akam, Michael; Jaeger, Johannes

    2009-10-01

    The early embryo of Drosophila melanogaster provides a powerful model system to study the role of genes in pattern formation. The gap gene network constitutes the first zygotic regulatory tier in the hierarchy of the segmentation genes involved in specifying the position of body segments. Here, we use an integrative, systems-level approach to investigate the regulatory effect of the terminal gap gene huckebein (hkb) on gap gene expression. We present quantitative expression data for the Hkb protein, which enable us to include hkb in gap gene circuit models. Gap gene circuits are mathematical models of gene networks used as computational tools to extract regulatory information from spatial expression data. This is achieved by fitting the model to gap gene expression patterns, in order to obtain estimates for regulatory parameters which predict a specific network topology. We show how considering variability in the data combined with analysis of parameter determinability significantly improves the biological relevance and consistency of the approach. Our models are in agreement with earlier results, which they extend in two important respects: First, we show that Hkb is involved in the regulation of the posterior hunchback (hb) domain, but does not have any other essential function. Specifically, Hkb is required for the anterior shift in the posterior border of this domain, which is now reproduced correctly in our models. Second, gap gene circuits presented here are able to reproduce mutants of terminal gap genes, while previously published models were unable to reproduce any null mutants correctly. As a consequence, our models now capture the expression dynamics of all posterior gap genes and some variational properties of the system correctly. This is an important step towards a better, quantitative understanding of the developmental and evolutionary dynamics of the gap gene network.

  11. Gene Expression Signatures Based on Variability can Robustly Predict Tumor Progression and Prognosis

    PubMed Central

    Dinalankara, Wikum; Bravo, Héctor Corrada

    2015-01-01

    Gene expression signatures are commonly used to create cancer prognosis and diagnosis methods, yet only a small number of them are successfully deployed in the clinic since many fail to replicate performance on subsequent validation. A primary reason for this lack of reproducibility is the fact that these signatures attempt to model the highly variable and unstable genomic behavior of cancer. Our group recently introduced gene expression anti-profiles as a robust methodology to derive gene expression signatures based on the observation that while gene expression measurements are highly heterogeneous across tumors of a specific cancer type relative to the normal tissue, their degree of deviation from normal tissue expression in specific genes involved in tissue differentiation is a stable tumor mark that is reproducible across experiments and cancer types. Here we show that constructing gene expression signatures based on variability and the anti-profile approach yields classifiers capable of successfully distinguishing benign growths from cancerous growths based on deviation from normal expression. We then show that this same approach generates stable and reproducible signatures that predict probability of relapse and survival based on tumor gene expression. These results suggest that using the anti-profile framework for the discovery of genomic signatures is an avenue leading to the development of reproducible signatures suitable for adoption in clinical settings. PMID:26078586

  12. Lessons learned: Optimization of a murine small bowel resection model

    PubMed Central

    Taylor, Janice A.; Martin, Colin A.; Nair, Rajalakshmi; Guo, Jun; Erwin, Christopher R.; Warner, Brad W.

    2008-01-01

    Background/Purpose Central to the use of murine models of disease is the ability to derive reproducible data. The purpose of this study was to determine factors contributing to variability in our murine model of small bowel resection (SBR). Methods Male C57Bl/6 mice were randomized to sham or 50% SBR. The effect of housing type (pathogen-free versus standard housing), nutrition (reconstituted powder versus tube feeding formulation), and correlates of intestinal morphology with gene expression changes were investigated Multiple linear regression modeling or one-way ANOVA was used for data analysis. Results Pathogen-free mice had significantly shorter ileal villi at baseline and demonstrated greater villus growth after SBR compared to mice housed in standard rooms. Food type did not affect adaptation. Gene expression changes were more consistent and significant in isolated crypt cells that demonstrated adaptive growth when compared with crypts that did not deepen after SBR. Conclusion Maintenance of mice in pathogen-free conditions and restricting gene expression analysis to individual animals exhibiting morphologic adaptation enhances sensitivity and specificity of data derived from this model. These refinements will minimize experimental variability and lead to improved understanding of the complex process of intestinal adaptation. PMID:18558176

  13. Characterization of Bois noir isolates by restriction fragment length polymorphism of a Stolbur-specific putative membrane protein gene.

    PubMed

    Pacifico, D; Alma, A; Bagnoli, B; Foissac, X; Pasquini, G; Tessitori, M; Marzachì, C

    2009-06-01

    Bois noir phytoplasma (BNp), widespread in wine-producing areas of Europe and endemic in France and Italy, is classified in the 16SrXII-A subgroup, whose members are referred to as Stolbur phytoplasmas. The 16S rDNA gene of Stolbur phytoplasma shows low variability, and few non-ribosomal genes are available as markers to assess variation among isolates. We used the Stolbur-specific stol-1H10 gene, encoding a putative membrane-exposed protein, to investigate genetic diversity of French and Italian BNp isolates from plants and insects. Amplification of stol-1H10 from infected grapevines, weeds, and Hyalesthes obsoletus produced fragments of three sizes, and restriction fragment length polymorphism analysis divided these amplicons further into 12 profiles (V1 to V12). French BNp isolates were more variable than Italian ones, and different profiles were present in infected grapevines from France and Italy. Isolate V3, most abundant among Italian affected grapes but present among French ones, was found in one Urtica dioica sample and in all H. obsoletus collected on this species. Four Italian-specific profiles were represented among infected Convolvulus arvensis, the most frequent of which (V12) was also detected in H. obsoletus collected on this species. Most of the variability in the stol-1H10 sequence was associated with type II on the tuf gene.

  14. Quantifying Intrinsic and Extrinsic Variability in Stochastic Gene Expression Models

    PubMed Central

    Singh, Abhyudai; Soltani, Mohammad

    2013-01-01

    Genetically identical cell populations exhibit considerable intercellular variation in the level of a given protein or mRNA. Both intrinsic and extrinsic sources of noise drive this variability in gene expression. More specifically, extrinsic noise is the expression variability that arises from cell-to-cell differences in cell-specific factors such as enzyme levels, cell size and cell cycle stage. In contrast, intrinsic noise is the expression variability that is not accounted for by extrinsic noise, and typically arises from the inherent stochastic nature of biochemical processes. Two-color reporter experiments are employed to decompose expression variability into its intrinsic and extrinsic noise components. Analytical formulas for intrinsic and extrinsic noise are derived for a class of stochastic gene expression models, where variations in cell-specific factors cause fluctuations in model parameters, in particular, transcription and/or translation rate fluctuations. Assuming mRNA production occurs in random bursts, transcription rate is represented by either the burst frequency (how often the bursts occur) or the burst size (number of mRNAs produced in each burst). Our analysis shows that fluctuations in the transcription burst frequency enhance extrinsic noise but do not affect the intrinsic noise. On the contrary, fluctuations in the transcription burst size or mRNA translation rate dramatically increase both intrinsic and extrinsic noise components. Interestingly, simultaneous fluctuations in transcription and translation rates arising from randomness in ATP abundance can decrease intrinsic noise measured in a two-color reporter assay. Finally, we discuss how these formulas can be combined with single-cell gene expression data from two-color reporter experiments for estimating model parameters. PMID:24391934

  15. Quantifying intrinsic and extrinsic variability in stochastic gene expression models.

    PubMed

    Singh, Abhyudai; Soltani, Mohammad

    2013-01-01

    Genetically identical cell populations exhibit considerable intercellular variation in the level of a given protein or mRNA. Both intrinsic and extrinsic sources of noise drive this variability in gene expression. More specifically, extrinsic noise is the expression variability that arises from cell-to-cell differences in cell-specific factors such as enzyme levels, cell size and cell cycle stage. In contrast, intrinsic noise is the expression variability that is not accounted for by extrinsic noise, and typically arises from the inherent stochastic nature of biochemical processes. Two-color reporter experiments are employed to decompose expression variability into its intrinsic and extrinsic noise components. Analytical formulas for intrinsic and extrinsic noise are derived for a class of stochastic gene expression models, where variations in cell-specific factors cause fluctuations in model parameters, in particular, transcription and/or translation rate fluctuations. Assuming mRNA production occurs in random bursts, transcription rate is represented by either the burst frequency (how often the bursts occur) or the burst size (number of mRNAs produced in each burst). Our analysis shows that fluctuations in the transcription burst frequency enhance extrinsic noise but do not affect the intrinsic noise. On the contrary, fluctuations in the transcription burst size or mRNA translation rate dramatically increase both intrinsic and extrinsic noise components. Interestingly, simultaneous fluctuations in transcription and translation rates arising from randomness in ATP abundance can decrease intrinsic noise measured in a two-color reporter assay. Finally, we discuss how these formulas can be combined with single-cell gene expression data from two-color reporter experiments for estimating model parameters.

  16. Optimization of Saanen sperm genes amplification: evaluation of standardized protocols in genetically uncharacterized rural goats reared under a subtropical environment.

    PubMed

    Barbour, Elie K; Saade, Maya F; Sleiman, Fawwak T; Hamadeh, Shady K; Mouneimne, Youssef; Kassaifi, Zeina; Kayali, Ghazi; Harakeh, Steve; Jaber, Lina S; Shaib, Houssam A

    2012-10-01

    The purpose of this research is to optimize quantitatively the amplification of specific sperm genes in reference genomically characterized Saanen goat and to evaluate the standardized protocols applicability on sperms of uncharacterized genome of rural goats reared under subtropical environment for inclusion in future selection programs. The optimization of the protocols in Saanen sperms included three production genes (growth hormone (GH) exons 2, 3, and 4, αS1-casein (CSN1S1), and α-lactalbumin) and two health genes (MHC class II DRB and prion (PrP)). The optimization was based on varying the primers concentrations and the inclusion of a PCR cosolvent (Triton X). The impact of the studied variables on statistically significant increase in the yield of amplicons was noticed in four out of five (80%) optimized protocols, namely in those related to GH, CSN1S1, α-lactalbumin, and PrP genes (P < 0.05). There was no significant difference in the yield of amplicons related to MHC class II DRB gene, regardless of the variables used (P > 0.05). The applicability of the optimized protocols of Saanen sperm genes on amplification of uncharacterized rural goat sperms revealed a 100% success in tested individuals for amplification of GH, CSN1S1, α-lactalbumin, and MHC class II DRB genes and a 75% success for the PrP gene. The significant success in applicability of the Saanen quantitatively optimized protocols to other uncharacterized genome of rural goats allows for their inclusion in future selection, targeting the sustainability of this farming system in a subtropical environment and the improvement of the farmers livelihood.

  17. Inspection of the grapevine BURP superfamily highlights an expansion of RD22 genes with distinctive expression features in berry development and ABA-mediated stress responses.

    PubMed

    Matus, José Tomás; Aquea, Felipe; Espinoza, Carmen; Vega, Andrea; Cavallini, Erika; Dal Santo, Silvia; Cañón, Paola; Rodríguez-Hoces de la Guardia, Amparo; Serrano, Jennifer; Tornielli, Giovanni Battista; Arce-Johnson, Patricio

    2014-01-01

    The RESPONSIVE TO DEHYDRATION 22 (RD22) gene is a molecular link between abscisic acid (ABA) signalling and abiotic stress responses. Its expression has been used as a reliable ABA early response marker. In Arabidopsis, the single copy RD22 gene possesses a BURP domain also located at the C-terminus of USP embryonic proteins and the beta subunit of polygalacturonases. In grapevine, a RD22 gene has been identified but putative paralogs are also found in the grape genome, possibly forming a large RD22 family in this species. In this work, we searched for annotations containing BURP domains in the Vitis vinifera genome. Nineteen proteins were defined by a comparative analysis between the two genome predictions and RNA-Seq data. These sequences were compared to other plant BURPs identified in previous genome surveys allowing us to reconceive group classifications based on phylogenetic relationships and protein motif occurrence. We observed a lineage-specific evolution of the RD22 family, with the biggest expansion in grapevine and poplar. In contrast, rice, sorghum and maize presented highly expanded monocot-specific groups. The Vitis RD22 group may have expanded from segmental duplications as most of its members are confined to a region in chromosome 4. The inspection of transcriptomic data revealed variable expression of BURP genes in vegetative and reproductive organs. Many genes were induced in specific tissues or by abiotic and biotic stresses. Three RD22 genes were further studied showing that they responded oppositely to ABA and to stress conditions. Our results show that the inclusion of RNA-Seq data is essential while describing gene families and improving gene annotations. Robust phylogenetic analyses including all BURP members from other sequenced species helped us redefine previous relationships that were erroneously established. This work provides additional evidence for RD22 genes serving as marker genes for different organs or stresses in grapevine.

  18. Coalitional game theory as a promising approach to identify candidate autism genes.

    PubMed

    Gupta, Anika; Sun, Min Woo; Paskov, Kelley Marie; Stockham, Nate Tyler; Jung, Jae-Yoon; Wall, Dennis Paul

    2018-01-01

    Despite mounting evidence for the strong role of genetics in the phenotypic manifestation of Autism Spectrum Disorder (ASD), the specific genes responsible for the variable forms of ASD remain undefined. ASD may be best explained by a combinatorial genetic model with varying epistatic interactions across many small effect mutations. Coalitional or cooperative game theory is a technique that studies the combined effects of groups of players, known as coalitions, seeking to identify players who tend to improve the performance--the relationship to a specific disease phenotype--of any coalition they join. This method has been previously shown to boost biologically informative signal in gene expression data but to-date has not been applied to the search for cooperative mutations among putative ASD genes. We describe our approach to highlight genes relevant to ASD using coalitional game theory on alteration data of 1,965 fully sequenced genomes from 756 multiplex families. Alterations were encoded into binary matrices for ASD (case) and unaffected (control) samples, indicating likely gene-disrupting, inherited mutations in altered genes. To determine individual gene contributions given an ASD phenotype, a "player" metric, referred to as the Shapley value, was calculated for each gene in the case and control cohorts. Sixty seven genes were found to have significantly elevated player scores and likely represent significant contributors to the genetic coordination underlying ASD. Using network and cross-study analysis, we found that these genes are involved in biological pathways known to be affected in the autism cases and that a subset directly interact with several genes known to have strong associations to autism. These findings suggest that coalitional game theory can be applied to large-scale genomic data to identify hidden yet influential players in complex polygenic disorders such as autism.

  19. Cri du Chat syndrome

    PubMed Central

    Cerruti Mainardi, Paola

    2006-01-01

    The Cri du Chat syndrome (CdCS) is a genetic disease resulting from a deletion of variable size occurring on the short arm of chromosome 5 (5p-). The incidence ranges from 1:15,000 to 1:50,000 live-born infants. The main clinical features are a high-pitched monochromatic cry, microcephaly, broad nasal bridge, epicanthal folds, micrognathia, abnormal dermatoglyphics, and severe psychomotor and mental retardation. Malformations, although not very frequent, may be present: cardiac, neurological and renal abnormalities, preauricular tags, syndactyly, hypospadias, and cryptorchidism. Molecular cytogenetic analysis has allowed a cytogenetic and phenotypic map of 5p to be defined, even if results from the studies reported up to now are not completely in agreement. Genotype-phenotype correlation studies showed a clinical and cytogenetic variability. The identification of phenotypic subsets associated with a specific size and type of deletion is of diagnostic and prognostic relevance. Specific growth and psychomotor development charts have been established. Two genes, Semaphorin F (SEMAF) and δ-catenin (CTNND2), which have been mapped to the "critical regions", are potentially involved in cerebral development and their deletion may be associated with mental retardation in CdCS patients. Deletion of the telomerase reverse transcriptase (hTERT) gene, localised to 5p15.33, could contribute to the phenotypic changes in CdCS. The critical regions were recently refined by using array comparative genomic hybridisation. The cat-like cry critical region was further narrowed using quantitative polymerase chain reaction (PCR) and three candidate genes were characterised in this region. The diagnosis is based on typical clinical manifestations. Karyotype analysis and, in doubtful cases, FISH analysis will confirm the diagnosis. There is no specific therapy for CdCS but early rehabilitative and educational interventions improve the prognosis and considerable progress has been made in the social adjustment of CdCS patients. PMID:16953888

  20. Identification of Genes Involved in Breast Cancer Metastasis by Integrating Protein-Protein Interaction Information with Expression Data.

    PubMed

    Tian, Xin; Xin, Mingyuan; Luo, Jian; Liu, Mingyao; Jiang, Zhenran

    2017-02-01

    The selection of relevant genes for breast cancer metastasis is critical for the treatment and prognosis of cancer patients. Although much effort has been devoted to the gene selection procedures by use of different statistical analysis methods or computational techniques, the interpretation of the variables in the resulting survival models has been limited so far. This article proposes a new Random Forest (RF)-based algorithm to identify important variables highly related with breast cancer metastasis, which is based on the important scores of two variable selection algorithms, including the mean decrease Gini (MDG) criteria of Random Forest and the GeneRank algorithm with protein-protein interaction (PPI) information. The new gene selection algorithm can be called PPIRF. The improved prediction accuracy fully illustrated the reliability and high interpretability of gene list selected by the PPIRF approach.

  1. Unveiling the pan-genome of the SXT/R391 family of ICEs: molecular characterisation of new variable regions of SXT/R391-like ICEs detected in Pseudoalteromonas sp. and Vibrio scophthalmi.

    PubMed

    Rodríguez-Blanco, Arturo; Lemos, Manuel L; Osorio, Carlos R

    2016-08-01

    Integrating conjugative elements (ICEs) of the SXT/R391 family have been identified in fish-isolated bacterial strains collected from marine aquaculture environments of the northwestern Iberian Peninsula. Here we analysed the variable regions of two ICEs, one preliminarily characterised in a previous study (ICEVscSpa3) and one newly identified (ICEPspSpa1). Bacterial strains harboring these ICEs were phylogenetically assigned to Vibrio scophthalmi and Pseudoalteromonas sp., thus constituting the first evidence of SXT/R391-like ICEs in the genus Pseudoalteromonas to date. Variable DNA regions, which confer element-specific properties to ICEs of this family, were characterised. Interestingly, the two ICEs contained 29 genes not found in variable DNA insertions of previously described ICEs. Most notably, variable gene content for ICEVscSpa3 showed similarity to genes potentially involved in housekeeping functions of replication, nucleotide metabolism and transcription. For these genes, closest homologues were found clustered in the genome of Pseudomonas psychrotolerans L19, suggesting a transfer as a block to ICEVscSpa3. Genes encoding antibiotic resistance, restriction modification systems and toxin/antitoxin systems were absent from hotspots of ICEVscSpa3. In contrast, the variable gene content of ICEPspSpa1 included genes involved in restriction/modification functions in two different hotspots and genes related to ICE maintenance. The present study unveils a relatively large number of novel genes in SXT/R391-ICEs, and demonstrates the major role of ICE elements as contributors to horizontal gene transfer.

  2. Site-Specific Fat-1 Knock-In Enables Significant Decrease of n-6PUFAs/n-3PUFAs Ratio in Pigs

    PubMed Central

    Li, Mengjing; Ouyang, Hongsheng; Yuan, Hongming; Li, Jianing; Xie, Zicong; Wang, Kankan; Yu, Tingting; Liu, Minghao; Chen, Xue; Tang, Xiaochun; Jiao, Huping; Pang, Daxin

    2018-01-01

    The fat-1 gene from Caenorhabditis elegans encodes a fatty acid desaturase which was widely studied due to its beneficial function of converting n-6 polyunsaturated fatty acids (n-6PUFAs) to n-3 polyunsaturated fatty acids (n-3PUFAs). To date, many fat-1 transgenic animals have been generated to study disease pathogenesis or improve meat quality. However, all of them were generated using a random integration method with variable transgene expression levels and the introduction of selectable marker genes often raise biosafety concern. To this end, we aimed to generate marker-free fat-1 transgenic pigs in a site-specific manner. The Rosa26 locus, first found in mouse embryonic stem cells, has become one of the most common sites for inserting transgenes due to its safe and ubiquitous expression. In our study, the fat-1 gene was inserted into porcine Rosa 26 (pRosa26) locus via Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)/CRISPR-associated 9 (Cas9) system. The Southern blot analysis of our knock-in pigs indicated a single copy of the fat-1 gene at the pRosa26 locus. Furthermore, this single-copy fat-1 gene supported satisfactory expression in a variety of tissues in F1 generation pigs. Importantly, the gas chromatography analysis indicated that these fat-1 knock-in pigs exhibited a significant increase in the level of n-3PUFAs, leading to an obvious decrease in the n-6PUFAs/n-3PUFAs ratio from 9.36 to 2.12 (***P < 0.0001). Altogether, our fat-1 knock-in pigs hold great promise for improving the nutritional value of pork and serving as an animal model to investigate therapeutic effects of n-3PUFAs on various diseases. PMID:29563188

  3. Barcoding and species recognition of opportunistic pathogens in Ochroconis and Verruconis.

    PubMed

    Samerpitak, Kittipan; Gerrits van den Ende, Bert H G; Stielow, J Benjamin; Menken, Steph B J; de Hoog, G Sybren

    2016-02-01

    The genera Ochroconis and Verruconis (Sympoventuriaceae, Venturiales) have remarkably high molecular diversity despite relatively high degrees of phenotypic similarity. Tree topologies, inter-specific and intra-specific heterogeneities, barcoding gaps and reciprocal monophyly of all currently known species were analyzed. It was concluded that all currently used genes viz. SSU, ITS, LSU, ACT1, BT2, and TEF1 were unable to reach all 'gold standard' criteria of barcoding markers. They could nevertheless be used for reasonably reliable identification of species, because the markers, although variable, were associated with large inter-specific heterogeneity. Of the coding protein-genes, ACT1 revealed highest potentiality as barcoding marker in mostly all parts of the investigated sequence. SSU, LSU, ITS, and ACT1 yielded consistent monophyly in all investigated species, but only SSU and LSU generated clear barcoding gaps. For phylogeny, LSU was an informative marker, suitable to reconstruct gene-trees showing correct phylogenetic relationships. Cryptic species were revealed especially in complexes with very high intra-specific variability. When all these complexes will be taxonomically resolved, ACT1 will probably appear to be the most reliable barcoding gene for Ochroconis and Verruconis. Copyright © 2015 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.

  4. Trichostatin A specifically improves the aberrant expression of transcription factor genes in embryos produced by somatic cell nuclear transfer

    PubMed Central

    Inoue, Kimiko; Oikawa, Mami; Kamimura, Satoshi; Ogonuki, Narumi; Nakamura, Toshinobu; Nakano, Toru; Abe, Kuniya; Ogura, Atsuo

    2015-01-01

    Although mammalian cloning by somatic cell nuclear transfer (SCNT) has been established in various species, the low developmental efficiency has hampered its practical applications. Treatment of SCNT-derived embryos with histone deacetylase (HDAC) inhibitors can improve their development, but the underlying mechanism is still unclear. To address this question, we analysed gene expression profiles of SCNT-derived 2-cell mouse embryos treated with trichostatin A (TSA), a potent HDAC inhibitor that is best used for mouse cloning. Unexpectedly, TSA had no effect on the numbers of aberrantly expressed genes or the overall gene expression pattern in the embryos. However, in-depth investigation by gene ontology and functional analyses revealed that TSA treatment specifically improved the expression of a small subset of genes encoding transcription factors and their regulatory factors, suggesting their positive involvement in de novo RNA synthesis. Indeed, introduction of one of such transcription factors, Spi-C, into the embryos at least partially mimicked the TSA-induced improvement in embryonic development by activating gene networks associated with transcriptional regulation. Thus, the effects of TSA treatment on embryonic gene expression did not seem to be stochastic, but more specific than expected, targeting genes that direct development and trigger zygotic genome activation at the 2-cell stage. PMID:25974394

  5. The nuclear genome of Rhazya stricta and the evolution of alkaloid diversity in a medically relevant clade of Apocynaceae

    PubMed Central

    Sabir, Jamal S. M.; Jansen, Robert K.; Arasappan, Dhivya; Calderon, Virginie; Noutahi, Emmanuel; Zheng, Chunfang; Park, Seongjun; Sabir, Meshaal J.; Baeshen, Mohammed N.; Hajrah, Nahid H.; Khiyami, Mohammad A.; Baeshen, Nabih A.; Obaid, Abdullah Y.; Al-Malki, Abdulrahman L.; Sankoff, David; El-Mabrouk, Nadia; Ruhlman, Tracey A.

    2016-01-01

    Alkaloid accumulation in plants is activated in response to stress, is limited in distribution and specific alkaloid repertoires are variable across taxa. Rauvolfioideae (Apocynaceae, Gentianales) represents a major center of structural expansion in the monoterpenoid indole alkaloids (MIAs) yielding thousands of unique molecules including highly valuable chemotherapeutics. The paucity of genome-level data for Apocynaceae precludes a deeper understanding of MIA pathway evolution hindering the elucidation of remaining pathway enzymes and the improvement of MIA availability in planta or in vitro. We sequenced the nuclear genome of Rhazya stricta (Apocynaceae, Rauvolfioideae) and present this high quality assembly in comparison with that of coffee (Rubiaceae, Coffea canephora, Gentianales) and others to investigate the evolution of genome-scale features. The annotated Rhazya genome was used to develop the community resource, RhaCyc, a metabolic pathway database. Gene family trees were constructed to identify homologs of MIA pathway genes and to examine their evolutionary history. We found that, unlike Coffea, the Rhazya lineage has experienced many structural rearrangements. Gene tree analyses suggest recent, lineage-specific expansion and diversification among homologs encoding MIA pathway genes in Gentianales and provide candidate sequences with the potential to close gaps in characterized pathways and support prospecting for new MIA production avenues. PMID:27653669

  6. The nuclear genome of Rhazya stricta and the evolution of alkaloid diversity in a medically relevant clade of Apocynaceae.

    PubMed

    Sabir, Jamal S M; Jansen, Robert K; Arasappan, Dhivya; Calderon, Virginie; Noutahi, Emmanuel; Zheng, Chunfang; Park, Seongjun; Sabir, Meshaal J; Baeshen, Mohammed N; Hajrah, Nahid H; Khiyami, Mohammad A; Baeshen, Nabih A; Obaid, Abdullah Y; Al-Malki, Abdulrahman L; Sankoff, David; El-Mabrouk, Nadia; Ruhlman, Tracey A

    2016-09-22

    Alkaloid accumulation in plants is activated in response to stress, is limited in distribution and specific alkaloid repertoires are variable across taxa. Rauvolfioideae (Apocynaceae, Gentianales) represents a major center of structural expansion in the monoterpenoid indole alkaloids (MIAs) yielding thousands of unique molecules including highly valuable chemotherapeutics. The paucity of genome-level data for Apocynaceae precludes a deeper understanding of MIA pathway evolution hindering the elucidation of remaining pathway enzymes and the improvement of MIA availability in planta or in vitro. We sequenced the nuclear genome of Rhazya stricta (Apocynaceae, Rauvolfioideae) and present this high quality assembly in comparison with that of coffee (Rubiaceae, Coffea canephora, Gentianales) and others to investigate the evolution of genome-scale features. The annotated Rhazya genome was used to develop the community resource, RhaCyc, a metabolic pathway database. Gene family trees were constructed to identify homologs of MIA pathway genes and to examine their evolutionary history. We found that, unlike Coffea, the Rhazya lineage has experienced many structural rearrangements. Gene tree analyses suggest recent, lineage-specific expansion and diversification among homologs encoding MIA pathway genes in Gentianales and provide candidate sequences with the potential to close gaps in characterized pathways and support prospecting for new MIA production avenues.

  7. Molecular identification of Nocardia species using the sodA gene: Identificación molecular de especies de Nocardia utilizando el gen sodA.

    PubMed

    Sánchez-Herrera, K; Sandoval, H; Mouniee, D; Ramírez-Durán, N; Bergeron, E; Boiron, P; Sánchez-Saucedo, N; Rodríguez-Nava, V

    2017-09-01

    Currently for bacterial identification and classification the rrs gene encoding 16S rRNA is used as a reference method for the analysis of strains of the genus Nocardia. However, it does not have enough polymorphism to differentiate them at the species level. This fact makes it necessary to search for molecular targets that can provide better identification. The sod A gene (encoding the enzyme superoxide dismutase) has had good results in identifying species of other Actinomycetes. In this study the sod A gene is proposed for the identification and differentiation at the species level of the genus Nocardia. We used 41 type species of various collections; a 386 bp fragment of the sod A gene was amplified and sequenced, and a phylogenetic analysis was performed comparing the genes rrs (1171 bp), hsp 65 (401 bp), sec A1 (494 bp), gyr B (1195 bp) and rpo B (401 bp). The sequences were aligned using the Clustal X program. Evolutionary trees according to the neighbour-joining method were created with the programs Phylo_win and MEGA 6. The specific variability of the sod A genus of the genus Nocardia was analysed. A high phylogenetic resolution, significant genetic variability, and specificity and reliability were observed for the differentiation of the isolates at the species level. The polymorphism observed in the sod A gene sequence contains variable regions that allow the discrimination of closely related Nocardia species. The clear specificity, despite its small size, proves to be of great advantage for use in taxonomic studies and clinical diagnosis of the genus Nocardia.

  8. Variability among Cucurbitaceae species (melon, cucumber and watermelon) in a genomic region containing a cluster of NBS-LRR genes.

    PubMed

    Morata, Jordi; Puigdomènech, Pere

    2017-02-08

    Cucurbitaceae species contain a significantly lower number of genes coding for proteins with similarity to plant resistance genes belonging to the NBS-LRR family than other plant species of similar genome size. A large proportion of these genes are organized in clusters that appear to be hotspots of variability. The genomes of the Cucurbitaceae species measured until now are intermediate in size (between 350 and 450 Mb) and they apparently have not undergone any genome duplications beside those at the origin of eudicots. The cluster containing the largest number of NBS-LRR genes has previously been analyzed in melon and related species and showed a high degree of interspecific and intraspecific variability. It was of interest to study whether similar behavior occurred in other cluster of the same family of genes. The cluster of NBS-LRR genes located in melon chromosome 9 was analyzed and compared with the syntenic regions in other cucurbit genomes. This is the second cluster in number within this species and it contains nine sequences with a NBS-LRR annotation including two genes, Fom1 and Prv, providing resistance against Fusarium and Ppapaya ring-spot virus (PRSV). The variability within the melon species appears to consist essentially of single nucleotide polymorphisms. Clusters of similar genes are present in the syntenic regions of the two species of Cucurbitaceae that were sequenced, cucumber and watermelon. Most of the genes in the syntenic clusters can be aligned between species and a hypothesis of generation of the cluster is proposed. The number of genes in the watermelon cluster is similar to that in melon while a higher number of genes (12) is present in cucumber, a species with a smaller genome than melon. After comparing genome resequencing data of 115 cucumber varieties, deletion of a group of genes is observed in a group of varieties of Indian origin. Clusters of genes coding for NBS-LRR proteins in cucurbits appear to have specific variability in different regions of the genome and between different species. This observation is in favour of considering that the adaptation of plant species to changing environments is based upon the variability that may occur at any location in the genome and that has been produced by specific mechanisms of sequence variation acting on plant genomes. This information could be useful both to understand the evolution of species and for plant breeding.

  9. Transforming RNA-Seq data to improve the performance of prognostic gene signatures.

    PubMed

    Zwiener, Isabella; Frisch, Barbara; Binder, Harald

    2014-01-01

    Gene expression measurements have successfully been used for building prognostic signatures, i.e for identifying a short list of important genes that can predict patient outcome. Mostly microarray measurements have been considered, and there is little advice available for building multivariable risk prediction models from RNA-Seq data. We specifically consider penalized regression techniques, such as the lasso and componentwise boosting, which can simultaneously consider all measurements and provide both, multivariable regression models for prediction and automated variable selection. However, they might be affected by the typical skewness, mean-variance-dependency or extreme values of RNA-Seq covariates and therefore could benefit from transformations of the latter. In an analytical part, we highlight preferential selection of covariates with large variances, which is problematic due to the mean-variance dependency of RNA-Seq data. In a simulation study, we compare different transformations of RNA-Seq data for potentially improving detection of important genes. Specifically, we consider standardization, the log transformation, a variance-stabilizing transformation, the Box-Cox transformation, and rank-based transformations. In addition, the prediction performance for real data from patients with kidney cancer and acute myeloid leukemia is considered. We show that signature size, identification performance, and prediction performance critically depend on the choice of a suitable transformation. Rank-based transformations perform well in all scenarios and can even outperform complex variance-stabilizing approaches. Generally, the results illustrate that the distribution and potential transformations of RNA-Seq data need to be considered as a critical step when building risk prediction models by penalized regression techniques.

  10. Transforming RNA-Seq Data to Improve the Performance of Prognostic Gene Signatures

    PubMed Central

    Zwiener, Isabella; Frisch, Barbara; Binder, Harald

    2014-01-01

    Gene expression measurements have successfully been used for building prognostic signatures, i.e for identifying a short list of important genes that can predict patient outcome. Mostly microarray measurements have been considered, and there is little advice available for building multivariable risk prediction models from RNA-Seq data. We specifically consider penalized regression techniques, such as the lasso and componentwise boosting, which can simultaneously consider all measurements and provide both, multivariable regression models for prediction and automated variable selection. However, they might be affected by the typical skewness, mean-variance-dependency or extreme values of RNA-Seq covariates and therefore could benefit from transformations of the latter. In an analytical part, we highlight preferential selection of covariates with large variances, which is problematic due to the mean-variance dependency of RNA-Seq data. In a simulation study, we compare different transformations of RNA-Seq data for potentially improving detection of important genes. Specifically, we consider standardization, the log transformation, a variance-stabilizing transformation, the Box-Cox transformation, and rank-based transformations. In addition, the prediction performance for real data from patients with kidney cancer and acute myeloid leukemia is considered. We show that signature size, identification performance, and prediction performance critically depend on the choice of a suitable transformation. Rank-based transformations perform well in all scenarios and can even outperform complex variance-stabilizing approaches. Generally, the results illustrate that the distribution and potential transformations of RNA-Seq data need to be considered as a critical step when building risk prediction models by penalized regression techniques. PMID:24416353

  11. Evolution of Alternative Adaptive Immune Systems in Vertebrates.

    PubMed

    Boehm, Thomas; Hirano, Masayuki; Holland, Stephen J; Das, Sabyasachi; Schorpp, Michael; Cooper, Max D

    2018-04-26

    Adaptive immunity in jawless fishes is based on antigen recognition by three types of variable lymphocyte receptors (VLRs) composed of variable leucine-rich repeats, which are differentially expressed by two T-like lymphocyte lineages and one B-like lymphocyte lineage. The T-like cells express either VLRAs or VLRCs of yet undefined antigen specificity, whereas the VLRB antibodies secreted by B-like cells bind proteinaceous and carbohydrate antigens. The incomplete VLR germline genes are assembled into functional units by a gene conversion-like mechanism that employs flanking variable leucine-rich repeat sequences as templates in association with lineage-specific expression of cytidine deaminases. B-like cells develop in the hematopoietic typhlosole and kidneys, whereas T-like cells develop in the thymoid, a thymus-equivalent region at the gill fold tips. Thus, the dichotomy between T-like and B-like cells and the presence of dedicated lymphopoietic tissues emerge as ancestral vertebrate features, whereas the somatic diversification of structurally distinct antigen receptor genes evolved independently in jawless and jawed vertebrates.

  12. Combining Genotype, Phenotype, and Environment to Infer Potential Candidate Genes.

    PubMed

    Talbot, Benoit; Chen, Ting-Wen; Zimmerman, Shawna; Joost, Stéphane; Eckert, Andrew J; Crow, Taylor M; Semizer-Cuming, Devrim; Seshadri, Chitra; Manel, Stéphanie

    2017-03-01

    Population genomic analysis can be an important tool in understanding local adaptation. Identification of potential adaptive loci in such analyses is usually based on the survey of a large genomic dataset in combination with environmental variables. Phenotypic data are less commonly incorporated into such studies, although combining a genome scan analysis with a phenotypic trait analysis can greatly improve the insights obtained from each analysis individually. Here, we aimed to identify loci potentially involved in adaptation to climate in 283 Loblolly pine (Pinus taeda) samples from throughout the species' range in the southeastern United States. We analyzed associations between phenotypic, molecular, and environmental variables from datasets of 3082 single nucleotide polymorphism (SNP) loci and 3 categories of phenotypic traits (gene expression, metabolites, and whole-plant traits). We found only 6 SNP loci that displayed potential signals of local adaptation. Five of the 6 identified SNPs are linked to gene expression traits for lignin development, and 1 is linked with whole-plant traits. We subsequently compared the 6 candidate genes with environmental variables and found a high correlation in only 3 of them (R2 > 0.2). Our study highlights the need for a combination of genotypes, phenotypes, and environmental variables, and for an appropriate sampling scheme and study design, to improve confidence in the identification of potential candidate genes. © The American Genetic Association 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  13. BASiCS: Bayesian Analysis of Single-Cell Sequencing Data

    PubMed Central

    Vallejos, Catalina A.; Marioni, John C.; Richardson, Sylvia

    2015-01-01

    Single-cell mRNA sequencing can uncover novel cell-to-cell heterogeneity in gene expression levels in seemingly homogeneous populations of cells. However, these experiments are prone to high levels of unexplained technical noise, creating new challenges for identifying genes that show genuine heterogeneous expression within the population of cells under study. BASiCS (Bayesian Analysis of Single-Cell Sequencing data) is an integrated Bayesian hierarchical model where: (i) cell-specific normalisation constants are estimated as part of the model parameters, (ii) technical variability is quantified based on spike-in genes that are artificially introduced to each analysed cell’s lysate and (iii) the total variability of the expression counts is decomposed into technical and biological components. BASiCS also provides an intuitive detection criterion for highly (or lowly) variable genes within the population of cells under study. This is formalised by means of tail posterior probabilities associated to high (or low) biological cell-to-cell variance contributions, quantities that can be easily interpreted by users. We demonstrate our method using gene expression measurements from mouse Embryonic Stem Cells. Cross-validation and meaningful enrichment of gene ontology categories within genes classified as highly (or lowly) variable supports the efficacy of our approach. PMID:26107944

  14. BASiCS: Bayesian Analysis of Single-Cell Sequencing Data.

    PubMed

    Vallejos, Catalina A; Marioni, John C; Richardson, Sylvia

    2015-06-01

    Single-cell mRNA sequencing can uncover novel cell-to-cell heterogeneity in gene expression levels in seemingly homogeneous populations of cells. However, these experiments are prone to high levels of unexplained technical noise, creating new challenges for identifying genes that show genuine heterogeneous expression within the population of cells under study. BASiCS (Bayesian Analysis of Single-Cell Sequencing data) is an integrated Bayesian hierarchical model where: (i) cell-specific normalisation constants are estimated as part of the model parameters, (ii) technical variability is quantified based on spike-in genes that are artificially introduced to each analysed cell's lysate and (iii) the total variability of the expression counts is decomposed into technical and biological components. BASiCS also provides an intuitive detection criterion for highly (or lowly) variable genes within the population of cells under study. This is formalised by means of tail posterior probabilities associated to high (or low) biological cell-to-cell variance contributions, quantities that can be easily interpreted by users. We demonstrate our method using gene expression measurements from mouse Embryonic Stem Cells. Cross-validation and meaningful enrichment of gene ontology categories within genes classified as highly (or lowly) variable supports the efficacy of our approach.

  15. Therapeutic Gene Editing Safety and Specificity.

    PubMed

    Lux, Christopher T; Scharenberg, Andrew M

    2017-10-01

    Therapeutic gene editing is significant for medical advancement. Safety is intricately linked to the specificity of the editing tools used to cut at precise genomic targets. Improvements can be achieved by thoughtful design of nucleases and repair templates, analysis of off-target editing, and careful utilization of viral vectors. Advancements in DNA repair mechanisms and development of new generations of tools improve targeting of specific sequences while minimizing risks. It is important to plot a safe course for future clinical trials. This article reviews safety and specificity for therapeutic gene editing to spur dialogue and advancement. Copyright © 2017 Elsevier Inc. All rights reserved.

  16. PROMISE: a tool to identify genomic features with a specific biologically interesting pattern of associations with multiple endpoint variables.

    PubMed

    Pounds, Stan; Cheng, Cheng; Cao, Xueyuan; Crews, Kristine R; Plunkett, William; Gandhi, Varsha; Rubnitz, Jeffrey; Ribeiro, Raul C; Downing, James R; Lamba, Jatinder

    2009-08-15

    In some applications, prior biological knowledge can be used to define a specific pattern of association of multiple endpoint variables with a genomic variable that is biologically most interesting. However, to our knowledge, there is no statistical procedure designed to detect specific patterns of association with multiple endpoint variables. Projection onto the most interesting statistical evidence (PROMISE) is proposed as a general procedure to identify genomic variables that exhibit a specific biologically interesting pattern of association with multiple endpoint variables. Biological knowledge of the endpoint variables is used to define a vector that represents the biologically most interesting values for statistics that characterize the associations of the endpoint variables with a genomic variable. A test statistic is defined as the dot-product of the vector of the observed association statistics and the vector of the most interesting values of the association statistics. By definition, this test statistic is proportional to the length of the projection of the observed vector of correlations onto the vector of most interesting associations. Statistical significance is determined via permutation. In simulation studies and an example application, PROMISE shows greater statistical power to identify genes with the interesting pattern of associations than classical multivariate procedures, individual endpoint analyses or listing genes that have the pattern of interest and are significant in more than one individual endpoint analysis. Documented R routines are freely available from www.stjuderesearch.org/depts/biostats and will soon be available as a Bioconductor package from www.bioconductor.org.

  17. Rare Genetic Forms of Obesity: Clinical Approach and Current Treatments in 2016

    PubMed Central

    Huvenne, Hélène; Dubern, Béatrice; Clément, Karine; Poitou, Christine

    2016-01-01

    Obesity results from a synergistic relationship between genes and the environment. The phenotypic expression of genetic factors involved in obesity is variable, allowing to distinguish several clinical pictures of obesity. Monogenic obesity is described as rare and severe early-onset obesity with abnormal feeding behavior and endocrine disorders. This is mainly due to autosomal recessive mutations in genes of the leptin-melanocortin pathway which plays a key role in the hypothalamic control of food intake. Melanocortin 4 receptor(MC4R)-linked obesity is characterized by the variable severity of obesity and no notable additional phenotypes. Mutations in the MC4R gene are involved in 2-3% of obese children and adults; the majority of these are heterozygous. Syndromic obesity is associated with mental retardation, dysmorphic features, and organ-specific developmental abnormalities. Additional genes participating in the development of hypothalamus and central nervous system have been regularly identified. But to date, not all involved genes have been identified so far. New diagnostic tools, such as whole-exome sequencing, will probably help to identify other genes. Managing these patients is challenging. Indeed, specific treatments are available only for specific types of monogenic obesity, such as leptin deficiency. Data on bariatric surgery are limited and controversial. New molecules acting on the leptin-melanocortin pathway are currently being developed. PMID:27241181

  18. [Environmental and genetic variables related with alterations in language acquisition in early childhood].

    PubMed

    Moriano-Gutierrez, A; Colomer-Revuelta, J; Sanjuan, J; Carot-Sierra, J M

    2017-01-01

    A great deal of research has addressed problems in the correct acquisition of language, but with few overall conclusions. The reasons for this lie in the individual variability, the existence of different measures for assessing language and the fact that a complex network of genetic and environmental factors are involved in its development. To review the environmental and genetic variables that have been studied to date, in order to gain a better under-standing of the causes of specific language impairment and create new evidence that can help in the development of screening systems for the early detection of these disorders. The environmental variables related with poorer early child language development include male gender, low level of education of the mother, familial history of problems with language or psychiatric problems, perinatal problems and health problems in early childhood. Bilingualism seems to be a protective factor. Temperament and language are related. Within the genetic factors there are several specific genes associated with language, two of which have a greater influence on its physiological acquisition: FOXP2 and CNTNAP2. The other genes that are most related with specific language disorders are ATP2C2, CMIP, ROBO2, ZNF277 and NOP9. The key to comprehending the development of specific language disorders lies in reaching an understanding of the true role played by genes in the ontogenesis, in the regulation of the different developmental processes, and how this role is modulated by the environment.

  19. State Space Model with hidden variables for reconstruction of gene regulatory networks.

    PubMed

    Wu, Xi; Li, Peng; Wang, Nan; Gong, Ping; Perkins, Edward J; Deng, Youping; Zhang, Chaoyang

    2011-01-01

    State Space Model (SSM) is a relatively new approach to inferring gene regulatory networks. It requires less computational time than Dynamic Bayesian Networks (DBN). There are two types of variables in the linear SSM, observed variables and hidden variables. SSM uses an iterative method, namely Expectation-Maximization, to infer regulatory relationships from microarray datasets. The hidden variables cannot be directly observed from experiments. How to determine the number of hidden variables has a significant impact on the accuracy of network inference. In this study, we used SSM to infer Gene regulatory networks (GRNs) from synthetic time series datasets, investigated Bayesian Information Criterion (BIC) and Principle Component Analysis (PCA) approaches to determining the number of hidden variables in SSM, and evaluated the performance of SSM in comparison with DBN. True GRNs and synthetic gene expression datasets were generated using GeneNetWeaver. Both DBN and linear SSM were used to infer GRNs from the synthetic datasets. The inferred networks were compared with the true networks. Our results show that inference precision varied with the number of hidden variables. For some regulatory networks, the inference precision of DBN was higher but SSM performed better in other cases. Although the overall performance of the two approaches is compatible, SSM is much faster and capable of inferring much larger networks than DBN. This study provides useful information in handling the hidden variables and improving the inference precision.

  20. Use of Partial Least Squares improves the efficacy of removing unwanted variability in differential expression analyses based on RNA-Seq data.

    PubMed

    Chakraborty, Sutirtha

    2018-05-26

    RNA-Seq technology has revolutionized the face of gene expression profiling by generating read count data measuring the transcript abundances for each queried gene on multiple experimental subjects. But on the downside, the underlying technical artefacts and hidden biological profiles of the samples generate a wide variety of latent effects that may potentially distort the actual transcript/gene expression signals. Standard normalization techniques fail to correct for these hidden variables and lead to flawed downstream analyses. In this work I demonstrate the use of Partial Least Squares (built as an R package 'SVAPLSseq') to correct for the traces of extraneous variability in RNA-Seq data. A novel and thorough comparative analysis of the PLS based method is presented along with some of the other popularly used approaches for latent variable correction in RNA-Seq. Overall, the method is found to achieve a substantially improved estimation of the hidden effect signatures in the RNA-Seq transcriptome expression landscape compared to other available techniques. Copyright © 2017. Published by Elsevier Inc.

  1. Regional differentiation among populations of the Diamondback terrapin (Malaclemys terrapin)

    USGS Publications Warehouse

    Hart, Kristen M.; Hunter, Margaret E.; King, Tim L.

    2014-01-01

    The Diamondback terrapin (Malaclemys terrapin) is a brackish-water turtle species whose populations have been fragmented due to anthropogenic activity such as development of coastal habitat and entrapment in commercial blue crab (Callinectes sapidus) fishing gear. Genetic analyses can improve conservation efforts for the long-term protection of the species. We used microsatellite DNA analysis to investigate levels of gene flow among and genetic variability within 21 geographically separate collections of the species distributed from Massachusetts to Texas. Quantified levels of genetic variability (allelic diversity, genotypic frequencies, and heterozygosity) revealed three zones of genetic discontinuity, resulting in four discrete populations: Northeast Atlantic, Coastal Mid-Atlantic, Florida and Texas/Louisiana. The average number of alleles and expected heterozygosity for the four genetic clusters were NA = 6.54 and HE = 0.050, respectively. However, the geographic boundaries of the populations did not correspond to accepted terrapin subspecies limits. Our results illuminate not only the need to sample terrapins in additional sites, specifically in the southeast, but also the necessity for allowing uninterrupted gene flow among population groupings to preserve current levels of genetic diversity.

  2. The gravity of pollination: integrating at-site features into spatial analysis of contemporary pollen movement.

    PubMed

    DiLeo, Michelle F; Siu, Jenna C; Rhodes, Matthew K; López-Villalobos, Adriana; Redwine, Angela; Ksiazek, Kelly; Dyer, Rodney J

    2014-08-01

    Pollen-mediated gene flow is a major driver of spatial genetic structure in plant populations. Both individual plant characteristics and site-specific features of the landscape can modify the perceived attractiveness of plants to their pollinators and thus play an important role in shaping spatial genetic variation. Most studies of landscape-level genetic connectivity in plants have focused on the effects of interindividual distance using spatial and increasingly ecological separation, yet have not incorporated individual plant characteristics or other at-site ecological variables. Using spatially explicit simulations, we first tested the extent to which the inclusion of at-site variables influencing local pollination success improved the statistical characterization of genetic connectivity based upon examination of pollen pool genetic structure. The addition of at-site characteristics provided better models than those that only considered interindividual spatial distance (e.g. IBD). Models parameterized using conditional genetic covariance (e.g. population graphs) also outperformed those assuming panmixia. In a natural population of Cornus florida L. (Cornaceae), we showed that the addition of at-site characteristics (clumping of primary canopy opening above each maternal tree and maternal tree floral output) provided significantly better models describing gene flow than models including only between-site spatial (IBD) and ecological (isolation by resistance) variables. Overall, our results show that including interindividual and local ecological variation greatly aids in characterizing landscape-level measures of contemporary gene flow. © 2014 John Wiley & Sons Ltd.

  3. RNAseq-based transcriptome comparison of Saccharomyces cerevisiae strains isolated from diverse fermentative environments.

    PubMed

    Ibáñez, Clara; Pérez-Torrado, Roberto; Morard, Miguel; Toft, Christina; Barrio, Eladio; Querol, Amparo

    2017-09-18

    Transcriptome analyses play a central role in unraveling the complexity of gene expression regulation in Saccharomyces cerevisiae. This species, one of the most important microorganisms for humans given its industrial applications, shows an astonishing degree of genetic and phenotypic variability among different strains adapted to specific environments. In order to gain novel insights into the Saccharomyces cerevisiae biology of strains adapted to different fermentative environments, we analyzed the whole transcriptome of three strains isolated from wine, flor wine or mezcal fermentations. An RNA-seq transcriptome comparison of the different yeasts in the samples obtained during synthetic must fermentation highlighted the differences observed in the genes that encode mannoproteins, and in those involved in aroma, sugar transport, glycerol and alcohol metabolism, which are important under alcoholic fermentation conditions. These differences were also observed in the physiology of the strains after mannoprotein and aroma determinations. This study offers an essential foundation for understanding how gene expression variations contribute to the fermentation differences of the strains adapted to unequal fermentative environments. Such knowledge is crucial to make improvements in fermentation processes and to define targets for the genetic improvement or selection of wine yeasts. Copyright © 2017 Elsevier B.V. All rights reserved.

  4. Species-specific identification of Dekkera/Brettanomyces yeasts by fluorescently labeled DNA probes targeting the 26S rRNA.

    PubMed

    Röder, Christoph; König, Helmut; Fröhlich, Jürgen

    2007-09-01

    Sequencing of the complete 26S rRNA genes of all Dekkera/Brettanomyces species colonizing different beverages revealed the potential for a specific primer and probe design to support diagnostic PCR approaches and FISH. By analysis of the complete 26S rRNA genes of all five currently known Dekkera/Brettanomyces species (Dekkera bruxellensis, D. anomala, Brettanomyces custersianus, B. nanus and B. naardenensis), several regions with high nucleotide sequence variability yet distinct from the D1/D2 domains were identified. FISH species-specific probes targeting the 26S rRNA gene's most variable regions were designed. Accessibility of probe targets for hybridization was facilitated by the construction of partially complementary 'side'-labeled probes, based on secondary structure models of the rRNA sequences. The specificity and routine applicability of the FISH-based method for yeast identification were tested by analyzing different wine isolates. Investigation of the prevalence of Dekkera/Brettanomyces yeasts in the German viticultural regions Wonnegau, Nierstein and Bingen (Rhinehesse, Rhineland-Palatinate) resulted in the isolation of 37 D. bruxellensis strains from 291 wine samples.

  5. Using variable rate models to identify genes under selection in sequence pairs: their validity and limitations for EST sequences.

    PubMed

    Church, Sheri A; Livingstone, Kevin; Lai, Zhao; Kozik, Alexander; Knapp, Steven J; Michelmore, Richard W; Rieseberg, Loren H

    2007-02-01

    Using likelihood-based variable selection models, we determined if positive selection was acting on 523 EST sequence pairs from two lineages of sunflower and lettuce. Variable rate models are generally not used for comparisons of sequence pairs due to the limited information and the inaccuracy of estimates of specific substitution rates. However, previous studies have shown that the likelihood ratio test (LRT) is reliable for detecting positive selection, even with low numbers of sequences. These analyses identified 56 genes that show a signature of selection, of which 75% were not identified by simpler models that average selection across codons. Subsequent mapping studies in sunflower show four of five of the positively selected genes identified by these methods mapped to domestication QTLs. We discuss the validity and limitations of using variable rate models for comparisons of sequence pairs, as well as the limitations of using ESTs for identification of positively selected genes.

  6. X chromosome regulation: diverse patterns in development, tissues and disease

    PubMed Central

    Deng, Xinxian; Berletch, Joel B.; Nguyen, Di K.; Disteche, Christine M.

    2014-01-01

    Genes on the mammalian X chromosome are present in one copy in males and two copies in females. The complex mechanisms that regulate the X chromosome lead to evolutionary and physiological variability in gene expression between species, the sexes, individuals, developmental stages, tissues and cell types. In early development, delayed and incomplete X chromosome inactivation (XCI) in some species causes variability in gene expression. Additional diversity stems from escape from XCI and from mosaicism or XCI skewing in females. This causes sex-specific differences that manifest as differential gene expression and associated phenotypes. Furthermore, the complexity and diversity of X dosage regulation affect the severity of diseases caused by X-linked mutations. PMID:24733023

  7. PROMISE: a tool to identify genomic features with a specific biologically interesting pattern of associations with multiple endpoint variables

    PubMed Central

    Pounds, Stan; Cheng, Cheng; Cao, Xueyuan; Crews, Kristine R.; Plunkett, William; Gandhi, Varsha; Rubnitz, Jeffrey; Ribeiro, Raul C.; Downing, James R.; Lamba, Jatinder

    2009-01-01

    Motivation: In some applications, prior biological knowledge can be used to define a specific pattern of association of multiple endpoint variables with a genomic variable that is biologically most interesting. However, to our knowledge, there is no statistical procedure designed to detect specific patterns of association with multiple endpoint variables. Results: Projection onto the most interesting statistical evidence (PROMISE) is proposed as a general procedure to identify genomic variables that exhibit a specific biologically interesting pattern of association with multiple endpoint variables. Biological knowledge of the endpoint variables is used to define a vector that represents the biologically most interesting values for statistics that characterize the associations of the endpoint variables with a genomic variable. A test statistic is defined as the dot-product of the vector of the observed association statistics and the vector of the most interesting values of the association statistics. By definition, this test statistic is proportional to the length of the projection of the observed vector of correlations onto the vector of most interesting associations. Statistical significance is determined via permutation. In simulation studies and an example application, PROMISE shows greater statistical power to identify genes with the interesting pattern of associations than classical multivariate procedures, individual endpoint analyses or listing genes that have the pattern of interest and are significant in more than one individual endpoint analysis. Availability: Documented R routines are freely available from www.stjuderesearch.org/depts/biostats and will soon be available as a Bioconductor package from www.bioconductor.org. Contact: stanley.pounds@stjude.org Supplementary information: Supplementary data are available at Bioinformatics online. PMID:19528086

  8. Development of a real-time PCR method for the differential detection and quantification of four solanaceae in GMO analysis: potato (Solanum tuberosum), tomato (Solanum lycopersicum), eggplant (Solanum melongena), and pepper (Capsicum annuum).

    PubMed

    Chaouachi, Maher; El Malki, Redouane; Berard, Aurélie; Romaniuk, Marcel; Laval, Valérie; Brunel, Dominique; Bertheau, Yves

    2008-03-26

    The labeling of products containing genetically modified organisms (GMO) is linked to their quantification since a threshold for the presence of fortuitous GMOs in food has been established. This threshold is calculated from a combination of two absolute quantification values: one for the specific GMO target and the second for an endogenous reference gene specific to the taxon. Thus, the development of reliable methods to quantify GMOs using endogenous reference genes in complex matrixes such as food and feed is needed. Plant identification can be difficult in the case of closely related taxa, which moreover are subject to introgression events. Based on the homology of beta-fructosidase sequences obtained from public databases, two couples of consensus primers were designed for the detection, quantification, and differentiation of four Solanaceae: potato (Solanum tuberosum), tomato (Solanum lycopersicum), pepper (Capsicum annuum), and eggplant (Solanum melongena). Sequence variability was studied first using lines and cultivars (intraspecies sequence variability), then using taxa involved in gene introgressions, and finally, using taxonomically close taxa (interspecies sequence variability). This study allowed us to design four highly specific TaqMan-MGB probes. A duplex real time PCR assay was developed for simultaneous quantification of tomato and potato. For eggplant and pepper, only simplex real time PCR tests were developed. The results demonstrated the high specificity and sensitivity of the assays. We therefore conclude that beta-fructosidase can be used as an endogenous reference gene for GMO analysis.

  9. Dose response relationship in anti-stress gene regulatory networks.

    PubMed

    Zhang, Qiang; Andersen, Melvin E

    2007-03-02

    To maintain a stable intracellular environment, cells utilize complex and specialized defense systems against a variety of external perturbations, such as electrophilic stress, heat shock, and hypoxia, etc. Irrespective of the type of stress, many adaptive mechanisms contributing to cellular homeostasis appear to operate through gene regulatory networks that are organized into negative feedback loops. In general, the degree of deviation of the controlled variables, such as electrophiles, misfolded proteins, and O2, is first detected by specialized sensor molecules, then the signal is transduced to specific transcription factors. Transcription factors can regulate the expression of a suite of anti-stress genes, many of which encode enzymes functioning to counteract the perturbed variables. The objective of this study was to explore, using control theory and computational approaches, the theoretical basis that underlies the steady-state dose response relationship between cellular stressors and intracellular biochemical species (controlled variables, transcription factors, and gene products) in these gene regulatory networks. Our work indicated that the shape of dose response curves (linear, superlinear, or sublinear) depends on changes in the specific values of local response coefficients (gains) distributed in the feedback loop. Multimerization of anti-stress enzymes and transcription factors into homodimers, homotrimers, or even higher-order multimers, play a significant role in maintaining robust homeostasis. Moreover, our simulation noted that dose response curves for the controlled variables can transition sequentially through four distinct phases as stressor level increases: initial superlinear with lesser control, superlinear more highly controlled, linear uncontrolled, and sublinear catastrophic. Each phase relies on specific gain-changing events that come into play as stressor level increases. The low-dose region is intrinsically nonlinear, and depending on the level of local gains, presence of gain-changing events, and degree of feedforward gene activation, this region can appear as superlinear, sublinear, or even J-shaped. The general dose response transition proposed here was further examined in a complex anti-electrophilic stress pathway, which involves multiple genes, enzymes, and metabolic reactions. This work would help biologists and especially toxicologists to better assess and predict the cellular impact brought about by biological stressors.

  10. A condition-specific codon optimization approach for improved heterologous gene expression in Saccharomyces cerevisiae

    PubMed Central

    2014-01-01

    Background Heterologous gene expression is an important tool for synthetic biology that enables metabolic engineering and the production of non-natural biologics in a variety of host organisms. The translational efficiency of heterologous genes can often be improved by optimizing synonymous codon usage to better match the host organism. However, traditional approaches for optimization neglect to take into account many factors known to influence synonymous codon distributions. Results Here we define an alternative approach for codon optimization that utilizes systems level information and codon context for the condition under which heterologous genes are being expressed. Furthermore, we utilize a probabilistic algorithm to generate multiple variants of a given gene. We demonstrate improved translational efficiency using this condition-specific codon optimization approach with two heterologous genes, the fluorescent protein-encoding eGFP and the catechol 1,2-dioxygenase gene CatA, expressed in S. cerevisiae. For the latter case, optimization for stationary phase production resulted in nearly 2.9-fold improvements over commercial gene optimization algorithms. Conclusions Codon optimization is now often a standard tool for protein expression, and while a variety of tools and approaches have been developed, they do not guarantee improved performance for all hosts of applications. Here, we suggest an alternative method for condition-specific codon optimization and demonstrate its utility in Saccharomyces cerevisiae as a proof of concept. However, this technique should be applicable to any organism for which gene expression data can be generated and is thus of potential interest for a variety of applications in metabolic and cellular engineering. PMID:24636000

  11. Variable sexually dimorphic gene expression in laboratory strains of Drosophila melanogaster.

    PubMed

    Baker, Dean A; Meadows, Lisa A; Wang, Jing; Dow, Julian At; Russell, Steven

    2007-12-10

    Wild-type laboratory strains of model organisms are typically kept in isolation for many years, with the action of genetic drift and selection on mutational variation causing lineages to diverge with time. Natural populations from which such strains are established, show that gender-specific interactions in particular drive many aspects of sequence level and transcriptional level variation. Here, our goal was to identify genes that display transcriptional variation between laboratory strains of Drosophila melanogaster, and to explore evidence of gender-biased interactions underlying that variability. Transcriptional variation among the laboratory genotypes studied occurs more frequently in males than in females. Qualitative differences are also apparent to suggest that genes within particular functional classes disproportionately display variation in gene expression. Our analysis indicates that genes with reproductive functions are most often divergent between genotypes in both sexes, however a large proportion of female variation can also be attributed to genes without expression in the ovaries. The present study clearly shows that transcriptional variation between common laboratory strains of Drosophila can differ dramatically due to sexual dimorphism. Much of this variation reflects sex-specific challenges associated with divergent physiological trade-offs, morphology and regulatory pathways operating within males and females.

  12. Gene-diet interactions and aging in C. elegans

    PubMed Central

    Yen, Chia An; Curran, Sean P.

    2016-01-01

    Diet is the most variable aspect of life history, as most individuals have a large diversity of food choices, varying in the type and amount that they ingest. In the short-term, diet can affect metabolism and energy levels. However, in the long run, the net deficiency or excess of calories from diet can influence the progression and severity of age-related diseases. An old and yet still debated question is: how do specific dietary choices impact health- and lifespan? It is clear that genetics can play a critical role — perhaps just as important as diet choices. For example, poor diet in combination with genetic susceptibility can lead to metabolic disorders, such as obesity and type 2 diabetes. Recent work in Caenorhabditis elegans has identified the existence of diet-gene pairs, where the consequence of mutating a specific gene is only realized on specific diets. Many core metabolic pathways are conserved from worm to human. Although only a handful of these diet-gene pairs has been characterized, there are potentially hundreds, if not thousands, of such interactions, which may explain the variability in the rates of aging in humans and the incidence and severity of age-related diseases. PMID:26924670

  13. Genetic variability of milk fatty acids.

    PubMed

    Arnould, V M-R; Soyeurt, H

    2009-01-01

    The milk fatty acid (FA) profile is far from the optimal fat composition in regards to human health. The natural sources of variation, such as feeding or genetics, could be used to increase the concentrations of unsaturated fatty acids. The impact of feeding is well described. However, genetic effects on the milk FA composition begin to be extensively studied. This paper summarizes the available information about the genetic variability of FAs. The greatest breed differences in FA composition are observed between Holstein and Jersey milk. Milk fat of the latter breed contains higher concentrations of saturated FAs, especially short-chain FAs. The variation of the delta-9 desaturase activity estimated from specific FA ratios could explain partly these breed differences. The choice of a specific breed seems to be a possibility to improve the nutritional quality of milk fat. Generally, the proportions of FAs in milk are more heritable than the proportions of these same FAs in fat. Heritability estimates range from 0.00 to 0.54. The presence of some single nucleotide polymorphisms could explain partly the observed individual genetic variability. The polymorphisms detected on SCD1 and DGAT1 genes influence the milk FA composition. The SCD1 V allele increases the unsaturation of C16 and C18. The DGAT1 A allele is related to the unsaturation of C18. So, a combination of the molecular and quantitative approaches should be used to develop tools helping farmers in the selection of their animals to improve the nutritional quality of the produced milk fat.

  14. Deep sequencing reveals cell-type-specific patterns of single-cell transcriptome variation.

    PubMed

    Dueck, Hannah; Khaladkar, Mugdha; Kim, Tae Kyung; Spaethling, Jennifer M; Francis, Chantal; Suresh, Sangita; Fisher, Stephen A; Seale, Patrick; Beck, Sheryl G; Bartfai, Tamas; Kuhn, Bernhard; Eberwine, James; Kim, Junhyong

    2015-06-09

    Differentiation of metazoan cells requires execution of different gene expression programs but recent single-cell transcriptome profiling has revealed considerable variation within cells of seeming identical phenotype. This brings into question the relationship between transcriptome states and cell phenotypes. Additionally, single-cell transcriptomics presents unique analysis challenges that need to be addressed to answer this question. We present high quality deep read-depth single-cell RNA sequencing for 91 cells from five mouse tissues and 18 cells from two rat tissues, along with 30 control samples of bulk RNA diluted to single-cell levels. We find that transcriptomes differ globally across tissues with regard to the number of genes expressed, the average expression patterns, and within-cell-type variation patterns. We develop methods to filter genes for reliable quantification and to calibrate biological variation. All cell types include genes with high variability in expression, in a tissue-specific manner. We also find evidence that single-cell variability of neuronal genes in mice is correlated with that in rats consistent with the hypothesis that levels of variation may be conserved. Single-cell RNA-sequencing data provide a unique view of transcriptome function; however, careful analysis is required in order to use single-cell RNA-sequencing measurements for this purpose. Technical variation must be considered in single-cell RNA-sequencing studies of expression variation. For a subset of genes, biological variability within each cell type appears to be regulated in order to perform dynamic functions, rather than solely molecular noise.

  15. Genome-wide analysis of differential transcriptional and epigenetic variability across human immune cell types.

    PubMed

    Ecker, Simone; Chen, Lu; Pancaldi, Vera; Bagger, Frederik O; Fernández, José María; Carrillo de Santa Pau, Enrique; Juan, David; Mann, Alice L; Watt, Stephen; Casale, Francesco Paolo; Sidiropoulos, Nikos; Rapin, Nicolas; Merkel, Angelika; Stunnenberg, Hendrik G; Stegle, Oliver; Frontini, Mattia; Downes, Kate; Pastinen, Tomi; Kuijpers, Taco W; Rico, Daniel; Valencia, Alfonso; Beck, Stephan; Soranzo, Nicole; Paul, Dirk S

    2017-01-26

    A healthy immune system requires immune cells that adapt rapidly to environmental challenges. This phenotypic plasticity can be mediated by transcriptional and epigenetic variability. We apply a novel analytical approach to measure and compare transcriptional and epigenetic variability genome-wide across CD14 + CD16 - monocytes, CD66b + CD16 + neutrophils, and CD4 + CD45RA + naïve T cells from the same 125 healthy individuals. We discover substantially increased variability in neutrophils compared to monocytes and T cells. In neutrophils, genes with hypervariable expression are found to be implicated in key immune pathways and are associated with cellular properties and environmental exposure. We also observe increased sex-specific gene expression differences in neutrophils. Neutrophil-specific DNA methylation hypervariable sites are enriched at dynamic chromatin regions and active enhancers. Our data highlight the importance of transcriptional and epigenetic variability for the key role of neutrophils as the first responders to inflammatory stimuli. We provide a resource to enable further functional studies into the plasticity of immune cells, which can be accessed from: http://blueprint-dev.bioinfo.cnio.es/WP10/hypervariability .

  16. Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse

    PubMed Central

    Hillier, LaDeana W.; Zody, Michael C.; Goldstein, Steve; She, Xinwe; Bult, Carol J.; Agarwala, Richa; Cherry, Joshua L.; DiCuccio, Michael; Hlavina, Wratko; Kapustin, Yuri; Meric, Peter; Maglott, Donna; Birtle, Zoë; Marques, Ana C.; Graves, Tina; Zhou, Shiguo; Teague, Brian; Potamousis, Konstantinos; Churas, Christopher; Place, Michael; Herschleb, Jill; Runnheim, Ron; Forrest, Daniel; Amos-Landgraf, James; Schwartz, David C.; Cheng, Ze; Lindblad-Toh, Kerstin; Eichler, Evan E.; Ponting, Chris P.

    2009-01-01

    The mouse (Mus musculus) is the premier animal model for understanding human disease and development. Here we show that a comprehensive understanding of mouse biology is only possible with the availability of a finished, high-quality genome assembly. The finished clone-based assembly of the mouse strain C57BL/6J reported here has over 175,000 fewer gaps and over 139 Mb more of novel sequence, compared with the earlier MGSCv3 draft genome assembly. In a comprehensive analysis of this revised genome sequence, we are now able to define 20,210 protein-coding genes, over a thousand more than predicted in the human genome (19,042 genes). In addition, we identified 439 long, non–protein-coding RNAs with evidence for transcribed orthologs in human. We analyzed the complex and repetitive landscape of 267 Mb of sequence that was missing or misassembled in the previously published assembly, and we provide insights into the reasons for its resistance to sequencing and assembly by whole-genome shotgun approaches. Duplicated regions within newly assembled sequence tend to be of more recent ancestry than duplicates in the published draft, correcting our initial understanding of recent evolution on the mouse lineage. These duplicates appear to be largely composed of sequence regions containing transposable elements and duplicated protein-coding genes; of these, some may be fixed in the mouse population, but at least 40% of segmentally duplicated sequences are copy number variable even among laboratory mouse strains. Mouse lineage-specific regions contain 3,767 genes drawn mainly from rapidly-changing gene families associated with reproductive functions. The finished mouse genome assembly, therefore, greatly improves our understanding of rodent-specific biology and allows the delineation of ancestral biological functions that are shared with human from derived functions that are not. PMID:19468303

  17. Population-specific association of genes for telomere-associated proteins with longevity in an Italian population.

    PubMed

    Crocco, Paolina; Barale, Roberto; Rose, Giuseppina; Rizzato, Cosmeri; Santoro, Aurelia; De Rango, Francesco; Carrai, Maura; Fogar, Paola; Monti, Daniela; Biondi, Fiammetta; Bucci, Laura; Ostan, Rita; Tallaro, Federica; Montesanto, Alberto; Zambon, Carlo-Federico; Franceschi, Claudio; Canzian, Federico; Passarino, Giuseppe; Campa, Daniele

    2015-06-01

    Leukocyte telomere length (LTL) has been observed to be hereditable and correlated with longevity. However, contrasting results have been reported in different populations on the value of LTL heritability and on how biology of telomeres influences longevity. We investigated whether the variability of genes correlated to telomere maintenance is associated with telomere length and affects longevity in a population from Southern Italy (20-106 years). For this purpose we analyzed thirty-one polymorphisms in eight telomerase-associated genes of which twelve in the genes coding for the core enzyme (TERT and TERC) and the remaining in genes coding for components of the telomerase complex (TERF1, TERF2, TERF2IP, TNKS, TNKS2 and TEP1). We did not observe (after correcting for multiple testing) statistically significant associations between SNPs and LTL, possibly suggesting a low genetic influence of the variability of these genes on LTL in the elderly. On the other hand, we found that the variability of genes encoding for TERF1 and TNKS2, not directly involved in LTL, but important for keeping the integrity of the structure, shows a significant association with longevity. This suggests that the maintenance of these chromosomal structures may be critically important for preventing, or delaying, senescence and aging. Such a correlation was not observed in a population from northern Italy that we used as an independent replication set. This discrepancy is in line with previous reports regarding both the population specificity of results on telomere biology and the differences of aging in northern and southern Italy.

  18. Polymer-Mediated Delivery of siRNAs to Hepatocellular Carcinoma: Variables Affecting Specificity and Effectiveness.

    PubMed

    Farra, Rossella; Musiani, Francesco; Perrone, Francesca; Čemažar, Maja; Kamenšek, Urška; Tonon, Federica; Abrami, Michela; Ručigaj, Aleš; Grassi, Mario; Pozzato, Gabriele; Bonazza, Deborah; Zanconati, Fabrizio; Forte, Giancarlo; El Boustani, Maguie; Scarabel, Lucia; Garziera, Marica; Russo Spena, Concetta; De Stefano, Lucia; Salis, Barbara; Toffoli, Giuseppe; Rizzolio, Flavio; Grassi, Gabriele; Dapas, Barbara

    2018-03-28

    Despite the advances in anticancer therapies, their effectiveness for many human tumors is still far from being optimal. Significant improvements in treatment efficacy can come from the enhancement of drug specificity. This goal may be achieved by combining the use of therapeutic molecules with tumor specific effects and delivery carriers with tumor targeting ability. In this regard, nucleic acid-based drug (NABD) and particularly small interfering RNAs (siRNAs), are attractive molecules due to the possibility to be engineered to target specific tumor genes. On the other hand, polymeric-based delivery systems are emerging as versatile carriers to generate tumor-targeted delivery systems. Here we will focus on the most recent findings in the selection of siRNA/polymeric targeted delivery systems for hepatocellular carcinoma (HCC), a human tumor for which currently available therapeutic approaches are poorly effective. In addition, we will discuss the most attracting and, in our opinion, promising siRNA-polymer combinations for HCC in relation to the biological features of HCC tissue. Attention will be also put on the mathematical description of the mechanisms ruling siRNA-carrier delivery, this being an important aspect to improve effectiveness reducing the experimental work.

  19. Molecular diagnostics for human leptospirosis.

    PubMed

    Waggoner, Jesse J; Pinsky, Benjamin A

    2016-10-01

    The definitive diagnosis of leptospirosis, which results from infection with spirochetes of the genus Leptospira, currently relies on the use of culture, serological testing (microscopic agglutination testing), and molecular detection. The purpose of this review is to describe new molecular diagnostics for Leptospira and discuss advancements in the use of available methods. Efforts have been focused on improving the clinical sensitivity of Leptospira detection using molecular methods. In this review, we describe a reoptimized pathogenic species-specific real-time PCR (targeting lipL32) that has demonstrated improved sensitivity, findings by two groups that real-time reverse-transcription PCR assays targeting the 16S rrs gene can improve detection, and two new loop-mediated amplification techniques. Quantitation of leptospiremia, detection in different specimen types, and the complementary roles played by molecular detection and microscopic agglutination testing will be discussed. Finally, a protocol for Leptospira strain subtyping using variable number tandem repeat targets and high-resolution melting will be described. Molecular diagnostics have an established role for the diagnosis of leptospirosis and provide an actionable diagnosis in the acute setting. The use of real-time reverse-transcription PCR for testing serum/plasma and cerebrospinal fluid, when available, may improve the detection of Leptospira without decreasing clinical specificity.

  20. Improved high-dimensional prediction with Random Forests by the use of co-data.

    PubMed

    Te Beest, Dennis E; Mes, Steven W; Wilting, Saskia M; Brakenhoff, Ruud H; van de Wiel, Mark A

    2017-12-28

    Prediction in high dimensional settings is difficult due to the large number of variables relative to the sample size. We demonstrate how auxiliary 'co-data' can be used to improve the performance of a Random Forest in such a setting. Co-data are incorporated in the Random Forest by replacing the uniform sampling probabilities that are used to draw candidate variables by co-data moderated sampling probabilities. Co-data here are defined as any type information that is available on the variables of the primary data, but does not use its response labels. These moderated sampling probabilities are, inspired by empirical Bayes, learned from the data at hand. We demonstrate the co-data moderated Random Forest (CoRF) with two examples. In the first example we aim to predict the presence of a lymph node metastasis with gene expression data. We demonstrate how a set of external p-values, a gene signature, and the correlation between gene expression and DNA copy number can improve the predictive performance. In the second example we demonstrate how the prediction of cervical (pre-)cancer with methylation data can be improved by including the location of the probe relative to the known CpG islands, the number of CpG sites targeted by a probe, and a set of p-values from a related study. The proposed method is able to utilize auxiliary co-data to improve the performance of a Random Forest.

  1. Characterization of Lipooligosaccharide-Biosynthetic Loci of Campylobacter jejuni Reveals New Lipooligosaccharide Classes: Evidence of Mosaic Organizations▿ †

    PubMed Central

    Parker, Craig T.; Gilbert, Michel; Yuki, Nobuhiro; Endtz, Hubert P.; Mandrell, Robert E.

    2008-01-01

    The lipooligosaccharide (LOS) biosynthesis region is one of the more variable genomic regions between strains of Campylobacter jejuni. Indeed, eight classes of LOS biosynthesis loci have been established previously based on gene content and organization. In this study, we characterize additional classes of LOS biosynthesis loci and analyze various mechanisms that result in changes to LOS structures. To gain further insights into the genomic diversity of C. jejuni LOS biosynthesis region, we sequenced the LOS biosynthesis loci of 15 strains that possessed gene content that was distinct from the eight classes. This analysis identified 11 new classes of LOS loci that exhibited examples of deletions and insertions of genes and cassettes of genes found in other LOS classes or capsular biosynthesis loci leading to mosaic LOS loci. The sequence analysis also revealed both missense mutations leading to “allelic” glycosyltransferases and phase-variable and non-phase-variable gene inactivation by the deletion or insertion of bases. Specifically, we demonstrated that gene inactivation is an important mechanism for altering the LOS structures of strains possessing the same class of LOS biosynthesis locus. Together, these observations suggest that LOS biosynthesis region is a hotspot for genetic exchange and variability, often leading to changes in the LOS produced. PMID:18556784

  2. Retroviral vectors encoding ADA regulatory locus control region provide enhanced T-cell-specific transgene expression.

    PubMed

    Trinh, Alice T; Ball, Bret G; Weber, Erin; Gallaher, Timothy K; Gluzman-Poltorak, Zoya; Anderson, French; Basile, Lena A

    2009-12-30

    Murine retroviral vectors have been used in several hundred gene therapy clinical trials, but have fallen out of favor for a number of reasons. One issue is that gene expression from viral or internal promoters is highly variable and essentially unregulated. Moreover, with retroviral vectors, gene expression is usually silenced over time. Mammalian genes, in contrast, are characterized by highly regulated, precise levels of expression in both a temporal and a cell-specific manner. To ascertain if recapitulation of endogenous adenosine deaminase (ADA) expression can be achieved in a vector construct we created a new series of Moloney murine leukemia virus (MuLV) based retroviral vector that carry human regulatory elements including combinations of the ADA promoter, the ADA locus control region (LCR), ADA introns and human polyadenylation sequences in a self-inactivating vector backbone. A MuLV-based retroviral vector with a self-inactivating (SIN) backbone, the phosphoglycerate kinase promoter (PGK) and the enhanced green fluorescent protein (eGFP), as a reporter gene, was generated. Subsequent vectors were constructed from this basic vector by deletion or addition of certain elements. The added elements that were assessed are the human ADA promoter, human ADA locus control region (LCR), introns 7, 8, and 11 from the human ADA gene, and human growth hormone polyadenylation signal. Retroviral vector particles were produced by transient three-plasmid transfection of 293T cells. Retroviral vectors encoding eGFP were titered by transducing 293A cells, and then the proportion of GFP-positive cells was determined using fluorescence-activated cell sorting (FACS). Non T-cell and T-cell lines were transduced at a multiplicity of infection (MOI) of 0.1 and the yield of eGFP transgene expression was evaluated by FACS analysis using mean fluorescent intensity (MFI) detection. Vectors that contained the ADA LCR were preferentially expressed in T-cell lines. Further improvements in T-cell specific gene expression were observed with the incorporation of additional cis-regulatory elements, such as a human polyadenylation signal and intron 7 from the human ADA gene. These studies suggest that the combination of an authentically regulated ADA gene in a murine retroviral vector, together with additional locus-specific regulatory refinements, will yield a vector with a safer profile and greater efficacy in terms of high-level, therapeutic, regulated gene expression for the treatment of ADA-deficient severe combined immunodeficiency.

  3. Analysis of variable sites between two complete South China tiger (Panthera tigris amoyensis) mitochondrial genomes.

    PubMed

    Zhang, Wenping; Yue, Bisong; Wang, Xiaofang; Zhang, Xiuyue; Xie, Zhong; Liu, Nonglin; Fu, Wenyuan; Yuan, Yaohua; Chen, Daqing; Fu, Danghua; Zhao, Bo; Yin, Yuzhong; Yan, Xiahui; Wang, Xinjing; Zhang, Rongying; Liu, Jie; Li, Maoping; Tang, Yao; Hou, Rong; Zhang, Zhihe

    2011-10-01

    In order to investigate the mitochondrial genome of Panthera tigris amoyensis, two South China tigers (P25 and P27) were analyzed following 15 cymt-specific primer sets. The entire mtDNA sequence was found to be 16,957 bp and 17,001 bp long for P25 and P27 respectively, and this difference in length between P25 and P27 occurred in the number of tandem repeats in the RS-3 segment of the control region. The structural characteristics of complete P. t. amoyensis mitochondrial genomes were also highly similar to those of P. uncia. Additionally, the rate of point mutation was only 0.3% and a total of 59 variable sites between P25 and P27 were found. Out of the 59 variable sites, 6 were located in 6 different tRNA genes, 6 in the 2 rRNA genes, 7 in non-coding regions (one located between tRNA-Asn and tRNA-Tyr and six in the D-loop), and 40 in 10 protein-coding genes. COI held the largest amount of variable sites (9 sites) and Cytb contained the highest variable rate (0.7%) in the complete sequences. Moreover, out of the 40 variable sites located in 10 protein-coding genes, 12 sites were nonsynonymous.

  4. Regulatory genes and their roles for improvement of antibiotic biosynthesis in Streptomyces.

    PubMed

    Lu, Fengjuan; Hou, Yanyan; Zhang, Heming; Chu, Yiwen; Xia, Haiyang; Tian, Yongqiang

    2017-08-01

    The numerous secondary metabolites in Streptomyces spp. are crucial for various applications. For example, cephamycin C is used as an antibiotic, and avermectin is used as an insecticide. Specifically, antibiotic yield is closely related to many factors, such as the external environment, nutrition (including nitrogen and carbon sources), biosynthetic efficiency and the regulatory mechanisms in producing strains. There are various types of regulatory genes that work in different ways, such as pleiotropic (or global) regulatory genes, cluster-situated regulators, which are also called pathway-specific regulatory genes, and many other regulators. The study of regulatory genes that influence antibiotic biosynthesis in Streptomyces spp. not only provides a theoretical basis for antibiotic biosynthesis in Streptomyces but also helps to increase the yield of antibiotics via molecular manipulation of these regulatory genes. Currently, more and more emphasis is being placed on the regulatory genes of antibiotic biosynthetic gene clusters in Streptomyces spp., and many studies on these genes have been performed to improve the yield of antibiotics in Streptomyces. This paper lists many antibiotic biosynthesis regulatory genes in Streptomyces spp. and focuses on frequently investigated regulatory genes that are involved in pathway-specific regulation and pleiotropic regulation and their applications in genetic engineering.

  5. T-cell receptor variable genes and genetic susceptibility to celiac disease: an association and linkage study.

    PubMed

    Roschmann, E; Wienker, T F; Gerok, W; Volk, B A

    1993-12-01

    Genetic susceptibility of celiac disease is primarily associated with a particular combination of and HLA-DQA1/DQB1 gene; however, this does not fully account for the genetic predisposition. Therefore, the aim of this study was to examine whether T-cell receptor (TCR) genes may be susceptibility genes in celiac disease. HLA class II typing was performed by polymerase chain reaction amplification in combination with sequence-specific oligonucleotide hybridization. TCR alpha (TCRA), TCR gamma (TCRG), and TCR beta (TCRB) loci were investigated by restriction fragment length polymorphism analysis. Allelic frequencies of TCRA, TCRG, and TCRB variable genes were compared between patients with celiac disease (n = 53) and control patients (n = 67), and relative risk (RR) estimates were calculated. The RR was 1.67 for allele C1 at TCRA1, 3.35 for allele D2 at TCRA2, 1.66 for allele B2 at TCRG, and 1.35 for allele B at TCRB, showing no significant association. Additionally, linkage analysis was performed in 23 families. The logarithm of odd scores for celiac disease vs. the TCR variable genes at TCRA, TCRG, and TCRB showed no significant linkage. These data suggest that the analyzed TCR variable gene segments V alpha 1.2, V gamma 11, and V beta 8 do not play a major role in susceptibility to celiac disease.

  6. A large population-based association study between HLA and KIR genotypes and measles vaccine antibody responses.

    PubMed

    Ovsyannikova, Inna G; Schaid, Daniel J; Larrabee, Beth R; Haralambieva, Iana H; Kennedy, Richard B; Poland, Gregory A

    2017-01-01

    Human antibody response to measles vaccine is highly variable in the population. Host genes contribute to inter-individual antibody response variation. The killer cell immunoglobulin-like receptors (KIR) are recognized to interact with HLA molecules and possibly influence humoral immune response to viral antigens. To expand on and improve our previous work with HLA genes, and to explore the genetic contribution of KIR genes to the inter-individual variability in measles vaccine-induced antibody responses, we performed a large population-based study in 2,506 healthy immunized subjects (ages 11 to 41 years) to identify HLA and KIR associations with measles vaccine-induced neutralizing antibodies. After correcting for the large number of statistical tests of allele effects on measles-specific neutralizing antibody titers, no statistically significant associations were found for either HLA or KIR loci. However, suggestive associations worthy of follow-up in other cohorts include B*57:01, DQB1*06:02, and DRB1*15:05 alleles. Specifically, the B*57:01 allele (1,040 mIU/mL; p = 0.0002) was suggestive of an association with lower measles antibody titer. In contrast, the DQB1*06:02 (1,349 mIU/mL; p = 0.0004) and DRB1*15:05 (2,547 mIU/mL; p = 0.0004) alleles were suggestive of an association with higher measles antibodies. Notably, the associations with KIR genotypes were strongly nonsignificant, suggesting that KIR loci in terms of copy number and haplotypes are not likely to play a major role in antibody response to measles vaccination. These findings refine our knowledge of the role of HLA and KIR alleles in measles vaccine-induced immunity.

  7. Adaptation to climate through flowering phenology: a case study in Medicago truncatula.

    PubMed

    Burgarella, Concetta; Chantret, Nathalie; Gay, Laurène; Prosperi, Jean-Marie; Bonhomme, Maxime; Tiffin, Peter; Young, Nevin D; Ronfort, Joelle

    2016-07-01

    Local climatic conditions likely constitute an important selective pressure on genes underlying important fitness-related traits such as flowering time, and in many species, flowering phenology and climatic gradients strongly covary. To test whether climate shapes the genetic variation on flowering time genes and to identify candidate flowering genes involved in the adaptation to environmental heterogeneity, we used a large Medicago truncatula core collection to examine the association between nucleotide polymorphisms at 224 candidate genes and both climate variables and flowering phenotypes. Unlike genome-wide studies, candidate gene approaches are expected to enrich for the number of meaningful trait associations because they specifically target genes that are known to affect the trait of interest. We found that flowering time mediates adaptation to climatic conditions mainly by variation at genes located upstream in the flowering pathways, close to the environmental stimuli. Variables related to the annual precipitation regime reflected selective constraints on flowering time genes better than the other variables tested (temperature, altitude, latitude or longitude). By comparing phenotype and climate associations, we identified 12 flowering genes as the most promising candidates responsible for phenological adaptation to climate. Four of these genes were located in the known flowering time QTL region on chromosome 7. However, climate and flowering associations also highlighted largely distinct gene sets, suggesting different genetic architectures for adaptation to climate and flowering onset. © 2016 John Wiley & Sons Ltd.

  8. The RNA polymerase III-dependent family of genes in hemiascomycetes: comparative RNomics, decoding strategies, transcription and evolutionary implications

    PubMed Central

    Marck, Christian; Kachouri-Lafond, Rym; Lafontaine, Ingrid; Westhof, Eric; Dujon, Bernard; Grosjean, Henri

    2006-01-01

    We present the first comprehensive analysis of RNA polymerase III (Pol III) transcribed genes in ten yeast genomes. This set includes all tRNA genes (tDNA) and genes coding for SNR6 (U6), SNR52, SCR1 and RPR1 RNA in the nine hemiascomycetes Saccharomyces cerevisiae, Saccharomyces castellii, Candida glabrata, Kluyveromyces waltii, Kluyveromyces lactis, Eremothecium gossypii, Debaryomyces hansenii, Candida albicans, Yarrowia lipolytica and the archiascomycete Schizosaccharomyces pombe. We systematically analysed sequence specificities of tRNA genes, polymorphism, variability of introns, gene redundancy and gene clustering. Analysis of decoding strategies showed that yeasts close to S.cerevisiae use bacterial decoding rules to read the Leu CUN and Arg CGN codons, in contrast to all other known Eukaryotes. In D.hansenii and C.albicans, we identified a novel tDNA-Leu (AAG), reading the Leu CUU/CUC/CUA codons with an unusual G at position 32. A systematic ‘p-distance tree’ using the 60 variable positions of the tRNA molecule revealed that most tDNAs cluster into amino acid-specific sub-trees, suggesting that, within hemiascomycetes, orthologous tDNAs are more closely related than paralogs. We finally determined the bipartite A- and B-box sequences recognized by TFIIIC. These minimal sequences are nearly conserved throughout hemiascomycetes and were satisfactorily retrieved at appropriate locations in other Pol III genes. PMID:16600899

  9. Selection of specific protein binders for pre-defined targets from an optimized library of artificial helicoidal repeat proteins (alphaRep).

    PubMed

    Guellouz, Asma; Valerio-Lepiniec, Marie; Urvoas, Agathe; Chevrel, Anne; Graille, Marc; Fourati-Kammoun, Zaineb; Desmadril, Michel; van Tilbeurgh, Herman; Minard, Philippe

    2013-01-01

    We previously designed a new family of artificial proteins named αRep based on a subgroup of thermostable helicoidal HEAT-like repeats. We have now assembled a large optimized αRep library. In this library, the side chains at each variable position are not fully randomized but instead encoded by a distribution of codons based on the natural frequency of side chains of the natural repeats family. The library construction is based on a polymerization of micro-genes and therefore results in a distribution of proteins with a variable number of repeats. We improved the library construction process using a "filtration" procedure to retain only fully coding modules that were recombined to recreate sequence diversity. The final library named Lib2.1 contains 1.7×10(9) independent clones. Here, we used phage display to select, from the previously described library or from the new library, new specific αRep proteins binding to four different non-related predefined protein targets. Specific binders were selected in each case. The results show that binders with various sizes are selected including relatively long sequences, with up to 7 repeats. ITC-measured affinities vary with Kd values ranging from micromolar to nanomolar ranges. The formation of complexes is associated with a significant thermal stabilization of the bound target protein. The crystal structures of two complexes between αRep and their cognate targets were solved and show that the new interfaces are established by the variable surfaces of the repeated modules, as well by the variable N-cap residues. These results suggest that αRep library is a new and versatile source of tight and specific binding proteins with favorable biophysical properties.

  10. Selection of Specific Protein Binders for Pre-Defined Targets from an Optimized Library of Artificial Helicoidal Repeat Proteins (alphaRep)

    PubMed Central

    Chevrel, Anne; Graille, Marc; Fourati-Kammoun, Zaineb; Desmadril, Michel; van Tilbeurgh, Herman; Minard, Philippe

    2013-01-01

    We previously designed a new family of artificial proteins named αRep based on a subgroup of thermostable helicoidal HEAT-like repeats. We have now assembled a large optimized αRep library. In this library, the side chains at each variable position are not fully randomized but instead encoded by a distribution of codons based on the natural frequency of side chains of the natural repeats family. The library construction is based on a polymerization of micro-genes and therefore results in a distribution of proteins with a variable number of repeats. We improved the library construction process using a “filtration” procedure to retain only fully coding modules that were recombined to recreate sequence diversity. The final library named Lib2.1 contains 1.7×109 independent clones. Here, we used phage display to select, from the previously described library or from the new library, new specific αRep proteins binding to four different non-related predefined protein targets. Specific binders were selected in each case. The results show that binders with various sizes are selected including relatively long sequences, with up to 7 repeats. ITC-measured affinities vary with Kd values ranging from micromolar to nanomolar ranges. The formation of complexes is associated with a significant thermal stabilization of the bound target protein. The crystal structures of two complexes between αRep and their cognate targets were solved and show that the new interfaces are established by the variable surfaces of the repeated modules, as well by the variable N-cap residues. These results suggest that αRep library is a new and versatile source of tight and specific binding proteins with favorable biophysical properties. PMID:24014183

  11. A comprehensive analysis of Helicobacter pylori plasticity zones reveals that they are integrating conjugative elements with intermediate integration specificity.

    PubMed

    Fischer, Wolfgang; Breithaupt, Ute; Kern, Beate; Smith, Stella I; Spicher, Carolin; Haas, Rainer

    2014-04-27

    The human gastric pathogen Helicobacter pylori is a paradigm for chronic bacterial infections. Its persistence in the stomach mucosa is facilitated by several mechanisms of immune evasion and immune modulation, but also by an unusual genetic variability which might account for the capability to adapt to changing environmental conditions during long-term colonization. This variability is reflected by the fact that almost each infected individual is colonized by a genetically unique strain. Strain-specific genes are dispersed throughout the genome, but clusters of genes organized as genomic islands may also collectively be present or absent. We have comparatively analysed such clusters, which are commonly termed plasticity zones, in a high number of H. pylori strains of varying geographical origin. We show that these regions contain fixed gene sets, rather than being true regions of genome plasticity, but two different types and several subtypes with partly diverging gene content can be distinguished. Their genetic diversity is incongruent with variations in the rest of the genome, suggesting that they are subject to horizontal gene transfer within H. pylori populations. We identified 40 distinct integration sites in 45 genome sequences, with a conserved heptanucleotide motif that seems to be the minimal requirement for integration. The significant number of possible integration sites, together with the requirement for a short conserved integration motif and the high level of gene conservation, indicates that these elements are best described as integrating conjugative elements (ICEs) with an intermediate integration site specificity.

  12. Phylogenetic utility of the nuclear genes AGAMOUS 1 and PHYTOCHROME B in palms (Arecaceae): an example within Bactridinae

    PubMed Central

    Ludeña, Bertha; Chabrillange, Nathalie; Aberlenc-Bertossi, Frédérique; Adam, Hélène; Tregear, James W.; Pintaud, Jean-Christophe

    2011-01-01

    Background and Aims Molecular phylogenetic studies of palms (Arecaceae) have not yet provided a fully resolved phylogeny of the family. There is a need to increase the current set of markers to resolve difficult groups such as the Neotropical subtribe Bactridinae (Arecoideae: Cocoseae). We propose the use of two single-copy nuclear genes as valuable tools for palm phylogenetics. Methods New primers were developed for the amplification of the AGAMOUS 1 (AG1) and PHYTOCHROME B (PHYB) genes. For the AGAMOUS gene, the paralogue 1 of Elaeis guineensis (EgAG1) was targeted. The region amplified contained coding sequences between the MIKC K and C MADS-box domains. For the PHYB gene, exon 1 (partial sequence) was first amplified in palm species using published degenerate primers for Poaceae, and then specific palm primers were designed. The two gene portions were sequenced in 22 species of palms representing all genera of Bactridinae, with emphasis on Astrocaryum and Hexopetion, the status of the latter genus still being debated. Key Results The new primers designed allow consistent amplification and high-quality sequencing within the palm family. The two loci studied produced more variability than chloroplast loci and equally or less variability than PRK, RPBII and ITS nuclear markers. The phylogenetic structure obtained with AG1 and PHYB genes provides new insights into intergeneric relationships within the Bactridinae and the intrageneric structure of Astrocaryum. The Hexopetion clade was recovered as monophyletic with both markers and was weakly supported as sister to Astrocaryum sensu stricto in the combined analysis. The rare Astrocaryum minus formed a species complex with Astrocaryum gynacanthum. Moreover, both AG1 and PHYB contain a microsatellite that could have further uses in species delimitation and population genetics. Conclusions AG1 and PHYB provide additional phylogenetic information within the palm family, and should prove useful in combination with other genes to improve the resolution of palm phylogenies. PMID:21828068

  13. Site-Specific Fat-1 Knock-In Enables Significant Decrease of n-6PUFAs/n-3PUFAs Ratio in Pigs.

    PubMed

    Li, Mengjing; Ouyang, Hongsheng; Yuan, Hongming; Li, Jianing; Xie, Zicong; Wang, Kankan; Yu, Tingting; Liu, Minghao; Chen, Xue; Tang, Xiaochun; Jiao, Huping; Pang, Daxin

    2018-05-04

    The fat-1 gene from Caenorhabditis elegans encodes a fatty acid desaturase which was widely studied due to its beneficial function of converting n-6 polyunsaturated fatty acids (n-6PUFAs) to n-3 polyunsaturated fatty acids (n-3PUFAs). To date, many fat-1 transgenic animals have been generated to study disease pathogenesis or improve meat quality. However, all of them were generated using a random integration method with variable transgene expression levels and the introduction of selectable marker genes often raise biosafety concern. To this end, we aimed to generate marker-free fat-1 transgenic pigs in a site-specific manner. The Rosa26 locus, first found in mouse embryonic stem cells, has become one of the most common sites for inserting transgenes due to its safe and ubiquitous expression. In our study, the fat-1 gene was inserted into porcine Rosa 26 (pRosa26) locus via Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)/CRISPR-associated 9 (Cas9) system. The Southern blot analysis of our knock-in pigs indicated a single copy of the fat-1 gene at the pRosa26 locus. Furthermore, this single-copy fat-1 gene supported satisfactory expression in a variety of tissues in F1 generation pigs. Importantly, the gas chromatography analysis indicated that these fat-1 knock-in pigs exhibited a significant increase in the level of n-3PUFAs, leading to an obvious decrease in the n-6PUFAs/n-3PUFAs ratio from 9.36 to 2.12 (*** P < 0.0001). Altogether, our fat-1 knock-in pigs hold great promise for improving the nutritional value of pork and serving as an animal model to investigate therapeutic effects of n-3PUFAs on various diseases. Copyright © 2018 Li et al.

  14. Improved accuracy of supervised CRM discovery with interpolated Markov models and cross-species comparison

    PubMed Central

    Kazemian, Majid; Zhu, Qiyun; Halfon, Marc S.; Sinha, Saurabh

    2011-01-01

    Despite recent advances in experimental approaches for identifying transcriptional cis-regulatory modules (CRMs, ‘enhancers’), direct empirical discovery of CRMs for all genes in all cell types and environmental conditions is likely to remain an elusive goal. Effective methods for computational CRM discovery are thus a critically needed complement to empirical approaches. However, existing computational methods that search for clusters of putative binding sites are ineffective if the relevant TFs and/or their binding specificities are unknown. Here, we provide a significantly improved method for ‘motif-blind’ CRM discovery that does not depend on knowledge or accurate prediction of TF-binding motifs and is effective when limited knowledge of functional CRMs is available to ‘supervise’ the search. We propose a new statistical method, based on ‘Interpolated Markov Models’, for motif-blind, genome-wide CRM discovery. It captures the statistical profile of variable length words in known CRMs of a regulatory network and finds candidate CRMs that match this profile. The method also uses orthologs of the known CRMs from closely related genomes. We perform in silico evaluation of predicted CRMs by assessing whether their neighboring genes are enriched for the expected expression patterns. This assessment uses a novel statistical test that extends the widely used Hypergeometric test of gene set enrichment to account for variability in intergenic lengths. We find that the new CRM prediction method is superior to existing methods. Finally, we experimentally validate 12 new CRM predictions by examining their regulatory activity in vivo in Drosophila; 10 of the tested CRMs were found to be functional, while 6 of the top 7 predictions showed the expected activity patterns. We make our program available as downloadable source code, and as a plugin for a genome browser installed on our servers. PMID:21821659

  15. Mechanisms of gap gene expression canalization in the Drosophila blastoderm.

    PubMed

    Gursky, Vitaly V; Panok, Lena; Myasnikova, Ekaterina M; Manu; Samsonova, Maria G; Reinitz, John; Samsonov, Alexander M

    2011-01-01

    Extensive variation in early gap gene expression in the Drosophila blastoderm is reduced over time because of gap gene cross regulation. This phenomenon is a manifestation of canalization, the ability of an organism to produce a consistent phenotype despite variations in genotype or environment. The canalization of gap gene expression can be understood as arising from the actions of attractors in the gap gene dynamical system. In order to better understand the processes of developmental robustness and canalization in the early Drosophila embryo, we investigated the dynamical effects of varying spatial profiles of Bicoid protein concentration on the formation of the expression border of the gap gene hunchback. At several positions on the anterior-posterior axis of the embryo, we analyzed attractors and their basins of attraction in a dynamical model describing expression of four gap genes with the Bicoid concentration profile accounted as a given input in the model equations. This model was tested against a family of Bicoid gradients obtained from individual embryos. These gradients were normalized by two independent methods, which are based on distinct biological hypotheses and provide different magnitudes for Bicoid spatial variability. We showed how the border formation is dictated by the biological initial conditions (the concentration gradient of maternal Hunchback protein) being attracted to specific attracting sets in a local vicinity of the border. Different types of these attracting sets (point attractors or one dimensional attracting manifolds) define several possible mechanisms of border formation. The hunchback border formation is associated with intersection of the spatial gradient of the maternal Hunchback protein and a boundary between the attraction basins of two different point attractors. We demonstrated how the positional variability for hunchback is related to the corresponding variability of the basin boundaries. The observed reduction in variability of the hunchback gene expression can be accounted for by specific geometrical properties of the basin boundaries. We clarified the mechanisms of gap gene expression canalization in early Drosophila embryos. These mechanisms were specified in the case of hunchback in well defined terms of the dynamical system theory.

  16. Increased variability of stimulus-driven cortical responses is associated with genetic variability in children with and without dyslexia.

    PubMed

    Centanni, T M; Pantazis, D; Truong, D T; Gruen, J R; Gabrieli, J D E; Hogan, T P

    2018-05-26

    Individuals with dyslexia exhibit increased brainstem variability in response to sound. It is unknown as to whether increased variability extends to neocortical regions associated with audition and reading, extends to visual stimuli, and whether increased variability characterizes all children with dyslexia or, instead, a specific subset of children. We evaluated the consistency of stimulus-evoked neural responses in children with (N = 20) or without dyslexia (N = 12) as measured by magnetoencephalography (MEG). Approximately half of the children with dyslexia had significantly higher levels of variability in cortical responses to both auditory and visual stimuli in multiple nodes of the reading network. There was a significant and positive relationship between the number of risk alleles at rs6935076 in the dyslexia-susceptibility gene KIAA0319 and the degree of neural variability in primary auditory cortex across all participants. This gene has been linked with neural variability in rodents and in typical readers. These findings indicate that unstable representations of auditory and visual stimuli in auditory and other reading-related neocortical regions are present in a subset of children with dyslexia and support the link between the gene KIAA0319 and the auditory neural variability across children with or without dyslexia. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.

  17. Characterization of global loss of imprinting in fetal overgrowth syndrome induced by assisted reproduction

    PubMed Central

    Chen, Zhiyuan; Hagen, Darren E.; Elsik, Christine G.; Ji, Tieming; Morris, Collin James; Moon, Laura Emily; Rivera, Rocío Melissa

    2015-01-01

    Embryos generated with the use of assisted reproductive technologies (ART) can develop overgrowth syndromes. In ruminants, the condition is referred to as large offspring syndrome (LOS) and exhibits variable phenotypic abnormalities including overgrowth, enlarged tongue, and abdominal wall defects. These characteristics recapitulate those observed in the human loss-of-imprinting (LOI) overgrowth syndrome Beckwith–Wiedemann (BWS). We have recently shown LOI at the KCNQ1 locus in LOS, the most common epimutation in BWS. Although the first case of ART-induced LOS was reported in 1995, studies have not yet determined the extent of LOI in this condition. Here, we determined allele-specific expression of imprinted genes previously identified in human and/or mouse in day ∼105 Bos taurus indicus × Bos taurus taurus F1 hybrid control and LOS fetuses using RNAseq. Our analysis allowed us to determine the monoallelic expression of 20 genes in tissues of control fetuses. LOS fetuses displayed variable LOI compared with controls. Biallelic expression of imprinted genes in LOS was associated with tissue-specific hypomethylation of the normally methylated parental allele. In addition, a positive correlation was observed between body weight and the number of biallelically expressed imprinted genes in LOS fetuses. Furthermore, not only was there loss of allele-specific expression of imprinted genes in LOS, but also differential transcript amounts of these genes between control and overgrown fetuses. In summary, we characterized previously unidentified imprinted genes in bovines and identified misregulation of imprinting at multiple loci in LOS. We concluded that LOS is a multilocus LOI syndrome, as is BWS. PMID:25825726

  18. Characterization of global loss of imprinting in fetal overgrowth syndrome induced by assisted reproduction.

    PubMed

    Chen, Zhiyuan; Hagen, Darren E; Elsik, Christine G; Ji, Tieming; Morris, Collin James; Moon, Laura Emily; Rivera, Rocío Melissa

    2015-04-14

    Embryos generated with the use of assisted reproductive technologies (ART) can develop overgrowth syndromes. In ruminants, the condition is referred to as large offspring syndrome (LOS) and exhibits variable phenotypic abnormalities including overgrowth, enlarged tongue, and abdominal wall defects. These characteristics recapitulate those observed in the human loss-of-imprinting (LOI) overgrowth syndrome Beckwith-Wiedemann (BWS). We have recently shown LOI at the KCNQ1 locus in LOS, the most common epimutation in BWS. Although the first case of ART-induced LOS was reported in 1995, studies have not yet determined the extent of LOI in this condition. Here, we determined allele-specific expression of imprinted genes previously identified in human and/or mouse in day ∼105 Bos taurus indicus × Bos taurus taurus F1 hybrid control and LOS fetuses using RNAseq. Our analysis allowed us to determine the monoallelic expression of 20 genes in tissues of control fetuses. LOS fetuses displayed variable LOI compared with controls. Biallelic expression of imprinted genes in LOS was associated with tissue-specific hypomethylation of the normally methylated parental allele. In addition, a positive correlation was observed between body weight and the number of biallelically expressed imprinted genes in LOS fetuses. Furthermore, not only was there loss of allele-specific expression of imprinted genes in LOS, but also differential transcript amounts of these genes between control and overgrown fetuses. In summary, we characterized previously unidentified imprinted genes in bovines and identified misregulation of imprinting at multiple loci in LOS. We concluded that LOS is a multilocus LOI syndrome, as is BWS.

  19. Drugs meeting the molecular basis of diabetic kidney disease: bridging from molecular mechanism to personalized medicine.

    PubMed

    Lambers Heerspink, Hiddo J; Oberbauer, Rainer; Perco, Paul; Heinzel, Andreas; Heinze, Georg; Mayer, Gert; Mayer, Bernd

    2015-08-01

    Diabetic kidney disease (DKD) is a complex, multifactorial disease and is associated with a high risk of renal and cardiovascular morbidity and mortality. Clinical practice guidelines for diabetes recommend essentially identical treatments for all patients without taking into account how the individual responds to the instituted therapy. Yet, individuals vary widely in how they respond to medications and therefore optimal therapy differs between individuals. Understanding the underlying molecular mechanisms of variability in drug response will help tailor optimal therapy. Polymorphisms in genes related to drug pharmacokinetics have been used to explore mechanisms of response variability in DKD, but with limited success. The complex interaction between genetic make-up and environmental factors on the abundance of proteins and metabolites renders pharmacogenomics alone insufficient to fully capture response variability. A complementary approach is to attribute drug response variability to individual variability in underlying molecular mechanisms involved in the progression of disease. The interplay of different processes (e.g. inflammation, fibrosis, angiogenesis, oxidative stress) appears to drive disease progression, but the individual contribution of each process varies. Drugs at the other hand address specific targets and thereby interfere in certain disease-associated processes. At this level, biomarkers may help to gain insight into which specific pathophysiological processes are involved in an individual followed by a rational assessment whether a specific drug's mode of action indeed targets the relevant process at hand. This article describes the conceptual background and data-driven workflow developed by the SysKid consortium aimed at improving characterization of the molecular mechanisms underlying DKD at the interference of the molecular impact of individual drugs in order to tailor optimal therapy to individual patients. © The Author 2015. Published by Oxford University Press on behalf of ERA-EDTA. All rights reserved.

  20. Effectiveness of the combined evaluation of KLK3 genetics and free-to-total prostate specific antigen ratio for prostate cancer diagnosis.

    PubMed

    Zambon, Carlo-Federico; Prayer-Galetti, Tommaso; Basso, Daniela; Padoan, Andrea; Rossi, Elisa; Secco, Silvia; Pelloso, Michela; Fogar, Paola; Navaglia, Filippo; Moz, Stefania; Zattoni, Filiberto; Plebani, Mario

    2012-10-01

    Of serum prostate specific antigen variability 40% depends on inherited factors. We ascertained whether the knowledge of KLK3 genetics would enhance prostate specific antigen diagnostic performance in patients with clinical suspicion of prostate cancer. We studied 1,058 men who consecutively underwent prostate biopsy for clinical suspicion of prostate cancer. At histology prostate cancer was present in 401 cases and absent in 657. Serum total prostate specific antigen and the free-to-total prostate specific antigen ratio were determined. Four polymorphisms of the KLK3 gene (rs2569733, rs2739448, rs925013 and rs2735839) and 1 polymorphism of the SRD5A2 gene (rs523349) were studied. The influence of genetics on prostate specific antigen variability was evaluated by multivariate linear regression analysis. The performance of total prostate specific antigen and the free-to-total prostate specific antigen ratio alone or combined with a genetically based patient classification were defined by ROC curve analyses. For prostate cancer diagnosis the free-to-total prostate specific antigen ratio index alone (cutoff 11%) was superior to total prostate specific antigen (cutoff 4 ng/ml) and to free-to-total prostate specific antigen ratio reflex testing (positive predictive value 61%, 43% and 54%, respectively). Prostate specific antigen correlated with KLK3 genetics (rs2735839 polymorphism p = 0.001, and rs2569733, rs2739448 and rs925013 haplotype combination p = 0.003). In patients with different KLK3 genetics 2 optimal free-to-total prostate specific antigen ratio cutoffs (11% and 14.5%) were found. For free-to-total prostate specific antigen ratio values between 11% and 14.5% the prostate cancer probability ranged from 30.0% to 47.4% according to patient genetics. The free-to-total prostate specific antigen ratio is superior to total prostate specific antigen for prostate cancer diagnosis, independent of total prostate specific antigen results. Free-to-total prostate specific antigen ratio findings below 11% are positively associated with prostate cancer and those above 14.5% are negatively associated with prostate cancer, while the interpretation of those between 11% and 14.5% is improved by patient KLK3 genetic analysis. Copyright © 2012 American Urological Association Education and Research, Inc. Published by Elsevier Inc. All rights reserved.

  1. Variability of Creatine Metabolism Genes in Children with Autism Spectrum Disorder.

    PubMed

    Cameron, Jessie M; Levandovskiy, Valeriy; Roberts, Wendy; Anagnostou, Evdokia; Scherer, Stephen; Loh, Alvin; Schulze, Andreas

    2017-07-31

    Creatine deficiency syndrome (CDS) comprises three separate enzyme deficiencies with overlapping clinical presentations: arginine:glycine amidinotransferase ( GATM gene, glycine amidinotransferase), guanidinoacetate methyltransferase ( GAMT gene), and creatine transporter deficiency ( SLC6A8 gene, solute carrier family 6 member 8). CDS presents with developmental delays/regression, intellectual disability, speech and language impairment, autistic behaviour, epileptic seizures, treatment-refractory epilepsy, and extrapyramidal movement disorders; symptoms that are also evident in children with autism. The objective of the study was to test the hypothesis that genetic variability in creatine metabolism genes is associated with autism. We sequenced GATM , GAMT and SLC6A8 genes in 166 patients with autism (coding sequence, introns and adjacent untranslated regions). A total of 29, 16 and 25 variants were identified in each gene, respectively. Four variants were novel in GATM , and 5 in SLC6A8 (not present in the 1000 Genomes, Exome Sequencing Project (ESP) or Exome Aggregation Consortium (ExAC) databases). A single variant in each gene was identified as non-synonymous, and computationally predicted to be potentially damaging. Nine variants in GATM were shown to have a lower minor allele frequency (MAF) in the autism population than in the 1000 Genomes database, specifically in the East Asian population (Fisher's exact test). Two variants also had lower MAFs in the European population. In summary, there were no apparent associations of variants in GAMT and SLC6A8 genes with autism. The data implying there could be a lower association of some specific GATM gene variants with autism is an observation that would need to be corroborated in a larger group of autism patients, and with sub-populations of Asian ethnicities. Overall, our findings suggest that the genetic variability of creatine synthesis/transport is unlikely to play a part in the pathogenesis of autism spectrum disorder (ASD) in children.

  2. Association of sequence variations in vitamin K epoxide reductase and gamma-glutamyl carboxylas genes with biochemical measures of vitamin K status

    USDA-ARS?s Scientific Manuscript database

    Genetic factors, specifically the VKORC1 and GGCX genes, have been shown to contribute to the interindividual variability in response to the vitamin K-antagonist, warfarin, which influences the dose required to achieve the desired anticoagulation response. These differences in warfarin sensitivity ...

  3. An efficient method for variable region assembly in the construction of scFv phage display libraries using independent strand amplification

    PubMed Central

    Sotelo, Pablo H.; Collazo, Noberto; Zuñiga, Roberto; Gutiérrez-González, Matías; Catalán, Diego; Ribeiro, Carolina Hager; Aguillón, Juan Carlos; Molina, María Carmen

    2012-01-01

    Phage display library technology is a common method to produce human antibodies. In this technique, the immunoglobulin variable regions are displayed in a bacteriophage in a way that each filamentous virus displays the product of a single antibody gene on its surface. From the collection of different phages, it is possible to isolate the virus that recognizes specific targets. The most common form in which to display antibody variable regions in the phage is the single chain variable fragment format (scFv), which requires assembly of the heavy and light immunoglobulin variable regions in a single gene. In this work, we describe a simple and efficient method for the assembly of immunoglobulin heavy and light chain variable regions in a scFv format. This procedure involves a two-step reaction: (1) DNA amplification to produce the single strand form of the heavy or light chain gene required for the fusion; and (2) mixture of both single strand products followed by an assembly reaction to construct a complete scFv gene. Using this method, we produced 6-fold more scFv encoding DNA than the commonly used splicing by overlap extension PCR (SOE-PCR) approach. The scFv gene produced by this method also proved to be efficient in generating a diverse scFv phage display library. From this scFv library, we obtained phages that bound several non-related antigens, including recombinant proteins and rotavirus particles. PMID:22692130

  4. A Genome-Wide Landscape of Retrocopies in Primate Genomes.

    PubMed

    Navarro, Fábio C P; Galante, Pedro A F

    2015-07-29

    Gene duplication is a key factor contributing to phenotype diversity across and within species. Although the availability of complete genomes has led to the extensive study of genomic duplications, the dynamics and variability of gene duplications mediated by retrotransposition are not well understood. Here, we predict mRNA retrotransposition and use comparative genomics to investigate their origin and variability across primates. Analyzing seven anthropoid primate genomes, we found a similar number of mRNA retrotranspositions (∼7,500 retrocopies) in Catarrhini (Old Word Monkeys, including humans), but a surprising large number of retrocopies (∼10,000) in Platyrrhini (New World Monkeys), which may be a by-product of higher long interspersed nuclear element 1 activity in these genomes. By inferring retrocopy orthology, we dated most of the primate retrocopy origins, and estimated a decrease in the fixation rate in recent primate history, implying a smaller number of species-specific retrocopies. Moreover, using RNA-Seq data, we identified approximately 3,600 expressed retrocopies. As expected, most of these retrocopies are located near or within known genes, present tissue-specific and even species-specific expression patterns, and no expression correlation to their parental genes. Taken together, our results provide further evidence that mRNA retrotransposition is an active mechanism in primate evolution and suggest that retrocopies may not only introduce great genetic variability between lineages but also create a large reservoir of potentially functional new genomic loci in primate genomes. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  5. Diurnal and developmental differences in gene expression between adult dispersing and flightless morphs of the wing polymorphic cricket, Gryllus firmus: Implications for life-history evolution.

    PubMed

    Zera, Anthony J; Vellichirammal, Neetha Nanoth; Brisson, Jennifer A

    2018-04-12

    The functional basis of life history adaptation is a key topic of research in life history evolution. Studies of wing-polymorphism in the cricket Gryllus firmus have played a prominent role in this field. However, prior in-depth investigations of morph specialization have primarily focused on a single hormone, juvenile hormone, and a single aspect of intermediary metabolism, the fatty-acid biosynthetic component of lipid metabolism. Moreover, the role of diurnal variation in life history adaptation in G. firmus has been understudied, as is the case for organisms in general. Here, we identify genes whose expression differs consistently between the morphs independent of time-of-day during early adulthood, as well as genes that exhibit a strong pattern of morph-specific diurnal expression. We find strong, consistent, morph-specific differences in the expression of genes involved in endocrine regulation, carbohydrate and lipid metabolism, and immunity - in particular, in the expression of an insulin-like-peptide precursor gene and genes involved in triglyceride production. We also find that the flight-capable morph exhibited a substantially greater number of genes exhibiting diurnal change in gene expression compared with the flightless morph, correlated with the greater circadian change in the hemolymph juvenile titer in the dispersing morph. In fact, diurnal differences in expression within the dispersing morph at different times of the day were significantly greater in magnitude than differences between dispersing and flightless morphs at the same time-of-day. These results provide important baseline information regarding the potential role of variable gene expression on life history specialization in morphs of G. firmus, and the first information on genetically-variable, diurnal change in gene expression, associated with a key life history polymorphism. These results also suggest the existence of prominent morph-specific circadian differences in gene expression in G. firmus, possibly caused by the morph-specific circadian rhythm in the juvenile hormone titer. Copyright © 2018 Elsevier Ltd. All rights reserved.

  6. Genome-wide prediction and analysis of human tissue-selective genes using microarray expression data

    PubMed Central

    2013-01-01

    Background Understanding how genes are expressed specifically in particular tissues is a fundamental question in developmental biology. Many tissue-specific genes are involved in the pathogenesis of complex human diseases. However, experimental identification of tissue-specific genes is time consuming and difficult. The accurate predictions of tissue-specific gene targets could provide useful information for biomarker development and drug target identification. Results In this study, we have developed a machine learning approach for predicting the human tissue-specific genes using microarray expression data. The lists of known tissue-specific genes for different tissues were collected from UniProt database, and the expression data retrieved from the previously compiled dataset according to the lists were used for input vector encoding. Random Forests (RFs) and Support Vector Machines (SVMs) were used to construct accurate classifiers. The RF classifiers were found to outperform SVM models for tissue-specific gene prediction. The results suggest that the candidate genes for brain or liver specific expression can provide valuable information for further experimental studies. Our approach was also applied for identifying tissue-selective gene targets for different types of tissues. Conclusions A machine learning approach has been developed for accurately identifying the candidate genes for tissue specific/selective expression. The approach provides an efficient way to select some interesting genes for developing new biomedical markers and improve our knowledge of tissue-specific expression. PMID:23369200

  7. Comparison of MeHg-induced toxicogenomic responses across in vivo and in vitro models used in developmental toxicology.

    PubMed

    Robinson, Joshua F; Theunissen, Peter T; van Dartel, Dorien A M; Pennings, Jeroen L; Faustman, Elaine M; Piersma, Aldert H

    2011-09-01

    Toxicogenomic evaluations may improve toxicity prediction of in vitro-based developmental models, such as whole embryo culture (WEC) and embryonic stem cells (ESC), by providing a robust mechanistic marker which can be linked with responses associated with developmental toxicity in vivo. While promising in theory, toxicogenomic comparisons between in vivo and in vitro models are complex due to inherent differences in model characteristics and experimental design. Determining factors which influence these global comparisons are critical in the identification of reliable mechanistic-based markers of developmental toxicity. In this study, we compared available toxicogenomic data assessing the impact of the known teratogen, methylmercury (MeHg) across a diverse set of in vitro and in vivo models to investigate the impact of experimental variables (i.e. model, dose, time) on our comparative assessments. We evaluated common and unique aspects at both the functional (Gene Ontology) and gene level of MeHg-induced response. At the functional level, we observed stronger similarity in MeHg-response between mouse embryos exposed in utero (2 studies), ESC, and WEC as compared to liver, brain and mouse embryonic fibroblast MeHg studies. These findings were strongly correlated to the presence of a MeHg-induced developmentally related gene signature. In addition, we identified specific MeHg-induced gene expression alterations associated with developmental signaling and heart development across WEC, ESC and in vivo systems. However, the significance of overlap between studies was highly dependent on traditional experimental variables (i.e. dose, time). In summary, we identify promising examples of unique gene expression responses which show in vitro-in vivo similarities supporting the relevance of in vitro developmental models for predicting in vivo developmental toxicity. Copyright © 2011 Elsevier Inc. All rights reserved.

  8. Improved Escherichia coli Bactofection and Cytotoxicity by Heterologous Expression of Bacteriophage ΦX174 Lysis Gene E.

    PubMed

    Chung, Tai-Chun; Jones, Charles H; Gollakota, Akhila; Kamal Ahmadi, Mahmoud; Rane, Snehal; Zhang, Guojian; Pfeifer, Blaine A

    2015-05-04

    Bactofection offers a gene delivery option particularly useful in the context of immune modulation. The bacterial host naturally attracts recognition and cellular uptake by antigen presenting cells (APCs) as the initial step in triggering an immune response. Moreover, depending on the bacterial vector, molecular biology tools are available to influence and/or overcome additional steps and barriers to effective antigen presentation. In this work, molecular engineering was applied using Escherichia coli as a bactofection vector. In particular, the bacteriophage ΦX174 lysis E (LyE) gene was designed for variable expression across strains containing different levels of lysteriolysin O (LLO). The objective was to generate a bacterial vector with improved attenuation and delivery characteristics. The resulting strains exhibited enhanced gene and protein release and inducible cellular death. In addition, the new vectors demonstrated improved gene delivery and cytotoxicity profiles to RAW264.7 macrophage APCs.

  9. An Independent Filter for Gene Set Testing Based on Spectral Enrichment.

    PubMed

    Frost, H Robert; Li, Zhigang; Asselbergs, Folkert W; Moore, Jason H

    2015-01-01

    Gene set testing has become an indispensable tool for the analysis of high-dimensional genomic data. An important motivation for testing gene sets, rather than individual genomic variables, is to improve statistical power by reducing the number of tested hypotheses. Given the dramatic growth in common gene set collections, however, testing is often performed with nearly as many gene sets as underlying genomic variables. To address the challenge to statistical power posed by large gene set collections, we have developed spectral gene set filtering (SGSF), a novel technique for independent filtering of gene set collections prior to gene set testing. The SGSF method uses as a filter statistic the p-value measuring the statistical significance of the association between each gene set and the sample principal components (PCs), taking into account the significance of the associated eigenvalues. Because this filter statistic is independent of standard gene set test statistics under the null hypothesis but dependent under the alternative, the proportion of enriched gene sets is increased without impacting the type I error rate. As shown using simulated and real gene expression data, the SGSF algorithm accurately filters gene sets unrelated to the experimental outcome resulting in significantly increased gene set testing power.

  10. The extraction of simple relationships in growth factor-specific multiple-input and multiple-output systems in cell-fate decisions by backward elimination PLS regression.

    PubMed

    Akimoto, Yuki; Yugi, Katsuyuki; Uda, Shinsuke; Kudo, Takamasa; Komori, Yasunori; Kubota, Hiroyuki; Kuroda, Shinya

    2013-01-01

    Cells use common signaling molecules for the selective control of downstream gene expression and cell-fate decisions. The relationship between signaling molecules and downstream gene expression and cellular phenotypes is a multiple-input and multiple-output (MIMO) system and is difficult to understand due to its complexity. For example, it has been reported that, in PC12 cells, different types of growth factors activate MAP kinases (MAPKs) including ERK, JNK, and p38, and CREB, for selective protein expression of immediate early genes (IEGs) such as c-FOS, c-JUN, EGR1, JUNB, and FOSB, leading to cell differentiation, proliferation and cell death; however, how multiple-inputs such as MAPKs and CREB regulate multiple-outputs such as expression of the IEGs and cellular phenotypes remains unclear. To address this issue, we employed a statistical method called partial least squares (PLS) regression, which involves a reduction of the dimensionality of the inputs and outputs into latent variables and a linear regression between these latent variables. We measured 1,200 data points for MAPKs and CREB as the inputs and 1,900 data points for IEGs and cellular phenotypes as the outputs, and we constructed the PLS model from these data. The PLS model highlighted the complexity of the MIMO system and growth factor-specific input-output relationships of cell-fate decisions in PC12 cells. Furthermore, to reduce the complexity, we applied a backward elimination method to the PLS regression, in which 60 input variables were reduced to 5 variables, including the phosphorylation of ERK at 10 min, CREB at 5 min and 60 min, AKT at 5 min and JNK at 30 min. The simple PLS model with only 5 input variables demonstrated a predictive ability comparable to that of the full PLS model. The 5 input variables effectively extracted the growth factor-specific simple relationships within the MIMO system in cell-fate decisions in PC12 cells.

  11. Salt Stress Induced Variation in DNA Methylation Pattern and Its Influence on Gene Expression in Contrasting Rice Genotypes

    PubMed Central

    Karan, Ratna; DeLeon, Teresa; Biradar, Hanamareddy; Subudhi, Prasanta K.

    2012-01-01

    Background Salinity is a major environmental factor limiting productivity of crop plants including rice in which wide range of natural variability exists. Although recent evidences implicate epigenetic mechanisms for modulating the gene expression in plants under environmental stresses, epigenetic changes and their functional consequences under salinity stress in rice are underexplored. DNA methylation is one of the epigenetic mechanisms regulating gene expression in plant’s responses to environmental stresses. Better understanding of epigenetic regulation of plant growth and response to environmental stresses may create novel heritable variation for crop improvement. Methodology/Principal Findings Methylation sensitive amplification polymorphism (MSAP) technique was used to assess the effect of salt stress on extent and patterns of DNA methylation in four genotypes of rice differing in the degree of salinity tolerance. Overall, the amount of DNA methylation was more in shoot compared to root and the contribution of fully methylated loci was always more than hemi-methylated loci. Sequencing of ten randomly selected MSAP fragments indicated gene-body specific DNA methylation of retrotransposons, stress responsive genes, and chromatin modification genes, distributed on different rice chromosomes. Bisulphite sequencing and quantitative RT-PCR analysis of selected MSAP loci showed that cytosine methylation changes under salinity as well as gene expression varied with genotypes and tissue types irrespective of the level of salinity tolerance of rice genotypes. Conclusions/Significance The gene body methylation may have an important role in regulating gene expression in organ and genotype specific manner under salinity stress. Association between salt tolerance and methylation changes observed in some cases suggested that many methylation changes are not “directed”. The natural genetic variation for salt tolerance observed in rice germplasm may be independent of the extent and pattern of DNA methylation which may have been induced by abiotic stress followed by accumulation through the natural selection process. PMID:22761959

  12. Global gene expression profiling of oral cavity cancers suggests molecular heterogeneity within anatomic subsites

    PubMed Central

    Severino, Patricia; Alvares, Adriana M; Michaluart, Pedro; Okamoto, Oswaldo K; Nunes, Fabio D; Moreira-Filho, Carlos A; Tajara, Eloiza H

    2008-01-01

    Background Oral squamous cell carcinoma (OSCC) is a frequent neoplasm, which is usually aggressive and has unpredictable biological behavior and unfavorable prognosis. The comprehension of the molecular basis of this variability should lead to the development of targeted therapies as well as to improvements in specificity and sensitivity of diagnosis. Results Samples of primary OSCCs and their corresponding surgical margins were obtained from male patients during surgery and their gene expression profiles were screened using whole-genome microarray technology. Hierarchical clustering and Principal Components Analysis were used for data visualization and One-way Analysis of Variance was used to identify differentially expressed genes. Samples clustered mostly according to disease subsite, suggesting molecular heterogeneity within tumor stages. In order to corroborate our results, two publicly available datasets of microarray experiments were assessed. We found significant molecular differences between OSCC anatomic subsites concerning groups of genes presently or potentially important for drug development, including mRNA processing, cytoskeleton organization and biogenesis, metabolic process, cell cycle and apoptosis. Conclusion Our results corroborate literature data on molecular heterogeneity of OSCCs. Differences between disease subsites and among samples belonging to the same TNM class highlight the importance of gene expression-based classification and challenge the development of targeted therapies. PMID:19014556

  13. Multiple-input multiple-output causal strategies for gene selection.

    PubMed

    Bontempi, Gianluca; Haibe-Kains, Benjamin; Desmedt, Christine; Sotiriou, Christos; Quackenbush, John

    2011-11-25

    Traditional strategies for selecting variables in high dimensional classification problems aim to find sets of maximally relevant variables able to explain the target variations. If these techniques may be effective in generalization accuracy they often do not reveal direct causes. The latter is essentially related to the fact that high correlation (or relevance) does not imply causation. In this study, we show how to efficiently incorporate causal information into gene selection by moving from a single-input single-output to a multiple-input multiple-output setting. We show in synthetic case study that a better prioritization of causal variables can be obtained by considering a relevance score which incorporates a causal term. In addition we show, in a meta-analysis study of six publicly available breast cancer microarray datasets, that the improvement occurs also in terms of accuracy. The biological interpretation of the results confirms the potential of a causal approach to gene selection. Integrating causal information into gene selection algorithms is effective both in terms of prediction accuracy and biological interpretation.

  14. Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: Implications for the microbial “pan-genome”

    PubMed Central

    Tettelin, Hervé; Masignani, Vega; Cieslewicz, Michael J.; Donati, Claudio; Medini, Duccio; Ward, Naomi L.; Angiuoli, Samuel V.; Crabtree, Jonathan; Jones, Amanda L.; Durkin, A. Scott; DeBoy, Robert T.; Davidsen, Tanja M.; Mora, Marirosa; Scarselli, Maria; Margarit y Ros, Immaculada; Peterson, Jeremy D.; Hauser, Christopher R.; Sundaram, Jaideep P.; Nelson, William C.; Madupu, Ramana; Brinkac, Lauren M.; Dodson, Robert J.; Rosovitz, Mary J.; Sullivan, Steven A.; Daugherty, Sean C.; Haft, Daniel H.; Selengut, Jeremy; Gwinn, Michelle L.; Zhou, Liwei; Zafar, Nikhat; Khouri, Hoda; Radune, Diana; Dimitrov, George; Watkins, Kisha; O'Connor, Kevin J. B.; Smith, Shannon; Utterback, Teresa R.; White, Owen; Rubens, Craig E.; Grandi, Guido; Madoff, Lawrence C.; Kasper, Dennis L.; Telford, John L.; Wessels, Michael R.; Rappuoli, Rino; Fraser, Claire M.

    2005-01-01

    The development of efficient and inexpensive genome sequencing methods has revolutionized the study of human bacterial pathogens and improved vaccine design. Unfortunately, the sequence of a single genome does not reflect how genetic variability drives pathogenesis within a bacterial species and also limits genome-wide screens for vaccine candidates or for antimicrobial targets. We have generated the genomic sequence of six strains representing the five major disease-causing serotypes of Streptococcus agalactiae, the main cause of neonatal infection in humans. Analysis of these genomes and those available in databases showed that the S. agalactiae species can be described by a pan-genome consisting of a core genome shared by all isolates, accounting for ≈80% of any single genome, plus a dispensable genome consisting of partially shared and strain-specific genes. Mathematical extrapolation of the data suggests that the gene reservoir available for inclusion in the S. agalactiae pan-genome is vast and that unique genes will continue to be identified even after sequencing hundreds of genomes. PMID:16172379

  15. Oligodeoxynucleotides Can Transiently Up- and Downregulate CHS Gene Expression in Flax by Changing DNA Methylation in a Sequence-Specific Manner

    PubMed Central

    Dzialo, Magdalena; Szopa, Jan; Czuj, Tadeusz; Zuk, Magdalena

    2017-01-01

    Chalcone synthase (CHS) has been recognized as an essential enzyme in the phenylpropanoid biosynthesis pathway. Apart from the leading role in the production of phenolic compounds with many valuable biological activities beneficial to biomedicine, CHS is well appreciated in science. Genetic engineering greatly facilitates expanding knowledge on the function and genetics of CHS in plants. The CHS gene is one of the most intensively studied genes in flax. In our study, we investigated engineering of the CHS gene through genetic and epigenetic approaches. Considering the numerous restrictions concerning the application of genetically modified (GM) crops, the main purpose of this research was optimization of the plant's modulation via epigenetics. In our study, plants modified through two methods were compared: a widely popular agrotransformation and a relatively recent oligodeoxynucleotide (ODN) strategy. It was recently highlighted that the ODN technique can be a rapid and time-serving antecedent in quick analysis of gene function before taking vector-mediated transformation. In order to understand the molecular background of epigenetic variation in more detail and evaluate the use of ODNs as a tool for predictable and stable gene engineering, we concentrated on the integration of gene expression and gene-body methylation. The treatment of flax with a series of short oligonucleotides homologous to a different part of CHS gene isoforms revealed that those directed to regulatory gene regions (5′- and 3′-UTR) activated gene expression, directed to non-coding region (introns) caused gen activity reduction, while those homologous to a coding region may have a variable influence on its activity. Gene expression changes were accompanied by changes in its methylation status. However, only certain (CCGG) motifs along the gene sequence were affected. The analyzed DNA motifs of the CHS flax gene are more accessible for methylation when located within a CpG island. The methylation motifs also led to rearrangement of the nucleosome location. The obtained results suggest high specificity of ODN action and establish a potential valuable alternative for improvement of crops. PMID:28555142

  16. Oligodeoxynucleotides Can Transiently Up- and Downregulate CHS Gene Expression in Flax by Changing DNA Methylation in a Sequence-Specific Manner.

    PubMed

    Dzialo, Magdalena; Szopa, Jan; Czuj, Tadeusz; Zuk, Magdalena

    2017-01-01

    Chalcone synthase (CHS) has been recognized as an essential enzyme in the phenylpropanoid biosynthesis pathway. Apart from the leading role in the production of phenolic compounds with many valuable biological activities beneficial to biomedicine, CHS is well appreciated in science. Genetic engineering greatly facilitates expanding knowledge on the function and genetics of CHS in plants. The CHS gene is one of the most intensively studied genes in flax. In our study, we investigated engineering of the CHS gene through genetic and epigenetic approaches. Considering the numerous restrictions concerning the application of genetically modified (GM) crops, the main purpose of this research was optimization of the plant's modulation via epigenetics. In our study, plants modified through two methods were compared: a widely popular agrotransformation and a relatively recent oligodeoxynucleotide (ODN) strategy. It was recently highlighted that the ODN technique can be a rapid and time-serving antecedent in quick analysis of gene function before taking vector-mediated transformation. In order to understand the molecular background of epigenetic variation in more detail and evaluate the use of ODNs as a tool for predictable and stable gene engineering, we concentrated on the integration of gene expression and gene-body methylation. The treatment of flax with a series of short oligonucleotides homologous to a different part of CHS gene isoforms revealed that those directed to regulatory gene regions (5'- and 3'-UTR) activated gene expression, directed to non-coding region (introns) caused gen activity reduction, while those homologous to a coding region may have a variable influence on its activity. Gene expression changes were accompanied by changes in its methylation status. However, only certain (CCGG) motifs along the gene sequence were affected. The analyzed DNA motifs of the CHS flax gene are more accessible for methylation when located within a CpG island. The methylation motifs also led to rearrangement of the nucleosome location. The obtained results suggest high specificity of ODN action and establish a potential valuable alternative for improvement of crops.

  17. Strong motion deficits in dyslexia associated with DCDC2 gene alteration.

    PubMed

    Cicchini, Guido Marco; Marino, Cecilia; Mascheretti, Sara; Perani, Daniela; Morrone, Maria Concetta

    2015-05-27

    Dyslexia is a specific impairment in reading that affects 1 in 10 people. Previous studies have failed to isolate a single cause of the disorder, but several candidate genes have been reported. We measured motion perception in two groups of dyslexics, with and without a deletion within the DCDC2 gene, a risk gene for dyslexia. We found impairment for motion particularly strong at high spatial frequencies in the population carrying the deletion. The data suggest that deficits in motion processing occur in a specific genotype, rather than the entire dyslexia population, contributing to the large variability in impairment of motion thresholds in dyslexia reported in the literature. Copyright © 2015 the authors 0270-6474/15/358059-06$15.00/0.

  18. Genetic variability in isolates of Chromobacterium violaceum from pulmonary secretion, water, and soil.

    PubMed

    Santini, A C; Magalhães, J T; Cascardo, J C M; Corrêa, R X

    2016-04-28

    Chromobacterium violaceum is a free-living Gram-negative bacillus usually found in the water and soil in tropical regions, which causes infections in humans. Chromobacteriosis is characterized by rapid dissemination and high mortality. The aim of this study was to detect the genetic variability among C. violaceum type strain ATCC 12472, and seven isolates from the environment and one from a pulmonary secretion from a chromobacteriosis patient from Ilhéus, Bahia. The molecular characterization of all samples was performed by polymerase chain reaction (PCR) sequencing and 16S rDNA analysis. Primers specific for two ATCC 12472 pathogenicity genes, hilA and yscD, as well as random amplified polymorphic DNA (RAPD), were used for PCR amplification and comparative sequencing of the products. For a more specific approach, the PCR products of 16S rDNA were digested with restriction enzymes. Seven of the samples, including type-strain ATCC 12472, were amplified by the hilA primers; these were subsequently sequenced. Gene yscD was amplified only in type-strain ATCC 12472. MspI and AluI digestion revealed 16S rDNA polymorphisms. This data allowed the generation of a dendogram for each analysis. The isolates of C. violaceum have variability in random genomic regions demonstrated by RAPD. Also, these isolates have variability in pathogenicity genes, as demonstrated by sequencing and restriction enzyme digestion.

  19. Using nearly full-genome HIV sequence data improves phylogeny reconstruction in a simulated epidemic

    PubMed Central

    Yebra, Gonzalo; Hodcroft, Emma B.; Ragonnet-Cronin, Manon L.; Pillay, Deenan; Brown, Andrew J. Leigh; Fraser, Christophe; Kellam, Paul; de Oliveira, Tulio; Dennis, Ann; Hoppe, Anne; Kityo, Cissy; Frampton, Dan; Ssemwanga, Deogratius; Tanser, Frank; Keshani, Jagoda; Lingappa, Jairam; Herbeck, Joshua; Wawer, Maria; Essex, Max; Cohen, Myron S.; Paton, Nicholas; Ratmann, Oliver; Kaleebu, Pontiano; Hayes, Richard; Fidler, Sarah; Quinn, Thomas; Novitsky, Vladimir; Haywards, Andrew; Nastouli, Eleni; Morris, Steven; Clark, Duncan; Kozlakidis, Zisis

    2016-01-01

    HIV molecular epidemiology studies analyse viral pol gene sequences due to their availability, but whole genome sequencing allows to use other genes. We aimed to determine what gene(s) provide(s) the best approximation to the real phylogeny by analysing a simulated epidemic (created as part of the PANGEA_HIV project) with a known transmission tree. We sub-sampled a simulated dataset of 4662 sequences into different combinations of genes (gag-pol-env, gag-pol, gag, pol, env and partial pol) and sampling depths (100%, 60%, 20% and 5%), generating 100 replicates for each case. We built maximum-likelihood trees for each combination using RAxML (GTR + Γ), and compared their topologies to the corresponding true tree’s using CompareTree. The accuracy of the trees was significantly proportional to the length of the sequences used, with the gag-pol-env datasets showing the best performance and gag and partial pol sequences showing the worst. The lowest sampling depths (20% and 5%) greatly reduced the accuracy of tree reconstruction and showed high variability among replicates, especially when using the shortest gene datasets. In conclusion, using longer sequences derived from nearly whole genomes will improve the reliability of phylogenetic reconstruction. With low sample coverage, results can be highly variable, particularly when based on short sequences. PMID:28008945

  20. Using nearly full-genome HIV sequence data improves phylogeny reconstruction in a simulated epidemic.

    PubMed

    Yebra, Gonzalo; Hodcroft, Emma B; Ragonnet-Cronin, Manon L; Pillay, Deenan; Brown, Andrew J Leigh

    2016-12-23

    HIV molecular epidemiology studies analyse viral pol gene sequences due to their availability, but whole genome sequencing allows to use other genes. We aimed to determine what gene(s) provide(s) the best approximation to the real phylogeny by analysing a simulated epidemic (created as part of the PANGEA_HIV project) with a known transmission tree. We sub-sampled a simulated dataset of 4662 sequences into different combinations of genes (gag-pol-env, gag-pol, gag, pol, env and partial pol) and sampling depths (100%, 60%, 20% and 5%), generating 100 replicates for each case. We built maximum-likelihood trees for each combination using RAxML (GTR + Γ), and compared their topologies to the corresponding true tree's using CompareTree. The accuracy of the trees was significantly proportional to the length of the sequences used, with the gag-pol-env datasets showing the best performance and gag and partial pol sequences showing the worst. The lowest sampling depths (20% and 5%) greatly reduced the accuracy of tree reconstruction and showed high variability among replicates, especially when using the shortest gene datasets. In conclusion, using longer sequences derived from nearly whole genomes will improve the reliability of phylogenetic reconstruction. With low sample coverage, results can be highly variable, particularly when based on short sequences.

  1. Oxytocin receptor gene polymorphism modulates the effects of social support on heart rate variability

    PubMed Central

    Kanthak, Magdalena K.; Chen, Frances S.; Kumsta, Robert; Hill, LaBarron K.; Thayer, Julian F.; Heinrichs, Markus

    2017-01-01

    A large body of empirical research has demonstrated stress-buffering effects of social support. However, recent studies suggest that genetic variation of the oxytocin system (specifically, a common single nucleotide polymorphism, rs53576, of the oxytocin receptor gene) modulates the efficacy of social support. The timing and neurobiological basis of this genetic modulation were investigated using a standardized, laboratory-based psychological stress procedure (Trier Social Stress Test for Groups, TSST-G). To index potential stress buffering effects of social support mediated by the oxytocin system, heart rate variability (HRV) was obtained before and during the TSST-G from 40 healthy participants. Results indicate that social support is associated with higher HRV only in G allele carriers. Specifically, social support increased heart rate variability during direct social interaction and only in individuals with at least one copy of the G allele of rs53576. These findings support the idea that the stress-attenuating effects of social support are modulated by genetic variation of the oxytocin system. PMID:26903384

  2. Life-cycle modification in open oceans accounts for genome variability in a cosmopolitan phytoplankton.

    PubMed

    von Dassow, Peter; John, Uwe; Ogata, Hiroyuki; Probert, Ian; Bendif, El Mahdi; Kegel, Jessica U; Audic, Stéphane; Wincker, Patrick; Da Silva, Corinne; Claverie, Jean-Michel; Doney, Scott; Glover, David M; Flores, Daniella Mella; Herrera, Yeritza; Lescot, Magali; Garet-Delmas, Marie-José; de Vargas, Colomban

    2015-06-01

    Emiliania huxleyi is the most abundant calcifying plankton in modern oceans with substantial intraspecific genome variability and a biphasic life cycle involving sexual alternation between calcified 2N and flagellated 1N cells. We show that high genome content variability in Emiliania relates to erosion of 1N-specific genes and loss of the ability to form flagellated cells. Analysis of 185 E. huxleyi strains isolated from world oceans suggests that loss of flagella occurred independently in lineages inhabiting oligotrophic open oceans over short evolutionary timescales. This environmentally linked physiogenomic change suggests life cycling is not advantageous in very large/diluted populations experiencing low biotic pressure and low ecological variability. Gene loss did not appear to reflect pressure for genome streamlining in oligotrophic oceans as previously observed in picoplankton. Life-cycle modifications might be common in plankton and cause major functional variability to be hidden from traditional taxonomic or molecular markers.

  3. Exogenous glutathione improves high root-zone temperature tolerance by modulating photosynthesis, antioxidant and osmolytes systems in cucumber seedlings

    PubMed Central

    Ding, Xiaotao; Jiang, Yuping; He, Lizhong; Zhou, Qiang; Yu, Jizhu; Hui, Dafeng; Huang, Danfeng

    2016-01-01

    To investigate the physiological responses of plants to high root-zone temperature (HT, 35 °C) stress mitigated by exogenous glutathione (GSH), cucumber (Cucumis sativus L.) seedlings were exposed to HT with or without GSH treatment for 4 days and following with 4 days of recovery. Plant physiological variables, growth, and gene expression related to antioxidant enzymes and Calvin cycle were quantified. The results showed that HT significantly decreased GSH content, the ratio of reduced to oxidized glutathione (GSH/GSSG), chlorophyll content, photosynthesis and related gene expression, shoot height, stem diameter, as well as dry weight. The exogenous GSH treatment clearly lessened the HT stress by increasing the above variables. Meanwhile, HT significantly increased soluble protein content, proline and malondialdehyde (MDA) content as well as O2•− production rate, the gene expression and activities of antioxidant enzymes. The GSH treatment remarkably improved soluble protein content, proline content, antioxidant enzymes activities, and antioxidant enzymes related gene expression, and reduced the MDA content and O2•− production rate compared to no GSH treatment in the HT condition. Our results suggest that exogenous GSH enhances cucumber seedling tolerance of HT stress by modulating the photosynthesis, antioxidant and osmolytes systems to improve physiological adaptation. PMID:27752105

  4. Evolution of gremlin 2 in cetartiodactyl mammals: gene loss coincides with lack of upper jaw incisors in ruminants.

    PubMed

    Opazo, Juan C; Zavala, Kattina; Krall, Paola; Arias, Rodrigo A

    2017-01-01

    Understanding the processes that give rise to genomic variability in extant species is an active area of research within evolutionary biology. With the availability of whole genome sequences, it is possible to quantify different forms of variability such as variation in gene copy number, which has been described as an important source of genetic variability and in consequence of phenotypic variability. Most of the research on this topic has been focused on understanding the biological significance of gene duplication, and less attention has been given to the evolutionary role of gene loss. Gremlin 2 is a member of the DAN gene family and plays a significant role in tooth development by blocking the ligand-signaling pathway of BMP2 and BMP4. The goal of this study was to investigate the evolutionary history of gremlin 2 in cetartiodactyl mammals, a group that possesses highly divergent teeth morphology. Results from our analyses indicate that gremlin 2 has experienced a mixture of gene loss, gene duplication, and rate acceleration. Although the last common ancestor of cetartiodactyls possessed a single gene copy, pigs and camels are the only cetartiodactyl groups that have retained gremlin 2. According to the phyletic distribution of this gene and synteny analyses, we propose that gremlin 2 was lost in the common ancestor of ruminants and cetaceans between 56.3 and 63.5 million years ago as a product of a chromosomal rearrangement. Our analyses also indicate that the rate of evolution of gremlin 2 has been accelerated in the two groups that have retained this gene. Additionally, the lack of this gene could explain the high diversity of teeth among cetartiodactyl mammals; specifically, the presence of this gene could act as a biological constraint. Thus, our results support the notions that gene loss is a way to increase phenotypic diversity and that gremlin 2 is a dispensable gene, at least in cetartiodactyl mammals.

  5. Immunoglobulin kappa light chain gene promoter and enhancer are not responsible for B-cell restricted gene rearrangement.

    PubMed Central

    Goodhardt, M; Babinet, C; Lutfalla, G; Kallenbach, S; Cavelier, P; Rougeon, F

    1989-01-01

    We have produced transgenic mice which synthesize chimeric mouse-rabbit immunoglobulin (Ig) kappa light chains following in vivo recombination of an injected unrearranged kappa gene. The exogenous gene construct contained a mouse germ-line kappa variable (V kappa) gene segment, the mouse germ-line joining (J kappa) locus including the enhancer, and the rabbit b9 constant (C kappa) region. A high level of V-J recombination of the kappa transgene was observed in spleen of the transgenic mice. Surprisingly, a particularly high degree of variability in the exact site of recombination and the presence of non germ-line encoded nucleotides (N-regions) were found at the V-J junction of the rearranged kappa transgene. Furthermore, unlike endogenous kappa genes, rearrangement of the exogenous gene occurred in T-cells of the transgenic mice. These results show that additional sequences, other than the heptamer-nonamer signal sequences and the promoter and enhancer elements, are required to obtain stage- and lineage- specific regulation of Ig kappa light chain gene rearrangement in vivo. Images PMID:2508061

  6. Primer sets for cloning the human repertoire of T cell Receptor Variable regions.

    PubMed

    Boria, Ilenia; Cotella, Diego; Dianzani, Irma; Santoro, Claudio; Sblattero, Daniele

    2008-08-29

    Amplification and cloning of naïve T cell Receptor (TR) repertoires or antigen-specific TR is crucial to shape immune response and to develop immuno-based therapies. TR variable (V) regions are encoded by several genes that recombine during T cell development. The cloning of expressed genes as large diverse libraries from natural sources relies upon the availability of primers able to amplify as many V genes as possible. Here, we present a list of primers computationally designed on all functional TR V and J genes listed in the IMGT, the ImMunoGeneTics information system. The list consists of unambiguous or degenerate primers suitable to theoretically amplify and clone the entire TR repertoire. We show that it is possible to selectively amplify and clone expressed TR V genes in one single RT-PCR step and from as little as 1000 cells. This new primer set will facilitate the creation of more diverse TR libraries than has been possible using currently available primer sets.

  7. Genetics of Obsessive-Compulsive Disorder and Related Disorders

    PubMed Central

    Browne, Heidi A.; Gair, Shannon L.; Scharf, Jeremiah M.; Grice, Dorothy E.

    2014-01-01

    Synopsis Twin and family studies support a significant genetic contribution to obsessive-compulsive disorder (OCD) and related disorders such as chronic tic disorders, trichotillomania, skin picking disorder, body dysmorphic disorder, and hoarding disorder. Recently, population-based studies and novel laboratory-based methods have confirmed substantial heritability in OCD. Genome-wide association studies and candidate gene association studies have provided information on specific genes that may be involved in the pathobiology of OCD and also of related disorders, particularly chronic tic disorders, though these genes each contribute only a small portion of the total genetic risk and a substantial portion of the specific genetic risk profile in OCD is still unknown. Nevertheless, there are some examples of genes for which perturbations produce OCD-like phenotypes in animal model systems, allowing a laboratory platform for investigating the pathobiology of --- and new treatments for --- OCD and related disorders. Future work promises to continue to clarify the specific genes involved in risk for OCD as well as their interaction with environmental variables. PMID:25150565

  8. A PCR primer bank for quantitative gene expression analysis.

    PubMed

    Wang, Xiaowei; Seed, Brian

    2003-12-15

    Although gene expression profiling by microarray analysis is a useful tool for assessing global levels of transcriptional activity, variability associated with the data sets usually requires that observed differences be validated by some other method, such as real-time quantitative polymerase chain reaction (real-time PCR). However, non-specific amplification of non-target genes is frequently observed in the latter, confounding the analysis in approximately 40% of real-time PCR attempts when primer-specific labels are not used. Here we present an experimentally validated algorithm for the identification of transcript-specific PCR primers on a genomic scale that can be applied to real-time PCR with sequence-independent detection methods. An online database, PrimerBank, has been created for researchers to retrieve primer information for their genes of interest. PrimerBank currently contains 147 404 primers encompassing most known human and mouse genes. The primer design algorithm has been tested by conventional and real-time PCR for a subset of 112 primer pairs with a success rate of 98.2%.

  9. eap Gene as novel target for specific identification of Staphylococcus aureus.

    PubMed

    Hussain, Muzaffar; von Eiff, Christof; Sinha, Bhanu; Joost, Insa; Herrmann, Mathias; Peters, Georg; Becker, Karsten

    2008-02-01

    The cell surface-associated extracellular adherence protein (Eap) mediates adherence of Staphylococcus aureus to host extracellular matrix components and inhibits inflammation, wound healing, and angiogenesis. A well-characterized collection of S. aureus and non-S. aureus staphylococcal isolates (n = 813) was tested for the presence of the Eap-encoding gene (eap) by PCR to investigate the use of the eap gene as a specific diagnostic tool for identification of S. aureus. Whereas all 597 S. aureus isolates were eap positive, this gene was not detectable in 216 non-S. aureus staphylococcal isolates comprising 47 different species and subspecies of coagulase-negative staphylococci and non-S. aureus coagulase-positive or coagulase-variable staphylococci. Furthermore, non-S. aureus isolates did not express Eap homologs, as verified on the transcriptional and protein levels. Based on these data, the sensitivity and specificity of the newly developed PCR targeting the eap gene were both 100%. Thus, the unique occurrence of Eap in S. aureus offers a promising tool particularly suitable for molecular diagnostics of this pathogen.

  10. Comparative genomics of four closely related Clostridium perfringens bacteriophages reveals variable evolution among core genes with therapeutic potential

    PubMed Central

    2011-01-01

    Background Because biotechnological uses of bacteriophage gene products as alternatives to conventional antibiotics will require a thorough understanding of their genomic context, we sequenced and analyzed the genomes of four closely related phages isolated from Clostridium perfringens, an important agricultural and human pathogen. Results Phage whole-genome tetra-nucleotide signatures and proteomic tree topologies correlated closely with host phylogeny. Comparisons of our phage genomes to 26 others revealed three shared COGs; of particular interest within this core genome was an endolysin (PF01520, an N-acetylmuramoyl-L-alanine amidase) and a holin (PF04531). Comparative analyses of the evolutionary history and genomic context of these common phage proteins revealed two important results: 1) strongly significant host-specific sequence variation within the endolysin, and 2) a protein domain architecture apparently unique to our phage genomes in which the endolysin is located upstream of its associated holin. Endolysin sequences from our phages were one of two very distinct genotypes distinguished by variability within the putative enzymatically-active domain. The shared or core genome was comprised of genes with multiple sequence types belonging to five pfam families, and genes belonging to 12 pfam families, including the holin genes, which were nearly identical. Conclusions Significant genomic diversity exists even among closely-related bacteriophages. Holins and endolysins represent conserved functions across divergent phage genomes and, as we demonstrate here, endolysins can have significant variability and host-specificity even among closely-related genomes. Endolysins in our phage genomes may be subject to different selective pressures than the rest of the genome. These findings may have important implications for potential biotechnological applications of phage gene products. PMID:21631945

  11. Compendium of Immune Signatures Identifies Conserved and Species-Specific Biology in Response to Inflammation.

    PubMed

    Godec, Jernej; Tan, Yan; Liberzon, Arthur; Tamayo, Pablo; Bhattacharya, Sanchita; Butte, Atul J; Mesirov, Jill P; Haining, W Nicholas

    2016-01-19

    Gene-expression profiling has become a mainstay in immunology, but subtle changes in gene networks related to biological processes are hard to discern when comparing various datasets. For instance, conservation of the transcriptional response to sepsis in mouse models and human disease remains controversial. To improve transcriptional analysis in immunology, we created ImmuneSigDB: a manually annotated compendium of ∼5,000 gene-sets from diverse cell states, experimental manipulations, and genetic perturbations in immunology. Analysis using ImmuneSigDB identified signatures induced in activated myeloid cells and differentiating lymphocytes that were highly conserved between humans and mice. Sepsis triggered conserved patterns of gene expression in humans and mouse models. However, we also identified species-specific biological processes in the sepsis transcriptional response: although both species upregulated phagocytosis-related genes, a mitosis signature was specific to humans. ImmuneSigDB enables granular analysis of transcriptomic data to improve biological understanding of immune processes of the human and mouse immune systems. Copyright © 2016 Elsevier Inc. All rights reserved.

  12. Improved site-specific recombinase-based method to produce selectable marker- and vector-backbone-free transgenic cells

    NASA Astrophysics Data System (ADS)

    Yu, Yuan; Tong, Qi; Li, Zhongxia; Tian, Jinhai; Wang, Yizhi; Su, Feng; Wang, Yongsheng; Liu, Jun; Zhang, Yong

    2014-02-01

    PhiC31 integrase-mediated gene delivery has been extensively used in gene therapy and animal transgenesis. However, random integration events are observed in phiC31-mediated integration in different types of mammalian cells; as a result, the efficiencies of pseudo attP site integration and evaluation of site-specific integration are compromised. To improve this system, we used an attB-TK fusion gene as a negative selection marker, thereby eliminating random integration during phiC31-mediated transfection. We also excised the selection system and plasmid bacterial backbone by using two other site-specific recombinases, Cre and Dre. Thus, we generated clean transgenic bovine fetal fibroblast cells free of selectable marker and plasmid bacterial backbone. These clean cells were used as donor nuclei for somatic cell nuclear transfer (SCNT), indicating a similar developmental competence of SCNT embryos to that of non-transgenic cells. Therefore, the present gene delivery system facilitated the development of gene therapy and agricultural biotechnology.

  13. Genetically-Driven Enhancement of Dopaminergic Transmission Affects Moral Acceptability in Females but Not in Males: A Pilot Study

    PubMed Central

    Pellegrini, Silvia; Palumbo, Sara; Iofrida, Caterina; Melissari, Erika; Rota, Giuseppina; Mariotti, Veronica; Anastasio, Teresa; Manfrinati, Andrea; Rumiati, Rino; Lotto, Lorella; Sarlo, Michela; Pietrini, Pietro

    2017-01-01

    Moral behavior has been a key topic of debate for philosophy and psychology for a long time. In recent years, thanks to the development of novel methodologies in cognitive sciences, the question of how we make moral choices has expanded to the study of neurobiological correlates that subtend the mental processes involved in moral behavior. For instance, in vivo brain imaging studies have shown that distinct patterns of brain neural activity, associated with emotional response and cognitive processes, are involved in moral judgment. Moreover, while it is well-known that responses to the same moral dilemmas differ across individuals, to what extent this variability may be rooted in genetics still remains to be understood. As dopamine is a key modulator of neural processes underlying executive functions, we questioned whether genetic polymorphisms associated with decision-making and dopaminergic neurotransmission modulation would contribute to the observed variability in moral judgment. To this aim, we genotyped five genetic variants of the dopaminergic pathway [rs1800955 in the dopamine receptor D4 (DRD4) gene, DRD4 48 bp variable number of tandem repeat (VNTR), solute carrier family 6 member 3 (SLC6A3) 40 bp VNTR, rs4680 in the catechol-O-methyl transferase (COMT) gene, and rs1800497 in the ankyrin repeat and kinase domain containing 1 (ANKK1) gene] in 200 subjects, who were requested to answer 56 moral dilemmas. As these variants are all located in genes belonging to the dopaminergic pathway, they were combined in multilocus genetic profiles for the association analysis. While no individual variant showed any significant effects on moral dilemma responses, the multilocus genetic profile analysis revealed a significant gender-specific influence on human moral acceptability. Specifically, those genotype combinations that improve dopaminergic signaling selectively increased moral acceptability in females, by making their responses to moral dilemmas more similar to those provided by males. As females usually give more emotionally-based answers and engage the “emotional brain” more than males, our results, though preliminary and therefore in need of replication in independent samples, suggest that this increase in dopamine availability enhances the cognitive and reduces the emotional components of moral decision-making in females, thus favoring a more rationally-driven decision process. PMID:28900390

  14. Bacillus subtilis genome diversity.

    PubMed

    Earl, Ashlee M; Losick, Richard; Kolter, Roberto

    2007-02-01

    Microarray-based comparative genomic hybridization (M-CGH) is a powerful method for rapidly identifying regions of genome diversity among closely related organisms. We used M-CGH to examine the genome diversity of 17 strains belonging to the nonpathogenic species Bacillus subtilis. Our M-CGH results indicate that there is considerable genetic heterogeneity among members of this species; nearly one-third of Bsu168-specific genes exhibited variability, as measured by the microarray hybridization intensities. The variable loci include those encoding proteins involved in antibiotic production, cell wall synthesis, sporulation, and germination. The diversity in these genes may reflect this organism's ability to survive in diverse natural settings.

  15. Mitochondria and the non-genetic origins of cell-to-cell variability: More is different.

    PubMed

    Guantes, Raúl; Díaz-Colunga, Juan; Iborra, Francisco J

    2016-01-01

    Gene expression activity is heterogeneous in a population of isogenic cells. Identifying the molecular basis of this variability will improve our understanding of phenomena like tumor resistance to drugs, virus infection, or cell fate choice. The complexity of the molecular steps and machines involved in transcription and translation could introduce sources of randomness at many levels, but a common constraint to most of these processes is its energy dependence. In eukaryotic cells, most of this energy is provided by mitochondria. A clonal population of cells may show a large variability in the number and functionality of mitochondria. Here, we discuss how differences in the mitochondrial content of each cell contribute to heterogeneity in gene products. Changes in the amount of mitochondria can also entail drastic alterations of a cell's gene expression program, which ultimately leads to phenotypic diversity. Also watch the Video Abstract. © 2015 WILEY Periodicals, Inc.

  16. Transcriptome-Level Signatures in Gene Expression and Gene Expression Variability during Bacterial Adaptive Evolution.

    PubMed

    Erickson, Keesha E; Otoupal, Peter B; Chatterjee, Anushree

    2017-01-01

    Antibiotic-resistant bacteria are an increasingly serious public health concern, as strains emerge that demonstrate resistance to almost all available treatments. One factor that contributes to the crisis is the adaptive ability of bacteria, which exhibit remarkable phenotypic and gene expression heterogeneity in order to gain a survival advantage in damaging environments. This high degree of variability in gene expression across biological populations makes it a challenging task to identify key regulators of bacterial adaptation. Here, we research the regulation of adaptive resistance by investigating transcriptome profiles of Escherichia coli upon adaptation to disparate toxins, including antibiotics and biofuels. We locate potential target genes via conventional gene expression analysis as well as using a new analysis technique examining differential gene expression variability. By investigating trends across the diverse adaptation conditions, we identify a focused set of genes with conserved behavior, including those involved in cell motility, metabolism, membrane structure, and transport, and several genes of unknown function. To validate the biological relevance of the observed changes, we synthetically perturb gene expression using clustered regularly interspaced short palindromic repeat (CRISPR)-dCas9. Manipulation of select genes in combination with antibiotic treatment promotes adaptive resistance as demonstrated by an increased degree of antibiotic tolerance and heterogeneity in MICs. We study the mechanisms by which identified genes influence adaptation and find that select differentially variable genes have the potential to impact metabolic rates, mutation rates, and motility. Overall, this work provides evidence for a complex nongenetic response, encompassing shifts in gene expression and gene expression variability, which underlies adaptive resistance. IMPORTANCE Even initially sensitive bacteria can rapidly thwart antibiotic treatment through stress response processes known as adaptive resistance. Adaptive resistance fosters transient tolerance increases and the emergence of mutations conferring heritable drug resistance. In order to extend the applicable lifetime of new antibiotics, we must seek to hinder the occurrence of bacterial adaptive resistance; however, the regulation of adaptation is difficult to identify due to immense heterogeneity emerging during evolution. This study specifically seeks to generate heterogeneity by adapting bacteria to different stresses and then examines gene expression trends across the disparate populations in order to pinpoint key genes and pathways associated with adaptive resistance. The targets identified here may eventually inform strategies for impeding adaptive resistance and prolonging the effectiveness of antibiotic treatment.

  17. Surface Diversity in Mycoplasma agalactiae Is Driven by Site-Specific DNA Inversions within the vpma Multigene Locus

    PubMed Central

    Glew, Michelle D.; Marenda, Marc; Rosengarten, Renate; Citti, Christine

    2002-01-01

    The ruminant pathogen Mycoplasma agalactiae possesses a family of abundantly expressed variable surface lipoproteins called Vpmas. Phenotypic switches between Vpma members have previously been correlated with DNA rearrangements within a locus of vpma genes and are proposed to play an important role in disease pathogenesis. In this study, six vpma genes were characterized in the M. agalactiae type strain PG2. All vpma genes clustered within an 8-kb region and shared highly conserved 5′ untranslated regions, lipoprotein signal sequences, and short N-terminal sequences. Analyses of the vpma loci from consecutive clonal isolates showed that vpma DNA rearrangements were site specific and that cleavage and strand exchange occurred within a minimal region of 21 bp located within the 5′ untranslated region of all vpma genes. This process controlled expression of vpma genes by effectively linking the open reading frame (ORF) of a silent gene to a unique active promoter sequence within the locus. An ORF (xer1) immediately adjacent to one end of the vpma locus did not undergo rearrangement and had significant homology to a distinct subset of genes belonging to the λ integrase family of site-specific xer recombinases. It is proposed that xer1 codes for a site-specific recombinase that is not involved in chromosome dimer resolution but rather is responsible for the observed vpma-specific recombination in M. agalactiae. PMID:12374833

  18. Retroviral vectors encoding ADA regulatory locus control region provide enhanced T-cell-specific transgene expression

    PubMed Central

    2009-01-01

    Background Murine retroviral vectors have been used in several hundred gene therapy clinical trials, but have fallen out of favor for a number of reasons. One issue is that gene expression from viral or internal promoters is highly variable and essentially unregulated. Moreover, with retroviral vectors, gene expression is usually silenced over time. Mammalian genes, in contrast, are characterized by highly regulated, precise levels of expression in both a temporal and a cell-specific manner. To ascertain if recapitulation of endogenous adenosine deaminase (ADA) expression can be achieved in a vector construct we created a new series of Moloney murine leukemia virus (MuLV) based retroviral vector that carry human regulatory elements including combinations of the ADA promoter, the ADA locus control region (LCR), ADA introns and human polyadenylation sequences in a self-inactivating vector backbone. Methods A MuLV-based retroviral vector with a self-inactivating (SIN) backbone, the phosphoglycerate kinase promoter (PGK) and the enhanced green fluorescent protein (eGFP), as a reporter gene, was generated. Subsequent vectors were constructed from this basic vector by deletion or addition of certain elements. The added elements that were assessed are the human ADA promoter, human ADA locus control region (LCR), introns 7, 8, and 11 from the human ADA gene, and human growth hormone polyadenylation signal. Retroviral vector particles were produced by transient three-plasmid transfection of 293T cells. Retroviral vectors encoding eGFP were titered by transducing 293A cells, and then the proportion of GFP-positive cells was determined using fluorescence-activated cell sorting (FACS). Non T-cell and T-cell lines were transduced at a multiplicity of infection (MOI) of 0.1 and the yield of eGFP transgene expression was evaluated by FACS analysis using mean fluorescent intensity (MFI) detection. Results Vectors that contained the ADA LCR were preferentially expressed in T-cell lines. Further improvements in T-cell specific gene expression were observed with the incorporation of additional cis-regulatory elements, such as a human polyadenylation signal and intron 7 from the human ADA gene. Conclusion These studies suggest that the combination of an authentically regulated ADA gene in a murine retroviral vector, together with additional locus-specific regulatory refinements, will yield a vector with a safer profile and greater efficacy in terms of high-level, therapeutic, regulated gene expression for the treatment of ADA-deficient severe combined immunodeficiency. PMID:20042112

  19. Development of B cells expressing surface immunoglobulin molecules that lack V(D)J-encoded determinants in the avian embryo bursa of Fabricius

    PubMed Central

    Sayegh, Camil E.; Demaries, Sandra L.; Iacampo, Sandra; Ratcliffe, Michael J. H.

    1999-01-01

    Immunoglobulin gene rearrangement in avian B cell precursors generates surface Ig receptors of limited diversity. It has been proposed that specificities encoded by these receptors play a critical role in B lineage development by recognizing endogenous ligands within the bursa of Fabricius. To address this issue directly we have introduced a truncated surface IgM, lacking variable region domains, into developing B precursors by retroviral gene transfer in vivo. Cells expressing this truncated receptor lack endogenous surface IgM, and the low level of endogenous Ig rearrangements that have occurred within this population of cells has not been selected for having a productive reading frame. Such cells proliferate rapidly within bursal epithelial buds of normal morphology. In addition, despite reduced levels of endogenous light chain rearrangement, those light chain rearrangements that have occurred have undergone variable region diversification by gene conversion. Therefore, although surface expression of an Ig receptor is required for bursal colonization and the induction of gene conversion, the specificity encoded by the prediversified receptor is irrelevant and, consequently, there is no obligate ligand for V(D)J-encoded determinants of prediversified avian cell surface IgM receptor. PMID:10485907

  20. Prediction of essential proteins based on gene expression programming.

    PubMed

    Zhong, Jiancheng; Wang, Jianxin; Peng, Wei; Zhang, Zhen; Pan, Yi

    2013-01-01

    Essential proteins are indispensable for cell survive. Identifying essential proteins is very important for improving our understanding the way of a cell working. There are various types of features related to the essentiality of proteins. Many methods have been proposed to combine some of them to predict essential proteins. However, it is still a big challenge for designing an effective method to predict them by integrating different features, and explaining how these selected features decide the essentiality of protein. Gene expression programming (GEP) is a learning algorithm and what it learns specifically is about relationships between variables in sets of data and then builds models to explain these relationships. In this work, we propose a GEP-based method to predict essential protein by combing some biological features and topological features. We carry out experiments on S. cerevisiae data. The experimental results show that the our method achieves better prediction performance than those methods using individual features. Moreover, our method outperforms some machine learning methods and performs as well as a method which is obtained by combining the outputs of eight machine learning methods. The accuracy of predicting essential proteins can been improved by using GEP method to combine some topological features and biological features.

  1. Genotypic, Phenotypic and Clinical Validation of GeneXpert in Extra-Pulmonary and Pulmonary Tuberculosis in India

    PubMed Central

    Singh, Urvashi B.; Pandey, Pooja; Mehta, Girija; Bhatnagar, Anuj K.; Mohan, Anant; Goyal, Vinay; Ahuja, Vineet; Ramachandran, Ranjani; Sachdeva, Kuldeep S.; Samantaray, Jyotish C.

    2016-01-01

    Background Newer molecular diagnostics have brought paradigm shift in early diagnosis of tuberculosis [TB]. WHO recommended use of GeneXpert MTB/RIF [Xpert] for Extra-pulmonary [EP] TB; critics have since questioned its efficiency. Methods The present study was designed to assess the performance of GeneXpert in 761 extra-pulmonary and 384 pulmonary specimens from patients clinically suspected of TB and compare with Phenotypic, Genotypic and Composite reference standards [CRS]. Results Comparison of GeneXpert results to CRS, demonstrated sensitivity of 100% and 90.68%, specificity of 100% and 99.62% for pulmonary and extra-pulmonary samples. On comparison with culture, sensitivity for Rifampicin [Rif] resistance detection was 87.5% and 81.82% respectively, while specificity was 100% for both pulmonary and extra-pulmonary TB. On comparison to sequencing of rpoB gene [Rif resistance determining region, RRDR], sensitivity was respectively 93.33% and 90% while specificity was 100% in both pulmonary and extra-pulmonary TB. GeneXpert assay missed 533CCG mutation in one sputum and dual mutation [517 & 519] in one pus sample, detected by sequencing. Sequencing picked dual mutation [529, 530] in a sputum sample sensitive to Rif, demonstrating, not all RRDR mutations lead to resistance. Conclusions Current study reports observations in a patient care setting in a high burden region, from a large collection of pulmonary and extra-pulmonary samples and puts to rest questions regarding sensitivity, specificity, detection of infrequent mutations and mutations responsible for low-level Rif resistance by GeneXpert. Improvements in the assay could offer further improvement in sensitivity of detection in different patient samples; nevertheless it may be difficult to improve sensitivity of Rif resistance detection if only one gene is targeted. Assay specificity was high both for TB detection and Rif resistance detection. Despite a few misses, the assay offers major boost to early diagnosis of TB and MDR-TB, in difficult to diagnose pauci-bacillary TB. PMID:26894283

  2. Sexually divergent induction of microglial-associated neuroinflammation with hippocampal aging.

    PubMed

    Mangold, Colleen A; Wronowski, Benjamin; Du, Mei; Masser, Dustin R; Hadad, Niran; Bixler, Georgina V; Brucklacher, Robert M; Ford, Matthew M; Sonntag, William E; Freeman, Willard M

    2017-07-21

    The necessity of including both males and females in molecular neuroscience research is now well understood. However, there is relatively limited basic biological data on brain sex differences across the lifespan despite the differences in age-related neurological dysfunction and disease between males and females. Whole genome gene expression of young (3 months), adult (12 months), and old (24 months) male and female C57BL6 mice hippocampus was analyzed. Subsequent bioinformatic analyses and confirmations of age-related changes and sex differences in hippocampal gene and protein expression were performed. Males and females demonstrate both common expression changes with aging and marked sex differences in the nature and magnitude of the aging responses. Age-related hippocampal induction of neuroinflammatory gene expression was sexually divergent and enriched for microglia-specific genes such as complement pathway components. Sexually divergent C1q protein expression was confirmed by immunoblotting and immunohistochemistry. Similar patterns of cortical sexually divergent gene expression were also evident. Additionally, inter-animal gene expression variability increased with aging in males, but not females. These findings demonstrate sexually divergent neuroinflammation with aging that may contribute to sex differences in age-related neurological diseases such as stroke and Alzheimer's, specifically in the complement system. The increased expression variability in males suggests a loss of fidelity in gene expression regulation with aging. These findings reveal a central role of sex in the transcriptomic response of the hippocampus to aging that warrants further, in depth, investigations.

  3. Colonizing the world in spite of reduced MHC variation

    USGS Publications Warehouse

    Gangoso, L.; Alcaide, M.; Grande, J.M.; Muñoz, J.; Talbot, Sandra L.; Sonsthagen, Sarah A.; Sage, Kevin; Figuerola, J.

    2012-01-01

    Reduced immune gene diversity is thought to negatively affect the capacity of organisms to adapt to pathogen challenges, which represent a major force in natural selection. Genes of the Major Histocompatibility Complex (MHC) are the most widely invoked adaptive loci in conservation biology, and have become the most popular genetic markers to investigate pathogen-host interactions in vertebrates. Although MHC genes are the most polymorphic genes described in the vertebrate genome, the extent to which MHC diversity determines the long-term persistence of populations is, unclear and often debated, as recent studies have documented the occurrence of natural populations thriving even after a depletion of MHC diversity caused by genetic drift. Here, we show that some phylogenetically related species belonging to the Falco genus (Aves: Falconidae) present a dramatically low MHC variability that has not precluded, nevertheless, the successful colonization of almost all existing regions and habitats worldwide. We found evidence for two remarkably different patterns of MHC variation within the genus. While kestrels show a high MHC variation according to the general theory, falcons exhibit an ancestrally low intra- and inter-specific MHC allelic diversity. We provide compelling evidence that this pattern is not caused by the degeneration of functional genes into pseudogenes, the inadvertent analyses of paralogous MHC genes, or the devastating action of genetic drift. Instead, our results strongly support the idea of an evolutionary transition driven and maintained by natural selection from primarily highly variable towards low polymorphic, but functional and expressed, MHC genes with species-specific pathogen-recognition capabilities.

  4. Specific PCR primers directed to identify cryI and cryIII genes within a Bacillus thuringiensis strain collection.

    PubMed Central

    Cerón, J; Ortíz, A; Quintero, R; Güereca, L; Bravo, A

    1995-01-01

    In this paper we describe a PCR strategy that can be used to rapidly identify Bacillus thuringiensis strains that harbor any of the known cryI or cryIII genes. Four general PCR primers which amplify DNA fragments from the known cryI or cryIII genes were selected from conserved regions. Once a strain was identified as an organism that contains a particular type of cry gene, it could be easily characterized by performing additional PCR with specific cryI and cryIII primers selected from variable regions. The method described in this paper can be used to identify the 10 different cryI genes and the five different cryIII genes. One feature of this screening method is that each cry gene is expected to produce a PCR product having a precise molecular weight. The genes which produce PCR products having different sizes probably represent strains that harbor a potentially novel cry gene. Finally, we present evidence that novel crystal genes can be identified by the method described in this paper. PMID:8526493

  5. Differential gene expression patterns in the autogamous plant Hordeum euclaston (Poaceae).

    PubMed

    Georg-Kraemer, J E; Ferreira, C A S; Cavalli, S S

    2011-02-22

    Sib-seedlings of 95 strains of the strictly autogamous grass Hordeum euclaston were analyzed by horizontal polyacrylamide gel electrophoresis for four isoenzyme systems at a specific ontogenetic stage. We found differences in the activity of some genes among individuals of this species. Hence, an ontogenetic analysis was carried out to investigate 12 strains at five ontogenetic stages, to determine the patterns of expression of these genes during development. The differences in the presence versus absence of certain isoenzyme bands may be due to differential regulatory activation in response to environmental differences, as all plants showed the same structural genes, although these genes were active in different tissues and/or times of development. These results indicate the importance of differential gene activation in the metabolic phenotype variability of this strictly autogamous, highly homozygous species. The same structural alleles for isoenzymes showed the active form of the enzymes (phenotypic expression) to be present in different tissues and/or stages of development. Differential isoenzyme gene activation was shown to be directly responsible for the enzymatic variability (metabolic phenotype) presented by the plants, which seem to possess almost no heterozygosis.

  6. MO-DE-207B-03: Improved Cancer Classification Using Patient-Specific Biological Pathway Information Via Gene Expression Data

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Young, M; Craft, D

    Purpose: To develop an efficient, pathway-based classification system using network biology statistics to assist in patient-specific response predictions to radiation and drug therapies across multiple cancer types. Methods: We developed PICS (Pathway Informed Classification System), a novel two-step cancer classification algorithm. In PICS, a matrix m of mRNA expression values for a patient cohort is collapsed into a matrix p of biological pathways. The entries of p, which we term pathway scores, are obtained from either principal component analysis (PCA), normal tissue centroid (NTC), or gene expression deviation (GED). The pathway score matrix is clustered using both k-means and hierarchicalmore » clustering, and a clustering is judged by how well it groups patients into distinct survival classes. The most effective pathway scoring/clustering combination, per clustering p-value, thus generates various ‘signatures’ for conventional and functional cancer classification. Results: PICS successfully regularized large dimension gene data, separated normal and cancerous tissues, and clustered a large patient cohort spanning six cancer types. Furthermore, PICS clustered patient cohorts into distinct, statistically-significant survival groups. For a suboptimally-debulked ovarian cancer set, the pathway-classified Kaplan-Meier survival curve (p = .00127) showed significant improvement over that of a prior gene expression-classified study (p = .0179). For a pancreatic cancer set, the pathway-classified Kaplan-Meier survival curve (p = .00141) showed significant improvement over that of a prior gene expression-classified study (p = .04). Pathway-based classification confirmed biomarkers for the pyrimidine, WNT-signaling, glycerophosphoglycerol, beta-alanine, and panthothenic acid pathways for ovarian cancer. Despite its robust nature, PICS requires significantly less run time than current pathway scoring methods. Conclusion: This work validates the PICS method to improve cancer classification using biological pathways. Patients are classified with greater specificity and physiological relevance as compared to current gene-specific approaches. Focus now moves to utilizing PICS for pan-cancer patient-specific treatment response prediction.« less

  7. Configurations of a two-tiered amplified gene expression system in adenoviral vectors designed to improve the specificity of in vivo prostate cancer imaging

    PubMed Central

    Sato, M; Figueiredo, ML; Burton, JB; Johnson, M; Chen, M; Powell, R; Gambhir, SS; Carey, M; Wu, L

    2009-01-01

    Effective treatment for recurrent, disseminated prostate cancer is notably limited. We have developed adenoviral vectors with a prostate-specific two-step transcriptional amplification (TSTA) system that would express therapeutic genes at a robust level to target metastatic disease. The TSTA system employs the prostate-specific antigen (PSA) promoter/enhancer to drive a potent synthetic activator, which in turn activates the expression of the therapeutic gene. In this study, we explored different configurations of this bipartite system and discovered that physical separation of the two TSTA components into E1 and E3 regions of adenovirus was able to enhance androgen regulation and cell-discriminatory expression. The TSTA vectors that express imaging reporter genes were assessed by noninvasive imaging technologies in animal models. The improved selectivity of the E1E3 configured vector was reflected in silenced ectopic expression in the lung. Significantly, the enhanced specificity of the E1E3 vector enabled the detection of lung metastasis of prostate cancer. An E1E3 TSTA vector that expresses the herpes simplex virus thymidine kinase gene can effectively direct positron emission tomography (PET) imaging of the tumor. The prostate-targeted gene delivery vectors with robust and cell-specific expression capability will advance the development of safe and effective imaging guided therapy for recurrent metastatic stages of prostate cancer. PMID:18305574

  8. [Identification of new conserved and variable regions in the 16S rRNA gene of acetic acid bacteria and acetobacteraceae family].

    PubMed

    Chakravorty, S; Sarkar, S; Gachhui, R

    2015-01-01

    The Acetobacteraceae family of the class Alpha Proteobacteria is comprised of high sugar and acid tolerant bacteria. The Acetic Acid Bacteria are the economically most significant group of this family because of its association with food products like vinegar, wine etc. Acetobacteraceae are often hard to culture in laboratory conditions and they also maintain very low abundances in their natural habitats. Thus identification of the organisms in such environments is greatly dependent on modern tools of molecular biology which require a thorough knowledge of specific conserved gene sequences that may act as primers and or probes. Moreover unconserved domains in genes also become markers for differentiating closely related genera. In bacteria, the 16S rRNA gene is an ideal candidate for such conserved and variable domains. In order to study the conserved and variable domains of the 16S rRNA gene of Acetic Acid Bacteria and the Acetobacteraceae family, sequences from publicly available databases were aligned and compared. Near complete sequences of the gene were also obtained from Kombucha tea biofilm, a known Acetobacteraceae family habitat, in order to corroborate the domains obtained from the alignment studies. The study indicated that the degree of conservation in the gene is significantly higher among the Acetic Acid Bacteria than the whole Acetobacteraceae family. Moreover it was also observed that the previously described hypervariable regions V1, V3, V5, V6 and V7 were more or less conserved in the family and the spans of the variable regions are quite distinct as well.

  9. Adjusting for background mutation frequency biases improves the identification of cancer driver genes.

    PubMed

    Evans, Perry; Avey, Stefan; Kong, Yong; Krauthammer, Michael

    2013-09-01

    A common goal of tumor sequencing projects is finding genes whose mutations are selected for during tumor development. This is accomplished by choosing genes that have more non-synonymous mutations than expected from an estimated background mutation frequency. While this background frequency is unknown, it can be estimated using both the observed synonymous mutation frequency and the non-synonymous to synonymous mutation ratio. The synonymous mutation frequency can be determined across all genes or in a gene-specific manner. This choice introduces an interesting trade-off. A gene-specific frequency adjusts for an underlying mutation bias, but is difficult to estimate given missing synonymous mutation counts. Using a genome-wide synonymous frequency is more robust, but is less suited for adjusting biases. Studying four evaluation criteria for identifying genes with high non-synonymous mutation burden (reflecting preferential selection of expressed genes, genes with mutations in conserved bases, genes with many protein interactions, and genes that show loss of heterozygosity), we find that the gene-specific synonymous frequency is superior in the gene expression and protein interaction tests. In conclusion, the use of the gene-specific synonymous mutation frequency is well suited for assessing a gene's non-synonymous mutation burden.

  10. EXTRA-EMBRYONIC-SPECIFIC IMPRINTED EXPRESSION IS RESTRICTED TO DEFINED LINEAGES IN THE POST-IMPLANTATION EMBRYO

    PubMed Central

    Hudson, Quanah J.; Seidl, Christine I.M.; Kulinski, Tomasz M.; Huang, Ru; Warczok, Katarzyna E.; Bittner, Romana; Bartolomei, Marisa S.; Barlow, Denise P.

    2011-01-01

    A subset of imprinted genes in the mouse have been reported to show imprinted expression that is restricted to the placenta, a short-lived extra-embryonic organ. Notably these so-called 'placental-specific' imprinted genes are expressed from both parental alleles in embryo and adult tissues. The placenta is an embryonic-derived organ that is closely associated with maternal tissue and as a consequence, maternal contamination can be mistaken for maternal-specific imprinted expression. The complexity of the placenta, which arises from multiple embryonic lineages, poses additional problems in accurately assessing allele-specific repressive epigenetic modifications in genes that also show lineage-specific silencing in this organ. These problems require that extra evidence be obtained to support the imprinted status of genes whose imprinted expression is restricted to the placenta. We show here that the extra-embryonic visceral yolk sac (VYS), a nutritive membrane surrounding the developing embryo, shows a similar 'extra-embryonic-lineage-specific' pattern of imprinted expression. We present an improved enzymatic technique for separating the bilaminar VYS and show that this pattern of imprinted expression is restricted to the endoderm layer. Finally, we show that VYS 'extra-embryonic-lineage-specific' imprinted expression is regulated by DNA methylation in a similar manner as shown for genes showing multi-lineage imprinted expression in extra-embryonic, embryonic and adult tissues. These results show that the VYS is an improved model for studying the epigenetic mechanisms regulating extra-embryonic-lineage-specific imprinted expression. PMID:21354127

  11. Pasture-feeding of Charolais steers influences skeletal muscle metabolism and gene expression.

    PubMed

    Cassar-Malek, I; Jurie, C; Bernard, C; Barnola, I; Micol, D; Hocquette, J-F

    2009-10-01

    Extensive beef production systems on pasture are promoted to improve animal welfare and beef quality. This study aimed to compare the influence on muscle characteristics of two management approaches representative of intensive and extensive production systems. One group of 6 Charolais steers was fed maize-silage indoors and another group of 6 Charolais steers grazed on pasture. Activities of enzymes representative of glycolytic and oxidative (Isocitrate dehydrogenase [ICDH], citrate synthase [CS], hydroxyacyl-CoA dehydrogenase [HAD]) muscle metabolism were assessed in Rectus abdominis (RA) and Semitendinosus (ST) muscles. Activities of oxidative enzymes ICDH, CS and HAD were higher in muscles from grazing animals demonstrating a plasticity of muscle metabolism according to the production and feeding system. Gene expression profiling in RA and ST muscles was performed on both production groups using a multi-tissue bovine cDNA repertoire. Variance analysis showed an effect of the muscle type and of the production system on gene expression (P<0.001). A list of the 212 most variable genes according to the production system was established, of which 149 genes corresponded to identified genes. They were classified according to their gene function annotation mainly in the "protein metabolism and modification", "signal transduction", "cell cycle", "developmental processes" and "muscle contraction" biological processes. Selenoprotein W was found to be underexpressed in pasture-fed animals and could be proposed as a putative gene marker of the grass-based system. In conclusion, enzyme-specific adaptations and gene expression modifications were observed in response to the production system and some of them could be candidates for grazing or grass-feeding traceability.

  12. The role of glutamine synthetase and glutamate dehydrogenase in nitrogen assimilation and possibilities for improvement in the nitrogen utilization of crops.

    PubMed

    Miflin, Ben J; Habash, Dimah Z

    2002-04-01

    This short review outlines the central role of glutamine synthetase (GS) in plant nitrogen metabolism and discusses some possibilities for crop improvement. GS functions as the major assimilatory enzyme for ammonia produced from N fixation, and nitrate or ammonia nutrition. It also reassimilates ammonia released as a result of photorespiration and the breakdown of proteins and nitrogen transport compounds. GS is distributed in different subcellular locations (chloroplast and cytoplasm) and in different tissues and organs. This distribution probably changes as a function of the development of the tissue, for example, GS1 appears to play a key role in leaf senescence. The enzyme is the product of multiple genes with complex promoters that ensure the expression of the genes in an organ- and tissue-specific manner and in response to a number of environmental variables affecting the nutritional status of the cell. GS activity is also regulated post-translationally in a manner that involves 14-3-3 proteins and phosphorylation. GS and plant nitrogen metabolism is best viewed as a complex matrix continually changing during the development cycle of plants. Along with GS, a number of other enzymes play key roles in maintaining the balance of carbon and nitrogen. It is proposed that one of these is glutamate dehydrogenase (GDH). There is considerable evidence for a GDH shunt to return the carbon in amino acids back into reactions of carbon metabolism and the tri-carboxylic acid cycle. Results with transgenic plants containing transferred GS genes suggest that there may be ways in which it is possible to improve the efficiency with which crop plants use nitrogen. Marker-assisted breeding may also bring about such improvements.

  13. The influence of antibody fragment format on phage display based affinity maturation of IgG

    PubMed Central

    Steinwand, Miriam; Droste, Patrick; Frenzel, Andrè; Hust, Michael; Dübel, Stefan; Schirrmann, Thomas

    2014-01-01

    Today, most approved therapeutic antibodies are provided as immunoglobulin G (IgG), whereas small recombinant antibody formats are required for in vitro antibody generation and engineering during drug development. Particularly, single chain (sc) antibody fragments like scFv or scFab are well suited for phage display and bacterial expression, but some have been found to lose affinity during conversion into IgG.   In this study, we compared the influence of the antibody format on affinity maturation of the CD30-specific scFv antibody fragment SH313-F9, with the overall objective being improvement of the IgG. The variable genes of SH313-F9 were randomly mutated and then cloned into libraries encoding different recombinant antibody formats, including scFv, Fab, scFabΔC, and FabΔC. All tested antibody formats except Fab allowed functional phage display of the parental antibody SH313-F9, and the corresponding mutated antibody gene libraries allowed isolation of candidates with enhanced CD30 binding. Moreover, scFv and scFabΔC antibody variants retained improved antigen binding after subcloning into the single gene encoded IgG-like formats scFv-Fc or scIgG, but lost affinity after conversion into IgGs. Only affinity maturation using the Fab-like FabΔC format, which does not contain the carboxy terminal cysteines, allowed successful selection of molecules with improved binding that was retained after conversion to IgG. Thus, affinity maturation of IgGs is dependent on the antibody format employed for selection and screening. In this study, only FabΔC resulted in the efficient selection of IgG candidates with higher affinity by combination of Fab-like conformation and improved phage display compared with Fab. PMID:24262918

  14. Isolation and partial characterization of a root-specific promoter for stacking multiple traits into cassava (Manihot esculenta CRANTZ).

    PubMed

    Gbadegesin, M A; Beeching, J R

    2011-06-07

    Cassava can be cultivated on impoverished soils with minimum inputs, and its storage roots are a staple food for millions in Africa. However, these roots are low in bioavailable nutrients and in protein content, contain cyanogenic glycosides, and suffer from a very short post-harvest shelf-life, and the plant is susceptible to viral and bacterial diseases prevalent in Africa. The demand for improvement of cassava with respect to these traits comes from both farmers and national agricultural institutions. Genetic improvement of cassava cultivars by molecular biology techniques requires the availability of appropriate genes, a system to introduce these genes into cassava, and the use of suitable gene promoters. Cassava root-specific promoter for auxin-repressed protein was isolated using the gene walking approach, starting with a cDNA sequence. In silico analysis of promoter sequences revealed putative cis-acting regulatory elements, including root-specific elements, which may be required for gene expression in vascular tissues. Research on the activities of this promoter is continuing, with the development of plant expression cassettes for transformation into major African elite lines and farmers' preferred cassava cultivars to enable testing of tissue-specific expression patterns in the field.

  15. Epigenetic modification of the oxytocin receptor gene influences the perception of anger and fear in the human brain

    PubMed Central

    Puglia, Meghan H.; Lillard, Travis S.; Morris, James P.; Connelly, Jessica J.

    2015-01-01

    In humans, the neuropeptide oxytocin plays a critical role in social and emotional behavior. The actions of this molecule are dependent on a protein that acts as its receptor, which is encoded by the oxytocin receptor gene (OXTR). DNA methylation of OXTR, an epigenetic modification, directly influences gene transcription and is variable in humans. However, the impact of this variability on specific social behaviors is unknown. We hypothesized that variability in OXTR methylation impacts social perceptual processes often linked with oxytocin, such as perception of facial emotions. Using an imaging epigenetic approach, we established a relationship between OXTR methylation and neural activity in response to emotional face processing. Specifically, high levels of OXTR methylation were associated with greater amounts of activity in regions associated with face and emotion processing including amygdala, fusiform, and insula. Importantly, we found that these higher levels of OXTR methylation were also associated with decreased functional coupling of amygdala with regions involved in affect appraisal and emotion regulation. These data indicate that the human endogenous oxytocin system is involved in attenuation of the fear response, corroborating research implicating intranasal oxytocin in the same processes. Our findings highlight the importance of including epigenetic mechanisms in the description of the endogenous oxytocin system and further support a central role for oxytocin in social cognition. This approach linking epigenetic variability with neural endophenotypes may broadly explain individual differences in phenotype including susceptibility or resilience to disease. PMID:25675509

  16. QuASAR-MPRA: accurate allele-specific analysis for massively parallel reporter assays.

    PubMed

    Kalita, Cynthia A; Moyerbrailean, Gregory A; Brown, Christopher; Wen, Xiaoquan; Luca, Francesca; Pique-Regi, Roger

    2018-03-01

    The majority of the human genome is composed of non-coding regions containing regulatory elements such as enhancers, which are crucial for controlling gene expression. Many variants associated with complex traits are in these regions, and may disrupt gene regulatory sequences. Consequently, it is important to not only identify true enhancers but also to test if a variant within an enhancer affects gene regulation. Recently, allele-specific analysis in high-throughput reporter assays, such as massively parallel reporter assays (MPRAs), have been used to functionally validate non-coding variants. However, we are still missing high-quality and robust data analysis tools for these datasets. We have further developed our method for allele-specific analysis QuASAR (quantitative allele-specific analysis of reads) to analyze allele-specific signals in barcoded read counts data from MPRA. Using this approach, we can take into account the uncertainty on the original plasmid proportions, over-dispersion, and sequencing errors. The provided allelic skew estimate and its standard error also simplifies meta-analysis of replicate experiments. Additionally, we show that a beta-binomial distribution better models the variability present in the allelic imbalance of these synthetic reporters and results in a test that is statistically well calibrated under the null. Applying this approach to the MPRA data, we found 602 SNPs with significant (false discovery rate 10%) allele-specific regulatory function in LCLs. We also show that we can combine MPRA with QuASAR estimates to validate existing experimental and computational annotations of regulatory variants. Our study shows that with appropriate data analysis tools, we can improve the power to detect allelic effects in high-throughput reporter assays. http://github.com/piquelab/QuASAR/tree/master/mpra. fluca@wayne.edu or rpique@wayne.edu. Supplementary data are available online at Bioinformatics. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  17. Highly specific targeting of the TMPRSS2/ERG fusion gene using liposomal nanovectors

    PubMed Central

    Shao, Longjiang; Tekedereli, Ibrahim; Wang, Jianghua; Yuca, Erkan; Tsang, Susan; Sood, Anil; Lopez-Berestein, Gabriel; Ozpolat, Bulent; Ittmann, Michael

    2012-01-01

    Purpose The TMPRSS2/ERG (T/E) fusion gene is present in half of all prostate cancer (PCa) tumors. Fusion of the oncogenic ERG gene with the androgen-regulated TMPRSS2 gene promoter results in expression of fusion mRNAs in PCa cells. The junction of theTMPRSS2 and ERG derived portions of the fusion mRNA constitutes a cancer specific target in cells containing the T/E fusion gene. Targeting the most common alternatively spliced fusion gene mRNA junctional isoforms in vivo using siRNAs in liposomal nanovectors may potentially be a novel, low toxicity treatment for PCa. Experimental Design We designed and optimized siRNAs targeting the two most common T/E fusion gene mRNA junctional isoforms (Type III or Type VI). Specificity of siRNAs was assessed by transient co-transfection in vitro. To test their ability to inhibit growth of PCa cells expressing these fusion gene isoforms in vivo, specific siRNAs in liposomal nanovectors were used to treat mice bearing orthotopic or subcutaneous xenograft tumors expressing the targeted fusion isoforms. Results The targeting siRNAs were both potent and highly specific in vitro. In vivo they significantly inhibited tumor growth. The degree of growth inhibition was variable and was correlated with the extent of fusion gene knockdown. The growth inhibition was associated with marked inhibition of angiogenesis and, to a lesser degree, proliferation and a marked increase in apoptosis of tumor cells. No toxicity was observed. Conclusions Targeting the T/E fusion junction in vivo with specific siRNAs delivered via liposomal nanovectors is a promising therapy for men with PCa. PMID:23052253

  18. Highly specific targeting of the TMPRSS2/ERG fusion gene using liposomal nanovectors.

    PubMed

    Shao, Longjiang; Tekedereli, Ibrahim; Wang, Jianghua; Yuca, Erkan; Tsang, Susan; Sood, Anil; Lopez-Berestein, Gabriel; Ozpolat, Bulent; Ittmann, Michael

    2012-12-15

    The TMPRSS2/ERG (T/E) fusion gene is present in half of all prostate cancer tumors. Fusion of the oncogenic ERG gene with the androgen-regulated TMPRSS2 gene promoter results in expression of fusion mRNAs in prostate cancer cells. The junction of theTMPRSS2- and ERG-derived portions of the fusion mRNA constitutes a cancer-specific target in cells containing the T/E fusion gene. Targeting the most common alternatively spliced fusion gene mRNA junctional isoforms in vivo using siRNAs in liposomal nanovectors may potentially be a novel, low-toxicity treatment for prostate cancer. We designed and optimized siRNAs targeting the two most common T/E fusion gene mRNA junctional isoforms (type III or type VI). Specificity of siRNAs was assessed by transient co-transfection in vitro. To test their ability to inhibit growth of prostate cancer cells expressing these fusion gene isoforms in vivo, specific siRNAs in liposomal nanovectors were used to treat mice bearing orthotopic or subcutaneous xenograft tumors expressing the targeted fusion isoforms. The targeting siRNAs were both potent and highly specific in vitro. In vivo they significantly inhibited tumor growth. The degree of growth inhibition was variable and was correlated with the extent of fusion gene knockdown. The growth inhibition was associated with marked inhibition of angiogenesis and, to a lesser degree, proliferation and a marked increase in apoptosis of tumor cells. No toxicity was observed. Targeting the T/E fusion junction in vivo with specific siRNAs delivered via liposomal nanovectors is a promising therapy for men with prostate cancer. ©2012 AACR.

  19. The genome of melon (Cucumis melo L.)

    PubMed Central

    Garcia-Mas, Jordi; Benjak, Andrej; Sanseverino, Walter; Bourgeois, Michael; Mir, Gisela; González, Víctor M.; Hénaff, Elizabeth; Câmara, Francisco; Cozzuto, Luca; Lowy, Ernesto; Alioto, Tyler; Capella-Gutiérrez, Salvador; Blanca, Jose; Cañizares, Joaquín; Ziarsolo, Pello; Gonzalez-Ibeas, Daniel; Rodríguez-Moreno, Luis; Droege, Marcus; Du, Lei; Alvarez-Tejado, Miguel; Lorente-Galdos, Belen; Melé, Marta; Yang, Luming; Weng, Yiqun; Navarro, Arcadi; Marques-Bonet, Tomas; Aranda, Miguel A.; Nuez, Fernando; Picó, Belén; Gabaldón, Toni; Roma, Guglielmo; Guigó, Roderic; Casacuberta, Josep M.; Arús, Pere; Puigdomènech, Pere

    2012-01-01

    We report the genome sequence of melon, an important horticultural crop worldwide. We assembled 375 Mb of the double-haploid line DHL92, representing 83.3% of the estimated melon genome. We predicted 27,427 protein-coding genes, which we analyzed by reconstructing 22,218 phylogenetic trees, allowing mapping of the orthology and paralogy relationships of sequenced plant genomes. We observed the absence of recent whole-genome duplications in the melon lineage since the ancient eudicot triplication, and our data suggest that transposon amplification may in part explain the increased size of the melon genome compared with the close relative cucumber. A low number of nucleotide-binding site–leucine-rich repeat disease resistance genes were annotated, suggesting the existence of specific defense mechanisms in this species. The DHL92 genome was compared with that of its parental lines allowing the quantification of sequence variability in the species. The use of the genome sequence in future investigations will facilitate the understanding of evolution of cucurbits and the improvement of breeding strategies. PMID:22753475

  20. Asialoglycoprotein receptor 1 is a specific cell-surface marker for isolating hepatocytes derived from human pluripotent stem cells

    PubMed Central

    Peters, Derek T.; Henderson, Christopher A.; Warren, Curtis R.; Friesen, Max; Xia, Fang; Becker, Caroline E.; Musunuru, Kiran; Cowan, Chad A.

    2016-01-01

    ABSTRACT Hepatocyte-like cells (HLCs) are derived from human pluripotent stem cells (hPSCs) in vitro, but differentiation protocols commonly give rise to a heterogeneous mixture of cells. This variability confounds the evaluation of in vitro functional assays performed using HLCs. Increased differentiation efficiency and more accurate approximation of the in vivo hepatocyte gene expression profile would improve the utility of hPSCs. Towards this goal, we demonstrate the purification of a subpopulation of functional HLCs using the hepatocyte surface marker asialoglycoprotein receptor 1 (ASGR1). We analyzed the expression profile of ASGR1-positive cells by microarray, and tested their ability to perform mature hepatocyte functions (albumin and urea secretion, cytochrome activity). By these measures, ASGR1-positive HLCs are enriched for the gene expression profile and functional characteristics of primary hepatocytes compared with unsorted HLCs. We have demonstrated that ASGR1-positive sorting isolates a functional subpopulation of HLCs from among the heterogeneous cellular population produced by directed differentiation. PMID:27143754

  1. Plankton networks driving carbon export in the oligotrophic ocean

    NASA Astrophysics Data System (ADS)

    Guidi, L.; Chaffron, S.; Bittner, L.; Eveillard, D.; Raes, J.; Karsenti, E.; Bowler, C.; Gorsky, G.

    2016-02-01

    The biological carbon pump is the process by which CO2 is transformed to organic carbon via photosynthesis that sinks to the deep ocean as particles where it is sequestered. While the intensity of the pump correlates with plankton community composition, the underlying ecosystem structure and interactions driving the process remain largely uncharacterised. Here we use environmental and metagenomic data gathered during the Tara Oceans expedition to improve our understanding of the underlying processes. We show that specific plankton communities correlate with carbon export and highlight unexpected and overlooked taxa such as Radiolaria, alveolate parasites, as well as Synechococcus and their phages, as lineages most strongly associated with carbon export in the subtropical oligotrophic ocean. Additionally, we show that the relative abundance of just a few bacterial and viral genes can predict most of the variability in carbon export in these regions. Together these results help elucidate ecosystem drivers of the biological carbon pump and present a case study for scaling from genes-to-ecosystems.

  2. Determination of allergen specificity by heavy chains in grass pollen allergen-specific IgE antibodies.

    PubMed

    Gadermaier, Elisabeth; Flicker, Sabine; Lupinek, Christian; Steinberger, Peter; Valenta, Rudolf

    2013-04-01

    Affinity and clonality of allergen-specific IgE antibodies are important determinants for the magnitude of IgE-mediated allergic inflammation. We sought to analyze the contribution of heavy and light chains of human allergen-specific IgE antibodies for allergen specificity and to test whether promiscuous pairing of heavy and light chains with different allergen specificity allows binding and might affect affinity. Ten IgE Fabs specific for 3 non-cross-reactive major timothy grass pollen allergens (Phl p 1, Phl p 2, and Phl p 5) obtained by means of combinatorial cloning from patients with grass pollen allergy were used to construct stable recombinant single chain variable fragments (ScFvs) representing the original Fabs and shuffled ScFvs in which heavy chains were recombined with light chains from IgE Fabs with specificity for other allergens by using the pCANTAB 5 E expression system. Possible ancestor genes for the heavy chain and light chain variable region-encoding genes were determined by using sequence comparison with the ImMunoGeneTics database, and their chromosomal locations were determined. Recombinant ScFvs were tested for allergen specificity and epitope recognition by means of direct and sandwich ELISA, and affinity by using surface plasmon resonance experiments. The shuffling experiments demonstrate that promiscuous pairing of heavy and light chains is possible and maintains allergen specificity, which is mainly determined by the heavy chains. ScFvs consisting of different heavy and light chains exhibited different affinities and even epitope specificity for the corresponding allergen. Our results indicate that allergen specificity of allergen-specific IgE is mainly determined by the heavy chains. Different heavy and light chain pairings in allergen-specific IgE antibodies affect affinity and epitope specificity and thus might influence clinical reactivity to allergens. Copyright © 2012 American Academy of Allergy, Asthma & Immunology. Published by Mosby, Inc. All rights reserved.

  3. Expression of single-chain Fv gene specific for gamma-seminoprotein by RTS and its biological activity identification.

    PubMed

    Han, Yuedong; Haun, Yi; Deng, Jinlan; Gao, Feng; Pan, Bifeng; Cui, Daxiang

    2006-01-01

    Fabricating a single-chain variable fragment specific for human seminoprotein is very important in antibody-directed enzyme prodrug therapy and NMR imaging for prostate cancer. Here a single-chain Fv specific for gamma-seminoprotein was expressed by RTS. Its activity and the efficiency of entry into prostate cancer cells are investigated by immunoprecipitation and Western blotting and immunofluorescent staining, as well as entry of conjugated magnetic beads into cells. Results showed that ScFv peptides specific for gamma-seminoprotein were successfully prepared, which can bind with the prostate cells specifically and can bring magnetic beads into prostate cancer cells within 15 min, the amount of magnetic beads inside prostate cancer cells increased as the culture time prolonged. ScFv-conjugated magnetic beads did not enter into control cells. In conclusion, the ScFv peptide against human gamma-seminoprotein with biological activity was successfully fabricated, which can take magnetic beads to prostate cancer cells specifically and not to the control cells. This ScFv peptide against human gamma-seminoprotein should be useful in improving the detection and therapy of prostate cancer at early stages and NMR imaging.

  4. The Variable Regions of Lactobacillus rhamnosus Genomes Reveal the Dynamic Evolution of Metabolic and Host-Adaptation Repertoires

    PubMed Central

    Ceapa, Corina; Davids, Mark; Ritari, Jarmo; Lambert, Jolanda; Wels, Michiel; Douillard, François P.; Smokvina, Tamara; de Vos, Willem M.; Knol, Jan; Kleerebezem, Michiel

    2016-01-01

    Lactobacillus rhamnosus is a diverse Gram-positive species with strains isolated from different ecological niches. Here, we report the genome sequence analysis of 40 diverse strains of L. rhamnosus and their genomic comparison, with a focus on the variable genome. Genomic comparison of 40 L. rhamnosus strains discriminated the conserved genes (core genome) and regions of plasticity involving frequent rearrangements and horizontal transfer (variome). The L. rhamnosus core genome encompasses 2,164 genes, out of 4,711 genes in total (the pan-genome). The accessory genome is dominated by genes encoding carbohydrate transport and metabolism, extracellular polysaccharides (EPS) biosynthesis, bacteriocin production, pili production, the cas system, and the associated clustered regularly interspaced short palindromic repeat (CRISPR) loci, and more than 100 transporter functions and mobile genetic elements like phages, plasmid genes, and transposons. A clade distribution based on amino acid differences between core (shared) proteins matched with the clade distribution obtained from the presence–absence of variable genes. The phylogenetic and variome tree overlap indicated that frequent events of gene acquisition and loss dominated the evolutionary segregation of the strains within this species, which is paralleled by evolutionary diversification of core gene functions. The CRISPR-Cas system could have contributed to this evolutionary segregation. Lactobacillus rhamnosus strains contain the genetic and metabolic machinery with strain-specific gene functions required to adapt to a large range of environments. A remarkable congruency of the evolutionary relatedness of the strains’ core and variome functions, possibly favoring interspecies genetic exchanges, underlines the importance of gene-acquisition and loss within the L. rhamnosus strain diversification. PMID:27358423

  5. Biocatalytic Conversion of Avermectin to 4″-Oxo-Avermectin: Improvement of Cytochrome P450 Monooxygenase Specificity by Directed Evolution▿ †

    PubMed Central

    Trefzer, Axel; Jungmann, Volker; Molnár, István; Botejue, Ajit; Buckel, Dagmar; Frey, Gerhard; Hill, D. Steven; Jörg, Mario; Ligon, James M.; Mason, Dylan; Moore, David; Pachlatko, J. Paul; Richardson, Toby H.; Spangenberg, Petra; Wall, Mark A.; Zirkle, Ross; Stege, Justin T.

    2007-01-01

    Discovery of the CYP107Z subfamily of cytochrome P450 oxidases (CYPs) led to an alternative biocatalytic synthesis of 4″-oxo-avermectin, a key intermediate for the commercial production of the semisynthetic insecticide emamectin. However, under industrial process conditions, these wild-type CYPs showed lower yields due to side product formation. Molecular evolution employing GeneReassembly was used to improve the regiospecificity of these enzymes by a combination of random mutagenesis, protein structure-guided site-directed mutagenesis, and recombination of multiple natural and synthetic CYP107Z gene fragments. To assess the specificity of CYP mutants, a miniaturized, whole-cell biocatalytic reaction system that allowed high-throughput screening of large numbers of variants was developed. In an iterative process consisting of four successive rounds of GeneReassembly evolution, enzyme variants with significantly improved specificity for the production of 4″-oxo-avermectin were identified; these variants could be employed for a more economical industrial biocatalytic process to manufacture emamectin. PMID:17483257

  6. Synthetic spike-in standards for high-throughput 16S rRNA gene amplicon sequencing

    PubMed Central

    Tourlousse, Dieter M.; Yoshiike, Satowa; Ohashi, Akiko; Matsukura, Satoko; Noda, Naohiro

    2017-01-01

    Abstract High-throughput sequencing of 16S rRNA gene amplicons (16S-seq) has become a widely deployed method for profiling complex microbial communities but technical pitfalls related to data reliability and quantification remain to be fully addressed. In this work, we have developed and implemented a set of synthetic 16S rRNA genes to serve as universal spike-in standards for 16S-seq experiments. The spike-ins represent full-length 16S rRNA genes containing artificial variable regions with negligible identity to known nucleotide sequences, permitting unambiguous identification of spike-in sequences in 16S-seq read data from any microbiome sample. Using defined mock communities and environmental microbiota, we characterized the performance of the spike-in standards and demonstrated their utility for evaluating data quality on a per-sample basis. Further, we showed that staggered spike-in mixtures added at the point of DNA extraction enable concurrent estimation of absolute microbial abundances suitable for comparative analysis. Results also underscored that template-specific Illumina sequencing artifacts may lead to biases in the perceived abundance of certain taxa. Taken together, the spike-in standards represent a novel bioanalytical tool that can substantially improve 16S-seq-based microbiome studies by enabling comprehensive quality control along with absolute quantification. PMID:27980100

  7. Targeted polymeric nanoparticles for cancer gene therapy

    PubMed Central

    Kim, Jayoung; Wilson, David R.; Zamboni, Camila G.; Green, Jordan J.

    2015-01-01

    In this article, advances in designing polymeric nanoparticles for targeted cancer gene therapy are reviewed. Characterization and evaluation of biomaterials, targeting ligands, and transcriptional elements are each discussed. Advances in biomaterials have driven improvements to nanoparticle stability and tissue targeting, conjugation of ligands to the surface of polymeric nanoparticles enable binding to specific cancer cells, and the design of transcriptional elements has enabled selective DNA expression specific to the cancer cells. Together, these features have improved the performance of polymeric nanoparticles as targeted non-viral gene delivery vectors to treat cancer. As polymeric nanoparticles can be designed to be biodegradable, non-toxic, and to have reduced immunogenicity and tumorigenicity compared to viral platforms, they have significant potential for clinical use. Results of polymeric gene therapy in clinical trials and future directions for the engineering of nanoparticle systems for targeted cancer gene therapy are also presented. PMID:26061296

  8. Genome Comparison of Human and Non-Human Malaria Parasites Reveals Species Subset-Specific Genes Potentially Linked to Human Disease

    PubMed Central

    Frech, Christian; Chen, Nansheng

    2011-01-01

    Genes underlying important phenotypic differences between Plasmodium species, the causative agents of malaria, are frequently found in only a subset of species and cluster at dynamically evolving subtelomeric regions of chromosomes. We hypothesized that chromosome-internal regions of Plasmodium genomes harbour additional species subset-specific genes that underlie differences in human pathogenicity, human-to-human transmissibility, and human virulence. We combined sequence similarity searches with synteny block analyses to identify species subset-specific genes in chromosome-internal regions of six published Plasmodium genomes, including Plasmodium falciparum, Plasmodium vivax, Plasmodium knowlesi, Plasmodium yoelii, Plasmodium berghei, and Plasmodium chabaudi. To improve comparative analysis, we first revised incorrectly annotated gene models using homology-based gene finders and examined putative subset-specific genes within syntenic contexts. Confirmed subset-specific genes were then analyzed for their role in biological pathways and examined for molecular functions using publicly available databases. We identified 16 genes that are well conserved in the three primate parasites but not found in rodent parasites, including three key enzymes of the thiamine (vitamin B1) biosynthesis pathway. Thirteen genes were found to be present in both human parasites but absent in the monkey parasite P. knowlesi, including genes specifically upregulated in sporozoites or gametocytes that could be linked to parasite transmission success between humans. Furthermore, we propose 15 chromosome-internal P. falciparum-specific genes as new candidate genes underlying increased human virulence and detected a currently uncharacterized cluster of P. vivax-specific genes on chromosome 6 likely involved in erythrocyte invasion. In conclusion, Plasmodium species harbour many chromosome-internal differences in the form of protein-coding genes, some of which are potentially linked to human disease and thus promising leads for future laboratory research. PMID:22215999

  9. Variability in PAH-DNA adduct measurements in peripheral mononuclear cells: implications for quantitative cancer risk assessment.

    PubMed

    Dickey, C; Santella, R M; Hattis, D; Tang, D; Hsu, Y; Cooper, T; Young, T L; Perera, F P

    1997-10-01

    Biomarkers such as DNA adducts have significant potential to improve quantitative risk assessment by characterizing individual differences in metabolism of genotoxins and DNA repair and accounting for some of the factors that could affect interindividual variation in cancer risk. Inherent uncertainty in laboratory measurements and within-person variability of DNA adduct levels over time are putatively unrelated to cancer risk and should be subtracted from observed variation to better estimate interindividual variability of response to carcinogen exposure. A total of 41 volunteers, both smokers and nonsmokers, were asked to provide a peripheral blood sample every 3 weeks for several months in order to specifically assess intraindividual variability of polycyclic aromatic hydrocarbon (PAH)-DNA adduct levels. The intraindividual variance in PAH-DNA adduct levels, together with measurement uncertainty (laboratory variability and unaccounted for differences in exposure), constituted roughly 30% of the overall variance. An estimated 70% of the total variance was contributed by interindividual variability and is probably representative of the true biologic variability of response to carcinogenic exposure in lymphocytes. The estimated interindividual variability in DNA damage after subtracting intraindividual variability and measurement uncertainty was 24-fold. Inter-individual variance was higher (52-fold) in persons who constitutively lack the Glutathione S-Transferase M1 (GSTM1) gene which is important in the detoxification pathway of PAH. Risk assessment models that do not consider the variability of susceptibility to DNA damage following carcinogen exposure may underestimate risks to the general population, especially for those people who are most vulnerable.

  10. Primer sets for cloning the human repertoire of T cell Receptor Variable regions

    PubMed Central

    Boria, Ilenia; Cotella, Diego; Dianzani, Irma; Santoro, Claudio; Sblattero, Daniele

    2008-01-01

    Background Amplification and cloning of naïve T cell Receptor (TR) repertoires or antigen-specific TR is crucial to shape immune response and to develop immuno-based therapies. TR variable (V) regions are encoded by several genes that recombine during T cell development. The cloning of expressed genes as large diverse libraries from natural sources relies upon the availability of primers able to amplify as many V genes as possible. Results Here, we present a list of primers computationally designed on all functional TR V and J genes listed in the IMGT®, the ImMunoGeneTics information system®. The list consists of unambiguous or degenerate primers suitable to theoretically amplify and clone the entire TR repertoire. We show that it is possible to selectively amplify and clone expressed TR V genes in one single RT-PCR step and from as little as 1000 cells. Conclusion This new primer set will facilitate the creation of more diverse TR libraries than has been possible using currently available primer sets. PMID:18759974

  11. Systems Biophysics of Gene Expression

    PubMed Central

    Vilar, Jose M.G.; Saiz, Leonor

    2013-01-01

    Gene expression is a process central to any form of life. It involves multiple temporal and functional scales that extend from specific protein-DNA interactions to the coordinated regulation of multiple genes in response to intracellular and extracellular changes. This diversity in scales poses fundamental challenges to the use of traditional approaches to fully understand even the simplest gene expression systems. Recent advances in computational systems biophysics have provided promising avenues to reliably integrate the molecular detail of biophysical process into the system behavior. Here, we review recent advances in the description of gene regulation as a system of biophysical processes that extend from specific protein-DNA interactions to the combinatorial assembly of nucleoprotein complexes. There is now basic mechanistic understanding on how promoters controlled by multiple, local and distal, DNA binding sites for transcription factors can actively control transcriptional noise, cell-to-cell variability, and other properties of gene regulation, including precision and flexibility of the transcriptional responses. PMID:23790365

  12. Distinct gene-specific mechanisms of arrhythmia revealed by cardiac gene transfer of two long QT disease genes, HERG and KCNE1.

    PubMed

    Hoppe, U C; Marbán, E; Johns, D C

    2001-04-24

    The long QT syndrome (LQTS) is a heritable disorder that predisposes to sudden cardiac death. LQTS is caused by mutations in ion channel genes including HERG and KCNE1, but the precise mechanisms remain unclear. To clarify this situation we injected adenoviral vectors expressing wild-type or LQT mutants of HERG and KCNE1 into guinea pig myocardium. End points at 48-72 h included electrophysiology in isolated myocytes and electrocardiography in vivo. HERG increased the rapid component, I(Kr), of the delayed rectifier current, thereby accelerating repolarization, increasing refractoriness, and diminishing beat-to-beat action potential variability. Conversely, HERG-G628S suppressed I(Kr) without significantly delaying repolarization. Nevertheless, HERG-G628S abbreviated refractoriness and increased beat-to-beat variability, leading to early afterdepolarizations (EADs). KCNE1 increased the slow component of the delayed rectifier, I(Ks), without clear phenotypic sequelae. In contrast, KCNE1-D76N suppressed I(Ks) and markedly slowed repolarization, leading to frequent EADs and electrocardiographic QT prolongation. Thus, the two genes predispose to sudden death by distinct mechanisms: the KCNE1 mutant flagrantly undermines cardiac repolarization, and HERG-G628S subtly facilitates the genesis and propagation of premature beats. Our ability to produce electrocardiographic long QT in vivo with a clinical KCNE1 mutation demonstrates the utility of somatic gene transfer in creating genotype-specific disease models.

  13. Noise in gene expression is coupled to growth rate.

    PubMed

    Keren, Leeat; van Dijk, David; Weingarten-Gabbay, Shira; Davidi, Dan; Jona, Ghil; Weinberger, Adina; Milo, Ron; Segal, Eran

    2015-12-01

    Genetically identical cells exposed to the same environment display variability in gene expression (noise), with important consequences for the fidelity of cellular regulation and biological function. Although population average gene expression is tightly coupled to growth rate, the effects of changes in environmental conditions on expression variability are not known. Here, we measure the single-cell expression distributions of approximately 900 Saccharomyces cerevisiae promoters across four environmental conditions using flow cytometry, and find that gene expression noise is tightly coupled to the environment and is generally higher at lower growth rates. Nutrient-poor conditions, which support lower growth rates, display elevated levels of noise for most promoters, regardless of their specific expression values. We present a simple model of noise in expression that results from having an asynchronous population, with cells at different cell-cycle stages, and with different partitioning of the cells between the stages at different growth rates. This model predicts non-monotonic global changes in noise at different growth rates as well as overall higher variability in expression for cell-cycle-regulated genes in all conditions. The consistency between this model and our data, as well as with noise measurements of cells growing in a chemostat at well-defined growth rates, suggests that cell-cycle heterogeneity is a major contributor to gene expression noise. Finally, we identify gene and promoter features that play a role in gene expression noise across conditions. Our results show the existence of growth-related global changes in gene expression noise and suggest their potential phenotypic implications. © 2015 Keren et al.; Published by Cold Spring Harbor Laboratory Press.

  14. Noise in gene expression is coupled to growth rate

    PubMed Central

    Keren, Leeat; van Dijk, David; Weingarten-Gabbay, Shira; Davidi, Dan; Jona, Ghil; Weinberger, Adina; Milo, Ron; Segal, Eran

    2015-01-01

    Genetically identical cells exposed to the same environment display variability in gene expression (noise), with important consequences for the fidelity of cellular regulation and biological function. Although population average gene expression is tightly coupled to growth rate, the effects of changes in environmental conditions on expression variability are not known. Here, we measure the single-cell expression distributions of approximately 900 Saccharomyces cerevisiae promoters across four environmental conditions using flow cytometry, and find that gene expression noise is tightly coupled to the environment and is generally higher at lower growth rates. Nutrient-poor conditions, which support lower growth rates, display elevated levels of noise for most promoters, regardless of their specific expression values. We present a simple model of noise in expression that results from having an asynchronous population, with cells at different cell-cycle stages, and with different partitioning of the cells between the stages at different growth rates. This model predicts non-monotonic global changes in noise at different growth rates as well as overall higher variability in expression for cell-cycle–regulated genes in all conditions. The consistency between this model and our data, as well as with noise measurements of cells growing in a chemostat at well-defined growth rates, suggests that cell-cycle heterogeneity is a major contributor to gene expression noise. Finally, we identify gene and promoter features that play a role in gene expression noise across conditions. Our results show the existence of growth-related global changes in gene expression noise and suggest their potential phenotypic implications. PMID:26355006

  15. Frequency of Fanconi anemia in Brazil and efficacy of screening for the FANCA 3788-3790del mutation.

    PubMed

    Magdalena, N; Pilonetto, D V; Bitencourt, M A; Pereira, N F; Ribeiro, R C; Jeng, M; Pasquini, R

    2005-05-01

    Fanconi anemia (FA) is an autosomal recessive genetic disease characterized by progressive bone marrow failure, susceptibility to cancer and multiple congenital anomalies. There is important clinical variability among patients and the knowledge of factors which might predict outcome would greatly help the decision making regarding the choices of treatment and the appropriate time to start it. Future studies of the possible correlation between specific mutations with specific clinical presentations will provide the answer to one of these factors. At our Center we standardized a rapid and precise screening test using a mismatch PCR assay for a specific mutation (3788-3790del in exon 38 of gene FANCA) in Brazilian FA patients. We present the results obtained after screening 80 non-consanguineous FA patients referred from all regions of Brazil with a clinical diagnosis of FA supported by cellular hypersensitivity to diepoxybutane. We were able to detect the 3788-3790del allele in 24 of the 80 (30%) FA patients studied. Thirteen of the 80 (16.25%) were homozygotes and 11 of the 80 (13.75%) were compound heterozygotes, thus confirming the high frequency of the FANCA 3788-3790del mutation in Brazilian FA patients. The identification of patients with specific mutations in the FA genes may lead to a better clinical description of this condition, also providing data for genotype-phenotype correlations, to a better understanding of the interaction of this specific mutation with other mutations in compound heterozygote patients, and ultimately to the right choices of treatment for each patient with improvement of the prognosis on future studies.

  16. Functional analysis and transcriptional output of the Göttingen minipig genome.

    PubMed

    Heckel, Tobias; Schmucki, Roland; Berrera, Marco; Ringshandl, Stephan; Badi, Laura; Steiner, Guido; Ravon, Morgane; Küng, Erich; Kuhn, Bernd; Kratochwil, Nicole A; Schmitt, Georg; Kiialainen, Anna; Nowaczyk, Corinne; Daff, Hamina; Khan, Azinwi Phina; Lekolool, Isaac; Pelle, Roger; Okoth, Edward; Bishop, Richard; Daubenberger, Claudia; Ebeling, Martin; Certa, Ulrich

    2015-11-14

    In the past decade the Göttingen minipig has gained increasing recognition as animal model in pharmaceutical and safety research because it recapitulates many aspects of human physiology and metabolism. Genome-based comparison of drug targets together with quantitative tissue expression analysis allows rational prediction of pharmacology and cross-reactivity of human drugs in animal models thereby improving drug attrition which is an important challenge in the process of drug development. Here we present a new chromosome level based version of the Göttingen minipig genome together with a comparative transcriptional analysis of tissues with pharmaceutical relevance as basis for translational research. We relied on mapping and assembly of WGS (whole-genome-shotgun sequencing) derived reads to the reference genome of the Duroc pig and predict 19,228 human orthologous protein-coding genes. Genome-based prediction of the sequence of human drug targets enables the prediction of drug cross-reactivity based on conservation of binding sites. We further support the finding that the genome of Sus scrofa contains about ten-times less pseudogenized genes compared to other vertebrates. Among the functional human orthologs of these minipig pseudogenes we found HEPN1, a putative tumor suppressor gene. The genomes of Sus scrofa, the Tibetan boar, the African Bushpig, and the Warthog show sequence conservation of all inactivating HEPN1 mutations suggesting disruption before the evolutionary split of these pig species. We identify 133 Sus scrofa specific, conserved long non-coding RNAs (lncRNAs) in the minipig genome and show that these transcripts are highly conserved in the African pigs and the Tibetan boar suggesting functional significance. Using a new minipig specific microarray we show high conservation of gene expression signatures in 13 tissues with biomedical relevance between humans and adult minipigs. We underline this relationship for minipig and human liver where we could demonstrate similar expression levels for most phase I drug-metabolizing enzymes. Higher expression levels and metabolic activities were found for FMO1, AKR/CRs and for phase II drug metabolizing enzymes in minipig as compared to human. The variability of gene expression in equivalent human and minipig tissues is considerably higher in minipig organs, which is important for study design in case a human target belongs to this variable category in the minipig. The first analysis of gene expression in multiple tissues during development from young to adult shows that the majority of transcriptional programs are concluded four weeks after birth. This finding is in line with the advanced state of human postnatal organ development at comparative age categories and further supports the minipig as model for pediatric drug safety studies. Genome based assessment of sequence conservation combined with gene expression data in several tissues improves the translational value of the minipig for human drug development. The genome and gene expression data presented here are important resources for researchers using the minipig as model for biomedical research or commercial breeding. Potential impact of our data for comparative genomics, translational research, and experimental medicine are discussed.

  17. Inhibition of histone deacetylation and DNA methylation improves gene expression mediated by the adeno-associated virus/phage in cancer cells.

    PubMed

    Kia, Azadeh; Yata, Teerapong; Hajji, Nabil; Hajitou, Amin

    2013-10-22

    Bacteriophage (phage), viruses that infect bacteria only, have become promising vectors for targeted systemic delivery of genes to cancer, although, with poor efficiency. We previously designed an improved phage vector by incorporating cis genetic elements of adeno-associated virus (AAV). This novel AAV/phage hybrid (AAVP) specifically targeted systemic delivery of therapeutic genes into tumors. To advance the AAVP vector, we recently introduced the stress-inducible Grp78 tumor specific promoter and found that this dual tumor-targeted AAVP provides persistent gene expression, over time, in cancer cells compared to silenced gene expression from the CMV promoter in the parental AAVP. Herein, we investigated the effect of histone deacetylation and DNA methylation on AAVP-mediated gene expression in cancer cells and explored the effect of cell confluence state on AAVP gene expression efficacy. Using a combination of AAVP expressing the GFP reporter gene, flow cytometry, inhibitors of histone deacetylation, and DNA methylation, we have demonstrated that histone deacetylation and DNA methylation are associated with silencing of gene expression from the CMV promoter in the parental AAVP. Importantly, inhibitors of histone deacetylases boost gene expression in cancer cells from the Grp78 promoter in the dual tumor-targeted AAVP. However, cell confluence had no effect on AAVP-guided gene expression. Our findings prove that combination of histone deacetylase inhibitor drugs with the Grp78 promoter is an effective approach to improve AAVP-mediated gene expression in cancer cells and should be considered for AAVP-based clinical cancer gene therapy.

  18. Improved accuracy of supervised CRM discovery with interpolated Markov models and cross-species comparison.

    PubMed

    Kazemian, Majid; Zhu, Qiyun; Halfon, Marc S; Sinha, Saurabh

    2011-12-01

    Despite recent advances in experimental approaches for identifying transcriptional cis-regulatory modules (CRMs, 'enhancers'), direct empirical discovery of CRMs for all genes in all cell types and environmental conditions is likely to remain an elusive goal. Effective methods for computational CRM discovery are thus a critically needed complement to empirical approaches. However, existing computational methods that search for clusters of putative binding sites are ineffective if the relevant TFs and/or their binding specificities are unknown. Here, we provide a significantly improved method for 'motif-blind' CRM discovery that does not depend on knowledge or accurate prediction of TF-binding motifs and is effective when limited knowledge of functional CRMs is available to 'supervise' the search. We propose a new statistical method, based on 'Interpolated Markov Models', for motif-blind, genome-wide CRM discovery. It captures the statistical profile of variable length words in known CRMs of a regulatory network and finds candidate CRMs that match this profile. The method also uses orthologs of the known CRMs from closely related genomes. We perform in silico evaluation of predicted CRMs by assessing whether their neighboring genes are enriched for the expected expression patterns. This assessment uses a novel statistical test that extends the widely used Hypergeometric test of gene set enrichment to account for variability in intergenic lengths. We find that the new CRM prediction method is superior to existing methods. Finally, we experimentally validate 12 new CRM predictions by examining their regulatory activity in vivo in Drosophila; 10 of the tested CRMs were found to be functional, while 6 of the top 7 predictions showed the expected activity patterns. We make our program available as downloadable source code, and as a plugin for a genome browser installed on our servers. © The Author(s) 2011. Published by Oxford University Press.

  19. Distinct skeletal muscle fiber characteristics and gene expression in diet-sensitive versus diet-resistant obesity.

    PubMed

    Gerrits, Martin F; Ghosh, Sujoy; Kavaslar, Nihan; Hill, Benjamin; Tour, Anastasia; Seifert, Erin L; Beauchamp, Brittany; Gorman, Shelby; Stuart, Joan; Dent, Robert; McPherson, Ruth; Harper, Mary-Ellen

    2010-08-01

    Inter-individual variability in weight gain and loss under energy surfeit and deficit conditions, respectively, are well recognized but poorly understood phenomena. We documented weight loss variability in an intensively supervised clinical weight loss program and assessed skeletal muscle gene expression and phenotypic characteristics related to variable response to a 900 kcal regimen. Matched pairs of healthy, diet-compliant, obese diet-sensitive (ODS) and diet-resistant (ODR) subjects were defined as those in the highest and lowest quintiles for weight loss rate. Physical activity energy expenditure was minimal and comparable. Following program completion and weight stabilization, skeletal muscle biopsies were obtained. Gene expression analysis of rectus femoris and vastus lateralis indicated upregulation of genes and gene sets involved in oxidative phosphorylation and glucose and fatty acid metabolism in ODS compared with ODR. In vastus lateralis, there was a higher proportion of oxidative (type I) fibers in ODS compared with ODR women and lean controls, fiber hypertrophy in ODS compared with ODR women and lean controls, and lower succinate dehydrogenase in oxidative and oxidative-glycolytic fibers in all obese compared with lean subjects. Intramuscular lipid content was generally higher in obese versus lean, and specifically higher in ODS vs. lean women. Altogether, our findings demonstrate differences in muscle gene expression and fiber composition related to clinical weight loss success.

  20. Distinct skeletal muscle fiber characteristics and gene expression in diet-sensitive versus diet-resistant obesity

    PubMed Central

    Gerrits, Martin F.; Ghosh, Sujoy; Kavaslar, Nihan; Hill, Benjamin; Tour, Anastasia; Seifert, Erin L.; Beauchamp, Brittany; Gorman, Shelby; Stuart, Joan; Dent, Robert; McPherson, Ruth; Harper, Mary-Ellen

    2010-01-01

    Inter-individual variability in weight gain and loss under energy surfeit and deficit conditions, respectively, are well recognized but poorly understood phenomena. We documented weight loss variability in an intensively supervised clinical weight loss program and assessed skeletal muscle gene expression and phenotypic characteristics related to variable response to a 900 kcal regimen. Matched pairs of healthy, diet-compliant, obese diet-sensitive (ODS) and diet-resistant (ODR) subjects were defined as those in the highest and lowest quintiles for weight loss rate. Physical activity energy expenditure was minimal and comparable. Following program completion and weight stabilization, skeletal muscle biopsies were obtained. Gene expression analysis of rectus femoris and vastus lateralis indicated upregulation of genes and gene sets involved in oxidative phosphorylation and glucose and fatty acid metabolism in ODS compared with ODR. In vastus lateralis, there was a higher proportion of oxidative (type I) fibers in ODS compared with ODR women and lean controls, fiber hypertrophy in ODS compared with ODR women and lean controls, and lower succinate dehydrogenase in oxidative and oxidative-glycolytic fibers in all obese compared with lean subjects. Intramuscular lipid content was generally higher in obese versus lean, and specifically higher in ODS vs. lean women. Altogether, our findings demonstrate differences in muscle gene expression and fiber composition related to clinical weight loss success. PMID:20332421

  1. Sex linkage, sex-specific selection, and the role of recombination in the evolution of sexually dimorphic gene expression.

    PubMed

    Connallon, Tim; Clark, Andrew G

    2010-12-01

    Sex-biased genes--genes that are differentially expressed within males and females--are nonrandomly distributed across animal genomes, with sex chromosomes and autosomes often carrying markedly different concentrations of male- and female-biased genes. These linkage patterns are often gene- and lineage-dependent, differing between functional genetic categories and between species. Although sex-specific selection is often hypothesized to shape the evolution of sex-linked and autosomal gene content, population genetics theory has yet to account for many of the gene- and lineage-specific idiosyncrasies emerging from the empirical literature. With the goal of improving the connection between evolutionary theory and a rapidly growing body of genome-wide empirical studies, we extend previous population genetics theory of sex-specific selection by developing and analyzing a biologically informed model that incorporates sex linkage, pleiotropy, recombination, and epistasis, factors that are likely to vary between genes and between species. Our results demonstrate that sex-specific selection and sex-specific recombination rates can generate, and are compatible with, the gene- and species-specific linkage patterns reported in the genomics literature. The theory suggests that sexual selection may strongly influence the architectures of animal genomes, as well as the chromosomal distribution of fixed substitutions underlying sexually dimorphic traits. © 2010 The Author(s). Evolution© 2010 The Society for the Study of Evolution.

  2. Tumor vessel-injuring ability improves antitumor effect of cytotoxic T lymphocytes in adoptive immunotherapy.

    PubMed

    Kanagawa, N; Yanagawa, T; Nakagawa, T; Okada, N; Nakagawa, S

    2013-01-01

    Angiogenesis is required for normal physiologic processes, but it is also involved in tumor growth, progression and metastasis. Here, we report the development of an immune-based antiangiogenic strategy based on the generation of T lymphocytes that possess killing specificity for cells expressing vascular endothelial growth factor receptor 2 (VEGFR2). To target VEGFR2-expressing cells, we engineered cytotoxic T lymphocyte (CTL) expressing chimeric T-cell receptors (cTCR-CTL) comprised of a single-chain variable fragment (scFv) against VEGFR2 linked to an intracellular signaling sequence derived from the CD3ζ chain of the TCR and CD28 by retroviral gene transduction methods. The cTCR-CTL exhibited efficient killing specificity against VEGFR2 and a tumor-targeting function in vitro and in vivo. Reflecting such abilities, we confirmed that the cTCR-CTL strongly inhibited the growth of a variety of syngeneic tumors after adoptive transfer into tumor-bearing mice without consequent damage to normal tissue. In addition, CTL expressing both cTCR and tumor-specific TCR induced complete tumor regression due to enhanced tumor infiltration by the CTL and long-term antigen-specific function. These findings provide evidence that the tumor vessel-injuring ability improved the antitumor effect of CTLs in adoptive immunotherapy for a broad range of cancers by inducing immune-mediated destruction of the tumor neovasculature.

  3. Searching whole genome sequences for biochemical identification features of emerging and reemerging pathogenic Corynebacterium species.

    PubMed

    Santos, André S; Ramos, Rommel T; Silva, Artur; Hirata, Raphael; Mattos-Guaraldi, Ana L; Meyer, Roberto; Azevedo, Vasco; Felicori, Liza; Pacheco, Luis G C

    2018-05-11

    Biochemical tests are traditionally used for bacterial identification at the species level in clinical microbiology laboratories. While biochemical profiles are generally efficient for the identification of the most important corynebacterial pathogen Corynebacterium diphtheriae, their ability to differentiate between biovars of this bacterium is still controversial. Besides, the unambiguous identification of emerging human pathogenic species of the genus Corynebacterium may be hampered by highly variable biochemical profiles commonly reported for these species, including Corynebacterium striatum, Corynebacterium amycolatum, Corynebacterium minutissimum, and Corynebacterium xerosis. In order to identify the genomic basis contributing for the biochemical variabilities observed in phenotypic identification methods of these bacteria, we combined a comprehensive literature review with a bioinformatics approach based on reconstruction of six specific biochemical reactions/pathways in 33 recently released whole genome sequences. We used data retrieved from curated databases (MetaCyc, PathoSystems Resource Integration Center (PATRIC), The SEED, TransportDB, UniProtKB) associated with homology searches by BLAST and profile Hidden Markov Models (HMMs) to detect enzymes participating in the various pathways and performed ab initio protein structure modeling and molecular docking to confirm specific results. We found a differential distribution among the various strains of genes that code for some important enzymes, such as beta-phosphoglucomutase and fructokinase, and also for individual components of carbohydrate transport systems, including the fructose-specific phosphoenolpyruvate-dependent sugar phosphotransferase (PTS) and the ribose-specific ATP-binging cassette (ABC) transporter. Horizontal gene transfer plays a role in the biochemical variability of the isolates, as some genes needed for sucrose fermentation were seen to be present in genomic islands. Noteworthy, using profile HMMs, we identified an enzyme with putative alpha-1,6-glycosidase activity only in some specific strains of C. diphtheriae and this may aid to understanding of the differential abilities to utilize glycogen and starch between the biovars.

  4. Interaction of the putative tyrosine recombinases RipX (UU145), XerC (UU222), and CodV (UU529) of Ureaplasma parvum serovar 3 with specific DNA

    PubMed Central

    Zimmerman, Carl-Ulrich R; Rosengarten, Renate; Spergser, Joachim

    2013-01-01

    Phase variation of two loci (‘mba locus’ and ‘UU172 phase-variable element’) in Ureaplasma parvum serovar 3 has been suggested as result of site-specific DNA inversion occurring at short inverted repeats. Three potential tyrosine recombinases (RipX, XerC, and CodV encoded by the genes UU145, UU222, and UU529) have been annotated in the genome of U. parvum serovar 3, which could be mediators in the proposed recombination event. We document that only orthologs of the gene xerC are present in all strains that show phase variation in the two loci. We demonstrate in vitro binding of recombinant maltose-binding protein fusions of XerC to the inverted repeats of the phase-variable loci, of RipX to a direct repeat that flanks a 20-kbp region, which has been proposed as putative pathogenicity island, and of CodV to a putative dif site. Co-transformation of the model organism Mycoplasma pneumoniae M129 with both the ‘mba locus’ and the recombinase gene xerC behind an active promoter region resulted in DNA inversion in the ‘mba locus’. Results suggest that XerC of U. parvum serovar 3 is a mediator in the proposed DNA inversion event of the two phase-variable loci. PMID:23305333

  5. pelB gene in isolates of Colletotrichum gloeosporioides from several hosts.

    PubMed

    Medeiros, L V; Maciel, D B; Medeiros, V V; Houllou Kido, L M; Oliveira, N T

    2010-04-13

    Colletotrichum gloeosporioides is an important pathogen for a great number of economically important crops. During the necrotrophic phase of infection by Colletotrichum spp, the degradative enzymes of plant cell walls, such as pectate lyase, clearly increase. A gene pelB that expresses a pectate lyase was identified in isolates of C. gloeosporioides in avocado pathogens. Various molecular studies have identified a kind of specialization of C. gloeosporioides isolates with specific hosts; however, there have been no studies of this gene in isolates from hosts other than avocado. The same is true for other species of Colletotrichum. We examined genetic variability in order to design primers that would amplify pelB gene fragments and compared the products of this amplification in C. gloeosporioides isolates from different hosts. Genetic variability was assessed using ISSR primers; the resultant data were grouped based on the UPGMA clustering method. Primers for the pelB gene were designed from selected GenBank sequences using the Primer 3 program at an annealing temperature of 60 degrees C and product amplification of nearly 600 bp. The ISSR primers were efficient in demonstrating the genetic variability of the Colletotrichum isolates and in distinguishing C. gloeosporioides, C. acutatum and C. sublineolum species. The gene pelB was found in C. gloeosporioides, C. acutatum and C. sublineolum. Amplified restriction fragments using MspI did not reveal differences in pelB gene structure in isolates from the three different host species that we investigated.

  6. A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data.

    PubMed

    Li, Xiaohong; Brock, Guy N; Rouchka, Eric C; Cooper, Nigel G F; Wu, Dongfeng; O'Toole, Timothy E; Gill, Ryan S; Eteleeb, Abdallah M; O'Brien, Liz; Rai, Shesh N

    2017-01-01

    Normalization is an essential step with considerable impact on high-throughput RNA sequencing (RNA-seq) data analysis. Although there are numerous methods for read count normalization, it remains a challenge to choose an optimal method due to multiple factors contributing to read count variability that affects the overall sensitivity and specificity. In order to properly determine the most appropriate normalization methods, it is critical to compare the performance and shortcomings of a representative set of normalization routines based on different dataset characteristics. Therefore, we set out to evaluate the performance of the commonly used methods (DESeq, TMM-edgeR, FPKM-CuffDiff, TC, Med UQ and FQ) and two new methods we propose: Med-pgQ2 and UQ-pgQ2 (per-gene normalization after per-sample median or upper-quartile global scaling). Our per-gene normalization approach allows for comparisons between conditions based on similar count levels. Using the benchmark Microarray Quality Control Project (MAQC) and simulated datasets, we performed differential gene expression analysis to evaluate these methods. When evaluating MAQC2 with two replicates, we observed that Med-pgQ2 and UQ-pgQ2 achieved a slightly higher area under the Receiver Operating Characteristic Curve (AUC), a specificity rate > 85%, the detection power > 92% and an actual false discovery rate (FDR) under 0.06 given the nominal FDR (≤0.05). Although the top commonly used methods (DESeq and TMM-edgeR) yield a higher power (>93%) for MAQC2 data, they trade off with a reduced specificity (<70%) and a slightly higher actual FDR than our proposed methods. In addition, the results from an analysis based on the qualitative characteristics of sample distribution for MAQC2 and human breast cancer datasets show that only our gene-wise normalization methods corrected data skewed towards lower read counts. However, when we evaluated MAQC3 with less variation in five replicates, all methods performed similarly. Thus, our proposed Med-pgQ2 and UQ-pgQ2 methods perform slightly better for differential gene analysis of RNA-seq data skewed towards lowly expressed read counts with high variation by improving specificity while maintaining a good detection power with a control of the nominal FDR level.

  7. A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data

    PubMed Central

    Li, Xiaohong; Brock, Guy N.; Rouchka, Eric C.; Cooper, Nigel G. F.; Wu, Dongfeng; O’Toole, Timothy E.; Gill, Ryan S.; Eteleeb, Abdallah M.; O’Brien, Liz

    2017-01-01

    Normalization is an essential step with considerable impact on high-throughput RNA sequencing (RNA-seq) data analysis. Although there are numerous methods for read count normalization, it remains a challenge to choose an optimal method due to multiple factors contributing to read count variability that affects the overall sensitivity and specificity. In order to properly determine the most appropriate normalization methods, it is critical to compare the performance and shortcomings of a representative set of normalization routines based on different dataset characteristics. Therefore, we set out to evaluate the performance of the commonly used methods (DESeq, TMM-edgeR, FPKM-CuffDiff, TC, Med UQ and FQ) and two new methods we propose: Med-pgQ2 and UQ-pgQ2 (per-gene normalization after per-sample median or upper-quartile global scaling). Our per-gene normalization approach allows for comparisons between conditions based on similar count levels. Using the benchmark Microarray Quality Control Project (MAQC) and simulated datasets, we performed differential gene expression analysis to evaluate these methods. When evaluating MAQC2 with two replicates, we observed that Med-pgQ2 and UQ-pgQ2 achieved a slightly higher area under the Receiver Operating Characteristic Curve (AUC), a specificity rate > 85%, the detection power > 92% and an actual false discovery rate (FDR) under 0.06 given the nominal FDR (≤0.05). Although the top commonly used methods (DESeq and TMM-edgeR) yield a higher power (>93%) for MAQC2 data, they trade off with a reduced specificity (<70%) and a slightly higher actual FDR than our proposed methods. In addition, the results from an analysis based on the qualitative characteristics of sample distribution for MAQC2 and human breast cancer datasets show that only our gene-wise normalization methods corrected data skewed towards lower read counts. However, when we evaluated MAQC3 with less variation in five replicates, all methods performed similarly. Thus, our proposed Med-pgQ2 and UQ-pgQ2 methods perform slightly better for differential gene analysis of RNA-seq data skewed towards lowly expressed read counts with high variation by improving specificity while maintaining a good detection power with a control of the nominal FDR level. PMID:28459823

  8. A network of epigenetic modifiers and DNA repair genes controls tissue-specific copy number alteration preference.

    PubMed

    Cramer, Dina; Serrano, Luis; Schaefer, Martin H

    2016-11-10

    Copy number alterations (CNAs) in cancer patients show a large variability in their number, length and position, but the sources of this variability are not known. CNA number and length are linked to patient survival, suggesting clinical relevance. We have identified genes that tend to be mutated in samples that have few or many CNAs, which we term CONIM genes (COpy Number Instability Modulators). CONIM proteins cluster into a densely connected subnetwork of physical interactions and many of them are epigenetic modifiers. Therefore, we investigated how the epigenome of the tissue-of-origin influences the position of CNA breakpoints and the properties of the resulting CNAs. We found that the presence of heterochromatin in the tissue-of-origin contributes to the recurrence and length of CNAs in the respective cancer type.

  9. Mutant phenotypes for thousands of bacterial genes of unknown function

    DOE PAGES

    Price, Morgan N.; Wetmore, Kelly M.; Waters, R. Jordan; ...

    2018-05-16

    One-third of all protein-coding genes from bacterial genomes cannot be annotated with a function. Here, to investigate the functions of these genes, we present genome-wide mutant fitness data from 32 diverse bacteria across dozens of growth conditions. We identified mutant phenotypes for 11,779 protein-coding genes that had not been annotated with a specific function. Many genes could be associated with a specific condition because the gene affected fitness only in that condition, or with another gene in the same bacterium because they had similar mutant phenotypes. Of the poorly annotated genes, 2,316 had associations that have high confidence because theymore » are conserved in other bacteria. By combining these conserved associations with comparative genomics, we identified putative DNA repair proteins; in addition, we propose specific functions for poorly annotated enzymes and transporters and for uncharacterized protein families. Lastly, our study demonstrates the scalability of microbial genetics and its utility for improving gene annotations.« less

  10. Mutant phenotypes for thousands of bacterial genes of unknown function

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Price, Morgan N.; Wetmore, Kelly M.; Waters, R. Jordan

    One-third of all protein-coding genes from bacterial genomes cannot be annotated with a function. Here, to investigate the functions of these genes, we present genome-wide mutant fitness data from 32 diverse bacteria across dozens of growth conditions. We identified mutant phenotypes for 11,779 protein-coding genes that had not been annotated with a specific function. Many genes could be associated with a specific condition because the gene affected fitness only in that condition, or with another gene in the same bacterium because they had similar mutant phenotypes. Of the poorly annotated genes, 2,316 had associations that have high confidence because theymore » are conserved in other bacteria. By combining these conserved associations with comparative genomics, we identified putative DNA repair proteins; in addition, we propose specific functions for poorly annotated enzymes and transporters and for uncharacterized protein families. Lastly, our study demonstrates the scalability of microbial genetics and its utility for improving gene annotations.« less

  11. Structure and variation of the mitochondrial genome of fishes.

    PubMed

    Satoh, Takashi P; Miya, Masaki; Mabuchi, Kohji; Nishida, Mutsumi

    2016-09-07

    The mitochondrial (mt) genome has been used as an effective tool for phylogenetic and population genetic analyses in vertebrates. However, the structure and variability of the vertebrate mt genome are not well understood. A potential strategy for improving our understanding is to conduct a comprehensive comparative study of large mt genome data. The aim of this study was to characterize the structure and variability of the fish mt genome through comparative analysis of large datasets. An analysis of the secondary structure of proteins for 250 fish species (248 ray-finned and 2 cartilaginous fishes) illustrated that cytochrome c oxidase subunits (COI, COII, and COIII) and a cytochrome bc1 complex subunit (Cyt b) had substantial amino acid conservation. Among the four proteins, COI was the most conserved, as more than half of all amino acid sites were invariable among the 250 species. Our models identified 43 and 58 stems within 12S rRNA and 16S rRNA, respectively, with larger numbers than proposed previously for vertebrates. The models also identified 149 and 319 invariable sites in 12S rRNA and 16S rRNA, respectively, in all fishes. In particular, the present result verified that a region corresponding to the peptidyl transferase center in prokaryotic 23S rRNA, which is homologous to mt 16S rRNA, is also conserved in fish mt 16S rRNA. Concerning the gene order, we found 35 variations (in 32 families) that deviated from the common gene order in vertebrates. These gene rearrangements were mostly observed in the area spanning the ND5 gene to the control region as well as two tRNA gene cluster regions (IQM and WANCY regions). Although many of such gene rearrangements were unique to a specific taxon, some were shared polyphyletically between distantly related species. Through a large-scale comparative analysis of 250 fish species mt genomes, we elucidated various structural aspects of the fish mt genome and the encoded genes. The present results will be important for understanding functions of the mt genome and developing programs for nucleotide sequence analysis. This study demonstrated the significance of extensive comparisons for understanding the structure of the mt genome.

  12. A Semiquantitative Framework for Gene Regulatory Networks: Increasing the Time and Quantitative Resolution of Boolean Networks

    PubMed Central

    Kerkhofs, Johan; Geris, Liesbet

    2015-01-01

    Boolean models have been instrumental in predicting general features of gene networks and more recently also as explorative tools in specific biological applications. In this study we introduce a basic quantitative and a limited time resolution to a discrete (Boolean) framework. Quantitative resolution is improved through the employ of normalized variables in unison with an additive approach. Increased time resolution stems from the introduction of two distinct priority classes. Through the implementation of a previously published chondrocyte network and T helper cell network, we show that this addition of quantitative and time resolution broadens the scope of biological behaviour that can be captured by the models. Specifically, the quantitative resolution readily allows models to discern qualitative differences in dosage response to growth factors. The limited time resolution, in turn, can influence the reachability of attractors, delineating the likely long term system behaviour. Importantly, the information required for implementation of these features, such as the nature of an interaction, is typically obtainable from the literature. Nonetheless, a trade-off is always present between additional computational cost of this approach and the likelihood of extending the model’s scope. Indeed, in some cases the inclusion of these features does not yield additional insight. This framework, incorporating increased and readily available time and semi-quantitative resolution, can help in substantiating the litmus test of dynamics for gene networks, firstly by excluding unlikely dynamics and secondly by refining falsifiable predictions on qualitative behaviour. PMID:26067297

  13. EGFR-Targeted Adenovirus Dendrimer Coating for Improved Systemic Delivery of the Theranostic NIS Gene

    PubMed Central

    Grünwald, Geoffrey K; Vetter, Alexandra; Klutz, Kathrin; Willhauck, Michael J; Schwenk, Nathalie; Senekowitsch-Schmidtke, Reingard; Schwaiger, Markus; Zach, Christian; Wagner, Ernst; Göke, Burkhard; Holm, Per S; Ogris, Manfred; Spitzweg, Christine

    2013-01-01

    We recently demonstrated tumor-selective iodide uptake and therapeutic efficacy of combined radiovirotherapy after systemic delivery of the theranostic sodium iodide symporter (NIS) gene using a dendrimer-coated adenovirus. To further improve shielding and targeting we physically coated replication-selective adenoviruses carrying the hNIS gene with a conjugate consisting of cationic poly(amidoamine) (PAMAM) dendrimer linked to the peptidic, epidermal growth factor receptor (EGFR)-specific ligand GE11. In vitro experiments demonstrated coxsackie-adenovirus receptor-independent but EGFR-specific transduction efficiency. Systemic injection of the uncoated adenovirus in a liver cancer xenograft mouse model led to high levels of NIS expression in the liver due to hepatic sequestration, which were significantly reduced after coating as demonstrated by 123I-scintigraphy. Reduction of adenovirus liver pooling resulted in decreased hepatotoxicity and increased transduction efficiency in peripheral xenograft tumors. 124I-PET-imaging confirmed EGFR-specificity by significantly lower tumoral radioiodine accumulation after pretreatment with the EGFR-specific antibody cetuximab. A significantly enhanced oncolytic effect was observed following systemic application of dendrimer-coated adenovirus that was further increased by additional treatment with a therapeutic dose of 131I. These results demonstrate restricted virus tropism and tumor-selective retargeting after systemic application of coated, EGFR-targeted adenoviruses therefore representing a promising strategy for improved systemic adenoviral NIS gene therapy. PMID:24193032

  14. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Spreitzer, Robert Joseph

    Ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) catalyzes the rate-limiting step of CO 2 fixation in photosynthesis. However, it is a slow enzyme, and O 2 competes with CO 2 at the active site. Oxygenation initiates the photorespiratory pathway, which also results in the loss of CO 2. If carboxylation could be increased or oxygenation decreased, an increase in net CO 2 fixation would be realized. Because Rubisco provides the primary means by which carbon enters all life on earth, there is much interest in engineering Rubisco to increase the production of food and renewable energy. Rubisco is located in the chloroplasts of plants,more » and it is comprised of two subunits. Much is known about the chloroplast-gene-encoded large subunit (rbcL gene), which contains the active site, but much less is known about the role of the nuclear-gene-encoded small subunit in Rubisco function (rbcS gene). Both subunits are coded by multiple genes in plants, which makes genetic engineering difficult. In the eukaryotic, green alga Chlamydomonas reinhardtii, it has been possible to eliminate all the Rubisco genes. These Rubisco-less mutants can be maintained by providing acetate as an alternative carbon source. In this project, focus has been placed on determining whether the small subunit might be a better genetic-engineering target for improving Rubisco. Analysis of a variable-loop structure (βA-βB loop) of the small subunit by genetic selection, directed mutagenesis, and construction of chimeras has shown that the small subunit can influence CO 2/O 2 specificity. X-ray crystal structures of engineered chimeric-loop enzymes have indicated that additional residues and regions of the small subunit may also contribute to Rubisco function. Structural dynamics of the small-subunit carboxyl terminus was also investigated. Alanine-scanning mutagenesis of the most-conserved small-subunit residues has identified a possible structural pathway between the small-subunit βA-βB loop and alpha-helix 8 of the large-subunit α/β-barrel active site. Hybrid enzymes were also created comprised of plant small subunits and Chlamydomonas large subunits, and these enzymes have increases in CO 2/O 2 specificity, further indicating that small subunits may be the key for ultimately engineering an improved Rubisco enzyme.« less

  15. Integrated site-specific quantification of faecal bacteria and detection of DNA markers in faecal contamination source tracking as a microbial risk tracking tool in urban Lake ecosystems

    NASA Astrophysics Data System (ADS)

    Donde, Oscar Omondi; Tian, Cuicui; Xiao, Bangding

    2017-11-01

    The presence of feacal-derived pathogens in water is responsible for several infectious diseases and deaths worldwide. As a solution, sources of fecal pollution in waters must be accurately assessed, properly determined and strictly controlled. However, the exercise has remained challenging due to the existing overlapping characteristics by different members of faecal coliform bacteria and the inadequacy of information pertaining to the contribution of seasonality and weather condition on tracking the possible sources of pollution. There are continued efforts to improve the Faecal Contamination Source Tracking (FCST) techniques such as Microbial Source Tracking (MST). This study aimed to make contribution to MST by evaluating the efficacy of combining site specific quantification of faecal contamination indicator bacteria and detection of DNA markers while accounting for seasonality and weather conditions' effects in tracking the major sources of faecal contamination in a freshwater system (Donghu Lake, China). The results showed that the use of cyd gene in addition to lacZ and uidA genes differentiates E. coli from other closely related faecal bacteria. The use of selective media increases the pollution source tracking accuracy. BSA addition boosts PCR detection and increases FCST efficiency. Seasonality and weather variability also influence the detection limit for DNA markers.

  16. Gene-gene-environment interactions between drugs, transporters, receptors, and metabolizing enzymes: Statins, SLCO1B1, and CYP3A4 as an example.

    PubMed

    Sadee, Wolfgang

    2013-09-01

    Pharmacogenetic biomarker tests include mostly specific single gene-drug pairs, capable of accounting for a portion of interindividual variability in drug response and toxicity. However, multiple genes are likely to contribute, either acting independently or epistatically, with the CYP2C9-VKORC1-warfarin test panel, an example of a clinically used gene-gene-dug interaction. I discuss here further instances of gene-gene-drug interactions, including a proposed dynamic effect on statin therapy by genetic variants in both a transporter (SLCO1B1) and a metabolizing enzyme (CYP3A4) in liver cells, the main target site where statins block cholesterol synthesis. These examples set a conceptual framework for developing diagnostic panels involving multiple gene-drug combinations. Copyright © 2013 Wiley Periodicals, Inc.

  17. Do PTK2 gene polymorphisms contribute to the interindividual variability in muscle strength and the response to resistance training? A preliminary report.

    PubMed

    Erskine, Robert M; Williams, Alun G; Jones, David A; Stewart, Claire E; Degens, Hans

    2012-04-01

    The protein tyrosine kinase-2 (PTK2) gene encodes focal adhesion kinase, a structural protein involved in lateral transmission of muscle fiber force. We investigated whether single-nucleotide polymorphisms (SNPs) of the PTK2 gene were associated with various indexes of human skeletal muscle strength and the interindividual variability in the strength responses to resistance training. We determined unilateral knee extension single repetition maximum (1-RM), maximum isometric voluntary contraction (MVC) knee joint torque, and quadriceps femoris muscle specific force (maximum force per unit physiological cross-sectional area) before and after 9 wk of knee extension resistance training in 51 untrained young men. All participants were genotyped for the PTK2 intronic rs7843014 A/C and 3'-untranslated region (UTR) rs7460 A/T SNPs. There were no genotype associations with baseline measures or posttraining changes in 1-RM or MVC. Although the training-induced increase in specific force was similar for all PTK2 genotypes, baseline specific force was higher in PTK2 rs7843014 AA and rs7460 TT homozygotes than in the respective rs7843014 C- (P = 0.016) and rs7460 A-allele (P = 0.009) carriers. These associations between muscle specific force and PTK2 SNPs suggest that interindividual differences exist in the way force is transmitted from the muscle fibers to the tendon. Therefore, our results demonstrate for the first time the impact of genetic variation on the intrinsic strength of human skeletal muscle.

  18. Elucidating the role of highly homologous Nicotiana benthamiana ubiquitin E2 gene family members in plant immunity through an improved virus-induced gene silencing approach.

    PubMed

    Zhou, Bangjun; Zeng, Lirong

    2017-01-01

    Virus-induced gene silencing (VIGS) has been used in many plant species as an attractive post transcriptional gene silencing (PTGS) method for studying gene function either individually or at large-scale in a high-throughput manner. However, the specificity and efficiency for knocking down members of a highly homologous gene family have remained to date a significant challenge in VIGS due to silencing of off-targets. Here we present an improved method for the selection and evaluation of gene fragments used for VIGS to specifically and efficiently knock down members of a highly homologous gene family. Using this method, we knocked down twelve and four members, respectively of group III of the gene family encoding ubiquitin-conjugating enzymes (E2) in Nicotiana benthamiana . Assays using these VIGS-treated plants revealed that the group III E2s are essential for plant development, plant immunity-associated reactive oxygen species (ROS) production, expression of the gene NbRbohB that is required for ROS production, and suppression of immunity-associated programmed cell death (PCD) by AvrPtoB, an effector protein of the bacterial pathogen Pseudomons syringae . Moreover, functional redundancy for plant development and ROS production was found to exist among members of group III E2s. We have found that employment of a gene fragment as short as approximately 70 base pairs (bp) that contains at least three mismatched nucleotides to other genes within any 21-bp sequences prevents silencing of off-target(s) in VIGS. This improved approach in the selection and evaluation of gene fragments allows for specific and efficient knocking down of highly homologous members of a gene family. Using this approach, we implicated N. benthamiana group III E2s in plant development, immunity-associated ROS production, and suppression of multiple immunity-associated PCD by AvrPtoB. We also unraveled functional redundancy among group III members in their requirement for plant development and plant immunity-associated ROS production.

  19. Polymorphisms of genes encoding P2X7R, IL-1B, OPG and RANK in orthodontic-induced apical root resorption.

    PubMed

    Pereira, S; Lavado, N; Nogueira, L; Lopez, M; Abreu, J; Silva, H

    2014-10-01

    Orthodontic-induced external apical root resorption (EARR) is a complex phenotype determined by poorly defined mechanical and patient intrinsic factors. The aim of this work was to construct a multifactorial integrative model, including clinical and genetic susceptibility factors, to analyze the risk of developing this common orthodontic complication. This retrospective study included 195 orthodontic patients. Using a multiple-linear regression model, where the dependent variable was the maximum% of root resorption (%EARRmax) for each patient, we assessed the contribution of nine clinical variables and four polymorphisms of genes involved in bone and tooth root remodeling (rs1718119 from P2RX7, rs1143634 from IL1B, rs3102735 from TNFRSF11B, encoding OPG, and rs1805034 from TNFRSF11A, encoding RANK). Clinical and genetic variables explained 30% of%EARRmax variability. The variables with the most significant unique contribution to the model were: gender (P < 0.05), treatment duration (P < 0.001), premolar extractions (P < 0.01), Hyrax appliance (P < 0.001) and GG genotype of rs1718119 from P2RX7 gene (P < 0.01). Age, overjet, tongue thrust, skeletal class II and the other polymorphisms made minor contributions. This study highlights the P2RX7 gene as a possible factor of susceptibility to EARR. A more extensive genetic profile may improve this model. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  20. Partial Diversity Generates Effector Immunity Specificity of the Bac41-Like Bacteriocins of Enterococcus faecalis Clinical Strains.

    PubMed

    Kurushima, Jun; Ike, Yasuyoshi; Tomita, Haruyoshi

    2016-09-01

    Bacteriocin 41 (Bac41) is the plasmid-encoded bacteriocin produced by the opportunistic pathogen Enterococcus faecalis Its genetic determinant consists of bacL1 (effector), bacL2 (regulator), bacA (effector), and bacI (immunity). The secreted effectors BacL1 and BacA coordinate to induce the lytic cell death of E. faecalis Meanwhile, the immunity factor BacI provides self-resistance to the Bac41 producer, E. faecalis, against the action of BacL1 and BacA. In this study, we demonstrated that more than half of the 327 clinical strains of E. faecalis screened had functional Bac41 genes. Analysis of the genetic structure of the Bac41 genes in the DNA sequences of the E. faecalis strains revealed that the Bac41-like genes consist of a relatively conserved region and a variable region located downstream from bacA Based on similarities in the variable region, the Bac41-like genes could be classified into type I, type IIa, and type IIb. Interestingly, the distinct Bac41 types had specific immunity factors for self-resistance, BacI1 or BacI2, and did not show cross-immunity to the other type of effector. We also demonstrated experimentally that the specificity of the immunity was determined by the combination of the C-terminal region of BacA and the presence of the unique BacI1 or BacI2 factor. These observations suggested that Bac41-like bacteriocin genes are extensively disseminated among E. faecalis strains in the clinical environment and can be grouped into at least three types. It was also indicated that the partial diversity results in specificity of self-resistance which may offer these strains a competitive advantage. Bacteriocins are antibacterial effectors produced by bacteria. In general, a bacteriocin-coding gene is accompanied by a cognate immunity gene that confers self-resistance on the bacteriocin-producing bacterium itself. We demonstrated that one of the bacteriocins, Bac41, is disseminated among E. faecalis clinical strains and the Bac41 subtypes with partial diversity. The Bac41-like bacteriocins were found to be classified into type I, type IIa, and type IIb by variation of the cognate immunity factors. The antibacterial activity of the respective effectors was specifically inhibited by the immunity factor from the same type of Bac41 but not the other types. This specificity of effector-immunity pairs suggests that bacteriocin genes might have evolved to change the immunity specificity to acquire an advantage in interbacterial competition. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

  1. Impact of strong selection for the PrP major gene on genetic variability of four French sheep breeds (Open Access publication)

    PubMed Central

    Palhiere, Isabelle; Brochard, Mickaël; Moazami-Goudarzi, Katayoun; Laloë, Denis; Amigues, Yves; Bed'hom, Bertrand; Neuts, Étienne; Leymarie, Cyril; Pantano, Thais; Cribiu, Edmond Paul; Bibé, Bernard; Verrier, Étienne

    2008-01-01

    Effective selection on the PrP gene has been implemented since October 2001 in all French sheep breeds. After four years, the ARR "resistant" allele frequency increased by about 35% in young males. The aim of this study was to evaluate the impact of this strong selection on genetic variability. It is focussed on four French sheep breeds and based on the comparison of two groups of 94 animals within each breed: the first group of animals was born before the selection began, and the second, 3–4 years later. Genetic variability was assessed using genealogical and molecular data (29 microsatellite markers). The expected loss of genetic variability on the PrP gene was confirmed. Moreover, among the five markers located in the PrP region, only the three closest ones were affected. The evolution of the number of alleles, heterozygote deficiency within population, expected heterozygosity and the Reynolds distances agreed with the criteria from pedigree and pointed out that neutral genetic variability was not much affected. This trend depended on breed, i.e. on their initial states (population size, PrP frequencies) and on the selection strategies for improving scrapie resistance while carrying out selection for production traits. PMID:18990357

  2. Breeding and Genetics Symposium: networks and pathways to guide genomic selection.

    PubMed

    Snelling, W M; Cushman, R A; Keele, J W; Maltecca, C; Thomas, M G; Fortes, M R S; Reverter, A

    2013-02-01

    Many traits affecting profitability and sustainability of meat, milk, and fiber production are polygenic, with no single gene having an overwhelming influence on observed variation. No knowledge of the specific genes controlling these traits has been needed to make substantial improvement through selection. Significant gains have been made through phenotypic selection enhanced by pedigree relationships and continually improving statistical methodology. Genomic selection, recently enabled by assays for dense SNP located throughout the genome, promises to increase selection accuracy and accelerate genetic improvement by emphasizing the SNP most strongly correlated to phenotype although the genes and sequence variants affecting phenotype remain largely unknown. These genomic predictions theoretically rely on linkage disequilibrium (LD) between genotyped SNP and unknown functional variants, but familial linkage may increase effectiveness when predicting individuals related to those in the training data. Genomic selection with functional SNP genotypes should be less reliant on LD patterns shared by training and target populations, possibly allowing robust prediction across unrelated populations. Although the specific variants causing polygenic variation may never be known with certainty, a number of tools and resources can be used to identify those most likely to affect phenotype. Associations of dense SNP genotypes with phenotype provide a 1-dimensional approach for identifying genes affecting specific traits; in contrast, associations with multiple traits allow defining networks of genes interacting to affect correlated traits. Such networks are especially compelling when corroborated by existing functional annotation and established molecular pathways. The SNP occurring within network genes, obtained from public databases or derived from genome and transcriptome sequences, may be classified according to expected effects on gene products. As illustrated by functionally informed genomic predictions being more accurate than naive whole-genome predictions of beef tenderness, coupling evidence from livestock genotypes, phenotypes, gene expression, and genomic variants with existing knowledge of gene functions and interactions may provide greater insight into the genes and genomic mechanisms affecting polygenic traits and facilitate functional genomic selection for economically important traits.

  3. Targeting of Magnetic Nanoparticle-coated Microbubbles to the Vascular Wall Empowers Site-specific Lentiviral Gene Delivery in vivo.

    PubMed

    Heun, Yvonn; Hildebrand, Staffan; Heidsieck, Alexandra; Gleich, Bernhard; Anton, Martina; Pircher, Joachim; Ribeiro, Andrea; Mykhaylyk, Olga; Eberbeck, Dietmar; Wenzel, Daniela; Pfeifer, Alexander; Woernle, Markus; Krötz, Florian; Pohl, Ulrich; Mannell, Hanna

    2017-01-01

    In the field of vascular gene therapy, targeting systems are promising advancements to improve site-specificity of gene delivery. Here, we studied whether incorporation of magnetic nanoparticles (MNP) with different magnetic properties into ultrasound sensitive microbubbles may represent an efficient way to enable gene targeting in the vascular system after systemic application. Thus, we associated novel silicon oxide-coated magnetic nanoparticle containing microbubbles (SO-Mag MMB) with lentiviral particles carrying therapeutic genes and determined their physico-chemical as well as biological properties compared to MMB coated with polyethylenimine-coated magnetic nanoparticles (PEI-Mag MMB). While there were no differences between both MMB types concerning size and lentivirus binding, SO-Mag MMB exhibited superior characteristics regarding magnetic moment, magnetizability as well as transduction efficiency under static and flow conditions in vitro . Focal disruption of lentiviral SO-Mag MMB by ultrasound within isolated vessels exposed to an external magnetic field decisively improved localized VEGF expression in aortic endothelium ex vivo and enhanced the angiogenic response. Using the same system in vivo , we achieved a highly effective, site-specific lentiviral transgene expression in microvessels of the mouse dorsal skin after arterial injection. Thus, we established a novel lentiviral MMB technique, which has great potential towards site-directed vascular gene therapy.

  4. Analytical performance of the ThyroSeq v3 genomic classifier for cancer diagnosis in thyroid nodules.

    PubMed

    Nikiforova, Marina N; Mercurio, Stephanie; Wald, Abigail I; Barbi de Moura, Michelle; Callenberg, Keith; Santana-Santos, Lucas; Gooding, William E; Yip, Linwah; Ferris, Robert L; Nikiforov, Yuri E

    2018-04-15

    Molecular tests have clinical utility for thyroid nodules with indeterminate fine-needle aspiration (FNA) cytology, although their performance requires further improvement. This study evaluated the analytical performance of the newly created ThyroSeq v3 test. ThyroSeq v3 is a DNA- and RNA-based next-generation sequencing assay that analyzes 112 genes for a variety of genetic alterations, including point mutations, insertions/deletions, gene fusions, copy number alterations, and abnormal gene expression, and it uses a genomic classifier (GC) to separate malignant lesions from benign lesions. It was validated in 238 tissue samples and 175 FNA samples with known surgical follow-up. Analytical performance studies were conducted. In the training tissue set of samples, ThyroSeq GC detected more than 100 genetic alterations, including BRAF, RAS, TERT, and DICER1 mutations, NTRK1/3, BRAF, and RET fusions, 22q loss, and gene expression alterations. GC cutoffs were established to distinguish cancer from benign nodules with 93.9% sensitivity, 89.4% specificity, and 92.1% accuracy. This correctly classified most papillary, follicular, and Hurthle cell lesions, medullary thyroid carcinomas, and parathyroid lesions. In the FNA validation set, the GC sensitivity was 98.0%, the specificity was 81.8%, and the accuracy was 90.9%. Analytical accuracy studies demonstrated a minimal required nucleic acid input of 2.5 ng, a 12% minimal acceptable tumor content, and reproducible test results under variable stress conditions. The ThyroSeq v3 GC analyzes 5 different classes of molecular alterations and provides high accuracy for detecting all common types of thyroid cancer and parathyroid lesions. The analytical sensitivity, specificity, and robustness of the test have been successfully validated and indicate its suitability for clinical use. Cancer 2018;124:1682-90. © 2018 American Cancer Society. © 2018 American Cancer Society.

  5. A polymorphic region in the human transcription factor AP-2beta gene is associated with specific personality traits.

    PubMed

    Damberg, M; Garpenstrand, H; Alfredsson, J; Ekblom, J; Forslund, K; Rylander, G; Oreland, L

    2000-03-01

    Transcription factor AP-2beta is implicated in playing an important role during embryonic development of different parts of the brain, eg, midbrain, hindbrain, spinal cord, dorsal and cranial root ganglia.1,2 The gene encoding AP-2beta contains a polymorphic region which includes a tetranucleotide repeat of [CAAA] four or five times, located in intron 2 between nucleotides 12593 and 12612.3 Since the midbrain contains structures important for variables such as mood and personality, we have investigated if the AP-2beta genotype is associated with personality traits estimated by the Karolinska Scales of Personality (KSP). Identification of transcription factor genes as candidate genes in psychiatric disorders is a novel approach to further elucidate the genetic factors that, together with environmental factors, are involved in the expression of specific psychiatric phenotypes. The AP-2beta genotype and KSP scores were determined for 137 Caucasian volunteers (73 females and 64 males). The personality traits muscular tension, guilt, somatic anxiety, psychastenia and indirect aggression were significantly associated with the specific AP-2beta genotype, albeit with significant difference between genders. Based on this result the human AP-2beta gene seems to be an important candidate gene for personality disorders. Moreover, the present results suggest that the structure of the intron 2 region of the AP-2beta gene is one factor that contributes to development of the constitutional component of specific personality traits.

  6. Rigorous tests of gene-environment interactions in a lab study of the oxytocin receptor gene (OXTR), alcohol exposure, and aggression.

    PubMed

    LoParo, Devon; Johansson, Ada; Walum, Hasse; Westberg, Lars; Santtila, Pekka; Waldman, Irwin

    2016-07-01

    Naturalistic studies of gene-environment interactions (G X E) have been plagued by several limitations, including difficulty isolating specific environmental risk factors from other correlated aspects of the environment, gene-environment correlation (rGE ), and the use of a single genetic variant to represent the influence of a gene. We present results from 235 Finnish young men in two lab studies of aggression and alcohol challenge that attempt to redress these limitations of the extant G X E literature. Specifically, we use a latent variable modeling approach in an attempt to more fully account for genetic variation across the oxytocin receptor gene (OXTR) and to robustly test its main effects on aggression and its interaction with alcohol exposure. We also modeled aggression as a latent variable comprising various indices, including the average and maximum levels of aggression, the earliest trial on which aggression was expressed, and the proportion of trials on which the minimum and maximum levels of aggression were expressed. The best fitting model for the genetic variation across OXTR included six factors derived from an exploratory factor analysis, roughly corresponding to six haplotype blocks. Aggression levels were higher on trials in which participants were administered alcohol, won, or were provoked. There was a significant main effect of OXTR on aggression across studies after controlling for covariates. The interaction of OXTR and alcohol was also significant across studies, such that OXTR had stronger effects on aggression in the alcohol administration condition. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.

  7. Epigenetic and genetic variants in the HTR1B gene and clinical improvement in children and adolescents treated with fluoxetine.

    PubMed

    Gassó, Patricia; Rodríguez, Natalia; Blázquez, Ana; Monteagudo, Ana; Boloc, Daniel; Plana, Maria Teresa; Lafuente, Amalia; Lázaro, Luisa; Arnaiz, Joan Albert; Mas, Sergi

    2017-04-03

    The serotonin 1B receptor (5-HT 1B ) is important to both the pathogenesis of major depressive disorder and the antidepressant effects of selective serotonin reuptake inhibitors. Although fluoxetine has been shown to be effective and safe in children and adolescents, not all patients experience a proper clinical response, which has led to further study into the main factors involved in this inter-individual variability. Our aim was to study the effect of epigenetic and genetic factors that could affect 5-hydroxytryptamine receptor 1B (HTR1B) gene expression, and thereby response to fluoxetine. A total of 83 children and adolescents were clinically assessed 12weeks after of initiating an antidepressant treatment with fluoxetine for the first time. We evaluated the influence of single nucleotide polymorphisms (SNPs) specifically located in transcription factor binding sites (TFBSs) on their clinical improvement. A combined genetic analysis considering the significant SNPs together with the functional variant rs130058 previously associated in our population was also performed. Moreover, we assessed, for the first time in the literature, whether methylation levels of the HTR1B promoter region could be associated with the pharmacological response. Two, rs9361233 and rs9361235, were significantly associated with clinical improvement after treatment with fluoxetine. The heterozygous genotype combination analysis showed a negative correlation with clinical improvement. The lowest improvement was experienced by patients who were heterozygous for all three SNPs. Moreover, a negative correlation was found between clinical improvement and the average methylation level of the HTR1B promoter. These results give new evidence for the role of epigenetic and genetic factors which could modulate HTR1B expression in the pharmacological response to antidepressants. Copyright © 2016 Elsevier Inc. All rights reserved.

  8. Why weight? Modelling sample and observational level variability improves power in RNA-seq analyses

    PubMed Central

    Liu, Ruijie; Holik, Aliaksei Z.; Su, Shian; Jansz, Natasha; Chen, Kelan; Leong, Huei San; Blewitt, Marnie E.; Asselin-Labat, Marie-Liesse; Smyth, Gordon K.; Ritchie, Matthew E.

    2015-01-01

    Variations in sample quality are frequently encountered in small RNA-sequencing experiments, and pose a major challenge in a differential expression analysis. Removal of high variation samples reduces noise, but at a cost of reducing power, thus limiting our ability to detect biologically meaningful changes. Similarly, retaining these samples in the analysis may not reveal any statistically significant changes due to the higher noise level. A compromise is to use all available data, but to down-weight the observations from more variable samples. We describe a statistical approach that facilitates this by modelling heterogeneity at both the sample and observational levels as part of the differential expression analysis. At the sample level this is achieved by fitting a log-linear variance model that includes common sample-specific or group-specific parameters that are shared between genes. The estimated sample variance factors are then converted to weights and combined with observational level weights obtained from the mean–variance relationship of the log-counts-per-million using ‘voom’. A comprehensive analysis involving both simulations and experimental RNA-sequencing data demonstrates that this strategy leads to a universally more powerful analysis and fewer false discoveries when compared to conventional approaches. This methodology has wide application and is implemented in the open-source ‘limma’ package. PMID:25925576

  9. Model-driven analysis of experimentally determined growth phenotypes for 465 yeast gene deletion mutants under 16 different conditions

    PubMed Central

    Snitkin, Evan S; Dudley, Aimée M; Janse, Daniel M; Wong, Kaisheen; Church, George M; Segrè, Daniel

    2008-01-01

    Background Understanding the response of complex biochemical networks to genetic perturbations and environmental variability is a fundamental challenge in biology. Integration of high-throughput experimental assays and genome-scale computational methods is likely to produce insight otherwise unreachable, but specific examples of such integration have only begun to be explored. Results In this study, we measured growth phenotypes of 465 Saccharomyces cerevisiae gene deletion mutants under 16 metabolically relevant conditions and integrated them with the corresponding flux balance model predictions. We first used discordance between experimental results and model predictions to guide a stage of experimental refinement, which resulted in a significant improvement in the quality of the experimental data. Next, we used discordance still present in the refined experimental data to assess the reliability of yeast metabolism models under different conditions. In addition to estimating predictive capacity based on growth phenotypes, we sought to explain these discordances by examining predicted flux distributions visualized through a new, freely available platform. This analysis led to insight into the glycerol utilization pathway and the potential effects of metabolic shortcuts on model results. Finally, we used model predictions and experimental data to discriminate between alternative raffinose catabolism routes. Conclusions Our study demonstrates how a new level of integration between high throughput measurements and flux balance model predictions can improve understanding of both experimental and computational results. The added value of a joint analysis is a more reliable platform for specific testing of biological hypotheses, such as the catabolic routes of different carbon sources. PMID:18808699

  10. [Polymorphism of KPI-A genes from plants of the subgenus Potatoe (sect. Petota, Estolonifera and Lycopersicum) and subgenus Solanum].

    PubMed

    Krinitsyna, A A; Mel'nikova, N V; Belenikin, M S; Poltronieri, P; Santino, A; Kudriavtseva, A V; Savilova, A M; Speranskaia, A S

    2013-01-01

    Kunitz-type proteinase inhibitor proteins of group A (KPI-A) are involved in the protection of potato plants from pathogens and pests. Although sequences of large number of the KPI-A genes from different species of cultivated potato (Solanum tuberosum subsp. tuberosum) and a few genes from tomato (Solanum lycopersicum) are known to date, information about the allelic diversity of these genes in other species of the genus Solanum is lacking. In our work, the consensus sequences of the KPI-A genes were established in two species of subgenus Potatoe sect. Petota (Solanum tuberosum subsp. andigenum--5 genes and Solanum stoloniferum--2 genes) and in the subgenus Solanum (Solanum nigrum--5 genes) by amplification, cloning, sequencing and subsequent analysis. The determined sequences of KPI-A genes were 97-100% identical to known sequences of the cultivated potato of sect. Petota (cultivated potato Solanum tuberosum subsp. tuberosum) and sect. Etuberosum (S. palustre). The interspecific variability of these genes did not exceed the intraspecific variability for all studied species except Solanum lycopersicum. The distribution of highly variable and conserved sequences in the mature protein-encoding regions was uniform for all investigated KPI-A genes. However, our attempts to amplify the homologous genes using the same primers and the genomes of Solanum dulcamarum, Solanum lycopersicum and Mandragora officinarum resulted in no product formation. Phylogenetic analysis of KPI-A diversity showed that the sequences of the S. lycopersicum form independent cluster, whereas KPI-A of S. nigrum and species of sect. Etuberosum and sect. Petota are closely related and do not form species-specific subclasters. Although Solanum nigrum is resistant to all known races of economically one of the most important diseases of solanaceous plants oomycete Phytophthora infestans aminoacid sequences encoding by KPI-A genes from its genome have nearly or absolutely no differences to the same from genomes of cultivated potatoes involved by P. infestans.

  11. Variable neighborhood search for reverse engineering of gene regulatory networks.

    PubMed

    Nicholson, Charles; Goodwin, Leslie; Clark, Corey

    2017-01-01

    A new search heuristic, Divided Neighborhood Exploration Search, designed to be used with inference algorithms such as Bayesian networks to improve on the reverse engineering of gene regulatory networks is presented. The approach systematically moves through the search space to find topologies representative of gene regulatory networks that are more likely to explain microarray data. In empirical testing it is demonstrated that the novel method is superior to the widely employed greedy search techniques in both the quality of the inferred networks and computational time. Copyright © 2016 Elsevier Inc. All rights reserved.

  12. Statistical analysis of an RNA titration series evaluates microarray precision and sensitivity on a whole-array basis

    PubMed Central

    Holloway, Andrew J; Oshlack, Alicia; Diyagama, Dileepa S; Bowtell, David DL; Smyth, Gordon K

    2006-01-01

    Background Concerns are often raised about the accuracy of microarray technologies and the degree of cross-platform agreement, but there are yet no methods which can unambiguously evaluate precision and sensitivity for these technologies on a whole-array basis. Results A methodology is described for evaluating the precision and sensitivity of whole-genome gene expression technologies such as microarrays. The method consists of an easy-to-construct titration series of RNA samples and an associated statistical analysis using non-linear regression. The method evaluates the precision and responsiveness of each microarray platform on a whole-array basis, i.e., using all the probes, without the need to match probes across platforms. An experiment is conducted to assess and compare four widely used microarray platforms. All four platforms are shown to have satisfactory precision but the commercial platforms are superior for resolving differential expression for genes at lower expression levels. The effective precision of the two-color platforms is improved by allowing for probe-specific dye-effects in the statistical model. The methodology is used to compare three data extraction algorithms for the Affymetrix platforms, demonstrating poor performance for the commonly used proprietary algorithm relative to the other algorithms. For probes which can be matched across platforms, the cross-platform variability is decomposed into within-platform and between-platform components, showing that platform disagreement is almost entirely systematic rather than due to measurement variability. Conclusion The results demonstrate good precision and sensitivity for all the platforms, but highlight the need for improved probe annotation. They quantify the extent to which cross-platform measures can be expected to be less accurate than within-platform comparisons for predicting disease progression or outcome. PMID:17118209

  13. Exploring internal features of 16S rRNA gene for identification of clinically relevant species of the genus Streptococcus

    PubMed Central

    2011-01-01

    Background Streptococcus is an economically important genus as a number of species belonging to this genus are human and animal pathogens. The genus has been divided into different groups based on 16S rRNA gene sequence similarity. The variability observed among the members of these groups is low and it is difficult to distinguish them. The present study was taken up to explore 16S rRNA gene sequence to develop methods that can be used for preliminary identification and can supplement the existing methods for identification of clinically-relevant isolates of the genus Streptococcus. Methods 16S rRNA gene sequences belonging to the isolates of S. dysgalactiae, S. equi, S. pyogenes, S. agalactiae, S. bovis, S. gallolyticus, S. mutans, S. sobrinus, S. mitis, S. pneumoniae, S. thermophilus and S. anginosus were analyzed with the purpose to define genetic variability within each species to generate a phylogenetic framework, to identify species-specific signatures and in-silico restriction enzyme analysis. Results The framework based analysis was used to segregate Streptococcus spp. previously identified upto genus level. This segregation was validated using species-specific signatures and in-silico restriction enzyme analysis. 43 uncharacterized Streptococcus spp. could be identified using this approach. Conclusions The markers generated exploring 16S rRNA gene sequences provided useful tool that can be further used for identification of different species of the genus Streptococcus. PMID:21702978

  14. Correlation between Hox code and vertebral morphology in archosaurs.

    PubMed

    Böhmer, Christine; Rauhut, Oliver W M; Wörheide, Gert

    2015-07-07

    The relationship between developmental genes and phenotypic variation is of central interest in evolutionary biology. An excellent example is the role of Hox genes in the anteroposterior regionalization of the vertebral column in vertebrates. Archosaurs (crocodiles, dinosaurs including birds) are highly variable both in vertebral morphology and number. Nevertheless, functionally equivalent Hox genes are active in the axial skeleton during embryonic development, indicating that the morphological variation across taxa is likely owing to modifications in the pattern of Hox gene expression. By using geometric morphometrics, we demonstrate a correlation between vertebral Hox code and quantifiable vertebral morphology in modern archosaurs, in which the boundaries between morphological subgroups of vertebrae can be linked to anterior Hox gene expression boundaries. Our findings reveal homologous units of cervical vertebrae in modern archosaurs, each with their specific Hox gene pattern, enabling us to trace these homologies in the extinct sauropodomorph dinosaurs, a group with highly variable vertebral counts. Based on the quantifiable vertebral morphology, this allows us to infer the underlying genetic mechanisms in vertebral evolution in fossils, which represents not only an important case study, but will lead to a better understanding of the origin of morphological disparity in recent archosaur vertebral columns.

  15. Correlation between Hox code and vertebral morphology in archosaurs

    PubMed Central

    Böhmer, Christine; Rauhut, Oliver W. M.; Wörheide, Gert

    2015-01-01

    The relationship between developmental genes and phenotypic variation is of central interest in evolutionary biology. An excellent example is the role of Hox genes in the anteroposterior regionalization of the vertebral column in vertebrates. Archosaurs (crocodiles, dinosaurs including birds) are highly variable both in vertebral morphology and number. Nevertheless, functionally equivalent Hox genes are active in the axial skeleton during embryonic development, indicating that the morphological variation across taxa is likely owing to modifications in the pattern of Hox gene expression. By using geometric morphometrics, we demonstrate a correlation between vertebral Hox code and quantifiable vertebral morphology in modern archosaurs, in which the boundaries between morphological subgroups of vertebrae can be linked to anterior Hox gene expression boundaries. Our findings reveal homologous units of cervical vertebrae in modern archosaurs, each with their specific Hox gene pattern, enabling us to trace these homologies in the extinct sauropodomorph dinosaurs, a group with highly variable vertebral counts. Based on the quantifiable vertebral morphology, this allows us to infer the underlying genetic mechanisms in vertebral evolution in fossils, which represents not only an important case study, but will lead to a better understanding of the origin of morphological disparity in recent archosaur vertebral columns. PMID:26085583

  16. Somatic diversification in the heavy chain variable region genes expressed by human autoantibodies bearing a lupus-associated nephritogenic anti-DNA idiotype

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Demaison, C.; Chastagner, P.; Theze, J.

    1994-01-18

    Monoclonal anti-DNA antibodies bearing a lupus nephritis-associated idiotype were derived from five patients with systemic lupus erythematosus (SLE). Genes encoding their heavy (H)-chain variable (V[sub H]) regions were cloned and sequenced. When compared with their closest V[sub h] germ-line gene relatives, these sequences exhibit a number of silent (S) and replacement (R) substitutions. The ratios of R/S mutations were much higher in the complementarity-determining regions (CDRs) of the antibodies than in the framework regions. Molecular amplification of genomic V[sub H] genes and Southern hybridization with somatic CDR2-specific oligonucleotide probes showed that the configuration of the V[sub H] genes corresponding tomore » V[sub H] sequences in the nephritogenic antibodies is not present in the patient's own germ-line DNA, implying that the B-cell clones underwent somatic mutation in vivo. These findings, together with the characteristics of the diversity and junctional gene elements utilized to form the antibody, indicate that these autoantibodies have been driven through somatic selection processes reminiscent of those that govern antibody responses triggered by exogenous stimuli.« less

  17. A Nested-Splicing by Overlap Extension PCR Improves Specificity of this Standard Method.

    PubMed

    Karkhane, Ali Asghar; Yakhchali, Bagher; Rastgar Jazii, Ferdous; Bambai, Bijan; Aminzadeh, Saeed; Rahimi, Fatemeh

    2015-06-01

    Splicing by overlap extension (SOE) PCR is used to create mutation in the coding sequence of an enzyme in order to study the role of specific residues in protein's structure and function. We introduced a nested-SOE-PCR (N -SOE-PCR) in order to increase the specificity and generating mutations in a gene by SOE-PCR. Genomic DNA from Bacillus thermocatenulatus was extracted. Nested PCR was used to amplify B. thermocatenulatus lipase gene variants, namely wild type and mutant, using gene specific and mutagenic specific primers, followed by cloning in a suitable vector. Briefly in N-SOE-PCR method, instead of two pairs of primers, three pairs of primers are used to amplify a mutagenic fragment. Moreover, the first and second PCR products are slightly longer than PCR products in a conventional SOE. PCR products obtained from the first round of PCR are used for the second PCR by applying the nested and mutated primers. Following to the purification of the amplified fragments, they will be subject of the further purification and will be used as template to perform the third round of PCR using gene specific primers. In the end, the products will be cloned into a suitable vector for subsequent application. In comparison to the conventional SOE-PCR, the improved method (i.e. N-SOE-PCR) increases the yield and specificity of the products. In addition, the proposed method shows a large reduction in the non-specific products. By applying two more primers in the conventional SOE, the specificity of the method will be improved. This would be in part due to annealing of the primers further inside the amplicon that increases both the efficiency and a better attachment of the primers. Positioning of the primer far from both ends of an amplicon leads to an enhanced binding as well as increased affinity in the third round of amplification in SOE.

  18. Automated analysis of high-throughput B-cell sequencing data reveals a high frequency of novel immunoglobulin V gene segment alleles.

    PubMed

    Gadala-Maria, Daniel; Yaari, Gur; Uduman, Mohamed; Kleinstein, Steven H

    2015-02-24

    Individual variation in germline and expressed B-cell immunoglobulin (Ig) repertoires has been associated with aging, disease susceptibility, and differential response to infection and vaccination. Repertoire properties can now be studied at large-scale through next-generation sequencing of rearranged Ig genes. Accurate analysis of these repertoire-sequencing (Rep-Seq) data requires identifying the germline variable (V), diversity (D), and joining (J) gene segments used by each Ig sequence. Current V(D)J assignment methods work by aligning sequences to a database of known germline V(D)J segment alleles. However, existing databases are likely to be incomplete and novel polymorphisms are hard to differentiate from the frequent occurrence of somatic hypermutations in Ig sequences. Here we develop a Tool for Ig Genotype Elucidation via Rep-Seq (TIgGER). TIgGER analyzes mutation patterns in Rep-Seq data to identify novel V segment alleles, and also constructs a personalized germline database containing the specific set of alleles carried by a subject. This information is then used to improve the initial V segment assignments from existing tools, like IMGT/HighV-QUEST. The application of TIgGER to Rep-Seq data from seven subjects identified 11 novel V segment alleles, including at least one in every subject examined. These novel alleles constituted 13% of the total number of unique alleles in these subjects, and impacted 3% of V(D)J segment assignments. These results reinforce the highly polymorphic nature of human Ig V genes, and suggest that many novel alleles remain to be discovered. The integration of TIgGER into Rep-Seq processing pipelines will increase the accuracy of V segment assignments, thus improving B-cell repertoire analyses.

  19. Toward Bridging the Mechanistic Gap Between Genes and Traits by Emphasizing the Role of Proteins in a Computational Environment

    NASA Astrophysics Data System (ADS)

    Haskel-Ittah, Michal; Yarden, Anat

    2017-12-01

    Previous studies have shown that students often ignore molecular mechanisms when describing genetic phenomena. Specifically, students tend to directly link genes to their encoded traits, ignoring the role of proteins as mediators in this process. We tested the ability of 10th grade students to connect genes to traits through proteins, using concept maps and reasoning questions. The context of this study was a computational learning environment developed specifically to foster this ability. This environment presents proteins as the mechanism-mediating genetic phenomena. We found that students' ability to connect genes, proteins, and traits, or to reason using this connection, was initially poor. However, significant improvement was obtained when using the learning environment. Our results suggest that visual representations of proteins' functions in the context of a specific trait contributed to this improvement. One significant aspect of these results is the indication that 10th graders are capable of accurately describing genetic phenomena and their underlying mechanisms, a task that has been shown to raise difficulties, even in higher grades of high school.

  20. The Transcriptome of the Reference Potato Genome Solanum tuberosum Group Phureja Clone DM1-3 516R44

    PubMed Central

    Massa, Alicia N.; Childs, Kevin L.; Lin, Haining; Bryan, Glenn J.; Giuliano, Giovanni; Buell, C. Robin

    2011-01-01

    Advances in molecular breeding in potato have been limited by its complex biological system, which includes vegetative propagation, autotetraploidy, and extreme heterozygosity. The availability of the potato genome and accompanying gene complement with corresponding gene structure, location, and functional annotation are powerful resources for understanding this complex plant and advancing molecular breeding efforts. Here, we report a reference for the potato transcriptome using 32 tissues and growth conditions from the doubled monoploid Solanum tuberosum Group Phureja clone DM1-3 516R44 for which a genome sequence is available. Analysis of greater than 550 million RNA-Seq reads permitted the detection and quantification of expression levels of over 22,000 genes. Hierarchical clustering and principal component analyses captured the biological variability that accounts for gene expression differences among tissues suggesting tissue-specific gene expression, and genes with tissue or condition restricted expression. Using gene co-expression network analysis, we identified 18 gene modules that represent tissue-specific transcriptional networks of major potato organs and developmental stages. This information provides a powerful resource for potato research as well as studies on other members of the Solanaceae family. PMID:22046362

  1. A Sorghum bicolor expression atlas reveals dynamic genotype-specific expression profiles for vegetative tissues of grain, sweet and bioenergy sorghums.

    PubMed

    Shakoor, Nadia; Nair, Ramesh; Crasta, Oswald; Morris, Geoffrey; Feltus, Alex; Kresovich, Stephen

    2014-01-23

    Effective improvement in sorghum crop development necessitates a genomics-based approach to identify functional genes and QTLs. Sequenced in 2009, a comprehensive annotation of the sorghum genome and the development of functional genomics resources is key to enable the discovery and deployment of regulatory and metabolic genes and gene networks for crop improvement. This study utilizes the first commercially available whole-transcriptome sorghum microarray (Sorgh-WTa520972F) to identify tissue and genotype-specific expression patterns for all identified Sorghum bicolor exons and UTRs. The genechip contains 1,026,373 probes covering 149,182 exons (27,577 genes) across the Sorghum bicolor nuclear, chloroplast, and mitochondrial genomes. Specific probesets were also included for putative non-coding RNAs that may play a role in gene regulation (e.g., microRNAs), and confirmed functional small RNAs in related species (maize and sugarcane) were also included in our array design. We generated expression data for 78 samples with a combination of four different tissue types (shoot, root, leaf and stem), two dissected stem tissues (pith and rind) and six diverse genotypes, which included 6 public sorghum lines (R159, Atlas, Fremont, PI152611, AR2400 and PI455230) representing grain, sweet, forage, and high biomass ideotypes. Here we present a summary of the microarray dataset, including analysis of tissue-specific gene expression profiles and associated expression profiles of relevant metabolic pathways. With an aim to enable identification and functional characterization of genes in sorghum, this expression atlas presents a new and valuable resource to the research community.

  2. A Sorghum bicolor expression atlas reveals dynamic genotype-specific expression profiles for vegetative tissues of grain, sweet and bioenergy sorghums

    PubMed Central

    2014-01-01

    Background Effective improvement in sorghum crop development necessitates a genomics-based approach to identify functional genes and QTLs. Sequenced in 2009, a comprehensive annotation of the sorghum genome and the development of functional genomics resources is key to enable the discovery and deployment of regulatory and metabolic genes and gene networks for crop improvement. Results This study utilizes the first commercially available whole-transcriptome sorghum microarray (Sorgh-WTa520972F) to identify tissue and genotype-specific expression patterns for all identified Sorghum bicolor exons and UTRs. The genechip contains 1,026,373 probes covering 149,182 exons (27,577 genes) across the Sorghum bicolor nuclear, chloroplast, and mitochondrial genomes. Specific probesets were also included for putative non-coding RNAs that may play a role in gene regulation (e.g., microRNAs), and confirmed functional small RNAs in related species (maize and sugarcane) were also included in our array design. We generated expression data for 78 samples with a combination of four different tissue types (shoot, root, leaf and stem), two dissected stem tissues (pith and rind) and six diverse genotypes, which included 6 public sorghum lines (R159, Atlas, Fremont, PI152611, AR2400 and PI455230) representing grain, sweet, forage, and high biomass ideotypes. Conclusions Here we present a summary of the microarray dataset, including analysis of tissue-specific gene expression profiles and associated expression profiles of relevant metabolic pathways. With an aim to enable identification and functional characterization of genes in sorghum, this expression atlas presents a new and valuable resource to the research community. PMID:24456189

  3. Analysis of phylogenomic datasets reveals conflict, concordance, and gene duplications with examples from animals and plants.

    PubMed

    Smith, Stephen A; Moore, Michael J; Brown, Joseph W; Yang, Ya

    2015-08-05

    The use of transcriptomic and genomic datasets for phylogenetic reconstruction has become increasingly common as researchers attempt to resolve recalcitrant nodes with increasing amounts of data. The large size and complexity of these datasets introduce significant phylogenetic noise and conflict into subsequent analyses. The sources of conflict may include hybridization, incomplete lineage sorting, or horizontal gene transfer, and may vary across the phylogeny. For phylogenetic analysis, this noise and conflict has been accommodated in one of several ways: by binning gene regions into subsets to isolate consistent phylogenetic signal; by using gene-tree methods for reconstruction, where conflict is presumed to be explained by incomplete lineage sorting (ILS); or through concatenation, where noise is presumed to be the dominant source of conflict. The results provided herein emphasize that analysis of individual homologous gene regions can greatly improve our understanding of the underlying conflict within these datasets. Here we examined two published transcriptomic datasets, the angiosperm group Caryophyllales and the aculeate Hymenoptera, for the presence of conflict, concordance, and gene duplications in individual homologs across the phylogeny. We found significant conflict throughout the phylogeny in both datasets and in particular along the backbone. While some nodes in each phylogeny showed patterns of conflict similar to what might be expected with ILS alone, the backbone nodes also exhibited low levels of phylogenetic signal. In addition, certain nodes, especially in the Caryophyllales, had highly elevated levels of strongly supported conflict that cannot be explained by ILS alone. This study demonstrates that phylogenetic signal is highly variable in phylogenomic data sampled across related species and poses challenges when conducting species tree analyses on large genomic and transcriptomic datasets. Further insight into the conflict and processes underlying these complex datasets is necessary to improve and develop adequate models for sequence analysis and downstream applications. To aid this effort, we developed the open source software phyparts ( https://bitbucket.org/blackrim/phyparts ), which calculates unique, conflicting, and concordant bipartitions, maps gene duplications, and outputs summary statistics such as internode certainy (ICA) scores and node-specific counts of gene duplications.

  4. Biomarkers of susceptibility following benzene exposure: influence of genetic polymorphisms on benzene metabolism and health effects.

    PubMed

    Carbonari, Damiano; Chiarella, Pieranna; Mansi, Antonella; Pigini, Daniela; Iavicoli, Sergio; Tranfo, Giovanna

    2016-01-01

    Benzene is a ubiquitous occupational and environmental pollutant. Improved industrial hygiene allowed airborne concentrations close to the environmental context (1-1000 µg/m(3)). Conversely, new limits for benzene levels in urban air were set (5 µg/m(3)). The biomonitoring of exposure to such low benzene concentrations are performed measuring specific and sensitive biomarkers such as S-phenylmercapturic acid, trans, trans-muconic acid and urinary benzene: many studies referred high variability in the levels of these biomarkers, suggesting the involvement of polymorphic metabolic genes in the individual susceptibility to benzene toxicity. We reviewed the influence of metabolic polymorphisms on the biomarkers levels of benzene exposure and effect, in order to understand the real impact of benzene exposure on subjects with increased susceptibility.

  5. Modeling Bi-modality Improves Characterization of Cell Cycle on Gene Expression in Single Cells

    PubMed Central

    Danaher, Patrick; Finak, Greg; Krouse, Michael; Wang, Alice; Webster, Philippa; Beechem, Joseph; Gottardo, Raphael

    2014-01-01

    Advances in high-throughput, single cell gene expression are allowing interrogation of cell heterogeneity. However, there is concern that the cell cycle phase of a cell might bias characterizations of gene expression at the single-cell level. We assess the effect of cell cycle phase on gene expression in single cells by measuring 333 genes in 930 cells across three phases and three cell lines. We determine each cell's phase non-invasively without chemical arrest and use it as a covariate in tests of differential expression. We observe bi-modal gene expression, a previously-described phenomenon, wherein the expression of otherwise abundant genes is either strongly positive, or undetectable within individual cells. This bi-modality is likely both biologically and technically driven. Irrespective of its source, we show that it should be modeled to draw accurate inferences from single cell expression experiments. To this end, we propose a semi-continuous modeling framework based on the generalized linear model, and use it to characterize genes with consistent cell cycle effects across three cell lines. Our new computational framework improves the detection of previously characterized cell-cycle genes compared to approaches that do not account for the bi-modality of single-cell data. We use our semi-continuous modelling framework to estimate single cell gene co-expression networks. These networks suggest that in addition to having phase-dependent shifts in expression (when averaged over many cells), some, but not all, canonical cell cycle genes tend to be co-expressed in groups in single cells. We estimate the amount of single cell expression variability attributable to the cell cycle. We find that the cell cycle explains only 5%–17% of expression variability, suggesting that the cell cycle will not tend to be a large nuisance factor in analysis of the single cell transcriptome. PMID:25032992

  6. Fine Analysis of Genetic Diversity of the tpr Gene Family among Treponemal Species, Subspecies and Strains

    PubMed Central

    Centurion-Lara, Arturo; Giacani, Lorenzo; Godornes, Charmie; Molini, Barbara J.; Brinck Reid, Tara; Lukehart, Sheila A.

    2013-01-01

    Background The pathogenic non-cultivable treponemes include three subspecies of Treponema pallidum (pallidum, pertenue, endemicum), T. carateum, T. paraluiscuniculi, and the unclassified Fribourg-Blanc treponeme (Simian isolate). These treponemes are morphologically indistinguishable and antigenically and genetically highly similar, yet cross-immunity is variable or non-existent. Although all of these organisms cause chronic, multistage skin and systemic disease, they have historically been classified by mode of transmission, clinical presentations and host ranges. Whole genome studies underscore the high degree of sequence identity among species, subspecies and strains, pinpointing a limited number of genomic regions for variation. Many of these “hot spots” include members of the tpr gene family, composed of 12 paralogs encoding candidate virulence factors. We hypothesize that the distinct clinical presentations, host specificity, and variable cross-immunity might reside on virulence factors such as the tpr genes. Methodology/Principal Findings Sequence analysis of 11 tpr loci (excluding tprK) from 12 strains demonstrated an impressive heterogeneity, including SNPs, indels, chimeric genes, truncated gene products and large deletions. Comparative analyses of sequences and 3D models of predicted proteins in Subfamily I highlight the striking co-localization of discrete variable regions with predicted surface-exposed loops. A hallmark of Subfamily II is the presence of chimeric genes in the tprG and J loci. Diversity in Subfamily III is limited to tprA and tprL. Conclusions/Significance An impressive sequence variability was found in tpr sequences among the Treponema isolates examined in this study, with most of the variation being consistent within subspecies or species, or between syphilis vs. non-syphilis strains. Variability was seen in the pallidum subspecies, which can be divided into 5 genogroups. These findings support a genetic basis for the classification of these organisms into their respective subspecies and species. Future functional studies will determine whether the identified genetic differences relate to cross-immunity, clinical differences, or host ranges. PMID:23696912

  7. Comparative genomic and plasmid analysis of beer-spoiling and non-beer-spoiling Lactobacillus brevis isolates.

    PubMed

    Bergsveinson, Jordyn; Ziola, Barry

    2017-12-01

    Beer-spoilage-related lactic acid bacteria (BSR LAB) belong to multiple genera and species; however, beer-spoilage capacity is isolate-specific and partially acquired via horizontal gene transfer within the brewing environment. Thus, the extent to which genus-, species-, or environment- (i.e., brewery-) level genetic variability influences beer-spoilage phenotype is unknown. Publicly available Lactobacillus brevis genomes were analyzed via BlAst Diagnostic Gene findEr (BADGE) for BSR genes and assessed for pangenomic relationships. Also analyzed were functional coding capacities of plasmids of LAB inhabiting extreme niche environments. Considerable genetic variation was observed in L. brevis isolated from clinical samples, whereas 16 candidate genes distinguish BSR and non-BSR L. brevis genomes. These genes are related to nutrient scavenging of gluconate or pentoses, mannose, and metabolism of pectin. BSR L. brevis isolates also have higher average nucleotide identity and stronger pangenome association with one another, though isolation source (i.e., specific brewery) also appears to influence the plasmid coding capacity of BSR LAB. Finally, it is shown that niche-specific adaptation and phenotype are plasmid-encoded for both BSR and non-BSR LAB. The ultimate combination of plasmid-encoded genes dictates the ability of L. brevis to survive in the most extreme beer environment, namely, gassed (i.e., pressurized) beer.

  8. A Diverse Repertoire of Human Immunoglobulin Variable Genes in a Chicken B Cell Line is Generated by Both Gene Conversion and Somatic Hypermutation.

    PubMed

    Leighton, Philip A; Schusser, Benjamin; Yi, Henry; Glanville, Jacob; Harriman, William

    2015-01-01

    Chicken immune responses to human proteins are often more robust than rodent responses because of the phylogenetic relationship between the different species. For discovery of a diverse panel of unique therapeutic antibody candidates, chickens therefore represent an attractive host for human-derived targets. Recent advances in monoclonal antibody technology, specifically new methods for the molecular cloning of antibody genes directly from primary B cells, has ushered in a new era of generating monoclonal antibodies from non-traditional host animals that were previously inaccessible through hybridoma technology. However, such monoclonals still require post-discovery humanization in order to be developed as therapeutics. To obviate the need for humanization, a modified strain of chickens could be engineered to express a human-sequence immunoglobulin variable region repertoire. Here, human variable genes introduced into the chicken immunoglobulin loci through gene targeting were evaluated for their ability to be recognized and diversified by the native chicken recombination machinery that is present in the B-lineage cell line DT40. After expansion in culture the DT40 population accumulated genetic mutants that were detected via deep sequencing. Bioinformatic analysis revealed that the human targeted constructs are performing as expected in the cell culture system, and provide a measure of confidence that they will be functional in transgenic animals.

  9. Synthetic spike-in standards for high-throughput 16S rRNA gene amplicon sequencing.

    PubMed

    Tourlousse, Dieter M; Yoshiike, Satowa; Ohashi, Akiko; Matsukura, Satoko; Noda, Naohiro; Sekiguchi, Yuji

    2017-02-28

    High-throughput sequencing of 16S rRNA gene amplicons (16S-seq) has become a widely deployed method for profiling complex microbial communities but technical pitfalls related to data reliability and quantification remain to be fully addressed. In this work, we have developed and implemented a set of synthetic 16S rRNA genes to serve as universal spike-in standards for 16S-seq experiments. The spike-ins represent full-length 16S rRNA genes containing artificial variable regions with negligible identity to known nucleotide sequences, permitting unambiguous identification of spike-in sequences in 16S-seq read data from any microbiome sample. Using defined mock communities and environmental microbiota, we characterized the performance of the spike-in standards and demonstrated their utility for evaluating data quality on a per-sample basis. Further, we showed that staggered spike-in mixtures added at the point of DNA extraction enable concurrent estimation of absolute microbial abundances suitable for comparative analysis. Results also underscored that template-specific Illumina sequencing artifacts may lead to biases in the perceived abundance of certain taxa. Taken together, the spike-in standards represent a novel bioanalytical tool that can substantially improve 16S-seq-based microbiome studies by enabling comprehensive quality control along with absolute quantification. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  10. Combining transcription factor binding affinities with open-chromatin data for accurate gene expression prediction

    PubMed Central

    Schmidt, Florian; Gasparoni, Nina; Gasparoni, Gilles; Gianmoena, Kathrin; Cadenas, Cristina; Polansky, Julia K.; Ebert, Peter; Nordström, Karl; Barann, Matthias; Sinha, Anupam; Fröhler, Sebastian; Xiong, Jieyi; Dehghani Amirabad, Azim; Behjati Ardakani, Fatemeh; Hutter, Barbara; Zipprich, Gideon; Felder, Bärbel; Eils, Jürgen; Brors, Benedikt; Chen, Wei; Hengstler, Jan G.; Hamann, Alf; Lengauer, Thomas; Rosenstiel, Philip; Walter, Jörn; Schulz, Marcel H.

    2017-01-01

    The binding and contribution of transcription factors (TF) to cell specific gene expression is often deduced from open-chromatin measurements to avoid costly TF ChIP-seq assays. Thus, it is important to develop computational methods for accurate TF binding prediction in open-chromatin regions (OCRs). Here, we report a novel segmentation-based method, TEPIC, to predict TF binding by combining sets of OCRs with position weight matrices. TEPIC can be applied to various open-chromatin data, e.g. DNaseI-seq and NOMe-seq. Additionally, Histone-Marks (HMs) can be used to identify candidate TF binding sites. TEPIC computes TF affinities and uses open-chromatin/HM signal intensity as quantitative measures of TF binding strength. Using machine learning, we find low affinity binding sites to improve our ability to explain gene expression variability compared to the standard presence/absence classification of binding sites. Further, we show that both footprints and peaks capture essential TF binding events and lead to a good prediction performance. In our application, gene-based scores computed by TEPIC with one open-chromatin assay nearly reach the quality of several TF ChIP-seq data sets. Finally, these scores correctly predict known transcriptional regulators as illustrated by the application to novel DNaseI-seq and NOMe-seq data for primary human hepatocytes and CD4+ T-cells, respectively. PMID:27899623

  11. Autism genetics: searching for specificity and convergence

    PubMed Central

    2012-01-01

    Advances in genetics and genomics have improved our understanding of autism spectrum disorders. As many genes have been implicated, we look to points of convergence among these genes across biological systems to better understand and treat these disorders. PMID:22849751

  12. Genome-wide computational analysis reveals cardiomyocyte-specific transcriptional Cis-regulatory motifs that enable efficient cardiac gene therapy.

    PubMed

    Rincon, Melvin Y; Sarcar, Shilpita; Danso-Abeam, Dina; Keyaerts, Marleen; Matrai, Janka; Samara-Kuko, Ermira; Acosta-Sanchez, Abel; Athanasopoulos, Takis; Dickson, George; Lahoutte, Tony; De Bleser, Pieter; VandenDriessche, Thierry; Chuah, Marinee K

    2015-01-01

    Gene therapy is a promising emerging therapeutic modality for the treatment of cardiovascular diseases and hereditary diseases that afflict the heart. Hence, there is a need to develop robust cardiac-specific expression modules that allow for stable expression of the gene of interest in cardiomyocytes. We therefore explored a new approach based on a genome-wide bioinformatics strategy that revealed novel cardiac-specific cis-acting regulatory modules (CS-CRMs). These transcriptional modules contained evolutionary-conserved clusters of putative transcription factor binding sites that correspond to a "molecular signature" associated with robust gene expression in the heart. We then validated these CS-CRMs in vivo using an adeno-associated viral vector serotype 9 that drives a reporter gene from a quintessential cardiac-specific α-myosin heavy chain promoter. Most de novo designed CS-CRMs resulted in a >10-fold increase in cardiac gene expression. The most robust CRMs enhanced cardiac-specific transcription 70- to 100-fold. Expression was sustained and restricted to cardiomyocytes. We then combined the most potent CS-CRM4 with a synthetic heart and muscle-specific promoter (SPc5-12) and obtained a significant 20-fold increase in cardiac gene expression compared to the cytomegalovirus promoter. This study underscores the potential of rational vector design to improve the robustness of cardiac gene therapy.

  13. Genome scanning for detecting adaptive genes along environmental gradients in the Japanese conifer, Cryptomeria japonica.

    PubMed

    Tsumura, Y; Uchiyama, K; Moriguchi, Y; Ueno, S; Ihara-Ujino, T

    2012-12-01

    Local adaptation is important in evolutionary processes and speciation. We used multiple tests to identify several candidate genes that may be involved in local adaptation from 1026 loci in 14 natural populations of Cryptomeria japonica, the most economically important forestry tree in Japan. We also studied the relationships between genotypes and environmental variables to obtain information on the selective pressures acting on individual populations. Outlier loci were mapped onto a linkage map, and the positions of loci associated with specific environmental variables are considered. The outlier loci were not randomly distributed on the linkage map; linkage group 11 was identified as a genomic island of divergence. Three loci in this region were also associated with environmental variables such as mean annual temperature, daily maximum temperature, maximum snow depth, and so on. Outlier loci identified with high significance levels will be essential for conservation purposes and for future work on molecular breeding.

  14. Genome-Wide Identification and Expression Analyses of Aquaporin Gene Family during Development and Abiotic Stress in Banana

    PubMed Central

    Hu, Wei; Hou, Xiaowan; Huang, Chao; Yan, Yan; Tie, Weiwei; Ding, Zehong; Wei, Yunxie; Liu, Juhua; Miao, Hongxia; Lu, Zhiwei; Li, Meiying; Xu, Biyu; Jin, Zhiqiang

    2015-01-01

    Aquaporins (AQPs) function to selectively control the flow of water and other small molecules through biological membranes, playing crucial roles in various biological processes. However, little information is available on the AQP gene family in bananas. In this study, we identified 47 banana AQP genes based on the banana genome sequence. Evolutionary analysis of AQPs from banana, Arabidopsis, poplar, and rice indicated that banana AQPs (MaAQPs) were clustered into four subfamilies. Conserved motif analysis showed that all banana AQPs contained the typical AQP-like or major intrinsic protein (MIP) domain. Gene structure analysis suggested the majority of MaAQPs had two to four introns with a highly specific number and length for each subfamily. Expression analysis of MaAQP genes during fruit development and postharvest ripening showed that some MaAQP genes exhibited high expression levels during these stages, indicating the involvement of MaAQP genes in banana fruit development and ripening. Additionally, some MaAQP genes showed strong induction after stress treatment and therefore, may represent potential candidates for improving banana resistance to abiotic stress. Taken together, this study identified some excellent tissue-specific, fruit development- and ripening-dependent, and abiotic stress-responsive candidate MaAQP genes, which could lay a solid foundation for genetic improvement of banana cultivars. PMID:26307965

  15. A Prediction Algorithm for Drug Response in Patients with Mesial Temporal Lobe Epilepsy Based on Clinical and Genetic Information

    PubMed Central

    Carvalho, Benilton S.; Bilevicius, Elizabeth; Alvim, Marina K. M.; Lopes-Cendes, Iscia

    2017-01-01

    Mesial temporal lobe epilepsy is the most common form of adult epilepsy in surgical series. Currently, the only characteristic used to predict poor response to clinical treatment in this syndrome is the presence of hippocampal sclerosis. Single nucleotide polymorphisms (SNPs) located in genes encoding drug transporter and metabolism proteins could influence response to therapy. Therefore, we aimed to evaluate whether combining information from clinical variables as well as SNPs in candidate genes could improve the accuracy of predicting response to drug therapy in patients with mesial temporal lobe epilepsy. For this, we divided 237 patients into two groups: 75 responsive and 162 refractory to antiepileptic drug therapy. We genotyped 119 SNPs in ABCB1, ABCC2, CYP1A1, CYP1A2, CYP1B1, CYP2C9, CYP2C19, CYP2D6, CYP2E1, CYP3A4, and CYP3A5 genes. We used 98 additional SNPs to evaluate population stratification. We assessed a first scenario using only clinical variables and a second one including SNP information. The random forests algorithm combined with leave-one-out cross-validation was used to identify the best predictive model in each scenario and compared their accuracies using the area under the curve statistic. Additionally, we built a variable importance plot to present the set of most relevant predictors on the best model. The selected best model included the presence of hippocampal sclerosis and 56 SNPs. Furthermore, including SNPs in the model improved accuracy from 0.4568 to 0.8177. Our findings suggest that adding genetic information provided by SNPs, located on drug transport and metabolism genes, can improve the accuracy for predicting which patients with mesial temporal lobe epilepsy are likely to be refractory to drug treatment, making it possible to identify patients who may benefit from epilepsy surgery sooner. PMID:28052106

  16. Construction of an alternative glycerol-utilization pathway for improved β-carotene production in Escherichia coli.

    PubMed

    Guo, Jin-Ying; Hu, Kun-Le; Bi, Chang-Hao; Li, Qing-Yan; Zhang, Xue-Li

    2018-05-11

    Glycerol, which is an inevitable by-product of biodiesel production, is an ideal carbon source for the production of carotenoids due to its low price, good availability and chemically reduced status, which results in a low requirement for additional reducing equivalents. In this study, an alternative carbon-utilization pathway was constructed in Escherichia coli to enable more efficient β-carotene production from glycerol. An aldehyde reductase gene (alrd) and an aldehyde dehydrogenase gene (aldH) from Ralstonia eutropha H16 were integrated into the E. coli chromosome to form a novel glycerol-utilization pathway. The β-carotene specific production value was increased by 50% after the introduction of alrd and aldH. It was found that the glycerol kinase gene (garK), alrd and aldH were the bottleneck of the alternative glycerol metabolic pathway, and modulation of garK gene with an mRS library further increased the β-carotene specific production value by 13%. Finally, co-modulation of genes in the introduced aldH-alrd operon led to 86% more of β-carotene specific production value than that of the strain without the alternative glycerol-utilization pathway and the glycerol-utilization rate was also increased. In this work, β-carotene production of E. coli was significantly improved by constructing and optimizing an alternative glycerol-utilization pathway. This strategy can potentially be used to improve the production of other isoprenoids using glycerol as a cheap and abundant substrate, and therefore has industrial relevance.

  17. Pulsed Irradiation Improves Target Selectivity of Infrared Laser-Evoked Gene Operator for Single-Cell Gene Induction in the Nematode C. elegans

    PubMed Central

    Suzuki, Motoshi; Toyoda, Naoya; Takagi, Shin

    2014-01-01

    Methods for turning on/off gene expression at the experimenter’s discretion would be useful for various biological studies. Recently, we reported on a novel microscope system utilizing an infrared laser-evoked gene operator (IR-LEGO) designed for inducing heat shock response efficiently in targeted single cells in living organisms without cell damage, thereby driving expression of a transgene under the control of a heat shock promoter. Although the original IR-LEGO can be successfully used for gene induction, several limitations hinder its wider application. Here, using the nematode Caenorhabditis elegans (C. elegans) as a subject, we have made improvements in IR-LEGO. For better spatial control of heating, a pulsed irradiation method using an optical chopper was introduced. As a result, single cells of C. elegans embryos as early as the 2-cell stage and single neurons in ganglia can be induced to express genes selectively. In addition, the introduction of site-specific recombination systems to IR-LEGO enables the induction of gene expression controlled by constitutive and cell type-specific promoters. The strategies adopted here will be useful for future applications of IR-LEGO to other organisms. PMID:24465705

  18. Gene expression variability in human hepatic drug metabolizing enzymes and transporters.

    PubMed

    Yang, Lun; Price, Elvin T; Chang, Ching-Wei; Li, Yan; Huang, Ying; Guo, Li-Wu; Guo, Yongli; Kaput, Jim; Shi, Leming; Ning, Baitang

    2013-01-01

    Interindividual variability in the expression of drug-metabolizing enzymes and transporters (DMETs) in human liver may contribute to interindividual differences in drug efficacy and adverse reactions. Published studies that analyzed variability in the expression of DMET genes were limited by sample sizes and the number of genes profiled. We systematically analyzed the expression of 374 DMETs from a microarray data set consisting of gene expression profiles derived from 427 human liver samples. The standard deviation of interindividual expression for DMET genes was much higher than that for non-DMET genes. The 20 DMET genes with the largest variability in the expression provided examples of the interindividual variation. Gene expression data were also analyzed using network analysis methods, which delineates the similarities of biological functionalities and regulation mechanisms for these highly variable DMET genes. Expression variability of human hepatic DMET genes may affect drug-gene interactions and disease susceptibility, with concomitant clinical implications.

  19. Clinical and multiple gene expression variables in survival analysis of breast cancer: Analysis with the hypertabastic survival model

    PubMed Central

    2012-01-01

    Background We explore the benefits of applying a new proportional hazard model to analyze survival of breast cancer patients. As a parametric model, the hypertabastic survival model offers a closer fit to experimental data than Cox regression, and furthermore provides explicit survival and hazard functions which can be used as additional tools in the survival analysis. In addition, one of our main concerns is utilization of multiple gene expression variables. Our analysis treats the important issue of interaction of different gene signatures in the survival analysis. Methods The hypertabastic proportional hazards model was applied in survival analysis of breast cancer patients. This model was compared, using statistical measures of goodness of fit, with models based on the semi-parametric Cox proportional hazards model and the parametric log-logistic and Weibull models. The explicit functions for hazard and survival were then used to analyze the dynamic behavior of hazard and survival functions. Results The hypertabastic model provided the best fit among all the models considered. Use of multiple gene expression variables also provided a considerable improvement in the goodness of fit of the model, as compared to use of only one. By utilizing the explicit survival and hazard functions provided by the model, we were able to determine the magnitude of the maximum rate of increase in hazard, and the maximum rate of decrease in survival, as well as the times when these occurred. We explore the influence of each gene expression variable on these extrema. Furthermore, in the cases of continuous gene expression variables, represented by a measure of correlation, we were able to investigate the dynamics with respect to changes in gene expression. Conclusions We observed that use of three different gene signatures in the model provided a greater combined effect and allowed us to assess the relative importance of each in determination of outcome in this data set. These results point to the potential to combine gene signatures to a greater effect in cases where each gene signature represents some distinct aspect of the cancer biology. Furthermore we conclude that the hypertabastic survival models can be an effective survival analysis tool for breast cancer patients. PMID:23241496

  20. Structural polymorphism at LCR and its role in beta-globin gene regulation.

    PubMed

    Kukreti, Shrikant; Kaur, Harpreet; Kaushik, Mahima; Bansal, Aparna; Saxena, Sarika; Kaushik, Shikha; Kukreti, Ritushree

    2010-09-01

    Information on the secondary structures and conformational manifestations of eukaryotic DNA and their biological significance with reference to gene regulation and expression is limited. The human beta-globin gene Locus Control Region (LCR), a dominant regulator of globin gene expression, is a contiguous piece of DNA with five tissue-specific DNase I-hypersensitive sites (HSs). Since these HSs have a high density of transcription factor binding sites, structural interdependencies between HSs and different promoters may directly or indirectly regulate LCR functions. Mutations and SNPs may stabilize or destabilize the local secondary structures, affecting the gene expression by changes in the protein-DNA recognition patterns. Various palindromic or quasi-palindromic segments within LCR, could cause structural polymorphism and geometrical switching of DNA. This emphasizes the importance of understanding of the sequence-dependent variations of the DNA structure. Such structural motifs might act as regulatory elements. The local conformational variability of a DNA segment or action of a DNA specific protein is key to create and maintain active chromatin domains and affect transcription of various tissue specific beta-globin genes. We, summarize here the current status of beta-globin LCR structure and function. Further structural studies at molecular level and functional genomics might solve the regulatory puzzles that control the beta-globin gene locus. Copyright (c) 2010 Elsevier Masson SAS. All rights reserved.

  1. Highly Effective Serodiagnosis for Chagas' Disease ▿

    PubMed Central

    Hernández, Pilar; Heimann, Michael; Riera, Cristina; Solano, Marco; Santalla, José; Luquetti, Alejandro O.; Beck, Ewald

    2010-01-01

    Many proteins of Trypanosoma cruzi, the causative agent of Chagas' disease, contain characteristic arrays of highly repetitive immunogenic amino acid motifs. Diagnostic tests using these motifs in monomeric or dimeric form have proven to provide markedly improved specificity compared to conventional tests based on crude parasite extracts. However, in many cases the available tests still suffer from limited sensitivity. In this study we produced stable synthetic genes with maximal codon variability for the four diagnostic antigens, B13, CRA, TcD, and TcE, each containing between three and nine identical amino acid repeats. These genes were combined by linker sequences encoding short proline-rich peptides, giving rise to a 24-kDa fusion protein which was used as a novel diagnostic antigen in an enzyme-linked immunosorbent assay setup. Validation of the assay with a large number of well-characterized patient sera from Bolivia and Brazil revealed excellent diagnostic performance. The high sensitivity of the new test may allow future studies to use blood collected by finger prick and dried on filter paper, thus dramatically reducing the costs and effort for the detection of T. cruzi infection. PMID:20668136

  2. Improved heterologous production of the nonribosomal peptide-polyketide siderophore yersiniabactin through metabolic engineering and induction optimization.

    PubMed

    Ahmadi, Mahmoud Kamal; Pfeifer, Blaine A

    2016-11-01

    Biosynthesis of complex natural products like polyketides and nonribosomal peptides using Escherichia coli as a heterologous host provides an opportunity to access these molecules. The value in doing so stems from the fact that many compounds hold some therapeutic or other beneficial property and their original production hosts are intractable for a variety of reasons. In this work, metabolic engineering and induction variable optimization were used to increase production of the polyketide-nonribosomal peptide compound yersiniabactin, a siderophore that has been utilized to selectively remove metals from various solid and aqueous samples. Specifically, several precursor substrate support pathways were altered through gene expression and exogenous supplementation in order to boost production of the final compound. The gene expression induction process was also analyzed to identify the temperatures and inducer concentrations resulting in highest final production levels. When combined, yersiniabactin production was extended to ∼175 mg L -1 . © 2016 American Institute of Chemical Engineers Biotechnol. Prog., 32:1412-1417, 2016. © 2016 American Institute of Chemical Engineers.

  3. Asialoglycoprotein receptor 1 is a specific cell-surface marker for isolating hepatocytes derived from human pluripotent stem cells.

    PubMed

    Peters, Derek T; Henderson, Christopher A; Warren, Curtis R; Friesen, Max; Xia, Fang; Becker, Caroline E; Musunuru, Kiran; Cowan, Chad A

    2016-05-01

    Hepatocyte-like cells (HLCs) are derived from human pluripotent stem cells (hPSCs) in vitro, but differentiation protocols commonly give rise to a heterogeneous mixture of cells. This variability confounds the evaluation of in vitro functional assays performed using HLCs. Increased differentiation efficiency and more accurate approximation of the in vivo hepatocyte gene expression profile would improve the utility of hPSCs. Towards this goal, we demonstrate the purification of a subpopulation of functional HLCs using the hepatocyte surface marker asialoglycoprotein receptor 1 (ASGR1). We analyzed the expression profile of ASGR1-positive cells by microarray, and tested their ability to perform mature hepatocyte functions (albumin and urea secretion, cytochrome activity). By these measures, ASGR1-positive HLCs are enriched for the gene expression profile and functional characteristics of primary hepatocytes compared with unsorted HLCs. We have demonstrated that ASGR1-positive sorting isolates a functional subpopulation of HLCs from among the heterogeneous cellular population produced by directed differentiation. © 2016. Published by The Company of Biologists Ltd.

  4. Gene and Chromosomal Copy Number Variations as an Adaptive Mechanism Towards a Parasitic Lifestyle in Trypanosomatids.

    PubMed

    Reis-Cunha, João Luís; Valdivia, Hugo O; Bartholomeu, Daniella Castanheira

    2018-02-01

    Trypanosomatids are a group of kinetoplastid parasites including some of great public health importance, causing debilitating and life-long lasting diseases that affect more than 24 million people worldwide. Among the trypanosomatids, Trypanosoma cruzi, Trypanosoma brucei and species from the Leishmania genus are the most well studied parasites, due to their high prevalence in human infections. These parasites have an extreme genomic and phenotypic variability, with a massive expansion in the copy number of species-specific multigene families enrolled in host-parasite interactions that mediate cellular invasion and immune evasion processes. As most trypanosomatids are heteroxenous, and therefore their lifecycles involve the transition between different hosts, these parasites have developed several strategies to ensure a rapid adaptation to changing environments. Among these strategies, a rapid shift in the repertoire of expressed genes, genetic variability and genome plasticity are key mechanisms. Trypanosomatid genomes are organized into large directional gene clusters that are transcribed polycistronically, where genes derived from the same polycistron may have very distinct mRNA levels. This particular mode of transcription implies that the control of gene expression operates mainly at post-transcriptional level. In this sense, gene duplications/losses were already associated with changes in mRNA levels in these parasites. Gene duplications also allow the generation of sequence variability, as the newly formed copy can diverge without loss of function of the original copy. Recently, aneuploidies have been shown to occur in several Leishmania species and T. cruzi strains. Although aneuploidies are usually associated with debilitating phenotypes in superior eukaryotes, recent data shows that it could also provide increased fitness in stress conditions and generate drug resistance in unicellular eukaryotes. In this review, we will focus on gene and chromosomal copy number variations and their relevance to the evolution of trypanosomatid parasites.

  5. Genetic engineering of microbial pesticides

    Treesearch

    Bruce C. Carlton

    1985-01-01

    Recent advances in genetics and molecular biology make possible the cloning and genetic manipulation of genes for insecticidal activities from natural insect pathogens. Using recombinant DNA methods and site-directed mutagenesis of specific gene regions, production of new and improved biorationals should be possible.

  6. Same MSH2 Gene Mutation But Variable Phenotypes in 2 Families With Lynch Syndrome: Two Case Reports and Review of Genotype-Phenotype Correlation.

    PubMed

    Liccardo, Raffaella; De Rosa, Marina; Duraturo, Francesca

    2018-01-01

    Lynch syndrome is an autosomal dominant syndrome that can be subdivided into Lynch syndrome I, or site-specific colonic cancer, and Lynch syndrome II, or extracolonic cancers, particularly carcinomas of the stomach, endometrium, biliary and pancreatic systems, and urinary tract. Lynch syndrome is associated with point mutations and large rearrangements in DNA MisMatch Repair ( MMR ) genes. This syndrome shows a variable phenotypic expression in people who carry pathogenetic mutations. So far, a correlation in genotype-phenotype has not been definitely established. In this study, we describe 2 Lynch syndrome cases presenting with the same genotype but different phenotypes and discuss possible reasons for this.

  7. Bivalve-specific gene expansion in the pearl oyster genome: implications of adaptation to a sessile lifestyle.

    PubMed

    Takeuchi, Takeshi; Koyanagi, Ryo; Gyoja, Fuki; Kanda, Miyuki; Hisata, Kanako; Fujie, Manabu; Goto, Hiroki; Yamasaki, Shinichi; Nagai, Kiyohito; Morino, Yoshiaki; Miyamoto, Hiroshi; Endo, Kazuyoshi; Endo, Hirotoshi; Nagasawa, Hiromichi; Kinoshita, Shigeharu; Asakawa, Shuichi; Watabe, Shugo; Satoh, Noriyuki; Kawashima, Takeshi

    2016-01-01

    Bivalve molluscs have flourished in marine environments, and many species constitute important aquatic resources. Recently, whole genome sequences from two bivalves, the pearl oyster, Pinctada fucata, and the Pacific oyster, Crassostrea gigas, have been decoded, making it possible to compare genomic sequences among molluscs, and to explore general and lineage-specific genetic features and trends in bivalves. In order to improve the quality of sequence data for these purposes, we have updated the entire P. fucata genome assembly. We present a new genome assembly of the pearl oyster, Pinctada fucata (version 2.0). To update the assembly, we conducted additional sequencing, obtaining accumulated sequence data amounting to 193× the P. fucata genome. Sequence redundancy in contigs that was caused by heterozygosity was removed in silico, which significantly improved subsequent scaffolding. Gene model version 2.0 was generated with the aid of manual gene annotations supplied by the P. fucata research community. Comparison of mollusc and other bilaterian genomes shows that gene arrangements of Hox, ParaHox, and Wnt clusters in the P. fucata genome are similar to those of other molluscs. Like the Pacific oyster, P. fucata possesses many genes involved in environmental responses and in immune defense. Phylogenetic analyses of heat shock protein70 and C1q domain-containing protein families indicate that extensive expansion of genes occurred independently in each lineage. Several gene duplication events prior to the split between the pearl oyster and the Pacific oyster are also evident. In addition, a number of tandem duplications of genes that encode shell matrix proteins are also well characterized in the P. fucata genome. Both the Pinctada and Crassostrea lineages have expanded specific gene families in a lineage-specific manner. Frequent duplication of genes responsible for shell formation in the P. fucata genome explains the diversity of mollusc shell structures. These duplications reveal dynamic genome evolution to forge the complex physiology that enables bivalves to employ a sessile lifestyle in the intertidal zone.

  8. Acute myeloid/T-lymphoblastic leukaemia (AMTL): a distinct category of acute leukaemias with common pathogenesis in need of improved therapy.

    PubMed

    Gutierrez, Alejandro; Kentsis, Alex

    2018-03-01

    Advances in the classification of acute leukaemias have led to improved outcomes for a substantial fraction of patients. However, chemotherapy resistance remains a major problem for specific subsets of acute leukaemias. Here, we propose that a molecularly distinct subtype of acute leukaemia with shared myeloid and T cell lymphoblastic features, which we term acute myeloid/T-lymphoblastic leukaemia (AMTL), is divided across 3 diagnostic categories owing to variable expression of markers deemed to be defining of myeloid and T-lymphoid lineages, such as myeloperoxidase and CD3. This proposed diagnostic group is supported by (i) retained myeloid differentiation potential during early T cell lymphoid development, (ii) recognition that some cases of acute myeloid leukaemia (AML) harbour hallmarks of T cell development, such as T-cell receptor gene rearrangements and (iii) common gene mutations in subsets of AML and T cell acute lymphoblastic leukaemia (T-ALL), including WT1, PHF6, RUNX1 and BCL11B. This proposed diagnostic entity overlaps with early T cell precursor (ETP) T-ALL and T cell/myeloid mixed phenotype acute leukaemias (MPALs), and also includes a subset of leukaemias currently classified as AML with features of T-lymphoblastic development. The proposed classification of AMTL as a distinct entity would enable more precise prospective diagnosis and permit the development of improved therapies for patients whose treatment is inadequate with current approaches. © 2018 John Wiley & Sons Ltd.

  9. Identification of Optimal Epitopes for Plasmodium falciparum Rapid Diagnostic Tests That Target Histidine-Rich Proteins 2 and 3

    PubMed Central

    Lee, Nelson; Gatton, Michelle L.; Pelecanos, Anita; Bubb, Martin; Gonzalez, Iveth; Bell, David; Cheng, Qin

    2012-01-01

    Rapid diagnostic tests (RDTs) represent important tools to diagnose malaria infection. To improve understanding of the variable performance of RDTs that detect the major target in Plasmodium falciparum, namely, histidine-rich protein 2 (HRP2), and to inform the design of better tests, we undertook detailed mapping of the epitopes recognized by eight HRP-specific monoclonal antibodies (MAbs). To investigate the geographic skewing of this polymorphic protein, we analyzed the distribution of these epitopes in parasites from geographically diverse areas. To identify an ideal amino acid motif for a MAb to target in HRP2 and in the related protein HRP3, we used a purpose-designed script to perform bioinformatic analysis of 448 distinct gene sequences from pfhrp2 and from 99 sequences from the closely related gene pfhrp3. The frequency and distribution of these motifs were also compared to the MAb epitopes. Heat stability testing of MAbs immobilized on nitrocellulose membranes was also performed. Results of these experiments enabled the identification of MAbs with the most desirable characteristics for inclusion in RDTs, including copy number and coverage of target epitopes, geographic skewing, heat stability, and match with the most abundant amino acid motifs identified. This study therefore informs the selection of MAbs to include in malaria RDTs as well as in the generation of improved MAbs that should improve the performance of HRP-detecting malaria RDTs. PMID:22259210

  10. Delivery methods for site-specific nucleases: Achieving the full potential of therapeutic gene editing.

    PubMed

    Liu, Jia; Shui, Sai-Lan

    2016-12-28

    The advent of site-specific nucleases, particularly CRISPR/Cas9, provides researchers with the unprecedented ability to manipulate genomic sequences. These nucleases are used to create model cell lines, engineer metabolic pathways, produce transgenic animals and plants, perform genome-wide functional screen and, most importantly, treat human diseases that are difficult to tackle by traditional medications. Considerable efforts have been devoted to improving the efficiency and specificity of nucleases for clinical applications. However, safe and efficient delivery methods remain the major obstacle for therapeutic gene editing. In this review, we summarize the recent progress on nuclease delivery methods, highlight their impact on the outcomes of gene editing and discuss the potential of different delivery approaches for therapeutic gene editing. Copyright © 2016 Elsevier B.V. All rights reserved.

  11. African-specific variability in the acetylcholine muscarinic receptor M4: association with cocaine and heroin addiction.

    PubMed

    Levran, Orna; Randesi, Matthew; Peles, Einat; Correa da Rosa, Joel; Ott, Jurg; Rotrosen, John; Adelson, Miriam; Kreek, Mary Jeanne

    2016-06-01

    This study was designed to determine whether polymorphisms in acetylcholine receptors contribute to opioid dependence and/or cocaine dependence. The sample (n = 1860) was divided by drug and ancestry, and 55 polymorphisms (nine genes) were analyzed. Of the 20 SNPs that showed nominally significant associations, the association of the African-specific CHRM4 SNP rs2229163 (Asn417=) with cocaine dependence survived correction for multiple testing (Pcorrected = 0.047). CHRM4 is located in a region of strong linkage disequilibrium on chromosome 11 that includes genes associated with schizophrenia. CHRM4 SNP rs2229163 is in strong linkage disequilibrium with several African-specific SNPs in DGKZ and AMBRA1. Cholinergic receptors' variants may contribute to drug addiction and have a potential role as pharmacogenetic markers.

  12. Association of DNA methylation and monoamine oxidase A gene expression in the brains of different dog breeds.

    PubMed

    Eo, JungWoo; Lee, Hee-Eun; Nam, Gyu-Hwi; Kwon, Yun-Jeong; Choi, Yuri; Choi, Bong-Hwan; Huh, Jae-Won; Kim, Minkyu; Lee, Sang-Eun; Seo, Bohyun; Kim, Heui-Soo

    2016-04-15

    The monoamine oxidase A (MAOA) gene is an important candidate gene for human behavior that encodes an enzyme regulating the metabolism of key neurotransmitters. The regulatory mechanisms of the MAOA gene in dogs are yet to be elucidated. We measured MAOA gene transcription and analyzed the VNTR genotype and methylation status of the gene promoter region in different dog breeds to determine whether MAOA expression is correlated with the MAOA genotype or epigenetic modification in dogs. We found brain-specific expression of the MAOA gene and different transcription levels in different dog breeds including Beagle, Sapsaree, and German shepherd, and also a robust association of the DNA methylation of the gene promoter with mRNA levels. However, the 90 bp tandem repeats that we observed near the transcription start site were not variable, indicating no correlation with canine MAOA activity. These results show that differential DNA methylation in the MAOA promoter region may affect gene expression by modulating promoter activity. Moreover, the distinctive patterns of MAOA expression and DNA methylation may be involved in breed-specific or individual behavioral characteristics, such as aggression, because behavioral phenotypes are related to different physiological and neuroendocrine responses. Copyright © 2016 Elsevier B.V. All rights reserved.

  13. High Inter-Individual Diversity of Point Mutations, Insertions, and Deletions in Human Influenza Virus Nucleoprotein-Specific Memory B Cells

    PubMed Central

    Bussmann, Bianca M.; Horn, Susanne; Sieg, Michael; Jassoy, Christian

    2015-01-01

    The diversity of virus-specific antibodies and of B cells among different individuals is unknown. Using single-cell cloning of antibody genes, we generated recombinant human monoclonal antibodies from influenza nucleoprotein-specific memory B cells in four adult humans with and without preceding influenza vaccination. We examined the diversity of the antibody repertoires and found that NP-specific B cells used numerous immunoglobulin genes. The heavy chains (HCs) originated from 26 and the kappa light chains (LCs) from 19 different germ line genes. Matching HC and LC chains gave rise to 43 genetically distinct antibodies that bound influenza NP. The median lengths of the CDR3 of the HC, kappa and lambda LC were 14, 9 and 11 amino acids, respectively. We identified changes at 13.6% of the amino acid positions in the V gene of the antibody heavy chain, at 8.4 % in the kappa and at 10.6 % in the lambda V gene. We identified somatic insertions or deletions in 8.1% of the variable genes. We also found several small groups of clonal relatives that were highly diversified. Our findings demonstrate broadly diverse memory B cell repertoires for the influenza nucleoprotein. We found extensive variation within individuals with a high number of point mutations, insertions, and deletions, and extensive clonal diversification. Thus, structurally conserved proteins can elicit broadly diverse and highly mutated B-cell responses. PMID:26086076

  14. Antibody-independent Targeted Quantification of TMPRSS2-ERG Fusion Protein Products in Prostate Cancer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    He, Jintang; Sun, Xuefei; Shi, Tujin

    2014-10-01

    Fusions between the transmembrane protease serine 2 (TMPRSS2) and ETS related gene (ERG) represent one of the most specific biomarkers that define a distinct molecular subtype of prostate cancer. The studies on TMPRSS2-ERG gene fusions have seldom been performed at the protein level, primarily due to the lack of high-quality antibodies or an antibody-independent method that is sufficiently sensitive for detecting the truncated ERG protein products resulting from TMPRSS2-ERG gene fusions and alternative splicing. Herein, we applied a recently developed PRISM (high-pressure high-resolution separations with intelligent selection and multiplexing)-SRM (selected reaction monitoring) strategy for quantifying ERG protein in prostate cancermore » cell lines and tumors. The highly sensitive PRISM-SRM assays led to confident detection of 6 unique ERG peptides in either the TMPRSS2-ERG positive cell lines or tissues but not in the negative controls, indicating that ERG protein expression is highly correlated with TMPRSS2-ERG gene rearrangements. Significantly, our results demonstrated for the first time that at least two groups of ERG protein isoforms were simultaneously expressed at variable levels in TMPRSS2-ERG positive samples as evidenced by concomitant detection of two mutually exclusive peptides. Three peptides shared across almost all fusion protein products were determined to be the most abundant peptides, and hence can be used as “signature” peptides for detecting ERG overexpression resulting from TMPRSS2-ERG gene fusion. These PRISM-SRM assays provide valuable tools for studying TMPRSS2-ERG gene fusion protein products, thus improving our understanding of the role of TMPRSS2-ERG gene fusion in the biology of prostate cancer.« less

  15. Molecular Structure-Based Large-Scale Prediction of Chemical-Induced Gene Expression Changes.

    PubMed

    Liu, Ruifeng; AbdulHameed, Mohamed Diwan M; Wallqvist, Anders

    2017-09-25

    The quantitative structure-activity relationship (QSAR) approach has been used to model a wide range of chemical-induced biological responses. However, it had not been utilized to model chemical-induced genomewide gene expression changes until very recently, owing to the complexity of training and evaluating a very large number of models. To address this issue, we examined the performance of a variable nearest neighbor (v-NN) method that uses information on near neighbors conforming to the principle that similar structures have similar activities. Using a data set of gene expression signatures of 13 150 compounds derived from cell-based measurements in the NIH Library of Integrated Network-based Cellular Signatures program, we were able to make predictions for 62% of the compounds in a 10-fold cross validation test, with a correlation coefficient of 0.61 between the predicted and experimentally derived signatures-a reproducibility rivaling that of high-throughput gene expression measurements. To evaluate the utility of the predicted gene expression signatures, we compared the predicted and experimentally derived signatures in their ability to identify drugs known to cause specific liver, kidney, and heart injuries. Overall, the predicted and experimentally derived signatures had similar receiver operating characteristics, whose areas under the curve ranged from 0.71 to 0.77 and 0.70 to 0.73, respectively, across the three organ injury models. However, detailed analyses of enrichment curves indicate that signatures predicted from multiple near neighbors outperformed those derived from experiments, suggesting that averaging information from near neighbors may help improve the signal from gene expression measurements. Our results demonstrate that the v-NN method can serve as a practical approach for modeling large-scale, genomewide, chemical-induced, gene expression changes.

  16. Phenotype classification of single cells using SRS microscopy, RNA sequencing, and microfluidics (Conference Presentation)

    NASA Astrophysics Data System (ADS)

    Streets, Aaron M.; Cao, Chen; Zhang, Xiannian; Huang, Yanyi

    2016-03-01

    Phenotype classification of single cells reveals biological variation that is masked in ensemble measurement. This heterogeneity is found in gene and protein expression as well as in cell morphology. Many techniques are available to probe phenotypic heterogeneity at the single cell level, for example quantitative imaging and single-cell RNA sequencing, but it is difficult to perform multiple assays on the same single cell. In order to directly track correlation between morphology and gene expression at the single cell level, we developed a microfluidic platform for quantitative coherent Raman imaging and immediate RNA sequencing (RNA-Seq) of single cells. With this device we actively sort and trap cells for analysis with stimulated Raman scattering microscopy (SRS). The cells are then processed in parallel pipelines for lysis, and preparation of cDNA for high-throughput transcriptome sequencing. SRS microscopy offers three-dimensional imaging with chemical specificity for quantitative analysis of protein and lipid distribution in single cells. Meanwhile, the microfluidic platform facilitates single-cell manipulation, minimizes contamination, and furthermore, provides improved RNA-Seq detection sensitivity and measurement precision, which is necessary for differentiating biological variability from technical noise. By combining coherent Raman microscopy with RNA sequencing, we can better understand the relationship between cellular morphology and gene expression at the single-cell level.

  17. Comparative analysis of lentiviral vectors and modular protein nanovectors for traumatic brain injury gene therapy

    PubMed Central

    Negro-Demontel, María Luciana; Saccardo, Paolo; Giacomini, Cecilia; Yáñez-Muñoz, Rafael Joaquín; Ferrer-Miralles, Neus; Vazquez, Esther; Villaverde, Antonio; Peluffo, Hugo

    2014-01-01

    Traumatic brain injury (TBI) remains as one of the leading causes of mortality and morbidity worldwide and there are no effective treatments currently available. Gene therapy applications have emerged as important alternatives for the treatment of diverse nervous system injuries. New strategies are evolving with the notion that each particular pathological condition may require a specific vector. Moreover, the lack of detailed comparative studies between different vectors under similar conditions hampers the selection of an ideal vector for a given pathological condition. The potential use of lentiviral vectors versus several modular protein-based nanovectors was compared using a controlled cortical impact model of TBI under the same gene therapy conditions. We show that variables such as protein/DNA ratio, incubation volume, and presence of serum or chloroquine in the transfection medium impact on both nanovector formation and transfection efficiency in vitro. While lentiviral vectors showed GFP protein 1 day after TBI and increased expression at 14 days, nanovectors showed stable and lower GFP transgene expression from 1 to 14 days. No toxicity after TBI by any of the vectors was observed as determined by resulting levels of IL-1β or using neurological sticky tape test. In fact, both vector types induced functional improvement per se. PMID:26015985

  18. Flagellin diversity in Clostridium botulinum groups I and II: a new strategy for strain identification.

    PubMed

    Paul, Catherine J; Twine, Susan M; Tam, Kevin J; Mullen, James A; Kelly, John F; Austin, John W; Logan, Susan M

    2007-05-01

    Strains of Clostridium botulinum are traditionally identified by botulinum neurotoxin type; however, identification of an additional target for typing would improve differentiation. Isolation of flagellar filaments and analysis by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) showed that C. botulinum produced multiple flagellin proteins. Nano-liquid chromatography-tandem mass spectrometry (nLC-MS/MS) analysis of in-gel tryptic digests identified peptides in all flagellin bands that matched two homologous tandem flagellin genes identified in the C. botulinum Hall A genome. Designated flaA1 and flaA2, these open reading frames encode the major structural flagellins of C. botulinum. Colony PCR and sequencing of flaA1/A2 variable regions classified 80 environmental and clinical strains into group I or group II and clustered isolates into 12 flagellar types. Flagellar type was distinct from neurotoxin type, and epidemiologically related isolates clustered together. Sequencing a larger PCR product, obtained during amplification of flaA1/A2 from type E strain Bennett identified a second flagellin gene, flaB. LC-MS analysis confirmed that flaB encoded a large type E-specific flagellin protein, and the predicted molecular mass for FlaB matched that observed by SDS-PAGE. In contrast, the molecular mass of FlaA was 2 to 12 kDa larger than the mass predicted by the flaA1/A2 sequence of a given strain, suggesting that FlaA is posttranslationally modified. While identification of FlaB, and the observation by SDS-PAGE of different masses of the FlaA proteins, showed the flagellin proteins of C. botulinum to be diverse, the presence of the flaA1/A2 gene in all strains examined facilitates single locus sequence typing of C. botulinum using the flagellin variable region.

  19. Measuring semantic similarities by combining gene ontology annotations and gene co-function networks

    DOE PAGES

    Peng, Jiajie; Uygun, Sahra; Kim, Taehyong; ...

    2015-02-14

    Background: Gene Ontology (GO) has been used widely to study functional relationships between genes. The current semantic similarity measures rely only on GO annotations and GO structure. This limits the power of GO-based similarity because of the limited proportion of genes that are annotated to GO in most organisms. Results: We introduce a novel approach called NETSIM (network-based similarity measure) that incorporates information from gene co-function networks in addition to using the GO structure and annotations. Using metabolic reaction maps of yeast, Arabidopsis, and human, we demonstrate that NETSIM can improve the accuracy of GO term similarities. We also demonstratemore » that NETSIM works well even for genomes with sparser gene annotation data. We applied NETSIM on large Arabidopsis gene families such as cytochrome P450 monooxygenases to group the members functionally and show that this grouping could facilitate functional characterization of genes in these families. Conclusions: Using NETSIM as an example, we demonstrated that the performance of a semantic similarity measure could be significantly improved after incorporating genome-specific information. NETSIM incorporates both GO annotations and gene co-function network data as a priori knowledge in the model. Therefore, functional similarities of GO terms that are not explicitly encoded in GO but are relevant in a taxon-specific manner become measurable when GO annotations are limited.« less

  20. Genomics-Enabled Next-Generation Breeding Approaches for Developing System-Specific Drought Tolerant Hybrids in Maize

    PubMed Central

    Nepolean, Thirunavukkarsau; Kaul, Jyoti; Mukri, Ganapati; Mittal, Shikha

    2018-01-01

    Breeding science has immensely contributed to the global food security. Several varieties and hybrids in different food crops including maize have been released through conventional breeding. The ever growing population, decreasing agricultural land, lowering water table, changing climate, and other variables pose tremendous challenge to the researchers to improve the production and productivity of food crops. Drought is one of the major problems to sustain and improve the productivity of food crops including maize in tropical and subtropical production systems. With advent of novel genomics and breeding tools, the way of doing breeding has been tremendously changed in the last two decades. Drought tolerance is a combination of several component traits with a quantitative mode of inheritance. Rapid DNA and RNA sequencing tools and high-throughput SNP genotyping techniques, trait mapping, functional characterization, genomic selection, rapid generation advancement, and other tools are now available to understand the genetics of drought tolerance and to accelerate the breeding cycle. Informatics play complementary role by managing the big-data generated from the large-scale genomics and breeding experiments. Genome editing is the latest technique to alter specific genes to improve the trait expression. Integration of novel genomics, next-generation breeding, and informatics tools will accelerate the stress breeding process and increase the genetic gain under different production systems. PMID:29696027

  1. Familial aggregation of focal seizure semiology in the Epilepsy Phenome/Genome Project.

    PubMed

    Tobochnik, Steven; Fahlstrom, Robyn; Shain, Catherine; Winawer, Melodie R

    2017-07-04

    To improve phenotype definition in genetic studies of epilepsy, we assessed the familial aggregation of focal seizure types and of specific seizure symptoms within the focal epilepsies in families from the Epilepsy Phenome/Genome Project. We studied 302 individuals with nonacquired focal epilepsy from 149 families. Familial aggregation was assessed by logistic regression analysis of relatives' traits (dependent variable) by probands' traits (independent variable), estimating the odds ratio for each symptom in a relative given presence vs absence of the symptom in the proband. In families containing multiple individuals with nonacquired focal epilepsy, we found significant evidence for familial aggregation of ictal motor, autonomic, psychic, and aphasic symptoms. Within these categories, ictal whole body posturing, diaphoresis, dyspnea, fear/anxiety, and déjà vu/jamais vu showed significant familial aggregation. Focal seizure type aggregated as well, including complex partial, simple partial, and secondarily generalized tonic-clonic seizures. Our results provide insight into genotype-phenotype correlation in the nonacquired focal epilepsies and a framework for identifying subgroups of patients likely to share susceptibility genes. © 2017 American Academy of Neurology.

  2. Risk Classification with an Adaptive Naive Bayes Kernel Machine Model.

    PubMed

    Minnier, Jessica; Yuan, Ming; Liu, Jun S; Cai, Tianxi

    2015-04-22

    Genetic studies of complex traits have uncovered only a small number of risk markers explaining a small fraction of heritability and adding little improvement to disease risk prediction. Standard single marker methods may lack power in selecting informative markers or estimating effects. Most existing methods also typically do not account for non-linearity. Identifying markers with weak signals and estimating their joint effects among many non-informative markers remains challenging. One potential approach is to group markers based on biological knowledge such as gene structure. If markers in a group tend to have similar effects, proper usage of the group structure could improve power and efficiency in estimation. We propose a two-stage method relating markers to disease risk by taking advantage of known gene-set structures. Imposing a naive bayes kernel machine (KM) model, we estimate gene-set specific risk models that relate each gene-set to the outcome in stage I. The KM framework efficiently models potentially non-linear effects of predictors without requiring explicit specification of functional forms. In stage II, we aggregate information across gene-sets via a regularization procedure. Estimation and computational efficiency is further improved with kernel principle component analysis. Asymptotic results for model estimation and gene set selection are derived and numerical studies suggest that the proposed procedure could outperform existing procedures for constructing genetic risk models.

  3. PreCisIon: PREdiction of CIS-regulatory elements improved by gene's positION.

    PubMed

    Elati, Mohamed; Nicolle, Rémy; Junier, Ivan; Fernández, David; Fekih, Rim; Font, Julio; Képès, François

    2013-02-01

    Conventional approaches to predict transcriptional regulatory interactions usually rely on the definition of a shared motif sequence on the target genes of a transcription factor (TF). These efforts have been frustrated by the limited availability and accuracy of TF binding site motifs, usually represented as position-specific scoring matrices, which may match large numbers of sites and produce an unreliable list of target genes. To improve the prediction of binding sites, we propose to additionally use the unrelated knowledge of the genome layout. Indeed, it has been shown that co-regulated genes tend to be either neighbors or periodically spaced along the whole chromosome. This study demonstrates that respective gene positioning carries significant information. This novel type of information is combined with traditional sequence information by a machine learning algorithm called PreCisIon. To optimize this combination, PreCisIon builds a strong gene target classifier by adaptively combining weak classifiers based on either local binding sequence or global gene position. This strategy generically paves the way to the optimized incorporation of any future advances in gene target prediction based on local sequence, genome layout or on novel criteria. With the current state of the art, PreCisIon consistently improves methods based on sequence information only. This is shown by implementing a cross-validation analysis of the 20 major TFs from two phylogenetically remote model organisms. For Bacillus subtilis and Escherichia coli, respectively, PreCisIon achieves on average an area under the receiver operating characteristic curve of 70 and 60%, a sensitivity of 80 and 70% and a specificity of 60 and 56%. The newly predicted gene targets are demonstrated to be functionally consistent with previously known targets, as assessed by analysis of Gene Ontology enrichment or of the relevant literature and databases.

  4. Oxytocin and Opioid Receptor Gene Polymorphisms Associated with Greeting Behavior in Dogs.

    PubMed

    Kubinyi, Enikő; Bence, Melinda; Koller, Dora; Wan, Michele; Pergel, Eniko; Ronai, Zsolt; Sasvari-Szekely, Maria; Miklósi, Ádám

    2017-01-01

    Meeting humans is an everyday experience for most companion dogs, and their behavior in these situations and its genetic background is of major interest. Previous research in our laboratory reported that in German shepherd dogs the lack of G allele, and in Border collies the lack of A allele, of the oxytocin receptor gene (OXTR) 19208A/G single nucleotide polymorphism (SNP) was linked to increased friendliness, which suggests that although broad traits are affected by genetic variability, the specific links between alleles and behavioral variables might be breed-specific. In the current study, we found that Siberian huskies with the A allele approached a friendly unfamiliar woman less frequently in a greeting test, which indicates that certain polymorphisms are related to human directed behavior, but that the relationship patterns between polymorphisms and behavioral phenotypes differ between populations. This finding was further supported by our next investigation. According to primate studies, endogenous opioid peptide (e.g., endorphins) receptor genes have also been implicated in social relationships. Therefore, we examined the rs21912990 of the OPRM1 gene. Firstly, we found that the allele frequencies of Siberian huskies and gray wolves were similar, but differed from that of Border collies and German shepherd dogs, which might reflect their genetic relationship. Secondly, we detected significant associations between the OPRM1 SNP and greeting behavior among German shepherd dogs and a trend in Border collies, but we could not detect an association in Siberian huskies. Although our results with OXTR and OPRM1 gene variants should be regarded as preliminary due to the relatively low sample size, they suggest that (1) OXTR and OPRM1 gene variants in dogs affect human-directed social behavior and (2) their effects differ between breeds.

  5. Oxytocin and Opioid Receptor Gene Polymorphisms Associated with Greeting Behavior in Dogs

    PubMed Central

    Kubinyi, Enikő; Bence, Melinda; Koller, Dora; Wan, Michele; Pergel, Eniko; Ronai, Zsolt; Sasvari-Szekely, Maria; Miklósi, Ádám

    2017-01-01

    Meeting humans is an everyday experience for most companion dogs, and their behavior in these situations and its genetic background is of major interest. Previous research in our laboratory reported that in German shepherd dogs the lack of G allele, and in Border collies the lack of A allele, of the oxytocin receptor gene (OXTR) 19208A/G single nucleotide polymorphism (SNP) was linked to increased friendliness, which suggests that although broad traits are affected by genetic variability, the specific links between alleles and behavioral variables might be breed-specific. In the current study, we found that Siberian huskies with the A allele approached a friendly unfamiliar woman less frequently in a greeting test, which indicates that certain polymorphisms are related to human directed behavior, but that the relationship patterns between polymorphisms and behavioral phenotypes differ between populations. This finding was further supported by our next investigation. According to primate studies, endogenous opioid peptide (e.g., endorphins) receptor genes have also been implicated in social relationships. Therefore, we examined the rs21912990 of the OPRM1 gene. Firstly, we found that the allele frequencies of Siberian huskies and gray wolves were similar, but differed from that of Border collies and German shepherd dogs, which might reflect their genetic relationship. Secondly, we detected significant associations between the OPRM1 SNP and greeting behavior among German shepherd dogs and a trend in Border collies, but we could not detect an association in Siberian huskies. Although our results with OXTR and OPRM1 gene variants should be regarded as preliminary due to the relatively low sample size, they suggest that (1) OXTR and OPRM1 gene variants in dogs affect human-directed social behavior and (2) their effects differ between breeds. PMID:28936190

  6. The phosphatidylinositol synthase gene (GhPIS) contributes to longer, stronger, and finer fibers in cotton.

    PubMed

    Long, Qin; Yue, Fang; Liu, Ruochen; Song, Shuiqing; Li, Xianbi; Ding, Bo; Yan, Xingying; Pei, Yan

    2018-05-11

    Cotton fibers are the most important natural raw material used in textile industries world-wide. Fiber length, strength, and fineness are the three major traits which determine the quality and economic value of cotton. It is known that exogenous application of phosphatidylinositols (PtdIns), important structural phospholipids, can promote cotton fiber elongation. Here, we sought to increase the in planta production of PtdIns to improve fiber traits. Transgenic cotton plants were generated in which the expression of a cotton phosphatidylinositol synthase gene (i.e., GhPIS) was controlled by the fiber-specific SCFP promoter element, resulting in the specific up-regulation of GhPIS during cotton fiber development. We demonstrate that PtdIns content was significantly enhanced in transgenic cotton fibers and the elevated level of PtdIns stimulated the expression of genes involved in PtdIns phosphorylation as well as promoting lignin/lignin-like phenolic biosynthesis. Fiber length, strength and fineness were also improved in the transgenic plants as compared to the wild-type cotton, with no loss in overall fiber yield. Our data indicate that fiber-specific up-regulation of PtdIns synthesis is a promising strategy for cotton fiber quality improvement.

  7. Lessons learned from the dog genome.

    PubMed

    Wayne, Robert K; Ostrander, Elaine A

    2007-11-01

    Extensive genetic resources and a high-quality genome sequence position the dog as an important model species for understanding genome evolution, population genetics and genes underlying complex phenotypic traits. Newly developed genomic resources have expanded our understanding of canine evolutionary history and dog origins. Domestication involved genetic contributions from multiple populations of gray wolves probably through backcrossing. More recently, the advent of controlled breeding practices has segregated genetic variability into distinct dog breeds that possess specific phenotypic traits. Consequently, genome-wide association and selective sweep scans now allow the discovery of genes underlying breed-specific characteristics. The dog is finally emerging as a novel resource for studying the genetic basis of complex traits, including behavior.

  8. Computing and Applying Atomic Regulons to Understand Gene Expression and Regulation

    PubMed Central

    Faria, José P.; Davis, James J.; Edirisinghe, Janaka N.; Taylor, Ronald C.; Weisenhorn, Pamela; Olson, Robert D.; Stevens, Rick L.; Rocha, Miguel; Rocha, Isabel; Best, Aaron A.; DeJongh, Matthew; Tintle, Nathan L.; Parrello, Bruce; Overbeek, Ross; Henry, Christopher S.

    2016-01-01

    Understanding gene function and regulation is essential for the interpretation, prediction, and ultimate design of cell responses to changes in the environment. An important step toward meeting the challenge of understanding gene function and regulation is the identification of sets of genes that are always co-expressed. These gene sets, Atomic Regulons (ARs), represent fundamental units of function within a cell and could be used to associate genes of unknown function with cellular processes and to enable rational genetic engineering of cellular systems. Here, we describe an approach for inferring ARs that leverages large-scale expression data sets, gene context, and functional relationships among genes. We computed ARs for Escherichia coli based on 907 gene expression experiments and compared our results with gene clusters produced by two prevalent data-driven methods: Hierarchical clustering and k-means clustering. We compared ARs and purely data-driven gene clusters to the curated set of regulatory interactions for E. coli found in RegulonDB, showing that ARs are more consistent with gold standard regulons than are data-driven gene clusters. We further examined the consistency of ARs and data-driven gene clusters in the context of gene interactions predicted by Context Likelihood of Relatedness (CLR) analysis, finding that the ARs show better agreement with CLR predicted interactions. We determined the impact of increasing amounts of expression data on AR construction and find that while more data improve ARs, it is not necessary to use the full set of gene expression experiments available for E. coli to produce high quality ARs. In order to explore the conservation of co-regulated gene sets across different organisms, we computed ARs for Shewanella oneidensis, Pseudomonas aeruginosa, Thermus thermophilus, and Staphylococcus aureus, each of which represents increasing degrees of phylogenetic distance from E. coli. Comparison of the organism-specific ARs showed that the consistency of AR gene membership correlates with phylogenetic distance, but there is clear variability in the regulatory networks of closely related organisms. As large scale expression data sets become increasingly common for model and non-model organisms, comparative analyses of atomic regulons will provide valuable insights into fundamental regulatory modules used across the bacterial domain. PMID:27933038

  9. RNA Structure Design Improves Activity and Specificity of trans-Splicing-Triggered Cell Death in a Suicide Gene Therapy Approach.

    PubMed

    Poddar, Sushmita; Loh, Pei She; Ooi, Zi Hao; Osman, Farhana; Eul, Joachim; Patzel, Volker

    2018-06-01

    Spliceosome-mediated RNA trans-splicing enables correction or labeling of pre-mRNA, but therapeutic applications are hampered by issues related to the activity and target specificity of trans-splicing RNA (tsRNA). We employed computational RNA structure design to improve both on-target activity and specificity of tsRNA in a herpes simplex virus thymidine kinase/ganciclovir suicide gene therapy approach targeting alpha fetoprotein (AFP), a marker of hepatocellular carcinoma (HCC) or human papillomavirus type 16 (HPV-16) pre-mRNA. While unstructured, mismatched target binding domains significantly improved 3' exon replacement (3'ER), 5' exon replacement (5'ER) correlated with the thermodynamic stability of the tsRNA 3' end. Alternative on-target trans-splicing was found to be a prevalent event. The specificity of trans-splicing with the intended target splice site was improved 10-fold by designing tsRNA that harbors secondary target binding domains shielding alternative on-target and blinding off-target splicing events. Such rationally designed suicide RNAs efficiently triggered death of HPV-16-transduced or hepatoblastoma-derived human tissue culture cells without evidence for off-target cell killing. Highest cell death activities were observed with novel dual-targeting tsRNAs programmed for trans-splicing toward AFP and a second HCC pre-mRNA biomarker. Our observations suggest trans-splicing represents a promising approach to suicide gene therapy. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.

  10. Arm-specific dynamics of chromosome evolution in malaria mosquitoes

    PubMed Central

    2011-01-01

    Background The malaria mosquito species of subgenus Cellia have rich inversion polymorphisms that correlate with environmental variables. Polymorphic inversions tend to cluster on the chromosomal arms 2R and 2L but not on X, 3R and 3L in Anopheles gambiae and homologous arms in other species. However, it is unknown whether polymorphic inversions on homologous chromosomal arms of distantly related species from subgenus Cellia nonrandomly share similar sets of genes. It is also unclear if the evolutionary breakage of inversion-poor chromosomal arms is under constraints. Results To gain a better understanding of the arm-specific differences in the rates of genome rearrangements, we compared gene orders and established syntenic relationships among Anopheles gambiae, Anopheles funestus, and Anopheles stephensi. We provided evidence that polymorphic inversions on the 2R arms in these three species nonrandomly captured similar sets of genes. This nonrandom distribution of genes was not only a result of preservation of ancestral gene order but also an outcome of extensive reshuffling of gene orders that created new combinations of homologous genes within independently originated polymorphic inversions. The statistical analysis of distribution of conserved gene orders demonstrated that the autosomal arms differ in their tolerance to generating evolutionary breakpoints. The fastest evolving 2R autosomal arm was enriched with gene blocks conserved between only a pair of species. In contrast, all identified syntenic blocks were preserved on the slowly evolving 3R arm of An. gambiae and on the homologous arms of An. funestus and An. stephensi. Conclusions Our results suggest that natural selection favors specific gene combinations within polymorphic inversions when distant species are exposed to similar environmental pressures. This knowledge could be useful for the discovery of genes responsible for an association of inversion polymorphisms with phenotypic variations in multiple species. Our data support the chromosomal arm specificity in rates of gene order disruption during mosquito evolution. We conclude that the distribution of breakpoint regions is evolutionary conserved on slowly evolving arms and tends to be lineage-specific on rapidly evolving arms. PMID:21473772

  11. Epigenetic hereditary transcription profiles III, evidence for an epigenetic network resulting in gender, tissue and age-specific variation in overall transcription

    PubMed Central

    Simons, Johannes WIM

    2009-01-01

    Background We have previously shown that deviations from the average transcription profile of a group of functionally related genes are not only heritable, but also demonstrate specific patterns associated with age, gender and differentiation, thereby implicating genome-wide nuclear programming as the cause. To determine whether these results could be reproduced, a different micro-array database (obtained from two types of muscle tissue, derived from 81 human donors aged between 16 to 89 years) was studied. Results This new database also revealed the existence of age, gender and tissue-specific features in a small group of functionally related genes. In order to further analyze this phenomenon, a method was developed for quantifying the contribution of different factors to the variability in gene expression, and for generating a database limited to residual values reflecting constitutional differences between individuals. These constitutional differences, presumably epigenetic in origin, contribute to about 50% of the observed residual variance which is connected with a network of interrelated changes in gene expression with some genes displaying a decrease or increase in residual variation with age. Conclusion Epigenetic variation in gene expression without a clear concomitant relation to gene function appears to be a widespread phenomenon. This variation is connected with interactions between genes, is gender and tissue specific and is related to cellular aging. This finding, together with the method developed for analysis, might contribute to the elucidation of the role of nuclear programming in differentiation, aging and carcinogenesis Reviewers This article was reviewed by Thiago M. Venancio (nominated by Aravind Iyer), Hua Li (nominated by Arcady Mushegian) and Arcady Mushegian and J.P.de Magelhaes (nominated by G. Church). PMID:19796384

  12. Lack of specific alleles for the bovine chemokine (C-X-C) receptor type 4 (CXCR4) gene in West African cattle questions its role as a candidate for trypanotolerance.

    PubMed

    Álvarez, Isabel; Pérez-Pardal, Lucía; Traoré, Amadou; Fernández, Iván; Goyache, Félix

    2016-08-01

    A panel of 81 Asian, African and European cattle (Bos taurus and B. indicus) was analysed for the whole sequence of the CXCR4 gene (3844bp), a strong candidate for cattle trypanotolerance. Thirty-one polymorphic sites identified gave 31 different haplotypes. Neutrality tests rejected the hypothesis of either positive or purifying selection. Bayesian phylogenetic tree showed differentiation of haplotypes into two clades gathering genetic variability predating domestication. Related with clades definition, linkage disequilibrium analyses suggested the existence of one only linkage block on the CXCR4 gene. Two tag SNPs identified on exon 2 captured 50% of variability. Whatever the analysis carried out, no clear separation between cattle groups was identified. Most haplotypes identified in West African taurine cattle were also found in European cattle and in Asian and West African zebu. West African taurine samples did not carry unique variants on the CXCR4 gene sequence. The current analysis failed in identifying a causal mutation on the CXCR4 gene underlying a previously reported QTL for cattle trypanotolerance on BTA2. Copyright © 2016 Elsevier B.V. All rights reserved.

  13. Structure and genetic variability of envelope glycoproteins of two antigenic variants of caprine arthritis-encephalitis lentivirus.

    PubMed

    Knowles, D P; Cheevers, W P; McGuire, T C; Brassfield, A L; Harwood, W G; Stem, T A

    1991-11-01

    To define the structure of the caprine arthritis-encephalitis virus (CAEV) env gene and characterize genetic changes which occur during antigenic variation, we sequenced the env genes of CAEV-63 and CAEV-Co, two antigenic variants of CAEV defined by serum neutralization. The deduced primary translation product of the CAEV env gene consists of a 60- to 80-amino-acid signal peptide followed by an amino-terminal surface protein (SU) and a carboxy-terminal transmembrane protein (TM) separated by an Arg-Lys-Lys-Arg cleavage site. The signal peptide cleavage site was verified by amino-terminal amino acid sequencing of native CAEV-63 SU. In addition, immunoprecipitation of [35S]methionine-labeled CAEV-63 proteins by sera from goats immunized with recombinant vaccinia virus expressing the CAEV-63 env gene confirmed that antibodies induced by env-encoded recombinant proteins react specifically with native virion SU and TM. The env genes of CAEV-63 and CAEV-Co encode 28 conserved cysteines and 25 conserved potential N-linked glycosylation sites. Nucleotide sequence variability results in 62 amino acid changes and one deletion within the SU and 34 amino acid changes within the TM.

  14. Structure and genetic variability of envelope glycoproteins of two antigenic variants of caprine arthritis-encephalitis lentivirus.

    PubMed Central

    Knowles, D P; Cheevers, W P; McGuire, T C; Brassfield, A L; Harwood, W G; Stem, T A

    1991-01-01

    To define the structure of the caprine arthritis-encephalitis virus (CAEV) env gene and characterize genetic changes which occur during antigenic variation, we sequenced the env genes of CAEV-63 and CAEV-Co, two antigenic variants of CAEV defined by serum neutralization. The deduced primary translation product of the CAEV env gene consists of a 60- to 80-amino-acid signal peptide followed by an amino-terminal surface protein (SU) and a carboxy-terminal transmembrane protein (TM) separated by an Arg-Lys-Lys-Arg cleavage site. The signal peptide cleavage site was verified by amino-terminal amino acid sequencing of native CAEV-63 SU. In addition, immunoprecipitation of [35S]methionine-labeled CAEV-63 proteins by sera from goats immunized with recombinant vaccinia virus expressing the CAEV-63 env gene confirmed that antibodies induced by env-encoded recombinant proteins react specifically with native virion SU and TM. The env genes of CAEV-63 and CAEV-Co encode 28 conserved cysteines and 25 conserved potential N-linked glycosylation sites. Nucleotide sequence variability results in 62 amino acid changes and one deletion within the SU and 34 amino acid changes within the TM. Images PMID:1656067

  15. Demonstration of mRNA editing and localization of guide RNA genes in kinetoplast-mitochondria of the plant trypanosomatid Phytomonas serpens.

    PubMed

    Maslov, D A; Hollar, L; Haghighat, P; Nawathean, P

    1998-06-01

    Maxicircle molecules of kDNA in several isolates of Phytomonas were detected by hybridization with the 12S rRNA gene probe from Leishmania tarentolae. The estimated size of maxicircles is isolate-specific and varies from 27 to 36 kb. Fully edited and polyadenylated mRNA for kinetoplast-encoded ribosomal protein S12 (RPS12) was found in the steady-state kinetoplast RNA isolated from Phytomonas serpens strain 1G. Two minicircles (1.45 kb) from this strain were also sequenced. Each minicircle contains two 120 bp conserved regions positioned 180 degrees apart, a region enriched with G and T bases and a variable region. One minicircle encodes a gRNA for the first block of editing of RPSl2 mRNA, and the other encodes a gRNA with unknown function. A gRNA gene for the second block of RPSl2 was found on a minicircle sequenced previously. On each minicircle, a gRNA gene is located in the variable region in a similar position and orientation with respect to the conserved regions.

  16. Transcriptomics of cortical gray matter thickness decline during normal aging

    PubMed Central

    Kochunov, P; Charlesworth, J; Winkler, A; Hong, LE; Nichols, T; Curran, JE; Sprooten, E; Jahanshad, N; Thompson, PM; Johnson, MP; Kent, JW; Landman, BA; Mitchell, B; Cole, SA; Dyer, TD; Moses, EK; Goring, HHH; Almasy, L; Duggirala, R; Olvera, RL; Glahn, DC; Blangero, J

    2013-01-01

    Introduction We performed a whole-transcriptome correlation analysis, followed by the pathway enrichment and testing of innate immune response pathways analyses to evaluate the hypothesis that transcriptional activity can predict cortical gray matter thickness (GMT) variability during normal cerebral aging Methods Transcriptome and GMT data were availabe for 379 individuals (age range=28–85) community-dwelling members of large extended Mexican-American families. Collection of transcriptome data preceded that of neuroimaging data by 17 years. Genome-wide gene transcriptome data consisted of 20,413 heritable lymphocytes-based transcripts. GMT measurements were performed from high-resolution (isotropic 800µm) T1-weighted MRI. Transcriptome-wide and pathway enrichment analysis was used to classify genes correlated with GMT. Transcripts for sixty genes from seven innate immune pathways were tested as specific predictors of GMT variability. Results Transcripts for eight genes (IGFBP3, LRRN3, CRIP2, SCD, IDS, TCF4, GATA3, HN1) passed the transcriptome-wide significance threshold. Four orthogonal factors extracted from this set predicted 31.9% of the variability in the whole-brain and between 23.4 and 35% of regional GMT measurements. Pathway enrichment analysis identified six functional categories including cellular proliferation, aggregation, differentiation, viral infection, and metabolism. The integrin signaling pathway was significantly (p<10−6) enriched with GMT. Finally, three innate immune pathways (complement signaling, toll-receptors and scavenger and immunoglobulins) were significantly associated with GMT. Conclusion Expression activity for the genes that regulate cellular proliferation, adhesion, differentiation and inflammation can explain a significant proportion of individual variability in cortical GMT. Our findings suggest that normal cerebral aging is the product of a progressive decline in regenerative capacity and increased neuroinflammation. PMID:23707588

  17. Transcriptomics of cortical gray matter thickness decline during normal aging.

    PubMed

    Kochunov, P; Charlesworth, J; Winkler, A; Hong, L E; Nichols, T E; Curran, J E; Sprooten, E; Jahanshad, N; Thompson, P M; Johnson, M P; Kent, J W; Landman, B A; Mitchell, B; Cole, S A; Dyer, T D; Moses, E K; Goring, H H H; Almasy, L; Duggirala, R; Olvera, R L; Glahn, D C; Blangero, J

    2013-11-15

    We performed a whole-transcriptome correlation analysis, followed by the pathway enrichment and testing of innate immune response pathway analyses to evaluate the hypothesis that transcriptional activity can predict cortical gray matter thickness (GMT) variability during normal cerebral aging. Transcriptome and GMT data were available for 379 individuals (age range=28-85) community-dwelling members of large extended Mexican American families. Collection of transcriptome data preceded that of neuroimaging data by 17 years. Genome-wide gene transcriptome data consisted of 20,413 heritable lymphocytes-based transcripts. GMT measurements were performed from high-resolution (isotropic 800 μm) T1-weighted MRI. Transcriptome-wide and pathway enrichment analysis was used to classify genes correlated with GMT. Transcripts for sixty genes from seven innate immune pathways were tested as specific predictors of GMT variability. Transcripts for eight genes (IGFBP3, LRRN3, CRIP2, SCD, IDS, TCF4, GATA3, and HN1) passed the transcriptome-wide significance threshold. Four orthogonal factors extracted from this set predicted 31.9% of the variability in the whole-brain and between 23.4 and 35% of regional GMT measurements. Pathway enrichment analysis identified six functional categories including cellular proliferation, aggregation, differentiation, viral infection, and metabolism. The integrin signaling pathway was significantly (p<10(-6)) enriched with GMT. Finally, three innate immune pathways (complement signaling, toll-receptors and scavenger and immunoglobulins) were significantly associated with GMT. Expression activity for the genes that regulate cellular proliferation, adhesion, differentiation and inflammation can explain a significant proportion of individual variability in cortical GMT. Our findings suggest that normal cerebral aging is the product of a progressive decline in regenerative capacity and increased neuroinflammation. Copyright © 2013 Elsevier Inc. All rights reserved.

  18. Molecular variability and evolution of the pectate lyase (pel-2) parasitism gene in cyst nematodes parasitizing different solanaceous plants.

    PubMed

    Geric Stare, Barbara; Fouville, Didier; Širca, Saša; Gallot, Aurore; Urek, Gregor; Grenier, Eric

    2011-02-01

    While pectate lyases are major parasitism factors in plant-parasitic nematodes, there is little information on the variability of these genes within species and their utility as pathotype or host range molecular markers. We have analysed polymorphisms of pectate lyase 2 (pel-2) gene, which degrades the unesterified polygalacturonate (pectate) of the host cell-wall, in the genus Globodera. Molecular variability of the pel-2 gene and the predicted protein was evaluated in populations of G. rostochiensis, G. pallida, G. "mexicana" and G. tabacum. Seventy eight pel-2 sequences were obtained and aligned. Point mutations were observed at 373 positions, 57% of these affect the coding part of the gene and produce 129 aa replacements. The observed polymorphism does not correlate either to the pathotypes proposed in potato cyst nematodes (PCN) or the subspecies described in tobacco cyst nematodes. The trees reveal a topology different from the admitted species topology as G. rostochiensis and G. pallida sequences are more similar to each other than to G. tabacum. Species-specific sites, potentially applicable for identification, and sites distinguishing PCN from tobacco cyst nematodes, were identified. As both G. rostochiensis and G. pallida display the same host range, but distinct from G. tabacum, which cannot parasitize potato plants, it is tempting to speculate that pel-2 genes polymorphism may be implicated in this adaptation, a view supported by the fact that no active pectate lyase 2 was found in G. "mexicana", a close relative of G. pallida that is unable to develop on cultivated potato varieties.

  19. Toward an Integration of Cognitive and Genetic Models of Risk for Depression

    PubMed Central

    Gibb, Brandon E.; Beevers, Christopher G.; McGeary, John E.

    2012-01-01

    There is growing interest in integrating cognitive and genetic models of depression risk. We review two ways in which these models can be meaningfully integrated. First, information-processing biases may represent intermediate phenotypes for specific genetic influences. These genetic influences may represent main effects on specific cognitive processes or may moderate the impact of environmental influences on information-processing biases. Second, cognitive and genetic influences may combine to increase reactivity to environmental stressors, increasing risk for depression in a gene × cognition × environment model of risk. There is now growing support for both of these ways of integrating cognitive and genetic models of depression risk. Specifically, there is support for genetic influences on information-processing biases, particularly the link between 5-HTTLPR and attentional biases, from both genetic association and gene × environment (G × E) studies. There is also initial support for gene × cognition × environment models of risk in which specific genetic influences contribute to increased reactivity to environmental influences. We review this research and discuss important areas of future research, particularly the need for larger samples that allow for a broader examination of genetic and epigenetic influences as well as the combined influence of variability across a number of genes. PMID:22920216

  20. [New perspectives on molecular and genic therapies in Down syndrome].

    PubMed

    Delabar, Jean Maurice

    2010-04-01

    Trisomy 21 was first described as a syndrome in the middle of the nineteenth century and associated to a chromosomic anomaly one hundred years later: the most salient feature of this syndrome is a mental retardation of variable intensity. Molecular mapping and DNA sequencing have allowed identifying the gene content of chromosome 21. Molecular quantitative analyses indicated that trisomy is inducing an overexpression for a large part of the triplicated genes and deregulates also pathways involving non HSA21 genes. Together with the physiological description of murine models overexpressing orthologous genes, these data have allowed to elaborate hypotheses on the cause of cognitive impairment. From these hypotheses and using murine models it is now possible to assess the efficiency of various therapeutic strategies. This paper reviews these new perspectives starting from the strategies targeting the level of HSA21 RNAs or HSA21 proteins; then it describes methods targeting activities either of proteins involved in cell cycle pathways or of proteins controlling the synaptic plasticity. It is promising that strategies targeting specific genes or specific pathways are already giving positive results.

  1. Expression of Selenoprotein Genes Is Affected by Obesity of Pigs Fed a High-Fat Diet123

    PubMed Central

    Zhao, Hua; Li, Ke; Tang, Jia-Yong; Zhou, Ji-Chang; Wang, Kang-Ning; Xia, Xin-Jie; Lei, Xin Gen

    2015-01-01

    Background: Relations of the 25 mammalian selenoprotein genes with obesity and the associated inflammation remain unclear. Objective: This study explored impacts of high-fat diet-induced obesity on inflammation and expressions of selenoprotein and obesity-related genes in 10 tissues of pigs. Methods: Plasma and 10 tissues were collected from pigs (n = 10) fed a corn-soy–based control diet or that diet containing 3–7% lard from weanling to finishing (180 d). Plasma concentrations (n = 8) of cytokines and thyroid hormones and tissue mRNA abundance (n = 4) of 25 selenoprotein genes and 16 obesity-related genes were compared between the pigs fed the control and high-fat diets. Stepwise regression was applied to analyze correlations among all these measures, including the previously reported body physical and plasma biochemical variables. Results: The high-fat diet elevated (P < 0.05) plasma concentrations of tumor necrosis factor α, interleukin-6, leptin, and leptin receptor by 29–42% and affected (P < 0.05–0.1) tissue mRNA levels of the selenoprotein and obesity-related genes in 3 patterns. Specifically, the high-fat diet up-regulated 12 selenoprotein genes in 6 tissues, down-regulated 13 selenoprotein genes in 7 tissues, and exerted no effect on 5 genes in any tissue. Body weights and plasma triglyceride concentrations of pigs showed the strongest regressions to tissue mRNA abundances of selenoprotein and obesity-related genes. Among the selenoprotein genes, selenoprotein V and I were ranked as the strongest independent variables for the regression of phenotypic and plasma measures. Meanwhile, agouti signaling protein, adiponectin, and resistin genes represented the strongest independent variables of the obesity-related genes for the regression of tissue selenoprotein mRNA. Conclusions: The high-fat diet induced inflammation in pigs and affected their gene expression of selenoproteins associated with thioredoxin and oxidoreductase systems, local tissue thyroid hormone activity, endoplasmic reticulum protein degradation, and phosphorylation of lipids. This porcine model may be used to study interactive mechanisms between excess fat intake and selenoprotein function. PMID:25972525

  2. Expression of Selenoprotein Genes Is Affected by Obesity of Pigs Fed a High-Fat Diet.

    PubMed

    Zhao, Hua; Li, Ke; Tang, Jia-Yong; Zhou, Ji-Chang; Wang, Kang-Ning; Xia, Xin-Jie; Lei, Xin Gen

    2015-07-01

    Relations of the 25 mammalian selenoprotein genes with obesity and the associated inflammation remain unclear. This study explored impacts of high-fat diet-induced obesity on inflammation and expressions of selenoprotein and obesity-related genes in 10 tissues of pigs. Plasma and 10 tissues were collected from pigs (n = 10) fed a corn-soy-based control diet or that diet containing 3-7% lard from weanling to finishing (180 d). Plasma concentrations (n = 8) of cytokines and thyroid hormones and tissue mRNA abundance (n = 4) of 25 selenoprotein genes and 16 obesity-related genes were compared between the pigs fed the control and high-fat diets. Stepwise regression was applied to analyze correlations among all these measures, including the previously reported body physical and plasma biochemical variables. The high-fat diet elevated (P < 0.05) plasma concentrations of tumor necrosis factor α, interleukin-6, leptin, and leptin receptor by 29-42% and affected (P < 0.05-0.1) tissue mRNA levels of the selenoprotein and obesity-related genes in 3 patterns. Specifically, the high-fat diet up-regulated 12 selenoprotein genes in 6 tissues, down-regulated 13 selenoprotein genes in 7 tissues, and exerted no effect on 5 genes in any tissue. Body weights and plasma triglyceride concentrations of pigs showed the strongest regressions to tissue mRNA abundances of selenoprotein and obesity-related genes. Among the selenoprotein genes, selenoprotein V and I were ranked as the strongest independent variables for the regression of phenotypic and plasma measures. Meanwhile, agouti signaling protein, adiponectin, and resistin genes represented the strongest independent variables of the obesity-related genes for the regression of tissue selenoprotein mRNA. The high-fat diet induced inflammation in pigs and affected their gene expression of selenoproteins associated with thioredoxin and oxidoreductase systems, local tissue thyroid hormone activity, endoplasmic reticulum protein degradation, and phosphorylation of lipids. This porcine model may be used to study interactive mechanisms between excess fat intake and selenoprotein function. © 2015 American Society for Nutrition.

  3. Immuno-Navigator, a batch-corrected coexpression database, reveals cell type-specific gene networks in the immune system

    PubMed Central

    Vandenbon, Alexis; Dinh, Viet H.; Mikami, Norihisa; Kitagawa, Yohko; Teraguchi, Shunsuke; Ohkura, Naganari; Sakaguchi, Shimon

    2016-01-01

    High-throughput gene expression data are one of the primary resources for exploring complex intracellular dynamics in modern biology. The integration of large amounts of public data may allow us to examine general dynamical relationships between regulators and target genes. However, obstacles for such analyses are study-specific biases or batch effects in the original data. Here we present Immuno-Navigator, a batch-corrected gene expression and coexpression database for 24 cell types of the mouse immune system. We systematically removed batch effects from the underlying gene expression data and showed that this removal considerably improved the consistency between inferred correlations and prior knowledge. The data revealed widespread cell type-specific correlation of expression. Integrated analysis tools allow users to use this correlation of expression for the generation of hypotheses about biological networks and candidate regulators in specific cell types. We show several applications of Immuno-Navigator as examples. In one application we successfully predicted known regulators of importance in naturally occurring Treg cells from their expression correlation with a set of Treg-specific genes. For one high-scoring gene, integrin β8 (Itgb8), we confirmed an association between Itgb8 expression in forkhead box P3 (Foxp3)-positive T cells and Treg-specific epigenetic remodeling. Our results also suggest that the regulation of Treg-specific genes within Treg cells is relatively independent of Foxp3 expression, supporting recent results pointing to a Foxp3-independent component in the development of Treg cells. PMID:27078110

  4. Interspecific and intraspecific gene variability in a 1-Mb region containing the highest density of NBS-LRR genes found in the melon genome.

    PubMed

    González, Víctor M; Aventín, Núria; Centeno, Emilio; Puigdomènech, Pere

    2014-12-17

    Plant NBS-LRR -resistance genes tend to be found in clusters, which have been shown to be hot spots of genome variability. In melon, half of the 81 predicted NBS-LRR genes group in nine clusters, and a 1 Mb region on linkage group V contains the highest density of R-genes and presence/absence gene polymorphisms found in the melon genome. This region is known to contain the locus of Vat, an agronomically important gene that confers resistance to aphids. However, the presence of duplications makes the sequencing and annotation of R-gene clusters difficult, usually resulting in multi-gapped sequences with higher than average errors. A 1-Mb sequence that contains the largest NBS-LRR gene cluster found in melon was improved using a strategy that combines Illumina paired-end mapping and PCR-based gap closing. Unknown sequence was decreased by 70% while about 3,000 SNPs and small indels were corrected. As a result, the annotations of 18 of a total of 23 NBS-LRR genes found in this region were modified, including additional coding sequences, amino acid changes, correction of splicing boundaries, or fussion of ORFs in common transcription units. A phylogeny analysis of the R-genes and their comparison with syntenic sequences in other cucurbits point to a pattern of local gene amplifications since the diversification of cucurbits from other families, and through speciation within the family. A candidate Vat gene is proposed based on the sequence similarity between a reported Vat gene from a Korean melon cultivar and a sequence fragment previously absent in the unrefined sequence. A sequence refinement strategy allowed substantial improvement of a 1 Mb fragment of the melon genome and the re-annotation of the largest cluster of NBS-LRR gene homologues found in melon. Analysis of the cluster revealed that resistance genes have been produced by sequence duplication in adjacent genome locations since the divergence of cucurbits from other close families, and through the process of speciation within the family a candidate Vat gene was also identified using sequence previously unavailable, which demonstrates the advantages of genome assembly refinements when analyzing complex regions such as those containing clusters of highly similar genes.

  5. RNAi screening comes of age: improved techniques and complementary approaches

    PubMed Central

    Mohr, Stephanie E.; Smith, Jennifer A.; Shamu, Caroline E.; Neumüller, Ralph A.; Perrimon, Norbert

    2014-01-01

    Gene silencing through sequence-specific targeting of mRNAs by RNAi has enabled genome-wide functional screens in cultured cells and in vivo in model organisms. These screens have resulted in the identification of new cellular pathways and potential drug targets. Considerable progress has been made to improve the quality of RNAi screen data through the development of new experimental and bioinformatics approaches. The recent availability of genome-editing strategies, such as the CRISPR (clustered regularly interspaced short palindromic repeats)-Cas9 system, when combined with RNAi, could lead to further improvements in screen data quality and follow-up experiments, thus promoting our understanding of gene function and gene regulatory networks. PMID:25145850

  6. oPOSSUM: integrated tools for analysis of regulatory motif over-representation

    PubMed Central

    Ho Sui, Shannan J.; Fulton, Debra L.; Arenillas, David J.; Kwon, Andrew T.; Wasserman, Wyeth W.

    2007-01-01

    The identification of over-represented transcription factor binding sites from sets of co-expressed genes provides insights into the mechanisms of regulation for diverse biological contexts. oPOSSUM, an internet-based system for such studies of regulation, has been improved and expanded in this new release. New features include a worm-specific version for investigating binding sites conserved between Caenorhabditis elegans and C. briggsae, as well as a yeast-specific version for the analysis of co-expressed sets of Saccharomyces cerevisiae genes. The human and mouse applications feature improvements in ortholog mapping, sequence alignments and the delineation of multiple alternative promoters. oPOSSUM2, introduced for the analysis of over-represented combinations of motifs in human and mouse genes, has been integrated with the original oPOSSUM system. Analysis using user-defined background gene sets is now supported. The transcription factor binding site models have been updated to include new profiles from the JASPAR database. oPOSSUM is available at http://www.cisreg.ca/oPOSSUM/ PMID:17576675

  7. The role of transcriptome resilience in resistance of corals to bleaching.

    PubMed

    Seneca, Francois O; Palumbi, Stephen R

    2015-04-01

    Wild populations increasingly experience extreme conditions as climate change amplifies environmental variability. How individuals respond to environmental extremes determines the impact of climate change overall. The variability of response from individual to individual can represent the opportunity for natural selection to occur as a result of extreme conditions. Here, we experimentally replicated the natural exposure to extreme temperatures of the reef lagoon at Ofu Island (American Samoa), where corals can experience severe heat stress during midday low tide. We investigated the bleaching and transcriptome response of 20 Acropora hyacinthus colonies 5 and 20 h after exposure to control (29 °C) or heated (35 °C) conditions. We found a highly dynamic transcriptome response: 27% of the coral transcriptome was significantly regulated 1 h postheat exposure. Yet 15 h later, when heat-induced coral bleaching became apparent, only 12% of the transcriptome was differentially regulated. A large proportion of responsive genes at the first time point returned to control levels, others remained differentially expressed over time, while an entirely different subset of genes was successively regulated at the second time point. However, a noteworthy variability in gene expression was observed among individual coral colonies. Among the genes of which expression lingered over time, fast return to normal levels was associated with low bleaching. Colonies that maintained higher expression levels of these genes bleached severely. Return to normal levels of gene expression after stress has been termed transcriptome resilience, and in the case of some specific genes may signal the physiological health and response ability of individuals to environmental stress. © 2015 John Wiley & Sons Ltd.

  8. Modular organization of the white spruce (Picea glauca) transcriptome reveals functional organization and evolutionary signatures.

    PubMed

    Raherison, Elie S M; Giguère, Isabelle; Caron, Sébastien; Lamara, Mebarek; MacKay, John J

    2015-07-01

    Transcript profiling has shown the molecular bases of several biological processes in plants but few studies have developed an understanding of overall transcriptome variation. We investigated transcriptome structure in white spruce (Picea glauca), aiming to delineate its modular organization and associated functional and evolutionary attributes. Microarray analyses were used to: identify and functionally characterize groups of co-expressed genes; investigate expressional and functional diversity of vascular tissue preferential genes which were conserved among Picea species, and identify expression networks underlying wood formation. We classified 22 857 genes as variable (79%; 22 coexpression groups) or invariant (21%) by profiling across several vegetative tissues. Modular organization and complex transcriptome restructuring among vascular tissue preferential genes was revealed by their assignment to coexpression groups with partially overlapping profiles and partially distinct functions. Integrated analyses of tissue-based and temporally variable profiles identified secondary xylem gene networks, showed their remodelling over a growing season and identified PgNAC-7 (no apical meristerm (NAM), Arabidopsis transcription activation factor (ATAF) and cup-shaped cotyledon (CUC) transcription factor 007 in Picea glauca) as a major hub gene specific to earlywood formation. Reference profiling identified comprehensive, statistically robust coexpressed groups, revealing that modular organization underpins the evolutionary conservation of the transcriptome structure. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  9. Hybrid Nanomaterial Complexes for Advanced Phage-guided Gene Delivery

    PubMed Central

    Yata, Teerapong; Lee, Koon-Yang; Dharakul, Tararaj; Songsivilai, Sirirurg; Bismarck, Alexander; Mintz, Paul J; Hajitou, Amin

    2014-01-01

    Developing nanomaterials that are effective, safe, and selective for gene transfer applications is challenging. Bacteriophages (phage), viruses that infect bacteria only, have shown promise for targeted gene transfer applications. Unfortunately, limited progress has been achieved in improving their potential to overcome mammalian cellular barriers. We hypothesized that chemical modification of the bacteriophage capsid could be applied to improve targeted gene delivery by phage vectors into mammalian cells. Here, we introduce a novel hybrid system consisting of two classes of nanomaterial systems, cationic polymers and M13 bacteriophage virus particles genetically engineered to display a tumor-targeting ligand and carry a transgene cassette. We demonstrate that the phage complex with cationic polymers generates positively charged phage and large aggregates that show enhanced cell surface attachment, buffering capacity, and improved transgene expression while retaining cell type specificity. Moreover, phage/polymer complexes carrying a therapeutic gene achieve greater cancer cell killing than phage alone. This new class of hybrid nanomaterial platform can advance targeted gene delivery applications by bacteriophage. PMID:25118171

  10. Sequences of heavy and light chain variable regions from four bovine immunoglobulins.

    PubMed

    Armour, K L; Tempest, P R; Fawcett, P H; Fernie, M L; King, S I; White, P; Taylor, G; Harris, W J

    1994-12-01

    Oligodeoxyribonucleotide primers based on the 5' ends of bovine IgG1/2 and lambda constant (C) region genes, together with primers encoding conserved amino acids at the N-terminus of mature variable (V) regions from other species, have been used in cDNA and polymerase chain reactions (PCRs) to amplify heavy and light chain V region cDNA from bovine heterohybridomas. The amino acid sequences of VH and V lambda from four bovine immunoglobulins of different specificities are presented.

  11. Genetic variability of psychrotolerant Acidithiobacillus ferrivorans revealed by (meta)genomic analysis.

    PubMed

    González, Carolina; Yanquepe, María; Cardenas, Juan Pablo; Valdes, Jorge; Quatrini, Raquel; Holmes, David S; Dopson, Mark

    2014-11-01

    Acidophilic microorganisms inhabit low pH environments such as acid mine drainage that is generated when sulfide minerals are exposed to air. The genome sequence of the psychrotolerant Acidithiobacillus ferrivorans SS3 was compared to a metagenome from a low temperature acidic stream dominated by an A. ferrivorans-like strain. Stretches of genomic DNA characterized by few matches to the metagenome, termed 'metagenomic islands', encoded genes associated with metal efflux and pH homeostasis. The metagenomic islands were enriched in mobile elements such as phage proteins, transposases, integrases and in one case, predicted to be flanked by truncated tRNAs. Cus gene clusters predicted to be involved in copper efflux and further Cus-like RND systems were predicted to be located in metagenomic islands and therefore, constitute part of the flexible gene complement of the species. Phylogenetic analysis of Cus clusters showed both lineage specificity within the Acidithiobacillus genus as well as niche specificity associated with an acidic environment. The metagenomic islands also contained a predicted copper efflux P-type ATPase system and a polyphosphate kinase potentially involved in polyphosphate mediated copper resistance. This study identifies genetic variability of low temperature acidophiles that likely reflects metal resistance selective pressures in the copper rich environment. Copyright © 2014 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.

  12. Self-focusing therapeutic gene delivery with intelligent gene vector swarms: intra-swarm signalling through receptor transgene expression in targeted cells.

    PubMed

    Tolmachov, Oleg E

    2015-01-01

    Gene delivery in vivo that is tightly focused on the intended target cells is essential to maximize the benefits of gene therapy and to reduce unwanted side-effects. Cell surface markers are immediately available for probing by therapeutic gene vectors and are often used to direct gene transfer with these vectors to specific target cell populations. However, it is not unusual for the choice of available extra-cellular markers to be too scarce to provide a reliable definition of the desired therapeutically relevant set of target cells. Therefore, interrogation of intra-cellular determinants of cell-specificity, such as tissue-specific transcription factors, can be vital in order to provide detailed cell-guiding information to gene vector particles. An important improvement in cell-specific gene delivery can be achieved through auto-buildup in vector homing efficiency using intelligent 'self-focusing' of swarms of vector particles on target cells. Vector self-focusing was previously suggested to rely on the release of diffusible chemo-attractants after a successful target-specific hit by 'scout' vector particles. I hypothesize that intelligent self-focusing behaviour of swarms of cell-targeted therapeutic gene vectors can be accomplished without the employment of difficult-to-use diffusible chemo-attractants, instead relying on the intra-swarm signalling through cells expressing a non-diffusible extra-cellular receptor for the gene vectors. In the proposed model, cell-guiding information is gathered by the 'scout' gene vector particles, which: (1) attach to a variety of cells via a weakly binding (low affinity) receptor; (2) successfully facilitate gene transfer into these cells; (3) query intra-cellular determinants of cell-specificity with their transgene expression control elements and (4) direct the cell-specific biosynthesis of a vector-encoded strongly binding (high affinity) cell-surface receptor. Free members of the vector swarm loaded with therapeutic cargo are then attracted to and internalized into the intended target cells via the expressed cognate strongly binding extra-cellular receptor, causing escalation of gene transfer into these cells and increasing the copy number of the therapeutic gene expression modules. Such self-focusing swarms of gene vectors can be either homogeneous, with 'scout' and 'therapeutic' members of the swarm being structurally identical, or, alternatively, heterogeneous (split), with 'scout' and 'therapeutic' members of the swarm being structurally specialized. It is hoped that the proposed self-focusing cell-targeted gene vector swarms with receptor-mediated intra-swarm signalling could be particularly effective in 'top-up' gene delivery scenarios, achieving high-level and sustained expression of therapeutic transgenes that are prone to shut-down through degradation and silencing. Crucially, in contrast to low-precision 'general location' vector guidance by diffusible chemo-attractants, ear-marking non-diffusible receptors can provide high-accuracy targeting of therapeutic vector particles to the specific cell, which has undergone a 'successful cell-specific hit' by a 'scout' vector particle. Opportunities for cell targeting could be expanded, since in the proposed model of self-focusing it could be possible to probe a broad selection of intra-cellular determinants of cell-specificity and not just to rely exclusively on extra-cellular markers of cell-specificity. By employing such self-focusing gene vectors for the improvement of cell-targeted delivery of therapeutic genes, e.g., in cancer therapy or gene addition therapy of recessive genetic diseases, it could be possible to broaden a leeway for the reduction of the vector load and, consequently, to minimize undesired vector cytotoxicity, immune reactions, and the risk of inadvertent genetic modification of germline cells in genetic treatment in vivo. Copyright © 2014 Elsevier B.V. All rights reserved.

  13. The road ahead: working towards effective clinical translation of myocardial gene therapies

    PubMed Central

    Katz, Michael G; Fargnoli, Anthony S; Williams, Richard D; Bridges, Charles R

    2014-01-01

    During the last two decades the fields of molecular and cellular cardiology, and more recently molecular cardiac surgery, have developed rapidly. The concept of delivering cDNA encoding a therapeutic gene to cardiomyocytes using a vector system with substantial cardiac tropism, allowing for long-term expression of a therapeutic protein, has moved from hypothesis to bench to clinical application. However, the clinical results to date are still disappointing. The ideal gene transfer method should be explored in clinically relevant animal models of heart disease to evaluate the relative roles of specific molecular pathways in disease pathogenesis, helping to validate the potential targets for therapeutic intervention. Successful clinical cardiovascular gene therapy also requires the use of nonimmunogenic cardiotropic vectors capable of expressing the requisite amount of therapeutic protein in vivo and in situ. Depending on the desired application either regional or global myocardial gene delivery is required. Cardiac-specific delivery techniques incorporating mapping technologies for regional delivery and highly efficient methodologies for global delivery should improve the precision and specificity of gene transfer to the areas of interest and minimize collateral organ gene expression. PMID:24341816

  14. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kolker, Eugene

    Our project focused primarily on analysis of different types of data produced by global high-throughput technologies, data integration of gene annotation, and gene and protein expression information, as well as on getting a better functional annotation of Shewanella genes. Specifically, four of our numerous major activities and achievements include the development of: statistical models for identification and expression proteomics, superior to currently available approaches (including our own earlier ones); approaches to improve gene annotations on the whole-organism scale; standards for annotation, transcriptomics and proteomics approaches; and generalized approaches for data integration of gene annotation, gene and protein expression information.

  15. Improving accuracy of DNA diet estimates using food tissue control materials and an evaluation of proxies for digestion bias.

    PubMed

    Thomas, Austen C; Jarman, Simon N; Haman, Katherine H; Trites, Andrew W; Deagle, Bruce E

    2014-08-01

    Ecologists are increasingly interested in quantifying consumer diets based on food DNA in dietary samples and high-throughput sequencing of marker genes. It is tempting to assume that food DNA sequence proportions recovered from diet samples are representative of consumer's diet proportions, despite the fact that captive feeding studies do not support that assumption. Here, we examine the idea of sequencing control materials of known composition along with dietary samples in order to correct for technical biases introduced during amplicon sequencing and biological biases such as variable gene copy number. Using the Ion Torrent PGM(©) , we sequenced prey DNA amplified from scats of captive harbour seals (Phoca vitulina) fed a constant diet including three fish species in known proportions. Alongside, we sequenced a prey tissue mix matching the seals' diet to generate tissue correction factors (TCFs). TCFs improved the diet estimates (based on sequence proportions) for all species and reduced the average estimate error from 28 ± 15% (uncorrected) to 14 ± 9% (TCF-corrected). The experimental design also allowed us to infer the magnitude of prey-specific digestion biases and calculate digestion correction factors (DCFs). The DCFs were compared with possible proxies for differential digestion (e.g. fish protein%, fish lipid%) revealing a strong relationship between the DCFs and percent lipid of the fish prey, suggesting prey-specific corrections based on lipid content would produce accurate diet estimates in this study system. These findings demonstrate the value of parallel sequencing of food tissue mixtures in diet studies and offer new directions for future research in quantitative DNA diet analysis. © 2013 John Wiley & Sons Ltd.

  16. Recent Advances in the Molecular Genetics of Familial Hypertrophic Cardiomyopathy in South Asian Descendants

    PubMed Central

    Kraker, Jessica; Viswanathan, Shiv Kumar; Knöll, Ralph; Sadayappan, Sakthivel

    2016-01-01

    The South Asian population, numbered at 1.8 billion, is estimated to comprise around 20% of the global population and 1% of the American population, and has one of the highest rates of cardiovascular disease. While South Asians show increased classical risk factors for developing heart failure, the role of population-specific genetic risk factors has not yet been examined for this group. Hypertrophic cardiomyopathy (HCM) is one of the major cardiac genetic disorders among South Asians, leading to contractile dysfunction, heart failure, and sudden cardiac death. This disease displays autosomal dominant inheritance, and it is associated with a large number of variants in both sarcomeric and non-sarcomeric proteins. The South Asians, a population with large ethnic diversity, potentially carries region-specific polymorphisms. There is high variability in disease penetrance and phenotypic expression of variants associated with HCM. Thus, extensive studies are required to decipher pathogenicity and the physiological mechanisms of these variants, as well as the contribution of modifier genes and environmental factors to disease phenotypes. Conducting genotype-phenotype correlation studies will lead to improved understanding of HCM and, consequently, improved treatment options for this high-risk population. The objective of this review is to report the history of cardiovascular disease and HCM in South Asians, present previously published pathogenic variants, and introduce current efforts to study HCM using induced pluripotent stem cell-derived cardiomyocytes, next-generation sequencing, and gene editing technologies. The authors ultimately hope that this review will stimulate further research, drive novel discoveries, and contribute to the development of personalized medicine with the aim of expanding therapeutic strategies for HCM. PMID:27840609

  17. An overview of techniques for linking high-dimensional molecular data to time-to-event endpoints by risk prediction models.

    PubMed

    Binder, Harald; Porzelius, Christine; Schumacher, Martin

    2011-03-01

    Analysis of molecular data promises identification of biomarkers for improving prognostic models, thus potentially enabling better patient management. For identifying such biomarkers, risk prediction models can be employed that link high-dimensional molecular covariate data to a clinical endpoint. In low-dimensional settings, a multitude of statistical techniques already exists for building such models, e.g. allowing for variable selection or for quantifying the added value of a new biomarker. We provide an overview of techniques for regularized estimation that transfer this toward high-dimensional settings, with a focus on models for time-to-event endpoints. Techniques for incorporating specific covariate structure are discussed, as well as techniques for dealing with more complex endpoints. Employing gene expression data from patients with diffuse large B-cell lymphoma, some typical modeling issues from low-dimensional settings are illustrated in a high-dimensional application. First, the performance of classical stepwise regression is compared to stage-wise regression, as implemented by a component-wise likelihood-based boosting approach. A second issues arises, when artificially transforming the response into a binary variable. The effects of the resulting loss of efficiency and potential bias in a high-dimensional setting are illustrated, and a link to competing risks models is provided. Finally, we discuss conditions for adequately quantifying the added value of high-dimensional gene expression measurements, both at the stage of model fitting and when performing evaluation. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  18. Unpredictable Variable Prenatal Stress Programs Expression of Genes Involved in Appetite Control and Energy Expenditure

    NASA Technical Reports Server (NTRS)

    Moyer, E. L.; Al-Shayeb, B.; Baer, L. A.; Ronca, A. E.

    2016-01-01

    Exposure to stress in the womb shapes neurobiological and physiological outcomes of offspring in later life, including body weight regulation and metabolic profiles. Our previous work utilizing a centrifugation-induced hyper-gravity demonstrated significantly increased (8-15%) body mass in male, but not female, rats exposed throughout gestation to chronic 2-g from conception to birth. We reported a similar outcome in adult offspring exposed throughout gestation to Unpredictable Variable Prenatal Stress (UVPS). Here we examine gene expression changes and the plasma of animals treated with our UVPS model to identify a potential role for prenatal stress in this hypergravity programming effect. Specifically we focused on appetite control and energy expenditure pathways in prenatally stressed adult (90-day-old) male Sprague-Dawley rats.

  19. Phase Variable O Antigen Biosynthetic Genes Control Expression of the Major Protective Antigen and Bacteriophage Receptor in Vibrio cholerae O1

    PubMed Central

    Seed, Kimberley D.; Faruque, Shah M.; Mekalanos, John J.; Calderwood, Stephen B.; Qadri, Firdausi; Camilli, Andrew

    2012-01-01

    The Vibrio cholerae lipopolysaccharide O1 antigen is a major target of bacteriophages and the human immune system and is of critical importance for vaccine design. We used an O1-specific lytic bacteriophage as a tool to probe the capacity of V. cholerae to alter its O1 antigen and identified a novel mechanism by which this organism can modulate O antigen expression and exhibit intra-strain heterogeneity. We identified two phase variable genes required for O1 antigen biosynthesis, manA and wbeL. manA resides outside of the previously recognized O1 antigen biosynthetic locus, and encodes for a phosphomannose isomerase critical for the initial step in O1 antigen biosynthesis. We determined that manA and wbeL phase variants are attenuated for virulence, providing functional evidence to further support the critical role of the O1 antigen for infectivity. We provide the first report of phase variation modulating O1 antigen expression in V. cholerae, and show that the maintenance of these phase variable loci is an important means by which this facultative pathogen can generate the diverse subpopulations of cells needed for infecting the host intestinal tract and for escaping predation by an O1-specific phage. PMID:23028317

  20. BRILIA: Integrated Tool for High-Throughput Annotation and Lineage Tree Assembly of B-Cell Repertoires.

    PubMed

    Lee, Donald W; Khavrutskii, Ilja V; Wallqvist, Anders; Bavari, Sina; Cooper, Christopher L; Chaudhury, Sidhartha

    2016-01-01

    The somatic diversity of antigen-recognizing B-cell receptors (BCRs) arises from Variable (V), Diversity (D), and Joining (J) (VDJ) recombination and somatic hypermutation (SHM) during B-cell development and affinity maturation. The VDJ junction of the BCR heavy chain forms the highly variable complementarity determining region 3 (CDR3), which plays a critical role in antigen specificity and binding affinity. Tracking the selection and mutation of the CDR3 can be useful in characterizing humoral responses to infection and vaccination. Although tens to hundreds of thousands of unique BCR genes within an expressed B-cell repertoire can now be resolved with high-throughput sequencing, tracking SHMs is still challenging because existing annotation methods are often limited by poor annotation coverage, inconsistent SHM identification across the VDJ junction, or lack of B-cell lineage data. Here, we present B-cell repertoire inductive lineage and immunosequence annotator (BRILIA), an algorithm that leverages repertoire-wide sequencing data to globally improve the VDJ annotation coverage, lineage tree assembly, and SHM identification. On benchmark tests against simulated human and mouse BCR repertoires, BRILIA correctly annotated germline and clonally expanded sequences with 94 and 70% accuracy, respectively, and it has a 90% SHM-positive prediction rate in the CDR3 of heavily mutated sequences; these are substantial improvements over existing methods. We used BRILIA to process BCR sequences obtained from splenic germinal center B cells extracted from C57BL/6 mice. BRILIA returned robust B-cell lineage trees and yielded SHM patterns that are consistent across the VDJ junction and agree with known biological mechanisms of SHM. By contrast, existing BCR annotation tools, which do not account for repertoire-wide clonal relationships, systematically underestimated both the size of clonally related B-cell clusters and yielded inconsistent SHM frequencies. We demonstrate BRILIA's utility in B-cell repertoire studies related to VDJ gene usage, mechanisms for adenosine mutations, and SHM hot spot motifs. Furthermore, we show that the complete gene usage annotation and SHM identification across the entire CDR3 are essential for studying the B-cell affinity maturation process through immunosequencing methods.

  1. Multivariate Bayesian variable selection exploiting dependence structure among outcomes: Application to air pollution effects on DNA methylation.

    PubMed

    Lee, Kyu Ha; Tadesse, Mahlet G; Baccarelli, Andrea A; Schwartz, Joel; Coull, Brent A

    2017-03-01

    The analysis of multiple outcomes is becoming increasingly common in modern biomedical studies. It is well-known that joint statistical models for multiple outcomes are more flexible and more powerful than fitting a separate model for each outcome; they yield more powerful tests of exposure or treatment effects by taking into account the dependence among outcomes and pooling evidence across outcomes. It is, however, unlikely that all outcomes are related to the same subset of covariates. Therefore, there is interest in identifying exposures or treatments associated with particular outcomes, which we term outcome-specific variable selection. In this work, we propose a variable selection approach for multivariate normal responses that incorporates not only information on the mean model, but also information on the variance-covariance structure of the outcomes. The approach effectively leverages evidence from all correlated outcomes to estimate the effect of a particular covariate on a given outcome. To implement this strategy, we develop a Bayesian method that builds a multivariate prior for the variable selection indicators based on the variance-covariance of the outcomes. We show via simulation that the proposed variable selection strategy can boost power to detect subtle effects without increasing the probability of false discoveries. We apply the approach to the Normative Aging Study (NAS) epigenetic data and identify a subset of five genes in the asthma pathway for which gene-specific DNA methylations are associated with exposures to either black carbon, a marker of traffic pollution, or sulfate, a marker of particles generated by power plants. © 2016, The International Biometric Society.

  2. An RNA-Seq-based reference transcriptome for Citrus.

    PubMed

    Terol, Javier; Tadeo, Francisco; Ventimilla, Daniel; Talon, Manuel

    2016-03-01

    Previous RNA-Seq studies in citrus have been focused on physiological processes relevant to fruit quality and productivity of the major species, especially sweet orange. Less attention has been paid to vegetative or reproductive tissues, while most Citrus species have never been analysed. In this work, we characterized the transcriptome of vegetative and reproductive tissues from 12 Citrus species from all main phylogenetic groups. Our aims were to acquire a complete view of the citrus transcriptome landscape, to improve previous functional annotations and to obtain genetic markers associated with genes of agronomic interest. 28 samples were used for RNA-Seq analysis, obtained from 12 Citrus species: C. medica, C. aurantifolia, C. limon, C. bergamia, C. clementina, C. deliciosa, C. reshni, C. maxima, C. paradisi, C. aurantium, C. sinensis and Poncirus trifoliata. Four different organs were analysed: root, phloem, leaf and flower. A total of 3421 million Illumina reads were produced and mapped against the reference C. clementina genome sequence. Transcript discovery pipeline revealed 3326 new genes, the number of genes with alternative splicing was increased to 19,739, and a total of 73,797 transcripts were identified. Differential expression studies between the four tissues showed that gene expression is overall related to the physiological function of the specific organs above any other variable. Variants discovery analysis revealed the presence of indels and SNPs in genes associated with fruit quality and productivity. Pivotal pathways in citrus such as those of flavonoids, flavonols, ethylene and auxin were also analysed in detail. © 2015 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.

  3. UGT2B17 and SULT1A1 gene copy number variation (CNV) detection by LabChip microfluidic technology.

    PubMed

    Gaedigk, Andrea; Gaedigk, Roger; Leeder, J Steven

    2010-05-01

    Gene copy number variations (CNVs) are increasingly recognized to play important roles in the expression of genes and hence on their respective enzymatic activities. This has been demonstrated for a number of drug metabolizing genes, such as UDP-glucuronosyltransferases 2B17 (UGT2B17) and sulfotransferase 1A1 (SULT1A1), which are subject to genetic heterogeneity, including CNV. Quantitative assays to assess gene copy number are therefore becoming an integral part of accurate genotype assessment and phenotype prediction. In this study, we evaluated a microfluidics-based system, the Bio-Rad Experion system, to determine the power and utility of this platform to detect UGT2B17 and SULT1A1 CNV in DNA samples derived from blood and tissue. UGT2B17 is known to present with 0, 1 or 2 and SULT1A1 with up to 5 gene copies. Distinct clustering (p<0.001) into copy number groups was achieved for both genes. DNA samples derived from blood exhibited less inter-run variability compared to DNA samples obtained from liver tissue. This variability may be caused by tissue-specific PCR inhibitors as it could be overcome by using DNA from another tissue, or after the DNA had undergone whole genome amplification. This method produced results comparable to those reported for other quantitative test platforms.

  4. Genetic Variability of 27 Traits in a Core Collection of Flax (Linum usitatissimum L.)

    PubMed Central

    You, Frank M.; Jia, Gaofeng; Xiao, Jin; Duguid, Scott D.; Rashid, Khalid Y.; Booker, Helen M.; Cloutier, Sylvie

    2017-01-01

    Assessment of genetic variability of plant core germplasm is needed for efficient germplasm utilization in breeding improvement. A total of 391 accessions of a flax core collection, which preserves the variation present in the world collection of 3,378 accessions maintained by Plant Gene Resources of Canada (PGRC) and represents a broad range of geographical origins, different improvement statuses and two morphotypes, was evaluated in field trials in up to 8 year-location environments for 10 agronomic, eight seed quality, six fiber and three disease resistance traits. The large phenotypic variation in this subset was explained by morphotypes (22%), geographical origins (11%), and other variance components (67%). Both divergence and similarity between two basic morphotypes, namely oil or linseed and fiber types, were observed, whereby linseed accessions had greater thousand seed weight, seeds m−2, oil content, branching capability and resistance to powdery mildew while fiber accessions had greater straw weight, plant height, protein content and resistance to pasmo and fusarium wilt diseases, but they had similar performance in many traits and some of them shared common characteristics of fiber and linseed types. Weak geographical patterns within either fiber or linseed accessions were confirmed, but specific trait performance was identified in East Asia for fiber type, and South Asia and North America for linseed type. Relatively high broad-sense heritability was obtained for seed quality traits, followed by agronomic traits and resistance to powdery mildew and fusarium wilt. Diverse phenotypic and genetic variability in the flax core collection constitutes a useful resource for breeding. PMID:28993783

  5. Genetic Variability of 27 Traits in a Core Collection of Flax (Linum usitatissimum L.).

    PubMed

    You, Frank M; Jia, Gaofeng; Xiao, Jin; Duguid, Scott D; Rashid, Khalid Y; Booker, Helen M; Cloutier, Sylvie

    2017-01-01

    Assessment of genetic variability of plant core germplasm is needed for efficient germplasm utilization in breeding improvement. A total of 391 accessions of a flax core collection, which preserves the variation present in the world collection of 3,378 accessions maintained by Plant Gene Resources of Canada (PGRC) and represents a broad range of geographical origins, different improvement statuses and two morphotypes, was evaluated in field trials in up to 8 year-location environments for 10 agronomic, eight seed quality, six fiber and three disease resistance traits. The large phenotypic variation in this subset was explained by morphotypes (22%), geographical origins (11%), and other variance components (67%). Both divergence and similarity between two basic morphotypes, namely oil or linseed and fiber types, were observed, whereby linseed accessions had greater thousand seed weight, seeds m -2 , oil content, branching capability and resistance to powdery mildew while fiber accessions had greater straw weight, plant height, protein content and resistance to pasmo and fusarium wilt diseases, but they had similar performance in many traits and some of them shared common characteristics of fiber and linseed types. Weak geographical patterns within either fiber or linseed accessions were confirmed, but specific trait performance was identified in East Asia for fiber type, and South Asia and North America for linseed type. Relatively high broad-sense heritability was obtained for seed quality traits, followed by agronomic traits and resistance to powdery mildew and fusarium wilt. Diverse phenotypic and genetic variability in the flax core collection constitutes a useful resource for breeding.

  6. Recombinant DNA modification of gibberellin metabolism alters growth rate and biomass allocation in Populus

    DOE PAGES

    Lu, Haiwei; Viswanath, Venkatesh; Ma, Cathleen; ...

    2015-11-13

    Overexpression of genes that modify gibberellin (GA) metabolism and signaling have been previously shown to produce trees with improved biomass production but highly disturbed development. In order to examine if more subtle types of genetic modification of GA could improve growth rate and modify tree architecture, we transformed a model poplar genotype (Populus tremula × P. alba) with eight genes, including two cisgenes (intact copies of native genes), four intragenes (modified copies of native genes), and two transgenes (from sexually incompatible species), and studied their effects under greenhouse and field conditions. In the greenhouse, four out of the eight testedmore » genes produced a significant and often striking improvement of stem volume, and two constructs significantly modified the proportion of root or shoot biomass. Characterization of GA concentrations in the cisgenic population that had an additional copy of a poplar GA20-oxidase gene showed elevated concentrations of 13-hydroxylated GAs compared to wild-type poplars. In the field, we observed growth improvement for three of the six tested constructs, but it was significantly greater for only one of the constructs, a pRGL:GA20-oxidase intragene. The greenhouse and field responses were highly variable, possibly to due to cross-talk among the GA pathway and other stress response pathways, or due to interactions between the cisgenes and intragenes with highly similar endogenes. Our results indicate that extensive field trials, similar to those required for conventional breeding, will be critical to evaluating the value and pleiotropic effects of GA-modifying genes.« less

  7. Recombinant DNA modification of gibberellin metabolism alters growth rate and biomass allocation in Populus

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lu, Haiwei; Viswanath, Venkatesh; Ma, Cathleen

    Overexpression of genes that modify gibberellin (GA) metabolism and signaling have been previously shown to produce trees with improved biomass production but highly disturbed development. In order to examine if more subtle types of genetic modification of GA could improve growth rate and modify tree architecture, we transformed a model poplar genotype (Populus tremula × P. alba) with eight genes, including two cisgenes (intact copies of native genes), four intragenes (modified copies of native genes), and two transgenes (from sexually incompatible species), and studied their effects under greenhouse and field conditions. In the greenhouse, four out of the eight testedmore » genes produced a significant and often striking improvement of stem volume, and two constructs significantly modified the proportion of root or shoot biomass. Characterization of GA concentrations in the cisgenic population that had an additional copy of a poplar GA20-oxidase gene showed elevated concentrations of 13-hydroxylated GAs compared to wild-type poplars. In the field, we observed growth improvement for three of the six tested constructs, but it was significantly greater for only one of the constructs, a pRGL:GA20-oxidase intragene. The greenhouse and field responses were highly variable, possibly to due to cross-talk among the GA pathway and other stress response pathways, or due to interactions between the cisgenes and intragenes with highly similar endogenes. Our results indicate that extensive field trials, similar to those required for conventional breeding, will be critical to evaluating the value and pleiotropic effects of GA-modifying genes.« less

  8. Gene expression atlas of pigeonpea and its application to gain insights into genes associated with pollen fertility implicated in seed formation

    PubMed Central

    Pazhamala, Lekha T.; Purohit, Shilp; Saxena, Rachit K.; Garg, Vanika; Krishnamurthy, L.; Verdier, Jerome

    2017-01-01

    Abstract Pigeonpea (Cajanus cajan) is an important grain legume of the semi-arid tropics, mainly used for its protein rich seeds. To link the genome sequence information with agronomic traits resulting from specific developmental processes, a Cajanus cajan gene expression atlas (CcGEA) was developed using the Asha genotype. Thirty tissues/organs representing developmental stages from germination to senescence were used to generate 590.84 million paired-end RNA-Seq data. The CcGEA revealed a compendium of 28 793 genes with differential, specific, spatio-temporal and constitutive expression during various stages of development in different tissues. As an example to demonstrate the application of the CcGEA, a network of 28 flower-related genes analysed for cis-regulatory elements and splicing variants has been identified. In addition, expression analysis of these candidate genes in male sterile and male fertile genotypes suggested their critical role in normal pollen development leading to seed formation. Gene network analysis also identified two regulatory genes, a pollen-specific SF3 and a sucrose–proton symporter, that could have implications for improvement of agronomic traits such as seed production and yield. In conclusion, the CcGEA provides a valuable resource for pigeonpea to identify candidate genes involved in specific developmental processes and to understand the well-orchestrated growth and developmental process in this resilient crop. PMID:28338822

  9. Integrated analysis of gene expression and methylation profiles of 48 candidate genes in breast cancer patients.

    PubMed

    Li, Zibo; Heng, Jianfu; Yan, Jinhua; Guo, Xinwu; Tang, Lili; Chen, Ming; Peng, Limin; Wu, Yepeng; Wang, Shouman; Xiao, Zhi; Deng, Zhongping; Dai, Lizhong; Wang, Jun

    2016-11-01

    Gene-specific methylation and expression have shown biological and clinical importance for breast cancer diagnosis and prognosis. Integrated analysis of gene methylation and gene expression may identify genes associated with biology mechanism and clinical outcome of breast cancer and aid in clinical management. Using high-throughput microfluidic quantitative PCR, we analyzed the expression profiles of 48 candidate genes in 96 Chinese breast cancer patients and investigated their correlation with gene methylation and associations with breast cancer clinical parameters. Breast cancer-specific gene expression alternation was found in 25 genes with significant expression difference between paired tumor and normal tissues. A total of 9 genes (CCND2, EGFR, GSTP1, PGR, PTGS2, RECK, SOX17, TNFRSF10D, and WIF1) showed significant negative correlation between methylation and gene expression, which were validated in the TCGA database. Total 23 genes (ACADL, APC, BRCA2, CADM1, CAV1, CCND2, CST6, EGFR, ESR2, GSTP1, ICAM5, NPY, PGR, PTGS2, RECK, RUNX3, SFRP1, SOX17, SYK, TGFBR2, TNFRSF10D, WIF1, and WRN) annotated with potential TFBSs in the promoter regions showed negative correlation between methylation and expression. In logistics regression analysis, 31 of the 48 genes showed improved performance in disease prediction with combination of methylation and expression coefficient. Our results demonstrated the complex correlation and the possible regulatory mechanisms between DNA methylation and gene expression. Integration analysis of methylation and expression of candidate genes could improve performance in breast cancer prediction. These findings would contribute to molecular characterization and identification of biomarkers for potential clinical applications.

  10. Why weight? Modelling sample and observational level variability improves power in RNA-seq analyses.

    PubMed

    Liu, Ruijie; Holik, Aliaksei Z; Su, Shian; Jansz, Natasha; Chen, Kelan; Leong, Huei San; Blewitt, Marnie E; Asselin-Labat, Marie-Liesse; Smyth, Gordon K; Ritchie, Matthew E

    2015-09-03

    Variations in sample quality are frequently encountered in small RNA-sequencing experiments, and pose a major challenge in a differential expression analysis. Removal of high variation samples reduces noise, but at a cost of reducing power, thus limiting our ability to detect biologically meaningful changes. Similarly, retaining these samples in the analysis may not reveal any statistically significant changes due to the higher noise level. A compromise is to use all available data, but to down-weight the observations from more variable samples. We describe a statistical approach that facilitates this by modelling heterogeneity at both the sample and observational levels as part of the differential expression analysis. At the sample level this is achieved by fitting a log-linear variance model that includes common sample-specific or group-specific parameters that are shared between genes. The estimated sample variance factors are then converted to weights and combined with observational level weights obtained from the mean-variance relationship of the log-counts-per-million using 'voom'. A comprehensive analysis involving both simulations and experimental RNA-sequencing data demonstrates that this strategy leads to a universally more powerful analysis and fewer false discoveries when compared to conventional approaches. This methodology has wide application and is implemented in the open-source 'limma' package. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. Differential expression of CURS gene during various growth stages, climatic condition and soil nutrients in turmeric (Curcuma longa): Towards site specific cultivation for high curcumin yield.

    PubMed

    Sandeep, I Sriram; Das, Suryasnata; Nasim, Noohi; Mishra, Antaryami; Acharya, Laxmikanta; Joshi, Raj Kumar; Nayak, Sanghamitra; Mohanty, Sujata

    2017-09-01

    Curcuma longa L., accumulates substantial amount of curcumin and essential oil. Little is known about the differential expression of curcumin synthase (CURS) gene and consequent curcumin content variations at different agroclimatic zones. The present study aimed to evaluate the effect of climate, soil and harvesting phase on expression of CURS gene for curcumin yield in two high yielding turmeric cultivars. Expression of CURS gene at different experimental zones as well as at different harvesting phase was studied through transcriptional analysis by qRT-PCR. Curcumin varied from 1.5 to 5% and 1.4-5% in Surama and Roma respectively. The expression of CURS also varied from 0.402 to 5.584 fold in Surama and 0.856-5.217 fold in Roma. Difference in curcumin content at a particular zone varied among different harvesting period from 3.95 to 4.31% in Surama and 3.57-3.83% in Roma. Expression of CURS gene was also effected by harvesting time of the rhizome which varied from 7.389 to 16.882 fold in Surama and 4.41-8.342 fold in Roma. The CURS gene expression was found regardless of variations in curcumin content at different experimental zones. This may be due to the effects of soil and environmental variables. Expression was positively correlated with curcumin content with different harvesting time at a particular zone. This find indicates effect of soil and environment on molecular and biochemical dynamics of curcumin biosynthesis and could be useful in genetic improvement of turmeric. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

  12. Transcriptome Characterization of Cymbidium sinense 'Dharma' Using 454 Pyrosequencing and Its Application in the Identification of Genes Associated with Leaf Color Variation.

    PubMed

    Zhu, Genfa; Yang, Fengxi; Shi, Shanshan; Li, Dongmei; Wang, Zhen; Liu, Hailin; Huang, Dan; Wang, Caiyun

    2015-01-01

    The highly variable leaf color of Cymbidium sinense significantly improves its horticultural and economic value, and makes it highly desirable in the flower markets in China and Southeast Asia. However, little is understood about the molecular mechanism underlying leaf-color variations. In this study, we found the content of photosynthetic pigments, especially chlorophyll degradation metabolite in the leaf-color mutants is distinguished significantly from that in the wild type of Cymbidium sinense 'Dharma'. To further determine the candidate genes controlling leaf-color variations, we first sequenced the global transcriptome using 454 pyrosequencing. More than 0.7 million expressed sequence tags (ESTs) with an average read length of 445.9 bp were generated and assembled into 103,295 isotigs representing 68,460 genes. Of these isotigs, 43,433 were significantly aligned to known proteins in the public database, of which 29,299 could be categorized into 42 functional groups in the gene ontology system, 10,079 classified into 23 functional classifications in the clusters of orthologous groups system, and 23,092 assigned to 139 clusters of specific metabolic pathways in the Kyoto Encyclopedia of Genes and Genomes. Among these annotations, 95 isotigs were designated as involved in chlorophyll metabolism. On this basis, we identified 16 key enzyme-encoding genes in the chlorophyll metabolism pathway, the full length cDNAs and expressions of which were further confirmed. Expression pattern indicated that the key enzyme-encoding genes for chlorophyll degradation were more highly expressed in the leaf color mutants, as was consistent with their lower chlorophyll contents. This study is the first to supply an informative 454 EST dataset for Cymbidium sinense 'Dharma' and to identify original leaf color-associated genes, which provide important resources to facilitate gene discovery for molecular breeding, marketable trait discovery, and investigating various biological process in this species.

  13. Transcriptome Characterization of Cymbidium sinense 'Dharma' Using 454 Pyrosequencing and Its Application in the Identification of Genes Associated with Leaf Color Variation

    PubMed Central

    Shi, Shanshan; Li, Dongmei; Wang, Zhen; Liu, Hailin; Huang, Dan; Wang, Caiyun

    2015-01-01

    The highly variable leaf color of Cymbidium sinense significantly improves its horticultural and economic value, and makes it highly desirable in the flower markets in China and Southeast Asia. However, little is understood about the molecular mechanism underlying leaf-color variations. In this study, we found the content of photosynthetic pigments, especially chlorophyll degradation metabolite in the leaf-color mutants is distinguished significantly from that in the wild type of Cymbidium sinense 'Dharma'. To further determine the candidate genes controlling leaf-color variations, we first sequenced the global transcriptome using 454 pyrosequencing. More than 0.7 million expressed sequence tags (ESTs) with an average read length of 445.9 bp were generated and assembled into 103,295 isotigs representing 68,460 genes. Of these isotigs, 43,433 were significantly aligned to known proteins in the public database, of which 29,299 could be categorized into 42 functional groups in the gene ontology system, 10,079 classified into 23 functional classifications in the clusters of orthologous groups system, and 23,092 assigned to 139 clusters of specific metabolic pathways in the Kyoto Encyclopedia of Genes and Genomes. Among these annotations, 95 isotigs were designated as involved in chlorophyll metabolism. On this basis, we identified 16 key enzyme-encoding genes in the chlorophyll metabolism pathway, the full length cDNAs and expressions of which were further confirmed. Expression pattern indicated that the key enzyme-encoding genes for chlorophyll degradation were more highly expressed in the leaf color mutants, as was consistent with their lower chlorophyll contents. This study is the first to supply an informative 454 EST dataset for Cymbidium sinense 'Dharma' and to identify original leaf color-associated genes, which provide important resources to facilitate gene discovery for molecular breeding, marketable trait discovery, and investigating various biological process in this species. PMID:26042676

  14. Combining transcription factor binding affinities with open-chromatin data for accurate gene expression prediction.

    PubMed

    Schmidt, Florian; Gasparoni, Nina; Gasparoni, Gilles; Gianmoena, Kathrin; Cadenas, Cristina; Polansky, Julia K; Ebert, Peter; Nordström, Karl; Barann, Matthias; Sinha, Anupam; Fröhler, Sebastian; Xiong, Jieyi; Dehghani Amirabad, Azim; Behjati Ardakani, Fatemeh; Hutter, Barbara; Zipprich, Gideon; Felder, Bärbel; Eils, Jürgen; Brors, Benedikt; Chen, Wei; Hengstler, Jan G; Hamann, Alf; Lengauer, Thomas; Rosenstiel, Philip; Walter, Jörn; Schulz, Marcel H

    2017-01-09

    The binding and contribution of transcription factors (TF) to cell specific gene expression is often deduced from open-chromatin measurements to avoid costly TF ChIP-seq assays. Thus, it is important to develop computational methods for accurate TF binding prediction in open-chromatin regions (OCRs). Here, we report a novel segmentation-based method, TEPIC, to predict TF binding by combining sets of OCRs with position weight matrices. TEPIC can be applied to various open-chromatin data, e.g. DNaseI-seq and NOMe-seq. Additionally, Histone-Marks (HMs) can be used to identify candidate TF binding sites. TEPIC computes TF affinities and uses open-chromatin/HM signal intensity as quantitative measures of TF binding strength. Using machine learning, we find low affinity binding sites to improve our ability to explain gene expression variability compared to the standard presence/absence classification of binding sites. Further, we show that both footprints and peaks capture essential TF binding events and lead to a good prediction performance. In our application, gene-based scores computed by TEPIC with one open-chromatin assay nearly reach the quality of several TF ChIP-seq data sets. Finally, these scores correctly predict known transcriptional regulators as illustrated by the application to novel DNaseI-seq and NOMe-seq data for primary human hepatocytes and CD4+ T-cells, respectively. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. Diversity in the 18S SSU rRNA V4 hyper-variable region of Theileria spp. in Cape buffalo (Syncerus caffer) and cattle from southern Africa.

    PubMed

    Mans, Ben J; Pienaar, Ronel; Latif, Abdalla A; Potgieter, Fred T

    2011-05-01

    Sequence variation within the 18S SSU rRNA V4 hyper-variable region can affect the accuracy of real-time hybridization probe-based diagnostics for the detection of Theileria spp. infections. This is relevant for assays that use non-specific primers, such as the real-time hybridization assay for T. parva (Sibeko et al. 2008). To assess the effect of sequence variation on this test, the Theileria 18S gene from 62 buffalo and 49 cattle samples was cloned and ∼1000 clones sequenced. Twenty-six genotypes were detected which included known and novel genotypes for the T. buffeli, T. mutans, T. taurotragi and T. velifera clades. A novel genotype related to T. sp. (sable) was also detected in 1 bovine sample. Theileria genotypic diversity was higher in buffalo compared to cattle. Polymorphism within the T. parva hyper-variable region was confirmed by aberrant real-time melting peaks and supported by sequencing of the S5 ribosomal gene. Analysis of the S5 gene suggests that this gene can be a marker for species differentiation. T. parva, T. sp. (buffalo) and T. sp. (bougasvlei) remain the only genotypes amplified by the primer set of the hybridization assay. Therefore, the 18S sequence diversity observed does not seem to affect the current real-time hybridization assay for T. parva.

  16. Performance of Glutamate Dehydrogenase and Triose Phosphate Isomerase Genes in the Analysis of Genotypic Variability of Isolates of Giardia duodenalis from Livestocks

    PubMed Central

    Fava, Natália M. N.; Soares, Rodrigo M.; Scalia, Luana A. M.; Kalapothakis, Evanguedes; Pena, Isabella F.; Vieira, Carlos U.; Faria, Elaine S. M.; Cunha, Maria J.; Couto, Talles R.; Cury, Márcia Cristina

    2013-01-01

    Giardia duodenalis is a small intestinal protozoan parasite of several terrestrial vertebrates. This work aims to assess the genotypic variability of Giardia duodenalis isolates from cattle, sheep and pigs in the Southeast of Brazil, by comparing the standard characterization between glutamate dehydrogenase (gdh) and triose phosphate isomerase (tpi) primers. Fecal samples from the three groups of animals were analyzed using the zinc sulphate centrifugal flotation technique. Out of 59 positive samples, 30 were from cattle, 26 from sheep and 3 from pigs. Cyst pellets were stored and submitted to PCR and nested-PCR reactions with gdh and tpi primers. Fragment amplification of gdh and tpi genes was observed in 25 (42.4%) and 36 (61.0%) samples, respectively. Regarding the sequencing, 24 sequences were obtained with gdh and 20 with tpi. For both genes, there was a prevalence of E specific species assemblage, although some isolates have been identified as A and B, by the tpi sequencing. This has also shown a larger number of heterogeneous sequences, which have been attribute to mixed infections between assemblages B and E. The largest variability of inter-assemblage associated to the frequency of heterogeneity provided by tpi sequencing reinforces the polymorphic nature of this gene and makes it an excellent target for studies on molecular epidemiology. PMID:24308010

  17. Assessment of sequence variability in a p23 gene region within and among three genotypes of the Theileria orientalis complex from south-eastern Australia.

    PubMed

    Perera, Piyumali K; Gasser, Robin B; Jabbar, Abdul

    2015-03-01

    Oriental theileriosis is a tick-borne, protozoan disease of cattle caused by one or more genotypes of Theileria orientalis complex. In this study, we assessed sequence variability in a region of the 23kDa piroplasm membrane protein (p23) gene within and among three T. orientalis genotypes (designated buffeli, chitose and ikeda) in south-eastern Australia. Genomic DNA (n=100) was extracted from blood of infected cattle from various locations endemic for oriental theileriosis and tested by polymerase chain reaction (PCR)-coupled mutation scanning (single-strand conformation polymorphism (SSCP)) and targeted sequencing analysis. Eight distinct sequences represented all DNA samples, and three genotypes were found: buffeli (n=3), chitose (3) and ikeda (2). Nucleotide pairwise comparisons among these eight sequences revealed considerably higher variability among the genotypes (6.6-11.7%) than within them (0-1.9%), indicating that the p23 gene region allows the accurate identification of T. orientalis genotypes. In the future, we will combine this gene with other molecular markers to study the genetic structure of T. orientalis populations in Australasia, which will pave the way to establish a highly sensitive and specific PCR-based assay for genotypic diagnosis of infection and for assessing levels of parasitaemia in cattle. Copyright © 2014 Elsevier GmbH. All rights reserved.

  18. Gene expression patterns of wood decay fungi Postia placenta and Phanerochaete chrysosporium are influenced by wood substrate composition during degradation

    Treesearch

    Oleksandr Skyba; Daniel Cullen; Carl J. Douglas; Shawn D. Mansfield

    2016-01-01

    Identification of the specific genes and enzymes involved in the fungal degradation of lignocellulosic biomass derived from feedstocks with various compositions is essential to the development of improved bioenergy processes. In order to elucidate the effect of substrate composition on gene expression in wood-rotting fungi, we employed microarrays based on the...

  19. Mission Advantages of Constant Power, Variable Isp Electrostatic Thrusters

    NASA Technical Reports Server (NTRS)

    Oleson, Steven R.

    2000-01-01

    Electric propulsion has moved from station-keeping capability for spacecraft to primary propulsion with the advent of both the Deep Space One asteroid flyby and geosynchronous spacecraft orbit insertion. In both cases notably more payload was delivered than would have been possible with chemical propulsion. To provide even greater improvements electrostatic thruster performance could be varied in specific impulse, but kept at constant power to provide better payload or trip time performance for different mission phases. Such variable specific impulse mission applications include geosynchronous and low earth orbit spacecraft stationkeeping and orbit insertion, geosynchronous reusable tug missions, and interplanetary probes. The application of variable specific impulse devices is shown to add from 5 to 15% payload for these missions. The challenges to building such devices include variable voltage power supplies and extending fuel throughput capabilities across the specific impulse range.

  20. A systems-wide comparison of red rice (Oryza longistaminata) tissues identifies rhizome specific genes and proteins that are targets for cultivated rice improvement

    PubMed Central

    2014-01-01

    Background The rhizome, the original stem of land plants, enables species to invade new territory and is a critical component of perenniality, especially in grasses. Red rice (Oryza longistaminata) is a perennial wild rice species with many valuable traits that could be used to improve cultivated rice cultivars, including rhizomatousness, disease resistance and drought tolerance. Despite these features, little is known about the molecular mechanisms that contribute to rhizome growth, development and function in this plant. Results We used an integrated approach to compare the transcriptome, proteome and metabolome of the rhizome to other tissues of red rice. 116 Gb of transcriptome sequence was obtained from various tissues and used to identify rhizome-specific and preferentially expressed genes, including transcription factors and hormone metabolism and stress response-related genes. Proteomics and metabolomics approaches identified 41 proteins and more than 100 primary metabolites and plant hormones with rhizome preferential accumulation. Of particular interest was the identification of a large number of gene transcripts from Magnaportha oryzae, the fungus that causes rice blast disease in cultivated rice, even though the red rice plants showed no sign of disease. Conclusions A significant set of genes, proteins and metabolites appear to be specifically or preferentially expressed in the rhizome of O. longistaminata. The presence of M. oryzae gene transcripts at a high level in apparently healthy plants suggests that red rice is resistant to this pathogen, and may be able to provide genes to cultivated rice that will enable resistance to rice blast disease. PMID:24521476

  1. Transcriptome analysis of two recombinant inbred lines of common bean contrasting for symbiotic nitrogen fixation

    USDA-ARS?s Scientific Manuscript database

    Common bean (Phaseolus vulgaris L.) is able to fix atmospheric nitrogen (N2) through symbiotic nitrogen fixation (SNF). Effective utilization of existing variability for SNF in common bean for genetic improvement requires an understanding of underlying genes and molecular mechanisms. The utility of ...

  2. Unusual Variability of the Drosophila Melanogaster Ref(2)p Protein Which Controls the Multiplication of Sigma Rhabdovirus

    PubMed Central

    Dru, P.; Bras, F.; Dezelee, S.; Gay, P.; Petitjean, A. M.; Pierre-Deneubourg, A.; Teninges, D.; Contamine, D.

    1993-01-01

    The ref(2)P gene of Drosophila melanogaster was identified by the discovery of two alleles, P(o) and P(p), respectively, permissive and restrictive for sigma rhabdovirus multiplication. A surprising variability of this gene was first noticed by the observation of size differences between the transcripts of permissive and restrictive alleles. In this paper, another restrictive allele, P(n), clearly distinct from P(p), is described: it exhibits a weaker antiviral effect than P(p) and differs from P(p) by its molecular structure. Five types of alleles were distinguished on the basis of their molecular structure, as revealed by S1 nuclease analysis of 17 D. melanogaster strains; three alleles were permissive and two restrictive. Comparison of the sequences of four haplotypes revealed numerous point mutations, two deletions (21 and 24 bp) and a complex event involving a 3-bp deletion, all affected the coding region. The unusual variability of the ref(2)P locus was confirmed by the high ratio of amino acid replacements to synonymous mutations (7:1), as compared to that of other genes, such as the Adh (2:42). Nevertheless, nucleotide sequence comparison with the Drosophila erecta ref(2)P gene shows that selective pressures are exerted to maintain the existence of a functional protein. The effects of this high variability on the ref(2)P protein are discussed in relation to its specific antiviral properties and to its function in D. melanogaster, where it is required for male fertility. PMID:8462852

  3. [Progress of gene editing technologies and prospect in traditional Chinese medicine].

    PubMed

    Ma, Yan-Yan; Li, Jing-Zhe; Gao, Er-Ning; Qian, Dan; Zhong, Ju-Ying; Liu, Chang-Zhen

    2017-01-01

    Gene editing is a kind of technologies that makes precise modification to the genome. It can be used to knock out/in and replace the specific DNA fragment, and make accurate gene editing on the genome level. The essence of the technique is the DNA sequence change with use of non homologous end link repair and homologous recombination repair, combined with specific DNA target recognition and endonuclease.This technology has wide range of development prospects and high application value in terms of scientific research, agriculture, medical treatment and other fields. In the field of gene therapy, gene editing technology has achieved cross-time success in cancers such as leukemia, genetic disorders such as hemophilia, thalassemia, multiple muscle nutritional disorders and retrovirus associated infectious diseases such as AIDS and other diseases. The preparation work for new experimental methods and animal models combined with gene editing technology is under rapid development and improvement. Laboratories around the world have also applied gene editing technique in prevention of malaria, organ transplantation, biological pharmaceuticals, agricultural breeding improvement, resurrection of extinct species, and other research areas. This paper summarizes the application and development status of gene editing technique in the above fields, and also preliminarily explores the potential application prospect of the technology in the field of traditional Chinese medicine, and discusses the present controversy and thoughts. Copyright© by the Chinese Pharmaceutical Association.

  4. FOXM1 promotes the progression of prostate cancer by regulating PSA gene transcription.

    PubMed

    Liu, Youhong; Liu, Yijun; Yuan, Bowen; Yin, Linglong; Peng, Yuchong; Yu, Xiaohui; Zhou, Weibing; Gong, Zhicheng; Liu, Jianye; He, Leye; Li, Xiong

    2017-03-07

    Androgen/AR is the primary contributor to prostate cancer (PCa) progression by regulating Prostate Specific Antigen (PSA) gene transcription. The disease inevitably evolves to androgen-independent (AI) status. Other mechanisms by which PSA is regulated and develops to AI have not yet been fully determined. FOXM1 is a cell proliferation-specific transcription factor highly expressed in PCa cells compared to non-malignant prostate epithelial cells, suggesting that the aberrant overexpression of FOXM1 contributes to PCa development. In addition to regulating AR gene transcription and cell cycle-regulatory genes, FOXM1 selectively regulates the gene transcription of KLK2 and PSA, typical androgen responsive genes. Screening the potential FOXM1-binding sites by ChIP-PCR, we found that FOXM1 directly binds to the FHK binding motifs in the PSA promoter/enhancer regions. AI C4-2 cells have more FOXM1 binding sites than androgen dependent LNCaP cells. The depletion of FOXM1 by small molecular inhibitors significantly improves the suppression of PSA gene transcription by the anti-AR agent Cadosax. This is the first report showing that FOXM1 promotes PCa progression by regulating PSA gene transcription, particularly in AI PCa cells. The combination of anti-AR agents and FOXM1 inhibitors has the potential to greatly improve therapy for late-stage PCa patients by suppressing PSA levels.

  5. Variability of Actinobacteria, a minor component of rumen microflora.

    PubMed

    Suľák, M; Sikorová, L; Jankuvová, J; Javorský, P; Pristaš, P

    2012-07-01

    Actinobacteria (Actinomycetes) are a significant and interesting group of gram-positive bacteria. They are regular, though infrequent, members of the microbial life in the rumen and represent up to 3 % of total rumen bacteria; there is considerable lack of information about ecology and biology of rumen actinobacteria. During the characterization of variability of rumen treponemas using non-cultivation approach, we also noted the variability of rumen actinobacteria. By using Treponema-specific primers a specific 16S rRNA gene library was prepared from cow and sheep rumen total DNA. About 10 % of recombinant clones contained actinobacteria-like sequences. Phylogenetic analyses of 11 clones obtained showed the high variability of actinobacteria in the ruminant digestive system. While some sequences are nearly identical to known sequences of actinobacteria, we detected completely new clusters of actinobacteria-like sequences, representing probably new, as yet undiscovered, group of rumen Actinobacteria. Further research will be necessary for understanding their nature and functions in the rumen.

  6. New Developments in CRISPR Technology: Improvements in Specificity and Efficiency.

    PubMed

    Safari, Fatemeh; Farajnia, Safar; Ghasemi, Younes; Zarghami, Nosratollah

    2017-01-01

    RNA-guided endonuclease as a versatile genome editing technology opened new windows in various fields of biology. The simplicity of this revolutionary technique provides a promising future for its application in a broad range of approaches from functional annotation of genes to diseases, to genetic manipulation and gene therapy. Besides the site-specific activity of Cas9 endonuclease, the unintended cleavage known as off-target effect is still a major challenge for this genome editing technique. Various strategies have been developed to resolve this bottleneck including development of new softwares for designing optimized guide RNA (gRNA), engineering Cas9 enzyme, improvement in off-target detection assays, etc. Results: This review dedicated to discuss on methods that have been used for optimizing Cas9, specificity with the aim of improving this technology for therapeutic applications. In addition, the applications and novel breakthroughs in the field of CRISPR technology will be described. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  7. Variable promoter methylation contributes to differential expression of key genes in human placenta-derived venous and arterial endothelial cells.

    PubMed

    Joo, Jihoon E; Hiden, Ursula; Lassance, Luciana; Gordon, Lavinia; Martino, David J; Desoye, Gernot; Saffery, Richard

    2013-07-15

    The endothelial compartment, comprising arterial, venous and lymphatic cell types, is established prenatally in association with rapid phenotypic and functional changes. The molecular mechanisms underpinning this process in utero have yet to be fully elucidated. The aim of this study was to investigate the potential for DNA methylation to act as a driver of the specific gene expression profiles of arterial and venous endothelial cells. Placenta-derived venous and arterial endothelial cells were collected at birth prior to culturing. DNA methylation was measured at >450,000 CpG sites in parallel with expression measurements taken from 25,000 annotated genes. A consistent set of genomic loci was found to show coordinate differential methylation between the arterial and venous cell types. This included many loci previously not investigated in relation to endothelial function. An inverse relationship was observed between gene expression and promoter methylation levels for a limited subset of genes implicated in endothelial function, including NOS3, encoding endothelial Nitric Oxide Synthase. Endothelial cells derived from the placental vasculature at birth contain widespread methylation of key regulatory genes. These are candidates involved in the specification of different endothelial cell types and represent potential target genes for environmentally mediated epigenetic disruption in utero in association with cardiovascular disease risk later in life.

  8. Grassland management regimens reduce small-scale heterogeneity and species diversity of beta-proteobacterial ammonia pxidizer populations.

    PubMed

    Webster, Gordon; Embley, T Martin; Prosser, James I

    2002-01-01

    The impact of soil management practices on ammonia oxidizer diversity and spatial heterogeneity was determined in improved (addition of N fertilizer), unimproved (no additions), and semi-improved (intermediate management) grassland pastures at the Sourhope Research Station in Scotland. Ammonia oxidizer diversity within each grassland soil was assessed by PCR amplification of microbial community DNA with both ammonia oxidizer-specific, 16S rRNA gene (rDNA) and functional, amoA, gene primers. PCR products were analysed by denaturing gradient gel electrophoresis, phylogenetic analysis of partial 16S rDNA and amoA sequences, and hybridization with ammonia oxidizer-specific oligonucleotide probes. Ammonia oxidizer populations in unimproved soils were more diverse than those in improved soils and were dominated by organisms representing Nitrosospira clusters 1 and 3 and Nitrosomonas cluster 7 (closely related phylogenetically to Nitrosomonas europaea). Improved soils were only dominated by Nitrosospira cluster 3 and Nitrosomonas cluster 7. These differences were also reflected in functional gene (amoA) diversity, with amoA gene sequences of both Nitrosomonas and Nitrosospira species detected. Replicate 0.5-g samples of unimproved soil demonstrated significant spatial heterogeneity in 16S rDNA-defined ammonia oxidizer clusters, which was reflected in heterogeneity in ammonium concentration and pH. Heterogeneity in soil characteristics and ammonia oxidizer diversity were lower in improved soils. The results therefore demonstrate significant effects of soil management on diversity and heterogeneity of ammonia oxidizer populations that are related to similar changes in relevant soil characteristics.

  9. Applications of CRISPR/Cas9 technology for targeted mutagenesis, gene replacement and stacking of genes in higher plants.

    PubMed

    Luo, Ming; Gilbert, Brian; Ayliffe, Michael

    2016-07-01

    Mutagenesis continues to play an essential role for understanding plant gene function and, in some instances, provides an opportunity for plant improvement. The development of gene editing technologies such as TALENs and zinc fingers has revolutionised the targeted mutation specificity that can now be achieved. The CRISPR/Cas9 system is the most recent addition to gene editing technologies and arguably the simplest requiring only two components; a small guide RNA molecule (sgRNA) and Cas9 endonuclease protein which complex to recognise and cleave a specific 20 bp target site present in a genome. Target specificity is determined by complementary base pairing between the sgRNA and target site sequence enabling highly specific, targeted mutation to be readily engineered. Upon target site cleavage, error-prone endogenous repair mechanisms produce small insertion/deletions at the target site usually resulting in loss of gene function. CRISPR/Cas9 gene editing has been rapidly adopted in plants and successfully undertaken in numerous species including major crop species. Its applications are not restricted to mutagenesis and target site cleavage can be exploited to promote sequence insertion or replacement by recombination. The multiple applications of this technology in plants are described.

  10. Molecular breeding in Brassica for salt tolerance: importance of microsatellite (SSR) markers for molecular breeding in Brassica.

    PubMed

    Kumar, Manu; Choi, Ju-Young; Kumari, Nisha; Pareek, Ashwani; Kim, Seong-Ryong

    2015-01-01

    Salinity is one of the important abiotic factors for any crop management in irrigated as well as rainfed areas, which leads to poor harvests. This yield reduction in salt affected soils can be overcome by improving salt tolerance in crops or by soil reclamation. Salty soils can be reclaimed by leaching the salt or by cultivation of salt tolerance crops. Salt tolerance is a quantitative trait controlled by several genes. Poor knowledge about mechanism of its inheritance makes slow progress in its introgression into target crops. Brassica is known to be a good reclamation crop. Inter and intra specific variation within Brassica species shows potential of molecular breeding to raise salinity tolerant genotypes. Among the various molecular markers, SSR markers are getting high attention, since they are randomly sparsed, highly variable and show co-dominant inheritance. Furthermore, as sequencing techniques are improving and softwares to find SSR markers are being developed, SSR markers technology is also evolving rapidly. Comparative SSR marker studies targeting Arabidopsis thaliana and Brassica species which lie in the same family will further aid in studying the salt tolerance related QTLs and subsequent identification of the "candidate genes" and finding out the origin of important QTLs. Although, there are a few reports on molecular breeding for improving salt tolerance using molecular markers in Brassica species, usage of SSR markers has a big potential to improve salt tolerance in Brassica crops. In order to obtain best harvests, role of SSR marker driven breeding approaches play important role and it has been discussed in this review especially for the introgression of salt tolerance traits in crops.

  11. Disruption of the mouse Necdin gene results in hypothalamic and behavioral alterations reminiscent of the human Prader-Willi syndrome.

    PubMed

    Muscatelli, F; Abrous, D N; Massacrier, A; Boccaccio, I; Le Moal, M; Cau, P; Cremer, H

    2000-12-12

    Prader-Willi syndrome (PWS) is a complex neurogenetic disorder with considerable clinical variability that is thought in large part to be the result of a hypothalamic defect. PWS results from the absence of paternal expression of imprinted genes localized in the 15q11-q13 region; however, none of the characterized genes has so far been shown to be involved in the etiology of PWS. Here, we provide a detailed investigation of a mouse model deficient for NECDIN: Linked to the mutation, a neonatal lethality of variable penetrance is observed. Viable NECDIN: mutants show a reduction in both oxytocin-producing and luteinizing hormone-releasing hormone (LHRH)-producing neurons in hypothalamus. This represents the first evidence of a hypothalamic deficiency in a mouse model of PWS. NECDIN:-deficient mice also display increased skin scraping activity in the open field test and improved spatial learning and memory in the Morris water maze. The latter features are reminiscent of the skin picking and improved spatial memory that are characteristics of the PWS phenotype. These striking parallels in hypothalamic structure, emotional and cognitive-related behaviors strongly suggest that NECDIN is responsible for at least a subset of the multiple clinical manifestations of PWS.

  12. Massive expression of germ cell-specific genes is a hallmark of cancer and a potential target for novel treatment development.

    PubMed

    Bruggeman, Jan Willem; Koster, Jan; Lodder, Paul; Repping, Sjoerd; Hamer, Geert

    2018-06-15

    Cancer cells have been found to frequently express genes that are normally restricted to the testis, often referred to as cancer/testis (CT) antigens or genes. Because germ cell-specific antigens are not recognized as "self" by the innate immune system, CT-genes have previously been suggested as ideal candidate targets for cancer therapy. The use of CT-genes in cancer therapy has thus far been unsuccessful, most likely because their identification has relied on gene expression in whole testis, including the testicular somatic cells, precluding the detection of true germ cell-specific genes. By comparing the transcriptomes of micro-dissected germ cell subtypes, representing the main developmental stages of human spermatogenesis, with the publicly accessible transcriptomes of 2617 samples from 49 different healthy somatic tissues and 9232 samples from 33 tumor types, we here discover hundreds of true germ cell-specific cancer expressed genes. Strikingly, we found these germ cell cancer genes (GC-genes) to be widely expressed in all analyzed tumors. Many GC-genes appeared to be involved in processes that are likely to actively promote tumor viability, proliferation and metastasis. Targeting these true GC-genes thus has the potential to inhibit tumor growth with infertility being the only possible side effect. Moreover, we identified a subset of GC-genes that are not expressed in spermatogonial stem cells. Targeting of this GC-gene subset is predicted to only lead to temporary infertility, as untargeted spermatogonial stem cells can recover spermatogenesis after treatment. Our GC-gene dataset enables improved understanding of tumor biology and provides multiple novel targets for cancer treatment.

  13. Monoamine Oxidase A (MAOA) Gene and Personality Traits from Late Adolescence through Early Adulthood: A Latent Variable Investigation

    PubMed Central

    Xu, Man K.; Gaysina, Darya; Tsonaka, Roula; Morin, Alexandre J. S.; Croudace, Tim J.; Barnett, Jennifer H.; Houwing-Duistermaat, Jeanine; Richards, Marcus; Jones, Peter B.

    2017-01-01

    Very few molecular genetic studies of personality traits have used longitudinal phenotypic data, therefore molecular basis for developmental change and stability of personality remains to be explored. We examined the role of the monoamine oxidase A gene (MAOA) on extraversion and neuroticism from adolescence to adulthood, using modern latent variable methods. A sample of 1,160 male and 1,180 female participants with complete genotyping data was drawn from a British national birth cohort, the MRC National Survey of Health and Development (NSHD). The predictor variable was based on a latent variable representing genetic variations of the MAOA gene measured by three SNPs (rs3788862, rs5906957, and rs979606). Latent phenotype variables were constructed using psychometric methods to represent cross-sectional and longitudinal phenotypes of extraversion and neuroticism measured at ages 16 and 26. In males, the MAOA genetic latent variable (AAG) was associated with lower extraversion score at age 16 (β = −0.167; CI: −0.289, −0.045; p = 0.007, FDRp = 0.042), as well as greater increase in extraversion score from 16 to 26 years (β = 0.197; CI: 0.067, 0.328; p = 0.003, FDRp = 0.036). No genetic association was found for neuroticism after adjustment for multiple testing. Although, we did not find statistically significant associations after multiple testing correction in females, this result needs to be interpreted with caution due to issues related to x-inactivation in females. The latent variable method is an effective way of modeling phenotype- and genetic-based variances and may therefore improve the methodology of molecular genetic studies of complex psychological traits. PMID:29075213

  14. Monoamine Oxidase A (MAOA) Gene and Personality Traits from Late Adolescence through Early Adulthood: A Latent Variable Investigation.

    PubMed

    Xu, Man K; Gaysina, Darya; Tsonaka, Roula; Morin, Alexandre J S; Croudace, Tim J; Barnett, Jennifer H; Houwing-Duistermaat, Jeanine; Richards, Marcus; Jones, Peter B

    2017-01-01

    Very few molecular genetic studies of personality traits have used longitudinal phenotypic data, therefore molecular basis for developmental change and stability of personality remains to be explored. We examined the role of the monoamine oxidase A gene ( MAOA ) on extraversion and neuroticism from adolescence to adulthood, using modern latent variable methods. A sample of 1,160 male and 1,180 female participants with complete genotyping data was drawn from a British national birth cohort, the MRC National Survey of Health and Development (NSHD). The predictor variable was based on a latent variable representing genetic variations of the MAOA gene measured by three SNPs (rs3788862, rs5906957, and rs979606). Latent phenotype variables were constructed using psychometric methods to represent cross-sectional and longitudinal phenotypes of extraversion and neuroticism measured at ages 16 and 26. In males, the MAOA genetic latent variable (AAG) was associated with lower extraversion score at age 16 (β = -0.167; CI: -0.289, -0.045; p = 0.007, FDRp = 0.042), as well as greater increase in extraversion score from 16 to 26 years (β = 0.197; CI: 0.067, 0.328; p = 0.003, FDRp = 0.036). No genetic association was found for neuroticism after adjustment for multiple testing. Although, we did not find statistically significant associations after multiple testing correction in females, this result needs to be interpreted with caution due to issues related to x-inactivation in females. The latent variable method is an effective way of modeling phenotype- and genetic-based variances and may therefore improve the methodology of molecular genetic studies of complex psychological traits.

  15. Application of DNA Machineries for the Barcode Patterned Detection of Genes or Proteins.

    PubMed

    Zhou, Zhixin; Luo, Guofeng; Wulf, Verena; Willner, Itamar

    2018-06-05

    The study introduces an analytical platform for the detection of genes or aptamer-ligand complexes by nucleic acid barcode patterns generated by DNA machineries. The DNA machineries consist of nucleic acid scaffolds that include specific recognition sites for the different genes or aptamer-ligand analytes. The binding of the analytes to the scaffolds initiate, in the presence of the nucleotide mixture, a cyclic polymerization/nicking machinery that yields displaced strands of variable lengths. The electrophoretic separation of the resulting strands provides barcode patterns for the specific detection of the different analytes. Mixtures of DNA machineries that yield, upon sensing of different genes (or aptamer ligands), one-, two-, or three-band barcode patterns are described. The combination of nucleic acid scaffolds acting, in the presence of polymerase/nicking enzyme and nucleotide mixture, as DNA machineries, that generate multiband barcode patterns provide an analytical platform for the detection of an individual gene out of many possible genes. The diversity of genes (or other analytes) that can be analyzed by the DNA machineries and the barcode patterned imaging is given by the Pascal's triangle. As a proof-of-concept, the detection of one of six genes, that is, TP53, Werner syndrome, Tay-Sachs normal gene, BRCA1, Tay-Sachs mutant gene, and cystic fibrosis disorder gene by six two-band barcode patterns is demonstrated. The advantages and limitations of the detection of analytes by polymerase/nicking DNA machineries that yield barcode patterns as imaging readout signals are discussed.

  16. Fast and robust group-wise eQTL mapping using sparse graphical models.

    PubMed

    Cheng, Wei; Shi, Yu; Zhang, Xiang; Wang, Wei

    2015-01-16

    Genome-wide expression quantitative trait loci (eQTL) studies have emerged as a powerful tool to understand the genetic basis of gene expression and complex traits. The traditional eQTL methods focus on testing the associations between individual single-nucleotide polymorphisms (SNPs) and gene expression traits. A major drawback of this approach is that it cannot model the joint effect of a set of SNPs on a set of genes, which may correspond to hidden biological pathways. We introduce a new approach to identify novel group-wise associations between sets of SNPs and sets of genes. Such associations are captured by hidden variables connecting SNPs and genes. Our model is a linear-Gaussian model and uses two types of hidden variables. One captures the set associations between SNPs and genes, and the other captures confounders. We develop an efficient optimization procedure which makes this approach suitable for large scale studies. Extensive experimental evaluations on both simulated and real datasets demonstrate that the proposed methods can effectively capture both individual and group-wise signals that cannot be identified by the state-of-the-art eQTL mapping methods. Considering group-wise associations significantly improves the accuracy of eQTL mapping, and the successful multi-layer regression model opens a new approach to understand how multiple SNPs interact with each other to jointly affect the expression level of a group of genes.

  17. Differential replication dynamics for large and small Vibrio chromosomes affect gene dosage, expression and location

    PubMed Central

    Dryselius, Rikard; Izutsu, Kaori; Honda, Takeshi; Iida, Tetsuya

    2008-01-01

    Background Replication of bacterial chromosomes increases copy numbers of genes located near origins of replication relative to genes located near termini. Such differential gene dosage depends on replication rate, doubling time and chromosome size. Although little explored, differential gene dosage may influence both gene expression and location. For vibrios, a diverse family of fast growing gammaproteobacteria, gene dosage may be particularly important as they harbor two chromosomes of different size. Results Here we examined replication dynamics and gene dosage effects for the separate chromosomes of three Vibrio species. We also investigated locations for specific gene types within the genome. The results showed consistently larger gene dosage differences for the large chromosome which also initiated replication long before the small. Accordingly, large chromosome gene expression levels were generally higher and showed an influence from gene dosage. This was reflected by a higher abundance of growth essential and growth contributing genes of which many locate near the origin of replication. In contrast, small chromosome gene expression levels were low and appeared independent of gene dosage. Also, species specific genes are highly abundant and an over-representation of genes involved in transcription could explain its gene dosage independent expression. Conclusion Here we establish a link between replication dynamics and differential gene dosage on one hand and gene expression levels and the location of specific gene types on the other. For vibrios, this relationship appears connected to a polarisation of genetic content between its chromosomes, which may both contribute to and be enhanced by an improved adaptive capacity. PMID:19032792

  18. Cellular and molecular mechanisms of HIV-1 integration targeting.

    PubMed

    Engelman, Alan N; Singh, Parmit K

    2018-07-01

    Integration is central to HIV-1 replication and helps mold the reservoir of cells that persists in AIDS patients. HIV-1 interacts with specific cellular factors to target integration to interior regions of transcriptionally active genes within gene-dense regions of chromatin. The viral capsid interacts with several proteins that are additionally implicated in virus nuclear import, including cleavage and polyadenylation specificity factor 6, to suppress integration into heterochromatin. The viral integrase protein interacts with transcriptional co-activator lens epithelium-derived growth factor p75 to principally position integration within gene bodies. The integrase additionally senses target DNA distortion and nucleotide sequence to help fine-tune the specific phosphodiester bonds that are cleaved at integration sites. Research into virus-host interactions that underlie HIV-1 integration targeting has aided the development of a novel class of integrase inhibitors and may help to improve the safety of viral-based gene therapy vectors.

  19. IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing

    PubMed Central

    Deonovic, Benjamin; Wang, Yunhao; Weirather, Jason; Wang, Xiu-Jie; Au, Kin Fai

    2017-01-01

    Abstract Allele-specific expression (ASE) is a fundamental problem in studying gene regulation and diploid transcriptome profiles, with two key challenges: (i) haplotyping and (ii) estimation of ASE at the gene isoform level. Existing ASE analysis methods are limited by a dependence on haplotyping from laborious experiments or extra genome/family trio data. In addition, there is a lack of methods for gene isoform level ASE analysis. We developed a tool, IDP-ASE, for full ASE analysis. By innovative integration of Third Generation Sequencing (TGS) long reads with Second Generation Sequencing (SGS) short reads, the accuracy of haplotyping and ASE quantification at the gene and gene isoform level was greatly improved as demonstrated by the gold standard data GM12878 data and semi-simulation data. In addition to methodology development, applications of IDP-ASE to human embryonic stem cells and breast cancer cells indicate that the imbalance of ASE and non-uniformity of gene isoform ASE is widespread, including tumorigenesis relevant genes and pluripotency markers. These results show that gene isoform expression and allele-specific expression cooperate to provide high diversity and complexity of gene regulation and expression, highlighting the importance of studying ASE at the gene isoform level. Our study provides a robust bioinformatics solution to understand ASE using RNA sequencing data only. PMID:27899656

  20. Biotype-specific tcpA genes in Vibrio cholerae.

    PubMed

    Iredell, J R; Manning, P A

    1994-08-01

    The tcpA gene, encoding the structural subunit of the toxin-coregulated pilus, has been isolated from a variety of clinical isolates of Vibrio cholerae, and the nucleotide sequence determined. Strict biotype-specific conservation within both the coding and putative regulatory regions was observed, with important differences between the El Tor and classical biotypes. V. cholerae O139 Bengal strains appear to have El Tor-type tcpA genes. Environmental O1 and non-O1 isolates have sequences that bind an El Tor-specific tcpA DNA probe and that are weakly and variably amplified by tcpA-specific polymerase chain reaction primers, under conditions of reduced stringency. The data presented allow the selection of primer pairs to help distinguish between clinical and environmental isolates, and to distinguish El Tor (and Bengal) biotypes from classical biotypes of V. cholerae. While the role of TcpA in cholera vaccine preparations remains unclear, the data strongly suggest that TcpA-containing vaccines directed at O1 strains need include only the two forms of TcpA, and that such vaccines directed at (O139) Bengal strains should include the TcpA of El Tor biotype.

  1. Surface modification and endothelialization of biomaterials as potential scaffolds for vascular tissue engineering applications.

    PubMed

    Ren, Xiangkui; Feng, Yakai; Guo, Jintang; Wang, Haixia; Li, Qian; Yang, Jing; Hao, Xuefang; Lv, Juan; Ma, Nan; Li, Wenzhong

    2015-08-07

    Surface modification and endothelialization of vascular biomaterials are common approaches that are used to both resist the nonspecific adhesion of proteins and improve the hemocompatibility and long-term patency of artificial vascular grafts. Surface modification of vascular grafts using hydrophilic poly(ethylene glycol), zwitterionic polymers, heparin or other bioactive molecules can efficiently enhance hemocompatibility, and consequently prevent thrombosis on artificial vascular grafts. However, these modified surfaces may be excessively hydrophilic, which limits initial vascular endothelial cell adhesion and formation of a confluent endothelial lining. Therefore, the improvement of endothelialization on these grafts by chemical modification with specific peptides and genes is now arousing more and more interest. Several active peptides, such as RGD, CAG, REDV and YIGSR, can be specifically recognized by endothelial cells. Consequently, graft surfaces that are modified by these peptides can exhibit targeting selectivity for the adhesion of endothelial cells, and genes can be delivered by targeting carriers to specific tissues to enhance the promotion and regeneration of blood vessels. These methods could effectively accelerate selective endothelial cell recruitment and functional endothelialization. In this review, recent developments in the surface modification and endothelialization of biomaterials in vascular tissue engineering are summarized. Both gene engineering and targeting ligand immobilization are promising methods to improve the clinical outcome of artificial vascular grafts.

  2. Cell-type specific features of circular RNA expression.

    PubMed

    Salzman, Julia; Chen, Raymond E; Olsen, Mari N; Wang, Peter L; Brown, Patrick O

    2013-01-01

    Thousands of loci in the human and mouse genomes give rise to circular RNA transcripts; at many of these loci, the predominant RNA isoform is a circle. Using an improved computational approach for circular RNA identification, we found widespread circular RNA expression in Drosophila melanogaster and estimate that in humans, circular RNA may account for 1% as many molecules as poly(A) RNA. Analysis of data from the ENCODE consortium revealed that the repertoire of genes expressing circular RNA, the ratio of circular to linear transcripts for each gene, and even the pattern of splice isoforms of circular RNAs from each gene were cell-type specific. These results suggest that biogenesis of circular RNA is an integral, conserved, and regulated feature of the gene expression program.

  3. Preservation of Integron Types among Enterobacteriaceae Producing Extended-Spectrum β-Lactamases in a Spanish Hospital over a 15-Year Period (1988 to 2003)▿

    PubMed Central

    Machado, Elisabete; Ferreira, Joana; Novais, Ângela; Peixe, Luísa; Cantón, Rafael; Baquero, Fernando; Coque, Teresa M.

    2007-01-01

    The variable presence of integrons among extended-spectrum beta-lactamase (ESBL)-producing Enterobacteriaceae species (0 to 66%) is described. Association between blaESBL and integrons occurred when these are linked to specific ESBL-type genes (In60 bearing ISCR1 and blaCTX-M-9) or when ESBL genes were superimposed onto selected plasmids carrying integrons. Some integrons were identical to those found during decades worldwide, illustrating the preservation of the genetic elements carrying them. PMID:17404002

  4. How Does the Scientific Community Contribute to Gene Ontology?

    PubMed

    Lovering, Ruth C

    2017-01-01

    Collaborations between the scientific community and members of the Gene Ontology (GO) Consortium have led to an increase in the number and specificity of GO terms, as well as increasing the number of GO annotations. A variety of approaches have been taken to encourage research scientists to contribute to the GO, but the success of these approaches has been variable. This chapter reviews both the successes and failures of engaging the scientific community in GO development and annotation, as well as, providing motivation and advice to encourage individual researchers to contribute to GO.

  5. Selective DNA demethylation by fusion of TDG with a sequence-specific DNA-binding domain

    PubMed Central

    Gregory, David J.; Mikhaylova, Lyudmila; Fedulov, Alexey V.

    2012-01-01

    Our ability to selectively manipulate gene expression by epigenetic means is limited, as there is no approach for targeted reactivation of epigenetically silenced genes, in contrast to what is available for selective gene silencing. We aimed to develop a tool for selective transcriptional activation by DNA demethylation. Here we present evidence that direct targeting of thymine-DNA-glycosylase (TDG) to specific sequences in the DNA can result in local DNA demethylation at potential regulatory sequences and lead to enhanced gene induction. When TDG was fused to a well-characterized DNA-binding domain [the Rel-homology domain (RHD) of NFκB], we observed decreased DNA methylation and increased transcriptional response to unrelated stimulus of inducible nitric oxide synthase (NOS2). The effect was not seen for control genes lacking either RHD-binding sites or high levels of methylation, nor in control mock-transduced cells. Specific reactivation of epigenetically silenced genes may thus be achievable by this approach, which provides a broadly useful strategy to further our exploration of biological mechanisms and to improve control over the epigenome. PMID:22419066

  6. Recombinant Rp1 genes confer necrotic or nonspecific resistance phenotypes.

    PubMed

    Smith, Shavannor M; Steinau, Martin; Trick, Harold N; Hulbert, Scot H

    2010-06-01

    Genes at the Rp1 rust resistance locus of maize confer race-specific resistance to the common rust fungus Puccinia sorghi. Three variant genes with nonspecific effects (HRp1 -Kr1N, -D*21 and -MD*19) were found to be generated by intragenic crossing over within the LRR region. The LRR region of most NBS-LRR encoding genes is quite variable and codes for one of the regions in resistance gene proteins that controls specificity. Sequence comparisons demonstrated that the Rp1-Kr1N recombinant gene was identical to the N-terminus of the rp1-kp2 gene and C-terminus of another gene from its HRp1-K grandparent. The Rp1-D*21 recombinant gene consists of the N-terminus of the rp1-dp2 gene and C-terminus of the Rp1-D gene from the parental haplotype. Similarly, a recombinant gene from the Rp1-MD*19 haplotype has the N-terminus of an rp1 gene from the HRp1-M parent and C-terminus of the rp1-D19 gene from the HRp1-D parent. The recombinant Rp1 -Kr1N, -D*21 and -MD*19 genes activated defense responses in the absence of their AVR proteins triggering HR (hypersensitive response) in the absence of the pathogen. The results indicate that the frequent intragenic recombination events that occur in the Rp1 gene cluster not only recombine the genes into novel haplotypes, but also create genes with nonspecific effects. Some of these may contribute to nonspecific quantitative resistance but others have severe consequences for the fitness of the plant.

  7. Production of individualized V gene databases reveals high levels of immunoglobulin genetic diversity

    NASA Astrophysics Data System (ADS)

    Corcoran, Martin M.; Phad, Ganesh E.; Bernat, Néstor Vázquez; Stahl-Hennig, Christiane; Sumida, Noriyuki; Persson, Mats A. A.; Martin, Marcel; Hedestam, Gunilla B. Karlsson

    2016-12-01

    Comprehensive knowledge of immunoglobulin genetics is required to advance our understanding of B cell biology. Validated immunoglobulin variable (V) gene databases are close to completion only for human and mouse. We present a novel computational approach, IgDiscover, that identifies germline V genes from expressed repertoires to a specificity of 100%. IgDiscover uses a cluster identification process to produce candidate sequences that, once filtered, results in individualized germline V gene databases. IgDiscover was tested in multiple species, validated by genomic cloning and cross library comparisons and produces comprehensive gene databases even where limited genomic sequence is available. IgDiscover analysis of the allelic content of the Indian and Chinese-origin rhesus macaques reveals high levels of immunoglobulin gene diversity in this species. Further, we describe a novel human IGHV3-21 allele and confirm significant gene differences between Balb/c and C57BL6 mouse strains, demonstrating the power of IgDiscover as a germline V gene discovery tool.

  8. Production of individualized V gene databases reveals high levels of immunoglobulin genetic diversity

    PubMed Central

    Corcoran, Martin M.; Phad, Ganesh E.; Bernat, Néstor Vázquez; Stahl-Hennig, Christiane; Sumida, Noriyuki; Persson, Mats A.A.; Martin, Marcel; Hedestam, Gunilla B. Karlsson

    2016-01-01

    Comprehensive knowledge of immunoglobulin genetics is required to advance our understanding of B cell biology. Validated immunoglobulin variable (V) gene databases are close to completion only for human and mouse. We present a novel computational approach, IgDiscover, that identifies germline V genes from expressed repertoires to a specificity of 100%. IgDiscover uses a cluster identification process to produce candidate sequences that, once filtered, results in individualized germline V gene databases. IgDiscover was tested in multiple species, validated by genomic cloning and cross library comparisons and produces comprehensive gene databases even where limited genomic sequence is available. IgDiscover analysis of the allelic content of the Indian and Chinese-origin rhesus macaques reveals high levels of immunoglobulin gene diversity in this species. Further, we describe a novel human IGHV3-21 allele and confirm significant gene differences between Balb/c and C57BL6 mouse strains, demonstrating the power of IgDiscover as a germline V gene discovery tool. PMID:27995928

  9. Influence of smoking status and intensity on discovery of blood pressure loci through gene-smoking interactions

    PubMed Central

    Fuentes, Lisa de las; Schwander, Karen; Cupples, L. Adrienne; Rao, D. C.

    2015-01-01

    Background Genetic variation accounts for approximately 30% of blood pressure (BP) variability but most of that variability hasn't been attributed to specific variants. Interactions between genes and BP-associated factors may explain some ‘missing heritability.’ Cigarette smoking increases BP after short-term exposure and decreases BP with longer exposure. Gene-smoking interactions have discovered novel BP loci, but the contribution of smoking status and intensity to gene discovery is unknown. Methods We analyzed gene-smoking intensity interactions for association with systolic BP (SBP) in three subgroups from the Framingham Heart Study: current smokers only (N = 1,057), current and former smokers (‘ever smokers’, N = 3,374), and all subjects (N = 6,710). We used three smoking intensity variables defined at cutoffs of 10, 15, and 20 cigarettes per day (CPD). We evaluated the 1 degree-of-freedom (df) interaction and 2df joint test using generalized estimating equations. Results Analysis of current smokers using a CPD cutoff of 10 produced two loci associated with SBP. The rs9399633 minor allele was associated with increased SBP (5 mmHg) in heavy smokers (CPD>10) but decreased SBP (7 mmHg) in light smokers (CPD≤10). The rs11717948 minor allele was associated with decreased SBP (8 mmHg) in light smokers but decreased SBP (2 mmHg) in heavy smokers. Across all nine analyses, 19 additional loci reached p < 1×10−6. Discussion Analysis of current smokers may have the highest power to detect gene-smoking interactions, despite the reduced sample size. Associations of loci near SASH1 and KLHL6/KLHL24 with SBP may be modulated by tobacco smoking. PMID:25940791

  10. Influence of Smoking Status and Intensity on Discovery of Blood Pressure Loci Through Gene-Smoking Interactions.

    PubMed

    Basson, Jacob; Sung, Yun Ju; Fuentes, Lisa de Las; Schwander, Karen; Cupples, L Adrienne; Rao, D C

    2015-09-01

    Genetic variation accounts for approximately 30% of blood pressure (BP) variability but most of that variability has not been attributed to specific variants. Interactions between genes and BP-associated factors may explain some "missing heritability." Cigarette smoking increases BP after short-term exposure and decreases BP with longer exposure. Gene-smoking interactions have discovered novel BP loci, but the contribution of smoking status and intensity to gene discovery is unknown. We analyzed gene-smoking intensity interactions for association with systolic BP (SBP) in three subgroups from the Framingham Heart Study: current smokers only (N = 1,057), current and former smokers ("ever smokers," N = 3,374), and all subjects (N = 6,710). We used three smoking intensity variables defined at cutoffs of 10, 15, and 20 cigarettes per day (CPD). We evaluated the 1 degree-of-freedom (df) interaction and 2df joint test using generalized estimating equations. Analysis of current smokers using a CPD cutoff of 10 produced two loci associated with SBP. The rs9399633 minor allele was associated with increased SBP (5 mmHg) in heavy smokers (CPD > 10) but decreased SBP (7 mmHg) in light smokers (CPD ≤ 10). The rs11717948 minor allele was associated with decreased SBP (8 mmHg) in light smokers but decreased SBP (2 mmHg) in heavy smokers. Across all nine analyses, 19 additional loci reached P < 1 × 10(-6). Analysis of current smokers may have the highest power to detect gene-smoking interactions, despite the reduced sample size. Associations of loci near SASH1 and KLHL6/KLHL24 with SBP may be modulated by tobacco smoking. © 2015 WILEY PERIODICALS, INC.

  11. Transcripts and MicroRNAs Responding to Salt Stress in Musa acuminata Colla (AAA Group) cv. Berangan Roots

    PubMed Central

    Lee, Wan Sin; Gudimella, Ranganath; Wong, Gwo Rong; Tammi, Martti Tapani; Khalid, Norzulaani; Harikrishna, Jennifer Ann

    2015-01-01

    Physiological responses to stress are controlled by expression of a large number of genes, many of which are regulated by microRNAs. Since most banana cultivars are salt-sensitive, improved understanding of genetic regulation of salt induced stress responses in banana can support future crop management and improvement in the face of increasing soil salinity related to irrigation and climate change. In this study we focused on determining miRNA and their targets that respond to NaCl exposure and used transcriptome sequencing of RNA and small RNA from control and NaCl-treated banana roots to assemble a cultivar-specific reference transcriptome and identify orthologous and Musa-specific miRNA responding to salinity. We observed that, banana roots responded to salinity stress with changes in expression for a large number of genes (9.5% of 31,390 expressed unigenes) and reduction in levels of many miRNA, including several novel miRNA and banana-specific miRNA-target pairs. Banana roots expressed a unique set of orthologous and Musa-specific miRNAs of which 59 respond to salt stress in a dose-dependent manner. Gene expression patterns of miRNA compared with those of their predicted mRNA targets indicated that a majority of the differentially expressed miRNAs were down-regulated in response to increased salinity, allowing increased expression of targets involved in diverse biological processes including stress signaling, stress defence, transport, cellular homeostasis, metabolism and other stress-related functions. This study may contribute to the understanding of gene regulation and abiotic stress response of roots and the high-throughput sequencing data sets generated may serve as important resources related to salt tolerance traits for functional genomic studies and genetic improvement in banana. PMID:25993649

  12. Transcripts and MicroRNAs Responding to Salt Stress in Musa acuminata Colla (AAA Group) cv. Berangan Roots.

    PubMed

    Lee, Wan Sin; Gudimella, Ranganath; Wong, Gwo Rong; Tammi, Martti Tapani; Khalid, Norzulaani; Harikrishna, Jennifer Ann

    2015-01-01

    Physiological responses to stress are controlled by expression of a large number of genes, many of which are regulated by microRNAs. Since most banana cultivars are salt-sensitive, improved understanding of genetic regulation of salt induced stress responses in banana can support future crop management and improvement in the face of increasing soil salinity related to irrigation and climate change. In this study we focused on determining miRNA and their targets that respond to NaCl exposure and used transcriptome sequencing of RNA and small RNA from control and NaCl-treated banana roots to assemble a cultivar-specific reference transcriptome and identify orthologous and Musa-specific miRNA responding to salinity. We observed that, banana roots responded to salinity stress with changes in expression for a large number of genes (9.5% of 31,390 expressed unigenes) and reduction in levels of many miRNA, including several novel miRNA and banana-specific miRNA-target pairs. Banana roots expressed a unique set of orthologous and Musa-specific miRNAs of which 59 respond to salt stress in a dose-dependent manner. Gene expression patterns of miRNA compared with those of their predicted mRNA targets indicated that a majority of the differentially expressed miRNAs were down-regulated in response to increased salinity, allowing increased expression of targets involved in diverse biological processes including stress signaling, stress defence, transport, cellular homeostasis, metabolism and other stress-related functions. This study may contribute to the understanding of gene regulation and abiotic stress response of roots and the high-throughput sequencing data sets generated may serve as important resources related to salt tolerance traits for functional genomic studies and genetic improvement in banana.

  13. Detecting regulatory gene-environment interactions with unmeasured environmental factors.

    PubMed

    Fusi, Nicoló; Lippert, Christoph; Borgwardt, Karsten; Lawrence, Neil D; Stegle, Oliver

    2013-06-01

    Genomic studies have revealed a substantial heritable component of the transcriptional state of the cell. To fully understand the genetic regulation of gene expression variability, it is important to study the effect of genotype in the context of external factors such as alternative environmental conditions. In model systems, explicit environmental perturbations have been considered for this purpose, allowing to directly test for environment-specific genetic effects. However, such experiments are limited to species that can be profiled in controlled environments, hampering their use in important systems such as human. Moreover, even in seemingly tightly regulated experimental conditions, subtle environmental perturbations cannot be ruled out, and hence unknown environmental influences are frequent. Here, we propose a model-based approach to simultaneously infer unmeasured environmental factors from gene expression profiles and use them in genetic analyses, identifying environment-specific associations between polymorphic loci and individual gene expression traits. In extensive simulation studies, we show that our method is able to accurately reconstruct environmental factors and their interactions with genotype in a variety of settings. We further illustrate the use of our model in a real-world dataset in which one environmental factor has been explicitly experimentally controlled. Our method is able to accurately reconstruct the true underlying environmental factor even if it is not given as an input, allowing to detect genuine genotype-environment interactions. In addition to the known environmental factor, we find unmeasured factors involved in novel genotype-environment interactions. Our results suggest that interactions with both known and unknown environmental factors significantly contribute to gene expression variability. and implementation: Software available at http://pmbio.github.io/envGPLVM/. Supplementary data are available at Bioinformatics online.

  14. HIV promoter integration site primarily modulates transcriptional burst size rather than frequency.

    PubMed

    Skupsky, Ron; Burnett, John C; Foley, Jonathan E; Schaffer, David V; Arkin, Adam P

    2010-09-30

    Mammalian gene expression patterns, and their variability across populations of cells, are regulated by factors specific to each gene in concert with its surrounding cellular and genomic environment. Lentiviruses such as HIV integrate their genomes into semi-random genomic locations in the cells they infect, and the resulting viral gene expression provides a natural system to dissect the contributions of genomic environment to transcriptional regulation. Previously, we showed that expression heterogeneity and its modulation by specific host factors at HIV integration sites are key determinants of infected-cell fate and a possible source of latent infections. Here, we assess the integration context dependence of expression heterogeneity from diverse single integrations of a HIV-promoter/GFP-reporter cassette in Jurkat T-cells. Systematically fitting a stochastic model of gene expression to our data reveals an underlying transcriptional dynamic, by which multiple transcripts are produced during short, infrequent bursts, that quantitatively accounts for the wide, highly skewed protein expression distributions observed in each of our clonal cell populations. Interestingly, we find that the size of transcriptional bursts is the primary systematic covariate over integration sites, varying from a few to tens of transcripts across integration sites, and correlating well with mean expression. In contrast, burst frequencies are scattered about a typical value of several per cell-division time and demonstrate little correlation with the clonal means. This pattern of modulation generates consistently noisy distributions over the sampled integration positions, with large expression variability relative to the mean maintained even for the most productive integrations, and could contribute to specifying heterogeneous, integration-site-dependent viral production patterns in HIV-infected cells. Genomic environment thus emerges as a significant control parameter for gene expression variation that may contribute to structuring mammalian genomes, as well as be exploited for survival by integrating viruses.

  15. Quantification of Human Fecal Bifidobacterium Species by Use of Quantitative Real-Time PCR Analysis Targeting the groEL Gene

    PubMed Central

    Junick, Jana

    2012-01-01

    Quantitative real-time PCR assays targeting the groEL gene for the specific enumeration of 12 human fecal Bifidobacterium species were developed. The housekeeping gene groEL (HSP60 in eukaryotes) was used as a discriminative marker for the differentiation of Bifidobacterium adolescentis, B. angulatum, B. animalis, B. bifidum, B. breve, B. catenulatum, B. dentium, B. gallicum, B. longum, B. pseudocatenulatum, B. pseudolongum, and B. thermophilum. The bifidobacterial chromosome contains a single copy of the groEL gene, allowing the determination of the cell number by quantification of the groEL copy number. Real-time PCR assays were validated by comparing fecal samples spiked with known numbers of a given Bifidobacterium species. Independent of the Bifidobacterium species tested, the proportion of groEL copies recovered from fecal samples spiked with 5 to 9 log10 cells/g feces was approximately 50%. The quantification limit was 5 to 6 log10 groEL copies/g feces. The interassay variability was less than 10%, and variability between different DNA extractions was less than 23%. The method developed was applied to fecal samples from healthy adults and full-term breast-fed infants. Bifidobacterial diversity in both adults and infants was low, with mostly ≤3 Bifidobacterium species and B. longum frequently detected. The predominant species in infant and adult fecal samples were B. breve and B. adolescentis, respectively. It was possible to distinguish B. catenulatum and B. pseudocatenulatum. We conclude that the groEL gene is a suitable molecular marker for the specific and accurate quantification of human fecal Bifidobacterium species by real-time PCR. PMID:22307308

  16. Computational Selection of Transcriptomics Experiments Improves Guilt-by-Association Analyses

    PubMed Central

    Bhat, Prajwal; Yang, Haixuan; Bögre, László; Devoto, Alessandra; Paccanaro, Alberto

    2012-01-01

    The Guilt-by-Association (GBA) principle, according to which genes with similar expression profiles are functionally associated, is widely applied for functional analyses using large heterogeneous collections of transcriptomics data. However, the use of such large collections could hamper GBA functional analysis for genes whose expression is condition specific. In these cases a smaller set of condition related experiments should instead be used, but identifying such functionally relevant experiments from large collections based on literature knowledge alone is an impractical task. We begin this paper by analyzing, both from a mathematical and a biological point of view, why only condition specific experiments should be used in GBA functional analysis. We are able to show that this phenomenon is independent of the functional categorization scheme and of the organisms being analyzed. We then present a semi-supervised algorithm that can select functionally relevant experiments from large collections of transcriptomics experiments. Our algorithm is able to select experiments relevant to a given GO term, MIPS FunCat term or even KEGG pathways. We extensively test our algorithm on large dataset collections for yeast and Arabidopsis. We demonstrate that: using the selected experiments there is a statistically significant improvement in correlation between genes in the functional category of interest; the selected experiments improve GBA-based gene function prediction; the effectiveness of the selected experiments increases with annotation specificity; our algorithm can be successfully applied to GBA-based pathway reconstruction. Importantly, the set of experiments selected by the algorithm reflects the existing literature knowledge about the experiments. [A MATLAB implementation of the algorithm and all the data used in this paper can be downloaded from the paper website: http://www.paccanarolab.org/papers/CorrGene/]. PMID:22879875

  17. A Century of Shope Papillomavirus in Museum Rabbit Specimens

    PubMed Central

    Escudero Duch, Clara; Williams, Richard A. J.; Timm, Robert M.; Perez-Tris, Javier; Benitez, Laura

    2015-01-01

    Sylvilagus floridanus Papillomavirus (SfPV) causes growth of large horn-like tumors on rabbits. SfPV was described in cottontail rabbits (probably Sylvilagus floridanus) from Kansas and Iowa by Richard Shope in 1933, and detected in S. audubonii in 2011. It is known almost exclusively from the US Midwest. We explored the University of Kansas Natural History Museum for historical museum specimens infected with SfPV, using molecular techniques, to assess if additional wild species host SfPV, and whether SfPV occurs throughout the host range, or just in the Midwest. Secondary aims were to detect distinct strains, and evidence for strain spatio-temporal specificity. We found 20 of 1395 rabbits in the KU collection SfPV symptomatic. Three of 17 lagomorph species (S. nuttallii, and the two known hosts) were symptomatic, while Brachylagus, Lepus and eight additional Sylvilagus species were not. 13 symptomatic individuals were positive by molecular testing, including the first S. nuttallii detection. Prevalence of symptomatic individuals was significantly higher in Sylvilagus (1.8%) than Lepus. Half of these specimens came from Kansas, though new molecular detections were obtained from Jalisco—Mexico’s first—and Nebraska, Nevada, New Mexico, and Texas, USA. We document the oldest lab-confirmed case (Kansas, 1915), pre-dating Shope’s first case. SfPV amplification was possible from 63.2% of symptomatic museum specimens. Using multiple methodologies, rolling circle amplification and, multiple isothermal displacement amplification in addition to PCR, greatly improved detection rates. Short sequences were obtained from six individuals for two genes. L1 gene sequences were identical to all previously detected sequences; E7 gene sequences, were more variable, yielding five distinct SfPV1 strains that differing by less than 2% from strains circulating in the Midwest and Mexico, between 1915 and 2005. Our results do not clarify whether strains are host species specific, though they are consistent with SfPV specificity to genus Sylvilagus. PMID:26147570

  18. Improving wood properties for wood utilization through multi-omics integration in lignin biosynthesis

    DOE PAGES

    Wang, Jack P.; Matthews, Megan L.; Williams, Cranos M.; ...

    2018-04-20

    A multi-omics quantitative integrative analysis of lignin biosynthesis can advance the strategic engineering of wood for timber, pulp, and biofuels. Lignin is polymerized from three monomers (monolignols) produced by a grid-like pathway. The pathway in wood formation of Populus trichocarpa has at least 21 genes, encoding enzymes that mediate 37 reactions on 24 metabolites, leading to lignin and affecting wood properties. We perturb these 21 pathway genes and integrate transcriptomic, proteomic, fluxomic and phenomic data from 221 lines selected from ~2000 transgenics (6-month-old). The integrative analysis estimates how changing expression of pathway gene or gene combination affects protein abundance, metabolic-flux,more » metabolite concentrations, and 25 wood traits, including lignin, tree-growth, density, strength, and saccharification. The analysis then predicts improvements in any of these 25 traits individually or in combinations, through engineering expression of specific monolignol genes. The analysis may lead to greater understanding of other pathways for improved growth and adaptation.« less

  19. Improving wood properties for wood utilization through multi-omics integration in lignin biosynthesis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wang, Jack P.; Matthews, Megan L.; Williams, Cranos M.

    A multi-omics quantitative integrative analysis of lignin biosynthesis can advance the strategic engineering of wood for timber, pulp, and biofuels. Lignin is polymerized from three monomers (monolignols) produced by a grid-like pathway. The pathway in wood formation of Populus trichocarpa has at least 21 genes, encoding enzymes that mediate 37 reactions on 24 metabolites, leading to lignin and affecting wood properties. We perturb these 21 pathway genes and integrate transcriptomic, proteomic, fluxomic and phenomic data from 221 lines selected from ~2000 transgenics (6-month-old). The integrative analysis estimates how changing expression of pathway gene or gene combination affects protein abundance, metabolic-flux,more » metabolite concentrations, and 25 wood traits, including lignin, tree-growth, density, strength, and saccharification. The analysis then predicts improvements in any of these 25 traits individually or in combinations, through engineering expression of specific monolignol genes. The analysis may lead to greater understanding of other pathways for improved growth and adaptation.« less

  20. Improving wood properties for wood utilization through multi-omics integration in lignin biosynthesis.

    PubMed

    Wang, Jack P; Matthews, Megan L; Williams, Cranos M; Shi, Rui; Yang, Chenmin; Tunlaya-Anukit, Sermsawat; Chen, Hsi-Chuan; Li, Quanzi; Liu, Jie; Lin, Chien-Yuan; Naik, Punith; Sun, Ying-Hsuan; Loziuk, Philip L; Yeh, Ting-Feng; Kim, Hoon; Gjersing, Erica; Shollenberger, Todd; Shuford, Christopher M; Song, Jina; Miller, Zachary; Huang, Yung-Yun; Edmunds, Charles W; Liu, Baoguang; Sun, Yi; Lin, Ying-Chung Jimmy; Li, Wei; Chen, Hao; Peszlen, Ilona; Ducoste, Joel J; Ralph, John; Chang, Hou-Min; Muddiman, David C; Davis, Mark F; Smith, Chris; Isik, Fikret; Sederoff, Ronald; Chiang, Vincent L

    2018-04-20

    A multi-omics quantitative integrative analysis of lignin biosynthesis can advance the strategic engineering of wood for timber, pulp, and biofuels. Lignin is polymerized from three monomers (monolignols) produced by a grid-like pathway. The pathway in wood formation of Populus trichocarpa has at least 21 genes, encoding enzymes that mediate 37 reactions on 24 metabolites, leading to lignin and affecting wood properties. We perturb these 21 pathway genes and integrate transcriptomic, proteomic, fluxomic and phenomic data from 221 lines selected from ~2000 transgenics (6-month-old). The integrative analysis estimates how changing expression of pathway gene or gene combination affects protein abundance, metabolic-flux, metabolite concentrations, and 25 wood traits, including lignin, tree-growth, density, strength, and saccharification. The analysis then predicts improvements in any of these 25 traits individually or in combinations, through engineering expression of specific monolignol genes. The analysis may lead to greater understanding of other pathways for improved growth and adaptation.

  1. Detection, Characterization, and Typing of Shiga Toxin-Producing Escherichia coli.

    PubMed

    Parsons, Brendon D; Zelyas, Nathan; Berenger, Byron M; Chui, Linda

    2016-01-01

    Shiga toxin-producing Escherichia coli (STEC) are responsible for gastrointestinal diseases reported in numerous outbreaks around the world. Given the public health importance of STEC, effective detection, characterization and typing is critical to any medical laboratory system. While non-O157 serotypes account for the majority of STEC infections, frontline microbiology laboratories may only screen for STEC using O157-specific agar-based methods. As a result, non-O157 STEC infections are significantly under-reported. This review discusses recent advances on the detection, characterization and typing of STEC with emphasis on work performed at the Alberta Provincial Laboratory for Public Health (ProvLab). Candidates for the detection of all STEC serotypes include chromogenic agars, enzyme immunoassays (EIA) and quantitative real time polymerase chain reaction (qPCR). Culture methods allow further characterization of isolates, whereas qPCR provides the greatest sensitivity and specificity, followed by EIA. The virulence gene profiles using PCR arrays and stx gene subtypes can subsequently be determined. Different non-O157 serotypes exhibit markedly different virulence gene profiles and a greater prevalence of stx1 than stx2 subtypes compared to O157:H7 isolates. Finally, recent innovations in whole genome sequencing (WGS) have allowed it to emerge as a candidate for the characterization and typing of STEC in diagnostic surveillance isolates. Methods of whole genome analysis such as single nucleotide polymorphisms and k-mer analysis are concordant with epidemiological data and standard typing methods, such as pulsed-field gel electrophoresis and multiple-locus variable number tandem repeat analysis while offering additional strain differentiation. Together these findings highlight improved strategies for STEC detection using currently available systems and the development of novel approaches for future surveillance.

  2. Detection, Characterization, and Typing of Shiga Toxin-Producing Escherichia coli

    PubMed Central

    Parsons, Brendon D.; Zelyas, Nathan; Berenger, Byron M.; Chui, Linda

    2016-01-01

    Shiga toxin-producing Escherichia coli (STEC) are responsible for gastrointestinal diseases reported in numerous outbreaks around the world. Given the public health importance of STEC, effective detection, characterization and typing is critical to any medical laboratory system. While non-O157 serotypes account for the majority of STEC infections, frontline microbiology laboratories may only screen for STEC using O157-specific agar-based methods. As a result, non-O157 STEC infections are significantly under-reported. This review discusses recent advances on the detection, characterization and typing of STEC with emphasis on work performed at the Alberta Provincial Laboratory for Public Health (ProvLab). Candidates for the detection of all STEC serotypes include chromogenic agars, enzyme immunoassays (EIA) and quantitative real time polymerase chain reaction (qPCR). Culture methods allow further characterization of isolates, whereas qPCR provides the greatest sensitivity and specificity, followed by EIA. The virulence gene profiles using PCR arrays and stx gene subtypes can subsequently be determined. Different non-O157 serotypes exhibit markedly different virulence gene profiles and a greater prevalence of stx1 than stx2 subtypes compared to O157:H7 isolates. Finally, recent innovations in whole genome sequencing (WGS) have allowed it to emerge as a candidate for the characterization and typing of STEC in diagnostic surveillance isolates. Methods of whole genome analysis such as single nucleotide polymorphisms and k-mer analysis are concordant with epidemiological data and standard typing methods, such as pulsed-field gel electrophoresis and multiple-locus variable number tandem repeat analysis while offering additional strain differentiation. Together these findings highlight improved strategies for STEC detection using currently available systems and the development of novel approaches for future surveillance. PMID:27148176

  3. Computational correction of copy number effect improves specificity of CRISPR-Cas9 essentiality screens in cancer cells. | Office of Cancer Genomics

    Cancer.gov

    The CRISPR-Cas9 system has revolutionized gene editing both at single genes and in multiplexed loss-of-function screens, thus enabling precise genome-scale identification of genes essential for proliferation and survival of cancer cells. However, previous studies have reported that a gene-independent antiproliferative effect of Cas9-mediated DNA cleavage confounds such measurement of genetic dependency, thereby leading to false-positive results in copy number-amplified regions.

  4. Polymorphisms in the Wilms Tumor Gene Are Associated With Interindividual Variations in Rubella Virus-Specific Cellular Immunity After Measles-Mumps-Rubella II Vaccination.

    PubMed

    Voigt, Emily A; Haralambieva, Iana H; Larrabee, Beth L; Kennedy, Richard B; Ovsyannikova, Inna G; Schaid, Daniel J; Poland, Gregory A

    2018-01-30

    Rubella vaccination induces widely variable immune responses in vaccine recipients. While rubella vaccination is effective at inducing immunity to rubella infection in most subjects, up to 5% of individuals do not achieve or maintain long-term protective immunity. To expand upon our previous work identifying genetic polymorphisms that are associated with these interindividual differences in humoral immunity to rubella virus, we performed a genome-wide association study in a large cohort of 1843 subjects to discover single-nucleotide polymorphisms (SNPs) associated with rubella virus-specific cellular immune responses. We identified SNPs in the Wilms tumor protein gene (WT1) that were significantly associated (P < 5 × 10-8) with interindividual variations in rubella-specific interleukin 6 secretion from subjects' peripheral blood mononuclear cells postvaccination. No SNPs were found to be significantly associated with variations in rubella-specific interferon-γ secretion. Our findings demonstrate that genetic polymorphisms in the WT1 gene in subjects of European ancestry are associated with interindividual differences in rubella virus-specific cellular immunity after measles-mumps-rubella II vaccination. © The Author(s) 2017. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail: journals.permissions@oup.com.

  5. Microbial strain improvement for organosulfur removal from coal. Final technical report, 1 September, 1992--31 August, 1993

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kilbane, J.J. II

    1993-12-31

    IGT has developed a microbial culture of Rhodococcus rhodochrous, designated as IGTS8, that is capable of specifically cleaving carbon-sulfur bonds in a range of organosulfur model compounds and is capable of removing organic sulfur from coal and petroleum without significantly sacrificing the calorific value of the fuel. Although IGTS8 possesses the ability to specifically remove organic sulfur from coal, a major research need is to develop improved strains of microorganisms that possess higher levels of desulfurization activity and therefore will permit more favorable biodesulfurization process conditions: faster rates, more complete removal, and smaller reactor size. Strain improvement is the singlemore » most important aspect to the development of a practical coal biodesulfurization process and accordingly is the focus of research in this project. During the past year, significant progress was made toward improving the biodesulfurization capabilities of Rhodococcus Rhodochrous IGTS8. The main objective was to identify and characterize strong promoters of IGTS8. The DNA sequencing of the promoter region and chloramphenicol resistance gene of pRF2, as well as six mutant promoters, was determined. The 16S structural gene of IGTS8 was isolated and used to identify the putative promoter of this gene. Four promoter probe vectors were constructed and are currently being used to analyze the strength of Rhodococcus promoters: from the IGTS8 genome, mutants of promoters from the chloramphenicol resistance gene of pRF2, the promoter from the 16S RNA gene, and various strong inducible promoters.« less

  6. Multiple productive immunoglobulin heavy chain gene rearrangements in chronic lymphocytic leukemia are mostly derived from independent clones

    PubMed Central

    Plevova, Karla; Francova, Hana Skuhrova; Burckova, Katerina; Brychtova, Yvona; Doubek, Michael; Pavlova, Sarka; Malcikova, Jitka; Mayer, Jiri; Tichy, Boris; Pospisilova, Sarka

    2014-01-01

    In chronic lymphocytic leukemia, usually a monoclonal disease, multiple productive immunoglobulin heavy chain gene rearrangements are identified sporadically. Prognostication of such cases based on immunoglobulin heavy variable gene mutational status can be problematic, especially if the different rearrangements have discordant mutational status. To gain insight into the possible biological mechanisms underlying the origin of the multiple rearrangements, we performed a comprehensive immunogenetic and immunophenotypic characterization of 31 cases with the multiple rearrangements identified in a cohort of 1147 patients with chronic lymphocytic leukemia. For the majority of cases (25/31), we provide evidence of the co-existence of at least two B lymphocyte clones with a chronic lymphocytic leukemia phenotype. We also identified clonal drifts in serial samples, likely driven by selection forces. More specifically, higher immunoglobulin variable gene identity to germline and longer complementarity determining region 3 were preferred in persistent or newly appearing clones, a phenomenon more pronounced in patients with stereotyped B-cell receptors. Finally, we report that other factors, such as TP53 gene defects and therapy administration, influence clonal selection. Our findings are relevant to clonal evolution in the context of antigen stimulation and transition of monoclonal B-cell lymphocytosis to chronic lymphocytic leukemia. PMID:24038023

  7. Metatranscriptome sequence analysis reveals diel periodicity of microbial community gene expression in the ocean's interior

    NASA Astrophysics Data System (ADS)

    Vislova, A.; Aylward, F.; Sosa, O.; DeLong, E.

    2016-02-01

    Previous work has revealed diel periodicity of gene expression in key metabolic pathways in both autotrophic and heterotrophic microbes in the surface ocean. In this study, we investigated patterns of diel periodicity of gene expression in depth profiles (25, 75, 125 and 250 meters). We postulated that microbial diel transcriptional signals would be increasingly dampened with depth, and that the timing of peak expression of specific transcripts would be shifted in time between depths, in accordance with depth-dependent diel light variability. Bacterioplankton were sampled from four depths every four hours at station ALOHA (22° 45' N 158° W) over 2 days. RNA was extracted from cells preserved on filters, converted to cDNA, and sequenced on the Illumina platform. Surprisingly, harmonic regression analysis revealed an increasing proportion of genes with diel periodic expression patterns with increasing depth between 25- 125 meters. At 250 meters, the proportion of genes exhibiting diel expression patterns decreased an order of magnitude compared to the photic zone. Community composition, functional gene categories, and diel patterns of gene expression were significantly different between the photic zone and 250 meter samples. The signals driving diel periodic gene expression in microbes at 250 meters is under further investigation. These data are now beginning provide a better understanding of the tempo and mode of microbial dynamics among specific taxa, throughout the ocean's interior.

  8. Predictive models for Escherichia coli concentrations at inland lake beaches and relationship of model variables to pathogen detection

    USGS Publications Warehouse

    Francy, Donna S.; Stelzer, Erin A.; Duris, Joseph W.; Brady, Amie M.G.; Harrison, John H.; Johnson, Heather E.; Ware, Michael W.

    2013-01-01

    Predictive models, based on environmental and water quality variables, have been used to improve the timeliness and accuracy of recreational water quality assessments, but their effectiveness has not been studied in inland waters. Sampling at eight inland recreational lakes in Ohio was done in order to investigate using predictive models for Escherichia coli and to understand the links between E. coli concentrations, predictive variables, and pathogens. Based upon results from 21 beach sites, models were developed for 13 sites, and the most predictive variables were rainfall, wind direction and speed, turbidity, and water temperature. Models were not developed at sites where the E. coli standard was seldom exceeded. Models were validated at nine sites during an independent year. At three sites, the model resulted in increased correct responses, sensitivities, and specificities compared to use of the previous day's E. coli concentration (the current method). Drought conditions during the validation year precluded being able to adequately assess model performance at most of the other sites. Cryptosporidium, adenovirus, eaeA (E. coli), ipaH (Shigella), and spvC (Salmonella) were found in at least 20% of samples collected for pathogens at five sites. The presence or absence of the three bacterial genes was related to some of the model variables but was not consistently related to E. coli concentrations. Predictive models were not effective at all inland lake sites; however, their use at two lakes with high swimmer densities will provide better estimates of public health risk than current methods and will be a valuable resource for beach managers and the public.

  9. Predictive models for Escherichia coli concentrations at inland lake beaches and relationship of model variables to pathogen detection.

    PubMed

    Francy, Donna S; Stelzer, Erin A; Duris, Joseph W; Brady, Amie M G; Harrison, John H; Johnson, Heather E; Ware, Michael W

    2013-03-01

    Predictive models, based on environmental and water quality variables, have been used to improve the timeliness and accuracy of recreational water quality assessments, but their effectiveness has not been studied in inland waters. Sampling at eight inland recreational lakes in Ohio was done in order to investigate using predictive models for Escherichia coli and to understand the links between E. coli concentrations, predictive variables, and pathogens. Based upon results from 21 beach sites, models were developed for 13 sites, and the most predictive variables were rainfall, wind direction and speed, turbidity, and water temperature. Models were not developed at sites where the E. coli standard was seldom exceeded. Models were validated at nine sites during an independent year. At three sites, the model resulted in increased correct responses, sensitivities, and specificities compared to use of the previous day's E. coli concentration (the current method). Drought conditions during the validation year precluded being able to adequately assess model performance at most of the other sites. Cryptosporidium, adenovirus, eaeA (E. coli), ipaH (Shigella), and spvC (Salmonella) were found in at least 20% of samples collected for pathogens at five sites. The presence or absence of the three bacterial genes was related to some of the model variables but was not consistently related to E. coli concentrations. Predictive models were not effective at all inland lake sites; however, their use at two lakes with high swimmer densities will provide better estimates of public health risk than current methods and will be a valuable resource for beach managers and the public.

  10. An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways

    PubMed Central

    Ander, Bradley P.; Zhang, Xiaoshuai; Xue, Fuzhong; Sharp, Frank R.; Yang, Xiaowei

    2013-01-01

    The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of subjects. Traditional gene-wise selection methods using univariate analyses face difficulty to incorporate correlational, structural, or functional structures amongst the molecular measures. For microarray gene expression data, we first summarize solutions in dealing with ‘large p, small n’ problems, and then propose an integrative Bayesian variable selection (iBVS) framework for simultaneously identifying causal or marker genes and regulatory pathways. A novel partial least squares (PLS) g-prior for iBVS is developed to allow the incorporation of prior knowledge on gene-gene interactions or functional relationships. From the point view of systems biology, iBVS enables user to directly target the joint effects of multiple genes and pathways in a hierarchical modeling diagram to predict disease status or phenotype. The estimated posterior selection probabilities offer probabilitic and biological interpretations. Both simulated data and a set of microarray data in predicting stroke status are used in validating the performance of iBVS in a Probit model with binary outcomes. iBVS offers a general framework for effective discovery of various molecular biomarkers by combining data-based statistics and knowledge-based priors. Guidelines on making posterior inferences, determining Bayesian significance levels, and improving computational efficiencies are also discussed. PMID:23844055

  11. An integrative framework for Bayesian variable selection with informative priors for identifying genes and pathways.

    PubMed

    Peng, Bin; Zhu, Dianwen; Ander, Bradley P; Zhang, Xiaoshuai; Xue, Fuzhong; Sharp, Frank R; Yang, Xiaowei

    2013-01-01

    The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of subjects. Traditional gene-wise selection methods using univariate analyses face difficulty to incorporate correlational, structural, or functional structures amongst the molecular measures. For microarray gene expression data, we first summarize solutions in dealing with 'large p, small n' problems, and then propose an integrative Bayesian variable selection (iBVS) framework for simultaneously identifying causal or marker genes and regulatory pathways. A novel partial least squares (PLS) g-prior for iBVS is developed to allow the incorporation of prior knowledge on gene-gene interactions or functional relationships. From the point view of systems biology, iBVS enables user to directly target the joint effects of multiple genes and pathways in a hierarchical modeling diagram to predict disease status or phenotype. The estimated posterior selection probabilities offer probabilitic and biological interpretations. Both simulated data and a set of microarray data in predicting stroke status are used in validating the performance of iBVS in a Probit model with binary outcomes. iBVS offers a general framework for effective discovery of various molecular biomarkers by combining data-based statistics and knowledge-based priors. Guidelines on making posterior inferences, determining Bayesian significance levels, and improving computational efficiencies are also discussed.

  12. Effects of Gene Duplication, Positive Selection, and Shifts in Gene Expression on the Evolution of the Venom Gland Transcriptome in Widow Spiders

    PubMed Central

    Haney, Robert A.; Clarke, Thomas H.; Gadgil, Rujuta; Fitzpatrick, Ryan; Hayashi, Cheryl Y.; Ayoub, Nadia A.; Garb, Jessica E.

    2016-01-01

    Gene duplication and positive selection can be important determinants of the evolution of venom, a protein-rich secretion used in prey capture and defense. In a typical model of venom evolution, gene duplicates switch to venom gland expression and change function under the action of positive selection, which together with further duplication produces large gene families encoding diverse toxins. Although these processes have been demonstrated for individual toxin families, high-throughput multitissue sequencing of closely related venomous species can provide insights into evolutionary dynamics at the scale of the entire venom gland transcriptome. By assembling and analyzing multitissue transcriptomes from the Western black widow spider and two closely related species with distinct venom toxicity phenotypes, we do not find that gene duplication and duplicate retention is greater in gene families with venom gland biased expression in comparison with broadly expressed families. Positive selection has acted on some venom toxin families, but does not appear to be in excess for families with venom gland biased expression. Moreover, we find 309 distinct gene families that have single transcripts with venom gland biased expression, suggesting that the switching of genes to venom gland expression in numerous unrelated gene families has been a dominant mode of evolution. We also find ample variation in protein sequences of venom gland–specific transcripts, lineage-specific family sizes, and ortholog expression among species. This variation might contribute to the variable venom toxicity of these species. PMID:26733576

  13. Identification of an ancestral resistance gene cluster involved in the coevolution process between Phaseolus vulgaris and its fungal pathogen Colletotrichum lindemuthianum.

    PubMed

    Geffroy, V; Sicard, D; de Oliveira, J C; Sévignac, M; Cohen, S; Gepts, P; Neema, C; Langin, T; Dron, M

    1999-09-01

    The recent cloning of plant resistance (R) genes and the sequencing of resistance gene clusters have shed light on the molecular evolution of R genes. However, up to now, no attempt has been made to correlate this molecular evolution with the host-pathogen coevolution process at the population level. Cross-inoculations were carried out between 26 strains of the fungal pathogen Colletotrichum lindemuthianum and 48 Phaseolus vulgaris plants collected in the three centers of diversity of the host species. A high level of diversity for resistance against the pathogen was revealed. Most of the resistance specificities were overcome in sympatric situations, indicating an adaptation of the pathogen to the local host. In contrast, plants were generally resistant to allopatric strains, suggesting that R genes that were efficient against exotic strains but had been overcome locally were maintained in the plant genome. These results indicated that coevolution processes between the two protagonists led to a differentiation for resistance in the three centers of diversity of the host. To improve our understanding of the molecular evolution of these different specificities, a recombinant inbred (RI) population derived from two representative genotypes of the Andean (JaloEEP558) and Mesoamerican (BAT93) gene pools was used to map anthracnose specificities. A gene cluster comprising both Andean (Co-y; Co-z) and Mesoamerican (Co-9) host resistance specificities was identified, suggesting that this locus existed prior to the separation of the two major gene pools of P. vulgaris. Molecular analysis revealed a high level of complexity at this locus. It harbors 11 restriction fragment length polymorphisms when R gene analog (RGA) clones are used. The relationship between the coevolution process and diversification of resistance specificities at resistance gene clusters is discussed.

  14. Analysis of the Highly Diverse Gene Borders in Ebola Virus Reveals a Distinct Mechanism of Transcriptional Regulation

    PubMed Central

    Brauburger, Kristina; Boehmann, Yannik; Tsuda, Yoshimi; Hoenen, Thomas; Olejnik, Judith; Schümann, Michael; Ebihara, Hideki

    2014-01-01

    ABSTRACT Ebola virus (EBOV) belongs to the group of nonsegmented negative-sense RNA viruses. The seven EBOV genes are separated by variable gene borders, including short (4- or 5-nucleotide) intergenic regions (IRs), a single long (144-nucleotide) IR, and gene overlaps, where the neighboring gene end and start signals share five conserved nucleotides. The unique structure of the gene overlaps and the presence of a single long IR are conserved among all filoviruses. Here, we sought to determine the impact of the EBOV gene borders during viral transcription. We show that readthrough mRNA synthesis occurs in EBOV-infected cells irrespective of the structure of the gene border, indicating that the gene overlaps do not promote recognition of the gene end signal. However, two consecutive gene end signals at the VP24 gene might improve termination at the VP24-L gene border, ensuring efficient L gene expression. We further demonstrate that the long IR is not essential for but regulates transcription reinitiation in a length-dependent but sequence-independent manner. Mutational analysis of bicistronic minigenomes and recombinant EBOVs showed no direct correlation between IR length and reinitiation rates but demonstrated that specific IR lengths not found naturally in filoviruses profoundly inhibit downstream gene expression. Intriguingly, although truncation of the 144-nucleotide-long IR to 5 nucleotides did not substantially affect EBOV transcription, it led to a significant reduction of viral growth. IMPORTANCE Our current understanding of EBOV transcription regulation is limited due to the requirement for high-containment conditions to study this highly pathogenic virus. EBOV is thought to share many mechanistic features with well-analyzed prototype nonsegmented negative-sense RNA viruses. A single polymerase entry site at the 3′ end of the genome determines that transcription of the genes is mainly controlled by gene order and cis-acting signals found at the gene borders. Here, we examined the regulatory role of the structurally unique EBOV gene borders during viral transcription. Our data suggest that transcriptional regulation in EBOV is highly complex and differs from that in prototype viruses and further the understanding of this most fundamental process in the filovirus replication cycle. Moreover, our results with recombinant EBOVs suggest a novel role of the long IR found in all filovirus genomes during the viral replication cycle. PMID:25142600

  15. Analysis of the highly diverse gene borders in Ebola virus reveals a distinct mechanism of transcriptional regulation.

    PubMed

    Brauburger, Kristina; Boehmann, Yannik; Tsuda, Yoshimi; Hoenen, Thomas; Olejnik, Judith; Schümann, Michael; Ebihara, Hideki; Mühlberger, Elke

    2014-11-01

    Ebola virus (EBOV) belongs to the group of nonsegmented negative-sense RNA viruses. The seven EBOV genes are separated by variable gene borders, including short (4- or 5-nucleotide) intergenic regions (IRs), a single long (144-nucleotide) IR, and gene overlaps, where the neighboring gene end and start signals share five conserved nucleotides. The unique structure of the gene overlaps and the presence of a single long IR are conserved among all filoviruses. Here, we sought to determine the impact of the EBOV gene borders during viral transcription. We show that readthrough mRNA synthesis occurs in EBOV-infected cells irrespective of the structure of the gene border, indicating that the gene overlaps do not promote recognition of the gene end signal. However, two consecutive gene end signals at the VP24 gene might improve termination at the VP24-L gene border, ensuring efficient L gene expression. We further demonstrate that the long IR is not essential for but regulates transcription reinitiation in a length-dependent but sequence-independent manner. Mutational analysis of bicistronic minigenomes and recombinant EBOVs showed no direct correlation between IR length and reinitiation rates but demonstrated that specific IR lengths not found naturally in filoviruses profoundly inhibit downstream gene expression. Intriguingly, although truncation of the 144-nucleotide-long IR to 5 nucleotides did not substantially affect EBOV transcription, it led to a significant reduction of viral growth. Our current understanding of EBOV transcription regulation is limited due to the requirement for high-containment conditions to study this highly pathogenic virus. EBOV is thought to share many mechanistic features with well-analyzed prototype nonsegmented negative-sense RNA viruses. A single polymerase entry site at the 3' end of the genome determines that transcription of the genes is mainly controlled by gene order and cis-acting signals found at the gene borders. Here, we examined the regulatory role of the structurally unique EBOV gene borders during viral transcription. Our data suggest that transcriptional regulation in EBOV is highly complex and differs from that in prototype viruses and further the understanding of this most fundamental process in the filovirus replication cycle. Moreover, our results with recombinant EBOVs suggest a novel role of the long IR found in all filovirus genomes during the viral replication cycle. Copyright © 2014, American Society for Microbiology. All Rights Reserved.

  16. Associations between gastric dilatation-volvulus in Great Danes and specific alleles of the canine immune-system genes DLA88, DRB1, and TLR5.

    PubMed

    Harkey, Michael A; Villagran, Alexandra M; Venkataraman, Gopalakrishnan M; Leisenring, Wendy M; Hullar, Meredith A J; Torok-Storb, Beverly J

    2017-08-01

    OBJECTIVE To determine whether specific alleles of candidate genes of the major histocompatibility complex (MHC) and innate immune system were associated with gastric dilatation-volvulus (GDV) in Great Danes. ANIMALS 42 healthy Great Danes (control group) and 39 Great Danes with ≥ 1 GDV episode. PROCEDURES Variable regions of the 2 most polymorphic MHC genes (DLA88 and DRB1) were amplified and sequenced from the dogs in each group. Similarly, regions of 3 genes associated with the innate immune system (TLR5, NOD2, and ATG16L1), which have been linked to inflammatory bowel disease, were amplified and sequenced. Alleles were evaluated for associations with GDV, controlling for age and dog family. RESULTS Specific alleles of genes DLA88, DRB1, and TLR5 were significantly associated with GDV. One allele of each gene had an OR > 2 in the unadjusted univariate analyses and retained a hazard ratio > 2 after controlling for temperament, age, and familial association in the multivariate analysis. CONCLUSIONS AND CLINICAL RELEVANCE The 3 GDV-associated alleles identified in this study may serve as diagnostic markers for identification of Great Danes at risk for GDV. Additional research is needed to determine whether other dog breeds have the same genetic associations. These findings also provided a new target for research into the etiology of, and potential treatments for, GDV in dogs.

  17. PAGER 2.0: an update to the pathway, annotated-list and gene-signature electronic repository for Human Network Biology

    PubMed Central

    Yue, Zongliang; Zheng, Qi; Neylon, Michael T; Yoo, Minjae; Shin, Jimin; Zhao, Zhiying; Tan, Aik Choon

    2018-01-01

    Abstract Integrative Gene-set, Network and Pathway Analysis (GNPA) is a powerful data analysis approach developed to help interpret high-throughput omics data. In PAGER 1.0, we demonstrated that researchers can gain unbiased and reproducible biological insights with the introduction of PAGs (Pathways, Annotated-lists and Gene-signatures) as the basic data representation elements. In PAGER 2.0, we improve the utility of integrative GNPA by significantly expanding the coverage of PAGs and PAG-to-PAG relationships in the database, defining a new metric to quantify PAG data qualities, and developing new software features to simplify online integrative GNPA. Specifically, we included 84 282 PAGs spanning 24 different data sources that cover human diseases, published gene-expression signatures, drug–gene, miRNA–gene interactions, pathways and tissue-specific gene expressions. We introduced a new normalized Cohesion Coefficient (nCoCo) score to assess the biological relevance of genes inside a PAG, and RP-score to rank genes and assign gene-specific weights inside a PAG. The companion web interface contains numerous features to help users query and navigate the database content. The database content can be freely downloaded and is compatible with third-party Gene Set Enrichment Analysis tools. We expect PAGER 2.0 to become a major resource in integrative GNPA. PAGER 2.0 is available at http://discovery.informatics.uab.edu/PAGER/. PMID:29126216

  18. Dynamic evolution of plant mitochondrial genomes: Mobile genes and introns and highly variable mutation rates

    PubMed Central

    Palmer, Jeffrey D.; Adams, Keith L.; Cho, Yangrae; Parkinson, Christopher L.; Qiu, Yin-Long; Song, Keming

    2000-01-01

    We summarize our recent studies showing that angiosperm mitochondrial (mt) genomes have experienced remarkably high rates of gene loss and concomitant transfer to the nucleus and of intron acquisition by horizontal transfer. Moreover, we find substantial lineage-specific variation in rates of these structural mutations and also point mutations. These findings mostly arise from a Southern blot survey of gene and intron distribution in 281 diverse angiosperms. These blots reveal numerous losses of mt ribosomal protein genes but, with one exception, only rare loss of respiratory genes. Some lineages of angiosperms have kept all of their mt ribosomal protein genes whereas others have lost most of them. These many losses appear to reflect remarkably high (and variable) rates of functional transfer of mt ribosomal protein genes to the nucleus in angiosperms. The recent transfer of cox2 to the nucleus in legumes provides both an example of interorganellar gene transfer in action and a starting point for discussion of the roles of mechanistic and selective forces in determining the distribution of genetic labor between organellar and nuclear genomes. Plant mt genomes also acquire sequences by horizontal transfer. A striking example of this is a homing group I intron in the mt cox1 gene. This extraordinarily invasive mobile element has probably been acquired over 1,000 times separately during angiosperm evolution via a recent wave of cross-species horizontal transfers. Finally, whereas all previously examined angiosperm mtDNAs have low rates of synonymous substitutions, mtDNAs of two distantly related angiosperms have highly accelerated substitution rates. PMID:10860957

  19. HCV proteins and immunoglobulin variable gene (IgV) subfamilies in HCV-induced type II mixed cryoglobulinemia: a concurrent pathogenetic role.

    PubMed

    Sautto, Giuseppe; Mancini, Nicasio; Solforosi, Laura; Diotti, Roberta A; Clementi, Massimo; Burioni, Roberto

    2012-01-01

    The association between hepatitis C virus (HCV) infection and type II mixed cryoglobulinemia (MCII) is well established, but the role played by distinct HCV proteins and by specific components of the anti-HCV humoral immune response remains to be clearly defined. It is widely accepted that HCV drives the expansion of few B-cell clones expressing a restricted pool of selected immunoglobulin variable (IgV) gene subfamilies frequently endowed with rheumatoid factor (RF) activity. Moreover, the same IgV subfamilies are frequently observed in HCV-transformed malignant B-cell clones occasionally complicating MCII. In this paper, we analyze both the humoral and viral counterparts at the basis of cryoglobulins production in HCV-induced MCII, with particular attention reserved to the single IgV subfamilies most frequently involved.

  20. HCV Proteins and Immunoglobulin Variable Gene (IgV) Subfamilies in HCV-Induced Type II Mixed Cryoglobulinemia: A Concurrent Pathogenetic Role

    PubMed Central

    Sautto, Giuseppe; Mancini, Nicasio; Solforosi, Laura; Diotti, Roberta A.; Clementi, Massimo; Burioni, Roberto

    2012-01-01

    The association between hepatitis C virus (HCV) infection and type II mixed cryoglobulinemia (MCII) is well established, but the role played by distinct HCV proteins and by specific components of the anti-HCV humoral immune response remains to be clearly defined. It is widely accepted that HCV drives the expansion of few B-cell clones expressing a restricted pool of selected immunoglobulin variable (IgV) gene subfamilies frequently endowed with rheumatoid factor (RF) activity. Moreover, the same IgV subfamilies are frequently observed in HCV-transformed malignant B-cell clones occasionally complicating MCII. In this paper, we analyze both the humoral and viral counterparts at the basis of cryoglobulins production in HCV-induced MCII, with particular attention reserved to the single IgV subfamilies most frequently involved. PMID:22690241

  1. Systematic assessment of cervical cancer initiation and progression uncovers genetic panels for deep learning-based early diagnosis and proposes novel diagnostic and prognostic biomarkers.

    PubMed

    Long, Nguyen Phuoc; Jung, Kyung Hee; Yoon, Sang Jun; Anh, Nguyen Hoang; Nghi, Tran Diem; Kang, Yun Pyo; Yan, Hong Hua; Min, Jung Eun; Hong, Soon-Sun; Kwon, Sung Won

    2017-12-12

    Although many outstanding achievements in the management of cervical cancer (CxCa) have obtained, it still imposes a major burden which has prompted scientists to discover and validate new CxCa biomarkers to improve the diagnostic and prognostic assessment of CxCa. In this study, eight different gene expression data sets containing 202 cancer, 115 cervical intraepithelial neoplasia (CIN), and 105 normal samples were utilized for an integrative systems biology assessment in a multi-stage carcinogenesis manner. Deep learning-based diagnostic models were established based on the genetic panels of intrinsic genes of cervical carcinogenesis as well as on the unbiased variable selection approach. Survival analysis was also conducted to explore the potential biomarker candidates for prognostic assessment. Our results showed that cell cycle, RNA transport, mRNA surveillance, and one carbon pool by folate were the key regulatory mechanisms involved in the initiation, progression, and metastasis of CxCa. Various genetic panels combined with machine learning algorithms successfully differentiated CxCa from CIN and normalcy in cross-study normalized data sets. In particular, the 168-gene deep learning model for the differentiation of cancer from normalcy achieved an externally validated accuracy of 97.96% (99.01% sensitivity and 95.65% specificity). Survival analysis revealed that ZNF281 and EPHB6 were the two most promising prognostic genetic markers for CxCa among others. Our findings open new opportunities to enhance current understanding of the characteristics of CxCa pathobiology. In addition, the combination of transcriptomics-based signatures and deep learning classification may become an important approach to improve CxCa diagnosis and management in clinical practice.

  2. Systematic assessment of cervical cancer initiation and progression uncovers genetic panels for deep learning-based early diagnosis and proposes novel diagnostic and prognostic biomarkers

    PubMed Central

    Long, Nguyen Phuoc; Jung, Kyung Hee; Yoon, Sang Jun; Anh, Nguyen Hoang; Nghi, Tran Diem; Kang, Yun Pyo; Yan, Hong Hua; Min, Jung Eun; Hong, Soon-Sun; Kwon, Sung Won

    2017-01-01

    Although many outstanding achievements in the management of cervical cancer (CxCa) have obtained, it still imposes a major burden which has prompted scientists to discover and validate new CxCa biomarkers to improve the diagnostic and prognostic assessment of CxCa. In this study, eight different gene expression data sets containing 202 cancer, 115 cervical intraepithelial neoplasia (CIN), and 105 normal samples were utilized for an integrative systems biology assessment in a multi-stage carcinogenesis manner. Deep learning-based diagnostic models were established based on the genetic panels of intrinsic genes of cervical carcinogenesis as well as on the unbiased variable selection approach. Survival analysis was also conducted to explore the potential biomarker candidates for prognostic assessment. Our results showed that cell cycle, RNA transport, mRNA surveillance, and one carbon pool by folate were the key regulatory mechanisms involved in the initiation, progression, and metastasis of CxCa. Various genetic panels combined with machine learning algorithms successfully differentiated CxCa from CIN and normalcy in cross-study normalized data sets. In particular, the 168-gene deep learning model for the differentiation of cancer from normalcy achieved an externally validated accuracy of 97.96% (99.01% sensitivity and 95.65% specificity). Survival analysis revealed that ZNF281 and EPHB6 were the two most promising prognostic genetic markers for CxCa among others. Our findings open new opportunities to enhance current understanding of the characteristics of CxCa pathobiology. In addition, the combination of transcriptomics-based signatures and deep learning classification may become an important approach to improve CxCa diagnosis and management in clinical practice. PMID:29312619

  3. Detection of aberrant methylation of a six-gene panel in serum DNA for diagnosis of breast cancer

    PubMed Central

    Li, Junnan; Li, Xiaobo; Wang, Dong; Su, Yonghui; Niu, Ming; Zhong, Zhenbin; Wang, Ji; Zhang, Xianyu; Kang, Wenli; Pang, Da

    2016-01-01

    Detection of breast cancer at an early stage is the key for successful treatment and improvement of outcome. However the limitations of mammography are well recognized, especially for those women with premenopausal breast cancer. Novel approaches to breast cancer screening are necessary, especially in the developing world where mammography is not feasible. In this study, we examined the promoter methylation of six genes (SFN, P16, hMLH1, HOXD13, PCDHGB7 and RASSF1a) in circulating free DNA (cfDNA) extracted from serum. We used a high-throughput DNA methylation assay (MethyLight) to examine serum from 749 cases including breast cancer patients, patients with benign breast diseases and healthy women. The six-gene methylation panel test achieved 79.6% and 82.4% sensitivity with a specificity of 72.4% and 78.1% in diagnosis of breast cancer when compared with healthy and benign disease controls, respectively. Moreover, the methylation panel positive group showed significant differences in the following independent variables: (a) involvement of family history of tumors; (b) a low proliferative index, ki-67; (c) high ratios in luminal subtypes. Additionally the panel also complemented some breast cancer cases which were neglected by mammography or ultrasound. These data suggest that epigenetic markers in serum have potential for diagnosis of breast cancer. PMID:26918343

  4. Modeling forest ecosystem responses to elevated carbon dioxide and ozone using artificial neural networks.

    PubMed

    Larsen, Peter E; Cseke, Leland J; Miller, R Michael; Collart, Frank R

    2014-10-21

    Rising atmospheric levels of carbon dioxide and ozone will impact productivity and carbon sequestration in forest ecosystems. The scale of this process and the potential economic consequences provide an incentive for the development of models to predict the types and rates of ecosystem responses and feedbacks that result from and influence of climate change. In this paper, we use phenotypic and molecular data derived from the Aspen Free Air CO2 Enrichment site (Aspen-FACE) to evaluate modeling approaches for ecosystem responses to changing conditions. At FACE, it was observed that different aspen clones exhibit clone-specific responses to elevated atmospheric levels of carbon dioxide and ozone. To identify the molecular basis for these observations, we used artificial neural networks (ANN) to examine above and below-ground community phenotype responses to elevated carbon dioxide, elevated ozone and gene expression profiles. The aspen community models generated using this approach identified specific genes and subnetworks of genes associated with variable sensitivities for aspen clones. The ANN model also predicts specific co-regulated gene clusters associated with differential sensitivity to elevated carbon dioxide and ozone in aspen species. The results suggest ANN is an effective approach to predict relevant gene expression changes resulting from environmental perturbation and provides useful information for the rational design of future biological experiments. Copyright © 2014 Elsevier Ltd. All rights reserved.

  5. Diverse hematological phenotypes of β-thalassemia carriers.

    PubMed

    Luo, Hong-Yuan; Chui, David H K

    2016-03-01

    Most β-thalassemia carriers have mild anemia, low mean corpuscular volume and mean corpuscular hemoglobin, and elevated hemoglobin α2 (HbA2 ). However, there is considerable variability resulting from coinheritance with α- and/or δ-globin gene mutations, dominant inheritance of β-thalassemia mutations, highly unstable variant globin chains, large deletions removing part or all of the β-globin gene cluster, loss of heterozygosity of the β-globin gene cluster during development, or concomitant erythroid enzyme or membrane protein abnormalities. Recognition of the specific abnormality and correct diagnosis can allay anxiety and unnecessary investigation, help formulate treatment programs, and deliver appropriate genetic and family counseling. © 2016 New York Academy of Sciences.

  6. Diversity surveys and evolutionary relationships of aoxB genes in aerobic arsenite-oxidizing bacteria.

    PubMed

    Quéméneur, Marianne; Heinrich-Salmeron, Audrey; Muller, Daniel; Lièvremont, Didier; Jauzein, Michel; Bertin, Philippe N; Garrido, Francis; Joulian, Catherine

    2008-07-01

    A new primer set was designed to specifically amplify ca. 1,100 bp of aoxB genes encoding the As(III) oxidase catalytic subunit from taxonomically diverse aerobic As(III)-oxidizing bacteria. Comparative analysis of AoxB protein sequences showed variable conservation levels and highlighted the conservation of essential amino acids and structural motifs. AoxB phylogeny of pure strains showed well-discriminated taxonomic groups and was similar to 16S rRNA phylogeny. Alphaproteobacteria-, Betaproteobacteria-, and Gammaproteobacteria-related sequences were retrieved from environmental surveys, demonstrating their prevalence in mesophilic As-contaminated soils. Our study underlines the usefulness of the aoxB gene as a functional marker of aerobic As(III) oxidizers.

  7. Gene expression atlas of pigeonpea and its application to gain insights into genes associated with pollen fertility implicated in seed formation.

    PubMed

    Pazhamala, Lekha T; Purohit, Shilp; Saxena, Rachit K; Garg, Vanika; Krishnamurthy, L; Verdier, Jerome; Varshney, Rajeev K

    2017-04-01

    Pigeonpea (Cajanus cajan) is an important grain legume of the semi-arid tropics, mainly used for its protein rich seeds. To link the genome sequence information with agronomic traits resulting from specific developmental processes, a Cajanus cajan gene expression atlas (CcGEA) was developed using the Asha genotype. Thirty tissues/organs representing developmental stages from germination to senescence were used to generate 590.84 million paired-end RNA-Seq data. The CcGEA revealed a compendium of 28 793 genes with differential, specific, spatio-temporal and constitutive expression during various stages of development in different tissues. As an example to demonstrate the application of the CcGEA, a network of 28 flower-related genes analysed for cis-regulatory elements and splicing variants has been identified. In addition, expression analysis of these candidate genes in male sterile and male fertile genotypes suggested their critical role in normal pollen development leading to seed formation. Gene network analysis also identified two regulatory genes, a pollen-specific SF3 and a sucrose-proton symporter, that could have implications for improvement of agronomic traits such as seed production and yield. In conclusion, the CcGEA provides a valuable resource for pigeonpea to identify candidate genes involved in specific developmental processes and to understand the well-orchestrated growth and developmental process in this resilient crop. © The Author 2017. Published by Oxford University Press on behalf of the Society for Experimental Biology.

  8. Field evaluation of apple overexpressing a peach CBF gene confirms its effect on cold hardiness, dormancy, and growth

    USDA-ARS?s Scientific Manuscript database

    In recent years, the scientific literature has become replete with examples of the improvement of abiotic stress tolerance by overexpression of specific genes. Few studies, however, have evaluated transgenic plants under field conditions or the impact of overexpression on non-target traits. We pre...

  9. Quantitative genetic analysis of agronomic and morphological traits in sorghum, Sorghum bicolor

    PubMed Central

    Mohammed, Riyazaddin; Are, Ashok K.; Bhavanasi, Ramaiah; Munghate, Rajendra S.; Kavi Kishor, Polavarapu B.; Sharma, Hari C.

    2015-01-01

    The productivity in sorghum is low, owing to various biotic and abiotic constraints. Combining insect resistance with desirable agronomic and morphological traits is important to increase sorghum productivity. Therefore, it is important to understand the variability for various agronomic traits, their heritabilities and nature of gene action to develop appropriate strategies for crop improvement. Therefore, a full diallel set of 10 parents and their 90 crosses including reciprocals were evaluated in replicated trials during the 2013–14 rainy and postrainy seasons. The crosses between the parents with early- and late-flowering flowered early, indicating dominance of earliness for anthesis in the test material used. Association between the shoot fly resistance, morphological, and agronomic traits suggested complex interactions between shoot fly resistance and morphological traits. Significance of the mean sum of squares for GCA (general combining ability) and SCA (specific combining ability) of all the studied traits suggested the importance of both additive and non-additive components in inheritance of these traits. The GCA/SCA, and the predictability ratios indicated predominance of additive gene effects for majority of the traits studied. High broad-sense and narrow-sense heritability estimates were observed for most of the morphological and agronomic traits. The significance of reciprocal combining ability effects for days to 50% flowering, plant height and 100 seed weight, suggested maternal effects for inheritance of these traits. Plant height and grain yield across seasons, days to 50% flowering, inflorescence exsertion, and panicle shape in the postrainy season showed greater specific combining ability variance, indicating the predominance of non-additive type of gene action/epistatic interactions in controlling the expression of these traits. Additive gene action in the rainy season, and dominance in the postrainy season for days to 50% flowering and plant height suggested G X E interactions for these traits. PMID:26579183

  10. Panel of 23S rRNA Gene-Based Real-Time PCR Assays for Improved Universal and Group-Specific Detection of Phytoplasmas▿ †

    PubMed Central

    Hodgetts, Jennifer; Boonham, Neil; Mumford, Rick; Dickinson, Matthew

    2009-01-01

    Primers and probes based on the 23S rRNA gene have been utilized to design a range of real-time PCR assays for routine phytoplasma diagnostics. These assays have been authenticated as phytoplasma specific and shown to be at least as sensitive as nested PCR. A universal assay to detect all phytoplasmas has been developed, along with a multiplex assay to discriminate 16SrI group phytoplasmas from members of all of the other 16Sr groups. Assays for the 16SrII, 16SrIV, and 16SrXII groups have also been developed to confirm that the 23S rRNA gene can be used to design group-specific assays. PMID:19270148

  11. Genetic Algorithms Applied to Multi-Objective Aerodynamic Shape Optimization

    NASA Technical Reports Server (NTRS)

    Holst, Terry L.

    2004-01-01

    A genetic algorithm approach suitable for solving multi-objective optimization problems is described and evaluated using a series of aerodynamic shape optimization problems. Several new features including two variations of a binning selection algorithm and a gene-space transformation procedure are included. The genetic algorithm is suitable for finding pareto optimal solutions in search spaces that are defined by any number of genes and that contain any number of local extrema. A new masking array capability is included allowing any gene or gene subset to be eliminated as decision variables from the design space. This allows determination of the effect of a single gene or gene subset on the pareto optimal solution. Results indicate that the genetic algorithm optimization approach is flexible in application and reliable. The binning selection algorithms generally provide pareto front quality enhancements and moderate convergence efficiency improvements for most of the problems solved.

  12. Genetic Algorithms Applied to Multi-Objective Aerodynamic Shape Optimization

    NASA Technical Reports Server (NTRS)

    Holst, Terry L.

    2005-01-01

    A genetic algorithm approach suitable for solving multi-objective problems is described and evaluated using a series of aerodynamic shape optimization problems. Several new features including two variations of a binning selection algorithm and a gene-space transformation procedure are included. The genetic algorithm is suitable for finding Pareto optimal solutions in search spaces that are defined by any number of genes and that contain any number of local extrema. A new masking array capability is included allowing any gene or gene subset to be eliminated as decision variables from the design space. This allows determination of the effect of a single gene or gene subset on the Pareto optimal solution. Results indicate that the genetic algorithm optimization approach is flexible in application and reliable. The binning selection algorithms generally provide Pareto front quality enhancements and moderate convergence efficiency improvements for most of the problems solved.

  13. Natural selection on marine carnivores elaborated a diverse family of classical MHC class I genes exhibiting haplotypic gene content variation and allelic polymorphism

    PubMed Central

    Norman, Paul J.; Parham, Peter

    2012-01-01

    Pinnipeds, marine carnivores, diverged from terrestrial carnivores ~45 million years ago, before their adaptation to marine environments. This lifestyle change exposed pinnipeds to different microbiota and pathogens, with probable impact on their MHC class I genes. Investigating this question, genomic sequences were determined for 71 MHC class I variants: 27 from harbor seal and 44 from gray seal. These variants form three MHC class I gene lineages, one comprising a pseudogene. The second, a candidate nonclassical MHC class I gene, comprises a nonpolymorphic transcribed gene related to dog DLA-79 and giant panda Aime-1906. The third is the diversity lineage, which includes 62 of the 71 seal MHC class I variants. All are transcribed, and they minimally represent six harbor and 12 gray seal MHC class I genes. Besides species-specific differences in gene number, seal MHC class I haplotypes exhibit gene content variation and allelic polymorphism. Patterns of sequence variation, and of positions for positively selected sites, indicate the diversity lineage genes are the seals’ classical MHC class I genes. Evidence that expansion of diversity lineage genes began before gray and harbor seals diverged is the presence in both species of two distinctive sublineages of diversity lineage genes. Pointing to further expansion following the divergence are the presence of species-specific genes and greater MHC class I diversity in gray seals than harbor seals. The elaboration of a complex variable family of classical MHC class I genes in pinnipeds contrasts with the single, highly polymorphic classical MHC class I gene of dog and giant panda, terrestrial carnivores. PMID:23001684

  14. A New Chicken Genome Assembly Provides Insight into Avian Genome Structure.

    PubMed

    Warren, Wesley C; Hillier, LaDeana W; Tomlinson, Chad; Minx, Patrick; Kremitzki, Milinn; Graves, Tina; Markovic, Chris; Bouk, Nathan; Pruitt, Kim D; Thibaud-Nissen, Francoise; Schneider, Valerie; Mansour, Tamer A; Brown, C Titus; Zimin, Aleksey; Hawken, Rachel; Abrahamsen, Mitch; Pyrkosz, Alexis B; Morisson, Mireille; Fillon, Valerie; Vignal, Alain; Chow, William; Howe, Kerstin; Fulton, Janet E; Miller, Marcia M; Lovell, Peter; Mello, Claudio V; Wirthlin, Morgan; Mason, Andrew S; Kuo, Richard; Burt, David W; Dodgson, Jerry B; Cheng, Hans H

    2017-01-05

    The importance of the Gallus gallus (chicken) as a model organism and agricultural animal merits a continuation of sequence assembly improvement efforts. We present a new version of the chicken genome assembly (Gallus_gallus-5.0; GCA_000002315.3), built from combined long single molecule sequencing technology, finished BACs, and improved physical maps. In overall assembled bases, we see a gain of 183 Mb, including 16.4 Mb in placed chromosomes with a corresponding gain in the percentage of intact repeat elements characterized. Of the 1.21 Gb genome, we include three previously missing autosomes, GGA30, 31, and 33, and improve sequence contig length 10-fold over the previous Gallus_gallus-4.0. Despite the significant base representation improvements made, 138 Mb of sequence is not yet located to chromosomes. When annotated for gene content, Gallus_gallus-5.0 shows an increase of 4679 annotated genes (2768 noncoding and 1911 protein-coding) over those in Gallus_gallus-4.0. We also revisited the question of what genes are missing in the avian lineage, as assessed by the highest quality avian genome assembly to date, and found that a large fraction of the original set of missing genes are still absent in sequenced bird species. Finally, our new data support a detailed map of MHC-B, encompassing two segments: one with a highly stable gene copy number and another in which the gene copy number is highly variable. The chicken model has been a critical resource for many other fields of study, and this new reference assembly will substantially further these efforts. Copyright © 2017 Warren et al.

  15. Massively parallel rRNA gene sequencing exacerbates the potential for biased community diversity comparisons due to variable library sizes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gihring, Thomas; Green, Stefan; Schadt, Christopher Warren

    2011-01-01

    Technologies for massively parallel sequencing are revolutionizing microbial ecology and are vastly increasing the scale of ribosomal RNA (rRNA) gene studies. Although pyrosequencing has increased the breadth and depth of possible rRNA gene sampling, one drawback is that the number of reads obtained per sample is difficult to control. Pyrosequencing libraries typically vary widely in the number of sequences per sample, even within individual studies, and there is a need to revisit the behaviour of richness estimators and diversity indices with variable gene sequence library sizes. Multiple reports and review papers have demonstrated the bias in non-parametric richness estimators (e.g.more » Chao1 and ACE) and diversity indices when using clone libraries. However, we found that biased community comparisons are accumulating in the literature. Here we demonstrate the effects of sample size on Chao1, ACE, CatchAll, Shannon, Chao-Shen and Simpson's estimations specifically using pyrosequencing libraries. The need to equalize the number of reads being compared across libraries is reiterated, and investigators are directed towards available tools for making unbiased diversity comparisons.« less

  16. Plankton networks driving carbon export in the oligotrophic ocean

    PubMed Central

    Larhlimi, Abdelhalim; Roux, Simon; Darzi, Youssef; Audic, Stephane; Berline, Léo; Brum, Jennifer; Coelho, Luis Pedro; Espinoza, Julio Cesar Ignacio; Malviya, Shruti; Sunagawa, Shinichi; Dimier, Céline; Kandels-Lewis, Stefanie; Picheral, Marc; Poulain, Julie; Searson, Sarah; Stemmann, Lars; Not, Fabrice; Hingamp, Pascal; Speich, Sabrina; Follows, Mick; Karp-Boss, Lee; Boss, Emmanuel; Ogata, Hiroyuki; Pesant, Stephane; Weissenbach, Jean; Wincker, Patrick; Acinas, Silvia G.; Bork, Peer; de Vargas, Colomban; Iudicone, Daniele; Sullivan, Matthew B.; Raes, Jeroen; Karsenti, Eric; Bowler, Chris; Gorsky, Gabriel

    2015-01-01

    The biological carbon pump is the process by which CO2 is transformed to organic carbon via photosynthesis, exported through sinking particles, and finally sequestered in the deep ocean. While the intensity of the pump correlates with plankton community composition, the underlying ecosystem structure driving the process remains largely uncharacterised. Here we use environmental and metagenomic data gathered during the Tara Oceans expedition to improve our understanding of carbon export in the oligotrophic ocean. We show that specific plankton communities, from the surface and deep chlorophyll maximum, correlate with carbon export at 150 m and highlight unexpected taxa such as Radiolaria, alveolate parasites, as well as Synechococcus and their phages, as lineages most strongly associated with carbon export in the subtropical, nutrient-depleted, oligotrophic ocean. Additionally, we show that the relative abundance of just a few bacterial and viral genes can predict most of the variability in carbon export in these regions. PMID:26863193

  17. Plankton networks driving carbon export in the oligotrophic ocean

    NASA Astrophysics Data System (ADS)

    2016-04-01

    The biological carbon pump is the process by which CO2 is transformed to organic carbon via photosynthesis, exported through sinking particles, and finally sequestered in the deep ocean. While the intensity of the pump correlates with plankton community composition, the underlying ecosystem structure driving the process remains largely uncharacterized. Here we use environmental and metagenomic data gathered during the Tara Oceans expedition to improve our understanding of carbon export in the oligotrophic ocean. We show that specific plankton communities, from the surface and deep chlorophyll maximum, correlate with carbon export at 150 m and highlight unexpected taxa such as Radiolaria and alveolate parasites, as well as Synechococcus and their phages, as lineages most strongly associated with carbon export in the subtropical, nutrient-depleted, oligotrophic ocean. Additionally, we show that the relative abundance of a few bacterial and viral genes can predict a significant fraction of the variability in carbon export in these regions.

  18. Transcript Isoform Variation Associated with Cytosine Modification in Human Lymphoblastoid Cell Lines.

    PubMed

    Zhang, Xu; Zhang, Wei

    2016-06-01

    Cytosine modification on DNA is variable among individuals, which could correlate with gene expression variation. The effect of cytosine modification on interindividual transcript isoform variation (TIV), however, remains unclear. In this study, we assessed the extent of cytosine modification-specific TIV in lymphoblastoid cell lines (LCLs) derived from unrelated individuals of European and African descent. Our study detected cytosine modification-specific TIVs for 17% of the analyzed genes at a 5% false discovery rate. Forty-five percent of the TIV-associated cytosine modifications correlated with the overall gene expression levels as well, with the corresponding CpG sites overrepresented in transcript initiation sites, transcription factor binding sites, and distinct histone modification peaks, suggesting that alternative isoform transcription underlies the TIVs. Our analysis also revealed 33% of the TIV-associated cytosine modifications that affected specific exons, with the corresponding CpG sites overrepresented in exon/intron junctions, splicing branching points, and transcript termination sites, implying that the TIVs are attributable to alternative splicing or transcription termination. Genetic and epigenetic regulation of TIV shared target preference but exerted independent effects on 61% of the common exon targets. Cytosine modification-specific TIVs detected from LCLs were differentially enriched in those detected from various tissues in The Cancer Genome Atlas, indicating their developmental dependency. Genes containing cytosine modification-specific TIVs were enriched in pathways of cancers and metabolic disorders. Our study demonstrated a prominent effect of cytosine modification variation on the transcript isoform spectrum over gross transcript abundance and revealed epigenetic contributions to diseases that were mediated through cytosine modification-specific TIV. Copyright © 2016 by the Genetics Society of America.

  19. Biasogram: Visualization of Confounding Technical Bias in Gene Expression Data

    PubMed Central

    Krzystanek, Marcin; Szallasi, Zoltan; Eklund, Aron C.

    2013-01-01

    Gene expression profiles of clinical cohorts can be used to identify genes that are correlated with a clinical variable of interest such as patient outcome or response to a particular drug. However, expression measurements are susceptible to technical bias caused by variation in extraneous factors such as RNA quality and array hybridization conditions. If such technical bias is correlated with the clinical variable of interest, the likelihood of identifying false positive genes is increased. Here we describe a method to visualize an expression matrix as a projection of all genes onto a plane defined by a clinical variable and a technical nuisance variable. The resulting plot indicates the extent to which each gene is correlated with the clinical variable or the technical variable. We demonstrate this method by applying it to three clinical trial microarray data sets, one of which identified genes that may have been driven by a confounding technical variable. This approach can be used as a quality control step to identify data sets that are likely to yield false positive results. PMID:23613961

  20. From gene engineering to gene modulation and manipulation: can we prevent or detect gene doping in sports?

    PubMed

    Fischetto, Giuseppe; Bermon, Stéphane

    2013-10-01

    During the last 2 decades, progress in deciphering the human gene map as well as the discovery of specific defective genes encoding particular proteins in some serious human diseases have resulted in attempts to treat sick patients with gene therapy. There has been considerable focus on human recombinant proteins which were gene-engineered and produced in vitro (insulin, growth hormone, insulin-like growth factor-1, erythropoietin). Unfortunately, these substances and methods also became improper tools for unscrupulous athletes. Biomedical research has focused on the possible direct insertion of gene material into the body, in order to replace some defective genes in vivo and/or to promote long-lasting endogenous synthesis of deficient proteins. Theoretically, diabetes, anaemia, muscular dystrophies, immune deficiency, cardiovascular diseases and numerous other illnesses could benefit from such innovative biomedical research, though much work remains to be done. Considering recent findings linking specific genotypes and physical performance, it is tempting to submit the young athletic population to genetic screening or, alternatively, to artificial gene expression modulation. Much research is already being conducted in order to achieve a safe transfer of genetic material to humans. This is of critical importance since uncontrolled production of the specifically coded protein, with serious secondary adverse effects (polycythaemia, acute cardiovascular problems, cancer, etc.), could occur. Other unpredictable reactions (immunogenicity of vectors or DNA-vector complex, autoimmune anaemia, production of wild genetic material) also remain possible at the individual level. Some new substances (myostatin blockers or anti-myostatin antibodies), although not gene material, might represent a useful and well-tolerated treatment to prevent progression of muscular dystrophies. Similarly, other molecules, in the roles of gene or metabolic activators [5-aminoimidazole-4-carboxamide 1-β-D-ribofuranoside (AICAR), GW1516], might concomitantly improve endurance exercise capacity in ischaemic conditions but also in normal conditions. Undoubtedly, some athletes will attempt to take advantage of these new molecules to increase strength or endurance. Antidoping laboratories are improving detection methods. These are based both on direct identification of new substances or their metabolites and on indirect evaluation of changes in gene, protein or metabolite patterns (genomics, proteomics or metabolomics).

  1. Treacher Collins syndrome: clinical implications for the paediatrician--a new mutation in a severely affected newborn and comparison with three further patients with the same mutation, and review of the literature.

    PubMed

    Schlump, Jan-Ulrich; Stein, Anja; Hehr, Ute; Karen, Tanja; Möller-Hartmann, Claudia; Elcioglu, Nursel H; Bogdanova, Nadja; Woike, Hartmut Fritz; Lohmann, Dietmar R; Felderhoff-Mueser, Ursula; Linz, Annette; Wieczorek, Dagmar

    2012-11-01

    Treacher Collins syndrome (TCS) is the most common and well-known mandibulofacial dysostosis caused by mutations in at least three genes involved in pre-rRNA transcription, the TCOF1, POLR1D and POLR1C genes. We present a severely affected male individual with TCS with a heterozygous de novo frameshift mutation within the TCOF1 gene (c.790_791delAG,p.Ser264GlnfsX7) and compare the clinical findings with three previously unpublished, milder affected individuals from two families with the same mutation. We elucidate typical clinical features of TCS and its clinical implications for the paediatrician and mandibulofacial surgeon, especially in severely affected individuals and give a short review of the literature. The clinical data of these three families illustrate that the phenotype associated with this specific mutation has a wide intra- and interfamilial variability, which confirms that variable expressivity in carriers of TCOF1 mutations is not a simple consequence of the mutation but might be modified by the combination of genetic, environmental and stochastic factors. Being such a highly complex disease treatment of individuals with TCS should be tailored to the specific needs of each individual, preferably by a multidisciplinary team consisting of paediatricians, craniofacial surgeons and geneticists.

  2. Analysis, Characterization, and Loci of the tuf Genes in Lactobacillus and Bifidobacterium Species and Their Direct Application for Species Identification

    PubMed Central

    Ventura, Marco; Canchaya, Carlos; Meylan, Valèrie; Klaenhammer, Todd R.; Zink, Ralf

    2003-01-01

    We analyzed the tuf gene, encoding elongation factor Tu, from 33 strains representing 17 Lactobacillus species and 8 Bifidobacterium species. The tuf sequences were aligned and used to infer phylogenesis among species of lactobacilli and bifidobacteria. We demonstrated that the synonymous substitution affecting this gene renders elongation factor Tu a reliable molecular clock for investigating evolutionary distances of lactobacilli and bifidobacteria. In fact, the phylogeny generated by these tuf sequences is consistent with that derived from 16S rRNA analysis. The investigation of a multiple alignment of tuf sequences revealed regions conserved among strains belonging to the same species but distinct from those of other species. PCR primers complementary to these regions allowed species-specific identification of closely related species, such as Lactobacillus casei group members. These tuf gene-based assays developed in this study provide an alternative to present methods for the identification for lactic acid bacterial species. Since a variable number of tuf genes have been described for bacteria, the presence of multiple genes was examined. Southern analysis revealed one tuf gene in the genomes of lactobacilli and bifidobacteria, but the tuf gene was arranged differently in the genomes of these two taxa. Our results revealed that the tuf gene in bifidobacteria is flanked by the same gene constellation as the str operon, as originally reported for Escherichia coli. In contrast, bioinformatic and transcriptional analyses of the DNA region flanking the tuf gene in four Lactobacillus species indicated the same four-gene unit and suggested a novel tuf operon specific for the genus Lactobacillus. PMID:14602655

  3. Targeted transfection increases siRNA uptake and gene silencing of primary endothelial cells in vitro--a quantitative study.

    PubMed

    Asgeirsdóttir, Sigridur A; Talman, Eduard G; de Graaf, Inge A; Kamps, Jan A A M; Satchell, Simon C; Mathieson, Peter W; Ruiters, Marcel H J; Molema, Grietje

    2010-01-25

    Applications of small-interfering RNA (siRNA) call for specific and efficient delivery of siRNA into particular cell types. We developed a novel, non-viral targeting system to deliver siRNA specifically into inflammation-activated endothelial cells. This was achieved by conjugating the cationic amphiphilic lipid SAINT to antibodies recognizing the inflammatory cell adhesion molecule E-selectin. These anti-E-selectin-SAINT lipoplexes (SAINTarg) maintained antigen recognition capacity of the parental antibody in vitro, and ex vivo in human kidney tissue slices subjected to inflammatory conditions. Regular SAINT mediated transfection resulted in efficient gene silencing in human microvascular endothelial cells (HMEC-1) and conditionally immortalized glomerular endothelial cells (ciGEnC). However, primary human umbilical vein endothelial cells (HUVEC) transfected poorly, a phenomenon that we could quantitatively correlate with a cell-type specific capacity to facilitate siRNA uptake. Importantly, SAINTarg increased siRNA uptake and transfection specificity for activated endothelial cells. Transfection with SAINTarg delivered significantly more siRNA into activated HUVEC, compared to transfection with non-targeted SAINT. The enhanced uptake of siRNA was corroborated by improved silencing of both gene- and protein expression of VE-cadherin in activated HUVEC, indicating that SAINTarg delivered functionally active siRNA into endothelial cells. The obtained results demonstrate a successful design of a small nucleotide carrier system with improved and specific siRNA delivery into otherwise difficult-to-transfect primary endothelial cells, which in addition reduced considerably the amount of siRNA needed for gene silencing. Copyright 2009 Elsevier B.V. All rights reserved.

  4. A Sorghum bicolor expression atlas reveals dynamic genotype-specific expression profiles for vegetative tissues of grain, sweet and bioenergy sorghums

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Shakoor, N; Nair, R; Crasta, O

    2014-01-23

    Background: Effective improvement in sorghum crop development necessitates a genomics-based approach to identify functional genes and QTLs. Sequenced in 2009, a comprehensive annotation of the sorghum genome and the development of functional genomics resources is key to enable the discovery and deployment of regulatory and metabolic genes and gene networks for crop improvement. Results: This study utilizes the first commercially available whole-transcriptome sorghum microarray (Sorgh-WTa520972F) to identify tissue and genotype-specific expression patterns for all identified Sorghum bicolor exons and UTRs. The genechip contains 1,026,373 probes covering 149,182 exons (27,577 genes) across the Sorghum bicolor nuclear, chloroplast, and mitochondrial genomes. Specificmore » probesets were also included for putative non-coding RNAs that may play a role in gene regulation (e. g., microRNAs), and confirmed functional small RNAs in related species (maize and sugarcane) were also included in our array design. We generated expression data for 78 samples with a combination of four different tissue types (shoot, root, leaf and stem), two dissected stem tissues (pith and rind) and six diverse genotypes, which included 6 public sorghum lines (R159, Atlas, Fremont, PI152611, AR2400 and PI455230) representing grain, sweet, forage, and high biomass ideotypes. Conclusions: Here we present a summary of the microarray dataset, including analysis of tissue-specific gene expression profiles and associated expression profiles of relevant metabolic pathways. With an aim to enable identification and functional characterization of genes in sorghum, this expression atlas presents a new and valuable resource to the research community.« less

  5. Preliminary definition of improvement in juvenile arthritis.

    PubMed

    Giannini, E H; Ruperto, N; Ravelli, A; Lovell, D J; Felson, D T; Martini, A

    1997-07-01

    To identify a core set of outcome variables for the assessment of children with juvenile arthritis (JA), to use the core set to develop a definition of improvement to determine whether individual patients demonstrate clinically important improvement, and to promote this definition as a single efficacy measure in JA clinical trials. A core set of outcome variables was established using a combination of statistical and consensus formation techniques. Variables in the core set consisted of 1) physician global assessment of disease activity; 2) parent/patient assessment of overall well-being; 3) functional ability; 4) number of joints with active arthritis; 5) number of joints with limited range of motion; and 6) erythrocyte sedimentation rate. To establish a definition of improvement using this core set, 21 pediatric rheumatologists from 14 countries met, and, using consensus formation techniques, scored each of 72 patient profiles as improved or not improved. Using the physicians' consensus as the gold standard, the chi-square, sensitivity, and specificity were calculated for each of 240 possible definitions of improvement. Definitions with sensitivity or specificity of <80% were eliminated. The ability of the remaining definitions to discriminate between the effects of active agent and those of placebo, using actual trial data, was then observed. Each definition was also ranked for face validity, and the sum of the ranks was then multiplied by the kappa statistic. The definition of improvement with the highest final score was as follows: at least 30% improvement from baseline in 3 of any 6 variables in the core set, with no more than 1 of the remaining variables worsening by >30%. The second highest scoring definition was closely related to the first; the third highest was similar to the Paulus criteria used in adult rheumatoid arthritis trials, except with different variables. This indicates convergent validity of the process used. We propose a definition of improvement for JA. Use of a uniform definition will help standardize the conduct and reporting of clinical trials, and should help practitioners decide if a child with JA has responded adequately to therapy. We are in the process of prospectively validating this definition and several others that scored highly.

  6. Deletion of Specific Immune-Modulatory Genes from Modified Vaccinia Virus Ankara-Based HIV Vaccines Engenders Improved Immunogenicity in Rhesus Macaques

    PubMed Central

    O'Mara, Leigh A.; Gangadhara, Sailaja; McQuoid, Monica; Zhang, Xiugen; Zheng, Rui; Gill, Kiran; Verma, Meena; Yu, Tianwei; Johnson, Brent; Li, Bing; Derdeyn, Cynthia A.; Ibegbu, Chris; Altman, John D.; Hunter, Eric; Feinberg, Mark B.

    2012-01-01

    Modified vaccinia virus Ankara (MVA) is a safe, attenuated orthopoxvirus that is being developed as a vaccine vector but has demonstrated limited immunogenicity in several early-phase clinical trials. Our objective was to rationally improve the immunogenicity of MVA-based HIV/AIDS vaccines via the targeted deletion of specific poxvirus immune-modulatory genes. Vaccines expressing codon-optimized HIV subtype C consensus Env and Gag antigens were generated from MVA vector backbones that (i) harbor simultaneous deletions of four viral immune-modulatory genes, encoding an interleukin-18 (IL-18) binding protein, an IL-1β receptor, a dominant negative Toll/IL-1 signaling adapter, and CC-chemokine binding protein (MVAΔ4-HIV); (ii) harbor a deletion of an additional (fifth) viral gene, encoding uracil-DNA glycosylase (MVAΔ5-HIV); or (iii) represent the parental MVA backbone as a control (MVA-HIV). We performed head-to-head comparisons of the cellular and humoral immune responses that were elicited by these vectors during homologous prime-boost immunization regimens utilizing either high-dose (2 × 108 PFU) or low-dose (1 × 107 PFU) intramuscular immunization of rhesus macaques. At all time points, a majority of the HIV-specific T cell responses, elicited by all vectors, were directed against Env, rather than Gag, determinants, as previously observed with other vector systems. Both modified vectors elicited up to 6-fold-higher frequencies of HIV-specific CD8 and CD4 T cell responses and up to 25-fold-higher titers of Env (gp120)-specific binding (nonneutralizing) antibody responses that were relatively transient in nature. While the correlates of protection against HIV infection remain incompletely defined, our results indicate that the rational deletion of specific genes from MVA vectors can positively alter their cellular and humoral immunogenicity profiles in nonhuman primates. PMID:22973033

  7. NETWORK ASSISTED ANALYSIS TO REVEAL THE GENETIC BASIS OF AUTISM1

    PubMed Central

    Liu, Li; Lei, Jing; Roeder, Kathryn

    2016-01-01

    While studies show that autism is highly heritable, the nature of the genetic basis of this disorder remains illusive. Based on the idea that highly correlated genes are functionally interrelated and more likely to affect risk, we develop a novel statistical tool to find more potentially autism risk genes by combining the genetic association scores with gene co-expression in specific brain regions and periods of development. The gene dependence network is estimated using a novel partial neighborhood selection (PNS) algorithm, where node specific properties are incorporated into network estimation for improved statistical and computational efficiency. Then we adopt a hidden Markov random field (HMRF) model to combine the estimated network and the genetic association scores in a systematic manner. The proposed modeling framework can be naturally extended to incorporate additional structural information concerning the dependence between genes. Using currently available genetic association data from whole exome sequencing studies and brain gene expression levels, the proposed algorithm successfully identified 333 genes that plausibly affect autism risk. PMID:27134692

  8. Origins of extrinsic variability in eukaryotic gene expression

    NASA Astrophysics Data System (ADS)

    Volfson, Dmitri; Marciniak, Jennifer; Blake, William J.; Ostroff, Natalie; Tsimring, Lev S.; Hasty, Jeff

    2006-02-01

    Variable gene expression within a clonal population of cells has been implicated in a number of important processes including mutation and evolution, determination of cell fates and the development of genetic disease. Recent studies have demonstrated that a significant component of expression variability arises from extrinsic factors thought to influence multiple genes simultaneously, yet the biological origins of this extrinsic variability have received little attention. Here we combine computational modelling with fluorescence data generated from multiple promoter-gene inserts in Saccharomyces cerevisiae to identify two major sources of extrinsic variability. One unavoidable source arising from the coupling of gene expression with population dynamics leads to a ubiquitous lower limit for expression variability. A second source, which is modelled as originating from a common upstream transcription factor, exemplifies how regulatory networks can convert noise in upstream regulator expression into extrinsic noise at the output of a target gene. Our results highlight the importance of the interplay of gene regulatory networks with population heterogeneity for understanding the origins of cellular diversity.

  9. Origins of extrinsic variability in eukaryotic gene expression

    NASA Astrophysics Data System (ADS)

    Volfson, Dmitri; Marciniak, Jennifer; Blake, William J.; Ostroff, Natalie; Tsimring, Lev S.; Hasty, Jeff

    2006-03-01

    Variable gene expression within a clonal population of cells has been implicated in a number of important processes including mutation and evolution, determination of cell fates and the development of genetic disease. Recent studies have demonstrated that a significant component of expression variability arises from extrinsic factors thought to influence multiple genes in concert, yet the biological origins of this extrinsic variability have received little attention. Here we combine computational modeling with fluorescence data generated from multiple promoter-gene inserts in Saccharomyces cerevisiae to identify two major sources of extrinsic variability. One unavoidable source arising from the coupling of gene expression with population dynamics leads to a ubiquitous noise floor in expression variability. A second source which is modeled as originating from a common upstream transcription factor exemplifies how regulatory networks can convert noise in upstream regulator expression into extrinsic noise at the output of a target gene. Our results highlight the importance of the interplay of gene regulatory networks with population heterogeneity for understanding the origins of cellular diversity.

  10. ΔN-P63α and TA-P63α exhibit intrinsic differences in transactivation specificities that depend on distinct features of DNA target sites

    PubMed Central

    Foggetti, Giorgia; Raimondi, Ivan; Campomenosi, Paola; Menichini, Paola

    2014-01-01

    TP63 is a member of the TP53 gene family that encodes for up to ten different TA and ΔN isoforms through alternative promoter usage and alternative splicing. Besides being a master regulator of gene expression for squamous epithelial proliferation, differentiation and maintenance, P63, through differential expression of its isoforms, plays important roles in tumorigenesis. All P63 isoforms share an immunoglobulin-like folded DNA binding domain responsible for binding to sequence-specific response elements (REs), whose overall consensus sequence is similar to that of the canonical p53 RE. Using a defined assay in yeast, where P63 isoforms and RE sequences are the only variables, and gene expression assays in human cell lines, we demonstrated that human TA- and ΔN-P63α proteins exhibited differences in transactivation specificity not observed with the corresponding P73 or P53 protein isoforms. These differences 1) were dependent on specific features of the RE sequence, 2) could be related to intrinsic differences in their oligomeric state and cooperative DNA binding, and 3) appeared to be conserved in evolution. Since genotoxic stress can change relative ratio of TA- and ΔN-P63α protein levels, the different transactivation specificity of each P63 isoform could potentially influence cellular responses to specific stresses. PMID:24926492

  11. Variables and Strategies in Development of Therapeutic Post-Transcriptional Gene Silencing Agents

    PubMed Central

    Sullivan, Jack M.; Yau, Edwin H.; Kolniak, Tiffany A.; Sheflin, Lowell G.; Taggart, R. Thomas; Abdelmaksoud, Heba E.

    2011-01-01

    Post-transcriptional gene silencing (PTGS) agents such as ribozymes, RNAi and antisense have substantial potential for gene therapy of human retinal degenerations. These technologies are used to knockdown a specific target RNA and its cognate protein. The disease target mRNA may be a mutant mRNA causing an autosomal dominant retinal degeneration or a normal mRNA that is overexpressed in certain diseases. All PTGS technologies depend upon the initial critical annealing event of the PTGS ligand to the target RNA. This event requires that the PTGS agent is in a conformational state able to support hybridization and that the target have a large and accessible single-stranded platform to allow rapid annealing, although such platforms are rare. We address the biocomplexity that currently limits PTGS therapeutic development with particular emphasis on biophysical variables that influence cellular performance. We address the different strategies that can be used for development of PTGS agents intended for therapeutic translation. These issues apply generally to the development of PTGS agents for retinal, ocular, or systemic diseases. This review should assist the interested reader to rapidly appreciate critical variables in PTGS development and facilitate initial design and testing of such agents against new targets of clinical interest. PMID:21785698

  12. Individuals with Type 1 and Type 2 Diabetes Mellitus Trade Increased Hyperglycemia for Decreased Hypoglycemia When Glycemic Variability is not Improved.

    PubMed

    Jangam, Sujit R; Hayter, Gary; Dunn, Timothy C

    2018-02-01

    Glycemic variability refers to oscillations in blood glucose within a day and differences in blood glucose at the same time on different days. Glycemic variability is linked to hypoglycemia and hyperglycemia. The relationship among these three important metrics is examined here, specifically to show how reduction in both hypo- and hyperglycemia risk is dependent on changes in variability. To understand the importance of glycemic variability in the simultaneous reduction of hypoglycemia and hyperglycemia risk, we introduce the glycemic risk plot-estimated HbA1c % (eA1c) vs. minutes below 70 mg/dl (MB70) with constant variability contours for predicting post-intervention risks in the absence of a change in glycemic variability. The glycemic risk plot illustrates that individuals who do not reduce glycemic variability improve one of the two metrics (hypoglycemia risk or hyperglycemia risk) at the cost of the other. It is important to reduce variability to improve both risks. These results were confirmed by data collected in a randomized controlled trial consisting of individuals with type 1 and type 2 diabetes on insulin therapy. For type 1, a total of 28 individuals out of 35 (80%) showed improvement in at least one of the risks (hypo and/or hyper) during the 100-day course of the study. Seven individuals (20%) showed improvement in both. Similar data were observed for type 2 where a total of 36 individuals out of 43 (84%) showed improvement in at least one risk and 8 individuals (19%) showed improvement in both. All individuals in the study who showed improvement in both hypoglycemia and hyperglycemia risk also showed a reduction in variability. Therapy changes intended to improve an individual's hypoglycemia or hyperglycemia risk often result in the reduction of one risk at the expense of another. It is important to improve glucose variability to reduce both risks or at least maintain one risk while reducing the other. Abbott Diabetes Care.

  13. Catecholaminergic systems in stress: structural and molecular genetic approaches.

    PubMed

    Kvetnansky, Richard; Sabban, Esther L; Palkovits, Miklos

    2009-04-01

    Stressful stimuli evoke complex endocrine, autonomic, and behavioral responses that are extremely variable and specific depending on the type and nature of the stressors. We first provide a short overview of physiology, biochemistry, and molecular genetics of sympatho-adrenomedullary, sympatho-neural, and brain catecholaminergic systems. Important processes of catecholamine biosynthesis, storage, release, secretion, uptake, reuptake, degradation, and transporters in acutely or chronically stressed organisms are described. We emphasize the structural variability of catecholamine systems and the molecular genetics of enzymes involved in biosynthesis and degradation of catecholamines and transporters. Characterization of enzyme gene promoters, transcriptional and posttranscriptional mechanisms, transcription factors, gene expression and protein translation, as well as different phases of stress-activated transcription and quantitative determination of mRNA levels in stressed organisms are discussed. Data from catecholamine enzyme gene knockout mice are shown. Interaction of catecholaminergic systems with other neurotransmitter and hormonal systems are discussed. We describe the effects of homotypic and heterotypic stressors, adaptation and maladaptation of the organism, and the specificity of stressors (physical, emotional, metabolic, etc.) on activation of catecholaminergic systems at all levels from plasma catecholamines to gene expression of catecholamine enzymes. We also discuss cross-adaptation and the effect of novel heterotypic stressors on organisms adapted to long-term monotypic stressors. The extra-adrenal nonneuronal adrenergic system is described. Stress-related central neuronal regulatory circuits and central organization of responses to various stressors are presented with selected examples of regulatory molecular mechanisms. Data summarized here indicate that catecholaminergic systems are activated in different ways following exposure to distinct stressful stimuli.

  14. Gene set analysis using variance component tests.

    PubMed

    Huang, Yen-Tsung; Lin, Xihong

    2013-06-28

    Gene set analyses have become increasingly important in genomic research, as many complex diseases are contributed jointly by alterations of numerous genes. Genes often coordinate together as a functional repertoire, e.g., a biological pathway/network and are highly correlated. However, most of the existing gene set analysis methods do not fully account for the correlation among the genes. Here we propose to tackle this important feature of a gene set to improve statistical power in gene set analyses. We propose to model the effects of an independent variable, e.g., exposure/biological status (yes/no), on multiple gene expression values in a gene set using a multivariate linear regression model, where the correlation among the genes is explicitly modeled using a working covariance matrix. We develop TEGS (Test for the Effect of a Gene Set), a variance component test for the gene set effects by assuming a common distribution for regression coefficients in multivariate linear regression models, and calculate the p-values using permutation and a scaled chi-square approximation. We show using simulations that type I error is protected under different choices of working covariance matrices and power is improved as the working covariance approaches the true covariance. The global test is a special case of TEGS when correlation among genes in a gene set is ignored. Using both simulation data and a published diabetes dataset, we show that our test outperforms the commonly used approaches, the global test and gene set enrichment analysis (GSEA). We develop a gene set analyses method (TEGS) under the multivariate regression framework, which directly models the interdependence of the expression values in a gene set using a working covariance. TEGS outperforms two widely used methods, GSEA and global test in both simulation and a diabetes microarray data.

  15. Partial least squares based identification of Duchenne muscular dystrophy specific genes.

    PubMed

    An, Hui-bo; Zheng, Hua-cheng; Zhang, Li; Ma, Lin; Liu, Zheng-yan

    2013-11-01

    Large-scale parallel gene expression analysis has provided a greater ease for investigating the underlying mechanisms of Duchenne muscular dystrophy (DMD). Previous studies typically implemented variance/regression analysis, which would be fundamentally flawed when unaccounted sources of variability in the arrays existed. Here we aim to identify genes that contribute to the pathology of DMD using partial least squares (PLS) based analysis. We carried out PLS-based analysis with two datasets downloaded from the Gene Expression Omnibus (GEO) database to identify genes contributing to the pathology of DMD. Except for the genes related to inflammation, muscle regeneration and extracellular matrix (ECM) modeling, we found some genes with high fold change, which have not been identified by previous studies, such as SRPX, GPNMB, SAT1, and LYZ. In addition, downregulation of the fatty acid metabolism pathway was found, which may be related to the progressive muscle wasting process. Our results provide a better understanding for the downstream mechanisms of DMD.

  16. Generation of a mouse scFv library specific for porcine aminopeptidase N using the T7 phage display system.

    PubMed

    Sun, Dongbo; Shi, Hongyan; Chen, Jianfei; Shi, Da; Zhu, Qinghe; Zhang, Hong; Liu, Shengwang; Wang, Yunfeng; Qiu, Huaji; Feng, Li

    2012-06-01

    Porcine aminopeptidase N (pAPN) is a common cellular receptor for swine transmissible gastroenteritis virus (TGEV) and porcine epidemic diarrhea virus (PEDV). To investigate single-chain fragment variable (scFv) repertoire against pAPN, the genes encoding the immunoglobulin light chain variable region (VL) and heavy chain variable region (VH) were amplified by reverse transcript polymerase chain reaction (RT-PCR) using a series of degenerate primers from the spleen of BABL/c mice immunized with native pAPN. The VL and VH amplicons were combined randomly by a 12 amino acid flexible linker by splicing by overlap extension PCR (SOE-PCR), which produced the scFv gene repertoire. After ligation of the scFv gene repertoire into the T7Select10-3b vector, a mouse scFv phage library specific for pAPN was produced through in vitro packaging. The primary scFv library against pAPN contained 2.0×10(7) recombinant phage clones, and the titer of the amplified library was 3.6×10(9)pfu/mL. BstNI restriction analysis and DNA sequencing revealed that 28 phage clones from the primary pAPN scFv library showed excellent diversity. The effectiveness of the scFv library against pAPN was verified further by phage ELISA using the recombinant protein of the pAPN C subunit as coating antigen. The construction and evaluation of a murine scFv library against the common receptor pAPN of porcine coronaviruses TGEV and PEDV using the T7 phage display system are described. Copyright © 2012 Elsevier B.V. All rights reserved.

  17. Short and long-term genome stability analysis of prokaryotic genomes.

    PubMed

    Brilli, Matteo; Liò, Pietro; Lacroix, Vincent; Sagot, Marie-France

    2013-05-08

    Gene organization dynamics is actively studied because it provides useful evolutionary information, makes functional annotation easier and often enables to characterize pathogens. There is therefore a strong interest in understanding the variability of this trait and the possible correlations with life-style. Two kinds of events affect genome organization: on one hand translocations and recombinations change the relative position of genes shared by two genomes (i.e. the backbone gene order); on the other, insertions and deletions leave the backbone gene order unchanged but they alter the gene neighborhoods by breaking the syntenic regions. A complete picture about genome organization evolution therefore requires to account for both kinds of events. We developed an approach where we model chromosomes as graphs on which we compute different stability estimators; we consider genome rearrangements as well as the effect of gene insertions and deletions. In a first part of the paper, we fit a measure of backbone gene order conservation (hereinafter called backbone stability) against phylogenetic distance for over 3000 genome comparisons, improving existing models for the divergence in time of backbone stability. Intra- and inter-specific comparisons were treated separately to focus on different time-scales. The use of multiple genomes of a same species allowed to identify genomes with diverging gene order with respect to their conspecific. The inter-species analysis indicates that pathogens are more often unstable with respect to non-pathogens. In a second part of the text, we show that in pathogens, gene content dynamics (insertions and deletions) have a much more dramatic effect on genome organization stability than backbone rearrangements. In this work, we studied genome organization divergence taking into account the contribution of both genome order rearrangements and genome content dynamics. By studying species with multiple sequenced genomes available, we were able to explore genome organization stability at different time-scales and to find significant differences for pathogen and non-pathogen species. The output of our framework also allows to identify the conserved gene clusters and/or partial occurrences thereof, making possible to explore how gene clusters assembled during evolution.

  18. Production, characteristics and applications of the cell-bound phytase of Pichia anomala.

    PubMed

    Vohra, Ashima; Kaur, Parvinder; Satyanarayana, T

    2011-01-01

    Among several yeasts isolated from dried flowers of Woodfordia fruticosa, Pichia anomala produced a high titre of cell-bound phytase. The optimization of fermentation variables led to formulation of media and selection of cultural variables that supported enhanced phytase production. The enzyme productivity was very high in fed batch fermentation in air-lift fermentor as compared to that in stirred tank fermentor. Amelioration in the cell-bound phytase activity was observed when yeast cells were permeabilized with Triton-X-100. The enzyme is thermostable and acid stable with broad substrate specificity, the characteristics that are desirable for enzymes to be used in the animal feed industry. The phytase-encoding gene was cloned and sequenced. The 3D structure of the enzyme was proposed by comparative modeling using phytase of Debaryomyces occidentalis (50% sequence identity) as template. When broiler chicks, and fresh water and marine fishes were fed with the feed supplemented with yeast biomass containing phytase, improvement in growth and phosphorus retention, and decrease in the excretion of phosphorus in the faeces were recorded. The cell-bound phytase of P. anomala could effectively dephytinize wheat flour and soymilk.

  19. Landscape of Conditional eQTL in Dorsolateral Prefrontal Cortex and Co-localization with Schizophrenia GWAS.

    PubMed

    Dobbyn, Amanda; Huckins, Laura M; Boocock, James; Sloofman, Laura G; Glicksberg, Benjamin S; Giambartolomei, Claudia; Hoffman, Gabriel E; Perumal, Thanneer M; Girdhar, Kiran; Jiang, Yan; Raj, Towfique; Ruderfer, Douglas M; Kramer, Robin S; Pinto, Dalila; Akbarian, Schahram; Roussos, Panos; Domenici, Enrico; Devlin, Bernie; Sklar, Pamela; Stahl, Eli A; Sieberts, Solveig K

    2018-06-07

    Causal genes and variants within genome-wide association study (GWAS) loci can be identified by integrating GWAS statistics with expression quantitative trait loci (eQTL) and determining which variants underlie both GWAS and eQTL signals. Most analyses, however, consider only the marginal eQTL signal, rather than dissect this signal into multiple conditionally independent signals for each gene. Here we show that analyzing conditional eQTL signatures, which could be important under specific cellular or temporal contexts, leads to improved fine mapping of GWAS associations. Using genotypes and gene expression levels from post-mortem human brain samples (n = 467) reported by the CommonMind Consortium (CMC), we find that conditional eQTL are widespread; 63% of genes with primary eQTL also have conditional eQTL. In addition, genomic features associated with conditional eQTL are consistent with context-specific (e.g., tissue-, cell type-, or developmental time point-specific) regulation of gene expression. Integrating the 2014 Psychiatric Genomics Consortium schizophrenia (SCZ) GWAS and CMC primary and conditional eQTL data reveals 40 loci with strong evidence for co-localization (posterior probability > 0.8), including six loci with co-localization of conditional eQTL. Our co-localization analyses support previously reported genes, identify novel genes associated with schizophrenia risk, and provide specific hypotheses for their functional follow-up. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  20. Combining Evidence of Preferential Gene-Tissue Relationships from Multiple Sources

    PubMed Central

    Guo, Jing; Hammar, Mårten; Öberg, Lisa; Padmanabhuni, Shanmukha S.; Bjäreland, Marcus; Dalevi, Daniel

    2013-01-01

    An important challenge in drug discovery and disease prognosis is to predict genes that are preferentially expressed in one or a few tissues, i.e. showing a considerably higher expression in one tissue(s) compared to the others. Although several data sources and methods have been published explicitly for this purpose, they often disagree and it is not evident how to retrieve these genes and how to distinguish true biological findings from those that are due to choice-of-method and/or experimental settings. In this work we have developed a computational approach that combines results from multiple methods and datasets with the aim to eliminate method/study-specific biases and to improve the predictability of preferentially expressed human genes. A rule-based score is used to merge and assign support to the results. Five sets of genes with known tissue specificity were used for parameter pruning and cross-validation. In total we identify 3434 tissue-specific genes. We compare the genes of highest scores with the public databases: PaGenBase (microarray), TiGER (EST) and HPA (protein expression data). The results have 85% overlap to PaGenBase, 71% to TiGER and only 28% to HPA. 99% of our predictions have support from at least one of these databases. Our approach also performs better than any of the databases on identifying drug targets and biomarkers with known tissue-specificity. PMID:23950964

  1. Gender-specific responses to climate variability in a semi-arid ecosystem in northern Benin.

    PubMed

    Dah-Gbeto, Afiavi P; Villamor, Grace B

    2016-12-01

    Highly erratic rainfall patterns in northern Benin complicate the ability of rural farmers to engage in subsistence agriculture. This research explores gender-specific responses to climate variability in the context of agrarian Benin through a household survey (n = 260) and an experimental gaming exercise among a subset of the survey respondents. Although men and women from the sample population are equally aware of climate variability and share similar coping strategies, their specific land-use strategies, preferences, and motivations are distinct. Over the long term, these differences would likely lead to dissimilar coping strategies and vulnerability to the effects of climate change. Examination of gender-specific land-use responses to climate change and anticipatory learning can enhance efforts to improve adaptability and resilience among rural subsistence farmers.

  2. Aging Shapes the Population-Mean and -Dispersion of Gene Expression in Human Brains

    PubMed Central

    Brinkmeyer-Langford, Candice L.; Guan, Jinting; Ji, Guoli; Cai, James J.

    2016-01-01

    Human aging is associated with cognitive decline and an increased risk of neurodegenerative disease. Our objective for this study was to evaluate potential relationships between age and variation in gene expression across different regions of the brain. We analyzed the Genotype-Tissue Expression (GTEx) data from 54 to 101 tissue samples across 13 brain regions in post-mortem donors of European descent aged between 20 and 70 years at death. After accounting for the effects of covariates and hidden confounding factors, we identified 1446 protein-coding genes whose expression in one or more brain regions is correlated with chronological age at a false discovery rate of 5%. These genes are involved in various biological processes including apoptosis, mRNA splicing, amino acid biosynthesis, and neurotransmitter transport. The distribution of these genes among brain regions is uneven, suggesting variable regional responses to aging. We also found that the aging response of many genes, e.g., TP37 and C1QA, depends on individuals' genotypic backgrounds. Finally, using dispersion-specific analysis, we identified genes such as IL7R, MS4A4E, and TERF1/TERF2 whose expressions are differentially dispersed by aging, i.e., variances differ between age groups. Our results demonstrate that age-related gene expression is brain region-specific, genotype-dependent, and associated with both mean and dispersion changes. Our findings provide a foundation for more sophisticated gene expression modeling in the studies of age-related neurodegenerative diseases. PMID:27536236

  3. Database of cattle candidate genes and genetic markers for milk production and mastitis

    PubMed Central

    Ogorevc, J; Kunej, T; Razpet, A; Dovc, P

    2009-01-01

    A cattle database of candidate genes and genetic markers for milk production and mastitis has been developed to provide an integrated research tool incorporating different types of information supporting a genomic approach to study lactation, udder development and health. The database contains 943 genes and genetic markers involved in mammary gland development and function, representing candidates for further functional studies. The candidate loci were drawn on a genetic map to reveal positional overlaps. For identification of candidate loci, data from seven different research approaches were exploited: (i) gene knockouts or transgenes in mice that result in specific phenotypes associated with mammary gland (143 loci); (ii) cattle QTL for milk production (344) and mastitis related traits (71); (iii) loci with sequence variations that show specific allele-phenotype interactions associated with milk production (24) or mastitis (10) in cattle; (iv) genes with expression profiles associated with milk production (207) or mastitis (107) in cattle or mouse; (v) cattle milk protein genes that exist in different genetic variants (9); (vi) miRNAs expressed in bovine mammary gland (32) and (vii) epigenetically regulated cattle genes associated with mammary gland function (1). Fourty-four genes found by multiple independent analyses were suggested as the most promising candidates and were further in silico analysed for expression levels in lactating mammary gland, genetic variability and top biological functions in functional networks. A miRNA target search for mammary gland expressed miRNAs identified 359 putative binding sites in 3′UTRs of candidate genes. PMID:19508288

  4. Coupled Ethical–Epistemic Analysis of Public Health Research and Practice: Categorizing Variables to Improve Population Health and Equity

    PubMed Central

    Katikireddi, S. Vittal; Valles, Sean A

    2015-01-01

    The categorization of variables can stigmatize populations, which is ethically problematic and threatens the central purpose of public health: to improve population health and reduce health inequities. How social variables (e.g., behavioral risks for HIV) are categorized can reinforce stigma and cause unintended harms to the populations practitioners and researchers strive to serve. Although debates about the validity or ethical consequences of epidemiological variables are familiar for specific variables (e.g., ethnicity), these issues apply more widely. We argue that these tensions and debates regarding epidemiological variables should be analyzed simultaneously as ethical and epistemic challenges. We describe a framework derived from the philosophy of science that may be usefully applied to public health, and we illustrate its application. PMID:25393193

  5. Phenotypes of Recessive Pediatric Cataract in a Cohort of Children with Identified Homozygous Gene Mutations (An American Ophthalmological Society Thesis)

    PubMed Central

    Khan, Arif O.; Aldahmesh, Mohammed A.; Alkuraya, Fowzan S.

    2015-01-01

    Purpose: To assess for phenotype-genotype correlations in families with recessive pediatric cataract and identified gene mutations. Methods: Retrospective review (2004 through 2013) of 26 Saudi Arabian apparently nonsyndromic pediatric cataract families referred to one of the authors (A.O.K.) and for which recessive gene mutations were identified. Results: Fifteen different homozygous recessive gene mutations were identified in the 26 consanguineous families; two genes and five families are novel to this study. Ten families had a founder CRYBB1 deletion (all with bilateral central pulverulent cataract), two had the same missense mutation in CRYAB (both with bilateral juvenile cataract with marked variable expressivity), and two had different mutations in FYCO1 (both with bilateral posterior capsular abnormality). The remaining 12 families each had mutations in 12 different genes (CRYAA, CRYBA1, AKR1E2, AGK, BFSP2, CYP27A1, CYP51A1, EPHA2, GCNT2, LONP1, RNLS, WDR87) with unique phenotypes noted for CYP27A1 (bilateral juvenile fleck with anterior and/or posterior capsular cataract and later cerebrotendinous xanthomatosis), EPHA2 (bilateral anterior persistent fetal vasculature), and BFSP2 (bilateral flecklike with cloudy cortex). Potential carrier signs were documented for several families. Conclusions: In this recessive pediatric cataract case series most identified genes are noncrystallin. Recessive pediatric cataract phenotypes are generally nonspecific, but some notable phenotypes are distinct and associated with specific gene mutations. Marked variable expressivity can occur from a recessive missense CRYAB mutation. Genetic analysis of apparently isolated pediatric cataract can sometimes uncover mutations in a syndromic gene. Some gene mutations seem to be associated with apparent heterozygous carrier signs. PMID:26622071

  6. Gene expression in gastrointestinal stromal tumors is distinguished by KIT genotype and anatomic site.

    PubMed

    Antonescu, Cristina R; Viale, Agnes; Sarran, Lisa; Tschernyavsky, Sylvia J; Gonen, Mithat; Segal, Neil H; Maki, Robert G; Socci, Nicholas D; DeMatteo, Ronald P; Besmer, Peter

    2004-05-15

    Gastrointestinal stromal tumors (GISTs) are specific KIT expressing and KIT-signaling driven mesenchymal tumors of the human digestive tract, many of which have KIT-activating mutations. Previous studies have found a relatively homogeneous gene expression profile in GIST, as compared with other histological types of sarcomas. Transcriptional heterogeneity within clinically or molecularly defined subsets of GISTs has not been previously reported. We tested the hypothesis that the gene expression profile in GISTs might be related to KIT genotype and possibly to other clinicopathological factors. An HG-U133A Affymetrix chip (22,000 genes) platform was used to determine the variability of gene expression in 28 KIT-expressing GIST samples from 24 patients. A control group of six intra-abdominal leiomyosarcomas was also included for comparison. Statistical analyses (t tests) were performed to identify discriminatory gene lists among various GIST subgroups. The levels of expression of various GIST subsets were also linked to a modified version of the growth factor/KIT signaling pathway to analyze differences at various steps in signal transduction. Genes involved in KIT signaling were differentially expressed among wild-type and mutant GISTs. High gene expression of potential drug targets, such as VEGF, MCSF, and BCL2 in the wild-type group, and Mesothelin in exon 9 GISTs were found. There was a striking difference in gene expression between stomach and small bowel GISTs. This finding was validated in four separate tumors, two gastric and two intestinal, from a patient with familial GIST with a germ-line KIT W557R substitution. GISTs have heterogeneous gene expression depending on KIT genotype and tumor location, which is seen at both the genomic level and the KIT signaling pathway in particular. These findings may explain their variable clinical behavior and response to therapy.

  7. Two different secondary metabolism gene clusters occupied the same ancestral locus in fungal dermatophytes of the arthrodermataceae.

    PubMed

    Zhang, Han; Rokas, Antonis; Slot, Jason C

    2012-01-01

    Dermatophyte fungi of the family Arthrodermataceae (Eurotiomycetes) colonize keratinized tissue, such as skin, frequently causing superficial mycoses in humans and other mammals, reptiles, and birds. Competition with native microflora likely underlies the propensity of these dermatophytes to produce a diversity of antibiotics and compounds for scavenging iron, which is extremely scarce, as well as the presence of an unusually large number of putative secondary metabolism gene clusters, most of which contain non-ribosomal peptide synthetases (NRPS), in their genomes. To better understand the historical origins and diversification of NRPS-containing gene clusters we examined the evolution of a variable locus (VL) that exists in one of three alternative conformations among the genomes of seven dermatophyte species. The first conformation of the VL (termed VLA) contains only 539 base pairs of sequence and lacks protein-coding genes, whereas the other two conformations (termed VLB and VLC) span 36 Kb and 27 Kb and contain 12 and 10 genes, respectively. Interestingly, both VLB and VLC appear to contain distinct secondary metabolism gene clusters; VLB contains a NRPS gene as well as four porphyrin metabolism genes never found to be physically linked in the genomes of 128 other fungal species, whereas VLC also contains a NRPS gene as well as several others typically found associated with secondary metabolism gene clusters. Phylogenetic evidence suggests that the VL locus was present in the ancestor of all seven species achieving its present distribution through subsequent differential losses or retentions of specific conformations. We propose that the existence of variable loci, similar to the one we studied, in fungal genomes could potentially explain the dramatic differences in secondary metabolic diversity between closely related species of filamentous fungi, and contribute to host adaptation and the generation of metabolic diversity.

  8. Cell-Type Specific Features of Circular RNA Expression

    PubMed Central

    Salzman, Julia; Chen, Raymond E.; Olsen, Mari N.; Wang, Peter L.; Brown, Patrick O.

    2013-01-01

    Thousands of loci in the human and mouse genomes give rise to circular RNA transcripts; at many of these loci, the predominant RNA isoform is a circle. Using an improved computational approach for circular RNA identification, we found widespread circular RNA expression in Drosophila melanogaster and estimate that in humans, circular RNA may account for 1% as many molecules as poly(A) RNA. Analysis of data from the ENCODE consortium revealed that the repertoire of genes expressing circular RNA, the ratio of circular to linear transcripts for each gene, and even the pattern of splice isoforms of circular RNAs from each gene were cell-type specific. These results suggest that biogenesis of circular RNA is an integral, conserved, and regulated feature of the gene expression program. PMID:24039610

  9. The genome sequence of taurine cattle: a window to ruminant biology and evolution.

    PubMed

    Elsik, Christine G; Tellam, Ross L; Worley, Kim C; Gibbs, Richard A; Muzny, Donna M; Weinstock, George M; Adelson, David L; Eichler, Evan E; Elnitski, Laura; Guigó, Roderic; Hamernik, Debora L; Kappes, Steve M; Lewin, Harris A; Lynn, David J; Nicholas, Frank W; Reymond, Alexandre; Rijnkels, Monique; Skow, Loren C; Zdobnov, Evgeny M; Schook, Lawrence; Womack, James; Alioto, Tyler; Antonarakis, Stylianos E; Astashyn, Alex; Chapple, Charles E; Chen, Hsiu-Chuan; Chrast, Jacqueline; Câmara, Francisco; Ermolaeva, Olga; Henrichsen, Charlotte N; Hlavina, Wratko; Kapustin, Yuri; Kiryutin, Boris; Kitts, Paul; Kokocinski, Felix; Landrum, Melissa; Maglott, Donna; Pruitt, Kim; Sapojnikov, Victor; Searle, Stephen M; Solovyev, Victor; Souvorov, Alexandre; Ucla, Catherine; Wyss, Carine; Anzola, Juan M; Gerlach, Daniel; Elhaik, Eran; Graur, Dan; Reese, Justin T; Edgar, Robert C; McEwan, John C; Payne, Gemma M; Raison, Joy M; Junier, Thomas; Kriventseva, Evgenia V; Eyras, Eduardo; Plass, Mireya; Donthu, Ravikiran; Larkin, Denis M; Reecy, James; Yang, Mary Q; Chen, Lin; Cheng, Ze; Chitko-McKown, Carol G; Liu, George E; Matukumalli, Lakshmi K; Song, Jiuzhou; Zhu, Bin; Bradley, Daniel G; Brinkman, Fiona S L; Lau, Lilian P L; Whiteside, Matthew D; Walker, Angela; Wheeler, Thomas T; Casey, Theresa; German, J Bruce; Lemay, Danielle G; Maqbool, Nauman J; Molenaar, Adrian J; Seo, Seongwon; Stothard, Paul; Baldwin, Cynthia L; Baxter, Rebecca; Brinkmeyer-Langford, Candice L; Brown, Wendy C; Childers, Christopher P; Connelley, Timothy; Ellis, Shirley A; Fritz, Krista; Glass, Elizabeth J; Herzig, Carolyn T A; Iivanainen, Antti; Lahmers, Kevin K; Bennett, Anna K; Dickens, C Michael; Gilbert, James G R; Hagen, Darren E; Salih, Hanni; Aerts, Jan; Caetano, Alexandre R; Dalrymple, Brian; Garcia, Jose Fernando; Gill, Clare A; Hiendleder, Stefan G; Memili, Erdogan; Spurlock, Diane; Williams, John L; Alexander, Lee; Brownstein, Michael J; Guan, Leluo; Holt, Robert A; Jones, Steven J M; Marra, Marco A; Moore, Richard; Moore, Stephen S; Roberts, Andy; Taniguchi, Masaaki; Waterman, Richard C; Chacko, Joseph; Chandrabose, Mimi M; Cree, Andy; Dao, Marvin Diep; Dinh, Huyen H; Gabisi, Ramatu Ayiesha; Hines, Sandra; Hume, Jennifer; Jhangiani, Shalini N; Joshi, Vandita; Kovar, Christie L; Lewis, Lora R; Liu, Yih-Shin; Lopez, John; Morgan, Margaret B; Nguyen, Ngoc Bich; Okwuonu, Geoffrey O; Ruiz, San Juana; Santibanez, Jireh; Wright, Rita A; Buhay, Christian; Ding, Yan; Dugan-Rocha, Shannon; Herdandez, Judith; Holder, Michael; Sabo, Aniko; Egan, Amy; Goodell, Jason; Wilczek-Boney, Katarzyna; Fowler, Gerald R; Hitchens, Matthew Edward; Lozado, Ryan J; Moen, Charles; Steffen, David; Warren, James T; Zhang, Jingkun; Chiu, Readman; Schein, Jacqueline E; Durbin, K James; Havlak, Paul; Jiang, Huaiyang; Liu, Yue; Qin, Xiang; Ren, Yanru; Shen, Yufeng; Song, Henry; Bell, Stephanie Nicole; Davis, Clay; Johnson, Angela Jolivet; Lee, Sandra; Nazareth, Lynne V; Patel, Bella Mayurkumar; Pu, Ling-Ling; Vattathil, Selina; Williams, Rex Lee; Curry, Stacey; Hamilton, Cerissa; Sodergren, Erica; Wheeler, David A; Barris, Wes; Bennett, Gary L; Eggen, André; Green, Ronnie D; Harhay, Gregory P; Hobbs, Matthew; Jann, Oliver; Keele, John W; Kent, Matthew P; Lien, Sigbjørn; McKay, Stephanie D; McWilliam, Sean; Ratnakumar, Abhirami; Schnabel, Robert D; Smith, Timothy; Snelling, Warren M; Sonstegard, Tad S; Stone, Roger T; Sugimoto, Yoshikazu; Takasuga, Akiko; Taylor, Jeremy F; Van Tassell, Curtis P; Macneil, Michael D; Abatepaulo, Antonio R R; Abbey, Colette A; Ahola, Virpi; Almeida, Iassudara G; Amadio, Ariel F; Anatriello, Elen; Bahadue, Suria M; Biase, Fernando H; Boldt, Clayton R; Carroll, Jeffery A; Carvalho, Wanessa A; Cervelatti, Eliane P; Chacko, Elsa; Chapin, Jennifer E; Cheng, Ye; Choi, Jungwoo; Colley, Adam J; de Campos, Tatiana A; De Donato, Marcos; Santos, Isabel K F de Miranda; de Oliveira, Carlo J F; Deobald, Heather; Devinoy, Eve; Donohue, Kaitlin E; Dovc, Peter; Eberlein, Annett; Fitzsimmons, Carolyn J; Franzin, Alessandra M; Garcia, Gustavo R; Genini, Sem; Gladney, Cody J; Grant, Jason R; Greaser, Marion L; Green, Jonathan A; Hadsell, Darryl L; Hakimov, Hatam A; Halgren, Rob; Harrow, Jennifer L; Hart, Elizabeth A; Hastings, Nicola; Hernandez, Marta; Hu, Zhi-Liang; Ingham, Aaron; Iso-Touru, Terhi; Jamis, Catherine; Jensen, Kirsty; Kapetis, Dimos; Kerr, Tovah; Khalil, Sari S; Khatib, Hasan; Kolbehdari, Davood; Kumar, Charu G; Kumar, Dinesh; Leach, Richard; Lee, Justin C-M; Li, Changxi; Logan, Krystin M; Malinverni, Roberto; Marques, Elisa; Martin, William F; Martins, Natalia F; Maruyama, Sandra R; Mazza, Raffaele; McLean, Kim L; Medrano, Juan F; Moreno, Barbara T; Moré, Daniela D; Muntean, Carl T; Nandakumar, Hari P; Nogueira, Marcelo F G; Olsaker, Ingrid; Pant, Sameer D; Panzitta, Francesca; Pastor, Rosemeire C P; Poli, Mario A; Poslusny, Nathan; Rachagani, Satyanarayana; Ranganathan, Shoba; Razpet, Andrej; Riggs, Penny K; Rincon, Gonzalo; Rodriguez-Osorio, Nelida; Rodriguez-Zas, Sandra L; Romero, Natasha E; Rosenwald, Anne; Sando, Lillian; Schmutz, Sheila M; Shen, Libing; Sherman, Laura; Southey, Bruce R; Lutzow, Ylva Strandberg; Sweedler, Jonathan V; Tammen, Imke; Telugu, Bhanu Prakash V L; Urbanski, Jennifer M; Utsunomiya, Yuri T; Verschoor, Chris P; Waardenberg, Ashley J; Wang, Zhiquan; Ward, Robert; Weikard, Rosemarie; Welsh, Thomas H; White, Stephen N; Wilming, Laurens G; Wunderlich, Kris R; Yang, Jianqi; Zhao, Feng-Qi

    2009-04-24

    To understand the biology and evolution of ruminants, the cattle genome was sequenced to about sevenfold coverage. The cattle genome contains a minimum of 22,000 genes, with a core set of 14,345 orthologs shared among seven mammalian species of which 1217 are absent or undetected in noneutherian (marsupial or monotreme) genomes. Cattle-specific evolutionary breakpoint regions in chromosomes have a higher density of segmental duplications, enrichment of repetitive elements, and species-specific variations in genes associated with lactation and immune responsiveness. Genes involved in metabolism are generally highly conserved, although five metabolic genes are deleted or extensively diverged from their human orthologs. The cattle genome sequence thus provides a resource for understanding mammalian evolution and accelerating livestock genetic improvement for milk and meat production.

  10. Correlation of Shiga Toxin Gene Frequency with Commonly Used Microbial Indicators of Recreational Water Quality▿ †

    PubMed Central

    Smith, Cody J.; Olszewski, Adam M.; Mauro, Steven A.

    2009-01-01

    Shiga toxin (Stx) genes produce proteins that are pathogenic to humans, leading to severe gastrointestinal illness. This work focuses on examining the abundance and distribution of stx genes in relation to common microbial indicators in beach water and streams in the vicinity of Presque Isle State Park in Erie, PA. By use of quantitative PCR, the relative abundance levels of stx DNA in over 700 samples in the sampling area were determined. The results demonstrate that the abundance and distribution of stx genes are variable and do not correlate with the abundance of Escherichia coli bacteria, enterococci, or viral particles. These results suggest that microbial indicators of water quality are not adequate in predicting the occurrence of organisms that harbor stx genes and highlight the need for standardized pathogen-specific detection protocols for waters utilized for recreational swimming. PMID:19011065

  11. Is Each Light-Harvesting Complex Protein Important for Plant Fitness?1[w

    PubMed Central

    Ganeteg, Ulrika; Külheim, Carsten; Andersson, Jenny; Jansson, Stefan

    2004-01-01

    Many of the photosynthetic genes are conserved among all higher plants, indicating that there is strong selective pressure to maintain the genes of each protein. However, mutants of these genes often lack visible growth phenotypes, suggesting that they are important only under certain conditions or have overlapping functions. To assess the importance of specific genes encoding the light-harvesting complex (LHC) proteins for the survival of the plant in the natural environment, we have combined two different scientific traditions by using an ecological fitness assay on a set of genetically modified Arabidopsis plants with differing LHC protein contents. The fitness of all of the LHC-deficient plants was reduced in some of the growth environments, supporting the hypothesis that each of the genes has been conserved because they provide ecological flexibility, which is of great adaptive value given the highly variable conditions encountered in nature. PMID:14730076

  12. Smoking and diabetes. Epigenetics involvement in osseointegration.

    PubMed

    Razzouk, Sleiman; Sarkis, Rami

    2013-03-01

    Bone quality is a poorly defined parameter for successful implant placement, which largely depends upon many environmental and genetic factors unique to every individual. Smoking and diabetes are among the environmental factors that most impact osseointegration. However, there is an inter-individual variability of bone response in smokers and diabetic patients. Recent data on gene-environment interactions highlight the major role of epigenetic changes to induce a specific phenotype. Histone acetylation and DNA methylation are the main events that occur and modulate the gene expression. In this paper, we emphasize the impact of epigenetics on diabetes and smoking and describe their significance in bone healing. Also, we underscore the importance of adopting a new approach in clinical management for implant placement by customizing the treatment according to the patient's specific characteristics.

  13. Adaptive molecular evolution of the Major Histocompatibility Complex genes, DRA and DQA, in the genus Equus

    PubMed Central

    2011-01-01

    Background Major Histocompatibility Complex (MHC) genes are central to vertebrate immune response and are believed to be under balancing selection by pathogens. This hypothesis has been supported by observations of extremely high polymorphism, elevated nonsynonymous to synonymous base pair substitution rates and trans-species polymorphisms at these loci. In equids, the organization and variability of this gene family has been described, however the full extent of diversity and selection is unknown. As selection is not expected to act uniformly on a functional gene, maximum likelihood codon-based models of selection that allow heterogeneity in selection across codon positions can be valuable for examining MHC gene evolution and the molecular basis for species adaptations. Results We investigated the evolution of two class II MHC genes of the Equine Lymphocyte Antigen (ELA), DRA and DQA, in the genus Equus with the addition of novel alleles identified in plains zebra (E. quagga, formerly E. burchelli). We found that both genes exhibited a high degree of polymorphism and inter-specific sharing of allele lineages. To our knowledge, DRA allelic diversity was discovered to be higher than has ever been observed in vertebrates. Evidence was also found to support a duplication of the DQA locus. Selection analyses, evaluated in terms of relative rates of nonsynonymous to synonymous mutations (dN/dS) averaged over the gene region, indicated that the majority of codon sites were conserved and under purifying selection (dN

  14. Genome‐scale diversity and niche adaptation analysis of Lactococcus lactis by comparative genome hybridization using multi‐strain arrays

    PubMed Central

    Siezen, Roland J.; Bayjanov, Jumamurat R.; Felis, Giovanna E.; van der Sijde, Marijke R.; Starrenburg, Marjo; Molenaar, Douwe; Wels, Michiel; van Hijum, Sacha A. F. T.; van Hylckama Vlieg, Johan E. T.

    2011-01-01

    Summary Lactococcus lactis produces lactic acid and is widely used in the manufacturing of various fermented dairy products. However, the species is also frequently isolated from non‐dairy niches, such as fermented plant material. Recently, these non‐dairy strains have gained increasing interest, as they have been described to possess flavour‐forming activities that are rarely found in dairy isolates and have diverse metabolic properties. We performed an extensive whole‐genome diversity analysis on 39 L. lactis strains, isolated from dairy and plant sources. Comparative genome hybridization analysis with multi‐strain microarrays was used to assess presence or absence of genes and gene clusters in these strains, relative to all L. lactis sequences in public databases, whereby chromosomal and plasmid‐encoded genes were computationally analysed separately. Nearly 3900 chromosomal orthologous groups (chrOGs) were defined on basis of four sequenced chromosomes of L. lactis strains (IL1403, KF147, SK11, MG1363). Of these, 1268 chrOGs are present in at least 35 strains and represent the presently known core genome of L. lactis, and 72 chrOGs appear to be unique for L. lactis. Nearly 600 and 400 chrOGs were found to be specific for either the subspecies lactis or subspecies cremoris respectively. Strain variability was found in presence or absence of gene clusters related to growth on plant substrates, such as genes involved in the consumption of arabinose, xylan, α‐galactosides and galacturonate. Further niche‐specific differences were found in gene clusters for exopolysaccharides biosynthesis, stress response (iron transport, osmotolerance) and bacterial defence mechanisms (nisin biosynthesis). Strain variability of functions encoded on known plasmids included proteolysis, lactose fermentation, citrate uptake, metal ion resistance and exopolysaccharides biosynthesis. The present study supports the view of L. lactis as a species with a very flexible genome. PMID:21338475

  15. Diverse Cis-Regulatory Mechanisms Contribute to Expression Evolution of Tandem Gene Duplicates

    PubMed Central

    Baudouin-Gonzalez, Luís; Santos, Marília A; Tempesta, Camille; Sucena, Élio; Roch, Fernando; Tanaka, Kohtaro

    2017-01-01

    Abstract Pairs of duplicated genes generally display a combination of conserved expression patterns inherited from their unduplicated ancestor and newly acquired domains. However, how the cis-regulatory architecture of duplicated loci evolves to produce these expression patterns is poorly understood. We have directly examined the gene-regulatory evolution of two tandem duplicates, the Drosophila Ly6 genes CG9336 and CG9338, which arose at the base of the drosophilids between 40 and 60 Ma. Comparing the expression patterns of the two paralogs in four Drosophila species with that of the unduplicated ortholog in the tephritid Ceratitis capitata, we show that they diverged from each other as well as from the unduplicated ortholog. Moreover, the expression divergence appears to have occurred close to the duplication event and also more recently in a lineage-specific manner. The comparison of the tissue-specific cis-regulatory modules (CRMs) controlling the paralog expression in the four Drosophila species indicates that diverse cis-regulatory mechanisms, including the novel tissue-specific enhancers, differential inactivation, and enhancer sharing, contributed to the expression evolution. Our analysis also reveals a surprisingly variable cis-regulatory architecture, in which the CRMs driving conserved expression domains change in number, location, and specificity. Altogether, this study provides a detailed historical account that uncovers a highly dynamic picture of how the paralog expression patterns and their underlying cis-regulatory landscape evolve. We argue that our findings will encourage studying cis-regulatory evolution at the whole-locus level to understand how interactions between enhancers and other regulatory levels shape the evolution of gene expression. PMID:28961967

  16. Natural history of left ventricular mechanics in transplanted hearts: relationships with clinical variables and genetic expression profiles of allograft rejection.

    PubMed

    Eleid, Mackram F; Caracciolo, Giuseppe; Cho, Eun Joo; Scott, Robert L; Steidley, D Eric; Wilansky, Susan; Arabia, Francisco A; Khandheria, Bijoy K; Sengupta, Partho P

    2010-10-01

    The aim of this study was to explore the temporal evolution of left ventricular (LV) mechanics in relation to clinical variables and genetic expression profiles implicated in cardiac allograft function. Considerable uncertainty exists regarding the range and determinants of variability in LV systolic performance in transplanted hearts (TXH). Fifty-one patients (mean age 53 ± 12 years; 37 men) underwent serial assessment of echocardiograms, cardiac catheterization, gene expression profiles, and endomyocardial biopsy data within 2 weeks and at 3, 6, 12, and 24 months after transplantation. Two-dimensional speckle-tracking data were compared between patients with TXH and 37 controls (including 12 post-coronary artery bypass patients). Post-transplantation mortality and hospitalizations were recorded with a median follow-up period of 944 days. Global longitudinal strain (LS) and radial strain remained attenuated in patients with TXH at all time points (p < 0.001 and p = 0.005), independent of clinical rejection episodes. Failure to improve global LS at 3 months (≥ 1 SD) was associated with higher incidence of death and cardiac events (hazard ratio: 5.92; 95% confidence interval: 1.96 to 17.91; p = 0.049). Multivariate analysis revealed gene expression score as the only independent predictor of global LS (R(2) = 0.53, p = 0.005), with SEMA7A gene expression having the highest correlation with global LS (r = -0.84, p < 0.001). Speckle tracking-derived LV strains are helpful in estimating the burden of LV dysfunction in patients with TXH that evolves independent of biopsy-detected cellular rejection. Failure to improve global LS at 3 months after transplantation is associated with a higher incidence of death and cardiac events. Serial changes in LV mechanics correlate with peripheral blood gene expression profiles and may affect the clinical assessment of long-term prognosis in patients with TXH. Copyright © 2010 American College of Cardiology Foundation. Published by Elsevier Inc. All rights reserved.

  17. Development and Event-specific Detection of Transgenic Glyphosate-resistant Rice Expressing the G2-EPSPS Gene

    PubMed Central

    Dong, Yufeng; Jin, Xi; Tang, Qiaoling; Zhang, Xin; Yang, Jiangtao; Liu, Xiaojing; Cai, Junfeng; Zhang, Xiaobing; Wang, Xujing; Wang, Zhixing

    2017-01-01

    Glyphosate is a widely used herbicide, due to its broad spectrum, low cost, low toxicity, high efficiency, and non-selective characteristics. Rice farmers rarely use glyphosate as a herbicide, because the crop is sensitive to this chemical. The development of transgenic glyphosate-tolerant rice could greatly improve the economics of rice production. Here, we transformed the Pseudomonas fluorescens G2 5-enolpyruvyl shikimate-3-phosphate synthase (EPSPS) gene G2-EPSPS, which conferred tolerance to glyphosate herbicide into a widely used japonica rice cultivar, Zhonghua 11 (ZH11), to develop two highly glyphosate-tolerant transgenic rice lines, G2-6 and G2-7, with one exogenous gene integration. Seed germination tests and glyphosate-tolerance assays of plants grown in a greenhouse showed that the two transgenic lines could greatly improve glyphosate-tolerance compared with the wild-type; The glyphosate-tolerance field test indicated that both transgenic lines could grow at concentrations of 20,000 ppm glyphosate, which is more than 20-times the recommended concentration in the field. Isolation of the flanking sequence of transgenic rice G2-6 indicated that the 5′-terminal of T-DNA was inserted into chromosome 8 of the rice genome. An event-specific PCR test system was established and the limit of detection of the primers reached five copies. Overall, the G2-EPSPS gene significantly improved glyphosate-tolerance in transgenic rice; furthermore, it is a useful candidate gene for the future development of commercial transgenic rice. PMID:28611804

  18. Leveraging Genetic-Background Effects in Saccharomyces cerevisiae To Improve Lignocellulosic Hydrolysate Tolerance

    DOE PAGES

    Sardi, Maria; Rovinskiy, Nikolay; Zhang, Yaoping; ...

    2016-07-22

    We report a major obstacle to sustainable lignocellulosic biofuel production is microbe inhibition by the combinatorial stresses in pretreated plant hydrolysate. Chemical biomass pretreatment releases a suite of toxins that interact with other stressors, including high osmolarity and temperature, which together can have poorly understood synergistic effects on cells. Improving tolerance in industrial strains has been hindered, in part because the mechanisms of tolerance reported in the literature often fail to recapitulate in other strain backgrounds. Here, we explored and then exploited variations in stress tolerance, toxin-induced transcriptomic responses, and fitness effects of gene overexpression in different Saccharomyces cerevisiae (yeast)more » strains to identify genes and processes linked to tolerance of hydrolysate stressors. Using six different S. cerevisiae strains that together maximized phenotypic and genetic diversity, first we explored transcriptomic differences between resistant and sensitive strains to identify common and strain-specific responses. This comparative analysis implicated primary cellular targets of hydrolysate toxins, secondary effects of defective defense strategies, and mechanisms of tolerance. Dissecting the responses to individual hydrolysate components across strains pointed to synergistic interactions between osmolarity, pH, hydrolysate toxins, and nutrient composition. By characterizing the effects of high-copy gene overexpression in three different strains, we revealed the breadth of the background-specific effects of gene fitness contributions in synthetic hydrolysate. Lastly, our approach identified new genes for engineering improved stress tolerance in diverse strains while illuminating the effects of genetic background on molecular mechanisms.« less

  19. Influence of rol genes in floriculture.

    PubMed

    Casanova, Eva; Trillas, Maria Isabel; Moysset, Lluïsa; Vainstein, Alexander

    2005-01-01

    Traditionally, new traits have been introduced into ornamental plants through classical breeding. However, genetic engineering now enables specific alterations of single traits in already successful varieties. New or improved varieties of floricultural crops can be obtained by acting on floral traits, such as color, shape or fragrance, on vase life in cut-flower species, and on rooting potential or overall plant morphology. Overexpression of the rol genes of the Ri plasmid of Agrobacterium rhizogenes in plants alters several of the plant's developmental processes and affects their architecture. Both A. rhizogenes- and rol-transgenic plants display the "hairy-root phenotype", although specific differences are found between species and between transgenic lines. In general, these plants show a dwarfed phenotype, reduced apical dominance, smaller, wrinkled leaves, increased rooting, altered flowering and reduced fertility. Among the rol genes, termed rolA, B, C and D, rolC has been the most widely studied because its effects are the most advantageous in terms of improving ornamental and horticultural traits. In addition to the dwarfness and the increase in lateral shoots that lead to a bushy phenotype, rolC-plants display more, smaller flowers, and advanced flowering; surprisingly, these plants may have better rooting capacity and they show almost no undesirable traits. rolD, the least studied among the rol genes, offers promising applications due to its promotion of flowering. Although the biochemical functions of rol genes remain poorly understood, they are useful tools for improving ornamental flowers, as their expression in transgenic plants yields many beneficial traits.

  20. Codon optimisation to improve expression of a Mycobacterium avium ssp. paratuberculosis-specific membrane-associated antigen by Lactobacillus salivarius.

    PubMed

    Johnston, Christopher; Douarre, Pierre E; Soulimane, Tewfik; Pletzer, Daniel; Weingart, Helge; MacSharry, John; Coffey, Aidan; Sleator, Roy D; O'Mahony, Jim

    2013-06-01

    Subunit and DNA-based vaccines against Mycobacterium avium ssp. paratuberculosis (MAP) attempt to overcome inherent issues associated with whole-cell formulations. However, these vaccines can be hampered by poor expression of recombinant antigens from a number of disparate hosts. The high G+C content of MAP invariably leads to a codon bias throughout gene expression. To investigate if the codon bias affects recombinant MAP antigen expression, the open reading frame of a MAP-specific antigen MptD (MAP3733c) was codon optimised for expression against a Lactobacillus salivarius host. Of the total 209 codons which constitute MAP3733c, 172 were modified resulting in a reduced G+C content from 61% for the native gene to 32.7% for the modified form. Both genes were placed under the transcriptional control of the PnisA promoter; allowing controlled heterologous expression in L. salivarius. Expression was monitored using fluorescence microscopy and microplate fluorometry via GFP tags translationally fused to the C-termini of the two MptD genes. A > 37-fold increase in expression was observed for the codon-optimised MAP3733synth variant over the native gene. Due to the low cost and improved expression achieved, codon optimisation significantly improves the potential of L. salivarius as an oral vaccine stratagem against Johne's disease. © 2013 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.

  1. Poxvirus Host Range Genes and Virus–Host Spectrum: A Critical Review

    PubMed Central

    Oliveira, Graziele Pereira; Rodrigues, Rodrigo Araújo Lima; Lima, Maurício Teixeira; Drumond, Betânia Paiva; Abrahão, Jônatas Santos

    2017-01-01

    The Poxviridae family is comprised of double-stranded DNA viruses belonging to nucleocytoplasmic large DNA viruses (NCLDV). Among the NCLDV, poxviruses exhibit the widest known host range, which is likely observed because this viral family has been more heavily investigated. However, relative to each member of the Poxviridae family, the spectrum of the host is variable, where certain viruses can infect a large range of hosts, while others are restricted to only one host species. It has been suggested that the variability in host spectrum among poxviruses is linked with the presence or absence of some host range genes. Would it be possible to extrapolate the restriction of viral replication in a specific cell lineage to an animal, a far more complex organism? In this study, we compare and discuss the relationship between the host range of poxvirus species and the abundance/diversity of host range genes. We analyzed the sequences of 38 previously identified and putative homologs of poxvirus host range genes, and updated these data with deposited sequences of new poxvirus genomes. Overall, the term host range genes might not be the most appropriate for these genes, since no correlation between them and the viruses’ host spectrum was observed, and a change in nomenclature should be considered. Finally, we analyzed the evolutionary history of these genes, and reaffirmed the occurrence of horizontal gene transfer (HGT) for certain elements, as previously suggested. Considering the data presented in this study, it is not possible to associate the diversity of host range factors with the amount of hosts of known poxviruses, and this traditional nomenclature creates misunderstandings. PMID:29112165

  2. Species composition of the genus Saprolegnia in fin fish aquaculture environments, as determined by nucleotide sequence analysis of the nuclear rDNA ITS regions.

    PubMed

    de la Bastide, Paul Y; Leung, Wai Lam; Hintz, William E

    2015-01-01

    The ITS region of the rDNA gene was compared for Saprolegnia spp. in order to improve our understanding of nucleotide sequence variability within and between species of this genus, determine species composition in Canadian fin fish aquaculture facilities, and to assess the utility of ITS sequence variability in genetic marker development. From a collection of more than 400 field isolates, ITS region nucleotide sequences were studied and it was determined that there was sufficient consistent inter-specific variation to support the designation of species identity based on ITS sequence data. This non-subjective approach to species identification does not rely upon transient morphological features. Phylogenetic analyses comparing our ITS sequences and species designations with data from previous studies generally supported the clade scheme of Diéguez-Uribeondo et al. (2007) and found agreement with the molecular taxonomic cluster system of Sandoval-Sierra et al. (2014). Our Canadian ITS sequence collection will thus contribute to the public database and assist the clarification of Saprolegnia spp. taxonomy. The analysis of ITS region sequence variability facilitated genus- and species-level identification of unknown samples from aquaculture facilities and provided useful information on species composition. A unique ITS-RFLP for the identification of S. parasitica was also described. Copyright © 2014 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.

  3. tRNAomics: tRNA gene copy number variation and codon use provide bioinformatic evidence of a new anticodon:codon wobble pair in a eukaryote

    PubMed Central

    Iben, James R.; Maraia, Richard J.

    2012-01-01

    tRNA genes are interspersed throughout eukaryotic DNA, contributing to genome architecture and evolution in addition to translation of the transcriptome. Codon use correlates with tRNA gene copy number in noncomplex organisms including yeasts. Synonymous codons impact translation with various outcomes, dependent on relative tRNA abundances. Availability of whole-genome sequences allowed us to examine tRNA gene copy number variation (tgCNV) and codon use in four Schizosaccharomyces species and Saccharomyces cerevisiae. tRNA gene numbers vary from 171 to 322 in the four Schizosaccharomyces despite very high similarity in other features of their genomes. In addition, we performed whole-genome sequencing of several related laboratory strains of Schizosaccharomyces pombe and found tgCNV at a cluster of tRNA genes. We examined for the first time effects of wobble rules on correlation of tRNA gene number and codon use and showed improvement for S. cerevisiae and three of the Schizosaccharomyces species. In contrast, correlation in Schizosaccharomyces japonicus is poor due to markedly divergent tRNA gene content, and much worsened by the wobble rules. In japonicus, some tRNA iso-acceptor genes are absent and others are greatly reduced relative to the other yeasts, while genes for synonymous wobble iso-acceptors are amplified, indicating wobble use not apparent in any other eukaryote. We identified a subset of japonicus-specific wobbles that improves correlation of codon use and tRNA gene content in japonicus. We conclude that tgCNV is high among Schizo species and occurs in related laboratory strains of S. pombe (and expectedly other species), and tRNAome-codon analyses can provide insight into species-specific wobble decoding. PMID:22586155

  4. Cotransduction with MGMT and Ubiquitous or Erythroid-Specific GFP Lentiviruses Allows Enrichment of Dual-Positive Hematopoietic Progenitor Cells In Vivo

    PubMed Central

    Roth, Justin C.; Ismail, Mourad; Reese, Jane S.; Lingas, Karen T.; Ferrari, Giuliana; Gerson, Stanton L.

    2012-01-01

    The P140K point mutant of MGMT allows robust hematopoietic stem cell (HSC) enrichment in vivo. Thus, dual-gene vectors that couple MGMT and therapeutic gene expression have allowed enrichment of gene-corrected HSCs in animal models. However, expression levels from dual-gene vectors are often reduced for one or both genes. Further, it may be desirable to express selection and therapeutic genes at distinct stages of cell differentiation. In this regard, we evaluated whether hematopoietic cells could be efficiently cotransduced using low MOIs of two separate single-gene lentiviruses, including MGMT for dual-positive cell enrichment. Cotransduction efficiencies were evaluated using a range of MGMT : GFP virus ratios, MOIs, and selection stringencies in vitro. Cotransduction was optimal when equal proportions of each virus were used, but low MGMT : GFP virus ratios resulted in the highest proportion of dual-positive cells after selection. This strategy was then evaluated in murine models for in vivo selection of HSCs cotransduced with a ubiquitous MGMT expression vector and an erythroid-specific GFP vector. Although the MGMT and GFP expression percentages were variable among engrafted recipients, drug selection enriched MGMT-positive leukocyte and GFP-positive erythroid cell populations. These data demonstrate cotransduction as a mean to rapidly enrich and evaluate therapeutic lentivectors in vivo. PMID:22888445

  5. Gene Expression Profile Analysis is Directly Affected by the Selected Reference Gene: The Case of Leaf-Cutting Atta Sexdens

    PubMed Central

    Máximo, Wesley P. F.; Zanetti, Ronald; Paiva, Luciano V.

    2018-01-01

    Although several ant species are important targets for the development of molecular control strategies, only a few studies focus on identifying and validating reference genes for quantitative reverse transcription polymerase chain reaction (RT-qPCR) data normalization. We provide here an extensive study to identify and validate suitable reference genes for gene expression analysis in the ant Atta sexdens, a threatening agricultural pest in South America. The optimal number of reference genes varies according to each sample and the result generated by RefFinder differed about which is the most suitable reference gene. Results suggest that the RPS16, NADH and SDHB genes were the best reference genes in the sample pool according to stability values. The SNF7 gene expression pattern was stable in all evaluated sample set. In contrast, when using less stable reference genes for normalization a large variability in SNF7 gene expression was recorded. There is no universal reference gene suitable for all conditions under analysis, since these genes can also participate in different cellular functions, thus requiring a systematic validation of possible reference genes for each specific condition. The choice of reference genes on SNF7 gene normalization confirmed that unstable reference genes might drastically change the expression profile analysis of target candidate genes. PMID:29419794

  6. Association of Malignancy Prevalence With Test Properties and Performance of the Gene Expression Classifier in Indeterminate Thyroid Nodules.

    PubMed

    Al-Qurayshi, Zaid; Deniwar, Ahmed; Thethi, Tina; Mallik, Tilak; Srivastav, Sudesh; Murad, Fadi; Bhatia, Parisha; Moroz, Krzysztof; Sholl, Andrew B; Kandil, Emad

    2017-04-01

    It is crucial for clinicians to know the malignancy prevalence within each indeterminate cytologic category to estimate the performance of the gene expression classifier (GEC). To examine the variability in the performance of the GEC. This retrospective cohort study of patients with Bethesda category III and IV thyroid nodules used single-institution data from January 1, 2013, through February 29, 2016. Expected negative predictive value (NPV) was calculated by adopting published sensitivity and specificity. Observed NPV was calculated based on the true-negative rate. Outcomes were compared with pooled data from 11 studies published January 1, 2010, to January 31, 2016. A total of 145 patients with 154 thyroid nodules were included in the study (mean [SD] age, 56.0 [16.2] years; 106 females [73.1%]). Malignancy prevalence was 45%. On the basis of this prevalence, the expected NPV is 85% and the observed NPV is 69%. If the prevalence is assumed to be 25%, the expected NPV would be 94%, whereas the observed NPV would be 85%. Pooled data analysis of 11 studies comprising 1303 participants revealed a malignancy prevalence of 31% (95% CI, 29%-34%) and a pooled NPV of 92% (95% CI, 87%-96%). In this study, variability in the performance of the GEC was not solely a function of malignancy prevalence and may have been attributable to intrinsic variability of the test sensitivity and specificity. The utility of the GEC in practice is elusive because of this variability. A better definition of the GEC's intrinsic properties is needed.

  7. Transcriptional Analysis of In vivo Plasmodium yoelii Liver Stage Gene Expression

    DTIC Science & Technology

    2005-04-26

    reaction was added to a PCR master mix with one of several oligonucleotide primer pairs (Table S6). The primers were designed and the specific conditions ...Koonin EV. Using the COG database to improve gene recognition in complete genomes. Genetica 2000;108:9–17. 26] Florens L, Washburn MP, Raine JD, et al

  8. [Implementation of vanA and vanB genes by PCR technique research interest in system (Xpert vanA/vanB CepheidR) closed in a laboratory of microbiology in managing an outbreak to Enterococcus faecium resistant glycopeptide (EfRG)].

    PubMed

    Dekeyser, S; Beclin, E; Descamps, D

    2011-04-01

    The closed system PCR for the rapid detection of vanA and vanB genes (Xpert vanA/vanB Cepheid(®)) was evaluated in our laboratory, to improve the rapidity of the response and thus the management of patients and isolation measures during two GRE outbreaks. From March to December2009, 565 samples were analysed by PCR associated to bacterial culture initially for all samples for 2months (n = 75), and thereafter for PCR-positive samples only. In this study, sensitivity and negative predictive values of the PCR were 100%. Specificity was evaluated in the presence and absence of outbreak: 69.3 and 76.8% respectively. The variability of false positive rates between units were lower in nonepidemic than during epidemic phase. The global false positive rate was 23.9%. This easy-to-use technology provides rapid results… four samples are tested in 1h versus 72h for culture. Despite its reagent cost, it represents an important hospital diagnostic tool: improvement of the management of cohorting areas and patient transfer between units, adaptation of isolation measures and treatments. However, culture remains necessary to confirm any positive result obtained by PCR and for epidemiological surveillance. Copyright © 2010 Elsevier Masson SAS. All rights reserved.

  9. Whole genome sequencing revealed host adaptation-focused genomic plasticity of pathogenic Leptospira

    PubMed Central

    Xu, Yinghua; Zhu, Yongzhang; Wang, Yuezhu; Chang, Yung-Fu; Zhang, Ying; Jiang, Xiugao; Zhuang, Xuran; Zhu, Yongqiang; Zhang, Jinlong; Zeng, Lingbing; Yang, Minjun; Li, Shijun; Wang, Shengyue; Ye, Qiang; Xin, Xiaofang; Zhao, Guoping; Zheng, Huajun; Guo, Xiaokui; Wang, Junzhi

    2016-01-01

    Leptospirosis, caused by pathogenic Leptospira spp., has recently been recognized as an emerging infectious disease worldwide. Despite its severity and global importance, knowledge about the molecular pathogenesis and virulence evolution of Leptospira spp. remains limited. Here we sequenced and analyzed 102 isolates representing global sources. A high genomic variability were observed among different Leptospira species, which was attributed to massive gene gain and loss events allowing for adaptation to specific niche conditions and changing host environments. Horizontal gene transfer and gene duplication allowed the stepwise acquisition of virulence factors in pathogenic Leptospira evolved from a recent common ancestor. More importantly, the abundant expansion of specific virulence-related protein families, such as metalloproteases-associated paralogs, were exclusively identified in pathogenic species, reflecting the importance of these protein families in the pathogenesis of leptospirosis. Our observations also indicated that positive selection played a crucial role on this bacteria adaptation to hosts. These novel findings may lead to greater understanding of the global diversity and virulence evolution of Leptospira spp. PMID:26833181

  10. Genetic characterization of the oxytocin-neurophysin I gene (OXT) and its regulatory regions analysis in domestic Old and New World camelids

    PubMed Central

    Ogah, Danlami Moses; Iannaccone, Marco; Erhardt, Georg; Di Stasio, Liliana; Cosenza, Gianfranco

    2018-01-01

    Oxytocin is a neurohypophysial peptide linked to a wide range of biological functions, including milk ejection, temperament and reproduction. Aims of the present study were a) the characterization of the OXT (Oxytocin-neurophysin I) gene and its regulatory regions in Old and New world camelids; b) the investigation of the genetic diversity and the discovery of markers potentially affecting the gene regulation. On average, the gene extends over 814 bp, ranging between 825 bp in dromedary, 811 bp in Bactrian and 810 bp in llama and alpaca. Such difference in size is due to a duplication event of 21 bp in dromedary. The main regulatory elements, including the composite hormone response elements (CHREs), were identified in the promoter, whereas the presence of mature microRNAs binding sequences in the 3’UTR improves the knowledge on the factors putatively involved in the OXT gene regulation, although their specific biological effect needs to be still elucidated. The sequencing of genomic DNA allowed the identification of 17 intraspecific polymorphisms and 69 nucleotide differences among the four species. One of these (MF464535:g.622C>G) is responsible, in alpaca, for the loss of a consensus sequence for the transcription factor SP1. Furthermore, the same SNP falls within a CpG island and it creates a new methylation site, thus opening future possibilities of investigation to verify the influence of the novel allelic variant in the OXT gene regulation. A PCR-RFLP method was setup for the genotyping and the frequency of the allele C was 0.93 in a population of 71 alpacas. The obtained data clarify the structure of OXT gene in domestic camelids and add knowledge to the genetic variability of a genomic region, which has received little investigation so far. These findings open the opportunity for new investigations, including association studies with productive and reproductive traits. PMID:29608621

  11. Genetic characterization of the oxytocin-neurophysin I gene (OXT) and its regulatory regions analysis in domestic Old and New World camelids.

    PubMed

    Pauciullo, Alfredo; Ogah, Danlami Moses; Iannaccone, Marco; Erhardt, Georg; Di Stasio, Liliana; Cosenza, Gianfranco

    2018-01-01

    Oxytocin is a neurohypophysial peptide linked to a wide range of biological functions, including milk ejection, temperament and reproduction. Aims of the present study were a) the characterization of the OXT (Oxytocin-neurophysin I) gene and its regulatory regions in Old and New world camelids; b) the investigation of the genetic diversity and the discovery of markers potentially affecting the gene regulation. On average, the gene extends over 814 bp, ranging between 825 bp in dromedary, 811 bp in Bactrian and 810 bp in llama and alpaca. Such difference in size is due to a duplication event of 21 bp in dromedary. The main regulatory elements, including the composite hormone response elements (CHREs), were identified in the promoter, whereas the presence of mature microRNAs binding sequences in the 3'UTR improves the knowledge on the factors putatively involved in the OXT gene regulation, although their specific biological effect needs to be still elucidated. The sequencing of genomic DNA allowed the identification of 17 intraspecific polymorphisms and 69 nucleotide differences among the four species. One of these (MF464535:g.622C>G) is responsible, in alpaca, for the loss of a consensus sequence for the transcription factor SP1. Furthermore, the same SNP falls within a CpG island and it creates a new methylation site, thus opening future possibilities of investigation to verify the influence of the novel allelic variant in the OXT gene regulation. A PCR-RFLP method was setup for the genotyping and the frequency of the allele C was 0.93 in a population of 71 alpacas. The obtained data clarify the structure of OXT gene in domestic camelids and add knowledge to the genetic variability of a genomic region, which has received little investigation so far. These findings open the opportunity for new investigations, including association studies with productive and reproductive traits.

  12. Genome-Wide Analysis of ZmDREB Genes and Their Association with Natural Variation in Drought Tolerance at Seedling Stage of Zea mays L

    PubMed Central

    Wang, Hongwei; Xin, Haibo; Yang, Xiaohong; Yan, Jianbing; Li, Jiansheng; Tran, Lam-Son Phan; Shinozaki, Kazuo; Yamaguchi-Shinozaki, Kazuko; Qin, Feng

    2013-01-01

    The worldwide production of maize (Zea mays L.) is frequently impacted by water scarcity and as a result, increased drought tolerance is a priority target in maize breeding programs. While DREB transcription factors have been demonstrated to play a central role in desiccation tolerance, whether or not natural sequence variations in these genes are associated with the phenotypic variability of this trait is largely unknown. In the present study, eighteen ZmDREB genes present in the maize B73 genome were cloned and systematically analyzed to determine their phylogenetic relationship, synteny with rice, maize and sorghum genomes; pattern of drought-responsive gene expression, and protein transactivation activity. Importantly, the association between the nucleic acid variation of each ZmDREB gene with drought tolerance was evaluated using a diverse population of maize consisting of 368 varieties from tropical and temperate regions. A significant association between the genetic variation of ZmDREB2.7 and drought tolerance at seedling stage was identified. Further analysis found that the DNA polymorphisms in the promoter region of ZmDREB2.7, but not the protein coding region itself, was associated with different levels of drought tolerance among maize varieties, likely due to distinct patterns of gene expression in response to drought stress. In vitro, protein-DNA binding assay demonstrated that ZmDREB2.7 protein could specifically interact with the target DNA sequences. The transgenic Arabidopsis overexpressing ZmDREB2.7 displayed enhanced tolerance to drought stress. Moreover, a favorable allele of ZmDREB2.7, identified in the drought-tolerant maize varieties, was effective in imparting plant tolerance to drought stress. Based upon these findings, we conclude that natural variation in the promoter of ZmDREB2.7 contributes to maize drought tolerance, and that the gene and its favorable allele may be an important genetic resource for the genetic improvement of drought tolerance in maize. PMID:24086146

  13. Genome editing for crop improvement: Challenges and opportunities

    PubMed Central

    Abdallah, Naglaa A; Prakash, Channapatna S; McHughen, Alan G

    2015-01-01

    ABSTRACT Genome or gene editing includes several new techniques to help scientists precisely modify genome sequences. The techniques also enables us to alter the regulation of gene expression patterns in a pre-determined region and facilitates novel insights into the functional genomics of an organism. Emergence of genome editing has brought considerable excitement especially among agricultural scientists because of its simplicity, precision and power as it offers new opportunities to develop improved crop varieties with clear-cut addition of valuable traits or removal of undesirable traits. Research is underway to improve crop varieties with higher yields, strengthen stress tolerance, disease and pest resistance, decrease input costs, and increase nutritional value. Genome editing encompasses a wide variety of tools using either a site-specific recombinase (SSR) or a site-specific nuclease (SSN) system. Both systems require recognition of a known sequence. The SSN system generates single or double strand DNA breaks and activates endogenous DNA repair pathways. SSR technology, such as Cre/loxP and Flp/FRT mediated systems, are able to knockdown or knock-in genes in the genome of eukaryotes, depending on the orientation of the specific sites (loxP, FLP, etc.) flanking the target site. There are 4 main classes of SSN developed to cleave genomic sequences, mega-nucleases (homing endonuclease), zinc finger nucleases (ZFNs), transcriptional activator-like effector nucleases (TALENs), and the CRISPR/Cas nuclease system (clustered regularly interspaced short palindromic repeat/CRISPR-associated protein). The recombinase mediated genome engineering depends on recombinase (sub-) family and target-site and induces high frequencies of homologous recombination. Improving crops with gene editing provides a range of options: by altering only a few nucleotides from billions found in the genomes of living cells, altering the full allele or by inserting a new gene in a targeted region of the genome. Due to its precision, gene editing is more precise than either conventional crop breeding methods or standard genetic engineering methods. Thus this technology is a very powerful tool that can be used toward securing the world's food supply. In addition to improving the nutritional value of crops, it is the most effective way to produce crops that can resist pests and thrive in tough climates. There are 3 types of modifications produced by genome editing; Type I includes altering a few nucleotides, Type II involves replacing an allele with a pre-existing one and Type III allows for the insertion of new gene(s) in predetermined regions in the genome. Because most genome-editing techniques can leave behind traces of DNA alterations evident in a small number of nucleotides, crops created through gene editing could avoid the stringent regulation procedures commonly associated with GM crop development. For this reason many scientists believe plants improved with the more precise gene editing techniques will be more acceptable to the public than transgenic plants. With genome editing comes the promise of new crops being developed more rapidly with a very low risk of off-target effects. It can be performed in any laboratory with any crop, even those that have complex genomes and are not easily bred using conventional methods. PMID:26930114

  14. DNA Damage Response and Repair Gene Alterations Are Associated with Improved Survival in Patients with Platinum-Treated Advanced Urothelial Carcinoma.

    PubMed

    Teo, Min Yuen; Bambury, Richard M; Zabor, Emily C; Jordan, Emmet; Al-Ahmadie, Hikmat; Boyd, Mariel E; Bouvier, Nancy; Mullane, Stephanie A; Cha, Eugene K; Roper, Nitin; Ostrovnaya, Irina; Hyman, David M; Bochner, Bernard H; Arcila, Maria E; Solit, David B; Berger, Michael F; Bajorin, Dean F; Bellmunt, Joaquim; Iyer, Gopakumar; Rosenberg, Jonathan E

    2017-07-15

    Purpose: Platinum-based chemotherapy remains the standard treatment for advanced urothelial carcinoma by inducing DNA damage. We hypothesize that somatic alterations in DNA damage response and repair (DDR) genes are associated with improved sensitivity to platinum-based chemotherapy. Experimental Design: Patients with diagnosis of locally advanced and metastatic urothelial carcinoma treated with platinum-based chemotherapy who had exon sequencing with the Memorial Sloan Kettering-Integrated Mutation Profiling of Actionable Cancer Targets (MSK-IMPACT) assay were identified. Patients were dichotomized based on the presence/absence of alterations in a panel of 34 DDR genes. DDR alteration status was correlated with clinical outcomes and disease features. Results: One hundred patients were identified, of which 47 harbored alterations in DDR genes. Patients with DDR alterations had improved progression-free survival (9.3 vs. 6.0 months, log-rank P = 0.007) and overall survival (23.7 vs. 13.0 months, log-rank P = 0.006). DDR alterations were also associated with higher number mutations and copy-number alterations. A trend toward positive correlation between DDR status and nodal metastases and inverse correlation with visceral metastases were observed. Different DDR pathways also suggested variable impact on clinical outcomes. Conclusions: Somatic DDR alteration is associated with improved clinical outcomes in platinum-treated patients with advanced urothelial carcinoma. Once validated, it can improve patient selection for clinical practice and future study enrollment. Clin Cancer Res; 23(14); 3610-8. ©2017 AACR . ©2017 American Association for Cancer Research.

  15. A pragmatic approach for mortality prediction after surgery in infective endocarditis: optimizing and refining EuroSCORE.

    PubMed

    Fernández-Hidalgo, N; Ferreria-González, I; Marsal, J R; Ribera, A; Aznar, M L; de Alarcón, A; García-Cabrera, E; Gálvez-Acebal, J; Sánchez-Espín, G; Reguera-Iglesias, J M; De La Torre-Lima, J; Lomas, J M; Hidalgo-Tenorio, C; Vallejo, N; Miranda, B; Santos-Ortega, A; Castro, M A; Tornos, P; García-Dorado, D; Almirante, B

    2018-03-03

    To simplify and optimize the ability of EuroSCORE I and II to predict early mortality after surgery for infective endocarditis (IE). Multicentre retrospective study (n = 775). Simplified scores, eliminating irrelevant variables, and new specific scores, adding specific IE variables, were created. The performance of the original, recalibrated and specific EuroSCOREs was assessed by Brier score, C-statistic and calibration plot in bootstrap samples. The Net Reclassification Index was quantified. Recalibrated scores including age, previous cardiac surgery, critical preoperative state, New York Heart Association >I, and emergent surgery (EuroSCORE I and II); renal failure and pulmonary hypertension (EuroSCORE I); and urgent surgery (EuroSCORE II) performed better than the original EuroSCOREs (Brier original and recalibrated: EuroSCORE I: 0.1770 and 0.1667; EuroSCORE II: 0.2307 and 0.1680). Performance improved with the addition of fistula, staphylococci and mitral location (EuroSCORE I and II) (Brier specific: EuroSCORE I 0.1587, EuroSCORE II 0.1592). Discrimination improved in specific models (C-statistic original, recalibrated and specific: EuroSCORE I: 0.7340, 0.7471 and 0.7728; EuroSCORE II: 0.7442, 0.7423 and 0.7700). Calibration improved in both EuroSCORE I models (intercept 0.295, slope 0.829 (original); intercept -0.094, slope 0.888 (recalibrated); intercept -0.059, slope 0.925 (specific)) but only in specific EuroSCORE II model (intercept 2.554, slope 1.114 (original); intercept -0.260, slope 0.703 (recalibrated); intercept -0.053, slope 0.930 (specific)). Net Reclassification Index was 5.1% and 20.3% for the specific EuroSCORE I and II CONCLUSIONS: The use of simplified EuroSCORE I and EuroSCORE II models in IE with the addition of specific variables may lead to simpler and more accurate models. Copyright © 2018 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.

  16. Evolutionary Trails of Plant Group II Pyridoxal Phosphate-Dependent Decarboxylase Genes.

    PubMed

    Kumar, Rahul

    2016-01-01

    Type II pyridoxal phosphate-dependent decarboxylase (PLP_deC) enzymes play important metabolic roles during nitrogen metabolism. Recent evolutionary profiling of these genes revealed a sharp expansion of histidine decarboxylase genes in the members of Solanaceae family. In spite of the high sequence homology shared by PLP_deC orthologs, these enzymes display remarkable differences in their substrate specificities. Currently, limited information is available on the gene repertoires and substrate specificities of PLP_deCs which renders their precise annotation challenging and offers technical challenges in the immediate identification and biochemical characterization of their full gene complements in plants. Herein, we explored their evolutionary trails in a comprehensive manner by taking advantage of high-throughput data accessibility and computational approaches. We discussed the premise that has enabled an improved reconstruction of their evolutionary lineage and evaluated the factors offering constraints in their rapid functional characterization, till date. We envisage that the synthesized information herein would act as a catalyst for the rapid exploration of their biochemical specificity and physiological roles in more plant species.

  17. Computational correction of copy number effect improves specificity of CRISPR-Cas9 essentiality screens in cancer cells.

    PubMed

    Meyers, Robin M; Bryan, Jordan G; McFarland, James M; Weir, Barbara A; Sizemore, Ann E; Xu, Han; Dharia, Neekesh V; Montgomery, Phillip G; Cowley, Glenn S; Pantel, Sasha; Goodale, Amy; Lee, Yenarae; Ali, Levi D; Jiang, Guozhi; Lubonja, Rakela; Harrington, William F; Strickland, Matthew; Wu, Ting; Hawes, Derek C; Zhivich, Victor A; Wyatt, Meghan R; Kalani, Zohra; Chang, Jaime J; Okamoto, Michael; Stegmaier, Kimberly; Golub, Todd R; Boehm, Jesse S; Vazquez, Francisca; Root, David E; Hahn, William C; Tsherniak, Aviad

    2017-12-01

    The CRISPR-Cas9 system has revolutionized gene editing both at single genes and in multiplexed loss-of-function screens, thus enabling precise genome-scale identification of genes essential for proliferation and survival of cancer cells. However, previous studies have reported that a gene-independent antiproliferative effect of Cas9-mediated DNA cleavage confounds such measurement of genetic dependency, thereby leading to false-positive results in copy number-amplified regions. We developed CERES, a computational method to estimate gene-dependency levels from CRISPR-Cas9 essentiality screens while accounting for the copy number-specific effect. In our efforts to define a cancer dependency map, we performed genome-scale CRISPR-Cas9 essentiality screens across 342 cancer cell lines and applied CERES to this data set. We found that CERES decreased false-positive results and estimated sgRNA activity for both this data set and previously published screens performed with different sgRNA libraries. We further demonstrate the utility of this collection of screens, after CERES correction, for identifying cancer-type-specific vulnerabilities.

  18. Computational correction of copy-number effect improves specificity of CRISPR-Cas9 essentiality screens in cancer cells

    PubMed Central

    Meyers, Robin M.; Bryan, Jordan G.; McFarland, James M.; Weir, Barbara A.; Sizemore, Ann E.; Xu, Han; Dharia, Neekesh V.; Montgomery, Phillip G.; Cowley, Glenn S.; Pantel, Sasha; Goodale, Amy; Lee, Yenarae; Ali, Levi D.; Jiang, Guozhi; Lubonja, Rakela; Harrington, William F.; Strickland, Matthew; Wu, Ting; Hawes, Derek C.; Zhivich, Victor A.; Wyatt, Meghan R.; Kalani, Zohra; Chang, Jaime J.; Okamoto, Michael; Stegmaier, Kimberly; Golub, Todd R.; Boehm, Jesse S.; Vazquez, Francisca; Root, David E.; Hahn, William C.; Tsherniak, Aviad

    2017-01-01

    The CRISPR-Cas9 system has revolutionized gene editing both on single genes and in multiplexed loss-of-function screens, enabling precise genome-scale identification of genes essential to proliferation and survival of cancer cells1,2. However, previous studies reported that a gene-independent anti-proliferative effect of Cas9-mediated DNA cleavage confounds such measurement of genetic dependency, leading to false positive results in copy number amplified regions3,4. We developed CERES, a computational method to estimate gene dependency levels from CRISPR-Cas9 essentiality screens while accounting for the copy-number-specific effect. As part of our efforts to define a cancer dependency map, we performed genome-scale CRISPR-Cas9 essentiality screens across 342 cancer cell lines and applied CERES to this dataset. We found that CERES reduced false positive results and estimated sgRNA activity for both this dataset and previously published screens performed with different sgRNA libraries. Here, we demonstrate the utility of this collection of screens, upon CERES correction, in revealing cancer-type-specific vulnerabilities. PMID:29083409

  19. The Genetics of Pulmonary Arterial Hypertension

    PubMed Central

    Austin, Eric D.; Loyd, James E.

    2014-01-01

    Pulmonary arterial hypertension (PAH) is a progressive and fatal disease for which there is an ever-expanding body of genetic and related pathophysiological information on disease pathogenesis. A number of germline gene mutations have now been described, including mutations in the gene coding bone morphogenic protein receptor type 2 (BMPR2) and related genes. Recent advanced gene sequencing methods have facilitated the discovery of additional genes with mutations among those with and without familial forms of PAH (CAV1, KCNK3, EIF2AK4). The reduced penetrance, variable expressivity, and female predominance of PAH suggest that genetic, genomic and other factors modify disease expression. These multi-faceted variations are an active area of investigation in the field, including but not limited to common genetic variants and epigenetic processes, and may provide novel opportunities for pharmacologic intervention in the near future. They also highlight the need for a systems-oriented multi-level approach to incorporate the multitude of biologic variations now associated with PAH. Ultimately, improved understanding provides the opportunity for improved patient and family counseling about this devastating disease, but do require in depth understanding of the genetic factors relevant to PAH. PMID:24951767

  20. Comprehensive genetic dissection of wood properties in a widely-grown tropical tree: Eucalyptus

    PubMed Central

    2011-01-01

    Background Eucalyptus is an important genus in industrial plantations throughout the world and is grown for use as timber, pulp, paper and charcoal. Several breeding programmes have been launched worldwide to concomitantly improve growth performance and wood properties (WPs). In this study, an interspecific cross between Eucalyptus urophylla and E. grandis was used to identify major genomic regions (Quantitative Trait Loci, QTL) controlling the variability of WPs. Results Linkage maps were generated for both parent species. A total of 117 QTLs were detected for a series of wood and end-use related traits, including chemical, technological, physical, mechanical and anatomical properties. The QTLs were mainly clustered into five linkage groups. In terms of distribution of QTL effects, our result agrees with the typical L-shape reported in most QTL studies, i.e. most WP QTLs had limited effects and only a few (13) had major effects (phenotypic variance explained > 15%). The co-locations of QTLs for different WPs as well as QTLs and candidate genes are discussed in terms of phenotypic correlations between traits, and of the function of the candidate genes. The major wood property QTL harbours a gene encoding a Cinnamoyl CoA reductase (CCR), a structural enzyme of the monolignol-specific biosynthesis pathway. Conclusions Given the number of traits analysed, this study provides a comprehensive understanding of the genetic architecture of wood properties in this Eucalyptus full-sib pedigree. At the dawn of Eucalyptus genome sequence, it will provide a framework to identify the nature of genes underlying these important quantitative traits. PMID:21651758

  1. ModuleMiner - improved computational detection of cis-regulatory modules: are there different modes of gene regulation in embryonic development and adult tissues?

    PubMed Central

    Van Loo, Peter; Aerts, Stein; Thienpont, Bernard; De Moor, Bart; Moreau, Yves; Marynen, Peter

    2008-01-01

    We present ModuleMiner, a novel algorithm for computationally detecting cis-regulatory modules (CRMs) in a set of co-expressed genes. ModuleMiner outperforms other methods for CRM detection on benchmark data, and successfully detects CRMs in tissue-specific microarray clusters and in embryonic development gene sets. Interestingly, CRM predictions for differentiated tissues exhibit strong enrichment close to the transcription start site, whereas CRM predictions for embryonic development gene sets are depleted in this region. PMID:18394174

  2. [The application of genome editing in identification of plant gene function and crop breeding].

    PubMed

    Zhou, Xiang-chun; Xing, Yong-zhong

    2016-03-01

    Plant genome can be modified via current biotechnology with high specificity and excellent efficiency. Zinc finger nucleases (ZFN), transcription activator-like effector nucleases (TALEN) and clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated 9 (Cas9) system are the key engineered nucleases used in the genome editing. Genome editing techniques enable gene targeted mutagenesis, gene knock-out, gene insertion or replacement at the target sites during the endogenous DNA repair process, including non-homologous end joining (NHEJ) and homologous recombination (HR), triggered by the induction of DNA double-strand break (DSB). Genome editing has been successfully applied in the genome modification of diverse plant species, such as Arabidopsis thaliana, Oryza sativa, and Nicotiana tabacum. In this review, we summarize the application of genome editing in identification of plant gene function and crop breeding. Moreover, we also discuss the improving points of genome editing in crop precision genetic improvement for further study.

  3. Dysfunctional BLK in common variable immunodeficiency perturbs B-cell proliferation and ability to elicit antigen-specific CD4+ T-cell help.

    PubMed

    Compeer, Ewoud B; Janssen, Willemijn; van Royen-Kerkhof, Annet; van Gijn, Marielle; van Montfrans, Joris M; Boes, Marianne

    2015-05-10

    Common Variable Immunodeficiency (CVID) is the most prevalent primary antibody deficiency, and characterized by defective generation of high-affinity antibodies. Patients have therefore increased risk to recurrent infections of the respiratory and intestinal tract. Development of high-affinity antigen-specific antibodies involves two key actions of B-cell receptors (BCR): transmembrane signaling through BCR-complexes to induce B-cell differentiation and proliferation, and BCR-mediated antigen internalization for class-II MHC-mediated presentation to acquire antigen-specific CD4(+) T-cell help.We identified a variant (L3P) in the B-lymphoid tyrosine kinase (BLK) gene of 2 related CVID-patients, which was absent in healthy relatives. BLK belongs to the Src-kinases family and involved in BCR-signaling. Here, we sought to clarify BLK function in healthy human B-cells and its association to CVID.BLK expression was comparable in patient and healthy B-cells. Functional analysis of L3P-BLK showed reduced BCR crosslinking-induced Syk phosphorylation and proliferation, in both primary B-cells and B-LCLs. B-cells expressing L3P-BLK showed accelerated destruction of BCR-internalized antigen and reduced ability to elicit CD40L-expression on antigen-specific CD4(+) T-cells.In conclusion, we found a novel BLK gene variant in CVID-patients that causes suppressed B-cell proliferation and reduced ability of B-cells to elicit antigen-specific CD4(+) T-cell responses. Both these mechanisms may contribute to hypogammaglobulinemia in CVID-patients.

  4. Mutation-profile-based methods for understanding selection forces in cancer somatic mutations: a comparative analysis.

    PubMed

    Zhou, Zhan; Zou, Yangyun; Liu, Gangbiao; Zhou, Jingqi; Wu, Jingcheng; Zhao, Shimin; Su, Zhixi; Gu, Xun

    2017-08-29

    Human genes exhibit different effects on fitness in cancer and normal cells. Here, we present an evolutionary approach to measure the selection pressure on human genes, using the well-known ratio of the nonsynonymous to synonymous substitution rate in both cancer genomes ( C N / C S ) and normal populations ( p N / p S ). A new mutation-profile-based method that adopts sample-specific mutation rate profiles instead of conventional substitution models was developed. We found that cancer-specific selection pressure is quite different from the selection pressure at the species and population levels. Both the relaxation of purifying selection on passenger mutations and the positive selection of driver mutations may contribute to the increased C N / C S values of human genes in cancer genomes compared with the p N / p S values in human populations. The C N / C S values also contribute to the improved classification of cancer genes and a better understanding of the onco-functionalization of cancer genes during oncogenesis. The use of our computational pipeline to identify cancer-specific positively and negatively selected genes may provide useful information for understanding the evolution of cancers and identifying possible targets for therapeutic intervention.

  5. Inferring gene dependency network specific to phenotypic alteration based on gene expression data and clinical information of breast cancer.

    PubMed

    Zhou, Xionghui; Liu, Juan

    2014-01-01

    Although many methods have been proposed to reconstruct gene regulatory network, most of them, when applied in the sample-based data, can not reveal the gene regulatory relations underlying the phenotypic change (e.g. normal versus cancer). In this paper, we adopt phenotype as a variable when constructing the gene regulatory network, while former researches either neglected it or only used it to select the differentially expressed genes as the inputs to construct the gene regulatory network. To be specific, we integrate phenotype information with gene expression data to identify the gene dependency pairs by using the method of conditional mutual information. A gene dependency pair (A,B) means that the influence of gene A on the phenotype depends on gene B. All identified gene dependency pairs constitute a directed network underlying the phenotype, namely gene dependency network. By this way, we have constructed gene dependency network of breast cancer from gene expression data along with two different phenotype states (metastasis and non-metastasis). Moreover, we have found the network scale free, indicating that its hub genes with high out-degrees may play critical roles in the network. After functional investigation, these hub genes are found to be biologically significant and specially related to breast cancer, which suggests that our gene dependency network is meaningful. The validity has also been justified by literature investigation. From the network, we have selected 43 discriminative hubs as signature to build the classification model for distinguishing the distant metastasis risks of breast cancer patients, and the result outperforms those classification models with published signatures. In conclusion, we have proposed a promising way to construct the gene regulatory network by using sample-based data, which has been shown to be effective and accurate in uncovering the hidden mechanism of the biological process and identifying the gene signature for phenotypic change.

  6. Alcohol-related Genes Show an Enrichment of Associations with a Persistent Externalizing Factor

    PubMed Central

    Ashenhurst, James R.; Harden, K. Paige; Corbin, William R.; Fromme, Kim

    2016-01-01

    Research using twins has found that much of the variability in externalizing phenotypes – including alcohol and drug use, impulsive personality traits, risky sex and property crime – is explained by genetic factors. Nevertheless, identification of specific genes and variants associated with these traits has proven to be difficult, likely because individual differences in externalizing are explained by many genes of small individual effect. Moreover, twin research indicates that heritable variance in externalizing behaviors is mostly shared across the externalizing spectrum rather than specific to any behavior. We use a longitudinal, “deep phenotyping” approach to model a general externalizing factor reflecting persistent engagement in a variety of socially problematic behaviors measured at eleven assessment occasions spanning early adulthood (ages 18 to 28). In an ancestrally homogenous sample of non-Hispanic Whites (N = 337), we then tested for enrichment of associations between the persistent externalizing factor and a set of 3,281 polymorphisms within 104 genes that were previously identified as associated with alcohol-use behaviors. Next we tested for enrichment among domain-specific factors (e.g., property crime) composed of residual variance not accounted for by the common factor. Significance was determined relative to bootstrapped empirical thresholds derived from permutations of phenotypic data. Results indicated significant enrichment of genetic associations for persistent externalizing, but not for domain-specific factors. Consistent with twin research findings, these results suggest that genetic variants are broadly associated with externalizing behaviors rather than unique to specific behaviors. General Scientific Summary This study shows that variation in 104 genes is associated with socially problematic “externalizing” behavior, including substance misuse, property crime, risky sex, and aspects of impulsive personality. Importantly, this association was with the common variation across these behaviors rather than with the variation unique to any given behavior. The manuscript demonstrates a potentially advantageous technique for relating sets of hypothesized genes to complex traits or behaviors. PMID:27505405

  7. Transposable element islands facilitate adaptation to novel environments in an invasive species

    PubMed Central

    Schrader, Lukas; Kim, Jay W.; Ence, Daniel; Zimin, Aleksey; Klein, Antonia; Wyschetzki, Katharina; Weichselgartner, Tobias; Kemena, Carsten; Stökl, Johannes; Schultner, Eva; Wurm, Yannick; Smith, Christopher D.; Yandell, Mark; Heinze, Jürgen; Gadau, Jürgen; Oettler, Jan

    2014-01-01

    Adaptation requires genetic variation, but founder populations are generally genetically depleted. Here we sequence two populations of an inbred ant that diverge in phenotype to determine how variability is generated. Cardiocondyla obscurior has the smallest of the sequenced ant genomes and its structure suggests a fundamental role of transposable elements (TEs) in adaptive evolution. Accumulations of TEs (TE islands) comprising 7.18% of the genome evolve faster than other regions with regard to single-nucleotide variants, gene/exon duplications and deletions and gene homology. A non-random distribution of gene families, larvae/adult specific gene expression and signs of differential methylation in TE islands indicate intragenomic differences in regulation, evolutionary rates and coalescent effective population size. Our study reveals a tripartite interplay between TEs, life history and adaptation in an invasive species. PMID:25510865

  8. Molecular basis of length polymorphism in the human zeta-globin gene complex.

    PubMed Central

    Goodbourn, S E; Higgs, D R; Clegg, J B; Weatherall, D J

    1983-01-01

    The length polymorphism between the human zeta-globin gene and its pseudogene is caused by an allele-specific variation in the copy number of a tandemly repeating 36-base-pair sequence. This sequence is related to a tandemly repeated 14-base-pair sequence in the 5' flanking region of the human insulin gene, which is known to cause length polymorphism, and to a repetitive sequence in intervening sequence (IVS) 1 of the pseudo-zeta-globin gene. Evidence is presented that the latter is also of variable length, probably because of differences in the copy number of the tandem repeat. The homology between the three length polymorphisms may be an indication of the presence of a more widespread group of related sequences in the human genome, which might be useful for generalized linkage studies. PMID:6308667

  9. Quantitative Real-Time Legionella PCR for Environmental Water Samples: Data Interpretation

    PubMed Central

    Joly, Philippe; Falconnet, Pierre-Alain; André, Janine; Weill, Nicole; Reyrolle, Monique; Vandenesch, François; Maurin, Max; Etienne, Jerome; Jarraud, Sophie

    2006-01-01

    Quantitative Legionella PCRs targeting the 16S rRNA gene (specific for the genus Legionella) and the mip gene (specific for the species Legionella pneumophila) were applied to a total of 223 hot water system samples (131 in one laboratory and 92 in another laboratory) and 37 cooling tower samples (all in the same laboratory). The PCR results were compared with those of conventional culture. 16S rRNA gene PCR results were nonquantifiable for 2.8% of cooling tower samples and up to 39.1% of hot water system samples, and this was highly predictive of Legionella CFU counts below 250/liter. PCR cutoff values for identifying hot water system samples containing >103 CFU/liter legionellae were determined separately in each laboratory. The cutoffs differed widely between the laboratories and had sensitivities from 87.7 to 92.9% and specificities from 77.3 to 96.5%. The best specificity was obtained with mip PCR. PCR cutoffs could not be determined for cooling tower samples, as the results were highly variable and often high for culture-negative samples. Thus, quantitative Legionella PCR appears to be applicable to samples from hot water systems, but the positivity cutoff has to be determined in each laboratory. PMID:16597985

  10. Leveraging Gene-Environment Interactions and Endotypes for Asthma Gene Discovery

    PubMed Central

    Bønnelykke, Klaus; Ober, Carole

    2016-01-01

    Asthma is a heterogeneous clinical syndrome that includes subtypes of disease with different underlying causes and disease mechanisms. Asthma is caused by a complex interaction between genes and environmental exposures; early-life exposures in particular play an important role. Asthma is also heritable, and a number of susceptibility variants have been discovered in genome-wide association studies, although the known risk alleles explain only a small proportion of the heritability. In this review, we present evidence supporting the hypothesis that focusing on more specific asthma phenotypes, such as childhood asthma with severe exacerbations, and on relevant exposures that are involved in gene-environment interactions (GEIs), such as rhinovirus infections, will improve detection of asthma genes and our understanding of the underlying mechanisms. We will discuss the challenges of considering GEIs and the advantages of studying responses to asthma-associated exposures in clinical birth cohorts, as well as in cell models of GEIs, to dissect the context-specific nature of genotypic risks, to prioritize variants in genome-wide association studies, and to identify pathways involved in pathogenesis in subgroups of patients. We propose that such approaches, in spite of their many challenges, present great opportunities for better understanding of asthma pathogenesis and heterogeneity and, ultimately, for improving prevention and treatment of disease. PMID:26947980

  11. Kindness in the blood: A randomized controlled trial of the gene regulatory impact of prosocial behavior.

    PubMed

    Nelson-Coffey, S Katherine; Fritz, Megan M; Lyubomirsky, Sonja; Cole, Steve W

    2017-07-01

    Prosocial behavior is linked to longevity, but few studies have experimentally manipulated prosocial behavior to identify the causal mechanisms underlying this association. One possible mediating pathway involves changes in gene expression that may subsequently influence disease development or resistance. In the current study, we examined changes in a leukocyte gene expression profile known as the Conserved Transcriptional Response to Adversity (CTRA) in 159 adults who were randomly assigned for 4 weeks to engage in prosocial behavior directed towards specific others, prosocial behavior directed towards the world in general, self-focused kindness, or a neutral control task. Those randomized to prosocial behavior towards specific others demonstrated improvements (i.e., reductions) in leukocyte expression of CTRA indicator genes. No significant changes in CTRA gene expression were observed in the other 3 conditions. These findings suggest that prosocial behavior can causally impact leukocyte gene expression profiles in ways that might potentially help explain the previously observed health advantages associated with social ties. Copyright © 2017 Elsevier Ltd. All rights reserved.

  12. Gene-specific of endocannabinoid receptor 1 (cnr1a) by ethanol probably leads to the development of fetal alcohol spectrum disorder (FASD) phenotypes in Japanese rice fish (Oryzias latipes) embryogenesis

    USDA-ARS?s Scientific Manuscript database

    Developmental ethanol exposure is able to induce Fetal Alcohol Spectrum Disorder (FASD) phenotypes in Japanese rice fish (Oryzias latipes). This study investigated possible differential expression of cannabinoid receptor (cnr) mRNAs during Japanese rice fish embryogenesis and variability to ethanol-...

  13. High Frequency and Diversity of Antimicrobial Activities Produced by Nasal Staphylococcus Strains against Bacterial Competitors

    PubMed Central

    Janek, Daniela; Zipperer, Alexander; Kulik, Andreas; Krismer, Bernhard; Peschel, Andreas

    2016-01-01

    The human nasal microbiota is highly variable and dynamic often enclosing major pathogens such as Staphylococcus aureus. The potential roles of bacteriocins or other mechanisms allowing certain bacterial clones to prevail in this nutrient-poor habitat have hardly been studied. Of 89 nasal Staphylococcus isolates, unexpectedly, the vast majority (84%) was found to produce antimicrobial substances in particular under habitat-specific stress conditions, such as iron limitation or exposure to hydrogen peroxide. Activity spectra were generally narrow but highly variable with activities against certain nasal members of the Actinobacteria, Proteobacteria, Firmicutes, or several groups of bacteria. Staphylococcus species and many other Firmicutes were insusceptible to most of the compounds. A representative bacteriocin was identified as a nukacin-related peptide whose inactivation reduced the capacity of the producer Staphylococcus epidermidis IVK45 to limit growth of other nasal bacteria. Of note, the bacteriocin genes were found on mobile genetic elements exhibiting signs of extensive horizontal gene transfer and rearrangements. Thus, continuously evolving bacteriocins appear to govern bacterial competition in the human nose and specific bacteriocins may become important agents for eradication of notorious opportunistic pathogens from human microbiota. PMID:27490492

  14. Immortalization of MEF is characterized by the deregulation of specific miRNAs with potential tumor suppressor activity

    PubMed Central

    Rizzo, Milena; Evangelista, Monica; Simili, Marcella; Mariani, Laura; Pitto, Letizia; Rainaldi, Giuseppe

    2011-01-01

    The life span (Hayflick limit) of primary mouse embryo fibroblasts (MEF) in culture is variable but it is still unclear if the escape of the Hayflick limit is also variable. To address this point MEF were expanded every fifteen days (6T15) instead of every three days (6T3) until they became immortal. With this protocol MEF lifespan was extended and immortalization accordingly delayed. By testing a panel of genes (p19ARF, p16, p21) and miRNAs (miR-20a, miR-21, miR-28, miR-290) related to primary MEF senescence, a switch of p21 from up to down regulation, the down regulation of specific miRNAs as well as a massive shift from diploidy to hyperdiploidy were observed in coincidence with the resumption of cell proliferation. Collectively, these data indicate that the inactivation of genes and miRNAs, important in controlling cell proliferation, might be determinant for the escape from the Hayflick limit. In support of this hypothesis was the finding that some of the down regulated miRNAs transfected in immortalized MEF inhibited cell proliferation thus displaying a tumor suppressor-like activity. PMID:21765199

  15. Immortalization of MEF is characterized by the deregulation of specific miRNAs with potential tumor suppressor activity.

    PubMed

    Rizzo, Milena; Evangelista, Monica; Simili, Marcella; Mariani, Laura; Pitto, Letizia; Rainaldi, Giuseppe

    2011-07-01

    The life span (Hayflick limit) of primary mouse embryo fibroblasts (MEF) in culture is variable but it is still unclear if the escape of the Hayflick limit is also variable. To address this point MEF were expanded every fifteen days (6T15) instead of every three days (6T3) until they became immortal. With this protocol MEF lifespan was extended and immortalization accordingly delayed. By testing a panel of genes (p19ARF, p16, p21) and miRNAs (miR-20a, miR-21, miR-28, miR-290) related to primary MEF senescence, a switch of p21 from up to down regulation, the down regulation of specific miRNAs as well as a massive shift from diploidy to hyperdiploidy were observed in coincidence with the resumption of cell proliferation. Collectively, these data indicate that the inactivation of genes and miRNAs, important in controlling cell proliferation, might be determinant for the escape from the Hayflick limit. In support of this hypothesis was the finding that some of the down regulated miRNAs transfected in immortalized MEF inhibited cell proliferation thus displaying a tumor suppressor-like activity.

  16. DM-BLD: differential methylation detection using a hierarchical Bayesian model exploiting local dependency.

    PubMed

    Wang, Xiao; Gu, Jinghua; Hilakivi-Clarke, Leena; Clarke, Robert; Xuan, Jianhua

    2017-01-15

    The advent of high-throughput DNA methylation profiling techniques has enabled the possibility of accurate identification of differentially methylated genes for cancer research. The large number of measured loci facilitates whole genome methylation study, yet posing great challenges for differential methylation detection due to the high variability in tumor samples. We have developed a novel probabilistic approach, D: ifferential M: ethylation detection using a hierarchical B: ayesian model exploiting L: ocal D: ependency (DM-BLD), to detect differentially methylated genes based on a Bayesian framework. The DM-BLD approach features a joint model to capture both the local dependency of measured loci and the dependency of methylation change in samples. Specifically, the local dependency is modeled by Leroux conditional autoregressive structure; the dependency of methylation changes is modeled by a discrete Markov random field. A hierarchical Bayesian model is developed to fully take into account the local dependency for differential analysis, in which differential states are embedded as hidden variables. Simulation studies demonstrate that DM-BLD outperforms existing methods for differential methylation detection, particularly when the methylation change is moderate and the variability of methylation in samples is high. DM-BLD has been applied to breast cancer data to identify important methylated genes (such as polycomb target genes and genes involved in transcription factor activity) associated with breast cancer recurrence. A Matlab package of DM-BLD is available at http://www.cbil.ece.vt.edu/software.htm CONTACT: Xuan@vt.eduSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  17. High-frequency expression of a conserved kappa light-chain variable-region gene in chronic lymphocytic leukemia

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kipps, T.J.; Fong, S.; Tomhave, E.

    Malignant B lymphocytes from several patients with chronic lymphocytic leukemia (CLL) were examined for reactivity with murine monoclonal antibody 17.109. This antibody, prepared against the rheumatoid factor (RF) paraprotein Sie, recognizes a cross reactive idiotype on 48% of human IgM RF paraproteins, but does not react with IgM paraproteins without RF activity or substantially with normal pooled immunoglobulin. The 17.109-reactive idiotype is a marker for a kappa III variable-region gene, designated V/sub kappa/RF, that is conserved in outbred human populations. In a limited study of 31 CLL patients, the leukemic cells from 5 of 20 patients with kappa light chain-expressingmore » CLL were recognized by the 17.109 monoclonal antibody. Despite having malignant cells specifically reactive with this antibody, patients with 17.109-positive CLL did not have elevated serum levels of circulating antibody bearing 17.109-reactive determinants. Total RNAs isolated from the CLL B lymphocytes, or from hybridomas produced by fusing the CLL cells with the WI-L2-729-HF/sub 2/ cell line, were fractionated electrophoretically and examined by blot hybridization. Under stringent hybridization conditions capable of discerning a single base-pair mismatch, RNA from the 17.109-idiotype-positive CLL cells hybridized to synthetic oligonucleotide probes corresponding to framework and complementary-determining regions in the V/sub kappa/RF gene. The high frequency of the 17.109-associated idiotype and the V/sub kappa/RF gene in CLL suggests that the disease may arise from B lymphocytes that express a restricted set of inherited immunoglobulin variable-region genes with little or no somatic mutation.« less

  18. Association study of ERβ, AR, and CYP19A1 genes and MtF transsexualism.

    PubMed

    Fernández, Rosa; Esteva, Isabel; Gómez-Gil, Esther; Rumbo, Teresa; Almaraz, Mari Cruz; Roda, Ester; Haro-Mora, Juan-Jesús; Guillamón, Antonio; Pásaro, Eduardo

    2014-12-01

    The etiology of male-to-female (MtF) transsexualism is unknown. Both genetic and neurological factors may play an important role. To investigate the possible influence of the genetic factor on the etiology of MtF transsexualism. We carried out a cytogenetic and molecular analysis in 442 MtFs and 473 healthy, age- and geographical origin-matched XY control males. The karyotype was investigated by G-banding and by high-density array in the transsexual group. The molecular analysis involved three tandem variable regions of genes estrogen receptor β (ERβ) (CA tandem repeats in intron 5), androgen receptor (AR) (CAG tandem repeats in exon 1), and CYP19A1 (TTTA tandem repeats in intron 4). The allele and genotype frequencies, after division into short and long alleles, were obtained. We investigated the association between genotype and transsexualism by performing a molecular analysis of three variable regions of genes ERβ, AR, and CYP19A1 in 915 individuals (442 MtFs and 473 control males). Most MtFs showed an unremarkable 46,XY karyotype (97.96%). No specific chromosome aberration was associated with MtF transsexualism, and prevalence of aneuploidy (2.04%) was slightly higher than in the general population. Molecular analyses showed no significant difference in allelic or genotypic distribution of the genes examined between MtFs and controls. Moreover, molecular findings presented no evidence of an association between the sex hormone-related genes (ERβ, AR, and CYP19A1) and MtF transsexualism. The study suggests that the analysis of karyotype provides limited information in these subjects. Variable regions analyzed from ERβ, AR, and CYP19A1 are not associated with MtF transsexualism. Nevertheless, this does not exclude other polymorphic regions not analyzed. © 2014 International Society for Sexual Medicine.

  19. A preliminary study of genetic diversity of MSP-1 types in Plasmodium falciparum in southern province of Sistan Baluchistan of Iran.

    PubMed

    Zahra, Zamani; Reza, Razavi Mohammad; Mehdi, Assmar; Sedigheh, Sadeghi; Fatemeh, Pourfallah; Nikoo, Nasoohi; Ashraf, Sheibani; Mohammad, Raisi

    2007-02-01

    Plasmodiumfalciparum merozoite surface protein-1 (MSP-1) shows extensive antigenic diversity. This is due to the presence of seven variable blocks, five semi-conserved and also five conserved blocks. The variable blocks in the MSP-1 gene are principally dimorphic, displaying either K1 or MAD20 type; except for the block 2 region which is represented by three alleles, an RO33 type in addition to the other two. Allelic diversity is reported to be generated by intra-genic recombination between the variable blocks. A study of allelic variation of MSP-1 gene in Plasmodium falciparum was carried out in the southern province of Sistan Baluchistan in Iran in 2001-2003. Samples were obtained from 30 febrile patients and DNA was extracted and association types between blocks 2 and 6 was identified on each block using specific primers and compared with those from Vietnam, Brazil and Africa. The association types obtained, were similar though less in number than the ones from Vietnam, but more than those from Africa and Brazil.

  20. Chronic lymphocytic leukemia patients exposed to ionizing radiation due to the Chernobyl NPP accident--with focus on immunoglobulin heavy chain gene analysis.

    PubMed

    Abramenko, Iryna; Bilous, Nadia; Chumak, Anatoliy; Davidova, Ekaterina; Kryachok, Iryna; Martina, Zoya; Nechaev, Stanislav; Dyagil, Iryna; Bazyka, Dmytriy; Bebeshko, Vladimir

    2008-04-01

    Clinical data and immunoglobulin variable heavy chain (IgVH) gene configuration were analyzed in 47 CLL patients, exposed to ionizing radiation (IR) due to Chernobyl NPP accident, and 141 non-exposed patients. Clean-up workers of the second quarter of 1986 (n=19) were picked out as separate group with the highest number of unmutated cases (94.4%), increased usage of IgVH1-69 (33.3%) and IgVH3-21 (16.7%) genes, high frequency of secondary solid tumors (6 cases) and Richter transformation (4 cases). These preliminary data suggest that CLL in the most suffered contingent due to Chernobyl NPP accident might have some specific features.

Top