Chen, Bor-Sen; Lin, Ying-Po
2011-01-01
In the evolutionary process, the random transmission and mutation of genes provide biological diversities for natural selection. In order to preserve functional phenotypes between generations, gene networks need to evolve robustly under the influence of random perturbations. Therefore, the robustness of the phenotype, in the evolutionary process, exerts a selection force on gene networks to keep network functions. However, gene networks need to adjust, by variations in genetic content, to generate phenotypes for new challenges in the network’s evolution, ie, the evolvability. Hence, there should be some interplay between the evolvability and network robustness in evolutionary gene networks. In this study, the interplay between the evolvability and network robustness of a gene network and a biochemical network is discussed from a nonlinear stochastic system point of view. It was found that if the genetic robustness plus environmental robustness is less than the network robustness, the phenotype of the biological network is robust in evolution. The tradeoff between the genetic robustness and environmental robustness in evolution is discussed from the stochastic stability robustness and sensitivity of the nonlinear stochastic biological network, which may be relevant to the statistical tradeoff between bias and variance, the so-called bias/variance dilemma. Further, the tradeoff could be considered as an antagonistic pleiotropic action of a gene network and discussed from the systems biology perspective. PMID:22084563
Female mating preferences determine system-level evolution in a gene network model.
Fierst, Janna L
2013-06-01
Environmental patterns of directional, stabilizing and fluctuating selection can influence the evolution of system-level properties like evolvability and mutational robustness. Intersexual selection produces strong phenotypic selection and these dynamics may also affect the response to mutation and the potential for future adaptation. In order to to assess the influence of mating preferences on these evolutionary properties, I modeled a male trait and female preference determined by separate gene regulatory networks. I studied three sexual selection scenarios: sexual conflict, a Gaussian model of the Fisher process described in Lande (in Proc Natl Acad Sci 78(6):3721-3725, 1981) and a good genes model in which the male trait signalled his mutational condition. I measured the effects these mating preferences had on the potential for traits and preferences to evolve towards new states, and mutational robustness of both the phenotype and the individual's overall viability. All types of sexual selection increased male phenotypic robustness relative to a randomly mating population. The Fisher model also reduced male evolvability and mutational robustness for viability. Under good genes sexual selection, males evolved an increased mutational robustness for viability. Females choosing their mates is a scenario that is sufficient to create selective forces that impact genetic evolution and shape the evolutionary response to mutation and environmental selection. These dynamics will inevitably develop in any population where sexual selection is operating, and affect the potential for future adaptation.
Relevance of phenotypic noise to adaptation and evolution.
Kaneko, K; Furusawa, C
2008-09-01
Biological processes are inherently noisy, as highlighted in recent measurements of stochasticity in gene expression. Here, the authors show that such phenotypic noise is essential to the adaptation of organisms to a variety of environments and also to the evolution of robustness against mutations. First, the authors show that for any growing cell showing stochastic gene expression, the adaptive cellular state is inevitably selected by noise, without the use of a specific signal transduction network. In general, changes in any protein concentration in a cell are products of its synthesis minus dilution and degradation, both of which are proportional to the rate of cell growth. In an adaptive state, both the synthesis and dilution terms of proteins are large, and so the adaptive state is less affected by stochasticity in gene expression, whereas for a non-adaptive state, both terms are smaller, and so cells are easily knocked out of their original state by noise. This leads to a novel, generic mechanism for the selection of adaptive states. The authors have confirmed this selection by model simulations. Secondly, the authors consider the evolution of gene networks to acquire robustness of the phenotype against noise and mutation. Through simulations using a simple stochastic gene expression network that undergoes mutation and selection, the authors show that a threshold level of noise in gene expression is required for the network to acquire both types of robustness. The results reveal how the noise that cells encounter during growth and development shapes any network's robustness, not only to noise but also to mutations. The authors also establish a relationship between developmental and mutational robustness.
Identification of Conflicting Selective Effects on Highly Expressed Genes
Higgs, Paul G.; Hao, Weilong; Golding, G. Brian
2007-01-01
Many different selective effects on DNA and proteins influence the frequency of codons and amino acids in coding sequences. Selection is often stronger on highly expressed genes. Hence, by comparing high- and low-expression genes it is possible to distinguish the factors that are selected by evolution. It has been proposed that highly expressed genes should (i) preferentially use codons matching abundant tRNAs (translational efficiency), (ii) preferentially use amino acids with low cost of synthesis, (iii) be under stronger selection to maintain the required amino acid content, and (iv) be selected for translational robustness. These effects act simultaneously and can be contradictory. We develop a model that combines these factors, and use Akaike’s Information Criterion for model selection. We consider pairs of paralogues that arose by whole-genome duplication in Saccharmyces cerevisiae. A codon-based model is used that includes asymmetric effects due to selection on highly expressed genes. The largest effect is translational efficiency, which is found to strongly influence synonymous, but not non-synonymous rates. Minimization of the cost of amino acid synthesis is implicated. However, when a more general measure of selection for amino acid usage is used, the cost minimization effect becomes redundant. Small effects that we attribute to selection for translational robustness can be identified as an improvement in the model fit on top of the effects of translational efficiency and amino acid usage. PMID:19430600
Transcriptional Regulation of Pattern-Triggered Immunity in Plants.
Li, Bo; Meng, Xiangzong; Shan, Libo; He, Ping
2016-05-11
Perception of microbe-associated molecular patterns (MAMPs) by cell-surface-resident pattern recognition receptors (PRRs) induces rapid, robust, and selective transcriptional reprogramming, which is central for launching effective pattern-triggered immunity (PTI) in plants. Signal relay from PRR complexes to the nuclear transcriptional machinery via intracellular kinase cascades rapidly activates primary immune response genes. The coordinated action of gene-specific transcription factors and the general transcriptional machinery contribute to the selectivity of immune gene activation. In addition, PRR complexes and signaling components are often transcriptionally upregulated upon MAMP perception to ensure the robustness and sustainability of PTI outputs. In this review, we discuss recent advances in deciphering the signaling pathways and regulatory mechanisms that coordinately lead to timely and accurate MAMP-induced gene expression in plants. Copyright © 2016 Elsevier Inc. All rights reserved.
A novel feature extraction approach for microarray data based on multi-algorithm fusion
Jiang, Zhu; Xu, Rong
2015-01-01
Feature extraction is one of the most important and effective method to reduce dimension in data mining, with emerging of high dimensional data such as microarray gene expression data. Feature extraction for gene selection, mainly serves two purposes. One is to identify certain disease-related genes. The other is to find a compact set of discriminative genes to build a pattern classifier with reduced complexity and improved generalization capabilities. Depending on the purpose of gene selection, two types of feature extraction algorithms including ranking-based feature extraction and set-based feature extraction are employed in microarray gene expression data analysis. In ranking-based feature extraction, features are evaluated on an individual basis, without considering inter-relationship between features in general, while set-based feature extraction evaluates features based on their role in a feature set by taking into account dependency between features. Just as learning methods, feature extraction has a problem in its generalization ability, which is robustness. However, the issue of robustness is often overlooked in feature extraction. In order to improve the accuracy and robustness of feature extraction for microarray data, a novel approach based on multi-algorithm fusion is proposed. By fusing different types of feature extraction algorithms to select the feature from the samples set, the proposed approach is able to improve feature extraction performance. The new approach is tested against gene expression dataset including Colon cancer data, CNS data, DLBCL data, and Leukemia data. The testing results show that the performance of this algorithm is better than existing solutions. PMID:25780277
A novel feature extraction approach for microarray data based on multi-algorithm fusion.
Jiang, Zhu; Xu, Rong
2015-01-01
Feature extraction is one of the most important and effective method to reduce dimension in data mining, with emerging of high dimensional data such as microarray gene expression data. Feature extraction for gene selection, mainly serves two purposes. One is to identify certain disease-related genes. The other is to find a compact set of discriminative genes to build a pattern classifier with reduced complexity and improved generalization capabilities. Depending on the purpose of gene selection, two types of feature extraction algorithms including ranking-based feature extraction and set-based feature extraction are employed in microarray gene expression data analysis. In ranking-based feature extraction, features are evaluated on an individual basis, without considering inter-relationship between features in general, while set-based feature extraction evaluates features based on their role in a feature set by taking into account dependency between features. Just as learning methods, feature extraction has a problem in its generalization ability, which is robustness. However, the issue of robustness is often overlooked in feature extraction. In order to improve the accuracy and robustness of feature extraction for microarray data, a novel approach based on multi-algorithm fusion is proposed. By fusing different types of feature extraction algorithms to select the feature from the samples set, the proposed approach is able to improve feature extraction performance. The new approach is tested against gene expression dataset including Colon cancer data, CNS data, DLBCL data, and Leukemia data. The testing results show that the performance of this algorithm is better than existing solutions.
A Tightly Regulated Genetic Selection System with Signaling-Active Alleles of Phytochrome B.
Hu, Wei; Lagarias, J Clark
2017-01-01
Selectable markers derived from plant genes circumvent the potential risk of antibiotic/herbicide-resistance gene transfer into neighboring plant species, endophytic bacteria, and mycorrhizal fungi. Toward this goal, we have engineered and validated signaling-active alleles of phytochrome B (eYHB) as plant-derived selection marker genes in the model plant Arabidopsis (Arabidopsis thaliana). By probing the relationship of construct size and induction conditions to optimal phenotypic selection, we show that eYHB-based alleles are robust substitutes for antibiotic/herbicide-dependent marker genes as well as surprisingly sensitive reporters of off-target transgene expression. © 2017 American Society of Plant Biologists. All Rights Reserved.
A Tightly Regulated Genetic Selection System with Signaling-Active Alleles of Phytochrome B1[OPEN
2017-01-01
Selectable markers derived from plant genes circumvent the potential risk of antibiotic/herbicide-resistance gene transfer into neighboring plant species, endophytic bacteria, and mycorrhizal fungi. Toward this goal, we have engineered and validated signaling-active alleles of phytochrome B (eYHB) as plant-derived selection marker genes in the model plant Arabidopsis (Arabidopsis thaliana). By probing the relationship of construct size and induction conditions to optimal phenotypic selection, we show that eYHB-based alleles are robust substitutes for antibiotic/herbicide-dependent marker genes as well as surprisingly sensitive reporters of off-target transgene expression. PMID:27881727
Robust gene selection methods using weighting schemes for microarray data analysis.
Kang, Suyeon; Song, Jongwoo
2017-09-02
A common task in microarray data analysis is to identify informative genes that are differentially expressed between two different states. Owing to the high-dimensional nature of microarray data, identification of significant genes has been essential in analyzing the data. However, the performances of many gene selection techniques are highly dependent on the experimental conditions, such as the presence of measurement error or a limited number of sample replicates. We have proposed new filter-based gene selection techniques, by applying a simple modification to significance analysis of microarrays (SAM). To prove the effectiveness of the proposed method, we considered a series of synthetic datasets with different noise levels and sample sizes along with two real datasets. The following findings were made. First, our proposed methods outperform conventional methods for all simulation set-ups. In particular, our methods are much better when the given data are noisy and sample size is small. They showed relatively robust performance regardless of noise level and sample size, whereas the performance of SAM became significantly worse as the noise level became high or sample size decreased. When sufficient sample replicates were available, SAM and our methods showed similar performance. Finally, our proposed methods are competitive with traditional methods in classification tasks for microarrays. The results of simulation study and real data analysis have demonstrated that our proposed methods are effective for detecting significant genes and classification tasks, especially when the given data are noisy or have few sample replicates. By employing weighting schemes, we can obtain robust and reliable results for microarray data analysis.
Steiner, Christopher F.
2012-01-01
The ability of organisms to adapt and persist in the face of environmental change is accepted as a fundamental feature of natural systems. More contentious is whether the capacity of organisms to adapt (or “evolvability”) can itself evolve and the mechanisms underlying such responses. Using model gene networks, I provide evidence that evolvability emerges more readily when populations experience positively autocorrelated environmental noise (red noise) compared to populations in stable or randomly varying (white noise) environments. Evolvability was correlated with increasing genetic robustness to effects on network viability and decreasing robustness to effects on phenotypic expression; populations whose networks displayed greater viability robustness and lower phenotypic robustness produced more additive genetic variation and adapted more rapidly in novel environments. Patterns of selection for robustness varied antagonistically with epistatic effects of mutations on viability and phenotypic expression, suggesting that trade-offs between these properties may constrain their evolutionary responses. Evolution of evolvability and robustness was stronger in sexual populations compared to asexual populations indicating that enhanced genetic variation under fluctuating selection combined with recombination load is a primary driver of the emergence of evolvability. These results provide insight into the mechanisms potentially underlying rapid adaptation as well as the environmental conditions that drive the evolution of genetic interactions. PMID:23284934
Pervasive positive selection on duplicated and nonduplicated vertebrate protein coding genes.
Studer, Romain A; Penel, Simon; Duret, Laurent; Robinson-Rechavi, Marc
2008-09-01
A stringent branch-site codon model was used to detect positive selection in vertebrate evolution. We show that the test is robust to the large evolutionary distances involved. Positive selection was detected in 77% of 884 genes studied. Most positive selection concerns a few sites on a single branch of the phylogenetic tree: Between 0.9% and 4.7% of sites are affected by positive selection depending on the branches. No functional category was overrepresented among genes under positive selection. Surprisingly, whole genome duplication had no effect on the prevalence of positive selection, whether the fish-specific genome duplication or the two rounds at the origin of vertebrates. Thus positive selection has not been limited to a few gene classes, or to specific evolutionary events such as duplication, but has been pervasive during vertebrate evolution.
Robust Design of Biological Circuits: Evolutionary Systems Biology Approach
Chen, Bor-Sen; Hsu, Chih-Yuan; Liou, Jing-Jia
2011-01-01
Artificial gene circuits have been proposed to be embedded into microbial cells that function as switches, timers, oscillators, and the Boolean logic gates. Building more complex systems from these basic gene circuit components is one key advance for biologic circuit design and synthetic biology. However, the behavior of bioengineered gene circuits remains unstable and uncertain. In this study, a nonlinear stochastic system is proposed to model the biological systems with intrinsic parameter fluctuations and environmental molecular noise from the cellular context in the host cell. Based on evolutionary systems biology algorithm, the design parameters of target gene circuits can evolve to specific values in order to robustly track a desired biologic function in spite of intrinsic and environmental noise. The fitness function is selected to be inversely proportional to the tracking error so that the evolutionary biological circuit can achieve the optimal tracking mimicking the evolutionary process of a gene circuit. Finally, several design examples are given in silico with the Monte Carlo simulation to illustrate the design procedure and to confirm the robust performance of the proposed design method. The result shows that the designed gene circuits can robustly track desired behaviors with minimal errors even with nontrivial intrinsic and external noise. PMID:22187523
Robust design of biological circuits: evolutionary systems biology approach.
Chen, Bor-Sen; Hsu, Chih-Yuan; Liou, Jing-Jia
2011-01-01
Artificial gene circuits have been proposed to be embedded into microbial cells that function as switches, timers, oscillators, and the Boolean logic gates. Building more complex systems from these basic gene circuit components is one key advance for biologic circuit design and synthetic biology. However, the behavior of bioengineered gene circuits remains unstable and uncertain. In this study, a nonlinear stochastic system is proposed to model the biological systems with intrinsic parameter fluctuations and environmental molecular noise from the cellular context in the host cell. Based on evolutionary systems biology algorithm, the design parameters of target gene circuits can evolve to specific values in order to robustly track a desired biologic function in spite of intrinsic and environmental noise. The fitness function is selected to be inversely proportional to the tracking error so that the evolutionary biological circuit can achieve the optimal tracking mimicking the evolutionary process of a gene circuit. Finally, several design examples are given in silico with the Monte Carlo simulation to illustrate the design procedure and to confirm the robust performance of the proposed design method. The result shows that the designed gene circuits can robustly track desired behaviors with minimal errors even with nontrivial intrinsic and external noise.
Tao, Yongxin; van Peer, Arend Frans; Huang, Qianhui; Shao, Yanping; Zhang, Lei; Xie, Bin; Jiang, Yuji; Zhu, Jian; Xie, Baogui
2016-07-12
The selection of appropriate internal control genes (ICGs) is a crucial step in the normalization of real-time quantitative PCR (RT-qPCR) data. Housekeeping genes are habitually selected for this purpose, despite accumulating evidence on their instability. We screened for novel, robust ICGs in the mushroom forming fungus Volvariella volvacea. Nine commonly used and five newly selected ICGs were evaluated for expression stability using RT-qPCR data in eight different stages of the life cycle of V. volvacea. Three different algorithms consistently determined that three novel ICGs (SPRYp, Ras and Vps26) exhibited the highest expression stability in V. volvacea. Subsequent analysis of ICGs in twenty-four expression profiles from nine filamentous fungi revealed that Ras was the most stable ICG amongst the Basidiomycetous samples, followed by SPRYp, Vps26 and ACTB. Vps26 was expressed most stably within the analyzed data of Ascomycetes, followed by HH3 and β-TUB. No ICG was universally stable for all fungal species, or for all experimental conditions within a species. Ultimately, the choice of an ICG will depend on a specific set of experiments. This study provides novel, robust ICGs for Basidiomycetes and Ascomycetes. Together with the presented guiding principles, this enables the efficient selection of suitable ICGs for RT-qPCR.
Bhanot, Gyan; Alexe, Gabriela; Levine, Arnold J; Stolovitzky, Gustavo
2005-01-01
A major challenge in cancer diagnosis from microarray data is the need for robust, accurate, classification models which are independent of the analysis techniques used and can combine data from different laboratories. We propose such a classification scheme originally developed for phenotype identification from mass spectrometry data. The method uses a robust multivariate gene selection procedure and combines the results of several machine learning tools trained on raw and pattern data to produce an accurate meta-classifier. We illustrate and validate our method by applying it to gene expression datasets: the oligonucleotide HuGeneFL microarray dataset of Shipp et al. (www.genome.wi.mit.du/MPR/lymphoma) and the Hu95Av2 Affymetrix dataset (DallaFavera's laboratory, Columbia University). Our pattern-based meta-classification technique achieves higher predictive accuracies than each of the individual classifiers , is robust against data perturbations and provides subsets of related predictive genes. Our techniques predict that combinations of some genes in the p53 pathway are highly predictive of phenotype. In particular, we find that in 80% of DLBCL cases the mRNA level of at least one of the three genes p53, PLK1 and CDK2 is elevated, while in 80% of FL cases, the mRNA level of at most one of them is elevated.
Evolution and inheritance of early embryonic patterning in D. simulans and D. sechellia
Lott, Susan E.; Ludwig, Michael Z.; Kreitman, Martin
2010-01-01
Pattern formation in Drosophila is a widely studied example of a robust developmental system. Such robust systems pose a challenge to adaptive evolution, as they mask variation which selection may otherwise act upon. Yet we find variation in the localization of expression domains (henceforth ‘stripe allometry’) in the pattern formation pathway. Specifically, we characterize differences in the gap genes giant and Kruppel, and the pair-rule gene even-skipped, which differ between the sibling species D. simulans and D. sechellia. In a double-backcross experiment, stripe allometry is consistent with maternal inheritance of stripe positioning and multiple genetic factors, with a distinct genetic basis from embryo length. Embryos produced by F1 and F2 backcross mothers exhibit novel spatial patterns of gene expression relative to the parental species, with no measurable increase in positional variance among individuals. Buffering of novel spatial patterns in the backcross genotypes suggests that robustness need not be disrupted in order for the trait to evolve, and perhaps the system is incapable of evolving to prevent the expression of all genetic variation. This limitation, and the ability of natural selection to act on minute genetic differences that are within the “margin of error” for the buffering mechanism, indicates that developmentally buffered traits can evolve without disruption of robustness PMID:21121913
Kadota, Koji; Konishi, Tomokazu; Shimizu, Kentaro
2007-05-01
Large-scale expression profiling using DNA microarrays enables identification of tissue-selective genes for which expression is considerably higher and/or lower in some tissues than in others. Among numerous possible methods, only two outlier-detection-based methods (an AIC-based method and Sprent's non-parametric method) can treat equally various types of selective patterns, but they produce substantially different results. We investigated the performance of these two methods for different parameter settings and for a reduced number of samples. We focused on their ability to detect selective expression patterns robustly. We applied them to public microarray data collected from 36 normal human tissue samples and analyzed the effects of both changing the parameter settings and reducing the number of samples. The AIC-based method was more robust in both cases. The findings confirm that the use of the AIC-based method in the recently proposed ROKU method for detecting tissue-selective expression patterns is correct and that Sprent's method is not suitable for ROKU.
Stability Depends on Positive Autoregulation in Boolean Gene Regulatory Networks
Pinho, Ricardo; Garcia, Victor; Irimia, Manuel; Feldman, Marcus W.
2014-01-01
Network motifs have been identified as building blocks of regulatory networks, including gene regulatory networks (GRNs). The most basic motif, autoregulation, has been associated with bistability (when positive) and with homeostasis and robustness to noise (when negative), but its general importance in network behavior is poorly understood. Moreover, how specific autoregulatory motifs are selected during evolution and how this relates to robustness is largely unknown. Here, we used a class of GRN models, Boolean networks, to investigate the relationship between autoregulation and network stability and robustness under various conditions. We ran evolutionary simulation experiments for different models of selection, including mutation and recombination. Each generation simulated the development of a population of organisms modeled by GRNs. We found that stability and robustness positively correlate with autoregulation; in all investigated scenarios, stable networks had mostly positive autoregulation. Assuming biological networks correspond to stable networks, these results suggest that biological networks should often be dominated by positive autoregulatory loops. This seems to be the case for most studied eukaryotic transcription factor networks, including those in yeast, flies and mammals. PMID:25375153
Strain-Dependent Transcriptome Signatures for Robustness in Lactococcus lactis
Dijkstra, Annereinou R.; Alkema, Wynand; Starrenburg, Marjo J. C.; van Hijum, Sacha A. F. T.; Bron, Peter A.
2016-01-01
Recently, we demonstrated that fermentation conditions have a strong impact on subsequent survival of Lactococcus lactis strain MG1363 during heat and oxidative stress, two important parameters during spray drying. Moreover, employment of a transcriptome-phenotype matching approach revealed groups of genes associated with robustness towards heat and/or oxidative stress. To investigate if other strains have similar or distinct transcriptome signatures for robustness, we applied an identical transcriptome-robustness phenotype matching approach on the L. lactis strains IL1403, KF147 and SK11, which have previously been demonstrated to display highly diverse robustness phenotypes. These strains were subjected to an identical fermentation regime as was performed earlier for strain MG1363 and consisted of twelve conditions, varying in the level of salt and/or oxygen, as well as fermentation temperature and pH. In the exponential phase of growth, cells were harvested for transcriptome analysis and assessment of heat and oxidative stress survival phenotypes. The variation in fermentation conditions resulted in differences in heat and oxidative stress survival of up to five 10-log units. Effects of the fermentation conditions on stress survival of the L. lactis strains were typically strain-dependent, although the fermentation conditions had mainly similar effects on the growth characteristics of the different strains. By association of the transcriptomes and robustness phenotypes highly strain-specific transcriptome signatures for robustness towards heat and oxidative stress were identified, indicating that multiple mechanisms exist to increase robustness and, as a consequence, robustness of each strain requires individual optimization. However, a relatively small overlap in the transcriptome responses of the strains was also identified and this generic transcriptome signature included genes previously associated with stress (ctsR and lplL) and novel genes, including nanE and genes encoding transport proteins. The transcript levels of these genes can function as indicators of robustness and could aid in selection of fermentation parameters, potentially resulting in more optimal robustness during spray drying. PMID:27973578
Tsuda, Kenichi; Mine, Akira; Bethke, Gerit; Igarashi, Daisuke; Botanga, Christopher J; Tsuda, Yayoi; Glazebrook, Jane; Sato, Masanao; Katagiri, Fumiaki
2013-01-01
Network robustness is a crucial property of the plant immune signaling network because pathogens are under a strong selection pressure to perturb plant network components to dampen plant immune responses. Nevertheless, modulation of network robustness is an area of network biology that has rarely been explored. While two modes of plant immunity, Effector-Triggered Immunity (ETI) and Pattern-Triggered Immunity (PTI), extensively share signaling machinery, the network output is much more robust against perturbations during ETI than PTI, suggesting modulation of network robustness. Here, we report a molecular mechanism underlying the modulation of the network robustness in Arabidopsis thaliana. The salicylic acid (SA) signaling sector regulates a major portion of the plant immune response and is important in immunity against biotrophic and hemibiotrophic pathogens. In Arabidopsis, SA signaling was required for the proper regulation of the vast majority of SA-responsive genes during PTI. However, during ETI, regulation of most SA-responsive genes, including the canonical SA marker gene PR1, could be controlled by SA-independent mechanisms as well as by SA. The activation of the two immune-related MAPKs, MPK3 and MPK6, persisted for several hours during ETI but less than one hour during PTI. Sustained MAPK activation was sufficient to confer SA-independent regulation of most SA-responsive genes. Furthermore, the MPK3 and SA signaling sectors were compensatory to each other for inhibition of bacterial growth as well as for PR1 expression during ETI. These results indicate that the duration of the MAPK activation is a critical determinant for modulation of robustness of the immune signaling network. Our findings with the plant immune signaling network imply that the robustness level of a biological network can be modulated by the activities of network components.
Recursive feature selection with significant variables of support vectors.
Tsai, Chen-An; Huang, Chien-Hsun; Chang, Ching-Wei; Chen, Chun-Houh
2012-01-01
The development of DNA microarray makes researchers screen thousands of genes simultaneously and it also helps determine high- and low-expression level genes in normal and disease tissues. Selecting relevant genes for cancer classification is an important issue. Most of the gene selection methods use univariate ranking criteria and arbitrarily choose a threshold to choose genes. However, the parameter setting may not be compatible to the selected classification algorithms. In this paper, we propose a new gene selection method (SVM-t) based on the use of t-statistics embedded in support vector machine. We compared the performance to two similar SVM-based methods: SVM recursive feature elimination (SVMRFE) and recursive support vector machine (RSVM). The three methods were compared based on extensive simulation experiments and analyses of two published microarray datasets. In the simulation experiments, we found that the proposed method is more robust in selecting informative genes than SVMRFE and RSVM and capable to attain good classification performance when the variations of informative and noninformative genes are different. In the analysis of two microarray datasets, the proposed method yields better performance in identifying fewer genes with good prediction accuracy, compared to SVMRFE and RSVM.
Fuzzy support vector machine: an efficient rule-based classification technique for microarrays.
Hajiloo, Mohsen; Rabiee, Hamid R; Anooshahpour, Mahdi
2013-01-01
The abundance of gene expression microarray data has led to the development of machine learning algorithms applicable for tackling disease diagnosis, disease prognosis, and treatment selection problems. However, these algorithms often produce classifiers with weaknesses in terms of accuracy, robustness, and interpretability. This paper introduces fuzzy support vector machine which is a learning algorithm based on combination of fuzzy classifiers and kernel machines for microarray classification. Experimental results on public leukemia, prostate, and colon cancer datasets show that fuzzy support vector machine applied in combination with filter or wrapper feature selection methods develops a robust model with higher accuracy than the conventional microarray classification models such as support vector machine, artificial neural network, decision trees, k nearest neighbors, and diagonal linear discriminant analysis. Furthermore, the interpretable rule-base inferred from fuzzy support vector machine helps extracting biological knowledge from microarray data. Fuzzy support vector machine as a new classification model with high generalization power, robustness, and good interpretability seems to be a promising tool for gene expression microarray classification.
Evolution and inheritance of early embryonic patterning in Drosophila simulans and D. sechellia.
Lott, Susan E; Ludwig, Michael Z; Kreitman, Martin
2011-05-01
Pattern formation in Drosophila is a widely studied example of a robust developmental system. Such robust systems pose a challenge to adaptive evolution, as they mask variation that selection may otherwise act upon. Yet we find variation in the localization of expression domains (henceforth "stripe allometry") in the pattern formation pathway. Specifically, we characterize differences in the gap genes giant and Kruppel, and the pair-rule gene even-skipped, which differ between the sibling species Drosophila simulans and D. sechellia. In a double-backcross experiment, stripe allometry is consistent with maternal inheritance of stripe positioning and multiple genetic factors, with a distinct genetic basis from embryo length. Embryos produced by F1 and F2 backcross mothers exhibit novel spatial patterns of gene expression relative to the parental species, with no measurable increase in positional variance among individuals. Buffering of novel spatial patterns in the backcross genotypes suggests that robustness need not be disrupted in order for the trait to evolve, and perhaps the system is incapable of evolving to prevent the expression of all genetic variation. This limitation, and the ability of natural selection to act on minute genetic differences that are within the "margin of error" for the buffering mechanism, indicates that developmentally buffered traits can evolve without disruption of robustness. © 2010 The Author(s). Evolution© 2010 The Society for the Study of Evolution.
Lu, Tao
2016-01-01
The gene regulation network (GRN) evaluates the interactions between genes and look for models to describe the gene expression behavior. These models have many applications; for instance, by characterizing the gene expression mechanisms that cause certain disorders, it would be possible to target those genes to block the progress of the disease. Many biological processes are driven by nonlinear dynamic GRN. In this article, we propose a nonparametric differential equation (ODE) to model the nonlinear dynamic GRN. Specially, we address following questions simultaneously: (i) extract information from noisy time course gene expression data; (ii) model the nonlinear ODE through a nonparametric smoothing function; (iii) identify the important regulatory gene(s) through a group smoothly clipped absolute deviation (SCAD) approach; (iv) test the robustness of the model against possible shortening of experimental duration. We illustrate the usefulness of the model and associated statistical methods through a simulation and a real application examples.
Kadota, Koji; Konishi, Tomokazu; Shimizu, Kentaro
2007-01-01
Large-scale expression profiling using DNA microarrays enables identification of tissue-selective genes for which expression is considerably higher and/or lower in some tissues than in others. Among numerous possible methods, only two outlier-detection-based methods (an AIC-based method and Sprent’s non-parametric method) can treat equally various types of selective patterns, but they produce substantially different results. We investigated the performance of these two methods for different parameter settings and for a reduced number of samples. We focused on their ability to detect selective expression patterns robustly. We applied them to public microarray data collected from 36 normal human tissue samples and analyzed the effects of both changing the parameter settings and reducing the number of samples. The AIC-based method was more robust in both cases. The findings confirm that the use of the AIC-based method in the recently proposed ROKU method for detecting tissue-selective expression patterns is correct and that Sprent’s method is not suitable for ROKU. PMID:19936074
Modelling the influence of parental effects on gene-network evolution.
Odorico, Andreas; Rünneburger, Estelle; Le Rouzic, Arnaud
2018-05-01
Understanding the importance of nongenetic heredity in the evolutionary process is a major topic in modern evolutionary biology. We modified a classical gene-network model by allowing parental transmission of gene expression and studied its evolutionary properties through individual-based simulations. We identified ontogenetic time (i.e. the time gene networks have to stabilize before being submitted to natural selection) as a crucial factor in determining the evolutionary impact of this phenotypic inheritance. Indeed, fast-developing organisms display enhanced adaptation and greater robustness to mutations when evolving in presence of nongenetic inheritance (NGI). In contrast, in our model, long development reduces the influence of the inherited state of the gene network. NGI thus had a negligible effect on the evolution of gene networks when the speed at which transcription levels reach equilibrium is not constrained. Nevertheless, simulations show that intergenerational transmission of the gene-network state negatively affects the evolution of robustness to environmental disturbances for either fast- or slow-developing organisms. Therefore, these results suggest that the evolutionary consequences of NGI might not be sought only in the way species respond to selection, but also on the evolution of emergent properties (such as environmental and genetic canalization) in complex genetic architectures. © 2018 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2018 European Society For Evolutionary Biology.
Fann, Monchou; Godlove, Jason M.; Catalfamo, Marta; Wood, William H.; Chrest, Francis J.; Chun, Nicholas; Granger, Larry; Wersto, Robert; Madara, Karen; Becker, Kevin; Henkart, Pierre A.; Weng, Nan-ping
2006-01-01
To understand the molecular basis for the rapid and robust memory T-cell responses, we examined gene expression and chromatin modification by histone H3 lysine 9 (H3K9) acetylation in resting and activated human naive and memory CD8+ T cells. We found that, although overall gene expression patterns were similar, a number of genes are differentially expressed in either memory or naive cells in their resting and activated states. To further elucidate the basis for differential gene expression, we assessed the role of histone H3K9 acetylation in differential gene expression. Strikingly, higher H3K9 acetylation levels were detected in resting memory cells, prior to their activation, for those genes that were differentially expressed following activation, indicating that hyperacetylation of histone H3K9 may play a role in selective and rapid gene expression of memory CD8+ T cells. Consistent with this model, we showed that inducing high levels of H3K9 acetylation resulted in an increased expression in naive cells of those genes that are normally expressed differentially in memory cells. Together, these findings suggest that differential gene expression mediated at least in part by histone H3K9 hyperacetylation may be responsible for the rapid and robust memory CD8+ T-cell response. PMID:16868257
Microarray-based cancer prediction using soft computing approach.
Wang, Xiaosheng; Gotoh, Osamu
2009-05-26
One of the difficulties in using gene expression profiles to predict cancer is how to effectively select a few informative genes to construct accurate prediction models from thousands or ten thousands of genes. We screen highly discriminative genes and gene pairs to create simple prediction models involved in single genes or gene pairs on the basis of soft computing approach and rough set theory. Accurate cancerous prediction is obtained when we apply the simple prediction models for four cancerous gene expression datasets: CNS tumor, colon tumor, lung cancer and DLBCL. Some genes closely correlated with the pathogenesis of specific or general cancers are identified. In contrast with other models, our models are simple, effective and robust. Meanwhile, our models are interpretable for they are based on decision rules. Our results demonstrate that very simple models may perform well on cancerous molecular prediction and important gene markers of cancer can be detected if the gene selection approach is chosen reasonably.
Xi, Jianing; Wang, Minghui; Li, Ao
2018-06-05
Discovery of mutated driver genes is one of the primary objective for studying tumorigenesis. To discover some relatively low frequently mutated driver genes from somatic mutation data, many existing methods incorporate interaction network as prior information. However, the prior information of mRNA expression patterns are not exploited by these existing network-based methods, which is also proven to be highly informative of cancer progressions. To incorporate prior information from both interaction network and mRNA expressions, we propose a robust and sparse co-regularized nonnegative matrix factorization to discover driver genes from mutation data. Furthermore, our framework also conducts Frobenius norm regularization to overcome overfitting issue. Sparsity-inducing penalty is employed to obtain sparse scores in gene representations, of which the top scored genes are selected as driver candidates. Evaluation experiments by known benchmarking genes indicate that the performance of our method benefits from the two type of prior information. Our method also outperforms the existing network-based methods, and detect some driver genes that are not predicted by the competing methods. In summary, our proposed method can improve the performance of driver gene discovery by effectively incorporating prior information from interaction network and mRNA expression patterns into a robust and sparse co-regularized matrix factorization framework.
Eveno, Emmanuelle; Collada, Carmen; Guevara, M Angeles; Léger, Valérie; Soto, Alvaro; Díaz, Luis; Léger, Patrick; González-Martínez, Santiago C; Cervera, M Teresa; Plomion, Christophe; Garnier-Géré, Pauline H
2008-02-01
The importance of natural selection for shaping adaptive trait differentiation among natural populations of allogamous tree species has long been recognized. Determining the molecular basis of local adaptation remains largely unresolved, and the respective roles of selection and demography in shaping population structure are actively debated. Using a multilocus scan that aims to detect outliers from simulated neutral expectations, we analyzed patterns of nucleotide diversity and genetic differentiation at 11 polymorphic candidate genes for drought stress tolerance in phenotypically contrasted Pinus pinaster Ait. populations across its geographical range. We compared 3 coalescent-based methods: 2 frequentist-like, including 1 approach specifically developed for biallelic single nucleotide polymorphisms (SNPs) here and 1 Bayesian. Five genes showed outlier patterns that were robust across methods at the haplotype level for 2 of them. Two genes presented higher F(ST) values than expected (PR-AGP4 and erd3), suggesting that they could have been affected by the action of diversifying selection among populations. In contrast, 3 genes presented lower F(ST) values than expected (dhn-1, dhn2, and lp3-1), which could represent signatures of homogenizing selection among populations. A smaller proportion of outliers were detected at the SNP level suggesting the potential functional significance of particular combinations of sites in drought-response candidate genes. The Bayesian method appeared robust to low sample sizes, flexible to assumptions regarding migration rates, and powerful for detecting selection at the haplotype level, but the frequentist-like method adapted to SNPs was more efficient for the identification of outlier SNPs showing low differentiation. Population-specific effects estimated in the Bayesian method also revealed populations with lower immigration rates, which could have led to favorable situations for local adaptation. Outlier patterns are discussed in relation to the different genes' putative involvement in drought tolerance responses, from published results in transcriptomics and association mapping in P. pinaster and other related species. These genes clearly constitute relevant candidates for future association studies in P. pinaster.
Feature weight estimation for gene selection: a local hyperlinear learning approach
2014-01-01
Background Modeling high-dimensional data involving thousands of variables is particularly important for gene expression profiling experiments, nevertheless,it remains a challenging task. One of the challenges is to implement an effective method for selecting a small set of relevant genes, buried in high-dimensional irrelevant noises. RELIEF is a popular and widely used approach for feature selection owing to its low computational cost and high accuracy. However, RELIEF based methods suffer from instability, especially in the presence of noisy and/or high-dimensional outliers. Results We propose an innovative feature weighting algorithm, called LHR, to select informative genes from highly noisy data. LHR is based on RELIEF for feature weighting using classical margin maximization. The key idea of LHR is to estimate the feature weights through local approximation rather than global measurement, which is typically used in existing methods. The weights obtained by our method are very robust in terms of degradation of noisy features, even those with vast dimensions. To demonstrate the performance of our method, extensive experiments involving classification tests have been carried out on both synthetic and real microarray benchmark datasets by combining the proposed technique with standard classifiers, including the support vector machine (SVM), k-nearest neighbor (KNN), hyperplane k-nearest neighbor (HKNN), linear discriminant analysis (LDA) and naive Bayes (NB). Conclusion Experiments on both synthetic and real-world datasets demonstrate the superior performance of the proposed feature selection method combined with supervised learning in three aspects: 1) high classification accuracy, 2) excellent robustness to noise and 3) good stability using to various classification algorithms. PMID:24625071
Evolutionary Origins of Cancer Driver Genes and Implications for Cancer Prognosis
Chu, Xin-Yi; Zhou, Xiong-Hui; Cui, Ze-Jia; Zhang, Hong-Yu
2017-01-01
The cancer atavistic theory suggests that carcinogenesis is a reverse evolution process. It is thus of great interest to explore the evolutionary origins of cancer driver genes and the relevant mechanisms underlying the carcinogenesis. Moreover, the evolutionary features of cancer driver genes could be helpful in selecting cancer biomarkers from high-throughput data. In this study, through analyzing the cancer endogenous molecular networks, we revealed that the subnetwork originating from eukaryota could control the unlimited proliferation of cancer cells, and the subnetwork originating from eumetazoa could recapitulate the other hallmarks of cancer. In addition, investigations based on multiple datasets revealed that cancer driver genes were enriched in genes originating from eukaryota, opisthokonta, and eumetazoa. These results have important implications for enhancing the robustness of cancer prognosis models through selecting the gene signatures by the gene age information. PMID:28708071
Evolutionary Origins of Cancer Driver Genes and Implications for Cancer Prognosis.
Chu, Xin-Yi; Jiang, Ling-Han; Zhou, Xiong-Hui; Cui, Ze-Jia; Zhang, Hong-Yu
2017-07-14
The cancer atavistic theory suggests that carcinogenesis is a reverse evolution process. It is thus of great interest to explore the evolutionary origins of cancer driver genes and the relevant mechanisms underlying the carcinogenesis. Moreover, the evolutionary features of cancer driver genes could be helpful in selecting cancer biomarkers from high-throughput data. In this study, through analyzing the cancer endogenous molecular networks, we revealed that the subnetwork originating from eukaryota could control the unlimited proliferation of cancer cells, and the subnetwork originating from eumetazoa could recapitulate the other hallmarks of cancer. In addition, investigations based on multiple datasets revealed that cancer driver genes were enriched in genes originating from eukaryota, opisthokonta, and eumetazoa. These results have important implications for enhancing the robustness of cancer prognosis models through selecting the gene signatures by the gene age information.
Auffret, Marc D.; Stewart, Robert; Dewhurst, Richard J.; Duthie, Carol-Anne; Rooke, John A.; Wallace, Robert J.; Freeman, Tom C.; Snelling, Timothy J.; Watson, Mick; Roehe, Rainer
2018-01-01
Previous shotgun metagenomic analyses of ruminal digesta identified some microbial information that might be useful as biomarkers to select cattle that emit less methane (CH4), which is a potent greenhouse gas. It is known that methane production (g/kgDMI) and to an extent the microbial community is heritable and therefore biomarkers can offer a method of selecting cattle for low methane emitting phenotypes. In this study a wider range of Bos Taurus cattle, varying in breed and diet, was investigated to determine microbial communities and genetic markers associated with high/low CH4 emissions. Digesta samples were taken from 50 beef cattle, comprising four cattle breeds, receiving two basal diets containing different proportions of concentrate and also including feed additives (nitrate or lipid), that may influence methane emissions. A combination of partial least square analysis and network analysis enabled the identification of the most significant and robust biomarkers of CH4 emissions (VIP > 0.8) across diets and breeds when comparing all potential biomarkers together. Genes associated with the hydrogenotrophic methanogenesis pathway converting carbon dioxide to methane, provided the dominant biomarkers of CH4 emissions and methanogens were the microbial populations most closely correlated with CH4 emissions and identified by metagenomics. Moreover, these genes grouped together as confirmed by network analysis for each independent experiment and when combined. Finally, the genes involved in the methane synthesis pathway explained a higher proportion of variation in CH4 emissions by PLS analysis compared to phylogenetic parameters or functional genes. These results confirmed the reproducibility of the analysis and the advantage to use these genes as robust biomarkers of CH4 emissions. Volatile fatty acid concentrations and ratios were significantly correlated with CH4, but these factors were not identified as robust enough for predictive purposes. Moreover, the methanotrophic Methylomonas genus was found to be negatively correlated with CH4. Finally, this study confirmed the importance of using robust and applicable biomarkers from the microbiome as a proxy of CH4 emissions across diverse production systems and environments. PMID:29375511
Auffret, Marc D; Stewart, Robert; Dewhurst, Richard J; Duthie, Carol-Anne; Rooke, John A; Wallace, Robert J; Freeman, Tom C; Snelling, Timothy J; Watson, Mick; Roehe, Rainer
2017-01-01
Previous shotgun metagenomic analyses of ruminal digesta identified some microbial information that might be useful as biomarkers to select cattle that emit less methane (CH 4 ), which is a potent greenhouse gas. It is known that methane production (g/kgDMI) and to an extent the microbial community is heritable and therefore biomarkers can offer a method of selecting cattle for low methane emitting phenotypes. In this study a wider range of Bos Taurus cattle, varying in breed and diet, was investigated to determine microbial communities and genetic markers associated with high/low CH 4 emissions. Digesta samples were taken from 50 beef cattle, comprising four cattle breeds, receiving two basal diets containing different proportions of concentrate and also including feed additives (nitrate or lipid), that may influence methane emissions. A combination of partial least square analysis and network analysis enabled the identification of the most significant and robust biomarkers of CH 4 emissions (VIP > 0.8) across diets and breeds when comparing all potential biomarkers together. Genes associated with the hydrogenotrophic methanogenesis pathway converting carbon dioxide to methane, provided the dominant biomarkers of CH 4 emissions and methanogens were the microbial populations most closely correlated with CH 4 emissions and identified by metagenomics. Moreover, these genes grouped together as confirmed by network analysis for each independent experiment and when combined. Finally, the genes involved in the methane synthesis pathway explained a higher proportion of variation in CH 4 emissions by PLS analysis compared to phylogenetic parameters or functional genes. These results confirmed the reproducibility of the analysis and the advantage to use these genes as robust biomarkers of CH 4 emissions. Volatile fatty acid concentrations and ratios were significantly correlated with CH 4 , but these factors were not identified as robust enough for predictive purposes. Moreover, the methanotrophic Methylomonas genus was found to be negatively correlated with CH 4 . Finally, this study confirmed the importance of using robust and applicable biomarkers from the microbiome as a proxy of CH 4 emissions across diverse production systems and environments.
NASA Astrophysics Data System (ADS)
Faure, Guilhem; Koonin, Eugene V.
2015-05-01
Robustness to destabilizing effects of mutations is thought of as a key factor of protein evolution. The connections between two measures of robustness, the relative core size and the computationally estimated effect of mutations on protein stability (ΔΔG), protein abundance and the selection pressure on protein-coding genes (dN/dS) were analyzed for the organisms with a large number of available protein structures including four eukaryotes, two bacteria and one archaeon. The distribution of the effects of mutations in the core on protein stability is universal and indistinguishable in eukaryotes and bacteria, centered at slightly destabilizing amino acid replacements, and with a heavy tail of more strongly destabilizing replacements. The distribution of mutational effects in the hyperthermophilic archaeon Thermococcus gammatolerans is significantly shifted toward strongly destabilizing replacements which is indicative of stronger constraints that are imposed on proteins in hyperthermophiles. The median effect of mutations is strongly, positively correlated with the relative core size, in evidence of the congruence between the two measures of protein robustness. However, both measures show only limited correlations to the expression level and selection pressure on protein-coding genes. Thus, the degree of robustness reflected in the universal distribution of mutational effects appears to be a fundamental, ancient feature of globular protein folds whereas the observed variations are largely neutral and uncoupled from short term protein evolution. A weak anticorrelation between protein core size and selection pressure is observed only for surface residues in prokaryotes but a stronger anticorrelation is observed for all residues in eukaryotic proteins. This substantial difference between proteins of prokaryotes and eukaryotes is likely to stem from the demonstrable higher compactness of prokaryotic proteins.
Rue-Albrecht, Kévin; McGettigan, Paul A; Hernández, Belinda; Nalpas, Nicolas C; Magee, David A; Parnell, Andrew C; Gordon, Stephen V; MacHugh, David E
2016-03-11
Identification of gene expression profiles that differentiate experimental groups is critical for discovery and analysis of key molecular pathways and also for selection of robust diagnostic or prognostic biomarkers. While integration of differential expression statistics has been used to refine gene set enrichment analyses, such approaches are typically limited to single gene lists resulting from simple two-group comparisons or time-series analyses. In contrast, functional class scoring and machine learning approaches provide powerful alternative methods to leverage molecular measurements for pathway analyses, and to compare continuous and multi-level categorical factors. We introduce GOexpress, a software package for scoring and summarising the capacity of gene ontology features to simultaneously classify samples from multiple experimental groups. GOexpress integrates normalised gene expression data (e.g., from microarray and RNA-seq experiments) and phenotypic information of individual samples with gene ontology annotations to derive a ranking of genes and gene ontology terms using a supervised learning approach. The default random forest algorithm allows interactions between all experimental factors, and competitive scoring of expressed genes to evaluate their relative importance in classifying predefined groups of samples. GOexpress enables rapid identification and visualisation of ontology-related gene panels that robustly classify groups of samples and supports both categorical (e.g., infection status, treatment) and continuous (e.g., time-series, drug concentrations) experimental factors. The use of standard Bioconductor extension packages and publicly available gene ontology annotations facilitates straightforward integration of GOexpress within existing computational biology pipelines.
Statistical Analysis of Big Data on Pharmacogenomics
Fan, Jianqing; Liu, Han
2013-01-01
This paper discusses statistical methods for estimating complex correlation structure from large pharmacogenomic datasets. We selectively review several prominent statistical methods for estimating large covariance matrix for understanding correlation structure, inverse covariance matrix for network modeling, large-scale simultaneous tests for selecting significantly differently expressed genes and proteins and genetic markers for complex diseases, and high dimensional variable selection for identifying important molecules for understanding molecule mechanisms in pharmacogenomics. Their applications to gene network estimation and biomarker selection are used to illustrate the methodological power. Several new challenges of Big data analysis, including complex data distribution, missing data, measurement error, spurious correlation, endogeneity, and the need for robust statistical methods, are also discussed. PMID:23602905
2012-01-01
Background Previous studies on tumor classification based on gene expression profiles suggest that gene selection plays a key role in improving the classification performance. Moreover, finding important tumor-related genes with the highest accuracy is a very important task because these genes might serve as tumor biomarkers, which is of great benefit to not only tumor molecular diagnosis but also drug development. Results This paper proposes a novel gene selection method with rich biomedical meaning based on Heuristic Breadth-first Search Algorithm (HBSA) to find as many optimal gene subsets as possible. Due to the curse of dimensionality, this type of method could suffer from over-fitting and selection bias problems. To address these potential problems, a HBSA-based ensemble classifier is constructed using majority voting strategy from individual classifiers constructed by the selected gene subsets, and a novel HBSA-based gene ranking method is designed to find important tumor-related genes by measuring the significance of genes using their occurrence frequencies in the selected gene subsets. The experimental results on nine tumor datasets including three pairs of cross-platform datasets indicate that the proposed method can not only obtain better generalization performance but also find many important tumor-related genes. Conclusions It is found that the frequencies of the selected genes follow a power-law distribution, indicating that only a few top-ranked genes can be used as potential diagnosis biomarkers. Moreover, the top-ranked genes leading to very high prediction accuracy are closely related to specific tumor subtype and even hub genes. Compared with other related methods, the proposed method can achieve higher prediction accuracy with fewer genes. Moreover, they are further justified by analyzing the top-ranked genes in the context of individual gene function, biological pathway, and protein-protein interaction network. PMID:22830977
Weber, K; Mock, U; Petrowitz, B; Bartsch, U; Fehse, B
2010-04-01
Vector-encoded fluorescent proteins (FPs) facilitate unambiguous identification or sorting of gene-modified cells by fluorescence-activated cell sorting (FACS). Exploiting this feature, we have recently developed lentiviral gene ontology (LeGO) vectors (www.LentiGO-Vectors.de) for multi-gene analysis in different target cells. In this study, we extend the LeGO principle by introducing 10 different drug-selectable FPs created by fusing one of the five selection marker (protecting against blasticidin, hygromycin, neomycin, puromycin and zeocin) and one of the five FP genes (Cerulean, eGFP, Venus, dTomato and mCherry). All tested fusion proteins allowed both fluorescence-mediated detection and drug-mediated selection of LeGO-transduced cells. Newly generated codon-optimized hygromycin- and neomycin-resistance genes showed improved expression as compared with their ancestors. New LeGO constructs were produced at titers >10(6) per ml (for non-concentrated supernatants). We show efficient combinatorial marking and selection of various cells, including mesenchymal stem cells, simultaneously transduced with different LeGO constructs. Inclusion of the cytomegalovirus early enhancer/chicken beta-actin promoter into LeGO vectors facilitated robust transgene expression in and selection of neural stem cells and their differentiated progeny. We suppose that the new drug-selectable markers combining advantages of FACS and drug selection are well suited for numerous applications and vector systems. Their inclusion into LeGO vectors opens new possibilities for (stem) cell tracking and functional multi-gene analysis.
Qiu, Huazhang; Wu, Namei; Zheng, Yanjie; Chen, Min; Weng, Shaohuang; Chen, Yuanzhong; Lin, Xinhua
2015-01-01
A robust and versatile signal-on fluorescence sensing strategy was developed to provide label-free detection of various target analytes. The strategy used SYBR Green I dye and graphene oxide as signal reporter and signal-to-background ratio enhancer, respectively. Multidrug resistance protein 1 (MDR1) gene and mercury ion (Hg2+) were selected as target analytes to investigate the generality of the method. The linear relationship and specificity of the detections showed that the sensitive and selective analyses of target analytes could be achieved by the proposed strategy with low detection limits of 0.5 and 2.2 nM for MDR1 gene and Hg2+, respectively. Moreover, the strategy was used to detect real samples. Analytical results of MDR1 gene in the serum indicated that the developed method is a promising alternative approach for real applications in complex systems. Furthermore, the recovery of the proposed method for Hg2+ detection was acceptable. Thus, the developed label-free signal-on fluorescence sensing strategy exhibited excellent universality, sensitivity, and handling convenience. PMID:25565810
Su, Zhao-Zhong; Sarkar, Devanand; Emdad, Luni; Duigou, Gregory J; Young, Charles S H; Ware, Joy; Randolph, Aaron; Valerie, Kristoffer; Fisher, Paul B
2005-01-25
One impediment to effective cancer-specific gene therapy is the rarity of regulatory sequences targeting gene expression selectively in tumor cells. Although many tissue-specific promoters are recognized, few cancer-selective gene promoters are available. Progression-elevated gene-3 (PEG-3) is a rodent gene identified by subtraction hybridization that displays elevated expression as a function of transformation by diversely acting oncogenes, DNA damage, and cancer cell progression. The promoter of PEG-3, PEG-Prom, displays robust expression in a broad spectrum of human cancer cell lines with marginal expression in normal cellular counterparts. Whereas GFP expression, when under the control of a CMV promoter, is detected in both normal and cancer cells, when GFP is expressed under the control of the PEG-Prom, cancer-selective expression is evident. Mutational analysis identifies the AP-1 and PEA-3 transcription factors as primary mediators of selective, cancer-specific expression of the PEG-Prom. Synthesis of apoptosis-inducing genes, under the control of the CMV promoter, inhibits the growth of both normal and cancer cells, whereas PEG-Prom-mediated expression of these genes kills only cancer cells and spares normal cells. The efficacy of the PEG-Prom as part of a cancer gene therapeutic regimen is further documented by in vivo experiments in which PEG-Prom-controlled expression of an apoptosis-inducing gene completely inhibited prostate cancer xenograft growth in nude mice. These compelling observations indicate that the PEG-Prom, with its cancer-specific expression, provides a means of selectively delivering genes to cancer cells, thereby providing a crucial component in developing effective cancer gene therapies.
Population Level Purifying Selection and Gene Expression Shape Subgenome Evolution in Maize.
Pophaly, Saurabh D; Tellier, Aurélien
2015-12-01
The maize ancestor experienced a recent whole-genome duplication (WGD) followed by gene erosion which generated two subgenomes, the dominant subgenome (maize1) experiencing fewer deletions than maize2. We take advantage of available extensive polymorphism and gene expression data in maize to study purifying selection and gene expression divergence between WGD retained paralog pairs. We first report a strong correlation in nucleotide diversity between duplicate pairs, except for upstream regions. We then show that maize1 genes are under stronger purifying selection than maize2. WGD retained genes have higher gene dosage and biased Gene Ontologies consistent with previous studies. The relative gene expression of paralogs across tissues demonstrates that 98% of duplicate pairs have either subfunctionalized in a tissuewise manner or have diverged consistently in their expression thereby preventing functional complementation. Tissuewise subfunctionalization seems to be a hallmark of transcription factors, whereas consistent repression occurs for macromolecular complexes. We show that dominant gene expression is a strong determinant of the strength of purifying selection, explaining the inferred stronger negative selection on maize1 genes. We propose a novel expression-based classification of duplicates which is more robust to explain observed polymorphism patterns than the subgenome location. Finally, upstream regions of repressed genes exhibit an enrichment in transposable elements which indicates a possible mechanism for expression divergence. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Killian, Tobias; Dickopf, Steffen; Haas, Alexander K; Kirstenpfad, Claudia; Mayer, Klaus; Brinkmann, Ulrich
2017-11-13
We have devised an effective and robust method for the characterization of gene-editing events. The efficacy of editing-mediated mono- and bi-allelic gene inactivation and integration events is quantified based on colony counts. The combination of diphtheria toxin (DT) and puromycin (PM) selection enables analyses of 10,000-100,000 individual cells, assessing hundreds of clones with inactivated genes per experiment. Mono- and bi-allelic gene inactivation is differentiated by DT resistance, which occurs only upon bi-allelic inactivation. PM resistance indicates integration. The robustness and generalizability of the method were demonstrated by quantifying the frequency of gene inactivation and cassette integration under different editing approaches: CRISPR/Cas9-mediated complete inactivation was ~30-50-fold more frequent than cassette integration. Mono-allelic inactivation without integration occurred >100-fold more frequently than integration. Assessment of gRNA length confirmed 20mers to be most effective length for inactivation, while 16-18mers provided the highest overall integration efficacy. The overall efficacy was ~2-fold higher for CRISPR/Cas9 than for zinc-finger nuclease and was significantly increased upon modulation of non-homologous end joining or homology-directed repair. The frequencies and ratios of editing events were similar for two different DPH genes (independent of the target sequence or chromosomal location), which indicates that the optimization parameters identified with this method can be generalized.
Popova, Blagovesta; Schubert, Steffen; Bulla, Ingo; Buchwald, Daniela; Kramer, Wilfried
2015-01-01
A major challenge in gene library generation is to guarantee a large functional size and diversity that significantly increases the chances of selecting different functional protein variants. The use of trinucleotides mixtures for controlled randomization results in superior library diversity and offers the ability to specify the type and distribution of the amino acids at each position. Here we describe the generation of a high diversity gene library using tHisF of the hyperthermophile Thermotoga maritima as a scaffold. Combining various rational criteria with contingency, we targeted 26 selected codons of the thisF gene sequence for randomization at a controlled level. We have developed a novel method of creating full-length gene libraries by combinatorial assembly of smaller sub-libraries. Full-length libraries of high diversity can easily be assembled on demand from smaller and much less diverse sub-libraries, which circumvent the notoriously troublesome long-term archivation and repeated proliferation of high diversity ensembles of phages or plasmids. We developed a generally applicable software tool for sequence analysis of mutated gene sequences that provides efficient assistance for analysis of library diversity. Finally, practical utility of the library was demonstrated in principle by assessment of the conformational stability of library members and isolating protein variants with HisF activity from it. Our approach integrates a number of features of nucleic acids synthetic chemistry, biochemistry and molecular genetics to a coherent, flexible and robust method of combinatorial gene synthesis. PMID:26355961
Popova, Blagovesta; Schubert, Steffen; Bulla, Ingo; Buchwald, Daniela; Kramer, Wilfried
2015-01-01
A major challenge in gene library generation is to guarantee a large functional size and diversity that significantly increases the chances of selecting different functional protein variants. The use of trinucleotides mixtures for controlled randomization results in superior library diversity and offers the ability to specify the type and distribution of the amino acids at each position. Here we describe the generation of a high diversity gene library using tHisF of the hyperthermophile Thermotoga maritima as a scaffold. Combining various rational criteria with contingency, we targeted 26 selected codons of the thisF gene sequence for randomization at a controlled level. We have developed a novel method of creating full-length gene libraries by combinatorial assembly of smaller sub-libraries. Full-length libraries of high diversity can easily be assembled on demand from smaller and much less diverse sub-libraries, which circumvent the notoriously troublesome long-term archivation and repeated proliferation of high diversity ensembles of phages or plasmids. We developed a generally applicable software tool for sequence analysis of mutated gene sequences that provides efficient assistance for analysis of library diversity. Finally, practical utility of the library was demonstrated in principle by assessment of the conformational stability of library members and isolating protein variants with HisF activity from it. Our approach integrates a number of features of nucleic acids synthetic chemistry, biochemistry and molecular genetics to a coherent, flexible and robust method of combinatorial gene synthesis.
NASA Astrophysics Data System (ADS)
Noirel, Josselin; Simonson, Thomas
2008-11-01
Following Kimura's neutral theory of molecular evolution [M. Kimura, The Neutral Theory of Molecular Evolution (Cambridge University Press, Cambridge, 1983) (reprinted in 1986)], it has become common to assume that the vast majority of viable mutations of a gene confer little or no functional advantage. Yet, in silico models of protein evolution have shown that mutational robustness of sequences could be selected for, even in the context of neutral evolution. The evolution of a biological population can be seen as a diffusion on the network of viable sequences. This network is called a "neutral network." Depending on the mutation rate μ and the population size N, the biological population can evolve purely randomly (μN ≪1) or it can evolve in such a way as to select for sequences of higher mutational robustness (μN ≫1). The stringency of the selection depends not only on the product μN but also on the exact topology of the neutral network, the special arrangement of which was named "superfunnel." Even though the relation between mutation rate, population size, and selection was thoroughly investigated, a study of the salient topological features of the superfunnel that could affect the strength of the selection was wanting. This question is addressed in this study. We use two different models of proteins: on lattice and off lattice. We compare neutral networks computed using these models to random networks. From this, we identify two important factors of the topology that determine the stringency of the selection for mutationally robust sequences. First, the presence of highly connected nodes ("hubs") in the network increases the selection for mutationally robust sequences. Second, the stringency of the selection increases when the correlation between a sequence's mutational robustness and its neighbors' increases. The latter finding relates a global characteristic of the neutral network to a local one, which is attainable through experiments or molecular modeling.
Noirel, Josselin; Simonson, Thomas
2008-11-14
Following Kimura's neutral theory of molecular evolution [M. Kimura, The Neutral Theory of Molecular Evolution (Cambridge University Press, Cambridge, 1983) (reprinted in 1986)], it has become common to assume that the vast majority of viable mutations of a gene confer little or no functional advantage. Yet, in silico models of protein evolution have shown that mutational robustness of sequences could be selected for, even in the context of neutral evolution. The evolution of a biological population can be seen as a diffusion on the network of viable sequences. This network is called a "neutral network." Depending on the mutation rate mu and the population size N, the biological population can evolve purely randomly (muN<1) or it can evolve in such a way as to select for sequences of higher mutational robustness (muN>1). The stringency of the selection depends not only on the product muN but also on the exact topology of the neutral network, the special arrangement of which was named "superfunnel." Even though the relation between mutation rate, population size, and selection was thoroughly investigated, a study of the salient topological features of the superfunnel that could affect the strength of the selection was wanting. This question is addressed in this study. We use two different models of proteins: on lattice and off lattice. We compare neutral networks computed using these models to random networks. From this, we identify two important factors of the topology that determine the stringency of the selection for mutationally robust sequences. First, the presence of highly connected nodes ("hubs") in the network increases the selection for mutationally robust sequences. Second, the stringency of the selection increases when the correlation between a sequence's mutational robustness and its neighbors' increases. The latter finding relates a global characteristic of the neutral network to a local one, which is attainable through experiments or molecular modeling.
Robust Gaussian Graphical Modeling via l1 Penalization
Sun, Hokeun; Li, Hongzhe
2012-01-01
Summary Gaussian graphical models have been widely used as an effective method for studying the conditional independency structure among genes and for constructing genetic networks. However, gene expression data typically have heavier tails or more outlying observations than the standard Gaussian distribution. Such outliers in gene expression data can lead to wrong inference on the dependency structure among the genes. We propose a l1 penalized estimation procedure for the sparse Gaussian graphical models that is robustified against possible outliers. The likelihood function is weighted according to how the observation is deviated, where the deviation of the observation is measured based on its own likelihood. An efficient computational algorithm based on the coordinate gradient descent method is developed to obtain the minimizer of the negative penalized robustified-likelihood, where nonzero elements of the concentration matrix represents the graphical links among the genes. After the graphical structure is obtained, we re-estimate the positive definite concentration matrix using an iterative proportional fitting algorithm. Through simulations, we demonstrate that the proposed robust method performs much better than the graphical Lasso for the Gaussian graphical models in terms of both graph structure selection and estimation when outliers are present. We apply the robust estimation procedure to an analysis of yeast gene expression data and show that the resulting graph has better biological interpretation than that obtained from the graphical Lasso. PMID:23020775
NASA Astrophysics Data System (ADS)
Szejka, Agnes; Drossel, Barbara
2010-02-01
We study the evolution of Boolean networks as model systems for gene regulation. Inspired by biological networks, we select simultaneously for robust attractors and for the ability to respond to external inputs by changing the attractor. Mutations change the connections between the nodes and the update functions. In order to investigate the influence of the type of update functions, we perform our simulations with canalizing as well as with threshold functions. We compare the properties of the fitness landscapes that result for different versions of the selection criterion and the update functions. We find that for all studied cases the fitness landscape has a plateau with maximum fitness resulting in the fact that structurally very different networks are able to fulfill the same task and are connected by neutral paths in network (“genotype”) space. We find furthermore a connection between the attractor length and the mutational robustness, and an extremely long memory of the initial evolutionary stage.
Surprisingly Low Limits of Selection in Plant Domestication
Allaby, Robin G.; Kitchen, James L.; Fuller, Dorian Q.
2015-01-01
Current debate concerns the pace at which domesticated plants emerged from cultivated wild populations and how many genes were involved. Using an individual-based model, based on the assumptions of Haldane and Maynard Smith, respectively, we estimate that a surprisingly low number of 50–100 loci are the most that could be under selection in a cultivation regime at the selection strengths observed in the archaeological record. This finding is robust to attempts to rescue populations from extinction through selection from high standing genetic variation, gene flow, and the Maynard Smith-based model of threshold selection. Selective sweeps come at a cost, reducing the capacity of plants to adapt to new environments, which may contribute to the explanation of why selective sweeps have not been detected more frequently and why expansion of the agrarian package during the Neolithic was so frequently associated with collapse. PMID:27081302
Nwakanma, Davis C.; Duffy, Craig W.; Amambua-Ngwa, Alfred; Oriero, Eniyou C.; Bojang, Kalifa A.; Pinder, Margaret; Drakeley, Chris J.; Sutherland, Colin J.; Milligan, Paul J.; MacInnis, Bronwyn; Kwiatkowski, Dominic P.; Clark, Taane G.; Greenwood, Brian M.; Conway, David J.
2014-01-01
Background. Analysis of genome-wide polymorphism in many organisms has potential to identify genes under recent selection. However, data on historical allele frequency changes are rarely available for direct confirmation. Methods. We genotyped single nucleotide polymorphisms (SNPs) in 4 Plasmodium falciparum drug resistance genes in 668 archived parasite-positive blood samples of a Gambian population between 1984 and 2008. This covered a period before antimalarial resistance was detected locally, through subsequent failure of multiple drugs until introduction of artemisinin combination therapy. We separately performed genome-wide sequence analysis of 52 clinical isolates from 2008 to prospect for loci under recent directional selection. Results. Resistance alleles increased from very low frequencies, peaking in 2000 for chloroquine resistance-associated crt and mdr1 genes and at the end of the survey period for dhfr and dhps genes respectively associated with pyrimethamine and sulfadoxine resistance. Temporal changes fit a model incorporating likely selection coefficients over the period. Three of the drug resistance loci were in the top 4 regions under strong selection implicated by the genome-wide analysis. Conclusions. Genome-wide polymorphism analysis of an endemic population sample robustly identifies loci with detailed documentation of recent selection, demonstrating power to prospectively detect emerging drug resistance genes. PMID:24265439
Rough set soft computing cancer classification and network: one stone, two birds.
Zhang, Yue
2010-07-15
Gene expression profiling provides tremendous information to help unravel the complexity of cancer. The selection of the most informative genes from huge noise for cancer classification has taken centre stage, along with predicting the function of such identified genes and the construction of direct gene regulatory networks at different system levels with a tuneable parameter. A new study by Wang and Gotoh described a novel Variable Precision Rough Sets-rooted robust soft computing method to successfully address these problems and has yielded some new insights. The significance of this progress and its perspectives will be discussed in this article.
Roth, Justin C.; Ismail, Mourad; Reese, Jane S.; Lingas, Karen T.; Ferrari, Giuliana; Gerson, Stanton L.
2012-01-01
The P140K point mutant of MGMT allows robust hematopoietic stem cell (HSC) enrichment in vivo. Thus, dual-gene vectors that couple MGMT and therapeutic gene expression have allowed enrichment of gene-corrected HSCs in animal models. However, expression levels from dual-gene vectors are often reduced for one or both genes. Further, it may be desirable to express selection and therapeutic genes at distinct stages of cell differentiation. In this regard, we evaluated whether hematopoietic cells could be efficiently cotransduced using low MOIs of two separate single-gene lentiviruses, including MGMT for dual-positive cell enrichment. Cotransduction efficiencies were evaluated using a range of MGMT : GFP virus ratios, MOIs, and selection stringencies in vitro. Cotransduction was optimal when equal proportions of each virus were used, but low MGMT : GFP virus ratios resulted in the highest proportion of dual-positive cells after selection. This strategy was then evaluated in murine models for in vivo selection of HSCs cotransduced with a ubiquitous MGMT expression vector and an erythroid-specific GFP vector. Although the MGMT and GFP expression percentages were variable among engrafted recipients, drug selection enriched MGMT-positive leukocyte and GFP-positive erythroid cell populations. These data demonstrate cotransduction as a mean to rapidly enrich and evaluate therapeutic lentivectors in vivo. PMID:22888445
Studying the genetic basis of speciation in high gene flow marine invertebrates
2016-01-01
A growing number of genes responsible for reproductive incompatibilities between species (barrier loci) exhibit the signals of positive selection. However, the possibility that genes experiencing positive selection diverge early in speciation and commonly cause reproductive incompatibilities has not been systematically investigated on a genome-wide scale. Here, I outline a research program for studying the genetic basis of speciation in broadcast spawning marine invertebrates that uses a priori genome-wide information on a large, unbiased sample of genes tested for positive selection. A targeted sequence capture approach is proposed that scores single-nucleotide polymorphisms (SNPs) in widely separated species populations at an early stage of allopatric divergence. The targeted capture of both coding and non-coding sequences enables SNPs to be characterized at known locations across the genome and at genes with known selective or neutral histories. The neutral coding and non-coding SNPs provide robust background distributions for identifying FST-outliers within genes that can, in principle, identify specific mutations experiencing diversifying selection. If natural hybridization occurs between species, the neutral coding and non-coding SNPs can provide a neutral admixture model for genomic clines analyses aimed at finding genes exhibiting strong blocks to introgression. Strongylocentrotid sea urchins are used as a model system to outline the approach but it can be used for any group that has a complete reference genome available. PMID:29491951
Gene set analysis approaches for RNA-seq data: performance evaluation and application guideline
Rahmatallah, Yasir; Emmert-Streib, Frank
2016-01-01
Transcriptome sequencing (RNA-seq) is gradually replacing microarrays for high-throughput studies of gene expression. The main challenge of analyzing microarray data is not in finding differentially expressed genes, but in gaining insights into the biological processes underlying phenotypic differences. To interpret experimental results from microarrays, gene set analysis (GSA) has become the method of choice, in particular because it incorporates pre-existing biological knowledge (in a form of functionally related gene sets) into the analysis. Here we provide a brief review of several statistically different GSA approaches (competitive and self-contained) that can be adapted from microarrays practice as well as those specifically designed for RNA-seq. We evaluate their performance (in terms of Type I error rate, power, robustness to the sample size and heterogeneity, as well as the sensitivity to different types of selection biases) on simulated and real RNA-seq data. Not surprisingly, the performance of various GSA approaches depends only on the statistical hypothesis they test and does not depend on whether the test was developed for microarrays or RNA-seq data. Interestingly, we found that competitive methods have lower power as well as robustness to the samples heterogeneity than self-contained methods, leading to poor results reproducibility. We also found that the power of unsupervised competitive methods depends on the balance between up- and down-regulated genes in tested gene sets. These properties of competitive methods have been overlooked before. Our evaluation provides a concise guideline for selecting GSA approaches, best performing under particular experimental settings in the context of RNA-seq. PMID:26342128
Optimized Sleeping Beauty transposons rapidly generate stable transgenic cell lines.
Kowarz, Eric; Löscher, Denise; Marschalek, Rolf
2015-04-01
Stable gene expression in mammalian cells is a prerequisite for many in vitro and in vivo experiments. However, either the integration of plasmids into mammalian genomes or the use of retro-/lentiviral systems have intrinsic limitations. The use of transposable elements, e.g. the Sleeping Beauty system (SB), circumvents most of these drawbacks (integration sites, size limitations) and allows the quick generation of stable cell lines. The integration process of SB is catalyzed by a transposase and the handling of this gene transfer system is easy, fast and safe. Here, we report our improvements made to the existing SB vector system and present two new vector types for robust constitutive or inducible expression of any gene of interest. Both types are available in 16 variants with different selection marker (puromycin, hygromycin, blasticidin, neomycin) and fluorescent protein expression (GFP, RFP, BFP) to fit most experimental requirements. With this system it is possible to generate cell lines from stable transfected cells quickly and reliably in a medium-throughput setting (three to five days). Cell lines robustly express any gene-of-interest, either constitutively or tightly regulated by doxycycline. This allows many laboratory experiments to speed up generation of data in a rapid and robust manner. Copyright © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Evans, Perry; Avey, Stefan; Kong, Yong; Krauthammer, Michael
2013-09-01
A common goal of tumor sequencing projects is finding genes whose mutations are selected for during tumor development. This is accomplished by choosing genes that have more non-synonymous mutations than expected from an estimated background mutation frequency. While this background frequency is unknown, it can be estimated using both the observed synonymous mutation frequency and the non-synonymous to synonymous mutation ratio. The synonymous mutation frequency can be determined across all genes or in a gene-specific manner. This choice introduces an interesting trade-off. A gene-specific frequency adjusts for an underlying mutation bias, but is difficult to estimate given missing synonymous mutation counts. Using a genome-wide synonymous frequency is more robust, but is less suited for adjusting biases. Studying four evaluation criteria for identifying genes with high non-synonymous mutation burden (reflecting preferential selection of expressed genes, genes with mutations in conserved bases, genes with many protein interactions, and genes that show loss of heterozygosity), we find that the gene-specific synonymous frequency is superior in the gene expression and protein interaction tests. In conclusion, the use of the gene-specific synonymous mutation frequency is well suited for assessing a gene's non-synonymous mutation burden.
Sehgal, Vasudha; Seviour, Elena G; Moss, Tyler J; Mills, Gordon B; Azencott, Robert; Ram, Prahlad T
2015-01-01
MicroRNAs (miRNAs) play a crucial role in the maintenance of cellular homeostasis by regulating the expression of their target genes. As such, the dysregulation of miRNA expression has been frequently linked to cancer. With rapidly accumulating molecular data linked to patient outcome, the need for identification of robust multi-omic molecular markers is critical in order to provide clinical impact. While previous bioinformatic tools have been developed to identify potential biomarkers in cancer, these methods do not allow for rapid classification of oncogenes versus tumor suppressors taking into account robust differential expression, cutoffs, p-values and non-normality of the data. Here, we propose a methodology, Robust Selection Algorithm (RSA) that addresses these important problems in big data omics analysis. The robustness of the survival analysis is ensured by identification of optimal cutoff values of omics expression, strengthened by p-value computed through intensive random resampling taking into account any non-normality in the data and integration into multi-omic functional networks. Here we have analyzed pan-cancer miRNA patient data to identify functional pathways involved in cancer progression that are associated with selected miRNA identified by RSA. Our approach demonstrates the way in which existing survival analysis techniques can be integrated with a functional network analysis framework to efficiently identify promising biomarkers and novel therapeutic candidates across diseases.
Cooper, Tara E.; Krause, David J.
2013-01-01
Sulfolobus species have become the model organisms for studying the unique biology of the crenarchaeal division of the archaeal domain. In particular, Sulfolobus islandicus provides a powerful opportunity to explore natural variation via experimental functional genomics. To support these efforts, we further expanded genetic tools for S. islandicus by developing a stringent positive selection for agmatine prototrophs in strains in which the argD gene, encoding arginine decarboxylase, has been deleted. Strains with deletions in argD were shown to be auxotrophic for agmatine even in nutrient-rich medium, but growth could be restored by either supplementation of exogenous agmatine or reintroduction of a functional copy of the argD gene from S. solfataricus P2 into the ΔargD host. Using this stringent selection, a robust targeted gene knockout system was established via an improved next generation of the MID (marker insertion and unmarked target gene deletion) method. Application of this novel system was validated by targeted knockout of the upsEF genes involved in UV-inducible cell aggregation formation. PMID:23835176
Data-adaptive test statistics for microarray data.
Mukherjee, Sach; Roberts, Stephen J; van der Laan, Mark J
2005-09-01
An important task in microarray data analysis is the selection of genes that are differentially expressed between different tissue samples, such as healthy and diseased. However, microarray data contain an enormous number of dimensions (genes) and very few samples (arrays), a mismatch which poses fundamental statistical problems for the selection process that have defied easy resolution. In this paper, we present a novel approach to the selection of differentially expressed genes in which test statistics are learned from data using a simple notion of reproducibility in selection results as the learning criterion. Reproducibility, as we define it, can be computed without any knowledge of the 'ground-truth', but takes advantage of certain properties of microarray data to provide an asymptotically valid guide to expected loss under the true data-generating distribution. We are therefore able to indirectly minimize expected loss, and obtain results substantially more robust than conventional methods. We apply our method to simulated and oligonucleotide array data. By request to the corresponding author.
Compartmentalized partnered replication for the directed evolution of genetic parts and circuits.
Abil, Zhanar; Ellefson, Jared W; Gollihar, Jimmy D; Watkins, Ella; Ellington, Andrew D
2017-12-01
Compartmentalized partnered replication (CPR) is an emulsion-based directed evolution method based on a robust and modular phenotype-genotype linkage. In contrast to other in vivo directed evolution approaches, CPR largely mitigates host fitness effects due to a relatively short expression time of the gene of interest. CPR is based on gene circuits in which the selection of a 'partner' function from a library leads to the production of a thermostable polymerase. After library preparation, bacteria produce partner proteins that can potentially lead to enhancement of transcription, translation, gene regulation, and other aspects of cellular metabolism that reinforce thermostable polymerase production. Individual cells are then trapped in water-in-oil emulsion droplets in the presence of primers and dNTPs, followed by the recovery of the partner genes via emulsion PCR. In this step, droplets with cells expressing partner proteins that promote polymerase production will produce higher copy numbers of the improved partner gene. The resulting partner genes can subsequently be recloned for the next round of selection. Here, we present a step-by-step guideline for the procedure by providing examples of (i) selection of T7 RNA polymerases that recognize orthogonal promoters and (ii) selection of tRNA for enhanced amber codon suppression. A single round of CPR should take ∼3-5 d, whereas a whole directed evolution can be performed in 3-10 rounds, depending on selection efficiency.
Rough Set Soft Computing Cancer Classification and Network: One Stone, Two Birds
Zhang, Yue
2010-01-01
Gene expression profiling provides tremendous information to help unravel the complexity of cancer. The selection of the most informative genes from huge noise for cancer classification has taken centre stage, along with predicting the function of such identified genes and the construction of direct gene regulatory networks at different system levels with a tuneable parameter. A new study by Wang and Gotoh described a novel Variable Precision Rough Sets-rooted robust soft computing method to successfully address these problems and has yielded some new insights. The significance of this progress and its perspectives will be discussed in this article. PMID:20706619
Vermeirssen, Vanessa; De Clercq, Inge; Van Parys, Thomas; Van Breusegem, Frank; Van de Peer, Yves
2014-01-01
The abiotic stress response in plants is complex and tightly controlled by gene regulation. We present an abiotic stress gene regulatory network of 200,014 interactions for 11,938 target genes by integrating four complementary reverse-engineering solutions through average rank aggregation on an Arabidopsis thaliana microarray expression compendium. This ensemble performed the most robustly in benchmarking and greatly expands upon the availability of interactions currently reported. Besides recovering 1182 known regulatory interactions, cis-regulatory motifs and coherent functionalities of target genes corresponded with the predicted transcription factors. We provide a valuable resource of 572 abiotic stress modules of coregulated genes with functional and regulatory information, from which we deduced functional relationships for 1966 uncharacterized genes and many regulators. Using gain- and loss-of-function mutants of seven transcription factors grown under control and salt stress conditions, we experimentally validated 141 out of 271 predictions (52% precision) for 102 selected genes and mapped 148 additional transcription factor-gene regulatory interactions (49% recall). We identified an intricate core oxidative stress regulatory network where NAC13, NAC053, ERF6, WRKY6, and NAC032 transcription factors interconnect and function in detoxification. Our work shows that ensemble reverse-engineering can generate robust biological hypotheses of gene regulation in a multicellular eukaryote that can be tested by medium-throughput experimental validation. PMID:25549671
Vermeirssen, Vanessa; De Clercq, Inge; Van Parys, Thomas; Van Breusegem, Frank; Van de Peer, Yves
2014-12-01
The abiotic stress response in plants is complex and tightly controlled by gene regulation. We present an abiotic stress gene regulatory network of 200,014 interactions for 11,938 target genes by integrating four complementary reverse-engineering solutions through average rank aggregation on an Arabidopsis thaliana microarray expression compendium. This ensemble performed the most robustly in benchmarking and greatly expands upon the availability of interactions currently reported. Besides recovering 1182 known regulatory interactions, cis-regulatory motifs and coherent functionalities of target genes corresponded with the predicted transcription factors. We provide a valuable resource of 572 abiotic stress modules of coregulated genes with functional and regulatory information, from which we deduced functional relationships for 1966 uncharacterized genes and many regulators. Using gain- and loss-of-function mutants of seven transcription factors grown under control and salt stress conditions, we experimentally validated 141 out of 271 predictions (52% precision) for 102 selected genes and mapped 148 additional transcription factor-gene regulatory interactions (49% recall). We identified an intricate core oxidative stress regulatory network where NAC13, NAC053, ERF6, WRKY6, and NAC032 transcription factors interconnect and function in detoxification. Our work shows that ensemble reverse-engineering can generate robust biological hypotheses of gene regulation in a multicellular eukaryote that can be tested by medium-throughput experimental validation. © 2014 American Society of Plant Biologists. All rights reserved.
Arnholdt-Schmitt, Birgit
2017-01-01
Respiration traits allow calculating temperature-dependent carbon use efficiency and prediction of growth rates. This protocol aims (1) to enable validation of respiration traits as non-DNA biomarkers for breeding on robust plants in support of sustainable and healthy plant production; (2) to provide an efficient, novel way to identify and predict functionality of DNA-based markers (genes, polymorphisms, edited genes, transgenes, genomes, and hologenomes), and (3) to directly help farmers select robust material appropriate for a specified region. The protocol is based on applying isothermal calorespirometry and consists of four steps: plant tissue preparation, calorespirometry measurements, data processing, and final validation through massive field-based data.The methodology can serve selection and improvement for a wide range of crops. Several of them are currently being tested in the author's lab. Among them are important cereals, such as wheat, barley, and rye, and diverse vegetables. However, it is critical that the protocol for measuring respiration traits be well adjusted to the plant species by considering deep knowledge on the specific physiology and functional cell biology behind the final target trait for production. Here, Daucus carota L. is chosen as an advanced example to demonstrate critical species-specific steps for protocol development. Carrot is an important global vegetable that is grown worldwide and in all climate regions (moderate, subtropical, and tropical). Recently, this species is also used in my lab as a model for studies on alternative oxidase (AOX) gene diversity and evolutionary dynamics in interaction with endophytes.
High throughput selection of antibiotic-resistant transgenic Arabidopsis plants.
Nagashima, Yukihiro; Koiwa, Hisashi
2017-05-15
Kanamycin resistance is the most frequently used antibiotic-resistance marker for Arabidopsis transformations, however, this method frequently causes escape of untransformed plants, particularly at the high seedling density during the selection. Here we developed a robust high-density selection method using top agar for Arabidopsis thaliana. Top agar effectively suppressed growth of untransformed wild-type plants on selection media at high density. Survival of the transformed plants during the selection were confirmed by production of green true leaves and expression of a firefly luciferase reporter gene. Top agar method allowed selection using a large amount of seeds in Arabidopsis transformation. Copyright © 2017 Elsevier Inc. All rights reserved.
Reddy, Anupama; Growney, Joseph D; Wilson, Nick S; Emery, Caroline M; Johnson, Jennifer A; Ward, Rebecca; Monaco, Kelli A; Korn, Joshua; Monahan, John E; Stump, Mark D; Mapa, Felipa A; Wilson, Christopher J; Steiger, Janine; Ledell, Jebediah; Rickles, Richard J; Myer, Vic E; Ettenberg, Seth A; Schlegel, Robert; Sellers, William R; Huet, Heather A; Lehár, Joseph
2015-01-01
Death Receptor 5 (DR5) agonists demonstrate anti-tumor activity in preclinical models but have yet to demonstrate robust clinical responses. A key limitation may be the lack of patient selection strategies to identify those most likely to respond to treatment. To overcome this limitation, we screened a DR5 agonist Nanobody across >600 cell lines representing 21 tumor lineages and assessed molecular features associated with response. High expression of DR5 and Casp8 were significantly associated with sensitivity, but their expression thresholds were difficult to translate due to low dynamic ranges. To address the translational challenge of establishing thresholds of gene expression, we developed a classifier based on ratios of genes that predicted response across lineages. The ratio classifier outperformed the DR5+Casp8 classifier, as well as standard approaches for feature selection and classification using genes, instead of ratios. This classifier was independently validated using 11 primary patient-derived pancreatic xenograft models showing perfect predictions as well as a striking linearity between prediction probability and anti-tumor response. A network analysis of the genes in the ratio classifier captured important biological relationships mediating drug response, specifically identifying key positive and negative regulators of DR5 mediated apoptosis, including DR5, CASP8, BID, cFLIP, XIAP and PEA15. Importantly, the ratio classifier shows translatability across gene expression platforms (from Affymetrix microarrays to RNA-seq) and across model systems (in vitro to in vivo). Our approach of using gene expression ratios presents a robust and novel method for constructing translatable biomarkers of compound response, which can also probe the underlying biology of treatment response.
Reddy, Anupama; Growney, Joseph D.; Wilson, Nick S.; Emery, Caroline M.; Johnson, Jennifer A.; Ward, Rebecca; Monaco, Kelli A.; Korn, Joshua; Monahan, John E.; Stump, Mark D.; Mapa, Felipa A.; Wilson, Christopher J.; Steiger, Janine; Ledell, Jebediah; Rickles, Richard J.; Myer, Vic E.; Ettenberg, Seth A.; Schlegel, Robert; Sellers, William R.
2015-01-01
Death Receptor 5 (DR5) agonists demonstrate anti-tumor activity in preclinical models but have yet to demonstrate robust clinical responses. A key limitation may be the lack of patient selection strategies to identify those most likely to respond to treatment. To overcome this limitation, we screened a DR5 agonist Nanobody across >600 cell lines representing 21 tumor lineages and assessed molecular features associated with response. High expression of DR5 and Casp8 were significantly associated with sensitivity, but their expression thresholds were difficult to translate due to low dynamic ranges. To address the translational challenge of establishing thresholds of gene expression, we developed a classifier based on ratios of genes that predicted response across lineages. The ratio classifier outperformed the DR5+Casp8 classifier, as well as standard approaches for feature selection and classification using genes, instead of ratios. This classifier was independently validated using 11 primary patient-derived pancreatic xenograft models showing perfect predictions as well as a striking linearity between prediction probability and anti-tumor response. A network analysis of the genes in the ratio classifier captured important biological relationships mediating drug response, specifically identifying key positive and negative regulators of DR5 mediated apoptosis, including DR5, CASP8, BID, cFLIP, XIAP and PEA15. Importantly, the ratio classifier shows translatability across gene expression platforms (from Affymetrix microarrays to RNA-seq) and across model systems (in vitro to in vivo). Our approach of using gene expression ratios presents a robust and novel method for constructing translatable biomarkers of compound response, which can also probe the underlying biology of treatment response. PMID:26378449
Takahashi, Hiro; Kobayashi, Takeshi; Honda, Hiroyuki
2005-01-15
For establishing prognostic predictors of various diseases using DNA microarray analysis technology, it is desired to find selectively significant genes for constructing the prognostic model and it is also necessary to eliminate non-specific genes or genes with error before constructing the model. We applied projective adaptive resonance theory (PART) to gene screening for DNA microarray data. Genes selected by PART were subjected to our FNN-SWEEP modeling method for the construction of a cancer class prediction model. The model performance was evaluated through comparison with a conventional screening signal-to-noise (S2N) method or nearest shrunken centroids (NSC) method. The FNN-SWEEP predictor with PART screening could discriminate classes of acute leukemia in blinded data with 97.1% accuracy and classes of lung cancer with 90.0% accuracy, while the predictor with S2N was only 85.3 and 70.0% or the predictor with NSC was 88.2 and 90.0%, respectively. The results have proven that PART was superior for gene screening. The software is available upon request from the authors. honda@nubio.nagoya-u.ac.jp
Breeding and Genetics Symposium: networks and pathways to guide genomic selection.
Snelling, W M; Cushman, R A; Keele, J W; Maltecca, C; Thomas, M G; Fortes, M R S; Reverter, A
2013-02-01
Many traits affecting profitability and sustainability of meat, milk, and fiber production are polygenic, with no single gene having an overwhelming influence on observed variation. No knowledge of the specific genes controlling these traits has been needed to make substantial improvement through selection. Significant gains have been made through phenotypic selection enhanced by pedigree relationships and continually improving statistical methodology. Genomic selection, recently enabled by assays for dense SNP located throughout the genome, promises to increase selection accuracy and accelerate genetic improvement by emphasizing the SNP most strongly correlated to phenotype although the genes and sequence variants affecting phenotype remain largely unknown. These genomic predictions theoretically rely on linkage disequilibrium (LD) between genotyped SNP and unknown functional variants, but familial linkage may increase effectiveness when predicting individuals related to those in the training data. Genomic selection with functional SNP genotypes should be less reliant on LD patterns shared by training and target populations, possibly allowing robust prediction across unrelated populations. Although the specific variants causing polygenic variation may never be known with certainty, a number of tools and resources can be used to identify those most likely to affect phenotype. Associations of dense SNP genotypes with phenotype provide a 1-dimensional approach for identifying genes affecting specific traits; in contrast, associations with multiple traits allow defining networks of genes interacting to affect correlated traits. Such networks are especially compelling when corroborated by existing functional annotation and established molecular pathways. The SNP occurring within network genes, obtained from public databases or derived from genome and transcriptome sequences, may be classified according to expected effects on gene products. As illustrated by functionally informed genomic predictions being more accurate than naive whole-genome predictions of beef tenderness, coupling evidence from livestock genotypes, phenotypes, gene expression, and genomic variants with existing knowledge of gene functions and interactions may provide greater insight into the genes and genomic mechanisms affecting polygenic traits and facilitate functional genomic selection for economically important traits.
Hartmann, Fanny E.; Croll, Daniel
2017-01-01
Abstract Differences in gene content are a significant source of variability within species and have an impact on phenotypic traits. However, little is known about the mechanisms responsible for the most recent gene gains and losses. We screened the genomes of 123 worldwide isolates of the major pathogen of wheat Zymoseptoria tritici for robust evidence of gene copy number variation. Based on orthology relationships in three closely related fungi, we identified 599 gene gains and 1,024 gene losses that have not yet reached fixation within the focal species. Our analyses of gene gains and losses segregating in populations showed that gene copy number variation arose preferentially in subtelomeres and in proximity to transposable elements. Recently lost genes were enriched in virulence factors and secondary metabolite gene clusters. In contrast, recently gained genes encoded mostly secreted protein lacking a conserved domain. We analyzed the frequency spectrum at loci segregating a gene presence–absence polymorphism in four worldwide populations. Recent gene losses showed a significant excess in low-frequency variants compared with genome-wide single nucleotide polymorphism, which is indicative of strong negative selection against gene losses. Recent gene gains were either under weak negative selection or neutral. We found evidence for strong divergent selection among populations at individual loci segregating a gene presence–absence polymorphism. Hence, gene gains and losses likely contributed to local adaptation. Our study shows that microbial eukaryotes harbor extensive copy number variation within populations and that functional differences among recently gained and lost genes led to distinct evolutionary trajectories. PMID:28981698
Niklaus, Michael; Gruissem, Wilhelm; Vanderschuren, Hervé
2011-01-01
Cassava is one of the most important crops in the tropics. Its industrial use for starch and biofuel production is also increasing its importance for agricultural production in tropical countries. In the last decade cassava biotechnology has emerged as a valuable alternative to the breeding constraints of this highly heterozygous crop for improved trait development of cassava germplasm. Cassava transformation remains difficult and time-consuming because of limitations in selecting transgenic tissues and regeneration of transgenic plantlets. We have recently reported an efficient and robust cassava transformation protocol using the hygromycin phosphotransferase II (hptII) gene as selection marker and the aminoglycoside hygromycin at optimal concentrations to maximize the regeneration of transgenic plantlets. In the present work, we expanded the transformation protocol to the use of the neomycin phosphotransferase II (nptII) gene as selection marker. Several aminoglycosides compatible with the use of nptII were tested and optimal concentrations for cassava transformation were determined. Given its efficiency equivalent to hptII as selection marker with the described protocol, the use of nptII opens new possibilities to engineer transgenic cassava lines with multiple T-DNA insertions and to produce transgenic cassava with a resistance marker gene that is already deregulated in several commercial transgenic crops.
YamiPred: A Novel Evolutionary Method for Predicting Pre-miRNAs and Selecting Relevant Features.
Kleftogiannis, Dimitrios; Theofilatos, Konstantinos; Likothanassis, Spiros; Mavroudi, Seferina
2015-01-01
MicroRNAs (miRNAs) are small non-coding RNAs, which play a significant role in gene regulation. Predicting miRNA genes is a challenging bioinformatics problem and existing experimental and computational methods fail to deal with it effectively. We developed YamiPred, an embedded classification method that combines the efficiency and robustness of support vector machines (SVM) with genetic algorithms (GA) for feature selection and parameters optimization. YamiPred was tested in a new and realistic human dataset and was compared with state-of-the-art computational intelligence approaches and the prevalent SVM-based tools for miRNA prediction. Experimental results indicate that YamiPred outperforms existing approaches in terms of accuracy and of geometric mean of sensitivity and specificity. The embedded feature selection component selects a compact feature subset that contributes to the performance optimization. Further experimentation with this minimal feature subset has achieved very high classification performance and revealed the minimum number of samples required for developing a robust predictor. YamiPred also confirmed the important role of commonly used features such as entropy and enthalpy, and uncovered the significance of newly introduced features, such as %A-U aggregate nucleotide frequency and positional entropy. The best model trained on human data has successfully predicted pre-miRNAs to other organisms including the category of viruses.
Horvath, Robert
2017-01-01
Abstract To avoid negative effects of transposable element (TE) proliferation, plants epigenetically silence TEs using a number of mechanisms, including RNA-directed DNA methylation. These epigenetic modifications can extend outside the boundaries of TE insertions and lead to silencing of nearby genes, resulting in a trade-off between TE silencing and interference with nearby gene regulation. Therefore, purifying selection is expected to remove silenced TE insertions near genes more efficiently and prevent their accumulation within a population. To explore how effects of TE silencing on gene regulation shapes purifying selection on TEs, we analyzed whole genome sequencing data from 166 individuals of a large population of the outcrossing species Capsella grandiflora. We found that most TEs are rare, and in chromosome arms, silenced TEs are exposed to stronger purifying selection than those that are not silenced by 24-nucleotide small RNAs, especially with increasing proximity to genes. An age-of-allele test of neutrality on a subset of TEs supports our inference of purifying selection on silenced TEs, suggesting that our results are robust to varying transposition rates. Our results provide new insights into the processes affecting the accumulation of TEs in an outcrossing species and support the view that epigenetic silencing of TEs results in a trade-off between preventing TE proliferation and interference with nearby gene regulation. We also suggest that in the centromeric and pericentromeric regions, the negative aspects of epigenetic TE silencing are missing. PMID:29036316
Selecting informative subsets of sparse supermatrices increases the chance to find correct trees.
Misof, Bernhard; Meyer, Benjamin; von Reumont, Björn Marcus; Kück, Patrick; Misof, Katharina; Meusemann, Karen
2013-12-03
Character matrices with extensive missing data are frequently used in phylogenomics with potentially detrimental effects on the accuracy and robustness of tree inference. Therefore, many investigators select taxa and genes with high data coverage. Drawbacks of these selections are their exclusive reliance on data coverage without consideration of actual signal in the data which might, thus, not deliver optimal data matrices in terms of potential phylogenetic signal. In order to circumvent this problem, we have developed a heuristics implemented in a software called mare which (1) assesses information content of genes in supermatrices using a measure of potential signal combined with data coverage and (2) reduces supermatrices with a simple hill climbing procedure to submatrices with high total information content. We conducted simulation studies using matrices of 50 taxa × 50 genes with heterogeneous phylogenetic signal among genes and data coverage between 10-30%. With matrices of 50 taxa × 50 genes with heterogeneous phylogenetic signal among genes and data coverage between 10-30% Maximum Likelihood (ML) tree reconstructions failed to recover correct trees. A selection of a data subset with the herein proposed approach increased the chance to recover correct partial trees more than 10-fold. The selection of data subsets with the herein proposed simple hill climbing procedure performed well either considering the information content or just a simple presence/absence information of genes. We also applied our approach on an empirical data set, addressing questions of vertebrate systematics. With this empirical dataset selecting a data subset with high information content and supporting a tree with high average boostrap support was most successful if information content of genes was considered. Our analyses of simulated and empirical data demonstrate that sparse supermatrices can be reduced on a formal basis outperforming the usually used simple selections of taxa and genes with high data coverage.
Mathematical Modeling of RNA-Based Architectures for Closed Loop Control of Gene Expression.
Agrawal, Deepak K; Tang, Xun; Westbrook, Alexandra; Marshall, Ryan; Maxwell, Colin S; Lucks, Julius; Noireaux, Vincent; Beisel, Chase L; Dunlop, Mary J; Franco, Elisa
2018-05-08
Feedback allows biological systems to control gene expression precisely and reliably, even in the presence of uncertainty, by sensing and processing environmental changes. Taking inspiration from natural architectures, synthetic biologists have engineered feedback loops to tune the dynamics and improve the robustness and predictability of gene expression. However, experimental implementations of biomolecular control systems are still far from satisfying performance specifications typically achieved by electrical or mechanical control systems. To address this gap, we present mathematical models of biomolecular controllers that enable reference tracking, disturbance rejection, and tuning of the temporal response of gene expression. These controllers employ RNA transcriptional regulators to achieve closed loop control where feedback is introduced via molecular sequestration. Sensitivity analysis of the models allows us to identify which parameters influence the transient and steady state response of a target gene expression process, as well as which biologically plausible parameter values enable perfect reference tracking. We quantify performance using typical control theory metrics to characterize response properties and provide clear selection guidelines for practical applications. Our results indicate that RNA regulators are well-suited for building robust and precise feedback controllers for gene expression. Additionally, our approach illustrates several quantitative methods useful for assessing the performance of biomolecular feedback control systems.
Smura, Teemu; Blomqvist, Soile; Vuorinen, Tytti; Ivanova, Olga; Samoilovich, Elena; Al-Hello, Haider; Savolainen-Kopra, Carita; Hovi, Tapani; Roivainen, Merja
2014-01-01
Genus Enterovirus (Family Picornaviridae,) consists of twelve species divided into genetically diverse types by their capsid protein VP1 coding sequences. Each enterovirus type can further be divided into intra-typic sub-clusters (genotypes). The aim of this study was to elucidate what leads to the emergence of novel enterovirus clades (types and genotypes). An evolutionary analysis was conducted for a sub-group of Enterovirus C species that contains types Coxsackievirus A21 (CVA-21), CVA-24, Enterovirus C95 (EV-C95), EV-C96 and EV-C99. VP1 gene datasets were collected and analysed to infer the phylogeny, rate of evolution, nucleotide and amino acid substitution patterns and signs of selection. In VP1 coding gene, high intra-typic sequence diversities and robust grouping into distinct genotypes within each type were detected. Within each type the majority of nucleotide substitutions were synonymous and the non-synonymous substitutions tended to cluster in distinct highly polymorphic sites. Signs of positive selection were detected in some of these highly polymorphic sites, while strong negative selection was indicated in most of the codons. Despite robust clustering to intra-typic genotypes, only few genotype-specific ‘signature’ amino acids were detected. In contrast, when different enterovirus types were compared, there was a clear tendency towards fixation of type-specific ‘signature’ amino acids. The results suggest that permanent fixation of type-specific amino acids is a hallmark associated with evolution of different enterovirus types, whereas neutral evolution and/or (frequency-dependent) positive selection in few highly polymorphic amino acid sites are the dominant forms of evolution when strains within an enterovirus type are compared. PMID:24695547
Smura, Teemu; Blomqvist, Soile; Vuorinen, Tytti; Ivanova, Olga; Samoilovich, Elena; Al-Hello, Haider; Savolainen-Kopra, Carita; Hovi, Tapani; Roivainen, Merja
2014-01-01
Genus Enterovirus (Family Picornaviridae,) consists of twelve species divided into genetically diverse types by their capsid protein VP1 coding sequences. Each enterovirus type can further be divided into intra-typic sub-clusters (genotypes). The aim of this study was to elucidate what leads to the emergence of novel enterovirus clades (types and genotypes). An evolutionary analysis was conducted for a sub-group of Enterovirus C species that contains types Coxsackievirus A21 (CVA-21), CVA-24, Enterovirus C95 (EV-C95), EV-C96 and EV-C99. VP1 gene datasets were collected and analysed to infer the phylogeny, rate of evolution, nucleotide and amino acid substitution patterns and signs of selection. In VP1 coding gene, high intra-typic sequence diversities and robust grouping into distinct genotypes within each type were detected. Within each type the majority of nucleotide substitutions were synonymous and the non-synonymous substitutions tended to cluster in distinct highly polymorphic sites. Signs of positive selection were detected in some of these highly polymorphic sites, while strong negative selection was indicated in most of the codons. Despite robust clustering to intra-typic genotypes, only few genotype-specific 'signature' amino acids were detected. In contrast, when different enterovirus types were compared, there was a clear tendency towards fixation of type-specific 'signature' amino acids. The results suggest that permanent fixation of type-specific amino acids is a hallmark associated with evolution of different enterovirus types, whereas neutral evolution and/or (frequency-dependent) positive selection in few highly polymorphic amino acid sites are the dominant forms of evolution when strains within an enterovirus type are compared.
Verification of STS markers for leaf rust resistance genes of wheat by seven European laboratories.
Błaszczyk, Lidia; Chełkowski, Jerzy; Korzun, Victor; Kraic, Jan; Ordon, Frank; Ovesná, Jaroslava; Purnhauser, Laszlo; Tar, Melinda; Vida, Gyula
2004-01-01
A set of Thatcher near-isogenic lines and two breeding lines were used to examine sequence tagged site (STS) markers linked to leaf rust resistance genes Lr9, Lr10, Lr19, Lr24, Lr28, Lr29, Lr35, and a simple sequenced repeat (SSR) marker for Lr39. The selected STS markers for resistance genes Lr9, Lr10, Lr19, Lr24 and Lr28 were identified in seven accessions by seven European laboratories. Near-isogenic lines of the spring wheat Thatcher were used as positive controls. Markers for resistance genes Lr9, Lr10, Lr19, Lr24 were identified in all seven laboratories as amplification products of 1100 bp, 310 bp, 130 bp and 310 bp, respectively. The STS markers linked to resistance genes Lr9, Lr10, Lr19, Lr24, Lr29, Lr35 and the SSR marker for Lr39 were robust and highly specific for these genes and will be useful in marker-assisted selection in wheat. However, the amplification product of 378 bp that corresponded with resistance gene Lr28 was detected in all accessions including genotypes lacking this gene in all seven laboratories. This marker needs to be improved.
Hartmann, Fanny E; Croll, Daniel
2017-11-01
Differences in gene content are a significant source of variability within species and have an impact on phenotypic traits. However, little is known about the mechanisms responsible for the most recent gene gains and losses. We screened the genomes of 123 worldwide isolates of the major pathogen of wheat Zymoseptoria tritici for robust evidence of gene copy number variation. Based on orthology relationships in three closely related fungi, we identified 599 gene gains and 1,024 gene losses that have not yet reached fixation within the focal species. Our analyses of gene gains and losses segregating in populations showed that gene copy number variation arose preferentially in subtelomeres and in proximity to transposable elements. Recently lost genes were enriched in virulence factors and secondary metabolite gene clusters. In contrast, recently gained genes encoded mostly secreted protein lacking a conserved domain. We analyzed the frequency spectrum at loci segregating a gene presence-absence polymorphism in four worldwide populations. Recent gene losses showed a significant excess in low-frequency variants compared with genome-wide single nucleotide polymorphism, which is indicative of strong negative selection against gene losses. Recent gene gains were either under weak negative selection or neutral. We found evidence for strong divergent selection among populations at individual loci segregating a gene presence-absence polymorphism. Hence, gene gains and losses likely contributed to local adaptation. Our study shows that microbial eukaryotes harbor extensive copy number variation within populations and that functional differences among recently gained and lost genes led to distinct evolutionary trajectories. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Kaneko, Kunihiko
2011-06-01
Here I present and discuss a model that, among other things, appears able to describe the dynamics of cancer cell origin from the perspective of stable and unstable gene expression profiles. In identifying such aberrant gene expression profiles as lying outside the normal stable states attracted through development and normal cell differentiation, the hypothesis explains why cancer cells accumulate mutations, to which they are not robust, and why these mutations create a new stable state far from the normal gene expression profile space. Such cells are in strong contrast with normal cell types that appeared as an attractor state in the gene expression dynamical system under cell-cell interaction and achieved robustness to noise through evolution, which in turn also conferred robustness to mutation. In complex gene regulation networks, other aberrant cellular states lacking such high robustness are expected to remain, which would correspond to cancer cells. Copyright © 2011 WILEY Periodicals, Inc.
Mansourian, Robert; Mutch, David M; Antille, Nicolas; Aubert, Jerome; Fogel, Paul; Le Goff, Jean-Marc; Moulin, Julie; Petrov, Anton; Rytz, Andreas; Voegel, Johannes J; Roberts, Matthew-Alan
2004-11-01
Microarray technology has become a powerful research tool in many fields of study; however, the cost of microarrays often results in the use of a low number of replicates (k). Under circumstances where k is low, it becomes difficult to perform standard statistical tests to extract the most biologically significant experimental results. Other more advanced statistical tests have been developed; however, their use and interpretation often remain difficult to implement in routine biological research. The present work outlines a method that achieves sufficient statistical power for selecting differentially expressed genes under conditions of low k, while remaining as an intuitive and computationally efficient procedure. The present study describes a Global Error Assessment (GEA) methodology to select differentially expressed genes in microarray datasets, and was developed using an in vitro experiment that compared control and interferon-gamma treated skin cells. In this experiment, up to nine replicates were used to confidently estimate error, thereby enabling methods of different statistical power to be compared. Gene expression results of a similar absolute expression are binned, so as to enable a highly accurate local estimate of the mean squared error within conditions. The model then relates variability of gene expression in each bin to absolute expression levels and uses this in a test derived from the classical ANOVA. The GEA selection method is compared with both the classical and permutational ANOVA tests, and demonstrates an increased stability, robustness and confidence in gene selection. A subset of the selected genes were validated by real-time reverse transcription-polymerase chain reaction (RT-PCR). All these results suggest that GEA methodology is (i) suitable for selection of differentially expressed genes in microarray data, (ii) intuitive and computationally efficient and (iii) especially advantageous under conditions of low k. The GEA code for R software is freely available upon request to authors.
Camarasa, María Vicenta; Gálvez, Víctor Miguel
2016-02-09
Cystic fibrosis is one of the most frequent inherited rare diseases, caused by mutations in the cystic fibrosis transmembrane conductance regulator gene. Apart from symptomatic treatments, therapeutic protocols for curing the disease have not yet been established. The regeneration of genetically corrected, disease-free epithelia in cystic fibrosis patients is envisioned by designing a stem cell/genetic therapy in which patient-derived pluripotent stem cells are genetically corrected, from which target tissues are derived. In this framework, we present an efficient method for seamless correction of pF508del mutation in patient-specific induced pluripotent stem cells by gene edited homologous recombination. Gene edition has been performed by transcription activator-like effector nucleases and a homologous recombination donor vector which contains a PiggyBac transposon-based double selectable marker cassette.This new method has been designed to partially avoid xenobiotics from the culture system, improve cell culture efficiency and genome stability by using a robust culture system method, and optimize timings. Overall, once the pluripotent cells have been amplified for the first nucleofection, the procedure can be completed in 69 days, and can be easily adapted to edit and change any gene of interest.
Phylotranscriptomic consolidation of the jawed vertebrate timetree.
Irisarri, Iker; Baurain, Denis; Brinkmann, Henner; Delsuc, Frédéric; Sire, Jean-Yves; Kupfer, Alexander; Petersen, Jörn; Jarek, Michael; Meyer, Axel; Vences, Miguel; Philippe, Hervé
2017-09-01
Phylogenomics is extremely powerful but introduces new challenges as no agreement exists on "standards" for data selection, curation and tree inference. We use jawed vertebrates (Gnathostomata) as model to address these issues. Despite considerable efforts in resolving their evolutionary history and macroevolution, few studies have included a full phylogenetic diversity of gnathostomes and some relationships remain controversial. We tested a novel bioinformatic pipeline to assemble large and accurate phylogenomic datasets from RNA sequencing and find this phylotranscriptomic approach successful and highly cost-effective. Increased sequencing effort up to ca. 10Gbp allows recovering more genes, but shallower sequencing (1.5Gbp) is sufficient to obtain thousands of full-length orthologous transcripts. We reconstruct a robust and strongly supported timetree of jawed vertebrates using 7,189 nuclear genes from 100 taxa, including 23 new transcriptomes from previously unsampled key species. Gene jackknifing of genomic data corroborates the robustness of our tree and allows calculating genome-wide divergence times by overcoming gene sampling bias. Mitochondrial genomes prove insufficient to resolve the deepest relationships because of limited signal and among-lineage rate heterogeneity. Our analyses emphasize the importance of large curated nuclear datasets to increase the accuracy of phylogenomics and provide a reference framework for the evolutionary history of jawed vertebrates.
Springer, M S; Burk, A; Kavanagh, J R; Waddell, V G; Stanhope, M J
1997-12-09
The subclass Theria of Mammalia includes marsupials (infraclass Metatheria) and placentals (infraclass Eutheria). Within each group, interordinal relationships remain unclear. One limitation of many studies is incomplete ordinal representation. Here, we analyze DNA sequences for part of exon 1 of the interphotoreceptor retinoid binding protein gene, including 10 that are newly reported, for representatives of all therian orders. Among placentals, the most robust clades are Cetartiodactyla, Paenungulata, and an expanded African clade that includes paenungulates, tubulidentates, and macroscelideans. Anagalida, Archonta, Altungulata, Hyracoidea + Perissodactyla, Ungulata, and the "flying primate" hypothesis are rejected by statistical tests. Among marsupials, the most robust clade includes all orders except Didelphimorphia. The phylogenetic placement of the monito del monte and the marsupial mole remains unclear. However, the marsupial mole sequence contains three frameshift indels and numerous stop codons in all three reading frames. Given that the interphotoreceptor retinoid binding protein gene is a single-copy gene that functions in the visual cycle and that the marsupial mole is blind with degenerate eyes, this finding suggests that phenotypic degeneration of the eyes is accompanied by parallel changes at the molecular level as a result of relaxed selective constraints.
Moorthy, Sakthi D.; Davidson, Scott; Shchuka, Virlana M.; Singh, Gurdeep; Malek-Gilani, Nakisa; Langroudi, Lida; Martchenko, Alexandre; So, Vincent; Macpherson, Neil N.; Mitchell, Jennifer A.
2017-01-01
Transcriptional enhancers are critical for maintaining cell-type–specific gene expression and driving cell fate changes during development. Highly transcribed genes are often associated with a cluster of individual enhancers such as those found in locus control regions. Recently, these have been termed stretch enhancers or super-enhancers, which have been predicted to regulate critical cell identity genes. We employed a CRISPR/Cas9-mediated deletion approach to study the function of several enhancer clusters (ECs) and isolated enhancers in mouse embryonic stem (ES) cells. Our results reveal that the effect of deleting ECs, also classified as ES cell super-enhancers, is highly variable, resulting in target gene expression reductions ranging from 12% to as much as 92%. Partial deletions of these ECs which removed only one enhancer or a subcluster of enhancers revealed partially redundant control of the regulated gene by multiple enhancers within the larger cluster. Many highly transcribed genes in ES cells are not associated with a super-enhancer; furthermore, super-enhancer predictions ignore 81% of the potentially active regulatory elements predicted by cobinding of five or more pluripotency-associated transcription factors. Deletion of these additional enhancer regions revealed their robust regulatory role in gene transcription. In addition, select super-enhancers and enhancers were identified that regulated clusters of paralogous genes. We conclude that, whereas robust transcriptional output can be achieved by an isolated enhancer, clusters of enhancers acting on a common target gene act in a partially redundant manner to fine tune transcriptional output of their target genes. PMID:27895109
2013-01-01
Background High–throughput (HT) technologies provide huge amount of gene expression data that can be used to identify biomarkers useful in the clinical practice. The most frequently used approaches first select a set of genes (i.e. gene signature) able to characterize differences between two or more phenotypical conditions, and then provide a functional assessment of the selected genes with an a posteriori enrichment analysis, based on biological knowledge. However, this approach comes with some drawbacks. First, gene selection procedure often requires tunable parameters that affect the outcome, typically producing many false hits. Second, a posteriori enrichment analysis is based on mapping between biological concepts and gene expression measurements, which is hard to compute because of constant changes in biological knowledge and genome analysis. Third, such mapping is typically used in the assessment of the coverage of gene signature by biological concepts, that is either score–based or requires tunable parameters as well, limiting its power. Results We present Knowledge Driven Variable Selection (KDVS), a framework that uses a priori biological knowledge in HT data analysis. The expression data matrix is transformed, according to prior knowledge, into smaller matrices, easier to analyze and to interpret from both computational and biological viewpoints. Therefore KDVS, unlike most approaches, does not exclude a priori any function or process potentially relevant for the biological question under investigation. Differently from the standard approach where gene selection and functional assessment are applied independently, KDVS embeds these two steps into a unified statistical framework, decreasing the variability derived from the threshold–dependent selection, the mapping to the biological concepts, and the signature coverage. We present three case studies to assess the usefulness of the method. Conclusions We showed that KDVS not only enables the selection of known biological functionalities with accuracy, but also identification of new ones. An efficient implementation of KDVS was devised to obtain results in a fast and robust way. Computing time is drastically reduced by the effective use of distributed resources. Finally, integrated visualization techniques immediately increase the interpretability of results. Overall, KDVS approach can be considered as a viable alternative to enrichment–based approaches. PMID:23302187
Li, Jingtao; Sun, Xinhua; Liu, Yanzhi; Wang, Xueliang; Zhang, Hao; Pan, Hongyu
2017-01-01
Plant productivity is limited by salinity stress, both in natural and agricultural systems. Identification of salt stress-related genes from halophyte can provide insights into mechanisms of salt stress tolerance in plants. Atriplex canescens is a xero-halophyte that exhibits optimum growth in the presence of 400 mM NaCl. A cDNA library derived from highly salt-treated A. canescens plants was constructed based on a yeast expression system. A total of 53 transgenic yeast clones expressing enhanced salt tolerance were selected from 105 transformants. Their plasmids were sequenced and the gene characteristics were annotated using a BLASTX search. Retransformation of yeast cells with the selected plasmids conferred salt tolerance to the resulting transformants. The expression patterns of 28 of these stress-related genes were further investigated in A. canescens leaves by quantitative reverse transcription-PCR. In this study, we provided a rapid and robust assay system for large-scale screening of genes for varied abiotic stress tolerance with high efficiency in A. canescens. PMID:29149055
Comparisons of Robustness and Sensitivity between Cancer and Normal Cells by Microarray Data
Chu, Liang-Hui; Chen, Bor-Sen
2008-01-01
Robustness is defined as the ability to uphold performance in face of perturbations and uncertainties, and sensitivity is a measure of the system deviations generated by perturbations to the system. While cancer appears as a robust but fragile system, few computational and quantitative evidences demonstrate robustness tradeoffs in cancer. Microarrays have been widely applied to decipher gene expression signatures in human cancer research, and quantification of global gene expression profiles facilitates precise prediction and modeling of cancer in systems biology. We provide several efficient computational methods based on system and control theory to compare robustness and sensitivity between cancer and normal cells by microarray data. Measurement of robustness and sensitivity by linear stochastic model is introduced in this study, which shows oscillations in feedback loops of p53 and demonstrates robustness tradeoffs that cancer is a robust system with some extreme fragilities. In addition, we measure sensitivity of gene expression to perturbations in other gene expression and kinetic parameters, discuss nonlinear effects in feedback loops of p53 and extend our method to robustness-based cancer drug design. PMID:19259409
Safo, Sandra E; Li, Shuzhao; Long, Qi
2018-03-01
Integrative analysis of high dimensional omics data is becoming increasingly popular. At the same time, incorporating known functional relationships among variables in analysis of omics data has been shown to help elucidate underlying mechanisms for complex diseases. In this article, our goal is to assess association between transcriptomic and metabolomic data from a Predictive Health Institute (PHI) study that includes healthy adults at a high risk of developing cardiovascular diseases. Adopting a strategy that is both data-driven and knowledge-based, we develop statistical methods for sparse canonical correlation analysis (CCA) with incorporation of known biological information. Our proposed methods use prior network structural information among genes and among metabolites to guide selection of relevant genes and metabolites in sparse CCA, providing insight on the molecular underpinning of cardiovascular disease. Our simulations demonstrate that the structured sparse CCA methods outperform several existing sparse CCA methods in selecting relevant genes and metabolites when structural information is informative and are robust to mis-specified structural information. Our analysis of the PHI study reveals that a number of gene and metabolic pathways including some known to be associated with cardiovascular diseases are enriched in the set of genes and metabolites selected by our proposed approach. © 2017, The International Biometric Society.
Sato, M; Figueiredo, ML; Burton, JB; Johnson, M; Chen, M; Powell, R; Gambhir, SS; Carey, M; Wu, L
2009-01-01
Effective treatment for recurrent, disseminated prostate cancer is notably limited. We have developed adenoviral vectors with a prostate-specific two-step transcriptional amplification (TSTA) system that would express therapeutic genes at a robust level to target metastatic disease. The TSTA system employs the prostate-specific antigen (PSA) promoter/enhancer to drive a potent synthetic activator, which in turn activates the expression of the therapeutic gene. In this study, we explored different configurations of this bipartite system and discovered that physical separation of the two TSTA components into E1 and E3 regions of adenovirus was able to enhance androgen regulation and cell-discriminatory expression. The TSTA vectors that express imaging reporter genes were assessed by noninvasive imaging technologies in animal models. The improved selectivity of the E1E3 configured vector was reflected in silenced ectopic expression in the lung. Significantly, the enhanced specificity of the E1E3 vector enabled the detection of lung metastasis of prostate cancer. An E1E3 TSTA vector that expresses the herpes simplex virus thymidine kinase gene can effectively direct positron emission tomography (PET) imaging of the tumor. The prostate-targeted gene delivery vectors with robust and cell-specific expression capability will advance the development of safe and effective imaging guided therapy for recurrent metastatic stages of prostate cancer. PMID:18305574
Genes involved in virulence of the entomopathogenic fungus Beauveria bassiana.
Valero-Jiménez, Claudio A; Wiegers, Harm; Zwaan, Bas J; Koenraadt, Constantianus J M; van Kan, Jan A L
2016-01-01
Pest insects cause severe damage to global crop production and pose a threat to human health by transmitting diseases. Traditionally, chemical pesticides (insecticides) have been used to control such pests and have proven to be effective only for a limited amount of time because of the rapid spread of genetic insecticide resistance. The basis of this resistance is mostly caused by (co)dominant mutations in single genes, which explains why insecticide use alone is an unsustainable solution. Therefore, robust solutions for insect pest control need to be sought in alternative methods such as biological control agents for which single-gene resistance is less likely to evolve. The entomopathogenic fungus Beauveria bassiana has shown potential as a biological control agent of insects, and insight into the mechanisms of virulence is essential to show the robustness of its use. With the recent availability of the whole genome sequence of B. bassiana, progress in understanding the genetics that constitute virulence toward insects can be made more quickly. In this review we divide the infection process into distinct steps and provide an overview of what is currently known about genes and mechanisms influencing virulence in B. bassiana. We also discuss the need for novel strategies and experimental methods to better understand the infection mechanisms deployed by entomopathogenic fungi. Such knowledge can help improve biocontrol agents, not only by selecting the most virulent genotypes, but also by selecting the genotypes that use combinations of virulence mechanisms for which resistance in the insect host is least likely to develop. Copyright © 2015 Elsevier Inc. All rights reserved.
Nonlinear Dynamics in Gene Regulation Promote Robustness and Evolvability of Gene Expression Levels.
Steinacher, Arno; Bates, Declan G; Akman, Ozgur E; Soyer, Orkun S
2016-01-01
Cellular phenotypes underpinned by regulatory networks need to respond to evolutionary pressures to allow adaptation, but at the same time be robust to perturbations. This creates a conflict in which mutations affecting regulatory networks must both generate variance but also be tolerated at the phenotype level. Here, we perform mathematical analyses and simulations of regulatory networks to better understand the potential trade-off between robustness and evolvability. Examining the phenotypic effects of mutations, we find an inverse correlation between robustness and evolvability that breaks only with nonlinearity in the network dynamics, through the creation of regions presenting sudden changes in phenotype with small changes in genotype. For genotypes embedding low levels of nonlinearity, robustness and evolvability correlate negatively and almost perfectly. By contrast, genotypes embedding nonlinear dynamics allow expression levels to be robust to small perturbations, while generating high diversity (evolvability) under larger perturbations. Thus, nonlinearity breaks the robustness-evolvability trade-off in gene expression levels by allowing disparate responses to different mutations. Using analytical derivations of robustness and system sensitivity, we show that these findings extend to a large class of gene regulatory network architectures and also hold for experimentally observed parameter regimes. Further, the effect of nonlinearity on the robustness-evolvability trade-off is ensured as long as key parameters of the system display specific relations irrespective of their absolute values. We find that within this parameter regime genotypes display low and noisy expression levels. Examining the phenotypic effects of mutations, we find an inverse correlation between robustness and evolvability that breaks only with nonlinearity in the network dynamics. Our results provide a possible solution to the robustness-evolvability trade-off, suggest an explanation for the ubiquity of nonlinear dynamics in gene expression networks, and generate useful guidelines for the design of synthetic gene circuits.
Individualised cancer therapeutics: dream or reality? Therapeutics construction.
Shen, Yuqiao; Senzer, Neil; Nemunaitis, John
2005-11-01
The analysis of DNA microarray and proteomic data, and the subsequent integration into functional expression sets, provides a circuit map of the hierarchical cellular networks responsible for sustaining the viability and environmental competitiveness of cancer cells, that is, their robust systematics. These technologies can be used to 'snapshot' the unique patterns of molecular derangements and modified interactions in cancer, and allow for strategic selection of therapeutics that best match the individual profile of the tumour. This review highlights technology that can be used to selectively disrupt critical molecular targets and describes possible vehicles to deliver the synthesised molecular therapeutics to the relevant cellular compartments of the malignant cells. RNA interference (RNAi) involves a group of evolutionarily conserved gene silencing mechanisms in which small sequences of double-stranded RNA or intrinsic antisense RNA trigger mRNA cleavage or translational repression, respectively. Although RNAi molecules can be synthesised to 'silence' virtually any gene, even if upregulated, a mechanism for selective delivery of RNAi effectors to sites of malignant disease remains challenging. The authors will discuss gene-modified conditionally replicating viruses as candidate vehicles for the delivery of RNAi.
Sandhu, Maninder; Sureshkumar, V; Prakash, Chandra; Dixit, Rekha; Solanke, Amolkumar U; Sharma, Tilak Raj; Mohapatra, Trilochan; S V, Amitha Mithra
2017-09-30
Genome-wide microarray has enabled development of robust databases for functional genomics studies in rice. However, such databases do not directly cater to the needs of breeders. Here, we have attempted to develop a web interface which combines the information from functional genomic studies across different genetic backgrounds with DNA markers so that they can be readily deployed in crop improvement. In the current version of the database, we have included drought and salinity stress studies since these two are the major abiotic stresses in rice. RiceMetaSys, a user-friendly and freely available web interface provides comprehensive information on salt responsive genes (SRGs) and drought responsive genes (DRGs) across genotypes, crop development stages and tissues, identified from multiple microarray datasets. 'Physical position search' is an attractive tool for those using QTL based approach for dissecting tolerance to salt and drought stress since it can provide the list of SRGs and DRGs in any physical interval. To identify robust candidate genes for use in crop improvement, the 'common genes across varieties' search tool is useful. Graphical visualization of expression profiles across genes and rice genotypes has been enabled to facilitate the user and to make the comparisons more impactful. Simple Sequence Repeat (SSR) search in the SRGs and DRGs is a valuable tool for fine mapping and marker assisted selection since it provides primers for survey of polymorphism. An external link to intron specific markers is also provided for this purpose. Bulk retrieval of data without any limit has been enabled in case of locus and SSR search. The aim of this database is to facilitate users with a simple and straight-forward search options for identification of robust candidate genes from among thousands of SRGs and DRGs so as to facilitate linking variation in expression profiles to variation in phenotype. Database URL: http://14.139.229.201.
Ranking of Prokaryotic Genomes Based on Maximization of Sortedness of Gene Lengths
Bolshoy, A; Salih, B; Cohen, I; Tatarinova, T
2014-01-01
How variations of gene lengths (some genes become longer than their predecessors, while other genes become shorter and the sizes of these factions are randomly different from organism to organism) depend on organismal evolution and adaptation is still an open question. We propose to rank the genomes according to lengths of their genes, and then find association between the genome rank and variousproperties, such as growth temperature, nucleotide composition, and pathogenicity. This approach reveals evolutionary driving factors. The main purpose of this study is to test effectiveness and robustness of several ranking methods. The selected method of evaluation is measuring of overall sortedness of the data. We have demonstrated that all considered methods give consistent results and Bubble Sort and Simulated Annealing achieve the highest sortedness. Also, Bubble Sort is considerably faster than the Simulated Annealing method. PMID:26146586
Ranking of Prokaryotic Genomes Based on Maximization of Sortedness of Gene Lengths.
Bolshoy, A; Salih, B; Cohen, I; Tatarinova, T
How variations of gene lengths (some genes become longer than their predecessors, while other genes become shorter and the sizes of these factions are randomly different from organism to organism) depend on organismal evolution and adaptation is still an open question. We propose to rank the genomes according to lengths of their genes, and then find association between the genome rank and variousproperties, such as growth temperature, nucleotide composition, and pathogenicity. This approach reveals evolutionary driving factors. The main purpose of this study is to test effectiveness and robustness of several ranking methods. The selected method of evaluation is measuring of overall sortedness of the data. We have demonstrated that all considered methods give consistent results and Bubble Sort and Simulated Annealing achieve the highest sortedness. Also, Bubble Sort is considerably faster than the Simulated Annealing method.
Robust synthetic biology design: stochastic game theory approach.
Chen, Bor-Sen; Chang, Chia-Hung; Lee, Hsiao-Ching
2009-07-15
Synthetic biology is to engineer artificial biological systems to investigate natural biological phenomena and for a variety of applications. However, the development of synthetic gene networks is still difficult and most newly created gene networks are non-functioning due to uncertain initial conditions and disturbances of extra-cellular environments on the host cell. At present, how to design a robust synthetic gene network to work properly under these uncertain factors is the most important topic of synthetic biology. A robust regulation design is proposed for a stochastic synthetic gene network to achieve the prescribed steady states under these uncertain factors from the minimax regulation perspective. This minimax regulation design problem can be transformed to an equivalent stochastic game problem. Since it is not easy to solve the robust regulation design problem of synthetic gene networks by non-linear stochastic game method directly, the Takagi-Sugeno (T-S) fuzzy model is proposed to approximate the non-linear synthetic gene network via the linear matrix inequality (LMI) technique through the Robust Control Toolbox in Matlab. Finally, an in silico example is given to illustrate the design procedure and to confirm the efficiency and efficacy of the proposed robust gene design method. http://www.ee.nthu.edu.tw/bschen/SyntheticBioDesign_supplement.pdf.
An integrative approach for measuring semantic similarities using gene ontology.
Peng, Jiajie; Li, Hongxiang; Jiang, Qinghua; Wang, Yadong; Chen, Jin
2014-01-01
Gene Ontology (GO) provides rich information and a convenient way to study gene functional similarity, which has been successfully used in various applications. However, the existing GO based similarity measurements have limited functions for only a subset of GO information is considered in each measure. An appropriate integration of the existing measures to take into account more information in GO is demanding. We propose a novel integrative measure called InteGO2 to automatically select appropriate seed measures and then to integrate them using a metaheuristic search method. The experiment results show that InteGO2 significantly improves the performance of gene similarity in human, Arabidopsis and yeast on both molecular function and biological process GO categories. InteGO2 computes gene-to-gene similarities more accurately than tested existing measures and has high robustness. The supplementary document and software are available at http://mlg.hit.edu.cn:8082/.
iPcc: a novel feature extraction method for accurate disease class discovery and prediction
Ren, Xianwen; Wang, Yong; Zhang, Xiang-Sun; Jin, Qi
2013-01-01
Gene expression profiling has gradually become a routine procedure for disease diagnosis and classification. In the past decade, many computational methods have been proposed, resulting in great improvements on various levels, including feature selection and algorithms for classification and clustering. In this study, we present iPcc, a novel method from the feature extraction perspective to further propel gene expression profiling technologies from bench to bedside. We define ‘correlation feature space’ for samples based on the gene expression profiles by iterative employment of Pearson’s correlation coefficient. Numerical experiments on both simulated and real gene expression data sets demonstrate that iPcc can greatly highlight the latent patterns underlying noisy gene expression data and thus greatly improve the robustness and accuracy of the algorithms currently available for disease diagnosis and classification based on gene expression profiles. PMID:23761440
Liu, Li-Zhi; Wu, Fang-Xiang; Zhang, Wen-Jun
2014-01-01
As an abstract mapping of the gene regulations in the cell, gene regulatory network is important to both biological research study and practical applications. The reverse engineering of gene regulatory networks from microarray gene expression data is a challenging research problem in systems biology. With the development of biological technologies, multiple time-course gene expression datasets might be collected for a specific gene network under different circumstances. The inference of a gene regulatory network can be improved by integrating these multiple datasets. It is also known that gene expression data may be contaminated with large errors or outliers, which may affect the inference results. A novel method, Huber group LASSO, is proposed to infer the same underlying network topology from multiple time-course gene expression datasets as well as to take the robustness to large error or outliers into account. To solve the optimization problem involved in the proposed method, an efficient algorithm which combines the ideas of auxiliary function minimization and block descent is developed. A stability selection method is adapted to our method to find a network topology consisting of edges with scores. The proposed method is applied to both simulation datasets and real experimental datasets. It shows that Huber group LASSO outperforms the group LASSO in terms of both areas under receiver operating characteristic curves and areas under the precision-recall curves. The convergence analysis of the algorithm theoretically shows that the sequence generated from the algorithm converges to the optimal solution of the problem. The simulation and real data examples demonstrate the effectiveness of the Huber group LASSO in integrating multiple time-course gene expression datasets and improving the resistance to large errors or outliers.
Development and Validation of a qRT-PCR Classifier for Lung Cancer Prognosis
Chen, Guoan; Kim, Sinae; Taylor, Jeremy MG; Wang, Zhuwen; Lee, Oliver; Ramnath, Nithya; Reddy, Rishindra M; Lin, Jules; Chang, Andrew C; Orringer, Mark B; Beer, David G
2011-01-01
Purpose This prospective study aimed to develop a robust and clinically-applicable method to identify high-risk early stage lung cancer patients and then to validate this method for use in future translational studies. Patients and Methods Three published Affymetrix microarray data sets representing 680 primary tumors were used in the survival-related gene selection procedure using clustering, Cox model and random survival forest (RSF) analysis. A final set of 91 genes was selected and tested as a predictor of survival using a qRT-PCR-based assay utilizing an independent cohort of 101 lung adenocarcinomas. Results The RSF model built from 91 genes in the training set predicted patient survival in an independent cohort of 101 lung adenocarcinomas, with a prediction error rate of 26.6%. The mortality risk index (MRI) was significantly related to survival (Cox model p < 0.00001) and separated all patients into low, medium, and high-risk groups (HR = 1.00, 2.82, 4.42). The MRI was also related to survival in stage 1 patients (Cox model p = 0.001), separating patients into low, medium, and high-risk groups (HR = 1.00, 3.29, 3.77). Conclusions The development and validation of this robust qRT-PCR platform allows prediction of patient survival with early stage lung cancer. Utilization will now allow investigators to evaluate it prospectively by incorporation into new clinical trials with the goal of personalized treatment of lung cancer patients and improving patient survival. PMID:21792073
Zerbini, Francesca; Zanella, Ilaria; Fraccascia, Davide; König, Enrico; Irene, Carmela; Frattini, Luca F; Tomasi, Michele; Fantappiè, Laura; Ganfini, Luisa; Caproni, Elena; Parri, Matteo; Grandi, Alberto; Grandi, Guido
2017-04-24
The exploitation of the CRISPR/Cas9 machinery coupled to lambda (λ) recombinase-mediated homologous recombination (recombineering) is becoming the method of choice for genome editing in E. coli. First proposed by Jiang and co-workers, the strategy has been subsequently fine-tuned by several authors who demonstrated, by using few selected loci, that the efficiency of mutagenesis (number of mutant colonies over total number of colonies analyzed) can be extremely high (up to 100%). However, from published data it is difficult to appreciate the robustness of the technology, defined as the number of successfully mutated loci over the total number of targeted loci. This information is particularly relevant in high-throughput genome editing, where repetition of experiments to rescue missing mutants would be impractical. This work describes a "brute force" validation activity, which culminated in the definition of a robust, simple and rapid protocol for single or multiple gene deletions. We first set up our own version of the CRISPR/Cas9 protocol and then we evaluated the mutagenesis efficiency by changing different parameters including sequence of guide RNAs, length and concentration of donor DNAs, and use of single stranded and double stranded donor DNAs. We then validated the optimized conditions targeting 78 "dispensable" genes. This work led to the definition of a protocol, featuring the use of double stranded synthetic donor DNAs, which guarantees mutagenesis efficiencies consistently higher than 10% and a robustness of 100%. The procedure can be applied also for simultaneous gene deletions. This work defines for the first time the robustness of a CRISPR/Cas9-based protocol based on a large sample size. Since the technical solutions here proposed can be applied to other similar procedures, the data could be of general interest for the scientific community working on bacterial genome editing and, in particular, for those involved in synthetic biology projects requiring high throughput procedures.
Genetic Architecture Promotes the Evolution and Maintenance of Cooperation
Frénoy, Antoine; Taddei, François; Misevic, Dusan
2013-01-01
When cooperation has a direct cost and an indirect benefit, a selfish behavior is more likely to be selected for than an altruistic one. Kin and group selection do provide evolutionary explanations for the stability of cooperation in nature, but we still lack the full understanding of the genomic mechanisms that can prevent cheater invasion. In our study we used Aevol, an agent-based, in silico genomic platform to evolve populations of digital organisms that compete, reproduce, and cooperate by secreting a public good for tens of thousands of generations. We found that cooperating individuals may share a phenotype, defined as the amount of public good produced, but have very different abilities to resist cheater invasion. To understand the underlying genetic differences between cooperator types, we performed bio-inspired genomics analyses of our digital organisms by recording and comparing the locations of metabolic and secretion genes, as well as the relevant promoters and terminators. Association between metabolic and secretion genes (promoter sharing, overlap via frame shift or sense-antisense encoding) was characteristic for populations with robust cooperation and was more likely to evolve when secretion was costly. In mutational analysis experiments, we demonstrated the potential evolutionary consequences of the genetic association by performing a large number of mutations and measuring their phenotypic and fitness effects. The non-cooperating mutants arising from the individuals with genetic association were more likely to have metabolic deleterious mutations that eventually lead to selection eliminating such mutants from the population due to the accompanying fitness decrease. Effectively, cooperation evolved to be protected and robust to mutations through entangled genetic architecture. Our results confirm the importance of second-order selection on evolutionary outcomes, uncover an important genetic mechanism for the evolution and maintenance of cooperation, and suggest promising methods for preventing gene loss in synthetically engineered organisms. PMID:24278000
Springer, Mark S.; Burk, Angela; Kavanagh, John R.; Waddell, Victor G.; Stanhope, Michael J.
1997-01-01
The subclass Theria of Mammalia includes marsupials (infraclass Metatheria) and placentals (infraclass Eutheria). Within each group, interordinal relationships remain unclear. One limitation of many studies is incomplete ordinal representation. Here, we analyze DNA sequences for part of exon 1 of the interphotoreceptor retinoid binding protein gene, including 10 that are newly reported, for representatives of all therian orders. Among placentals, the most robust clades are Cetartiodactyla, Paenungulata, and an expanded African clade that includes paenungulates, tubulidentates, and macroscelideans. Anagalida, Archonta, Altungulata, Hyracoidea + Perissodactyla, Ungulata, and the “flying primate” hypothesis are rejected by statistical tests. Among marsupials, the most robust clade includes all orders except Didelphimorphia. The phylogenetic placement of the monito del monte and the marsupial mole remains unclear. However, the marsupial mole sequence contains three frameshift indels and numerous stop codons in all three reading frames. Given that the interphotoreceptor retinoid binding protein gene is a single-copy gene that functions in the visual cycle and that the marsupial mole is blind with degenerate eyes, this finding suggests that phenotypic degeneration of the eyes is accompanied by parallel changes at the molecular level as a result of relaxed selective constraints. PMID:9391099
Tsai, Yu-Shuen; Aguan, Kripamoy; Pal, Nikhil R.; Chung, I-Fang
2011-01-01
Informative genes from microarray data can be used to construct prediction model and investigate biological mechanisms. Differentially expressed genes, the main targets of most gene selection methods, can be classified as single- and multiple-class specific signature genes. Here, we present a novel gene selection algorithm based on a Group Marker Index (GMI), which is intuitive, of low-computational complexity, and efficient in identification of both types of genes. Most gene selection methods identify only single-class specific signature genes and cannot identify multiple-class specific signature genes easily. Our algorithm can detect de novo certain conditions of multiple-class specificity of a gene and makes use of a novel non-parametric indicator to assess the discrimination ability between classes. Our method is effective even when the sample size is small as well as when the class sizes are significantly different. To compare the effectiveness and robustness we formulate an intuitive template-based method and use four well-known datasets. We demonstrate that our algorithm outperforms the template-based method in difficult cases with unbalanced distribution. Moreover, the multiple-class specific genes are good biomarkers and play important roles in biological pathways. Our literature survey supports that the proposed method identifies unique multiple-class specific marker genes (not reported earlier to be related to cancer) in the Central Nervous System data. It also discovers unique biomarkers indicating the intrinsic difference between subtypes of lung cancer. We also associate the pathway information with the multiple-class specific signature genes and cross-reference to published studies. We find that the identified genes participate in the pathways directly involved in cancer development in leukemia data. Our method gives a promising way to find genes that can involve in pathways of multiple diseases and hence opens up the possibility of using an existing drug on other diseases as well as designing a single drug for multiple diseases. PMID:21909426
A functional metagenomic approach for expanding the synthetic biology toolbox for biomass conversion
Sommer, Morten OA; Church, George M; Dantas, Gautam
2010-01-01
Sustainable biofuel alternatives to fossil fuel energy are hampered by recalcitrance and toxicity of biomass substrates to microbial biocatalysts. To address this issue, we present a culture-independent functional metagenomic platform for mining Nature's vast enzymatic reservoir and show its relevance to biomass conversion. We performed functional selections on 4.7 Gb of metagenomic fosmid libraries and show that genetic elements conferring tolerance toward seven important biomass inhibitors can be identified. We select two metagenomic fosmids that improve the growth of Escherichia coli by 5.7- and 6.9-fold in the presence of inhibitory concentrations of syringaldehyde and 2-furoic acid, respectively, and identify the individual genes responsible for these tolerance phenotypes. Finally, we combine the individual genes to create a three-gene construct that confers tolerance to mixtures of these important biomass inhibitors. This platform presents a route for expanding the repertoire of genetic elements available to synthetic biology and provides a starting point for efforts to engineer robust strains for biofuel generation. PMID:20393580
DOE Office of Scientific and Technical Information (OSTI.GOV)
Uehara, Takeki, E-mail: takeki.uehara@shionogi.co.jp; Toxicogenomics Informatics Project, National Institute of Biomedical Innovation, 7-6-8 Asagi, Ibaraki, Osaka 567-0085; Minowa, Yohsuke
2011-09-15
The present study was performed to develop a robust gene-based prediction model for early assessment of potential hepatocarcinogenicity of chemicals in rats by using our toxicogenomics database, TG-GATEs (Genomics-Assisted Toxicity Evaluation System developed by the Toxicogenomics Project in Japan). The positive training set consisted of high- or middle-dose groups that received 6 different non-genotoxic hepatocarcinogens during a 28-day period. The negative training set consisted of high- or middle-dose groups of 54 non-carcinogens. Support vector machine combined with wrapper-type gene selection algorithms was used for modeling. Consequently, our best classifier yielded prediction accuracies for hepatocarcinogenicity of 99% sensitivity and 97% specificitymore » in the training data set, and false positive prediction was almost completely eliminated. Pathway analysis of feature genes revealed that the mitogen-activated protein kinase p38- and phosphatidylinositol-3-kinase-centered interactome and the v-myc myelocytomatosis viral oncogene homolog-centered interactome were the 2 most significant networks. The usefulness and robustness of our predictor were further confirmed in an independent validation data set obtained from the public database. Interestingly, similar positive predictions were obtained in several genotoxic hepatocarcinogens as well as non-genotoxic hepatocarcinogens. These results indicate that the expression profiles of our newly selected candidate biomarker genes might be common characteristics in the early stage of carcinogenesis for both genotoxic and non-genotoxic carcinogens in the rat liver. Our toxicogenomic model might be useful for the prospective screening of hepatocarcinogenicity of compounds and prioritization of compounds for carcinogenicity testing. - Highlights: >We developed a toxicogenomic model to predict hepatocarcinogenicity of chemicals. >The optimized model consisting of 9 probes had 99% sensitivity and 97% specificity. >This model enables us to detect genotoxic as well as non-genotoxic hepatocarcinogens.« less
Wang, Junbai; Wu, Qianqian; Hu, Xiaohua Tony; Tian, Tianhai
2016-11-01
Investigating the dynamics of genetic regulatory networks through high throughput experimental data, such as microarray gene expression profiles, is a very important but challenging task. One of the major hindrances in building detailed mathematical models for genetic regulation is the large number of unknown model parameters. To tackle this challenge, a new integrated method is proposed by combining a top-down approach and a bottom-up approach. First, the top-down approach uses probabilistic graphical models to predict the network structure of DNA repair pathway that is regulated by the p53 protein. Two networks are predicted, namely a network of eight genes with eight inferred interactions and an extended network of 21 genes with 17 interactions. Then, the bottom-up approach using differential equation models is developed to study the detailed genetic regulations based on either a fully connected regulatory network or a gene network obtained by the top-down approach. Model simulation error, parameter identifiability and robustness property are used as criteria to select the optimal network. Simulation results together with permutation tests of input gene network structures indicate that the prediction accuracy and robustness property of the two predicted networks using the top-down approach are better than those of the corresponding fully connected networks. In particular, the proposed approach reduces computational cost significantly for inferring model parameters. Overall, the new integrated method is a promising approach for investigating the dynamics of genetic regulation. Copyright © 2016 Elsevier Inc. All rights reserved.
Andries, Erik; Hagstrom, Thomas; Atlas, Susan R; Willman, Cheryl
2007-02-01
Linear discrimination, from the point of view of numerical linear algebra, can be treated as solving an ill-posed system of linear equations. In order to generate a solution that is robust in the presence of noise, these problems require regularization. Here, we examine the ill-posedness involved in the linear discrimination of cancer gene expression data with respect to outcome and tumor subclasses. We show that a filter factor representation, based upon Singular Value Decomposition, yields insight into the numerical ill-posedness of the hyperplane-based separation when applied to gene expression data. We also show that this representation yields useful diagnostic tools for guiding the selection of classifier parameters, thus leading to improved performance.
Selective robust optimization: A new intensity-modulated proton therapy optimization strategy
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Yupeng; Niemela, Perttu; Siljamaki, Sami
2015-08-15
Purpose: To develop a new robust optimization strategy for intensity-modulated proton therapy as an important step in translating robust proton treatment planning from research to clinical applications. Methods: In selective robust optimization, a worst-case-based robust optimization algorithm is extended, and terms of the objective function are selectively computed from either the worst-case dose or the nominal dose. Two lung cancer cases and one head and neck cancer case were used to demonstrate the practical significance of the proposed robust planning strategy. The lung cancer cases had minimal tumor motion less than 5 mm, and, for the demonstration of the methodology,more » are assumed to be static. Results: Selective robust optimization achieved robust clinical target volume (CTV) coverage and at the same time increased nominal planning target volume coverage to 95.8%, compared to the 84.6% coverage achieved with CTV-based robust optimization in one of the lung cases. In the other lung case, the maximum dose in selective robust optimization was lowered from a dose of 131.3% in the CTV-based robust optimization to 113.6%. Selective robust optimization provided robust CTV coverage in the head and neck case, and at the same time improved controls over isodose distribution so that clinical requirements may be readily met. Conclusions: Selective robust optimization may provide the flexibility and capability necessary for meeting various clinical requirements in addition to achieving the required plan robustness in practical proton treatment planning settings.« less
2015-01-01
Background Over the past 50,000 years, shifts in human-environmental or human-human interactions shaped genetic differences within and among human populations, including variants under positive selection. Shaped by environmental factors, such variants influence the genetics of modern health, disease, and treatment outcome. Because evolutionary processes tend to act on gene regulation, we test whether regulatory variants are under positive selection. We introduce a new approach to enhance detection of genetic markers undergoing positive selection, using conditional entropy to capture recent local selection signals. Results We use conditional logistic regression to compare our Adjusted Haplotype Conditional Entropy (H|H) measure of positive selection to existing positive selection measures. H|H and existing measures were applied to published regulatory variants acting in cis (cis-eQTLs), with conditional logistic regression testing whether regulatory variants undergo stronger positive selection than the surrounding gene. These cis-eQTLs were drawn from six independent studies of genotype and RNA expression. The conditional logistic regression shows that, overall, H|H is substantially more powerful than existing positive-selection methods in identifying cis-eQTLs against other Single Nucleotide Polymorphisms (SNPs) in the same genes. When broken down by Gene Ontology, H|H predictions are particularly strong in some biological process categories, where regulatory variants are under strong positive selection compared to the bulk of the gene, distinct from those GO categories under overall positive selection. . However, cis-eQTLs in a second group of genes lack positive selection signatures detectable by H|H, consistent with ancient short haplotypes compared to the surrounding gene (for example, in innate immunity GO:0042742); under such other modes of selection, H|H would not be expected to be a strong predictor.. These conditional logistic regression models are adjusted for Minor allele frequency(MAF); otherwise, ascertainment bias is a huge factor in all eQTL data sets. Relationships between Gene Ontology categories, positive selection and eQTL specificity were replicated with H|H in a single larger data set. Our measure, Adjusted Haplotype Conditional Entropy (H|H), was essential in generating all of the results above because it: 1) is a stronger overall predictor for eQTLs than comparable existing approaches, and 2) shows low sequential auto-correlation, overcoming problems with convergence of these conditional regression statistical models. Conclusions Our new method, H|H, provides a consistently more robust signal associated with cis-eQTLs compared to existing methods. We interpret this to indicate that some cis-eQTLs are under positive selection compared to their surrounding genes. Conditional entropy indicative of a selective sweep is an especially strong predictor of eQTLs for genes in several biological processes of medical interest. Where conditional entropy is a weak or negative predictor of eQTLs, such as innate immune genes, this would be consistent with balancing selection acting on such eQTLs over long time periods. Different measures of selection may be needed for variant prioritization under other modes of evolutionary selection. PMID:26111110
da Silva, Paula Renata Alves; Vidal, Marcia Soares; de Paula Soares, Cleiton; Polese, Valéria; Simões-Araújo, Jean Luís; Baldani, José Ivo
2016-11-01
Among the members of the genus Burkholderia, Burkholderia tropica has the ability to fix nitrogen and promote sugarcane plant growth as well as act as a biological control agent. There is little information about how this bacterium metabolizes carbohydrates as well as those carbon sources found in the sugarcane juice that accumulates in stems during plant growth. Reverse transcription quantitative PCR (RT-qPCR) can be used to evaluate changes in gene expression during bacterial growth on different carbon sources. Here we tested the expression of six reference genes, lpxC, gyrB, recA, rpoA, rpoB, and rpoD, when cells were grown with glucose, fructose, sucrose, mannitol, aconitic acid, and sugarcane juice as carbon sources. The lpxC, gyrB, and recA were selected as the most stable reference genes based on geNorm and NormFinder software analyses. Validation of these three reference genes during strain Ppe8 growth on the same carbon sources showed that genes involved in glycogen biosynthesis (glgA, glgB, glgC) and trehalose biosynthesis (treY and treZ) were highly expressed when Ppe8 was grown in aconitic acid relative to other carbon sources, while otsA expression (trehalose biosynthesis) was reduced with all carbon sources. In addition, the expression level of the ORF_6066 (gluconolactonase) gene was reduced on sugarcane juice. The results confirmed the stability of the three selected reference genes (lpxC, gyrB, and recA) during the RT-qPCR and also their robustness by evaluating the relative expression of genes involved in glycogen and trehalose biosynthesis when strain Ppe8 was grown on different carbon sources and sugarcane juice.
Peterson, Christopher W.; Wang, Jianbin; Deleage, Claire; Reddy, Sowmya; Kaur, Jasbir; Polacino, Patricia; Reik, Andreas; Huang, Meei-Li; Holmes, Michael C.; Estes, Jacob D.
2018-01-01
Autologous transplantation and engraftment of HIV-resistant cells in sufficient numbers should recapitulate the functional cure of the Berlin Patient, with applicability to a greater number of infected individuals and with a superior safety profile. A robust preclinical model of suppressed HIV infection is critical in order to test such gene therapy-based cure strategies, both alone and in combination with other cure strategies. Here, we present a nonhuman primate (NHP) model of latent infection using simian/human immunodeficiency virus (SHIV) and combination antiretroviral therapy (cART) in pigtail macaques. We demonstrate that transplantation of CCR5 gene-edited hematopoietic stem/progenitor cells (HSPCs) persist in infected and suppressed animals, and that protected cells expand through virus-dependent positive selection. CCR5 gene-edited cells are readily detectable in tissues, namely those closely associated with viral reservoirs such as lymph nodes and gastrointestinal tract. Following autologous transplantation, tissue-associated SHIV DNA and RNA levels in suppressed animals are significantly reduced (p ≤ 0.05), relative to suppressed, untransplanted control animals. In contrast, the size of the peripheral reservoir, measured by QVOA, is variably impacted by transplantation. Our studies demonstrate that CCR5 gene editing is equally feasible in infected and uninfected animals, that edited cells persist, traffic to, and engraft in tissue reservoirs, and that this approach significantly reduces secondary lymphoid tissue viral reservoir size. Our robust NHP model of HIV gene therapy and viral persistence can be immediately applied to the investigation of combinatorial approaches that incorporate anti-HIV gene therapy, immune modulators, therapeutic vaccination, and latency reversing agents. PMID:29672640
Pechenick, Dov A.; Payne, Joshua L.; Moore, Jason H.
2011-01-01
Gene regulatory networks (GRNs) drive the cellular processes that sustain life. To do so reliably, GRNs must be robust to perturbations, such as gene deletion and the addition or removal of regulatory interactions. GRNs must also be robust to genetic changes in regulatory regions that define the logic of signal-integration, as these changes can affect how specific combinations of regulatory signals are mapped to particular gene expression states. Previous theoretical analyses have demonstrated that the robustness of a GRN is influenced by its underlying topological properties, such as degree distribution and modularity. Another important topological property is assortativity, which measures the propensity with which nodes of similar connectivity are connected to one another. How assortativity influences the robustness of the signal-integration logic of GRNs remains an open question. Here, we use computational models of GRNs to investigate this relationship. We separately consider each of the three dynamical regimes of this model for a variety of degree distributions. We find that in the chaotic regime, robustness exhibits a pronounced increase as assortativity becomes more positive, while in the critical and ordered regimes, robustness is generally less sensitive to changes in assortativity. We attribute the increased robustness to a decrease in the duration of the gene expression pattern, which is caused by a reduction in the average size of a GRN’s in-components. This study provides the first direct evidence that assortativity influences the robustness of the signal-integration logic of computational models of GRNs, illuminates a mechanistic explanation for this influence, and furthers our understanding of the relationship between topology and robustness in complex biological systems. PMID:22155134
Kim, SungHwan; Lin, Chien-Wei; Tseng, George C
2016-07-01
Supervised machine learning is widely applied to transcriptomic data to predict disease diagnosis, prognosis or survival. Robust and interpretable classifiers with high accuracy are usually favored for their clinical and translational potential. The top scoring pair (TSP) algorithm is an example that applies a simple rank-based algorithm to identify rank-altered gene pairs for classifier construction. Although many classification methods perform well in cross-validation of single expression profile, the performance usually greatly reduces in cross-study validation (i.e. the prediction model is established in the training study and applied to an independent test study) for all machine learning methods, including TSP. The failure of cross-study validation has largely diminished the potential translational and clinical values of the models. The purpose of this article is to develop a meta-analytic top scoring pair (MetaKTSP) framework that combines multiple transcriptomic studies and generates a robust prediction model applicable to independent test studies. We proposed two frameworks, by averaging TSP scores or by combining P-values from individual studies, to select the top gene pairs for model construction. We applied the proposed methods in simulated data sets and three large-scale real applications in breast cancer, idiopathic pulmonary fibrosis and pan-cancer methylation. The result showed superior performance of cross-study validation accuracy and biomarker selection for the new meta-analytic framework. In conclusion, combining multiple omics data sets in the public domain increases robustness and accuracy of the classification model that will ultimately improve disease understanding and clinical treatment decisions to benefit patients. An R package MetaKTSP is available online. (http://tsenglab.biostat.pitt.edu/software.htm). ctseng@pitt.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Chen, Bor-Sen; Lin, Ying-Po
2013-01-01
Robust stabilization and environmental disturbance attenuation are ubiquitous systematic properties that are observed in biological systems at many different levels. The underlying principles for robust stabilization and environmental disturbance attenuation are universal to both complex biological systems and sophisticated engineering systems. In many biological networks, network robustness should be large enough to confer: intrinsic robustness for tolerating intrinsic parameter fluctuations; genetic robustness for buffering genetic variations; and environmental robustness for resisting environmental disturbances. Network robustness is needed so phenotype stability of biological network can be maintained, guaranteeing phenotype robustness. Synthetic biology is foreseen to have important applications in biotechnology and medicine; it is expected to contribute significantly to a better understanding of functioning of complex biological systems. This paper presents a unifying mathematical framework for investigating the principles of both robust stabilization and environmental disturbance attenuation for synthetic gene networks in synthetic biology. Further, from the unifying mathematical framework, we found that the phenotype robustness criterion for synthetic gene networks is the following: if intrinsic robustness + genetic robustness + environmental robustness ≦ network robustness, then the phenotype robustness can be maintained in spite of intrinsic parameter fluctuations, genetic variations, and environmental disturbances. Therefore, the trade-offs between intrinsic robustness, genetic robustness, environmental robustness, and network robustness in synthetic biology can also be investigated through corresponding phenotype robustness criteria from the systematic point of view. Finally, a robust synthetic design that involves network evolution algorithms with desired behavior under intrinsic parameter fluctuations, genetic variations, and environmental disturbances, is also proposed, together with a simulation example. PMID:23515190
Chen, Bor-Sen; Lin, Ying-Po
2013-01-01
Robust stabilization and environmental disturbance attenuation are ubiquitous systematic properties that are observed in biological systems at many different levels. The underlying principles for robust stabilization and environmental disturbance attenuation are universal to both complex biological systems and sophisticated engineering systems. In many biological networks, network robustness should be large enough to confer: intrinsic robustness for tolerating intrinsic parameter fluctuations; genetic robustness for buffering genetic variations; and environmental robustness for resisting environmental disturbances. Network robustness is needed so phenotype stability of biological network can be maintained, guaranteeing phenotype robustness. Synthetic biology is foreseen to have important applications in biotechnology and medicine; it is expected to contribute significantly to a better understanding of functioning of complex biological systems. This paper presents a unifying mathematical framework for investigating the principles of both robust stabilization and environmental disturbance attenuation for synthetic gene networks in synthetic biology. Further, from the unifying mathematical framework, we found that the phenotype robustness criterion for synthetic gene networks is the following: if intrinsic robustness + genetic robustness + environmental robustness ≦ network robustness, then the phenotype robustness can be maintained in spite of intrinsic parameter fluctuations, genetic variations, and environmental disturbances. Therefore, the trade-offs between intrinsic robustness, genetic robustness, environmental robustness, and network robustness in synthetic biology can also be investigated through corresponding phenotype robustness criteria from the systematic point of view. Finally, a robust synthetic design that involves network evolution algorithms with desired behavior under intrinsic parameter fluctuations, genetic variations, and environmental disturbances, is also proposed, together with a simulation example.
AUCTSP: an improved biomarker gene pair class predictor.
Kagaris, Dimitri; Khamesipour, Alireza; Yiannoutsos, Constantin T
2018-06-26
The Top Scoring Pair (TSP) classifier, based on the concept of relative ranking reversals in the expressions of pairs of genes, has been proposed as a simple, accurate, and easily interpretable decision rule for classification and class prediction of gene expression profiles. The idea that differences in gene expression ranking are associated with presence or absence of disease is compelling and has strong biological plausibility. Nevertheless, the TSP formulation ignores significant available information which can improve classification accuracy and is vulnerable to selecting genes which do not have differential expression in the two conditions ("pivot" genes). We introduce the AUCTSP classifier as an alternative rank-based estimator of the magnitude of the ranking reversals involved in the original TSP. The proposed estimator is based on the Area Under the Receiver Operating Characteristic (ROC) Curve (AUC) and as such, takes into account the separation of the entire distribution of gene expression levels in gene pairs under the conditions considered, as opposed to comparing gene rankings within individual subjects as in the original TSP formulation. Through extensive simulations and case studies involving classification in ovarian, leukemia, colon, breast and prostate cancers and diffuse large b-cell lymphoma, we show the superiority of the proposed approach in terms of improving classification accuracy, avoiding overfitting and being less prone to selecting non-informative (pivot) genes. The proposed AUCTSP is a simple yet reliable and robust rank-based classifier for gene expression classification. While the AUCTSP works by the same principle as TSP, its ability to determine the top scoring gene pair based on the relative rankings of two marker genes across all subjects as opposed to each individual subject results in significant performance gains in classification accuracy. In addition, the proposed method tends to avoid selection of non-informative (pivot) genes as members of the top-scoring pair.
Amelio, Antonio L; Caputi, Massimo; Conkright, Michael D
2009-01-01
The CREB regulated transcription co-activators (CRTCs) regulate many biological processes by integrating and converting environmental inputs into transcriptional responses. Although the mechanisms by which CRTCs sense cellular signals are characterized, little is known regarding how CRTCs contribute to the regulation of cAMP inducible genes. Here we show that these dynamic regulators, unlike other co-activators, independently direct either pre-mRNA splice-site selection or transcriptional activation depending on the cell type or promoter context. Moreover, in other scenarios, the CRTC co-activators coordinately regulate transcription and splicing. Mutational analyses showed that CRTCs possess distinct functional domains responsible for regulating either pre-mRNA splicing or transcriptional activation. Interestingly, the CRTC1–MAML2 oncoprotein lacks the splicing domain and is incapable of altering splice-site selection despite robustly activating transcription. The differential usage of these distinct domains allows CRTCs to selectively mediate multiple facets of gene regulation, indicating that co-activators are not solely restricted to coordinating alternative splicing with increase in transcriptional activity. PMID:19644446
2014-01-01
Background Network inference of gene expression data is an important challenge in systems biology. Novel algorithms may provide more detailed gene regulatory networks (GRN) for complex, chronic inflammatory diseases such as rheumatoid arthritis (RA), in which activated synovial fibroblasts (SFBs) play a major role. Since the detailed mechanisms underlying this activation are still unclear, simultaneous investigation of multi-stimuli activation of SFBs offers the possibility to elucidate the regulatory effects of multiple mediators and to gain new insights into disease pathogenesis. Methods A GRN was therefore inferred from RA-SFBs treated with 4 different stimuli (IL-1 β, TNF- α, TGF- β, and PDGF-D). Data from time series microarray experiments (0, 1, 2, 4, 12 h; Affymetrix HG-U133 Plus 2.0) were batch-corrected applying ‘ComBat’, analyzed for differentially expressed genes over time with ‘Limma’, and used for the inference of a robust GRN with NetGenerator V2.0, a heuristic ordinary differential equation-based method with soft integration of prior knowledge. Results Using all genes differentially expressed over time in RA-SFBs for any stimulus, and selecting the genes belonging to the most significant gene ontology (GO) term, i.e., ‘cartilage development’, a dynamic, robust, moderately complex multi-stimuli GRN was generated with 24 genes and 57 edges in total, 31 of which were gene-to-gene edges. Prior literature-based knowledge derived from Pathway Studio or manual searches was reflected in the final network by 25/57 confirmed edges (44%). The model contained known network motifs crucial for dynamic cellular behavior, e.g., cross-talk among pathways, positive feed-back loops, and positive feed-forward motifs (including suppression of the transcriptional repressor OSR2 by all 4 stimuli. Conclusion A multi-stimuli GRN highly concordant with literature data was successfully generated by network inference from the gene expression of stimulated RA-SFBs. The GRN showed high reliability, since 10 predicted edges were independently validated by literature findings post network inference. The selected GO term ‘cartilage development’ contained a number of differentiation markers, growth factors, and transcription factors with potential relevance for RA. Finally, the model provided new insight into the response of RA-SFBs to multiple stimuli implicated in the pathogenesis of RA, in particular to the ‘novel’ potent growth factor PDGF-D. PMID:24989895
Hung, Hsiao-Tung; Koh, Kyunghee; Sowcik, Mallory; Sehgal, Amita; Kelz, Max B.
2013-01-01
A robust, bistable switch regulates the fluctuations between wakefulness and natural sleep as well as those between wakefulness and anesthetic-induced unresponsiveness. We previously provided experimental evidence for the existence of a behavioral barrier to transitions between these states of arousal, which we call neural inertia. Here we show that neural inertia is controlled by processes that contribute to sleep homeostasis and requires four genes involved in electrical excitability: Sh, sss, na and unc79. Although loss of function mutations in these genes can increase or decrease sensitivity to anesthesia induction, surprisingly, they all collapse neural inertia. These effects are genetically selective: neural inertia is not perturbed by loss-of-function mutations in all genes required for the sleep/wake cycle. These effects are also anatomically selective: sss acts in different neurons to influence arousal-promoting and arousal-suppressing processes underlying neural inertia. Supporting the idea that anesthesia and sleep share some, but not all, genetic and anatomical arousal-regulating pathways, we demonstrate that increasing homeostatic sleep drive widens the neural inertial barrier. We propose that processes selectively contributing to sleep homeostasis and neural inertia may be impaired in pathophysiological conditions such as coma and persistent vegetative states. PMID:24039590
Hagan, Christy R; Regan, Tarah M; Dressing, Gwen E; Lange, Carol A
2011-06-01
Progesterone receptors (PR) are critical mediators of mammary gland development and contribute to breast cancer progression. Progestin-induced rapid activation of cytoplasmic protein kinases leads to selective regulation of growth-promoting genes by phospho-PR species. Herein, we show that phosphorylation of PR Ser81 is ck2 dependent and progestin regulated in intact cells but also occurs in the absence of PR ligands when cells enter the G(1)/S phase of the cell cycle. T47D breast cancer cells stably expressing a PR-B mutant receptor that cannot be phosphorylated at Ser79/81 (S79/81A) formed fewer soft agar colonies. Regulation of selected genes by PR-B, but not PR-A, also required Ser79/81 phosphorylation for basal and/or progestin-regulated (BIRC3, HSD11β2, and HbEGF) expression. Additionally, wild-type (wt) PR-B, but not S79/81A mutant PR, was robustly recruited to a progesterone response element (PRE)-containing transcriptional enhancer region of BIRC3; abundant ck2 also associated with this region in cells expressing wt but not S79/81A PR. We conclude that phospho-Ser81 PR provides a platform for ck2 recruitment and regulation of selected PR-B target genes. Understanding how ligand-independent PRs function in the context of high levels of kinase activities characteristic of breast cancer is critical to understanding the basis of tumor-specific changes in gene expression and will speed the development of highly selective treatments.
Hagan, Christy R.; Regan, Tarah M.; Dressing, Gwen E.; Lange, Carol A.
2011-01-01
Progesterone receptors (PR) are critical mediators of mammary gland development and contribute to breast cancer progression. Progestin-induced rapid activation of cytoplasmic protein kinases leads to selective regulation of growth-promoting genes by phospho-PR species. Herein, we show that phosphorylation of PR Ser81 is ck2 dependent and progestin regulated in intact cells but also occurs in the absence of PR ligands when cells enter the G1/S phase of the cell cycle. T47D breast cancer cells stably expressing a PR-B mutant receptor that cannot be phosphorylated at Ser79/81 (S79/81A) formed fewer soft agar colonies. Regulation of selected genes by PR-B, but not PR-A, also required Ser79/81 phosphorylation for basal and/or progestin-regulated (BIRC3, HSD11β2, and HbEGF) expression. Additionally, wild-type (wt) PR-B, but not S79/81A mutant PR, was robustly recruited to a progesterone response element (PRE)-containing transcriptional enhancer region of BIRC3; abundant ck2 also associated with this region in cells expressing wt but not S79/81A PR. We conclude that phospho-Ser81 PR provides a platform for ck2 recruitment and regulation of selected PR-B target genes. Understanding how ligand-independent PRs function in the context of high levels of kinase activities characteristic of breast cancer is critical to understanding the basis of tumor-specific changes in gene expression and will speed the development of highly selective treatments. PMID:21518957
A P-Norm Robust Feature Extraction Method for Identifying Differentially Expressed Genes
Liu, Jian; Liu, Jin-Xing; Gao, Ying-Lian; Kong, Xiang-Zhen; Wang, Xue-Song; Wang, Dong
2015-01-01
In current molecular biology, it becomes more and more important to identify differentially expressed genes closely correlated with a key biological process from gene expression data. In this paper, based on the Schatten p-norm and Lp-norm, a novel p-norm robust feature extraction method is proposed to identify the differentially expressed genes. In our method, the Schatten p-norm is used as the regularization function to obtain a low-rank matrix and the Lp-norm is taken as the error function to improve the robustness to outliers in the gene expression data. The results on simulation data show that our method can obtain higher identification accuracies than the competitive methods. Numerous experiments on real gene expression data sets demonstrate that our method can identify more differentially expressed genes than the others. Moreover, we confirmed that the identified genes are closely correlated with the corresponding gene expression data. PMID:26201006
A P-Norm Robust Feature Extraction Method for Identifying Differentially Expressed Genes.
Liu, Jian; Liu, Jin-Xing; Gao, Ying-Lian; Kong, Xiang-Zhen; Wang, Xue-Song; Wang, Dong
2015-01-01
In current molecular biology, it becomes more and more important to identify differentially expressed genes closely correlated with a key biological process from gene expression data. In this paper, based on the Schatten p-norm and Lp-norm, a novel p-norm robust feature extraction method is proposed to identify the differentially expressed genes. In our method, the Schatten p-norm is used as the regularization function to obtain a low-rank matrix and the Lp-norm is taken as the error function to improve the robustness to outliers in the gene expression data. The results on simulation data show that our method can obtain higher identification accuracies than the competitive methods. Numerous experiments on real gene expression data sets demonstrate that our method can identify more differentially expressed genes than the others. Moreover, we confirmed that the identified genes are closely correlated with the corresponding gene expression data.
Kuramae, Eiko E; Robert, Vincent; Echavarri-Erasun, Carlos; Boekhout, Teun
2007-01-01
Background The construction of robust and well resolved phylogenetic trees is important for our understanding of many, if not all biological processes, including speciation and origin of higher taxa, genome evolution, metabolic diversification, multicellularity, origin of life styles, pathogenicity and so on. Many older phylogenies were not well supported due to insufficient phylogenetic signal present in the single or few genes used in phylogenetic reconstructions. Importantly, single gene phylogenies were not always found to be congruent. The phylogenetic signal may, therefore, be increased by enlarging the number of genes included in phylogenetic studies. Unfortunately, concatenation of many genes does not take into consideration the evolutionary history of each individual gene. Here, we describe an approach to select informative phylogenetic proteins to be used in the Tree of Life (TOL) and barcoding projects by comparing the cophenetic correlation coefficients (CCC) among individual protein distance matrices of proteins, using the fungi as an example. The method demonstrated that the quality and number of concatenated proteins is important for a reliable estimation of TOL. Approximately 40–45 concatenated proteins seem needed to resolve fungal TOL. Results In total 4852 orthologous proteins (KOGs) were assigned among 33 fungal genomes from the Asco- and Basidiomycota and 70 of these represented single copy proteins. The individual protein distance matrices based on 531 concatenated proteins that has been used for phylogeny reconstruction before [14] were compared one with another in order to select those with the highest CCC, which then was used as a reference. This reference distance matrix was compared with those of the 70 single copy proteins selected and their CCC values were calculated. Sixty four KOGs showed a CCC above 0.50 and these were further considered for their phylogenetic potential. Proteins belonging to the cellular processes and signaling KOG category seem more informative than those belonging to the other three categories: information storage and processing; metabolism; and the poorly characterized category. After concatenation of 40 proteins the topology of the phylogenetic tree remained stable, but after concatenation of 60 or more proteins the bootstrap support values of some branches decreased, most likely due to the inclusion of proteins with lowers CCC values. The selection of protein sequences to be used in various TOL projects remains a critical and important process. The method described in this paper will contribute to a more objective selection of phylogenetically informative protein sequences. Conclusion This study provides candidate protein sequences to be considered as phylogenetic markers in different branches of fungal TOL. The selection procedure described here will be useful to select informative protein sequences to resolve branches of TOL that contain few or no species with completely sequenced genomes. The robust phylogenetic trees resulting from this method may contribute to our understanding of organismal diversification processes. The method proposed can be extended easily to other branches of TOL. PMID:17688684
Hsu, Chih-Yuan; Pan, Zhen-Ming; Hu, Rei-Hsing; Chang, Chih-Chun; Cheng, Hsiao-Chun; Lin, Che; Chen, Bor-Sen
2015-01-01
In this study, robust biological filters with an external control to match a desired input/output (I/O) filtering response are engineered based on the well-characterized promoter-RBS libraries and a cascade gene circuit topology. In the field of synthetic biology, the biological filter system serves as a powerful detector or sensor to sense different molecular signals and produces a specific output response only if the concentration of the input molecular signal is higher or lower than a specified threshold. The proposed systematic design method of robust biological filters is summarized into three steps. Firstly, several well-characterized promoter-RBS libraries are established for biological filter design by identifying and collecting the quantitative and qualitative characteristics of their promoter-RBS components via nonlinear parameter estimation method. Then, the topology of synthetic biological filter is decomposed into three cascade gene regulatory modules, and an appropriate promoter-RBS library is selected for each module to achieve the desired I/O specification of a biological filter. Finally, based on the proposed systematic method, a robust externally tunable biological filter is engineered by searching the promoter-RBS component libraries and a control inducer concentration library to achieve the optimal reference match for the specified I/O filtering response.
Liao, J. G.; Mcmurry, Timothy; Berg, Arthur
2014-01-01
Empirical Bayes methods have been extensively used for microarray data analysis by modeling the large number of unknown parameters as random effects. Empirical Bayes allows borrowing information across genes and can automatically adjust for multiple testing and selection bias. However, the standard empirical Bayes model can perform poorly if the assumed working prior deviates from the true prior. This paper proposes a new rank-conditioned inference in which the shrinkage and confidence intervals are based on the distribution of the error conditioned on rank of the data. Our approach is in contrast to a Bayesian posterior, which conditions on the data themselves. The new method is almost as efficient as standard Bayesian methods when the working prior is close to the true prior, and it is much more robust when the working prior is not close. In addition, it allows a more accurate (but also more complex) non-parametric estimate of the prior to be easily incorporated, resulting in improved inference. The new method’s prior robustness is demonstrated via simulation experiments. Application to a breast cancer gene expression microarray dataset is presented. Our R package rank.Shrinkage provides a ready-to-use implementation of the proposed methodology. PMID:23934072
Super-delta: a new differential gene expression analysis procedure with robust data normalization.
Liu, Yuhang; Zhang, Jinfeng; Qiu, Xing
2017-12-21
Normalization is an important data preparation step in gene expression analyses, designed to remove various systematic noise. Sample variance is greatly reduced after normalization, hence the power of subsequent statistical analyses is likely to increase. On the other hand, variance reduction is made possible by borrowing information across all genes, including differentially expressed genes (DEGs) and outliers, which will inevitably introduce some bias. This bias typically inflates type I error; and can reduce statistical power in certain situations. In this study we propose a new differential expression analysis pipeline, dubbed as super-delta, that consists of a multivariate extension of the global normalization and a modified t-test. A robust procedure is designed to minimize the bias introduced by DEGs in the normalization step. The modified t-test is derived based on asymptotic theory for hypothesis testing that suitably pairs with the proposed robust normalization. We first compared super-delta with four commonly used normalization methods: global, median-IQR, quantile, and cyclic loess normalization in simulation studies. Super-delta was shown to have better statistical power with tighter control of type I error rate than its competitors. In many cases, the performance of super-delta is close to that of an oracle test in which datasets without technical noise were used. We then applied all methods to a collection of gene expression datasets on breast cancer patients who received neoadjuvant chemotherapy. While there is a substantial overlap of the DEGs identified by all of them, super-delta were able to identify comparatively more DEGs than its competitors. Downstream gene set enrichment analysis confirmed that all these methods selected largely consistent pathways. Detailed investigations on the relatively small differences showed that pathways identified by super-delta have better connections to breast cancer than other methods. As a new pipeline, super-delta provides new insights to the area of differential gene expression analysis. Solid theoretical foundation supports its asymptotic unbiasedness and technical noise-free properties. Implementation on real and simulated datasets demonstrates its decent performance compared with state-of-art procedures. It also has the potential of expansion to be incorporated with other data type and/or more general between-group comparison problems.
Physiology is rocking the foundations of evolutionary biology.
Noble, Denis
2013-08-01
The 'Modern Synthesis' (Neo-Darwinism) is a mid-20th century gene-centric view of evolution, based on random mutations accumulating to produce gradual change through natural selection. Any role of physiological function in influencing genetic inheritance was excluded. The organism became a mere carrier of the real objects of selection, its genes. We now know that genetic change is far from random and often not gradual. Molecular genetics and genome sequencing have deconstructed this unnecessarily restrictive view of evolution in a way that reintroduces physiological function and interactions with the environment as factors influencing the speed and nature of inherited change. Acquired characteristics can be inherited, and in a few but growing number of cases that inheritance has now been shown to be robust for many generations. The 21st century can look forward to a new synthesis that will reintegrate physiology with evolutionary biology.
Keel, Brittney N; Zarek, Christina M; Keele, John W; Kuehn, Larry A; Snelling, Warren M; Oliver, William T; Freetly, Harvey C; Lindholm-Perry, Amanda K
2018-06-04
Feed intake and body weight gain are economically important inputs and outputs of beef production systems. The purpose of this study was to discover differentially expressed genes that will be robust for feed intake and gain across a large segment of the cattle industry. Transcriptomic studies often suffer from issues with reproducibility and cross-validation. One way to improve reproducibility is by integrating multiple datasets via meta-analysis. RNA sequencing (RNA-Seq) was performed on longissimus dorsi muscle from 80 steers (5 cohorts, each with 16 animals) selected from the outside fringe of a bivariate gain and feed intake distribution to understand the genes and pathways involved in feed efficiency. In each cohort, 16 steers were selected from one of four gain and feed intake phenotypes (n = 4 per phenotype) in a 2 × 2 factorial arrangement with gain and feed intake as main effect variables. Each cohort was analyzed as a single experiment using a generalized linear model and results from the 5 cohort analyses were combined in a meta-analysis to identify differentially expressed genes (DEG) across the cohorts. A total of 51 genes were differentially expressed for the main effect of gain, 109 genes for the intake main effect, and 11 genes for the gain x intake interaction (P corrected < 0.05). A jackknife sensitivity analysis showed that, in general, the meta-analysis produced robust DEGs for the two main effects and their interaction. Pathways identified from over-represented genes included mitochondrial energy production and oxidative stress pathways for the main effect of gain due to DEG including GPD1, NDUFA6, UQCRQ, ACTC1, and MGST3. For intake, metabolic pathways including amino acid biosynthesis and degradation were identified, and for the interaction analysis the pathways identified included GADD45, pyridoxal 5'phosphate salvage, and caveolar mediated endocytosis signaling. Variation among DEG identified by cohort suggests that environment and breed may play large roles in the expression of genes associated with feed efficiency in the muscle of beef cattle. Meta-analyses of transcriptome data from groups of animals over multiple cohorts may be necessary to elucidate the genetics contributing these types of biological phenotypes.
Directional selection causes decanalization in a group I ribozyme.
Hayden, Eric J; Weikert, Christian; Wagner, Andreas
2012-01-01
A canalized genotype is robust to environmental or genetic perturbations. Canalization is expected to result from stabilizing selection on a well-adapted phenotype. Decanalization, the loss of robustness, might follow periods of directional selection toward a new optimum. The evolutionary forces causing decanalization are still unknown, in part because it is difficult to determine the fitness effects of mutations in populations of organisms with complex genotypes and phenotypes. Here, we report direct experimental measurements of robustness in a system with a simple genotype and phenotype, the catalytic activity of an RNA enzyme. We find that the robustness of a population of RNA enzymes decreases during a period of directional selection in the laboratory. The decrease in robustness is primarily caused by the selective sweep of a genotype that is decanalized relative to the wild-type, both in terms of mutational robustness and environmental robustness (thermodynamic stability). Our results experimentally demonstrate that directional selection can cause decanalization on short time scales, and demonstrate co-evolution of mutational and environmental robustness.
Lipinski, Kamil A; Kaniak-Golik, Aneta; Golik, Pawel
2010-01-01
As a legacy of their endosymbiotic eubacterial origin, mitochondria possess a residual genome, encoding only a few proteins and dependent on a variety of factors encoded by the nuclear genome for its maintenance and expression. As a facultative anaerobe with well understood genetics and molecular biology, Saccharomyces cerevisiae is the model system of choice for studying nucleo-mitochondrial genetic interactions. Maintenance of the mitochondrial genome is controlled by a set of nuclear-coded factors forming intricately interconnected circuits responsible for replication, recombination, repair and transmission to buds. Expression of the yeast mitochondrial genome is regulated mostly at the post-transcriptional level, and involves many general and gene-specific factors regulating splicing, RNA processing and stability and translation. A very interesting aspect of the yeast mitochondrial system is the relationship between genome maintenance and gene expression. Deletions of genes involved in many different aspects of mitochondrial gene expression, notably translation, result in an irreversible loss of functional mtDNA. The mitochondrial genetic system viewed from the systems biology perspective is therefore very fragile and lacks robustness compared to the remaining systems of the cell. This lack of robustness could be a legacy of the reductive evolution of the mitochondrial genome, but explanations involving selective advantages of increased evolvability have also been postulated. Copyright © 2009 Elsevier B.V. All rights reserved.
Wear, Emma K; Wilbanks, Elizabeth G; Nelson, Craig E; Carlson, Craig A
2018-03-09
Primers targeting the 16S small subunit ribosomal RNA marker gene, used to characterize bacterial and archaeal communities, have recently been re-evaluated for marine planktonic habitats. To investigate whether primer selection affects the ecological interpretation of bacterioplankton populations and community dynamics, amplicon sequencing with four primer sets targeting several hypervariable regions of the 16S rRNA gene was conducted on both mock communities constructed from cloned 16S rRNA genes and a time-series of DNA samples from the temperate coastal Santa Barbara Channel. Ecological interpretations of community structure (delineation of depth and seasonality, correlations with environmental factors) were similar across primer sets, while population dynamics varied. We observed substantial differences in relative abundances of taxa known to be poorly resolved by some primer sets, such as Thaumarchaeota and SAR11, and unexpected taxa including Roseobacter clades. Though the magnitude of relative abundances of common OTUs differed between primer sets, the relative abundances of the OTUs were nonetheless strongly correlated. We do not endorse one primer set but rather enumerate strengths and weaknesses to facilitate selection appropriate to a system or experimental goal. While 16S rRNA gene primer bias suggests caution in assessing quantitative population dynamics, community dynamics appear robust across studies using different primers. © 2018 The Authors. Environmental Microbiology published by Society for Applied Microbiology and John Wiley & Sons Ltd.
Chabris, Christopher F; Lee, James J; Benjamin, Daniel J; Beauchamp, Jonathan P; Glaeser, Edward L; Borst, Gregoire; Pinker, Steven; Laibson, David I
2013-10-01
We explain why traits of interest to behavioral scientists may have a genetic architecture featuring hundreds or thousands of loci with tiny individual effects rather than a few with large effects and why such an architecture makes it difficult to find robust associations between traits and genes. We conducted a genome-wide association study at 2 sites, Harvard University and Union College, measuring more than 100 physical and behavioral traits with a sample size typical of candidate gene studies. We evaluated predictions that alleles with large effect sizes would be rare and most traits of interest to social science are likely characterized by a lack of strong directional selection. We also carried out a theoretical analysis of the genetic architecture of traits based on R.A. Fisher's geometric model of natural selection and empirical analyses of the effects of selection bias and phenotype measurement stability on the results of genetic association studies. Although we replicated several known genetic associations with physical traits, we found only 2 associations with behavioral traits that met the nominal genome-wide significance threshold, indicating that physical and behavioral traits are mainly affected by numerous genes with small effects. The challenge for social science genomics is the likelihood that genes are connected to behavioral variation by lengthy, nonlinear, interactive causal chains, and unraveling these chains requires allying with personal genomics to take advantage of the potential for large sample sizes as well as continuing with traditional epidemiological studies.
GeneStoryTeller: a mobile app for quick and comprehensive information retrieval of human genes.
Eleftheriou, Stergiani V; Bourdakou, Marilena M; Athanasiadis, Emmanouil I; Spyrou, George M
2015-01-01
In the last few years, mobile devices such as smartphones and tablets have become an integral part of everyday life, due to their software/hardware rapid development, as well as the increased portability they offer. Nevertheless, up to now, only few Apps have been developed in the field of bioinformatics, capable to perform fast and robust access to services. We have developed the GeneStoryTeller, a mobile application for Android platforms, where users are able to instantly retrieve information regarding any recorded human gene, derived from eight publicly available databases, as a summary story. Complementary information regarding gene-drugs interactions, functional annotation and disease associations for each selected gene is also provided in the gene story. The most challenging part during the development of the GeneStoryTeller was to keep balance between storing data locally within the app and obtaining the updated content dynamically via a network connection. This was accomplished with the implementation of an administrative site where data are curated and synchronized with the application requiring a minimum human intervention. © The Author(s) 2015. Published by Oxford University Press.
Autoimmunity as a Driving Force of Cognitive Evolution
Nataf, Serge
2017-01-01
In the last decades, increasingly robust experimental approaches have formally demonstrated that autoimmunity is a physiological process involved in a large range of functions including cognition. On this basis, the recently enunciated “brain superautoantigens” theory proposes that autoimmunity has been a driving force of cognitive evolution. It is notably suggested that the immune and nervous systems have somehow co-evolved and exerted a mutual selection pressure benefiting to both systems. In this two-way process, the evolutionary-determined emergence of neurons expressing specific immunogenic antigens (brain superautoantigens) has exerted a selection pressure on immune genes shaping the T-cell repertoire. Such a selection pressure on immune genes has translated into the emergence of a finely tuned autoimmune T-cell repertoire that promotes cognition. In another hand, the evolutionary-determined emergence of brain-autoreactive T-cells has exerted a selection pressure on neural genes coding for brain superautoantigens. Such a selection pressure has translated into the emergence of a neural repertoire (defined here as the whole of neurons, synapses and non-neuronal cells involved in cognitive functions) expressing brain superautoantigens. Overall, the brain superautoantigens theory suggests that cognitive evolution might have been primarily driven by internal cues rather than external environmental conditions. Importantly, while providing a unique molecular connection between neural and T-cell repertoires under physiological conditions, brain superautoantigens may also constitute an Achilles heel responsible for the particular susceptibility of Homo sapiens to “neuroimmune co-pathologies” i.e., disorders affecting both neural and T-cell repertoires. These may notably include paraneoplastic syndromes, multiple sclerosis as well as autism, schizophrenia and neurodegenerative diseases. In the context of this theoretical frame, a specific emphasis is given here to the potential evolutionary role exerted by two families of genes, namely the MHC class II genes, involved in antigen presentation to T-cells, and the Foxp genes, which play crucial roles in language (Foxp2) and the regulation of autoimmunity (Foxp3). PMID:29123465
Zhang, Lei; Zhao, Xihua; Zhang, Guoxiu; Zhang, Jiajia; Wang, Xuedong; Zhang, Suping; Wang, Wei; Wei, Dongzhi
2016-02-09
Filamentous fungi play important roles in the production of plant cell-wall degrading enzymes. In recent years, homologous recombinant technologies have contributed significantly to improved enzymes production and system design of genetically manipulated strains. When introducing multiple gene deletions, we need a robust and convenient way to control selectable marker genes, especially when only a limited number of markers are available in filamentous fungi. Integration after transformation is predominantly nonhomologous in most fungi other than yeast. Fungal strains deficient in the non-homologous end-joining (NHEJ) pathway have limitations associated with gene function analyses despite they are excellent recipient strains for gene targets. We describe strategies and methods to address these challenges above and leverage the power of resilient NHEJ deficiency strains. We have established a foolproof light-inducible platform for one-step unmarked genetic modification in industrial eukaryotic microorganisms designated as 'LML 3.0', and an on-off control protocol of NHEJ pathway called 'OFN 1.0', using a synthetic light-switchable transactivation to control Cre recombinase-based excision and inversion. The methods provide a one-step strategy to sequentially modify genes without introducing selectable markers and NHEJ-deficiency. The strategies can be used to manipulate many biological processes in a wide range of eukaryotic cells.
Automatic design of synthetic gene circuits through mixed integer non-linear programming.
Huynh, Linh; Kececioglu, John; Köppe, Matthias; Tagkopoulos, Ilias
2012-01-01
Automatic design of synthetic gene circuits poses a significant challenge to synthetic biology, primarily due to the complexity of biological systems, and the lack of rigorous optimization methods that can cope with the combinatorial explosion as the number of biological parts increases. Current optimization methods for synthetic gene design rely on heuristic algorithms that are usually not deterministic, deliver sub-optimal solutions, and provide no guaranties on convergence or error bounds. Here, we introduce an optimization framework for the problem of part selection in synthetic gene circuits that is based on mixed integer non-linear programming (MINLP), which is a deterministic method that finds the globally optimal solution and guarantees convergence in finite time. Given a synthetic gene circuit, a library of characterized parts, and user-defined constraints, our method can find the optimal selection of parts that satisfy the constraints and best approximates the objective function given by the user. We evaluated the proposed method in the design of three synthetic circuits (a toggle switch, a transcriptional cascade, and a band detector), with both experimentally constructed and synthetic promoter libraries. Scalability and robustness analysis shows that the proposed framework scales well with the library size and the solution space. The work described here is a step towards a unifying, realistic framework for the automated design of biological circuits.
Zhao, Huifen; Pestina, Tamara I; Nasimuzzaman, Md; Mehta, Perdeep; Hargrove, Phillip W; Persons, Derek A
2009-06-04
Correction of murine models of beta-thalassemia has been achieved through high-level globin lentiviral vector gene transfer into mouse hematopoietic stem cells (HSCs). However, transduction of human HSCs is less robust and may be inadequate to achieve therapeutic levels of genetically modified erythroid cells. We therefore developed a double gene lentiviral vector encoding both human gamma-globin under the transcriptional control of erythroid regulatory elements and methylguanine methyltransferase (MGMT), driven by a constitutive cellular promoter. MGMT expression provides cellular resistance to alkylator drugs, which can be administered to kill residual untransduced, diseased HSCs, whereas transduced cells are protected. Mice transplanted with beta-thalassemic HSCs transduced with a gamma-globin/MGMT vector initially had subtherapeutic levels of red cells expressing gamma-globin. To enrich gamma-globin-expressing cells, transplanted mice were treated with the alkylator agent 1,3-bis-chloroethyl-1-nitrosourea. This resulted in significant increases in the number of gamma-globin-expressing red cells and the amount of fetal hemoglobin, leading to resolution of anemia. Selection of transduced HSCs was also obtained when cells were drug-treated before transplantation. Mice that received these cells demonstrated reconstitution with therapeutic levels of gamma-globin-expressing cells. These data suggest that MGMT-based drug selection holds promise as a modality to improve gene therapy for beta-thalassemia.
Phenotypic Robustness and the Assortativity Signature of Human Transcription Factor Networks
Pechenick, Dov A.; Payne, Joshua L.; Moore, Jason H.
2014-01-01
Many developmental, physiological, and behavioral processes depend on the precise expression of genes in space and time. Such spatiotemporal gene expression phenotypes arise from the binding of sequence-specific transcription factors (TFs) to DNA, and from the regulation of nearby genes that such binding causes. These nearby genes may themselves encode TFs, giving rise to a transcription factor network (TFN), wherein nodes represent TFs and directed edges denote regulatory interactions between TFs. Computational studies have linked several topological properties of TFNs — such as their degree distribution — with the robustness of a TFN's gene expression phenotype to genetic and environmental perturbation. Another important topological property is assortativity, which measures the tendency of nodes with similar numbers of edges to connect. In directed networks, assortativity comprises four distinct components that collectively form an assortativity signature. We know very little about how a TFN's assortativity signature affects the robustness of its gene expression phenotype to perturbation. While recent theoretical results suggest that increasing one specific component of a TFN's assortativity signature leads to increased phenotypic robustness, the biological context of this finding is currently limited because the assortativity signatures of real-world TFNs have not been characterized. It is therefore unclear whether these earlier theoretical findings are biologically relevant. Moreover, it is not known how the other three components of the assortativity signature contribute to the phenotypic robustness of TFNs. Here, we use publicly available DNaseI-seq data to measure the assortativity signatures of genome-wide TFNs in 41 distinct human cell and tissue types. We find that all TFNs share a common assortativity signature and that this signature confers phenotypic robustness to model TFNs. Lastly, we determine the extent to which each of the four components of the assortativity signature contributes to this robustness. PMID:25121490
Robust Learning of High-dimensional Biological Networks with Bayesian Networks
NASA Astrophysics Data System (ADS)
Nägele, Andreas; Dejori, Mathäus; Stetter, Martin
Structure learning of Bayesian networks applied to gene expression data has become a potentially useful method to estimate interactions between genes. However, the NP-hardness of Bayesian network structure learning renders the reconstruction of the full genetic network with thousands of genes unfeasible. Consequently, the maximal network size is usually restricted dramatically to a small set of genes (corresponding with variables in the Bayesian network). Although this feature reduction step makes structure learning computationally tractable, on the downside, the learned structure might be adversely affected due to the introduction of missing genes. Additionally, gene expression data are usually very sparse with respect to the number of samples, i.e., the number of genes is much greater than the number of different observations. Given these problems, learning robust network features from microarray data is a challenging task. This chapter presents several approaches tackling the robustness issue in order to obtain a more reliable estimation of learned network features.
Gibson, Scott M; Ficklin, Stephen P; Isaacson, Sven; Luo, Feng; Feltus, Frank A; Smith, Melissa C
2013-01-01
The study of gene relationships and their effect on biological function and phenotype is a focal point in systems biology. Gene co-expression networks built using microarray expression profiles are one technique for discovering and interpreting gene relationships. A knowledge-independent thresholding technique, such as Random Matrix Theory (RMT), is useful for identifying meaningful relationships. Highly connected genes in the thresholded network are then grouped into modules that provide insight into their collective functionality. While it has been shown that co-expression networks are biologically relevant, it has not been determined to what extent any given network is functionally robust given perturbations in the input sample set. For such a test, hundreds of networks are needed and hence a tool to rapidly construct these networks. To examine functional robustness of networks with varying input, we enhanced an existing RMT implementation for improved scalability and tested functional robustness of human (Homo sapiens), rice (Oryza sativa) and budding yeast (Saccharomyces cerevisiae). We demonstrate dramatic decrease in network construction time and computational requirements and show that despite some variation in global properties between networks, functional similarity remains high. Moreover, the biological function captured by co-expression networks thresholded by RMT is highly robust.
Biomarkers of the Hedgehog/Smoothened pathway in healthy volunteers
Kadam, Sunil K; Patel, Bharvin K R; Jones, Emma; Nguyen, Tuan S; Verma, Lalit K; Landschulz, Katherine T; Stepaniants, Sergey; Li, Bin; Brandt, John T; Brail, Leslie H
2012-01-01
The Hedgehog (Hh) pathway is involved in oncogenic transformation and tumor maintenance. The primary objective of this study was to select surrogate tissue to measure messenger ribonucleic acid (mRNA) levels of Hh pathway genes for measurement of pharmacodynamic effect. Expression of Hh pathway specific genes was measured by quantitative real time polymerase chain reaction (qRT-PCR) and global gene expression using Affymetrix U133 microarrays. Correlations were made between the expression of specific genes determined by qRT-PCR and normalized microarray data. Gene ontology analysis using microarray data for a broader set of Hh pathway genes was performed to identify additional Hh pathway-related markers in the surrogate tissue. RNA extracted from blood, hair follicle, and skin obtained from healthy subjects was analyzed by qRT-PCR for 31 genes, whereas 8 samples were analyzed for a 7-gene subset. Twelve sample sets, each with ≤500 ng total RNA derived from hair, skin, and blood, were analyzed using Affymetrix U133 microarrays. Transcripts for several Hh pathway genes were undetectable in blood using qRT-PCR. Skin was the most desirable matrix, followed by hair follicle. Whether processed by robust multiarray average or microarray suite 5 (MAS5), expression patterns of individual samples showed co-clustered signals; both normalization methods were equally effective for unsupervised analysis. The MAS5- normalized probe sets appeared better suited for supervised analysis. This work provides the basis for selection of a surrogate tissue and an expression analysis-based approach to evaluate pathway-related genes as markers of pharmacodynamic effect with novel inhibitors of the Hh pathway. PMID:22611475
Analysis of gene network robustness based on saturated fixed point attractors
2014-01-01
The analysis of gene network robustness to noise and mutation is important for fundamental and practical reasons. Robustness refers to the stability of the equilibrium expression state of a gene network to variations of the initial expression state and network topology. Numerical simulation of these variations is commonly used for the assessment of robustness. Since there exists a great number of possible gene network topologies and initial states, even millions of simulations may be still too small to give reliable results. When the initial and equilibrium expression states are restricted to being saturated (i.e., their elements can only take values 1 or −1 corresponding to maximum activation and maximum repression of genes), an analytical gene network robustness assessment is possible. We present this analytical treatment based on determination of the saturated fixed point attractors for sigmoidal function models. The analysis can determine (a) for a given network, which and how many saturated equilibrium states exist and which and how many saturated initial states converge to each of these saturated equilibrium states and (b) for a given saturated equilibrium state or a given pair of saturated equilibrium and initial states, which and how many gene networks, referred to as viable, share this saturated equilibrium state or the pair of saturated equilibrium and initial states. We also show that the viable networks sharing a given saturated equilibrium state must follow certain patterns. These capabilities of the analytical treatment make it possible to properly define and accurately determine robustness to noise and mutation for gene networks. Previous network research conclusions drawn from performing millions of simulations follow directly from the results of our analytical treatment. Furthermore, the analytical results provide criteria for the identification of model validity and suggest modified models of gene network dynamics. The yeast cell-cycle network is used as an illustration of the practical application of this analytical treatment. PMID:24650364
Shah, Sheel; Lubeck, Eric; Schwarzkopf, Maayan; He, Ting-Fang; Greenbaum, Alon; Sohn, Chang Ho; Lignell, Antti; Choi, Harry M T; Gradinaru, Viviana; Pierce, Niles A; Cai, Long
2016-08-01
Accurate and robust detection of mRNA molecules in thick tissue samples can reveal gene expression patterns in single cells within their native environment. Preserving spatial relationships while accessing the transcriptome of selected cells is a crucial feature for advancing many biological areas - from developmental biology to neuroscience. However, because of the high autofluorescence background of many tissue samples, it is difficult to detect single-molecule fluorescence in situ hybridization (smFISH) signals robustly in opaque thick samples. Here, we draw on principles from the emerging discipline of dynamic nucleic acid nanotechnology to develop a robust method for multi-color, multi-RNA imaging in deep tissues using single-molecule hybridization chain reaction (smHCR). Using this approach, single transcripts can be imaged using epifluorescence, confocal or selective plane illumination microscopy (SPIM) depending on the imaging depth required. We show that smHCR has high sensitivity in detecting mRNAs in cell culture and whole-mount zebrafish embryos, and that combined with SPIM and PACT (passive CLARITY technique) tissue hydrogel embedding and clearing, smHCR can detect single mRNAs deep within thick (0.5 mm) brain slices. By simultaneously achieving ∼20-fold signal amplification and diffraction-limited spatial resolution, smHCR offers a robust and versatile approach for detecting single mRNAs in situ, including in thick tissues where high background undermines the performance of unamplified smFISH. © 2016. Published by The Company of Biologists Ltd.
Nyaboga, Evans; Tripathi, Jaindra N.; Manoharan, Rajesh; Tripathi, Leena
2014-01-01
Although genetic transformation of clonally propagated crops has been widely studied as a tool for crop improvement and as a vital part of the development of functional genomics resources, there has been no report of any existing Agrobacterium-mediated transformation of yam (Dioscorea spp.) with evidence of stable integration of T-DNA. Yam is an important crop in the tropics and subtropics providing food security and income to over 300 million people. However, yam production remains constrained by increasing levels of field and storage pests and diseases. A major constraint to the development of biotechnological approaches for yam improvement has been the lack of an efficient and robust transformation and regeneration system. In this study, we developed an Agrobacterium-mediated transformation of Dioscorea rotundata using axillary buds as explants. Two cultivars of D. rotundata were transformed using Agrobacterium tumefaciens harboring the binary vectors containing selectable marker and reporter genes. After selection with appropriate concentrations of antibiotic, shoots were developed on shoot induction and elongation medium. The elongated antibiotic-resistant shoots were subsequently rooted on medium supplemented with selection agent. Successful transformation was confirmed by polymerase chain reaction, Southern blot analysis, and reporter genes assay. Expression of gusA gene in transgenic plants was also verified by reverse transcription polymerase chain reaction analysis. Transformation efficiency varied from 9.4 to 18.2% depending on the cultivars, selectable marker genes, and the Agrobacterium strain used for transformation. It took 3–4 months from Agro-infection to regeneration of complete transgenic plant. Here we report an efficient, fast and reproducible protocol for Agrobacterium-mediated transformation of D. rotundata using axillary buds as explants, which provides a useful platform for future genetic engineering studies in this economically important crop. PMID:25309562
Selfish cellular networks and the evolution of complex organisms.
Kourilsky, Philippe
2012-03-01
Human gametogenesis takes years and involves many cellular divisions, particularly in males. Consequently, gametogenesis provides the opportunity to acquire multiple de novo mutations. A significant portion of these is likely to impact the cellular networks linking genes, proteins, RNA and metabolites, which constitute the functional units of cells. A wealth of literature shows that these individual cellular networks are complex, robust and evolvable. To some extent, they are able to monitor their own performance, and display sufficient autonomy to be termed "selfish". Their robustness is linked to quality control mechanisms which are embedded in and act upon the individual networks, thereby providing a basis for selection during gametogenesis. These selective processes are equally likely to affect cellular functions that are not gamete-specific, and the evolution of the most complex organisms, including man, is therefore likely to occur via two pathways: essential housekeeping functions would be regulated and evolve during gametogenesis within the parents before being transmitted to their progeny, while classical selection would operate on other traits of the organisms that shape their fitness with respect to the environment. Copyright © 2012 Académie des sciences. Published by Elsevier SAS. All rights reserved.
Robustness, evolvability, and the logic of genetic regulation.
Payne, Joshua L; Moore, Jason H; Wagner, Andreas
2014-01-01
In gene regulatory circuits, the expression of individual genes is commonly modulated by a set of regulating gene products, which bind to a gene's cis-regulatory region. This region encodes an input-output function, referred to as signal-integration logic, that maps a specific combination of regulatory signals (inputs) to a particular expression state (output) of a gene. The space of all possible signal-integration functions is vast and the mapping from input to output is many-to-one: For the same set of inputs, many functions (genotypes) yield the same expression output (phenotype). Here, we exhaustively enumerate the set of signal-integration functions that yield identical gene expression patterns within a computational model of gene regulatory circuits. Our goal is to characterize the relationship between robustness and evolvability in the signal-integration space of regulatory circuits, and to understand how these properties vary between the genotypic and phenotypic scales. Among other results, we find that the distributions of genotypic robustness are skewed, so that the majority of signal-integration functions are robust to perturbation. We show that the connected set of genotypes that make up a given phenotype are constrained to specific regions of the space of all possible signal-integration functions, but that as the distance between genotypes increases, so does their capacity for unique innovations. In addition, we find that robust phenotypes are (i) evolvable, (ii) easily identified by random mutation, and (iii) mutationally biased toward other robust phenotypes. We explore the implications of these latter observations for mutation-based evolution by conducting random walks between randomly chosen source and target phenotypes. We demonstrate that the time required to identify the target phenotype is independent of the properties of the source phenotype.
Designing climate-resilient rice with ideal grain quality suited for high-temperature stress
Sreenivasulu, Nese; Butardo, Vito M.; Misra, Gopal; Cuevas, Rosa Paula; Anacleto, Roslen; Kavi Kishor, Polavarpu B.
2015-01-01
To ensure rice food security, the target outputs of future rice breeding programmes should focus on developing climate-resilient rice varieties with emphasis on increased head rice yield coupled with superior grain quality. This challenge is made greater by a world that is increasingly becoming warmer. Such environmental changes dramatically impact head rice and milling yield as well as increasing chalkiness because of impairment in starch accumulation and other storage biosynthetic pathways in the grain. This review highlights the knowledge gained through gene discovery via quantitative trait locus (QTL) cloning and structural–functional genomic strategies to reduce chalk, increase head rice yield, and develop stable lines with optimum grain quality in challenging environments. The newly discovered genes and the knowledge gained on the influence of specific alleles related to stability of grain quality attributes provide a robust platform for marker-assisted selection in breeding to design heat-tolerant rice varieties with superior grain quality. Using the chalkiness trait in rice as a case study, we demonstrate here that the emerging field of systems genetics can help fast-track the identification of novel alleles and gene targets that can be pyramided for the development of environmentally robust rice varieties that possess improved grain quality. PMID:25662847
Ranking metrics in gene set enrichment analysis: do they matter?
Zyla, Joanna; Marczyk, Michal; Weiner, January; Polanska, Joanna
2017-05-12
There exist many methods for describing the complex relation between changes of gene expression in molecular pathways or gene ontologies under different experimental conditions. Among them, Gene Set Enrichment Analysis seems to be one of the most commonly used (over 10,000 citations). An important parameter, which could affect the final result, is the choice of a metric for the ranking of genes. Applying a default ranking metric may lead to poor results. In this work 28 benchmark data sets were used to evaluate the sensitivity and false positive rate of gene set analysis for 16 different ranking metrics including new proposals. Furthermore, the robustness of the chosen methods to sample size was tested. Using k-means clustering algorithm a group of four metrics with the highest performance in terms of overall sensitivity, overall false positive rate and computational load was established i.e. absolute value of Moderated Welch Test statistic, Minimum Significant Difference, absolute value of Signal-To-Noise ratio and Baumgartner-Weiss-Schindler test statistic. In case of false positive rate estimation, all selected ranking metrics were robust with respect to sample size. In case of sensitivity, the absolute value of Moderated Welch Test statistic and absolute value of Signal-To-Noise ratio gave stable results, while Baumgartner-Weiss-Schindler and Minimum Significant Difference showed better results for larger sample size. Finally, the Gene Set Enrichment Analysis method with all tested ranking metrics was parallelised and implemented in MATLAB, and is available at https://github.com/ZAEDPolSl/MrGSEA . Choosing a ranking metric in Gene Set Enrichment Analysis has critical impact on results of pathway enrichment analysis. The absolute value of Moderated Welch Test has the best overall sensitivity and Minimum Significant Difference has the best overall specificity of gene set analysis. When the number of non-normally distributed genes is high, using Baumgartner-Weiss-Schindler test statistic gives better outcomes. Also, it finds more enriched pathways than other tested metrics, which may induce new biological discoveries.
Liu, Wan-Ting; Wang, Yang; Zhang, Jing; Ye, Fei; Huang, Xiao-Hui; Li, Bin; He, Qing-Yu
2018-07-01
Lung adenocarcinoma (LAC) is the most lethal cancer and the leading cause of cancer-related death worldwide. The identification of meaningful clusters of co-expressed genes or representative biomarkers may help improve the accuracy of LAC diagnoses. Public databases, such as the Gene Expression Omnibus (GEO), provide rich resources of valuable information for clinics, however, the integration of multiple microarray datasets from various platforms and institutes remained a challenge. To determine potential indicators of LAC, we performed genome-wide relative significance (GWRS), genome-wide global significance (GWGS) and support vector machine (SVM) analyses progressively to identify robust gene biomarker signatures from 5 different microarray datasets that included 330 samples. The top 200 genes with robust signatures were selected for integrative analysis according to "guilt-by-association" methods, including protein-protein interaction (PPI) analysis and gene co-expression analysis. Of these 200 genes, only 10 genes showed both intensive PPI network and high gene co-expression correlation (r > 0.8). IPA analysis of this regulatory networks suggested that the cell cycle process is a crucial determinant of LAC. CENPA, as well as two linked hub genes CDK1 and CDC20, are determined to be potential indicators of LAC. Immunohistochemical staining showed that CENPA, CDK1 and CDC20 were highly expressed in LAC cancer tissue with co-expression patterns. A Cox regression model indicated that LAC patients with CENPA + /CDK1 + and CENPA + /CDC20 + were high-risk groups in terms of overall survival. In conclusion, our integrated microarray analysis demonstrated that CENPA, CDK1 and CDC20 might serve as novel cluster of prognostic biomarkers for LAC, and the cooperative unit of three genes provides a technically simple approach for identification of LAC patients. Copyright © 2018 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bhattacharya, Tanmoy; Stanton, Jennifer; Kim, Eun - Young
2008-01-01
A selective advantage against infectious diseases such as HIV/AIDS is associated with differences in the genes relevant to immunity and virus replication. The CC chemokine receptor 5 (CCR5), the principal coreceptor for HIV, and its chemokine ligands, including CCL3L1, influences the CD4+ target cells susceptibility to infection. The CCL3L1 gene is in a region of segmental duplication on the q-arm of human chromosome 17. Increased numbers of CCL3L1 gene copies that affect the gene expression phenotype might have substantial protective effects. Here we show that the population-specific CCL3L1 gene copy number and the CCR5 {Delta}32 protein-inactivating deletion that categorizes themore » CCL3L1-CCR5 genotype do not influence HIV/AIDS susceptibility or the robustness of immune recovery after the initiation of highly active antiretroviral therapy (HAART).« less
A large-scale benchmark of gene prioritization methods.
Guala, Dimitri; Sonnhammer, Erik L L
2017-04-21
In order to maximize the use of results from high-throughput experimental studies, e.g. GWAS, for identification and diagnostics of new disease-associated genes, it is important to have properly analyzed and benchmarked gene prioritization tools. While prospective benchmarks are underpowered to provide statistically significant results in their attempt to differentiate the performance of gene prioritization tools, a strategy for retrospective benchmarking has been missing, and new tools usually only provide internal validations. The Gene Ontology(GO) contains genes clustered around annotation terms. This intrinsic property of GO can be utilized in construction of robust benchmarks, objective to the problem domain. We demonstrate how this can be achieved for network-based gene prioritization tools, utilizing the FunCoup network. We use cross-validation and a set of appropriate performance measures to compare state-of-the-art gene prioritization algorithms: three based on network diffusion, NetRank and two implementations of Random Walk with Restart, and MaxLink that utilizes network neighborhood. Our benchmark suite provides a systematic and objective way to compare the multitude of available and future gene prioritization tools, enabling researchers to select the best gene prioritization tool for the task at hand, and helping to guide the development of more accurate methods.
Truong, Cong-Doan; Kwon, Yung-Keun
2017-12-21
Biological networks consisting of molecular components and interactions are represented by a graph model. There have been some studies based on that model to analyze a relationship between structural characteristics and dynamical behaviors in signaling network. However, little attention has been paid to changes of modularity and robustness in mutant networks. In this paper, we investigated the changes of modularity and robustness by edge-removal mutations in three signaling networks. We first observed that both the modularity and robustness increased on average in the mutant network by the edge-removal mutations. However, the modularity change was negatively correlated with the robustness change. This implies that it is unlikely that both the modularity and the robustness values simultaneously increase by the edge-removal mutations. Another interesting finding is that the modularity change was positively correlated with the degree, the number of feedback loops, and the edge betweenness of the removed edges whereas the robustness change was negatively correlated with them. We note that these results were consistently observed in randomly structure networks. Additionally, we identified two groups of genes which are incident to the highly-modularity-increasing and the highly-robustness-decreasing edges with respect to the edge-removal mutations, respectively, and observed that they are likely to be central by forming a connected component of a considerably large size. The gene-ontology enrichment of each of these gene groups was significantly different from the rest of genes. Finally, we showed that the highly-robustness-decreasing edges can be promising edgetic drug-targets, which validates the usefulness of our analysis. Taken together, the analysis of changes of robustness and modularity against edge-removal mutations can be useful to unravel novel dynamical characteristics underlying in signaling networks.
A powerful and robust test in genetic association studies.
Cheng, Kuang-Fu; Lee, Jen-Yu
2014-01-01
There are several well-known single SNP tests presented in the literature for detecting gene-disease association signals. Having in place an efficient and robust testing process across all genetic models would allow a more comprehensive approach to analysis. Although some studies have shown that it is possible to construct such a test when the variants are common and the genetic model satisfies certain conditions, the model conditions are too restrictive and in general difficult to verify. In this paper, we propose a powerful and robust test without assuming any model restrictions. Our test is based on the selected 2 × 2 tables derived from the usual 2 × 3 table. By signals from these tables, we show through simulations across a wide range of allele frequencies and genetic models that this approach may produce a test which is almost uniformly most powerful in the analysis of low- and high-frequency variants. Two cancer studies are used to demonstrate applications of the proposed test. © 2014 S. Karger AG, Basel.
Matsuzaki, Yasunori; Oue, Miho; Hirai, Hirokazu
2014-02-15
Certain inherited progressive neurodegenerative disorders, such as spinocerebellar ataxia (SCA), affect neurons in large areas of the central nervous system (CNS). The selective expression of disease-causing and therapeutic genes in susceptible regions and cell types is critical for the generation of animal models and development of gene therapies for these diseases. Previous studies have demonstrated the advantages of the short synapsin I (SynI) promoter (0.5 kb) as a neuron-specific promoter for robust transgene expression. However, the short SynI promoter has also shown some promoter activity in glia and a lack of transgene expression in significant areas of the CNS. New methods: To improve the SynI promoter, we used a SynI promoter that is twice as long (1.0 kb) as the short SynI promoter and incorporated a minimal CMV (minCMV) sequence. We observed that the 1.0 kb rat SynI promoter with minCMV [rSynI(1.0)-minCMV] exhibited robust promoter strength, excellent neuronal specificity and wide-ranging transgene expression throughout the CNS. Comparison with existing methods: Compared with the two previously reported short (0.5 kb) promoters, the new promoter was superior with respect to neuronal specificity and more efficiently transduced neurons. Moreover, transgenic mice expressing the mutant protein ATXN1[Q98], which causes SCA type 1 (SCA1), under the control of the rSynI(1.0)-minCMV promoter showed robust transgene expression specifically in neurons throughout the CNS and exhibited progressive ataxia. rSynI(1.0)-minCMV drives robust and neuron-specific transgene expression throughout the CNS and is therefore useful for viral vector-mediated neuron-specific gene delivery and generation of neuron-specific transgenic animals. Copyright © 2013 Elsevier B.V. All rights reserved.
Evolution of robustness to damage in artificial 3-dimensional development.
Joachimczak, Michał; Wróbel, Borys
2012-09-01
GReaNs is an Artificial Life platform we have built to investigate the general principles that guide evolution of multicellular development and evolution of artificial gene regulatory networks. The embryos develop in GReaNs in a continuous 3-dimensional (3D) space with simple physics. The developmental trajectories are indirectly encoded in linear genomes. The genomes are not limited in size and determine the topology of gene regulatory networks that are not limited in the number of nodes. The expression of the genes is continuous and can be modified by adding environmental noise. In this paper we evolved development of structures with a specific shape (an ellipsoid) and asymmetrical pattering (a 3D pattern inspired by the French flag problem), and investigated emergence of the robustness to damage in development and the emergence of the robustness to noise. Our results indicate that both types of robustness are related, and that including noise during evolution promotes higher robustness to damage. Interestingly, we have observed that some evolved gene regulatory networks rely on noise for proper behaviour. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Defining Aggressive Prostate Cancer Using a 12-Gene Model1
Riva, Alberto; Kim, Robert; Varambally, Sooryanarayana; He, Le; Kutok, Jeff; Aster, Jonathan C; Tang, Jeffery; Kuefer, Rainer; Hofer, Matthias D; Febbo, Phillip G; Chinnaiyan, Arul M; Rubin, Mark A
2006-01-01
Abstract The critical clinical question in prostate cancer research is: How do we develop means of distinguishing aggressive disease from indolent disease? Using a combination of proteomic and expression array data, we identified a set of 36 genes with concordant dysregulation of protein products that could be evaluated in situ by quantitative immunohistochemistry. Another five prostate cancer biomarkers were included using linear discriminant analysis, we determined that the optimal model used to predict prostate cancer progression consisted of 12 proteins. Using a separate patient population, transcriptional levels of the 12 genes encoding for these proteins predicted prostate-specific antigen failure in 79 men following surgery for clinically localized prostate cancer (P = .0015). This study demonstrates that cross-platform models can lead to predictive models with the possible advantage of being more robust through this selection process. PMID:16533427
Analytical validation of a psychiatric pharmacogenomic test.
Jablonski, Michael R; King, Nina; Wang, Yongbao; Winner, Joel G; Watterson, Lucas R; Gunselman, Sandra; Dechairo, Bryan M
2018-05-01
The aim of this study was to validate the analytical performance of a combinatorial pharmacogenomics test designed to aid in the appropriate medication selection for neuropsychiatric conditions. Genomic DNA was isolated from buccal swabs. Twelve genes (65 variants/alleles) associated with psychotropic medication metabolism, side effects, and mechanisms of actions were evaluated by bead array, MALDI-TOF mass spectrometry, and/or capillary electrophoresis methods (GeneSight Psychotropic, Assurex Health, Inc.). The combinatorial pharmacogenomics test has a dynamic range of 2.5-20 ng/μl of input genomic DNA, with comparable performance for all assays included in the test. Both the precision and accuracy of the test were >99.9%, with individual gene components between 99.4 and 100%. This study demonstrates that the combinatorial pharmacogenomics test is robust and reproducible, making it suitable for clinical use.
Li, Ziyi; Safo, Sandra E; Long, Qi
2017-07-11
Sparse principal component analysis (PCA) is a popular tool for dimensionality reduction, pattern recognition, and visualization of high dimensional data. It has been recognized that complex biological mechanisms occur through concerted relationships of multiple genes working in networks that are often represented by graphs. Recent work has shown that incorporating such biological information improves feature selection and prediction performance in regression analysis, but there has been limited work on extending this approach to PCA. In this article, we propose two new sparse PCA methods called Fused and Grouped sparse PCA that enable incorporation of prior biological information in variable selection. Our simulation studies suggest that, compared to existing sparse PCA methods, the proposed methods achieve higher sensitivity and specificity when the graph structure is correctly specified, and are fairly robust to misspecified graph structures. Application to a glioblastoma gene expression dataset identified pathways that are suggested in the literature to be related with glioblastoma. The proposed sparse PCA methods Fused and Grouped sparse PCA can effectively incorporate prior biological information in variable selection, leading to improved feature selection and more interpretable principal component loadings and potentially providing insights on molecular underpinnings of complex diseases.
Guyon, Laurent; Lajaunie, Christian; Fer, Frédéric; Bhajun, Ricky; Sulpice, Eric; Pinna, Guillaume; Campalans, Anna; Radicella, J Pablo; Rouillier, Philippe; Mary, Mélissa; Combe, Stéphanie; Obeid, Patricia; Vert, Jean-Philippe; Gidrol, Xavier
2015-09-18
Phenotypic screening monitors phenotypic changes induced by perturbations, including those generated by drugs or RNA interference. Currently-used methods for scoring screen hits have proven to be problematic, particularly when applied to physiologically relevant conditions such as low cell numbers or inefficient transfection. Here, we describe the Φ-score, which is a novel scoring method for the identification of phenotypic modifiers or hits in cell-based screens. Φ-score performance was assessed with simulations, a validation experiment and its application to gene identification in a large-scale RNAi screen. Using robust statistics and a variance model, we demonstrated that the Φ-score showed better sensitivity, selectivity and reproducibility compared to classical approaches. The improved performance of the Φ-score paves the way for cell-based screening of primary cells, which are often difficult to obtain from patients in sufficient numbers. We also describe a dedicated merging procedure to pool scores from small interfering RNAs targeting the same gene so as to provide improved visualization and hit selection.
Guyon, Laurent; Lajaunie, Christian; fer, Frédéric; bhajun, Ricky; sulpice, Eric; pinna, Guillaume; campalans, Anna; radicella, J. Pablo; rouillier, Philippe; mary, Mélissa; combe, Stéphanie; obeid, Patricia; vert, Jean-Philippe; gidrol, Xavier
2015-01-01
Phenotypic screening monitors phenotypic changes induced by perturbations, including those generated by drugs or RNA interference. Currently-used methods for scoring screen hits have proven to be problematic, particularly when applied to physiologically relevant conditions such as low cell numbers or inefficient transfection. Here, we describe the Φ-score, which is a novel scoring method for the identification of phenotypic modifiers or hits in cell-based screens. Φ-score performance was assessed with simulations, a validation experiment and its application to gene identification in a large-scale RNAi screen. Using robust statistics and a variance model, we demonstrated that the Φ-score showed better sensitivity, selectivity and reproducibility compared to classical approaches. The improved performance of the Φ-score paves the way for cell-based screening of primary cells, which are often difficult to obtain from patients in sufficient numbers. We also describe a dedicated merging procedure to pool scores from small interfering RNAs targeting the same gene so as to provide improved visualization and hit selection. PMID:26382112
Chowdhury, Nilotpal; Sapru, Shantanu
2015-01-01
Microarray analysis has revolutionized the role of genomic prognostication in breast cancer. However, most studies are single series studies, and suffer from methodological problems. We sought to use a meta-analytic approach in combining multiple publicly available datasets, while correcting for batch effects, to reach a more robust oncogenomic analysis. The aim of the present study was to find gene sets associated with distant metastasis free survival (DMFS) in systemically untreated, node-negative breast cancer patients, from publicly available genomic microarray datasets. Four microarray series (having 742 patients) were selected after a systematic search and combined. Cox regression for each gene was done for the combined dataset (univariate, as well as multivariate - adjusted for expression of Cell cycle related genes) and for the 4 major molecular subtypes. The centre and microarray batch effects were adjusted by including them as random effects variables. The Cox regression coefficients for each analysis were then ranked and subjected to a Gene Set Enrichment Analysis (GSEA). Gene sets representing protein translation were independently negatively associated with metastasis in the Luminal A and Luminal B subtypes, but positively associated with metastasis in Basal tumors. Proteinaceous extracellular matrix (ECM) gene set expression was positively associated with metastasis, after adjustment for expression of cell cycle related genes on the combined dataset. Finally, the positive association of the proliferation-related genes with metastases was confirmed. To the best of our knowledge, the results depicting mixed prognostic significance of protein translation in breast cancer subtypes are being reported for the first time. We attribute this to our study combining multiple series and performing a more robust meta-analytic Cox regression modeling on the combined dataset, thus discovering 'hidden' associations. This methodology seems to yield new and interesting results and may be used as a tool to guide new research.
Chowdhury, Nilotpal; Sapru, Shantanu
2015-01-01
Introduction Microarray analysis has revolutionized the role of genomic prognostication in breast cancer. However, most studies are single series studies, and suffer from methodological problems. We sought to use a meta-analytic approach in combining multiple publicly available datasets, while correcting for batch effects, to reach a more robust oncogenomic analysis. Aim The aim of the present study was to find gene sets associated with distant metastasis free survival (DMFS) in systemically untreated, node-negative breast cancer patients, from publicly available genomic microarray datasets. Methods Four microarray series (having 742 patients) were selected after a systematic search and combined. Cox regression for each gene was done for the combined dataset (univariate, as well as multivariate – adjusted for expression of Cell cycle related genes) and for the 4 major molecular subtypes. The centre and microarray batch effects were adjusted by including them as random effects variables. The Cox regression coefficients for each analysis were then ranked and subjected to a Gene Set Enrichment Analysis (GSEA). Results Gene sets representing protein translation were independently negatively associated with metastasis in the Luminal A and Luminal B subtypes, but positively associated with metastasis in Basal tumors. Proteinaceous extracellular matrix (ECM) gene set expression was positively associated with metastasis, after adjustment for expression of cell cycle related genes on the combined dataset. Finally, the positive association of the proliferation-related genes with metastases was confirmed. Conclusion To the best of our knowledge, the results depicting mixed prognostic significance of protein translation in breast cancer subtypes are being reported for the first time. We attribute this to our study combining multiple series and performing a more robust meta-analytic Cox regression modeling on the combined dataset, thus discovering 'hidden' associations. This methodology seems to yield new and interesting results and may be used as a tool to guide new research. PMID:26080057
Wen, Shuxiang; Chen, Xiaoling; Xu, Fuzhou; Sun, Huiling
2016-01-01
Real-time quantitative reverse transcription PCR (qRT-PCR) offers a robust method for measurement of gene expression levels. Selection of reliable reference gene(s) for gene expression study is conducive to reduce variations derived from different amounts of RNA and cDNA, the efficiency of the reverse transcriptase or polymerase enzymes. Until now reference genes identified for other members of the family Pasteurellaceae have not been validated for Avibacterium paragallinarum. The aim of this study was to validate nine reference genes of serovars A, B, and C strains of A. paragallinarum in different growth phase by qRT-PCR. Three of the most widely used statistical algorithms, geNorm, NormFinder and ΔCT method were used to evaluate the expression stability of reference genes. Data analyzed by overall rankings showed that in exponential and stationary phase of serovar A, the most stable reference genes were gyrA and atpD respectively; in exponential and stationary phase of serovar B, the most stable reference genes were atpD and recN respectively; in exponential and stationary phase of serovar C, the most stable reference genes were rpoB and recN respectively. This study provides recommendations for stable endogenous control genes for use in further studies involving measurement of gene expression levels.
Gong, Pichang; Ao, Xiang; Liu, Gaixiu; Cheng, Fangyun; He, Chaoying
2017-03-01
Herbaceous peony (Paeonia lactiflora) is a globally important ornamental plant. Spontaneous floral mutations occur frequently during cultivation, and are selected as a way to release new cultivars, but the underlying evolutionary developmental genetics remain largely elusive. Here, we investigated a collection of spontaneous corolla mutational plants (SCMPs) whose other floral organs were virtually unaffected. Unlike the corolla in normal plants (NPs) that withered soon after fertilization, the transformed corolla (petals) in SCMPs was greenish and persistent similar to the calyx (sepals). Epidermal cellular morphology of the SCMP corolla was also similar to that of calyx cells, further suggesting a sepaloid corolla in SCMPs. Ten floral MADS-box genes from these Paeonia plants were comparatively characterized with respect to sequence and expression. Codogenic sequence variation of these MADS-box genes was not linked to corolla changes in SCMPs. However, we found that both APETALA3 (AP3) and PISTILLATA (PI) lineages of B-class MADS-box genes were duplicated, and subsequent selective expression alterations of these genes were closely associated with the origin of SCMPs. AP3-PI obligate heterodimerization, essential for organ identity of corolla and stamens, was robustly detected. However, selective down-regulation of these duplicated genes might result in a reduction of this obligate heterodimer concentration in a corolla-specific manner, leading to the sepaloid corolla in SCMPs, thus representing a new sepaloid corolla model taking advantage of gene duplication. Our work suggests that modifying floral MADS-box genes could facilitate the breeding of novel cultivars with distinct floral morphology in ornamental plants, and also provides new insights into the functional evolution of the MADS-box genes in plants. © The Author 2016. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Park, Mira; Hong, Soon Gyu; Park, Hyun; Lee, Byeong-ha
2018-01-01
Sanionia uncinata is a dominant moss species in the maritime Antarctic. Due to its high adaptability to harsh environments, this extremophile plant has been considered a good target for studying the molecular adaptation mechanisms of plants to a variety of environmental stresses. Despite the importance of S. uncinata as a representative Antarctic plant species for the identification and characterization of genes associated with abiotic stress tolerance, suitable reference genes, which are critical for RT-qPCR analyses, have not yet been identified. In this report, 11 traditionally used and 6 novel candidate reference genes were selected from transcriptome data of S. uncinata and the expression stability of these genes was evaluated under various abiotic stress conditions using three statistical algorithms; geNorm, NormFinder, and BestKeeper. The stability ranking analysis selected the best reference genes depending on the stress conditions. Among the 17 candidates, the most stable references were POB1 and UFD2 for cold stress, POB1 and AKB for drought treatment, and UFD2 and AKB for the field samples from a different water contents in Antarctica. Overall, novel genes POB1 and AKB were the most reliable references across all samples, irrespective of experimental conditions. In addition, 6 novel candidate genes including AKB, POB1 and UFD2, were more stable than the housekeeping genes traditionally used for internal controls, indicating that transcriptome data can be useful for identifying novel robust normalizers. The reference genes validated in this study will be useful for improving the accuracy of RT-qPCR analysis for gene expression studies of S. uncinata in Antarctica and for further functional genomic analysis of bryophytes. PMID:29920565
Isaacson, Sven; Luo, Feng; Feltus, Frank A.; Smith, Melissa C.
2013-01-01
The study of gene relationships and their effect on biological function and phenotype is a focal point in systems biology. Gene co-expression networks built using microarray expression profiles are one technique for discovering and interpreting gene relationships. A knowledge-independent thresholding technique, such as Random Matrix Theory (RMT), is useful for identifying meaningful relationships. Highly connected genes in the thresholded network are then grouped into modules that provide insight into their collective functionality. While it has been shown that co-expression networks are biologically relevant, it has not been determined to what extent any given network is functionally robust given perturbations in the input sample set. For such a test, hundreds of networks are needed and hence a tool to rapidly construct these networks. To examine functional robustness of networks with varying input, we enhanced an existing RMT implementation for improved scalability and tested functional robustness of human (Homo sapiens), rice (Oryza sativa) and budding yeast (Saccharomyces cerevisiae). We demonstrate dramatic decrease in network construction time and computational requirements and show that despite some variation in global properties between networks, functional similarity remains high. Moreover, the biological function captured by co-expression networks thresholded by RMT is highly robust. PMID:23409071
Screening and selection of artificial riboswitches.
Harbaugh, Svetlana V; Martin, Jennifer; Weinstein, Jenna; Ingram, Grant; Kelley-Loughnane, Nancy
2018-05-17
Synthetic riboswitches are engineered to regulate gene expression in response to a variety of non-endogenous small molecules, and a challenge to select this engineered response requires robust screening tools. A new synthetic riboswitch can be created by linking an in vitro-selected aptamer library with a randomized expression platform followed by in vivo selection and screening. In order to determine response to analyte, we developed a dual-color reporter comprising elements of the E. coli fimbriae phase variation system: recombinase FimE controlled by a synthetic riboswitch and an invertible DNA segment (fimS) containing a constitutively active promoter placed between two fluorescent protein genes. Without an analyte, the fluorescent reporter constitutively expressed green fluorescent protein (GFPa1). Addition of the analyte initiated translation of fimE causing unidirectional inversion of the fimS segment and constitutive expression of red fluorescent protein (mKate2). The dual color reporter system can be used to select and to optimize artificial riboswitches in E. coli cells. In this work, the enriched library of aptamers incorporated into the riboswitch architecture reduces the sequence search space by offering a higher percentage of potential ligand binders. The study was designed to produce structure switching aptamers, a necessary feature for riboswitch function and efficiently quantify this function using the dual color reporter system. Copyright © 2018. Published by Elsevier Inc.
Genome-wide signatures of convergent evolution in echolocating mammals
Parker, Joe; Tsagkogeorga, Georgia; Cotton, James A.; Liu, Yuan; Provero, Paolo; Stupka, Elia; Rossiter, Stephen J.
2013-01-01
Evolution is typically thought to proceed through divergence of genes, proteins, and ultimately phenotypes1-3. However, similar traits might also evolve convergently in unrelated taxa due to similar selection pressures4,5. Adaptive phenotypic convergence is widespread in nature, and recent results from a handful of genes have suggested that this phenomenon is powerful enough to also drive recurrent evolution at the sequence level6-9. Where homoplasious substitutions do occur these have long been considered the result of neutral processes. However, recent studies have demonstrated that adaptive convergent sequence evolution can be detected in vertebrates using statistical methods that model parallel evolution9,10 although the extent to which sequence convergence between genera occurs across genomes is unknown. Here we analyse genomic sequence data in mammals that have independently evolved echolocation and show for the first time that convergence is not a rare process restricted to a handful of loci but is instead widespread, continuously distributed and commonly driven by natural selection acting on a small number of sites per locus. Systematic analyses of convergent sequence evolution in 805,053 amino acids within 2,326 orthologous coding gene sequences compared across 22 mammals (including four new bat genomes) revealed signatures consistent with convergence in nearly 200 loci. Strong and significant support for convergence among bats and the dolphin was seen in numerous genes linked to hearing or deafness, consistent with an involvement in echolocation. Surprisingly we also found convergence in many genes linked to vision: the convergent signal of many sensory genes was robustly correlated with the strength of natural selection. This first attempt to detect genome-wide convergent sequence evolution across divergent taxa reveals the phenomenon to be much more pervasive than previously recognised. PMID:24005325
Enhanced transgene expression in rice following selection controlled by weak promoters.
Zhou, Jie; Yang, Yong; Wang, Xuming; Yu, Feibo; Yu, Chulang; Chen, Juan; Cheng, Ye; Yan, Chenqi; Chen, Jianping
2013-03-27
Techniques that enable high levels of transgene expression in plants are attractive for the commercial production of plant-made recombinant pharmaceutical proteins or other gene transfer related strategies. The conventional way to increase the yield of desired transgenic products is to use strong promoters to control the expression of the transgene. Although many such promoters have been identified and characterized, the increase obtainable from a single promoter is ultimately limited to a certain extent. In this study, we report a method to magnify the effect of a single promoter by using a weak promoter-based selection system in transgenic rice. tCUP1, a fragment derived from the tobacco cryptic promoter (tCUP), was tested for its activity in rice by fusion to both a β-glucuronidase (GUS) reporter and a hygromycin phosphotransferase (HPT) selectable marker. The tCUP1 promoter allowed the recovery of transformed rice plants and conferred tissue specific expression of the GUS reporter, but was much weaker than the CaMV 35S promoter in driving a selectable marker for growth of resistant calli. However, in the resistant calli and regenerated transgenic plants selected by the use of tCUP1, the constitutive expression of green fluorescent protein (GFP) was dramatically increased as a result of the additive effect of multiple T-DNA insertions. The correlation between attenuated selection by a weak promoter and elevation of copy number and foreign gene expression was confirmed by using another relatively weak promoter from nopaline synthase (Nos). The use of weak promoter derived selectable markers leads to a high T-DNA copy number and then greatly increases the expression of the foreign gene. The method described here provides an effective approach to robustly enhance the expression of heterogenous transgenes through copy number manipulation in rice.
Cell-of-Origin in Diffuse Large B-Cell Lymphoma: Are the Assays Ready for the Clinic?
Scott, David W
2015-01-01
Diffuse large B-cell lymphoma (DLBCL) is the most common lymphoma worldwide and consists of a heterogeneous group of cancers classified together on the basis of shared morphology, immunophenotype, and aggressive clinical behavior. It is now recognized that this malignancy comprises at least two distinct molecular subtypes identified by gene expression profiling: the activated B-cell-like (ABC) and the germinal center B-cell-like (GCB) groups-the cell-of-origin (COO) classification. These two groups have different genetic mutation landscapes, pathobiology, and outcomes following treatment. Evidence is accumulating that novel agents have selective activity in one or the other COO group, making COO a predictive biomarker. Thus, there is now a pressing need for accurate and robust methods to assign COO, to support clinical trials, and ultimately guide treatment decisions for patients. The "gold standard" methods for COO are based on gene expression profiling (GEP) of RNA from fresh frozen tissue using microarray technology, which is an impractical solution when formalin-fixed paraffin-embedded tissue (FFPET) biopsies are the standard diagnostic material. This review outlines the history of the COO classification before examining the practical implementation of COO assays applicable to FFPET biopsies. The immunohistochemistry (IHC)-based algorithms and gene expression-based assays suitable for the highly degraded RNA from FFPET are discussed. Finally, the technical and practical challenges that still need to be addressed are outlined before robust gene expression-based assays are used in the routine management of patients with DLBCL.
CRC-113 gene expression signature for predicting prognosis in patients with colorectal cancer
Nguyen, Dinh Truong; Kim, Jin-Hwan; Jo, Yong Hwa; Shahid, Muhammad; Akter, Salima; Aryal, Saurav Nath; Yoo, Ji Youn; Ahn, Yong-Joo; Cho, Kyoung Min; Lee, Ju-Seog; Choe, Wonchae; Kang, Insug; Ha, Joohun; Kim, Sung Soo
2015-01-01
Colorectal cancer (CRC) is the third leading cause of global cancer mortality. Recent studies have proposed several gene signatures to predict CRC prognosis, but none of those have proven reliable for predicting prognosis in clinical practice yet due to poor reproducibility and molecular heterogeneity. Here, we have established a prognostic signature of 113 probe sets (CRC-113) that include potential biomarkers and reflect the biological and clinical characteristics. Robustness and accuracy were significantly validated in external data sets from 19 centers in five countries. In multivariate analysis, CRC-113 gene signature showed a stronger prognostic value for survival and disease recurrence in CRC patients than current clinicopathological risk factors and molecular alterations. We also demonstrated that the CRC-113 gene signature reflected both genetic and epigenetic molecular heterogeneity in CRC patients. Furthermore, incorporation of the CRC-113 gene signature into a clinical context and molecular markers further refined the selection of the CRC patients who might benefit from postoperative chemotherapy. Conclusively, CRC-113 gene signature provides new possibilities for improving prognostic models and personalized therapeutic strategies. PMID:26397224
CRC-113 gene expression signature for predicting prognosis in patients with colorectal cancer.
Nguyen, Minh Nam; Choi, Tae Gyu; Nguyen, Dinh Truong; Kim, Jin-Hwan; Jo, Yong Hwa; Shahid, Muhammad; Akter, Salima; Aryal, Saurav Nath; Yoo, Ji Youn; Ahn, Yong-Joo; Cho, Kyoung Min; Lee, Ju-Seog; Choe, Wonchae; Kang, Insug; Ha, Joohun; Kim, Sung Soo
2015-10-13
Colorectal cancer (CRC) is the third leading cause of global cancer mortality. Recent studies have proposed several gene signatures to predict CRC prognosis, but none of those have proven reliable for predicting prognosis in clinical practice yet due to poor reproducibility and molecular heterogeneity. Here, we have established a prognostic signature of 113 probe sets (CRC-113) that include potential biomarkers and reflect the biological and clinical characteristics. Robustness and accuracy were significantly validated in external data sets from 19 centers in five countries. In multivariate analysis, CRC-113 gene signature showed a stronger prognostic value for survival and disease recurrence in CRC patients than current clinicopathological risk factors and molecular alterations. We also demonstrated that the CRC-113 gene signature reflected both genetic and epigenetic molecular heterogeneity in CRC patients. Furthermore, incorporation of the CRC-113 gene signature into a clinical context and molecular markers further refined the selection of the CRC patients who might benefit from postoperative chemotherapy. Conclusively, CRC-113 gene signature provides new possibilities for improving prognostic models and personalized therapeutic strategies.
NASA Astrophysics Data System (ADS)
Sutrisno, Agung; Gunawan, Indra; Vanany, Iwan
2017-11-01
In spite of being integral part in risk - based quality improvement effort, studies improving quality of selection of corrective action priority using FMEA technique are still limited in literature. If any, none is considering robustness and risk in selecting competing improvement initiatives. This study proposed a theoretical model to select risk - based competing corrective action by considering robustness and risk of competing corrective actions. We incorporated the principle of robust design in counting the preference score among corrective action candidates. Along with considering cost and benefit of competing corrective actions, we also incorporate the risk and robustness of corrective actions. An example is provided to represent the applicability of the proposed model.
Rincon, Melvin Y; Sarcar, Shilpita; Danso-Abeam, Dina; Keyaerts, Marleen; Matrai, Janka; Samara-Kuko, Ermira; Acosta-Sanchez, Abel; Athanasopoulos, Takis; Dickson, George; Lahoutte, Tony; De Bleser, Pieter; VandenDriessche, Thierry; Chuah, Marinee K
2015-01-01
Gene therapy is a promising emerging therapeutic modality for the treatment of cardiovascular diseases and hereditary diseases that afflict the heart. Hence, there is a need to develop robust cardiac-specific expression modules that allow for stable expression of the gene of interest in cardiomyocytes. We therefore explored a new approach based on a genome-wide bioinformatics strategy that revealed novel cardiac-specific cis-acting regulatory modules (CS-CRMs). These transcriptional modules contained evolutionary-conserved clusters of putative transcription factor binding sites that correspond to a "molecular signature" associated with robust gene expression in the heart. We then validated these CS-CRMs in vivo using an adeno-associated viral vector serotype 9 that drives a reporter gene from a quintessential cardiac-specific α-myosin heavy chain promoter. Most de novo designed CS-CRMs resulted in a >10-fold increase in cardiac gene expression. The most robust CRMs enhanced cardiac-specific transcription 70- to 100-fold. Expression was sustained and restricted to cardiomyocytes. We then combined the most potent CS-CRM4 with a synthetic heart and muscle-specific promoter (SPc5-12) and obtained a significant 20-fold increase in cardiac gene expression compared to the cytomegalovirus promoter. This study underscores the potential of rational vector design to improve the robustness of cardiac gene therapy.
Turning science on robust cattle into improved genetic selection decisions.
Amer, P R
2012-04-01
More robust cattle have the potential to increase farm profitability, improve animal welfare, reduce the contribution of ruminant livestock to greenhouse gas emissions and decrease the risk of food shortages in the face of increased variability in the farm environment. Breeding is a powerful tool for changing the robustness of cattle; however, insufficient recording of breeding goal traits and selection of animals at younger ages tend to favour genetic change in productivity traits relative to robustness traits. This paper has extended a previously proposed theory of artificial evolution to demonstrate, using deterministic simulation, how choice of breeding scheme design can be used as a tool to manipulate the direction of genetic progress, whereas the breeding goal remains focussed on the factors motivating individual farm decision makers. Particular focus was placed on the transition from progeny testing or mass selection to genomic selection breeding strategies. Transition to genomic selection from a breeding strategy where candidates are selected before records from progeny being available was shown to be highly likely to favour genetic progress in robustness traits relative to productivity traits. This was shown even with modest numbers of animals available for training and when heritability for robustness traits was only slightly lower than that for productivity traits. When transitioning from progeny testing to a genomic selection strategy without progeny testing, it was shown that there is a significant risk that robustness traits could become less influential in selection relative to productivity traits. Augmentations of training populations using genotyped cows and support for industry-wide improvements in phenotypic recording of robustness traits were put forward as investment opportunities for stakeholders wishing to facilitate the application of science on robust cattle into improved genetic selection schemes.
Adaptive Evolution Is Substantially Impeded by Hill–Robertson Interference in Drosophila
Castellano, David; Coronado-Zamora, Marta; Campos, Jose L.; Barbadilla, Antonio; Eyre-Walker, Adam
2016-01-01
Hill–Robertson interference (HRi) is expected to reduce the efficiency of natural selection when two or more linked selected sites do not segregate freely, but no attempt has been done so far to quantify the overall impact of HRi on the rate of adaptive evolution for any given genome. In this work, we estimate how much HRi impedes the rate of adaptive evolution in the coding genome of Drosophila melanogaster. We compiled a data set of 6,141 autosomal protein-coding genes from Drosophila, from which polymorphism levels in D. melanogaster and divergence out to D. yakuba were estimated. The rate of adaptive evolution was calculated using a derivative of the McDonald–Kreitman test that controls for slightly deleterious mutations. We find that the rate of adaptive amino acid substitution at a given position of the genome is positively correlated to both the rate of recombination and the mutation rate, and negatively correlated to the gene density of the region. These correlations are robust to controlling for each other, for synonymous codon bias and for gene functions related to immune response and testes. We show that HRi diminishes the rate of adaptive evolution by approximately 27%. Interestingly, genes with low mutation rates embedded in gene poor regions lose approximately 17% of their adaptive substitutions whereas genes with high mutation rates embedded in gene rich regions lose approximately 60%. We conclude that HRi hampers the rate of adaptive evolution in Drosophila and that the variation in recombination, mutation, and gene density along the genome affects the HRi effect. PMID:26494843
Robustness, Evolvability, and the Logic of Genetic Regulation
Moore, Jason H.; Wagner, Andreas
2014-01-01
In gene regulatory circuits, the expression of individual genes is commonly modulated by a set of regulating gene products, which bind to a gene’s cis-regulatory region. This region encodes an input-output function, referred to as signal-integration logic, that maps a specific combination of regulatory signals (inputs) to a particular expression state (output) of a gene. The space of all possible signal-integration functions is vast and the mapping from input to output is many-to-one: for the same set of inputs, many functions (genotypes) yield the same expression output (phenotype). Here, we exhaustively enumerate the set of signal-integration functions that yield idential gene expression patterns within a computational model of gene regulatory circuits. Our goal is to characterize the relationship between robustness and evolvability in the signal-integration space of regulatory circuits, and to understand how these properties vary between the genotypic and phenotypic scales. Among other results, we find that the distributions of genotypic robustness are skewed, such that the majority of signal-integration functions are robust to perturbation. We show that the connected set of genotypes that make up a given phenotype are constrained to specific regions of the space of all possible signal-integration functions, but that as the distance between genotypes increases, so does their capacity for unique innovations. In addition, we find that robust phenotypes are (i) evolvable, (ii) easily identified by random mutation, and (iii) mutationally biased toward other robust phenotypes. We explore the implications of these latter observations for mutation-based evolution by conducting random walks between randomly chosen source and target phenotypes. We demonstrate that the time required to identify the target phenotype is independent of the properties of the source phenotype. PMID:23373974
Waterhouse, Matthew D.; Erb, Liesl P.; Beever, Erik; Russello, Michael A.
2018-01-01
The American pika is a thermally sensitive, alpine lagomorph species. Recent climate-associated population extirpations and genetic signatures of reduced population sizes range-wide indicate the viability of this species is sensitive to climate change. To test for potential adaptive responses to climate stress, we sampled pikas along two elevational gradients (each ~470 to 1640 m) and employed three outlier detection methods, BAYESCAN, LFMM, and BAYPASS, to scan for genotype-environment associations in samples genotyped at 30,763 SNP loci. We resolved 173 loci with robust evidence of natural selection detected by either two independent analyses or replicated in both transects. A BLASTN search of these outlier loci revealed several genes associated with metabolic function and oxygen transport, indicating natural selection from thermal stress and hypoxia. We also found evidence of directional gene flow primarily downslope from large high-elevation populations and reduced gene flow at outlier loci, a pattern suggesting potential impediments to the upward elevational movement of adaptive alleles in response to contemporary climate change. Finally, we documented evidence of reduced genetic diversity associated the south-facing transect and an increase in corticosterone stress levels associated with inbreeding. This study suggests the American pika is already undergoing climate-associated natural selection at multiple genomic regions. Further analysis is needed to determine if the rate of climate adaptation in the American pika and other thermally sensitive species will be able to keep pace with rapidly changing climate conditions.
GeneStoryTeller: a mobile app for quick and comprehensive information retrieval of human genes
Eleftheriou, Stergiani V.; Bourdakou, Marilena M.; Athanasiadis, Emmanouil I.; Spyrou, George M.
2015-01-01
In the last few years, mobile devices such as smartphones and tablets have become an integral part of everyday life, due to their software/hardware rapid development, as well as the increased portability they offer. Nevertheless, up to now, only few Apps have been developed in the field of bioinformatics, capable to perform fast and robust access to services. We have developed the GeneStoryTeller, a mobile application for Android platforms, where users are able to instantly retrieve information regarding any recorded human gene, derived from eight publicly available databases, as a summary story. Complementary information regarding gene–drugs interactions, functional annotation and disease associations for each selected gene is also provided in the gene story. The most challenging part during the development of the GeneStoryTeller was to keep balance between storing data locally within the app and obtaining the updated content dynamically via a network connection. This was accomplished with the implementation of an administrative site where data are curated and synchronized with the application requiring a minimum human intervention. Database URL: http://bioserver-3.bioacademy.gr/Bioserver/GeneStoryTeller/. PMID:26055097
Automatic Design of Synthetic Gene Circuits through Mixed Integer Non-linear Programming
Huynh, Linh; Kececioglu, John; Köppe, Matthias; Tagkopoulos, Ilias
2012-01-01
Automatic design of synthetic gene circuits poses a significant challenge to synthetic biology, primarily due to the complexity of biological systems, and the lack of rigorous optimization methods that can cope with the combinatorial explosion as the number of biological parts increases. Current optimization methods for synthetic gene design rely on heuristic algorithms that are usually not deterministic, deliver sub-optimal solutions, and provide no guaranties on convergence or error bounds. Here, we introduce an optimization framework for the problem of part selection in synthetic gene circuits that is based on mixed integer non-linear programming (MINLP), which is a deterministic method that finds the globally optimal solution and guarantees convergence in finite time. Given a synthetic gene circuit, a library of characterized parts, and user-defined constraints, our method can find the optimal selection of parts that satisfy the constraints and best approximates the objective function given by the user. We evaluated the proposed method in the design of three synthetic circuits (a toggle switch, a transcriptional cascade, and a band detector), with both experimentally constructed and synthetic promoter libraries. Scalability and robustness analysis shows that the proposed framework scales well with the library size and the solution space. The work described here is a step towards a unifying, realistic framework for the automated design of biological circuits. PMID:22536398
Knorr, Eileen; Fishilevich, Elane; Tenbusch, Linda; Frey, Meghan L F; Rangasamy, Murugesan; Billion, Andre; Worden, Sarah E; Gandra, Premchand; Arora, Kanika; Lo, Wendy; Schulenberg, Greg; Valverde-Garcia, Pablo; Vilcinskas, Andreas; Narva, Kenneth E
2018-02-01
RNAi shows potential as an agricultural technology for insect control, yet, a relatively low number of robust lethal RNAi targets have been demonstrated to control insects of agricultural interest. In the current study, a selection of lethal RNAi target genes from the iBeetle (Tribolium castaneum) screen were used to demonstrate efficacy of orthologous targets in the economically important coleopteran pests Diabrotica virgifera virgifera and Meligethes aeneus. Transcript orthologs of 50 selected genes were analyzed in D. v. virgifera diet-based RNAi bioassays; 21 of these RNAi targets showed mortality and 36 showed growth inhibition. Low dose injection- and diet-based dsRNA assays in T. castaneum and D. v. virgifera, respectively, enabled the identification of the four highly potent RNAi target genes: Rop, dre4, ncm, and RpII140. Maize was genetically engineered to express dsRNA directed against these prioritized candidate target genes. T 0 plants expressing Rop, dre4, or RpII140 RNA hairpins showed protection from D. v. virgifera larval feeding damage. dsRNA targeting Rop, dre4, ncm, and RpII140 in M. aeneus also caused high levels of mortality both by injection and feeding. In summary, high throughput systems for model organisms can be successfully used to identify potent RNA targets for difficult-to-work with agricultural insect pests.
Zhang, Lei; Zhao, Xihua; Zhang, Guoxiu; Zhang, Jiajia; Wang, Xuedong; Zhang, Suping; Wang, Wei; Wei, Dongzhi
2016-01-01
Filamentous fungi play important roles in the production of plant cell-wall degrading enzymes. In recent years, homologous recombinant technologies have contributed significantly to improved enzymes production and system design of genetically manipulated strains. When introducing multiple gene deletions, we need a robust and convenient way to control selectable marker genes, especially when only a limited number of markers are available in filamentous fungi. Integration after transformation is predominantly nonhomologous in most fungi other than yeast. Fungal strains deficient in the non-homologous end-joining (NHEJ) pathway have limitations associated with gene function analyses despite they are excellent recipient strains for gene targets. We describe strategies and methods to address these challenges above and leverage the power of resilient NHEJ deficiency strains. We have established a foolproof light-inducible platform for one-step unmarked genetic modification in industrial eukaryotic microorganisms designated as ‘LML 3.0’, and an on-off control protocol of NHEJ pathway called ‘OFN 1.0’, using a synthetic light-switchable transactivation to control Cre recombinase-based excision and inversion. The methods provide a one-step strategy to sequentially modify genes without introducing selectable markers and NHEJ-deficiency. The strategies can be used to manipulate many biological processes in a wide range of eukaryotic cells. PMID:26857594
Testing Gene-Gene Interactions in the Case-Parents Design
Yu, Zhaoxia
2011-01-01
The case-parents design has been widely used to detect genetic associations as it can prevent spurious association that could occur in population-based designs. When examining the effect of an individual genetic locus on a disease, logistic regressions developed by conditioning on parental genotypes provide complete protection from spurious association caused by population stratification. However, when testing gene-gene interactions, it is unknown whether conditional logistic regressions are still robust. Here we evaluate the robustness and efficiency of several gene-gene interaction tests that are derived from conditional logistic regressions. We found that in the presence of SNP genotype correlation due to population stratification or linkage disequilibrium, tests with incorrectly specified main-genetic-effect models can lead to inflated type I error rates. We also found that a test with fully flexible main genetic effects always maintains correct test size and its robustness can be achieved with negligible sacrifice of its power. When testing gene-gene interactions is the focus, the test allowing fully flexible main effects is recommended to be used. PMID:21778736
Selection of reference genes for expression studies with fish myogenic cell cultures.
Bower, Neil I; Johnston, Ian A
2009-08-10
Relatively few studies have used cell culture systems to investigate gene expression and the regulation of myogenesis in fish. To produce robust data from quantitative real-time PCR mRNA levels need to be normalised using internal reference genes which have stable expression across all experimental samples. We have investigated the expression of eight candidate genes to identify suitable reference genes for use in primary myogenic cell cultures from Atlantic salmon (Salmo salar L.). The software analysis packages geNorm, Normfinder and Best keeper were used to rank genes according to their stability across 42 samples during the course of myogenic differentiation. Initial results showed several of the candidate genes exhibited stable expression throughout myogenic culture while Sdha was identified as the least stable gene. Further analysis with geNorm, Normfinder and Bestkeeper identified Ef1alpha, Hprt1, Ppia and RNApolII as stably expressed. Comparison of data normalised with the geometric average obtained from combinations of any three of these genes showed no significant differences, indicating that any combination of these genes is valid. The geometric average of any three of Hprt1, Ef1alpha, Ppia and RNApolII is suitable for normalisation of gene expression data in primary myogenic cultures from Atlantic salmon.
Ponce-Toledo, Rafael I; Moreira, David; López-García, Purificación; Deschamps, Philippe
2018-06-19
Endosymbiosis has been common all along eukaryotic evolution, providing opportunities for genomic and organellar innovation. Plastids are a prominent example. After the primary endosymbiosis of the cyanobacterial plastid ancestor, photosynthesis spread in many eukaryotic lineages via secondary endosymbioses involving red or green algal endosymbionts and diverse heterotrophic hosts. However, the number of secondary endosymbioses and how they occurred remain poorly understood. In particular, contrasting patterns of endosymbiotic gene transfer (EGT) have been detected and subjected to various interpretations. In this context, accurate detection of EGTs is essential to avoid wrong evolutionary conclusions. We have assembled a strictly selected set of markers that provides robust phylogenomic evidence suggesting that nuclear genes involved in the function and maintenance of green secondary plastids in chlorarachniophytes and euglenids have unexpected mixed red and green algal origins. This mixed ancestry contrasts with the clear red algal origin of most nuclear genes carrying similar functions in secondary algae with red plastids.
Namkung, Junghyun; Nam, Jin-Wu; Park, Taesung
2007-01-01
Many genes with major effects on quantitative traits have been reported to interact with other genes. However, finding a group of interacting genes from thousands of SNPs is challenging. Hence, an efficient and robust algorithm is needed. The genetic algorithm (GA) is useful in searching for the optimal solution from a very large searchable space. In this study, we show that genome-wide interaction analysis using GA and a statistical interaction model can provide a practical method to detect biologically interacting loci. We focus our search on transcriptional regulators by analyzing gene x gene interactions for cancer-related genes. The expression values of three cancer-related genes were selected from the expression data of the Genetic Analysis Workshop 15 Problem 1 data set. We implemented a GA to identify the expression quantitative trait loci that are significantly associated with expression levels of the cancer-related genes. The time complexity of the GA was compared with that of an exhaustive search algorithm. As a result, our GA, which included heuristic methods, such as archive, elitism, and local search, has greatly reduced computational time in a genome-wide search for gene x gene interactions. In general, the GA took one-fifth the computation time of an exhaustive search for the most significant pair of single-nucleotide polymorphisms.
Namkung, Junghyun; Nam, Jin-Wu; Park, Taesung
2007-01-01
Many genes with major effects on quantitative traits have been reported to interact with other genes. However, finding a group of interacting genes from thousands of SNPs is challenging. Hence, an efficient and robust algorithm is needed. The genetic algorithm (GA) is useful in searching for the optimal solution from a very large searchable space. In this study, we show that genome-wide interaction analysis using GA and a statistical interaction model can provide a practical method to detect biologically interacting loci. We focus our search on transcriptional regulators by analyzing gene × gene interactions for cancer-related genes. The expression values of three cancer-related genes were selected from the expression data of the Genetic Analysis Workshop 15 Problem 1 data set. We implemented a GA to identify the expression quantitative trait loci that are significantly associated with expression levels of the cancer-related genes. The time complexity of the GA was compared with that of an exhaustive search algorithm. As a result, our GA, which included heuristic methods, such as archive, elitism, and local search, has greatly reduced computational time in a genome-wide search for gene × gene interactions. In general, the GA took one-fifth the computation time of an exhaustive search for the most significant pair of single-nucleotide polymorphisms. PMID:18466570
The emergence of human-evolutionary medical genomics
Crespi, Bernard J
2011-01-01
In this review, I describe how evolutionary genomics is uniquely suited to spearhead advances in understanding human disease risk, owing to the privileged position of genes as fundamental causes of phenotypic variation, and the ability of population genetic and phylogenetic methods to robustly infer processes of natural selection, drift, and mutation from genetic variation at the levels of family, population, species, and clade. I first provide an overview of models for the origins and maintenance of genetically based disease risk in humans. I then discuss how analyses of genetic disease risk can be dovetailed with studies of positive and balancing selection, to evaluate the degree to which the ‘genes that make us human’ also represent the genes that mediate risk of polygenic disease. Finally, I present four basic principles for the nascent field of human evolutionary medical genomics, each of which represents a process that is nonintuitive from a proximate perspective. Joint consideration of these principles compels novel forms of interdisciplinary analyses, most notably studies that (i) analyze tradeoffs at the level of molecular genetics, and (ii) identify genetic variants that are derived in the human lineage or in specific populations, and then compare individuals with derived versus ancestral alleles. PMID:25567974
Designing climate-resilient rice with ideal grain quality suited for high-temperature stress.
Sreenivasulu, Nese; Butardo, Vito M; Misra, Gopal; Cuevas, Rosa Paula; Anacleto, Roslen; Kavi Kishor, Polavarpu B
2015-04-01
To ensure rice food security, the target outputs of future rice breeding programmes should focus on developing climate-resilient rice varieties with emphasis on increased head rice yield coupled with superior grain quality. This challenge is made greater by a world that is increasingly becoming warmer. Such environmental changes dramatically impact head rice and milling yield as well as increasing chalkiness because of impairment in starch accumulation and other storage biosynthetic pathways in the grain. This review highlights the knowledge gained through gene discovery via quantitative trait locus (QTL) cloning and structural-functional genomic strategies to reduce chalk, increase head rice yield, and develop stable lines with optimum grain quality in challenging environments. The newly discovered genes and the knowledge gained on the influence of specific alleles related to stability of grain quality attributes provide a robust platform for marker-assisted selection in breeding to design heat-tolerant rice varieties with superior grain quality. Using the chalkiness trait in rice as a case study, we demonstrate here that the emerging field of systems genetics can help fast-track the identification of novel alleles and gene targets that can be pyramided for the development of environmentally robust rice varieties that possess improved grain quality. © The Author 2015. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Bayesian Variable Selection for Hierarchical Gene-Environment and Gene-Gene Interactions
Liu, Changlu; Ma, Jianzhong; Amos, Christopher I.
2014-01-01
We propose a Bayesian hierarchical mixture model framework that allows us to investigate the genetic and environmental effects, gene by gene interactions and gene by environment interactions in the same model. Our approach incorporates the natural hierarchical structure between the main effects and interaction effects into a mixture model, such that our methods tend to remove the irrelevant interaction effects more effectively, resulting in more robust and parsimonious models. We consider both strong and weak hierarchical models. For a strong hierarchical model, both of the main effects between interacting factors must be present for the interactions to be considered in the model development, while for a weak hierarchical model, only one of the two main effects is required to be present for the interaction to be evaluated. Our simulation results show that the proposed strong and weak hierarchical mixture models work well in controlling false positive rates and provide a powerful approach for identifying the predisposing effects and interactions in gene-environment interaction studies, in comparison with the naive model that does not impose this hierarchical constraint in most of the scenarios simulated. We illustrated our approach using data for lung cancer and cutaneous melanoma. PMID:25154630
Toward a comprehensive and systematic methylome signature in colorectal cancers.
Ashktorab, Hassan; Rahi, Hamed; Wansley, Daniel; Varma, Sudhir; Shokrani, Babak; Lee, Edward; Daremipouran, Mohammad; Laiyemo, Adeyinka; Goel, Ajay; Carethers, John M; Brim, Hassan
2013-08-01
CpG Island Methylator Phenotype (CIMP) is one of the underlying mechanisms in colorectal cancer (CRC). This study aimed to define a methylome signature in CRC through a methylation microarray analysis and a compilation of promising CIMP markers from the literature. Illumina HumanMethylation27 (IHM27) array data was generated and analyzed based on statistical differences in methylation data (1st approach) or based on overall differences in methylation percentages using lower 95% CI (2nd approach). Pyrosequencing was performed for the validation of nine genes. A meta-analysis was used to identify CIMP and non-CIMP markers that were hypermethylated in CRC but did not yet make it to the CIMP genes' list. Our 1st approach for array data analysis demonstrated the limitations in selecting genes for further validation, highlighting the need for the 2nd bioinformatics approach to adequately select genes with differential aberrant methylation. A more comprehensive list, which included non-CIMP genes, such as APC, EVL, CD109, PTEN, TWIST1, DCC, PTPRD, SFRP1, ICAM5, RASSF1A, EYA4, 30ST2, LAMA1, KCNQ5, ADHEF1, and TFPI2, was established. Array data are useful to categorize and cluster colonic lesions based on their global methylation profiles; however, its usefulness in identifying robust methylation markers is limited and rely on the data analysis method. We have identified 16 non-CIMP-panel genes for which we provide rationale for inclusion in a more comprehensive characterization of CIMP+ CRCs. The identification of a definitive list for methylome specific genes in CRC will contribute to better clinical management of CRC patients.
The zipper effect: Why different positions along the chromosome suffer different selection pressures
NASA Astrophysics Data System (ADS)
de Oliveira, P. M. C.; Moss de Oliveira, S.
2011-02-01
Variability within diploid sexual populations comes from two ingredients: mutations and recombination (or crossing-over). On average, the first introduces genetic defects in offspring genomes, while the second is a mechanism which tends to eliminate them, continuously “cleaning” the population genetic pool from harmful mutations along the generations. Here, we propose that loci near the chromosome tips are more effectively cleaned by the recombination mechanism than loci near the chromosome centre. This result implies that clusters of neighbouring, orchestrated-functioning genes, supposed to be more robust against the effects of genetic mutations, are more likely found near the chromosome centres, while isolated genes are more likely found near the tips. We confirm the tip-centre asymmetry through a simple computer agent-based model. In order to test this effect in reality, we also analyse as an example the particular case of HOX genes distributed along the 24 human chromosomes and verify that indeed, most HOX genes belong to such clustered networks located near the chromosome centres. Accordingly, isolated HOX genes are located closer to the tips.
A Penalized Robust Method for Identifying Gene-Environment Interactions
Shi, Xingjie; Liu, Jin; Huang, Jian; Zhou, Yong; Xie, Yang; Ma, Shuangge
2015-01-01
In high-throughput studies, an important objective is to identify gene-environment interactions associated with disease outcomes and phenotypes. Many commonly adopted methods assume specific parametric or semiparametric models, which may be subject to model mis-specification. In addition, they usually use significance level as the criterion for selecting important interactions. In this study, we adopt the rank-based estimation, which is much less sensitive to model specification than some of the existing methods and includes several commonly encountered data and models as special cases. Penalization is adopted for the identification of gene-environment interactions. It achieves simultaneous estimation and identification and does not rely on significance level. For computation feasibility, a smoothed rank estimation is further proposed. Simulation shows that under certain scenarios, for example with contaminated or heavy-tailed data, the proposed method can significantly outperform the existing alternatives with more accurate identification. We analyze a lung cancer prognosis study with gene expression measurements under the AFT (accelerated failure time) model. The proposed method identifies interactions different from those using the alternatives. Some of the identified genes have important implications. PMID:24616063
NASA Astrophysics Data System (ADS)
Erickson, Keesha; Chatterjee, Anushree
2014-03-01
Microbial pathogens are able to rapidly acquire tolerance to chemical toxins. Developing next-generation antibiotics that impede the emergence of resistance will help avoid a world-wide health crisis. Conversely, the ability to induce rapid tolerance gains could lead to high-yielding strains for sustainable production of biofuels and commodity chemicals. Achieving these goals requires an understanding of the general mechanisms allowing microbes to become resistant to diverse toxins. We apply top-down and bottom-up methodologies to identify biological network changes leading to adaptation and tolerance. Using a top-down approach, we perform evolution experiments to isolate resistant strains, collect samples for transcriptomic and proteomic analysis, and use the omics data to inform mathematical gene regulatory models. Using a bottom-up approach, we build and test synthetic genetic devices that enable increased or decreased expression of selected genes. Unique patterns in gene expression are identified in cultures actively gaining resistance, especially in pathways known to be involved with stress response, efflux, and mutagenesis. Genes correlated with tolerance could potentially allow the design of resistance-free antibiotics or robust chemical production strains.
Detection of histone modifications in plant leaves.
Jaskiewicz, Michal; Peterhansel, Christoph; Conrath, Uwe
2011-09-23
Chromatin structure is important for the regulation of gene expression in eukaryotes. In this process, chromatin remodeling, DNA methylation, and covalent modifications on the amino-terminal tails of histones H3 and H4 play essential roles(1-2). H3 and H4 histone modifications include methylation of lysine and arginine, acetylation of lysine, and phosphorylation of serine residues(1-2). These modifications are associated either with gene activation, repression, or a primed state of gene that supports more rapid and robust activation of expression after perception of appropriate signals (microbe-associated molecular patterns, light, hormones, etc.)(3-7). Here, we present a method for the reliable and sensitive detection of specific chromatin modifications on selected plant genes. The technique is based on the crosslinking of (modified) histones and DNA with formaldehyde(8,9), extraction and sonication of chromatin, chromatin immunoprecipitation (ChIP) with modification-specific antibodies(9,10), de-crosslinking of histone-DNA complexes, and gene-specific real-time quantitative PCR. The approach has proven useful for detecting specific histone modifications associated with C(4;) photosynthesis in maize(5,11) and systemic immunity in Arabidopsis(3).
Automatic segmentation and supervised learning-based selection of nuclei in cancer tissue images.
Nandy, Kaustav; Gudla, Prabhakar R; Amundsen, Ryan; Meaburn, Karen J; Misteli, Tom; Lockett, Stephen J
2012-09-01
Analysis of preferential localization of certain genes within the cell nuclei is emerging as a new technique for the diagnosis of breast cancer. Quantitation requires accurate segmentation of 100-200 cell nuclei in each tissue section to draw a statistically significant result. Thus, for large-scale analysis, manual processing is too time consuming and subjective. Fortuitously, acquired images generally contain many more nuclei than are needed for analysis. Therefore, we developed an integrated workflow that selects, following automatic segmentation, a subpopulation of accurately delineated nuclei for positioning of fluorescence in situ hybridization-labeled genes of interest. Segmentation was performed by a multistage watershed-based algorithm and screening by an artificial neural network-based pattern recognition engine. The performance of the workflow was quantified in terms of the fraction of automatically selected nuclei that were visually confirmed as well segmented and by the boundary accuracy of the well-segmented nuclei relative to a 2D dynamic programming-based reference segmentation method. Application of the method was demonstrated for discriminating normal and cancerous breast tissue sections based on the differential positioning of the HES5 gene. Automatic results agreed with manual analysis in 11 out of 14 cancers, all four normal cases, and all five noncancerous breast disease cases, thus showing the accuracy and robustness of the proposed approach. Published 2012 Wiley Periodicals, Inc.
Benjamin, Aimee L; Green, Benjamin B; Crooker, Brian A; McKay, Stephanie D; Kerr, David E
2016-03-24
We have previously found substantial animal-to-animal and age-dependent variation in the response of Holstein fibroblast cultures challenged with LPS. To expand on this finding, fibroblast cultures were established from dairy (Holstein) and beef (Angus) cattle and challenged with LPS to examine breed-dependent differences in the innate immune response. Global gene expression was measured by RNA-Seq, while an epigenetic basis for expression differences was examined by methylated CpG island recovery assay sequencing (MIRA-Seq) analysis. The Holstein breed displayed a more robust response to LPS than the Angus breed based on RNA-Seq analysis of cultures challenged with LPS for 0, 2, and 8 h. Several immune-associated genes were expressed at greater levels (FDR < 0.05) in Holstein cultures including TLR4 at all time points and a number of pro-inflammatory genes such as IL8, CCL20, CCL5, and TNF following LPS exposure. Despite extensive breed differences in the transcriptome, MIRA-Seq unveiled relatively similar patterns of genome-wide DNA methylation between breeds, with an overall hypomethylation of gene promoters. However, by examining the genome in 3Kb windows, 49 regions of differential methylation were discovered between Holstein and Angus fibroblasts, and two of these regions fell within the promoter region (-2500 to +500 bp of the transcription start site) of the genes NTRK2 and ADAMTS5. Fibroblasts isolated from Holstein cattle display a more robust response to LPS in comparison to cultures from Angus cattle. Different selection strategies and management practices exist between these two breeds that likely give rise to genetic and epigenetic factors contributing to the different immune response phenotypes.
Nikolova, Olga; Moser, Russell; Kemp, Christopher; Gönen, Mehmet; Margolin, Adam A
2017-05-01
In recent years, vast advances in biomedical technologies and comprehensive sequencing have revealed the genomic landscape of common forms of human cancer in unprecedented detail. The broad heterogeneity of the disease calls for rapid development of personalized therapies. Translating the readily available genomic data into useful knowledge that can be applied in the clinic remains a challenge. Computational methods are needed to aid these efforts by robustly analyzing genome-scale data from distinct experimental platforms for prioritization of targets and treatments. We propose a novel, biologically motivated, Bayesian multitask approach, which explicitly models gene-centric dependencies across multiple and distinct genomic platforms. We introduce a gene-wise prior and present a fully Bayesian formulation of a group factor analysis model. In supervised prediction applications, our multitask approach leverages similarities in response profiles of groups of drugs that are more likely to be related to true biological signal, which leads to more robust performance and improved generalization ability. We evaluate the performance of our method on molecularly characterized collections of cell lines profiled against two compound panels, namely the Cancer Cell Line Encyclopedia and the Cancer Therapeutics Response Portal. We demonstrate that accounting for the gene-centric dependencies enables leveraging information from multi-omic input data and improves prediction and feature selection performance. We further demonstrate the applicability of our method in an unsupervised dimensionality reduction application by inferring genes essential to tumorigenesis in the pancreatic ductal adenocarcinoma and lung adenocarcinoma patient cohorts from The Cancer Genome Atlas. : The code for this work is available at https://github.com/olganikolova/gbgfa. : nikolova@ohsu.edu or margolin@ohsu.edu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Kaltenbach, Miriam; Emond, Stephane; Hollfelder, Florian; Tokuriki, Nobuhiko
2016-10-01
The extent to which an emerging new function trades off with the original function is a key characteristic of the dynamics of enzyme evolution. Various cases of laboratory evolution have unveiled a characteristic trend; a large increase in a new, promiscuous activity is often accompanied by only a mild reduction of the native, original activity. A model that associates weak trade-offs with "evolvability" was put forward, which proposed that enzymes possess mutational robustness in the native activity and plasticity in promiscuous activities. This would enable the acquisition of a new function without compromising the original one, reducing the benefit of early gene duplication and therefore the selection pressure thereon. Yet, to date, no experimental study has examined this hypothesis directly. Here, we investigate the causes of weak trade-offs by systematically characterizing adaptive mutations that occurred in two cases of evolutionary transitions in enzyme function: (1) from phosphotriesterase to arylesterase, and (2) from atrazine chlorohydrolase to melamine deaminase. Mutational analyses in various genetic backgrounds revealed that, in contrast to the prevailing model, the native activity is less robust to mutations than the promiscuous activity. For example, in phosphotriesterase, the deleterious effect of individual mutations on the native phosphotriesterase activity is much larger than their positive effect on the promiscuous arylesterase activity. Our observations suggest a revision of the established model: weak trade-offs are not caused by an intrinsic robustness of the native activity and plasticity of the promiscuous activity. We propose that upon strong adaptive pressure for the new activity without selection against the original one, selected mutations will lead to the largest possible increases in the new function, but whether and to what extent they decrease the old function is irrelevant, creating a bias towards initially weak trade-offs and the emergence of generalist enzymes.
Commentary on Some Recent Theses Relevant to Combating Aging: August 2017.
Zealley, Benjamin; de Grey, Aubrey D N J
2017-08-01
Theses reviewed in this issue include "Engineering Cellular Input-Output for the Robust Control of Mammalian Cell-Based Therapies"; "Enzyme-Instructed Self-Assembly (EISA) Selectively Targets Cancer Cells"; "Exploration of helminth-derived immunoregulatory molecules as options for therapeutic intervention in allograft rejection and autoimmune disease"; "Expression of the Medial HOXA genes is indispensible for self-renewal in human hematopoietic stem cells"; "Gamma frequency entrainment attenuates amyloid load and modifies microglia"; and "Heterogeneous Distribution of Microvascular Blood Flow Contributes to Impaired Skeletal Muscle Oxygenation in Diabetes".
A Statistical Approach Reveals Designs for the Most Robust Stochastic Gene Oscillators
2016-01-01
The engineering of transcriptional networks presents many challenges due to the inherent uncertainty in the system structure, changing cellular context, and stochasticity in the governing dynamics. One approach to address these problems is to design and build systems that can function across a range of conditions; that is they are robust to uncertainty in their constituent components. Here we examine the parametric robustness landscape of transcriptional oscillators, which underlie many important processes such as circadian rhythms and the cell cycle, plus also serve as a model for the engineering of complex and emergent phenomena. The central questions that we address are: Can we build genetic oscillators that are more robust than those already constructed? Can we make genetic oscillators arbitrarily robust? These questions are technically challenging due to the large model and parameter spaces that must be efficiently explored. Here we use a measure of robustness that coincides with the Bayesian model evidence, combined with an efficient Monte Carlo method to traverse model space and concentrate on regions of high robustness, which enables the accurate evaluation of the relative robustness of gene network models governed by stochastic dynamics. We report the most robust two and three gene oscillator systems, plus examine how the number of interactions, the presence of autoregulation, and degradation of mRNA and protein affects the frequency, amplitude, and robustness of transcriptional oscillators. We also find that there is a limit to parametric robustness, beyond which there is nothing to be gained by adding additional feedback. Importantly, we provide predictions on new oscillator systems that can be constructed to verify the theory and advance design and modeling approaches to systems and synthetic biology. PMID:26835539
Interrogating the topological robustness of gene regulatory circuits by randomization
Levine, Herbert; Onuchic, Jose N.
2017-01-01
One of the most important roles of cells is performing their cellular tasks properly for survival. Cells usually achieve robust functionality, for example, cell-fate decision-making and signal transduction, through multiple layers of regulation involving many genes. Despite the combinatorial complexity of gene regulation, its quantitative behavior has been typically studied on the basis of experimentally verified core gene regulatory circuitry, composed of a small set of important elements. It is still unclear how such a core circuit operates in the presence of many other regulatory molecules and in a crowded and noisy cellular environment. Here we report a new computational method, named random circuit perturbation (RACIPE), for interrogating the robust dynamical behavior of a gene regulatory circuit even without accurate measurements of circuit kinetic parameters. RACIPE generates an ensemble of random kinetic models corresponding to a fixed circuit topology, and utilizes statistical tools to identify generic properties of the circuit. By applying RACIPE to simple toggle-switch-like motifs, we observed that the stable states of all models converge to experimentally observed gene state clusters even when the parameters are strongly perturbed. RACIPE was further applied to a proposed 22-gene network of the Epithelial-to-Mesenchymal Transition (EMT), from which we identified four experimentally observed gene states, including the states that are associated with two different types of hybrid Epithelial/Mesenchymal phenotypes. Our results suggest that dynamics of a gene circuit is mainly determined by its topology, not by detailed circuit parameters. Our work provides a theoretical foundation for circuit-based systems biology modeling. We anticipate RACIPE to be a powerful tool to predict and decode circuit design principles in an unbiased manner, and to quantitatively evaluate the robustness and heterogeneity of gene expression. PMID:28362798
Liu, Jian; Cheng, Yuhu; Wang, Xuesong; Zhang, Lin; Liu, Hui
2017-08-17
It is urgent to diagnose colorectal cancer in the early stage. Some feature genes which are important to colorectal cancer development have been identified. However, for the early stage of colorectal cancer, less is known about the identity of specific cancer genes that are associated with advanced clinical stage. In this paper, we conducted a feature extraction method named Optimal Mean based Block Robust Feature Extraction method (OMBRFE) to identify feature genes associated with advanced colorectal cancer in clinical stage by using the integrated colorectal cancer data. Firstly, based on the optimal mean and L 2,1 -norm, a novel feature extraction method called Optimal Mean based Robust Feature Extraction method (OMRFE) is proposed to identify feature genes. Then the OMBRFE method which introduces the block ideology into OMRFE method is put forward to process the colorectal cancer integrated data which includes multiple genomic data: copy number alterations, somatic mutations, methylation expression alteration, as well as gene expression changes. Experimental results demonstrate that the OMBRFE is more effective than previous methods in identifying the feature genes. Moreover, genes identified by OMBRFE are verified to be closely associated with advanced colorectal cancer in clinical stage.
Efficient Variable Selection Method for Exposure Variables on Binary Data
NASA Astrophysics Data System (ADS)
Ohno, Manabu; Tarumi, Tomoyuki
In this paper, we propose a new variable selection method for "robust" exposure variables. We define "robust" as property that the same variable can select among original data and perturbed data. There are few studies of effective for the selection method. The problem that selects exposure variables is almost the same as a problem that extracts correlation rules without robustness. [Brin 97] is suggested that correlation rules are possible to extract efficiently using chi-squared statistic of contingency table having monotone property on binary data. But the chi-squared value does not have monotone property, so it's is easy to judge the method to be not independent with an increase in the dimension though the variable set is completely independent, and the method is not usable in variable selection for robust exposure variables. We assume anti-monotone property for independent variables to select robust independent variables and use the apriori algorithm for it. The apriori algorithm is one of the algorithms which find association rules from the market basket data. The algorithm use anti-monotone property on the support which is defined by association rules. But independent property does not completely have anti-monotone property on the AIC of independent probability model, but the tendency to have anti-monotone property is strong. Therefore, selected variables with anti-monotone property on the AIC have robustness. Our method judges whether a certain variable is exposure variable for the independent variable using previous comparison of the AIC. Our numerical experiments show that our method can select robust exposure variables efficiently and precisely.
Adaptive Evolution Is Substantially Impeded by Hill-Robertson Interference in Drosophila.
Castellano, David; Coronado-Zamora, Marta; Campos, Jose L; Barbadilla, Antonio; Eyre-Walker, Adam
2016-02-01
Hill-Robertson interference (HRi) is expected to reduce the efficiency of natural selection when two or more linked selected sites do not segregate freely, but no attempt has been done so far to quantify the overall impact of HRi on the rate of adaptive evolution for any given genome. In this work, we estimate how much HRi impedes the rate of adaptive evolution in the coding genome of Drosophila melanogaster. We compiled a data set of 6,141 autosomal protein-coding genes from Drosophila, from which polymorphism levels in D. melanogaster and divergence out to D. yakuba were estimated. The rate of adaptive evolution was calculated using a derivative of the McDonald-Kreitman test that controls for slightly deleterious mutations. We find that the rate of adaptive amino acid substitution at a given position of the genome is positively correlated to both the rate of recombination and the mutation rate, and negatively correlated to the gene density of the region. These correlations are robust to controlling for each other, for synonymous codon bias and for gene functions related to immune response and testes. We show that HRi diminishes the rate of adaptive evolution by approximately 27%. Interestingly, genes with low mutation rates embedded in gene poor regions lose approximately 17% of their adaptive substitutions whereas genes with high mutation rates embedded in gene rich regions lose approximately 60%. We conclude that HRi hampers the rate of adaptive evolution in Drosophila and that the variation in recombination, mutation, and gene density along the genome affects the HRi effect. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
The emotion system promotes diversity and evolvability
Giske, Jarl; Eliassen, Sigrunn; Fiksen, Øyvind; Jakobsen, Per J.; Aksnes, Dag L.; Mangel, Marc; Jørgensen, Christian
2014-01-01
Studies on the relationship between the optimal phenotype and its environment have had limited focus on genotype-to-phenotype pathways and their evolutionary consequences. Here, we study how multi-layered trait architecture and its associated constraints prescribe diversity. Using an idealized model of the emotion system in fish, we find that trait architecture yields genetic and phenotypic diversity even in absence of frequency-dependent selection or environmental variation. That is, for a given environment, phenotype frequency distributions are predictable while gene pools are not. The conservation of phenotypic traits among these genetically different populations is due to the multi-layered trait architecture, in which one adaptation at a higher architectural level can be achieved by several different adaptations at a lower level. Our results emphasize the role of convergent evolution and the organismal level of selection. While trait architecture makes individuals more constrained than what has been assumed in optimization theory, the resulting populations are genetically more diverse and adaptable. The emotion system in animals may thus have evolved by natural selection because it simultaneously enhances three important functions, the behavioural robustness of individuals, the evolvability of gene pools and the rate of evolutionary innovation at several architectural levels. PMID:25100697
The emotion system promotes diversity and evolvability.
Giske, Jarl; Eliassen, Sigrunn; Fiksen, Øyvind; Jakobsen, Per J; Aksnes, Dag L; Mangel, Marc; Jørgensen, Christian
2014-09-22
Studies on the relationship between the optimal phenotype and its environment have had limited focus on genotype-to-phenotype pathways and their evolutionary consequences. Here, we study how multi-layered trait architecture and its associated constraints prescribe diversity. Using an idealized model of the emotion system in fish, we find that trait architecture yields genetic and phenotypic diversity even in absence of frequency-dependent selection or environmental variation. That is, for a given environment, phenotype frequency distributions are predictable while gene pools are not. The conservation of phenotypic traits among these genetically different populations is due to the multi-layered trait architecture, in which one adaptation at a higher architectural level can be achieved by several different adaptations at a lower level. Our results emphasize the role of convergent evolution and the organismal level of selection. While trait architecture makes individuals more constrained than what has been assumed in optimization theory, the resulting populations are genetically more diverse and adaptable. The emotion system in animals may thus have evolved by natural selection because it simultaneously enhances three important functions, the behavioural robustness of individuals, the evolvability of gene pools and the rate of evolutionary innovation at several architectural levels.
Teng, Linhong; Han, Wentao; Fan, Xiao; Xu, Dong; Zhang, Xiaowen; Dittami, Simon M.; Ye, Naihao
2017-01-01
Lipoxygenase (LOX) plays important roles in fatty acid oxidation and lipid mediator biosynthesis. In this study, we give first insights into brown algal LOX evolution. Whole genome searches revealed four, three, and eleven LOXs in Ectocarpus siliculosus, Cladosiphon okamuranus, and Saccharina japonica, respectively. In phylogenetic analyses, LOXs from brown algae form a robust clade with those from prokaryotes, suggesting an ancestral origin and slow evolution. Brown algal LOXs were divided into two clades, C1 and C2 in a phylogenetic tree. Compared to the two species of Ectocarpales, LOX gene expansion occurred in the kelp S. japonica through tandem duplication and segmental duplication. Selection pressure analysis showed that LOX genes in brown algae have undergone strong purifying selection, while the selective constraint in the C2 clade was more relaxed than that in the C1 clade. Furthermore, within each clade, LOXs of S. japonica evolved under more relaxed selection constraints than E. siliculosus and C. okamuranus. Structural modeling showed that unlike LOXs of plants and animals, which contain a β barrel in the N-terminal part of the protein, LOXs in brown algae fold into a single domain. Analysis of previously published transcriptomic data showed that LOXs in E. siliculosus are responsive to hyposaline, hypersaline, oxidative, and copper stresses. Moreover, clear divergence of expression patterns was observed among different life stages, as well as between duplicate gene pairs. In E. siliculosus, all four LOXs are male-biased in immature gametophytes, and mature gametophytes showed significantly higher LOX mRNA levels than immature gametophytes and sporophytes. In S. japonica, however, our RNA-Seq data showed that most LOXs are highly expressed in sporophytes. Even the most recently duplicated gene pairs showed divergent expression patterns, suggesting that functional divergence has likely occurred since LOX genes duplicated, which potentially contributes to the production of various oxylipins in brown algae. PMID:29234336
Muleke, Everlyne M’mbone; Jabir, Bashir Mohammed; Xie, Yang; Zhu, Xianwen; Cheng, Wanwan
2017-01-01
NAC (NAM, no apical meristem; ATAF, Arabidopsis transcription activation factor and CUC, cup-shaped cotyledon) proteins are among the largest transcription factor (TF) families playing fundamental biological processes, including cell expansion and differentiation, and hormone signaling in response to biotic and abiotic stresses. In this study, 172 RsNACs comprising 17 membrane-bound members were identified from the whole radish genome. In total, 98 RsNAC genes were non-uniformly distributed across the nine radish chromosomes. In silico analysis revealed that expression patterns of several NAC genes were tissue-specific such as a preferential expression in roots and leaves. In addition, 21 representative NAC genes were selected to investigate their responses to heavy metals (HMs), salt, heat, drought and abscisic acid (ABA) stresses using real-time polymerase chain reaction (RT-qPCR). As a result, differential expressions among these genes were identified where RsNAC023 and RsNAC080 genes responded positively to all stresses except ABA, while RsNAC145 responded more actively to salt, heat and drought stresses compared with other genes. The results provides more valuable information and robust candidate genes for future functional analysis for improving abiotic stress tolerances in radish. PMID:29259849
Yokoi, Kakeru; Koyama, Hiroaki; Minakuchi, Chieka; Tanaka, Toshiharu; Miura, Ken
2012-01-01
Using Tribolium castaneum, we quantitatively investigated the induction of nine antimicrobial peptide (AMP) genes by live gram-negative bacteria (Escherichia coli and Enterobacter cloacae), gram-positive bacteria (Micrococcus luteus and Bacillus subtilis) and the budding yeast (Saccharomyces cerevisiae). Then, five representative AMP genes were selected, and the involvement of the Toll and IMD pathways in their induction by E. coli, M. luteus and S. cerevisiae was examined by utilizing RNA interference of either MyD88 or IMD. Results indicated: Robust and acute induction of three genes by the two bacterial species was mediated mainly by the IMD pathway; slow and sustained induction of one gene by the two bacteria was mediated mainly by the Toll pathway; induction of the remaining one gene by the two bacteria was mediated by both pathways; induction of the five genes by the yeast was mediated by the Toll and/or IMD pathways depending on respective genes. These results suggest that more promiscuous activation and usage of the two pathways may occur in T. castaneum than in Drosophila melanogaster. In addition, the IMD pathway was revealed to dominantly contribute to defense against two bacterial species, gram-negative E. cloacae and gram-positive B. subtilis that possesses DAP-type peptidoglycan.
A Medium-Throughput Single Cell CRISPR-Cas9 Assay to Assess Gene Essentiality.
Grassian, A R; Scales, T M E; Knutson, S K; Kuntz, K W; McCarthy, N J; Lowe, C E; Moore, J D; Copeland, R A; Keilhack, H; Smith, J J; Wickenden, J A; Ribich, S
2015-01-01
Target selection for oncology is a crucial step in the successful development of therapeutics. Clustered regularly interspaced short palindromic repeats (CRISPR)-Cas9 editing of specific loci offers an alternative method to RNA interference and small molecule inhibitors for determining whether a cell line is dependent on a specific gene product for proliferation or survival. In our initial studies using CRISPR-Cas9 to verify the dependence on EZH2 activity for proliferation of a SMARCB1/SNF5/INI1 mutant malignant rhabdoid tumor (MRT) cell line, we noted that the initial reduction in proliferation was lost over time. We hypothesized that in the few cells that retain proliferative capacity, at least one allele of EZH2 remains functional. To verify this, we developed an assay to analyze 10s-100s of clonal cell populations for target gene disruption using restriction digest and fluorescent fragment length analyses. Our results clearly show that in cell lines in which EZH2 is essential for proliferation, at least one potentially functional allele of EZH2 is retained in the clones that survive. This assay clearly indicates whether or not a specific gene is essential for survival and/or proliferation in a given cell line. Such data can aid the development of more robust therapeutics by increasing confidence in target selection.
Bajaj, Deepak; Das, Shouvik; Upadhyaya, Hari D.; Ranjan, Rajeev; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. Laxmipathi; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.
2015-01-01
The study identified 9045 high-quality SNPs employing both genome-wide GBS- and candidate gene-based SNP genotyping assays in 172, including 93 cultivated (desi and kabuli) and 79 wild chickpea accessions. The GWAS in a structured population of 93 sequenced accessions detected 15 major genomic loci exhibiting significant association with seed coat color. Five seed color-associated major genomic loci underlying robust QTLs mapped on a high-density intra-specific genetic linkage map were validated by QTL mapping. The integration of association and QTL mapping with gene haplotype-specific LD mapping and transcript profiling identified novel allelic variants (non-synonymous SNPs) and haplotypes in a MATE secondary transporter gene regulating light/yellow brown and beige seed coat color differentiation in chickpea. The down-regulation and decreased transcript expression of beige seed coat color-associated MATE gene haplotype was correlated with reduced proanthocyanidins accumulation in the mature seed coats of beige than light/yellow brown seed colored desi and kabuli accessions for their coloration/pigmentation. This seed color-regulating MATE gene revealed strong purifying selection pressure primarily in LB/YB seed colored desi and wild Cicer reticulatum accessions compared with the BE seed colored kabuli accessions. The functionally relevant molecular tags identified have potential to decipher the complex transcriptional regulatory gene function of seed coat coloration and for understanding the selective sweep-based seed color trait evolutionary pattern in cultivated and wild accessions during chickpea domestication. The genome-wide integrated approach employed will expedite marker-assisted genetic enhancement for developing cultivars with desirable seed coat color types in chickpea. PMID:26635822
Modeling stochasticity and robustness in gene regulatory networks.
Garg, Abhishek; Mohanram, Kartik; Di Cara, Alessandro; De Micheli, Giovanni; Xenarios, Ioannis
2009-06-15
Understanding gene regulation in biological processes and modeling the robustness of underlying regulatory networks is an important problem that is currently being addressed by computational systems biologists. Lately, there has been a renewed interest in Boolean modeling techniques for gene regulatory networks (GRNs). However, due to their deterministic nature, it is often difficult to identify whether these modeling approaches are robust to the addition of stochastic noise that is widespread in gene regulatory processes. Stochasticity in Boolean models of GRNs has been addressed relatively sparingly in the past, mainly by flipping the expression of genes between different expression levels with a predefined probability. This stochasticity in nodes (SIN) model leads to over representation of noise in GRNs and hence non-correspondence with biological observations. In this article, we introduce the stochasticity in functions (SIF) model for simulating stochasticity in Boolean models of GRNs. By providing biological motivation behind the use of the SIF model and applying it to the T-helper and T-cell activation networks, we show that the SIF model provides more biologically robust results than the existing SIN model of stochasticity in GRNs. Algorithms are made available under our Boolean modeling toolbox, GenYsis. The software binaries can be downloaded from http://si2.epfl.ch/ approximately garg/genysis.html.
Das, Shouvik; Upadhyaya, Hari D.; Bajaj, Deepak; Kujur, Alice; Badoni, Saurabh; Laxmi; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. Laxmipathi; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.
2015-01-01
A rapid high-resolution genome-wide strategy for molecular mapping of major QTL(s)/gene(s) regulating important agronomic traits is vital for in-depth dissection of complex quantitative traits and genetic enhancement in chickpea. The present study for the first time employed a NGS-based whole-genome QTL-seq strategy to identify one major genomic region harbouring a robust 100-seed weight QTL using an intra-specific 221 chickpea mapping population (desi cv. ICC 7184 × desi cv. ICC 15061). The QTL-seq-derived major SW QTL (CaqSW1.1) was further validated by single-nucleotide polymorphism (SNP) and simple sequence repeat (SSR) marker-based traditional QTL mapping (47.6% R2 at higher LOD >19). This reflects the reliability and efficacy of QTL-seq as a strategy for rapid genome-wide scanning and fine mapping of major trait regulatory QTLs in chickpea. The use of QTL-seq and classical QTL mapping in combination narrowed down the 1.37 Mb (comprising 177 genes) major SW QTL (CaqSW1.1) region into a 35 kb genomic interval on desi chickpea chromosome 1 containing six genes. One coding SNP (G/A)-carrying constitutive photomorphogenic9 (COP9) signalosome complex subunit 8 (CSN8) gene of these exhibited seed-specific expression, including pronounced differential up-/down-regulation in low and high seed weight mapping parents and homozygous individuals during seed development. The coding SNP mined in this potential seed weight-governing candidate CSN8 gene was found to be present exclusively in all cultivated species/genotypes, but not in any wild species/genotypes of primary, secondary and tertiary gene pools. This indicates the effect of strong artificial and/or natural selection pressure on target SW locus during chickpea domestication. The proposed QTL-seq-driven integrated genome-wide strategy has potential to delineate major candidate gene(s) harbouring a robust trait regulatory QTL rapidly with optimal use of resources. This will further assist us to extrapolate the molecular mechanism underlying complex quantitative traits at a genome-wide scale leading to fast-paced marker-assisted genetic improvement in diverse crop plants, including chickpea. PMID:25922536
Robust Smoothing: Smoothing Parameter Selection and Applications to Fluorescence Spectroscopy∂
Lee, Jong Soo; Cox, Dennis D.
2009-01-01
Fluorescence spectroscopy has emerged in recent years as an effective way to detect cervical cancer. Investigation of the data preprocessing stage uncovered a need for a robust smoothing to extract the signal from the noise. Various robust smoothing methods for estimating fluorescence emission spectra are compared and data driven methods for the selection of smoothing parameter are suggested. The methods currently implemented in R for smoothing parameter selection proved to be unsatisfactory, and a computationally efficient procedure that approximates robust leave-one-out cross validation is presented. PMID:20729976
Sima, Chao; Amundson, Sally A.; Zenhausern, Frederic
2018-01-01
Purpose To compile a list of genes that have been reported to be affected by external ionizing radiation (IR) and to assess their performance as candidate biomarkers for individual human radiation dosimetry. Methods Eligible studies were identified through extensive searches of the online databases from 1978 to 2017. Original English-language publications of microarray studies assessing radiation-induced changes in gene expression levels in human blood after external IR were included. Genes identified in at least half of the selected studies were retained for bio-statistical analysis in order to evaluate their diagnostic ability. Results 24 studies met the criteria and were included in this study. Radiation-induced expression of 10,170 unique genes was identified and the 31 genes that have been identified in at least 50% of studies (12/24 studies) were selected for diagnostic power analysis. Twenty-seven genes showed a significant Spearman’s correlation with radiation dose. Individually, TNFSF4, FDXR, MYC, ZMAT3 and GADD45A provided the best discrimination of radiation dose < 2 Gy and dose ≥ 2 Gy according to according to their maximized Youden’s index (0.67, 0.55, 0.55, 0.55 and 0.53 respectively). Moreover, 12 combinations of three genes display an area under the Receiver Operating Curve (ROC) curve (AUC) = 1 reinforcing the concept of biomarker combinations instead of looking for an ideal and unique biomarker. Conclusion Gene expression is a promising approach for radiation dosimetry assessment. A list of robust candidate biomarkers has been identified from analysis of the studies published to date, confirming for example the potential of well-known genes such as FDXR and TNFSF4 or highlighting other promising gene such as ZMAT3. However, heterogeneity in protocols and analysis methods will require additional studies to confirm these results. PMID:29879226
On the robustness of complex heterogeneous gene expression networks.
Gómez-Gardeñes, Jesús; Moreno, Yamir; Floría, Luis M
2005-04-01
We analyze a continuous gene expression model on the underlying topology of a complex heterogeneous network. Numerical simulations aimed at studying the chaotic and periodic dynamics of the model are performed. The results clearly indicate that there is a region in which the dynamical and structural complexity of the system avoid chaotic attractors. However, contrary to what has been reported for Random Boolean Networks, the chaotic phase cannot be completely suppressed, which has important bearings on network robustness and gene expression modeling.
Genetic architecture and the evolution of sex.
Lohaus, Rolf; Burch, Christina L; Azevedo, Ricardo B R
2010-01-01
Theoretical investigations of the advantages of sex have tended to treat the genetic architecture of organisms as static and have not considered that genetic architecture might coevolve with reproductive mode. As a result, some potential advantages of sex may have been missed. Using a gene network model, we recently showed that recombination imposes selection for robustness to mutation and that negative epistasis can evolve as a by-product of this selection. These results motivated a detailed exploration of the mutational deterministic hypothesis, a hypothesis in which the advantage of sex depends critically on epistasis. We found that sexual populations do evolve higher mean fitness and lower genetic load than asexual populations at equilibrium, and, under moderate stabilizing selection and large population size, these equilibrium sexual populations resist invasion by asexuals. However, we found no evidence that these long- and short-term advantages to sex were explained by the negative epistasis that evolved in our experiments. The long-term advantage of sex was that sexual populations evolved a lower deleterious mutation rate, but this property was not sufficient to account for the ability of sexual populations to resist invasion by asexuals. The ability to resist asexual invasion was acquired simultaneously with an increase in recombinational robustness that minimized the cost of sex. These observations provide the first direct evidence that sexual reproduction does indeed select for conditions that favor its own maintenance. Furthermore, our results highlight the importance of considering a dynamic view of the genetic architecture to understand the evolution of sex and recombination.
Luo, Xiong-Jian; Mattheisen, Manuel; Li, Ming; Huang, Liang; Rietschel, Marcella; Børglum, Anders D.; Als, Thomas D.; van den Oord, Edwin J.; Aberg, Karolina A.; Mors, Ole; Mortensen, Preben Bo; Luo, Zhenwu; Degenhardt, Franziska; Cichon, Sven; Schulze, Thomas G.; Nöthen, Markus M.; Su, Bing; Zhao, Zhongming; Gan, Lin; Yao, Yong-Gang
2015-01-01
Genome-wide association studies have identified multiple risk variants and loci that show robust association with schizophrenia. Nevertheless, it remains unclear how these variants confer risk to schizophrenia. In addition, the driving force that maintains the schizophrenia risk variants in human gene pool is poorly understood. To investigate whether expression-associated genetic variants contribute to schizophrenia susceptibility, we systematically integrated brain expression quantitative trait loci and genome-wide association data of schizophrenia using Sherlock, a Bayesian statistical framework. Our analyses identified ZNF323 as a schizophrenia risk gene (P = 2.22×10–6). Subsequent analyses confirmed the association of the ZNF323 and its expression-associated single nucleotide polymorphism rs1150711 in independent samples (gene-expression: P = 1.40×10–6; single-marker meta-analysis in the combined discovery and replication sample comprising 44123 individuals: P = 6.85×10−10). We found that the ZNF323 was significantly downregulated in hippocampus and frontal cortex of schizophrenia patients (P = .0038 and P = .0233, respectively). Evidence for pleiotropic effects was detected (association of rs1150711 with lung function and gene expression of ZNF323 in lung: P = 6.62×10–5 and P = 9.00×10–5, respectively) with the risk allele (T allele) for schizophrenia acting as protective allele for lung function. Subsequent population genetics analyses suggest that the risk allele (T) of rs1150711 might have undergone recent positive selection in human population. Our findings suggest that the ZNF323 is a schizophrenia susceptibility gene whose expression may influence schizophrenia risk. Our study also illustrates a possible mechanism for maintaining schizophrenia risk variants in the human gene pool. PMID:25759474
NASA Astrophysics Data System (ADS)
Zarindast, Atousa; Seyed Hosseini, Seyed Mohamad; Pishvaee, Mir Saman
2017-06-01
Robust supplier selection problem, in a scenario-based approach has been proposed, when the demand and exchange rates are subject to uncertainties. First, a deterministic multi-objective mixed integer linear programming is developed; then, the robust counterpart of the proposed mixed integer linear programming is presented using the recent extension in robust optimization theory. We discuss decision variables, respectively, by a two-stage stochastic planning model, a robust stochastic optimization planning model which integrates worst case scenario in modeling approach and finally by equivalent deterministic planning model. The experimental study is carried out to compare the performances of the three models. Robust model resulted in remarkable cost saving and it illustrated that to cope with such uncertainties, we should consider them in advance in our planning. In our case study different supplier were selected due to this uncertainties and since supplier selection is a strategic decision, it is crucial to consider these uncertainties in planning approach.
Wang, Ya-Xuan; Gao, Ying-Lian; Liu, Jin-Xing; Kong, Xiang-Zhen; Li, Hai-Jun
2017-09-01
Identifying differentially expressed genes from the thousands of genes is a challenging task. Robust principal component analysis (RPCA) is an efficient method in the identification of differentially expressed genes. RPCA method uses nuclear norm to approximate the rank function. However, theoretical studies showed that the nuclear norm minimizes all singular values, so it may not be the best solution to approximate the rank function. The truncated nuclear norm is defined as the sum of some smaller singular values, which may achieve a better approximation of the rank function than nuclear norm. In this paper, a novel method is proposed by replacing nuclear norm of RPCA with the truncated nuclear norm, which is named robust principal component analysis regularized by truncated nuclear norm (TRPCA). The method decomposes the observation matrix of genomic data into a low-rank matrix and a sparse matrix. Because the significant genes can be considered as sparse signals, the differentially expressed genes are viewed as the sparse perturbation signals. Thus, the differentially expressed genes can be identified according to the sparse matrix. The experimental results on The Cancer Genome Atlas data illustrate that the TRPCA method outperforms other state-of-the-art methods in the identification of differentially expressed genes.
Priesner, Christoph; Aleksandrova, Krasimira; Esser, Ruth; Mockel-Tenbrinck, Nadine; Leise, Jana; Drechsel, Katharina; Marburger, Michael; Quaiser, Andrea; Goudeva, Lilia; Arseniev, Lubomir; Kaiser, Andrew D.; Glienke, Wolfgang; Koehl, Ulrike
2016-01-01
Multiple clinical studies have demonstrated that adaptive immunotherapy using redirected T cells against advanced cancer has led to promising results with improved patient survival. The continuously increasing interest in those advanced gene therapy medicinal products (GTMPs) leads to a manufacturing challenge regarding automation, process robustness, and cell storage. Therefore, this study addresses the proof of principle in clinical-scale selection, stimulation, transduction, and expansion of T cells using the automated closed CliniMACS® Prodigy system. Naïve and central memory T cells from apheresis products were first immunomagnetically enriched using anti-CD62L magnetic beads and further processed freshly (n = 3) or split for cryopreservation and processed after thawing (n = 1). Starting with 0.5 × 108 purified CD3+ T cells, three mock runs and one run including transduction with green fluorescent protein (GFP)-containing vector resulted in a median final cell product of 16 × 108 T cells (32-fold expansion) up to harvesting after 2 weeks. Expression of CD62L was downregulated on T cells after thawing, which led to the decision to purify CD62L+CD3+ T cells freshly with cryopreservation thereafter. Most important in the split product, a very similar expansion curve was reached comparing the overall freshly CD62L selected cells with those after thawing, which could be demonstrated in the T cell subpopulations as well by showing a nearly identical conversion of the CD4/CD8 ratio. In the GFP run, the transduction efficacy was 83%. In-process control also demonstrated sufficient glucose levels during automated feeding and medium removal. The robustness of the process and the constant quality of the final product in a closed and automated system give rise to improve harmonized manufacturing protocols for engineered T cells in future gene therapy studies. PMID:27562135
Interferon-γ Inhibits Ebola Virus Infection.
Rhein, Bethany A; Powers, Linda S; Rogers, Kai; Anantpadma, Manu; Singh, Brajesh K; Sakurai, Yasuteru; Bair, Thomas; Miller-Hunt, Catherine; Sinn, Patrick; Davey, Robert A; Monick, Martha M; Maury, Wendy
2015-01-01
Ebola virus outbreaks, such as the 2014 Makona epidemic in West Africa, are episodic and deadly. Filovirus antivirals are currently not clinically available. Our findings suggest interferon gamma, an FDA-approved drug, may serve as a novel and effective prophylactic or treatment option. Using mouse-adapted Ebola virus, we found that murine interferon gamma administered 24 hours before or after infection robustly protects lethally-challenged mice and reduces morbidity and serum viral titers. Furthermore, we demonstrated that interferon gamma profoundly inhibits Ebola virus infection of macrophages, an early cellular target of infection. As early as six hours following in vitro infection, Ebola virus RNA levels in interferon gamma-treated macrophages were lower than in infected, untreated cells. Addition of the protein synthesis inhibitor, cycloheximide, to interferon gamma-treated macrophages did not further reduce viral RNA levels, suggesting that interferon gamma blocks life cycle events that require protein synthesis such as virus replication. Microarray studies with interferon gamma-treated human macrophages identified more than 160 interferon-stimulated genes. Ectopic expression of a select group of these genes inhibited Ebola virus infection. These studies provide new potential avenues for antiviral targeting as these genes that have not previously appreciated to inhibit negative strand RNA viruses and specifically Ebola virus infection. As treatment of interferon gamma robustly protects mice from lethal Ebola virus infection, we propose that interferon gamma should be further evaluated for its efficacy as a prophylactic and/or therapeutic strategy against filoviruses. Use of this FDA-approved drug could rapidly be deployed during future outbreaks.
The developmental genetics of biological robustness
Mestek Boukhibar, Lamia; Barkoulas, Michalis
2016-01-01
Background Living organisms are continuously confronted with perturbations, such as environmental changes that include fluctuations in temperature and nutrient availability, or genetic changes such as mutations. While some developmental systems are affected by such challenges and display variation in phenotypic traits, others continue consistently to produce invariable phenotypes despite perturbation. This ability of a living system to maintain an invariable phenotype in the face of perturbations is termed developmental robustness. Biological robustness is a phenomenon observed across phyla, and studying its mechanisms is central to deciphering the genotype–phenotype relationship. Recent work in yeast, animals and plants has shown that robustness is genetically controlled and has started to reveal the underlying mechinisms behind it. Scope and Conclusions Studying biological robustness involves focusing on an important property of developmental traits, which is the phenotypic distribution within a population. This is often neglected because the vast majority of developmental biology studies instead focus on population aggregates, such as trait averages. By drawing on findings in animals and yeast, this Viewpoint considers how studies on plant developmental robustness may benefit from strict definitions of what is the developmental system of choice and what is the relevant perturbation, and also from clear distinctions between gene effects on the trait mean and the trait variance. Recent advances in quantitative developmental biology and high-throughput phenotyping now allow the design of targeted genetic screens to identify genes that amplify or restrict developmental trait variance and to study how variation propagates across different phenotypic levels in biological systems. The molecular characterization of more quantitative trait loci affecting trait variance will provide further insights into the evolution of genes modulating developmental robustness. The study of robustness mechanisms in closely related species will address whether mechanisms of robustness are evolutionarily conserved. PMID:26292993
Parkinson's Disease Gene Therapy: Success by Design Meets Failure by Efficacy
Bartus, Raymond T; Weinberg, Marc S; Samulski, R. Jude
2014-01-01
Over the past decade, nine gene therapy clinical trials for Parkinson's disease (PD) have been initiated and completed. Starting with considerable optimism at the initiation of each trial, none of the programs has yet borne sufficiently robust clinical efficacy or found a clear path toward regulatory approval. Despite the immediately disappointing nature of the efficacy outcomes in these trials, the clinical data garnered from the individual studies nonetheless represent tangible and significant progress for the gene therapy field. Collectively, the clinical trials demonstrate that we have overcome the major safety hurdles previously suppressing central nervous system (CNS) gene therapy, for none produced any evidence of untoward risk or harm after administration of various vector-delivery systems. More importantly, these studies also demonstrated controlled, highly persistent generation of biologically active proteins targeted to structures deep in the human brain. Therefore, a renewed, focused emphasis must be placed on advancing clinical efficacy by improving clinical trial design, patient selection and outcome measures, developing more predictive animal models to support clinical testing, carefully performing retrospective analyses, and most importantly moving forward—beyond our past limits. PMID:24356252
Genetic Mapping and Exome Sequencing Identify Variants Associated with Five Novel Diseases
Puffenberger, Erik G.; Jinks, Robert N.; Sougnez, Carrie; Cibulskis, Kristian; Willert, Rebecca A.; Achilly, Nathan P.; Cassidy, Ryan P.; Fiorentini, Christopher J.; Heiken, Kory F.; Lawrence, Johnny J.; Mahoney, Molly H.; Miller, Christopher J.; Nair, Devika T.; Politi, Kristin A.; Worcester, Kimberly N.; Setton, Roni A.; DiPiazza, Rosa; Sherman, Eric A.; Eastman, James T.; Francklyn, Christopher; Robey-Bond, Susan; Rider, Nicholas L.; Gabriel, Stacey; Morton, D. Holmes; Strauss, Kevin A.
2012-01-01
The Clinic for Special Children (CSC) has integrated biochemical and molecular methods into a rural pediatric practice serving Old Order Amish and Mennonite (Plain) children. Among the Plain people, we have used single nucleotide polymorphism (SNP) microarrays to genetically map recessive disorders to large autozygous haplotype blocks (mean = 4.4 Mb) that contain many genes (mean = 79). For some, uninformative mapping or large gene lists preclude disease-gene identification by Sanger sequencing. Seven such conditions were selected for exome sequencing at the Broad Institute; all had been previously mapped at the CSC using low density SNP microarrays coupled with autozygosity and linkage analyses. Using between 1 and 5 patient samples per disorder, we identified sequence variants in the known disease-causing genes SLC6A3 and FLVCR1, and present evidence to strongly support the pathogenicity of variants identified in TUBGCP6, BRAT1, SNIP1, CRADD, and HARS. Our results reveal the power of coupling new genotyping technologies to population-specific genetic knowledge and robust clinical data. PMID:22279524
Evolution of functional specialization and division of labor.
Rueffler, Claus; Hermisson, Joachim; Wagner, Günter P
2012-02-07
Division of labor among functionally specialized modules occurs at all levels of biological organization in both animals and plants. Well-known examples include the evolution of specialized enzymes after gene duplication, the evolution of specialized cell types, limb diversification in arthropods, and the evolution of specialized colony members in many taxa of marine invertebrates and social insects. Here, we identify conditions favoring the evolution of division of labor by means of a general mathematical model. Our starting point is the assumption that modules contribute to two different biological tasks and that the potential of modules to contribute to these tasks is traded off. Our results are phrased in terms of properties of performance functions that map the phenotype of modules to measures of performance. We show that division of labor is favored by three factors: positional effects that predispose modules for one of the tasks, accelerating performance functions, and synergistic interactions between modules. If modules can be lost or damaged, selection for robustness can counteract selection for functional specialization. To illustrate our theory we apply it to the evolution of specialized enzymes coded by duplicated genes.
High-efficiency targeted editing of large viral genomes by RNA-guided nucleases.
Bi, Yanwei; Sun, Le; Gao, Dandan; Ding, Chen; Li, Zhihua; Li, Yadong; Cun, Wei; Li, Qihan
2014-05-01
A facile and efficient method for the precise editing of large viral genomes is required for the selection of attenuated vaccine strains and the construction of gene therapy vectors. The type II prokaryotic CRISPR-Cas (clustered regularly interspaced short palindromic repeats (CRISPR)-associated (Cas)) RNA-guided nuclease system can be introduced into host cells during viral replication. The CRISPR-Cas9 system robustly stimulates targeted double-stranded breaks in the genomes of DNA viruses, where the non-homologous end joining (NHEJ) and homology-directed repair (HDR) pathways can be exploited to introduce site-specific indels or insert heterologous genes with high frequency. Furthermore, CRISPR-Cas9 can specifically inhibit the replication of the original virus, thereby significantly increasing the abundance of the recombinant virus among progeny virus. As a result, purified recombinant virus can be obtained with only a single round of selection. In this study, we used recombinant adenovirus and type I herpes simplex virus as examples to demonstrate that the CRISPR-Cas9 system is a valuable tool for editing the genomes of large DNA viruses.
High-Efficiency Targeted Editing of Large Viral Genomes by RNA-Guided Nucleases
Gao, Dandan; Ding, Chen; Li, Zhihua; Li, Yadong; Cun, Wei; Li, Qihan
2014-01-01
A facile and efficient method for the precise editing of large viral genomes is required for the selection of attenuated vaccine strains and the construction of gene therapy vectors. The type II prokaryotic CRISPR-Cas (clustered regularly interspaced short palindromic repeats (CRISPR)-associated (Cas)) RNA-guided nuclease system can be introduced into host cells during viral replication. The CRISPR-Cas9 system robustly stimulates targeted double-stranded breaks in the genomes of DNA viruses, where the non-homologous end joining (NHEJ) and homology-directed repair (HDR) pathways can be exploited to introduce site-specific indels or insert heterologous genes with high frequency. Furthermore, CRISPR-Cas9 can specifically inhibit the replication of the original virus, thereby significantly increasing the abundance of the recombinant virus among progeny virus. As a result, purified recombinant virus can be obtained with only a single round of selection. In this study, we used recombinant adenovirus and type I herpes simplex virus as examples to demonstrate that the CRISPR-Cas9 system is a valuable tool for editing the genomes of large DNA viruses. PMID:24788700
Yu, Yiyang; Yan, Fang; Chen, Yun; Jin, Christopher; Guo, Jian-Hua; Chai, Yunrong
2016-01-01
Bacillus subtilis is long known to produce poly-γ-glutamic acids (γ-PGA) as one of the major secreted polymeric substances. In B. subtilis, the regulation of γ-PGA production and its physiological role are still unclear. B. subtilis is also capable of forming structurally complex multicellular communities, or biofilms, in which an extracellular matrix consisting of secreted proteins and polysaccharides holds individual cells together. Biofilms were shown to facilitate B. subtilis–plant interactions. In this study, we show that different environmental isolates of B. subtilis, all capable of forming biofilms, vary significantly in γ-PGA production. This is possibly due to differential regulation of γ-PGA biosynthesis genes. In many of those environmental isolates, γ-PGA seems to contribute to robustness and complex morphology of the colony biofilms, suggesting a role of γ-PGA in biofilm formation. Our evidence further shows that in selected B. subtilis strains, γ-PGA also plays a role in root colonization by the bacteria, pinpointing a possible function of γ-PGA in B. subtilis–plant interactions. Finally, we found that several pathways co-regulate both γ-PGA biosynthesis genes and genes for the biofilm matrix in B. subtilis, but in an opposing fashion. We discussed potential biological significance of that. PMID:27891125
Xiao, Yang; Cheng, Xuanjin; Liu, Jun; Li, Chuang; Nong, Wenyan; Bian, Yinbing; Cheung, Man Kit; Kwan, Hoi Shan
2016-11-10
The elucidation of genome-wide variations could help reveal aspects of divergence, domestication, and adaptation of edible mushrooms. Here, we resequenced the whole genomes of 39 wild and 21 cultivated strains of Chinese Lentinula edodes, the shiitake mushroom. We identified three distinct genetic groups in the Chinese L. edodes population with robust differentiation. Results of phylogenetic and population structure analyses suggest that the cultivated strains and most of the wild trains of L. edodes in China possess different gene pools and two outlier strains show signatures of hybridization between groups. Eighty-four candidate genes contributing to population divergence were detected in outlier analysis, 18 of which are involved in response to environmental stresses. Gene enrichment analysis of group-specific single nucleotide polymorphisms showed that the cultivated strains were genetically diversified in biological processes related to stress response. As the formation of fruiting bodies is a stress-response process, we postulate that environment factors, such as temperature, drove the population divergence of L. edodes in China by natural or artificial selection. We also found phenotypic variations between groups and identified some wild strains that have potential to diversify the genetic pool for improving agricultural traits of L. edodes cultivars in China.
Xiao, Yang; Cheng, Xuanjin; Liu, Jun; Li, Chuang; Nong, Wenyan; Bian, Yinbing; Cheung, Man Kit; Kwan, Hoi Shan
2016-01-01
The elucidation of genome-wide variations could help reveal aspects of divergence, domestication, and adaptation of edible mushrooms. Here, we resequenced the whole genomes of 39 wild and 21 cultivated strains of Chinese Lentinula edodes, the shiitake mushroom. We identified three distinct genetic groups in the Chinese L. edodes population with robust differentiation. Results of phylogenetic and population structure analyses suggest that the cultivated strains and most of the wild trains of L. edodes in China possess different gene pools and two outlier strains show signatures of hybridization between groups. Eighty-four candidate genes contributing to population divergence were detected in outlier analysis, 18 of which are involved in response to environmental stresses. Gene enrichment analysis of group-specific single nucleotide polymorphisms showed that the cultivated strains were genetically diversified in biological processes related to stress response. As the formation of fruiting bodies is a stress-response process, we postulate that environment factors, such as temperature, drove the population divergence of L. edodes in China by natural or artificial selection. We also found phenotypic variations between groups and identified some wild strains that have potential to diversify the genetic pool for improving agricultural traits of L. edodes cultivars in China. PMID:27830835
Antinociceptive action of NOP and opioid receptor agonists in the mouse orofacial formalin test.
Rizzi, A; Ruzza, C; Bianco, S; Trapella, C; Calo', G
2017-08-01
Nociceptin/orphanin FQ (N/OFQ) modulates several biological functions, including pain transmission via selective activation of a specific receptor named NOP. The aim of this study was the investigation of the antinociceptive properties of NOP agonists and their interaction with opioids in the trigeminal territory. The orofacial formalin (OFF) test in mice was used to investigate the antinociceptive potential associated to the activation of NOP and opioid receptors. Mice subjected to OFF test displayed the typical biphasic nociceptive response and sensitivity to opioid and NSAID drugs. Mice knockout for the NOP gene displayed a robust pronociceptive phenotype. The NOP selective agonist Ro 65-6570 (0.1-1mgkg -1 ) and morphine (0.1-10mgkg -1 ) elicited dose dependent antinociceptive effects in the OFF with the alkaloid showing larger effects; the isobologram analysis of their actions demonstrated an additive type of interaction. The mixed NOP/opioid receptor agonist cebranopadol elicited potent (0.01-0.1mgkg -1 ) and robust antinociceptive effects. In the investigated dose range, all drugs did not modify the motor performance of the mice in the rotarod test. Collectively the results of this study demonstrated that selective NOP agonists and particularly mixed NOP/opioid agonists are worthy of development as innovative drugs to treat painful conditions of the trigeminal territory. Copyright © 2017 Elsevier Inc. All rights reserved.
Dong, Zuoli; Zhang, Naiqian; Li, Chun; Wang, Haiyun; Fang, Yun; Wang, Jun; Zheng, Xiaoqi
2015-06-30
An enduring challenge in personalized medicine is to select right drug for individual patients. Testing drugs on patients in large clinical trials is one way to assess their efficacy and toxicity, but it is impractical to test hundreds of drugs currently under development. Therefore the preclinical prediction model is highly expected as it enables prediction of drug response to hundreds of cell lines in parallel. Recently, two large-scale pharmacogenomic studies screened multiple anticancer drugs on over 1000 cell lines in an effort to elucidate the response mechanism of anticancer drugs. To this aim, we here used gene expression features and drug sensitivity data in Cancer Cell Line Encyclopedia (CCLE) to build a predictor based on Support Vector Machine (SVM) and a recursive feature selection tool. Robustness of our model was validated by cross-validation and an independent dataset, the Cancer Genome Project (CGP). Our model achieved good cross validation performance for most drugs in the Cancer Cell Line Encyclopedia (≥80% accuracy for 10 drugs, ≥75% accuracy for 19 drugs). Independent tests on eleven common drugs between CCLE and CGP achieved satisfactory performance for three of them, i.e., AZD6244, Erlotinib and PD-0325901, using expression levels of only twelve, six and seven genes, respectively. These results suggest that drug response could be effectively predicted from genomic features. Our model could be applied to predict drug response for some certain drugs and potentially play a complementary role in personalized medicine.
An Expanded Lateral Interactive Clonal Selection Algorithm and Its Application
NASA Astrophysics Data System (ADS)
Gao, Shangce; Dai, Hongwei; Zhang, Jianchen; Tang, Zheng
Based on the clonal selection principle proposed by Burnet, in the immune response process there is no crossover of genetic material between members of the repertoire, i. e., there is no knowledge communication during different elite pools in the previous clonal selection models. As a result, the search performance of these models is ineffective. To solve this problem, inspired by the concept of the idiotypic network theory, an expanded lateral interactive clonal selection algorithm (LICS) is put forward. In LICS, an antibody is matured not only through the somatic hypermutation and the receptor editing from the B cell, but also through the stimuli from other antibodies. The stimuli is realized by memorizing some common gene segment on the idiotypes, based on which a lateral interactive receptor editing operator is also introduced. Then, LICS is applied to several benchmark instances of the traveling salesman problem. Simulation results show the efficiency and robustness of LICS when compared to other traditional algorithms.
Mollah, Mohammad Manir Hossain; Jamal, Rahman; Mokhtar, Norfilza Mohd; Harun, Roslan; Mollah, Md. Nurul Haque
2015-01-01
Background Identifying genes that are differentially expressed (DE) between two or more conditions with multiple patterns of expression is one of the primary objectives of gene expression data analysis. Several statistical approaches, including one-way analysis of variance (ANOVA), are used to identify DE genes. However, most of these methods provide misleading results for two or more conditions with multiple patterns of expression in the presence of outlying genes. In this paper, an attempt is made to develop a hybrid one-way ANOVA approach that unifies the robustness and efficiency of estimation using the minimum β-divergence method to overcome some problems that arise in the existing robust methods for both small- and large-sample cases with multiple patterns of expression. Results The proposed method relies on a β-weight function, which produces values between 0 and 1. The β-weight function with β = 0.2 is used as a measure of outlier detection. It assigns smaller weights (≥ 0) to outlying expressions and larger weights (≤ 1) to typical expressions. The distribution of the β-weights is used to calculate the cut-off point, which is compared to the observed β-weight of an expression to determine whether that gene expression is an outlier. This weight function plays a key role in unifying the robustness and efficiency of estimation in one-way ANOVA. Conclusion Analyses of simulated gene expression profiles revealed that all eight methods (ANOVA, SAM, LIMMA, EBarrays, eLNN, KW, robust BetaEB and proposed) perform almost identically for m = 2 conditions in the absence of outliers. However, the robust BetaEB method and the proposed method exhibited considerably better performance than the other six methods in the presence of outliers. In this case, the BetaEB method exhibited slightly better performance than the proposed method for the small-sample cases, but the the proposed method exhibited much better performance than the BetaEB method for both the small- and large-sample cases in the presence of more than 50% outlying genes. The proposed method also exhibited better performance than the other methods for m > 2 conditions with multiple patterns of expression, where the BetaEB was not extended for this condition. Therefore, the proposed approach would be more suitable and reliable on average for the identification of DE genes between two or more conditions with multiple patterns of expression. PMID:26413858
Accurate and sensitive quantification of protein-DNA binding affinity.
Rastogi, Chaitanya; Rube, H Tomas; Kribelbauer, Judith F; Crocker, Justin; Loker, Ryan E; Martini, Gabriella D; Laptenko, Oleg; Freed-Pastor, William A; Prives, Carol; Stern, David L; Mann, Richard S; Bussemaker, Harmen J
2018-04-17
Transcription factors (TFs) control gene expression by binding to genomic DNA in a sequence-specific manner. Mutations in TF binding sites are increasingly found to be associated with human disease, yet we currently lack robust methods to predict these sites. Here, we developed a versatile maximum likelihood framework named No Read Left Behind (NRLB) that infers a biophysical model of protein-DNA recognition across the full affinity range from a library of in vitro selected DNA binding sites. NRLB predicts human Max homodimer binding in near-perfect agreement with existing low-throughput measurements. It can capture the specificity of the p53 tetramer and distinguish multiple binding modes within a single sample. Additionally, we confirm that newly identified low-affinity enhancer binding sites are functional in vivo, and that their contribution to gene expression matches their predicted affinity. Our results establish a powerful paradigm for identifying protein binding sites and interpreting gene regulatory sequences in eukaryotic genomes. Copyright © 2018 the Author(s). Published by PNAS.
Accurate and sensitive quantification of protein-DNA binding affinity
Rastogi, Chaitanya; Rube, H. Tomas; Kribelbauer, Judith F.; Crocker, Justin; Loker, Ryan E.; Martini, Gabriella D.; Laptenko, Oleg; Freed-Pastor, William A.; Prives, Carol; Stern, David L.; Mann, Richard S.; Bussemaker, Harmen J.
2018-01-01
Transcription factors (TFs) control gene expression by binding to genomic DNA in a sequence-specific manner. Mutations in TF binding sites are increasingly found to be associated with human disease, yet we currently lack robust methods to predict these sites. Here, we developed a versatile maximum likelihood framework named No Read Left Behind (NRLB) that infers a biophysical model of protein-DNA recognition across the full affinity range from a library of in vitro selected DNA binding sites. NRLB predicts human Max homodimer binding in near-perfect agreement with existing low-throughput measurements. It can capture the specificity of the p53 tetramer and distinguish multiple binding modes within a single sample. Additionally, we confirm that newly identified low-affinity enhancer binding sites are functional in vivo, and that their contribution to gene expression matches their predicted affinity. Our results establish a powerful paradigm for identifying protein binding sites and interpreting gene regulatory sequences in eukaryotic genomes. PMID:29610332
Oran, Amanda R.; Adams, Clare M.; Zhang, Xiao-yong; Gennaro, Victoria J.; Pfeiffer, Harla K.; Mellert, Hestia S.; Seidel, Hans E.; Mascioli, Kirsten; Kaplan, Jordan; Gaballa, Mahmoud R.; Shen, Chen; Rigoutsos, Isidore; King, Michael P.; Cotney, Justin L.; Arnold, Jamie J.; Sharma, Suresh D.; Martinez, Ubaldo E.; Vakoc, Christopher R.; Chodosh, Lewis A.; Thompson, James E.; Bradner, James E.; Cameron, Craig E.; Shadel, Gerald S.; Eischen, Christine M.; McMahon, Steven B.
2016-01-01
Despite ubiquitous activation in human cancer, essential downstream effector pathways of the MYC transcription factor have been difficult to define and target. Using a structure/function-based approach, we identified the mitochondrial RNA polymerase (POLRMT) locus as a critical downstream target of MYC. The multifunctional POLRMT enzyme controls mitochondrial gene expression, a process required both for mitochondrial function and mitochondrial biogenesis. We further demonstrate that inhibition of this newly defined MYC effector pathway causes robust and selective tumor cell apoptosis, via an acute, checkpoint-like mechanism linked to aberrant electron transport chain complex assembly and mitochondrial reactive oxygen species (ROS) production. Fortuitously, MYC-dependent tumor cell death can be induced by inhibiting the mitochondrial gene expression pathway using a variety of strategies, including treatment with FDA-approved antibiotics. In vivo studies using a mouse model of Burkitt's Lymphoma provide pre-clinical evidence that these antibiotics can successfully block progression of MYC-dependent tumors. PMID:27590350
Oran, Amanda R; Adams, Clare M; Zhang, Xiao-Yong; Gennaro, Victoria J; Pfeiffer, Harla K; Mellert, Hestia S; Seidel, Hans E; Mascioli, Kirsten; Kaplan, Jordan; Gaballa, Mahmoud R; Shen, Chen; Rigoutsos, Isidore; King, Michael P; Cotney, Justin L; Arnold, Jamie J; Sharma, Suresh D; Martinez-Outschoorn, Ubaldo E; Vakoc, Christopher R; Chodosh, Lewis A; Thompson, James E; Bradner, James E; Cameron, Craig E; Shadel, Gerald S; Eischen, Christine M; McMahon, Steven B
2016-11-08
Despite ubiquitous activation in human cancer, essential downstream effector pathways of the MYC transcription factor have been difficult to define and target. Using a structure/function-based approach, we identified the mitochondrial RNA polymerase (POLRMT) locus as a critical downstream target of MYC. The multifunctional POLRMT enzyme controls mitochondrial gene expression, a process required both for mitochondrial function and mitochondrial biogenesis. We further demonstrate that inhibition of this newly defined MYC effector pathway causes robust and selective tumor cell apoptosis, via an acute, checkpoint-like mechanism linked to aberrant electron transport chain complex assembly and mitochondrial reactive oxygen species (ROS) production. Fortuitously, MYC-dependent tumor cell death can be induced by inhibiting the mitochondrial gene expression pathway using a variety of strategies, including treatment with FDA-approved antibiotics. In vivo studies using a mouse model of Burkitt's Lymphoma provide pre-clinical evidence that these antibiotics can successfully block progression of MYC-dependent tumors.
Wu, Cen; Jiang, Yu; Ren, Jie; Cui, Yuehua; Ma, Shuangge
2018-02-10
Identification of gene-environment (G × E) interactions associated with disease phenotypes has posed a great challenge in high-throughput cancer studies. The existing marginal identification methods have suffered from not being able to accommodate the joint effects of a large number of genetic variants, while some of the joint-effect methods have been limited by failing to respect the "main effects, interactions" hierarchy, by ignoring data contamination, and by using inefficient selection techniques under complex structural sparsity. In this article, we develop an effective penalization approach to identify important G × E interactions and main effects, which can account for the hierarchical structures of the 2 types of effects. Possible data contamination is accommodated by adopting the least absolute deviation loss function. The advantage of the proposed approach over the alternatives is convincingly demonstrated in both simulation and a case study on lung cancer prognosis with gene expression measurements and clinical covariates under the accelerated failure time model. Copyright © 2017 John Wiley & Sons, Ltd.
Van Waes, Vincent; Beverley, Joel; Marinelli, Michela; Steiner, Heinz
2010-01-01
The psychostimulant methylphenidate (Ritalin) is used in conjunction with selective serotonin reuptake inhibitors (SSRIs) in the treatment of medical conditions such as attention-deficit hyperactivity disorder with anxiety/depression comorbidity and major depression. Co-exposure also occurs in patients on SSRIs that use psychostimulant “cognitive enhancers”. Methylphenidate is a dopamine/norepinephrine reuptake inhibitor that produces altered gene expression in the forebrain; these effects partly mimic gene regulation by cocaine (dopamine/norepinephrine/serotonin reuptake inhibitor). We investigated whether the addition of SSRIs (fluoxetine or citalopram; 5 mg/kg) modified gene regulation by methylphenidate (2–5 mg/kg) in the striatum and cortex of adolescent rats. Our results show that SSRIs potentiate methylphenidate-induced expression of the transcription factors zif 268 and c-fos in the striatum, rendering these molecular changes more cocaine-like. Present throughout most of the striatum, this potentiation was most robust in its sensorimotor parts. The methylphenidate + SSRI combination also enhanced behavioral stereotypies, consistent with dysfunction in sensorimotor striatal circuits. In so far as such gene regulation is implicated in psychostimulant addiction, our findings suggest that SSRIs may enhance the addiction liability of methylphenidate. PMID:20704593
Hayashi, Masamichi; Bernert, Heike; Kagohara, Luciane Tsukamoto; Maldonado, Leonel; Brait, Mariana; Schoenberg, Mark; Bivalacqua, Trinity; Netto, George J; Koch, Wayne; Sidransky, David; Hoque, Mohammad O
2014-05-30
To identify new epigenetic markers and further characterize Urothelial Cell Carcinoma (UCC), we tested the promoter methylation (PM) status of 19 genes previously identified as cancer specific methylated genes in other solid tumors. We used bisulfite sequencing, methylation specific PCR and quantitative methylation specific PCR (QMSP) to test the PM status of 19 genes in urothelial cancer cell lines. Among the 19 genes tested, VGF was found to be completely methylated in several UCC cell lines. VGF QMSP analysis showed that methylation values of almost all the primary 19 UCC tissues were higher than the paired normal tissues (P=0.009). In another cohort, 12/35 (34.3%) of low grade UCC cases displayed VGF methylation. As a biomarker for non-invasive detection of UCC, VGF showed a significantly higher frequency of methylation in urine from UCC cases (8/20) compared to controls (1/20) (P=0.020). After treatment of cell lines with 5-Aza-2'-deoxycytidine, VGF was robustly re-expressed. Forced expression of VGF in bladder cancer cell lines inhibited cell growth. Selection of candidates from genome-wide screening approach in other solid tumors successfully identified UCC specific methylated genes.
Paix, Alexandre; Wang, Yuemeng; Smith, Harold E.; Lee, Chih-Yung S.; Calidas, Deepika; Lu, Tu; Smith, Jarrett; Schmidt, Helen; Krause, Michael W.; Seydoux, Geraldine
2014-01-01
Homology-directed repair (HDR) of double-strand DNA breaks is a promising method for genome editing, but is thought to be less efficient than error-prone nonhomologous end joining in most cell types. We have investigated HDR of double-strand breaks induced by CRISPR-associated protein 9 (Cas9) in Caenorhabditis elegans. We find that HDR is very robust in the C. elegans germline. Linear repair templates with short (∼30–60 bases) homology arms support the integration of base and gene-sized edits with high efficiency, bypassing the need for selection. Based on these findings, we developed a systematic method to mutate, tag, or delete any gene in the C. elegans genome without the use of co-integrated markers or long homology arms. We generated 23 unique edits at 11 genes, including premature stops, whole-gene deletions, and protein fusions to antigenic peptides and GFP. Whole-genome sequencing of five edited strains revealed the presence of passenger variants, but no mutations at predicted off-target sites. The method is scalable for multi-gene editing projects and could be applied to other animals with an accessible germline. PMID:25249454
Robustness of meta-analyses in finding gene × environment interactions
Shi, Gang; Nehorai, Arye
2017-01-01
Meta-analyses that synthesize statistical evidence across studies have become important analytical tools for genetic studies. Inspired by the success of genome-wide association studies of the genetic main effect, researchers are searching for gene × environment interactions. Confounders are routinely included in the genome-wide gene × environment interaction analysis as covariates; however, this does not control for any confounding effects on the results if covariate × environment interactions are present. We carried out simulation studies to evaluate the robustness to the covariate × environment confounder for meta-regression and joint meta-analysis, which are two commonly used meta-analysis methods for testing the gene × environment interaction or the genetic main effect and interaction jointly. Here we show that meta-regression is robust to the covariate × environment confounder while joint meta-analysis is subject to the confounding effect with inflated type I error rates. Given vast sample sizes employed in genome-wide gene × environment interaction studies, non-significant covariate × environment interactions at the study level could substantially elevate the type I error rate at the consortium level. When covariate × environment confounders are present, type I errors can be controlled in joint meta-analysis by including the covariate × environment terms in the analysis at the study level. Alternatively, meta-regression can be applied, which is robust to potential covariate × environment confounders. PMID:28362796
Minaya, Miguel; Díaz-Pérez, Antonio; Mason-Gamer, Roberta; Pimentel, Manuel; Catalán, Pilar
2015-10-01
Low-copy nuclear genes (LCNGs) have complex genetic architectures and evolutionary dynamics. However, unlike multicopy nuclear genes, LCNGs are rarely subject to gene conversion or concerted evolution, and they have higher mutation rates than organellar or nuclear ribosomal DNA markers, so they have great potential for improving the robustness of phylogenetic reconstructions at all taxonomic levels. In this study, our first objective is to evaluate the evolutionary dynamics of the LCNG β-amylase by testing for potential pseudogenization, paralogy, homeology, recombination, and phylogenetic incongruence within a broad representation of the main Pooideae lineages. Our second objective is to determine whether β-amylase shows sufficient phylogenetic signal to reconstruct the evolutionary history of the Pooid grasses. A multigenic (ITS, matK, ndhF, trnTL, and trnLF) tree of the study group provided a framework for assessing the β-amylase phylogeny. Eight accessions showed complete absence of selection, suggesting putative pseudogenic copies or other relaxed selection pressures; resolution of Vulpia alopecuros 2x clones indicated its potential (semi) paralogy; and homeologous copies of allopolyploid species Festuca simensis, F. fenas, and F. arundinacea tracked their Mediterranean origin. Two recombination events were found within early-diverged Pooideae lineages, and five within the PACCMAD clade. The unexpected phylogenetic relationships of 37 grass species (26% of the sampled species) highlight the frequent occurrence of non-treelike evolutionary events, so this LCNG should be used with caution as a phylogenetic marker. However, once the pitfalls are identified and removed, the phylogenetic reconstruction of the grasses based on the β-amylase exon+intron positions is optimal at all taxonomic levels. Copyright © 2015 Elsevier Inc. All rights reserved.
Evidence of a Conserved Molecular Response to Selection for Increased Brain Size in Primates
Harrison, Peter W.; Caravas, Jason A.; Raghanti, Mary Ann; Phillips, Kimberley A.; Mundy, Nicholas I.
2017-01-01
The adaptive significance of human brain evolution has been frequently studied through comparisons with other primates. However, the evolution of increased brain size is not restricted to the human lineage but is a general characteristic of primate evolution. Whether or not these independent episodes of increased brain size share a common genetic basis is unclear. We sequenced and de novo assembled the transcriptome from the neocortical tissue of the most highly encephalized nonhuman primate, the tufted capuchin monkey (Cebus apella). Using this novel data set, we conducted a genome-wide analysis of orthologous brain-expressed protein coding genes to identify evidence of conserved gene–phenotype associations and species-specific adaptations during three independent episodes of brain size increase. We identify a greater number of genes associated with either total brain mass or relative brain size across these six species than show species-specific accelerated rates of evolution in individual large-brained lineages. We test the robustness of these associations in an expanded data set of 13 species, through permutation tests and by analyzing how genome-wide patterns of substitution co-vary with brain size. Many of the genes targeted by selection during brain expansion have glutamatergic functions or roles in cell cycle dynamics. We also identify accelerated evolution in a number of individual capuchin genes whose human orthologs are associated with human neuropsychiatric disorders. These findings demonstrate the value of phenotypically informed genome analyses, and suggest at least some aspects of human brain evolution have occurred through conserved gene–phenotype associations. Understanding these commonalities is essential for distinguishing human-specific selection events from general trends in brain evolution. PMID:28391320
Validation of endogenous reference genes for qRT-PCR analysis of human visceral adipose samples
2010-01-01
Background Given the epidemic proportions of obesity worldwide and the concurrent prevalence of metabolic syndrome, there is an urgent need for better understanding the underlying mechanisms of metabolic syndrome, in particular, the gene expression differences which may participate in obesity, insulin resistance and the associated series of chronic liver conditions. Real-time PCR (qRT-PCR) is the standard method for studying changes in relative gene expression in different tissues and experimental conditions. However, variations in amount of starting material, enzymatic efficiency and presence of inhibitors can lead to quantification errors. Hence the need for accurate data normalization is vital. Among several known strategies for data normalization, the use of reference genes as an internal control is the most common approach. Recent studies have shown that both obesity and presence of insulin resistance influence an expression of commonly used reference genes in omental fat. In this study we validated candidate reference genes suitable for qRT-PCR profiling experiments using visceral adipose samples from obese and lean individuals. Results Cross-validation of expression stability of eight selected reference genes using three popular algorithms, GeNorm, NormFinder and BestKeeper found ACTB and RPII as most stable reference genes. Conclusions We recommend ACTB and RPII as stable reference genes most suitable for gene expression studies of human visceral adipose tissue. The use of these genes as a reference pair may further enhance the robustness of qRT-PCR in this model system. PMID:20492695
Validation of endogenous reference genes for qRT-PCR analysis of human visceral adipose samples.
Mehta, Rohini; Birerdinc, Aybike; Hossain, Noreen; Afendy, Arian; Chandhoke, Vikas; Younossi, Zobair; Baranova, Ancha
2010-05-21
Given the epidemic proportions of obesity worldwide and the concurrent prevalence of metabolic syndrome, there is an urgent need for better understanding the underlying mechanisms of metabolic syndrome, in particular, the gene expression differences which may participate in obesity, insulin resistance and the associated series of chronic liver conditions. Real-time PCR (qRT-PCR) is the standard method for studying changes in relative gene expression in different tissues and experimental conditions. However, variations in amount of starting material, enzymatic efficiency and presence of inhibitors can lead to quantification errors. Hence the need for accurate data normalization is vital. Among several known strategies for data normalization, the use of reference genes as an internal control is the most common approach. Recent studies have shown that both obesity and presence of insulin resistance influence an expression of commonly used reference genes in omental fat. In this study we validated candidate reference genes suitable for qRT-PCR profiling experiments using visceral adipose samples from obese and lean individuals. Cross-validation of expression stability of eight selected reference genes using three popular algorithms, GeNorm, NormFinder and BestKeeper found ACTB and RPII as most stable reference genes. We recommend ACTB and RPII as stable reference genes most suitable for gene expression studies of human visceral adipose tissue. The use of these genes as a reference pair may further enhance the robustness of qRT-PCR in this model system.
Primary Airway Epithelial Cell Gene Editing Using CRISPR-Cas9.
Everman, Jamie L; Rios, Cydney; Seibold, Max A
2018-01-01
The adaptation of the clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR associated endonuclease 9 (CRISPR-Cas9) machinery from prokaryotic organisms has resulted in a gene editing system that is highly versatile, easily constructed, and can be leveraged to generate human cells knocked out (KO) for a specific gene. While standard transfection techniques can be used for the introduction of CRISPR-Cas9 expression cassettes to many cell types, delivery by this method is not efficient in many primary cell types, including primary human airway epithelial cells (AECs). More efficient delivery in AECs can be achieved through lentiviral-mediated transduction, allowing the CRISPR-Cas9 system to be integrated into the genome of the cell, resulting in stable expression of the nuclease machinery and increasing editing rates. In parallel, advancements have been made in the culture, expansion, selection, and differentiation of AECs, which allow the robust generation of a bulk edited AEC population from transduced cells. Applying these methods, we detail here our latest protocol to generate mucociliary epithelial cultures knocked out for a specific gene from donor-isolated primary human basal airway epithelial cells. This protocol includes methods to: (1) design and generate lentivirus which targets a specific gene for KO with CRISPR-Cas9 machinery, (2) efficiently transduce AECs, (3) culture and select for a bulk edited AEC population, (4) molecularly screen AECs for Cas9 cutting and specific sequence edits, and (5) further expand and differentiate edited cells to a mucociliary airway epithelial culture. The AEC knockouts generated using this protocol provide an excellent primary cell model system with which to characterize the function of genes involved in airway dysfunction and disease.
Yuan, Ming-Long; Zhang, Qi-Lin; Zhang, Li; Jia, Cheng-Lin; Li, Xiao-Peng; Yang, Xing-Zhuo; Feng, Run-Qiu
2018-05-01
Grassland caterpillars (Lepidoptera: Lymantriinae: Gynaephora) are the most important pests in alpine meadows of the Tibetan Plateau (TP) and have well adapted to high-altitude environments. To further understand the evolutionary history and their adaptation to the TP, we newly determined seven complete TP Gynaephora mitogenomes. Compared to single genes, whole mitogenomes provided the best phylogenetic signals and obtained robust results, supporting the monophyly of the TP Gynaephora species and a phylogeny of Arctiinae + (Aganainae + Lymantriinae). Incongruent phylogenetic signals were found among single mitochondrial genes, none of which recovered the same phylogeny as the whole mitogenome. We identified six best-performing single genes using Shimodaira-Hasegawa tests and found that the combinations of rrnS and either cox1 or cox3 generated the same phylogeny as the whole mitogenome, indicating the phylogenetic potential of these three genes for future evolutionary studies of Gynaephora. The TP Gynaephora species were estimated to radiate on the TP during the Pliocene and Quaternary, supporting an association of the diversification and speciation of the TP Gynaephora species with the TP uplifts and associated climate changes during this time. Selection analyses revealed accelerated evolutionary rates of the mitochondrial protein-coding genes in the TP Gynaephora species, suggesting that they accumulated more nonsynonymous substitutions that may benefit their adaptation to high altitudes. Furthermore, signals of positive selection were detected in nad5 of two Gynaephora species with the highest altitude-distributions, indicating that this gene may contribute to Gynaephora's adaptation to divergent altitudes. This study adds to the understanding of the TP Gynaephora evolutionary relationships and suggests a link between mitogenome evolution and ecological adaptation to high-altitude environments in grassland caterpillars. Copyright © 2018 Elsevier Inc. All rights reserved.
Liu, Xin; Guan, Huirui; Song, Min; Fu, Yanping; Han, Xiaomin; Lei, Meng; Ren, Jingyu; Guo, Bin; He, Wei; Wei, Yahui
2018-01-01
Stellera chamaejasme Linn, an important poisonous plant of the China grassland, is toxic to humans and livestock. The rapid expansion of S. chamaejasme has greatly damaged the grassland ecology and, consequently, seriously endangered the development of animal husbandry. To draft efficient prevention and control measures, it has become more urgent to carry out research on its adaptive and expansion mechanisms in different unfavorable habitats at the genetic level. Quantitative real-time polymerase chain reaction (qRT-PCR) is a widely used technique for studying gene expression at the transcript level; however, qRT-PCR requires reference genes (RGs) as endogenous controls for data normalization and only through appropriate RG selection and qRT-PCR can we guarantee the reliability and robustness of expression studies and RNA-seq data analysis. Unfortunately, little research on the selection of RGs for gene expression data normalization in S. chamaejasme has been reported. In this study, 10 candidate RGs namely, 18S , 60S , CYP , GAPCP1 , GAPDH2 , EF1B , MDH , SAND , TUA1 , and TUA6 , were singled out from the transcriptome database of S. chamaejasme , and their expression stability under three abiotic stresses (drought, cold, and salt) and three hormone treatments (abscisic acid, ABA; gibberellin, GA; ethephon, ETH) were estimated with the programs geNorm, NormFinder, and BestKeeper. Our results showed that GAPCP1 and EF1B were the best combination for the three abiotic stresses, whereas TUA6 and SAND , TUA1 and CYP , GAPDH2 and 60S were the best choices for ABA, GA, and ETH treatment, respectively. Moreover, GAPCP1 and 60S were assessed to be the best combination for all samples, and 18S was the least stable RG for use as an internal control in all of the experimental subsets. The expression patterns of two target genes ( P5CS2 and GI ) further verified that the RGs that we selected were suitable for gene expression normalization. This work is the first attempt to comprehensively estimate the stability of RGs in S. chamaejasme . Our results provide suitable RGs for high-precision normalization in qRT-PCR analysis, thereby making it more convenient to analyze gene expression under these experimental conditions.
A Luciferase Reporter Gene System for High-Throughput Screening of γ-Globin Gene Activators.
Xie, Wensheng; Silvers, Robert; Ouellette, Michael; Wu, Zining; Lu, Quinn; Li, Hu; Gallagher, Kathleen; Johnson, Kathy; Montoute, Monica
2016-01-01
Luciferase reporter gene assays have long been used for drug discovery due to their high sensitivity and robust signal. A dual reporter gene system contains a gene of interest and a control gene to monitor non-specific effects on gene expression. In our dual luciferase reporter gene system, a synthetic promoter of γ-globin gene was constructed immediately upstream of the firefly luciferase gene, followed downstream by a synthetic β-globin gene promoter in front of the Renilla luciferase gene. A stable cell line with the dual reporter gene was cloned and used for all assay development and HTS work. Due to the low activity of the control Renilla luciferase, only the firefly luciferase activity was further optimized for HTS. Several critical factors, such as cell density, serum concentration, and miniaturization, were optimized using tool compounds to achieve maximum robustness and sensitivity. Using the optimized reporter assay, the HTS campaign was successfully completed and approximately 1000 hits were identified. In this chapter, we also describe strategies to triage hits that non-specifically interfere with firefly luciferase.
Some scale-free networks could be robust under selective node attacks
NASA Astrophysics Data System (ADS)
Zheng, Bojin; Huang, Dan; Li, Deyi; Chen, Guisheng; Lan, Wenfei
2011-04-01
It is a mainstream idea that scale-free network would be fragile under the selective attacks. Internet is a typical scale-free network in the real world, but it never collapses under the selective attacks of computer viruses and hackers. This phenomenon is different from the deduction of the idea above because this idea assumes the same cost to delete an arbitrary node. Hence this paper discusses the behaviors of the scale-free network under the selective node attack with different cost. Through the experiments on five complex networks, we show that the scale-free network is possibly robust under the selective node attacks; furthermore, the more compact the network is, and the larger the average degree is, then the more robust the network is; with the same average degrees, the more compact the network is, the more robust the network is. This result would enrich the theory of the invulnerability of the network, and can be used to build robust social, technological and biological networks, and also has the potential to find the target of drugs.
MIYAGAWA, Shuji; MATSUNARI, Hitomi; WATANABE, Masahito; NAKANO, Kazuaki; UMEYAMA, Kazuhiro; SAKAI, Rieko; TAKAYANAGI, Shuko; TAKEISHI, Toki; FUKUDA, Tooru; YASHIMA, Sayaka; MAEDA, Akira; EGUCHI, Hiroshi; OKUYAMA, Hiroomi; NAGAYA, Masaki; NAGASHIMA, Hiroshi
2015-01-01
Zinc-finger nucleases (ZFNs) and transcription activator-like effector nucleases (TALENs) are new tools for producing gene knockout (KO) animals. The current study reports produced genetically modified pigs, in which two endogenous genes were knocked out. Porcine fibroblast cell lines were derived from homozygous α1,3-galactosyltransferase (GalT) KO pigs. These cells were subjected to an additional KO for the cytidine monophospho-N-acetylneuraminic acid hydroxylase (CMAH) gene. A pair of ZFN-encoding mRNAs targeting exon 8 of the CMAH gene was used to generate the heterozygous CMAH KO cells, from which cloned pigs were produced by somatic cell nuclear transfer (SCNT). One of the cloned pigs obtained was re-cloned after additional KO of the remaining CMAH allele using the same ZFN-encoding mRNAs to generate GalT/CMAH-double homozygous KO pigs. On the other hand, the use of TALEN-encoding mRNAs targeting exon 7 of the CMAH gene resulted in efficient generation of homozygous CMAH KO cells. These cells were used for SCNT to produce cloned pigs homozygous for a double GalT/CMAH KO. These results demonstrate that the combination of TALEN-encoding mRNA, in vitro selection of the nuclear donor cells and SCNT provides a robust method for generating KO pigs. PMID:26227017
Miyagawa, Shuji; Matsunari, Hitomi; Watanabe, Masahito; Nakano, Kazuaki; Umeyama, Kazuhiro; Sakai, Rieko; Takayanagi, Shuko; Takeishi, Toki; Fukuda, Tooru; Yashima, Sayaka; Maeda, Akira; Eguchi, Hiroshi; Okuyama, Hiroomi; Nagaya, Masaki; Nagashima, Hiroshi
2015-01-01
Zinc-finger nucleases (ZFNs) and transcription activator-like effector nucleases (TALENs) are new tools for producing gene knockout (KO) animals. The current study reports produced genetically modified pigs, in which two endogenous genes were knocked out. Porcine fibroblast cell lines were derived from homozygous α1,3-galactosyltransferase (GalT) KO pigs. These cells were subjected to an additional KO for the cytidine monophospho-N-acetylneuraminic acid hydroxylase (CMAH) gene. A pair of ZFN-encoding mRNAs targeting exon 8 of the CMAH gene was used to generate the heterozygous CMAH KO cells, from which cloned pigs were produced by somatic cell nuclear transfer (SCNT). One of the cloned pigs obtained was re-cloned after additional KO of the remaining CMAH allele using the same ZFN-encoding mRNAs to generate GalT/CMAH-double homozygous KO pigs. On the other hand, the use of TALEN-encoding mRNAs targeting exon 7 of the CMAH gene resulted in efficient generation of homozygous CMAH KO cells. These cells were used for SCNT to produce cloned pigs homozygous for a double GalT/CMAH KO. These results demonstrate that the combination of TALEN-encoding mRNA, in vitro selection of the nuclear donor cells and SCNT provides a robust method for generating KO pigs.
Robust dynamics in minimal hybrid models of genetic networks
Perkins, Theodore J.; Wilds, Roy; Glass, Leon
2010-01-01
Many gene-regulatory networks necessarily display robust dynamics that are insensitive to noise and stable under evolution. We propose that a class of hybrid systems can be used to relate the structure of these networks to their dynamics and provide insight into the origin of robustness. In these systems, the genes are represented by logical functions, and the controlling transcription factor protein molecules are real variables, which are produced and destroyed. As the transcription factor concentrations cross thresholds, they control the production of other transcription factors. We discuss mathematical analysis of these systems and show how the concepts of robustness and minimality can be used to generate putative logical organizations based on observed symbolic sequences. We apply the methods to control of the cell cycle in yeast. PMID:20921006
Robust dynamics in minimal hybrid models of genetic networks.
Perkins, Theodore J; Wilds, Roy; Glass, Leon
2010-11-13
Many gene-regulatory networks necessarily display robust dynamics that are insensitive to noise and stable under evolution. We propose that a class of hybrid systems can be used to relate the structure of these networks to their dynamics and provide insight into the origin of robustness. In these systems, the genes are represented by logical functions, and the controlling transcription factor protein molecules are real variables, which are produced and destroyed. As the transcription factor concentrations cross thresholds, they control the production of other transcription factors. We discuss mathematical analysis of these systems and show how the concepts of robustness and minimality can be used to generate putative logical organizations based on observed symbolic sequences. We apply the methods to control of the cell cycle in yeast.
Preprocessing of gene expression data by optimally robust estimators
2010-01-01
Background The preprocessing of gene expression data obtained from several platforms routinely includes the aggregation of multiple raw signal intensities to one expression value. Examples are the computation of a single expression measure based on the perfect match (PM) and mismatch (MM) probes for the Affymetrix technology, the summarization of bead level values to bead summary values for the Illumina technology or the aggregation of replicated measurements in the case of other technologies including real-time quantitative polymerase chain reaction (RT-qPCR) platforms. The summarization of technical replicates is also performed in other "-omics" disciplines like proteomics or metabolomics. Preprocessing methods like MAS 5.0, Illumina's default summarization method, RMA, or VSN show that the use of robust estimators is widely accepted in gene expression analysis. However, the selection of robust methods seems to be mainly driven by their high breakdown point and not by efficiency. Results We describe how optimally robust radius-minimax (rmx) estimators, i.e. estimators that minimize an asymptotic maximum risk on shrinking neighborhoods about an ideal model, can be used for the aggregation of multiple raw signal intensities to one expression value for Affymetrix and Illumina data. With regard to the Affymetrix data, we have implemented an algorithm which is a variant of MAS 5.0. Using datasets from the literature and Monte-Carlo simulations we provide some reasoning for assuming approximate log-normal distributions of the raw signal intensities by means of the Kolmogorov distance, at least for the discussed datasets, and compare the results of our preprocessing algorithms with the results of Affymetrix's MAS 5.0 and Illumina's default method. The numerical results indicate that when using rmx estimators an accuracy improvement of about 10-20% is obtained compared to Affymetrix's MAS 5.0 and about 1-5% compared to Illumina's default method. The improvement is also visible in the analysis of technical replicates where the reproducibility of the values (in terms of Pearson and Spearman correlation) is increased for all Affymetrix and almost all Illumina examples considered. Our algorithms are implemented in the R package named RobLoxBioC which is publicly available via CRAN, The Comprehensive R Archive Network (http://cran.r-project.org/web/packages/RobLoxBioC/). Conclusions Optimally robust rmx estimators have a high breakdown point and are computationally feasible. They can lead to a considerable gain in efficiency for well-established bioinformatics procedures and thus, can increase the reproducibility and power of subsequent statistical analysis. PMID:21118506
Multilevel selection analysis of a microbial social trait
de Vargas Roditi, Laura; Boyle, Kerry E; Xavier, Joao B
2013-01-01
The study of microbial communities often leads to arguments for the evolution of cooperation due to group benefits. However, multilevel selection models caution against the uncritical assumption that group benefits will lead to the evolution of cooperation. We analyze a microbial social trait to precisely define the conditions favoring cooperation. We combine the multilevel partition of the Price equation with a laboratory model system: swarming in Pseudomonas aeruginosa. We parameterize a population dynamics model using competition experiments where we manipulate expression, and therefore the cost-to-benefit ratio of swarming cooperation. Our analysis shows that multilevel selection can favor costly swarming cooperation because it causes population expansion. However, due to high costs and diminishing returns constitutive cooperation can only be favored by natural selection when relatedness is high. Regulated expression of cooperative genes is a more robust strategy because it provides the benefits of swarming expansion without the high cost or the diminishing returns. Our analysis supports the key prediction that strong group selection does not necessarily mean that microbial cooperation will always emerge. PMID:23959025
Convergent evolution of Darwin's finches caused by introgressive hybridization and selection.
Grant, Peter R; Grant, B Rosemary; Markert, Jeffrey A; Keller, Lukas F; Petren, K
2004-07-01
Between 1973 and 2003 mean morphological features of the cactus finch, Geospiza scandens, and the medium ground finch, G. fortis, populations on the Galápagos island of Daphne Major were subject to fluctuating directional selection. An increase in bluntness or robustness in the beak of G. scandens after 1990 can only partly be explained by selection. We use 16 microsatellite loci to test predictions of the previously proposed hypothesis that introgressive hybridization contributed to the trend, resulting in genes flowing predominantly from G. fortis to G. scandens. To identify F1 hybrids and backcrosses we use pedigrees where known, supplemented by the results of assignment tests based on 14 autosomal loci when parents were not known. We analyze changes in morphology and allelic composition in the two populations over a period of 15-20 years. With samples that included F1 hybrids and backcrosses, the G. scandens population became more similar to the G. fortis population both genetically and morphologically. Gene flow between species was estimated to be three times greater from G. fortis to G. scandens than in the opposite direction, resulting in a 20% reduction in the genetic difference between the species. Nevertheless, removing identified F1 hybrids and backcrosses from the total sample and reanalyzing the traits did not eliminate the convergence. The two species also converged in beak shape by 22.2% and in body size by 45.5%. A combination of introgressive hybridization and selection jointly provide the best explanation of convergence in morphology and genetic constitution under the changed ecological conditions following a major El Niño event in 1983. The study illustrates how species without postmating barriers to gene exchange can alternate between convergence and divergence when environmental conditions oscillate.
Webster, A. Francina; Chepelev, Nikolai; Gagné, Rémi; Kuo, Byron; Recio, Leslie; Williams, Andrew; Yauk, Carole L.
2015-01-01
Many regulatory agencies are exploring ways to integrate toxicogenomic data into their chemical risk assessments. The major challenge lies in determining how to distill the complex data produced by high-content, multi-dose gene expression studies into quantitative information. It has been proposed that benchmark dose (BMD) values derived from toxicogenomics data be used as point of departure (PoD) values in chemical risk assessments. However, there is limited information regarding which genomics platforms are most suitable and how to select appropriate PoD values. In this study, we compared BMD values modeled from RNA sequencing-, microarray-, and qPCR-derived gene expression data from a single study, and explored multiple approaches for selecting a single PoD from these data. The strategies evaluated include several that do not require prior mechanistic knowledge of the compound for selection of the PoD, thus providing approaches for assessing data-poor chemicals. We used RNA extracted from the livers of female mice exposed to non-carcinogenic (0, 2 mg/kg/day, mkd) and carcinogenic (4, 8 mkd) doses of furan for 21 days. We show that transcriptional BMD values were consistent across technologies and highly predictive of the two-year cancer bioassay-based PoD. We also demonstrate that filtering data based on statistically significant changes in gene expression prior to BMD modeling creates more conservative BMD values. Taken together, this case study on mice exposed to furan demonstrates that high-content toxicogenomics studies produce robust data for BMD modelling that are minimally affected by inter-technology variability and highly predictive of cancer-based PoD doses. PMID:26313361
Che Omar, Sarena; Bentley, Michael A; Morieri, Giulia; Preston, Gail M; Gurr, Sarah J
2016-01-01
The rice blast fungus causes significant annual harvest losses. It also serves as a genetically-tractable model to study fungal ingress. Whilst pathogenicity determinants have been unmasked and changes in global gene expression described, we know little about Magnaporthe oryzae cell wall remodelling. Our interests, in wall remodelling genes expressed during infection, vegetative growth and under exogenous wall stress, demand robust choice of reference genes for quantitative Real Time-PCR (qRT-PCR) data normalisation. We describe the expression stability of nine candidate reference genes profiled by qRT-PCR with cDNAs derived during asexual germling development, from sexual stage perithecia and from vegetative mycelium grown under various exogenous stressors. Our Minimum Information for Publication of qRT-PCR Experiments (MIQE) compliant analysis reveals a set of robust reference genes used to track changes in the expression of the cell wall remodelling gene MGG_Crh2 (MGG_00592). We ranked nine candidate reference genes by their expression stability (M) and report the best gene combination needed for reliable gene expression normalisation, when assayed in three tissue groups (Infective, Vegetative, and Global) frequently used in M. oryzae expression studies. We found that MGG_Actin (MGG_03982) and the 40S 27a ribosomal subunit MGG_40s (MGG_02872) proved to be robust reference genes for the Infection group and MGG_40s and MGG_Ef1 (Elongation Factor1-α) for both Vegetative and Global groups. Using the above validated reference genes, M. oryzae MGG_Crh2 expression was found to be significantly (p<0.05) elevated three-fold during vegetative growth as compared with dormant spores and two fold higher under cell wall stress (Congo Red) compared to growth under optimal conditions. We recommend the combinatorial use of two reference genes, belonging to the cytoskeleton and ribosomal synthesis functional groups, MGG_Actin, MGG_40s, MGG_S8 (Ribosomal subunit 40S S8) or MGG_Ef1, which demonstrated low M values across heterogeneous tissues. By contrast, metabolic pathway genes MGG_Fad (FAD binding domain-containing protein) and MGG_Gapdh (Glyceraldehyde-3-phosphate dehydrogenase) performed poorly, due to their lack of expression stability across samples.
Zuo, Zhenqiang; Gong, Ting; Che, You; Liu, Ruihua; Xu, Ping; Jiang, Hong; Qiao, Chuanling; Song, Cunjiang; Yang, Chao
2015-06-01
Agricultural soils are usually co-contaminated with organophosphate (OP) and pyrethroid pesticides. To develop a stable and marker-free Pseudomonas putida for co-expression of two pesticide-degrading enzymes, we constructed a suicide plasmid with expression cassettes containing a constitutive promoter J23119, an OP-degrading gene (mpd), a pyrethroid-hydrolyzing carboxylesterase gene (pytH) that utilizes the upp gene as a counter-selectable marker for upp-deficient P. putida. By introduction of suicide plasmid and two-step homologous recombination, both mpd and pytH genes were integrated into the chromosome of a robust soil bacterium P. putida KT2440 and no selection marker was left on chromosome. Functional expression of mpd and pytH in P. putida KT2440 was demonstrated by Western blot analysis and enzyme activity assays. Degradation experiments with liquid cultures showed that the mixed pesticides including methyl parathion, fenitrothion, chlorpyrifos, permethrin, fenpropathrin, and cypermethrin (0.2 mM each) were degraded completely within 48 h. The inoculation of engineered strain (10(6) cells/g) to soils treated with the above mixed pesticides resulted in a higher degradation rate than in noninoculated soils. All six pesticides could be degraded completely within 15 days in fumigated and nonfumigated soils with inoculation. Theses results highlight the potential of the engineered strain to be used for in situ bioremediation of soils co-contaminated with OP and pyrethroid pesticides.
Matarazzo, Valery; Muscatelli, Françoise
2013-01-01
Genomic imprinting is a normal process of epigenetic regulation leading some autosomal genes to be expressed from one parental allele only, the other parental allele being silenced. The reasons why this mechanism has been selected throughout evolution are not clear; however, expression dosage is critical for imprinted genes. There is a paradox between the fact that genomic imprinting is a robust mechanism controlling the expression of specific genes and the fact that this mechanism is based on epigenetic regulation that, per se, should present some flexibility. The robustness has been well studied, revealing the epigenetic modifications at the imprinted locus, but the flexibility has been poorly investigated. Prader-Willi syndrome is the best-studied disease involving imprinted genes caused by the absence of expression of paternally inherited alleles of genes located in the human 15q11-q13 region. Until now, the silencing of the maternally inherited alleles was like a dogma. Rieusset et al. showed that in absence of the paternal Ndn allele, in Ndn +m/-p mice, the maternal Ndn allele is expressed at an extremely low level with a high degree of non-genetic heterogeneity. In about 50% of these mutant mice, this stochastic expression reduces birth lethality and severity of the breathing deficiency, correlated with a reduction in the loss of serotonergic neurons. Furthermore, using several mouse models, they reveal a competition between non-imprinted Ndn promoters, which results in monoallelic (paternal or maternal) Ndn expression, suggesting that Ndn monoallelic expression occurs in the absence of imprinting regulation. Importantly, specific expression of the maternal NDN allele is also detected in post-mortem brain samples of PWS individuals. Here, similar expression of the Magel2 maternal allele is reported in Magel2 +m/-p mice, suggesting that this loss of imprinting can be extended to other PWS genes. These data reveal an unexpected epigenetic flexibility of PWS imprinted genes that could be exploited to reactivate the functional but dormant maternal alleles in PWS.
Gálvez, Juan Manuel; Castillo, Daniel; Herrera, Luis Javier; San Román, Belén; Valenzuela, Olga; Ortuño, Francisco Manuel; Rojas, Ignacio
2018-01-01
Most of the research studies developed applying microarray technology to the characterization of different pathological states of any disease may fail in reaching statistically significant results. This is largely due to the small repertoire of analysed samples, and to the limitation in the number of states or pathologies usually addressed. Moreover, the influence of potential deviations on the gene expression quantification is usually disregarded. In spite of the continuous changes in omic sciences, reflected for instance in the emergence of new Next-Generation Sequencing-related technologies, the existing availability of a vast amount of gene expression microarray datasets should be properly exploited. Therefore, this work proposes a novel methodological approach involving the integration of several heterogeneous skin cancer series, and a later multiclass classifier design. This approach is thus a way to provide the clinicians with an intelligent diagnosis support tool based on the use of a robust set of selected biomarkers, which simultaneously distinguishes among different cancer-related skin states. To achieve this, a multi-platform combination of microarray datasets from Affymetrix and Illumina manufacturers was carried out. This integration is expected to strengthen the statistical robustness of the study as well as the finding of highly-reliable skin cancer biomarkers. Specifically, the designed operation pipeline has allowed the identification of a small subset of 17 differentially expressed genes (DEGs) from which to distinguish among 7 involved skin states. These genes were obtained from the assessment of a number of potential batch effects on the gene expression data. The biological interpretation of these genes was inspected in the specific literature to understand their underlying information in relation to skin cancer. Finally, in order to assess their possible effectiveness in cancer diagnosis, a cross-validation Support Vector Machines (SVM)-based classification including feature ranking was performed. The accuracy attained exceeded the 92% in overall recognition of the 7 different cancer-related skin states. The proposed integration scheme is expected to allow the co-integration with other state-of-the-art technologies such as RNA-seq.
Rouillard, Andrew D; Hurle, Mark R; Agarwal, Pankaj
2018-05-01
Target selection is the first and pivotal step in drug discovery. An incorrect choice may not manifest itself for many years after hundreds of millions of research dollars have been spent. We collected a set of 332 targets that succeeded or failed in phase III clinical trials, and explored whether Omic features describing the target genes could predict clinical success. We obtained features from the recently published comprehensive resource: Harmonizome. Nineteen features appeared to be significantly correlated with phase III clinical trial outcomes, but only 4 passed validation schemes that used bootstrapping or modified permutation tests to assess feature robustness and generalizability while accounting for target class selection bias. We also used classifiers to perform multivariate feature selection and found that classifiers with a single feature performed as well in cross-validation as classifiers with more features (AUROC = 0.57 and AUPR = 0.81). The two predominantly selected features were mean mRNA expression across tissues and standard deviation of expression across tissues, where successful targets tended to have lower mean expression and higher expression variance than failed targets. This finding supports the conventional wisdom that it is favorable for a target to be present in the tissue(s) affected by a disease and absent from other tissues. Overall, our results suggest that it is feasible to construct a model integrating interpretable target features to inform target selection. We anticipate deeper insights and better models in the future, as researchers can reuse the data we have provided to improve methods for handling sample biases and learn more informative features. Code, documentation, and data for this study have been deposited on GitHub at https://github.com/arouillard/omic-features-successful-targets.
Functional cohesion of gene sets determined by latent semantic indexing of PubMed abstracts.
Xu, Lijing; Furlotte, Nicholas; Lin, Yunyue; Heinrich, Kevin; Berry, Michael W; George, Ebenezer O; Homayouni, Ramin
2011-04-14
High-throughput genomic technologies enable researchers to identify genes that are co-regulated with respect to specific experimental conditions. Numerous statistical approaches have been developed to identify differentially expressed genes. Because each approach can produce distinct gene sets, it is difficult for biologists to determine which statistical approach yields biologically relevant gene sets and is appropriate for their study. To address this issue, we implemented Latent Semantic Indexing (LSI) to determine the functional coherence of gene sets. An LSI model was built using over 1 million Medline abstracts for over 20,000 mouse and human genes annotated in Entrez Gene. The gene-to-gene LSI-derived similarities were used to calculate a literature cohesion p-value (LPv) for a given gene set using a Fisher's exact test. We tested this method against genes in more than 6,000 functional pathways annotated in Gene Ontology (GO) and found that approximately 75% of gene sets in GO biological process category and 90% of the gene sets in GO molecular function and cellular component categories were functionally cohesive (LPv<0.05). These results indicate that the LPv methodology is both robust and accurate. Application of this method to previously published microarray datasets demonstrated that LPv can be helpful in selecting the appropriate feature extraction methods. To enable real-time calculation of LPv for mouse or human gene sets, we developed a web tool called Gene-set Cohesion Analysis Tool (GCAT). GCAT can complement other gene set enrichment approaches by determining the overall functional cohesion of data sets, taking into account both explicit and implicit gene interactions reported in the biomedical literature. GCAT is freely available at http://binf1.memphis.edu/gcat.
NASA Astrophysics Data System (ADS)
Song, Yunquan; Lin, Lu; Jian, Ling
2016-07-01
Single-index varying-coefficient model is an important mathematical modeling method to model nonlinear phenomena in science and engineering. In this paper, we develop a variable selection method for high-dimensional single-index varying-coefficient models using a shrinkage idea. The proposed procedure can simultaneously select significant nonparametric components and parametric components. Under defined regularity conditions, with appropriate selection of tuning parameters, the consistency of the variable selection procedure and the oracle property of the estimators are established. Moreover, due to the robustness of the check loss function to outliers in the finite samples, our proposed variable selection method is more robust than the ones based on the least squares criterion. Finally, the method is illustrated with numerical simulations.
Robust and sparse correlation matrix estimation for the analysis of high-dimensional genomics data.
Serra, Angela; Coretto, Pietro; Fratello, Michele; Tagliaferri, Roberto; Stegle, Oliver
2018-02-15
Microarray technology can be used to study the expression of thousands of genes across a number of different experimental conditions, usually hundreds. The underlying principle is that genes sharing similar expression patterns, across different samples, can be part of the same co-expression system, or they may share the same biological functions. Groups of genes are usually identified based on cluster analysis. Clustering methods rely on the similarity matrix between genes. A common choice to measure similarity is to compute the sample correlation matrix. Dimensionality reduction is another popular data analysis task which is also based on covariance/correlation matrix estimates. Unfortunately, covariance/correlation matrix estimation suffers from the intrinsic noise present in high-dimensional data. Sources of noise are: sampling variations, presents of outlying sample units, and the fact that in most cases the number of units is much larger than the number of genes. In this paper, we propose a robust correlation matrix estimator that is regularized based on adaptive thresholding. The resulting method jointly tames the effects of the high-dimensionality, and data contamination. Computations are easy to implement and do not require hand tunings. Both simulated and real data are analyzed. A Monte Carlo experiment shows that the proposed method is capable of remarkable performances. Our correlation metric is more robust to outliers compared with the existing alternatives in two gene expression datasets. It is also shown how the regularization allows to automatically detect and filter spurious correlations. The same regularization is also extended to other less robust correlation measures. Finally, we apply the ARACNE algorithm on the SyNTreN gene expression data. Sensitivity and specificity of the reconstructed network is compared with the gold standard. We show that ARACNE performs better when it takes the proposed correlation matrix estimator as input. The R software is available at https://github.com/angy89/RobustSparseCorrelation. aserra@unisa.it or robtag@unisa.it. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Almli, Lynn M; Duncan, Richard; Feng, Hao; Ghosh, Debashis; Binder, Elisabeth B; Bradley, Bekh; Ressler, Kerry J; Conneely, Karen N; Epstein, Michael P
2014-12-01
Genetic association studies of psychiatric outcomes often consider interactions with environmental exposures and, in particular, apply tests that jointly consider gene and gene-environment interaction effects for analysis. Using a genome-wide association study (GWAS) of posttraumatic stress disorder (PTSD), we report that heteroscedasticity (defined as variability in outcome that differs by the value of the environmental exposure) can invalidate traditional joint tests of gene and gene-environment interaction. To identify the cause of bias in traditional joint tests of gene and gene-environment interaction in a PTSD GWAS and determine whether proposed robust joint tests are insensitive to this problem. The PTSD GWAS data set consisted of 3359 individuals (978 men and 2381 women) from the Grady Trauma Project (GTP), a cohort study from Atlanta, Georgia. The GTP performed genome-wide genotyping of participants and collected environmental exposures using the Childhood Trauma Questionnaire and Trauma Experiences Inventory. We performed joint interaction testing of the Beck Depression Inventory and modified PTSD Symptom Scale in the GTP GWAS. We assessed systematic bias in our interaction analyses using quantile-quantile plots and genome-wide inflation factors. Application of the traditional joint interaction test to the GTP GWAS yielded systematic inflation across different outcomes and environmental exposures (inflation-factor estimates ranging from 1.07 to 1.21), whereas application of the robust joint test to the same data set yielded no such inflation (inflation-factor estimates ranging from 1.01 to 1.02). Simulated data further revealed that the robust joint test is valid in different heteroscedasticity models, whereas the traditional joint test is invalid. The robust joint test also has power similar to the traditional joint test when heteroscedasticity is not an issue. We believe the robust joint test should be used in candidate-gene studies and GWASs of psychiatric outcomes that consider environmental interactions. To make the procedure useful for applied investigators, we created a software tool that can be called from the popular PLINK package for analysis.
Correcting Systematic Inflation in Genetic Association Tests That Consider Interaction Effects
Almli, Lynn M.; Duncan, Richard; Feng, Hao; Ghosh, Debashis; Binder, Elisabeth B.; Bradley, Bekh; Ressler, Kerry J.; Conneely, Karen N.; Epstein, Michael P.
2015-01-01
IMPORTANCE Genetic association studies of psychiatric outcomes often consider interactions with environmental exposures and, in particular, apply tests that jointly consider gene and gene-environment interaction effects for analysis. Using a genome-wide association study (GWAS) of posttraumatic stress disorder (PTSD), we report that heteroscedasticity (defined as variability in outcome that differs by the value of the environmental exposure) can invalidate traditional joint tests of gene and gene-environment interaction. OBJECTIVES To identify the cause of bias in traditional joint tests of gene and gene-environment interaction in a PTSD GWAS and determine whether proposed robust joint tests are insensitive to this problem. DESIGN, SETTING, AND PARTICIPANTS The PTSD GWAS data set consisted of 3359 individuals (978 men and 2381 women) from the Grady Trauma Project (GTP), a cohort study from Atlanta, Georgia. The GTP performed genome-wide genotyping of participants and collected environmental exposures using the Childhood Trauma Questionnaire and Trauma Experiences Inventory. MAIN OUTCOMES AND MEASURES We performed joint interaction testing of the Beck Depression Inventory and modified PTSD Symptom Scale in the GTP GWAS. We assessed systematic bias in our interaction analyses using quantile-quantile plots and genome-wide inflation factors. RESULTS Application of the traditional joint interaction test to the GTP GWAS yielded systematic inflation across different outcomes and environmental exposures (inflation-factor estimates ranging from 1.07 to 1.21), whereas application of the robust joint test to the same data set yielded no such inflation (inflation-factor estimates ranging from 1.01 to 1.02). Simulated data further revealed that the robust joint test is valid in different heteroscedasticity models, whereas the traditional joint test is invalid. The robust joint test also has power similar to the traditional joint test when heteroscedasticity is not an issue. CONCLUSIONS AND RELEVANCE We believe the robust joint test should be used in candidate-gene studies and GWASs of psychiatric outcomes that consider environmental interactions. To make the procedure useful for applied investigators, we created a software tool that can be called from the popular PLINK package for analysis. PMID:25354142
Manual Gene Ontology annotation workflow at the Mouse Genome Informatics Database
Drabkin, Harold J.; Blake, Judith A.
2012-01-01
The Mouse Genome Database, the Gene Expression Database and the Mouse Tumor Biology database are integrated components of the Mouse Genome Informatics (MGI) resource (http://www.informatics.jax.org). The MGI system presents both a consensus view and an experimental view of the knowledge concerning the genetics and genomics of the laboratory mouse. From genotype to phenotype, this information resource integrates information about genes, sequences, maps, expression analyses, alleles, strains and mutant phenotypes. Comparative mammalian data are also presented particularly in regards to the use of the mouse as a model for the investigation of molecular and genetic components of human diseases. These data are collected from literature curation as well as downloads of large datasets (SwissProt, LocusLink, etc.). MGI is one of the founding members of the Gene Ontology (GO) and uses the GO for functional annotation of genes. Here, we discuss the workflow associated with manual GO annotation at MGI, from literature collection to display of the annotations. Peer-reviewed literature is collected mostly from a set of journals available electronically. Selected articles are entered into a master bibliography and indexed to one of eight areas of interest such as ‘GO’ or ‘homology’ or ‘phenotype’. Each article is then either indexed to a gene already contained in the database or funneled through a separate nomenclature database to add genes. The master bibliography and associated indexing provide information for various curator-reports such as ‘papers selected for GO that refer to genes with NO GO annotation’. Once indexed, curators who have expertise in appropriate disciplines enter pertinent information. MGI makes use of several controlled vocabularies that ensure uniform data encoding, enable robust analysis and support the construction of complex queries. These vocabularies range from pick-lists to structured vocabularies such as the GO. All data associations are supported with statements of evidence as well as access to source publications. PMID:23110975
Manual Gene Ontology annotation workflow at the Mouse Genome Informatics Database.
Drabkin, Harold J; Blake, Judith A
2012-01-01
The Mouse Genome Database, the Gene Expression Database and the Mouse Tumor Biology database are integrated components of the Mouse Genome Informatics (MGI) resource (http://www.informatics.jax.org). The MGI system presents both a consensus view and an experimental view of the knowledge concerning the genetics and genomics of the laboratory mouse. From genotype to phenotype, this information resource integrates information about genes, sequences, maps, expression analyses, alleles, strains and mutant phenotypes. Comparative mammalian data are also presented particularly in regards to the use of the mouse as a model for the investigation of molecular and genetic components of human diseases. These data are collected from literature curation as well as downloads of large datasets (SwissProt, LocusLink, etc.). MGI is one of the founding members of the Gene Ontology (GO) and uses the GO for functional annotation of genes. Here, we discuss the workflow associated with manual GO annotation at MGI, from literature collection to display of the annotations. Peer-reviewed literature is collected mostly from a set of journals available electronically. Selected articles are entered into a master bibliography and indexed to one of eight areas of interest such as 'GO' or 'homology' or 'phenotype'. Each article is then either indexed to a gene already contained in the database or funneled through a separate nomenclature database to add genes. The master bibliography and associated indexing provide information for various curator-reports such as 'papers selected for GO that refer to genes with NO GO annotation'. Once indexed, curators who have expertise in appropriate disciplines enter pertinent information. MGI makes use of several controlled vocabularies that ensure uniform data encoding, enable robust analysis and support the construction of complex queries. These vocabularies range from pick-lists to structured vocabularies such as the GO. All data associations are supported with statements of evidence as well as access to source publications.
Lachowiec, Jennifer; Queitsch, Christine; Kliebenstein, Daniel J.
2016-01-01
Background Robustness to genetic and environmental perturbation is a salient feature of multicellular organisms. Loss of developmental robustness can lead to severe phenotypic defects and fitness loss. However, perfect robustness, i.e. no variation at all, is evolutionarily unfit as organisms must be able to change phenotype to properly respond to changing environments and biotic challenges. Plasticity is the ability to adjust phenotypes predictably in response to specific environmental stimuli, which can be considered a transient shift allowing an organism to move from one robust phenotypic state to another. Plants, as sessile organisms that undergo continuous development, are particularly dependent on an exquisite fine-tuning of the processes that balance robustness and plasticity to maximize fitness. Scope and Conclusions This paper reviews recently identified mechanisms, both systems-level and molecular, that modulate robustness, and discusses their implications for the optimization of plant fitness. Robustness in living systems arises from the structure of genetic networks, the specific molecular functions of the underlying genes, and their interactions. This very same network responsible for the robustness of specific developmental states also has to be built such that it enables plastic yet robust shifts in response to environmental changes. In plants, the interactions and functions of signal transduction pathways activated by phytohormones and the tendency for plants to tolerate whole-genome duplications, tandem gene duplication and hybridization are emerging as major regulators of robustness in development. Despite their obvious implications for plant evolution and plant breeding, the mechanistic underpinnings by which plants modulate precise levels of robustness, plasticity and evolvability in networks controlling different phenotypes are under-studied. PMID:26473020
Regulation of behaviorally associated gene networks in worker honey bee ovaries
Wang, Ying; Kocher, Sarah D.; Linksvayer, Timothy A.; Grozinger, Christina M.; Page, Robert E.; Amdam, Gro V.
2012-01-01
SUMMARY Several lines of evidence support genetic links between ovary size and division of labor in worker honey bees. However, it is largely unknown how ovaries influence behavior. To address this question, we first performed transcriptional profiling on worker ovaries from two genotypes that differ in social behavior and ovary size. Then, we contrasted the differentially expressed ovarian genes with six sets of available brain transcriptomes. Finally, we probed behavior-related candidate gene networks in wild-type ovaries of different sizes. We found differential expression in 2151 ovarian transcripts in these artificially selected honey bee strains, corresponding to approximately 20.3% of the predicted gene set of honey bees. Differences in gene expression overlapped significantly with changes in the brain transcriptomes. Differentially expressed genes were associated with neural signal transmission (tyramine receptor, TYR) and ecdysteroid signaling; two independently tested nuclear hormone receptors (HR46 and ftz-f1) were also significantly correlated with ovary size in wild-type bees. We suggest that the correspondence between ovary and brain transcriptomes identified here indicates systemic regulatory networks among hormones (juvenile hormone and ecdysteroids), pheromones (queen mandibular pheromone), reproductive organs and nervous tissues in worker honey bees. Furthermore, robust correlations between ovary size and neuraland endocrine response genes are consistent with the hypothesized roles of the ovaries in honey bee behavioral regulation. PMID:22162860
An Efficient Method for Generation of Knockout Human Embryonic Stem Cells Using CRISPR/Cas9 System.
Bohaciakova, Dasa; Renzova, Tereza; Fedorova, Veronika; Barak, Martin; Kunova Bosakova, Michaela; Hampl, Ales; Cajanek, Lukas
2017-11-01
Human embryonic stem cells (hESCs) represent a promising tool to study functions of genes during development, to model diseases, and to even develop therapies when combined with gene editing techniques such as CRISPR/CRISPR-associated protein-9 nuclease (Cas9) system. However, the process of disruption of gene expression by generation of null alleles is often inefficient and tedious. To circumvent these limitations, we developed a simple and efficient protocol to permanently downregulate expression of a gene of interest in hESCs using CRISPR/Cas9. We selected p53 for our proof of concept experiments. The methodology is based on series of hESC transfection, which leads to efficient downregulation of p53 expression even in polyclonal population (p53 Low cells), here proven by a loss of regulation of the expression of p53 target gene, microRNA miR-34a. We demonstrate that our approach achieves over 80% efficiency in generating hESC clonal sublines that do not express p53 protein. Importantly, we document by a set of functional experiments that such genetically modified hESCs do retain typical stem cells characteristics. In summary, we provide a simple and robust protocol to efficiently target expression of gene of interest in hESCs that can be useful for laboratories aiming to employ gene editing in their hESC applications/protocols.
Shortening tobacco life cycle accelerates functional gene identification in genomic research.
Ning, G; Xiao, X; Lv, H; Li, X; Zuo, Y; Bao, M
2012-11-01
Definitive allocation of function requires the introduction of genetic mutations and analysis of their phenotypic consequences. Novel, rapid and convenient techniques or materials are very important and useful to accelerate gene identification in functional genomics research. Here, over-expression of PmFT (Prunus mume), a novel FT orthologue, and PtFT (Populus tremula) lead to shortening of the tobacco life cycle. A series of novel short life cycle stable tobacco lines (30-50 days) were developed through repeated self-crossing selection breeding. Based on the second transformation via a gusA reporter gene, the promoter from BpFULL1 in silver birch (Betula pendula) and the gene (CPC) from Arabidopsis thaliana were effectively tested using short life cycle tobacco lines. Comparative analysis among wild type, short life cycle tobacco and Arabidopsis transformation system verified that it is optional to accelerate functional gene studies by shortening host plant material life cycle, at least in these short life cycle tobacco lines. The results verified that the novel short life cycle transgenic tobacco lines not only combine the advantages of economic nursery requirements and a simple transformation system, but also provide a robust, effective and stable host system to accelerate gene analysis. Thus, shortening tobacco life cycle strategy is feasible to accelerate heterologous or homologous functional gene identification in genomic research. © 2012 German Botanical Society and The Royal Botanical Society of the Netherlands.
2006-07-21
This fundermental architecture can be illustrated as bow-tie architeture , that is claimed to be the fundermenal architecture of robust systems (Fig...sequenced genomes86,87. The Gene Ontology’s biological process hierarchy88 was used to annotate functional categories to each gene, and proportions of... proportions in the early Drosophila embryo. Nature 415, 798-802 (2002). 14. Kitano, H. Cancer robustness: tumour tactics. Nature 426, 125 (2003). 15
2008-07-21
size. This fundermental architecture can be illustrated as bow-tie architeture , that is claimed to be the fundermenal architecture of robust systems...sequenced genomes86,87. The Gene Ontology’s biological process hierarchy88 was used to annotate functional categories to each gene, and proportions ...and proportions in the early Drosophila embryo. Nature 415, 798-802 (2002). 14. Kitano, H. Cancer robustness: tumour tactics. Nature 426, 125 (2003
Robust, synergistic regulation of human gene expression using TALE activators.
Maeder, Morgan L; Linder, Samantha J; Reyon, Deepak; Angstman, James F; Fu, Yanfang; Sander, Jeffry D; Joung, J Keith
2013-03-01
Artificial activators designed using transcription activator-like effector (TALE) technology have broad utility, but previous studies suggest that these monomeric proteins often exhibit low activities. Here we demonstrate that TALE activators can robustly function individually or in synergistic combinations to increase expression of endogenous human genes over wide dynamic ranges. These findings will encourage applications of TALE activators for research and therapy, and guide design of monomeric TALE-based fusion proteins.
Wisdom of crowds for robust gene network inference
Marbach, Daniel; Costello, James C.; Küffner, Robert; Vega, Nicci; Prill, Robert J.; Camacho, Diogo M.; Allison, Kyle R.; Kellis, Manolis; Collins, James J.; Stolovitzky, Gustavo
2012-01-01
Reconstructing gene regulatory networks from high-throughput data is a long-standing problem. Through the DREAM project (Dialogue on Reverse Engineering Assessment and Methods), we performed a comprehensive blind assessment of over thirty network inference methods on Escherichia coli, Staphylococcus aureus, Saccharomyces cerevisiae, and in silico microarray data. We characterize performance, data requirements, and inherent biases of different inference approaches offering guidelines for both algorithm application and development. We observe that no single inference method performs optimally across all datasets. In contrast, integration of predictions from multiple inference methods shows robust and high performance across diverse datasets. Thereby, we construct high-confidence networks for E. coli and S. aureus, each comprising ~1700 transcriptional interactions at an estimated precision of 50%. We experimentally test 53 novel interactions in E. coli, of which 23 were supported (43%). Our results establish community-based methods as a powerful and robust tool for the inference of transcriptional gene regulatory networks. PMID:22796662
Computing Prediction and Functional Analysis of Prokaryotic Propionylation.
Wang, Li-Na; Shi, Shao-Ping; Wen, Ping-Ping; Zhou, Zhi-You; Qiu, Jian-Ding
2017-11-27
Identification and systematic analysis of candidates for protein propionylation are crucial steps for understanding its molecular mechanisms and biological functions. Although several proteome-scale methods have been performed to delineate potential propionylated proteins, the majority of lysine-propionylated substrates and their role in pathological physiology still remain largely unknown. By gathering various databases and literatures, experimental prokaryotic propionylation data were collated to be trained in a support vector machine with various features via a three-step feature selection method. A novel online tool for seeking potential lysine-propionylated sites (PropSeek) ( http://bioinfo.ncu.edu.cn/PropSeek.aspx ) was built. Independent test results of leave-one-out and n-fold cross-validation were similar to each other, showing that PropSeek is a stable and robust predictor with satisfying performance. Meanwhile, analyses of Gene Ontology, Kyoto Encyclopedia of Genes and Genomes pathways, and protein-protein interactions implied a potential role of prokaryotic propionylation in protein synthesis and metabolism.
Detection of growth hormone doping by gene expression profiling of peripheral blood.
Mitchell, Christopher J; Nelson, Anne E; Cowley, Mark J; Kaplan, Warren; Stone, Glenn; Sutton, Selina K; Lau, Amie; Lee, Carol M Y; Ho, Ken K Y
2009-12-01
GH abuse is a significant problem in many sports, and there is currently no robust test that allows detection of doping beyond a short window after administration. Our objective was to evaluate gene expression profiling in peripheral blood leukocytes in-vivo as a test for GH doping in humans. Seven men and thirteen women were administered GH, 2 mg/d sc for 8 wk. Blood was collected at baseline and at 8 wk. RNA was extracted from the white cell fraction. Microarray analysis was undertaken using Agilent 44K G4112F arrays using a two-color design. Quantitative RT-PCR using TaqMan gene expression assays was performed for validation of selected differentially expressed genes. GH induced an approximately 2-fold increase in circulating IGF-I that was maintained throughout the 8 wk of the study. GH induced significant changes in gene expression with 353 in women and 41 in men detected with a false discovery rate of less than 5%. None of the differentially expressed genes were common between men and women. The maximal changes were a doubling for up-regulated or halving for down-regulated genes, similar in magnitude to the variation between individuals. Quantitative RT-PCR for seven target genes showed good concordance between microarray and quantitative PCR data in women but not in men. Gene expression analysis of peripheral blood leukocytes is unlikely to be a viable approach for the detection of GH doping.
Robust Variable Selection with Exponential Squared Loss.
Wang, Xueqin; Jiang, Yunlu; Huang, Mian; Zhang, Heping
2013-04-01
Robust variable selection procedures through penalized regression have been gaining increased attention in the literature. They can be used to perform variable selection and are expected to yield robust estimates. However, to the best of our knowledge, the robustness of those penalized regression procedures has not been well characterized. In this paper, we propose a class of penalized robust regression estimators based on exponential squared loss. The motivation for this new procedure is that it enables us to characterize its robustness that has not been done for the existing procedures, while its performance is near optimal and superior to some recently developed methods. Specifically, under defined regularity conditions, our estimators are [Formula: see text] and possess the oracle property. Importantly, we show that our estimators can achieve the highest asymptotic breakdown point of 1/2 and that their influence functions are bounded with respect to the outliers in either the response or the covariate domain. We performed simulation studies to compare our proposed method with some recent methods, using the oracle method as the benchmark. We consider common sources of influential points. Our simulation studies reveal that our proposed method performs similarly to the oracle method in terms of the model error and the positive selection rate even in the presence of influential points. In contrast, other existing procedures have a much lower non-causal selection rate. Furthermore, we re-analyze the Boston Housing Price Dataset and the Plasma Beta-Carotene Level Dataset that are commonly used examples for regression diagnostics of influential points. Our analysis unravels the discrepancies of using our robust method versus the other penalized regression method, underscoring the importance of developing and applying robust penalized regression methods.
Robust Variable Selection with Exponential Squared Loss
Wang, Xueqin; Jiang, Yunlu; Huang, Mian; Zhang, Heping
2013-01-01
Robust variable selection procedures through penalized regression have been gaining increased attention in the literature. They can be used to perform variable selection and are expected to yield robust estimates. However, to the best of our knowledge, the robustness of those penalized regression procedures has not been well characterized. In this paper, we propose a class of penalized robust regression estimators based on exponential squared loss. The motivation for this new procedure is that it enables us to characterize its robustness that has not been done for the existing procedures, while its performance is near optimal and superior to some recently developed methods. Specifically, under defined regularity conditions, our estimators are n-consistent and possess the oracle property. Importantly, we show that our estimators can achieve the highest asymptotic breakdown point of 1/2 and that their influence functions are bounded with respect to the outliers in either the response or the covariate domain. We performed simulation studies to compare our proposed method with some recent methods, using the oracle method as the benchmark. We consider common sources of influential points. Our simulation studies reveal that our proposed method performs similarly to the oracle method in terms of the model error and the positive selection rate even in the presence of influential points. In contrast, other existing procedures have a much lower non-causal selection rate. Furthermore, we re-analyze the Boston Housing Price Dataset and the Plasma Beta-Carotene Level Dataset that are commonly used examples for regression diagnostics of influential points. Our analysis unravels the discrepancies of using our robust method versus the other penalized regression method, underscoring the importance of developing and applying robust penalized regression methods. PMID:23913996
Bottari, Benedetta; Felis, Giovanna E; Salvetti, Elisa; Castioni, Anna; Campedelli, Ilenia; Torriani, Sandra; Bernini, Valentina; Gatti, Monica
2017-07-01
Lactobacillus casei,Lactobacillus paracasei and Lactobacillusrhamnosus form a closely related taxonomic group (the L. casei group) within the facultatively heterofermentative lactobacilli. Strains of these species have been used for a long time as probiotics in a wide range of products, and they represent the dominant species of nonstarter lactic acid bacteria in ripened cheeses, where they contribute to flavour development. The close genetic relationship among those species, as well as the similarity of biochemical properties of the strains, hinders the development of an adequate selective method to identify these bacteria. Despite this being a hot topic, as demonstrated by the large amount of literature about it, the results of different proposed identification methods are often ambiguous and unsatisfactory. The aim of this study was to develop a more robust species-specific identification assay for differentiating the species of the L. casei group. A taxonomy-driven comparative genomic analysis was carried out to select the potential target genes whose similarity could better reflect genome-wide diversity. The gene mutL appeared to be the most promising one and, therefore, a novel species-specific multiplex PCR assay was developed to rapidly and effectively distinguish L. casei, L. paracasei and L. rhamnosus strains. The analysis of a collection of 76 wild dairy isolates, previously identified as members of the L. casei group combining the results of multiple approaches, revealed that the novel designed primers, especially in combination with already existing ones, were able to improve the discrimination power at the species level and reveal previously undiscovered intraspecific biodiversity.
Tripathi, Jaindra N; Oduor, Richard O; Tripathi, Leena
2015-01-01
Banana (Musa spp.) is an important staple food as well as cash crop in tropical and subtropical countries. Various bacterial, fungal, and viral diseases and pests such as nematodes are major constraints in its production and are currently destabilizing the banana production in sub-Saharan Africa. Genetic engineering is a complementary option used for incorporating useful traits in banana to bypass the long generation time, polyploidy, and sterility of most of the cultivated varieties. A robust transformation protocol for farmer preferred varieties is crucial for banana genomics and improvement. A robust and reproducible system for genetic transformation of banana using embryogenic cell suspensions (ECS) has been developed in this study. Two different types of explants (immature male flowers and multiple buds) were tested for their ability to develop ECS in several varieties of banana locally grown in Africa. ECS of banana varieties "Cavendish Williams" and "Gros Michel" were developed using multiple buds, whereas ECS of "Sukali Ndiizi" was developed using immature male flowers. Regeneration efficiency of ECS was about 20,000-50,000 plantlets per ml of settled cell volume (SCV) depending on variety. ECS of three different varieties were transformed through Agrobacterium-mediated transformation using gusA reporter gene and 20-70 independent transgenic events per ml SCV of ECS were regenerated on selective medium. The presence and integration of gusA gene in transgenic plants was confirmed by PCR, dot blot, and Southern blot analysis and expression by histochemical GUS assays. The robust transformation platform was successfully used to generate hundreds of transgenic lines with disease resistance. Such a platform will facilitate the transfer of technologies to national agricultural research systems (NARS) in Africa.
Tripathi, Jaindra N.; Oduor, Richard O.; Tripathi, Leena
2015-01-01
Banana (Musa spp.) is an important staple food as well as cash crop in tropical and subtropical countries. Various bacterial, fungal, and viral diseases and pests such as nematodes are major constraints in its production and are currently destabilizing the banana production in sub-Saharan Africa. Genetic engineering is a complementary option used for incorporating useful traits in banana to bypass the long generation time, polyploidy, and sterility of most of the cultivated varieties. A robust transformation protocol for farmer preferred varieties is crucial for banana genomics and improvement. A robust and reproducible system for genetic transformation of banana using embryogenic cell suspensions (ECS) has been developed in this study. Two different types of explants (immature male flowers and multiple buds) were tested for their ability to develop ECS in several varieties of banana locally grown in Africa. ECS of banana varieties “Cavendish Williams” and “Gros Michel” were developed using multiple buds, whereas ECS of “Sukali Ndiizi” was developed using immature male flowers. Regeneration efficiency of ECS was about 20,000–50,000 plantlets per ml of settled cell volume (SCV) depending on variety. ECS of three different varieties were transformed through Agrobacterium-mediated transformation using gusA reporter gene and 20–70 independent transgenic events per ml SCV of ECS were regenerated on selective medium. The presence and integration of gusA gene in transgenic plants was confirmed by PCR, dot blot, and Southern blot analysis and expression by histochemical GUS assays. The robust transformation platform was successfully used to generate hundreds of transgenic lines with disease resistance. Such a platform will facilitate the transfer of technologies to national agricultural research systems (NARS) in Africa. PMID:26635849
Sevillano, Laura; Díaz, Margarita; Santamaría, Ramón I
2017-09-26
The industrial use of enzymes produced by microorganisms is continuously growing due to the need for sustainable solutions. Nevertheless, many of the plasmids used for recombinant production of proteins in bacteria are based on the use of antibiotic resistance genes as selection markers. The safety concerns and legal requirements surrounding the increased use of antibiotic resistance genes have made the development of new antibiotic-free approaches essential. In this work, a system completely free of antibiotic resistance genes and useful for the production of high yields of proteins in Streptomyces is described. This system is based on the separation of the two components of the yefM/yoeBsl (antitoxin/toxin) operon; the toxin (yoeBsl) gene, responsible for host death, is integrated into the genome and the antitoxin gene (yefMsl), which inactivates the toxin, is located in the expression plasmid. To develop this system, the toxin gene was integrated into the genome of a strain lacking the complete operon, and the antibiotic resistance gene integrated along with the toxin was eliminated by Cre recombinase to generate a final host strain free of any antibiotic resistance marker. In the same way, the antibiotic resistance gene from the final expression plasmid was removed by Dre recombinase. The usefulness of this system was analysed by checking the production of two hydrolases from different Streptomyces. Production of both proteins, with potential industrial use, was high and stable over time after strain storage and after serial subcultures. These results support the robustness and stability of the positive selection system developed. The total absence of antibiotic resistance genes makes this system a powerful tool for using Streptomyces as a host to produce proteins at the industrial level. This work is the first Streptomyces antibiotic marker-free system to be described. Graphical abstract Antibiotic marker-free platform for protein expression in Streptomyces. The antitoxin gene present in the expression plasmid counteracts the effect of the toxin gene in the genome. In absence of the expression plasmid, the toxin causes cell death ensuring that only plasmid-containing cells persist.
Covariate selection with group lasso and doubly robust estimation of causal effects
Koch, Brandon; Vock, David M.; Wolfson, Julian
2017-01-01
Summary The efficiency of doubly robust estimators of the average causal effect (ACE) of a treatment can be improved by including in the treatment and outcome models only those covariates which are related to both treatment and outcome (i.e., confounders) or related only to the outcome. However, it is often challenging to identify such covariates among the large number that may be measured in a given study. In this paper, we propose GLiDeR (Group Lasso and Doubly Robust Estimation), a novel variable selection technique for identifying confounders and predictors of outcome using an adaptive group lasso approach that simultaneously performs coefficient selection, regularization, and estimation across the treatment and outcome models. The selected variables and corresponding coefficient estimates are used in a standard doubly robust ACE estimator. We provide asymptotic results showing that, for a broad class of data generating mechanisms, GLiDeR yields a consistent estimator of the ACE when either the outcome or treatment model is correctly specified. A comprehensive simulation study shows that GLiDeR is more efficient than doubly robust methods using standard variable selection techniques and has substantial computational advantages over a recently proposed doubly robust Bayesian model averaging method. We illustrate our method by estimating the causal treatment effect of bilateral versus single-lung transplant on forced expiratory volume in one year after transplant using an observational registry. PMID:28636276
Covariate selection with group lasso and doubly robust estimation of causal effects.
Koch, Brandon; Vock, David M; Wolfson, Julian
2018-03-01
The efficiency of doubly robust estimators of the average causal effect (ACE) of a treatment can be improved by including in the treatment and outcome models only those covariates which are related to both treatment and outcome (i.e., confounders) or related only to the outcome. However, it is often challenging to identify such covariates among the large number that may be measured in a given study. In this article, we propose GLiDeR (Group Lasso and Doubly Robust Estimation), a novel variable selection technique for identifying confounders and predictors of outcome using an adaptive group lasso approach that simultaneously performs coefficient selection, regularization, and estimation across the treatment and outcome models. The selected variables and corresponding coefficient estimates are used in a standard doubly robust ACE estimator. We provide asymptotic results showing that, for a broad class of data generating mechanisms, GLiDeR yields a consistent estimator of the ACE when either the outcome or treatment model is correctly specified. A comprehensive simulation study shows that GLiDeR is more efficient than doubly robust methods using standard variable selection techniques and has substantial computational advantages over a recently proposed doubly robust Bayesian model averaging method. We illustrate our method by estimating the causal treatment effect of bilateral versus single-lung transplant on forced expiratory volume in one year after transplant using an observational registry. © 2017, The International Biometric Society.
2012-01-01
Background While the genetic transformation of the major cereal crops has become relatively routine, to date only a few reports were published on transgenic triticale, and robust data on T-DNA integration and segregation have not been available in this species. Results Here, we present a comprehensive analysis of stable transgenic winter triticale cv. Bogo carrying the selectable marker gene HYGROMYCIN PHOSPHOTRANSFERASE (HPT) and a synthetic green fluorescent protein gene (gfp). Progeny of four independent transgenic plants were comprehensively investigated with regard to the number of integrated T-DNA copies, the number of plant genomic integration loci, the integrity and functionality of individual T-DNA copies, as well as the segregation of transgenes in T1 and T2 generations, which also enabled us to identify homozygous transgenic lines. The truncation of some integrated T-DNAs at their left end along with the occurrence of independent segregation of multiple T-DNAs unintendedly resulted in a single-copy segregant that is selectable marker-free and homozygous for the gfp gene. The heritable expression of gfp driven by the maize UBI-1 promoter was demonstrated by confocal laser scanning microscopy. Conclusions The used transformation method is a valuable tool for the genetic engineering of triticale. Here we show that comprehensive molecular analyses are required for the correct interpretation of phenotypic data collected from the transgenic plants. PMID:23006412
Hensel, Goetz; Oleszczuk, Sylwia; Daghma, Diaa Eldin S; Zimny, Janusz; Melzer, Michael; Kumlehn, Jochen
2012-09-25
While the genetic transformation of the major cereal crops has become relatively routine, to date only a few reports were published on transgenic triticale, and robust data on T-DNA integration and segregation have not been available in this species. Here, we present a comprehensive analysis of stable transgenic winter triticale cv. Bogo carrying the selectable marker gene HYGROMYCIN PHOSPHOTRANSFERASE (HPT) and a synthetic green fluorescent protein gene (gfp). Progeny of four independent transgenic plants were comprehensively investigated with regard to the number of integrated T-DNA copies, the number of plant genomic integration loci, the integrity and functionality of individual T-DNA copies, as well as the segregation of transgenes in T1 and T2 generations, which also enabled us to identify homozygous transgenic lines. The truncation of some integrated T-DNAs at their left end along with the occurrence of independent segregation of multiple T-DNAs unintendedly resulted in a single-copy segregant that is selectable marker-free and homozygous for the gfp gene. The heritable expression of gfp driven by the maize UBI-1 promoter was demonstrated by confocal laser scanning microscopy. The used transformation method is a valuable tool for the genetic engineering of triticale. Here we show that comprehensive molecular analyses are required for the correct interpretation of phenotypic data collected from the transgenic plants.
From the ORFeome concept to highly comprehensive, full-genome screening libraries.
Rid, Raphaela; Abdel-Hadi, Omar; Maier, Richard; Wagner, Martin; Hundsberger, Harald; Hintner, Helmut; Bauer, Johann; Onder, Kamil
2013-02-01
Recombination-based cloning techniques have in recent times facilitated the establishment of genome-scale single-gene ORFeome repositories. Their further handling and downstream application in systematic fashion is, however, practically impeded because of logistical plus economic challenges. At this juncture, simultaneously transferring entire gene collections in compiled pool format could represent an advanced compromise between systematic ORFeome (an organism's entire set of protein-encoding open reading frames) projects and traditional random library approaches, but has not yet been considered in great detail. In our endeavor to merge the comprehensiveness of ORFeomes with a basically simple, streamlined, and easily executable single-tube design, we have here produced five different pooled screening-ready libraries for both Staphylococcus aureus and Homo sapiens. By evaluating the parallel transfer efficiencies of differentially sized genes from initial polymerase chain reaction (PCR) product amplification to entry and final destination library construction via quantitative real-time PCR, we found that the complexity of the gene population is fairly stably maintained once an entry resource has been successfully established, and that no apparent size-selection bias loss of large inserts takes place. Recombinational transfer processes are hence robust enough for straightforwardly achieving such pooled screening libraries.
Inference of combinatorial Boolean rules of synergistic gene sets from cancer microarray datasets.
Park, Inho; Lee, Kwang H; Lee, Doheon
2010-06-15
Gene set analysis has become an important tool for the functional interpretation of high-throughput gene expression datasets. Moreover, pattern analyses based on inferred gene set activities of individual samples have shown the ability to identify more robust disease signatures than individual gene-based pattern analyses. Although a number of approaches have been proposed for gene set-based pattern analysis, the combinatorial influence of deregulated gene sets on disease phenotype classification has not been studied sufficiently. We propose a new approach for inferring combinatorial Boolean rules of gene sets for a better understanding of cancer transcriptome and cancer classification. To reduce the search space of the possible Boolean rules, we identify small groups of gene sets that synergistically contribute to the classification of samples into their corresponding phenotypic groups (such as normal and cancer). We then measure the significance of the candidate Boolean rules derived from each group of gene sets; the level of significance is based on the class entropy of the samples selected in accordance with the rules. By applying the present approach to publicly available prostate cancer datasets, we identified 72 significant Boolean rules. Finally, we discuss several identified Boolean rules, such as the rule of glutathione metabolism (down) and prostaglandin synthesis regulation (down), which are consistent with known prostate cancer biology. Scripts written in Python and R are available at http://biosoft.kaist.ac.kr/~ihpark/. The refined gene sets and the full list of the identified Boolean rules are provided in the Supplementary Material. Supplementary data are available at Bioinformatics online.
Optimal Scaling of Digital Transcriptomes
Glusman, Gustavo; Caballero, Juan; Robinson, Max; Kutlu, Burak; Hood, Leroy
2013-01-01
Deep sequencing of transcriptomes has become an indispensable tool for biology, enabling expression levels for thousands of genes to be compared across multiple samples. Since transcript counts scale with sequencing depth, counts from different samples must be normalized to a common scale prior to comparison. We analyzed fifteen existing and novel algorithms for normalizing transcript counts, and evaluated the effectiveness of the resulting normalizations. For this purpose we defined two novel and mutually independent metrics: (1) the number of “uniform” genes (genes whose normalized expression levels have a sufficiently low coefficient of variation), and (2) low Spearman correlation between normalized expression profiles of gene pairs. We also define four novel algorithms, one of which explicitly maximizes the number of uniform genes, and compared the performance of all fifteen algorithms. The two most commonly used methods (scaling to a fixed total value, or equalizing the expression of certain ‘housekeeping’ genes) yielded particularly poor results, surpassed even by normalization based on randomly selected gene sets. Conversely, seven of the algorithms approached what appears to be optimal normalization. Three of these algorithms rely on the identification of “ubiquitous” genes: genes expressed in all the samples studied, but never at very high or very low levels. We demonstrate that these include a “core” of genes expressed in many tissues in a mutually consistent pattern, which is suitable for use as an internal normalization guide. The new methods yield robustly normalized expression values, which is a prerequisite for the identification of differentially expressed and tissue-specific genes as potential biomarkers. PMID:24223126
Friedrich, Torben; Rahmann, Sven; Weigel, Wilfried; Rabsch, Wolfgang; Fruth, Angelika; Ron, Eliora; Gunzer, Florian; Dandekar, Thomas; Hacker, Jörg; Müller, Tobias; Dobrindt, Ulrich
2010-10-21
The Enterobacteriaceae comprise a large number of clinically relevant species with several individual subspecies. Overlapping virulence-associated gene pools and the high overall genome plasticity often interferes with correct enterobacterial strain typing and risk assessment. Array technology offers a fast, reproducible and standardisable means for bacterial typing and thus provides many advantages for bacterial diagnostics, risk assessment and surveillance. The development of highly discriminative broad-range microbial diagnostic microarrays remains a challenge, because of marked genome plasticity of many bacterial pathogens. We developed a DNA microarray for strain typing and detection of major antimicrobial resistance genes of clinically relevant enterobacteria. For this purpose, we applied a global genome-wide probe selection strategy on 32 available complete enterobacterial genomes combined with a regression model for pathogen classification. The discriminative power of the probe set was further tested in silico on 15 additional complete enterobacterial genome sequences. DNA microarrays based on the selected probes were used to type 92 clinical enterobacterial isolates. Phenotypic tests confirmed the array-based typing results and corroborate that the selected probes allowed correct typing and prediction of major antibiotic resistances of clinically relevant Enterobacteriaceae, including the subspecies level, e.g. the reliable distinction of different E. coli pathotypes. Our results demonstrate that the global probe selection approach based on longest common factor statistics as well as the design of a DNA microarray with a restricted set of discriminative probes enables robust discrimination of different enterobacterial variants and represents a proof of concept that can be adopted for diagnostics of a wide range of microbial pathogens. Our approach circumvents misclassifications arising from the application of virulence markers, which are highly affected by horizontal gene transfer. Moreover, a broad range of pathogens have been covered by an efficient probe set size enabling the design of high-throughput diagnostics.
Genetic selection of cattle for improved immunity and health.
Mallard, Bonnie A; Emam, Mehdi; Paibomesai, Marlene; Thompson-Crispi, Kathleen; Wagter-Lesperance, Lauraine
2015-02-01
The immune system is a sensing structure composed of tissues and molecules that are well integrated with the neuroendocrine system. This integrate system ensures non-self from self-discrimination. In this capacity the immune system provides detection and protection from a wide range of pathogens. In mammals, the immune system is regulated by several thousand genes (8-9% of the genome) which indicate its high genetic priority as a critical fitness trait providing survival of the species. Identifying and selectively breeding livestock with the inherent ability to make superior immune responses can reduce disease occurrence, improve milk quality and increase farm profitability. Healthier animals also may be expected to demonstrate improvements in other traits, including reproductive fitness. Using the University of Guelph's patented High Immune Response technology it is possible to classify animals as high, average, or low responders based on their genetic estimated breeding value for immune responsiveness. High responders have the inherent ability to produce more balanced and robust immune responses compared with average or low responders. High responders dairy cattle essentially have about one-half the disease occurrence of low responders, and can pass their superior immune response genes on to future generations thereby accumulating health benefits within the dairy herd.
Yu, Ron X.; Liu, Jie; True, Nick; Wang, Wei
2008-01-01
A major challenge in the post-genome era is to reconstruct regulatory networks from the biological knowledge accumulated up to date. The development of tools for identifying direct target genes of transcription factors (TFs) is critical to this endeavor. Given a set of microarray experiments, a probabilistic model called TRANSMODIS has been developed which can infer the direct targets of a TF by integrating sequence motif, gene expression and ChIP-chip data. The performance of TRANSMODIS was first validated on a set of transcription factor perturbation experiments (TFPEs) involving Pho4p, a well studied TF in Saccharomyces cerevisiae. TRANSMODIS removed elements of arbitrariness in manual target gene selection process and produced results that concur with one's intuition. TRANSMODIS was further validated on a genome-wide scale by comparing it with two other methods in Saccharomyces cerevisiae. The usefulness of TRANSMODIS was then demonstrated by applying it to the identification of direct targets of DAF-16, a critical TF regulating ageing in Caenorhabditis elegans. We found that 189 genes were tightly regulated by DAF-16. In addition, DAF-16 has differential preference for motifs when acting as an activator or repressor, which awaits experimental verification. TRANSMODIS is computationally efficient and robust, making it a useful probabilistic framework for finding immediate targets. PMID:18350157
Kagohara, Luciane Tsukamoto; Maldonado, Leonel; Brait, Mariana; Schoenberg, Mark; Bivalacqua, Trinity; Netto, George J; Koch, Wayne; Sidransky, David; Hoque, Mohammad O.
2014-01-01
Background: To identify new epigenetic markers and further characterize Urothelial Cell Carcinoma (UCC), we tested the promoter methylation (PM) status of 19 genes previously identified as cancer specific methylated genes in other solid tumors. Methods: We used bisulfite sequencing, methylation specific PCR and quantitative methylation specific PCR (QMSP) to test the PM status of 19 genes in urothelial cancer cell lines. Results: Among the 19 genes tested, VGF was found to be completely methylated in several UCC cell lines. VGF QMSP analysis showed that methylation values of almost all the primary 19 UCC tissues were higher than the paired normal tissues (P=0.009). In another cohort, 12/35 (34.3%) of low grade UCC cases displayed VGF methylation. As a biomarker for non-invasive detection of UCC, VGF showed a significantly higher frequency of methylation in urine from UCC cases (8/20) compared to controls (1/20) (P=0.020). After treatment of cell lines with 5-Aza-2'-deoxycytidine, VGF was robustly re-expressed. Forced expression of VGF in bladder cancer cell lines inhibited cell growth. Conclusion: Selection of candidates from genome-wide screening approach in other solid tumors successfully identified UCC specific methylated genes. PMID:24830820
Leveraging Distant Relatedness to Quantify Human Mutation and Gene-Conversion Rates
Palamara, Pier Francesco; Francioli, Laurent C.; Wilton, Peter R.; Genovese, Giulio; Gusev, Alexander; Finucane, Hilary K.; Sankararaman, Sriram; Sunyaev, Shamil R.; de Bakker, Paul I.W.; Wakeley, John; Pe’er, Itsik; Price, Alkes L.
2015-01-01
The rate at which human genomes mutate is a central biological parameter that has many implications for our ability to understand demographic and evolutionary phenomena. We present a method for inferring mutation and gene-conversion rates by using the number of sequence differences observed in identical-by-descent (IBD) segments together with a reconstructed model of recent population-size history. This approach is robust to, and can quantify, the presence of substantial genotyping error, as validated in coalescent simulations. We applied the method to 498 trio-phased sequenced Dutch individuals and inferred a point mutation rate of 1.66 × 10−8 per base per generation and a rate of 1.26 × 10−9 for <20 bp indels. By quantifying how estimates varied as a function of allele frequency, we inferred the probability that a site is involved in non-crossover gene conversion as 5.99 × 10−6. We found that recombination does not have observable mutagenic effects after gene conversion is accounted for and that local gene-conversion rates reflect recombination rates. We detected a strong enrichment of recent deleterious variation among mismatching variants found within IBD regions and observed summary statistics of local sharing of IBD segments to closely match previously proposed metrics of background selection; however, we found no significant effects of selection on our mutation-rate estimates. We detected no evidence of strong variation of mutation rates in a number of genomic annotations obtained from several recent studies. Our analysis suggests that a mutation-rate estimate higher than that reported by recent pedigree-based studies should be adopted in the context of DNA-based demographic reconstruction. PMID:26581902
Mutlib, Abdul; Jiang, Ping; Atherton, Jim; Obert, Leslie; Kostrubsky, Seva; Madore, Steven; Nelson, Sidney
2006-10-01
The inability to predict if a metabolically bioactivated compound will cause toxicity in later stages of drug development or post-marketing is of serious concern. One approach for improving the predictive success of compound toxicity has been to compare the gene expression profile in preclinical models dosed with novel compounds to a gene expression database generated from compounds with known toxicity. While this guilt-by-association approach can be useful, it is often difficult to elucidate gene expression changes that may be related to the generation of reactive metabolites. In an effort to address this issue, we compared the gene expression profiles obtained from animals treated with a soft-electrophile-producing hepatotoxic compound against corresponding deuterium labeled analogues resistant to metabolic processing. Our aim was to identify a subset of potential biomarker genes for hepatotoxicity caused by soft-electrophile-producing compounds. The current study utilized a known hepatotoxic compound N-methylformamide (NMF) and its two analogues labeled with deuterium at different positions to block metabolic oxidation at the formyl (d(1)) and methyl (d(3)) moieties. Groups of mice were dosed with each compound, and their livers were harvested at different time intervals. RNA was prepared and analyzed on Affymetrix GeneChip arrays. RNA transcripts showing statistically significant changes were identified, and selected changes were confirmed using TaqMan RT-PCR. Serum clinical chemistry and histopathologic evaluations were performed on selected samples as well. The data set generated from the different groups of animals enabled us to determine which gene expression changes were attributed to the bioactivating pathway. We were able to selectively modulate the metabolism of NMF by labeling various positions of the molecule with a stable isotope, allowing us to monitor gene changes specifically due to a particular metabolic pathway. Two groups of genes were identified, which were associated with the metabolism of a certain part of the NMF molecule. The metabolic pathway leading to the production of reactive methyl isocyanate resulted in distinct expression patterns that correlated with histopathologic findings. There was a clear correlation between the expression of certain genes involved in the cell cycle/apoptosis and inflammatory pathways and the presence of reactive metabolite. These genes may serve as potential genomic biomarkers of hepatotoxicity induced by soft-electrophile-producing compounds. However, the robustness of these potential genomic biomarkers will need to be validated using other hepatotoxicants (both soft- and hard-electrophile-producing agents) and compounds known to cause idiosyncratic liver toxicity before being adopted into the drug discovery screening process.
Chen, Bor-Sen; Lin, Ying-Po
2013-01-01
In ecological networks, network robustness should be large enough to confer intrinsic robustness for tolerating intrinsic parameter fluctuations, as well as environmental robustness for resisting environmental disturbances, so that the phenotype stability of ecological networks can be maintained, thus guaranteeing phenotype robustness. However, it is difficult to analyze the network robustness of ecological systems because they are complex nonlinear partial differential stochastic systems. This paper develops a unifying mathematical framework for investigating the principles of both robust stabilization and environmental disturbance sensitivity in ecological networks. We found that the phenotype robustness criterion for ecological networks is that if intrinsic robustness + environmental robustness ≦ network robustness, then the phenotype robustness can be maintained in spite of intrinsic parameter fluctuations and environmental disturbances. These results in robust ecological networks are similar to that in robust gene regulatory networks and evolutionary networks even they have different spatial-time scales. PMID:23515112
Host–parasite fluctuating selection in the absence of specificity
Ashby, Ben; White, Andy; Bowers, Roger; Buckling, Angus; Koskella, Britt
2017-01-01
Fluctuating selection driven by coevolution between hosts and parasites is important for the generation of host and parasite diversity across space and time. Theory has focused primarily on infection genetics, with highly specific ‘matching-allele’ frameworks more likely to generate fluctuating selection dynamics (FSD) than ‘gene-for-gene’ (generalist–specialist) frameworks. However, the environment, ecological feedbacks and life-history characteristics may all play a role in determining when FSD occurs. Here, we develop eco-evolutionary models with explicit ecological dynamics to explore the ecological, epidemiological and host life-history drivers of FSD. Our key result is to demonstrate for the first time, to our knowledge, that specificity between hosts and parasites is not required to generate FSD. Furthermore, highly specific host–parasite interactions produce unstable, less robust stochastic fluctuations in contrast to interactions that lack specificity altogether or those that vary from generalist to specialist, which produce predictable limit cycles. Given the ubiquity of ecological feedbacks and the variation in the nature of specificity in host–parasite interactions, our work emphasizes the underestimated potential for host–parasite coevolution to generate fluctuating selection. PMID:29093222
Luo, Xiong-Jian; Mattheisen, Manuel; Li, Ming; Huang, Liang; Rietschel, Marcella; Børglum, Anders D; Als, Thomas D; van den Oord, Edwin J; Aberg, Karolina A; Mors, Ole; Mortensen, Preben Bo; Luo, Zhenwu; Degenhardt, Franziska; Cichon, Sven; Schulze, Thomas G; Nöthen, Markus M; Su, Bing; Zhao, Zhongming; Gan, Lin; Yao, Yong-Gang
2015-11-01
Genome-wide association studies have identified multiple risk variants and loci that show robust association with schizophrenia. Nevertheless, it remains unclear how these variants confer risk to schizophrenia. In addition, the driving force that maintains the schizophrenia risk variants in human gene pool is poorly understood. To investigate whether expression-associated genetic variants contribute to schizophrenia susceptibility, we systematically integrated brain expression quantitative trait loci and genome-wide association data of schizophrenia using Sherlock, a Bayesian statistical framework. Our analyses identified ZNF323 as a schizophrenia risk gene (P = 2.22×10(-6)). Subsequent analyses confirmed the association of the ZNF323 and its expression-associated single nucleotide polymorphism rs1150711 in independent samples (gene-expression: P = 1.40×10(-6); single-marker meta-analysis in the combined discovery and replication sample comprising 44123 individuals: P = 6.85×10(-10)). We found that the ZNF323 was significantly downregulated in hippocampus and frontal cortex of schizophrenia patients (P = .0038 and P = .0233, respectively). Evidence for pleiotropic effects was detected (association of rs1150711 with lung function and gene expression of ZNF323 in lung: P = 6.62×10(-5) and P = 9.00×10(-5), respectively) with the risk allele (T allele) for schizophrenia acting as protective allele for lung function. Subsequent population genetics analyses suggest that the risk allele (T) of rs1150711 might have undergone recent positive selection in human population. Our findings suggest that the ZNF323 is a schizophrenia susceptibility gene whose expression may influence schizophrenia risk. Our study also illustrates a possible mechanism for maintaining schizophrenia risk variants in the human gene pool. © The Author 2015. Published by Oxford University Press on behalf of the Maryland Psychiatric Research Center. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Fernandez-Leon, Jose A; Acosta, Gerardo G; Rozenfeld, Alejandro
2014-10-01
Researchers in diverse fields, such as in neuroscience, systems biology and autonomous robotics, have been intrigued by the origin and mechanisms for biological robustness. Darwinian evolution, in general, has suggested that adaptive mechanisms as a way of reaching robustness, could evolve by natural selection acting successively on numerous heritable variations. However, is this understanding enough for realizing how biological systems remain robust during their interactions with the surroundings? Here, we describe selected studies of bio-inspired systems that show behavioral robustness. From neurorobotics, cognitive, self-organizing and artificial immune system perspectives, our discussions focus mainly on how robust behaviors evolve or emerge in these systems, having the capacity of interacting with their surroundings. These descriptions are twofold. Initially, we introduce examples from autonomous robotics to illustrate how the process of designing robust control can be idealized in complex environments for autonomous navigation in terrain and underwater vehicles. We also include descriptions of bio-inspired self-organizing systems. Then, we introduce other studies that contextualize experimental evolution with simulated organisms and physical robots to exemplify how the process of natural selection can lead to the evolution of robustness by means of adaptive behaviors. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Using Public Data for Comparative Proteome Analysis in Precision Medicine Programs.
Hughes, Christopher S; Morin, Gregg B
2018-03-01
Maximizing the clinical utility of information obtained in longitudinal precision medicine programs would benefit from robust comparative analyses to known information to assess biological features of patient material toward identifying the underlying features driving their disease phenotype. Herein, the potential for utilizing publically deposited mass-spectrometry-based proteomics data to perform inter-study comparisons of cell-line or tumor-tissue materials is investigated. To investigate the robustness of comparison between MS-based proteomics studies carried out with different methodologies, deposited data representative of label-free (MS1) and isobaric tagging (MS2 and MS3 quantification) are utilized. In-depth quantitative proteomics data acquired from analysis of ovarian cancer cell lines revealed the robust recapitulation of observable gene expression dynamics between individual studies carried out using significantly different methodologies. The observed signatures enable robust inter-study clustering of cell line samples. In addition, the ability to classify and cluster tumor samples based on observed gene expression trends when using a single patient sample is established. With this analysis, relevant gene expression dynamics are obtained from a single patient tumor, in the context of a precision medicine analysis, by leveraging a large cohort of repository data as a comparator. Together, these data establish the potential for state-of-the-art MS-based proteomics data to serve as resources for robust comparative analyses in precision medicine applications. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
A Scaled Framework for CRISPR Editing of Human Pluripotent Stem Cells to Study Psychiatric Disease.
Hazelbaker, Dane Z; Beccard, Amanda; Bara, Anne M; Dabkowski, Nicole; Messana, Angelica; Mazzucato, Patrizia; Lam, Daisy; Manning, Danielle; Eggan, Kevin; Barrett, Lindy E
2017-10-10
Scaling of CRISPR-Cas9 technology in human pluripotent stem cells (hPSCs) represents an important step for modeling complex disease and developing drug screens in human cells. However, variables affecting the scaling efficiency of gene editing in hPSCs remain poorly understood. Here, we report a standardized CRISPR-Cas9 approach, with robust benchmarking at each step, to successfully target and genotype a set of psychiatric disease-implicated genes in hPSCs and provide a resource of edited hPSC lines for six of these genes. We found that transcriptional state and nucleosome positioning around targeted loci was not correlated with editing efficiency. However, editing frequencies varied between different hPSC lines and correlated with genomic stability, underscoring the need for careful cell line selection and unbiased assessments of genomic integrity. Together, our step-by-step quantification and in-depth analyses provide an experimental roadmap for scaling Cas9-mediated editing in hPSCs to study psychiatric disease, with broader applicability for other polygenic diseases. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.
Antibiotic Resistance in Plant-Pathogenic Bacteria.
Sundin, George W; Wang, Nian
2018-06-01
Antibiotics have been used for the management of relatively few bacterial plant diseases and are largely restricted to high-value fruit crops because of the expense involved. Antibiotic resistance in plant-pathogenic bacteria has become a problem in pathosystems where these antibiotics have been used for many years. Where the genetic basis for resistance has been examined, antibiotic resistance in plant pathogens has most often evolved through the acquisition of a resistance determinant via horizontal gene transfer. For example, the strAB streptomycin-resistance genes occur in Erwinia amylovora, Pseudomonas syringae, and Xanthomonas campestris, and these genes have presumably been acquired from nonpathogenic epiphytic bacteria colocated on plant hosts under antibiotic selection. We currently lack knowledge of the effect of the microbiome of commensal organisms on the potential of plant pathogens to evolve antibiotic resistance. Such knowledge is critical to the development of robust resistance management strategies to ensure the safe and effective continued use of antibiotics in the management of critically important diseases. Expected final online publication date for the Annual Review of Phytopathology Volume 56 is August 25, 2018. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Versatile Genetic Tool Box for the Crenarchaeote Sulfolobus acidocaldarius
Wagner, Michaela; van Wolferen, Marleen; Wagner, Alexander; Lassak, Kerstin; Meyer, Benjamin H.; Reimann, Julia; Albers, Sonja-Verena
2012-01-01
For reverse genetic approaches inactivation or selective modification of genes are required to elucidate their putative function. Sulfolobus acidocaldarius is a thermoacidophilic Crenarchaeon which grows optimally at 76°C and pH 3. As many antibiotics do not withstand these conditions the development of a genetic system in this organism is dependent on auxotrophies. Therefore we constructed a pyrE deletion mutant of S. acidocaldarius wild type strain DSM639 missing 322 bp called MW001. Using this strain as the starting point, we describe here different methods using single as well as double crossover events to obtain markerless deletion mutants, tag genes genomically and ectopically integrate foreign DNA into MW001. These methods enable us to construct single, double, and triple deletions strains that can still be complemented with the pRN1 based expression vector. Taken together we have developed a versatile and robust genetic tool box for the crenarchaeote S. acidocaldarius that will promote the study of unknown gene functions in this organism and makes it a suitable host for synthetic biology approaches. PMID:22707949
Steiner, Heinz; Van Waes, Vincent; Marinelli, Michela
2009-01-01
Background There is growing use of psychostimulant cognitive enhancers such as methylphenidate (Ritalin). Methylphenidate differs from the psychostimulant cocaine because it does not enhance brain levels of serotonin. We investigated whether exposure to methylphenidate combined with a serotonin-enhancing medication, the prototypical selective serotonin reuptake inhibitor (SSRI) fluoxetine (Prozac), would produce more “cocaine-like” molecular and behavioral changes. Methods We measured the effects of fluoxetine on gene expression induced by the cognitive enhancer methylphenidate in the striatum and nucleus accumbens of rats, by in situ hybridization histochemistry. We also determined whether fluoxetine modified behavioral effects of methylphenidate. Results Fluoxetine robustly potentiated methylphenidate-induced expression of the transcription factors c-fos and zif 268 throughout the striatum and to some degree in the nucleus accumbens. Fluoxetine also enhanced methylphenidate-induced stereotypical behavior. Conclusions Both potentiated gene regulation in the striatum and the behavioral effects indicate that combining the SSRI fluoxetine with the cognitive enhancer methylphenidate mimics cocaine effects, consistent with an increased risk for substance use disorder. PMID:19931852
The impact of target site accessibility on the design of effective siRNAs.
Tafer, Hakim; Ameres, Stefan L; Obernosterer, Gregor; Gebeshuber, Christoph A; Schroeder, Renée; Martinez, Javier; Hofacker, Ivo L
2008-05-01
Small-interfering RNAs (siRNAs) assemble into RISC, the RNA-induced silencing complex, which cleaves complementary mRNAs. Despite their fluctuating efficacy, siRNAs are widely used to assess gene function. Although this limitation could be ascribed, in part, to variations in the assembly and activation of RISC, downstream events in the RNA interference (RNAi) pathway, such as target site accessibility, have so far not been investigated extensively. In this study we present a comprehensive analysis of target RNA structure effects on RNAi by computing the accessibility of the target site for interaction with the siRNA. Based on our observations, we developed a novel siRNA design tool, RNAxs, by combining known siRNA functionality criteria with target site accessibility. We calibrated our method on two data sets comprising 573 siRNAs for 38 genes, and tested it on an independent set of 360 siRNAs targeting four additional genes. Overall, RNAxs proves to be a robust siRNA selection tool that substantially improves the prediction of highly efficient siRNAs.
Robust patterning of gene expression based on internal coordinate system of cells.
Ogawa, Ken-ichiro; Miyake, Yoshihiro
2015-06-01
Cell-to-cell communication in multicellular organisms is established through the transmission of various kinds of chemical substances such as proteins. It is well known that gene expression triggered by a chemical substance in individuals has stable spatial patterns despite the individual differences in concentration patterns of the chemical substance. This fact reveals an important property of multicellular organisms called "robustness", which allows the organisms to generate their forms while maintaining proportion. Robustness has been conventionally accounted for by the stability of solutions of dynamical equations that represent a specific interaction network of chemical substances. However, any biological system is composed of autonomous elements. In general, an autonomous element does not merely accept information on the chemical substance from the environment; instead, it accepts the information based on its own criteria for reaction. Therefore, this phenomenon needs to be considered from the viewpoint of cells. Such a viewpoint is expected to allow the consideration of the autonomy of cells in multicellular organisms. This study aims to explain theoretically the robust patterning of gene expression from the viewpoint of cells. For this purpose, we introduced a new operator for transforming a state variable of a chemical substance from an external coordinate system to an internal coordinate system of each cell, which describes the observation of the chemical substance by cells. We then applied this operator to the simplest reaction-diffusion model of the chemical substance to investigate observation effects by cells. Our mathematical analysis of this extended model indicates that the robust patterning of gene expression against individual differences in concentration pattern of the chemical substance can be explained from the viewpoint of cells if there is a regulation field that compensates for the difference between cells seen in the observation results. This result provides a new insight into the investigation of the mechanism of robust patterning in biological systems composed of individual elements. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Buzdin, Anton; Sorokin, Maxim; Garazha, Andrew; Sekacheva, Marina; Kim, Ella; Zhukov, Nikolay; Wang, Ye; Li, Xinmin; Kar, Souvik; Hartmann, Christian; Samii, Amir; Giese, Alf; Borisov, Nicolas
2018-06-20
Anticancer target drugs (ATDs) specifically bind and inhibit molecular targets that play important roles in cancer development and progression, being deeply implicated in intracellular signaling pathways. To date, hundreds of different ATDs were approved for clinical use in the different countries. Compared to previous chemotherapy treatments, ATDs often demonstrate reduced side effects and increased efficiency, but also have higher costs. However, the efficiency of ATDs for the advanced stage tumors is still insufficient. Different ATDs have different mechanisms of action and are effective in different cohorts of patients. Personalized approaches are therefore needed to select the best ATD candidates for the individual patients. In this review, we focus on a new generation of biomarkers - molecular pathway activation - and on their applications for predicting individual tumor response to ATDs. The success in high throughput gene expression profiling and emergence of novel bioinformatic tools reinforced quick development of pathway related field of molecular biomedicine. The ability to quantitatively measure degree of a pathway activation using gene expression data has revolutionized this field and made the corresponding analysis quick, robust and inexpensive. This success was further enhanced by using machine learning algorithms for selection of the best biomarkers. We review here the current progress in translating these studies to clinical oncology and patient-oriented adjustment of cancer therapy. Copyright © 2018. Published by Elsevier Ltd.
Evolution of the Division of Labor between Genes and Enzymes in the RNA World
Boza, Gergely; Szilágyi, András; Kun, Ádám; Santos, Mauro; Szathmáry, Eörs
2014-01-01
The RNA world is a very likely interim stage of the evolution after the first replicators and before the advent of the genetic code and translated proteins. Ribozymes are known to be able to catalyze many reaction types, including cofactor-aided metabolic transformations. In a metabolically complex RNA world, early division of labor between genes and enzymes could have evolved, where the ribozymes would have been transcribed from the genes more often than the other way round, benefiting the encapsulating cells through this dosage effect. Here we show, by computer simulations of protocells harboring unlinked RNA replicators, that the origin of replicational asymmetry producing more ribozymes from a gene template than gene strands from a ribozyme template is feasible and robust. Enzymatic activities of the two modeled ribozymes are in trade-off with their replication rates, and the relative replication rates compared to those of complementary strands are evolvable traits of the ribozymes. The degree of trade-off is shown to have the strongest effect in favor of the division of labor. Although some asymmetry between gene and enzymatic strands could have evolved even in earlier, surface-bound systems, the shown mechanism in protocells seems inevitable and under strong positive selection. This could have preadapted the genetic system for transcription after the subsequent origin of chromosomes and DNA. PMID:25474573
Evolution of the division of labor between genes and enzymes in the RNA world.
Boza, Gergely; Szilágyi, András; Kun, Ádám; Santos, Mauro; Szathmáry, Eörs
2014-12-01
The RNA world is a very likely interim stage of the evolution after the first replicators and before the advent of the genetic code and translated proteins. Ribozymes are known to be able to catalyze many reaction types, including cofactor-aided metabolic transformations. In a metabolically complex RNA world, early division of labor between genes and enzymes could have evolved, where the ribozymes would have been transcribed from the genes more often than the other way round, benefiting the encapsulating cells through this dosage effect. Here we show, by computer simulations of protocells harboring unlinked RNA replicators, that the origin of replicational asymmetry producing more ribozymes from a gene template than gene strands from a ribozyme template is feasible and robust. Enzymatic activities of the two modeled ribozymes are in trade-off with their replication rates, and the relative replication rates compared to those of complementary strands are evolvable traits of the ribozymes. The degree of trade-off is shown to have the strongest effect in favor of the division of labor. Although some asymmetry between gene and enzymatic strands could have evolved even in earlier, surface-bound systems, the shown mechanism in protocells seems inevitable and under strong positive selection. This could have preadapted the genetic system for transcription after the subsequent origin of chromosomes and DNA.
Wan, Qiu-Hong; Pan, Sheng-Kai; Hu, Li; Zhu, Ying; Xu, Peng-Wei; Xia, Jin-Quan; Chen, Hui; He, Gen-Yun; He, Jing; Ni, Xiao-Wei; Hou, Hao-Long; Liao, Sheng-Guang; Yang, Hai-Qiong; Chen, Ying; Gao, Shu-Kun; Ge, Yun-Fa; Cao, Chang-Chang; Li, Peng-Fei; Fang, Li-Ming; Liao, Li; Zhang, Shu; Wang, Meng-Zhen; Dong, Wei; Fang, Sheng-Guo
2013-01-01
Crocodilians are diving reptiles that can hold their breath under water for long periods of time and are crepuscular animals with excellent sensory abilities. They comprise a sister lineage of birds and have no sex chromosome. Here we report the genome sequence of the endangered Chinese alligator (Alligator sinensis) and describe its unique features. The next-generation sequencing generated 314 Gb of raw sequence, yielding a genome size of 2.3 Gb. A total of 22 200 genes were predicted in Alligator sinensis using a de novo, homology- and RNA-based combined model. The genetic basis of long-diving behavior includes duplication of the bicarbonate-binding hemoglobin gene, co-functioning of routine phosphate-binding and special bicarbonate-binding oxygen transport, and positively selected energy metabolism, ammonium bicarbonate excretion and cardiac muscle contraction. Further, we elucidated the robust Alligator sinensis sensory system, including a significantly expanded olfactory receptor repertoire, rapidly evolving nerve-related cellular components and visual perception, and positive selection of the night vision-related opsin and sound detection-associated otopetrin. We also discovered a well-developed immune system with a considerable number of lineage-specific antigen-presentation genes for adaptive immunity as well as expansion of the tripartite motif-containing C-type lectin and butyrophilin genes for innate immunity and expression of antibacterial peptides. Multifluorescence in situ hybridization showed that alligator chromosome 3, which encodes DMRT1, exhibits significant synteny with chicken chromosome Z. Finally, population history analysis indicated population admixture 0.60-1.05 million years ago, when the Qinghai-Tibetan Plateau was uplifted. PMID:23917531
Vanhoutte, Kurt; de Asmundis, Carlo; Francesconi, Anna; Figysl, Jurgen; Steurs, Griet; Boussy, Tim; Roos, Markus; Mueller, Andreas; Massimo, Lucio; Paparella, Gaetano; Van Caelenberg, Kristien; Chierchia, Gian Battista; Sarkozy, Andrea; Terradellas, Pedro Brugada Y; Zizi, Martin
2009-01-01
Atrial fibrillation (AF) is a frequent chronic dysrythmia with an incidence that increases with age (>40). Because of its medical and socio-economic impacts it is expected to become an increasing burden on most health care systems. AF is a multi-factorial disease for which the identification of subtypes is warranted. Novel approaches based on the broad concepts of systems biology may overcome the blurred notion of normal and pathological phenotype, which is inherent to high throughput molecular arrays analysis. Here we apply an internal contrast algorithm on AF patient data with an analytical focus on potential entry pathways into the disease. We used a RMA (Robust Multichip Average) normalized Affymetrix micro-array data set from 10 AF patients (geo_accession #GSE2240). Four series of probes were selected based on physiopathogenic links with AF entryways: apoptosis (remodeling), MAP kinase (cell remodeling), OXPHOS (ability to sustain hemodynamic workload) and glycolysis (ischemia). Annotated probe lists were polled with Bioconductor packages in R (version 2.7.1). Genetic profile contrasts were analysed with hierarchical clustering and principal component analysis. The analysis revealed distinct patient groups for all probe sets. A substantial part (54% till 67%) of the variance is explained in the first 2 principal components. Genes in PC1/2 with high discriminatory value were selected and analyzed in detail. We aim for reliable molecular stratification of AF. We show that stratification is possible based on physiologically relevant gene sets. Genes with high contrast value are likely to give pathophysiological insight into permanent AF subtypes.
Wan, Qiu-Hong; Pan, Sheng-Kai; Hu, Li; Zhu, Ying; Xu, Peng-Wei; Xia, Jin-Quan; Chen, Hui; He, Gen-Yun; He, Jing; Ni, Xiao-Wei; Hou, Hao-Long; Liao, Sheng-Guang; Yang, Hai-Qiong; Chen, Ying; Gao, Shu-Kun; Ge, Yun-Fa; Cao, Chang-Chang; Li, Peng-Fei; Fang, Li-Ming; Liao, Li; Zhang, Shu; Wang, Meng-Zhen; Dong, Wei; Fang, Sheng-Guo
2013-09-01
Crocodilians are diving reptiles that can hold their breath under water for long periods of time and are crepuscular animals with excellent sensory abilities. They comprise a sister lineage of birds and have no sex chromosome. Here we report the genome sequence of the endangered Chinese alligator (Alligator sinensis) and describe its unique features. The next-generation sequencing generated 314 Gb of raw sequence, yielding a genome size of 2.3 Gb. A total of 22 200 genes were predicted in Alligator sinensis using a de novo, homology- and RNA-based combined model. The genetic basis of long-diving behavior includes duplication of the bicarbonate-binding hemoglobin gene, co-functioning of routine phosphate-binding and special bicarbonate-binding oxygen transport, and positively selected energy metabolism, ammonium bicarbonate excretion and cardiac muscle contraction. Further, we elucidated the robust Alligator sinensis sensory system, including a significantly expanded olfactory receptor repertoire, rapidly evolving nerve-related cellular components and visual perception, and positive selection of the night vision-related opsin and sound detection-associated otopetrin. We also discovered a well-developed immune system with a considerable number of lineage-specific antigen-presentation genes for adaptive immunity as well as expansion of the tripartite motif-containing C-type lectin and butyrophilin genes for innate immunity and expression of antibacterial peptides. Multifluorescence in situ hybridization showed that alligator chromosome 3, which encodes DMRT1, exhibits significant synteny with chicken chromosome Z. Finally, population history analysis indicated population admixture 0.60-1.05 million years ago, when the Qinghai-Tibetan Plateau was uplifted.
He, Fei; Fromion, Vincent; Westerhoff, Hans V
2013-11-21
Metabolic control analysis (MCA) and supply-demand theory have led to appreciable understanding of the systems properties of metabolic networks that are subject exclusively to metabolic regulation. Supply-demand theory has not yet considered gene-expression regulation explicitly whilst a variant of MCA, i.e. Hierarchical Control Analysis (HCA), has done so. Existing analyses based on control engineering approaches have not been very explicit about whether metabolic or gene-expression regulation would be involved, but designed different ways in which regulation could be organized, with the potential of causing adaptation to be perfect. This study integrates control engineering and classical MCA augmented with supply-demand theory and HCA. Because gene-expression regulation involves time integration, it is identified as a natural instantiation of the 'integral control' (or near integral control) known in control engineering. This study then focuses on robustness against and adaptation to perturbations of process activities in the network, which could result from environmental perturbations, mutations or slow noise. It is shown however that this type of 'integral control' should rarely be expected to lead to the 'perfect adaptation': although the gene-expression regulation increases the robustness of important metabolite concentrations, it rarely makes them infinitely robust. For perfect adaptation to occur, the protein degradation reactions should be zero order in the concentration of the protein, which may be rare biologically for cells growing steadily. A proposed new framework integrating the methodologies of control engineering and metabolic and hierarchical control analysis, improves the understanding of biological systems that are regulated both metabolically and by gene expression. In particular, the new approach enables one to address the issue whether the intracellular biochemical networks that have been and are being identified by genomics and systems biology, correspond to the 'perfect' regulatory structures designed by control engineering vis-à-vis optimal functions such as robustness. To the extent that they are not, the analyses suggest how they may become so and this in turn should facilitate synthetic biology and metabolic engineering.
2013-01-01
Background Metabolic control analysis (MCA) and supply–demand theory have led to appreciable understanding of the systems properties of metabolic networks that are subject exclusively to metabolic regulation. Supply–demand theory has not yet considered gene-expression regulation explicitly whilst a variant of MCA, i.e. Hierarchical Control Analysis (HCA), has done so. Existing analyses based on control engineering approaches have not been very explicit about whether metabolic or gene-expression regulation would be involved, but designed different ways in which regulation could be organized, with the potential of causing adaptation to be perfect. Results This study integrates control engineering and classical MCA augmented with supply–demand theory and HCA. Because gene-expression regulation involves time integration, it is identified as a natural instantiation of the ‘integral control’ (or near integral control) known in control engineering. This study then focuses on robustness against and adaptation to perturbations of process activities in the network, which could result from environmental perturbations, mutations or slow noise. It is shown however that this type of ‘integral control’ should rarely be expected to lead to the ‘perfect adaptation’: although the gene-expression regulation increases the robustness of important metabolite concentrations, it rarely makes them infinitely robust. For perfect adaptation to occur, the protein degradation reactions should be zero order in the concentration of the protein, which may be rare biologically for cells growing steadily. Conclusions A proposed new framework integrating the methodologies of control engineering and metabolic and hierarchical control analysis, improves the understanding of biological systems that are regulated both metabolically and by gene expression. In particular, the new approach enables one to address the issue whether the intracellular biochemical networks that have been and are being identified by genomics and systems biology, correspond to the ‘perfect’ regulatory structures designed by control engineering vis-à-vis optimal functions such as robustness. To the extent that they are not, the analyses suggest how they may become so and this in turn should facilitate synthetic biology and metabolic engineering. PMID:24261908
Efficient robust doubly adaptive regularized regression with applications.
Karunamuni, Rohana J; Kong, Linglong; Tu, Wei
2018-01-01
We consider the problem of estimation and variable selection for general linear regression models. Regularized regression procedures have been widely used for variable selection, but most existing methods perform poorly in the presence of outliers. We construct a new penalized procedure that simultaneously attains full efficiency and maximum robustness. Furthermore, the proposed procedure satisfies the oracle properties. The new procedure is designed to achieve sparse and robust solutions by imposing adaptive weights on both the decision loss and the penalty function. The proposed method of estimation and variable selection attains full efficiency when the model is correct and, at the same time, achieves maximum robustness when outliers are present. We examine the robustness properties using the finite-sample breakdown point and an influence function. We show that the proposed estimator attains the maximum breakdown point. Furthermore, there is no loss in efficiency when there are no outliers or the error distribution is normal. For practical implementation of the proposed method, we present a computational algorithm. We examine the finite-sample and robustness properties using Monte Carlo studies. Two datasets are also analyzed.
Predicting effects of structural stress in a genome-reduced model bacterial metabolism
NASA Astrophysics Data System (ADS)
Güell, Oriol; Sagués, Francesc; Serrano, M. Ángeles
2012-08-01
Mycoplasma pneumoniae is a human pathogen recently proposed as a genome-reduced model for bacterial systems biology. Here, we study the response of its metabolic network to different forms of structural stress, including removal of individual and pairs of reactions and knockout of genes and clusters of co-expressed genes. Our results reveal a network architecture as robust as that of other model bacteria regarding multiple failures, although less robust against individual reaction inactivation. Interestingly, metabolite motifs associated to reactions can predict the propagation of inactivation cascades and damage amplification effects arising in double knockouts. We also detect a significant correlation between gene essentiality and damages produced by single gene knockouts, and find that genes controlling high-damage reactions tend to be expressed independently of each other, a functional switch mechanism that, simultaneously, acts as a genetic firewall to protect metabolism. Prediction of failure propagation is crucial for metabolic engineering or disease treatment.
Geographical Genomics of Human Leukocyte Gene Expression Variation in Southern Morocco
Idaghdour, Youssef; Czika, Wendy; Shianna, Kevin V.; Lee, S. Hong; Visscher, Peter M.; Martin, Hilary C.; Miclaus, Kelci; Jadallah, Sami J.; Goldstein, David B.; Wolfinger, Russell D.; Gibson, Greg
2009-01-01
Studies of the genetics of gene expression reveal expression SNPs that explain variation in transcript abundance. Here we address the robustness of eSNP associations to environmental geography and population structure in a comparison of 194 Arab and Amazigh individuals from a city and two villages in southern Morocco. Gene expression differed between pairs of locations for up to a third of all transcripts, with notable enrichment for ribosomal biosynthesis and oxidative phosphorylation. Robust associations were observed in the leukocyte samples with cis-eSNPs (P < 10−08) for 346 genes, and trans-eSNPs (P < 10−11) with 10 genes. All of these were consistent across the three sample locations and after controlling for ethnicity and relatedness. No evidence for large-effect trans-acting mediators of the pervasive environmental influence was found and instead genetic and environmental factors acted in a largely additive manner. PMID:19966804
From Cellular Attractor Selection to Adaptive Signal Control for Traffic Networks
Tian, Daxin; Zhou, Jianshan; Sheng, Zhengguo; Wang, Yunpeng; Ma, Jianming
2016-01-01
The management of varying traffic flows essentially depends on signal controls at intersections. However, design an optimal control that considers the dynamic nature of a traffic network and coordinates all intersections simultaneously in a centralized manner is computationally challenging. Inspired by the stable gene expressions of Escherichia coli in response to environmental changes, we explore the robustness and adaptability performance of signalized intersections by incorporating a biological mechanism in their control policies, specifically, the evolution of each intersection is induced by the dynamics governing an adaptive attractor selection in cells. We employ a mathematical model to capture such biological attractor selection and derive a generic, adaptive and distributed control algorithm which is capable of dynamically adapting signal operations for the entire dynamical traffic network. We show that the proposed scheme based on attractor selection can not only promote the balance of traffic loads on each link of the network but also allows the global network to accommodate dynamical traffic demands. Our work demonstrates the potential of bio-inspired intelligence emerging from cells and provides a deep understanding of adaptive attractor selection-based control formation that is useful to support the designs of adaptive optimization and control in other domains. PMID:26972968
From Cellular Attractor Selection to Adaptive Signal Control for Traffic Networks.
Tian, Daxin; Zhou, Jianshan; Sheng, Zhengguo; Wang, Yunpeng; Ma, Jianming
2016-03-14
The management of varying traffic flows essentially depends on signal controls at intersections. However, design an optimal control that considers the dynamic nature of a traffic network and coordinates all intersections simultaneously in a centralized manner is computationally challenging. Inspired by the stable gene expressions of Escherichia coli in response to environmental changes, we explore the robustness and adaptability performance of signalized intersections by incorporating a biological mechanism in their control policies, specifically, the evolution of each intersection is induced by the dynamics governing an adaptive attractor selection in cells. We employ a mathematical model to capture such biological attractor selection and derive a generic, adaptive and distributed control algorithm which is capable of dynamically adapting signal operations for the entire dynamical traffic network. We show that the proposed scheme based on attractor selection can not only promote the balance of traffic loads on each link of the network but also allows the global network to accommodate dynamical traffic demands. Our work demonstrates the potential of bio-inspired intelligence emerging from cells and provides a deep understanding of adaptive attractor selection-based control formation that is useful to support the designs of adaptive optimization and control in other domains.
Paten, A M; Pain, S J; Peterson, S W; Blair, H T; Kenyon, P R; Dearden, P K; Duncan, E J
2014-08-01
The mammary gland is a complex tissue consisting of multiple cell types which, over the lifetime of an animal, go through repeated cycles of development associated with pregnancy, lactation and involution. The mammary gland is also known to be sensitive to maternal programming by environmental stimuli such as nutrition. The molecular basis of these adaptations is of significant interest, but requires robust methods to measure gene expression. Reverse-transcription quantitative PCR (RT-qPCR) is commonly used to measure gene expression, and is currently the method of choice for validating genome-wide expression studies. RT-qPCR requires the selection of reference genes that are stably expressed over physiological states and treatments. In this study we identify suitable reference genes to normalize RT-qPCR data for the ovine mammary gland in two physiological states; late pregnancy and lactation. Biopsies were collected from offspring of ewes that had been subjected to different nutritional paradigms during pregnancy to examine effects of maternal programming on the mammary gland of the offspring. We evaluated eight candidate reference genes and found that two reference genes (PRPF3 and CUL1) are required for normalising RT-qPCR data from pooled RNA samples, but five reference genes are required for analyzing gene expression in individual animals (SENP2, EIF6, MRPL39, ATP1A1, CUL1). Using these stable reference genes, we showed that TET1, a key regulator of DNA methylation, is responsive to maternal programming and physiological state. The identification of these novel reference genes will be of utility to future studies of gene expression in the ovine mammary gland. Copyright © 2014 the American Physiological Society.
NASA Astrophysics Data System (ADS)
Gliddon, H. D.; Howes, P. D.; Kaforou, M.; Levin, M.; Stevens, M. M.
2016-05-01
The development of rapid, robust and high performance point-of-care diagnostics relies on the advancement and combination of various areas of research. We have developed an assay for the detection of multiple mRNA molecules that combines DNA nanotechnology with fluorescent nanomaterials. The core switching mechanism is toehold-mediated strand displacement. We have used fluorescent quantum dots (QDs) as signal transducers in this assay, as they bring many benefits including bright fluorescence and multiplexing abilities. The resulting assay is capable of multiplexed detection of long RNA targets against a high concentration of background non-target RNA, with high sensitivity and specificity and limits of detection in the nanomolar range using only a standard laboratory plate reader. We demonstrate the utility of our QD-based system for the detection of two genes selected from a microarray-derived tuberculosis-specific gene expression signature. Levels of up- and downregulated gene transcripts comprising this signature can be combined to give a disease risk score, making the signature more amenable for use as a diagnostic marker. Our QD-based approach to detect these transcripts could pave the way for novel diagnostic assays for tuberculosis.The development of rapid, robust and high performance point-of-care diagnostics relies on the advancement and combination of various areas of research. We have developed an assay for the detection of multiple mRNA molecules that combines DNA nanotechnology with fluorescent nanomaterials. The core switching mechanism is toehold-mediated strand displacement. We have used fluorescent quantum dots (QDs) as signal transducers in this assay, as they bring many benefits including bright fluorescence and multiplexing abilities. The resulting assay is capable of multiplexed detection of long RNA targets against a high concentration of background non-target RNA, with high sensitivity and specificity and limits of detection in the nanomolar range using only a standard laboratory plate reader. We demonstrate the utility of our QD-based system for the detection of two genes selected from a microarray-derived tuberculosis-specific gene expression signature. Levels of up- and downregulated gene transcripts comprising this signature can be combined to give a disease risk score, making the signature more amenable for use as a diagnostic marker. Our QD-based approach to detect these transcripts could pave the way for novel diagnostic assays for tuberculosis. Electronic supplementary information (ESI) available: Base pair mismatch tuning of CProbes. Binding capacity of the QDs. Theoretical limit of detection (LOD) for the monoplex systems. Kinetics of strand displacement. Kinetics of QProbe-CProbe binding. LOD and saturation point calculations. See DOI: 10.1039/c6nr00484a
2014-01-01
Background Gene expression analysis using quantitative reverse transcription PCR (qRT-PCR) is a robust method wherein the expression levels of target genes are normalised using internal control genes, known as reference genes, to derive changes in gene expression levels. Although reference genes have recently been suggested for olive tissues, combined/independent analysis on different cultivars has not yet been tested. Therefore, an assessment of reference genes was required to validate the recent findings and select stably expressed genes across different olive cultivars. Results A total of eight candidate reference genes [glyceraldehyde 3-phosphate dehydrogenase (GAPDH), serine/threonine-protein phosphatase catalytic subunit (PP2A), elongation factor 1 alpha (EF1-alpha), polyubiquitin (OUB2), aquaporin tonoplast intrinsic protein (TIP2), tubulin alpha (TUBA), 60S ribosomal protein L18-3 (60S RBP L18-3) and polypyrimidine tract-binding protein homolog 3 (PTB)] were chosen based on their stability in olive tissues as well as in other plants. Expression stability was examined by qRT-PCR across 12 biological samples, representing mesocarp tissues at various developmental stages in three different olive cultivars, Barnea, Frantoio and Picual, independently and together during the 2009 season with two software programs, GeNorm and BestKeeper. Both software packages identified GAPDH, EF1-alpha and PP2A as the three most stable reference genes across the three cultivars and in the cultivar, Barnea. GAPDH, EF1-alpha and 60S RBP L18-3 were found to be most stable reference genes in the cultivar Frantoio while 60S RBP L18-3, OUB2 and PP2A were found to be most stable reference genes in the cultivar Picual. Conclusions The analyses of expression stability of reference genes using qRT-PCR revealed that GAPDH, EF1-alpha, PP2A, 60S RBP L18-3 and OUB2 are suitable reference genes for expression analysis in developing Olea europaea mesocarp tissues, displaying the highest level of expression stability across three different olive cultivars, Barnea, Frantoio and Picual, however the combination of the three most stable reference genes do vary amongst individual cultivars. This study will provide guidance to other researchers to select reference genes for normalization against target genes by qPCR across tissues obtained from the mesocarp region of the olive fruit in the cultivars, Barnea, Frantoio and Picual. PMID:24884716
Mazaki-Tovi, Shali; Tarca, Adi L; Vaisbuch, Edi; Kusanovic, Juan Pedro; Than, Nandor Gabor; Chaiworapongsa, Tinnakorn; Dong, Zhong; Hassan, Sonia S; Romero, Roberto
2016-10-01
The aim of this study was to determine gene expression and splicing changes associated with parturition and regions (visceral vs. subcutaneous) of the adipose tissue of pregnant women. The transcriptome of visceral and abdominal subcutaneous adipose tissue from pregnant women at term with (n=15) and without (n=25) spontaneous labor was profiled with the Affymetrix GeneChip Human Exon 1.0 ST array. Overall gene expression changes and the differential exon usage rate were compared between patient groups (unpaired analyses) and adipose tissue regions (paired analyses). Selected genes were tested by quantitative reverse transcription-polymerase chain reaction. Four hundred and eighty-two genes were differentially expressed between visceral and subcutaneous fat of pregnant women with spontaneous labor at term (q-value <0.1; fold change >1.5). Biological processes enriched in this comparison included tissue and vasculature development as well as inflammatory and metabolic pathways. Differential splicing was found for 42 genes [q-value <0.1; differences in Finding Isoforms using Robust Multichip Analysis scores >2] between adipose tissue regions of women not in labor. Differential exon usage associated with parturition was found for three genes (LIMS1, HSPA5, and GSTK1) in subcutaneous tissues. We show for the first time evidence of implication of mRNA splicing and processing machinery in the subcutaneous adipose tissue of women in labor compared to those without labor.
Conn, P Jeffrey; Yohn, Samantha; Stansley, Branden; Foster, Dan; Plumley, Hyekyung; Lindsley, Craig
2018-01-01
Abstract Background A large number of clinical and preclinical studies suggest that dysfunction at synapses for the excitatory neurotransmitter glutamate may play a critical role in the pathophysiological changes that underlie each of the major symptom clusters observed in schizophrenia patients. Interestingly, recent genetic studies identified multiple nonsynonymous single nucleotide polymorphisms (SNPs) in the human genes encoding two specific subtypes of metabotropic glutamate (mGlu) receptors that are associated with schizophrenia. These include GRM1 and GRM3, the genes encoding for the mGlu1 and mGlu3 receptor subtypes respectively. Furthermore, postmortem studies suggest that expression of these mGlu receptor subtypes is altered brains of schizophrenia patients compared to controls. Mutations in GRM1 were identified a range of schizophrenia patients, whereas SNPs in the human gene encoding mGlu3 (GRM3) are selectively associated with poor performance on cognitive tests that are dependent on function of the prefrontal cortex (PFC) and hippocampus. These studies raise the possibility that disrupted signaling of mGlu1 and/or mGlu3 could contribute to the symptoms of schizophrenia and that selective modulators of these receptors could provide a novel approach to treatment of this disorder. Methods Wild-type and mutant forms of mGlu receptors were expressed in cell lines and used for discovery and optimization of highly selective positive allosteric modulators (PAMs) of mGlu1 and mGlu3. Optimized mGlu1 and mGlu3 PAMs were then used along with mouse genetic studies to evaluate the roles of these receptors in specific basal ganglia and forebrain circuits that have been implicated in schizophrenia. Finally, these compounds were used in animal models to assess potential efficacy in rodent models that are relevant for reducing positive, negative, and cognitive symptoms that are observed in schizophrenia patients. Results GRM1 mutations associated with schizophrenia were found to reduce mGlu1 signaling, suggesting that loss of function of this receptor could contribute to symptoms associated with schizophrenia. Furthermore, we found that highly selective mGlu1 PAMs reverse deficits in mGlu1 signaling observed in these mutant receptors, induced a profound reduction in dopamine release in striatal areas implicated in schizophrenia, and have robust antipsychotic-like effects that are mediated by localized inhibition of dopamine release in striatum. In contrast to existing antipsychotic medications, selective mGlu1 PAMs also improve motivation and reduce anhedonia in animal models. Interestingly, selective mGlu3 PAMs have multiple effects in the prefrontal cortex and hippocampus that would be expected to improve cognitive function. Consistent with this, highly selective mGlu3 PAMs have robust cognition-enhancing effects in rodent models that are relevant for the cognitive deficits observed in schizophrenia patients. Discussion These studies provide exciting new evidence that highly selective activators of two glutamate receptors identified in human genetic studies have potential utility in treatment of positive (mGLu1), negative (mGlu1), and cognitive (mGlu3) symptoms of schizophrenia patients. Furthermore, the novel mGlu1 and mGlu3 PAMs discovered in these studies provide excellent drug leads for further optimization and ultimate clinical testing.
Robust support vector regression networks for function approximation with outliers.
Chuang, Chen-Chia; Su, Shun-Feng; Jeng, Jin-Tsong; Hsiao, Chih-Ching
2002-01-01
Support vector regression (SVR) employs the support vector machine (SVM) to tackle problems of function approximation and regression estimation. SVR has been shown to have good robust properties against noise. When the parameters used in SVR are improperly selected, overfitting phenomena may still occur. However, the selection of various parameters is not straightforward. Besides, in SVR, outliers may also possibly be taken as support vectors. Such an inclusion of outliers in support vectors may lead to seriously overfitting phenomena. In this paper, a novel regression approach, termed as the robust support vector regression (RSVR) network, is proposed to enhance the robust capability of SVR. In the approach, traditional robust learning approaches are employed to improve the learning performance for any selected parameters. From the simulation results, our RSVR can always improve the performance of the learned systems for all cases. Besides, it can be found that even the training lasted for a long period, the testing errors would not go up. In other words, the overfitting phenomenon is indeed suppressed.
Almeida, Daniela; Maldonado, Emanuel; Vasconcelos, Vitor; Antunes, Agostinho
2015-01-01
Mitochondrial protein-coding genes (mt genes) encode subunits forming complexes of crucial cellular pathways, including those involved in the vital process of oxidative phosphorylation (OXPHOS). Despite the vital role of the mitochondrial genome (mt genome) in the survival of organisms, little is known with respect to its adaptive implications within marine invertebrates. The molluscan Class Cephalopoda is represented by a marine group of species known to occupy contrasting environments ranging from the intertidal to the deep sea, having distinct metabolic requirements, varied body shapes and highly advanced visual and nervous systems that make them highly competitive and successful worldwide predators. Thus, cephalopods are valuable models for testing natural selection acting on their mitochondrial subunits (mt subunits). Here, we used concatenated mt genes from 17 fully sequenced mt genomes of diverse cephalopod species to generate a robust mitochondrial phylogeny for the Class Cephalopoda. We followed an integrative approach considering several branches of interest–covering cephalopods with distinct morphologies, metabolic rates and habitats–to identify sites under positive selection and localize them in the respective protein alignment and/or tridimensional structure of the mt subunits. Our results revealed significant adaptive variation in several mt subunits involved in the energy production pathway of cephalopods: ND5 and ND6 from Complex I, CYTB from Complex III, COX2 and COX3 from Complex IV, and in ATP8 from Complex V. Furthermore, we identified relevant sites involved in protein-interactions, lining proton translocation channels, as well as disease/deficiencies related sites in the aforementioned complexes. A particular case, revealed by this study, is the involvement of some positively selected sites, found in Octopoda lineage in lining proton translocation channels (site 74 from ND5) and in interactions between subunits (site 507 from ND5) of Complex I. PMID:26285039
Selective Targeting of CTNNB1-, KRAS- or MYC-Driven Cell Growth by Combinations of Existing Drugs
Uitdehaag, Joost C. M.; de Roos, Jeroen A. D. M.; van Doornmalen, Antoon M.; Prinsen, Martine B. W.; Spijkers-Hagelstein, Jill A. P.; de Vetter, Judith R. F.; de Man, Jos; Buijsman, Rogier C.; Zaman, Guido J. R.
2015-01-01
The aim of combination drug treatment in cancer therapy is to improve response rate and to decrease the probability of the development of drug resistance. Preferably, drug combinations are synergistic rather than additive, and, ideally, drug combinations work synergistically only in cancer cells and not in non-malignant cells. We have developed a workflow to identify such targeted synergies, and applied this approach to selectively inhibit the proliferation of cell lines with mutations in genes that are difficult to modulate with small molecules. The approach is based on curve shift analysis, which we demonstrate is a more robust method of determining synergy than combination matrix screening with Bliss-scoring. We show that the MEK inhibitor trametinib is more synergistic in combination with the BRAF inhibitor dabrafenib than with vemurafenib, another BRAF inhibitor. In addition, we show that the combination of MEK and BRAF inhibitors is synergistic in BRAF-mutant melanoma cells, and additive or antagonistic in, respectively, BRAF-wild type melanoma cells and non-malignant fibroblasts. This combination exemplifies that synergistic action of drugs can depend on cancer genotype. Next, we used curve shift analysis to identify new drug combinations that specifically inhibit cancer cell proliferation driven by difficult-to-drug cancer genes. Combination studies were performed with compounds that as single agents showed preference for inhibition of cancer cells with mutations in either the CTNNB1 gene (coding for β-catenin), KRAS, or cancer cells expressing increased copy numbers of MYC. We demonstrate that the Wnt-pathway inhibitor ICG-001 and trametinib acted synergistically in Wnt-pathway-mutant cell lines. The ERBB2 inhibitor TAK-165 was synergistic with trametinib in KRAS-mutant cell lines. The EGFR/ERBB2 inhibitor neratinib acted synergistically with the spindle poison docetaxel and with the Aurora kinase inhibitor GSK-1070916 in cell lines with MYC amplification. Our approach can therefore efficiently discover novel drug combinations that selectively target cancer genes. PMID:26018524
Fallico, V; Ross, R P; Fitzgerald, G F; McAuliffe, O
2012-07-01
A collection of 17 natural lactococcal isolates from raw milk cheeses were studied in terms of their plasmid distribution, content, and diversity. All strains in the collection harbored an abundance of plasmids, including Lactococcus lactis ssp. cremoris DPC3758, whose 8-plasmid complement was selected for sequencing. The complete sequences of pAF22 (22,388 kb), pAF14 (14,419 kb), pAF12 (12,067 kb), pAF07 (7,435 kb), and pAF04 (3,801 kb) were obtained, whereas gene functions of technological interest were mapped to pAF65 (65 kb) and pAF45 (45 kb) by PCR. The plasmids of L. lactis DPC3758 were found to encode many genes with the potential to improve the technological properties of dairy starters. These included 3 anti-phage restriction/modification (R/M) systems (1 of type I and 2 of type II) and genes for immunity/resistance to nisin, lacticin 481, cadmium, and copper. Regions encoding conjugative/mobilization functions were present in 6 of the 8 plasmids, including those containing the R/M systems, thus enabling the food-grade transfer of these mechanisms to industrial strains. Using cadmium selection, the sequential stacking of the R/M plasmids into a plasmid-free host provided the recipient with increased protection against 936- and c2-type phages. The association of food-grade selectable markers and mobilization functions on L. lactis DPC3758 plasmids will facilitate their exploitation to obtain industrial strains with enhanced phage protection and robustness. These natural plasmids also provide another example of the major role of plasmids in contributing to host fitness and preservation within its ecological niche. Copyright © 2012 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Guerra, Susana; López-Fernández, Luis A.; Conde, Raquel; Pascual-Montano, Alberto; Harshman, Keith; Esteban, Mariano
2004-01-01
The potential use of the modified vaccinia virus Ankara (MVA) strain as a live recombinant vector to deliver antigens and elicit protective immune responses against infectious diseases demands a comprehensive understanding of the effect of MVA infection on human host gene expression. We used microarrays containing more than 15,000 human cDNAs to identify gene expression changes in human HeLa cell cultures at 2, 6, and 16 h postinfection. Clustering of the 410 differentially regulated genes identified 11 discrete gene clusters with altered expression patterns after MVA infection. Clusters 1 and 2 (accounting for 16.59% [68 of 410] of the genes) contained 68 transcripts showing a robust induction pattern that was maintained during the course of infection. Changes in cellular gene transcription detected by microarrays after MVA infection were confirmed for selected genes by Northern blot analysis and by real-time reverse transcription-PCR. Upregulated transcripts in clusters 1 and 2 included 20 genes implicated in immune responses, including interleukin 1A (IL-1A), IL-6, IL-7, IL-8, and IL-15 genes. MVA infection also stimulated the expression of NF-κB and components of the NF-κB signal transduction pathway, including p50 and TRAF-interacting protein. A marked increase in the expression of histone family members was also induced during MVA infection. Expression of the Wiskott-Aldrich syndrome family members WAS, WASF1, and the small GTP-binding protein RAC-1, which are involved in actin cytoskeleton reorganization, was enhanced after MVA infection. This study demonstrates that MVA infection triggered the induction of groups of genes, some of which may be involved in host resistance and immune modulation during virus infection. PMID:15140980
Maize GO annotation—methods, evaluation, and review (maize-GAMER)
USDA-ARS?s Scientific Manuscript database
We created a new high-coverage, robust, and reproducible functional annotation of maize protein-coding genes based on Gene Ontology (GO) term assignments. Whereas the existing Phytozome and Gramene maize GO annotation sets only cover 41% and 56% of maize protein-coding genes, respectively, this stu...
Canales, Javier; Moyano, Tomás C.; Villarroel, Eva; Gutiérrez, Rodrigo A.
2014-01-01
Nitrogen (N) is an essential macronutrient for plant growth and development. Plants adapt to changes in N availability partly by changes in global gene expression. We integrated publicly available root microarray data under contrasting nitrate conditions to identify new genes and functions important for adaptive nitrate responses in Arabidopsis thaliana roots. Overall, more than 2000 genes exhibited changes in expression in response to nitrate treatments in Arabidopsis thaliana root organs. Global regulation of gene expression by nitrate depends largely on the experimental context. However, despite significant differences from experiment to experiment in the identity of regulated genes, there is a robust nitrate response of specific biological functions. Integrative gene network analysis uncovered relationships between nitrate-responsive genes and 11 highly co-expressed gene clusters (modules). Four of these gene network modules have robust nitrate responsive functions such as transport, signaling, and metabolism. Network analysis hypothesized G2-like transcription factors are key regulatory factors controlling transport and signaling functions. Our meta-analysis highlights the role of biological processes not studied before in the context of the nitrate response such as root hair development and provides testable hypothesis to advance our understanding of nitrate responses in plants. PMID:24570678
Grinchuk, Oleg V; Yenamandra, Surya P; Iyer, Ramakrishnan; Singh, Malay; Lee, Hwee Kuan; Lim, Kiat Hon; Chow, Pierce Kah-Hoe; Kuznetsov, Vladamir A
2018-01-01
Currently, molecular markers are not used when determining the prognosis and treatment strategy for patients with hepatocellular carcinoma (HCC). In the present study, we proposed that the identification of common pro-oncogenic pathways in primary tumors (PT) and adjacent non-malignant tissues (AT) typically used to predict HCC patient risks may result in HCC biomarker discovery. We examined the genome-wide mRNA expression profiles of paired PT and AT samples from 321 HCC patients. The workflow integrated differentially expressed gene selection, gene ontology enrichment, computational classification, survival predictions, image analysis and experimental validation methods. We developed a 24-ribosomal gene-based HCC classifier (RGC), which is prognostically significant in both PT and AT. The RGC gene overexpression in PT was associated with a poor prognosis in the training (hazard ratio = 8.2, P = 9.4 × 10 -6 ) and cross-cohort validation (hazard ratio = 2.63, P = 0.004) datasets. The multivariate survival analysis demonstrated the significant and independent prognostic value of the RGC. The RGC displayed a significant prognostic value in AT of the training (hazard ratio = 5.0, P = 0.03) and cross-validation (hazard ratio = 1.9, P = 0.03) HCC groups, confirming the accuracy and robustness of the RGC. Our experimental and bioinformatics analyses suggested a key role for c-MYC in the pro-oncogenic pattern of ribosomal biogenesis co-regulation in PT and AT. Microarray, quantitative RT-PCR and quantitative immunohistochemical studies of the PT showed that DKK1 in PT is the perspective biomarker for poor HCC outcomes. The common co-transcriptional pattern of ribosome biogenesis genes in PT and AT from HCC patients suggests a new scalable prognostic system, as supported by the model of tumor-like metabolic redirection/assimilation in non-malignant AT. The RGC, comprising 24 ribosomal genes, is introduced as a robust and reproducible prognostic model for stratifying HCC patient risks. The adjacent non-malignant liver tissue alone, or in combination with HCC tissue biopsy, could be an important target for developing predictive and monitoring strategies, as well as evidence-based therapeutic interventions, that aim to reduce the risk of post-surgery relapse in HCC patients. © 2017 The Authors. Published by FEBS Press and John Wiley & Sons Ltd.
A Graphical Model of Smoking-Induced Global Instability in Lung Cancer.
Wang, Yanbo; Qian, Weikang; Yuan, Bo
2018-01-01
Smoking is the major cause of lung cancer and the leading cause of cancer-related death in the world. The most current view about lung cancer is no longer limited to individual genes being mutated by any carcinogenic insults from smoking. Instead, tumorigenesis is a phenotype conferred by many systematic and global alterations, leading to extensive heterogeneity and variation for both the genotypes and phenotypes of individual cancer cells. Thus, strategically it is foremost important to develop a methodology to capture any consistent and global alterations presumably shared by most of the cancerous cells for a given population. This is particularly true that almost all of the data collected from solid cancers (including lung cancers) are usually distant apart over a large span of temporal or even spatial contexts. Here, we report a multiple non-Gaussian graphical model to reconstruct the gene interaction network using two previously published gene expression datasets. Our graphical model aims to selectively detect gross structural changes at the level of gene interaction networks. Our methodology is extensively validated, demonstrating good robustness, as well as the selectivity and specificity expected based on our biological insights. In summary, gene regulatory networks are still relatively stable during presumably the early stage of neoplastic transformation. But drastic structural differences can be found between lung cancer and its normal control, including the gain of functional modules for cellular proliferations such as EGFR and PDGFRA, as well as the lost of the important IL6 module, supporting their roles as potential drug targets. Interestingly, our method can also detect early modular changes, with the ALDH3A1 and its associated interactions being strongly implicated as a potential early marker, whose activations appear to alter LCN2 module as well as its interactions with the important TP53-MDM2 circuitry. Our strategy using the graphical model to reconstruct gene interaction work with biologically-inspired constraints exemplifies the importance and beauty of biology in developing any bio-computational approach.
Landscape Phage: Evolution from Phage Display to Nanobiotechnology.
Petrenko, Valery A
2018-06-07
The development of phage engineering technology has led to the construction of a novel type of phage display library-a collection of nanofiber materials with diverse molecular landscapes accommodated on the surface of phage particles. These new nanomaterials, called the "landscape phage", serve as a huge resource of diagnostic/detection probes and versatile construction materials for the preparation of phage-functionalized biosensors and phage-targeted nanomedicines. Landscape-phage-derived probes interact with biological threat agents and generate detectable signals as a part of robust and inexpensive molecular recognition interfaces introduced in mobile detection devices. The use of landscape-phage-based interfaces may greatly improve the sensitivity, selectivity, robustness, and longevity of these devices. In another area of bioengineering, landscape-phage technology has facilitated the development and testing of targeted nanomedicines. The development of high-throughput phage selection methods resulted in the discovery of a variety of cancer cell-associated phages and phage proteins demonstrating natural proficiency to self-assemble into various drug- and gene-targeting nanovehicles. The application of this new "phage-programmed-nanomedicines" concept led to the development of a number of cancer cell-targeting nanomedicine platforms, which demonstrated anticancer efficacy in both in vitro and in vivo experiments. This review was prepared to attract the attention of chemical scientists and bioengineers seeking to develop functionalized nanomaterials and use them in different areas of bioscience, medicine, and engineering.
Cheng, Jun; He, Jun; Liu, Huaping; Cai, Hao; Hong, Guini; Zhang, Jiahui; Li, Na; Ao, Lu; Guo, Zheng
2017-01-01
Formalin-fixed paraffin-embedded (FFPE) samples represent a valuable resource for clinical researches. However, FFPE samples are usually considered an unreliable source for gene expression analysis due to the partial RNA degradation. In this study, through comparing gene expression profiles between FFPE samples and paired fresh-frozen (FF) samples for three cancer types, we firstly showed that expression measurements of thousands of genes had at least two-fold change in FFPE samples compared with paired FF samples. Therefore, for a transcriptional signature based on risk scores summarized from the expression levels of the signature genes, the risk score thresholds trained from FFPE (or FF) samples could not be applied to FF (or FFPE) samples. On the other hand, we found that more than 90% of the relative expression orderings (REOs) of gene pairs in the FF samples were maintained in their paired FFPE samples and largely unaffected by the storage time. The result suggested that the REOs of gene pairs were highly robust against partial RNA degradation in FFPE samples. Finally, as a case study, we developed a REOs-based signature to distinguish liver cirrhosis from hepatocellular carcinoma (HCC) using FFPE samples. The signature was validated in four datasets of FFPE samples and eight datasets of FF samples. In conclusion, the valuable FFPE samples can be fully exploited to identify REOs-based diagnostic and prognostic signatures which could be robustly applicable to both FF samples and FFPE samples with degraded RNA. PMID:28036264
Bouchet, Sophie; Pot, David; Deu, Monique; Rami, Jean-François; Billot, Claire; Perrier, Xavier; Rivallan, Ronan; Gardes, Laëtitia; Xia, Ling; Wenzl, Peter; Kilian, Andrzej; Glaszmann, Jean-Christophe
2012-01-01
Population structure, extent of linkage disequilibrium (LD) as well as signatures of selection were investigated in sorghum using a core sample representative of worldwide diversity. A total of 177 accessions were genotyped with 1122 informative physically anchored DArT markers. The properties of DArTs to describe sorghum genetic structure were compared to those of SSRs and of previously published RFLP markers. Model-based (STRUCTURE software) and Neighbor-Joining diversity analyses led to the identification of 6 groups and confirmed previous evolutionary hypotheses. Results were globally consistent between the different marker systems. However, DArTs appeared more robust in terms of data resolution and bayesian group assignment. Whole genome linkage disequilibrium as measured by mean r2 decreased from 0.18 (between 0 to 10 kb) to 0.03 (between 100 kb to 1 Mb), stabilizing at 0.03 after 1 Mb. Effects on LD estimations of sample size and genetic structure were tested using i. random sampling, ii. the Maximum Length SubTree algorithm (MLST), and iii. structure groups. Optimizing population composition by the MLST reduced the biases in small samples and seemed to be an efficient way of selecting samples to make the best use of LD as a genome mapping approach in structured populations. These results also suggested that more than 100,000 markers may be required to perform genome-wide association studies in collections covering worldwide sorghum diversity. Analysis of DArT markers differentiation between the identified genetic groups pointed out outlier loci potentially linked to genes controlling traits of interest, including disease resistance genes for which evidence of selection had already been reported. In addition, evidence of selection near a homologous locus of FAR1 concurred with sorghum phenotypic diversity for sensitivity to photoperiod. PMID:22428056
Bouchet, Sophie; Pot, David; Deu, Monique; Rami, Jean-François; Billot, Claire; Perrier, Xavier; Rivallan, Ronan; Gardes, Laëtitia; Xia, Ling; Wenzl, Peter; Kilian, Andrzej; Glaszmann, Jean-Christophe
2012-01-01
Population structure, extent of linkage disequilibrium (LD) as well as signatures of selection were investigated in sorghum using a core sample representative of worldwide diversity. A total of 177 accessions were genotyped with 1122 informative physically anchored DArT markers. The properties of DArTs to describe sorghum genetic structure were compared to those of SSRs and of previously published RFLP markers. Model-based (STRUCTURE software) and Neighbor-Joining diversity analyses led to the identification of 6 groups and confirmed previous evolutionary hypotheses. Results were globally consistent between the different marker systems. However, DArTs appeared more robust in terms of data resolution and bayesian group assignment. Whole genome linkage disequilibrium as measured by mean r(2) decreased from 0.18 (between 0 to 10 kb) to 0.03 (between 100 kb to 1 Mb), stabilizing at 0.03 after 1 Mb. Effects on LD estimations of sample size and genetic structure were tested using i. random sampling, ii. the Maximum Length SubTree algorithm (MLST), and iii. structure groups. Optimizing population composition by the MLST reduced the biases in small samples and seemed to be an efficient way of selecting samples to make the best use of LD as a genome mapping approach in structured populations. These results also suggested that more than 100,000 markers may be required to perform genome-wide association studies in collections covering worldwide sorghum diversity. Analysis of DArT markers differentiation between the identified genetic groups pointed out outlier loci potentially linked to genes controlling traits of interest, including disease resistance genes for which evidence of selection had already been reported. In addition, evidence of selection near a homologous locus of FAR1 concurred with sorghum phenotypic diversity for sensitivity to photoperiod.
Marden, James H
2013-12-01
Metabolic enzyme loci were some of the first genes accessible for molecular evolution and ecology research. New technologies now make the whole genome, transcriptome or proteome readily accessible, allowing unbiased scans for loci exhibiting significant differences in allele frequency or expression level and associated with phenotypes and/or responses to natural selection. With surprising frequency and in many cases in proportions greater than chance relative to other genes, glycolysis and TCA cycle enzyme loci appear among the genes with significant associations in these studies. Hence, there is an ongoing need to understand the basis for fitness effects of metabolic enzyme polymorphisms. Allele-specific effects on the binding affinity and catalytic rate of individual enzymes are well known, but often of uncertain significance because metabolic control theory and in vivo studies indicate that many individual metabolic enzymes do not affect pathway flux rate. I review research, so far little used in evolutionary biology, showing that metabolic enzyme substrates affect signalling pathways that regulate cell and organismal biology, and that these enzymes have moonlighting functions. To date there is little knowledge of how alleles in natural populations affect these phenotypes. I discuss an example in which alleles of a TCA enzyme locus associate with differences in a signalling pathway and development, organismal performance, and ecological dynamics. Ultimately, understanding how metabolic enzyme polymorphisms map to phenotypes and fitness remains a compelling and ongoing need for gaining robust knowledge of ecological and evolutionary processes. © 2013 John Wiley & Sons Ltd.
[Injudicious and excessive use of antibiotics: public health and salmon aquaculture in Chile].
Millanao B, Ana; Barrientos H, Marcela; Gómez C, Carolina; Tomova, Alexandra; Buschmann, Alejandro; Dölz, Humberto; Cabello, Felipe C
2011-01-01
Salmon aquaculture was one of the major growing and exporting industries in Chile. Its development was accompanied by an increasing and excessive use of large amounts of antimicrobials, such as quinolones, tetracyclines and florfenicol. The examination of the sanitary conditions in the industry as part of a more general investigation into the uncontrolled and extensive dissemination of the ISA virus epizootic in 2008, found numerous and wide-ranging shortcomings and limitations in management of preventive fish health. There was a growing industrial use of large amounts of antimicrobials as an attempt at prophylaxis of bacterial infections resulting from widespread unsanitary and unhealthy fish rearing conditions. As might be expected, these attempts were unsuccessful and this heavy antimicrobial use failed to prevent viral and parasitic epizootics. Comparative analysis of the amounts of antimicrobials, especially quinolones, consumed in salmon aquaculture and in human medicine in Chile robustly suggests that the most important selective pressure for antibiotic resistant bacteria in the country will be excessive antibiotic use in this industry. This excessive use will facilitate selection of resistant bacteria and resistance genes in water environments. The commonality of antibiotic resistance genes and the mobilome between environmental aquatic bacteria, fish pathogens and pathogens of terrestrial animals and humans suggests that horizontal gene transfer occurs between the resistome of these apparently independent and isolated bacterial populations. Thus, excessive antibiotic use in the marine environment in aquaculture is not innocuous and can potentially negatively affect therapy of bacterial infections of humans and terrestrial animals.
Dendrites Enable a Robust Mechanism for Neuronal Stimulus Selectivity.
Cazé, Romain D; Jarvis, Sarah; Foust, Amanda J; Schultz, Simon R
2017-09-01
Hearing, vision, touch: underlying all of these senses is stimulus selectivity, a robust information processing operation in which cortical neurons respond more to some stimuli than to others. Previous models assume that these neurons receive the highest weighted input from an ensemble encoding the preferred stimulus, but dendrites enable other possibilities. Nonlinear dendritic processing can produce stimulus selectivity based on the spatial distribution of synapses, even if the total preferred stimulus weight does not exceed that of nonpreferred stimuli. Using a multi-subunit nonlinear model, we demonstrate that stimulus selectivity can arise from the spatial distribution of synapses. We propose this as a general mechanism for information processing by neurons possessing dendritic trees. Moreover, we show that this implementation of stimulus selectivity increases the neuron's robustness to synaptic and dendritic failure. Importantly, our model can maintain stimulus selectivity for a larger range of loss of synapses or dendrites than an equivalent linear model. We then use a layer 2/3 biophysical neuron model to show that our implementation is consistent with two recent experimental observations: (1) one can observe a mixture of selectivities in dendrites that can differ from the somatic selectivity, and (2) hyperpolarization can broaden somatic tuning without affecting dendritic tuning. Our model predicts that an initially nonselective neuron can become selective when depolarized. In addition to motivating new experiments, the model's increased robustness to synapses and dendrites loss provides a starting point for fault-resistant neuromorphic chip development.
The Stochastic Evolutionary Game for a Population of Biological Networks Under Natural Selection
Chen, Bor-Sen; Ho, Shih-Ju
2014-01-01
In this study, a population of evolutionary biological networks is described by a stochastic dynamic system with intrinsic random parameter fluctuations due to genetic variations and external disturbances caused by environmental changes in the evolutionary process. Since information on environmental changes is unavailable and their occurrence is unpredictable, they can be considered as a game player with the potential to destroy phenotypic stability. The biological network needs to develop an evolutionary strategy to improve phenotypic stability as much as possible, so it can be considered as another game player in the evolutionary process, ie, a stochastic Nash game of minimizing the maximum network evolution level caused by the worst environmental disturbances. Based on the nonlinear stochastic evolutionary game strategy, we find that some genetic variations can be used in natural selection to construct negative feedback loops, efficiently improving network robustness. This provides larger genetic robustness as a buffer against neutral genetic variations, as well as larger environmental robustness to resist environmental disturbances and maintain a network phenotypic traits in the evolutionary process. In this situation, the robust phenotypic traits of stochastic biological networks can be more frequently selected by natural selection in evolution. However, if the harbored neutral genetic variations are accumulated to a sufficiently large degree, and environmental disturbances are strong enough that the network robustness can no longer confer enough genetic robustness and environmental robustness, then the phenotype robustness might break down. In this case, a network phenotypic trait may be pushed from one equilibrium point to another, changing the phenotypic trait and starting a new phase of network evolution through the hidden neutral genetic variations harbored in network robustness by adaptive evolution. Further, the proposed evolutionary game is extended to an n-tuple evolutionary game of stochastic biological networks with m players (competitive populations) and k environmental dynamics. PMID:24558296
Coram, Tristan E; Pang, Edwin C K
2006-11-01
Using microarray technology and a set of chickpea (Cicer arietinum L.) unigenes, grasspea (Lathyrus sativus L.) expressed sequence tags (ESTs) and lentil (Lens culinaris Med.) resistance gene analogues, the ascochyta blight (Ascochyta rabiei (Pass.) L.) resistance response was studied in four chickpea genotypes, including resistant, moderately resistant, susceptible and wild relative (Cicer echinospermum L.) genotypes. The experimental system minimized environmental effects and was conducted in reference design, in which samples from mock-inoculated controls acted as reference against post-inoculation samples. Robust data quality was achieved through the use of three biological replicates (including a dye swap), the inclusion of negative controls and strict selection criteria for differentially expressed genes, including a fold change cut-off determined by self-self hybridizations, Student's t-test and multiple testing correction (P < 0.05). Microarray observations were also validated by quantitative reverse transcriptase-polymerase chain reaction (RT-PCR). The time course expression patterns of 756 microarray features resulted in the differential expression of 97 genes in at least one genotype at one time point. k-means clustering grouped the genes into clusters of similar observations for each genotype, and comparisons between A. rabiei-resistant and A. rabiei-susceptible genotypes revealed potential gene 'signatures' predictive of effective A. rabiei resistance. These genes included several pathogenesis-related proteins, SNAKIN2 antimicrobial peptide, proline-rich protein, disease resistance response protein DRRG49-C, environmental stress-inducible protein, leucine-zipper protein, polymorphic antigen membrane protein, Ca-binding protein and several unknown proteins. The potential involvement of these genes and their pathways of induction are discussed. This study represents the first large-scale gene expression profiling in chickpea, and future work will focus on the functional validation of the genes of interest.
Mendoza-Mendoza, Artemio; Steyaert, Johanna; Nieto-Jacobo, Maria Fernanda; Holyoake, Andrew; Braithwaite, Mark; Stewart, Alison
2015-11-01
Several members of the genus Trichoderma are biocontrol agents of soil-borne fungal plant pathogens. The effectiveness of biocontrol agents depends heavily on how they perform in the complex field environment. Therefore, the ability to monitor and track Trichoderma within the environment is essential to understanding biocontrol efficacy. The objectives of this work were to: (a) identify key genes involved in Trichoderma sp. 'atroviride type B' morphogenesis; (b) develop a robust RNA isolation method from soil; and (c) develop molecular marker assays for characterizing morphogenesis whilst in the soil environment. Four cDNA libraries corresponding to conidia, germination, vegetative growth and conidiogenesis were created, and the genes identified by sequencing. Stage specificity of the different genes was confirmed by either Northern blot or quantitative reverse-transcriptase PCR (qRT-PCR) analysis using RNA from the four stages. con10, a conidial-specific gene, was observed in conidia, as well as one gene also involved in subsequent stages of germination (L-lactate/malate dehydrogenase encoding gene). The germination stage revealed high expression rates of genes involved in amino acid and protein biosynthesis, while in the vegetative-growth stage, genes involved in differentiation, including the mitogen-activated protein kinase kinase similar to Kpp7 from Ustilago maydis and the orthologue to stuA from Aspergillus nidulans, were preferentially expressed. Genes involved in cell-wall synthesis were expressed during conidiogenesis. We standardized total RNA isolation from Trichoderma sp. 'atroviride type B' growing in soil and then examined the expression profiles of selected genes using qRT-PCR. The results suggested that the relative expression patterns were cyclic and not accumulative.
Shaikh, Saame Raza; Shaver, Patti R; Shewchuk, Brian M
2018-05-08
Dietary fat composition can modulate gene expression in peripheral tissues in obesity. Observations of the dysregulation of growth hormone (GH) in obesity indicate that these effects extend to the hypothalamic-pituitary (H-P) axis. The authors thus determine whether specific high fat (HF) diets influence the levels of Gh and other key gene transcripts in the H-P axis. C57BL/6 mice are fed a lean control diet or a HF diet in the absence or presence of OA, EPA, or DHA ethyl esters. Comparative studies are conducted with menhaden fish oil. The HF diet lowered pituitary Gh mRNA and protein levels, and cell culture studies reveal that elevated insulin and glucose can reduce Gh transcripts. Supplementation of the HF diet with OA, EPA, DHA, or menhaden fish oil do not improve pituitary Gh levels. The HF diet also impaired the levels of additional genes in the pituitary and hypothalamus, which are selectively rescued with EPA or DHA ethyl esters. The effects of EPA and DHA are more robust relative to fish oil. A HF diet can affect H-P axis transcription, which can be mitigated in some genes by EPA and DHA, but not fish oil in most cases. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Construction and comparison of gene co-expression networks shows complex plant immune responses
López, Camilo; López-Kleine, Liliana
2014-01-01
Gene co-expression networks (GCNs) are graphic representations that depict the coordinated transcription of genes in response to certain stimuli. GCNs provide functional annotations of genes whose function is unknown and are further used in studies of translational functional genomics among species. In this work, a methodology for the reconstruction and comparison of GCNs is presented. This approach was applied using gene expression data that were obtained from immunity experiments in Arabidopsis thaliana, rice, soybean, tomato and cassava. After the evaluation of diverse similarity metrics for the GCN reconstruction, we recommended the mutual information coefficient measurement and a clustering coefficient-based method for similarity threshold selection. To compare GCNs, we proposed a multivariate approach based on the Principal Component Analysis (PCA). Branches of plant immunity that were exemplified by each experiment were analyzed in conjunction with the PCA results, suggesting both the robustness and the dynamic nature of the cellular responses. The dynamic of molecular plant responses produced networks with different characteristics that are differentiable using our methodology. The comparison of GCNs from plant pathosystems, showed that in response to similar pathogens plants could activate conserved signaling pathways. The results confirmed that the closeness of GCNs projected on the principal component space is an indicative of similarity among GCNs. This also can be used to understand global patterns of events triggered during plant immune responses. PMID:25320678
del Val, Coral; Rivas, Elena; Torres-Quesada, Omar; Toro, Nicolás; Jiménez-Zurdo, José I
2007-01-01
Bacterial small non-coding RNAs (sRNAs) are being recognized as novel widespread regulators of gene expression in response to environmental signals. Here, we present the first search for sRNA-encoding genes in the nitrogen-fixing endosymbiont Sinorhizobium meliloti, performed by a genome-wide computational analysis of its intergenic regions. Comparative sequence data from eight related α-proteobacteria were obtained, and the interspecies pairwise alignments were scored with the programs eQRNA and RNAz as complementary predictive tools to identify conserved and stable secondary structures corresponding to putative non-coding RNAs. Northern experiments confirmed that eight of the predicted loci, selected among the original 32 candidates as most probable sRNA genes, expressed small transcripts. This result supports the combined use of eQRNA and RNAz as a robust strategy to identify novel sRNAs in bacteria. Furthermore, seven of the transcripts accumulated differentially in free-living and symbiotic conditions. Experimental mapping of the 5′-ends of the detected transcripts revealed that their encoding genes are organized in autonomous transcription units with recognizable promoter and, in most cases, termination signatures. These findings suggest novel regulatory functions for sRNAs related to the interactions of α-proteobacteria with their eukaryotic hosts. PMID:17971083
Unraveling the genetic basis of xylose consumption in engineered Saccharomyces cerevisiae strains.
Dos Santos, Leandro Vieira; Carazzolle, Marcelo Falsarella; Nagamatsu, Sheila Tiemi; Sampaio, Nádia Maria Vieira; Almeida, Ludimila Dias; Pirolla, Renan Augusto Siqueira; Borelli, Guilherme; Corrêa, Thamy Lívia Ribeiro; Argueso, Juan Lucas; Pereira, Gonçalo Amarante Guimarães
2016-12-21
The development of biocatalysts capable of fermenting xylose, a five-carbon sugar abundant in lignocellulosic biomass, is a key step to achieve a viable production of second-generation ethanol. In this work, a robust industrial strain of Saccharomyces cerevisiae was modified by the addition of essential genes for pentose metabolism. Subsequently, taken through cycles of adaptive evolution with selection for optimal xylose utilization, strains could efficiently convert xylose to ethanol with a yield of about 0.46 g ethanol/g xylose. Though evolved independently, two strains carried shared mutations: amplification of the xylose isomerase gene and inactivation of ISU1, a gene encoding a scaffold protein involved in the assembly of iron-sulfur clusters. In addition, one of evolved strains carried a mutation in SSK2, a member of MAPKKK signaling pathway. In validation experiments, mutating ISU1 or SSK2 improved the ability to metabolize xylose of yeast cells without adaptive evolution, suggesting that these genes are key players in a regulatory network for xylose fermentation. Furthermore, addition of iron ion to the growth media improved xylose fermentation even by non-evolved cells. Our results provide promising new targets for metabolic engineering of C5-yeasts and point to iron as a potential new additive for improvement of second-generation ethanol production.
Unraveling the genetic basis of xylose consumption in engineered Saccharomyces cerevisiae strains
dos Santos, Leandro Vieira; Carazzolle, Marcelo Falsarella; Nagamatsu, Sheila Tiemi; Sampaio, Nádia Maria Vieira; Almeida, Ludimila Dias; Pirolla, Renan Augusto Siqueira; Borelli, Guilherme; Corrêa, Thamy Lívia Ribeiro; Argueso, Juan Lucas; Pereira, Gonçalo Amarante Guimarães
2016-01-01
The development of biocatalysts capable of fermenting xylose, a five-carbon sugar abundant in lignocellulosic biomass, is a key step to achieve a viable production of second-generation ethanol. In this work, a robust industrial strain of Saccharomyces cerevisiae was modified by the addition of essential genes for pentose metabolism. Subsequently, taken through cycles of adaptive evolution with selection for optimal xylose utilization, strains could efficiently convert xylose to ethanol with a yield of about 0.46 g ethanol/g xylose. Though evolved independently, two strains carried shared mutations: amplification of the xylose isomerase gene and inactivation of ISU1, a gene encoding a scaffold protein involved in the assembly of iron-sulfur clusters. In addition, one of evolved strains carried a mutation in SSK2, a member of MAPKKK signaling pathway. In validation experiments, mutating ISU1 or SSK2 improved the ability to metabolize xylose of yeast cells without adaptive evolution, suggesting that these genes are key players in a regulatory network for xylose fermentation. Furthermore, addition of iron ion to the growth media improved xylose fermentation even by non-evolved cells. Our results provide promising new targets for metabolic engineering of C5-yeasts and point to iron as a potential new additive for improvement of second-generation ethanol production. PMID:28000736
Spanagel, Rainer
2013-01-01
Convergent functional genomics (CFG) is a translational methodology that integrates in a Bayesian fashion multiple lines of evidence from studies in human and animal models to get a better understanding of the genetics of a disease or pathological behavior. Here the integration of data sets that derive from forward genetics in animals and genetic association studies including genome wide association studies (GWAS) in humans is described for addictive behavior. The aim of forward genetics in animals and association studies in humans is to identify mutations (e.g. SNPs) that produce a certain phenotype; i.e. "from phenotype to genotype". Most powerful in terms of forward genetics is combined quantitative trait loci (QTL) analysis and gene expression profiling in recombinant inbreed rodent lines or genetically selected animals for a specific phenotype, e.g. high vs. low drug consumption. By Bayesian scoring genomic information from forward genetics in animals is then combined with human GWAS data on a similar addiction-relevant phenotype. This integrative approach generates a robust candidate gene list that has to be functionally validated by means of reverse genetics in animals; i.e. "from genotype to phenotype". It is proposed that studying addiction relevant phenotypes and endophenotypes by this CFG approach will allow a better determination of the genetics of addictive behavior.
Pagan, Rafael F; Massey, Steven E
2014-02-01
Proteins are regarded as being robust to the deleterious effects of mutations. Here, the neutral emergence of mutational robustness in a population of single domain proteins is explored using computer simulations. A pairwise contact model was used to calculate the ΔG of folding (ΔG folding) using the three dimensional protein structure of leech eglin C. A random amino acid sequence with low mutational robustness, defined as the average ΔΔG resulting from a point mutation (ΔΔG average), was threaded onto the structure. A population of 1,000 threaded sequences was evolved under selection for stability, using an upper and lower energy threshold. Under these conditions, mutational robustness increased over time in the most common sequence in the population. In contrast, when the wild type sequence was used it did not show an increase in robustness. This implies that the emergence of mutational robustness is sequence specific and that wild type sequences may be close to maximal robustness. In addition, an inverse relationship between ∆∆G average and protein stability is shown, resulting partly from a larger average effect of point mutations in more stable proteins. The emergence of mutational robustness was also observed in the Escherichia coli colE1 Rop and human CD59 proteins, implying that the property may be common in single domain proteins under certain simulation conditions. The results indicate that at least a portion of mutational robustness in small globular proteins might have arisen by a process of neutral emergence, and could be an example of a beneficial trait that has not been directly selected for, termed a "pseudaptation."
Larriba, Yolanda; Rueda, Cristina; Fernández, Miguel A; Peddada, Shyamal D
2018-01-01
Motivation: Gene-expression data obtained from high throughput technologies are subject to various sources of noise and accordingly the raw data are pre-processed before formally analyzed. Normalization of the data is a key pre-processing step, since it removes systematic variations across arrays. There are numerous normalization methods available in the literature. Based on our experience, in the context of oscillatory systems, such as cell-cycle, circadian clock, etc., the choice of the normalization method may substantially impact the determination of a gene to be rhythmic. Thus rhythmicity of a gene can purely be an artifact of how the data were normalized. Since the determination of rhythmic genes is an important component of modern toxicological and pharmacological studies, it is important to determine truly rhythmic genes that are robust to the choice of a normalization method. Results: In this paper we introduce a rhythmicity measure and a bootstrap methodology to detect rhythmic genes in an oscillatory system. Although the proposed methodology can be used for any high-throughput gene expression data, in this paper we illustrate the proposed methodology using several publicly available circadian clock microarray gene-expression datasets. We demonstrate that the choice of normalization method has very little effect on the proposed methodology. Specifically, for any pair of normalization methods considered in this paper, the resulting values of the rhythmicity measure are highly correlated. Thus it suggests that the proposed measure is robust to the choice of a normalization method. Consequently, the rhythmicity of a gene is potentially not a mere artifact of the normalization method used. Lastly, as demonstrated in the paper, the proposed bootstrap methodology can also be used for simulating data for genes participating in an oscillatory system using a reference dataset. Availability: A user friendly code implemented in R language can be downloaded from http://www.eio.uva.es/~miguel/robustdetectionprocedure.html.
Larriba, Yolanda; Rueda, Cristina; Fernández, Miguel A.; Peddada, Shyamal D.
2018-01-01
Motivation: Gene-expression data obtained from high throughput technologies are subject to various sources of noise and accordingly the raw data are pre-processed before formally analyzed. Normalization of the data is a key pre-processing step, since it removes systematic variations across arrays. There are numerous normalization methods available in the literature. Based on our experience, in the context of oscillatory systems, such as cell-cycle, circadian clock, etc., the choice of the normalization method may substantially impact the determination of a gene to be rhythmic. Thus rhythmicity of a gene can purely be an artifact of how the data were normalized. Since the determination of rhythmic genes is an important component of modern toxicological and pharmacological studies, it is important to determine truly rhythmic genes that are robust to the choice of a normalization method. Results: In this paper we introduce a rhythmicity measure and a bootstrap methodology to detect rhythmic genes in an oscillatory system. Although the proposed methodology can be used for any high-throughput gene expression data, in this paper we illustrate the proposed methodology using several publicly available circadian clock microarray gene-expression datasets. We demonstrate that the choice of normalization method has very little effect on the proposed methodology. Specifically, for any pair of normalization methods considered in this paper, the resulting values of the rhythmicity measure are highly correlated. Thus it suggests that the proposed measure is robust to the choice of a normalization method. Consequently, the rhythmicity of a gene is potentially not a mere artifact of the normalization method used. Lastly, as demonstrated in the paper, the proposed bootstrap methodology can also be used for simulating data for genes participating in an oscillatory system using a reference dataset. Availability: A user friendly code implemented in R language can be downloaded from http://www.eio.uva.es/~miguel/robustdetectionprocedure.html PMID:29456555
Future treatments for Parkinson's disease: surfing the PD pipeline.
Hauser, Robert A
2011-01-01
Our current wish list for the treatment of Parkinson's disease (PD) includes therapies that will provide robust and sustained antiparkinsonian benefit through the day, ameliorate or prevent dyskinesia, and slow or prevent the progression of the disease. In this article, I review selected new therapies in clinical development for motor features or treatment complications of PD, and some that may slow disease progression. These include adenosine 2a (A2a) antagonists (istradefylline, preladenant, and SYN115), levodopa/carbidopa intestinal gel (LCIG), IPX066--an extended-release formulation of carbidopa/levodopa, XP21279--a sustained-release levodopa prodrug, ND0611--a carbidopa subcutaneous patch, safinamide--a mixed mechanism of action medication that may provide both MAO-B and glutamate inhibition, PMY50028--an oral neurotrophic factor inducer, antidyskinesia medications (AFQ056 and fipamezole), and gene therapies (AAV2-neurturin and glutamic acid decarboxylase gene transfer). Some of these therapies will never be proven efficacious and will not come to market while others may play a key role in the future treatment of PD.
A new variance stabilizing transformation for gene expression data analysis.
Kelmansky, Diana M; Martínez, Elena J; Leiva, Víctor
2013-12-01
In this paper, we introduce a new family of power transformations, which has the generalized logarithm as one of its members, in the same manner as the usual logarithm belongs to the family of Box-Cox power transformations. Although the new family has been developed for analyzing gene expression data, it allows a wider scope of mean-variance related data to be reached. We study the analytical properties of the new family of transformations, as well as the mean-variance relationships that are stabilized by using its members. We propose a methodology based on this new family, which includes a simple strategy for selecting the family member adequate for a data set. We evaluate the finite sample behavior of different classical and robust estimators based on this strategy by Monte Carlo simulations. We analyze real genomic data by using the proposed transformation to empirically show how the new methodology allows the variance of these data to be stabilized.
Sparse whole genome sequencing identifies two loci for major depressive disorder
2015-01-01
Major depressive disorder (MDD), one of the most frequently encountered forms of mental illness and a leading cause of disability worldwide1, poses a major challenge to genetic analysis. To date no robustly replicated genetic loci have been identified 2, despite analysis of more than 9,000 cases3. Using low coverage genome sequence of 5,303 Chinese women with recurrent MDD selected to reduce phenotypic heterogeneity, and 5,337 controls screened to exclude MDD, we identified and replicated two genome-wide significant loci contributing to risk of MDD on chromosome 10: one near the SIRT1 gene (P-value = 2.53×10−10) the other in an intron of the LHPP gene (P = 6.45×10−12). Analysis of 4,509 cases with a severe subtype of MDD, melancholia, yielded an increased genetic signal at the SIRT1 locus. We attribute our success to the recruitment of relatively homogeneous cases with severe illness. PMID:26176920
Multivariate Methods for Meta-Analysis of Genetic Association Studies.
Dimou, Niki L; Pantavou, Katerina G; Braliou, Georgia G; Bagos, Pantelis G
2018-01-01
Multivariate meta-analysis of genetic association studies and genome-wide association studies has received a remarkable attention as it improves the precision of the analysis. Here, we review, summarize and present in a unified framework methods for multivariate meta-analysis of genetic association studies and genome-wide association studies. Starting with the statistical methods used for robust analysis and genetic model selection, we present in brief univariate methods for meta-analysis and we then scrutinize multivariate methodologies. Multivariate models of meta-analysis for a single gene-disease association studies, including models for haplotype association studies, multiple linked polymorphisms and multiple outcomes are discussed. The popular Mendelian randomization approach and special cases of meta-analysis addressing issues such as the assumption of the mode of inheritance, deviation from Hardy-Weinberg Equilibrium and gene-environment interactions are also presented. All available methods are enriched with practical applications and methodologies that could be developed in the future are discussed. Links for all available software implementing multivariate meta-analysis methods are also provided.
Vanhoutte, Kurt; de Asmundis, Carlo; Francesconi, Anna; Figys1, Jurgen; Steurs, Griet; Boussy, Tim; Roos, Markus; Mueller, Andreas; Massimo, Lucio; Paparella, Gaetano; Van Caelenberg, Kristien; Chierchia, Gian Battista; Sarkozy, Andrea; Y Terradellas, Pedro Brugada; Zizi, Martin
2009-01-01
Atrial fibrillation (AF) is a frequent chronic dysrythmia with an incidence that increases with age (>40). Because of its medical and socio-economic impacts it is expected to become an increasing burden on most health care systems. AF is a multi-factorial disease for which the identification of subtypes is warranted. Novel approaches based on the broad concepts of systems biology may overcome the blurred notion of normal and pathological phenotype, which is inherent to high throughput molecular arrays analysis. Here we apply an internal contrast algorithm on AF patient data with an analytical focus on potential entry pathways into the disease. We used a RMA (Robust Multichip Average) normalized Affymetrix micro-array data set from 10 AF patients (geo_accession #GSE2240). Four series of probes were selected based on physiopathogenic links with AF entryways: apoptosis (remodeling), MAP kinase (cell remodeling), OXPHOS (ability to sustain hemodynamic workload) and glycolysis (ischemia). Annotated probe lists were polled with Bioconductor packages in R (version 2.7.1). Genetic profile contrasts were analysed with hierarchical clustering and principal component analysis. The analysis revealed distinct patient groups for all probe sets. A substantial part (54% till 67%) of the variance is explained in the first 2 principal components. Genes in PC1/2 with high discriminatory value were selected and analyzed in detail. We aim for reliable molecular stratification of AF. We show that stratification is possible based on physiologically relevant gene sets. Genes with high contrast value are likely to give pathophysiological insight into permanent AF subtypes. PMID:19255648
Chaudhury, Arindam; Kongchan, Natee; Gengler, Jon P.; Mohanty, Vakul; Christiansen, Audrey E.; Fachini, Joseph M.; Martin, James F.; Neilson, Joel R.
2014-01-01
Regulation of messenger ribonucleic acid (mRNA) subcellular localization, stability and translation is a central aspect of gene expression. Much of this control is mediated via recognition of mRNA 3′ untranslated regions (UTRs) by microRNAs (miRNAs) and RNA-binding proteins. The gold standard approach to assess the regulation imparted by a transcript's 3′ UTR is to fuse the UTR to a reporter coding sequence and assess the relative expression of this reporter as compared to a control. Yet, transient transfection approaches or the use of highly active viral promoter elements may overwhelm a cell's post-transcriptional regulatory machinery in this context. To circumvent this issue, we have developed and validated a novel, scalable piggyBac-based vector for analysis of 3′ UTR-mediated regulation in vitro and in vivo. The vector delivers three independent transcription units to the target genome—a selection cassette, a turboGFP control reporter and an experimental reporter expressed under the control of a 3′ UTR of interest. The pBUTR (piggyBac-based 3′ UnTranslated Region reporter) vector performs robustly as a siRNA/miRNA sensor, in established in vitro models of post-transcriptional regulation, and in both arrayed and pooled screening approaches. The vector is robustly expressed as a transgene during murine embryogenesis, highlighting its potential usefulness for revealing post-transcriptional regulation in an in vivo setting. PMID:24753411
A DNA fingerprinting procedure for ultra high-throughput genetic analysis of insects.
Schlipalius, D I; Waldron, J; Carroll, B J; Collins, P J; Ebert, P R
2001-12-01
Existing procedures for the generation of polymorphic DNA markers are not optimal for insect studies in which the organisms are often tiny and background molecular information is often non-existent. We have used a new high throughput DNA marker generation protocol called randomly amplified DNA fingerprints (RAF) to analyse the genetic variability in three separate strains of the stored grain pest, Rhyzopertha dominica. This protocol is quick, robust and reliable even though it requires minimal sample preparation, minute amounts of DNA and no prior molecular analysis of the organism. Arbitrarily selected oligonucleotide primers routinely produced approximately 50 scoreable polymorphic DNA markers, between individuals of three independent field isolates of R. dominica. Multivariate cluster analysis using forty-nine arbitrarily selected polymorphisms generated from a single primer reliably separated individuals into three clades corresponding to their geographical origin. The resulting clades were quite distinct, with an average genetic difference of 37.5 +/- 6.0% between clades and of 21.0 +/- 7.1% between individuals within clades. As a prelude to future gene mapping efforts, we have also assessed the performance of RAF under conditions commonly used in gene mapping. In this analysis, fingerprints from pooled DNA samples accurately and reproducibly reflected RAF profiles obtained from individual DNA samples that had been combined to create the bulked samples.
Jugessur, Astanand; Murray, Jeffrey C.; Moreno, Lina; Wilcox, Allen; Lie, Rolv T.
2011-01-01
This study uses instrumental variable (IV) models with genetic instruments to assess the effects of maternal smoking on the child’s risk of orofacial clefts (OFC), a common birth defect. The study uses genotypic variants in neurotransmitter and detoxification genes relateded to smoking as instruments for cigarette smoking before and during pregnancy. Conditional maximum likelihood and two-stage IV probit models are used to estimate the IV model. The data are from a population-level sample of affected and unaffected children in Norway. The selected genetic instruments generally fit the IV assumptions but may be considered “weak” in predicting cigarette smoking. We find that smoking before and during pregnancy increases OFC risk substantially under the IV model (by about 4–5 times at the sample average smoking rate). This effect is greater than that found with classical analytic models. This may be because the usual models are not able to consider self-selection into smoking based on unobserved confounders, or it may to some degree reflect limitations of the instruments. Inference based on weak-instrument robust confidence bounds is consistent with standard inference. Genetic instruments may provide a valuable approach to estimate the “causal” effects of risk behaviors with genetic-predisposing factors (such as smoking) on health and socioeconomic outcomes. PMID:22102793
Leempoel, Kevin; Parisod, Christian; Geiser, Céline; Joost, Stéphane
2018-02-01
Plant species are known to adapt locally to their environment, particularly in mountainous areas where conditions can vary drastically over short distances. The climate of such landscapes being largely influenced by topography, using fine-scale models to evaluate environmental heterogeneity may help detecting adaptation to micro-habitats. Here, we applied a multiscale landscape genomic approach to detect evidence of local adaptation in the alpine plant Biscutella laevigata . The two gene pools identified, experiencing limited gene flow along a 1-km ridge, were different in regard to several habitat features derived from a very high resolution (VHR) digital elevation model (DEM). A correlative approach detected signatures of selection along environmental gradients such as altitude, wind exposure, and solar radiation, indicating adaptive pressures likely driven by fine-scale topography. Using a large panel of DEM-derived variables as ecologically relevant proxies, our results highlighted the critical role of spatial resolution. These high-resolution multiscale variables indeed indicate that the robustness of associations between genetic loci and environmental features depends on spatial parameters that are poorly documented. We argue that the scale issue is critical in landscape genomics and that multiscale ecological variables are key to improve our understanding of local adaptation in highly heterogeneous landscapes.
Krebber, A; Bornhauser, S; Burmester, J; Honegger, A; Willuda, J; Bosshard, H R; Plückthun, A
1997-02-14
A prerequisite for the use of recombinant antibody technologies starting from hybridomas or immune repertoires is the reliable cloning of functional immunoglobulin genes. For this purpose, a standard phage display system was optimized for robustness, vector stability, tight control of scFv-delta geneIII expression, primer usage for PCR amplification of variable region genes, scFv assembly strategy and subsequent directional cloning using a single rare cutting restriction enzyme. This integrated cloning, screening and selection system allowed us to rapidly obtain antigen binding scFvs derived from spleen-cell repertoires of mice immunized with ampicillin as well as from all hybridoma cell lines tested to date. As representative examples, cloning of monoclonal antibodies against a his tag, leucine zippers, the tumor marker EGP-2 and the insecticide DDT is presented. Several hybridomas whose genes could not be cloned in previous experimental setups, but were successfully obtained with the present system, expressed high amounts of aberrant heavy and light chain mRNAs, which were amplified by PCR and greatly exceeded the amount of binding antibody sequences. These contaminating variable region genes were successfully eliminated by employing the optimized phage display system, thus avoiding time consuming sequencing of non-binding scFv genes. To maximize soluble expression of functional scFvs subsequent to cloning, a compatible vector series to simplify modification, detection, multimerization and rapid purification of recombinant antibody fragments was constructed.
Robust Feature Selection Technique using Rank Aggregation.
Sarkar, Chandrima; Cooley, Sarah; Srivastava, Jaideep
2014-01-01
Although feature selection is a well-developed research area, there is an ongoing need to develop methods to make classifiers more efficient. One important challenge is the lack of a universal feature selection technique which produces similar outcomes with all types of classifiers. This is because all feature selection techniques have individual statistical biases while classifiers exploit different statistical properties of data for evaluation. In numerous situations this can put researchers into dilemma as to which feature selection method and a classifiers to choose from a vast range of choices. In this paper, we propose a technique that aggregates the consensus properties of various feature selection methods to develop a more optimal solution. The ensemble nature of our technique makes it more robust across various classifiers. In other words, it is stable towards achieving similar and ideally higher classification accuracy across a wide variety of classifiers. We quantify this concept of robustness with a measure known as the Robustness Index (RI). We perform an extensive empirical evaluation of our technique on eight data sets with different dimensions including Arrythmia, Lung Cancer, Madelon, mfeat-fourier, internet-ads, Leukemia-3c and Embryonal Tumor and a real world data set namely Acute Myeloid Leukemia (AML). We demonstrate not only that our algorithm is more robust, but also that compared to other techniques our algorithm improves the classification accuracy by approximately 3-4% (in data set with less than 500 features) and by more than 5% (in data set with more than 500 features), across a wide range of classifiers.
OGT (O-GlcNAc Transferase) Selectively Modifies Multiple Residues Unique to Lamin A.
Simon, Dan N; Wriston, Amanda; Fan, Qiong; Shabanowitz, Jeffrey; Florwick, Alyssa; Dharmaraj, Tejas; Peterson, Sherket B; Gruenbaum, Yosef; Carlson, Cathrine R; Grønning-Wang, Line M; Hunt, Donald F; Wilson, Katherine L
2018-05-17
The LMNA gene encodes lamins A and C with key roles in nuclear structure, signaling, gene regulation, and genome integrity. Mutations in LMNA cause over 12 diseases ('laminopathies'). Lamins A and C are identical for their first 566 residues. However, they form separate filaments in vivo, with apparently distinct roles. We report that lamin A is β- O -linked N -acetylglucosamine- (O -GlcNAc)-modified in human hepatoma (Huh7) cells and in mouse liver. In vitro assays with purified O -GlcNAc transferase (OGT) enzyme showed robust O -GlcNAcylation of recombinant mature lamin A tails (residues 385⁻646), with no detectable modification of lamin B1, lamin C, or 'progerin' (Δ50) tails. Using mass spectrometry, we identified 11 O -GlcNAc sites in a 'sweet spot' unique to lamin A, with up to seven sugars per peptide. Most sites were unpredicted by current algorithms. Double-mutant (S612A/T643A) lamin A tails were still robustly O -GlcNAc-modified at seven sites. By contrast, O -GlcNAcylation was undetectable on tails bearing deletion Δ50, which causes Hutchinson⁻Gilford progeria syndrome, and greatly reduced by deletion Δ35. We conclude that residues deleted in progeria are required for substrate recognition and/or modification by OGT in vitro. Interestingly, deletion Δ35, which does not remove the majority of identified O -GlcNAc sites, does remove potential OGT-association motifs (lamin A residues 622⁻625 and 639⁻645) homologous to that in mouse Tet1. These biochemical results are significant because they identify a novel molecular pathway that may profoundly influence lamin A function. The hypothesis that lamin A is selectively regulated by OGT warrants future testing in vivo, along with two predictions: genetic variants may contribute to disease by perturbing OGT-dependent regulation, and nutrient or other stresses might cause OGT to misregulate wildtype lamin A.
Optimal Robust Motion Controller Design Using Multiobjective Genetic Algorithm
Svečko, Rajko
2014-01-01
This paper describes the use of a multiobjective genetic algorithm for robust motion controller design. Motion controller structure is based on a disturbance observer in an RIC framework. The RIC approach is presented in the form with internal and external feedback loops, in which an internal disturbance rejection controller and an external performance controller must be synthesised. This paper involves novel objectives for robustness and performance assessments for such an approach. Objective functions for the robustness property of RIC are based on simple even polynomials with nonnegativity conditions. Regional pole placement method is presented with the aims of controllers' structures simplification and their additional arbitrary selection. Regional pole placement involves arbitrary selection of central polynomials for both loops, with additional admissible region of the optimized pole location. Polynomial deviation between selected and optimized polynomials is measured with derived performance objective functions. A multiobjective function is composed of different unrelated criteria such as robust stability, controllers' stability, and time-performance indexes of closed loops. The design of controllers and multiobjective optimization procedure involve a set of the objectives, which are optimized simultaneously with a genetic algorithm—differential evolution. PMID:24987749
Quantification Assays for Total and Polyglutamine-Expanded Huntingtin Proteins
Boogaard, Ivette; Smith, Melanie; Pulli, Kristiina; Szynol, Agnieszka; Albertus, Faywell; Lamers, Marieke B. A. C.; Dijkstra, Sipke; Kordt, Daniel; Reindl, Wolfgang; Herrmann, Frank; McAllister, George; Fischer, David F.; Munoz-Sanjuan, Ignacio
2014-01-01
The expansion of a CAG trinucleotide repeat in the huntingtin gene, which produces huntingtin protein with an expanded polyglutamine tract, is the cause of Huntington's disease (HD). Recent studies have reported that RNAi suppression of polyglutamine-expanded huntingtin (mutant HTT) in HD animal models can ameliorate disease phenotypes. A key requirement for such preclinical studies, as well as eventual clinical trials, aimed to reduce mutant HTT exposure is a robust method to measure HTT protein levels in select tissues. We have developed several sensitive and selective assays that measure either total human HTT or polyglutamine-expanded human HTT proteins on the electrochemiluminescence Meso Scale Discovery detection platform with an increased dynamic range over other methods. In addition, we have developed an assay to detect endogenous mouse and rat HTT proteins in pre-clinical models of HD to monitor effects on the wild type protein of both allele selective and non-selective interventions. We demonstrate the application of these assays to measure HTT protein in several HD in vitro cellular and in vivo animal model systems as well as in HD patient biosamples. Furthermore, we used purified recombinant HTT proteins as standards to quantitate the absolute amount of HTT protein in such biosamples. PMID:24816435
Pathological Bases for a Robust Application of Cancer Molecular Classification
Diaz-Cano, Salvador J.
2015-01-01
Any robust classification system depends on its purpose and must refer to accepted standards, its strength relying on predictive values and a careful consideration of known factors that can affect its reliability. In this context, a molecular classification of human cancer must refer to the current gold standard (histological classification) and try to improve it with key prognosticators for metastatic potential, staging and grading. Although organ-specific examples have been published based on proteomics, transcriptomics and genomics evaluations, the most popular approach uses gene expression analysis as a direct correlate of cellular differentiation, which represents the key feature of the histological classification. RNA is a labile molecule that varies significantly according with the preservation protocol, its transcription reflect the adaptation of the tumor cells to the microenvironment, it can be passed through mechanisms of intercellular transference of genetic information (exosomes), and it is exposed to epigenetic modifications. More robust classifications should be based on stable molecules, at the genetic level represented by DNA to improve reliability, and its analysis must deal with the concept of intratumoral heterogeneity, which is at the origin of tumor progression and is the byproduct of the selection process during the clonal expansion and progression of neoplasms. The simultaneous analysis of multiple DNA targets and next generation sequencing offer the best practical approach for an analytical genomic classification of tumors. PMID:25898411
GFD-Net: A novel semantic similarity methodology for the analysis of gene networks.
Díaz-Montaña, Juan J; Díaz-Díaz, Norberto; Gómez-Vela, Francisco
2017-04-01
Since the popularization of biological network inference methods, it has become crucial to create methods to validate the resulting models. Here we present GFD-Net, the first methodology that applies the concept of semantic similarity to gene network analysis. GFD-Net combines the concept of semantic similarity with the use of gene network topology to analyze the functional dissimilarity of gene networks based on Gene Ontology (GO). The main innovation of GFD-Net lies in the way that semantic similarity is used to analyze gene networks taking into account the network topology. GFD-Net selects a functionality for each gene (specified by a GO term), weights each edge according to the dissimilarity between the nodes at its ends and calculates a quantitative measure of the network functional dissimilarity, i.e. a quantitative value of the degree of dissimilarity between the connected genes. The robustness of GFD-Net as a gene network validation tool was demonstrated by performing a ROC analysis on several network repositories. Furthermore, a well-known network was analyzed showing that GFD-Net can also be used to infer knowledge. The relevance of GFD-Net becomes more evident in Section "GFD-Net applied to the study of human diseases" where an example of how GFD-Net can be applied to the study of human diseases is presented. GFD-Net is available as an open-source Cytoscape app which offers a user-friendly interface to configure and execute the algorithm as well as the ability to visualize and interact with the results(http://apps.cytoscape.org/apps/gfdnet). Copyright © 2017 Elsevier Inc. All rights reserved.
Transcriptomic Assessment of Isozymes in the Biphenyl Pathway of Rhodococcus sp. Strain RHA1†
Gonçalves, Edmilson R.; Hara, Hirofumi; Miyazawa, Daisuke; Davies, Julian E.; Eltis, Lindsay D.; Mohn, William W.
2006-01-01
Rhodococcus sp. RHA1 grows on a broad range of aromatic compounds and vigorously degrades polychlorinated biphenyls (PCBs). Previous work identified RHA1 genes encoding multiple isozymes for most of the seven steps of the biphenyl (BPH) pathway, provided evidence for coexpression of some of these isozymes, and indicated the involvement of some of these enzymes in the degradation of BPH, ethylbenzene (ETB), and PCBs. To investigate the expression of these isozymes and better understand how they contribute to the robust degradative capacity of RHA1, we comprehensively analyzed the 9.7-Mb genome of RHA1 for BPH pathway genes and characterized the transcriptome of RHA1 growing on benzoate (BEN), BPH, and ETB. Sequence analyses revealed 54 potential BPH pathway genes, including 28 not previously reported. Transcriptomic analysis with a DNA microarray containing 70-mer probes for 8,213 RHA1 genes revealed a suite of 320 genes of diverse functions that were upregulated during growth both on BPH and on ETB, relative to growth on the control substrate, pyruvate. By contrast, only 65 genes were upregulated during growth on BEN. Quantitative PCR assays confirmed microarray results for selected genes and indicated that some of the catabolic genes were upregulated over 10,000-fold. Our analysis suggests that up to 22 enzymes, including 8 newly identified ones, may function in the BPH pathway of RHA1. The relative expression levels of catabolic genes did not differ for BPH and ETB, suggesting a common regulatory mechanism. This study delineated a suite of catabolic enzymes for biphenyl and alkyl-benzenes in RHA1, which is larger than previously recognized and which may serve as a model for catabolism in other environmentally important bacteria having large genomes. PMID:16957245
NASA Astrophysics Data System (ADS)
Xia, Wei; Chen, Ying; Zhang, Rui; Yan, Zhuangzhi; Zhou, Xiaobo; Zhang, Bo; Gao, Xin
2018-02-01
Our objective was to identify prognostic imaging biomarkers for hepatocellular carcinoma in contrast-enhanced computed tomography (CECT) with biological interpretations by associating imaging features and gene modules. We retrospectively analyzed 371 patients who had gene expression profiles. For the 38 patients with CECT imaging data, automatic intra-tumor partitioning was performed, resulting in three spatially distinct subregions. We extracted a total of 37 quantitative imaging features describing intensity, geometry, and texture from each subregion. Imaging features were selected after robustness and redundancy analysis. Gene modules acquired from clustering were chosen for their prognostic significance. By constructing an association map between imaging features and gene modules with Spearman rank correlations, the imaging features that significantly correlated with gene modules were obtained. These features were evaluated with Cox’s proportional hazard models and Kaplan-Meier estimates to determine their prognostic capabilities for overall survival (OS). Eight imaging features were significantly correlated with prognostic gene modules, and two of them were associated with OS. Among these, the geometry feature volume fraction of the subregion, which was significantly correlated with all prognostic gene modules representing cancer-related interpretation, was predictive of OS (Cox p = 0.022, hazard ratio = 0.24). The texture feature cluster prominence in the subregion, which was correlated with the prognostic gene module representing lipid metabolism and complement activation, also had the ability to predict OS (Cox p = 0.021, hazard ratio = 0.17). Imaging features depicting the volume fraction and textural heterogeneity in subregions have the potential to be predictors of OS with interpretable biological meaning.
A fast, robust and tunable synthetic gene oscillator.
Stricker, Jesse; Cookson, Scott; Bennett, Matthew R; Mather, William H; Tsimring, Lev S; Hasty, Jeff
2008-11-27
One defining goal of synthetic biology is the development of engineering-based approaches that enable the construction of gene-regulatory networks according to 'design specifications' generated from computational modelling. This approach provides a systematic framework for exploring how a given regulatory network generates a particular phenotypic behaviour. Several fundamental gene circuits have been developed using this approach, including toggle switches and oscillators, and these have been applied in new contexts such as triggered biofilm development and cellular population control. Here we describe an engineered genetic oscillator in Escherichia coli that is fast, robust and persistent, with tunable oscillatory periods as fast as 13 min. The oscillator was designed using a previously modelled network architecture comprising linked positive and negative feedback loops. Using a microfluidic platform tailored for single-cell microscopy, we precisely control environmental conditions and monitor oscillations in individual cells through multiple cycles. Experiments reveal remarkable robustness and persistence of oscillations in the designed circuit; almost every cell exhibited large-amplitude fluorescence oscillations throughout observation runs. The oscillatory period can be tuned by altering inducer levels, temperature and the media source. Computational modelling demonstrates that the key design principle for constructing a robust oscillator is a time delay in the negative feedback loop, which can mechanistically arise from the cascade of cellular processes involved in forming a functional transcription factor. The positive feedback loop increases the robustness of the oscillations and allows for greater tunability. Examination of our refined model suggested the existence of a simplified oscillator design without positive feedback, and we construct an oscillator strain confirming this computational prediction.
Castells, Xavier; Acebes, Juan José; Majós, Carles; Boluda, Susana; Julià-Sapé, Margarida; Candiota, Ana Paula; Ariño, Joaquín; Barceló, Anna; Arús, Carles
2015-01-01
Glioblastoma (Gb) is one of the most deadly tumors. Its molecular subtypes are yet to be fully characterized while the attendant efforts for personalized medicine need to be intensified in relation to glioblastoma diagnosis, treatment, and prognosis. Several molecular signatures based on gene expression microarrays were reported, but the use of microarrays for routine clinical practice is challenged by attendant economic costs. Several authors have proposed discriminant equations based on RT-PCR. Still, the discriminant threshold is often incompletely described, which makes proper validation difficult. In a previous work, we have reported two Gb subtypes based on the expression levels of four genes: CHI3L1, LDHA, LGALS1, and IGFBP3. One Gb subtype presented with low expression of the four genes mentioned, and of MGMT in a large portion of the patients (with anticipated high methylation of its promoter), and mutated IDH1. Here, we evaluate the robustness of the equations fitted with these genes using RT-PCR values in a set of 64 cases and importantly, define an unequivocal discriminant threshold with a view to prognostic implications. We developed two approaches to generate the discriminant equations: 1) using the expression level of the four genes mentioned above, and 2) using those genes displaying the highest correlation with survival among the aforementioned four ones, plus MGMT, as an attempt to further reduce the number of genes. The ease of equations' applicability, reduction in cost for raw data, and robustness in terms of resampling-based classification accuracy warrant further evaluation of these equations to discern Gb tumor biopsy heterogeneity at molecular level, diagnose potential malignancy, and prognosis of individual patients with glioblastomas.
How does epistasis influence the response to selection?
Barton, N H
2017-01-01
Much of quantitative genetics is based on the ‘infinitesimal model', under which selection has a negligible effect on the genetic variance. This is typically justified by assuming a very large number of loci with additive effects. However, it applies even when genes interact, provided that the number of loci is large enough that selection on each of them is weak relative to random drift. In the long term, directional selection will change allele frequencies, but even then, the effects of epistasis on the ultimate change in trait mean due to selection may be modest. Stabilising selection can maintain many traits close to their optima, even when the underlying alleles are weakly selected. However, the number of traits that can be optimised is apparently limited to ~4Ne by the ‘drift load', and this is hard to reconcile with the apparent complexity of many organisms. Just as for the mutation load, this limit can be evaded by a particular form of negative epistasis. A more robust limit is set by the variance in reproductive success. This suggests that selection accumulates information most efficiently in the infinitesimal regime, when selection on individual alleles is weak, and comparable with random drift. A review of evidence on selection strength suggests that although most variance in fitness may be because of alleles with large Nes, substantial amounts of adaptation may be because of alleles in the infinitesimal regime, in which epistasis has modest effects. PMID:27901509
How does epistasis influence the response to selection?
Barton, N H
2017-01-01
Much of quantitative genetics is based on the 'infinitesimal model', under which selection has a negligible effect on the genetic variance. This is typically justified by assuming a very large number of loci with additive effects. However, it applies even when genes interact, provided that the number of loci is large enough that selection on each of them is weak relative to random drift. In the long term, directional selection will change allele frequencies, but even then, the effects of epistasis on the ultimate change in trait mean due to selection may be modest. Stabilising selection can maintain many traits close to their optima, even when the underlying alleles are weakly selected. However, the number of traits that can be optimised is apparently limited to ~4N e by the 'drift load', and this is hard to reconcile with the apparent complexity of many organisms. Just as for the mutation load, this limit can be evaded by a particular form of negative epistasis. A more robust limit is set by the variance in reproductive success. This suggests that selection accumulates information most efficiently in the infinitesimal regime, when selection on individual alleles is weak, and comparable with random drift. A review of evidence on selection strength suggests that although most variance in fitness may be because of alleles with large N e s, substantial amounts of adaptation may be because of alleles in the infinitesimal regime, in which epistasis has modest effects.
Nucleotide exchange and excision technology DNA shuffling and directed evolution.
Speck, Janina; Stebel, Sabine C; Arndt, Katja M; Müller, Kristian M
2011-01-01
Remarkable success in optimizing complex properties within DNA and proteins has been achieved by directed evolution. In contrast to various random mutagenesis methods and high-throughput selection methods, the number of available DNA shuffling procedures is limited, and protocols are often difficult to adjust. The strength of the nucleotide exchange and excision technology (NExT) DNA shuffling described here is the robust, efficient, and easily controllable DNA fragmentation step based on random incorporation of the so-called 'exchange nucleotides' by PCR. The exchange nucleotides are removed enzymatically, followed by chemical cleavage of the DNA backbone. The oligonucleotide pool is reassembled into full-length genes by internal primer extension, and the recombined gene library is amplified by standard PCR. The technique has been demonstrated by shuffling a defined gene library of chloramphenicol acetyltransferase variants using uridine as fragmentation defining exchange nucleotide. Substituting 33% of the dTTP with dUTP in the incorporation PCR resulted in shuffled clones with an average parental fragment size of 86 bases and revealed a mutation rate of only 0.1%. Additionally, a computer program (NExTProg) has been developed that predicts the fragment size distribution depending on the relative amount of the exchange nucleotide.
Allele exchange at the EPSPS locus confers glyphosate tolerance in cassava.
Hummel, Aaron W; Chauhan, Raj Deepika; Cermak, Tomas; Mutka, Andrew M; Vijayaraghavan, Anupama; Boyher, Adam; Starker, Colby G; Bart, Rebecca; Voytas, Daniel F; Taylor, Nigel J
2017-12-09
Effective weed control can protect yields of cassava (Manihot esculenta) storage roots. Farmers could benefit from using herbicide with a tolerant cultivar. We applied traditional transgenesis and gene editing to generate robust glyphosate tolerance in cassava. By comparing promoters regulating expression of transformed 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) genes with various paired amino acid substitutions, we found that strong constitutive expression is required to achieve glyphosate tolerance during in vitro selection and in whole cassava plants. Using strategies that exploit homologous recombination (HR) and nonhomologous end-joining (NHEJ) DNA repair pathways, we precisely introduced the best-performing allele into the cassava genome, simultaneously creating a promoter swap and dual amino acid substitutions at the endogenous EPSPS locus. Primary EPSPS-edited plants were phenotypically normal, tolerant to high doses of glyphosate, with some free of detectable T-DNA integrations. Our methods demonstrate an editing strategy for creating glyphosate tolerance in crop plants and demonstrate the potential of gene editing for further improvement of cassava. © 2017 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
An anatomically comprehensive atlas of the adult human brain transcriptome
Guillozet-Bongaarts, Angela L.; Shen, Elaine H.; Ng, Lydia; Miller, Jeremy A.; van de Lagemaat, Louie N.; Smith, Kimberly A.; Ebbert, Amanda; Riley, Zackery L.; Abajian, Chris; Beckmann, Christian F.; Bernard, Amy; Bertagnolli, Darren; Boe, Andrew F.; Cartagena, Preston M.; Chakravarty, M. Mallar; Chapin, Mike; Chong, Jimmy; Dalley, Rachel A.; David Daly, Barry; Dang, Chinh; Datta, Suvro; Dee, Nick; Dolbeare, Tim A.; Faber, Vance; Feng, David; Fowler, David R.; Goldy, Jeff; Gregor, Benjamin W.; Haradon, Zeb; Haynor, David R.; Hohmann, John G.; Horvath, Steve; Howard, Robert E.; Jeromin, Andreas; Jochim, Jayson M.; Kinnunen, Marty; Lau, Christopher; Lazarz, Evan T.; Lee, Changkyu; Lemon, Tracy A.; Li, Ling; Li, Yang; Morris, John A.; Overly, Caroline C.; Parker, Patrick D.; Parry, Sheana E.; Reding, Melissa; Royall, Joshua J.; Schulkin, Jay; Sequeira, Pedro Adolfo; Slaughterbeck, Clifford R.; Smith, Simon C.; Sodt, Andy J.; Sunkin, Susan M.; Swanson, Beryl E.; Vawter, Marquis P.; Williams, Derric; Wohnoutka, Paul; Zielke, H. Ronald; Geschwind, Daniel H.; Hof, Patrick R.; Smith, Stephen M.; Koch, Christof; Grant, Seth G. N.; Jones, Allan R.
2014-01-01
Neuroanatomically precise, genome-wide maps of transcript distributions are critical resources to complement genomic sequence data and to correlate functional and genetic brain architecture. Here we describe the generation and analysis of a transcriptional atlas of the adult human brain, comprising extensive histological analysis and comprehensive microarray profiling of ~900 neuroanatomically precise subdivisions in two individuals. Transcriptional regulation varies enormously by anatomical location, with different regions and their constituent cell types displaying robust molecular signatures that are highly conserved between individuals. Analysis of differential gene expression and gene co-expression relationships demonstrates that brain-wide variation strongly reflects the distributions of major cell classes such as neurons, oligodendrocytes, astrocytes and microglia. Local neighbourhood relationships between fine anatomical subdivisions are associated with discrete neuronal subtypes and genes involved with synaptic transmission. The neocortex displays a relatively homogeneous transcriptional pattern, but with distinct features associated selectively with primary sensorimotor cortices and with enriched frontal lobe expression. Notably, the spatial topography of the neocortex is strongly reflected in its molecular topography— the closer two cortical regions, the more similar their transcriptomes. This freely accessible online data resource forms a high-resolution transcriptional baseline for neurogenetic studies of normal and abnormal human brain function. PMID:22996553
Metabolic gene regulation in a dynamically changing environment.
Bennett, Matthew R; Pang, Wyming Lee; Ostroff, Natalie A; Baumgartner, Bridget L; Nayak, Sujata; Tsimring, Lev S; Hasty, Jeff
2008-08-28
Natural selection dictates that cells constantly adapt to dynamically changing environments in a context-dependent manner. Gene-regulatory networks often mediate the cellular response to perturbation, and an understanding of cellular adaptation will require experimental approaches aimed at subjecting cells to a dynamic environment that mimics their natural habitat. Here we monitor the response of Saccharomyces cerevisiae metabolic gene regulation to periodic changes in the external carbon source by using a microfluidic platform that allows precise, dynamic control over environmental conditions. We show that the metabolic system acts as a low-pass filter that reliably responds to a slowly changing environment, while effectively ignoring fast fluctuations. The sensitive low-frequency response was significantly faster than in predictions arising from our computational modelling, and this discrepancy was resolved by the discovery that two key galactose transcripts possess half-lives that depend on the carbon source. Finally, to explore how induction characteristics affect frequency response, we compare two S. cerevisiae strains and show that they have the same frequency response despite having markedly different induction properties. This suggests that although certain characteristics of the complex networks may differ when probed in a static environment, the system has been optimized for a robust response to a dynamically changing environment.
Ribas, Vinicius T.; Costa, Marcos R.
2017-01-01
Limited axon regeneration in the injured adult mammalian central nervous system (CNS) usually results in irreversible functional deficits. Both the presence of extrinsic inhibitory molecules at the injury site and the intrinsically low capacity of adult neurons to grow axons are responsible for the diminished capacity of regeneration in the adult CNS. Conversely, in the embryonic CNS, neurons show a high regenerative capacity, mostly due to the expression of genes that positively control axon growth and downregulation of genes that inhibit axon growth. A better understanding of the role of these key genes controlling pro-regenerative mechanisms is pivotal to develop strategies to promote robust axon regeneration following adult CNS injury. Genetic manipulation techniques have been widely used to investigate the role of specific genes or a combination of different genes in axon regrowth. This review summarizes a myriad of studies that used genetic manipulations to promote axon growth in the injured CNS. We also review the roles of some of these genes during CNS development and suggest possible approaches to identify new candidate genes. Finally, we critically address the main advantages and pitfalls of gene-manipulation techniques, and discuss new strategies to promote robust axon regeneration in the mature CNS. PMID:28824380
Klett, Hagen; Fuellgraf, Hannah; Levit-Zerdoun, Ella; Hussung, Saskia; Kowar, Silke; Küsters, Simon; Bronsert, Peter; Werner, Martin; Wittel, Uwe; Fritsch, Ralph; Busch, Hauke; Boerries, Melanie
2018-01-01
Late diagnosis and systemic dissemination essentially contribute to the invariably poor prognosis of pancreatic ductal adenocarcinoma (PDAC). Therefore, the development of diagnostic biomarkers for PDAC are urgently needed to improve patient stratification and outcome in the clinic. By studying the transcriptomes of independent PDAC patient cohorts of tumor and non-tumor tissues, we identified 81 robustly regulated genes, through a novel, generally applicable meta-analysis. Using consensus clustering on co-expression values revealed four distinct clusters with genes originating from exocrine/endocrine pancreas, stromal and tumor cells. Three clusters were strongly associated with survival of PDAC patients based on TCGA database underlining the prognostic potential of the identified genes. With the added information of impact of survival and the robustness within the meta-analysis, we extracted a 17-gene subset for further validation. We show that it did not only discriminate PDAC from non-tumor tissue and stroma in fresh-frozen as well as formalin-fixed paraffin embedded samples, but also detected pancreatic precursor lesions and singled out pancreatitis samples. Moreover, the classifier discriminated PDAC from other cancers in the TCGA database. In addition, we experimentally validated the classifier in PDAC patients on transcript level using qPCR and exemplify the usage on protein level for three proteins (AHNAK2, LAMC2, TFF1) using immunohistochemistry and for two secreted proteins (TFF1, SERPINB5) using ELISA-based protein detection in blood-plasma. In conclusion, we present a novel robust diagnostic and prognostic gene signature for PDAC with future potential applicability in the clinic.
Klett, Hagen; Fuellgraf, Hannah; Levit-Zerdoun, Ella; Hussung, Saskia; Kowar, Silke; Küsters, Simon; Bronsert, Peter; Werner, Martin; Wittel, Uwe; Fritsch, Ralph; Busch, Hauke; Boerries, Melanie
2018-01-01
Late diagnosis and systemic dissemination essentially contribute to the invariably poor prognosis of pancreatic ductal adenocarcinoma (PDAC). Therefore, the development of diagnostic biomarkers for PDAC are urgently needed to improve patient stratification and outcome in the clinic. By studying the transcriptomes of independent PDAC patient cohorts of tumor and non-tumor tissues, we identified 81 robustly regulated genes, through a novel, generally applicable meta-analysis. Using consensus clustering on co-expression values revealed four distinct clusters with genes originating from exocrine/endocrine pancreas, stromal and tumor cells. Three clusters were strongly associated with survival of PDAC patients based on TCGA database underlining the prognostic potential of the identified genes. With the added information of impact of survival and the robustness within the meta-analysis, we extracted a 17-gene subset for further validation. We show that it did not only discriminate PDAC from non-tumor tissue and stroma in fresh-frozen as well as formalin-fixed paraffin embedded samples, but also detected pancreatic precursor lesions and singled out pancreatitis samples. Moreover, the classifier discriminated PDAC from other cancers in the TCGA database. In addition, we experimentally validated the classifier in PDAC patients on transcript level using qPCR and exemplify the usage on protein level for three proteins (AHNAK2, LAMC2, TFF1) using immunohistochemistry and for two secreted proteins (TFF1, SERPINB5) using ELISA-based protein detection in blood-plasma. In conclusion, we present a novel robust diagnostic and prognostic gene signature for PDAC with future potential applicability in the clinic. PMID:29675033
Genome-wide analysis of drought induced gene expression changes in flax (Linum usitatissimum).
Dash, Prasanta K; Cao, Yongguo; Jailani, Abdul K; Gupta, Payal; Venglat, Prakash; Xiang, Daoquan; Rai, Rhitu; Sharma, Rinku; Thirunavukkarasu, Nepolean; Abdin, Malik Z; Yadava, Devendra K; Singh, Nagendra K; Singh, Jas; Selvaraj, Gopalan; Deyholos, Mike; Kumar, Polumetla Ananda; Datla, Raju
2014-01-01
A robust phenotypic plasticity to ward off adverse environmental conditions determines performance and productivity in crop plants. Flax (linseed), is an important cash crop produced for natural textile fiber (linen) or oilseed with many health promoting products. This crop is prone to drought stress and yield losses in many parts of the world. Despite recent advances in drought research in a number of important crops, related progress in flax is very limited. Since, response of this plant to drought stress has not been addressed at the molecular level; we conducted microarray analysis to capture transcriptome associated with induced drought in flax. This study identified 183 differentially expressed genes (DEGs) associated with diverse cellular, biophysical and metabolic programs in flax. The analysis also revealed especially the altered regulation of cellular and metabolic pathways governing photosynthesis. Additionally, comparative transcriptome analysis identified a plethora of genes that displayed differential regulation both spatially and temporally. These results revealed co-regulated expression of 26 genes in both shoot and root tissues with implications for drought stress response. Furthermore, the data also showed that more genes are upregulated in roots compared to shoots, suggesting that roots may play important and additional roles in response to drought in flax. With prolonged drought treatment, the number of DEGs increased in both tissue types. Differential expression of selected genes was confirmed by qRT-PCR, thus supporting the suggested functional association of these intrinsic genes in maintaining growth and homeostasis in response to imminent drought stress in flax. Together the present study has developed foundational and new transcriptome data sets for drought stress in flax.
2013-01-01
Background Understanding the function of a particular gene under various stresses is important for engineering plants for broad-spectrum stress tolerance. Although virus-induced gene silencing (VIGS) has been used to characterize genes involved in abiotic stress tolerance, currently available gene silencing and stress imposition methodology at the whole plant level is not suitable for high-throughput functional analyses of genes. This demands a robust and reliable methodology for characterizing genes involved in abiotic and multi-stress tolerance. Results Our methodology employs VIGS-based gene silencing in leaf disks combined with simple stress imposition and effect quantification methodologies for easy and faster characterization of genes involved in abiotic and multi-stress tolerance. By subjecting leaf disks from gene-silenced plants to various abiotic stresses and inoculating silenced plants with various pathogens, we show the involvement of several genes for multi-stress tolerance. In addition, we demonstrate that VIGS can be used to characterize genes involved in thermotolerance. Our results also showed the functional relevance of NtEDS1 in abiotic stress, NbRBX1 and NbCTR1 in oxidative stress; NtRAR1 and NtNPR1 in salinity stress; NbSOS1 and NbHSP101 in biotic stress; and NtEDS1, NbETR1, NbWRKY2 and NbMYC2 in thermotolerance. Conclusions In addition to widening the application of VIGS, we developed a robust, easy and high-throughput methodology for functional characterization of genes involved in multi-stress tolerance. PMID:24289810
GO-based functional dissimilarity of gene sets.
Díaz-Díaz, Norberto; Aguilar-Ruiz, Jesús S
2011-09-01
The Gene Ontology (GO) provides a controlled vocabulary for describing the functions of genes and can be used to evaluate the functional coherence of gene sets. Many functional coherence measures consider each pair of gene functions in a set and produce an output based on all pairwise distances. A single gene can encode multiple proteins that may differ in function. For each functionality, other proteins that exhibit the same activity may also participate. Therefore, an identification of the most common function for all of the genes involved in a biological process is important in evaluating the functional similarity of groups of genes and a quantification of functional coherence can helps to clarify the role of a group of genes working together. To implement this approach to functional assessment, we present GFD (GO-based Functional Dissimilarity), a novel dissimilarity measure for evaluating groups of genes based on the most relevant functions of the whole set. The measure assigns a numerical value to the gene set for each of the three GO sub-ontologies. Results show that GFD performs robustly when applied to gene set of known functionality (extracted from KEGG). It performs particularly well on randomly generated gene sets. An ROC analysis reveals that the performance of GFD in evaluating the functional dissimilarity of gene sets is very satisfactory. A comparative analysis against other functional measures, such as GS2 and those presented by Resnik and Wang, also demonstrates the robustness of GFD.
Evaluation of reference genes for insect olfaction studies.
Omondi, Bonaventure Aman; Latorre-Estivalis, Jose Manuel; Rocha Oliveira, Ivana Helena; Ignell, Rickard; Lorenzo, Marcelo Gustavo
2015-04-22
Quantitative reverse transcription PCR (qRT-PCR) is a robust and accessible method to assay gene expression and to infer gene regulation. Being a chain of procedures, this technique is subject to systematic error due to biological and technical limitations mainly set by the starting material and downstream procedures. Thus, rigorous data normalization is critical to grant reliability and repeatability of gene expression quantification by qRT-PCR. A number of 'housekeeping genes', involved in basic cellular functions, have been commonly used as internal controls for this normalization process. However, these genes could themselves be regulated and must therefore be tested a priori. We evaluated eight potential reference genes for their stability as internal controls for RT-qPCR studies of olfactory gene expression in the antennae of Rhodnius prolixus, a Chagas disease vector. The set of genes included were: α-tubulin; β-actin; Glyceraldehyde-3-phosphate dehydrogenase; Eukaryotic initiation factor 1A; Glutathione-S-transferase; Serine protease; Succinate dehydrogenase; and Glucose-6-phosphate dehydrogenase. Five experimental conditions, including changes in age,developmental stage and feeding status were tested in both sexes. We show that the evaluation of candidate reference genes is necessary for each combination of sex, tissue and physiological condition analyzed in order to avoid inconsistent results and conclusions. Although, Normfinder and geNorm software yielded different results between males and females, five genes (SDH, Tub, GAPDH, Act and G6PDH) appeared in the first positions in all rankings obtained. By using gene expression data of a single olfactory coreceptor gene as an example, we demonstrated the extent of changes expected using different internal standards. This work underlines the need for a rigorous selection of internal standards to grant the reliability of normalization processes in qRT-PCR studies. Furthermore, we show that particular physiological or developmental conditions require independent evaluation of a diverse set of potential reference genes.
Robust portfolio selection based on asymmetric measures of variability of stock returns
NASA Astrophysics Data System (ADS)
Chen, Wei; Tan, Shaohua
2009-10-01
This paper addresses a new uncertainty set--interval random uncertainty set for robust optimization. The form of interval random uncertainty set makes it suitable for capturing the downside and upside deviations of real-world data. These deviation measures capture distributional asymmetry and lead to better optimization results. We also apply our interval random chance-constrained programming to robust mean-variance portfolio selection under interval random uncertainty sets in the elements of mean vector and covariance matrix. Numerical experiments with real market data indicate that our approach results in better portfolio performance.
A robust optimisation approach to the problem of supplier selection and allocation in outsourcing
NASA Astrophysics Data System (ADS)
Fu, Yelin; Keung Lai, Kin; Liang, Liang
2016-03-01
We formulate the supplier selection and allocation problem in outsourcing under an uncertain environment as a stochastic programming problem. Both the decision-maker's attitude towards risk and the penalty parameters for demand deviation are considered in the objective function. A service level agreement, upper bound for each selected supplier's allocation and the number of selected suppliers are considered as constraints. A novel robust optimisation approach is employed to solve this problem under different economic situations. Illustrative examples are presented with managerial implications highlighted to support decision-making.
Chen, Bor-Sen; Lin, Ying-Po
2013-01-01
Robust stabilization and environmental disturbance attenuation are ubiquitous systematic properties observed in biological systems at different levels. The underlying principles for robust stabilization and environmental disturbance attenuation are universal to both complex biological systems and sophisticated engineering systems. In many biological networks, network robustness should be enough to confer intrinsic robustness in order to tolerate intrinsic parameter fluctuations, genetic robustness for buffering genetic variations, and environmental robustness for resisting environmental disturbances. With this, the phenotypic stability of biological network can be maintained, thus guaranteeing phenotype robustness. This paper presents a survey on biological systems and then develops a unifying mathematical framework for investigating the principles of both robust stabilization and environmental disturbance attenuation in systems and evolutionary biology. Further, from the unifying mathematical framework, it was discovered that the phenotype robustness criterion for biological networks at different levels relies upon intrinsic robustness + genetic robustness + environmental robustness ≦ network robustness. When this is true, the phenotype robustness can be maintained in spite of intrinsic parameter fluctuations, genetic variations, and environmental disturbances. Therefore, the trade-offs between intrinsic robustness, genetic robustness, environmental robustness, and network robustness in systems and evolutionary biology can also be investigated through their corresponding phenotype robustness criterion from the systematic point of view. PMID:23515240
Chen, Bor-Sen; Lin, Ying-Po
2013-01-01
Robust stabilization and environmental disturbance attenuation are ubiquitous systematic properties observed in biological systems at different levels. The underlying principles for robust stabilization and environmental disturbance attenuation are universal to both complex biological systems and sophisticated engineering systems. In many biological networks, network robustness should be enough to confer intrinsic robustness in order to tolerate intrinsic parameter fluctuations, genetic robustness for buffering genetic variations, and environmental robustness for resisting environmental disturbances. With this, the phenotypic stability of biological network can be maintained, thus guaranteeing phenotype robustness. This paper presents a survey on biological systems and then develops a unifying mathematical framework for investigating the principles of both robust stabilization and environmental disturbance attenuation in systems and evolutionary biology. Further, from the unifying mathematical framework, it was discovered that the phenotype robustness criterion for biological networks at different levels relies upon intrinsic robustness + genetic robustness + environmental robustness ≦ network robustness. When this is true, the phenotype robustness can be maintained in spite of intrinsic parameter fluctuations, genetic variations, and environmental disturbances. Therefore, the trade-offs between intrinsic robustness, genetic robustness, environmental robustness, and network robustness in systems and evolutionary biology can also be investigated through their corresponding phenotype robustness criterion from the systematic point of view.
Celis, Juan Sebastián; Edgell, David R; Stelbrink, Björn; Wibberg, Daniel; Hauffe, Torsten; Blom, Jochen; Kalinowski, Jörn; Wilke, Thomas
2017-01-01
Group I introns and homing endonuclease genes (HEGs) are mobile genetic elements, capable of invading target sequences in intron-less genomes. LAGLIDADG HEGs are the largest family of endonucleases, playing a key role in the mobility of group I introns in a process known as 'homing'. Group I introns and HEGs are rare in metazoans, and can be mainly found inserted in the COXI gene of some sponges and cnidarians, including stony corals (Scleractinia) and mushroom corals (Corallimorpharia). Vertical and horizontal intron transfer mechanisms have been proposed as explanations for intron occurrence in cnidarians. However, the central role of LAGLIDADG motifs in intron mobility mechanisms remains poorly understood. To resolve questions regarding the evolutionary origin and distribution of group I introns and HEGs in Scleractinia and Corallimorpharia, we examined intron/HEGs sequences within a comprehensive phylogenetic framework. Analyses of LAGLIDADG motif conservation showed a high degree of degradation in complex Scleractinia and Corallimorpharia. Moreover, the two motifs lack the respective acidic residues necessary for metal-ion binding and catalysis, potentially impairing horizontal intron mobility. In contrast, both motifs are highly conserved within robust Scleractinia, indicating a fully functional endonuclease capable of promoting horizontal intron transference. A higher rate of non-synonymous substitutions (Ka) detected in the HEGs of complex Scleractinia and Corallimorpharia suggests degradation of the HEG, whereas lower Ka rates in robust Scleractinia are consistent with a scenario of purifying selection. Molecular-clock analyses and ancestral inference of intron type indicated an earlier intron insertion in complex Scleractinia and Corallimorpharia in comparison to robust Scleractinia. These findings suggest that the lack of horizontal intron transfers in the former two groups is related to an age-dependent degradation of the endonuclease activity. Moreover, they also explain the peculiar geographical patterns of introns in stony and mushroom corals.
LOD score exclusion analyses for candidate genes using random population samples.
Deng, H W; Li, J; Recker, R R
2001-05-01
While extensive analyses have been conducted to test for, no formal analyses have been conducted to test against, the importance of candidate genes with random population samples. We develop a LOD score approach for exclusion analyses of candidate genes with random population samples. Under this approach, specific genetic effects and inheritance models at candidate genes can be analysed and if a LOD score is < or = - 2.0, the locus can be excluded from having an effect larger than that specified. Computer simulations show that, with sample sizes often employed in association studies, this approach has high power to exclude a gene from having moderate genetic effects. In contrast to regular association analyses, population admixture will not affect the robustness of our analyses; in fact, it renders our analyses more conservative and thus any significant exclusion result is robust. Our exclusion analysis complements association analysis for candidate genes in random population samples and is parallel to the exclusion mapping analyses that may be conducted in linkage analyses with pedigrees or relative pairs. The usefulness of the approach is demonstrated by an application to test the importance of vitamin D receptor and estrogen receptor genes underlying the differential risk to osteoporotic fractures.
Fang, Xin; Sastry, Anand; Mih, Nathan; Kim, Donghyuk; Tan, Justin; Lloyd, Colton J.; Gao, Ye; Yang, Laurence; Palsson, Bernhard O.
2017-01-01
Transcriptional regulatory networks (TRNs) have been studied intensely for >25 y. Yet, even for the Escherichia coli TRN—probably the best characterized TRN—several questions remain. Here, we address three questions: (i) How complete is our knowledge of the E. coli TRN; (ii) how well can we predict gene expression using this TRN; and (iii) how robust is our understanding of the TRN? First, we reconstructed a high-confidence TRN (hiTRN) consisting of 147 transcription factors (TFs) regulating 1,538 transcription units (TUs) encoding 1,764 genes. The 3,797 high-confidence regulatory interactions were collected from published, validated chromatin immunoprecipitation (ChIP) data and RegulonDB. For 21 different TF knockouts, up to 63% of the differentially expressed genes in the hiTRN were traced to the knocked-out TF through regulatory cascades. Second, we trained supervised machine learning algorithms to predict the expression of 1,364 TUs given TF activities using 441 samples. The algorithms accurately predicted condition-specific expression for 86% (1,174 of 1,364) of the TUs, while 193 TUs (14%) were predicted better than random TRNs. Third, we identified 10 regulatory modules whose definitions were robust against changes to the TRN or expression compendium. Using surrogate variable analysis, we also identified three unmodeled factors that systematically influenced gene expression. Our computational workflow comprehensively characterizes the predictive capabilities and systems-level functions of an organism’s TRN from disparate data types. PMID:28874552
Genome-wide screen identifies a novel prognostic signature for breast cancer survival
Mao, Xuan Y.; Lee, Matthew J.; Zhu, Jeffrey; ...
2017-01-21
Large genomic datasets in combination with clinical data can be used as an unbiased tool to identify genes important in patient survival and discover potential therapeutic targets. We used a genome-wide screen to identify 587 genes significantly and robustly deregulated across four independent breast cancer (BC) datasets compared to normal breast tissue. Gene expression of 381 genes was significantly associated with relapse-free survival (RFS) in BC patients. We used a gene co-expression network approach to visualize the genetic architecture in normal breast and BCs. In normal breast tissue, co-expression cliques were identified enriched for cell cycle, gene transcription, cell adhesion,more » cytoskeletal organization and metabolism. In contrast, in BC, only two major co-expression cliques were identified enriched for cell cycle-related processes or blood vessel development, cell adhesion and mammary gland development processes. Interestingly, gene expression levels of 7 genes were found to be negatively correlated with many cell cycle related genes, highlighting these genes as potential tumor suppressors and novel therapeutic targets. A forward-conditional Cox regression analysis was used to identify a 12-gene signature associated with RFS. A prognostic scoring system was created based on the 12-gene signature. This scoring system robustly predicted BC patient RFS in 60 sampling test sets and was further validated in TCGA and METABRIC BC data. Our integrated study identified a 12-gene prognostic signature that could guide adjuvant therapy for BC patients and includes novel potential molecular targets for therapy.« less
Genome-wide screen identifies a novel prognostic signature for breast cancer survival
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mao, Xuan Y.; Lee, Matthew J.; Zhu, Jeffrey
Large genomic datasets in combination with clinical data can be used as an unbiased tool to identify genes important in patient survival and discover potential therapeutic targets. We used a genome-wide screen to identify 587 genes significantly and robustly deregulated across four independent breast cancer (BC) datasets compared to normal breast tissue. Gene expression of 381 genes was significantly associated with relapse-free survival (RFS) in BC patients. We used a gene co-expression network approach to visualize the genetic architecture in normal breast and BCs. In normal breast tissue, co-expression cliques were identified enriched for cell cycle, gene transcription, cell adhesion,more » cytoskeletal organization and metabolism. In contrast, in BC, only two major co-expression cliques were identified enriched for cell cycle-related processes or blood vessel development, cell adhesion and mammary gland development processes. Interestingly, gene expression levels of 7 genes were found to be negatively correlated with many cell cycle related genes, highlighting these genes as potential tumor suppressors and novel therapeutic targets. A forward-conditional Cox regression analysis was used to identify a 12-gene signature associated with RFS. A prognostic scoring system was created based on the 12-gene signature. This scoring system robustly predicted BC patient RFS in 60 sampling test sets and was further validated in TCGA and METABRIC BC data. Our integrated study identified a 12-gene prognostic signature that could guide adjuvant therapy for BC patients and includes novel potential molecular targets for therapy.« less
Towards programmable plant genetic circuits.
Medford, June I; Prasad, Ashok
2016-07-01
Synthetic biology enables the construction of genetic circuits with predictable gene functions in plants. Detailed quantitative descriptions of the transfer function or input-output function for genetic parts (promoters, 5' and 3' untranslated regions, etc.) are collected. These data are then used in computational simulations to determine their robustness and desired properties, thereby enabling the best components to be selected for experimental testing in plants. In addition, the process forms an iterative workflow which allows vast improvement to validated elements with sub-optimal function. These processes enable computational functions such as digital logic in living plants and follow the pathway of technological advances which took us from vacuum tubes to cell phones. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.
Verdugo, Ricardo A; Zeller, Tanja; Rotival, Maxime; Wild, Philipp S; Münzel, Thomas; Lackner, Karl J; Weidmann, Henri; Ninio, Ewa; Trégouët, David-Alexandre; Cambien, François; Blankenberg, Stefan; Tiret, Laurence
2013-01-01
Smoking is a risk factor for atherosclerosis with reported widespread effects on gene expression in circulating blood cells. We hypothesized that a molecular signature mediating the relation between smoking and atherosclerosis may be found in the transcriptome of circulating monocytes. Genome-wide expression profiles and counts of atherosclerotic plaques in carotid arteries were collected in 248 smokers and 688 non-smokers from the general population. Patterns of co-expressed genes were identified by Independent Component Analysis (ICA) and network structure of the pattern-specific gene modules was inferred by the PC-algorithm. A likelihood-based causality test was implemented to select patterns that fit models containing a path "smoking→gene expression→plaques". Robustness of the causal inference was assessed by bootstrapping. At a FDR ≤0.10, 3,368 genes were associated to smoking or plaques, of which 93% were associated to smoking only. SASH1 showed the strongest association to smoking and PPARG the strongest association to plaques. Twenty-nine gene patterns were identified by ICA. Modules containing SASH1 and PPARG did not show evidence for the "smoking→gene expression→plaques" causality model. Conversely, three modules had good support for causal effects and exhibited a network topology consistent with gene expression mediating the relation between smoking and plaques. The network with the strongest support for causal effects was connected to plaques through SLC39A8, a gene with known association to HDL-cholesterol and cellular uptake of cadmium from tobacco, while smoking was directly connected to GAS6, a gene reported to have anti-inflammatory effects in atherosclerosis and to be up-regulated in the placenta of women smoking during pregnancy. Our analysis of the transcriptome of monocytes recovered genes relevant for association to smoking and atherosclerosis, and connected genes that before, were only studied in separate contexts. Inspection of correlation structure revealed candidates that would be missed by expression-phenotype association analysis alone.
Verdugo, Ricardo A.; Zeller, Tanja; Rotival, Maxime; Wild, Philipp S.; Münzel, Thomas; Lackner, Karl J.; Weidmann, Henri; Ninio, Ewa; Trégouët, David-Alexandre; Cambien, François; Blankenberg, Stefan; Tiret, Laurence
2013-01-01
Smoking is a risk factor for atherosclerosis with reported widespread effects on gene expression in circulating blood cells. We hypothesized that a molecular signature mediating the relation between smoking and atherosclerosis may be found in the transcriptome of circulating monocytes. Genome-wide expression profiles and counts of atherosclerotic plaques in carotid arteries were collected in 248 smokers and 688 non-smokers from the general population. Patterns of co-expressed genes were identified by Independent Component Analysis (ICA) and network structure of the pattern-specific gene modules was inferred by the PC-algorithm. A likelihood-based causality test was implemented to select patterns that fit models containing a path “smoking→gene expression→plaques”. Robustness of the causal inference was assessed by bootstrapping. At a FDR ≤0.10, 3,368 genes were associated to smoking or plaques, of which 93% were associated to smoking only. SASH1 showed the strongest association to smoking and PPARG the strongest association to plaques. Twenty-nine gene patterns were identified by ICA. Modules containing SASH1 and PPARG did not show evidence for the “smoking→gene expression→plaques” causality model. Conversely, three modules had good support for causal effects and exhibited a network topology consistent with gene expression mediating the relation between smoking and plaques. The network with the strongest support for causal effects was connected to plaques through SLC39A8, a gene with known association to HDL-cholesterol and cellular uptake of cadmium from tobacco, while smoking was directly connected to GAS6, a gene reported to have anti-inflammatory effects in atherosclerosis and to be up-regulated in the placenta of women smoking during pregnancy. Our analysis of the transcriptome of monocytes recovered genes relevant for association to smoking and atherosclerosis, and connected genes that before, were only studied in separate contexts. Inspection of correlation structure revealed candidates that would be missed by expression-phenotype association analysis alone. PMID:23372645
Hough, Denise; Swart, Pieter; Cloete, Schalk
2013-05-17
It is a difficult task to improve animal production by means of genetic selection, if the environment does not allow full expression of the animal's genetic potential. This concept may well be the future for animal welfare, because it highlights the need to incorporate traits related to production and robustness, simultaneously, to reach sustainable breeding goals. This review explores the identification of potential genetic markers for robustness within the hypothalamic-pituitary-adrenal axis (HPAA), since this axis plays a vital role in the stress response. If genetic selection for superior HPAA responses to stress is possible, then it ought to be possible to breed robust and easily managed genotypes that might be able to adapt to a wide range of environmental conditions whilst expressing a high production potential. This approach is explored in this review by means of lessons learnt from research on Merino sheep, which were divergently selected for their multiple rearing ability. These two selection lines have shown marked differences in reproduction, production and welfare, which makes this breeding programme ideal to investigate potential genetic markers of robustness. The HPAA function is explored in detail to elucidate where such genetic markers are likely to be found.
Samsonraj, Rebekah; Paradise, Christopher R; Dudakovic, Amel; Sen, Buer; Nair, Asha A; Dietz, Allan B; Deyle, David R; Cool, Simon M; Rubin, Janet; van Wijnen, Andre
2018-06-08
Differentiation of mesenchymal stromal/stem cells (MSCs) involves a series of molecular signals and gene transcription events required for attaining cell lineage commitment. Modulation of the actin cytoskeleton using cytochalasin D (CytoD) drives osteogenesis at early time points in bone marrow-derived MSCs, and also initiates a robust osteogenic differentiation program in adipose-derived MSCs. To understand the molecular basis for these pronounced effects on osteogenic differentiation, we investigated global changes in gene expression in CytoD-treated murine and human MSCs by high-resolution RNA-sequencing (RNA-seq) analysis. A three-way bioinformatic comparison between human adipose-derived, human bone marrow-derived and mouse bone marrow-derived MSCs revealed significant upregulation of genes linked to extracellular matrix organization, cell adhesion and bone metabolism. As anticipated, the activation of these differentiation related genes is accompanied by a downregulation of nuclear and cell cycle-related genes presumably reflecting cytostatic effects of CytoD. We also identified eight novel CytoD activated genes - VGLL4, ARHGAP24, KLHL24, RCBTB2, BDH2, SCARF2, ACAD10, HEPH - which are commonly upregulated across the two species and tissue sources of our MSC samples. We selected the Hippo-pathway related VGLL4 gene, which encodes the transcriptional co-factor Vestigial-like 4, for further study because this pathway is linked to osteogenesis. VGLL4 siRNA depletion reduces mineralization of adipose-derived MSCs during CytoD-induced osteogenic differentiation. Together, our RNA-seq analyses suggest that while the stimulatory effects of CytoD on osteogenesis are pleiotropic and depend on the biological state of the cell type, a small group of genes including VGLL4 may contribute to MSC commitment towards the bone lineage.
In Situ Gene Therapy via AAV-CRISPR-Cas9-Mediated Targeted Gene Regulation.
Moreno, Ana M; Fu, Xin; Zhu, Jie; Katrekar, Dhruva; Shih, Yu-Ru V; Marlett, John; Cabotaje, Jessica; Tat, Jasmine; Naughton, John; Lisowski, Leszek; Varghese, Shyni; Zhang, Kang; Mali, Prashant
2018-04-25
Development of efficacious in vivo delivery platforms for CRISPR-Cas9-based epigenome engineering will be critical to enable the ability to target human diseases without permanent modification of the genome. Toward this, we utilized split-Cas9 systems to develop a modular adeno-associated viral (AAV) vector platform for CRISPR-Cas9 delivery to enable the full spectrum of targeted in situ gene regulation functionalities, demonstrating robust transcriptional repression (up to 80%) and activation (up to 6-fold) of target genes in cell culture and mice. We also applied our platform for targeted in vivo gene-repression-mediated gene therapy for retinitis pigmentosa. Specifically, we engineered targeted repression of Nrl, a master regulator of rod photoreceptor determination, and demonstrated Nrl knockdown mediates in situ reprogramming of rod cells into cone-like cells that are resistant to retinitis pigmentosa-specific mutations, with concomitant prevention of secondary cone loss. Furthermore, we benchmarked our results from Nrl knockdown with those from in vivo Nrl knockout via gene editing. Taken together, our AAV-CRISPR-Cas9 platform for in vivo epigenome engineering enables a robust approach to target disease in a genomically scarless and potentially reversible manner. Copyright © 2018 The American Society of Gene and Cell Therapy. Published by Elsevier Inc. All rights reserved.
Liu, Xing-Cai; He, Shi-Wei; Song, Rui; Sun, Yang; Li, Hao-Dong
2014-01-01
Railway freight center location problem is an important issue in railway freight transport programming. This paper focuses on the railway freight center location problem in uncertain environment. Seeing that the expected value model ignores the negative influence of disadvantageous scenarios, a robust optimization model was proposed. The robust optimization model takes expected cost and deviation value of the scenarios as the objective. A cloud adaptive clonal selection algorithm (C-ACSA) was presented. It combines adaptive clonal selection algorithm with Cloud Model which can improve the convergence rate. Design of the code and progress of the algorithm were proposed. Result of the example demonstrates the model and algorithm are effective. Compared with the expected value cases, the amount of disadvantageous scenarios in robust model reduces from 163 to 21, which prove the result of robust model is more reliable.
GPS baseline configuration design based on robustness analysis
NASA Astrophysics Data System (ADS)
Yetkin, M.; Berber, M.
2012-11-01
The robustness analysis results obtained from a Global Positioning System (GPS) network are dramatically influenced by the configuration
Critical Dynamics in Genetic Regulatory Networks: Examples from Four Kingdoms
Balleza, Enrique; Alvarez-Buylla, Elena R.; Chaos, Alvaro; Kauffman, Stuart; Shmulevich, Ilya; Aldana, Maximino
2008-01-01
The coordinated expression of the different genes in an organism is essential to sustain functionality under the random external perturbations to which the organism might be subjected. To cope with such external variability, the global dynamics of the genetic network must possess two central properties. (a) It must be robust enough as to guarantee stability under a broad range of external conditions, and (b) it must be flexible enough to recognize and integrate specific external signals that may help the organism to change and adapt to different environments. This compromise between robustness and adaptability has been observed in dynamical systems operating at the brink of a phase transition between order and chaos. Such systems are termed critical. Thus, criticality, a precise, measurable, and well characterized property of dynamical systems, makes it possible for robustness and adaptability to coexist in living organisms. In this work we investigate the dynamical properties of the gene transcription networks reported for S. cerevisiae, E. coli, and B. subtilis, as well as the network of segment polarity genes of D. melanogaster, and the network of flower development of A. thaliana. We use hundreds of microarray experiments to infer the nature of the regulatory interactions among genes, and implement these data into the Boolean models of the genetic networks. Our results show that, to the best of the current experimental data available, the five networks under study indeed operate close to criticality. The generality of this result suggests that criticality at the genetic level might constitute a fundamental evolutionary mechanism that generates the great diversity of dynamically robust living forms that we observe around us. PMID:18560561
Clustering by soft-constraint affinity propagation: applications to gene-expression data.
Leone, Michele; Sumedha; Weigt, Martin
2007-10-15
Similarity-measure-based clustering is a crucial problem appearing throughout scientific data analysis. Recently, a powerful new algorithm called Affinity Propagation (AP) based on message-passing techniques was proposed by Frey and Dueck (2007a). In AP, each cluster is identified by a common exemplar all other data points of the same cluster refer to, and exemplars have to refer to themselves. Albeit its proved power, AP in its present form suffers from a number of drawbacks. The hard constraint of having exactly one exemplar per cluster restricts AP to classes of regularly shaped clusters, and leads to suboptimal performance, e.g. in analyzing gene expression data. This limitation can be overcome by relaxing the AP hard constraints. A new parameter controls the importance of the constraints compared to the aim of maximizing the overall similarity, and allows to interpolate between the simple case where each data point selects its closest neighbor as an exemplar and the original AP. The resulting soft-constraint affinity propagation (SCAP) becomes more informative, accurate and leads to more stable clustering. Even though a new a priori free parameter is introduced, the overall dependence of the algorithm on external tuning is reduced, as robustness is increased and an optimal strategy for parameter selection emerges more naturally. SCAP is tested on biological benchmark data, including in particular microarray data related to various cancer types. We show that the algorithm efficiently unveils the hierarchical cluster structure present in the data sets. Further on, it allows to extract sparse gene expression signatures for each cluster.
Arpón, A; Riezu-Boj, J I; Milagro, F I; Marti, A; Razquin, C; Martínez-González, M A; Corella, D; Estruch, R; Casas, R; Fitó, M; Ros, E; Salas-Salvadó, J; Martínez, J A
2016-08-01
Epigenetic processes, including DNA methylation, might be modulated by environmental factors such as the diet, which in turn have been associated with the onset of several diseases such as obesity or cardiovascular events. Meanwhile, Mediterranean diet (MedDiet) has demonstrated favourable effects on cardiovascular risk, blood pressure, inflammation and other complications related to excessive adiposity. Some of these effects could be mediated by epigenetic modifications. Therefore, the objective of this study was to investigate whether the adherence to MedDiet is associated with changes in the methylation status from peripheral blood cells. A subset of 36 individuals was selected within the Prevención con Dieta Mediterránea (PREDIMED)-Navarra study, a randomised, controlled, parallel trial with three groups of intervention in high cardiovascular risk volunteers, two with a MedDiet and one low-fat control group. Changes in methylation between baseline and 5 years were studied. DNA methylation arrays were analysed by several robust statistical tests and functional classifications. Eight genes related to inflammation and immunocompetence (EEF2, COL18A1, IL4I1, LEPR, PLAGL1, IFRD1, MAPKAPK2, PPARGC1B) were finally selected as changes in their methylation levels correlated with adherence to MedDiet and because they presented sensitivity related to a high variability in methylation changes. Additionally, EEF2 methylation levels positively correlated with concentrations of TNF-α and CRP. This report is apparently the first showing that adherence to MedDiet is associated with the methylation of the reported genes related to inflammation with a potential regulatory impact.
Autism-like behavioral phenotypes in BTBR T+tf/J mice.
McFarlane, H G; Kusek, G K; Yang, M; Phoenix, J L; Bolivar, V J; Crawley, J N
2008-03-01
Autism is a behaviorally defined neurodevelopmental disorder of unknown etiology. Mouse models with face validity to the core symptoms offer an experimental approach to test hypotheses about the causes of autism and translational tools to evaluate potential treatments. We discovered that the inbred mouse strain BTBR T+tf/J (BTBR) incorporates multiple behavioral phenotypes relevant to all three diagnostic symptoms of autism. BTBR displayed selectively reduced social approach, low reciprocal social interactions and impaired juvenile play, as compared with C57BL/6J (B6) controls. Impaired social transmission of food preference in BTBR suggests communication deficits. Repetitive behaviors appeared as high levels of self-grooming by juvenile and adult BTBR mice. Comprehensive analyses of procedural abilities confirmed that social recognition and olfactory abilities were normal in BTBR, with no evidence for high anxiety-like traits or motor impairments, supporting an interpretation of highly specific social deficits. Database comparisons between BTBR and B6 on 124 putative autism candidate genes showed several interesting single nucleotide polymorphisms (SNPs) in the BTBR genetic background, including a nonsynonymous coding region polymorphism in Kmo. The Kmo gene encodes kynurenine 3-hydroxylase, an enzyme-regulating metabolism of kynurenic acid, a glutamate antagonist with neuroprotective actions. Sequencing confirmed this coding SNP in Kmo, supporting further investigation into the contribution of this polymorphism to autism-like behavioral phenotypes. Robust and selective social deficits, repetitive self-grooming, genetic stability and commercial availability of the BTBR inbred strain encourage its use as a research tool to search for background genes relevant to the etiology of autism, and to explore therapeutics to treat the core symptoms.
Adapting legume crops to climate change using genomic approaches.
Mousavi-Derazmahalleh, Mahsa; Bayer, Philipp E; Hane, James K; Valliyodan, Babu; Nguyen, Henry T; Nelson, Matthew N; Erskine, William; Varshney, Rajeev K; Papa, Roberto; Edwards, David
2018-03-30
Our agricultural system and hence food security is threatened by combination of events, such as increasing population, the impacts of climate change, and the need to a more sustainable development. Evolutionary adaptation may help some species to overcome environmental changes through new selection pressures driven by climate change. However, success of evolutionary adaptation is dependent on various factors, one of which is the extent of genetic variation available within species. Genomic approaches provide an exceptional opportunity to identify genetic variation that can be employed in crop improvement programs. In this review, we illustrate some of the routinely used genomics-based methods as well as recent breakthroughs, which facilitate assessment of genetic variation and discovery of adaptive genes in legumes. Although additional information is needed, the current utility of selection tools indicate a robust ability to utilize existing variation among legumes to address the challenges of climate uncertainty. © 2018 The Authors. Plant, Cell & Environment Published by John Wiley & Sons Ltd.
CRISPR-UMI: single-cell lineage tracing of pooled CRISPR-Cas9 screens.
Michlits, Georg; Hubmann, Maria; Wu, Szu-Hsien; Vainorius, Gintautas; Budusan, Elena; Zhuk, Sergei; Burkard, Thomas R; Novatchkova, Maria; Aichinger, Martin; Lu, Yiqing; Reece-Hoyes, John; Nitsch, Roberto; Schramek, Daniel; Hoepfner, Dominic; Elling, Ulrich
2017-12-01
Pooled CRISPR screens are a powerful tool for assessments of gene function. However, conventional analysis is based exclusively on the relative abundance of integrated single guide RNAs (sgRNAs) between populations, which does not discern distinct phenotypes and editing outcomes generated by identical sgRNAs. Here we present CRISPR-UMI, a single-cell lineage-tracing methodology for pooled screening to account for cell heterogeneity. We generated complex sgRNA libraries with unique molecular identifiers (UMIs) that allowed for screening of clonally expanded, individually tagged cells. A proof-of-principle CRISPR-UMI negative-selection screen provided increased sensitivity and robustness compared with conventional analysis by accounting for underlying cellular and editing-outcome heterogeneity and detection of outlier clones. Furthermore, a CRISPR-UMI positive-selection screen uncovered new roadblocks in reprogramming mouse embryonic fibroblasts as pluripotent stem cells, distinguishing reprogramming frequency and speed (i.e., effect size and probability). CRISPR-UMI boosts the predictive power, sensitivity, and information content of pooled CRISPR screens.
Neutrality and Robustness in Evo-Devo: Emergence of Lateral Inhibition
Munteanu, Andreea; Solé, Ricard V.
2008-01-01
Embryonic development is defined by the hierarchical dynamical process that translates genetic information (genotype) into a spatial gene expression pattern (phenotype) providing the positional information for the correct unfolding of the organism. The nature and evolutionary implications of genotype–phenotype mapping still remain key topics in evolutionary developmental biology (evo-devo). We have explored here issues of neutrality, robustness, and diversity in evo-devo by means of a simple model of gene regulatory networks. The small size of the system allowed an exhaustive analysis of the entire fitness landscape and the extent of its neutrality. This analysis shows that evolution leads to a class of robust genetic networks with an expression pattern characteristic of lateral inhibition. This class is a repertoire of distinct implementations of this key developmental process, the diversity of which provides valuable clues about its underlying causal principles. PMID:19023404
Robust Transgene Expression from Bicistronic mRNA in the Green Alga Chlamydomonas reinhardtii
Onishi, Masayuki; Pringle, John R.
2016-01-01
The unicellular green alga Chlamydomonas reinhardtii is a model organism that provides an opportunity to understand the evolution and functional biology of the lineage that includes the land plants, as well as aspects of the fundamental core biology conserved throughout the eukaryotic phylogeny. Although many tools are available to facilitate genetic, molecular biological, biochemical, and cell biological studies in Chlamydomonas, expression of unselected transgenes of interest (GOIs) has been challenging. In most methods used previously, the GOI and a selectable marker are expressed from two separate mRNAs, so that their concomitant expression is not guaranteed. In this study, we developed constructs that allow expression of an upstream GOI and downstream selectable marker from a single bicistronic mRNA. Although this approach in other systems has typically required a translation-enhancing element such as an internal ribosome entry site for the downstream marker, we found that a short stretch of unstructured junction sequence was sufficient to obtain adequate expression of the downstream gene, presumably through post-termination reinitiation. With this system, we obtained robust expression of both endogenous and heterologous GOIs, including fluorescent proteins and tagged fusion proteins, in the vast majority of transformants, thus eliminating the need for tedious secondary screening for GOI-expressing transformants. This improved efficiency should greatly facilitate a variety of genetic and cell-biological studies in Chlamydomonas and also enable new applications such as expression-based screens and large-scale production of foreign proteins. PMID:27770025
Saxena, Maneesha S.; Bajaj, Deepak; Das, Shouvik; Kujur, Alice; Kumar, Vinod; Singh, Mohar; Bansal, Kailash C.; Tyagi, Akhilesh K.; Parida, Swarup K.
2014-01-01
The identification and fine mapping of robust quantitative trait loci (QTLs)/genes governing important agro-morphological traits in chickpea still lacks systematic efforts at a genome-wide scale involving wild Cicer accessions. In this context, an 834 simple sequence repeat and single-nucleotide polymorphism marker-based high-density genetic linkage map between cultivated and wild parental accessions (Cicer arietinum desi cv. ICC 4958 and Cicer reticulatum wild cv. ICC 17160) was constructed. This inter-specific genetic map comprising eight linkage groups spanned a map length of 949.4 cM with an average inter-marker distance of 1.14 cM. Eleven novel major genomic regions harbouring 15 robust QTLs (15.6–39.8% R2 at 4.2–15.7 logarithm of odds) associated with four agro-morphological traits (100-seed weight, pod and branch number/plant and plant hairiness) were identified and mapped on chickpea chromosomes. Most of these QTLs showed positive additive gene effects with effective allelic contribution from ICC 4958, particularly for increasing seed weight (SW) and pod and branch number. One robust SW-influencing major QTL region (qSW4.2) has been narrowed down by combining QTL mapping with high-resolution QTL region-specific association analysis, differential expression profiling and gene haplotype-based association/LD mapping. This enabled to delineate a strong SW-regulating ABI3VP1 transcription factor (TF) gene at trait-specific QTL interval and consequently identified favourable natural allelic variants and superior high seed weight-specific haplotypes in the upstream regulatory region of this gene showing increased transcript expression during seed development. The genes (TFs) harbouring diverse trait-regulating QTLs, once validated and fine-mapped by our developed rapid integrated genomic approach and through gene/QTL map-based cloning, can be utilized as potential candidates for marker-assisted genetic enhancement of chickpea. PMID:25335477
Lee, Won Jun; Kim, Sang Cheol; Lee, Seul Ji; Lee, Jeongmi; Park, Jeong Hill; Yu, Kyung-Sang; Lim, Johan; Kwon, Sung Won
2014-01-01
Based on the process of carcinogenesis, carcinogens are classified as either genotoxic or non-genotoxic. In contrast to non-genotoxic carcinogens, many genotoxic carcinogens have been reported to cause tumor in carcinogenic bioassays in animals. Thus evaluating the genotoxicity potential of chemicals is important to discriminate genotoxic from non-genotoxic carcinogens for health care and pharmaceutical industry safety. Additionally, investigating the difference between the mechanisms of genotoxic and non-genotoxic carcinogens could provide the foundation for a mechanism-based classification for unknown compounds. In this study, we investigated the gene expression of HepG2 cells treated with genotoxic or non-genotoxic carcinogens and compared their mechanisms of action. To enhance our understanding of the differences in the mechanisms of genotoxic and non-genotoxic carcinogens, we implemented a gene set analysis using 12 compounds for the training set (12, 24, 48 h) and validated significant gene sets using 22 compounds for the test set (24, 48 h). For a direct biological translation, we conducted a gene set analysis using Globaltest and selected significant gene sets. To validate the results, training and test compounds were predicted by the significant gene sets using a prediction analysis for microarrays (PAM). Finally, we obtained 6 gene sets, including sets enriched for genes involved in the adherens junction, bladder cancer, p53 signaling pathway, pathways in cancer, peroxisome and RNA degradation. Among the 6 gene sets, the bladder cancer and p53 signaling pathway sets were significant at 12, 24 and 48 h. We also found that the DDB2, RRM2B and GADD45A, genes related to the repair and damage prevention of DNA, were consistently up-regulated for genotoxic carcinogens. Our results suggest that a gene set analysis could provide a robust tool in the investigation of the different mechanisms of genotoxic and non-genotoxic carcinogens and construct a more detailed understanding of the perturbation of significant pathways.
Lee, Won Jun; Kim, Sang Cheol; Lee, Seul Ji; Lee, Jeongmi; Park, Jeong Hill; Yu, Kyung-Sang; Lim, Johan; Kwon, Sung Won
2014-01-01
Based on the process of carcinogenesis, carcinogens are classified as either genotoxic or non-genotoxic. In contrast to non-genotoxic carcinogens, many genotoxic carcinogens have been reported to cause tumor in carcinogenic bioassays in animals. Thus evaluating the genotoxicity potential of chemicals is important to discriminate genotoxic from non-genotoxic carcinogens for health care and pharmaceutical industry safety. Additionally, investigating the difference between the mechanisms of genotoxic and non-genotoxic carcinogens could provide the foundation for a mechanism-based classification for unknown compounds. In this study, we investigated the gene expression of HepG2 cells treated with genotoxic or non-genotoxic carcinogens and compared their mechanisms of action. To enhance our understanding of the differences in the mechanisms of genotoxic and non-genotoxic carcinogens, we implemented a gene set analysis using 12 compounds for the training set (12, 24, 48 h) and validated significant gene sets using 22 compounds for the test set (24, 48 h). For a direct biological translation, we conducted a gene set analysis using Globaltest and selected significant gene sets. To validate the results, training and test compounds were predicted by the significant gene sets using a prediction analysis for microarrays (PAM). Finally, we obtained 6 gene sets, including sets enriched for genes involved in the adherens junction, bladder cancer, p53 signaling pathway, pathways in cancer, peroxisome and RNA degradation. Among the 6 gene sets, the bladder cancer and p53 signaling pathway sets were significant at 12, 24 and 48 h. We also found that the DDB2, RRM2B and GADD45A, genes related to the repair and damage prevention of DNA, were consistently up-regulated for genotoxic carcinogens. Our results suggest that a gene set analysis could provide a robust tool in the investigation of the different mechanisms of genotoxic and non-genotoxic carcinogens and construct a more detailed understanding of the perturbation of significant pathways. PMID:24497971
Huang, X N; Ren, H P
2016-05-13
Robust adaptation is a critical ability of gene regulatory network (GRN) to survive in a fluctuating environment, which represents the system responding to an input stimulus rapidly and then returning to its pre-stimulus steady state timely. In this paper, the GRN is modeled using the Michaelis-Menten rate equations, which are highly nonlinear differential equations containing 12 undetermined parameters. The robust adaption is quantitatively described by two conflicting indices. To identify the parameter sets in order to confer the GRNs with robust adaptation is a multi-variable, multi-objective, and multi-peak optimization problem, which is difficult to acquire satisfactory solutions especially high-quality solutions. A new best-neighbor particle swarm optimization algorithm is proposed to implement this task. The proposed algorithm employs a Latin hypercube sampling method to generate the initial population. The particle crossover operation and elitist preservation strategy are also used in the proposed algorithm. The simulation results revealed that the proposed algorithm could identify multiple solutions in one time running. Moreover, it demonstrated a superior performance as compared to the previous methods in the sense of detecting more high-quality solutions within an acceptable time. The proposed methodology, owing to its universality and simplicity, is useful for providing the guidance to design GRN with superior robust adaptation.
Criticality Is an Emergent Property of Genetic Networks that Exhibit Evolvability
Torres-Sosa, Christian; Huang, Sui; Aldana, Maximino
2012-01-01
Accumulating experimental evidence suggests that the gene regulatory networks of living organisms operate in the critical phase, namely, at the transition between ordered and chaotic dynamics. Such critical dynamics of the network permits the coexistence of robustness and flexibility which are necessary to ensure homeostatic stability (of a given phenotype) while allowing for switching between multiple phenotypes (network states) as occurs in development and in response to environmental change. However, the mechanisms through which genetic networks evolve such critical behavior have remained elusive. Here we present an evolutionary model in which criticality naturally emerges from the need to balance between the two essential components of evolvability: phenotype conservation and phenotype innovation under mutations. We simulated the Darwinian evolution of random Boolean networks that mutate gene regulatory interactions and grow by gene duplication. The mutating networks were subjected to selection for networks that both (i) preserve all the already acquired phenotypes (dynamical attractor states) and (ii) generate new ones. Our results show that this interplay between extending the phenotypic landscape (innovation) while conserving the existing phenotypes (conservation) suffices to cause the evolution of all the networks in a population towards criticality. Furthermore, the networks produced by this evolutionary process exhibit structures with hubs (global regulators) similar to the observed topology of real gene regulatory networks. Thus, dynamical criticality and certain elementary topological properties of gene regulatory networks can emerge as a byproduct of the evolvability of the phenotypic landscape. PMID:22969419
A Cancer Gene Selection Algorithm Based on the K-S Test and CFS.
Su, Qiang; Wang, Yina; Jiang, Xiaobing; Chen, Fuxue; Lu, Wen-Cong
2017-01-01
To address the challenging problem of selecting distinguished genes from cancer gene expression datasets, this paper presents a gene subset selection algorithm based on the Kolmogorov-Smirnov (K-S) test and correlation-based feature selection (CFS) principles. The algorithm selects distinguished genes first using the K-S test, and then, it uses CFS to select genes from those selected by the K-S test. We adopted support vector machines (SVM) as the classification tool and used the criteria of accuracy to evaluate the performance of the classifiers on the selected gene subsets. This approach compared the proposed gene subset selection algorithm with the K-S test, CFS, minimum-redundancy maximum-relevancy (mRMR), and ReliefF algorithms. The average experimental results of the aforementioned gene selection algorithms for 5 gene expression datasets demonstrate that, based on accuracy, the performance of the new K-S and CFS-based algorithm is better than those of the K-S test, CFS, mRMR, and ReliefF algorithms. The experimental results show that the K-S test-CFS gene selection algorithm is a very effective and promising approach compared to the K-S test, CFS, mRMR, and ReliefF algorithms.
Fromm, Bastian; Billipp, Tyler; Peck, Liam E.; Johansen, Morten; Tarver, James E.; King, Benjamin L.; Newcomb, James M.; Sempere, Lorenzo F.; Flatmark, Kjersti; Hovig, Eivind; Peterson, Kevin J.
2016-01-01
Although microRNAs (miRNAs) are among the most intensively studied molecules of the past 20 years, determining what is and what is not a miRNA has not been straightforward. Here, we present a uniform system for the annotation and nomenclature of miRNA genes. We show that fewer than a third of the 1,881 human miRBase entries, and only approximately 16% of the 7,095 metazoan miRBase entries, are robustly supported as miRNA genes. Furthermore, we show that the human repertoire of miRNAs has been shaped by periods of intense miRNA innovation, and that mature gene products show a very different tempo and mode of sequence evolution than star products. We establish a new open access database -- MirGeneDB (http://mirgenedb.org) -- to catalog this set of robustly supported miRNAs, which complements the efforts of miRBase, but differs from it by annotating the mature versus star products, and by imposing an evolutionary hierarchy upon this curated and consistently named repertoire. PMID:26473382
Dunn, Corey G.; Angermeier, Paul
2016-01-01
Understanding relationships between habitat associations for individuals and habitat factors that limit populations is a primary challenge for managers of stream fishes. Although habitat use by individuals can provide insight into the adaptive significance of selected microhabitats, not all habitat parameters will be significant at the population level, particularly when distributional patterns partially result from habitat degradation. We used underwater observation to quantify microhabitat selection by an imperiled stream fish, the Candy Darter Etheostoma osburni, in two streams with robust populations. We developed multiple-variable and multiple-life-stage habitat suitability indices (HSIs) from microhabitat selection patterns and used them to assess the suitability of available habitat in streams where Candy Darter populations were extirpated, localized, or robust. Next, we used a comparative framework to examine relationships among (1) habitat availability across streams, (2) projected habitat suitability of each stream, and (3) a rank for the likely long-term viability (robustness) of the population inhabiting each stream. Habitat selection was characterized by ontogenetic shifts from the low-velocity, slightly embedded areas used by age-0 Candy Darters to the swift, shallow areas with little fine sediment and complex substrate, which were used by adults. Overall, HSIs were strongly correlated with population rank. However, we observed weak or inverse relationships between predicted individual habitat suitability and population robustness for multiple life stages and variables. The results demonstrated that microhabitat selection by individuals does not always reflect population robustness, particularly when based on a single life stage or season, which highlights the risk of generalizing habitat selection that is observed during nonstressful periods or for noncritical resources. These findings suggest that stream fish managers may need to be cautious when implementing conservation measures based solely on observations of habitat selection by individuals and that detailed study at the individual and population levels may be necessary to identify habitat that limits populations.
Genome-wide identification of Saccharomyces cerevisiae genes required for tolerance to acetic acid.
Mira, Nuno P; Palma, Margarida; Guerreiro, Joana F; Sá-Correia, Isabel
2010-10-25
Acetic acid is a byproduct of Saccharomyces cerevisiae alcoholic fermentation. Together with high concentrations of ethanol and other toxic metabolites, acetic acid may contribute to fermentation arrest and reduced ethanol productivity. This weak acid is also a present in lignocellulosic hydrolysates, a highly interesting non-feedstock substrate in industrial biotechnology. Therefore, the better understanding of the molecular mechanisms underlying S. cerevisiae tolerance to acetic acid is essential for the rational selection of optimal fermentation conditions and the engineering of more robust industrial strains to be used in processes in which yeast is explored as cell factory. The yeast genes conferring protection against acetic acid were identified in this study at a genome-wide scale, based on the screening of the EUROSCARF haploid mutant collection for susceptibility phenotypes to this weak acid (concentrations in the range 70-110 mM, at pH 4.5). Approximately 650 determinants of tolerance to acetic acid were identified. Clustering of these acetic acid-resistance genes based on their biological function indicated an enrichment of genes involved in transcription, internal pH homeostasis, carbohydrate metabolism, cell wall assembly, biogenesis of mitochondria, ribosome and vacuole, and in the sensing, signalling and uptake of various nutrients in particular iron, potassium, glucose and amino acids. A correlation between increased resistance to acetic acid and the level of potassium in the growth medium was found. The activation of the Snf1p signalling pathway, involved in yeast response to glucose starvation, is demonstrated to occur in response to acetic acid stress but no evidence was obtained supporting the acetic acid-induced inhibition of glucose uptake. Approximately 490 of the 650 determinants of tolerance to acetic acid identified in this work are implicated, for the first time, in tolerance to this weak acid. These are novel candidate genes for genetic engineering to obtain more robust yeast strains against acetic acid toxicity. Among these genes there are number of transcription factors that are documented regulators of a large percentage of the genes found to exert protection against acetic acid thus being considered interesting targets for subsequent genetic engineering. The increase of potassium concentration in the growth medium was found to improve the expression of maximal tolerance to acetic acid, consistent with the idea that the adequate manipulation of nutrient concentration of industrial growth medium can be an interesting strategy to surpass the deleterious effects of this weak acid in yeast cells.
Dual CRISPR-Cas9 Cleavage Mediated Gene Excision and Targeted Integration in Yarrowia lipolytica.
Gao, Difeng; Smith, Spencer; Spagnuolo, Michael; Rodriguez, Gabriel; Blenner, Mark
2018-05-29
CRISPR-Cas9 technology has been successfully applied in Yarrowia lipolytica for targeted genomic editing including gene disruption and integration; however, disruptions by existing methods typically result from small frameshift mutations caused by indels within the coding region, which usually resulted in unnatural protein. In this study, a dual cleavage strategy directed by paired sgRNAs is developed for gene knockout. This method allows fast and robust gene excision, demonstrated on six genes of interest. The targeted regions for excision vary in length from 0.3 kb up to 3.5 kb and contain both non-coding and coding regions. The majority of the gene excisions are repaired by perfect nonhomologous end-joining without indel. Based on this dual cleavage system, two targeted markerless integration methods are developed by providing repair templates. While both strategies are effective, homology mediated end joining (HMEJ) based method are twice as efficient as homology recombination (HR) based method. In both cases, dual cleavage leads to similar or improved gene integration efficiencies compared to gene excision without integration. This dual cleavage strategy will be useful for not only generating more predictable and robust gene knockout, but also for efficient targeted markerless integration, and simultaneous knockout and integration in Y. lipolytica. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Curved Microneedle Array-Based sEMG Electrode for Robust Long-Term Measurements and High Selectivity
Kim, Minjae; Kim, Taewan; Kim, Dong Sung; Chung, Wan Kyun
2015-01-01
Surface electromyography is widely used in many fields to infer human intention. However, conventional electrodes are not appropriate for long-term measurements and are easily influenced by the environment, so the range of applications of sEMG is limited. In this paper, we propose a flexible band-integrated, curved microneedle array electrode for robust long-term measurements, high selectivity, and easy applicability. Signal quality, in terms of long-term usability and sensitivity to perspiration, was investigated. Its motion-discriminating performance was also evaluated. The results show that the proposed electrode is robust to perspiration and can maintain a high-quality measuring ability for over 8 h. The proposed electrode also has high selectivity for motion compared with a commercial wet electrode and dry electrode. PMID:26153773
Liu, Jin; Shao, Luyao; Trang, Phong; Yang, Zhu; Reeves, Michael; Sun, Xu; Vu, Gia-Phong; Wang, Yu; Li, Hongjian; Zheng, Congyi; Lu, Sangwei; Liu, Fenyong
2016-06-09
An external guide sequence (EGS) is a RNA sequence which can interact with a target mRNA to form a tertiary structure like a pre-tRNA and recruit intracellular ribonuclease P (RNase P), a tRNA processing enzyme, to degrade target mRNA. Previously, an in vitro selection procedure has been used by us to engineer new EGSs that are more robust in inducing human RNase P to cleave their targeted mRNAs. In this study, we constructed EGSs from a variant to target the mRNA encoding herpes simplex virus 1 (HSV-1) major transcription regulator ICP4, which is essential for the expression of viral early and late genes and viral growth. The EGS variant induced human RNase P cleavage of ICP4 mRNA sequence 60 times better than the EGS generated from a natural pre-tRNA. A decrease of about 97% and 75% in the level of ICP4 gene expression and an inhibition of about 7,000- and 500-fold in viral growth were observed in HSV infected cells expressing the variant and the pre-tRNA-derived EGS, respectively. This study shows that engineered EGSs can inhibit HSV-1 gene expression and viral growth. Furthermore, these results demonstrate the potential for engineered EGS RNAs to be developed and used as anti-HSV therapeutics.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sheppod, Timothy; Satterfield, Brent; Hukari, Kyle W.
2006-10-01
The advancement of DNA cloning has significantly augmented the potential threat of a focused bioweapon assault, such as a terrorist attack. With current DNA cloning techniques, toxin genes from the most dangerous (but environmentally labile) bacterial or viral organism can now be selected and inserted into robust organism to produce an infinite number of deadly chimeric bioweapons. In order to neutralize such a threat, accurate detection of the expressed toxin genes, rather than classification on strain or genealogical decent of these organisms, is critical. The development of a high-throughput microarray approach will enable the detection of unknowns chimeric bioweapons. Themore » development of a high-throughput microarray approach will enable the detection of unknown bioweapons. We have developed a unique microfluidic approach to capture and concentrate these threat genes (mRNA's) upto a 30 fold concentration. These captured oligonucleotides can then be used to synthesize in situ oligonucleotide copies (cDNA probes) of the captured genes. An integrated microfluidic architecture will enable us to control flows of reagents, perform clean-up steps and finally elute nanoliter volumes of synthesized oligonucleotides probes. The integrated approach has enabled a process where chimeric or conventional bioweapons can rapidly be identified based on their toxic function, rather than being restricted to information that may not identify the critical nature of the threat.« less
Liu, Jin; Shao, Luyao; Trang, Phong; Yang, Zhu; Reeves, Michael; Sun, Xu; Vu, Gia-Phong; Wang, Yu; Li, Hongjian; Zheng, Congyi; Lu, Sangwei; Liu, Fenyong
2016-01-01
An external guide sequence (EGS) is a RNA sequence which can interact with a target mRNA to form a tertiary structure like a pre-tRNA and recruit intracellular ribonuclease P (RNase P), a tRNA processing enzyme, to degrade target mRNA. Previously, an in vitro selection procedure has been used by us to engineer new EGSs that are more robust in inducing human RNase P to cleave their targeted mRNAs. In this study, we constructed EGSs from a variant to target the mRNA encoding herpes simplex virus 1 (HSV-1) major transcription regulator ICP4, which is essential for the expression of viral early and late genes and viral growth. The EGS variant induced human RNase P cleavage of ICP4 mRNA sequence 60 times better than the EGS generated from a natural pre-tRNA. A decrease of about 97% and 75% in the level of ICP4 gene expression and an inhibition of about 7,000- and 500-fold in viral growth were observed in HSV infected cells expressing the variant and the pre-tRNA-derived EGS, respectively. This study shows that engineered EGSs can inhibit HSV-1 gene expression and viral growth. Furthermore, these results demonstrate the potential for engineered EGS RNAs to be developed and used as anti-HSV therapeutics. PMID:27279482
Efficiency of nuclear and mitochondrial markers recovering and supporting known amniote groups.
Lambret-Frotté, Julia; Perini, Fernando Araújo; de Moraes Russo, Claudia Augusta
2012-01-01
We have analysed the efficiency of all mitochondrial protein coding genes and six nuclear markers (Adora3, Adrb2, Bdnf, Irbp, Rag2 and Vwf) in reconstructing and statistically supporting known amniote groups (murines, rodents, primates, eutherians, metatherians, therians). The efficiencies of maximum likelihood, Bayesian inference, maximum parsimony, neighbor-joining and UPGMA were also evaluated, by assessing the number of correct and incorrect recovered groupings. In addition, we have compared support values using the conservative bootstrap test and the Bayesian posterior probabilities. First, no correlation was observed between gene size and marker efficiency in recovering or supporting correct nodes. As expected, tree-building methods performed similarly, even UPGMA that, in some cases, outperformed other most extensively used methods. Bayesian posterior probabilities tend to show much higher support values than the conservative bootstrap test, for correct and incorrect nodes. Our results also suggest that nuclear markers do not necessarily show a better performance than mitochondrial genes. The so-called dependency among mitochondrial markers was not observed comparing genome performances. Finally, the amniote groups with lowest recovery rates were therians and rodents, despite the morphological support for their monophyletic status. We suggest that, regardless of the tree-building method, a few carefully selected genes are able to unfold a detailed and robust scenario of phylogenetic hypotheses, particularly if taxon sampling is increased.
Brain selective transgene expression in zebrafish using an NRSE derived motif
Bergeron, Sadie A.; Hannan, Markus C.; Codore, Hiba; Fero, Kandice; Li, Grace H.; Moak, Zachary; Yokogawa, Tohei; Burgess, Harold A.
2012-01-01
Transgenic technologies enable the manipulation and observation of circuits controlling behavior by permitting expression of genetically encoded reporter genes in neurons. Frequently though, neuronal expression is accompanied by transgene expression in non-neuronal tissues, which may preclude key experimental manipulations, including assessment of the contribution of neurons to behavior by ablation. To better restrict transgene expression to the nervous system in zebrafish larvae, we have used DNA sequences derived from the neuron-restrictive silencing element (NRSE). We find that one such sequence, REx2, when used in conjunction with several basal promoters, robustly suppresses transgene expression in non-neuronal tissues. Both in transient transgenic experiments and in stable enhancer trap lines, suppression is achieved without compromising expression within the nervous system. Furthermore, in REx2 enhancer trap lines non-neuronal expression can be de-repressed by knocking down expression of the NRSE binding protein RE1-silencing transcription factor (Rest). In one line, we show that the resulting pattern of reporter gene expression coincides with that of the adjacent endogenous gene, hapln3. We demonstrate that three common basal promoters are susceptible to the effects of the REx2 element, suggesting that this method may be useful for confining expression from many other promoters to the nervous system. This technique enables neural specific targeting of reporter genes and thus will facilitate the use of transgenic methods to manipulate circuit function in freely behaving larvae. PMID:23293587
A comparative analysis of biclustering algorithms for gene expression data
Eren, Kemal; Deveci, Mehmet; Küçüktunç, Onur; Çatalyürek, Ümit V.
2013-01-01
The need to analyze high-dimension biological data is driving the development of new data mining methods. Biclustering algorithms have been successfully applied to gene expression data to discover local patterns, in which a subset of genes exhibit similar expression levels over a subset of conditions. However, it is not clear which algorithms are best suited for this task. Many algorithms have been published in the past decade, most of which have been compared only to a small number of algorithms. Surveys and comparisons exist in the literature, but because of the large number and variety of biclustering algorithms, they are quickly outdated. In this article we partially address this problem of evaluating the strengths and weaknesses of existing biclustering methods. We used the BiBench package to compare 12 algorithms, many of which were recently published or have not been extensively studied. The algorithms were tested on a suite of synthetic data sets to measure their performance on data with varying conditions, such as different bicluster models, varying noise, varying numbers of biclusters and overlapping biclusters. The algorithms were also tested on eight large gene expression data sets obtained from the Gene Expression Omnibus. Gene Ontology enrichment analysis was performed on the resulting biclusters, and the best enrichment terms are reported. Our analyses show that the biclustering method and its parameters should be selected based on the desired model, whether that model allows overlapping biclusters, and its robustness to noise. In addition, we observe that the biclustering algorithms capable of finding more than one model are more successful at capturing biologically relevant clusters. PMID:22772837
Myoblast-mediated gene transfer for therapeutic angiogenesis and arteriogenesis.
von Degenfeld, Georges; Banfi, Andrea; Springer, Matthew L; Blau, Helen M
2003-10-01
Therapeutic angiogenesis aims at generating new blood vessels by delivering growth factors such as VEGF and FGF. Clinical trials are underway in patients with peripheral vascular and coronary heart disease. However, increasing evidence indicates that the new vasculature needs to be stabilized to avoid deleterious effects such as edema and hemangioma formation. Moreover, a major challenge is to induce new vessels that persist following cessation of the angiogenic stimulus. Mature vessels may be generated by modulating timing and dosage of growth factor expression, or by combination of 'growth' factors with 'maturation' factors like PDGF-BB, angiopoietin-1 or TGF-beta. Myoblast-mediated gene transfer has unique characteristics that make it a useful tool for studying promising novel approaches to therapeutic angiogenesis. It affords robust and long-lasting expression, and can be considered as a relatively rapid form of 'adult transgenesis' in muscle. The combined insertion of different gene constructs into single myoblasts and their progeny allows the simultaneous expression of different 'growth' and 'maturation' factors within the same cell in vivo. The additional insertion of a reporter gene makes it possible to analyze the phenotype of the vessels surrounding the transgenic muscle fibers into which the myoblasts have fused. The effects of timing and duration of gene expression can be studied by using tetracycline-inducible constructs, and dosage effects by selecting subpopulations consistently expressing distinct levels of growth factors. Finally, the autologous cell-based approach using transduced myoblasts could be an alternative gene delivery system for therapeutic angiogenesis in patients, avoiding the toxicities seen with some viral vectors.
mPeriod2 Brdm1 and other single Period mutant mice have normal food anticipatory activity.
Pendergast, Julie S; Wendroth, Robert H; Stenner, Rio C; Keil, Charles D; Yamazaki, Shin
2017-11-14
Animals anticipate the timing of food availability via the food-entrainable oscillator (FEO). The anatomical location and timekeeping mechanism of the FEO are unknown. Several studies showed the circadian gene, Period 2, is critical for FEO timekeeping. However, other studies concluded that canonical circadian genes are not essential for FEO timekeeping. In this study, we re-examined the effects of the Per2 Brdm1 mutation on food entrainment using methods that have revealed robust food anticipatory activity in other mutant lines. We examined food anticipatory activity, which is the output of the FEO, in single Period mutant mice. Single Per1, Per2, and Per3 mutant mice had robust food anticipatory activity during restricted feeding. In addition, we found that two different lines of Per2 mutant mice (ldc and Brdm1) anticipated restricted food availability. To determine if FEO timekeeping persisted in the absence of the food cue, we assessed activity during fasting. Food anticipatory (wheel-running) activity in all Period mutant mice was also robust during food deprivation. Together, our studies demonstrate that the Period genes are not necessary for the expression of food anticipatory activity.
Kitano, Hiroaki
2004-11-01
Robustness is a ubiquitously observed property of biological systems. It is considered to be a fundamental feature of complex evolvable systems. It is attained by several underlying principles that are universal to both biological organisms and sophisticated engineering systems. Robustness facilitates evolvability and robust traits are often selected by evolution. Such a mutually beneficial process is made possible by specific architectural features observed in robust systems. But there are trade-offs between robustness, fragility, performance and resource demands, which explain system behaviour, including the patterns of failure. Insights into inherent properties of robust systems will provide us with a better understanding of complex diseases and a guiding principle for therapy design.
Hacking DNA copy number for circuit engineering.
Wu, Feilun; You, Lingchong
2017-07-27
DNA copy number represents an essential parameter in the dynamics of synthetic gene circuits but typically is not explicitly considered. A new study demonstrates how dynamic control of DNA copy number can serve as an effective strategy to program robust oscillations in gene expression circuits.
Ozerov, Ivan V; Lezhnina, Ksenia V; Izumchenko, Evgeny; Artemov, Artem V; Medintsev, Sergey; Vanhaelen, Quentin; Aliper, Alexander; Vijg, Jan; Osipov, Andreyan N; Labat, Ivan; West, Michael D; Buzdin, Anton; Cantor, Charles R; Nikolsky, Yuri; Borisov, Nikolay; Irincheeva, Irina; Khokhlovich, Edward; Sidransky, David; Camargo, Miguel Luiz; Zhavoronkov, Alex
2016-11-16
Signalling pathway activation analysis is a powerful approach for extracting biologically relevant features from large-scale transcriptomic and proteomic data. However, modern pathway-based methods often fail to provide stable pathway signatures of a specific phenotype or reliable disease biomarkers. In the present study, we introduce the in silico Pathway Activation Network Decomposition Analysis (iPANDA) as a scalable robust method for biomarker identification using gene expression data. The iPANDA method combines precalculated gene coexpression data with gene importance factors based on the degree of differential gene expression and pathway topology decomposition for obtaining pathway activation scores. Using Microarray Analysis Quality Control (MAQC) data sets and pretreatment data on Taxol-based neoadjuvant breast cancer therapy from multiple sources, we demonstrate that iPANDA provides significant noise reduction in transcriptomic data and identifies highly robust sets of biologically relevant pathway signatures. We successfully apply iPANDA for stratifying breast cancer patients according to their sensitivity to neoadjuvant therapy.
Ozerov, Ivan V.; Lezhnina, Ksenia V.; Izumchenko, Evgeny; Artemov, Artem V.; Medintsev, Sergey; Vanhaelen, Quentin; Aliper, Alexander; Vijg, Jan; Osipov, Andreyan N.; Labat, Ivan; West, Michael D.; Buzdin, Anton; Cantor, Charles R.; Nikolsky, Yuri; Borisov, Nikolay; Irincheeva, Irina; Khokhlovich, Edward; Sidransky, David; Camargo, Miguel Luiz; Zhavoronkov, Alex
2016-01-01
Signalling pathway activation analysis is a powerful approach for extracting biologically relevant features from large-scale transcriptomic and proteomic data. However, modern pathway-based methods often fail to provide stable pathway signatures of a specific phenotype or reliable disease biomarkers. In the present study, we introduce the in silico Pathway Activation Network Decomposition Analysis (iPANDA) as a scalable robust method for biomarker identification using gene expression data. The iPANDA method combines precalculated gene coexpression data with gene importance factors based on the degree of differential gene expression and pathway topology decomposition for obtaining pathway activation scores. Using Microarray Analysis Quality Control (MAQC) data sets and pretreatment data on Taxol-based neoadjuvant breast cancer therapy from multiple sources, we demonstrate that iPANDA provides significant noise reduction in transcriptomic data and identifies highly robust sets of biologically relevant pathway signatures. We successfully apply iPANDA for stratifying breast cancer patients according to their sensitivity to neoadjuvant therapy. PMID:27848968
Selection of Phototransduction Genes in Homo sapiens.
Christopher, Mark; Scheetz, Todd E; Mullins, Robert F; Abràmoff, Michael D
2013-08-13
We investigated the evidence of recent positive selection in the human phototransduction system at single nucleotide polymorphism (SNP) and gene level. SNP genotyping data from the International HapMap Project for European, Eastern Asian, and African populations was used to discover differences in haplotype length and allele frequency between these populations. Numeric selection metrics were computed for each SNP and aggregated into gene-level metrics to measure evidence of recent positive selection. The level of recent positive selection in phototransduction genes was evaluated and compared to a set of genes shown previously to be under recent selection, and a set of highly conserved genes as positive and negative controls, respectively. Six of 20 phototransduction genes evaluated had gene-level selection metrics above the 90th percentile: RGS9, GNB1, RHO, PDE6G, GNAT1, and SLC24A1. The selection signal across these genes was found to be of similar magnitude to the positive control genes and much greater than the negative control genes. There is evidence for selective pressure in the genes involved in retinal phototransduction, and traces of this selective pressure can be demonstrated using SNP-level and gene-level metrics of allelic variation. We hypothesize that the selective pressure on these genes was related to their role in low light vision and retinal adaptation to ambient light changes. Uncovering the underlying genetics of evolutionary adaptations in phototransduction not only allows greater understanding of vision and visual diseases, but also the development of patient-specific diagnostic and intervention strategies.
Hough, Denise; Swart, Pieter; Cloete, Schalk
2013-01-01
Simple Summary Breeding sheep that are robust and easily managed may be beneficial for both animal welfare and production. Sheep that are more readily able to adapt to stressful situations and a wide variety of environmental conditions are likely to have more resources available for a higher expression of their production potential. This review explores the utilization of one of the stress response pathways, namely the hypothalamic-pituitary-adrenal axis, to locate potential sites where genetic markers might be identified that contribute to sheep robustness. A South African Merino breeding programme is used to demonstrate the potential benefits of this approach. Abstract It is a difficult task to improve animal production by means of genetic selection, if the environment does not allow full expression of the animal’s genetic potential. This concept may well be the future for animal welfare, because it highlights the need to incorporate traits related to production and robustness, simultaneously, to reach sustainable breeding goals. This review explores the identification of potential genetic markers for robustness within the hypothalamic-pituitary-adrenal axis (HPAA), since this axis plays a vital role in the stress response. If genetic selection for superior HPAA responses to stress is possible, then it ought to be possible to breed robust and easily managed genotypes that might be able to adapt to a wide range of environmental conditions whilst expressing a high production potential. This approach is explored in this review by means of lessons learnt from research on Merino sheep, which were divergently selected for their multiple rearing ability. These two selection lines have shown marked differences in reproduction, production and welfare, which makes this breeding programme ideal to investigate potential genetic markers of robustness. The HPAA function is explored in detail to elucidate where such genetic markers are likely to be found. PMID:26487412
Silver, Matt; Chen, Peng; Li, Ruoying; Cheng, Ching-Yu; Wong, Tien-Yin; Tai, E-Shyong; Teo, Yik-Ying; Montana, Giovanni
2013-01-01
Standard approaches to data analysis in genome-wide association studies (GWAS) ignore any potential functional relationships between gene variants. In contrast gene pathways analysis uses prior information on functional structure within the genome to identify pathways associated with a trait of interest. In a second step, important single nucleotide polymorphisms (SNPs) or genes may be identified within associated pathways. The pathways approach is motivated by the fact that genes do not act alone, but instead have effects that are likely to be mediated through their interaction in gene pathways. Where this is the case, pathways approaches may reveal aspects of a trait's genetic architecture that would otherwise be missed when considering SNPs in isolation. Most pathways methods begin by testing SNPs one at a time, and so fail to capitalise on the potential advantages inherent in a multi-SNP, joint modelling approach. Here, we describe a dual-level, sparse regression model for the simultaneous identification of pathways and genes associated with a quantitative trait. Our method takes account of various factors specific to the joint modelling of pathways with genome-wide data, including widespread correlation between genetic predictors, and the fact that variants may overlap multiple pathways. We use a resampling strategy that exploits finite sample variability to provide robust rankings for pathways and genes. We test our method through simulation, and use it to perform pathways-driven gene selection in a search for pathways and genes associated with variation in serum high-density lipoprotein cholesterol levels in two separate GWAS cohorts of Asian adults. By comparing results from both cohorts we identify a number of candidate pathways including those associated with cardiomyopathy, and T cell receptor and PPAR signalling. Highlighted genes include those associated with the L-type calcium channel, adenylate cyclase, integrin, laminin, MAPK signalling and immune function. PMID:24278029
Silver, Matt; Chen, Peng; Li, Ruoying; Cheng, Ching-Yu; Wong, Tien-Yin; Tai, E-Shyong; Teo, Yik-Ying; Montana, Giovanni
2013-11-01
Standard approaches to data analysis in genome-wide association studies (GWAS) ignore any potential functional relationships between gene variants. In contrast gene pathways analysis uses prior information on functional structure within the genome to identify pathways associated with a trait of interest. In a second step, important single nucleotide polymorphisms (SNPs) or genes may be identified within associated pathways. The pathways approach is motivated by the fact that genes do not act alone, but instead have effects that are likely to be mediated through their interaction in gene pathways. Where this is the case, pathways approaches may reveal aspects of a trait's genetic architecture that would otherwise be missed when considering SNPs in isolation. Most pathways methods begin by testing SNPs one at a time, and so fail to capitalise on the potential advantages inherent in a multi-SNP, joint modelling approach. Here, we describe a dual-level, sparse regression model for the simultaneous identification of pathways and genes associated with a quantitative trait. Our method takes account of various factors specific to the joint modelling of pathways with genome-wide data, including widespread correlation between genetic predictors, and the fact that variants may overlap multiple pathways. We use a resampling strategy that exploits finite sample variability to provide robust rankings for pathways and genes. We test our method through simulation, and use it to perform pathways-driven gene selection in a search for pathways and genes associated with variation in serum high-density lipoprotein cholesterol levels in two separate GWAS cohorts of Asian adults. By comparing results from both cohorts we identify a number of candidate pathways including those associated with cardiomyopathy, and T cell receptor and PPAR signalling. Highlighted genes include those associated with the L-type calcium channel, adenylate cyclase, integrin, laminin, MAPK signalling and immune function.
Zhan, Xue-yan; Zhao, Na; Lin, Zhao-zhou; Wu, Zhi-sheng; Yuan, Rui-juan; Qiao, Yan-jiang
2014-12-01
The appropriate algorithm for calibration set selection was one of the key technologies for a good NIR quantitative model. There are different algorithms for calibration set selection, such as Random Sampling (RS) algorithm, Conventional Selection (CS) algorithm, Kennard-Stone(KS) algorithm and Sample set Portioning based on joint x-y distance (SPXY) algorithm, et al. However, there lack systematic comparisons between two algorithms of the above algorithms. The NIR quantitative models to determine the asiaticoside content in Centella total glucosides were established in the present paper, of which 7 indexes were classified and selected, and the effects of CS algorithm, KS algorithm and SPXY algorithm for calibration set selection on the accuracy and robustness of NIR quantitative models were investigated. The accuracy indexes of NIR quantitative models with calibration set selected by SPXY algorithm were significantly different from that with calibration set selected by CS algorithm or KS algorithm, while the robustness indexes, such as RMSECV and |RMSEP-RMSEC|, were not significantly different. Therefore, SPXY algorithm for calibration set selection could improve the predicative accuracy of NIR quantitative models to determine asiaticoside content in Centella total glucosides, and have no significant effect on the robustness of the models, which provides a reference to determine the appropriate algorithm for calibration set selection when NIR quantitative models are established for the solid system of traditional Chinese medcine.
2013-01-01
Background Genomic selection is an appealing method to select purebreds for crossbred performance. In the case of crossbred records, single nucleotide polymorphism (SNP) effects can be estimated using an additive model or a breed-specific allele model. In most studies, additive gene action is assumed. However, dominance is the likely genetic basis of heterosis. Advantages of incorporating dominance in genomic selection were investigated in a two-way crossbreeding program for a trait with different magnitudes of dominance. Training was carried out only once in the simulation. Results When the dominance variance and heterosis were large and overdominance was present, a dominance model including both additive and dominance SNP effects gave substantially greater cumulative response to selection than the additive model. Extra response was the result of an increase in heterosis but at a cost of reduced purebred performance. When the dominance variance and heterosis were realistic but with overdominance, the advantage of the dominance model decreased but was still significant. When overdominance was absent, the dominance model was slightly favored over the additive model, but the difference in response between the models increased as the number of quantitative trait loci increased. This reveals the importance of exploiting dominance even in the absence of overdominance. When there was no dominance, response to selection for the dominance model was as high as for the additive model, indicating robustness of the dominance model. The breed-specific allele model was inferior to the dominance model in all cases and to the additive model except when the dominance variance and heterosis were large and with overdominance. However, the advantage of the dominance model over the breed-specific allele model may decrease as differences in linkage disequilibrium between the breeds increase. Retraining is expected to reduce the advantage of the dominance model over the alternatives, because in general, the advantage becomes important only after five or six generations post-training. Conclusion Under dominance and without retraining, genomic selection based on the dominance model is superior to the additive model and the breed-specific allele model to maximize crossbred performance through purebred selection. PMID:23621868
Pierson, Lisa; Mousley, Angela; Devine, Lynda; Marks, Nikki J; Day, Tim A; Maule, Aaron G
2010-04-01
Evolving RNA interference (RNAi) platforms are providing opportunities to probe gene function in parasitic helminths using reverse genetics. Although relatively robust methods for the application of RNAi in parasitic flatworms have been established, reports of successful RNAi are confined to three genera and there are no known reports of the application of RNAi to the class Cestoda. Here we report the successful application of RNAi to a cestode. Our target species was the common ruminant tapeworm, Moniezia expansa which can significantly impact the health/productivity of cattle, sheep and goats. Initial efforts aimed to silence the neuronally expressed neuropeptide F gene (Me-npf-1), which encodes one of the most abundant neuropeptides in flatworms and a homologue of vertebrate neuropeptide Y (NPY). Double stranded (ds)RNAs, delivered by electroporation and soaking (4-8h), failed to trigger consistent Me-npf-1 transcript knock-down in adult worms; small interfering RNAs (siRNAs) were also ineffective. Identical approaches resulted in significant and consistent transcript knock-down of actin transcript (71+/-4%) following soaking in Me-act-1 dsRNA. Similar successes were seen with hydrophobic lipid-binding protein (Me-lbp-1), with a dsRNA inducing significant target transcript reduction (72+/-5%). To confirm the validity of the observed transcript knock-downs we further investigated Me-act-1 RNAi worms for associated changes in protein levels, morphology and phenotype. Me-act-1 RNAi worms displayed significant reductions in both filamentous actin immunostaining (62+/-3%) and the amount of actin detected in Western blots (54+/-13%). Morphologically, Me-act-1 RNAi worms displayed profound tegumental disruption/blebbing. Further, muscle tension recordings from Me-act-1 RNAi worms revealed a significant reduction in both the number of worms contracting in response to praziquantel (20+/-12%) and in their contractile ability. These data demonstrate, to our knowledge for the first time, a functional RNAi pathway in a cestode and show that the robust knock-down of abundant gene transcripts is achievable using long dsRNAs following short exposure times. 2009 Australian Society for Parasitology Inc. Published by Elsevier Ltd. All rights reserved.
Aravindan, Sheeja; Natarajan, Mohan; Herman, Terence S; Awasthi, Vibhudutta; Aravindan, Natarajan
2013-03-04
Heterogeneously distributed hypoxic areas are a characteristic property of locally advanced breast cancers (BCa) and generally associated with therapeutic resistance, metastases, and poor patient survival. About 50% of locally advanced BCa, where radiotherapy is less effective are suggested to be due to hypoxic regions. In this study, we investigated the potential of bioactive phytochemicals in radio-sensitizing hypoxic BCa cells. Hypoxic (O2-2.5%; N2-92.5%; CO2-5%) MCF-7 cells were exposed to 4 Gy radiation (IR) alone or after pretreatment with Curcumin (CUR), curcumin analog EF24, neem leaf extract (NLE), Genistein (GEN), Resveratrol (RES) or raspberry extract (RSE). The cells were examined for inhibition of NFκB activity, transcriptional modulation of 88 NFκB signaling pathway genes, activation and cellular localization of radio-responsive NFκB related mediators, eNos, Erk1/2, SOD2, Akt1/2/3, p50, p65, pIκBα, TNFα, Birc-1, -2, -5 and associated induction of cell death. EMSA revealed that cells exposed to phytochemicals showed complete suppression of IR-induced NFκB. Relatively, cells exposed EF24 revealed a robust inhibition of IR-induced NFκB. QPCR profiling showed induced expression of 53 NFκB signaling pathway genes after IR. Conversely, 53, 50, 53, 53, 53 and 53 of IR-induced genes were inhibited with EF24, NLE, CUR, GEN, RES and RSE respectively. In addition, 25, 29, 24, 16, 11 and 21 of 35 IR-suppressed genes were further inhibited with EF24, NLE, CUR, GEN, RES and RSE respectively. Immunoblotting revealed a significant attenuating effect of IR-modulated radio-responsive eNos, Erk1/2, SOD2, Akt1/2/3, p50, p65, pIκBα, TNFα, Birc-1, -2 and -5 with EF24, NLE, CUR, GEN, RES or RSE. Annexin V-FITC staining showed a consistent and significant induction of IR-induced cell death with these phytochemicals. Notably, EF24 robustly conferred IR-induced cell death. Together, these data identifies the potential hypoxic cell radio-sensitizers and further implies that the induced radio-sensitization may be exerted by selectively targeting IR-induced NFκB signaling.
MISSION LentiPlex pooled shRNA library screening in mammalian cells.
Coussens, Matthew J; Corman, Courtney; Fischer, Ashley L; Sago, Jack; Swarthout, John
2011-12-21
RNA interference (RNAi) is an intrinsic cellular mechanism for the regulation of gene expression. Harnessing the innate power of this system enables us to knockdown gene expression levels in loss of gene function studies. There are two main methods for performing RNAi. The first is the use of small interfering RNAs (siRNAs) that are chemically synthesized, and the second utilizes short-hairpin RNAs (shRNAs) encoded within plasmids. The latter can be transfected into cells directly or packaged into replication incompetent lentiviral particles. The main advantages of using lentiviral shRNAs is the ease of introduction into a wide variety of cell types, their ability to stably integrate into the genome for long term gene knockdown and selection, and their efficacy in conducting high-throughput loss of function screens. To facilitate this we have created the LentiPlex pooled shRNA library. The MISSION LentiPlex Human shRNA Pooled Library is a genome-wide lentiviral pool produced using a proprietary process. The library consists of over 75,000 shRNA constructs from the TRC collection targeting 15,000+ human genes. Each library is tested for shRNA representation before product release to ensure robust library coverage. The library is provided in a ready-to-use lentiviral format at titers of at least 5 x 10(8) TU/ml via p24 assay and is pre-divided into ten subpools of approximately 8,000 shRNA constructs each. Amplification and sequencing primers are also provided for downstream target identification. Previous studies established a synergistic antitumor activity of TRAIL when combined with Paclitaxel in A549 cells, a human lung carcinoma cell line. In this study we demonstrate the application of a pooled LentiPlex shRNA library to rapidly conduct a positive selection screen for genes involved in the cytotoxicity of A549 cells when exposed to TRAIL and Paclitaxel. One barrier often encountered with high-throughput screens is the cost and difficulty in deconvolution; we also detail a cost-effective polyclonal approach utilizing traditional sequencing.
Arunkumar, Ramesh; Josephs, Emily B; Williamson, Robert J; Wright, Stephen I
2013-11-01
Selection on the gametophyte can be a major force shaping plant genomes as 7-11% of genes are expressed only in that phase and 60% of genes are expressed in both the gametophytic and sporophytic phases. The efficacy of selection on gametophytic tissues is likely to be influenced by sexual selection acting on male and female functions of hermaphroditic plants. Moreover, the haploid nature of the gametophytic phase allows selection to be efficient in removing recessive deleterious mutations and fixing recessive beneficial mutations. To assess the importance of gametophytic selection, we compared the strength of purifying selection and extent of positive selection on gametophyte- and sporophyte-specific genes in the highly outcrossing plant Capsella grandiflora. We found that pollen-exclusive genes had a larger fraction of sites under strong purifying selection, a greater proportion of adaptive substitutions, and faster protein evolution compared with seedling-exclusive genes. In contrast, sperm cell-exclusive genes had a smaller fraction of sites under strong purifying selection, a lower proportion of adaptive substitutions, and slower protein evolution compared with seedling-exclusive genes. Observations of strong selection acting on pollen-expressed genes are likely explained by sexual selection resulting from pollen competition aided by the haploid nature of that tissue. The relaxation of selection in sperm might be due to the reduced influence of intrasexual competition, but reduced gene expression may also be playing an important role.
Genotype by environment interaction and breeding for robustness in livestock
Rauw, Wendy M.; Gomez-Raya, Luis
2015-01-01
The increasing size of the human population is projected to result in an increase in meat consumption. However, at the same time, the dominant position of meat as the center of meals is on the decline. Modern objections to the consumption of meat include public concerns with animal welfare in livestock production systems. Animal breeding practices have become part of the debate since it became recognized that animals in a population that have been selected for high production efficiency are more at risk for behavioral, physiological and immunological problems. As a solution, animal breeding practices need to include selection for robustness traits, which can be implemented through the use of reaction norms analysis, or though the direct inclusion of robustness traits in the breeding objective and in the selection index. This review gives an overview of genotype × environment interactions (the influence of the environment, reaction norms, phenotypic plasticity, canalization, and genetic homeostasis), reaction norms analysis in livestock production, options for selection for increased levels of production and against environmental sensitivity, and direct inclusion of robustness traits in the selection index. Ethical considerations of breeding for improved animal welfare are discussed. The discussion on animal breeding practices has been initiated and is very alive today. This positive trend is part of the sustainable food production movement that aims at feeding 9.15 billion people not just in the near future but also beyond. PMID:26539207
Naciff, Jorge M; Khambatta, Zubin S; Thomason, Ryan G; Carr, Gregory J; Tiesman, Jay P; Singleton, David W; Khan, Sohaib A; Daston, George P
2009-01-01
We have determined the gene expression profile induced by 17 alpha-ethynyl estradiol (EE) in Ishikawa cells, a human uterine-derived estrogen-sensitive cell line, at various doses (1 pM, 100 pM, 10 nM, and 1 microM) and time points (8, 24, and 48 h). The transcript profiles were compared between treatment groups and controls (vehicle-treated) using high-density oligonucleotide arrays to determine the expression level of approximately 38,500 human genes. By trend analysis, we determined that the expression of 2560 genes was modified by exposure to EE in a dose- and time-dependent manner (p = 0.0001). The annotation available for the genes affected indicates that EE exposure results in changes in multiple molecular pathways affecting various biological processes, particularly associated with development, morphogenesis, organogenesis, cell proliferation, cell organization, and biogenesis. All of these processes are also affected by estrogen exposure in the uterus of the rat. Comparison of the response to EE in both the rat uterus and the Ishikawa cells showed that 71 genes are regulated in a similar manner in vivo as well as in vitro. Further, some of the genes that show a robust response to estrogen exposure in Ishikawa cells are well known to be estrogen responsive, in various in vivo studies, such as PGR, MMP7, IGFBP3, IGFBP5, SOX4, MYC, EGR1, FOS, CKB, and CCND2, among others. These results indicate that transcript profiling can serve as a viable tool to select reliable in vitro systems to evaluate potential estrogenic activities of target chemicals and to identify genes that are relevant for the estrogen response.
Munkácsy, Gyöngyi; Sztupinszki, Zsófia; Herman, Péter; Bán, Bence; Pénzváltó, Zsófia; Szarvas, Nóra; Győrffy, Balázs
2016-09-27
No independent cross-validation of success rate for studies utilizing small interfering RNA (siRNA) for gene silencing has been completed before. To assess the influence of experimental parameters like cell line, transfection technique, validation method, and type of control, we have to validate these in a large set of studies. We utilized gene chip data published for siRNA experiments to assess success rate and to compare methods used in these experiments. We searched NCBI GEO for samples with whole transcriptome analysis before and after gene silencing and evaluated the efficiency for the target and off-target genes using the array-based expression data. Wilcoxon signed-rank test was used to assess silencing efficacy and Kruskal-Wallis tests and Spearman rank correlation were used to evaluate study parameters. All together 1,643 samples representing 429 experiments published in 207 studies were evaluated. The fold change (FC) of down-regulation of the target gene was above 0.7 in 18.5% and was above 0.5 in 38.7% of experiments. Silencing efficiency was lowest in MCF7 and highest in SW480 cells (FC = 0.59 and FC = 0.30, respectively, P = 9.3E-06). Studies utilizing Western blot for validation performed better than those with quantitative polymerase chain reaction (qPCR) or microarray (FC = 0.43, FC = 0.47, and FC = 0.55, respectively, P = 2.8E-04). There was no correlation between type of control, transfection method, publication year, and silencing efficiency. Although gene silencing is a robust feature successfully cross-validated in the majority of experiments, efficiency remained insufficient in a significant proportion of studies. Selection of cell line model and validation method had the highest influence on silencing proficiency.
MICCA: a complete and accurate software for taxonomic profiling of metagenomic data.
Albanese, Davide; Fontana, Paolo; De Filippo, Carlotta; Cavalieri, Duccio; Donati, Claudio
2015-05-19
The introduction of high throughput sequencing technologies has triggered an increase of the number of studies in which the microbiota of environmental and human samples is characterized through the sequencing of selected marker genes. While experimental protocols have undergone a process of standardization that makes them accessible to a large community of scientist, standard and robust data analysis pipelines are still lacking. Here we introduce MICCA, a software pipeline for the processing of amplicon metagenomic datasets that efficiently combines quality filtering, clustering of Operational Taxonomic Units (OTUs), taxonomy assignment and phylogenetic tree inference. MICCA provides accurate results reaching a good compromise among modularity and usability. Moreover, we introduce a de-novo clustering algorithm specifically designed for the inference of Operational Taxonomic Units (OTUs). Tests on real and synthetic datasets shows that thanks to the optimized reads filtering process and to the new clustering algorithm, MICCA provides estimates of the number of OTUs and of other common ecological indices that are more accurate and robust than currently available pipelines. Analysis of public metagenomic datasets shows that the higher consistency of results improves our understanding of the structure of environmental and human associated microbial communities. MICCA is an open source project.
MICCA: a complete and accurate software for taxonomic profiling of metagenomic data
Albanese, Davide; Fontana, Paolo; De Filippo, Carlotta; Cavalieri, Duccio; Donati, Claudio
2015-01-01
The introduction of high throughput sequencing technologies has triggered an increase of the number of studies in which the microbiota of environmental and human samples is characterized through the sequencing of selected marker genes. While experimental protocols have undergone a process of standardization that makes them accessible to a large community of scientist, standard and robust data analysis pipelines are still lacking. Here we introduce MICCA, a software pipeline for the processing of amplicon metagenomic datasets that efficiently combines quality filtering, clustering of Operational Taxonomic Units (OTUs), taxonomy assignment and phylogenetic tree inference. MICCA provides accurate results reaching a good compromise among modularity and usability. Moreover, we introduce a de-novo clustering algorithm specifically designed for the inference of Operational Taxonomic Units (OTUs). Tests on real and synthetic datasets shows that thanks to the optimized reads filtering process and to the new clustering algorithm, MICCA provides estimates of the number of OTUs and of other common ecological indices that are more accurate and robust than currently available pipelines. Analysis of public metagenomic datasets shows that the higher consistency of results improves our understanding of the structure of environmental and human associated microbial communities. MICCA is an open source project. PMID:25988396
Sterner, Reinhard; Merkl, Rainer
2016-01-01
Modern enzymes are highly optimized biocatalysts that process their substrates with extreme efficiency. Many enzymes catalyze more than one reaction; however, the persistence of such ambiguities, their consequences and evolutionary causes are largely unknown. As a paradigmatic case, we study the history of bi-functionality for a time span of approximately two billion years for the sugar isomerase HisA from histidine biosynthesis. To look back in time, we computationally reconstructed and experimentally characterized three HisA predecessors. We show that these ancient enzymes catalyze not only the HisA reaction but also the isomerization of a similar substrate, which is commonly processed by the isomerase TrpF in tryptophan biosynthesis. Moreover, we found that three modern-day HisA enzymes from Proteobacteria and Thermotogae also possess low TrpF activity. We conclude that this bi-functionality was conserved for at least two billion years, most likely without any evolutionary pressure. Although not actively selected for, this trait can become advantageous in the case of a gene loss. Such exaptation is exemplified by the Actinobacteria that have lost the trpF gene but possess the bi-functional HisA homolog PriA, which adopts the roles of both HisA and TrpF. Our findings demonstrate that bi-functionality can perpetuate in the absence of selection for very long time-spans. PMID:26824644
The Chromatin Remodeler SPLAYED Regulates Specific Stress Signaling Pathways
Walley, Justin W.; Rowe, Heather C.; Xiao, Yanmei; Chehab, E. Wassim; Kliebenstein, Daniel J.; Wagner, Doris; Dehesh, Katayoon
2008-01-01
Organisms are continuously exposed to a myriad of environmental stresses. Central to an organism's survival is the ability to mount a robust transcriptional response to the imposed stress. An emerging mechanism of transcriptional control involves dynamic changes in chromatin structure. Alterations in chromatin structure are brought about by a number of different mechanisms, including chromatin modifications, which covalently modify histone proteins; incorporation of histone variants; and chromatin remodeling, which utilizes ATP hydrolysis to alter histone-DNA contacts. While considerable insight into the mechanisms of chromatin remodeling has been gained, the biological role of chromatin remodeling complexes beyond their function as regulators of cellular differentiation and development has remained poorly understood. Here, we provide genetic, biochemical, and biological evidence for the critical role of chromatin remodeling in mediating plant defense against specific biotic stresses. We found that the Arabidopsis SWI/SNF class chromatin remodeling ATPase SPLAYED (SYD) is required for the expression of selected genes downstream of the jasmonate (JA) and ethylene (ET) signaling pathways. SYD is also directly recruited to the promoters of several of these genes. Furthermore, we show that SYD is required for resistance against the necrotrophic pathogen Botrytis cinerea but not the biotrophic pathogen Pseudomonas syringae. These findings demonstrate not only that chromatin remodeling is required for selective pathogen resistance, but also that chromatin remodelers such as SYD can regulate specific pathways within biotic stress signaling networks. PMID:19079584
The chromatin remodeler SPLAYED regulates specific stress signaling pathways.
Walley, Justin W; Rowe, Heather C; Xiao, Yanmei; Chehab, E Wassim; Kliebenstein, Daniel J; Wagner, Doris; Dehesh, Katayoon
2008-12-01
Organisms are continuously exposed to a myriad of environmental stresses. Central to an organism's survival is the ability to mount a robust transcriptional response to the imposed stress. An emerging mechanism of transcriptional control involves dynamic changes in chromatin structure. Alterations in chromatin structure are brought about by a number of different mechanisms, including chromatin modifications, which covalently modify histone proteins; incorporation of histone variants; and chromatin remodeling, which utilizes ATP hydrolysis to alter histone-DNA contacts. While considerable insight into the mechanisms of chromatin remodeling has been gained, the biological role of chromatin remodeling complexes beyond their function as regulators of cellular differentiation and development has remained poorly understood. Here, we provide genetic, biochemical, and biological evidence for the critical role of chromatin remodeling in mediating plant defense against specific biotic stresses. We found that the Arabidopsis SWI/SNF class chromatin remodeling ATPase SPLAYED (SYD) is required for the expression of selected genes downstream of the jasmonate (JA) and ethylene (ET) signaling pathways. SYD is also directly recruited to the promoters of several of these genes. Furthermore, we show that SYD is required for resistance against the necrotrophic pathogen Botrytis cinerea but not the biotrophic pathogen Pseudomonas syringae. These findings demonstrate not only that chromatin remodeling is required for selective pathogen resistance, but also that chromatin remodelers such as SYD can regulate specific pathways within biotic stress signaling networks.
Recent Developments on Genetic Engineering of Microalgae for Biofuels and Bio-Based Chemicals.
Ng, I-Son; Tan, Shih-I; Kao, Pei-Hsun; Chang, Yu-Kaung; Chang, Jo-Shu
2017-10-01
Microalgae serve as a promising source for the production of biofuels and bio-based chemicals. They are superior to terrestrial plants as feedstock in many aspects and their biomass is naturally rich in lipids, carbohydrates, proteins, pigments, and other valuable compounds. Due to the relatively slow growth rate and high cultivation cost of microalgae, to screen efficient and robust microalgal strains as well as genetic modifications of the available strains for further improvement are of urgent demand in the development of microalgae-based biorefinery. In genetic engineering of microalgae, transformation and selection methods are the key steps to accomplish the target gene modification. However, determination of the preferable type and dosage of antibiotics used for transformant selection is usually time-consuming and microalgal-strain-dependent. Therefore, more powerful and efficient techniques should be developed to meet this need. In this review, the conventional and emerging genome-editing tools (e.g., CRISPR-Cas9, TALEN, and ZFN) used in editing the genomes of nuclear, mitochondria, and chloroplast of microalgae are thoroughly surveyed. Although all the techniques mentioned above demonstrate their abilities to perform gene editing and desired phenotype screening, there still need to overcome higher production cost and lower biomass productivity, to achieve efficient production of the desired products in microalgal biorefineries. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Logical analysis of diffuse large B-cell lymphomas.
Alexe, G; Alexe, S; Axelrod, D E; Hammer, P L; Weissmann, D
2005-07-01
The goal of this study is to re-examine the oligonucleotide microarray dataset of Shipp et al., which contains the intensity levels of 6817 genes of 58 patients with diffuse large B-cell lymphoma (DLBCL) and 19 with follicular lymphoma (FL), by means of the combinatorics, optimisation, and logic-based methodology of logical analysis of data (LAD). The motivations for this new analysis included the previously demonstrated capabilities of LAD and its expected potential (1) to identify different informative genes than those discovered by conventional statistical methods, (2) to identify combinations of gene expression levels capable of characterizing different types of lymphoma, and (3) to assemble collections of such combinations that if considered jointly are capable of accurately distinguishing different types of lymphoma. The central concept of LAD is a pattern or combinatorial biomarker, a concept that resembles a rule as used in decision tree methods. LAD is able to exhaustively generate the collection of all those patterns which satisfy certain quality constraints, through a systematic combinatorial process guided by clear optimization criteria. Then, based on a set covering approach, LAD aggregates the collection of patterns into classification models. In addition, LAD is able to use the information provided by large collections of patterns in order to extract subsets of variables, which collectively are able to distinguish between different types of disease. For the differential diagnosis of DLBCL versus FL, a model based on eight significant genes is constructed and shown to have a sensitivity of 94.7% and a specificity of 100% on the test set. For the prognosis of good versus poor outcome among the DLBCL patients, a model is constructed on another set consisting also of eight significant genes, and shown to have a sensitivity of 87.5% and a specificity of 90% on the test set. The genes selected by LAD also work well as a basis for other kinds of statistical analysis, indicating their robustness. These two models exhibit accuracies that compare favorably to those in the original study. In addition, the current study also provides a ranking by importance of the genes in the selected significant subsets as well as a library of dozens of combinatorial biomarkers (i.e. pairs or triplets of genes) that can serve as a source of mathematically generated, statistically significant research hypotheses in need of biological explanation.
Genomic Signature of Kin Selection in an Ant with Obligately Sterile Workers
Warner, Michael R.; Mikheyev, Alexander S.
2017-01-01
Abstract Kin selection is thought to drive the evolution of cooperation and conflict, but the specific genes and genome-wide patterns shaped by kin selection are unknown. We identified thousands of genes associated with the sterile ant worker caste, the archetype of an altruistic phenotype shaped by kin selection, and then used population and comparative genomic approaches to study patterns of molecular evolution at these genes. Consistent with population genetic theoretical predictions, worker-upregulated genes experienced reduced selection compared with genes upregulated in reproductive castes. Worker-upregulated genes included more taxonomically restricted genes, indicating that the worker caste has recruited more novel genes, yet these genes also experienced reduced selection. Our study identifies a putative genomic signature of kin selection and helps to integrate emerging sociogenomic data with longstanding social evolution theory. PMID:28419349
Gene Expression Signatures Based on Variability can Robustly Predict Tumor Progression and Prognosis
Dinalankara, Wikum; Bravo, Héctor Corrada
2015-01-01
Gene expression signatures are commonly used to create cancer prognosis and diagnosis methods, yet only a small number of them are successfully deployed in the clinic since many fail to replicate performance on subsequent validation. A primary reason for this lack of reproducibility is the fact that these signatures attempt to model the highly variable and unstable genomic behavior of cancer. Our group recently introduced gene expression anti-profiles as a robust methodology to derive gene expression signatures based on the observation that while gene expression measurements are highly heterogeneous across tumors of a specific cancer type relative to the normal tissue, their degree of deviation from normal tissue expression in specific genes involved in tissue differentiation is a stable tumor mark that is reproducible across experiments and cancer types. Here we show that constructing gene expression signatures based on variability and the anti-profile approach yields classifiers capable of successfully distinguishing benign growths from cancerous growths based on deviation from normal expression. We then show that this same approach generates stable and reproducible signatures that predict probability of relapse and survival based on tumor gene expression. These results suggest that using the anti-profile framework for the discovery of genomic signatures is an avenue leading to the development of reproducible signatures suitable for adoption in clinical settings. PMID:26078586
Network Hubs Buffer Environmental Variation in Saccharomyces cerevisiae
Levy, Sasha F; Siegal, Mark L
2008-01-01
Regulatory and developmental systems produce phenotypes that are robust to environmental and genetic variation. A gene product that normally contributes to this robustness is termed a phenotypic capacitor. When a phenotypic capacitor fails, for example when challenged by a harsh environment or mutation, the system becomes less robust and thus produces greater phenotypic variation. A functional phenotypic capacitor provides a mechanism by which hidden polymorphism can accumulate, whereas its failure provides a mechanism by which evolutionary change might be promoted. The primary example to date of a phenotypic capacitor is Hsp90, a molecular chaperone that targets a large set of signal transduction proteins. In both Drosophila and Arabidopsis, compromised Hsp90 function results in pleiotropic phenotypic effects dependent on the underlying genotype. For some traits, Hsp90 also appears to buffer stochastic variation, yet the relationship between environmental and genetic buffering remains an important unresolved question. We previously used simulations of knockout mutations in transcriptional networks to predict that many gene products would act as phenotypic capacitors. To test this prediction, we use high-throughput morphological phenotyping of individual yeast cells from single-gene deletion strains to identify gene products that buffer environmental variation in Saccharomyces cerevisiae. We find more than 300 gene products that, when absent, increase morphological variation. Overrepresented among these capacitors are gene products that control chromosome organization and DNA integrity, RNA elongation, protein modification, cell cycle, and response to stimuli such as stress. Capacitors have a high number of synthetic-lethal interactions but knockouts of these genes do not tend to cause severe decreases in growth rate. Each capacitor can be classified based on whether or not it is encoded by a gene with a paralog in the genome. Capacitors with a duplicate are highly connected in the protein–protein interaction network and show considerable divergence in expression from their paralogs. In contrast, capacitors encoded by singleton genes are part of highly interconnected protein clusters whose other members also tend to affect phenotypic variability or fitness. These results suggest that buffering and release of variation is a widespread phenomenon that is caused by incomplete functional redundancy at multiple levels in the genetic architecture. PMID:18986213
Wei, Jiangyong; Hu, Xiaohua; Zou, Xiufen; Tian, Tianhai
2017-12-28
Recent advances in omics technologies have raised great opportunities to study large-scale regulatory networks inside the cell. In addition, single-cell experiments have measured the gene and protein activities in a large number of cells under the same experimental conditions. However, a significant challenge in computational biology and bioinformatics is how to derive quantitative information from the single-cell observations and how to develop sophisticated mathematical models to describe the dynamic properties of regulatory networks using the derived quantitative information. This work designs an integrated approach to reverse-engineer gene networks for regulating early blood development based on singel-cell experimental observations. The wanderlust algorithm is initially used to develop the pseudo-trajectory for the activities of a number of genes. Since the gene expression data in the developed pseudo-trajectory show large fluctuations, we then use Gaussian process regression methods to smooth the gene express data in order to obtain pseudo-trajectories with much less fluctuations. The proposed integrated framework consists of both bioinformatics algorithms to reconstruct the regulatory network and mathematical models using differential equations to describe the dynamics of gene expression. The developed approach is applied to study the network regulating early blood cell development. A graphic model is constructed for a regulatory network with forty genes and a dynamic model using differential equations is developed for a network of nine genes. Numerical results suggests that the proposed model is able to match experimental data very well. We also examine the networks with more regulatory relations and numerical results show that more regulations may exist. We test the possibility of auto-regulation but numerical simulations do not support the positive auto-regulation. In addition, robustness is used as an importantly additional criterion to select candidate networks. The research results in this work shows that the developed approach is an efficient and effective method to reverse-engineer gene networks using single-cell experimental observations.
Zhu, LiuCun; Chen, XiJia; Kong, Xiangyin; Cai, Yu-Dong
2016-11-01
Hepatitis is a type of infectious disease that induces inflammation of the liver without pinpointing a particular pathogen or pathogenesis. Type C hepatitis, as a type of hepatitis, has been reported to induce cirrhosis and hepatocellular carcinoma within a very short amount of time. It is a great threat to human health. Some studies have revealed that trace elements are associated with infection with and immune rejection against hepatitis C virus (HCV). However, the mechanism underlying this phenomenon is still unclear. In this study, we aimed to expand our knowledge of this phenomenon by designing a computational method to identify genes that may be related to both HCV and trace element metabolic processes. The searching procedure included three stages. First, a shortest path algorithm was applied to a large network, constructed by protein-protein interactions, to identify potential genes of interest. Second, a permutation test was executed to exclude false discoveries. Finally, some rules based on the betweenness and associations between candidate genes and HCV and trace elements were built to select core genes among the remaining genes. 12 lists of genes, corresponding to 12 types of trace elements, were obtained. These genes are deemed to be associated with HCV infection and trace elements metabolism. The analyses indicate that some genes may be related to both HCV and trace element metabolic processes, further confirming the associations between HCV and trace elements. The method was further tested on another set of HCV genes, the results indicate that this method is quite robustness. The newly found genes may partially reveal unknown mechanisms between HCV infection and trace element metabolism. This article is part of a Special Issue entitled "System Genetics" Guest Editor: Dr. Yudong Cai and Dr. Tao Huang. Copyright © 2016 Elsevier B.V. All rights reserved.
Genome-wide gene order distances support clustering the gram-positive bacteria
House, Christopher H.; Pellegrini, Matteo; Fitz-Gibbon, Sorel T.
2015-01-01
Initially using 143 genomes, we developed a method for calculating the pair-wise distance between prokaryotic genomes using a Monte Carlo method to estimate the conservation of gene order. The method was based on repeatedly selecting five or six non-adjacent random orthologs from each of two genomes and determining if the chosen orthologs were in the same order. The raw distances were then corrected for gene order convergence using an adaptation of the Jukes-Cantor model, as well as using the common distance correction D′ = −ln(1-D). First, we compared the distances found via the order of six orthologs to distances found based on ortholog gene content and small subunit rRNA sequences. The Jukes-Cantor gene order distances are reasonably well correlated with the divergence of rRNA (R2 = 0.24), especially at rRNA Jukes-Cantor distances of less than 0.2 (R2 = 0.52). Gene content is only weakly correlated with rRNA divergence (R2 = 0.04) over all distances, however, it is especially strongly correlated at rRNA Jukes-Cantor distances of less than 0.1 (R2 = 0.67). This initial work suggests that gene order may be useful in conjunction with other methods to help understand the relatedness of genomes. Using the gene order distances in 143 genomes, the relations of prokaryotes were studied using neighbor joining and agreement subtrees. We then repeated our study of the relations of prokaryotes using gene order in 172 complete genomes better representing a wider-diversity of prokaryotes. Consistently, our trees show the Actinobacteria as a sister group to the bulk of the Firmicutes. In fact, the robustness of gene order support was found to be considerably greater for uniting these two phyla than for uniting any of the proteobacterial classes together. The results are supportive of the idea that Actinobacteria and Firmicutes are closely related, which in turn implies a single origin for the gram-positive cell. PMID:25653643
Whole genome sequencing data and de novo draft assemblies for 66 teleost species
Malmstrøm, Martin; Matschiner, Michael; Tørresen, Ole K.; Jakobsen, Kjetill S.; Jentoft, Sissel
2017-01-01
Teleost fishes comprise more than half of all vertebrate species, yet genomic data are only available for 0.2% of their diversity. Here, we present whole genome sequencing data for 66 new species of teleosts, vastly expanding the availability of genomic data for this important vertebrate group. We report on de novo assemblies based on low-coverage (9–39×) sequencing and present detailed methodology for all analyses. To facilitate further utilization of this data set, we present statistical analyses of the gene space completeness and verify the expected phylogenetic position of the sequenced genomes in a large mitogenomic context. We further present a nuclear marker set used for phylogenetic inference and evaluate each gene tree in relation to the species tree to test for homogeneity in the phylogenetic signal. Collectively, these analyses illustrate the robustness of this highly diverse data set and enable extensive reuse of the selected phylogenetic markers and the genomic data in general. This data set covers all major teleost lineages and provides unprecedented opportunities for comparative studies of teleosts. PMID:28094797
Cho, Min Seok; Jin, Yong Ju; Kang, Bo Kyoung; Park, Yu Kyoung; Kim, ChangKug; Park, Dong Suk
2018-05-04
Bacillus subtilis and B. velezensis are frequently isolated from various niches, including fermented foods, water, and soil. Within the Bacillus subtilis group, B. velezensis and B. subtilis subsp. subtilis have received significant attention as biological resources for biotechnology-associated industries. Nevertheless, radical solutions are urgently needed to identify microbes during their ecological succession to accurately confirm their action at the species or subspecies level in diverse environments, such as fermented materials. Thus, in this study, previously published genome data of the B. subtilis group were compared to exploit species- or subspecies-specific genes for use as improved qPCR targets to detect B. velezensis and B. subtilis subsp. subtilis in kimchi samples. In silico analyses of the selected genes and designed primer sequences, in conjunction with SYBR Green real-time PCR, confirmed the robustness of this newly developed assay. Consequently, this study will allow for new insights into the ontogeny and succession of B. velezensis and B. subtilis subsp. subtilis in various niches. Interestingly, in white kimchi without red pepper powder, neither B. subtilis subsp. subtilis nor B. velezensis was detected.
Nature and function of insulator protein binding sites in the Drosophila genome
Schwartz, Yuri B.; Linder-Basso, Daniela; Kharchenko, Peter V.; Tolstorukov, Michael Y.; Kim, Maria; Li, Hua-Bing; Gorchakov, Andrey A.; Minoda, Aki; Shanower, Gregory; Alekseyenko, Artyom A.; Riddle, Nicole C.; Jung, Youngsook L.; Gu, Tingting; Plachetka, Annette; Elgin, Sarah C.R.; Kuroda, Mitzi I.; Park, Peter J.; Savitsky, Mikhail; Karpen, Gary H.; Pirrotta, Vincenzo
2012-01-01
Chromatin insulator elements and associated proteins have been proposed to partition eukaryotic genomes into sets of independently regulated domains. Here we test this hypothesis by quantitative genome-wide analysis of insulator protein binding to Drosophila chromatin. We find distinct combinatorial binding of insulator proteins to different classes of sites and uncover a novel type of insulator element that binds CP190 but not any other known insulator proteins. Functional characterization of different classes of binding sites indicates that only a small fraction act as robust insulators in standard enhancer-blocking assays. We show that insulators restrict the spreading of the H3K27me3 mark but only at a small number of Polycomb target regions and only to prevent repressive histone methylation within adjacent genes that are already transcriptionally inactive. RNAi knockdown of insulator proteins in cultured cells does not lead to major alterations in genome expression. Taken together, these observations argue against the concept of a genome partitioned by specialized boundary elements and suggest that insulators are reserved for specific regulation of selected genes. PMID:22767387
Simple-MSSM: a simple and efficient method for simultaneous multi-site saturation mutagenesis.
Cheng, Feng; Xu, Jian-Miao; Xiang, Chao; Liu, Zhi-Qiang; Zhao, Li-Qing; Zheng, Yu-Guo
2017-04-01
To develop a practically simple and robust multi-site saturation mutagenesis (MSSM) method that enables simultaneously recombination of amino acid positions for focused mutant library generation. A general restriction enzyme-free and ligase-free MSSM method (Simple-MSSM) based on prolonged overlap extension PCR (POE-PCR) and Simple Cloning techniques. As a proof of principle of Simple-MSSM, the gene of eGFP (enhanced green fluorescent protein) was used as a template gene for simultaneous mutagenesis of five codons. Forty-eight randomly selected clones were sequenced. Sequencing revealed that all the 48 clones showed at least one mutant codon (mutation efficiency = 100%), and 46 out of the 48 clones had mutations at all the five codons. The obtained diversities at these five codons are 27, 24, 26, 26 and 22, respectively, which correspond to 84, 75, 81, 81, 69% of the theoretical diversity offered by NNK-degeneration (32 codons; NNK, K = T or G). The enzyme-free Simple-MSSM method can simultaneously and efficiently saturate five codons within one day, and therefore avoid missing interactions between residues in interacting amino acid networks.
The validity and value of inclusive fitness theory
Bourke, Andrew F. G.
2011-01-01
Social evolution is a central topic in evolutionary biology, with the evolution of eusociality (societies with altruistic, non-reproductive helpers) representing a long-standing evolutionary conundrum. Recent critiques have questioned the validity of the leading theory for explaining social evolution and eusociality, namely inclusive fitness (kin selection) theory. I review recent and past literature to argue that these critiques do not succeed. Inclusive fitness theory has added fundamental insights to natural selection theory. These are the realization that selection on a gene for social behaviour depends on its effects on co-bearers, the explanation of social behaviours as unalike as altruism and selfishness using the same underlying parameters, and the explanation of within-group conflict in terms of non-coinciding inclusive fitness optima. A proposed alternative theory for eusocial evolution assumes mistakenly that workers' interests are subordinate to the queen's, contains no new elements and fails to make novel predictions. The haplodiploidy hypothesis has yet to be rigorously tested and positive relatedness within diploid eusocial societies supports inclusive fitness theory. The theory has made unique, falsifiable predictions that have been confirmed, and its evidence base is extensive and robust. Hence, inclusive fitness theory deserves to keep its position as the leading theory for social evolution. PMID:21920980
The validity and value of inclusive fitness theory.
Bourke, Andrew F G
2011-11-22
Social evolution is a central topic in evolutionary biology, with the evolution of eusociality (societies with altruistic, non-reproductive helpers) representing a long-standing evolutionary conundrum. Recent critiques have questioned the validity of the leading theory for explaining social evolution and eusociality, namely inclusive fitness (kin selection) theory. I review recent and past literature to argue that these critiques do not succeed. Inclusive fitness theory has added fundamental insights to natural selection theory. These are the realization that selection on a gene for social behaviour depends on its effects on co-bearers, the explanation of social behaviours as unalike as altruism and selfishness using the same underlying parameters, and the explanation of within-group conflict in terms of non-coinciding inclusive fitness optima. A proposed alternative theory for eusocial evolution assumes mistakenly that workers' interests are subordinate to the queen's, contains no new elements and fails to make novel predictions. The haplodiploidy hypothesis has yet to be rigorously tested and positive relatedness within diploid eusocial societies supports inclusive fitness theory. The theory has made unique, falsifiable predictions that have been confirmed, and its evidence base is extensive and robust. Hence, inclusive fitness theory deserves to keep its position as the leading theory for social evolution.
Fleming, Damarius S.; Weigend, Steffen; Simianer, Henner; Weigend, Annett; Rothschild, Max; Schmidt, Carl; Ashwell, Chris; Persia, Mike; Reecy, James; Lamont, Susan J.
2017-01-01
Global climate change is increasing the magnitude of environmental stressors, such as temperature, pathogens, and drought, that limit the survivability and sustainability of livestock production. Poultry production and its expansion is dependent upon robust animals that are able to cope with stressors in multiple environments. Understanding the genetic strategies that indigenous, noncommercial breeds have evolved to survive in their environment could help to elucidate molecular mechanisms underlying biological traits of environmental adaptation. We examined poultry from diverse breeds and climates of Africa and Northern Europe for selection signatures that have allowed them to adapt to their indigenous environments. Selection signatures were studied using a combination of population genomic methods that employed FST, integrated haplotype score (iHS), and runs of homozygosity (ROH) procedures. All the analyses indicated differences in environment as a driver of selective pressure in both groups of populations. The analyses revealed unique differences in the genomic regions under selection pressure from the environment for each population. The African chickens showed stronger selection toward stress signaling and angiogenesis, while the Northern European chickens showed more selection pressure toward processes related to energy homeostasis. The results suggest that chromosomes 2 and 27 are the most diverged between populations and the most selected upon within the African (chromosome 27) and Northern European (chromosome 2) birds. Examination of the divergent populations has provided new insight into genes under possible selection related to tolerance of a population’s indigenous environment that may be baselines for examining the genomic contribution to tolerance adaptions. PMID:28341699
Fleming, Damarius S; Weigend, Steffen; Simianer, Henner; Weigend, Annett; Rothschild, Max; Schmidt, Carl; Ashwell, Chris; Persia, Mike; Reecy, James; Lamont, Susan J
2017-05-05
Global climate change is increasing the magnitude of environmental stressors, such as temperature, pathogens, and drought, that limit the survivability and sustainability of livestock production. Poultry production and its expansion is dependent upon robust animals that are able to cope with stressors in multiple environments. Understanding the genetic strategies that indigenous, noncommercial breeds have evolved to survive in their environment could help to elucidate molecular mechanisms underlying biological traits of environmental adaptation. We examined poultry from diverse breeds and climates of Africa and Northern Europe for selection signatures that have allowed them to adapt to their indigenous environments. Selection signatures were studied using a combination of population genomic methods that employed F ST , integrated haplotype score (iHS), and runs of homozygosity (ROH) procedures. All the analyses indicated differences in environment as a driver of selective pressure in both groups of populations. The analyses revealed unique differences in the genomic regions under selection pressure from the environment for each population. The African chickens showed stronger selection toward stress signaling and angiogenesis, while the Northern European chickens showed more selection pressure toward processes related to energy homeostasis. The results suggest that chromosomes 2 and 27 are the most diverged between populations and the most selected upon within the African (chromosome 27) and Northern European (chromosome 2) birds. Examination of the divergent populations has provided new insight into genes under possible selection related to tolerance of a population's indigenous environment that may be baselines for examining the genomic contribution to tolerance adaptions. Copyright © 2017 Fleming et al.
Considerations for designing chemical screening strategies in plant biology
Serrano, Mario; Kombrink, Erich; Meesters, Christian
2015-01-01
Traditionally, biologists regularly used classical genetic approaches to characterize and dissect plant processes. However, this strategy is often impaired by redundancy, lethality or pleiotropy of gene functions, which prevent the isolation of viable mutants. The chemical genetic approach has been recognized as an alternative experimental strategy, which has the potential to circumvent these problems. It relies on the capacity of small molecules to modify biological processes by specific binding to protein target(s), thereby conditionally modifying protein function(s), which phenotypically resemble mutation(s) of the encoding gene(s). A successful chemical screening campaign comprises three equally important elements: (1) a reliable, robust, and quantitative bioassay, which allows to distinguish between potent and less potent compounds, (2) a rigorous validation process for candidate compounds to establish their selectivity, and (3) an experimental strategy for elucidating a compound's mode of action and molecular target. In this review we will discuss details of this general strategy and additional aspects that deserve consideration in order to take full advantage of the power provided by the chemical approach to plant biology. In addition, we will highlight some success stories of recent chemical screenings in plant systems, which may serve as teaching examples for the implementation of future chemical biology projects. PMID:25904921
Survey on the phage resistance mechanisms displayed by a dairy Lactobacillus helveticus strain.
Zago, Miriam; Orrù, Luigi; Rossetti, Lia; Lamontanara, Antonella; Fornasari, Maria Emanuela; Bonvini, Barbara; Meucci, Aurora; Carminati, Domenico; Cattivelli, Luigi; Giraffa, Giorgio
2017-09-01
In this study the presence and functionality of phage defence mechanisms in Lactobacillus helveticus ATCC 10386, a strain of dairy origin which is sensitive to ΦLh56, were investigated. After exposure of ATCC 10386 to ΦLh56, the whole-genome sequences of ATCC 10386 and of a phage-resistant derivative (LhM3) were compared. LhM3 showed deletions in the S-layer protein and a higher expression of the genes involved in the restriction/modification (R/M) system. Genetic data were substantiated by measurements of bacteriophage adsorption rates, efficiency of plaquing, cell wall protein size and by gene expression analysis. In LhM3 two phage resistance mechanisms, the inhibition of phage adsorption and the upregulation of Type I R/M genes, take place and explain its resistance to ΦLh56. Although present in both ATCC 10386 and LhM3 genomes, the CRISPR machinery did not seem to play a role in the phage resistance of LhM3. Overall, the natural selection of phage resistant strains resulted successful in detecting variants carrying multiple phage defence mechanisms in L. helveticus. The concurrent presence of multiple phage-resistance systems should provide starter strains with increased fitness and robustness in dairy ecosystems. Copyright © 2017 Elsevier Ltd. All rights reserved.
2015-01-01
Background microRNA (miRNA) expression plays an influential role in cancer classification and malignancy, and miRNAs are feasible as alternative diagnostic markers for pancreatic cancer, a highly aggressive neoplasm with silent early symptoms, high metastatic potential, and resistance to conventional therapies. Methods In this study, we evaluated the benefits of multi-omics data analysis by integrating miRNA and mRNA expression data in pancreatic cancer. Using support vector machine (SVM) modelling and leave-one-out cross validation (LOOCV), we evaluated the diagnostic performance of single- or multi-markers based on miRNA and mRNA expression profiles from 104 PDAC tissues and 17 benign pancreatic tissues. For selecting even more reliable and robust markers, we performed validation by independent datasets from the Gene Expression Omnibus (GEO) and the Cancer Genome Atlas (TCGA) data depositories. For validation, miRNA activity was estimated by miRNA-target gene interaction and mRNA expression datasets in pancreatic cancer. Results Using a comprehensive identification approach, we successfully identified 705 multi-markers having powerful diagnostic performance for PDAC. In addition, these marker candidates annotated with cancer pathways using gene ontology analysis. Conclusions Our prediction models have strong potential for the diagnosis of pancreatic cancer. PMID:26328610
Large scale genomic reorganization of topological domains at the HoxD locus.
Fabre, Pierre J; Leleu, Marion; Mormann, Benjamin H; Lopez-Delisle, Lucille; Noordermeer, Daan; Beccari, Leonardo; Duboule, Denis
2017-08-07
The transcriptional activation of HoxD genes during mammalian limb development involves dynamic interactions with two topologically associating domains (TADs) flanking the HoxD cluster. In particular, the activation of the most posterior HoxD genes in developing digits is controlled by regulatory elements located in the centromeric TAD (C-DOM) through long-range contacts. To assess the structure-function relationships underlying such interactions, we measured compaction levels and TAD discreteness using a combination of chromosome conformation capture (4C-seq) and DNA FISH. We assessed the robustness of the TAD architecture by using a series of genomic deletions and inversions that impact the integrity of this chromatin domain and that remodel long-range contacts. We report multi-partite associations between HoxD genes and up to three enhancers. We find that the loss of native chromatin topology leads to the remodeling of TAD structure following distinct parameters. Our results reveal that the recomposition of TAD architectures after large genomic re-arrangements is dependent on a boundary-selection mechanism in which CTCF mediates the gating of long-range contacts in combination with genomic distance and sequence specificity. Accordingly, the building of a recomposed TAD at this locus depends on distinct functional and constitutive parameters.
Utilizing cattle genetic trends to evaluate the robust nature of gene bank collections
USDA-ARS?s Scientific Manuscript database
The National Animal Germplasm Program has developed substantial germplasm collections (>800,000 samples) for more than 300 unique livestock populations or breeds. Gene bank utilization is relatively new to the livestock sector and the long term genetic relevance of such collections has not been docu...
Identification of a Pi9 containing rice germplasm with a newly developed robust marker
USDA-ARS?s Scientific Manuscript database
The Pi9 gene, originating from Oryza minuta, is an effective resistance gene for controlling rice blast disease (Magnaporthe oryzae). However, currently available linked DNA markers do not accurately identify the function of Pi9, thus hindering its efficient incorporation into new cultivars through...
Acute ozone-induced pulmonary injury and inflammation are well characterized. A few studies have used gene expression profiling to determine the types of changes induced by ozone; however the mechanisms or the pathways involved are less well understood. We presumed that robust bi...
Elyasigomari, V; Lee, D A; Screen, H R C; Shaheed, M H
2017-03-01
For each cancer type, only a few genes are informative. Due to the so-called 'curse of dimensionality' problem, the gene selection task remains a challenge. To overcome this problem, we propose a two-stage gene selection method called MRMR-COA-HS. In the first stage, the minimum redundancy and maximum relevance (MRMR) feature selection is used to select a subset of relevant genes. The selected genes are then fed into a wrapper setup that combines a new algorithm, COA-HS, using the support vector machine as a classifier. The method was applied to four microarray datasets, and the performance was assessed by the leave one out cross-validation method. Comparative performance assessment of the proposed method with other evolutionary algorithms suggested that the proposed algorithm significantly outperforms other methods in selecting a fewer number of genes while maintaining the highest classification accuracy. The functions of the selected genes were further investigated, and it was confirmed that the selected genes are biologically relevant to each cancer type. Copyright © 2017. Published by Elsevier Inc.
Gene Therapy for Fracture Repair
2005-12-01
therapeutic benefits. We have identified a murine leukemia virus (MLV) vector that provides robust transgene expression in fracture tissues, and applied it to...During the second year of funding, we used the surgical technique to apply the murine leukemia virus (MLV)-based vector to the fracture tissues and...trochanter. ii ) Fracture Injection The therapeutic gene chosen was the BMP-2/4 hybrid gene. To most accurately establish the expression of the
Noise-aided computation within a synthetic gene network through morphable and robust logic gates
NASA Astrophysics Data System (ADS)
Dari, Anna; Kia, Behnam; Wang, Xiao; Bulsara, Adi R.; Ditto, William
2011-04-01
An important goal for synthetic biology is to build robust and tunable genetic regulatory networks that are capable of performing assigned operations, usually in the presence of noise. In this work, a synthetic gene network derived from the bacteriophage λ underpins a reconfigurable logic gate wherein we exploit noise and nonlinearity through the application of the logical stochastic resonance paradigm. This biological logic gate can emulate or “morph” the AND and OR operations through varying internal system parameters in a noisy background. Such genetic circuits can afford intriguing possibilities in the realization of engineered genetic networks in which the actual function of the gate can be changed after the network has been built, via an external control parameter. In this article, the full system characterization is reported, with the logic gate performance studied in the presence of external and internal noise. The robustness of the gate, to noise, is studied and illustrated through numerical simulations.
Cimarelli, Lucia; Singh, Kumar Saurabh; Mai, Nguyen Thi Nhu; Dhar, Bidhan Chandra; Brandi, Anna; Brandi, Letizia; Spurio, Roberto
2015-05-22
Our understanding of the composition of diatom communities and their response to environmental changes is currently limited by laborious taxonomic identification procedures. Advances in molecular technologies are expected to contribute more efficient, robust and sensitive tools for the detection of these ecologically relevant microorganisms. There is a need to explore and test phylogenetic markers as an alternative to the use of rRNA genes, whose limited sequence divergence does not allow the accurate discrimination of diatoms at the species level. In this work, nine diatom species belonging to eight genera, isolated from epylithic environmental samples collected in central Italy, were chosen to implement a panel of diatoms covering the full range of ecological status of freshwaters. The procedure described in this work relies on the PCR amplification of specific regions in two conserved diatom genes, elongation factor 1-a (eEF1-a) and silicic acid transporter (SIT), as a first step to narrow down the complexity of the targets, followed by microarray hybridization experiments. Oligonucleotide probes with the potential to discriminate closely related species were designed taking into account the genetic polymorphisms found in target genes. These probes were tested, refined and validated on a small-scale prototype DNA chip. Overall, we obtained 17 highly specific probes targeting eEF1-a and SIT, along with 19 probes having lower discriminatory power recognizing at the same time two or three species. This basic array was validated in a laboratory setting and is ready for tests with crude environmental samples eventually to be scaled-up to include a larger panel of diatoms. Its possible use for the simultaneous detection of diatoms selected from the classes of water quality identified by the European Water Framework Directive is discussed.
Doi, Masao
2013-12-01
Recent advances in circadian biology strongly suggest that there are still genes involved in the generation and maintenance of biological rhythms that remain to be identified. It has been generally appreciated that circadian rhythms are generated intracellularly through transcription/translation-based autoregulatory feedback circuits of the clock genes. However, the existence of new intracellular clock machinery that cannot be explained by existing clock genes has recently been reported. This clock manifests as oxidation-reduction cycles of peroxiredoxin proteins, implying that as-yet-undiscovered clock genes may exist within cells to regulate redox cycling. Moreover, great strides have also been made in understanding the cell-cell communication-based robust circadian oscillations of the suprachiasmatic nucleus (SCN), the central pacemaker in the brain. Thousands of neurons that constitute the SCN maintain a high degree of synchrony in a way that allows the SCN neurons to create coherent signals as a whole. Inactivation of the genes involved in the cell-cell synchronization of the SCN, which include the genes encoding VIP, VPAC2, and RGS16, leads to altered circadian rhythms in behavior and physiologies. The purpose of this review is to provide an overview of recent advances in the circadian biology, with a special emphasis on the importance of cell-cell interactions within the SCN.
Shrinkage covariance matrix approach based on robust trimmed mean in gene sets detection
NASA Astrophysics Data System (ADS)
Karjanto, Suryaefiza; Ramli, Norazan Mohamed; Ghani, Nor Azura Md; Aripin, Rasimah; Yusop, Noorezatty Mohd
2015-02-01
Microarray involves of placing an orderly arrangement of thousands of gene sequences in a grid on a suitable surface. The technology has made a novelty discovery since its development and obtained an increasing attention among researchers. The widespread of microarray technology is largely due to its ability to perform simultaneous analysis of thousands of genes in a massively parallel manner in one experiment. Hence, it provides valuable knowledge on gene interaction and function. The microarray data set typically consists of tens of thousands of genes (variables) from just dozens of samples due to various constraints. Therefore, the sample covariance matrix in Hotelling's T2 statistic is not positive definite and become singular, thus it cannot be inverted. In this research, the Hotelling's T2 statistic is combined with a shrinkage approach as an alternative estimation to estimate the covariance matrix to detect significant gene sets. The use of shrinkage covariance matrix overcomes the singularity problem by converting an unbiased to an improved biased estimator of covariance matrix. Robust trimmed mean is integrated into the shrinkage matrix to reduce the influence of outliers and consequently increases its efficiency. The performance of the proposed method is measured using several simulation designs. The results are expected to outperform existing techniques in many tested conditions.
Croze, Myriam; Živković, Daniel; Stephan, Wolfgang; Hutter, Stephan
2016-08-01
Balancing selection has been widely assumed to be an important evolutionary force, yet even today little is known about its abundance and its impact on the patterns of genetic diversity. Several studies have shown examples of balancing selection in humans, plants or parasites, and many genes under balancing selection are involved in immunity. It has been proposed that host-parasite coevolution is one of the main forces driving immune genes to evolve under balancing selection. In this paper, we review the literature on balancing selection on immunity genes in several organisms, including Drosophila. Furthermore, we performed a genome scan for balancing selection in an African population of Drosophila melanogaster using coalescent simulations of a demographic model with and without selection. We find very few genes under balancing selection and only one novel candidate gene related to immunity. Finally, we discuss the possible causes of the low number of genes under balancing selection. Copyright © 2016 The Authors. Published by Elsevier GmbH.. All rights reserved.
Concave 1-norm group selection
Jiang, Dingfeng; Huang, Jian
2015-01-01
Grouping structures arise naturally in many high-dimensional problems. Incorporation of such information can improve model fitting and variable selection. Existing group selection methods, such as the group Lasso, require correct membership. However, in practice it can be difficult to correctly specify group membership of all variables. Thus, it is important to develop group selection methods that are robust against group mis-specification. Also, it is desirable to select groups as well as individual variables in many applications. We propose a class of concave \\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{upgreek} \\usepackage{mathrsfs} \\setlength{\\oddsidemargin}{-69pt} \\begin{document} }{}$1$\\end{document}-norm group penalties that is robust to grouping structure and can perform bi-level selection. A coordinate descent algorithm is developed to calculate solutions of the proposed group selection method. Theoretical convergence of the algorithm is proved under certain regularity conditions. Comparison with other methods suggests the proposed method is the most robust approach under membership mis-specification. Simulation studies and real data application indicate that the \\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{upgreek} \\usepackage{mathrsfs} \\setlength{\\oddsidemargin}{-69pt} \\begin{document} }{}$1$\\end{document}-norm concave group selection approach achieves better control of false discovery rates. An R package grppenalty implementing the proposed method is available at CRAN. PMID:25417206
Okubo, Nami; Hayward, David C; Forêt, Sylvain; Ball, Eldon E
2016-02-29
Research into various aspects of coral biology has greatly increased in recent years due to anthropogenic threats to coral health including pollution, ocean warming and acidification. However, knowledge of coral early development has lagged. The present paper describes the embryonic development of two previously uncharacterized robust corals, Favia lizardensis (a massive brain coral) and Ctenactis echinata (a solitary coral) and compares it to that of the previously characterized complex coral, Acropora millepora, both morphologically and in terms of the expression of a set of key developmental genes. Illumina sequencing of mixed age embryos was carried out, resulting in embryonic transcriptomes consisting of 40605 contigs for C.echinata (N50 = 1080 bp) and 48536 contigs for F.lizardensis (N50 = 1496 bp). The transcriptomes have been annotated against Swiss-Prot and were sufficiently complete to enable the identification of orthologs of many key genes controlling development in bilaterians. Developmental series of images of whole mounts and sections reveal that the early stages of both species contain a blastocoel, consistent with their membership of the robust clade. In situ hybridization was used to examine the expression of the developmentally important genes brachyury, chordin and forkhead. The expression of brachyury and forkhead was consistent with that previously reported for Acropora and allowed us to confirm that the pseudo-blastopore sometimes seen in robust corals such as Favia spp. is not directly associated with gastrulation. C.echinata chordin expression, however, differed from that seen in the other two corals. Embryonic transcriptomes were assembled for the brain coral Favia lizardensis and the solitary coral Ctenactis echinata. Both species have a blastocoel in their early developmental stages, consistent with their phylogenetic position as members of the robust clade. Expression of the key developmental genes brachyury, chordin and forkhead was investigated, allowing comparison to that of their orthologs in Acropora, Nematostella and bilaterians and demonstrating that even within the Anthozoa there are significant differences in expression patterns.
Gérard, Claude; Novák, Béla
2013-01-01
microRNAs (miRNAs) are small noncoding RNAs that are important post-transcriptional regulators of gene expression. miRNAs can induce thresholds in protein synthesis. Such thresholds in protein output can be also achieved by oligomerization of transcription factors (TF) for the control of gene expression. First, we propose a minimal model for protein expression regulated by miRNA and by oligomerization of TF. We show that miRNA and oligomerization of TF generate a buffer, which increases the robustness of protein output towards molecular noise as well as towards random variation of kinetics parameters. Next, we extend the model by considering that the same miRNA can bind to multiple messenger RNAs, which accounts for the dynamics of a minimal competing endogenous RNAs (ceRNAs) network. The model shows that, through common miRNA regulation, TF can control the expression of all proteins formed by the ceRNA network, even if it drives the expression of only one gene in the network. The model further suggests that the threshold in protein synthesis mediated by the oligomerization of TF can be propagated to the other genes, which can increase the robustness of the expression of all genes in such ceRNA network. Furthermore, we show that a miRNA could increase the time delay of a “Goodwin-like” oscillator model, which may favor the occurrence of oscillations of large amplitude. This result predicts important roles of miRNAs in the control of the molecular mechanisms leading to the emergence of biological rhythms. Moreover, a model for the latter oscillator embedded in a ceRNA network indicates that the oscillatory behavior can be propagated, via the shared miRNA, to all proteins formed by such ceRNA network. Thus, by means of computational models, we show that miRNAs could act as vectors allowing the propagation of robustness in protein synthesis as well as oscillatory behaviors within ceRNA networks. PMID:24376695
Zinser, Erik R; Schneider, Dominique; Blot, Michel; Kolter, Roberto
2003-01-01
The loss of preexisting genes or gene activities during evolution is a major mechanism of ecological specialization. Evolutionary processes that can account for gene loss or inactivation have so far been restricted to one of two mechanisms: direct selection for the loss of gene activities that are disadvantageous under the conditions of selection (i.e., antagonistic pleiotropy) and selection-independent genetic drift of neutral (or nearly neutral) mutations (i.e., mutation accumulation). In this study we demonstrate with an evolved strain of Escherichia coli that a third, distinct mechanism exists by which gene activities can be lost. This selection-dependent mechanism involves the expropriation of one gene's upstream regulatory element by a second gene via a homologous recombination event. Resulting from this genetic exchange is the activation of the second gene and a concomitant inactivation of the first gene. This gene-for-gene expression tradeoff provides a net fitness gain, even if the forfeited activity of the first gene can play a positive role in fitness under the conditions of selection. PMID:12930738
2012-01-01
Background Water stress limits plant survival and production in many parts of the world. Identification of genes and alleles responding to water stress conditions is important in breeding plants better adapted to drought. Currently there are no studies examining the transcriptome wide gene and allelic expression patterns under water stress conditions. We used RNA sequencing (RNA-seq) to identify the candidate genes and alleles and to explore the evolutionary signatures of selection. Results We studied the effect of water stress on gene expression in Eucalyptus camaldulensis seedlings derived from three natural populations. We used reference-guided transcriptome mapping to study gene expression. Several genes showed differential expression between control and stress conditions. Gene ontology (GO) enrichment tests revealed up-regulation of 140 stress-related gene categories and down-regulation of 35 metabolic and cell wall organisation gene categories. More than 190,000 single nucleotide polymorphisms (SNPs) were detected and 2737 of these showed differential allelic expression. Allelic expression of 52% of these variants was correlated with differential gene expression. Signatures of selection patterns were studied by estimating the proportion of nonsynonymous to synonymous substitution rates (Ka/Ks). The average Ka/Ks ratio among the 13,719 genes was 0.39 indicating that most of the genes are under purifying selection. Among the positively selected genes (Ka/Ks > 1.5) apoptosis and cell death categories were enriched. Of the 287 positively selected genes, ninety genes showed differential expression and 27 SNPs from 17 positively selected genes showed differential allelic expression between treatments. Conclusions Correlation of allelic expression of several SNPs with total gene expression indicates that these variants may be the cis-acting variants or in linkage disequilibrium with such variants. Enrichment of apoptosis and cell death gene categories among the positively selected genes reveals the past selection pressures experienced by the populations used in this study. PMID:22853646