Origins of extrinsic variability in eukaryotic gene expression
NASA Astrophysics Data System (ADS)
Volfson, Dmitri; Marciniak, Jennifer; Blake, William J.; Ostroff, Natalie; Tsimring, Lev S.; Hasty, Jeff
2006-02-01
Variable gene expression within a clonal population of cells has been implicated in a number of important processes including mutation and evolution, determination of cell fates and the development of genetic disease. Recent studies have demonstrated that a significant component of expression variability arises from extrinsic factors thought to influence multiple genes simultaneously, yet the biological origins of this extrinsic variability have received little attention. Here we combine computational modelling with fluorescence data generated from multiple promoter-gene inserts in Saccharomyces cerevisiae to identify two major sources of extrinsic variability. One unavoidable source arising from the coupling of gene expression with population dynamics leads to a ubiquitous lower limit for expression variability. A second source, which is modelled as originating from a common upstream transcription factor, exemplifies how regulatory networks can convert noise in upstream regulator expression into extrinsic noise at the output of a target gene. Our results highlight the importance of the interplay of gene regulatory networks with population heterogeneity for understanding the origins of cellular diversity.
Origins of extrinsic variability in eukaryotic gene expression
NASA Astrophysics Data System (ADS)
Volfson, Dmitri; Marciniak, Jennifer; Blake, William J.; Ostroff, Natalie; Tsimring, Lev S.; Hasty, Jeff
2006-03-01
Variable gene expression within a clonal population of cells has been implicated in a number of important processes including mutation and evolution, determination of cell fates and the development of genetic disease. Recent studies have demonstrated that a significant component of expression variability arises from extrinsic factors thought to influence multiple genes in concert, yet the biological origins of this extrinsic variability have received little attention. Here we combine computational modeling with fluorescence data generated from multiple promoter-gene inserts in Saccharomyces cerevisiae to identify two major sources of extrinsic variability. One unavoidable source arising from the coupling of gene expression with population dynamics leads to a ubiquitous noise floor in expression variability. A second source which is modeled as originating from a common upstream transcription factor exemplifies how regulatory networks can convert noise in upstream regulator expression into extrinsic noise at the output of a target gene. Our results highlight the importance of the interplay of gene regulatory networks with population heterogeneity for understanding the origins of cellular diversity.
Multiple-input multiple-output causal strategies for gene selection.
Bontempi, Gianluca; Haibe-Kains, Benjamin; Desmedt, Christine; Sotiriou, Christos; Quackenbush, John
2011-11-25
Traditional strategies for selecting variables in high dimensional classification problems aim to find sets of maximally relevant variables able to explain the target variations. If these techniques may be effective in generalization accuracy they often do not reveal direct causes. The latter is essentially related to the fact that high correlation (or relevance) does not imply causation. In this study, we show how to efficiently incorporate causal information into gene selection by moving from a single-input single-output to a multiple-input multiple-output setting. We show in synthetic case study that a better prioritization of causal variables can be obtained by considering a relevance score which incorporates a causal term. In addition we show, in a meta-analysis study of six publicly available breast cancer microarray datasets, that the improvement occurs also in terms of accuracy. The biological interpretation of the results confirms the potential of a causal approach to gene selection. Integrating causal information into gene selection algorithms is effective both in terms of prediction accuracy and biological interpretation.
Akimoto, Yuki; Yugi, Katsuyuki; Uda, Shinsuke; Kudo, Takamasa; Komori, Yasunori; Kubota, Hiroyuki; Kuroda, Shinya
2013-01-01
Cells use common signaling molecules for the selective control of downstream gene expression and cell-fate decisions. The relationship between signaling molecules and downstream gene expression and cellular phenotypes is a multiple-input and multiple-output (MIMO) system and is difficult to understand due to its complexity. For example, it has been reported that, in PC12 cells, different types of growth factors activate MAP kinases (MAPKs) including ERK, JNK, and p38, and CREB, for selective protein expression of immediate early genes (IEGs) such as c-FOS, c-JUN, EGR1, JUNB, and FOSB, leading to cell differentiation, proliferation and cell death; however, how multiple-inputs such as MAPKs and CREB regulate multiple-outputs such as expression of the IEGs and cellular phenotypes remains unclear. To address this issue, we employed a statistical method called partial least squares (PLS) regression, which involves a reduction of the dimensionality of the inputs and outputs into latent variables and a linear regression between these latent variables. We measured 1,200 data points for MAPKs and CREB as the inputs and 1,900 data points for IEGs and cellular phenotypes as the outputs, and we constructed the PLS model from these data. The PLS model highlighted the complexity of the MIMO system and growth factor-specific input-output relationships of cell-fate decisions in PC12 cells. Furthermore, to reduce the complexity, we applied a backward elimination method to the PLS regression, in which 60 input variables were reduced to 5 variables, including the phosphorylation of ERK at 10 min, CREB at 5 min and 60 min, AKT at 5 min and JNK at 30 min. The simple PLS model with only 5 input variables demonstrated a predictive ability comparable to that of the full PLS model. The 5 input variables effectively extracted the growth factor-specific simple relationships within the MIMO system in cell-fate decisions in PC12 cells.
Plevova, Karla; Francova, Hana Skuhrova; Burckova, Katerina; Brychtova, Yvona; Doubek, Michael; Pavlova, Sarka; Malcikova, Jitka; Mayer, Jiri; Tichy, Boris; Pospisilova, Sarka
2014-01-01
In chronic lymphocytic leukemia, usually a monoclonal disease, multiple productive immunoglobulin heavy chain gene rearrangements are identified sporadically. Prognostication of such cases based on immunoglobulin heavy variable gene mutational status can be problematic, especially if the different rearrangements have discordant mutational status. To gain insight into the possible biological mechanisms underlying the origin of the multiple rearrangements, we performed a comprehensive immunogenetic and immunophenotypic characterization of 31 cases with the multiple rearrangements identified in a cohort of 1147 patients with chronic lymphocytic leukemia. For the majority of cases (25/31), we provide evidence of the co-existence of at least two B lymphocyte clones with a chronic lymphocytic leukemia phenotype. We also identified clonal drifts in serial samples, likely driven by selection forces. More specifically, higher immunoglobulin variable gene identity to germline and longer complementarity determining region 3 were preferred in persistent or newly appearing clones, a phenomenon more pronounced in patients with stereotyped B-cell receptors. Finally, we report that other factors, such as TP53 gene defects and therapy administration, influence clonal selection. Our findings are relevant to clonal evolution in the context of antigen stimulation and transition of monoclonal B-cell lymphocytosis to chronic lymphocytic leukemia. PMID:24038023
2012-01-01
Background We explore the benefits of applying a new proportional hazard model to analyze survival of breast cancer patients. As a parametric model, the hypertabastic survival model offers a closer fit to experimental data than Cox regression, and furthermore provides explicit survival and hazard functions which can be used as additional tools in the survival analysis. In addition, one of our main concerns is utilization of multiple gene expression variables. Our analysis treats the important issue of interaction of different gene signatures in the survival analysis. Methods The hypertabastic proportional hazards model was applied in survival analysis of breast cancer patients. This model was compared, using statistical measures of goodness of fit, with models based on the semi-parametric Cox proportional hazards model and the parametric log-logistic and Weibull models. The explicit functions for hazard and survival were then used to analyze the dynamic behavior of hazard and survival functions. Results The hypertabastic model provided the best fit among all the models considered. Use of multiple gene expression variables also provided a considerable improvement in the goodness of fit of the model, as compared to use of only one. By utilizing the explicit survival and hazard functions provided by the model, we were able to determine the magnitude of the maximum rate of increase in hazard, and the maximum rate of decrease in survival, as well as the times when these occurred. We explore the influence of each gene expression variable on these extrema. Furthermore, in the cases of continuous gene expression variables, represented by a measure of correlation, we were able to investigate the dynamics with respect to changes in gene expression. Conclusions We observed that use of three different gene signatures in the model provided a greater combined effect and allowed us to assess the relative importance of each in determination of outcome in this data set. These results point to the potential to combine gene signatures to a greater effect in cases where each gene signature represents some distinct aspect of the cancer biology. Furthermore we conclude that the hypertabastic survival models can be an effective survival analysis tool for breast cancer patients. PMID:23241496
Pounds, Stan; Cheng, Cheng; Cao, Xueyuan; Crews, Kristine R; Plunkett, William; Gandhi, Varsha; Rubnitz, Jeffrey; Ribeiro, Raul C; Downing, James R; Lamba, Jatinder
2009-08-15
In some applications, prior biological knowledge can be used to define a specific pattern of association of multiple endpoint variables with a genomic variable that is biologically most interesting. However, to our knowledge, there is no statistical procedure designed to detect specific patterns of association with multiple endpoint variables. Projection onto the most interesting statistical evidence (PROMISE) is proposed as a general procedure to identify genomic variables that exhibit a specific biologically interesting pattern of association with multiple endpoint variables. Biological knowledge of the endpoint variables is used to define a vector that represents the biologically most interesting values for statistics that characterize the associations of the endpoint variables with a genomic variable. A test statistic is defined as the dot-product of the vector of the observed association statistics and the vector of the most interesting values of the association statistics. By definition, this test statistic is proportional to the length of the projection of the observed vector of correlations onto the vector of most interesting associations. Statistical significance is determined via permutation. In simulation studies and an example application, PROMISE shows greater statistical power to identify genes with the interesting pattern of associations than classical multivariate procedures, individual endpoint analyses or listing genes that have the pattern of interest and are significant in more than one individual endpoint analysis. Documented R routines are freely available from www.stjuderesearch.org/depts/biostats and will soon be available as a Bioconductor package from www.bioconductor.org.
Konradi, Christine; Sillivan, Stephanie E.; Clay, Hayley B.
2011-01-01
Gene expression studies of bipolar disorder (BPD) have shown changes in transcriptome profiles in multiple brain regions. Here we summarize the most consistent findings in the scientific literature, and compare them to data from schizophrenia (SZ) and major depressive disorder (MDD). The transcriptome profiles of all three disorders overlap, making the existence of a BPD-specific profile unlikely. Three groups of functionally related genes are consistently expressed at altered levels in BPD, SZ and MDD. Genes involved in energy metabolism and mitochondrial function are downregulated, genes involved in immune response and inflammation are upregulated, and genes expressed in oligodendrocytes are downregulated. Experimental paradigms for multiple sclerosis demonstrate a tight link between energy metabolism, inflammation and demyelination. These studies also show variabilities in the extent of oligodendrocyte stress, which can vary from a downregulation of oligodendrocyte genes, such as observed in psychiatric disorders, to cell death and brain lesions seen in multiple sclerosis. We conclude that experimental models of multiple sclerosis could be of interest for the research of BPD, SZ and MDD. PMID:21310238
Patel, Chirag J; Manrai, Arjun K; Corona, Erik; Kohane, Isaac S
2017-02-01
It is hypothesized that environmental exposures and behaviour influence telomere length, an indicator of cellular ageing. We systematically associated 461 indicators of environmental exposures, physiology and self-reported behaviour with telomere length in data from the US National Health and Nutrition Examination Survey (NHANES) in 1999-2002. Further, we tested whether factors identified in the NHANES participants are also correlated with gene expression of telomere length modifying genes. We correlated 461 environmental exposures, behaviours and clinical variables with telomere length, using survey-weighted linear regression, adjusting for sex, age, age squared, race/ethnicity, poverty level, education and born outside the USA, and estimated the false discovery rate to adjust for multiple hypotheses. We conducted a secondary analysis to investigate the correlation between identified environmental variables and gene expression levels of telomere-associated genes in publicly available gene expression samples. After correlating 461 variables with telomere length, we found 22 variables significantly associated with telomere length after adjustment for multiple hypotheses. Of these varaibales, 14 were associated with longer telomeres, including biomarkers of polychlorinated biphenyls([PCBs; 0.1 to 0.2 standard deviation (SD) increase for 1 SD increase in PCB level, P < 0.002] and a form of vitamin A, retinyl stearate. Eight variables associated with shorter telomeres, including biomarkers of cadmium, C-reactive protein and lack of physical activity. We could not conclude that PCBs are correlated with gene expression of telomere-associated genes. Both environmental exposures and chronic disease-related risk factors may play a role in telomere length. Our secondary analysis found no evidence of association between PCBs/smoking and gene expression of telomere-associated genes. All correlations between exposures, behaviours and clinical factors and changes in telomere length will require further investigation regarding biological influence of exposure. © The Author 2016. Published by Oxford University Press on behalf of the International Epidemiological Association
Pounds, Stan; Cheng, Cheng; Cao, Xueyuan; Crews, Kristine R.; Plunkett, William; Gandhi, Varsha; Rubnitz, Jeffrey; Ribeiro, Raul C.; Downing, James R.; Lamba, Jatinder
2009-01-01
Motivation: In some applications, prior biological knowledge can be used to define a specific pattern of association of multiple endpoint variables with a genomic variable that is biologically most interesting. However, to our knowledge, there is no statistical procedure designed to detect specific patterns of association with multiple endpoint variables. Results: Projection onto the most interesting statistical evidence (PROMISE) is proposed as a general procedure to identify genomic variables that exhibit a specific biologically interesting pattern of association with multiple endpoint variables. Biological knowledge of the endpoint variables is used to define a vector that represents the biologically most interesting values for statistics that characterize the associations of the endpoint variables with a genomic variable. A test statistic is defined as the dot-product of the vector of the observed association statistics and the vector of the most interesting values of the association statistics. By definition, this test statistic is proportional to the length of the projection of the observed vector of correlations onto the vector of most interesting associations. Statistical significance is determined via permutation. In simulation studies and an example application, PROMISE shows greater statistical power to identify genes with the interesting pattern of associations than classical multivariate procedures, individual endpoint analyses or listing genes that have the pattern of interest and are significant in more than one individual endpoint analysis. Availability: Documented R routines are freely available from www.stjuderesearch.org/depts/biostats and will soon be available as a Bioconductor package from www.bioconductor.org. Contact: stanley.pounds@stjude.org Supplementary information: Supplementary data are available at Bioinformatics online. PMID:19528086
Olson, Nathan D.; Lund, Steven P.; Zook, Justin M.; Rojas-Cornejo, Fabiola; Beck, Brian; Foy, Carole; Huggett, Jim; Whale, Alexandra S.; Sui, Zhiwei; Baoutina, Anna; Dobeson, Michael; Partis, Lina; Morrow, Jayne B.
2015-01-01
This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA) sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing®, or Ion Torrent PGM®. The sequencing data were evaluated on three levels: (1) identity of biologically conserved position, (2) ratio of 16S rRNA gene copies featuring identified variants, and (3) the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies. PMID:27077030
Liu, Na; Ding, Longzhen; Li, Haijun; Zhang, Pengpeng; Zheng, Jixing; Weng, Chih-Huang
2018-08-01
The study aimed to determine the possible contribution of specific growth conditions and community structures to variable carbon enrichment factors (Ɛ- carbon ) values for the degradation of chlorinated ethenes (CEs) by a bacterial consortium with multiple dechlorinating genes. Ɛ- carbon values for trichloroethylene, cis-1,2-dichloroethylene, and vinyl chloride were -7.24% ± 0.59%, -14.6% ± 1.71%, and -21.1% ± 1.14%, respectively, during their degradation by a microbial consortium containing multiple dechlorinating genes including tceA and vcrA. The Ɛ- carbon values of all CEs were not greatly affected by changes in growth conditions and community structures, which directly or indirectly affected reductive dechlorination of CEs by this consortium. Stability analysis provided evidence that the presence of multiple dechlorinating genes within a microbial consortium had little effect on carbon isotope fractionation, as long as the genes have definite, non-overlapping functions. Copyright © 2018 Elsevier Ltd. All rights reserved.
Sadee, Wolfgang
2013-09-01
Pharmacogenetic biomarker tests include mostly specific single gene-drug pairs, capable of accounting for a portion of interindividual variability in drug response and toxicity. However, multiple genes are likely to contribute, either acting independently or epistatically, with the CYP2C9-VKORC1-warfarin test panel, an example of a clinically used gene-gene-dug interaction. I discuss here further instances of gene-gene-drug interactions, including a proposed dynamic effect on statin therapy by genetic variants in both a transporter (SLCO1B1) and a metabolizing enzyme (CYP3A4) in liver cells, the main target site where statins block cholesterol synthesis. These examples set a conceptual framework for developing diagnostic panels involving multiple gene-drug combinations. Copyright © 2013 Wiley Periodicals, Inc.
Effect of promoter architecture on the cell-to-cell variability in gene expression.
Sanchez, Alvaro; Garcia, Hernan G; Jones, Daniel; Phillips, Rob; Kondev, Jané
2011-03-01
According to recent experimental evidence, promoter architecture, defined by the number, strength and regulatory role of the operators that control transcription, plays a major role in determining the level of cell-to-cell variability in gene expression. These quantitative experiments call for a corresponding modeling effort that addresses the question of how changes in promoter architecture affect variability in gene expression in a systematic rather than case-by-case fashion. In this article we make such a systematic investigation, based on a microscopic model of gene regulation that incorporates stochastic effects. In particular, we show how operator strength and operator multiplicity affect this variability. We examine different modes of transcription factor binding to complex promoters (cooperative, independent, simultaneous) and how each of these affects the level of variability in transcriptional output from cell-to-cell. We propose that direct comparison between in vivo single-cell experiments and theoretical predictions for the moments of the probability distribution of mRNA number per cell can be used to test kinetic models of gene regulation. The emphasis of the discussion is on prokaryotic gene regulation, but our analysis can be extended to eukaryotic cells as well.
Effect of Promoter Architecture on the Cell-to-Cell Variability in Gene Expression
Sanchez, Alvaro; Garcia, Hernan G.; Jones, Daniel; Phillips, Rob; Kondev, Jané
2011-01-01
According to recent experimental evidence, promoter architecture, defined by the number, strength and regulatory role of the operators that control transcription, plays a major role in determining the level of cell-to-cell variability in gene expression. These quantitative experiments call for a corresponding modeling effort that addresses the question of how changes in promoter architecture affect variability in gene expression in a systematic rather than case-by-case fashion. In this article we make such a systematic investigation, based on a microscopic model of gene regulation that incorporates stochastic effects. In particular, we show how operator strength and operator multiplicity affect this variability. We examine different modes of transcription factor binding to complex promoters (cooperative, independent, simultaneous) and how each of these affects the level of variability in transcriptional output from cell-to-cell. We propose that direct comparison between in vivo single-cell experiments and theoretical predictions for the moments of the probability distribution of mRNA number per cell can be used to test kinetic models of gene regulation. The emphasis of the discussion is on prokaryotic gene regulation, but our analysis can be extended to eukaryotic cells as well. PMID:21390269
Systems Biophysics of Gene Expression
Vilar, Jose M.G.; Saiz, Leonor
2013-01-01
Gene expression is a process central to any form of life. It involves multiple temporal and functional scales that extend from specific protein-DNA interactions to the coordinated regulation of multiple genes in response to intracellular and extracellular changes. This diversity in scales poses fundamental challenges to the use of traditional approaches to fully understand even the simplest gene expression systems. Recent advances in computational systems biophysics have provided promising avenues to reliably integrate the molecular detail of biophysical process into the system behavior. Here, we review recent advances in the description of gene regulation as a system of biophysical processes that extend from specific protein-DNA interactions to the combinatorial assembly of nucleoprotein complexes. There is now basic mechanistic understanding on how promoters controlled by multiple, local and distal, DNA binding sites for transcription factors can actively control transcriptional noise, cell-to-cell variability, and other properties of gene regulation, including precision and flexibility of the transcriptional responses. PMID:23790365
Hahntow, Ines N; Mairuhu, Gideon; van Valkengoed, Irene Gm; Koopmans, Richard P; Michel, Martin C
2010-06-02
Genotype-phenotype association studies are typically based upon polymorphisms or haplotypes comprised of multiple polymorphisms within a single gene. It has been proposed that combinations of polymorphisms in distinct genes, which functionally impact the same phenotype, may have stronger phenotype associations than those within a single gene. We have tested this hypothesis using genes encoding components of the renin-angiotensin-aldosterone system and the high blood pressure phenotype. Our analysis is based on 1379 participants of the cross-sectional SUNSET study randomly selected from the population register of Amsterdam. Each subject was genotyped for the angiotensinogen M235T, the angiotensin-converting enzyme insertion/deletion and the angiotensin II type 1 receptor A1166C polymorphism. The phenotype high blood pressure was defined either as a categorical variable comparing hypertension versus normotension as in most previous studies or as a continuous variable using systolic, diastolic and mean blood pressure in a multiple regression analysis with gender, ethnicity, age, body-mass-index and antihypertensive medication as covariates. Genotype-phenotype relationships were explored for each polymorphism in isolation and for double and triple polymorphism combinations. At the single polymorphism level, only the A allele of the angiotensin II type 1 receptor was associated with a high blood pressure phenotype. Using combinations of polymorphisms of two or all three genes did not yield stronger/more consistent associations. We conclude that combinations of physiologically related polymorphisms of multiple genes, at least with regard to the renin-angiotensin-aldosterone system and the hypertensive phenotype, do not necessarily offer additional benefit in analyzing genotype/phenotype associations.
Sánchez, Brisa N; Kang, Shan; Mukherjee, Bhramar
2012-06-01
Many existing cohort studies initially designed to investigate disease risk as a function of environmental exposures have collected genomic data in recent years with the objective of testing for gene-environment interaction (G × E) effects. In environmental epidemiology, interest in G × E arises primarily after a significant effect of the environmental exposure has been documented. Cohort studies often collect rich exposure data; as a result, assessing G × E effects in the presence of multiple exposure markers further increases the burden of multiple testing, an issue already present in both genetic and environment health studies. Latent variable (LV) models have been used in environmental epidemiology to reduce dimensionality of the exposure data, gain power by reducing multiplicity issues via condensing exposure data, and avoid collinearity problems due to presence of multiple correlated exposures. We extend the LV framework to characterize gene-environment interaction in presence of multiple correlated exposures and genotype categories. Further, similar to what has been done in case-control G × E studies, we use the assumption of gene-environment (G-E) independence to boost the power of tests for interaction. The consequences of making this assumption, or the issue of how to explicitly model G-E association has not been previously investigated in LV models. We postulate a hierarchy of assumptions about the LV model regarding the different forms of G-E dependence and show that making such assumptions may influence inferential results on the G, E, and G × E parameters. We implement a class of shrinkage estimators to data adaptively trade-off between the most restrictive to most flexible form of G-E dependence assumption and note that such class of compromise estimators can serve as a benchmark of model adequacy in LV models. We demonstrate the methods with an example from the Early Life Exposures in Mexico City to Neuro-Toxicants Study of lead exposure, iron metabolism genes, and birth weight. © 2011, The International Biometric Society.
PanACEA: a bioinformatics tool for the exploration and visualization of bacterial pan-chromosomes.
Clarke, Thomas H; Brinkac, Lauren M; Inman, Jason M; Sutton, Granger; Fouts, Derrick E
2018-06-27
Bacterial pan-genomes, comprised of conserved and variable genes across multiple sequenced bacterial genomes, allow for identification of genomic regions that are phylogenetically discriminating or functionally important. Pan-genomes consist of large amounts of data, which can restrict researchers ability to locate and analyze these regions. Multiple software packages are available to visualize pan-genomes, but currently their ability to address these concerns are limited by using only pre-computed data sets, prioritizing core over variable gene clusters, or by not accounting for pan-chromosome positioning in the viewer. We introduce PanACEA (Pan-genome Atlas with Chromosome Explorer and Analyzer), which utilizes locally-computed interactive web-pages to view ordered pan-genome data. It consists of multi-tiered, hierarchical display pages that extend from pan-chromosomes to both core and variable regions to single genes. Regions and genes are functionally annotated to allow for rapid searching and visual identification of regions of interest with the option that user-supplied genomic phylogenies and metadata can be incorporated. PanACEA's memory and time requirements are within the capacities of standard laptops. The capability of PanACEA as a research tool is demonstrated by highlighting a variable region important in differentiating strains of Enterobacter hormaechei. PanACEA can rapidly translate the results of pan-chromosome programs into an intuitive and interactive visual representation. It will empower researchers to visually explore and identify regions of the pan-chromosome that are most biologically interesting, and to obtain publication quality images of these regions.
Tapia, Lorena I; Shaw, Chad A; Aideyan, Letisha O; Jewell, Alan M; Dawson, Brian C; Haq, Taha R; Piedra, Pedro A
2014-01-01
Human respiratory syncytial virus (HRSV) has three surface glycoproteins: small hydrophobic (SH), attachment (G) and fusion (F), encoded by three consecutive genes (SH-G-F). A 270-nt fragment of the G gene is used to genotype HRSV isolates. This study genotyped and investigated the variability of the gene and amino acid sequences of the three surface proteins of HRSV strains collected from 1987 to 2005 from one center. Sixty original clinical isolates and 5 prototype strains were analyzed. Sequences containing SH, F and G genes were generated, and multiple alignments and phylogenetic trees were analyzed. Genetic variability by protein domains comparing virus genotypes was assessed. Complete sequences of the SH-G-F genes were obtained for all 65 samples: HRSV-A = 35; HRSV-B = 30. In group A strains, genotypes GA5 and GA2 were predominant. For HRSV-B strains, the genotype GB4 was predominant from 1992 to 1994 and only genotype BA viruses were detected in 2004-2005. Different genetic variability at nucleotide level was detected between the genes, with G gene being the most variable and the highest variability detected in the 270-nt G fragment that is frequently used to genotype the virus. High variability (>10%) was also detected in the signal peptide and transmembrane domains of the F gene of HRSV A strains. Variability among the HRSV strains resulting in non-synonymous changes was detected in hypervariable domains of G protein, the signal peptide of the F protein, a not previously defined domain in the F protein, and the antigenic site Ø in the pre-fusion F. Divergent trends were observed between HRSV -A and -B groups for some functional domains. A diverse population of HRSV -A and -B genotypes circulated in Houston during an 18 year period. We hypothesize that diverse sequence variation of the surface protein genes provide HRSV strains a survival advantage in a partially immune-protected community.
Tapia, Lorena I.; Shaw, Chad A.; Aideyan, Letisha O.; Jewell, Alan M.; Dawson, Brian C.; Haq, Taha R.; Piedra, Pedro A.
2014-01-01
Human respiratory syncytial virus (HRSV) has three surface glycoproteins: small hydrophobic (SH), attachment (G) and fusion (F), encoded by three consecutive genes (SH-G-F). A 270-nt fragment of the G gene is used to genotype HRSV isolates. This study genotyped and investigated the variability of the gene and amino acid sequences of the three surface proteins of HRSV strains collected from 1987 to 2005 from one center. Sixty original clinical isolates and 5 prototype strains were analyzed. Sequences containing SH, F and G genes were generated, and multiple alignments and phylogenetic trees were analyzed. Genetic variability by protein domains comparing virus genotypes was assessed. Complete sequences of the SH-G-F genes were obtained for all 65 samples: HRSV-A = 35; HRSV-B = 30. In group A strains, genotypes GA5 and GA2 were predominant. For HRSV-B strains, the genotype GB4 was predominant from 1992 to 1994 and only genotype BA viruses were detected in 2004–2005. Different genetic variability at nucleotide level was detected between the genes, with G gene being the most variable and the highest variability detected in the 270-nt G fragment that is frequently used to genotype the virus. High variability (>10%) was also detected in the signal peptide and transmembrane domains of the F gene of HRSV A strains. Variability among the HRSV strains resulting in non-synonymous changes was detected in hypervariable domains of G protein, the signal peptide of the F protein, a not previously defined domain in the F protein, and the antigenic site Ø in the pre-fusion F. Divergent trends were observed between HRSV -A and -B groups for some functional domains. A diverse population of HRSV -A and -B genotypes circulated in Houston during an 18 year period. We hypothesize that diverse sequence variation of the surface protein genes provide HRSV strains a survival advantage in a partially immune-protected community. PMID:24625544
Intelligence: shared genetic basis between Mendelian disorders and a polygenic trait.
Franić, Sanja; Groen-Blokhuis, Maria M; Dolan, Conor V; Kattenberg, Mathijs V; Pool, René; Xiao, Xiangjun; Scheet, Paul A; Ehli, Erik A; Davies, Gareth E; van der Sluis, Sophie; Abdellaoui, Abdel; Hansell, Narelle K; Martin, Nicholas G; Hudziak, James J; van Beijsterveldt, Catherina E M; Swagerman, Suzanne C; Hulshoff Pol, Hilleke E; de Geus, Eco J C; Bartels, Meike; Ropers, H Hilger; Hottenga, Jouke-Jan; Boomsma, Dorret I
2015-10-01
Multiple inquiries into the genetic etiology of human traits indicated an overlap between genes underlying monogenic disorders (eg, skeletal growth defects) and those affecting continuous variability of related quantitative traits (eg, height). Extending the idea of a shared genetic basis between a Mendelian disorder and a classic polygenic trait, we performed an association study to examine the effect of 43 genes implicated in autosomal recessive cognitive disorders on intelligence in an unselected Dutch population (N=1316). Using both single-nucleotide polymorphism (SNP)- and gene-based association testing, we detected an association between intelligence and the genes of interest, with genes ELP2, TMEM135, PRMT10, and RGS7 showing the strongest associations. This is a demonstration of the relevance of genes implicated in monogenic disorders of intelligence to normal-range intelligence, and a corroboration of the utility of employing knowledge on monogenic disorders in identifying the genetic variability underlying complex traits.
A Fast Multiple-Kernel Method With Applications to Detect Gene-Environment Interaction.
Marceau, Rachel; Lu, Wenbin; Holloway, Shannon; Sale, Michèle M; Worrall, Bradford B; Williams, Stephen R; Hsu, Fang-Chi; Tzeng, Jung-Ying
2015-09-01
Kernel machine (KM) models are a powerful tool for exploring associations between sets of genetic variants and complex traits. Although most KM methods use a single kernel function to assess the marginal effect of a variable set, KM analyses involving multiple kernels have become increasingly popular. Multikernel analysis allows researchers to study more complex problems, such as assessing gene-gene or gene-environment interactions, incorporating variance-component based methods for population substructure into rare-variant association testing, and assessing the conditional effects of a variable set adjusting for other variable sets. The KM framework is robust, powerful, and provides efficient dimension reduction for multifactor analyses, but requires the estimation of high dimensional nuisance parameters. Traditional estimation techniques, including regularization and the "expectation-maximization (EM)" algorithm, have a large computational cost and are not scalable to large sample sizes needed for rare variant analysis. Therefore, under the context of gene-environment interaction, we propose a computationally efficient and statistically rigorous "fastKM" algorithm for multikernel analysis that is based on a low-rank approximation to the nuisance effect kernel matrices. Our algorithm is applicable to various trait types (e.g., continuous, binary, and survival traits) and can be implemented using any existing single-kernel analysis software. Through extensive simulation studies, we show that our algorithm has similar performance to an EM-based KM approach for quantitative traits while running much faster. We also apply our method to the Vitamin Intervention for Stroke Prevention (VISP) clinical trial, examining gene-by-vitamin effects on recurrent stroke risk and gene-by-age effects on change in homocysteine level. © 2015 WILEY PERIODICALS, INC.
Genomic and Epigenomic Insights into Nutrition and Brain Disorders
Dauncey, Margaret Joy
2013-01-01
Considerable evidence links many neuropsychiatric, neurodevelopmental and neurodegenerative disorders with multiple complex interactions between genetics and environmental factors such as nutrition. Mental health problems, autism, eating disorders, Alzheimer’s disease, schizophrenia, Parkinson’s disease and brain tumours are related to individual variability in numerous protein-coding and non-coding regions of the genome. However, genotype does not necessarily determine neurological phenotype because the epigenome modulates gene expression in response to endogenous and exogenous regulators, throughout the life-cycle. Studies using both genome-wide analysis of multiple genes and comprehensive analysis of specific genes are providing new insights into genetic and epigenetic mechanisms underlying nutrition and neuroscience. This review provides a critical evaluation of the following related areas: (1) recent advances in genomic and epigenomic technologies, and their relevance to brain disorders; (2) the emerging role of non-coding RNAs as key regulators of transcription, epigenetic processes and gene silencing; (3) novel approaches to nutrition, epigenetics and neuroscience; (4) gene-environment interactions, especially in the serotonergic system, as a paradigm of the multiple signalling pathways affected in neuropsychiatric and neurological disorders. Current and future advances in these four areas should contribute significantly to the prevention, amelioration and treatment of multiple devastating brain disorders. PMID:23503168
Pravica, Vera; Popadic, Dusan; Savic, Emina; Markovic, Milos; Drulovic, Jelena; Mostarica-Stojkovic, Marija
2012-04-01
Multiple sclerosis (MS) is a chronic inflammatory demyelinating and neurodegenerative disease of the central nervous system characterized by unpredictable and variable clinical course. Etiology of MS involves both genetic and environmental factors. New technologies identified genetic polymorphisms associated with MS susceptibility among which immunologically relevant genes are significantly overrepresented. Although individual genes contribute only a small part to MS susceptibility, they might be used as biomarkers, thus helping to identify accurate diagnosis, predict clinical disease course and response to therapy. This review focuses on recent progress in research on MS genetics with special emphasis on the possibility to use single nucleotide polymorphism of candidate genes as biomarkers of susceptibility to disease and response to therapy.
MU OPIOID RECEPTORS IN PAIN MANAGEMENT
Pasternak, Gavril; Pan, Ying-Xian
2014-01-01
Most of the potent analgesics currently in use act through the mu opioid receptor. Although they are classified as mu opioids, clinical experience suggests differences among them. The relative potencies of the agents can vary from patient to patient, as well as the side-effect profiles. These observations, coupled with pharmacological approaches in preclinical models, led to the suggestion of multiple subtypes of mu receptors. The explosion in molecular biology has led to the identification of a single gene encoding mu opioid receptors. It now appears that this gene undergoes extensive splicing, in which a single gene can generate multiple proteins. Evidence now suggests that these splice variants may help explain the clinical variability in responses among patients. PMID:21453899
Xu, Man K.; Gaysina, Darya; Tsonaka, Roula; Morin, Alexandre J. S.; Croudace, Tim J.; Barnett, Jennifer H.; Houwing-Duistermaat, Jeanine; Richards, Marcus; Jones, Peter B.
2017-01-01
Very few molecular genetic studies of personality traits have used longitudinal phenotypic data, therefore molecular basis for developmental change and stability of personality remains to be explored. We examined the role of the monoamine oxidase A gene (MAOA) on extraversion and neuroticism from adolescence to adulthood, using modern latent variable methods. A sample of 1,160 male and 1,180 female participants with complete genotyping data was drawn from a British national birth cohort, the MRC National Survey of Health and Development (NSHD). The predictor variable was based on a latent variable representing genetic variations of the MAOA gene measured by three SNPs (rs3788862, rs5906957, and rs979606). Latent phenotype variables were constructed using psychometric methods to represent cross-sectional and longitudinal phenotypes of extraversion and neuroticism measured at ages 16 and 26. In males, the MAOA genetic latent variable (AAG) was associated with lower extraversion score at age 16 (β = −0.167; CI: −0.289, −0.045; p = 0.007, FDRp = 0.042), as well as greater increase in extraversion score from 16 to 26 years (β = 0.197; CI: 0.067, 0.328; p = 0.003, FDRp = 0.036). No genetic association was found for neuroticism after adjustment for multiple testing. Although, we did not find statistically significant associations after multiple testing correction in females, this result needs to be interpreted with caution due to issues related to x-inactivation in females. The latent variable method is an effective way of modeling phenotype- and genetic-based variances and may therefore improve the methodology of molecular genetic studies of complex psychological traits. PMID:29075213
Xu, Man K; Gaysina, Darya; Tsonaka, Roula; Morin, Alexandre J S; Croudace, Tim J; Barnett, Jennifer H; Houwing-Duistermaat, Jeanine; Richards, Marcus; Jones, Peter B
2017-01-01
Very few molecular genetic studies of personality traits have used longitudinal phenotypic data, therefore molecular basis for developmental change and stability of personality remains to be explored. We examined the role of the monoamine oxidase A gene ( MAOA ) on extraversion and neuroticism from adolescence to adulthood, using modern latent variable methods. A sample of 1,160 male and 1,180 female participants with complete genotyping data was drawn from a British national birth cohort, the MRC National Survey of Health and Development (NSHD). The predictor variable was based on a latent variable representing genetic variations of the MAOA gene measured by three SNPs (rs3788862, rs5906957, and rs979606). Latent phenotype variables were constructed using psychometric methods to represent cross-sectional and longitudinal phenotypes of extraversion and neuroticism measured at ages 16 and 26. In males, the MAOA genetic latent variable (AAG) was associated with lower extraversion score at age 16 (β = -0.167; CI: -0.289, -0.045; p = 0.007, FDRp = 0.042), as well as greater increase in extraversion score from 16 to 26 years (β = 0.197; CI: 0.067, 0.328; p = 0.003, FDRp = 0.036). No genetic association was found for neuroticism after adjustment for multiple testing. Although, we did not find statistically significant associations after multiple testing correction in females, this result needs to be interpreted with caution due to issues related to x-inactivation in females. The latent variable method is an effective way of modeling phenotype- and genetic-based variances and may therefore improve the methodology of molecular genetic studies of complex psychological traits.
Dru, P.; Bras, F.; Dezelee, S.; Gay, P.; Petitjean, A. M.; Pierre-Deneubourg, A.; Teninges, D.; Contamine, D.
1993-01-01
The ref(2)P gene of Drosophila melanogaster was identified by the discovery of two alleles, P(o) and P(p), respectively, permissive and restrictive for sigma rhabdovirus multiplication. A surprising variability of this gene was first noticed by the observation of size differences between the transcripts of permissive and restrictive alleles. In this paper, another restrictive allele, P(n), clearly distinct from P(p), is described: it exhibits a weaker antiviral effect than P(p) and differs from P(p) by its molecular structure. Five types of alleles were distinguished on the basis of their molecular structure, as revealed by S1 nuclease analysis of 17 D. melanogaster strains; three alleles were permissive and two restrictive. Comparison of the sequences of four haplotypes revealed numerous point mutations, two deletions (21 and 24 bp) and a complex event involving a 3-bp deletion, all affected the coding region. The unusual variability of the ref(2)P locus was confirmed by the high ratio of amino acid replacements to synonymous mutations (7:1), as compared to that of other genes, such as the Adh (2:42). Nevertheless, nucleotide sequence comparison with the Drosophila erecta ref(2)P gene shows that selective pressures are exerted to maintain the existence of a functional protein. The effects of this high variability on the ref(2)P protein are discussed in relation to its specific antiviral properties and to its function in D. melanogaster, where it is required for male fertility. PMID:8462852
Centanni, T M; Pantazis, D; Truong, D T; Gruen, J R; Gabrieli, J D E; Hogan, T P
2018-05-26
Individuals with dyslexia exhibit increased brainstem variability in response to sound. It is unknown as to whether increased variability extends to neocortical regions associated with audition and reading, extends to visual stimuli, and whether increased variability characterizes all children with dyslexia or, instead, a specific subset of children. We evaluated the consistency of stimulus-evoked neural responses in children with (N = 20) or without dyslexia (N = 12) as measured by magnetoencephalography (MEG). Approximately half of the children with dyslexia had significantly higher levels of variability in cortical responses to both auditory and visual stimuli in multiple nodes of the reading network. There was a significant and positive relationship between the number of risk alleles at rs6935076 in the dyslexia-susceptibility gene KIAA0319 and the degree of neural variability in primary auditory cortex across all participants. This gene has been linked with neural variability in rodents and in typical readers. These findings indicate that unstable representations of auditory and visual stimuli in auditory and other reading-related neocortical regions are present in a subset of children with dyslexia and support the link between the gene KIAA0319 and the auditory neural variability across children with or without dyslexia. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Haake, David A.; Suchard, Marc A.; Kelley, Melissa M.; Dundoo, Manjula; Alt, David P.; Zuerner, Richard L.
2004-01-01
Leptospires belong to a genus of parasitic bacterial spirochetes that have adapted to a broad range of mammalian hosts. Mechanisms of leptospiral molecular evolution were explored by sequence analysis of four genes shared by 38 strains belonging to the core group of pathogenic Leptospira species: L. interrogans, L. kirschneri, L. noguchii, L. borgpetersenii, L. santarosai, and L. weilii. The 16S rRNA and lipL32 genes were highly conserved, and the lipL41 and ompL1 genes were significantly more variable. Synonymous substitutions are distributed throughout the ompL1 gene, whereas nonsynonymous substitutions are clustered in four variable regions encoding surface loops. While phylogenetic trees for the 16S, lipL32, and lipL41 genes were relatively stable, 8 of 38 (20%) ompL1 sequences had mosaic compositions consistent with horizontal transfer of DNA between related bacterial species. A novel Bayesian multiple change point model was used to identify the most likely sites of recombination and to determine the phylogenetic relatedness of the segments of the mosaic ompL1 genes. Segments of the mosaic ompL1 genes encoding two of the surface-exposed loops were likely acquired by horizontal transfer from a peregrine allele of unknown ancestry. Identification of the most likely sites of recombination with the Bayesian multiple change point model, an approach which has not previously been applied to prokaryotic gene sequence analysis, serves as a model for future studies of recombination in molecular evolution of genes. PMID:15090524
Integrative Exploratory Analysis of Two or More Genomic Datasets.
Meng, Chen; Culhane, Aedin
2016-01-01
Exploratory analysis is an essential step in the analysis of high throughput data. Multivariate approaches such as correspondence analysis (CA), principal component analysis, and multidimensional scaling are widely used in the exploratory analysis of single dataset. Modern biological studies often assay multiple types of biological molecules (e.g., mRNA, protein, phosphoproteins) on a same set of biological samples, thereby creating multiple different types of omics data or multiassay data. Integrative exploratory analysis of these multiple omics data is required to leverage the potential of multiple omics studies. In this chapter, we describe the application of co-inertia analysis (CIA; for analyzing two datasets) and multiple co-inertia analysis (MCIA; for three or more datasets) to address this problem. These methods are powerful yet simple multivariate approaches that represent samples using a lower number of variables, allowing a more easily identification of the correlated structure in and between multiple high dimensional datasets. Graphical representations can be employed to this purpose. In addition, the methods simultaneously project samples and variables (genes, proteins) onto the same lower dimensional space, so the most variant variables from each dataset can be selected and associated with samples, which can be further used to facilitate biological interpretation and pathway analysis. We applied CIA to explore the concordance between mRNA and protein expression in a panel of 60 tumor cell lines from the National Cancer Institute. In the same 60 cell lines, we used MCIA to perform a cross-platform comparison of mRNA gene expression profiles obtained on four different microarray platforms. Last, as an example of integrative analysis of multiassay or multi-omics data we analyzed transcriptomic, proteomic, and phosphoproteomic data from pluripotent (iPS) and embryonic stem (ES) cell lines.
Shi, Yuhong; Azimzadeh, Pedram; Jamingal, Sarada; Wentworth, Shannon; Ferlitch, Janice; Koh, James; Balenga, Nariman; Olson, John A
2018-01-01
Parathyroid tumors are mostly considered monoclonal neoplasms, the rationale for focused parathyroidectomy in primary hyperparathyroidism. We reported that flow sorting parathyroid tumor cells and methylation-sensitive polymerase chain reaction (me-PCR) of polymorphic human androgen receptor gene and phosphoglycerate kinase gene alleles in deoxyribonucleic acid reveals that ≤35% of parathyroid tumors are polyclonal. We sought to confirm these findings and assess for clinical relevance. Parathyroid tumors from 286 female primary hyperparathyroidism patients were analyzed for clonal status. Tumor clonal status was compared with clinical variables and operative findings. Statistical analysis was performed and significance was established at P < .05. In the study, 176 (62%) patients were informative for human androgen receptor gene and/or phosphoglycerate kinase gene. Assignment of clonal status was made in 119 (68%) tumors, of which 64 (54%) were monoclonal and 55 (46%) were polyclonal. Comparison of tumor clonal status to clinical variables in patients with complete operative data (N = 82) showed that while clinical features were the same between tumor types, patients with polyclonal tumors more often had multiple gland disease (risk ratio 4.066, confidence interval, 1.016-16.26; P = .039) potentially missed at unilateral neck exploration. This work confirms that primary hyperparathyroidism is often the result of polyclonal tumors and that parathyroid tumor clonal status may be associated with multiple gland disease. Copyright © 2017 Elsevier Inc. All rights reserved.
A stochastic model for optimizing composite predictors based on gene expression profiles.
Ramanathan, Murali
2003-07-01
This project was done to develop a mathematical model for optimizing composite predictors based on gene expression profiles from DNA arrays and proteomics. The problem was amenable to a formulation and solution analogous to the portfolio optimization problem in mathematical finance: it requires the optimization of a quadratic function subject to linear constraints. The performance of the approach was compared to that of neighborhood analysis using a data set containing cDNA array-derived gene expression profiles from 14 multiple sclerosis patients receiving intramuscular inteferon-beta1a. The Markowitz portfolio model predicts that the covariance between genes can be exploited to construct an efficient composite. The model predicts that a composite is not needed for maximizing the mean value of a treatment effect: only a single gene is needed, but the usefulness of the effect measure may be compromised by high variability. The model optimized the composite to yield the highest mean for a given level of variability or the least variability for a given mean level. The choices that meet this optimization criteria lie on a curve of composite mean vs. composite variability plot referred to as the "efficient frontier." When a composite is constructed using the model, it outperforms the composite constructed using the neighborhood analysis method. The Markowitz portfolio model may find potential applications in constructing composite biomarkers and in the pharmacogenomic modeling of treatment effects derived from gene expression endpoints.
Lindstrom, Stephen E.; Hiromoto, Yasuaki; Nishimura, Hidekazu; Saito, Takehiko; Nerome, Reiko; Nerome, Kuniaki
1999-01-01
Phylogenetic profiles of the genes coding for the hemagglutinin (HA) protein, nucleoprotein (NP), matrix (M) protein, and nonstructural (NS) proteins of influenza B viruses isolated from 1940 to 1998 were analyzed in a parallel manner in order to understand the evolutionary mechanisms of these viruses. Unlike human influenza A (H3N2) viruses, the evolutionary pathways of all four genes of recent influenza B viruses revealed similar patterns of genetic divergence into two major lineages. Although evolutionary rates of the HA, NP, M, and NS genes of influenza B viruses were estimated to be generally lower than those of human influenza A viruses, genes of influenza B viruses demonstrated complex phylogenetic patterns, indicating alternative mechanisms for generation of virus variability. Topologies of the evolutionary trees of each gene were determined to be quite distinct from one another, showing that these genes were evolving in an independent manner. Furthermore, variable topologies were apparently the result of frequent genetic exchange among cocirculating epidemic viruses. Evolutionary analysis done in the present study provided further evidence for cocirculation of multiple lineages as well as sequestering and reemergence of phylogenetic lineages of the internal genes. In addition, comparison of deduced amino acid sequences revealed a novel amino acid deletion in the HA1 domain of the HA protein of recent isolates from 1998 belonging to the B/Yamagata/16/88-like lineage. It thus became apparent that, despite lower evolutionary rates, influenza B viruses were able to generate genetic diversity among circulating viruses through a combination of evolutionary mechanisms involving cocirculating lineages and genetic reassortment by which new variants with distinct gene constellations emerged. PMID:10196339
Tiffin, Nicki; Meintjes, Ayton; Ramesar, Rajkumar; Bajic, Vladimir B.; Rayner, Brian
2010-01-01
Multiple factors underlie susceptibility to essential hypertension, including a significant genetic and ethnic component, and environmental effects. Blood pressure response of hypertensive individuals to salt is heterogeneous, but salt sensitivity appears more prevalent in people of indigenous African origin. The underlying genetics of salt-sensitive hypertension, however, are poorly understood. In this study, computational methods including text- and data-mining have been used to select and prioritize candidate aetiological genes for salt-sensitive hypertension. Additionally, we have compared allele frequencies and copy number variation for single nucleotide polymorphisms in candidate genes between indigenous Southern African and Caucasian populations, with the aim of identifying candidate genes with significant variability between the population groups: identifying genetic variability between population groups can exploit ethnic differences in disease prevalence to aid with prioritisation of good candidate genes. Our top-ranking candidate genes include parathyroid hormone precursor (PTH) and type-1angiotensin II receptor (AGTR1). We propose that the candidate genes identified in this study warrant further investigation as potential aetiological genes for salt-sensitive hypertension. PMID:20886000
Nock, Nl; Zhang, Lx
2011-11-29
Methods that can evaluate aggregate effects of rare and common variants are limited. Therefore, we applied a two-stage approach to evaluate aggregate gene effects in the 1000 Genomes Project data, which contain 24,487 single-nucleotide polymorphisms (SNPs) in 697 unrelated individuals from 7 populations. In stage 1, we identified potentially interesting genes (PIGs) as those having at least one SNP meeting Bonferroni correction using univariate, multiple regression models. In stage 2, we evaluate aggregate PIG effects on trait, Q1, by modeling each gene as a latent construct, which is defined by multiple common and rare variants, using the multivariate statistical framework of structural equation modeling (SEM). In stage 1, we found that PIGs varied markedly between a randomly selected replicate (replicate 137) and 100 other replicates, with the exception of FLT1. In stage 1, collapsing rare variants decreased false positives but increased false negatives. In stage 2, we developed a good-fitting SEM model that included all nine genes simulated to affect Q1 (FLT1, KDR, ARNT, ELAV4, FLT4, HIF1A, HIF3A, VEGFA, VEGFC) and found that FLT1 had the largest effect on Q1 (βstd = 0.33 ± 0.05). Using replicate 137 estimates as population values, we found that the mean relative bias in the parameters (loadings, paths, residuals) and their standard errors across 100 replicates was on average, less than 5%. Our latent variable SEM approach provides a viable framework for modeling aggregate effects of rare and common variants in multiple genes, but more elegant methods are needed in stage 1 to minimize type I and type II error.
Learning style and concept acquisition of community college students in introductory biology
NASA Astrophysics Data System (ADS)
Bobick, Sandra Burin
This study investigated the influence of learning style on concept acquisition within a sample of community college students in a general biology course. There are two subproblems within the larger problem: (1) the influence of demographic variables (age, gender, number of college credits, prior exposure to scientific information) on learning style, and (2) the correlations between prior scientific knowledge, learning style and student understanding of the concept of the gene. The sample included all students enrolled in an introductory general biology course during two consecutive semesters at an urban community college. Initial data was gathered during the first week of the semester, at which time students filled in a short questionnaire (age, gender, number of college credits, prior exposure to science information either through reading/visual sources or a prior biology course). Subjects were then given the Inventory of Learning Processes-Revised (ILP-R) which measures general preferences in five learning styles; Deep Learning; Elaborative Learning, Agentic Learning, Methodical Learning and Literal Memorization. Subjects were then given the Gene Conceptual Knowledge pretest: a 15 question objective section and an essay section. Subjects were exposed to specific concepts during lecture and laboratory exercises. At the last lab, students were given the Genetics Conceptual Knowledge Posttest. Pretest/posttest gains were correlated with demographic variables and learning styles were analyzed for significant correlations. Learning styles, as the independent variable in a simultaneous multiple regression, were significant predictors of results on the gene assessment tests, including pretest, posttest and gain. Of the learning styles, Deep Learning accounted for the greatest positive predictive value of pretest essay and pretest objective results. Literal Memorization was a significant negative predictor for posttest essay, essay gain and objective gain. Simultaneous multiple regression indicated that demographic variables were significant positive predictors for Methodical, Deep and Elaborative Learning Styles. Stepwise multiple regression resulted in number of credits, Read Science and gender (female) as significant predictors of learning styles. The findings of this study emphasize the importance of learning styles in conceptual understanding of the gene and the correlation of nonformal exposure to science information with learning style and conceptual understanding.
Maye, Peter; Stover, Mary Louise; Liu, Yaling; Rowe, David W; Gong, Shiaochin; Lichtler, Alexander C
2009-03-13
Reporter gene mice are valuable animal models for biological research providing a gene expression readout that can contribute to cellular characterization within the context of a developmental process. With the advancement of bacterial recombination techniques to engineer reporter gene constructs from BAC genomic clones and the generation of optically distinguishable fluorescent protein reporter genes, there is an unprecedented capability to engineer more informative transgenic reporter mouse models relative to what has been traditionally available. We demonstrate here our first effort on the development of a three stage bacterial recombination strategy to physically link multiple genes together with their respective fluorescent protein (FP) reporters in one DNA fragment. This strategy uses bacterial recombination techniques to: (1) subclone genes of interest into BAC linking vectors, (2) insert desired reporter genes into respective genes and (3) link different gene-reporters together. As proof of concept, we have generated a single DNA fragment containing the genes Trap, Dmp1, and Ibsp driving the expression of ECFP, mCherry, and Topaz FP reporter genes, respectively. Using this DNA construct, we have successfully generated transgenic reporter mice that retain two to three gene readouts. The three stage methodology to link multiple genes with their respective fluorescent protein reporter works with reasonable efficiency. Moreover, gene linkage allows for their common chromosomal integration into a single locus. However, the testing of this multi-reporter DNA construct by transgenesis does suggest that the linkage of two different genes together, despite their large size, can still create a positional effect. We believe that gene choice, genomic DNA fragment size and the presence of endogenous insulator elements are critical variables.
Cammarata-Scalisi, Francisco; Cozar, Mónica; Grinberg, Daniel; Balcells, Susana; Asteggiano, Carla G; Martínez-Domenech, Gustavo; Bracho, Ana; Sánchez, Yanira; Stock, Frances; Delgado-Luengo, Wilmer; Zara-Chirinos, Carmen; Chacín, José Antonio
2015-04-01
Hereditary forms of multiple exostoses, now called EXT1/EXT2-CDG within Congenital Disorders of Glycosylation, are the most common benign bone tumors in humans and clinical description consists of the formation of several cartilage-capped bone tumors, usually benign and localized in the juxta-epiphyseal region of long bones, although wide body dissemination in severe cases is not uncommon. Onset of the disease is variable ranging from 2-3 years up to 13-15 years with an estimated incidence ranging from 1/18,000 to 1/50,000 cases in European countries. We present a double mutant alleles in the EXT1 gene not previously reported in a teenager and her family with hereditary multiple exostoses.
Tao, Yebin; Sánchez, Brisa N; Mukherjee, Bhramar
2015-03-30
Many existing cohort studies designed to investigate health effects of environmental exposures also collect data on genetic markers. The Early Life Exposures in Mexico to Environmental Toxicants project, for instance, has been genotyping single nucleotide polymorphisms on candidate genes involved in mental and nutrient metabolism and also in potentially shared metabolic pathways with the environmental exposures. Given the longitudinal nature of these cohort studies, rich exposure and outcome data are available to address novel questions regarding gene-environment interaction (G × E). Latent variable (LV) models have been effectively used for dimension reduction, helping with multiple testing and multicollinearity issues in the presence of correlated multivariate exposures and outcomes. In this paper, we first propose a modeling strategy, based on LV models, to examine the association between repeated outcome measures (e.g., child weight) and a set of correlated exposure biomarkers (e.g., prenatal lead exposure). We then construct novel tests for G × E effects within the LV framework to examine effect modification of outcome-exposure association by genetic factors (e.g., the hemochromatosis gene). We consider two scenarios: one allowing dependence of the LV models on genes and the other assuming independence between the LV models and genes. We combine the two sets of estimates by shrinkage estimation to trade off bias and efficiency in a data-adaptive way. Using simulations, we evaluate the properties of the shrinkage estimates, and in particular, we demonstrate the need for this data-adaptive shrinkage given repeated outcome measures, exposure measures possibly repeated and time-varying gene-environment association. Copyright © 2014 John Wiley & Sons, Ltd.
Chakraborty, Sutirtha
2018-05-26
RNA-Seq technology has revolutionized the face of gene expression profiling by generating read count data measuring the transcript abundances for each queried gene on multiple experimental subjects. But on the downside, the underlying technical artefacts and hidden biological profiles of the samples generate a wide variety of latent effects that may potentially distort the actual transcript/gene expression signals. Standard normalization techniques fail to correct for these hidden variables and lead to flawed downstream analyses. In this work I demonstrate the use of Partial Least Squares (built as an R package 'SVAPLSseq') to correct for the traces of extraneous variability in RNA-Seq data. A novel and thorough comparative analysis of the PLS based method is presented along with some of the other popularly used approaches for latent variable correction in RNA-Seq. Overall, the method is found to achieve a substantially improved estimation of the hidden effect signatures in the RNA-Seq transcriptome expression landscape compared to other available techniques. Copyright © 2017. Published by Elsevier Inc.
NASA Astrophysics Data System (ADS)
Tolar, B. B.; Reji, L.; Smith, J. M.; Chavez, F.; Francis, C.
2016-12-01
Thaumarchaeaota are among the most abundant microorganisms on the planet, and are significant players in the global nitrogen cycle. All cultivated members of the phylum are capable of performing the first and rate-limiting step of nitrification - the aerobic oxidation of ammonia to nitrite. In marine environments, ammonia-oxidizing archaea (AOA) have been found to greatly outnumber their bacterial counterparts. However, much about their ecology remains largely unknown. Monterey Bay, a non-estuarine embayment on the central California coast, is an ideal site for studying the dynamics of natural thaumarchaeal assemblages, given the highly dynamic nature of the Bay waters with seasonal upwelling episodes and the associated steep gradients in environmental variables. In the present study, we examined thaumarchaeal population dynamics in the upper Monterey Bay water column (0-500 m) using multiple molecular markers. Following high-resolution spatiotemporal sampling (i.e., up to 10 depths sampled monthly over a period of 2 years) at two stations in the Bay, we quantified thaumarchaeal functional genes - the ammonia monooxygenase (amoA) gene and its `shallow' and `deep' marine ecotypes, and variants of the marine nitrite reductase (nirK) gene. The abundances of both genes were regressed against environmental variables to gain insights into factors shaping their spatiotemporal dynamics in the Bay. Gene abundances at both stations varied with depth and season, with winter months generally having several orders of magnitude greater abundances. Statistical analyses point to differential controls on the gene abundances, with depth and temperature potentially being the major environmental determinants of thaumarchaeal population size. Our results also highlight the importance of employing multiple marker genes to gain a more highly resolved picture of thaumarchaeal population dynamics in complex environmental systems such as the coastal ocean.
Diagnostic Challenges in Retinitis Pigmentosa: Genotypic Multiplicity and Phenotypic Variability
Chang, Susie; Vaccarella, Leah; Olatunji, Sunday; Cebulla, Colleen; Christoforidis, John
2011-01-01
Retinitis pigmentosa (RP) is a heterogeneous group of inherited retinal disorders. Diagnosis can be challenging as more than 40 genes are known to cause non-syndromic RP and phenotypic expression can differ significantly resulting in variations in disease severity, age of onset, rate of progression, and clinical findings. We describe the clinical manifestations of RP, the more commonly known causative gene mutations, and the genotypic-phenotypic correlation of RP. PMID:22131872
Aberrant gene promoter methylation associated with sporadic multiple colorectal cancer.
Gonzalo, Victoria; Lozano, Juan José; Muñoz, Jenifer; Balaguer, Francesc; Pellisé, Maria; Rodríguez de Miguel, Cristina; Andreu, Montserrat; Jover, Rodrigo; Llor, Xavier; Giráldez, M Dolores; Ocaña, Teresa; Serradesanferm, Anna; Alonso-Espinaco, Virginia; Jimeno, Mireya; Cuatrecasas, Miriam; Sendino, Oriol; Castellví-Bel, Sergi; Castells, Antoni
2010-01-19
Colorectal cancer (CRC) multiplicity has been mainly related to polyposis and non-polyposis hereditary syndromes. In sporadic CRC, aberrant gene promoter methylation has been shown to play a key role in carcinogenesis, although little is known about its involvement in multiplicity. To assess the effect of methylation in tumor multiplicity in sporadic CRC, hypermethylation of key tumor suppressor genes was evaluated in patients with both multiple and solitary tumors, as a proof-of-concept of an underlying epigenetic defect. We examined a total of 47 synchronous/metachronous primary CRC from 41 patients, and 41 gender, age (5-year intervals) and tumor location-paired patients with solitary tumors. Exclusion criteria were polyposis syndromes, Lynch syndrome and inflammatory bowel disease. DNA methylation at the promoter region of the MGMT, CDKN2A, SFRP1, TMEFF2, HS3ST2 (3OST2), RASSF1A and GATA4 genes was evaluated by quantitative methylation specific PCR in both tumor and corresponding normal appearing colorectal mucosa samples. Overall, patients with multiple lesions exhibited a higher degree of methylation in tumor samples than those with solitary tumors regarding all evaluated genes. After adjusting for age and gender, binomial logistic regression analysis identified methylation of MGMT2 (OR, 1.48; 95% CI, 1.10 to 1.97; p = 0.008) and RASSF1A (OR, 2.04; 95% CI, 1.01 to 4.13; p = 0.047) as variables independently associated with tumor multiplicity, being the risk related to methylation of any of these two genes 4.57 (95% CI, 1.53 to 13.61; p = 0.006). Moreover, in six patients in whom both tumors were available, we found a correlation in the methylation levels of MGMT2 (r = 0.64, p = 0.17), SFRP1 (r = 0.83, 0.06), HPP1 (r = 0.64, p = 0.17), 3OST2 (r = 0.83, p = 0.06) and GATA4 (r = 0.6, p = 0.24). Methylation in normal appearing colorectal mucosa from patients with multiple and solitary CRC showed no relevant difference in any evaluated gene. These results provide a proof-of-concept that gene promoter methylation is associated with tumor multiplicity. This underlying epigenetic defect may have noteworthy implications in the prevention of patients with sporadic CRC.
USING GENOMICS TO EXAMINE MULTIPLE EXPOSURE VARIABLES IN BIOINDICATORS RESEARCH
Genomics technologies provide a powerful tool for rapid assessment of differentially expressed genes in laboratory and field animals exposed to toxicants, and a means by which to link the earliest indicators of exposure to diverse effects in organisms and populations. However, a...
Improving RNA-Seq expression estimation by modeling isoform- and exon-specific read sequencing rate.
Liu, Xuejun; Shi, Xinxin; Chen, Chunlin; Zhang, Li
2015-10-16
The high-throughput sequencing technology, RNA-Seq, has been widely used to quantify gene and isoform expression in the study of transcriptome in recent years. Accurate expression measurement from the millions or billions of short generated reads is obstructed by difficulties. One is ambiguous mapping of reads to reference transcriptome caused by alternative splicing. This increases the uncertainty in estimating isoform expression. The other is non-uniformity of read distribution along the reference transcriptome due to positional, sequencing, mappability and other undiscovered sources of biases. This violates the uniform assumption of read distribution for many expression calculation approaches, such as the direct RPKM calculation and Poisson-based models. Many methods have been proposed to address these difficulties. Some approaches employ latent variable models to discover the underlying pattern of read sequencing. However, most of these methods make bias correction based on surrounding sequence contents and share the bias models by all genes. They therefore cannot estimate gene- and isoform-specific biases as revealed by recent studies. We propose a latent variable model, NLDMseq, to estimate gene and isoform expression. Our method adopts latent variables to model the unknown isoforms, from which reads originate, and the underlying percentage of multiple spliced variants. The isoform- and exon-specific read sequencing biases are modeled to account for the non-uniformity of read distribution, and are identified by utilizing the replicate information of multiple lanes of a single library run. We employ simulation and real data to verify the performance of our method in terms of accuracy in the calculation of gene and isoform expression. Results show that NLDMseq obtains competitive gene and isoform expression compared to popular alternatives. Finally, the proposed method is applied to the detection of differential expression (DE) to show its usefulness in the downstream analysis. The proposed NLDMseq method provides an approach to accurately estimate gene and isoform expression from RNA-Seq data by modeling the isoform- and exon-specific read sequencing biases. It makes use of a latent variable model to discover the hidden pattern of read sequencing. We have shown that it works well in both simulations and real datasets, and has competitive performance compared to popular methods. The method has been implemented as a freely available software which can be found at https://github.com/PUGEA/NLDMseq.
Emaneini, Mohammad; Jabalameli, Leila; Iman-Eini, Hossein; Aligholi, Marzieh; Ghasemi, Amir; Nakhjavani, Farrokh Akbari; Taherikalani, Morovat; Khoramian, Babak; Asadollahi, Parisa; Jabalameli, Fereshteh
2011-01-01
Methicillin resistant Staphylococcus aureus (MRSA), particularly strains with type III staphylococcal cassette chromosome mec (SCCmec), represent a serious human pathogen in Tehran, Iran. The disease-causing capability depends on their ability to produce a wide variety of virulent factors. The prevalence of exotoxin genes and multiple-locus variable number of tandem repeats fingerprinting (MLVF) profile among MRSA isolates, from patients in Tehran, was evaluated by PCR and Multiplex-PCR. The MLVF typing of 144 MRSA isolates with type III SCCmec produced 5 different MLVF types. Generally, 97.2% (140/144) of all the isolates were positive for at least one of the tested exotoxin genes. The most prevalent genes were hld, found in 87.5% (126/144) of the isolates followed by lukE-lukD and hla found in 72.9% (105/144) and 70.1% (101/144) of the isolates, respectively. The tst gene, belonging to MLVF types I, IV and V, was found among three of the isolates from blood and wound samples. The sea gene was detected in 58.3% (84/144) of the isolates and the sed and see genes were found in one isolate with MLVF type V. The coexistence of genes was observed in the 87.5% (126/144) of the isolates. The rate of coexistence of hld with lukE-lukD, hla with lukE-lukD and sea with lukE-lukD were 66.7% (96/144), 44.4% (64/144) and 44.4% (64/144), respectively. The present study demonstrated that MRSA strains with type III SCCmec show different MLVF patterns and exotoxin profiles.
Gal, Moran; Levanon, Erez Y; Hujeirat, Yasir; Khayat, Morad; Pe'er, Jacob; Shalev, Stavit
2014-12-01
Developmental malformations of the vitreoretinal vasculature are a heterogeneous group of conditions with various modes of inheritance, and include familial exudative vitreoretinopathy (FEVR), persistent fetal vasculature (PFV), and Norrie disease. We investigated a large consanguineous kindred with multiple affected individuals exhibiting variable phenotypes of abnormal vitreoretinal vasculature, consistent with the three above-mentioned conditions and compatible with autosomal recessive inheritance. Exome sequencing identified a novel c.542G > T (p.C181F) apparently mutation in the TSPAN12 gene that segregated with the ocular disease in the family. The TSPAN12 gene was previously reported to cause dominant and recessive FEVR, but has not yet been associated with other vitreoretinal manifestations. The intra-familial clinical variability caused by a single mutation in the TSPAN12 gene underscores the complicated phenotype-genotype correlation of mutations in this gene, and suggests that there are additional genetic and environmental factors involved in the complex process of ocular vascularization during embryonic development. Our study supports considering PFV, FEVR, and Norrie disease a spectrum of disorders, with clinical and genetic overlap, caused by mutations in distinct genes acting in the Norrin/β-catenin signaling pathway. © 2014 Wiley Periodicals, Inc.
Pleiotropic biological activities of alternatively spliced TMPRSS2/ERG fusion gene transcripts
Wang, Jianghua; Cai, Yi; Yu, Wendong; Ren, Chengxi; Spencer, David M.; Ittmann, Michael
2008-01-01
TMPRSS2/ERG gene fusions are found in the majority of prostate cancers; however, there is significant heterogeneity in the 5′ region of the alternatively spliced fusion gene transcripts. We have found that there is also significant heterogeneity within the coding exons as well. There is variable inclusion of a 72-bp exon and other novel alternatively spliced isoforms. To assess the biological significance of these alternatively spliced transcripts, we expressed various transcripts in primary prostatic epithelial cells and in an immortalized prostatic epithelial cell line, PNT1a. The fusion gene transcripts promoted proliferation, invasion and motility with variable activities that depended on the structure of the 5′ region encoding the TMPRSS2/ERG fusion and the presence of the 72-bp exon. Cotransfection of different isoforms further enhanced biological activity, mimicking the situation in vivo, in which multiple isoforms are expressed. Finally, knockdown of the fusion gene in VCaP cells resulted in inhibition of proliferation in vitro and tumor progression in an in vivo orthotopic mice model. Our results indicate that TMPRSS2/ERG fusion isoforms have variable biological activities promoting tumor initiation and progression and are consistent with our previous clinical observations indicating that certain TMPRSS2/ERG fusion isoforms are significantly correlated with more aggressive disease. PMID:18922926
Mutations of the Birt–Hogg–Dubé gene in patients with multiple lung cysts and recurrent pneumothorax
Gunji, Yoko; Akiyoshi, Taeko; Sato, Teruhiko; Kurihara, Masatoshi; Tominaga, Shigeru; Takahashi, Kazuhisa; Seyama, Kuniaki
2007-01-01
Rationale Birt–Hogg–Dubé (BHD) syndrome, a rare inherited autosomal genodermatosis first recognised in 1977, is characterised by fibrofolliculomas of the skin, an increased risk of renal tumours and multiple lung cysts with spontaneous pneumothorax. The BHD gene, a tumour suppressor gene located at chromosome 17p11.2, has recently been shown to be defective. Recent genetic studies revealed that clinical pictures of the disease may be variable and may not always present the full expression of the phenotypes. Objectives We hypothesised that mutations of the BHD gene are responsible for patients who have multiple lung cysts of which the underlying causes have not yet been elucidated. Methods We studied eight patients with lung cysts, without skin and renal disease; seven of these patients have a history of spontaneous pneumothorax and five have a family history of pneumothorax. The BHD gene was examined using PCR, denaturing high‐performance liquid chromatography and direct sequencing. Main results We found that five of the eight patients had a BHD germline mutation. All mutations were unique and four of them were novel, including three different deletions or insertions detected in exons 6, 12 and 13, respectively and one splice acceptor site mutation in intron 5 resulting in an in‐frame deletion of exon 6. Conclusions We found that germline mutations of the BHD gene are involved in some patients with multiple lung cysts and pneumothorax. Pulmonologists should be aware that BHD syndrome can occur as an isolated phenotype with pulmonary involvement. PMID:17496196
Bullich, Gemma; Trujillano, Daniel; Santín, Sheila; Ossowski, Stephan; Mendizábal, Santiago; Fraga, Gloria; Madrid, Álvaro; Ariceta, Gema; Ballarín, José; Torra, Roser; Estivill, Xavier; Ars, Elisabet
2015-09-01
Genetic diagnosis of steroid-resistant nephrotic syndrome (SRNS) using Sanger sequencing is complicated by the high genetic heterogeneity and phenotypic variability of this disease. We aimed to improve the genetic diagnosis of SRNS by simultaneously sequencing 26 glomerular genes using massive parallel sequencing and to study whether mutations in multiple genes increase disease severity. High-throughput mutation analysis was performed in 50 SRNS and/or focal segmental glomerulosclerosis (FSGS) patients, a validation cohort of 25 patients with known pathogenic mutations, and a discovery cohort of 25 uncharacterized patients with probable genetic etiology. In the validation cohort, we identified the 42 previously known pathogenic mutations across NPHS1, NPHS2, WT1, TRPC6, and INF2 genes. In the discovery cohort, disease-causing mutations in SRNS/FSGS genes were found in nine patients. We detected three patients with mutations in an SRNS/FSGS gene and COL4A3. Two of them were familial cases and presented a more severe phenotype than family members with mutation in only one gene. In conclusion, our results show that massive parallel sequencing is feasible and robust for genetic diagnosis of SRNS/FSGS. Our results indicate that patients carrying mutations in an SRNS/FSGS gene and also in COL4A3 gene have increased disease severity.
Chérif, Thouraya; Saidani, Mabrouka; Decré, Dominique; Boutiba-Ben Boubaker, Ilhem; Arlet, Guillaume
2016-01-01
Over a period of 40 months, plasmid-mediated AmpC β-lactamases were detected in Tunis, Tunisia, in 78 isolates (0.59%) of Escherichia coli, Klebsiella pneumoniae, and Proteus mirabilis. In 67 isolates, only one ampC gene was detected, i.e., blaCMY-2-type (n = 33), blaACC (n = 23), blaDHA (n = 6) or blaEBC (n = 5). Multiple ampC genes were detected in 11 isolates, with the following distribution: blaMOX-2, blaFOX-3, and blaCMY-4/16 (n = 6), blaFOX-3 and blaMOX-2 (n = 3), and blaCMY-4 and blaMOX-2 (n = 2). A great variety of plasmids carrying these genes was found, independently of the species and the bla gene. If the genetic context of blaCMY-2-type is variable, that of blaMOX-2, reported in part previously, is unique and that of blaFOX-3 is unique and new. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Clinical Trials With Large Numbers of Variables: Important Advantages of Canonical Analysis.
Cleophas, Ton J
2016-01-01
Canonical analysis assesses the combined effects of a set of predictor variables on a set of outcome variables, but it is little used in clinical trials despite the omnipresence of multiple variables. The aim of this study was to assess the performance of canonical analysis as compared with traditional multivariate methods using multivariate analysis of covariance (MANCOVA). As an example, a simulated data file with 12 gene expression levels and 4 drug efficacy scores was used. The correlation coefficient between the 12 predictor and 4 outcome variables was 0.87 (P = 0.0001) meaning that 76% of the variability in the outcome variables was explained by the 12 covariates. Repeated testing after the removal of 5 unimportant predictor and 1 outcome variable produced virtually the same overall result. The MANCOVA identified identical unimportant variables, but it was unable to provide overall statistics. (1) Canonical analysis is remarkable, because it can handle many more variables than traditional multivariate methods such as MANCOVA can. (2) At the same time, it accounts for the relative importance of the separate variables, their interactions and differences in units. (3) Canonical analysis provides overall statistics of the effects of sets of variables, whereas traditional multivariate methods only provide the statistics of the separate variables. (4) Unlike other methods for combining the effects of multiple variables such as factor analysis/partial least squares, canonical analysis is scientifically entirely rigorous. (5) Limitations include that it is less flexible than factor analysis/partial least squares, because only 2 sets of variables are used and because multiple solutions instead of one is offered. We do hope that this article will stimulate clinical investigators to start using this remarkable method.
Qin, Shengfang; Wang, Xueyan; Li, Yunxing; Wei, Ping; Chen, Chun; Zeng, Lan
2016-02-01
To explore the genetics mechanism for the phenotypic variability in a patient carrying a rare ring chromosome 9. The karyotype of the patient was analyzed with cytogenetics method. Presence of sex chromosome was confirmed with fluorescence in situ hybridization. The SRY gene was subjected to PCR amplification and direct sequencing. Potential deletion and duplication were detected with array-based comparative genomic hybridization (array-CGH). The karyotype of the patient has comprised 6 types of cell lines containing a ring chromosome 9. The SRY gene sequence was normal. By array-CGH, the patient has carried a hemizygous deletion at 9p24.3-p23 (174 201-9 721 761) encompassing 30 genes from Online Mendelian Inheritance in Man. The phenotypic variability of the 9p deletion syndrome in conjunct with ring chromosome 9 may be attributable to multiple factors including loss of chromosomal material, insufficient dosage of genes, instability of ring chromosome, and pattern of inheritance.
Parkin, Derek B; Archer, Linda L; Childress, April L; Wellehan, James F X
2009-07-01
Bearded dragons (Pogona vitticeps) are popular pets in the United States. Agamid Adenovirus 1 (AgAdV1) is an important infectious agent of bearded dragons. The only AgAdV1 sequences available to date are from a highly conserved region of the DNA polymerase gene. Degenerate primers were designed to amplify a variable region of the AgAdV1 hexon gene for sequencing. Genetic differences were identified within the hexon gene of 17 bearded dragons from 4 collections. Much less diversity was present in the polymerase gene. Bayesian analysis of the hexon nucleotide alignment identified two larger groups and two isolates that did not tightly cluster with these two groups. Multiple genotypes were identified within collections, and individual genotypes were seen in different collections. Three bearded dragons appeared to be infected by multiple strains. These findings show that this hexon region is useful for AgAdV1 genotyping, which can be used epidemiologically as well as in future investigations of AgAdV1 evolution and clinical implications of strain differences.
Johnston, Jennifer J; Walker, Robert L; Davis, Sean; Facio, Flavia; Turner, Joyce T; Bick, David P; Daentl, Donna L; Ellison, Jay W; Meltzer, Paul S; Biesecker, Leslie G
2007-01-01
Contiguous gene syndromes cause disorders via haploinsufficiency for adjacent genes. Some contiguous gene syndromes (CGS) have stereotypical breakpoints, but others have variable breakpoints. In CGS that have variable breakpoints, the extent of the deletions may be correlated with severity. The Greig cephalopolysyndactyly contiguous gene syndrome (GCPS‐CGS) is a multiple malformation syndrome caused by haploinsufficiency of GLI3 and adjacent genes. In addition, non‐CGS GCPS can be caused by deletions or duplications in GLI3. Although fluorescence in situ hybridisation (FISH) can identify large deletion mutations in patients with GCPS or GCPS‐CGS, it is not practical for identification of small intragenic deletions or insertions, and it is difficult to accurately characterise the extent of the large deletions using this technique. We have designed a custom comparative genomic hybridisation (CGH) array that allows identification of deletions and duplications at kilobase resolution in the vicinity of GLI3. The array averages one probe every 730 bp for a total of about 14 000 probes over 10 Mb. We have analysed 16 individuals with known or suspected deletions or duplications. In 15 of 16 individuals (14 deletions and 1 duplication), the array confirmed the prior results. In the remaining patient, the normal CGH array result was correct, and the prior assessment was a false positive quantitative polymerase chain reaction result. We conclude that high‐density CGH array analysis is more sensitive than FISH analysis for detecting deletions and provides clinically useful results on the extent of the deletion. We suggest that high‐density CGH array analysis should replace FISH analysis for assessment of deletions and duplications in patients with contiguous gene syndromes caused by variable deletions. PMID:17098889
Pereira, S; Lavado, N; Nogueira, L; Lopez, M; Abreu, J; Silva, H
2014-10-01
Orthodontic-induced external apical root resorption (EARR) is a complex phenotype determined by poorly defined mechanical and patient intrinsic factors. The aim of this work was to construct a multifactorial integrative model, including clinical and genetic susceptibility factors, to analyze the risk of developing this common orthodontic complication. This retrospective study included 195 orthodontic patients. Using a multiple-linear regression model, where the dependent variable was the maximum% of root resorption (%EARRmax) for each patient, we assessed the contribution of nine clinical variables and four polymorphisms of genes involved in bone and tooth root remodeling (rs1718119 from P2RX7, rs1143634 from IL1B, rs3102735 from TNFRSF11B, encoding OPG, and rs1805034 from TNFRSF11A, encoding RANK). Clinical and genetic variables explained 30% of%EARRmax variability. The variables with the most significant unique contribution to the model were: gender (P < 0.05), treatment duration (P < 0.001), premolar extractions (P < 0.01), Hyrax appliance (P < 0.001) and GG genotype of rs1718119 from P2RX7 gene (P < 0.01). Age, overjet, tongue thrust, skeletal class II and the other polymorphisms made minor contributions. This study highlights the P2RX7 gene as a possible factor of susceptibility to EARR. A more extensive genetic profile may improve this model. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Gene regulation and noise reduction by coupling of stochastic processes
NASA Astrophysics Data System (ADS)
Ramos, Alexandre F.; Hornos, José Eduardo M.; Reinitz, John
2015-02-01
Here we characterize the low-noise regime of a stochastic model for a negative self-regulating binary gene. The model has two stochastic variables, the protein number and the state of the gene. Each state of the gene behaves as a protein source governed by a Poisson process. The coupling between the two gene states depends on protein number. This fact has a very important implication: There exist protein production regimes characterized by sub-Poissonian noise because of negative covariance between the two stochastic variables of the model. Hence the protein numbers obey a probability distribution that has a peak that is sharper than those of the two coupled Poisson processes that are combined to produce it. Biochemically, the noise reduction in protein number occurs when the switching of the genetic state is more rapid than protein synthesis or degradation. We consider the chemical reaction rates necessary for Poisson and sub-Poisson processes in prokaryotes and eucaryotes. Our results suggest that the coupling of multiple stochastic processes in a negative covariance regime might be a widespread mechanism for noise reduction.
Gene regulation and noise reduction by coupling of stochastic processes
Hornos, José Eduardo M.; Reinitz, John
2015-01-01
Here we characterize the low noise regime of a stochastic model for a negative self-regulating binary gene. The model has two stochastic variables, the protein number and the state of the gene. Each state of the gene behaves as a protein source governed by a Poisson process. The coupling between the the two gene states depends on protein number. This fact has a very important implication: there exist protein production regimes characterized by sub-Poissonian noise because of negative covariance between the two stochastic variables of the model. Hence the protein numbers obey a probability distribution that has a peak that is sharper than those of the two coupled Poisson processes that are combined to produce it. Biochemically, the noise reduction in protein number occurs when the switching of genetic state is more rapid than protein synthesis or degradation. We consider the chemical reaction rates necessary for Poisson and sub-Poisson processes in prokaryotes and eucaryotes. Our results suggest that the coupling of multiple stochastic processes in a negative covariance regime might be a widespread mechanism for noise reduction. PMID:25768447
Gene regulation and noise reduction by coupling of stochastic processes.
Ramos, Alexandre F; Hornos, José Eduardo M; Reinitz, John
2015-02-01
Here we characterize the low-noise regime of a stochastic model for a negative self-regulating binary gene. The model has two stochastic variables, the protein number and the state of the gene. Each state of the gene behaves as a protein source governed by a Poisson process. The coupling between the two gene states depends on protein number. This fact has a very important implication: There exist protein production regimes characterized by sub-Poissonian noise because of negative covariance between the two stochastic variables of the model. Hence the protein numbers obey a probability distribution that has a peak that is sharper than those of the two coupled Poisson processes that are combined to produce it. Biochemically, the noise reduction in protein number occurs when the switching of the genetic state is more rapid than protein synthesis or degradation. We consider the chemical reaction rates necessary for Poisson and sub-Poisson processes in prokaryotes and eucaryotes. Our results suggest that the coupling of multiple stochastic processes in a negative covariance regime might be a widespread mechanism for noise reduction.
Genomic Methods for Clinical and Translational Pain Research
Wang, Dan; Kim, Hyungsuk; Wang, Xiao-Min; Dionne, Raymond
2012-01-01
Pain is a complex sensory experience for which the molecular mechanisms are yet to be fully elucidated. Individual differences in pain sensitivity are mediated by a complex network of multiple gene polymorphisms, physiological and psychological processes, and environmental factors. Here, we present the methods for applying unbiased molecular-genetic approaches, genome-wide association study (GWAS), and global gene expression analysis, to help better understand the molecular basis of pain sensitivity in humans and variable responses to analgesic drugs. PMID:22351080
Bessenyei, Beáta; Nagy, Andrea; Balogh, Erzsébet; Novák, László; Bognár, László; Knegt, Alida C; Oláh, Eva
2013-10-01
We report on a female patient with an exceedingly rare combination of achondroplasia and multiple-suture craniosynostosis. Besides the specific features of achondroplasia, synostosis of the metopic, coronal, lambdoid, and squamosal sutures was found. Series of neurosurgical interventions were carried out, principally for acrocephaly and posterior plagiocephaly. The most common achondroplasia mutation, a p.Gly380Arg in the fibroblast growth factor receptor 3 (FGFR3) gene, was detected. Cytogenetic and array CGH analyses, as well as molecular genetic testing of FGFR1, 2, 3 and TWIST1 genes failed to identify any additional genetic alteration. It is suggested that this unusual phenotype is a result of variable expressivity of the common achondroplasia mutation. Copyright © 2013 Wiley Periodicals, Inc.
Tsumura, Y; Uchiyama, K; Moriguchi, Y; Ueno, S; Ihara-Ujino, T
2012-12-01
Local adaptation is important in evolutionary processes and speciation. We used multiple tests to identify several candidate genes that may be involved in local adaptation from 1026 loci in 14 natural populations of Cryptomeria japonica, the most economically important forestry tree in Japan. We also studied the relationships between genotypes and environmental variables to obtain information on the selective pressures acting on individual populations. Outlier loci were mapped onto a linkage map, and the positions of loci associated with specific environmental variables are considered. The outlier loci were not randomly distributed on the linkage map; linkage group 11 was identified as a genomic island of divergence. Three loci in this region were also associated with environmental variables such as mean annual temperature, daily maximum temperature, maximum snow depth, and so on. Outlier loci identified with high significance levels will be essential for conservation purposes and for future work on molecular breeding.
High-throughput discovery of novel developmental phenotypes.
Dickinson, Mary E; Flenniken, Ann M; Ji, Xiao; Teboul, Lydia; Wong, Michael D; White, Jacqueline K; Meehan, Terrence F; Weninger, Wolfgang J; Westerberg, Henrik; Adissu, Hibret; Baker, Candice N; Bower, Lynette; Brown, James M; Caddle, L Brianna; Chiani, Francesco; Clary, Dave; Cleak, James; Daly, Mark J; Denegre, James M; Doe, Brendan; Dolan, Mary E; Edie, Sarah M; Fuchs, Helmut; Gailus-Durner, Valerie; Galli, Antonella; Gambadoro, Alessia; Gallegos, Juan; Guo, Shiying; Horner, Neil R; Hsu, Chih-Wei; Johnson, Sara J; Kalaga, Sowmya; Keith, Lance C; Lanoue, Louise; Lawson, Thomas N; Lek, Monkol; Mark, Manuel; Marschall, Susan; Mason, Jeremy; McElwee, Melissa L; Newbigging, Susan; Nutter, Lauryl M J; Peterson, Kevin A; Ramirez-Solis, Ramiro; Rowland, Douglas J; Ryder, Edward; Samocha, Kaitlin E; Seavitt, John R; Selloum, Mohammed; Szoke-Kovacs, Zsombor; Tamura, Masaru; Trainor, Amanda G; Tudose, Ilinca; Wakana, Shigeharu; Warren, Jonathan; Wendling, Olivia; West, David B; Wong, Leeyean; Yoshiki, Atsushi; MacArthur, Daniel G; Tocchini-Valentini, Glauco P; Gao, Xiang; Flicek, Paul; Bradley, Allan; Skarnes, William C; Justice, Monica J; Parkinson, Helen E; Moore, Mark; Wells, Sara; Braun, Robert E; Svenson, Karen L; de Angelis, Martin Hrabe; Herault, Yann; Mohun, Tim; Mallon, Ann-Marie; Henkelman, R Mark; Brown, Steve D M; Adams, David J; Lloyd, K C Kent; McKerlie, Colin; Beaudet, Arthur L; Bućan, Maja; Murray, Stephen A
2016-09-22
Approximately one-third of all mammalian genes are essential for life. Phenotypes resulting from knockouts of these genes in mice have provided tremendous insight into gene function and congenital disorders. As part of the International Mouse Phenotyping Consortium effort to generate and phenotypically characterize 5,000 knockout mouse lines, here we identify 410 lethal genes during the production of the first 1,751 unique gene knockouts. Using a standardized phenotyping platform that incorporates high-resolution 3D imaging, we identify phenotypes at multiple time points for previously uncharacterized genes and additional phenotypes for genes with previously reported mutant phenotypes. Unexpectedly, our analysis reveals that incomplete penetrance and variable expressivity are common even on a defined genetic background. In addition, we show that human disease genes are enriched for essential genes, thus providing a dataset that facilitates the prioritization and validation of mutations identified in clinical sequencing efforts.
High-throughput discovery of novel developmental phenotypes
Dickinson, Mary E.; Flenniken, Ann M.; Ji, Xiao; Teboul, Lydia; Wong, Michael D.; White, Jacqueline K.; Meehan, Terrence F.; Weninger, Wolfgang J.; Westerberg, Henrik; Adissu, Hibret; Baker, Candice N.; Bower, Lynette; Brown, James M.; Caddle, L. Brianna; Chiani, Francesco; Clary, Dave; Cleak, James; Daly, Mark J.; Denegre, James M.; Doe, Brendan; Dolan, Mary E.; Edie, Sarah M.; Fuchs, Helmut; Gailus-Durner, Valerie; Galli, Antonella; Gambadoro, Alessia; Gallegos, Juan; Guo, Shiying; Horner, Neil R.; Hsu, Chih-wei; Johnson, Sara J.; Kalaga, Sowmya; Keith, Lance C.; Lanoue, Louise; Lawson, Thomas N.; Lek, Monkol; Mark, Manuel; Marschall, Susan; Mason, Jeremy; McElwee, Melissa L.; Newbigging, Susan; Nutter, Lauryl M.J.; Peterson, Kevin A.; Ramirez-Solis, Ramiro; Rowland, Douglas J.; Ryder, Edward; Samocha, Kaitlin E.; Seavitt, John R.; Selloum, Mohammed; Szoke-Kovacs, Zsombor; Tamura, Masaru; Trainor, Amanda G; Tudose, Ilinca; Wakana, Shigeharu; Warren, Jonathan; Wendling, Olivia; West, David B.; Wong, Leeyean; Yoshiki, Atsushi; MacArthur, Daniel G.; Tocchini-Valentini, Glauco P.; Gao, Xiang; Flicek, Paul; Bradley, Allan; Skarnes, William C.; Justice, Monica J.; Parkinson, Helen E.; Moore, Mark; Wells, Sara; Braun, Robert E.; Svenson, Karen L.; de Angelis, Martin Hrabe; Herault, Yann; Mohun, Tim; Mallon, Ann-Marie; Henkelman, R. Mark; Brown, Steve D.M.; Adams, David J.; Lloyd, K.C. Kent; McKerlie, Colin; Beaudet, Arthur L.; Bucan, Maja; Murray, Stephen A.
2016-01-01
Approximately one third of all mammalian genes are essential for life. Phenotypes resulting from mouse knockouts of these genes have provided tremendous insight into gene function and congenital disorders. As part of the International Mouse Phenotyping Consortium effort to generate and phenotypically characterize 5000 knockout mouse lines, we have identified 410 lethal genes during the production of the first 1751 unique gene knockouts. Using a standardised phenotyping platform that incorporates high-resolution 3D imaging, we identified novel phenotypes at multiple time points for previously uncharacterized genes and additional phenotypes for genes with previously reported mutant phenotypes. Unexpectedly, our analysis reveals that incomplete penetrance and variable expressivity are common even on a defined genetic background. In addition, we show that human disease genes are enriched for essential genes identified in our screen, thus providing a novel dataset that facilitates prioritization and validation of mutations identified in clinical sequencing efforts. PMID:27626380
2011-01-01
Background Copepods are highly diverse and abundant, resulting in extensive ecological radiation in marine ecosystems. Calanus sinicus dominates continental shelf waters in the northwest Pacific Ocean and plays an important role in the local ecosystem by linking primary production to higher trophic levels. A lack of effective molecular markers has hindered phylogenetic and population genetic studies concerning copepods. As they are genome-level informative, mitochondrial DNA sequences can be used as markers for population genetic studies and phylogenetic studies. Results The mitochondrial genome of C. sinicus is distinct from other arthropods owing to the concurrence of multiple non-coding regions and a reshuffled gene arrangement. Further particularities in the mitogenome of C. sinicus include low A + T-content, symmetrical nucleotide composition between strands, abbreviated stop codons for several PCGs and extended lengths of the genes atp6 and atp8 relative to other copepods. The monophyletic Copepoda should be placed within the Vericrustacea. The close affinity between Cyclopoida and Poecilostomatoida suggests reassigning the latter as subordinate to the former. Monophyly of Maxillopoda is rejected. Within the alignment of 11 C. sinicus mitogenomes, there are 397 variable sites harbouring three 'hotspot' variable sites and three microsatellite loci. Conclusion The occurrence of the circular subgenomic fragment during laboratory assays suggests that special caution should be taken when sequencing mitogenomes using long PCR. Such a phenomenon may provide additional evidence of mitochondrial DNA recombination, which appears to have been a prerequisite for shaping the present mitochondrial profile of C. sinicus during its evolution. The lack of synapomorphic gene arrangements among copepods has cast doubt on the utility of gene order as a useful molecular marker for deep phylogenetic analysis. However, mitochondrial genomic sequences have been valuable markers for resolving phylogenetic issues concerning copepods. The variable site maps of C. sinicus mitogenomes provide a solid foundation for population genetic studies. PMID:21269523
Meta-analytic framework for liquid association.
Wang, Lin; Liu, Silvia; Ding, Ying; Yuan, Shin-Sheng; Ho, Yen-Yi; Tseng, George C
2017-07-15
Although coexpression analysis via pair-wise expression correlation is popularly used to elucidate gene-gene interactions at the whole-genome scale, many complicated multi-gene regulations require more advanced detection methods. Liquid association (LA) is a powerful tool to detect the dynamic correlation of two gene variables depending on the expression level of a third variable (LA scouting gene). LA detection from single transcriptomic study, however, is often unstable and not generalizable due to cohort bias, biological variation and limited sample size. With the rapid development of microarray and NGS technology, LA analysis combining multiple gene expression studies can provide more accurate and stable results. In this article, we proposed two meta-analytic approaches for LA analysis (MetaLA and MetaMLA) to combine multiple transcriptomic studies. To compensate demanding computing, we also proposed a two-step fast screening algorithm for more efficient genome-wide screening: bootstrap filtering and sign filtering. We applied the methods to five Saccharomyces cerevisiae datasets related to environmental changes. The fast screening algorithm reduced 98% of running time. When compared with single study analysis, MetaLA and MetaMLA provided stronger detection signal and more consistent and stable results. The top triplets are highly enriched in fundamental biological processes related to environmental changes. Our method can help biologists understand underlying regulatory mechanisms under different environmental exposure or disease states. A MetaLA R package, data and code for this article are available at http://tsenglab.biostat.pitt.edu/software.htm. ctseng@pitt.edu. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Elastic-net regularization approaches for genome-wide association studies of rheumatoid arthritis.
Cho, Seoae; Kim, Haseong; Oh, Sohee; Kim, Kyunga; Park, Taesung
2009-12-15
The current trend in genome-wide association studies is to identify regions where the true disease-causing genes may lie by evaluating thousands of single-nucleotide polymorphisms (SNPs) across the whole genome. However, many challenges exist in detecting disease-causing genes among the thousands of SNPs. Examples include multicollinearity and multiple testing issues, especially when a large number of correlated SNPs are simultaneously tested. Multicollinearity can often occur when predictor variables in a multiple regression model are highly correlated, and can cause imprecise estimation of association. In this study, we propose a simple stepwise procedure that identifies disease-causing SNPs simultaneously by employing elastic-net regularization, a variable selection method that allows one to address multicollinearity. At Step 1, the single-marker association analysis was conducted to screen SNPs. At Step 2, the multiple-marker association was scanned based on the elastic-net regularization. The proposed approach was applied to the rheumatoid arthritis (RA) case-control data set of Genetic Analysis Workshop 16. While the selected SNPs at the screening step are located mostly on chromosome 6, the elastic-net approach identified putative RA-related SNPs on other chromosomes in an increased proportion. For some of those putative RA-related SNPs, we identified the interactions with sex, a well known factor affecting RA susceptibility.
Geographic setting influences Great Lakes beach microbiological water quality
Haack, Sheridan K.; Fogarty, Lisa R.; Stelzer, Erin A.; Fuller, Lori M.; Brennan, Angela K.; Isaacs, Natasha M.; Johnson, Heather E.
2013-01-01
Understanding of factors that influence Escherichia coli (EC) and enterococci (ENT) concentrations, pathogen occurrence, and microbial sources at Great Lakes beaches comes largely from individual beach studies. Using 12 representative beaches, we tested enrichment cultures from 273 beach water and 22 tributary samples for EC, ENT, and genes indicating the bacterial pathogens Shiga-toxin producing E. coli (STEC), Shigella spp., Salmonella spp, Campylobacter jejuni/coli, and methicillin-resistant Staphylococcus aureus, and 108–145 samples for Bacteroides human, ruminant, and gull source-marker genes. EC/ENT temporal patterns, general Bacteroides concentration, and pathogen types and occurrence were regionally consistent (up to 40 km), but beach catchment variables (drains/creeks, impervious surface, urban land cover) influenced exceedances of EC/ENT standards and detections of Salmonella and STEC. Pathogen detections were more numerous when the EC/ENT Beach Action Value (but not when the Geometric Mean and Statistical Threshold Value) was exceeded. EC, ENT, and pathogens were not necessarily influenced by the same variables. Multiple Bacteroides sources, varying by date, occurred at every beach. Study of multiple beaches in different geographic settings provided new insights on the contrasting influences of regional and local variables, and a broader-scale perspective, on significance of EC/ENT exceedances, bacterial sources, and pathogen occurrence.
Candidate gene analysis for Alzheimer's disease in adults with Down syndrome.
Lee, Joseph H; Lee, Annie J; Dang, Lam-Ha; Pang, Deborah; Kisselev, Sergey; Krinsky-McHale, Sharon J; Zigman, Warren B; Luchsinger, José A; Silverman, Wayne; Tycko, Benjamin; Clark, Lorraine N; Schupf, Nicole
2017-08-01
Individuals with Down syndrome (DS) overexpress many genes on chromosome 21 due to trisomy and have high risk of dementia due to the Alzheimer's disease (AD) neuropathology. However, there is a wide range of phenotypic differences (e.g., age at onset of AD, amyloid β levels) among adults with DS, suggesting the importance of factors that modify risk within this particularly vulnerable population, including genotypic variability. Previous genetic studies in the general population have identified multiple genes that are associated with AD. This study examined the contribution of polymorphisms in these genes to the risk of AD in adults with DS ranging from 30 to 78 years of age at study entry (N = 320). We used multiple logistic regressions to estimate the likelihood of AD using single-nucleotide polymorphisms (SNPs) in candidate genes, adjusting for age, sex, race/ethnicity, level of intellectual disability and APOE genotype. This study identified multiple SNPs in APP and CST3 that were associated with AD at a gene-wise level empirical p-value of 0.05, with odds ratios in the range of 1.5-2. SNPs in MARK4 were marginally associated with AD. CST3 and MARK4 may contribute to our understanding of potential mechanisms where CST3 may contribute to the amyloid pathway by inhibiting plaque formation, and MARK4 may contribute to the regulation of the transition between stable and dynamic microtubules. Copyright © 2017 Elsevier Inc. All rights reserved.
Lee, Kyu Ha; Tadesse, Mahlet G; Baccarelli, Andrea A; Schwartz, Joel; Coull, Brent A
2017-03-01
The analysis of multiple outcomes is becoming increasingly common in modern biomedical studies. It is well-known that joint statistical models for multiple outcomes are more flexible and more powerful than fitting a separate model for each outcome; they yield more powerful tests of exposure or treatment effects by taking into account the dependence among outcomes and pooling evidence across outcomes. It is, however, unlikely that all outcomes are related to the same subset of covariates. Therefore, there is interest in identifying exposures or treatments associated with particular outcomes, which we term outcome-specific variable selection. In this work, we propose a variable selection approach for multivariate normal responses that incorporates not only information on the mean model, but also information on the variance-covariance structure of the outcomes. The approach effectively leverages evidence from all correlated outcomes to estimate the effect of a particular covariate on a given outcome. To implement this strategy, we develop a Bayesian method that builds a multivariate prior for the variable selection indicators based on the variance-covariance of the outcomes. We show via simulation that the proposed variable selection strategy can boost power to detect subtle effects without increasing the probability of false discoveries. We apply the approach to the Normative Aging Study (NAS) epigenetic data and identify a subset of five genes in the asthma pathway for which gene-specific DNA methylations are associated with exposures to either black carbon, a marker of traffic pollution, or sulfate, a marker of particles generated by power plants. © 2016, The International Biometric Society.
Crocco, Paolina; Barale, Roberto; Rose, Giuseppina; Rizzato, Cosmeri; Santoro, Aurelia; De Rango, Francesco; Carrai, Maura; Fogar, Paola; Monti, Daniela; Biondi, Fiammetta; Bucci, Laura; Ostan, Rita; Tallaro, Federica; Montesanto, Alberto; Zambon, Carlo-Federico; Franceschi, Claudio; Canzian, Federico; Passarino, Giuseppe; Campa, Daniele
2015-06-01
Leukocyte telomere length (LTL) has been observed to be hereditable and correlated with longevity. However, contrasting results have been reported in different populations on the value of LTL heritability and on how biology of telomeres influences longevity. We investigated whether the variability of genes correlated to telomere maintenance is associated with telomere length and affects longevity in a population from Southern Italy (20-106 years). For this purpose we analyzed thirty-one polymorphisms in eight telomerase-associated genes of which twelve in the genes coding for the core enzyme (TERT and TERC) and the remaining in genes coding for components of the telomerase complex (TERF1, TERF2, TERF2IP, TNKS, TNKS2 and TEP1). We did not observe (after correcting for multiple testing) statistically significant associations between SNPs and LTL, possibly suggesting a low genetic influence of the variability of these genes on LTL in the elderly. On the other hand, we found that the variability of genes encoding for TERF1 and TNKS2, not directly involved in LTL, but important for keeping the integrity of the structure, shows a significant association with longevity. This suggests that the maintenance of these chromosomal structures may be critically important for preventing, or delaying, senescence and aging. Such a correlation was not observed in a population from northern Italy that we used as an independent replication set. This discrepancy is in line with previous reports regarding both the population specificity of results on telomere biology and the differences of aging in northern and southern Italy.
Practical applications of the bioinformatics toolbox for narrowing quantitative trait loci.
Burgess-Herbert, Sarah L; Cox, Allison; Tsaih, Shirng-Wern; Paigen, Beverly
2008-12-01
Dissecting the genes involved in complex traits can be confounded by multiple factors, including extensive epistatic interactions among genes, the involvement of epigenetic regulators, and the variable expressivity of traits. Although quantitative trait locus (QTL) analysis has been a powerful tool for localizing the chromosomal regions underlying complex traits, systematically identifying the causal genes remains challenging. Here, through its application to plasma levels of high-density lipoprotein cholesterol (HDL) in mice, we demonstrate a strategy for narrowing QTL that utilizes comparative genomics and bioinformatics techniques. We show how QTL detected in multiple crosses are subjected to both combined cross analysis and haplotype block analysis; how QTL from one species are mapped to the concordant regions in another species; and how genomewide scans associating haplotype groups with their phenotypes can be used to prioritize the narrowed regions. Then we illustrate how these individual methods for narrowing QTL can be systematically integrated for mouse chromosomes 12 and 15, resulting in a significantly reduced number of candidate genes, often from hundreds to <10. Finally, we give an example of how additional bioinformatics resources can be combined with experiments to determine the most likely quantitative trait genes.
Why replication is important in landscape genetics: American black bear in the Rocky Mountains
Short, Bull R.A.; Cushman, S.A.; MacE, R.; Chilton, T.; Kendall, K.C.; Landguth, E.L.; Schwartz, Maurice L.; McKelvey, K.; Allendorf, F.W.; Luikart, G.
2011-01-01
We investigated how landscape features influence gene flow of black bears by testing the relative support for 36 alternative landscape resistance hypotheses, including isolation by distance (IBD) in each of 12 study areas in the north central U.S. Rocky Mountains. The study areas all contained the same basic elements, but differed in extent of forest fragmentation, altitude, variation in elevation and road coverage. In all but one of the study areas, isolation by landscape resistance was more supported than IBD suggesting gene flow is likely influenced by elevation, forest cover, and roads. However, the landscape features influencing gene flow varied among study areas. Using subsets of loci usually gave models with the very similar landscape features influencing gene flow as with all loci, suggesting the landscape features influencing gene flow were correctly identified. To test if the cause of the variability of supported landscape features in study areas resulted from landscape differences among study areas, we conducted a limiting factor analysis. We found that features were supported in landscape models only when the features were highly variable. This is perhaps not surprising but suggests an important cautionary note – that if landscape features are not found to influence gene flow, researchers should not automatically conclude that the features are unimportant to the species’ movement and gene flow. Failure to investigate multiple study areas that have a range of variability in landscape features could cause misleading inferences about which landscape features generally limit gene flow. This could lead to potentially erroneous identification of corridors and barriers if models are transferred between areas with different landscape characteristics.
KAMRADT, JACLYN M.; NIGG, JOEL T.; FRIDERICI, KAREN H.; NIKOLAS, MOLLY A.
2016-01-01
Genetic influences on dopaminergic neurotransmission have been implicated in attention-deficit hyperactivity disorder (ADHD) and are theorized to impact cognitive functioning via alterations in frontal–striatal circuitry. Neuropsychological functioning has been proposed to account for the potential associations between dopamine candidate genes and ADHD. However, to date, this mediation hypothesis has not been directly tested. Participants were 498 youth ages 6–17 years (mean M = 10.8 years, SD = 2.4 years, 55.0% male). All youth completed a multistage, multiple-informant assessment procedure to identify ADHD and non-ADHD cases, as well as a comprehensive neuropsychological battery. Youth provided a saliva sample for DNA analyses; the 480 base pair variable number of tandem repeat polymorphism of the dopamine active transporter 1 gene (DAT1) and the 120 base pair promoter polymorphism of the dopamine receptor D4 gene (DRD4) were genotyped. Multiple mediation analysis revealed significant indirect associations between DAT1 genotype and inattention, hyperactivity–impulsivity, and oppositionality, with specific indirect effects through response inhibition. The results highlight the role of neurocognitive task performance, particularly response inhibition, as a potential intermediate phenotype for ADHD, further elucidating the relationship between genetic polymorphisms and externalizing psychopathology. PMID:27049476
Serotonin transporter gene and childhood trauma--a G × E effect on anxiety sensitivity.
Klauke, Benedikt; Deckert, Jürgen; Reif, Andreas; Pauli, Paul; Zwanzger, Peter; Baumann, Christian; Arolt, Volker; Glöckner-Rist, Angelika; Domschke, Katharina
2011-12-21
Genetic factors and environmental factors are assumed to interactively influence the pathogenesis of anxiety disorders. Thus, a gene-environment interaction (G × E) study was conducted with respect to anxiety sensitivity (AS) as a promising intermediate phenotype of anxiety disorders. Healthy subjects (N = 363) were assessed for AS, childhood maltreatment (Childhood Trauma Questionnaire), and genotyped for functional serotonin transporter gene variants (5-HTTLPR/5-HTT rs25531). The influence of genetic and environmental variables on AS and its subdimensions was determined by a step-wise hierarchical regression and a multiple indicator multiple cause (MIMIC) model. A significant G × E effect of the more active 5-HTT genotypes and childhood maltreatment on AS was observed. Furthermore, genotype (LL)-childhood trauma interaction particularly influenced somatic AS subdimensions, whereas cognitive subdimensions were affected by childhood maltreatment only. Results indicate a G × E effect of the more active 5-HTT genotypes and childhood maltreatment on AS, with particular impact on its somatic subcomponent. © 2011 Wiley Periodicals, Inc.
Wang, Zhuo; Jin, Shuilin; Liu, Guiyou; Zhang, Xiurui; Wang, Nan; Wu, Deliang; Hu, Yang; Zhang, Chiping; Jiang, Qinghua; Xu, Li; Wang, Yadong
2017-05-23
The development of single-cell RNA sequencing has enabled profound discoveries in biology, ranging from the dissection of the composition of complex tissues to the identification of novel cell types and dynamics in some specialized cellular environments. However, the large-scale generation of single-cell RNA-seq (scRNA-seq) data collected at multiple time points remains a challenge to effective measurement gene expression patterns in transcriptome analysis. We present an algorithm based on the Dynamic Time Warping score (DTWscore) combined with time-series data, that enables the detection of gene expression changes across scRNA-seq samples and recovery of potential cell types from complex mixtures of multiple cell types. The DTWscore successfully classify cells of different types with the most highly variable genes from time-series scRNA-seq data. The study was confined to methods that are implemented and available within the R framework. Sample datasets and R packages are available at https://github.com/xiaoxiaoxier/DTWscore .
SMARCB1/INI1 germline mutations contribute to 10% of sporadic schwannomatosis.
Rousseau, Guillaume; Noguchi, Tetsuro; Bourdon, Violaine; Sobol, Hagay; Olschwang, Sylviane
2011-01-24
Schwannomatosis is a disease characterized by multiple non-vestibular schwannomas. Although biallelic NF2 mutations are found in schwannomas, no germ line event is detected in schwannomatosis patients. In contrast, germline mutations of the SMARCB1 (INI1) tumor suppressor gene were described in familial and sporadic schwannomatosis patients. To delineate the SMARCB1 gene contribution, the nine coding exons were sequenced in a series of 56 patients affected with a variable number of non-vestibular schwannomas. Nine variants scattered along the sequence of SMARCB1 were identified. Five of them were classified as deleterious. All five patients carrying a SMARCB1 mutation had more multiple schwannomas, corresponding to 10.2% of patients with schwannomatosis. They were also diagnosed before 35 years of age. These results suggest that patients with schwannomas have a significant probability of carrying a SMARCB1 mutation. Combined with data available from other studies, they confirm the clinical indications for genetic screening of the SMARCB1 gene.
SMARCB1/INI1 germline mutations contribute to 10% of sporadic schwannomatosis
2011-01-01
Background Schwannomatosis is a disease characterized by multiple non-vestibular schwannomas. Although biallelic NF2 mutations are found in schwannomas, no germ line event is detected in schwannomatosis patients. In contrast, germline mutations of the SMARCB1 (INI1) tumor suppressor gene were described in familial and sporadic schwannomatosis patients. Methods To delineate the SMARCB1 gene contribution, the nine coding exons were sequenced in a series of 56 patients affected with a variable number of non-vestibular schwannomas. Results Nine variants scattered along the sequence of SMARCB1 were identified. Five of them were classified as deleterious. All five patients carrying a SMARCB1 mutation had more multiple schwannomas, corresponding to 10.2% of patients with schwannomatosis. They were also diagnosed before 35 years of age. Conclusions These results suggest that patients with schwannomas have a significant probability of carrying a SMARCB1 mutation. Combined with data available from other studies, they confirm the clinical indications for genetic screening of the SMARCB1 gene. PMID:21255467
Johnson, Brent A
2009-10-01
We consider estimation and variable selection in the partial linear model for censored data. The partial linear model for censored data is a direct extension of the accelerated failure time model, the latter of which is a very important alternative model to the proportional hazards model. We extend rank-based lasso-type estimators to a model that may contain nonlinear effects. Variable selection in such partial linear model has direct application to high-dimensional survival analyses that attempt to adjust for clinical predictors. In the microarray setting, previous methods can adjust for other clinical predictors by assuming that clinical and gene expression data enter the model linearly in the same fashion. Here, we select important variables after adjusting for prognostic clinical variables but the clinical effects are assumed nonlinear. Our estimator is based on stratification and can be extended naturally to account for multiple nonlinear effects. We illustrate the utility of our method through simulation studies and application to the Wisconsin prognostic breast cancer data set.
Ventura, Marco; Canchaya, Carlos; Meylan, Valèrie; Klaenhammer, Todd R.; Zink, Ralf
2003-01-01
We analyzed the tuf gene, encoding elongation factor Tu, from 33 strains representing 17 Lactobacillus species and 8 Bifidobacterium species. The tuf sequences were aligned and used to infer phylogenesis among species of lactobacilli and bifidobacteria. We demonstrated that the synonymous substitution affecting this gene renders elongation factor Tu a reliable molecular clock for investigating evolutionary distances of lactobacilli and bifidobacteria. In fact, the phylogeny generated by these tuf sequences is consistent with that derived from 16S rRNA analysis. The investigation of a multiple alignment of tuf sequences revealed regions conserved among strains belonging to the same species but distinct from those of other species. PCR primers complementary to these regions allowed species-specific identification of closely related species, such as Lactobacillus casei group members. These tuf gene-based assays developed in this study provide an alternative to present methods for the identification for lactic acid bacterial species. Since a variable number of tuf genes have been described for bacteria, the presence of multiple genes was examined. Southern analysis revealed one tuf gene in the genomes of lactobacilli and bifidobacteria, but the tuf gene was arranged differently in the genomes of these two taxa. Our results revealed that the tuf gene in bifidobacteria is flanked by the same gene constellation as the str operon, as originally reported for Escherichia coli. In contrast, bioinformatic and transcriptional analyses of the DNA region flanking the tuf gene in four Lactobacillus species indicated the same four-gene unit and suggested a novel tuf operon specific for the genus Lactobacillus. PMID:14602655
Peeling skin syndrome associated with novel variant in FLG2 gene.
Alfares, Ahmed; Al-Khenaizan, Sultan; Al Mutairi, Fuad
2017-12-01
Peeling skin syndrome is a rare genodermatosis characterized by variably pruritic superficial generalized peeling of the skin with several genes involved until now little is known about the association between FLG2 and peeling skin syndrome. We describe multiple family members from a consanguineous Saudi family with peeling skin syndrome. Next Generation Sequencing identifies a cosegregating novel variant in FLG2 c.632C>G (p.Ser211*) as a likely etiology in this family. Here, we reported on the clinical manifestation of homozygous loss of function variant in FLG2 as a disease-causing gene for peeling skin syndrome and expand the dermatology findings. © 2017 Wiley Periodicals, Inc.
A novel sodium bicarbonate cotransporter-like gene in an ancient duplicated region: SLC4A9 at 5q31
Lipovich, Leonard; Lynch, Eric D; Lee, Ming K; King, Mary-Claire
2001-01-01
Background: Sodium bicarbonate cotransporter (NBC) genes encode proteins that execute coupled Na+ and HCO3- transport across epithelial cell membranes. We report the discovery, characterization, and genomic context of a novel human NBC-like gene, SLC4A9, on chromosome 5q31. Results: SLC4A9 was initially discovered by genomic sequence annotation and further characterized by sequencing of long-insert cDNA library clones. The predicted protein of 990 amino acids has 12 transmembrane domains and high sequence similarity to other NBCs. The 23-exon gene has 14 known mRNA isoforms. In three regions, mRNA sequence variation is generated by the inclusion or exclusion of portions of an exon. Noncoding SLC4A9 cDNAs were recovered multiple times from different libraries. The 3' untranslated region is fragmented into six alternatively spliced exons and contains expressed Alu, LINE and MER repeats. SLC4A9 has two alternative stop codons and six polyadenylation sites. Its expression is largely restricted to the kidney. In silico approaches were used to characterize two additional novel SLC4A genes and to place SLC4A9 within the context of multiple paralogous gene clusters containing members of the epidermal growth factor (EGF), ankyrin (ANK) and fibroblast growth factor (FGF) families. Seven human EGF-SLC4A-ANK-FGF clusters were found. Conclusion: The novel sodium bicarbonate cotransporter-like gene SLC4A9 demonstrates abundant alternative mRNA processing. It belongs to a growing class of functionally diverse genes characterized by inefficient highly variable splicing. The evolutionary history of the EGF-SLC4A-ANK-FGF gene clusters involves multiple rounds of duplication, apparently followed by large insertions and deletions at paralogous loci and genome-wide gene shuffling. PMID:11305939
Silver, Nicholas; Cotroneo, Emanuele; Proctor, Gordon; Osailan, Samira; Paterson, Katherine L; Carpenter, Guy H
2008-01-01
Background Real-time PCR is a reliable tool with which to measure mRNA transcripts, and provides valuable information on gene expression profiles. Endogenous controls such as housekeeping genes are used to normalise mRNA levels between samples for sensitive comparisons of mRNA transcription. Selection of the most stable control gene(s) is therefore critical for the reliable interpretation of gene expression data. For the purpose of this study, 7 commonly used housekeeping genes were investigated in salivary submandibular glands under normal, inflamed, atrophic and regenerative states. Results The program NormFinder identified the suitability of HPRT to use as a single gene for normalisation within the normal, inflamed and regenerative states, and GAPDH in the atrophic state. For normalisation to multiple housekeeping genes, for each individual state, the optimal number of housekeeping genes as given by geNorm was: ACTB/UBC in the normal, ACTB/YWHAZ in the inflamed, ACTB/HPRT in the atrophic and ACTB/GAPDH in the regenerative state. The most stable housekeeping gene identified between states (compared to normal) was UBC. However, ACTB, identified as one of the most stably expressed genes within states, was found to be one of the most variable between states. Furthermore we demonstrated that normalising between states to ACTB, rather than UBC, introduced an approximately 3 fold magnitude of error. Conclusion Using NormFinder, our studies demonstrated the suitability of HPRT to use as a single gene for normalisation within the normal, inflamed and regenerative groups and GAPDH in the atrophic group. However, if normalising to multiple housekeeping genes, we recommend normalising to those identified by geNorm. For normalisation across the physiological states, we recommend the use of UBC. PMID:18637167
A search for association between schizophrenia and dopamine-related alleles.
Jönsson, E; Brené, S; Geijer, T; Terenius, L; Tylec, A; Persson, M L; Sedvall, G
1996-01-01
Dopamine receptor dysfunction and altered tyrosine hydroxylase activity have both been implicated in the pathophysiology of schizophrenia. Schizophrenic patients and control subjects were examined for allele frequencies in the tyrosine hydroxylase and dopamine D2 and D4 receptor genes. No significant differences of allele or genotype frequencies were found between the two groups after adjustment for multiple comparisons. Neither were any significant relationships observed between allele frequencies and a number of clinical variables within the schizophrenic subsample. When no adjustment was made for multiple testing a few significant tendencies were obtained which warrant further research in extended patient and control materials. The results are compatible with the view that the tyrosine hydroxylase, dopamine receptor D2 and D4 gene polymorphisms examined are not of major importance in the aetiology or pathophysiology of schizophrenia.
Pounds, Stan; Cao, Xueyuan; Cheng, Cheng; Yang, Jun; Campana, Dario; Evans, William E.; Pui, Ching-Hon; Relling, Mary V.
2010-01-01
Powerful methods for integrated analysis of multiple biological data sets are needed to maximize interpretation capacity and acquire meaningful knowledge. We recently developed Projection Onto the Most Interesting Statistical Evidence (PROMISE). PROMISE is a statistical procedure that incorporates prior knowledge about the biological relationships among endpoint variables into an integrated analysis of microarray gene expression data with multiple biological and clinical endpoints. Here, PROMISE is adapted to the integrated analysis of pharmacologic, clinical, and genome-wide genotype data that incorporating knowledge about the biological relationships among pharmacologic and clinical response data. An efficient permutation-testing algorithm is introduced so that statistical calculations are computationally feasible in this higher-dimension setting. The new method is applied to a pediatric leukemia data set. The results clearly indicate that PROMISE is a powerful statistical tool for identifying genomic features that exhibit a biologically meaningful pattern of association with multiple endpoint variables. PMID:21516175
Fast and robust group-wise eQTL mapping using sparse graphical models.
Cheng, Wei; Shi, Yu; Zhang, Xiang; Wang, Wei
2015-01-16
Genome-wide expression quantitative trait loci (eQTL) studies have emerged as a powerful tool to understand the genetic basis of gene expression and complex traits. The traditional eQTL methods focus on testing the associations between individual single-nucleotide polymorphisms (SNPs) and gene expression traits. A major drawback of this approach is that it cannot model the joint effect of a set of SNPs on a set of genes, which may correspond to hidden biological pathways. We introduce a new approach to identify novel group-wise associations between sets of SNPs and sets of genes. Such associations are captured by hidden variables connecting SNPs and genes. Our model is a linear-Gaussian model and uses two types of hidden variables. One captures the set associations between SNPs and genes, and the other captures confounders. We develop an efficient optimization procedure which makes this approach suitable for large scale studies. Extensive experimental evaluations on both simulated and real datasets demonstrate that the proposed methods can effectively capture both individual and group-wise signals that cannot be identified by the state-of-the-art eQTL mapping methods. Considering group-wise associations significantly improves the accuracy of eQTL mapping, and the successful multi-layer regression model opens a new approach to understand how multiple SNPs interact with each other to jointly affect the expression level of a group of genes.
Maho, Angaya; Rossano, Alexandra; Hächler, Herbert; Holzer, Anita; Schelling, Esther; Zinsstag, Jakob; Hassane, Mahamat H.; Toguebaye, Bhen S.; Akakpo, Ayayi J.; Van Ert, Matthew; Keim, Paul; Kenefic, Leo; Frey, Joachim; Perreten, Vincent
2006-01-01
We genotyped 15 Bacillus anthracis isolates from Chad, Africa, using multiple-locus variable-number tandem repeat analysis and three additional direct-repeat markers. We identified two unique genotypes that represent a novel genetic lineage in the A cluster. Chadian isolates were susceptible to 11 antibiotics and free of 94 antibiotic resistance genes. PMID:16954291
Taddei, Lucilla; Stella, Giulio Rocco; Rogato, Alessandra; Bailleul, Benjamin; Fortunato, Antonio Emidio; Annunziata, Rossella; Sanges, Remo; Thaler, Michael; Lepetit, Bernard; Lavaud, Johann; Jaubert, Marianne; Finazzi, Giovanni; Bouly, Jean-Pierre; Falciatore, Angela
2016-01-01
Diatoms are phytoplanktonic organisms that grow successfully in the ocean where light conditions are highly variable. Studies of the molecular mechanisms of light acclimation in the marine diatom Phaeodactylum tricornutum show that carotenoid de-epoxidation enzymes and LHCX1, a member of the light-harvesting protein family, both contribute to dissipate excess light energy through non-photochemical quenching (NPQ). In this study, we investigate the role of the other members of the LHCX family in diatom stress responses. Our analysis of available genomic data shows that the presence of multiple LHCX genes is a conserved feature of diatom species living in different ecological niches. Moreover, an analysis of the levels of four P. tricornutum LHCX transcripts in relation to protein expression and photosynthetic activity indicates that LHCXs are differentially regulated under different light intensities and nutrient starvation, mostly modulating NPQ capacity. We conclude that multiple abiotic stress signals converge to regulate the LHCX content of cells, providing a way to fine-tune light harvesting and photoprotection. Moreover, our data indicate that the expansion of the LHCX gene family reflects functional diversification of its members which could benefit cells responding to highly variable ocean environments. PMID:27225826
An etiologic classification of autism spectrum disorders.
Gabis, Lidia V; Pomeroy, John
2014-05-01
Autism spectrum disorders (ASD) represent a common phenotype related to multiple etiologies, such as genetic, brain injury (e.g., prematurity), environmental (e.g., viral, toxic), multiple or unknown causes. To devise a clinical classification of children diagnosed with ASD according to etiologic workup. Children diagnosed with ASD (n = 436) from two databases were divided into groups of symptomatic cryptogenic or idiopathic, and variables within each database and diagnostic category were compared. By analyzing the two separate databases, 5.4% of the children were classified as symptomatic, 27% as cryptogenic and 67.75% as idiopathic. Among other findings, the entire symptomatic group demonstrated language delays, but almost none showed evidence for regression. Our results indicate similarities between the idiopathic and cryptogenic subgroups in most of the examined variables, and mutual differences from the symptomatic subgroup. The similarities between the first two subgroups support prior evidence that most perinatal factors and minor physical anomalies do not contribute to the development of core symptoms of autism. Differences in gender and clinical and diagnostic features were found when etiology was used to create subtypes of ASD. This classification could have heuristic importance in the search for an autism gene(s).
NASA Astrophysics Data System (ADS)
Corcoran, Martin M.; Phad, Ganesh E.; Bernat, Néstor Vázquez; Stahl-Hennig, Christiane; Sumida, Noriyuki; Persson, Mats A. A.; Martin, Marcel; Hedestam, Gunilla B. Karlsson
2016-12-01
Comprehensive knowledge of immunoglobulin genetics is required to advance our understanding of B cell biology. Validated immunoglobulin variable (V) gene databases are close to completion only for human and mouse. We present a novel computational approach, IgDiscover, that identifies germline V genes from expressed repertoires to a specificity of 100%. IgDiscover uses a cluster identification process to produce candidate sequences that, once filtered, results in individualized germline V gene databases. IgDiscover was tested in multiple species, validated by genomic cloning and cross library comparisons and produces comprehensive gene databases even where limited genomic sequence is available. IgDiscover analysis of the allelic content of the Indian and Chinese-origin rhesus macaques reveals high levels of immunoglobulin gene diversity in this species. Further, we describe a novel human IGHV3-21 allele and confirm significant gene differences between Balb/c and C57BL6 mouse strains, demonstrating the power of IgDiscover as a germline V gene discovery tool.
Corcoran, Martin M.; Phad, Ganesh E.; Bernat, Néstor Vázquez; Stahl-Hennig, Christiane; Sumida, Noriyuki; Persson, Mats A.A.; Martin, Marcel; Hedestam, Gunilla B. Karlsson
2016-01-01
Comprehensive knowledge of immunoglobulin genetics is required to advance our understanding of B cell biology. Validated immunoglobulin variable (V) gene databases are close to completion only for human and mouse. We present a novel computational approach, IgDiscover, that identifies germline V genes from expressed repertoires to a specificity of 100%. IgDiscover uses a cluster identification process to produce candidate sequences that, once filtered, results in individualized germline V gene databases. IgDiscover was tested in multiple species, validated by genomic cloning and cross library comparisons and produces comprehensive gene databases even where limited genomic sequence is available. IgDiscover analysis of the allelic content of the Indian and Chinese-origin rhesus macaques reveals high levels of immunoglobulin gene diversity in this species. Further, we describe a novel human IGHV3-21 allele and confirm significant gene differences between Balb/c and C57BL6 mouse strains, demonstrating the power of IgDiscover as a germline V gene discovery tool. PMID:27995928
Moorthy, Sakthi D.; Davidson, Scott; Shchuka, Virlana M.; Singh, Gurdeep; Malek-Gilani, Nakisa; Langroudi, Lida; Martchenko, Alexandre; So, Vincent; Macpherson, Neil N.; Mitchell, Jennifer A.
2017-01-01
Transcriptional enhancers are critical for maintaining cell-type–specific gene expression and driving cell fate changes during development. Highly transcribed genes are often associated with a cluster of individual enhancers such as those found in locus control regions. Recently, these have been termed stretch enhancers or super-enhancers, which have been predicted to regulate critical cell identity genes. We employed a CRISPR/Cas9-mediated deletion approach to study the function of several enhancer clusters (ECs) and isolated enhancers in mouse embryonic stem (ES) cells. Our results reveal that the effect of deleting ECs, also classified as ES cell super-enhancers, is highly variable, resulting in target gene expression reductions ranging from 12% to as much as 92%. Partial deletions of these ECs which removed only one enhancer or a subcluster of enhancers revealed partially redundant control of the regulated gene by multiple enhancers within the larger cluster. Many highly transcribed genes in ES cells are not associated with a super-enhancer; furthermore, super-enhancer predictions ignore 81% of the potentially active regulatory elements predicted by cobinding of five or more pluripotency-associated transcription factors. Deletion of these additional enhancer regions revealed their robust regulatory role in gene transcription. In addition, select super-enhancers and enhancers were identified that regulated clusters of paralogous genes. We conclude that, whereas robust transcriptional output can be achieved by an isolated enhancer, clusters of enhancers acting on a common target gene act in a partially redundant manner to fine tune transcriptional output of their target genes. PMID:27895109
Resurgence of Pertussis and Emergence of the Ptxp3 Toxin Promoter Allele in South Italy.
Loconsole, Daniela; De Robertis, Anna Lisa; Morea, Anna; Metallo, Angela; Lopalco, Pier Luigi; Chironna, Maria
2018-05-01
Despite universal immunization programs, pertussis remains a major public health concern. This study aimed to describe the pertussis epidemiology in the Puglia region in 2006-2015 and to identify recent polymorphisms in Bordetella pertussis virulence-associated genes. The pertussis cases in 2006-2015 were identified from the National Hospital Discharge Database and the Information System of Infectious Diseases. Samples of pertussis cases in 2014-2016 that were confirmed by the Regional Reference Laboratory were subjected to ptxA, ptxP and prn gene sequencing and, in 10 cases, multiple-locus variable-number tandem repeat analysis. In Puglia in 2006-2015, the pertussis incidence rose from an average of 1.39/100,000 inhabitants in 2006-2013 to 2.56-2.54/100,000 in 2014-2015. In infants <1 year of age, the incidence rose from an average of 60.4/100,000 infants in 2006-2013 to 149.9/100,000 in 2015. Of the 661 cases recorded in 2006-2015, 80.3% required hospitalization; of these, 45.4% were <1 year of age. Of the 80 sequenced samples, the allelic profile ptxA1-ptxP3-prn2 was detected in 74. This variant was detected in both vaccinated and unvaccinated people. Six Bordetella pertussis samples were prn deficient. The multiple-locus variable-number tandem repeat analysis cases exhibited multiple-locus variable-number tandem repeat analysis-type 27. The pertussis incidence in Puglia has risen. The hypervirulent strain was also found in vaccinated people. This suggests bacterial adaptation to the vaccine and raises questions about acellular vaccine effectiveness. Prevention of infant pertussis cases is best achieved by immunizing the pregnant mother. Enhanced surveillance and systematic laboratory confirmation of pertussis should be improved in Italy.
Gene genealogies for genetic association mapping, with application to Crohn's disease
Burkett, Kelly M.; Greenwood, Celia M. T.; McNeney, Brad; Graham, Jinko
2013-01-01
A gene genealogy describes relationships among haplotypes sampled from a population. Knowledge of the gene genealogy for a set of haplotypes is useful for estimation of population genetic parameters and it also has potential application in finding disease-predisposing genetic variants. As the true gene genealogy is unknown, Markov chain Monte Carlo (MCMC) approaches have been used to sample genealogies conditional on data at multiple genetic markers. We previously implemented an MCMC algorithm to sample from an approximation to the distribution of the gene genealogy conditional on haplotype data. Our approach samples ancestral trees, recombination and mutation rates at a genomic focal point. In this work, we describe how our sampler can be used to find disease-predisposing genetic variants in samples of cases and controls. We use a tree-based association statistic that quantifies the degree to which case haplotypes are more closely related to each other around the focal point than control haplotypes, without relying on a disease model. As the ancestral tree is a latent variable, so is the tree-based association statistic. We show how the sampler can be used to estimate the posterior distribution of the latent test statistic and corresponding latent p-values, which together comprise a fuzzy p-value. We illustrate the approach on a publicly-available dataset from a study of Crohn's disease that consists of genotypes at multiple SNP markers in a small genomic region. We estimate the posterior distribution of the tree-based association statistic and the recombination rate at multiple focal points in the region. Reassuringly, the posterior mean recombination rates estimated at the different focal points are consistent with previously published estimates. The tree-based association approach finds multiple sub-regions where the case haplotypes are more genetically related than the control haplotypes, and that there may be one or multiple disease-predisposing loci. PMID:24348515
The evolution of Dscam genes across the arthropods.
Armitage, Sophie A O; Freiburg, Rebecca Y; Kurtz, Joachim; Bravo, Ignacio G
2012-04-13
One way of creating phenotypic diversity is through alternative splicing of precursor mRNAs. A gene that has evolved a hypervariable form is Down syndrome cell adhesion molecule (Dscam-hv), which in Drosophila melanogaster can produce thousands of isoforms via mutually exclusive alternative splicing. The extracellular region of this protein is encoded by three variable exon clusters, each containing multiple exon variants. The protein is vital for neuronal wiring where the extreme variability at the somatic level is required for axonal guidance, and it plays a role in immunity where the variability has been hypothesised to relate to recognition of different antigens. Dscam-hv has been found across the Pancrustacea. Additionally, three paralogous non-hypervariable Dscam-like genes have also been described for D. melanogaster. Here we took a bioinformatics approach, building profile Hidden Markov Models to search across species for putative orthologs to the Dscam genes and for hypervariable alternatively spliced exons, and inferring the phylogenetic relationships among them. Our aims were to examine whether Dscam orthologs exist outside the Bilateria, whether the origin of Dscam-hv could lie outside the Pancrustacea, when the Dscam-like orthologs arose, how many alternatively spliced exons of each exon cluster were present in the most common recent ancestor, and how these clusters evolved. Our results suggest that the origin of Dscam genes may lie after the split between the Cnidaria and the Bilateria and supports the hypothesis that Dscam-hv originated in the common ancestor of the Pancrustacea. Our phylogeny of Dscam gene family members shows six well-supported clades: five containing Dscam-like genes and one containing all the Dscam-hv genes, a seventh clade contains arachnid putative Dscam genes. Furthermore, the exon clusters appear to have experienced different evolutionary histories. Dscam genes have undergone independent duplication events in the insects and in an arachnid genome, which adds to the more well-known tandem duplications that have taken place within Dscam-hv genes. Therefore, two forms of gene expansion seem to be active within this gene family. The evolutionary history of this dynamic gene family will be further unfolded as genomes of species from more disparate groups become available.
The evolution of Dscam genes across the arthropods
2012-01-01
Background One way of creating phenotypic diversity is through alternative splicing of precursor mRNAs. A gene that has evolved a hypervariable form is Down syndrome cell adhesion molecule (Dscam-hv), which in Drosophila melanogaster can produce thousands of isoforms via mutually exclusive alternative splicing. The extracellular region of this protein is encoded by three variable exon clusters, each containing multiple exon variants. The protein is vital for neuronal wiring where the extreme variability at the somatic level is required for axonal guidance, and it plays a role in immunity where the variability has been hypothesised to relate to recognition of different antigens. Dscam-hv has been found across the Pancrustacea. Additionally, three paralogous non-hypervariable Dscam-like genes have also been described for D. melanogaster. Here we took a bioinformatics approach, building profile Hidden Markov Models to search across species for putative orthologs to the Dscam genes and for hypervariable alternatively spliced exons, and inferring the phylogenetic relationships among them. Our aims were to examine whether Dscam orthologs exist outside the Bilateria, whether the origin of Dscam-hv could lie outside the Pancrustacea, when the Dscam-like orthologs arose, how many alternatively spliced exons of each exon cluster were present in the most common recent ancestor, and how these clusters evolved. Results Our results suggest that the origin of Dscam genes may lie after the split between the Cnidaria and the Bilateria and supports the hypothesis that Dscam-hv originated in the common ancestor of the Pancrustacea. Our phylogeny of Dscam gene family members shows six well-supported clades: five containing Dscam-like genes and one containing all the Dscam-hv genes, a seventh clade contains arachnid putative Dscam genes. Furthermore, the exon clusters appear to have experienced different evolutionary histories. Conclusions Dscam genes have undergone independent duplication events in the insects and in an arachnid genome, which adds to the more well-known tandem duplications that have taken place within Dscam-hv genes. Therefore, two forms of gene expansion seem to be active within this gene family. The evolutionary history of this dynamic gene family will be further unfolded as genomes of species from more disparate groups become available. PMID:22500922
Chowdhury, Nilotpal; Sapru, Shantanu
2015-01-01
Microarray analysis has revolutionized the role of genomic prognostication in breast cancer. However, most studies are single series studies, and suffer from methodological problems. We sought to use a meta-analytic approach in combining multiple publicly available datasets, while correcting for batch effects, to reach a more robust oncogenomic analysis. The aim of the present study was to find gene sets associated with distant metastasis free survival (DMFS) in systemically untreated, node-negative breast cancer patients, from publicly available genomic microarray datasets. Four microarray series (having 742 patients) were selected after a systematic search and combined. Cox regression for each gene was done for the combined dataset (univariate, as well as multivariate - adjusted for expression of Cell cycle related genes) and for the 4 major molecular subtypes. The centre and microarray batch effects were adjusted by including them as random effects variables. The Cox regression coefficients for each analysis were then ranked and subjected to a Gene Set Enrichment Analysis (GSEA). Gene sets representing protein translation were independently negatively associated with metastasis in the Luminal A and Luminal B subtypes, but positively associated with metastasis in Basal tumors. Proteinaceous extracellular matrix (ECM) gene set expression was positively associated with metastasis, after adjustment for expression of cell cycle related genes on the combined dataset. Finally, the positive association of the proliferation-related genes with metastases was confirmed. To the best of our knowledge, the results depicting mixed prognostic significance of protein translation in breast cancer subtypes are being reported for the first time. We attribute this to our study combining multiple series and performing a more robust meta-analytic Cox regression modeling on the combined dataset, thus discovering 'hidden' associations. This methodology seems to yield new and interesting results and may be used as a tool to guide new research.
Chowdhury, Nilotpal; Sapru, Shantanu
2015-01-01
Introduction Microarray analysis has revolutionized the role of genomic prognostication in breast cancer. However, most studies are single series studies, and suffer from methodological problems. We sought to use a meta-analytic approach in combining multiple publicly available datasets, while correcting for batch effects, to reach a more robust oncogenomic analysis. Aim The aim of the present study was to find gene sets associated with distant metastasis free survival (DMFS) in systemically untreated, node-negative breast cancer patients, from publicly available genomic microarray datasets. Methods Four microarray series (having 742 patients) were selected after a systematic search and combined. Cox regression for each gene was done for the combined dataset (univariate, as well as multivariate – adjusted for expression of Cell cycle related genes) and for the 4 major molecular subtypes. The centre and microarray batch effects were adjusted by including them as random effects variables. The Cox regression coefficients for each analysis were then ranked and subjected to a Gene Set Enrichment Analysis (GSEA). Results Gene sets representing protein translation were independently negatively associated with metastasis in the Luminal A and Luminal B subtypes, but positively associated with metastasis in Basal tumors. Proteinaceous extracellular matrix (ECM) gene set expression was positively associated with metastasis, after adjustment for expression of cell cycle related genes on the combined dataset. Finally, the positive association of the proliferation-related genes with metastases was confirmed. Conclusion To the best of our knowledge, the results depicting mixed prognostic significance of protein translation in breast cancer subtypes are being reported for the first time. We attribute this to our study combining multiple series and performing a more robust meta-analytic Cox regression modeling on the combined dataset, thus discovering 'hidden' associations. This methodology seems to yield new and interesting results and may be used as a tool to guide new research. PMID:26080057
Zhang, Wensheng; Edwards, Andrea; Zhu, Dongxiao; Flemington, Erik K.; Deininger, Prescott; Zhang, Kun
2012-01-01
In metazoans, miRNAs regulate gene expression primarily through binding to target sites in the 3′ UTRs (untranslated regions) of messenger RNAs (mRNAs). Cis-acting variants within, or close to, a gene are crucial in explaining the variability of gene expression measures. Single nucleotide polymorphisms (SNPs) in the 3′ UTRs of genes can affect the base-pairing between miRNAs and mRNAs, and hence disrupt existing target sites (in the reference sequence) or create novel target sites, suggesting a possible mechanism for cis regulation of gene expression. Moreover, because the alleles of different SNPs within a DNA sequence of limited length tend to be in strong linkage disequilibrium (LD), we hypothesize the variants of miRNA target sites caused by SNPs potentially function as bridges linking the documented cis-SNP markers to the expression of the associated genes. A large-scale analysis was herein performed to test this hypothesis. By systematically integrating multiple latest information sources, we found 21 significant gene-level SNP-involved miRNA-mediated post-transcriptional regulation modules (SNP-MPRMs) in the form of SNP-miRNA-mRNA triplets in lymphocyte cell lines for the CEU and YRI populations. Among the cognate genes, six including ALG8, DGKE, GNA12, KLF11, LRPAP1, and MMAB are related to multiple genetic diseases such as depressive disorder and Type-II diabetes. Furthermore, we found that ∼35% of the documented transcript intensity-related cis-SNPs (∼950) in a recent publication are identical to, or in significant linkage disequilibrium (LD) (p<0.01) with, one or multiple SNPs located in miRNA target sites. Based on these associations (or identities), 69 significant exon-level SNP-MPRMs and 12 disease genes were further determined for two populations. These results provide concrete in silico evidence for the proposed hypothesis. The discovered modules warrant additional follow-up in independent laboratory studies. PMID:22348086
Ander, Bradley P.; Zhang, Xiaoshuai; Xue, Fuzhong; Sharp, Frank R.; Yang, Xiaowei
2013-01-01
The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of subjects. Traditional gene-wise selection methods using univariate analyses face difficulty to incorporate correlational, structural, or functional structures amongst the molecular measures. For microarray gene expression data, we first summarize solutions in dealing with ‘large p, small n’ problems, and then propose an integrative Bayesian variable selection (iBVS) framework for simultaneously identifying causal or marker genes and regulatory pathways. A novel partial least squares (PLS) g-prior for iBVS is developed to allow the incorporation of prior knowledge on gene-gene interactions or functional relationships. From the point view of systems biology, iBVS enables user to directly target the joint effects of multiple genes and pathways in a hierarchical modeling diagram to predict disease status or phenotype. The estimated posterior selection probabilities offer probabilitic and biological interpretations. Both simulated data and a set of microarray data in predicting stroke status are used in validating the performance of iBVS in a Probit model with binary outcomes. iBVS offers a general framework for effective discovery of various molecular biomarkers by combining data-based statistics and knowledge-based priors. Guidelines on making posterior inferences, determining Bayesian significance levels, and improving computational efficiencies are also discussed. PMID:23844055
Peng, Bin; Zhu, Dianwen; Ander, Bradley P; Zhang, Xiaoshuai; Xue, Fuzhong; Sharp, Frank R; Yang, Xiaowei
2013-01-01
The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of subjects. Traditional gene-wise selection methods using univariate analyses face difficulty to incorporate correlational, structural, or functional structures amongst the molecular measures. For microarray gene expression data, we first summarize solutions in dealing with 'large p, small n' problems, and then propose an integrative Bayesian variable selection (iBVS) framework for simultaneously identifying causal or marker genes and regulatory pathways. A novel partial least squares (PLS) g-prior for iBVS is developed to allow the incorporation of prior knowledge on gene-gene interactions or functional relationships. From the point view of systems biology, iBVS enables user to directly target the joint effects of multiple genes and pathways in a hierarchical modeling diagram to predict disease status or phenotype. The estimated posterior selection probabilities offer probabilitic and biological interpretations. Both simulated data and a set of microarray data in predicting stroke status are used in validating the performance of iBVS in a Probit model with binary outcomes. iBVS offers a general framework for effective discovery of various molecular biomarkers by combining data-based statistics and knowledge-based priors. Guidelines on making posterior inferences, determining Bayesian significance levels, and improving computational efficiencies are also discussed.
Carvalho, Claudia M B; Vasanth, Shivakumar; Shinawi, Marwan; Russell, Chad; Ramocki, Melissa B; Brown, Chester W; Graakjaer, Jesper; Skytte, Anne-Bine; Vianna-Morgante, Angela M; Krepischi, Ana C V; Patel, Gayle S; Immken, LaDonna; Aleck, Kyrieckos; Lim, Cynthia; Cheung, Sau Wai; Rosenberg, Carla; Katsanis, Nicholas; Lupski, James R
2014-11-06
The 17p13.1 microdeletion syndrome is a recently described genomic disorder with a core clinical phenotype of intellectual disability, poor to absent speech, dysmorphic features, and a constellation of more variable clinical features, most prominently microcephaly. We identified five subjects with copy-number variants (CNVs) on 17p13.1 for whom we performed detailed clinical and molecular studies. Breakpoint mapping and retrospective analysis of published cases refined the smallest region of overlap (SRO) for microcephaly to a genomic interval containing nine genes. Dissection of this phenotype in zebrafish embryos revealed a complex genetic architecture: dosage perturbation of four genes (ASGR1, ACADVL, DVL2, and GABARAP) impeded neurodevelopment and decreased dosage of the same loci caused a reduced mitotic index in vitro. Moreover, epistatic analyses in vivo showed that dosage perturbations of discrete gene pairings induce microcephaly. Taken together, these studies support a model in which concomitant dosage perturbation of multiple genes within the CNV drive the microcephaly and possibly other neurodevelopmental phenotypes associated with rearrangements in the 17p13.1 SRO. Copyright © 2014 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Genome complexity in the coelacanth is reflected in its adaptive immune system
Saha, Nil Ratan; Ota, Tatsuya; Litman, Gary W.; Hansen, John; Parra, Zuly; Hsu, Ellen; Buonocore, Francesco; Canapa, Adriana; Cheng, Jan-Fang; Amemiya, Chris T.
2014-01-01
We have analyzed the available genome and transcriptome resources from the coelacanth in order to characterize genes involved in adaptive immunity. Two highly distinctive IgW-encoding loci have been identified that exhibit a unique genomic organization, including a multiplicity of tandemly repeated constant region exons. The overall organization of the IgW loci precludes typical heavy chain class switching. A locus encoding IgM could not be identified either computationally or by using several different experimental strategies. Four distinct sets of genes encoding Ig light chains were identified. This includes a variant sigma-type Ig light chain previously identified only in cartilaginous fishes and which is now provisionally denoted sigma-2. Genes encoding α/β and γ/δ T-cell receptors, and CD3, CD4, and CD8 co-receptors also were characterized. Ig heavy chain variable region genes and TCR components are interspersed within the TCR α/δ locus; this organization previously was reported only in tetrapods and raises questions regarding evolution and functional cooption of genes encoding variable regions. The composition, organization and syntenic conservation of the major histocompatibility complex locus have been characterized. We also identified large numbers of genes encoding cytokines and their receptors, and other genes associated with adaptive immunity. In terms of sequence identity and organization, the adaptive immune genes of the coelacanth more closely resemble orthologous genes in tetrapods than those in teleost fishes, consistent with current phylogenomic interpretations. Overall, the work reported described herein highlights the complexity inherent in the coelacanth genome and provides a rich catalog of immune genes for future investigations.
Soneson, Charlotte; Fontes, Magnus
2012-01-01
Analysis of multivariate data sets from, for example, microarray studies frequently results in lists of genes which are associated with some response of interest. The biological interpretation is often complicated by the statistical instability of the obtained gene lists, which may partly be due to the functional redundancy among genes, implying that multiple genes can play exchangeable roles in the cell. In this paper, we use the concept of exchangeability of random variables to model this functional redundancy and thereby account for the instability. We present a flexible framework to incorporate the exchangeability into the representation of lists. The proposed framework supports straightforward comparison between any 2 lists. It can also be used to generate new more stable gene rankings incorporating more information from the experimental data. Using 2 microarray data sets, we show that the proposed method provides more robust gene rankings than existing methods with respect to sampling variations, without compromising the biological significance of the rankings.
Linkage analysis of schizophrenia with five dopamine receptor genes in nine pedigrees
DOE Office of Scientific and Technical Information (OSTI.GOV)
Coon, H.; Byerley, W.; Holik, J.
Alterations in dopamine neurotransmission have been strongly implicated in the pathogenesis of schizophrenia for nearly 2 decades. Recently, the genes for five dopamine receptors have been cloned and characterized, and genetic and physical map information has become available. Using these five loci as candidate genes, the authors have tested for genetic linkage to schizophrenia in nine multigenerational families which include multiple affected individuals. In addition to testing conservative disease models, the have used a neurophysiological indicator variable, the P50 auditory evoked response. Deficits in gating of the P50 response have been shown to segregate with schizophrenia in this sample andmore » may identify carriers of gene(s) predisposing for schizophrenia. Linkage results were consistently negative, indicating that a defect at any of the actual receptor sites is unlikely to be a major contributor to schizophrenia in the nine families studied. 47 refs., 1 fig., 4 tabs.« less
Combinatorial interaction between CCM pathway genes precipitates hemorrhagic stroke.
Gore, Aniket V; Lampugnani, Maria Grazia; Dye, Louis; Dejana, Elisabetta; Weinstein, Brant M
2008-01-01
Intracranial hemorrhage (ICH) is a particularly severe form of stroke whose etiology remains poorly understood, with a highly variable appearance and onset of the disease (Felbor et al., 2006; Frizzell, 2005; Lucas et al., 2003). In humans, mutations in any one of three CCM genes causes an autosomal dominant genetic ICH disorder characterized by cerebral cavernous malformations (CCM). Recent evidence highlighting multiple interactions between the three CCM gene products and other proteins regulating endothelial junctional integrity suggests that minor deficits in these other proteins could potentially predispose to, or help to initiate, CCM, and that combinations of otherwise silent genetic deficits in both the CCM and interacting proteins might explain some of the variability in penetrance and expressivity of human ICH disorders. Here, we test this idea by combined knockdown of CCM pathway genes in zebrafish. Reducing the function of rap1b, which encodes a Ras GTPase effector protein for CCM1/Krit1, disrupts endothelial junctions in vivo and in vitro, showing it is a crucial player in the CCM pathway. Importantly, a minor reduction of Rap1b in combination with similar reductions in the products of other CCM pathway genes results in a high incidence of ICH. These findings support the idea that minor polygenic deficits in the CCM pathway can strongly synergize to initiate ICH.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gihring, Thomas; Green, Stefan; Schadt, Christopher Warren
2011-01-01
Technologies for massively parallel sequencing are revolutionizing microbial ecology and are vastly increasing the scale of ribosomal RNA (rRNA) gene studies. Although pyrosequencing has increased the breadth and depth of possible rRNA gene sampling, one drawback is that the number of reads obtained per sample is difficult to control. Pyrosequencing libraries typically vary widely in the number of sequences per sample, even within individual studies, and there is a need to revisit the behaviour of richness estimators and diversity indices with variable gene sequence library sizes. Multiple reports and review papers have demonstrated the bias in non-parametric richness estimators (e.g.more » Chao1 and ACE) and diversity indices when using clone libraries. However, we found that biased community comparisons are accumulating in the literature. Here we demonstrate the effects of sample size on Chao1, ACE, CatchAll, Shannon, Chao-Shen and Simpson's estimations specifically using pyrosequencing libraries. The need to equalize the number of reads being compared across libraries is reiterated, and investigators are directed towards available tools for making unbiased diversity comparisons.« less
Regularized rare variant enrichment analysis for case-control exome sequencing data.
Larson, Nicholas B; Schaid, Daniel J
2014-02-01
Rare variants have recently garnered an immense amount of attention in genetic association analysis. However, unlike methods traditionally used for single marker analysis in GWAS, rare variant analysis often requires some method of aggregation, since single marker approaches are poorly powered for typical sequencing study sample sizes. Advancements in sequencing technologies have rendered next-generation sequencing platforms a realistic alternative to traditional genotyping arrays. Exome sequencing in particular not only provides base-level resolution of genetic coding regions, but also a natural paradigm for aggregation via genes and exons. Here, we propose the use of penalized regression in combination with variant aggregation measures to identify rare variant enrichment in exome sequencing data. In contrast to marginal gene-level testing, we simultaneously evaluate the effects of rare variants in multiple genes, focusing on gene-based least absolute shrinkage and selection operator (LASSO) and exon-based sparse group LASSO models. By using gene membership as a grouping variable, the sparse group LASSO can be used as a gene-centric analysis of rare variants while also providing a penalized approach toward identifying specific regions of interest. We apply extensive simulations to evaluate the performance of these approaches with respect to specificity and sensitivity, comparing these results to multiple competing marginal testing methods. Finally, we discuss our findings and outline future research. © 2013 WILEY PERIODICALS, INC.
He, Yajun; Mao, Shaoshuai; Gao, Yulong; Zhu, Liying; Wu, Daoming; Cui, Yixin; Li, Jiana; Qian, Wei
2016-01-01
WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related QTL regions, indicating tandem duplicate WRKYs in the adaptive responses to environmental stimuli during the evolution process. Our results provide a framework for future studies regarding the function of WRKY genes in response to stress in B. napus. PMID:27322342
He, Yajun; Mao, Shaoshuai; Gao, Yulong; Zhu, Liying; Wu, Daoming; Cui, Yixin; Li, Jiana; Qian, Wei
2016-01-01
WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related QTL regions, indicating tandem duplicate WRKYs in the adaptive responses to environmental stimuli during the evolution process. Our results provide a framework for future studies regarding the function of WRKY genes in response to stress in B. napus.
IRAK1 variant is protective for orthodontic-induced external apical root resorption.
Pereira, S; Nogueira, L; Canova, F; Lopez, M; Silva, H C
2016-10-01
Interleukin-1 beta (IL1B) pathway is a key player in orthodontic-induced external apical root resorption (EARR). The aim of this work was to identify the genes related to the IL1 pathway as possible candidate genes for EARR, which might be included in an integrative predictive model of this complex phenotype. Using a stepwise multiple linear regression model, 195 patients who had undergone orthodontic treatment were assessed for clinical and genetic factors associated with %EARRmax (maximum %EARR value obtained for each patient). The four maxillary incisors and the two maxillary canines were assessed. Three functional single nucleotide polymorphisms (SNPs) were genotyped: rs1143634 in IL1B gene, rs315952 in IL1RN gene, and rs1059703 in X-linked IRAK1 gene. The model showed that four of the nine clinical variables and one SNP explained 30% of the %EARRmax variability. The most significant unique contributions to the model were gender (P = 0.001), treatment duration (P < 0.001), premolar extractions (P = 0.003), Hyrax appliance (P < 0.001), and homozygosity/hemizygosity for variant C from IRAK1 gene (P = 0.018), which proved to be a protective factor. IRAK1 polymorphism is proposed as a protective variant for EARR. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Gaysina, Darya; Xu, Man K.; Barnett, Jennifer H.; Croudace, Tim J.; Wong, Andrew; Richards, Marcus; Jones, Peter B.
2013-01-01
Genetic variation in the catechol-O-methyltransferase gene (COMT) can influence cognitive function, and this effect may depend on developmental stage. Using a large representative British birth cohort, we investigated the effect of COMT on cognitive function (verbal and non-verbal) at ages 8 and 15 years taking into account the possible modifying effect of pubertal stage. Five functional COMT polymorphisms, rs6269, rs4818, rs4680, rs737865 and rs165599 were analysed. Associations between COMT polymorphisms and cognition were tested using regression and latent variable structural equation modelling (SEM). Before correction for multiple testing, COMT rs737865 showed association with reading comprehension, verbal ability and global cognition at age 15 years in pubescent boys only. Although there was some evidence for age- and sex-specific effects of the COMT rs737865 none remained significant after correction for multiple testing. Further studies are necessary in order to make firmer conclusions. PMID:23178897
Lazea, Cecilia; Grigorescu-Sido, Paula; Popp, Radu; Legendre, Marie; Amselem, Serge; Al-Khzouz, Camelia; Bucerzan, Simona; Creţ, Victoria; Crişan, Mirela; Brad, Cristian
2015-09-01
To establish the frequency of the c.301_302 delAG mutation of the PROP1 gene in Romanian patients with multiple pituitary hormone deficiency (MPHD). Somatic assessment, hormonal test, bone age, magnetic resonance imaging of the pituitary gland, and molecular diagnosis were performed in 26 patients with MPHD (7 patients with familial form of MPHD and 19 patients with sporadic form of MPHD). The c.301_302delAG mutation was detected in the homozygous state in 10 patients belonging to 5 unrelated families (7 patients with familial history of MPHD and 3 patients with sporadic form of MPHD). Those 10 patients presented variable pituitary hormone deficiency and pituitary morphology. The c.301_302delAG homozygous genotype had a high frequency of 38% (10/26), reaching 100% (7/7) in group with familial cases of MPHD and 16% (3/19) in group with sporadic forms of MPHD.
2011-01-01
Background Because biotechnological uses of bacteriophage gene products as alternatives to conventional antibiotics will require a thorough understanding of their genomic context, we sequenced and analyzed the genomes of four closely related phages isolated from Clostridium perfringens, an important agricultural and human pathogen. Results Phage whole-genome tetra-nucleotide signatures and proteomic tree topologies correlated closely with host phylogeny. Comparisons of our phage genomes to 26 others revealed three shared COGs; of particular interest within this core genome was an endolysin (PF01520, an N-acetylmuramoyl-L-alanine amidase) and a holin (PF04531). Comparative analyses of the evolutionary history and genomic context of these common phage proteins revealed two important results: 1) strongly significant host-specific sequence variation within the endolysin, and 2) a protein domain architecture apparently unique to our phage genomes in which the endolysin is located upstream of its associated holin. Endolysin sequences from our phages were one of two very distinct genotypes distinguished by variability within the putative enzymatically-active domain. The shared or core genome was comprised of genes with multiple sequence types belonging to five pfam families, and genes belonging to 12 pfam families, including the holin genes, which were nearly identical. Conclusions Significant genomic diversity exists even among closely-related bacteriophages. Holins and endolysins represent conserved functions across divergent phage genomes and, as we demonstrate here, endolysins can have significant variability and host-specificity even among closely-related genomes. Endolysins in our phage genomes may be subject to different selective pressures than the rest of the genome. These findings may have important implications for potential biotechnological applications of phage gene products. PMID:21631945
Mansour, Hader A; Wood, Joel; Chowdari, Kodavali V; Tumuluru, Divya; Bamne, Mikhil; Monk, Timothy H; Hall, Martica H; Buysse, Daniel J; Nimgaonkar, Vishwajit L
2017-01-01
A variable number tandem repeat polymorphism (VNTR) in the period 3 (PER3) gene has been associated with heritable sleep and circadian variables, including self-rated chronotypes, polysomnographic (PSG) variables, insomnia and circadian sleep-wake disorders. This report describes novel molecular and clinical analyses of PER3 VNTR polymorphisms to better define their functional consequences. As the PER3 VNTR is located in the exonic (protein coding) region of PER3, we initially investigated whether both alleles (variants) are transcribed into messenger RNA in human fibroblasts. The VNTR showed bi-allelic gene expression. We next investigated genetic associations in relation to clinical variables in 274 older adult Caucasian individuals. Independent variables included genotypes for the PER3 VNTR as well as a representative set of single nucleotide polymorphisms (SNPs) that tag common variants at the PER3 locus (linkage disequilibrium (LD) between genetic variants < 0.5). In order to comprehensively evaluate variables analyzed individually in prior analyses, dependent measures included PSG total sleep time and sleep latency, self-rated chronotype, estimated with the Composite Scale (CS), and lifestyle regularity, estimated using the social rhythm metric (SRM). Initially, genetic polymorphisms were individually analyzed in relation to each outcome variable using analysis of variance (ANOVA). Nominally significant associations were further tested using regression analyses that incorporated individual ANOVA-associated DNA variants as potential predictors and each of the selected sleep/circadian variables as outcomes. The covariates included age, gender, body mass index and an index of medical co-morbidity. Significant genetic associations with the VNTR were not detected with the sleep or circadian variables. Nominally significant associations were detected between SNP rs1012477 and CS scores (p = 0.003) and between rs10462021 and SRM (p = 0.047); rs11579477 and average delta power (p = 0.043) (analyses uncorrected for multiple comparisons). In conclusion, alleles of the VNTR are expressed at the transcript level and may have a functional effect in cells expressing the PER3 gene. PER3 polymorphisms had a modest impact on selected sleep/circadian variables in our sample, suggesting that PER3 is associated with sleep and circadian function beyond VNTR polymorphisms. Further replicate analyses in larger, independent samples are recommended.
Some Like It Hot, Some Like It Warm: Phenotyping to Explore Thermotolerance Diversity
Yeh, Ching-Hui; Kaplinsky, Nicholas J.; Hu, Catherine; Charng, Yee-yung
2012-01-01
Plants have evolved overlapping but distinct cellular responses to different aspects of high temperature stress. These responses include basal thermotolerance, short- and long-term acquired thermotolerance, and thermotolerance to moderately high temperatures. This thermotolerance diversity’ means that multiple phenotypic assays are essential for fully describing the functions of genes involved in heat stress responses. A large number of genes with potential roles in heat stress responses have been identified using genetic screens and genome wide expression studies. We examine the range of phenotypic assays that have been used to characterize thermotolerance phenotypes in both Arabidopsis and crop plants. Three major variables differentiate thermotolerance assays: 1) the heat stress regime used, 2) the developmental stage of the plants being studied, and 3) the actual phenotype which is scored. Consideration of these variables will be essential for deepening our understanding of the molecular genetics of plant thermotolerance. PMID:22920995
Bayesian state space models for dynamic genetic network construction across multiple tissues.
Liang, Yulan; Kelemen, Arpad
2016-08-01
Construction of gene-gene interaction networks and potential pathways is a challenging and important problem in genomic research for complex diseases while estimating the dynamic changes of the temporal correlations and non-stationarity are the keys in this process. In this paper, we develop dynamic state space models with hierarchical Bayesian settings to tackle this challenge for inferring the dynamic profiles and genetic networks associated with disease treatments. We treat both the stochastic transition matrix and the observation matrix time-variant and include temporal correlation structures in the covariance matrix estimations in the multivariate Bayesian state space models. The unevenly spaced short time courses with unseen time points are treated as hidden state variables. Hierarchical Bayesian approaches with various prior and hyper-prior models with Monte Carlo Markov Chain and Gibbs sampling algorithms are used to estimate the model parameters and the hidden state variables. We apply the proposed Hierarchical Bayesian state space models to multiple tissues (liver, skeletal muscle, and kidney) Affymetrix time course data sets following corticosteroid (CS) drug administration. Both simulation and real data analysis results show that the genomic changes over time and gene-gene interaction in response to CS treatment can be well captured by the proposed models. The proposed dynamic Hierarchical Bayesian state space modeling approaches could be expanded and applied to other large scale genomic data, such as next generation sequence (NGS) combined with real time and time varying electronic health record (EHR) for more comprehensive and robust systematic and network based analysis in order to transform big biomedical data into predictions and diagnostics for precision medicine and personalized healthcare with better decision making and patient outcomes.
Meiotic gene-conversion rate and tract length variation in the human genome.
Padhukasahasram, Badri; Rannala, Bruce
2013-02-27
Meiotic recombination occurs in the form of two different mechanisms called crossing-over and gene-conversion and both processes have an important role in shaping genetic variation in populations. Although variation in crossing-over rates has been studied extensively using sperm-typing experiments, pedigree studies and population genetic approaches, our knowledge of variation in gene-conversion parameters (ie, rates and mean tract lengths) remains far from complete. To explore variability in population gene-conversion rates and its relationship to crossing-over rate variation patterns, we have developed and validated using coalescent simulations a comprehensive Bayesian full-likelihood method that can jointly infer crossing-over and gene-conversion rates as well as tract lengths from population genomic data under general variable rate models with recombination hotspots. Here, we apply this new method to SNP data from multiple human populations and attempt to characterize for the first time the fine-scale variation in gene-conversion parameters along the human genome. We find that the estimated ratio of gene-conversion to crossing-over rates varies considerably across genomic regions as well as between populations. However, there is a great degree of uncertainty associated with such estimates. We also find substantial evidence for variation in the mean conversion tract length. The estimated tract lengths did not show any negative relationship with the local heterozygosity levels in our analysis.European Journal of Human Genetics advance online publication, 27 February 2013; doi:10.1038/ejhg.2013.30.
Schaid, Daniel J; Sinnwell, Jason P; Jenkins, Gregory D; McDonnell, Shannon K; Ingle, James N; Kubo, Michiaki; Goss, Paul E; Costantino, Joseph P; Wickerham, D Lawrence; Weinshilboum, Richard M
2012-01-01
Gene-set analyses have been widely used in gene expression studies, and some of the developed methods have been extended to genome wide association studies (GWAS). Yet, complications due to linkage disequilibrium (LD) among single nucleotide polymorphisms (SNPs), and variable numbers of SNPs per gene and genes per gene-set, have plagued current approaches, often leading to ad hoc "fixes." To overcome some of the current limitations, we developed a general approach to scan GWAS SNP data for both gene-level and gene-set analyses, building on score statistics for generalized linear models, and taking advantage of the directed acyclic graph structure of the gene ontology when creating gene-sets. However, other types of gene-set structures can be used, such as the popular Kyoto Encyclopedia of Genes and Genomes (KEGG). Our approach combines SNPs into genes, and genes into gene-sets, but assures that positive and negative effects of genes on a trait do not cancel. To control for multiple testing of many gene-sets, we use an efficient computational strategy that accounts for LD and provides accurate step-down adjusted P-values for each gene-set. Application of our methods to two different GWAS provide guidance on the potential strengths and weaknesses of our proposed gene-set analyses. © 2011 Wiley Periodicals, Inc.
FLO1 is a variable green beard gene that drives biofilm-like cooperation in budding yeast
Smukalla, Scott; Caldara, Marina; Pochet, Nathalie; Beauvais, Anne; Guadagnini, Stephanie; Yan, Chen; Vinces, Marcelo D.; Jansen, An; Prevost, Marie Christine; Latgé, Jean-Paul; Fink, Gerald R.; Foster, Kevin R.; Verstrepen, Kevin J.
2008-01-01
Summary The budding yeast, Saccharomyces cerevisiae, has emerged as an archetype of eukaryotic cell biology. Here we show that S. cerevisiae is also a model for the evolution of cooperative behavior by revisiting flocculation, a self-adherence phenotype lacking in most laboratory strains. Expression of the gene FLO1 in the laboratory strain S288C restores flocculation, an altered physiological state, reminiscent of bacterial biofilms. Flocculation protects the FLO1-expressing cells from multiple stresses, including antimicrobials and ethanol. Furthermore, FLO1+ cells avoid exploitation by non-expressing flo1 cells by self/non-self recognition: FLO1+ cells preferentially stick to one another, regardless of genetic relatedness across the rest of the genome. Flocculation, therefore, is driven by one of a few known “green beard genes”, which direct cooperation towards other carriers of the same gene. Moreover, FLO1 is highly variable among strains both in expression and in sequence, suggesting that flocculation in S. cerevisiae is a dynamic, rapidly-evolving social trait. PMID:19013280
DiPrete, Thomas A.; Burik, Casper A. P.; Koellinger, Philipp D.
2018-01-01
Identifying causal effects in nonexperimental data is an enduring challenge. One proposed solution that recently gained popularity is the idea to use genes as instrumental variables [i.e., Mendelian randomization (MR)]. However, this approach is problematic because many variables of interest are genetically correlated, which implies the possibility that many genes could affect both the exposure and the outcome directly or via unobserved confounding factors. Thus, pleiotropic effects of genes are themselves a source of bias in nonexperimental data that would also undermine the ability of MR to correct for endogeneity bias from nongenetic sources. Here, we propose an alternative approach, genetic instrumental variable (GIV) regression, that provides estimates for the effect of an exposure on an outcome in the presence of pleiotropy. As a valuable byproduct, GIV regression also provides accurate estimates of the chip heritability of the outcome variable. GIV regression uses polygenic scores (PGSs) for the outcome of interest which can be constructed from genome-wide association study (GWAS) results. By splitting the GWAS sample for the outcome into nonoverlapping subsamples, we obtain multiple indicators of the outcome PGSs that can be used as instruments for each other and, in combination with other methods such as sibling fixed effects, can address endogeneity bias from both pleiotropy and the environment. In two empirical applications, we demonstrate that our approach produces reasonable estimates of the chip heritability of educational attainment (EA) and show that standard regression and MR provide upwardly biased estimates of the effect of body height on EA. PMID:29686100
DiPrete, Thomas A; Burik, Casper A P; Koellinger, Philipp D
2018-05-29
Identifying causal effects in nonexperimental data is an enduring challenge. One proposed solution that recently gained popularity is the idea to use genes as instrumental variables [i.e., Mendelian randomization (MR)]. However, this approach is problematic because many variables of interest are genetically correlated, which implies the possibility that many genes could affect both the exposure and the outcome directly or via unobserved confounding factors. Thus, pleiotropic effects of genes are themselves a source of bias in nonexperimental data that would also undermine the ability of MR to correct for endogeneity bias from nongenetic sources. Here, we propose an alternative approach, genetic instrumental variable (GIV) regression, that provides estimates for the effect of an exposure on an outcome in the presence of pleiotropy. As a valuable byproduct, GIV regression also provides accurate estimates of the chip heritability of the outcome variable. GIV regression uses polygenic scores (PGSs) for the outcome of interest which can be constructed from genome-wide association study (GWAS) results. By splitting the GWAS sample for the outcome into nonoverlapping subsamples, we obtain multiple indicators of the outcome PGSs that can be used as instruments for each other and, in combination with other methods such as sibling fixed effects, can address endogeneity bias from both pleiotropy and the environment. In two empirical applications, we demonstrate that our approach produces reasonable estimates of the chip heritability of educational attainment (EA) and show that standard regression and MR provide upwardly biased estimates of the effect of body height on EA. Copyright © 2018 the Author(s). Published by PNAS.
Wang, Juan-Juan; Cai, Qing; Qiu, Lei; Ying, Sheng-Hua; Feng, Ming-Guang
2017-05-01
Intracellular trehalose accumulation is relevant to fungal life and pathogenicity. Trehalose-6-phosphate synthase (TPS) is known to control the first step of trehalose synthesis, but functions of multiple TPS genes in some filamentous fungi are variable. Here, we examined the functions of two TPS genes (tpsA and tpsB) in Beauveria bassiana, a fungal insect pathogen widely applied in arthropod pest control. Intracellular TPS activity and trehalose content decreased by 71-75 and 72-80% in ΔtpsA, and 21-30 and 15-45% in ΔtpsB, respectively, and to undetectable levels in ΔtpsAΔtpsB, under normal and stressful conditions. The three mutants lost 33, 50, and 98% of conidiation capacity in standard cultures. Conidial quality indicated by viability, density, intracellular trehalose content, cell wall integrity, and hydrophobicity was more impaired in ΔtpsA than in ΔtpsB and mostly in ΔtpsAΔtpsB, which was also most sensitive to nutritional, chemical, and environmental stresses and least virulent to Galleria mellonella larvae. Almost all of phenotypic defects in ΔtpsAΔtpsB approached to the sums of those observed in ΔtpsA and ΔtpsB and were restored by targeted gene complementation. Altogether, TpsA and TpsB play complementary roles in sustaining trehalose synthesis, conidiation capacity, conidial quality, multiple stress tolerance, and virulence, highlighting a significance of both for the fungal adaptation to environment and host.
Overexpression of the Cytokine BAFF and Autoimmunity Risk.
Steri, Maristella; Orrù, Valeria; Idda, M Laura; Pitzalis, Maristella; Pala, Mauro; Zara, Ilenia; Sidore, Carlo; Faà, Valeria; Floris, Matteo; Deiana, Manila; Asunis, Isadora; Porcu, Eleonora; Mulas, Antonella; Piras, Maria G; Lobina, Monia; Lai, Sandra; Marongiu, Mara; Serra, Valentina; Marongiu, Michele; Sole, Gabriella; Busonero, Fabio; Maschio, Andrea; Cusano, Roberto; Cuccuru, Gianmauro; Deidda, Francesca; Poddie, Fausto; Farina, Gabriele; Dei, Mariano; Virdis, Francesca; Olla, Stefania; Satta, Maria A; Pani, Mario; Delitala, Alessandro; Cocco, Eleonora; Frau, Jessica; Coghe, Giancarlo; Lorefice, Lorena; Fenu, Giuseppe; Ferrigno, Paola; Ban, Maria; Barizzone, Nadia; Leone, Maurizio; Guerini, Franca R; Piga, Matteo; Firinu, Davide; Kockum, Ingrid; Lima Bomfim, Izaura; Olsson, Tomas; Alfredsson, Lars; Suarez, Ana; Carreira, Patricia E; Castillo-Palma, Maria J; Marcus, Joseph H; Congia, Mauro; Angius, Andrea; Melis, Maurizio; Gonzalez, Antonio; Alarcón Riquelme, Marta E; da Silva, Berta M; Marchini, Maurizio; Danieli, Maria G; Del Giacco, Stefano; Mathieu, Alessandro; Pani, Antonello; Montgomery, Stephen B; Rosati, Giulio; Hillert, Jan; Sawcer, Stephen; D'Alfonso, Sandra; Todd, John A; Novembre, John; Abecasis, Gonçalo R; Whalen, Michael B; Marrosu, Maria G; Meloni, Alessandra; Sanna, Serena; Gorospe, Myriam; Schlessinger, David; Fiorillo, Edoardo; Zoledziewska, Magdalena; Cucca, Francesco
2017-04-27
Genomewide association studies of autoimmune diseases have mapped hundreds of susceptibility regions in the genome. However, only for a few association signals has the causal gene been identified, and for even fewer have the causal variant and underlying mechanism been defined. Coincident associations of DNA variants affecting both the risk of autoimmune disease and quantitative immune variables provide an informative route to explore disease mechanisms and drug-targetable pathways. Using case-control samples from Sardinia, Italy, we performed a genomewide association study in multiple sclerosis followed by TNFSF13B locus-specific association testing in systemic lupus erythematosus (SLE). Extensive phenotyping of quantitative immune variables, sequence-based fine mapping, cross-population and cross-phenotype analyses, and gene-expression studies were used to identify the causal variant and elucidate its mechanism of action. Signatures of positive selection were also investigated. A variant in TNFSF13B, encoding the cytokine and drug target B-cell activating factor (BAFF), was associated with multiple sclerosis as well as SLE. The disease-risk allele was also associated with up-regulated humoral immunity through increased levels of soluble BAFF, B lymphocytes, and immunoglobulins. The causal variant was identified: an insertion-deletion variant, GCTGT→A (in which A is the risk allele), yielded a shorter transcript that escaped microRNA inhibition and increased production of soluble BAFF, which in turn up-regulated humoral immunity. Population genetic signatures indicated that this autoimmunity variant has been evolutionarily advantageous, most likely by augmenting resistance to malaria. A TNFSF13B variant was associated with multiple sclerosis and SLE, and its effects were clarified at the population, cellular, and molecular levels. (Funded by the Italian Foundation for Multiple Sclerosis and others.).
Oncogenes and tumor suppressors in the molecular pathogenesis of acute promyelocytic leukemia.
Pandolfi, P P
2001-04-01
Acute promyelocytic leukemia (APL) is associated with reciprocal chromosomal translocations always involving the retinoic acid receptor alpha (RARalpha) gene on chromosome 17 and variable partner genes (X genes) on distinct chromosomes. RARalpha fuses to the PML gene in the vast majority of APL cases, and in a few cases to the PLZF, NPM, NuMA and Stat5b genes, respectively, leading to the generation of RARalpha-X: and X:-RARalpha fusion genes. Both fusion proteins can exert oncogenic functions through their ability to interfere with the activities of X and RARalpha proteins. Here, it will be discussed in detail how an extensive biochemical analysis as well as a systematic in vivo genetic approach in the mouse has allowed the definition of the multiple oncogenic activities of PML-RARalpha, and how it has become apparent that this oncoprotein is able to impair RARalpha at the transcription level and the tumor suppressive function of the PML protein.
Skeletal muscle repair in a mouse model of nemaline myopathy
Sanoudou, Despina; Corbett, Mark A.; Han, Mei; Ghoddusi, Majid; Nguyen, Mai-Anh T.; Vlahovich, Nicole; Hardeman, Edna C.; Beggs, Alan H.
2012-01-01
Nemaline myopathy (NM), the most common non-dystrophic congenital myopathy, is a variably severe neuromuscular disorder for which no effective treatment is available. Although a number of genes have been identified in which mutations can cause NM, the pathogenetic mechanisms leading to the phenotypes are poorly understood. To address this question, we examined gene expression patterns in an NM mouse model carrying the human Met9Arg mutation of alpha-tropomyosin slow (Tpm3). We assessed five different skeletal muscles from affected mice, which are representative of muscles with differing fiber-type compositions, different physiological specializations and variable degrees of pathology. Although these same muscles in non-affected mice showed marked variation in patterns of gene expression, with diaphragm being the most dissimilar, the presence of the mutant protein in nemaline muscles resulted in a more similar pattern of gene expression among the muscles. This result suggests a common process or mechanism operating in nemaline muscles independent of the variable degrees of pathology. Transcriptional and protein expression data indicate the presence of a repair process and possibly delayed maturation in nemaline muscles. Markers indicative of satellite cell number, activated satellite cells and immature fibers including M-Cadherin, MyoD, desmin, Pax7 and Myf6 were elevated by western-blot analysis or immunohistochemistry. Evidence suggesting elevated focal repair was observed in nemaline muscle in electron micrographs. This analysis reveals that NM is characterized by a novel repair feature operating in multiple different muscles. PMID:16877500
Skeletal muscle repair in a mouse model of nemaline myopathy.
Sanoudou, Despina; Corbett, Mark A; Han, Mei; Ghoddusi, Majid; Nguyen, Mai-Anh T; Vlahovich, Nicole; Hardeman, Edna C; Beggs, Alan H
2006-09-01
Nemaline myopathy (NM), the most common non-dystrophic congenital myopathy, is a variably severe neuromuscular disorder for which no effective treatment is available. Although a number of genes have been identified in which mutations can cause NM, the pathogenetic mechanisms leading to the phenotypes are poorly understood. To address this question, we examined gene expression patterns in an NM mouse model carrying the human Met9Arg mutation of alpha-tropomyosin slow (Tpm3). We assessed five different skeletal muscles from affected mice, which are representative of muscles with differing fiber-type compositions, different physiological specializations and variable degrees of pathology. Although these same muscles in non-affected mice showed marked variation in patterns of gene expression, with diaphragm being the most dissimilar, the presence of the mutant protein in nemaline muscles resulted in a more similar pattern of gene expression among the muscles. This result suggests a common process or mechanism operating in nemaline muscles independent of the variable degrees of pathology. Transcriptional and protein expression data indicate the presence of a repair process and possibly delayed maturation in nemaline muscles. Markers indicative of satellite cell number, activated satellite cells and immature fibers including M-Cadherin, MyoD, desmin, Pax7 and Myf6 were elevated by western-blot analysis or immunohistochemistry. Evidence suggesting elevated focal repair was observed in nemaline muscle in electron micrographs. This analysis reveals that NM is characterized by a novel repair feature operating in multiple different muscles.
Dose response relationship in anti-stress gene regulatory networks.
Zhang, Qiang; Andersen, Melvin E
2007-03-02
To maintain a stable intracellular environment, cells utilize complex and specialized defense systems against a variety of external perturbations, such as electrophilic stress, heat shock, and hypoxia, etc. Irrespective of the type of stress, many adaptive mechanisms contributing to cellular homeostasis appear to operate through gene regulatory networks that are organized into negative feedback loops. In general, the degree of deviation of the controlled variables, such as electrophiles, misfolded proteins, and O2, is first detected by specialized sensor molecules, then the signal is transduced to specific transcription factors. Transcription factors can regulate the expression of a suite of anti-stress genes, many of which encode enzymes functioning to counteract the perturbed variables. The objective of this study was to explore, using control theory and computational approaches, the theoretical basis that underlies the steady-state dose response relationship between cellular stressors and intracellular biochemical species (controlled variables, transcription factors, and gene products) in these gene regulatory networks. Our work indicated that the shape of dose response curves (linear, superlinear, or sublinear) depends on changes in the specific values of local response coefficients (gains) distributed in the feedback loop. Multimerization of anti-stress enzymes and transcription factors into homodimers, homotrimers, or even higher-order multimers, play a significant role in maintaining robust homeostasis. Moreover, our simulation noted that dose response curves for the controlled variables can transition sequentially through four distinct phases as stressor level increases: initial superlinear with lesser control, superlinear more highly controlled, linear uncontrolled, and sublinear catastrophic. Each phase relies on specific gain-changing events that come into play as stressor level increases. The low-dose region is intrinsically nonlinear, and depending on the level of local gains, presence of gain-changing events, and degree of feedforward gene activation, this region can appear as superlinear, sublinear, or even J-shaped. The general dose response transition proposed here was further examined in a complex anti-electrophilic stress pathway, which involves multiple genes, enzymes, and metabolic reactions. This work would help biologists and especially toxicologists to better assess and predict the cellular impact brought about by biological stressors.
Chattaway, Marie Anne; Day, Michaela; Mtwale, Julia; White, Emma; Rogers, James; Day, Martin; Powell, David; Ahmad, Marwa; Harris, Ross; Talukder, Kaisar Ali; Wain, John; Jenkins, Claire; Cravioto, Alejandro
2017-10-01
This study investigates the virulence and antimicrobial resistance in association with common clonal complexes (CCs) of enteroaggregative Escherichia coli (EAEC) isolated from Bangladesh. The aim was to determine whether specific CCs were more likely to be associated with putative virulence genes and/or antimicrobial resistance. The presence of 15 virulence genes (by PCR) and susceptibility to 18 antibiotics were determined for 151 EAEC isolated from cases and controls during an intestinal infectious disease study carried out between 2007-2011 in the rural setting of Mirzapur, Bangladesh (Kotloff KL, Blackwelder WC, Nasrin D, Nataro JP, Farag TH et al.Clin Infect Dis 2012;55:S232-S245). These data were then analysed in the context of previously determined serotypes and clonal complexes defined by multi-locus sequence typing. Overall there was no association between the presence of virulence or antimicrobial resistance genes in isolates of EAEC from cases versus controls. However, when stratified by clonal complex (CC) one CC associated with cases harboured more virulence factors (CC40) and one CC harboured more resistance genes (CC38) than the average. There was no direct link between the virulence gene content and antibiotic resistance. Strains within a single CC had variable virulence and resistance gene content indicating independent and multiple gene acquisitions over time. In Bangladesh, there are multiple clonal complexes of EAEC harbouring a variety of virulence and resistance genes. The emergence of two of the most successful clones appeared to be linked to either increased virulence (CC40) or antimicrobial resistance (CC38), but increased resistance and virulence were not found in the same clonal complexes.
Evolutionary origins of a novel host plant detoxification gene in butterflies.
Fischer, Hanna M; Wheat, Christopher W; Heckel, David G; Vogel, Heiko
2008-05-01
Chemical interactions between plants and their insect herbivores provide an excellent opportunity to study the evolution of species interactions on a molecular level. Here, we investigate the molecular evolutionary events that gave rise to a novel detoxifying enzyme (nitrile-specifier protein [NSP]) in the butterfly family Pieridae, previously identified as a coevolutionary key innovation. By generating and sequencing expressed sequence tags, genomic libraries, and screening databases we found NSP to be a member of an insect-specific gene family, which we characterized and named the NSP-like gene family. Members consist of variable tandem repeats, are gut expressed, and are found across Insecta evolving in a dynamic, ongoing birth-death process. In the Lepidoptera, multiple copies of single-domain major allergen genes are present and originate via tandem duplications. Multiple domain genes are found solely within the brassicaceous-feeding Pieridae butterflies, one of them being NSP and another called major allergen (MA). Analyses suggest that NSP and its paralog MA have a unique single-domain evolutionary origin, being formed by intragenic domain duplication followed by tandem whole-gene duplication. Duplicates subsequently experienced a period of relaxed constraint followed by an increase in constraint, perhaps after neofunctionalization. NSP and its ortholog MA are still experiencing high rates of change, reflecting a dynamic evolution consistent with the known role of NSP in plant-insect interactions. Our results provide direct evidence to the hypothesis that gene duplication is one of the driving forces for speciation and adaptation, showing that both within- and whole-gene tandem duplications are a powerful force underlying evolutionary adaptation.
Rajkumar, A P; Poonkuzhali, B; Kuruvilla, A; Srivastava, A; Jacob, M; Jacob, K S
2012-12-01
Pharmacogenetics of schizophrenia has not yet delivered anticipated clinical dividends. Clinical heterogeneity of schizophrenia contributes to the poor replication of the findings of pharmacogenetic association studies. Functionally important HTR3A gene single-nucleotide polymorphisms (SNPs) were reported to be associated with response to clozapine. The aim of this study was to investigate how the association between HTR3A gene SNP and response to clozapine is influenced by various clinical predictors and by differing outcome definitions in patients with treatment-resistant schizophrenia (TRS). We recruited 101 consecutive patients with TRS, on stable doses of clozapine, and evaluated their HTR3A gene SNP (rs1062613 and rs2276302), psychopathology, and serum clozapine levels. We assessed their socio-demographic and clinical profiles, premorbid adjustment, traumatic events, cognition, and disability using standard assessment schedules. We evaluated their response to clozapine, by employing six differing outcome definitions. We employed appropriate multivariate statistics to calculate allelic and genotypic association, accounting for the effects of various clinical variables. T allele of rs1062613 and G allele of rs2276302 were significantly associated with good clinical response to clozapine (p = 0.02). However, varying outcome definitions make these associations inconsistent. rs1062613 and rs2276302 could explain only 13.8 % variability in the responses to clozapine, while combined clinical predictors and HTR3A pharmacogenetic association model could explain 38 % variability. We demonstrated that the results of pharmacogenetic studies in schizophrenia depend heavily on their outcome definitions and that combined clinical and pharmacogenetic models have better predictive values. Future pharmacogenetic studies should employ multiple outcome definitions and should evaluate associated clinical variables.
Chau, John H; Rahfeldt, Wolfgang A; Olmstead, Richard G
2018-03-01
Targeted sequence capture can be used to efficiently gather sequence data for large numbers of loci, such as single-copy nuclear loci. Most published studies in plants have used taxon-specific locus sets developed individually for a clade using multiple genomic and transcriptomic resources. General locus sets can also be developed from loci that have been identified as single-copy and have orthologs in large clades of plants. We identify and compare a taxon-specific locus set and three general locus sets (conserved ortholog set [COSII], shared single-copy nuclear [APVO SSC] genes, and pentatricopeptide repeat [PPR] genes) for targeted sequence capture in Buddleja (Scrophulariaceae) and outgroups. We evaluate their performance in terms of assembly success, sequence variability, and resolution and support of inferred phylogenetic trees. The taxon-specific locus set had the most target loci. Assembly success was high for all locus sets in Buddleja samples. For outgroups, general locus sets had greater assembly success. Taxon-specific and PPR loci had the highest average variability. The taxon-specific data set produced the best-supported tree, but all data sets showed improved resolution over previous non-sequence capture data sets. General locus sets can be a useful source of sequence capture targets, especially if multiple genomic resources are not available for a taxon.
Pathway-Based Kernel Boosting for the Analysis of Genome-Wide Association Studies
Manitz, Juliane; Burger, Patricia; Amos, Christopher I.; Chang-Claude, Jenny; Wichmann, Heinz-Erich; Kneib, Thomas; Bickeböller, Heike
2017-01-01
The analysis of genome-wide association studies (GWAS) benefits from the investigation of biologically meaningful gene sets, such as gene-interaction networks (pathways). We propose an extension to a successful kernel-based pathway analysis approach by integrating kernel functions into a powerful algorithmic framework for variable selection, to enable investigation of multiple pathways simultaneously. We employ genetic similarity kernels from the logistic kernel machine test (LKMT) as base-learners in a boosting algorithm. A model to explain case-control status is created iteratively by selecting pathways that improve its prediction ability. We evaluated our method in simulation studies adopting 50 pathways for different sample sizes and genetic effect strengths. Additionally, we included an exemplary application of kernel boosting to a rheumatoid arthritis and a lung cancer dataset. Simulations indicate that kernel boosting outperforms the LKMT in certain genetic scenarios. Applications to GWAS data on rheumatoid arthritis and lung cancer resulted in sparse models which were based on pathways interpretable in a clinical sense. Kernel boosting is highly flexible in terms of considered variables and overcomes the problem of multiple testing. Additionally, it enables the prediction of clinical outcomes. Thus, kernel boosting constitutes a new, powerful tool in the analysis of GWAS data and towards the understanding of biological processes involved in disease susceptibility. PMID:28785300
Pathway-Based Kernel Boosting for the Analysis of Genome-Wide Association Studies.
Friedrichs, Stefanie; Manitz, Juliane; Burger, Patricia; Amos, Christopher I; Risch, Angela; Chang-Claude, Jenny; Wichmann, Heinz-Erich; Kneib, Thomas; Bickeböller, Heike; Hofner, Benjamin
2017-01-01
The analysis of genome-wide association studies (GWAS) benefits from the investigation of biologically meaningful gene sets, such as gene-interaction networks (pathways). We propose an extension to a successful kernel-based pathway analysis approach by integrating kernel functions into a powerful algorithmic framework for variable selection, to enable investigation of multiple pathways simultaneously. We employ genetic similarity kernels from the logistic kernel machine test (LKMT) as base-learners in a boosting algorithm. A model to explain case-control status is created iteratively by selecting pathways that improve its prediction ability. We evaluated our method in simulation studies adopting 50 pathways for different sample sizes and genetic effect strengths. Additionally, we included an exemplary application of kernel boosting to a rheumatoid arthritis and a lung cancer dataset. Simulations indicate that kernel boosting outperforms the LKMT in certain genetic scenarios. Applications to GWAS data on rheumatoid arthritis and lung cancer resulted in sparse models which were based on pathways interpretable in a clinical sense. Kernel boosting is highly flexible in terms of considered variables and overcomes the problem of multiple testing. Additionally, it enables the prediction of clinical outcomes. Thus, kernel boosting constitutes a new, powerful tool in the analysis of GWAS data and towards the understanding of biological processes involved in disease susceptibility.
Reed, Jessica L; D'Ambrosio, Enrico; Marenco, Stefano; Ursini, Gianluca; Zheutlin, Amanda B; Blasi, Giuseppe; Spencer, Barbara E; Romano, Raffaella; Hochheiser, Jesse; Reifman, Ann; Sturm, Justin; Berman, Karen F; Bertolino, Alessandro; Weinberger, Daniel R; Callicott, Joseph H
2018-01-01
Brain phenotypes showing environmental influence may help clarify unexplained associations between urban exposure and psychiatric risk. Heritable prefrontal fMRI activation during working memory (WM) is such a phenotype. We hypothesized that urban upbringing (childhood urbanicity) would alter this phenotype and interact with dopamine genes that regulate prefrontal function during WM. Further, dopamine has been hypothesized to mediate urban-associated factors like social stress. WM-related prefrontal function was tested for main effects of urbanicity, main effects of three dopamine genes-catechol-O-methyltransferase (COMT), dopamine receptor D1 (DRD1), and dopamine receptor D2 (DRD2)-and, importantly, dopamine gene-by-urbanicity interactions. For COMT, three independent human samples were recruited (total n = 487). We also studied 253 subjects genotyped for DRD1 and DRD2. 3T fMRI activation during the N-back WM task was the dependent variable, while childhood urbanicity, dopamine genotype, and urbanicity-dopamine interactions were independent variables. Main effects of dopamine genes and of urbanicity were found. Individuals raised in an urban environment showed altered prefrontal activation relative to those raised in rural or town settings. For each gene, dopamine genotype-by-urbanicity interactions were shown in prefrontal cortex-COMT replicated twice in two independent samples. An urban childhood upbringing altered prefrontal function and interacted with each gene to alter genotype-phenotype relationships. Gene-environment interactions between multiple dopamine genes and urban upbringing suggest that neural effects of developmental environmental exposure could mediate, at least partially, increased risk for psychiatric illness in urban environments via dopamine genes expressed into adulthood.
Ozcan, Gozde; Balta, Burhan; Sekerci, Ahmet Ercan; Etoz, Osman A; Martinuzzi, Claudia; Kara, Ozlem; Pastorino, Lorenza; Kocoglu, Fatma; Ulker, Omer; Erdogan, Murat
2016-01-01
Gorlin-Goltz syndrome (GGS) is an uncommon autosomal dominant inherited disorder which comprises the triad of basal cell carcinomas (BCCs), odontogenic keratocysts, and musculoskeletal malformations. Besides this triad, neurological, ophthalmic, endocrine, and genital manifestations are known to be variable. It is occasionally associated with aggressive BCC and internal malignancies. This report documents a case of GGS with a novel mutation in the PTCH1 gene in an 11-year-old child. The clinical, radiographic, histopathologic and molecular findings of this condition, and treatment are described, and a review of GGS was carried out.
Qin, Y; Duquette, P; Zhang, Y; Talbot, P; Poole, R; Antel, J
1998-01-01
The cerebrospinal fluid (CSF) of multiple sclerosis (MS) patients is characterized by increased concentrations of immunoglobulin (Ig), which on electrophoretic analysis shows restricted heterogeneity (oligoclonal bands). CSF Ig is composed of both serum and intrathecally produced components. To examine the properties of intrathecal antibody-producing B cells, we analyzed Ig heavy-chain variable (V(H)) region genes of B cells recovered from the CSF of 12 MS patients and 15 patients with other neurological diseases (OND). Using a PCR technique, we could detect rearrangements of Ig V(H) genes in all samples. Sequence analysis of complementarity-determining region 3 (CDR3) of rearranged VDJ genes revealed expansion of a dominant clone or clones in 10 of the 12 MS patients. B cell clonal expansion was identified in 3 of 15 OND. The nucleotide sequences of V(H) genes from clonally expanded CSF B cells in MS patients demonstrated the preferential usage of the V(H) IV family. There were numerous somatic mutations, mainly in the CDRs, with a high replacement-to-silent ratio; the mutations were distributed in a way suggesting that these B cells had been positively selected through their antigen receptor. Our results demonstrate that in MS CSF, there is a high frequency of clonally expanded B cells that have properties of postgerminal center memory or antibody-forming lymphocytes. PMID:9727074
Musani, Vesna; Ozretić, Petar; Trnski, Diana; Sabol, Maja; Poduje, Sanja; Tošić, Mateja; Šitum, Mirna; Levanat, Sonja
2018-02-28
We describe a case of twins with sporadic Gorlin syndrome. Both twins had common Gorlin syndrome features including calcification of the falx cerebri, multiple jaw keratocysts, and multiple basal cell carcinomas, but with different expressivity. One brother also had benign testicular mesothelioma. We propose this tumor type as a possible new feature of Gorlin syndrome. Gorlin syndrome is a rare autosomal dominant disorder characterized by both developmental abnormalities and cancer predisposition, with variable expression of various developmental abnormalities and different types of tumors. The syndrome is primarily caused by mutations in the Patched 1 (PTCH1) gene, although rare mutations of Patched 2 (PTCH2) or Suppressor of Fused (SUFU) genes have also been found. Neither founder mutations nor hot spot locations have been described for PTCH1 in Gorlin syndrome patients. Although de novo mutations of the PTCH1 gene occur in almost 50% of Gorlin syndrome cases, there are a few recurrent mutations. Our twin patients were carriers of a de novo mutation in the PTCH1 gene, c.3364_3365delAT (p.Met1122ValfsX22). This is, to our knowledge, the first Gorlin syndrome-causing mutation that has been reported four independent times in distant geographical locations. Therefore, we propose the location of the described mutation as a potential hot spot for mutations in PTCH1.
Sung, Yun Ju; Di, Yanming; Fu, Audrey Q; Rothstein, Joseph H; Sieh, Weiva; Tong, Liping; Thompson, Elizabeth A; Wijsman, Ellen M
2007-01-01
We performed multipoint linkage analyses with multiple programs and models for several gene expression traits in the Centre d'Etude du Polymorphisme Humain families. All analyses provided consistent results for both peak location and shape. Variance-components (VC) analysis gave wider peaks and Bayes factors gave fewer peaks. Among programs from the MORGAN package, lm_multiple performed better than lm_markers, resulting in less Markov-chain Monte Carlo (MCMC) variability between runs, and the program lm_twoqtl provided higher LOD scores by also including either a polygenic component or an additional quantitative trait locus.
Sung, Yun Ju; Di, Yanming; Fu, Audrey Q; Rothstein, Joseph H; Sieh, Weiva; Tong, Liping; Thompson, Elizabeth A; Wijsman, Ellen M
2007-01-01
We performed multipoint linkage analyses with multiple programs and models for several gene expression traits in the Centre d'Etude du Polymorphisme Humain families. All analyses provided consistent results for both peak location and shape. Variance-components (VC) analysis gave wider peaks and Bayes factors gave fewer peaks. Among programs from the MORGAN package, lm_multiple performed better than lm_markers, resulting in less Markov-chain Monte Carlo (MCMC) variability between runs, and the program lm_twoqtl provided higher LOD scores by also including either a polygenic component or an additional quantitative trait locus. PMID:18466597
Conti, Sara; Condò, Maria; Posar, Annio; Mari, Francesca; Resta, Nicoletta; Renieri, Alessandra; Neri, Iria; Patrizi, Annalisa; Parmeggiani, Antonia
2012-03-01
Phosphatase and tensin homolog (PTEN) gene mutations are associated with a spectrum of clinical disorders characterized by skin lesions, macrocephaly, hamartomatous overgrowth of tissues, and an increased risk of cancers. Autism has rarely been described in association with these variable clinical features. At present, 24 patients with phosphatase and tensin homolog gene mutation, autism, macrocephaly, and some clinical findings described in phosphatase and tensin homolog syndromes have been reported in the literature. We describe a 14-year-old boy with autistic disorder, focal epilepsy, severe and progressive macrocephaly, and multiple papular skin lesions and palmoplantar punctate keratoses, characteristic of Cowden syndrome. The boy has a de novo phosphatase and tensin homolog gene mutation. Our patient is the first case described to present a typical Cowden syndrome and autism associated with epilepsy.
Medical Sequencing at the extremes of Human Body Mass
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ahituv, Nadav; Kavaslar, Nihan; Schackwitz, Wendy
2006-09-01
Body weight is a quantitative trait with significantheritability in humans. To identify potential genetic contributors tothis phenotype, we resequenced the coding exons and splice junctions of58 genes in 379 obese and 378 lean individuals. Our 96Mb survey included21 genes associated with monogenic forms of obesity in humans or mice, aswell as 37 genes that function in body weight-related pathways. We foundthat the monogenic obesity-associated gene group was enriched for rarenonsynonymous variants unique to the obese (n=46) versus lean (n=26)populations. Computational analysis further predicted a significantlygreater fraction of deleterious variants within the obese cohort.Consistent with the complex inheritance of body weight,more » we did notobserve obvious familial segregation in the majority of the 28 availablekindreds. Taken together, these data suggest that multiple rare alleleswith variable penetrance contribute to obesity in the population andprovide a deep medical sequencing based approach to detectthem.« less
Independence of heritable influences on the food intake of free-living humans.
de Castro, John M
2002-01-01
The time of day of meal ingestion, the number of people present at the meal, the subjective state of hunger, and the estimated before-meal contents in the stomach have been established as influences on the amount eaten in a meal and these influences have been shown to be heritable. Because these factors intercorrelate, the calculated heritabilities for some of these variables might result indirectly from their covariation with one of the other heritable variables. The independence of the heritability of the influence of these four factors was investigated with 110 identical and 102 fraternal same-sex and 53 fraternal mixed-sex adult twin pairs who were paid to maintain 7-d food-intake diaries. From the diary reports, the meal sizes were calculated and subjected to multiple regression analysis using the estimated before-meal stomach contents, the reported number of other people present, the subjective hunger ratings, and the time of day of the meal as predictors. Linear structural modeling was applied to the beta-coefficients from the multiple regression to investigate whether the heritability of the influences of these four variables was independent. Significant genetic effects were found for the beta-coefficients for all four variables, indicating that the heritability of their relationship with intake is to some extent independent and heritable. This suggests that influences of multiple factors on intake are influenced by the genes and become part of the total package of genetically determined physiologic, sociocultural, and psychological processes that regulate energy balance.
Heritability of diurnal changes in food intake in free-living humans.
de Castro, J M
2001-09-01
The time of day of meal ingestion, the number of people present at the meal, the subjective state of hunger, and the estimated before-meal contents in the stomach have been established as influences on the amount eaten in a meal, and this influence has been shown to be heritable. Because these factors intercorrelate, the possibility that the calculated heritabilities for some of these variables could result indirectly from their convariation with one of the other heritable variables was assessed. The independence of the heritability of the influence of these four factors was investigated with 110 identical and 102 fraternal same-sex and 53 fraternal mixed-sex adult twin pairs who were paid to maintain 7-d food intake diaries. From the diary reports, the meal sizes were calculated and subjected to multiple regression analysis using the estimated before-meal stomach contents, the reported number of other people present, the subjective hunger ratings, and the time of day of the meal as predictors. Linear structural modeling was applied to the beta coefficients from the multiple regression to investigate whether the heritability of the influences of these four variables was independent. Significant genetic effects were found for the beta coefficients for all four variables, indicating that the heritability of their relationship with intake is to some extent heritable. These results suggest that the influences of multiple factors on intake are influenced by the genes and become part of the total package of genetically determined physiologic, sociocultural, and psychological processes that regulate energy balance.
Chuma, Izumi; Isobe, Chihiro; Hotta, Yuma; Ibaragi, Kana; Futamata, Natsuru; Kusaba, Motoaki; Yoshida, Kentaro; Terauchi, Ryohei; Fujita, Yoshikatsu; Nakayashiki, Hitoshi; Valent, Barbara; Tosa, Yukio
2011-01-01
Magnaporthe oryzae is the causal agent of rice blast disease, a devastating problem worldwide. This fungus has caused breakdown of resistance conferred by newly developed commercial cultivars. To address how the rice blast fungus adapts itself to new resistance genes so quickly, we examined chromosomal locations of AVR-Pita, a subtelomeric gene family corresponding to the Pita resistance gene, in various isolates of M. oryzae (including wheat and millet pathogens) and its related species. We found that AVR-Pita (AVR-Pita1 and AVR-Pita2) is highly variable in its genome location, occurring in chromosomes 1, 3, 4, 5, 6, 7, and supernumerary chromosomes, particularly in rice-infecting isolates. When expressed in M. oryzae, most of the AVR-Pita homologs could elicit Pita-mediated resistance, even those from non-rice isolates. AVR-Pita was flanked by a retrotransposon, which presumably contributed to its multiple translocation across the genome. On the other hand, family member AVR-Pita3, which lacks avirulence activity, was stably located on chromosome 7 in a vast majority of isolates. These results suggest that the diversification in genome location of AVR-Pita in the rice isolates is a consequence of recognition by Pita in rice. We propose a model that the multiple translocation of AVR-Pita may be associated with its frequent loss and recovery mediated by its transfer among individuals in asexual populations. This model implies that the high mobility of AVR-Pita is a key mechanism accounting for the rapid adaptation toward Pita. Dynamic adaptation of some fungal plant pathogens may be achieved by deletion and recovery of avirulence genes using a population as a unit of adaptation. PMID:21829350
Ben-Moshe, Zohar; Vatine, Gad; Alon, Shahar; Tovin, Adi; Mracek, Philipp; Foulkes, Nicholas S; Gothilf, Yoav
2010-09-01
Circadian rhythms of physiology and behavior are generated by an autonomous circadian oscillator that is synchronized daily with the environment, mainly by light input. The PAR subfamily of transcriptional activators and the related E4BP4 repressor belonging to the basic leucine zipper (bZIP) family are clock-controlled genes that are suggested to mediate downstream circadian clock processes and to feedback onto the core oscillator. Here, the authors report the characterization of these genes in the zebrafish, an increasingly important model in the field of chronobiology. Five novel PAR and six novel e4bp4 zebrafish homolog genes were identified using bioinformatic tools and their coding sequences were cloned. Based on their evolutionary relationships, these genes were annotated as ztef2, zhlf1 and zhlf2, zdbp1 and zdbp2, and ze4bp4-1 to -6. The spatial and temporal mRNA expression pattern of each of these factors was characterized in zebrafish embryos in the context of a functional circadian clock and regulation by light. Nine of the factors exhibited augmented and rhythmic expression in the pineal gland, a central clock organ in zebrafish. Moreover, these genes were found to be regulated, to variable extents, by the circadian clock and/or by light. Differential expression patterns of multiple paralogs in zebrafish suggest multiple roles for these factors within the vertebrate circadian clock. This study, in the genetically accessible zebrafish model, lays the foundation for further research regarding the involvement and specific roles of PAR and E4BP4 transcription factors in the vertebrate circadian clock mechanism.
Lee, I-M; Bottner-Parker, K D; Zhao, Y; Bertaccini, A; Davis, R E
2012-09-01
The pigeon pea witches'-broom phytoplasma group (16SrIX) comprises diverse strains that cause numerous diseases in leguminous trees and herbaceous crops, vegetables, a fruit, a nut tree and a forest tree. At least 14 strains have been reported worldwide. Comparative phylogenetic analyses of the highly conserved 16S rRNA gene and the moderately conserved rplV (rpl22)-rpsC (rps3) and secY genes indicated that the 16SrIX group consists of at least six distinct genetic lineages. Some of these lineages cannot be readily differentiated based on analysis of 16S rRNA gene sequences alone. The relative genetic distances among these closely related lineages were better assessed by including more variable genes [e.g. ribosomal protein (rp) and secY genes]. The present study demonstrated that virtual RFLP analyses using rp and secY gene sequences allowed unambiguous identification of such lineages. A coding system is proposed to designate each distinct rp and secY subgroup in the 16SrIX group.
Nicita, Francesco; Torrente, Isabella; Spalice, Alberto; Bottillo, Irene; Papetti, Laura; Pinna, Valentina; Ursitti, Fabiana; Ruggieri, Martino
2014-02-01
Familial spinal neurofibromatosis (FSNF) is a rare form of neurofibromatosis type 1 (NF1) characterized by multiple, histologically proven neurofibromas of the spinal roots leaving no intact segments and associated neurofibromas of major peripheral nerves. It is sometimes associated with other NF1 stigmata. Most patients have NF1 gene mutations. We describe a patient who fulfilled the diagnostic criteria for spinal neurofibromatosis and belonged to a family in which other affected members exhibited classical NF1 stigmata. A novel missense (c.7109 T>A; p.Val2370Asp) mutation in exon 39 of the NF1 gene was present in the affected family members. The family displayed extreme phenotypic variability in the spectrum of NF1. To our knowledge, this is the first patient with spinal neurofibromatosis in the context of classical NF1 with an NF1 gene mutation. The term FSNF is inaccurate as this condition simply reflects the typical autosomal dominant pattern of NF1 inheritance with phenotypoc variability and does not encompass patients with sporadic disease or those in the context of a classical NF1 phenotype as reported in the present family. The term could be replaced by "spinal neurofibromatosis". Copyright © 2013 Elsevier Ltd. All rights reserved.
Tchetgen Tchetgen, Eric
2011-03-01
This article considers the detection and evaluation of genetic effects incorporating gene-environment interaction and independence. Whereas ordinary logistic regression cannot exploit the assumption of gene-environment independence, the proposed approach makes explicit use of the independence assumption to improve estimation efficiency. This method, which uses both cases and controls, fits a constrained retrospective regression in which the genetic variant plays the role of the response variable, and the disease indicator and the environmental exposure are the independent variables. The regression model constrains the association of the environmental exposure with the genetic variant among the controls to be null, thus explicitly encoding the gene-environment independence assumption, which yields substantial gain in accuracy in the evaluation of genetic effects. The proposed retrospective regression approach has several advantages. It is easy to implement with standard software, and it readily accounts for multiple environmental exposures of a polytomous or of a continuous nature, while easily incorporating extraneous covariates. Unlike the profile likelihood approach of Chatterjee and Carroll (Biometrika. 2005;92:399-418), the proposed method does not require a model for the association of a polytomous or continuous exposure with the disease outcome, and, therefore, it is agnostic to the functional form of such a model and completely robust to its possible misspecification.
Mallik, Saurav; Bhadra, Tapas; Mukherji, Ayan; Mallik, Saurav; Bhadra, Tapas; Mukherji, Ayan; Mallik, Saurav; Bhadra, Tapas; Mukherji, Ayan
2018-04-01
Association rule mining is an important technique for identifying interesting relationships between gene pairs in a biological data set. Earlier methods basically work for a single biological data set, and, in maximum cases, a single minimum support cutoff can be applied globally, i.e., across all genesets/itemsets. To overcome this limitation, in this paper, we propose dynamic threshold-based FP-growth rule mining algorithm that integrates gene expression, methylation and protein-protein interaction profiles based on weighted shortest distance to find the novel associations among different pairs of genes in multi-view data sets. For this purpose, we introduce three new thresholds, namely, Distance-based Variable/Dynamic Supports (DVS), Distance-based Variable Confidences (DVC), and Distance-based Variable Lifts (DVL) for each rule by integrating co-expression, co-methylation, and protein-protein interactions existed in the multi-omics data set. We develop the proposed algorithm utilizing these three novel multiple threshold measures. In the proposed algorithm, the values of , , and are computed for each rule separately, and subsequently it is verified whether the support, confidence, and lift of each evolved rule are greater than or equal to the corresponding individual , , and values, respectively, or not. If all these three conditions for a rule are found to be true, the rule is treated as a resultant rule. One of the major advantages of the proposed method compared with other related state-of-the-art methods is that it considers both the quantitative and interactive significance among all pairwise genes belonging to each rule. Moreover, the proposed method generates fewer rules, takes less running time, and provides greater biological significance for the resultant top-ranking rules compared to previous methods.
Sun, Lan; Irudayaraj, Joseph
2009-01-01
We demonstrate a surface enhanced Raman spectroscopy (SERS) based array platform to monitor gene expression in cancer cells in a multiplex and quantitative format without amplification steps. A strategy comprising of DNA/RNA hybridization, S1 nuclease digestion, and alkaline hydrolysis was adopted to obtain DNA targets specific to two splice junction variants Δ(9, 10) and Δ(5) of the breast cancer susceptibility gene 1 (BRCA1) from MCF-7 and MDA-MB-231 breast cancer cell lines. These two targets were identified simultaneously and their absolute quantities were estimated by a SERS strategy utilizing the inherent plasmon-phonon Raman mode of gold nanoparticle probes as a self-referencing standard to correct for variability in surface enhancement. Results were then validated by reverse transcription PCR (RT-PCR). Our proposed methodology could be expanded to a higher level of multiplexing for quantitative gene expression analysis of any gene without any amplification steps. PMID:19780515
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture.
Pai, Athma A; Henriques, Telmo; McCue, Kayla; Burkholder, Adam; Adelman, Karen; Burge, Christopher B
2017-12-27
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning ('intron definition') or exon-spanning ('exon definition') pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila , using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60-70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly low variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.
Chen, Zhiyuan; Hagen, Darren E.; Elsik, Christine G.; Ji, Tieming; Morris, Collin James; Moon, Laura Emily; Rivera, Rocío Melissa
2015-01-01
Embryos generated with the use of assisted reproductive technologies (ART) can develop overgrowth syndromes. In ruminants, the condition is referred to as large offspring syndrome (LOS) and exhibits variable phenotypic abnormalities including overgrowth, enlarged tongue, and abdominal wall defects. These characteristics recapitulate those observed in the human loss-of-imprinting (LOI) overgrowth syndrome Beckwith–Wiedemann (BWS). We have recently shown LOI at the KCNQ1 locus in LOS, the most common epimutation in BWS. Although the first case of ART-induced LOS was reported in 1995, studies have not yet determined the extent of LOI in this condition. Here, we determined allele-specific expression of imprinted genes previously identified in human and/or mouse in day ∼105 Bos taurus indicus × Bos taurus taurus F1 hybrid control and LOS fetuses using RNAseq. Our analysis allowed us to determine the monoallelic expression of 20 genes in tissues of control fetuses. LOS fetuses displayed variable LOI compared with controls. Biallelic expression of imprinted genes in LOS was associated with tissue-specific hypomethylation of the normally methylated parental allele. In addition, a positive correlation was observed between body weight and the number of biallelically expressed imprinted genes in LOS fetuses. Furthermore, not only was there loss of allele-specific expression of imprinted genes in LOS, but also differential transcript amounts of these genes between control and overgrown fetuses. In summary, we characterized previously unidentified imprinted genes in bovines and identified misregulation of imprinting at multiple loci in LOS. We concluded that LOS is a multilocus LOI syndrome, as is BWS. PMID:25825726
Chen, Zhiyuan; Hagen, Darren E; Elsik, Christine G; Ji, Tieming; Morris, Collin James; Moon, Laura Emily; Rivera, Rocío Melissa
2015-04-14
Embryos generated with the use of assisted reproductive technologies (ART) can develop overgrowth syndromes. In ruminants, the condition is referred to as large offspring syndrome (LOS) and exhibits variable phenotypic abnormalities including overgrowth, enlarged tongue, and abdominal wall defects. These characteristics recapitulate those observed in the human loss-of-imprinting (LOI) overgrowth syndrome Beckwith-Wiedemann (BWS). We have recently shown LOI at the KCNQ1 locus in LOS, the most common epimutation in BWS. Although the first case of ART-induced LOS was reported in 1995, studies have not yet determined the extent of LOI in this condition. Here, we determined allele-specific expression of imprinted genes previously identified in human and/or mouse in day ∼105 Bos taurus indicus × Bos taurus taurus F1 hybrid control and LOS fetuses using RNAseq. Our analysis allowed us to determine the monoallelic expression of 20 genes in tissues of control fetuses. LOS fetuses displayed variable LOI compared with controls. Biallelic expression of imprinted genes in LOS was associated with tissue-specific hypomethylation of the normally methylated parental allele. In addition, a positive correlation was observed between body weight and the number of biallelically expressed imprinted genes in LOS fetuses. Furthermore, not only was there loss of allele-specific expression of imprinted genes in LOS, but also differential transcript amounts of these genes between control and overgrown fetuses. In summary, we characterized previously unidentified imprinted genes in bovines and identified misregulation of imprinting at multiple loci in LOS. We concluded that LOS is a multilocus LOI syndrome, as is BWS.
Boosting for detection of gene-environment interactions.
Pashova, H; LeBlanc, M; Kooperberg, C
2013-01-30
In genetic association studies, it is typically thought that genetic variants and environmental variables jointly will explain more of the inheritance of a phenotype than either of these two components separately. Traditional methods to identify gene-environment interactions typically consider only one measured environmental variable at a time. However, in practice, multiple environmental factors may each be imprecise surrogates for the underlying physiological process that actually interacts with the genetic factors. In this paper, we develop a variant of L(2) boosting that is specifically designed to identify combinations of environmental variables that jointly modify the effect of a gene on a phenotype. Because the effect modifiers might have a small signal compared with the main effects, working in a space that is orthogonal to the main predictors allows us to focus on the interaction space. In a simulation study that investigates some plausible underlying model assumptions, our method outperforms the least absolute shrinkage and selection and Akaike Information Criterion and Bayesian Information Criterion model selection procedures as having the lowest test error. In an example for the Women's Health Initiative-Population Architecture using Genomics and Epidemiology study, the dedicated boosting method was able to pick out two single-nucleotide polymorphisms for which effect modification appears present. The performance was evaluated on an independent test set, and the results are promising. Copyright © 2012 John Wiley & Sons, Ltd.
Herrgård, Markus J.
2014-01-01
High-cell-density fermentation for industrial production of chemicals can impose numerous stresses on cells due to high substrate, product, and by-product concentrations; high osmolarity; reactive oxygen species; and elevated temperatures. There is a need to develop platform strains of industrial microorganisms that are more tolerant toward these typical processing conditions. In this study, the growth of six industrially relevant strains of Escherichia coli was characterized under eight stress conditions representative of fed-batch fermentation, and strains W and BL21(DE3) were selected as platforms for transposon (Tn) mutagenesis due to favorable resistance characteristics. Selection experiments, followed by either targeted or genome-wide next-generation-sequencing-based Tn insertion site determination, were performed to identify mutants with improved growth properties under a subset of three stress conditions and two combinations of individual stresses. A subset of the identified loss-of-function mutants were selected for a combinatorial approach, where strains with combinations of two and three gene deletions were systematically constructed and tested for single and multistress resistance. These approaches allowed identification of (i) strain-background-specific stress resistance phenotypes, (ii) novel gene deletion mutants in E. coli that confer single and multistress resistance in a strain-background-dependent manner, and (iii) synergistic effects of multiple gene deletions that confer improved resistance over single deletions. The results of this study underscore the suboptimality and strain-specific variability of the genetic network regulating growth under stressful conditions and suggest that further exploration of the combinatorial gene deletion space in multiple strain backgrounds is needed for optimizing strains for microbial bioprocessing applications. PMID:25085490
Chattaway, Marie Anne; Day, Michaela; Mtwale, Julia; White, Emma; Rogers, James; Day, Martin; Powell, David; Ahmad, Marwa; Harris, Ross; Talukder, Kaisar Ali; Wain, John; Jenkins, Claire; Cravioto, Alejandro
2017-01-01
Purpose This study investigates the virulence and antimicrobial resistance in association with common clonal complexes (CCs) of enteroaggregative Escherichia coli (EAEC) isolated from Bangladesh. The aim was to determine whether specific CCs were more likely to be associated with putative virulence genes and/or antimicrobial resistance. Methodology The presence of 15 virulence genes (by PCR) and susceptibility to 18 antibiotics were determined for 151 EAEC isolated from cases and controls during an intestinal infectious disease study carried out between 2007–2011 in the rural setting of Mirzapur, Bangladesh (Kotloff KL, Blackwelder WC, Nasrin D, Nataro JP, Farag TH et al. Clin Infect Dis 2012;55:S232–S245). These data were then analysed in the context of previously determined serotypes and clonal complexes defined by multi-locus sequence typing. Results Overall there was no association between the presence of virulence or antimicrobial resistance genes in isolates of EAEC from cases versus controls. However, when stratified by clonal complex (CC) one CC associated with cases harboured more virulence factors (CC40) and one CC harboured more resistance genes (CC38) than the average. There was no direct link between the virulence gene content and antibiotic resistance. Strains within a single CC had variable virulence and resistance gene content indicating independent and multiple gene acquisitions over time. Conclusion In Bangladesh, there are multiple clonal complexes of EAEC harbouring a variety of virulence and resistance genes. The emergence of two of the most successful clones appeared to be linked to either increased virulence (CC40) or antimicrobial resistance (CC38), but increased resistance and virulence were not found in the same clonal complexes. PMID:28945190
Charbonneau, Bridget; Maurer, Matthew J.; Fredericksen, Zachary S.; Zent, Clive S.; Link, Brian K.; Novak, Anne J.; Ansell, Stephen M.; Weiner, George J.; Wang, Alice H.; Witzig, Thomas E.; Dogan, Ahmet; Slager, Susan L.; Habermann, Thomas M.; Cerhan, James R.
2013-01-01
The complement pathway plays a central role in innate immunity, and also functions as a regulator of the overall immune response. We evaluated whether polymorphisms in complement genes are associated with event-free survival (EFS) in follicular (FL) and diffuse large B-cell (DLBCL) lymphoma. We genotyped 167 single nucleotide polymorphisms (SNPs) from 30 complement pathway genes in a prospective cohort study of newly diagnosed FL (N=107) and DLBCL (N=82) patients enrolled at the Mayo Clinic from 2002–2005. Cox regression was used to estimate Hazard Ratios (HRs) for individual SNPs with EFS, adjusting for FLIPI or IPI and treatment. For gene-level analyses, we used a principal components based gene-level test. In gene-level analyses for FL EFS, CFH (p=0.009), CD55 (p=0.006), CFHR5 (p=0.01), C9 (p=0.02), CFHR1 (p=0.03), and CD46 (p=0.03) were significant at p<0.05, and these genes remained noteworthy after accounting for multiple testing (q<0.15). SNPs in CFH, CFHR1, and CFHR5 showed stronger associations among patients receiving any rituximab, while SNPs from CD55 and CD46 showed stronger associations among patients who were observed. For DLBCL, only CLU (p=0.001) and C7 (p=0.03) were associated with EFS, but did not remain noteworthy after accounting for multiple testing (q>0.15). Genes from the Regulators of Complement Activation (CFH, CD55, CFHR1, CFHR5, CD46) at 1q32-q32.1, along with C9, were associated with FL EFS after adjusting for clinical variables, and if replicated, these findings add further support for the role of host innate immunity in FL prognosis. PMID:22718493
Gene expression variability in human hepatic drug metabolizing enzymes and transporters.
Yang, Lun; Price, Elvin T; Chang, Ching-Wei; Li, Yan; Huang, Ying; Guo, Li-Wu; Guo, Yongli; Kaput, Jim; Shi, Leming; Ning, Baitang
2013-01-01
Interindividual variability in the expression of drug-metabolizing enzymes and transporters (DMETs) in human liver may contribute to interindividual differences in drug efficacy and adverse reactions. Published studies that analyzed variability in the expression of DMET genes were limited by sample sizes and the number of genes profiled. We systematically analyzed the expression of 374 DMETs from a microarray data set consisting of gene expression profiles derived from 427 human liver samples. The standard deviation of interindividual expression for DMET genes was much higher than that for non-DMET genes. The 20 DMET genes with the largest variability in the expression provided examples of the interindividual variation. Gene expression data were also analyzed using network analysis methods, which delineates the similarities of biological functionalities and regulation mechanisms for these highly variable DMET genes. Expression variability of human hepatic DMET genes may affect drug-gene interactions and disease susceptibility, with concomitant clinical implications.
The shaping and functional consequences of the dosage effect landscape in multiple myeloma.
Samur, Mehmet K; Shah, Parantu K; Wang, Xujun; Minvielle, Stéphane; Magrangeas, Florence; Avet-Loiseau, Hervé; Munshi, Nikhil C; Li, Cheng
2013-10-02
Multiple myeloma (MM) is a malignant proliferation of plasma B cells. Based on recurrent aneuploidy such as copy number alterations (CNAs), myeloma is divided into two subtypes with different CNA patterns and patient survival outcomes. How aneuploidy events arise, and whether they contribute to cancer cell evolution are actively studied. The large amount of transcriptomic changes resultant of CNAs (dosage effect) pose big challenges for identifying functional consequences of CNAs in myeloma in terms of specific driver genes and pathways. In this study, we hypothesize that gene-wise dosage effect varies as a result from complex regulatory networks that translate the impact of CNAs to gene expression, and studying this variation can provide insights into functional effects of CNAs. We propose gene-wise dosage effect score and genome-wide karyotype plot as tools to measure and visualize concordant copy number and expression changes across cancer samples. We find that dosage effect in myeloma is widespread yet variable, and it is correlated with gene expression level and CNA frequencies in different chromosomes. Our analysis suggests that despite the enrichment of differentially expressed genes between hyperdiploid MM and non-hyperdiploid MM in the trisomy chromosomes, the chromosomal proportion of dosage sensitive genes is higher in the non-trisomy chromosomes. Dosage-sensitive genes are enriched by genes with protein translation and localization functions, and dosage resistant genes are enriched by apoptosis genes. These results point to future studies on differential dosage sensitivity and resistance of pro- and anti-proliferation pathways and their variation across patients as therapeutic targets and prognosis markers. Our findings support the hypothesis that recurrent CNAs in myeloma are selected by their functional consequences. The novel dosage effect score defined in this work will facilitate integration of copy number and expression data for identifying driver genes in cancer genomics studies. The accompanying R code is available at http://www.canevolve.org/dosageEffect/.
Halberg, Richard B.; Chen, Xiaodi; Amos-Landgraf, James M.; White, Alanna; Rasmussen, Kristin; Clipson, Linda; Pasch, Cheri; Sullivan, Ruth; Pitot, Henry C.; Dove, William F.
2008-01-01
Familial adenomatous polyposis (FAP) is a human cancer syndrome characterized by the development of hundreds to thousands of colonic polyps and extracolonic lesions including desmoid fibromas, osteomas, epidermoid cysts, and congenital hypertrophy of the pigmented retinal epithelium. Afflicted individuals are heterozygous for mutations in the APC gene. Detailed investigations of mice heterozygous for mutations in the ortholog Apc have shown that other genetic factors strongly influence the phenotype. Here we report qualitative and quantitative modifications of the phenotype of Apc mutants as a function of three genetic variables: Apc allele, p53 allele, and genetic background. We have found major differences between the Apc alleles Min and 1638N in multiplicity and regionality of intestinal tumors, as well as in incidence of extracolonic lesions. By contrast, Min mice homozygous for either of two different knockout alleles of p53 show similar phenotypic effects. These studies illustrate the classic principle that functional genetics is enriched by assessing penetrance and expressivity with allelic series. The mouse permits study of an allelic gene series on multiple genetic backgrounds, thereby leading to a better understanding of gene action in a range of biological processes. PMID:18723878
Halberg, Richard B; Chen, Xiaodi; Amos-Landgraf, James M; White, Alanna; Rasmussen, Kristin; Clipson, Linda; Pasch, Cheri; Sullivan, Ruth; Pitot, Henry C; Dove, William F
2008-09-01
Familial adenomatous polyposis (FAP) is a human cancer syndrome characterized by the development of hundreds to thousands of colonic polyps and extracolonic lesions including desmoid fibromas, osteomas, epidermoid cysts, and congenital hypertrophy of the pigmented retinal epithelium. Afflicted individuals are heterozygous for mutations in the APC gene. Detailed investigations of mice heterozygous for mutations in the ortholog Apc have shown that other genetic factors strongly influence the phenotype. Here we report qualitative and quantitative modifications of the phenotype of Apc mutants as a function of three genetic variables: Apc allele, p53 allele, and genetic background. We have found major differences between the Apc alleles Min and 1638N in multiplicity and regionality of intestinal tumors, as well as in incidence of extracolonic lesions. By contrast, Min mice homozygous for either of two different knockout alleles of p53 show similar phenotypic effects. These studies illustrate the classic principle that functional genetics is enriched by assessing penetrance and expressivity with allelic series. The mouse permits study of an allelic gene series on multiple genetic backgrounds, thereby leading to a better understanding of gene action in a range of biological processes.
Lessons learned from the dog genome.
Wayne, Robert K; Ostrander, Elaine A
2007-11-01
Extensive genetic resources and a high-quality genome sequence position the dog as an important model species for understanding genome evolution, population genetics and genes underlying complex phenotypic traits. Newly developed genomic resources have expanded our understanding of canine evolutionary history and dog origins. Domestication involved genetic contributions from multiple populations of gray wolves probably through backcrossing. More recently, the advent of controlled breeding practices has segregated genetic variability into distinct dog breeds that possess specific phenotypic traits. Consequently, genome-wide association and selective sweep scans now allow the discovery of genes underlying breed-specific characteristics. The dog is finally emerging as a novel resource for studying the genetic basis of complex traits, including behavior.
Lessons learned: Optimization of a murine small bowel resection model
Taylor, Janice A.; Martin, Colin A.; Nair, Rajalakshmi; Guo, Jun; Erwin, Christopher R.; Warner, Brad W.
2008-01-01
Background/Purpose Central to the use of murine models of disease is the ability to derive reproducible data. The purpose of this study was to determine factors contributing to variability in our murine model of small bowel resection (SBR). Methods Male C57Bl/6 mice were randomized to sham or 50% SBR. The effect of housing type (pathogen-free versus standard housing), nutrition (reconstituted powder versus tube feeding formulation), and correlates of intestinal morphology with gene expression changes were investigated Multiple linear regression modeling or one-way ANOVA was used for data analysis. Results Pathogen-free mice had significantly shorter ileal villi at baseline and demonstrated greater villus growth after SBR compared to mice housed in standard rooms. Food type did not affect adaptation. Gene expression changes were more consistent and significant in isolated crypt cells that demonstrated adaptive growth when compared with crypts that did not deepen after SBR. Conclusion Maintenance of mice in pathogen-free conditions and restricting gene expression analysis to individual animals exhibiting morphologic adaptation enhances sensitivity and specificity of data derived from this model. These refinements will minimize experimental variability and lead to improved understanding of the complex process of intestinal adaptation. PMID:18558176
Antonov, Valery A; Tkachenko, Galina A; Altukhova, Viktoriya V; Savchenko, Sergey S; Zinchenko, Olga V; Viktorov, Dmitry V; Zamaraev, Valery S; Ilyukhin, Vladimir I; Alekseev, Vladimir V
2008-12-01
Burkholderia mallei and B. pseudomallei are highly pathogenic microorganisms for both humans and animals. Moreover, they are regarded as potential agents of bioterrorism. Thus, rapid and unequivocal detection and identification of these dangerous pathogens is critical. In the present study, we describe the use of an optimized protocol for the early diagnosis of experimental glanders and melioidosis and for the rapid differentiation and typing of Burkholderia strains. This experience with PCR-based identification methods indicates that single PCR targets (23S and 16S rRNA genes, 16S-23S intergenic region, fliC and type III secretion gene cluster) should be used with caution for identification of B. mallei and B. pseudomallei, and need to be used alongside molecular methods such as gene sequencing. Several molecular typing procedures have been used to identify genetically related B. pseudomallei and B. mallei isolates, including ribotyping, pulsed-field gel electrophoresis and multilocus sequence typing. However, these methods are time consuming and technically challenging for many laboratories. RAPD, variable amplicon typing scheme, Rep-PCR, BOX-PCR and multiple-locus variable-number tandem repeat analysis have been recommended by us for the rapid differentiation of B. mallei and B. pseudomallei strains.
Frequency of mononuclear diploid cardiomyocytes underlies natural variation in heart regeneration.
Patterson, Michaela; Barske, Lindsey; Van Handel, Ben; Rau, Christoph D; Gan, Peiheng; Sharma, Avneesh; Parikh, Shan; Denholtz, Matt; Huang, Ying; Yamaguchi, Yukiko; Shen, Hua; Allayee, Hooman; Crump, J Gage; Force, Thomas I; Lien, Ching-Ling; Makita, Takako; Lusis, Aldons J; Kumar, S Ram; Sucov, Henry M
2017-09-01
Adult mammalian cardiomyocyte regeneration after injury is thought to be minimal. Mononuclear diploid cardiomyocytes (MNDCMs), a relatively small subpopulation in the adult heart, may account for the observed degree of regeneration, but this has not been tested. We surveyed 120 inbred mouse strains and found that the frequency of adult mononuclear cardiomyocytes was surprisingly variable (>7-fold). Cardiomyocyte proliferation and heart functional recovery after coronary artery ligation both correlated with pre-injury MNDCM content. Using genome-wide association, we identified Tnni3k as one gene that influences variation in this composition and demonstrated that Tnni3k knockout resulted in elevated MNDCM content and increased cardiomyocyte proliferation after injury. Reciprocally, overexpression of Tnni3k in zebrafish promoted cardiomyocyte polyploidization and compromised heart regeneration. Our results corroborate the relevance of MNDCMs in heart regeneration. Moreover, they imply that intrinsic heart regeneration is not limited nor uniform in all individuals, but rather is a variable trait influenced by multiple genes.
Huang, Tianhong; Yang, Guilin; Dang, Xiao; Ao, Feijian; Li, Jiankang; He, Yizhou; Tang, Qiyuan; He, Qing
2017-11-01
Alagille syndrome (AGS) is a highly variable, autosomal dominant disease that affects multiple structures including the liver, heart, eyes, bones and face. Targeted region capture sequencing focuses on a panel of known pathogenic genes and provides a rapid, cost‑effective and accurate method for molecular diagnosis. In a Chinese family, this method was used on the proband and Sanger sequencing was applied to validate the candidate mutation. A de novo heterozygous mutation (c.3254_3255insT p.Leu1085PhefsX24) of the jagged 1 gene was identified as the potential disease‑causing gene mutation. In conclusion, the present study suggested that target region capture sequencing is an efficient, reliable and accurate approach for the clinical diagnosis of AGS. Furthermore, these results expand on the understanding of the pathogenesis of AGS.
Single-cell analysis of transcription kinetics across the cell cycle
Skinner, Samuel O; Xu, Heng; Nagarkar-Jaiswal, Sonal; Freire, Pablo R; Zwaka, Thomas P; Golding, Ido
2016-01-01
Transcription is a highly stochastic process. To infer transcription kinetics for a gene-of-interest, researchers commonly compare the distribution of mRNA copy-number to the prediction of a theoretical model. However, the reliability of this procedure is limited because the measured mRNA numbers represent integration over the mRNA lifetime, contribution from multiple gene copies, and mixing of cells from different cell-cycle phases. We address these limitations by simultaneously quantifying nascent and mature mRNA in individual cells, and incorporating cell-cycle effects in the analysis of mRNA statistics. We demonstrate our approach on Oct4 and Nanog in mouse embryonic stem cells. Both genes follow similar two-state kinetics. However, Nanog exhibits slower ON/OFF switching, resulting in increased cell-to-cell variability in mRNA levels. Early in the cell cycle, the two copies of each gene exhibit independent activity. After gene replication, the probability of each gene copy to be active diminishes, resulting in dosage compensation. DOI: http://dx.doi.org/10.7554/eLife.12175.001 PMID:26824388
Uptake, Results, and Outcomes of Germline Multiple-Gene Sequencing After Diagnosis of Breast Cancer.
Kurian, Allison W; Ward, Kevin C; Hamilton, Ann S; Deapen, Dennis M; Abrahamse, Paul; Bondarenko, Irina; Li, Yun; Hawley, Sarah T; Morrow, Monica; Jagsi, Reshma; Katz, Steven J
2018-05-10
Low-cost sequencing of multiple genes is increasingly available for cancer risk assessment. Little is known about uptake or outcomes of multiple-gene sequencing after breast cancer diagnosis in community practice. To examine the effect of multiple-gene sequencing on the experience and treatment outcomes for patients with breast cancer. For this population-based retrospective cohort study, patients with breast cancer diagnosed from January 2013 to December 2015 and accrued from SEER registries across Georgia and in Los Angeles, California, were surveyed (n = 5080, response rate = 70%). Responses were merged with SEER data and results of clinical genetic tests, either BRCA1 and BRCA2 (BRCA1/2) sequencing only or including additional other genes (multiple-gene sequencing), provided by 4 laboratories. Type of testing (multiple-gene sequencing vs BRCA1/2-only sequencing), test results (negative, variant of unknown significance, or pathogenic variant), patient experiences with testing (timing of testing, who discussed results), and treatment (strength of patient consideration of, and surgeon recommendation for, prophylactic mastectomy), and prophylactic mastectomy receipt. We defined a patient subgroup with higher pretest risk of carrying a pathogenic variant according to practice guidelines. Among 5026 patients (mean [SD] age, 59.9 [10.7]), 1316 (26.2%) were linked to genetic results from any laboratory. Multiple-gene sequencing increasingly replaced BRCA1/2-only testing over time: in 2013, the rate of multiple-gene sequencing was 25.6% and BRCA1/2-only testing, 74.4%;in 2015 the rate of multiple-gene sequencing was 66.5% and BRCA1/2-only testing, 33.5%. Multiple-gene sequencing was more often ordered by genetic counselors (multiple-gene sequencing, 25.5% and BRCA1/2-only testing, 15.3%) and delayed until after surgery (multiple-gene sequencing, 32.5% and BRCA1/2-only testing, 19.9%). Multiple-gene sequencing substantially increased rate of detection of any pathogenic variant (multiple-gene sequencing: higher-risk patients, 12%; average-risk patients, 4.2% and BRCA1/2-only testing: higher-risk patients, 7.8%; average-risk patients, 2.2%) and variants of uncertain significance, especially in minorities (multiple-gene sequencing: white patients, 23.7%; black patients, 44.5%; and Asian patients, 50.9% and BRCA1/2-only testing: white patients, 2.2%; black patients, 5.6%; and Asian patients, 0%). Multiple-gene sequencing was not associated with an increase in the rate of prophylactic mastectomy use, which was highest with pathogenic variants in BRCA1/2 (BRCA1/2, 79.0%; other pathogenic variant, 37.6%; variant of uncertain significance, 30.2%; negative, 35.3%). Multiple-gene sequencing rapidly replaced BRCA1/2-only testing for patients with breast cancer in the community and enabled 2-fold higher detection of clinically relevant pathogenic variants without an associated increase in prophylactic mastectomy. However, important targets for improvement in the clinical utility of multiple-gene sequencing include postsurgical delay and racial/ethnic disparity in variants of uncertain significance.
From mild ataxia to huntington disease phenocopy: the multiple faces of spinocerebellar ataxia 17.
Koutsis, Georgios; Panas, Marios; Paraskevas, George P; Bougea, Anastasia M; Kladi, Athina; Karadima, Georgia; Kapaki, Elisabeth
2014-01-01
Introduction. Spinocerebellar ataxia 17 (SCA 17) is a rare autosomal dominant cerebellar ataxia (ADCA) caused by a CAG/CAA expansion in the TBP gene, reported from a limited number of countries. It is a very heterogeneous ADCA characterized by ataxia, cognitive decline, psychiatric symptoms, and involuntary movements, with some patients presenting with Huntington disease (HD) phenocopies. The SCA 17 expansion is stable during parent-child transmission and intrafamilial phenotypic homogeneity has been reported. However, significant phenotypic variability within families has also been observed. Report of the Family. We presently report a Greek family with a pathological expansion of 54 repeats at the SCA 17 locus that displayed remarkable phenotypic variability. Among 3 affected members, one presented with HD phenocopy; one with progressive ataxia, dementia, chorea, dystonia, and seizures, and one with mild slowly progressive ataxia with minor cognitive and affective symptoms. Conclusions. This is the first family with SCA 17 identified in Greece and highlights the multiple faces of this rare disorder, even within the same family.
From Mild Ataxia to Huntington Disease Phenocopy: The Multiple Faces of Spinocerebellar Ataxia 17
Panas, Marios; Paraskevas, George P.; Bougea, Anastasia M.; Karadima, Georgia; Kapaki, Elisabeth
2014-01-01
Introduction. Spinocerebellar ataxia 17 (SCA 17) is a rare autosomal dominant cerebellar ataxia (ADCA) caused by a CAG/CAA expansion in the TBP gene, reported from a limited number of countries. It is a very heterogeneous ADCA characterized by ataxia, cognitive decline, psychiatric symptoms, and involuntary movements, with some patients presenting with Huntington disease (HD) phenocopies. The SCA 17 expansion is stable during parent-child transmission and intrafamilial phenotypic homogeneity has been reported. However, significant phenotypic variability within families has also been observed. Report of the Family. We presently report a Greek family with a pathological expansion of 54 repeats at the SCA 17 locus that displayed remarkable phenotypic variability. Among 3 affected members, one presented with HD phenocopy; one with progressive ataxia, dementia, chorea, dystonia, and seizures, and one with mild slowly progressive ataxia with minor cognitive and affective symptoms. Conclusions. This is the first family with SCA 17 identified in Greece and highlights the multiple faces of this rare disorder, even within the same family. PMID:25349749
High frequency, spontaneous motA mutations in Campylobacter jejuni strain 81-176.
Mohawk, Krystle L; Poly, Frédéric; Sahl, Jason W; Rasko, David A; Guerry, Patricia
2014-01-01
Campylobacter jejuni is an important cause of bacterial diarrhea worldwide. The pathogenesis of C. jejuni is poorly understood and complicated by phase variation of multiple surface structures including lipooligosaccharide, capsule, and flagellum. When C. jejuni strain 81-176 was plated on blood agar for single colonies, the presence of translucent, non-motile colonial variants was noted among the majority of opaque, motile colonies. High-throughput genomic sequencing of two flagellated translucent and two opaque variants as well as the parent strain revealed multiple genetic changes compared to the published genome. However, the only mutated open reading frame common between the two translucent variants and absent from the opaque variants and the parent was motA, encoding a flagellar motor protein. A total of 18 spontaneous motA mutations were found that mapped to four distinct sites in the gene, with only one class of mutation present in a phase variable region. This study exemplifies the mutative/adaptive properties of C. jejuni and demonstrates additional variability in C. jejuni beyond phase variation.
Smith, Maria W.; Herfort, Lydie; Tyrol, Kaitlin; Suciu, Dominic; Campbell, Victoria; Crump, Byron C.; Peterson, Tawnya D.; Zuber, Peter; Baptista, Antonio M.; Simon, Holly M.
2010-01-01
Through their metabolic activities, microbial populations mediate the impact of high gradient regions on ecological function and productivity of the highly dynamic Columbia River coastal margin (CRCM). A 2226-probe oligonucleotide DNA microarray was developed to investigate expression patterns for microbial genes involved in nitrogen and carbon metabolism in the CRCM. Initial experiments with the environmental microarrays were directed toward validation of the platform and yielded high reproducibility in multiple tests. Bioinformatic and experimental validation also indicated that >85% of the microarray probes were specific for their corresponding target genes and for a few homologs within the same microbial family. The validated probe set was used to query gene expression responses by microbial assemblages to environmental variability. Sixty-four samples from the river, estuary, plume, and adjacent ocean were collected in different seasons and analyzed to correlate the measured variability in chemical, physical and biological water parameters to differences in global gene expression profiles. The method produced robust seasonal profiles corresponding to pre-freshet spring (April) and late summer (August). Overall relative gene expression was high in both seasons and was consistent with high microbial abundance measured by total RNA, heterotrophic bacterial production, and chlorophyll a. Both seasonal patterns involved large numbers of genes that were highly expressed relative to background, yet each produced very different gene expression profiles. April patterns revealed high differential gene expression in the coastal margin samples (estuary, plume and adjacent ocean) relative to freshwater, while little differential gene expression was observed along the river-to-ocean transition in August. Microbial gene expression profiles appeared to relate, in part, to seasonal differences in nutrient availability and potential resource competition. Furthermore, our results suggest that highly-active particle-attached microbiota in the Columbia River water column may perform dissimilatory nitrate reduction (both dentrification and DNRA) within anoxic particle microniches. PMID:20967204
Slattery, Martha L.; Lundgreen, Abbie; Herrick, Jennifer S.; Caan, Bette J.; Potter, John D.; Wolff, Roger K.
2012-01-01
There is considerable biologic plausibility to the hypothesis that genetic variability in pathways involved in insulin signaling and energy homeostasis may modulate dietary risk associated with colorectal cancer. We utilized data from 2 population-based case-control studies of colon (n = 1,574 cases, 1,970 controls) and rectal (n = 791 cases, 999 controls) cancer to evaluate genetic variation in candidate SNPs identified from 9 genes in a candidate pathway: PDK1, RP6KA1, RPS6KA2, RPS6KB1, RPS6KB2, PTEN, FRAP1 (mTOR), TSC1, TSC2, Akt1, PIK3CA, and PRKAG2 with dietary intake of total energy, carbohydrates, fat, and fiber. We employed SNP, haplotype, and multiple-gene analysis to evaluate associations. PDK1 interacted with dietary fat for both colon and rectal cancer and with dietary carbohydrates for colon cancer. Statistically significant interaction with dietary carbohydrates and rectal cancer was detected by haplotype analysis of PDK1. Evaluation of dietary interactions with multiple genes in this candidate pathway showed several interactions with pairs of genes: Akt1 and PDK1, PDK1 and PTEN, PDK1 and TSC1, and PRKAG2 and PTEN. Analyses show that genetic variation influences risk of colorectal cancer associated with diet and illustrate the importance of evaluating dietary interactions beyond the level of single SNPs or haplotypes when a biologically relevant candidate pathway is examined. PMID:21999454
Yang, Mingxing; Li, Xiumin; Li, Zhibin; Ou, Zhimin; Liu, Ming; Liu, Suhuan; Li, Xuejun; Yang, Shuyu
2013-01-01
DNA microarray analysis is characterized by obtaining a large number of gene variables from a small number of observations. Cluster analysis is widely used to analyze DNA microarray data to make classification and diagnosis of disease. Because there are so many irrelevant and insignificant genes in a dataset, a feature selection approach must be employed in data analysis. The performance of cluster analysis of this high-throughput data depends on whether the feature selection approach chooses the most relevant genes associated with disease classes. Here we proposed a new method using multiple Orthogonal Partial Least Squares-Discriminant Analysis (mOPLS-DA) models and S-plots to select the most relevant genes to conduct three-class disease classification and prediction. We tested our method using Golub's leukemia microarray data. For three classes with subtypes, we proposed hierarchical orthogonal partial least squares-discriminant analysis (OPLS-DA) models and S-plots to select features for two main classes and their subtypes. For three classes in parallel, we employed three OPLS-DA models and S-plots to choose marker genes for each class. The power of feature selection to classify and predict three-class disease was evaluated using cluster analysis. Further, the general performance of our method was tested using four public datasets and compared with those of four other feature selection methods. The results revealed that our method effectively selected the most relevant features for disease classification and prediction, and its performance was better than that of the other methods.
Systems genetic analysis of multivariate response to iron deficiency in mice
Yin, Lina; Unger, Erica L.; Jellen, Leslie C.; Earley, Christopher J.; Allen, Richard P.; Tomaszewicz, Ann; Fleet, James C.
2012-01-01
The aim of this study was to identify genes that influence iron regulation under varying dietary iron availability. Male and female mice from 20+ BXD recombinant inbred strains were fed iron-poor or iron-adequate diets from weaning until 4 mo of age. At death, the spleen, liver, and blood were harvested for the measurement of hemoglobin, hematocrit, total iron binding capacity, transferrin saturation, and liver, spleen and plasma iron concentration. For each measure and diet, we found large, strain-related variability. A principal-components analysis (PCA) was performed on the strain means for the seven parameters under each dietary condition for each sex, followed by quantitative trait loci (QTL) analysis on the factors. Compared with the iron-adequate diet, iron deficiency altered the factor structure of the principal components. QTL analysis, combined with PosMed (a candidate gene searching system) published gene expression data and literature citations, identified seven candidate genes, Ptprd, Mdm1, Picalm, lip1, Tcerg1, Skp2, and Frzb based on PCA factor, diet, and sex. Expression of each of these is cis-regulated, significantly correlated with the corresponding PCA factor, and previously reported to regulate iron, directly or indirectly. We propose that polymorphisms in multiple genes underlie individual differences in iron regulation, especially in response to dietary iron challenge. This research shows that iron management is a highly complex trait, influenced by multiple genes. Systems genetics analysis of iron homeostasis holds promise for developing new methods for prevention and treatment of iron deficiency anemia and related diseases. PMID:22461179
Tsai, Yu-Shuen; Aguan, Kripamoy; Pal, Nikhil R.; Chung, I-Fang
2011-01-01
Informative genes from microarray data can be used to construct prediction model and investigate biological mechanisms. Differentially expressed genes, the main targets of most gene selection methods, can be classified as single- and multiple-class specific signature genes. Here, we present a novel gene selection algorithm based on a Group Marker Index (GMI), which is intuitive, of low-computational complexity, and efficient in identification of both types of genes. Most gene selection methods identify only single-class specific signature genes and cannot identify multiple-class specific signature genes easily. Our algorithm can detect de novo certain conditions of multiple-class specificity of a gene and makes use of a novel non-parametric indicator to assess the discrimination ability between classes. Our method is effective even when the sample size is small as well as when the class sizes are significantly different. To compare the effectiveness and robustness we formulate an intuitive template-based method and use four well-known datasets. We demonstrate that our algorithm outperforms the template-based method in difficult cases with unbalanced distribution. Moreover, the multiple-class specific genes are good biomarkers and play important roles in biological pathways. Our literature survey supports that the proposed method identifies unique multiple-class specific marker genes (not reported earlier to be related to cancer) in the Central Nervous System data. It also discovers unique biomarkers indicating the intrinsic difference between subtypes of lung cancer. We also associate the pathway information with the multiple-class specific signature genes and cross-reference to published studies. We find that the identified genes participate in the pathways directly involved in cancer development in leukemia data. Our method gives a promising way to find genes that can involve in pathways of multiple diseases and hence opens up the possibility of using an existing drug on other diseases as well as designing a single drug for multiple diseases. PMID:21909426
NASA Astrophysics Data System (ADS)
Ward, Nancy E.; Pellis, Neal R.; Risin, Diana; Risin, Semyon A.; Liu, Wenbin
2006-09-01
Space flights result in remarkable effects on various physiological systems, including a decline in cellular immune functions. Previous studies have shown that exposure to microgravity, both true and modeled, can cause significant changes in numerous lymphocyte functions. The purpose of this study was to search for microgravity-sensitive genes, and specifically for apoptotic genes influenced by the microgravity environment and other genes related to immune response. The experiments were performed on anti-CD3 and IL-2 activated human T cells. To model microgravity conditions we have utilized the NASA rotating wall vessel bioreactor. Control lymphocytes were cultured in static 1g conditions. To assess gene expression we used DNA microarray chip technology. We had shown that multiple genes (approximately 3-8% of tested genes) respond to microgravity conditions by 1.5 and more fold change in expression. There is a significant variability in the response. However, a certain reproducible pattern in gene response could be identified. Among the genes showing reproducible changes in expression in modeled microgravity, several genes involved in apoptosis as well as in immune response were identified. These are IL-7 receptor, Granzyme B, Beta-3-endonexin, Apo2 ligand and STAT1. Possible functional consequences of these changes are discussed.
Ho, Pak Leung; Lo, Wai U.; Yeung, Man Kiu; Lin, Chi Ho; Chow, Kin Hung; Ang, Irene; Tong, Amy Hin Yan; Bao, Jessie Yun-Juan; Lok, Si; Lo, Janice Yee Chi
2011-01-01
Background The emergence of plasmid-mediated carbapenemases, such as NDM-1 in Enterobacteriaceae is a major public health issue. Since they mediate resistance to virtually all β-lactam antibiotics and there is often co-resistance to other antibiotic classes, the therapeutic options for infections caused by these organisms are very limited. Methodology We characterized the first NDM-1 producing E. coli isolate recovered in Hong Kong. The plasmid encoding the metallo-β-lactamase gene was sequenced. Principal Findings The plasmid, pNDM-HK readily transferred to E. coli J53 at high frequencies. It belongs to the broad host range IncL/M incompatibility group and is 88803 bp in size. Sequence alignment showed that pNDM-HK has a 55 kb backbone which shared 97% homology with pEL60 originating from the plant pathogen, Erwina amylovora in Lebanon and a 28.9 kb variable region. The plasmid backbone includes the mucAB genes mediating ultraviolet light resistance. The 28.9 kb region has a composite transposon-like structure which includes intact or truncated genes associated with resistance to β-lactams (bla TEM-1, bla NDM-1, Δbla DHA-1), aminoglycosides (aacC2, armA), sulphonamides (sul1) and macrolides (mel, mph2). It also harbors the following mobile elements: IS26, ISCR1, tnpU, tnpAcp2, tnpD, ΔtnpATn1 and insL. Certain blocks within the 28.9 kb variable region had homology with the corresponding sequences in the widely disseminated plasmids, pCTX-M3, pMUR050 and pKP048 originating from bacteria in Poland in 1996, in Spain in 2002 and in China in 2006, respectively. Significance The genetic support of NDM-1 gene suggests that it has evolved through complex pathways. The association with broad host range plasmid and multiple mobile genetic elements explain its observed horizontal mobility in multiple bacterial taxa. PMID:21445317
Berenger, Byron M; Berry, Chrystal; Peterson, Trevor; Fach, Patrick; Delannoy, Sabine; Li, Vincent; Tschetter, Lorelee; Nadon, Celine; Honish, Lance; Louie, Marie; Chui, Linda
2015-01-01
A standardised method for determining Escherichia coli O157:H7 strain relatedness using whole genome sequencing or virulence gene profiling is not yet established. We sought to assess the capacity of either high-throughput polymerase chain reaction (PCR) of 49 virulence genes, core-genome single nt variants (SNVs) or k-mer clustering to discriminate between outbreak-associated and sporadic E. coli O157:H7 isolates. Three outbreaks and multiple sporadic isolates from the province of Alberta, Canada were included in the study. Two of the outbreaks occurred concurrently in 2014 and one occurred in 2012. Pulsed-field gel electrophoresis (PFGE) and multilocus variable-number tandem repeat analysis (MLVA) were employed as comparator typing methods. The virulence gene profiles of isolates from the 2012 and 2014 Alberta outbreak events and contemporary sporadic isolates were mostly identical; therefore the set of virulence genes chosen in this study were not discriminatory enough to distinguish between outbreak clusters. Concordant with PFGE and MLVA results, core genome SNV and k-mer phylogenies clustered isolates from the 2012 and 2014 outbreaks as distinct events. k-mer phylogenies demonstrated increased discriminatory power compared with core SNV phylogenies. Prior to the widespread implementation of whole genome sequencing for routine public health use, issues surrounding cost, technical expertise, software standardisation, and data sharing/comparisons must be addressed.
Postoperative Pain and Analgesia: Is There a Genetic Basis to the Opioid Crisis?
Elmallah, Randa K; Ramkumar, Prem N; Khlopas, Anton; Ramkumar, Rathika R; Chughtai, Morad; Sodhi, Nipun; Sultan, Assem A; Mont, Michael A
2018-06-01
Multiple factors have been implicated in determining why certain patients have increased postoperative pain, with the potential to develop chronic pain. The purpose of this study was to: 1) identify and describe genes that affect postoperative pain perception and control; 2) address modifiable risk factors that result in epigenetic altered responses to pain; and 3) characterize differences in pain sensitivity and thresholds between opioid-naïve and opioid-dependent patients. Three electronic databases were used to conduct the literature search: Pubmed, EBSCO host, and SCOPUS. A total of 372 abstracts were reviewed, of which 46 studies were deemed relevant and are included in this review. Specific gene alterations that were shown to affect postoperative pain control included single nucleotide polymorphisms in the mu, kappa, and delta opioid receptors, ion channel genes, cytotoxic T-cells, glutamate receptors and cytokine genes, among others. Alcoholism, obesity, and smoking were all linked with genetic polymorphisms that altered pain sensitivity. Opioid abuse was found to be associated with a poorer response to analgesics postoperatively, as well as a risk for prescription overdose. Although pain perception has multiple complex influences, the greatest variability seen in response to opioids among postoperative patients known to date can be traced to genetic differences in opioid metabolism. Further study is needed to determine the clinical significance of these genetic associations.
Disruptions in Energy Balance: Does Nature overcome Nurture?
Fernández, José R.; Casazza, Krista; Divers, Jasmin; López-Alarcón, Mardya
2008-01-01
Fat accumulation, in general, is the result of a breakdown in the homeostatic regulation of energy balance. Although, the specific factors influencing the disruption of energy balance and why these factors affect individuals differently are not completely understood, numerous studies have identified multiple contributors. Environmental components influence food acquisition, eating, and lifestyle habits. However, the variability in obesity-related outcomes observed among individuals placed in similar controlled environments support the notion that genetic components also wield some control. Multiple genetic regions have been associated with measures related to energy balance; however, the replication of these genetic contributors to energy intake and energy expenditure in humans is relatively small perhaps because of the heterogeneity of human populations. Genetic tools such as genetic admixture account for individual’s genetic background in gene association studies, reducing the confounding effect of population stratification, and promise to be a relevant tool on the identification of genetic contributions to energy balance, particularly among individuals of diverse racial/ethnic backgrounds. Although it has been recognized that genes are expressed according to environmental influences, the search toward the understanding of nature and nurture in obesity will require the detailed study of the effect of genes under diverse physiologic and behavioral environments. It is evident that more research is needed to elucidate the methodological and statistical issues that underlie the interactions between genes and environments in obesity and its related comorbidities. PMID:18096193
Galfalvy, Hanga C; Erraji-Benchekroun, Loubna; Smyrniotopoulos, Peggy; Pavlidis, Paul; Ellis, Steven P; Mann, J John; Sibille, Etienne; Arango, Victoria
2003-01-01
Background Genomic studies of complex tissues pose unique analytical challenges for assessment of data quality, performance of statistical methods used for data extraction, and detection of differentially expressed genes. Ideally, to assess the accuracy of gene expression analysis methods, one needs a set of genes which are known to be differentially expressed in the samples and which can be used as a "gold standard". We introduce the idea of using sex-chromosome genes as an alternative to spiked-in control genes or simulations for assessment of microarray data and analysis methods. Results Expression of sex-chromosome genes were used as true internal biological controls to compare alternate probe-level data extraction algorithms (Microarray Suite 5.0 [MAS5.0], Model Based Expression Index [MBEI] and Robust Multi-array Average [RMA]), to assess microarray data quality and to establish some statistical guidelines for analyzing large-scale gene expression. These approaches were implemented on a large new dataset of human brain samples. RMA-generated gene expression values were markedly less variable and more reliable than MAS5.0 and MBEI-derived values. A statistical technique controlling the false discovery rate was applied to adjust for multiple testing, as an alternative to the Bonferroni method, and showed no evidence of false negative results. Fourteen probesets, representing nine Y- and two X-chromosome linked genes, displayed significant sex differences in brain prefrontal cortex gene expression. Conclusion In this study, we have demonstrated the use of sex genes as true biological internal controls for genomic analysis of complex tissues, and suggested analytical guidelines for testing alternate oligonucleotide microarray data extraction protocols and for adjusting multiple statistical analysis of differentially expressed genes. Our results also provided evidence for sex differences in gene expression in the brain prefrontal cortex, supporting the notion of a putative direct role of sex-chromosome genes in differentiation and maintenance of sexual dimorphism of the central nervous system. Importantly, these analytical approaches are applicable to all microarray studies that include male and female human or animal subjects. PMID:12962547
Galfalvy, Hanga C; Erraji-Benchekroun, Loubna; Smyrniotopoulos, Peggy; Pavlidis, Paul; Ellis, Steven P; Mann, J John; Sibille, Etienne; Arango, Victoria
2003-09-08
Genomic studies of complex tissues pose unique analytical challenges for assessment of data quality, performance of statistical methods used for data extraction, and detection of differentially expressed genes. Ideally, to assess the accuracy of gene expression analysis methods, one needs a set of genes which are known to be differentially expressed in the samples and which can be used as a "gold standard". We introduce the idea of using sex-chromosome genes as an alternative to spiked-in control genes or simulations for assessment of microarray data and analysis methods. Expression of sex-chromosome genes were used as true internal biological controls to compare alternate probe-level data extraction algorithms (Microarray Suite 5.0 [MAS5.0], Model Based Expression Index [MBEI] and Robust Multi-array Average [RMA]), to assess microarray data quality and to establish some statistical guidelines for analyzing large-scale gene expression. These approaches were implemented on a large new dataset of human brain samples. RMA-generated gene expression values were markedly less variable and more reliable than MAS5.0 and MBEI-derived values. A statistical technique controlling the false discovery rate was applied to adjust for multiple testing, as an alternative to the Bonferroni method, and showed no evidence of false negative results. Fourteen probesets, representing nine Y- and two X-chromosome linked genes, displayed significant sex differences in brain prefrontal cortex gene expression. In this study, we have demonstrated the use of sex genes as true biological internal controls for genomic analysis of complex tissues, and suggested analytical guidelines for testing alternate oligonucleotide microarray data extraction protocols and for adjusting multiple statistical analysis of differentially expressed genes. Our results also provided evidence for sex differences in gene expression in the brain prefrontal cortex, supporting the notion of a putative direct role of sex-chromosome genes in differentiation and maintenance of sexual dimorphism of the central nervous system. Importantly, these analytical approaches are applicable to all microarray studies that include male and female human or animal subjects.
Evidence of oligogenic sex determination in the apple snail Pomacea canaliculata.
Yusa, Yoichi; Kumagai, Natsumi
2018-06-01
A small number of genes may interact to determine sex, but few such examples have been demonstrated in animals, especially through comprehensive mating experiments. The highly invasive apple snail Pomacea canaliculata is gonochoristic and shows a large variation in brood sex ratio, and the involvement of multiple genes has been suggested for this phenomenon. We conducted mating experiments to determine whether their sex determination involves a few or many genes (i.e., oligogenic or polygenic sex determination, respectively). Full-sib females or males that were born from the same parents were mated to an adult of the opposite sex, and the brood sex ratios of the parents and their offspring were investigated. Analysis of a total of 4288 offspring showed that the sex ratios of offspring from the full-sib females were variable but clustered into only a few values. Similar patterns were observed for the full-sib males, although the effect was less clear because fewer offspring were used (n = 747). Notably, the offspring sex ratios of all full-sib females in some families were nearly 0.5 (proportion of males) with little variation. These results indicate that the number of genotypes of the full-sibs, and hence genes involved in sex determination, is small in this snail. Such oligogenic systems may be a major sex-determining system among animals, especially those with variable sex ratios.
2011-01-01
Background The aim of this study was to describe a novel trimethoprim resistance gene cassette, designated dfrA30, within a class 1 integron in a facultatively oligotrophic, multiple antibiotic and human serum resistant test strain, MB45, in a population of oligotrophic bacteria isolated from the river Mahananda; and to test the efficiency of surface bound acetate on zinc oxide quantum dots (ZnO QDs) as bactericidal agent on MB45. Methods Diluted Luria broth/Agar (10-3) media was used to cultivate the oligotrophic bacteria from water sample. Multiple antibiotic resistant bacteria were selected by employing replica plate method. A rapid assay was performed to determine the sensitivity/resistance of the test strain to human serum. Variable region of class 1 integron was cloned, sequenced and the expression of gene coding for antibiotic resistance was done in Escherichia coli JM 109. Identity of culture was determined by biochemical phenotyping and 16S rRNA gene sequence analyses. A phylogenetic tree was constructed based on representative trimethoprim resistance-mediating DfrA proteins retrieved from GenBank. Growth kinetic studies for the strain MB45 were performed in presence of varied concentrations of ZnO QDs. Results and conclusions The facultatively oligotrophic strain, MB45, resistant to human serum and ten antibiotics trimethoprim, cotrimoxazole, ampicillin, gentamycin, netilmicin, tobramycin, chloramphenicol, cefotaxime, kanamycin and streptomycin, has been identified as a new strain of Klebsiella pneumoniae. A novel dfr gene, designated as dfrA30, found integrated in class 1 integron was responsible for resistance to trimethoprim in Klebsiella pneumoniae strain MB45. The growth of wild strain MB45 was 100% arrested at 500 mg/L concentration of ZnO QDs. To our knowledge this is the first report on application of ZnO quantum dots to kill multiple antibiotics and serum resistant K. pneumoniae strain. PMID:21595893
Rearrangement of Immunoglobulin Genes in Shark Germ Cells
Lee, Susan S.; Fitch, David; Flajnik, Martin F.; Hsu, Ellen
2000-01-01
The variable (V), (diversity [D]), and joining (J) region recombinases (recombination activating genes [RAGs]) can perform like transposases and are thought to have initiated development of the adaptive immune system in early vertebrates by splitting archaic V genes with transposable elements. In cartilaginous fishes, the immunoglobulin (Ig) light chain genes are organized as multiple VJ-constant (C) clusters; some loci are capable of rearrangement while others contain fused VJ. The latter may be key to understanding the evolutionary role of RAG. Are they relics of the archaic genes, or are they results of rearrangement in germ cells? Our data suggest that some fused VJ genes are not only recently rearranged, but also resulted from RAG-like activity involving hairpin intermediates. Expression studies show that these, like some other germline-joined Ig sequences, are expressed at significant levels only early in ontogeny. We suggest that a rejoined Ig gene may not merely be a sequence restricting antibody diversity, but is potentially a novel receptor no longer tied to somatic RAG expression and rearrangement. From the combined data, we arrived at the unexpected conclusion that, in some vertebrates, RAG is still an active force in changing the genome. PMID:10811858
de Haas, Sanne; Delmar, Paul; Bansal, Aruna T; Moisse, Matthieu; Miles, David W; Leighl, Natasha; Escudier, Bernard; Van Cutsem, Eric; Carmeliet, Peter; Scherer, Stefan J; Pallaud, Celine; Lambrechts, Diether
2014-10-01
Despite extensive translational research, no validated biomarkers predictive of bevacizumab treatment outcome have been identified. We performed a meta-analysis of individual patient data from six randomized phase III trials in colorectal, pancreatic, lung, renal, breast, and gastric cancer to explore the potential relationships between 195 common genetic variants in the vascular endothelial growth factor (VEGF) pathway and bevacizumab treatment outcome. The analysis included 1,402 patients (716 bevacizumab-treated and 686 placebo-treated). Twenty variants were associated (P < 0.05) with progression-free survival (PFS) in bevacizumab-treated patients. Of these, 4 variants in EPAS1 survived correction for multiple testing (q < 0.05). Genotype-by-treatment interaction tests revealed that, across these 20 variants, 3 variants in VEGF-C (rs12510099), EPAS1 (rs4953344), and IL8RA (rs2234671) were potentially predictive (P < 0.05), but not resistant to multiple testing (q > 0.05). A weak genotype-by-treatment interaction effect was also observed for rs699946 in VEGF-A, whereas Bayesian genewise analysis revealed that genetic variability in VHL was associated with PFS in the bevacizumab arm (q < 0.05). Variants in VEGF-A, EPAS1, and VHL were located in expression quantitative loci derived from lymphoblastoid cell lines, indicating that they affect the expression levels of their respective gene. This large genetic analysis suggests that variants in VEGF-A, EPAS1, IL8RA, VHL, and VEGF-C have potential value in predicting bevacizumab treatment outcome across tumor types. Although these associations did not survive correction for multiple testing in a genotype-by-interaction analysis, they are among the strongest predictive effects reported to date for genetic variants and bevacizumab efficacy.
Epigenetic Variation in the Mu-opioid Receptor Gene in Infants with Neonatal Abstinence Syndrome
Wachman, Elisha M; Hayes, Marie J; Lester, Barry M; Terrin, Norma; Brown, Mark S; Nielsen, David A; Davis, Jonathan M
2014-01-01
Objective Neonatal abstinence syndrome (NAS) from in utero opioid exposure is highly variable with genetic factors appearing to play an important role. Epigenetic changes in cytosine:guanine (CpG) dinucleotide methylation can occur after drug exposure and may help to explain NAS variability. We correlated DNA methylation levels in the mu-opioid receptor (OPRM1) promoter in opioid-exposed infants and correlate them with NAS outcomes. Study design DNA samples from cord blood or saliva were analyzed for 86 infants being treated for NAS according to institutional protocol. Methylation levels at 16 OPRM1 CpG sites were determined and correlated with NAS outcome measures, including need for treatment, treatment with >2 medications, and length of hospital stay. We adjusted for co-variates and multiple genetic testing. Results Sixty-five percent of infants required treatment for NAS, and 24% required ≥2 medications. Hypermethylation of the OPRM1 promoter was measured at the −10 CpG in treated versus non-treated infants [adjusted difference δ=3.2% (95% CI 0.3–6.0%), p=0.03; NS after multiple testing correction]. There was hypermethylation at the −14 [δ=4.9% (95% CI 1.8–8.1%), p=0.003], −10 [δ=5.0% (95% CI 2.3–7.7%), p=0.0005)], and +84 [δ=3.5% (95% CI 0.6 – 6.4), p=0.02] CpG sites in infants requiring ≥2 medications which remained significant for −14 and −10 after multiple testing correction. Conclusions Increased methylation within the OPRM1 promoter is associated with worse NAS outcomes, consistent with gene silencing. PMID:24996986
DOE Office of Scientific and Technical Information (OSTI.GOV)
Terespolsky, D.; Siegel-Bartelt, J.; Weksberg, R.
Simpson-Golabi Behmel syndrome (SGBS) is an X-linked disorder characterized by pre- and postnatal macrosomia, minor facial anomalies, and variable visceral, skeletal, and neurological abnormalities. Since its first description by Simpson et al., a wide clinical range of cases has been reported. There is great variability in severity, ranging from a mild form associated with long-term survival to an early lethal form with multiple congenital anomalies and severe mental retardation. In 8 reported families, affected individuals died in infancy. Here we present 4 maternally related, male cousins with a severe variant of SGBS. One of these males was aborted therapeutically atmore » 19 weeks of gestation following the detection of multicystic kidneys on ultrasound. The 3 liveborn males were hydropic at birth with a combination of craniofacial anomalies including macrocephaly; apparently low-set, posteriorly angulated ears; hypertelorism; short, broad nose with anteverted nares; large mouth with thin upper vermilion border; prominent philtrum; high-arched or cleft palate; short neck; redundant skin; hypoplastic nails; skeletal defects involving upper and lower limbs; gastrointestinal and genitourinary anomalies. All 3 patients were hypotonic and neurologically impaired from birth. With the exception of a trilobate left lung in one patient, the cardiorespiratory system was structurally normal. All patients died within the first 8 weeks of life of multiple complications including pneumonia and sepsis. Two SGBS kindreds, with moderate expression of the condition, have been mapped to Xq27. It is not known whether severe, familiar cases, such as ours, are genetically distinct from and map to another locus. Final resolution of the genetic basis of the phenotypic variability in SGBS must await cloning and mutation analysis of the SGBS gene(s). 21 refs., 4 figs., 1 tab.« less
Evolving phenotypic networks in silico.
François, Paul
2014-11-01
Evolved gene networks are constrained by natural selection. Their structures and functions are consequently far from being random, as exemplified by the multiple instances of parallel/convergent evolution. One can thus ask if features of actual gene networks can be recovered from evolutionary first principles. I review a method for in silico evolution of small models of gene networks aiming at performing predefined biological functions. I summarize the current implementation of the algorithm, insisting on the construction of a proper "fitness" function. I illustrate the approach on three examples: biochemical adaptation, ligand discrimination and vertebrate segmentation (somitogenesis). While the structure of the evolved networks is variable, dynamics of our evolved networks are usually constrained and present many similar features to actual gene networks, including properties that were not explicitly selected for. In silico evolution can thus be used to predict biological behaviours without a detailed knowledge of the mapping between genotype and phenotype. Copyright © 2014 The Author. Published by Elsevier Ltd.. All rights reserved.
Selection of higher order regression models in the analysis of multi-factorial transcription data.
Prazeres da Costa, Olivia; Hoffman, Arthur; Rey, Johannes W; Mansmann, Ulrich; Buch, Thorsten; Tresch, Achim
2014-01-01
Many studies examine gene expression data that has been obtained under the influence of multiple factors, such as genetic background, environmental conditions, or exposure to diseases. The interplay of multiple factors may lead to effect modification and confounding. Higher order linear regression models can account for these effects. We present a new methodology for linear model selection and apply it to microarray data of bone marrow-derived macrophages. This experiment investigates the influence of three variable factors: the genetic background of the mice from which the macrophages were obtained, Yersinia enterocolitica infection (two strains, and a mock control), and treatment/non-treatment with interferon-γ. We set up four different linear regression models in a hierarchical order. We introduce the eruption plot as a new practical tool for model selection complementary to global testing. It visually compares the size and significance of effect estimates between two nested models. Using this methodology we were able to select the most appropriate model by keeping only relevant factors showing additional explanatory power. Application to experimental data allowed us to qualify the interaction of factors as either neutral (no interaction), alleviating (co-occurring effects are weaker than expected from the single effects), or aggravating (stronger than expected). We find a biologically meaningful gene cluster of putative C2TA target genes that appear to be co-regulated with MHC class II genes. We introduced the eruption plot as a tool for visual model comparison to identify relevant higher order interactions in the analysis of expression data obtained under the influence of multiple factors. We conclude that model selection in higher order linear regression models should generally be performed for the analysis of multi-factorial microarray data.
Ge, Tian; Nichols, Thomas E.; Ghosh, Debashis; Mormino, Elizabeth C.
2015-01-01
Measurements derived from neuroimaging data can serve as markers of disease and/or healthy development, are largely heritable, and have been increasingly utilized as (intermediate) phenotypes in genetic association studies. To date, imaging genetic studies have mostly focused on discovering isolated genetic effects, typically ignoring potential interactions with non-genetic variables such as disease risk factors, environmental exposures, and epigenetic markers. However, identifying significant interaction effects is critical for revealing the true relationship between genetic and phenotypic variables, and shedding light on disease mechanisms. In this paper, we present a general kernel machine based method for detecting effects of interaction between multidimensional variable sets. This method can model the joint and epistatic effect of a collection of single nucleotide polymorphisms (SNPs), accommodate multiple factors that potentially moderate genetic influences, and test for nonlinear interactions between sets of variables in a flexible framework. As a demonstration of application, we applied the method to data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) to detect the effects of the interactions between candidate Alzheimer's disease (AD) risk genes and a collection of cardiovascular disease (CVD) risk factors, on hippocampal volume measurements derived from structural brain magnetic resonance imaging (MRI) scans. Our method identified that two genes, CR1 and EPHA1, demonstrate significant interactions with CVD risk factors on hippocampal volume, suggesting that CR1 and EPHA1 may play a role in influencing AD-related neurodegeneration in the presence of CVD risks. PMID:25600633
Fabre, Michel; Koeck, Jean-Louis; Le Flèche, Philippe; Simon, Fabrice; Hervé, Vincent; Vergnaud, Gilles; Pourcel, Christine
2004-01-01
We have analyzed, using complementary molecular methods, the diversity of 43 strains of “Mycobacterium canettii” originating from the Republic of Djibouti, on the Horn of Africa, from 1998 to 2003. Genotyping by multiple-locus variable-number tandem repeat analysis shows that all the strains belong to a single but very distant group when compared to strains of the Mycobacterium tuberculosis complex (MTBC). Thirty-one strains cluster into one large group with little variability and five strains form another group, whereas the other seven are more diverged. In total, 14 genotypes are observed. The DR locus analysis reveals additional variability, some strains being devoid of a direct repeat locus and others having unique spacers. The hsp65 gene polymorphism was investigated by restriction enzyme analysis and sequencing of PCR amplicons. Four new single nucleotide polymorphisms were discovered. One strain was characterized by three nucleotide changes in 441 bp, creating new restriction enzyme polymorphisms. As no sequence variability was found for hsp65 in the whole MTBC, and as a single point mutation separates M. tuberculosis from the closest “M. canettii” strains, this diversity within “M. canettii” subspecies strongly suggests that it is the most probable source species of the MTBC rather than just another branch of the MTBC. PMID:15243089
Willemsen, Marjolein H; Fernandez, Bridget A; Bacino, Carlos A; Gerkes, Erica; de Brouwer, Arjan PM; Pfundt, Rolph; Sikkema-Raddatz, Birgit; Scherer, Stephen W; Marshall, Christian R; Potocki, Lorraine; van Bokhoven, Hans; Kleefstra, Tjitske
2010-01-01
The clinical use of array comparative genomic hybridization in the evaluation of patients with multiple congenital anomalies and/or mental retardation has recently led to the discovery of a number of novel microdeletion and microduplication syndromes. We present four male patients with overlapping molecularly defined de novo microdeletions of 16q24.3. The clinical features observed in these patients include facial dysmorphisms comprising prominent forehead, large ears, smooth philtrum, pointed chin and wide mouth, variable cognitive impairment, autism spectrum disorder, structural anomalies of the brain, seizures and neonatal thrombocytopenia. Although deletions vary in size, the common region of overlap is only 90 kb and comprises two known genes, Ankyrin Repeat Domain 11 (ANKRD11) (MIM 611192) and Zinc Finger 778 (ZNF778), and is located approximately 10 kb distally to Cadherin 15 (CDH15) (MIM 114019). This region is not found as a copy number variation in controls. We propose that these patients represent a novel and distinctive microdeletion syndrome, characterized by autism spectrum disorder, variable cognitive impairment, facial dysmorphisms and brain abnormalities. We suggest that haploinsufficiency of ANKRD11 and/or ZNF778 contribute to this phenotype and speculate that further investigation of non-deletion patients who have features suggestive of this 16q24.3 microdeletion syndrome might uncover other mutations in one or both of these genes. PMID:19920853
Brown, Allan F; Yousef, Gad G; Reid, Robert W; Chebrolu, Kranthi K; Thomas, Aswathy; Krueger, Christopher; Jeffery, Elizabeth; Jackson, Eric; Juvik, John A
2015-07-01
The identification of genetic factors influencing the accumulation of individual glucosinolates in broccoli florets provides novel insight into the regulation of glucosinolate levels in Brassica vegetables and will accelerate the development of vegetables with glucosinolate profiles tailored to promote human health. Quantitative trait loci analysis of glucosinolate (GSL) variability was conducted with a B. oleracea (broccoli) mapping population, saturated with single nucleotide polymorphism markers from a high-density array designed for rapeseed (Brassica napus). In 4 years of analysis, 14 QTLs were associated with the accumulation of aliphatic, indolic, or aromatic GSLs in floret tissue. The accumulation of 3-carbon aliphatic GSLs (2-propenyl and 3-methylsulfinylpropyl) was primarily associated with a single QTL on C05, but common regulation of 4-carbon aliphatic GSLs was not observed. A single locus on C09, associated with up to 40 % of the phenotypic variability of 2-hydroxy-3-butenyl GSL over multiple years, was not associated with the variability of precursor compounds. Similarly, QTLs on C02, C04, and C09 were associated with 4-methylsulfinylbutyl GSL concentration over multiple years but were not significantly associated with downstream compounds. Genome-specific SNP markers were used to identify candidate genes that co-localized to marker intervals and previously sequenced Brassica oleracea BAC clones containing known GSL genes (GSL-ALK, GSL-PRO, and GSL-ELONG) were aligned to the genomic sequence, providing support that at least three of our 14 QTLs likely correspond to previously identified GSL loci. The results demonstrate that previously identified loci do not fully explain GSL variation in broccoli. The identification of additional genetic factors influencing the accumulation of GSL in broccoli florets provides novel insight into the regulation of GSL levels in Brassicaceae and will accelerate development of vegetables with modified or enhanced GSL profiles.
A Risk Stratification Model for Lung Cancer Based on Gene Coexpression Network and Deep Learning
2018-01-01
Risk stratification model for lung cancer with gene expression profile is of great interest. Instead of previous models based on individual prognostic genes, we aimed to develop a novel system-level risk stratification model for lung adenocarcinoma based on gene coexpression network. Using multiple microarray, gene coexpression network analysis was performed to identify survival-related networks. A deep learning based risk stratification model was constructed with representative genes of these networks. The model was validated in two test sets. Survival analysis was performed using the output of the model to evaluate whether it could predict patients' survival independent of clinicopathological variables. Five networks were significantly associated with patients' survival. Considering prognostic significance and representativeness, genes of the two survival-related networks were selected for input of the model. The output of the model was significantly associated with patients' survival in two test sets and training set (p < 0.00001, p < 0.0001 and p = 0.02 for training and test sets 1 and 2, resp.). In multivariate analyses, the model was associated with patients' prognosis independent of other clinicopathological features. Our study presents a new perspective on incorporating gene coexpression networks into the gene expression signature and clinical application of deep learning in genomic data science for prognosis prediction. PMID:29581968
Zheng, Chunfang; Santos Muñoz, Daniella; Albert, Victor A; Sankoff, David
2015-01-01
Following whole genome duplication (WGD), there is a compact distribution of gene similarities within the genome reflecting duplicate pairs of all the genes in the genome. With time, the distribution broadens and loses volume due to variable decay of duplicate gene similarity and to the process of duplicate gene loss. If there are two WGD, the older one becomes so reduced and broad that it merges with the tail of the distributions resulting from more recent events, and it becomes difficult to distinguish them. The goal of this paper is to advance statistical methods of identifying, or at least counting, the WGD events in the lineage of a given genome. For a set of 15 angiosperm genomes, we analyze all 15 × 14 = 210 ordered pairs of target genome versus reference genome, using SynMap to find syntenic blocks. We consider all sets of B ≥ 2 syntenic blocks in the target genome that overlap in the reference genome as evidence of WGD activity in the target, whether it be one event or several. We hypothesize that in fitting an exponential function to the tail of the empirical distribution f (B) of block multiplicities, the size of the exponent will reflect the amount of WGD in the history of the target genome. By amalgamating the results from all reference genomes, a range of values of SynMap parameters, and alternative cutoff points for the tail, we find a clear pattern whereby multiple-WGD core eudicots have the smallest (negative) exponents, followed by core eudicots with only the single "γ" triplication in their history, followed by a non-core eudicot with a single WGD, followed by the monocots, with a basal angiosperm, the WGD-free Amborella having the largest exponent. The hypothesis that the exponent of the fit to the tail of the multiplicity distribution is a signature of the amount of WGD is verified, but there is also a clear complicating factor in the monocot clade, where a history of multiple WGD is not reflected in a small exponent.
Kim, Kyong-Chol; Chun, Hyejin; Lai, ChaoQiang; Parnell, Laurence D; Jang, Yangsoo; Lee, Jongho; Ordovas, Jose M
2015-03-01
Contrary to the traditional belief that obesity acts as a protective factor for bone, recent epidemiologic studies have shown that body fat might be a risk factor for osteoporosis and bone fracture. Accordingly, we evaluated the association between the phenotypes of osteoporosis or vertebral fracture and variants of obesity-related genes, peroxisome proliferator-activated receptor-gamma (PPARG), runt-related transcription factor 2 (RUNX2), leptin receptor (LEPR), and adiponectin (ADIPOQ). In total, 907 postmenopausal healthy women, aged 60-79 years, were included in this study. BMD and biomarkers of bone health and adiposity were measured. We genotyped for four single nucleotide polymorphisms (SNPs) from four genes (PPARG, RUNX2, LEPR, ADIPOQ). A general linear model for continuous dependent variables and a logistic regression model for categorical dependent variables were used to analyze the statistical differences among genotype groups. Compared with the TT subjects at rs7771980 in RUNX2, C-carrier (TC + CC) subjects had a lower vertebral fracture risk after adjusting for age, smoking, alcohol, total calorie intake, total energy expenditure, total calcium intake, total fat intake, weight, body fat. Odds ratio (OR) and 95% interval (CI) for the vertebral fracture risk was 0.55 (95% CI 0.32-0.94). After adjusting for multiple variables, the prevalence of vertebral fracture was highest in GG subjects at rs1501299 in ADIPOQ (p = 0.0473). A high calcium intake (>1000 mg/day) contributed to a high bone mineral density (BMD) in GT + TT subjects at rs1501299 in ADIPOQ (p for interaction = 0.0295). Even if the mechanisms between obesity-related genes and bone health are not fully established, the results of our study revealed the association of certain SNPs from obesity-related genes with BMD or vertebral fracture risk in postmenopausal Korean women.
Adewoye, L O; Worobec, E A
1999-12-01
In response to low extracellular glucose concentration, Pseudomonas aeruginosa induces the expression of the outer membrane carbohydrate-selective OprB porin. The promoter region of the oprB gene was cloned into a lacZ transcriptional fusion vector, and the construct was mobilized into P. aeruginosa OprB-deficient strain, WW100, to evaluate additional environmental factors that influence OprB porin gene expression. Growth temperature, pH of the growth medium, salicylate concentration, and carbohydrate source were found to differentially influence porin expression. This expression pattern was compared to those of whole-cell [14C]glucose uptake under conditions of high osmolarity, ionicity, variable pH, growth temperatures, and carbohydrate source. These studies revealed that the high-affinity glucose transport genes are down-regulated by salicylic acid, differentially regulated by pH and temperature, and are specifically responsive to exogenous glucose induction.
Rebelo, Ana Cristina; Verlengia, Rozangela; Kunz, Vandeni; Tamburus, Nayara; Cerda, Alvaro; Hirata, Rosario; Hirata, Mario; Silva, Ester
2012-01-01
This study examined the association of estrogen receptor alpha gene (ESR1) polymorphisms with cardiorespiratory and metabolic parameters in young women. In total, 354 healthy women were selected for cardiopulmonary exercise testing and short-term heart rate (HR) variability (HRV) evaluation. The HRV analysis was determined by the temporal indices rMSSD (square root of the mean squared differences of successive R–R intervals (RRi) divided by the number of RRi minus one), SDNN (root mean square of differences from mean RRi, divided by the number of RRi) and power spectrum components by low frequency (LF), high frequency (HF) and LF/HF ratio. Blood samples were obtained for serum lipids, estradiol and DNA extraction. ESR1 rs2234693 and rs9340799 polymorphisms were analyzed by PCR and fragment restriction analysis. HR and oxygen uptake (VO2) values did not differ between the ESR1 polymorphisms with respect to autonomic modulation. We not find a relationship between ESR1 T–A, T–G, C–A and C–G haplotypes and cardiorespiratory and metabolic variables. Multiple linear regression analysis demonstrated that VO2, total cholesterol and triglycerides influence HRV (p < 0.05). The results suggest that ESR1 variants have no effect on cardiorespiratory and metabolic variables, while HRV indices are influenced by aerobic capacity and lipids in healthy women. PMID:23202974
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture
Pai, Athma A; Henriques, Telmo; McCue, Kayla; Burkholder, Adam; Adelman, Karen
2017-01-01
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly low variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing. PMID:29280736
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture
Pai, Athma A.; Henriques, Telmo; McCue, Kayla; ...
2017-12-27
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly lowmore » variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.« less
Talkowski, Michael E.; Rosenfeld, Jill A.; Blumenthal, Ian; Pillalamarri, Vamsee; Chiang, Colby; Heilbut, Adrian; Ernst, Carl; Hanscom, Carrie; Rossin, Elizabeth; Lindgren, Amelia; Pereira, Shahrin; Ruderfer, Douglas; Kirby, Andrew; Ripke, Stephan; Harris, David; Lee, Ji-Hyun; Ha, Kyungsoo; Kim, Hyung-Goo; Solomon, Benjamin D.; Gropman, Andrea L.; Lucente, Diane; Sims, Katherine; Ohsumi, Toshiro K.; Borowsky, Mark L.; Loranger, Stephanie; Quade, Bradley; Lage, Kasper; Miles, Judith; Wu, Bai-Lin; Shen, Yiping; Neale, Benjamin; Shaffer, Lisa G.; Daly, Mark J.; Morton, Cynthia C.; Gusella, James F.
2012-01-01
SUMMARY Balanced chromosomal abnormalities (BCAs) represent a reservoir of single gene disruptions in neurodevelopmental disorders (NDD). We sequenced BCAs in autism and related NDDs, revealing disruption of 33 loci in four general categories: 1) genes associated with abnormal neurodevelopment (e.g., AUTS2, FOXP1, CDKL5), 2) single gene contributors to microdeletion syndromes (MBD5, SATB2, EHMT1, SNURF-SNRPN), 3) novel risk loci (e.g., CHD8, KIRREL3, ZNF507), and 4) genes associated with later onset psychiatric disorders (e.g., TCF4, ZNF804A, PDE10A, GRIN2B, ANK3). We also discovered profoundly increased burden of copy number variants among 19,556 neurodevelopmental cases compared to 13,991 controls (p = 2.07×10−47) and enrichment of polygenic risk alleles from autism and schizophrenia genome-wide association studies (p = 0.0018 and 0.0009, respectively). Our findings suggest a polygenic risk model of autism incorporating loci of strong effect and indicate that some neurodevelopmental genes are sensitive to perturbation by multiple mutational mechanisms, leading to variable phenotypic outcomes that manifest at different life stages. PMID:22521361
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pai, Athma A.; Henriques, Telmo; McCue, Kayla
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly lowmore » variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.« less
Flexible CRISPR library construction using parallel oligonucleotide retrieval
Read, Abigail; Gao, Shaojian; Batchelor, Eric
2017-01-01
Abstract CRISPR/Cas9-based gene knockout libraries have emerged as a powerful tool for functional screens. We present here a set of pre-designed human and mouse sgRNA sequences that are optimized for both high on-target potency and low off-target effect. To maximize the chance of target gene inactivation, sgRNAs were curated to target both 5΄ constitutive exons and exons that encode conserved protein domains. We describe here a robust and cost-effective method to construct multiple small sized CRISPR library from a single oligo pool generated by array synthesis using parallel oligonucleotide retrieval. Together, these resources provide a convenient means for individual labs to generate customized CRISPR libraries of variable size and coverage depth for functional genomics application. PMID:28334828
Pharmacogenomics of high-density lipoprotein-cholesterol-raising therapies
Aslibekyan, Stella; Straka, Robert J.; Irvin, Marguerite R.; Claas, Steven A.; Arnett, Donna K.
2017-01-01
High levels of HDL cholesterol (HDL-C) have traditionally been linked to lower incidence of cardiovascular disease, prompting the search for effective and safe HDL-C raising pharmaceutical agents. Although drugs such as niacin and fibrates represent established therapeutic approaches, HDL-C response to such therapies is variable and heritable, suggesting a role for pharmacogenomic determinants. Multiple genetic polymorphisms, located primarily in genes encoding lipoproteins, cholesteryl ester transfer protein, transporters and CYP450 genes have been shown to associate with HDL-C drug response in vitro and in epidemiologic studies. However, few of the pharmacogenomic findings have been independently validated, precluding the development of clinical tools that can be used to predict HDL-C response and leaving the goal of personalized medicine to future efforts. PMID:23469915
2014-01-01
Background Fanconi anemia (FA) is a rare inherited genetic syndrome with highly variable clinical manifestations. Fifteen genetic subtypes of FA have been identified. Traditional complementation tests for grouping studies have been used generally in FA patients and in stepwise methods to identify the FA type, which can result in incomplete genetic information from FA patients. Methods We diagnosed five pediatric patients with FA based on clinical manifestations, and we performed exome sequencing of peripheral blood specimens from these patients and their family members. The related sequencing data were then analyzed by bioinformatics, and the FANC gene mutations identified by exome sequencing were confirmed by PCR re-sequencing. Results Homozygous and compound heterozygous mutations of FANC genes were identified in all of the patients. The FA subtypes of the patients included FANCA, FANCM and FANCD2. Interestingly, four FA patients harbored multiple mutations in at least two FA genes, and some of these mutations have not been previously reported. These patients’ clinical manifestations were vastly different from each other, as were their treatment responses to androstanazol and prednisone. This finding suggests that heterozygous mutation(s) in FA genes could also have diverse biological and/or pathophysiological effects on FA patients or FA gene carriers. Interestingly, we were not able to identify de novo mutations in the genes implicated in DNA repair pathways when the sequencing data of patients were compared with those of their parents. Conclusions Our results indicate that Chinese FA patients and carriers might have higher and more complex mutation rates in FANC genes than have been conventionally recognized. Testing of the fifteen FANC genes in FA patients and their family members should be a regular clinical practice to determine the optimal care for the individual patient, to counsel the family and to obtain a better understanding of FA pathophysiology. PMID:24885126
Chang, Lixian; Yuan, Weiping; Zeng, Huimin; Zhou, Quanquan; Wei, Wei; Zhou, Jianfeng; Li, Miaomiao; Wang, Xiaomin; Xu, Mingjiang; Yang, Fengchun; Yang, Yungui; Cheng, Tao; Zhu, Xiaofan
2014-05-15
Fanconi anemia (FA) is a rare inherited genetic syndrome with highly variable clinical manifestations. Fifteen genetic subtypes of FA have been identified. Traditional complementation tests for grouping studies have been used generally in FA patients and in stepwise methods to identify the FA type, which can result in incomplete genetic information from FA patients. We diagnosed five pediatric patients with FA based on clinical manifestations, and we performed exome sequencing of peripheral blood specimens from these patients and their family members. The related sequencing data were then analyzed by bioinformatics, and the FANC gene mutations identified by exome sequencing were confirmed by PCR re-sequencing. Homozygous and compound heterozygous mutations of FANC genes were identified in all of the patients. The FA subtypes of the patients included FANCA, FANCM and FANCD2. Interestingly, four FA patients harbored multiple mutations in at least two FA genes, and some of these mutations have not been previously reported. These patients' clinical manifestations were vastly different from each other, as were their treatment responses to androstanazol and prednisone. This finding suggests that heterozygous mutation(s) in FA genes could also have diverse biological and/or pathophysiological effects on FA patients or FA gene carriers. Interestingly, we were not able to identify de novo mutations in the genes implicated in DNA repair pathways when the sequencing data of patients were compared with those of their parents. Our results indicate that Chinese FA patients and carriers might have higher and more complex mutation rates in FANC genes than have been conventionally recognized. Testing of the fifteen FANC genes in FA patients and their family members should be a regular clinical practice to determine the optimal care for the individual patient, to counsel the family and to obtain a better understanding of FA pathophysiology.
GeneNetFinder2: Improved Inference of Dynamic Gene Regulatory Relations with Multiple Regulators.
Han, Kyungsook; Lee, Jeonghoon
2016-01-01
A gene involved in complex regulatory interactions may have multiple regulators since gene expression in such interactions is often controlled by more than one gene. Another thing that makes gene regulatory interactions complicated is that regulatory interactions are not static, but change over time during the cell cycle. Most research so far has focused on identifying gene regulatory relations between individual genes in a particular stage of the cell cycle. In this study we developed a method for identifying dynamic gene regulations of several types from the time-series gene expression data. The method can find gene regulations with multiple regulators that work in combination or individually as well as those with single regulators. The method has been implemented as the second version of GeneNetFinder (hereafter called GeneNetFinder2) and tested on several gene expression datasets. Experimental results with gene expression data revealed the existence of genes that are not regulated by individual genes but rather by a combination of several genes. Such gene regulatory relations cannot be found by conventional methods. Our method finds such regulatory relations as well as those with multiple, independent regulators or single regulators, and represents gene regulatory relations as a dynamic network in which different gene regulatory relations are shown in different stages of the cell cycle. GeneNetFinder2 is available at http://bclab.inha.ac.kr/GeneNetFinder and will be useful for modeling dynamic gene regulations with multiple regulators.
Devos, Nicolas; Szövényi, Péter; Weston, David J; Rothfels, Carl J; Johnson, Matthew G; Shaw, A Jonathan
2016-07-01
The goal of this research was to investigate whether there has been a whole-genome duplication (WGD) in the ancestry of Sphagnum (peatmoss) or the class Sphagnopsida, and to determine if the timing of any such duplication(s) and patterns of paralog retention could help explain the rapid radiation and current ecological dominance of peatmosses. RNA sequencing (RNA-seq) data were generated for nine taxa in Sphagnopsida (Bryophyta). Analyses of frequency plots for synonymous substitutions per synonymous site (Ks ) between paralogous gene pairs and reconciliation of 578 gene trees were conducted to assess evidence of large-scale or genome-wide duplication events in each transcriptome. Both Ks frequency plots and gene tree-based analyses indicate multiple duplication events in the history of the Sphagnopsida. The most recent WGD event predates divergence of Sphagnum from the two other genera of Sphagnopsida. Duplicate retention is highly variable across species, which might be best explained by local adaptation. Our analyses indicate that the last WGD could have been an important factor underlying the diversification of peatmosses and facilitated their rise to ecological dominance in peatlands. The timing of the duplication events and their significance in the evolutionary history of peat mosses are discussed. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.
Pharmacogenetics of the β2-Adrenergic Receptor Gene
Ortega, Victor E.; Hawkins, Gregory A.; Peters, Stephen P.; Bleecker, Eugene R.
2009-01-01
Asthma is a complex genetic disease with multiple genetic and environmental determinants contributing to the observed variability in response to common anti-asthma therapies. Asthma pharmacogenetic research has focused on multiple candidate genes including the β2-adrenergic receptor gene (ADRβ2) and its effect on individual responses to beta agonist therapy. At present, knowledge about the effects of ADRβ2 variation on therapeutic responses is evolving and should not alter current Asthma Guideline approaches consisting of the use of short acting beta agonists for as-needed symptom based therapy and the use of a regular long-acting beta agonist in combination with inhaled corticosteroid therapy for optimal control of asthma symptoms in those asthmatics who are not controlled on inhaled corticosteroid alone. This approach is based upon studies showing a consistent pharmacogenetic response to regular use of short acting beta agonists (SABA) and less consistent findings in studies evaluating long acting beta agonist (LABA). While emerging pharmacogenetic studies are provocative and should lead to functional approaches, conflicting data with responses to LABA therapy may be caused by factors that include small sample sizes of study populations and differences in experimental design that may limit the conclusions that may be drawn from these clinical trials at the present time. PMID:17996583
Michelacci, Valeria; Orsini, Massimiliano; Knijn, Arnold; Delannoy, Sabine; Fach, Patrick; Caprioli, Alfredo; Morabito, Stefano
2016-01-01
Shiga-toxin producing Escherichia coli (STEC) strains possess a large accessory genome composed of virulence genes existing in multiple allelic variants, which sometimes segregate with specific STEC subpopulations. We analyzed the allelic variability of 91 virulence genes of STEC by Real Time PCR followed by melting curves analysis in 713 E. coli strains including 358 STEC. The 91 genes investigated were located on the locus of enterocyte effacement (LEE), OI-57, and OI-122 pathogenicity islands and displayed a total of 476 alleles in the study population. The combinations of the 91 alleles of each strain were termed allelic signatures and used to perform cluster analyses. We termed such an approach High Resolution Virulence Allelic Profiling (HReVAP) and used it to investigate the phylogeny of STEC of multiple serogroups. The dendrograms obtained identified groups of STEC segregating approximately with the serogroups and allowed the identification of subpopulations within the single groups. The study of the allelic signatures provided further evidence of the coevolution of the LEE and OI-122, reflecting the occurrence of their acquisition through a single event. The HReVAP analysis represents a sensitive tool for studying the evolution of LEE-positive STEC. PMID:26941726
MEN1, MEN4, and Carney Complex: Pathology and Molecular Genetics
Schernthaner-Reiter, Marie Helene; Trivellin, Giampaolo; Stratakis, Constantine A.
2015-01-01
Pituitary adenomas are a common feature of a subset of endocrine neoplasia syndromes, which have otherwise highly variable disease manifestations. We provide here a review of the clinical features and human molecular genetics of multiple endocrine neoplasia type 1 and 4 (MEN1 and MEN4, respectively) and Carney complex (CNC). MEN1, MEN4 and CNC are hereditary autosomal dominant syndromes that can present with pituitary adenomas. MEN1 is caused by inactivating mutations in the MEN1 gene, whose product menin is involved in multiple intracellular pathways contributing to transcriptional control and cell proliferation. MEN1 clinical features include primary hyperparathyroidism, pancreatic neuroendocrine tumours and prolactinomas and other pituitary adenomas. A subset of patients with pituitary adenomas and other MEN1 features have mutations in the CDKN1B gene; their disease has been called MEN type 4 (MEN4). Inactivating mutations in the type 1α regulatory subunit of protein kinase A (PKA) (the PRKAR1A gene), that lead to dysregulation and activation of the PKA pathway, are the main genetic cause of CNC, which is clinically characterised by primary pigmented adrenocortical disease (PPNAD), spotty skin pigmentation (lentigines), cardiac and other myxomas and acromegaly due to somatotropinomas or somatotrope hyperplasia. PMID:25592387
Picker-Minh, Sylvie; Mignot, Cyril; Doummar, Diane; Hashem, Mais; Faqeih, Eissa; Josset, Patrice; Dubern, Béatrice; Alkuraya, Fowzan S; Kraemer, Nadine; Kaindl, Angela M
2016-04-29
Infantile-onset multisystem neurologic, endocrine, and pancreatic disease (IMNEPD) has been recently linked to biallelic mutation of the peptidyl-tRNA hydrolase 2 gene PTRH2. Two index patients with IMNEPD in the original report had multiple neurological symptoms such as postnatal microcephaly, intellectual disability, developmental delay, sensorineural deafness, cerebellar atrophy, ataxia, and peripheral neuropathy. In addition, distal muscle weakness and abnormalities of thyroid, pancreas, and liver were found. Here, we report five further IMNEPD patients with a different homozygous PTRH2 mutation, broaden the phenotypic spectrum of the disease and differentiate common symptoms and interindividual variability in IMNEPD associated with a unique mutation. We thereby hope to better define IMNEPD and promote recognition and diagnosis of this novel disease entity.
Sritara, C; Thakkinstian, A; Ongphiphadhanakul, B; Chailurkit, L; Chanprasertyothin, S; Ratanachaiwong, W; Vathesatogkit, P; Sritara, P
2014-05-01
Using mediation analysis, a causal relationship between the AHSG gene and bone mineral density (BMD) through fetuin-A and body mass index (BMI) mediators was suggested. Fetuin-A, a multifunctional protein of hepatic origin, is associated with bone mineral density. It is unclear if this association is causal. This study aimed at clarification of this issue. A cross-sectional study was conducted among 1,741 healthy workers from the Electricity Generating Authority of Thailand (EGAT) cohort. The alpha-2-Heremans-Schmid glycoprotein (AHSG) rs2248690 gene was genotyped. Three mediation models were constructed using seemingly unrelated regression analysis. First, the ln[fetuin-A] group was regressed on the AHSG gene. Second, the BMI group was regressed on the AHSG gene and the ln[fetuin-A] group. Finally, the BMD model was constructed by fitting BMD on two mediators (ln[fetuin-A] and BMI) and the independent AHSG variable. All three analyses were adjusted for confounders. The prevalence of the minor T allele for the AHSG locus was 15.2%. The AHSG locus was highly related to serum fetuin-A levels (P < 0.001). Multiple mediation analyses showed that AHSG was significantly associated with BMD through the ln[fetuin-A] and BMI pathway, with beta coefficients of 0.0060 (95% CI 0.0038, 0.0083) and 0.0030 (95% CI 0.0020, 0.0045) at the total hip and lumbar spine, respectively. About 27.3 and 26.0% of total genetic effects on hip and spine BMD, respectively, were explained by the mediation effects of fetuin-A and BMI. Our study suggested evidence of a causal relationship between the AHSG gene and BMD through fetuin-A and BMI mediators.
Keyhaninejad, Neda; Curry, Jeanne; Romero, Joslynn; O'Connell, Mary A
2014-02-01
Accumulation of capsaicinoids in the placental tissue of ripening chile (Capsicum spp.) fruit follows the coordinated expression of multiple biosynthetic enzymes producing the substrates for capsaicin synthase. Transcription factors are likely agents to regulate expression of these biosynthetic genes. Placental RNAs from habanero fruit (Capsicum chinense) were screened for expression of candidate transcription factors; with two candidate genes identified, both in the ERF family of transcription factors. Characterization of these transcription factors, Erf and Jerf, in nine chile cultivars with distinct capsaicinoid contents demonstrated a correlation of expression with pungency. Amino acid variants were observed in both ERF and JERF from different chile cultivars; none of these changes involved the DNA binding domains. Little to no transcription of Erf was detected in non-pungent Capsium annuum or C. chinense mutants. This correlation was characterized at an individual fruit level in a set of jalapeño (C. annuum) lines again with distinct and variable capsaicinoid contents. Both Erf and Jerf are expressed early in fruit development, 16-20 days post-anthesis, at times prior to the accumulation of capsaicinoids in the placental tissues. These data support the hypothesis that these two members of the complex ERF family participate in regulation of the pungency phenotype in chile. Copyright © 2013. Published by Elsevier Ireland Ltd.
Keyhaninejad, Neda; Curry, Jeanne; Romero, Joslynn; O’Connell, Mary A.
2013-01-01
Accumulation of capsaicinoids in the placental tissue of ripening chile (Capsicum spp.) fruit follows the coordinated expression of multiple biosynthetic enzymes producing the substrates for capsaicin synthase. Transcription factors are likely agents to regulate expression of these biosynthetic genes. Placental RNAs from habanero fruit (C. chinense) were screened for expression of candidate transcription factors; with two candidate genes identified, both in the ERF family of transcription factors. Characterization of these transcription factors, Erf and Jerf, in nine chile cultivars with distinct capsaicinoid contents demonstrated a correlation of expression with pungency. Amino acid variants were observed in both ERF and JERF from different chile cultivars; none of these changes involved the DNA binding domains. Little to no transcription of Erf was detected in non-pungent C. annuum or C. chinense mutants. This correlation was characterized at an individual fruit level in a set of jalapeño (C. annuum) lines again with distinct and variable capsaicinoid contents. Both Erf and Jerf are expressed early in fruit development, 16–20 days post-anthesis, at times prior to the accumulation of capsaicinoids in the placental tissues. These data support the hypothesis that these two members of the complex ERF family participate in regulation of the pungency phenotype in chile. PMID:24388515
Muscatelli, F; Abrous, D N; Massacrier, A; Boccaccio, I; Le Moal, M; Cau, P; Cremer, H
2000-12-12
Prader-Willi syndrome (PWS) is a complex neurogenetic disorder with considerable clinical variability that is thought in large part to be the result of a hypothalamic defect. PWS results from the absence of paternal expression of imprinted genes localized in the 15q11-q13 region; however, none of the characterized genes has so far been shown to be involved in the etiology of PWS. Here, we provide a detailed investigation of a mouse model deficient for NECDIN: Linked to the mutation, a neonatal lethality of variable penetrance is observed. Viable NECDIN: mutants show a reduction in both oxytocin-producing and luteinizing hormone-releasing hormone (LHRH)-producing neurons in hypothalamus. This represents the first evidence of a hypothalamic deficiency in a mouse model of PWS. NECDIN:-deficient mice also display increased skin scraping activity in the open field test and improved spatial learning and memory in the Morris water maze. The latter features are reminiscent of the skin picking and improved spatial memory that are characteristics of the PWS phenotype. These striking parallels in hypothalamic structure, emotional and cognitive-related behaviors strongly suggest that NECDIN is responsible for at least a subset of the multiple clinical manifestations of PWS.
Computational Tools and Algorithms for Designing Customized Synthetic Genes
Gould, Nathan; Hendy, Oliver; Papamichail, Dimitris
2014-01-01
Advances in DNA synthesis have enabled the construction of artificial genes, gene circuits, and genomes of bacterial scale. Freedom in de novo design of synthetic constructs provides significant power in studying the impact of mutations in sequence features, and verifying hypotheses on the functional information that is encoded in nucleic and amino acids. To aid this goal, a large number of software tools of variable sophistication have been implemented, enabling the design of synthetic genes for sequence optimization based on rationally defined properties. The first generation of tools dealt predominantly with singular objectives such as codon usage optimization and unique restriction site incorporation. Recent years have seen the emergence of sequence design tools that aim to evolve sequences toward combinations of objectives. The design of optimal protein-coding sequences adhering to multiple objectives is computationally hard, and most tools rely on heuristics to sample the vast sequence design space. In this review, we study some of the algorithmic issues behind gene optimization and the approaches that different tools have adopted to redesign genes and optimize desired coding features. We utilize test cases to demonstrate the efficiency of each approach, as well as identify their strengths and limitations. PMID:25340050
The Core and Accessory Genomes of Burkholderia pseudomallei: Implications for Human Melioidosis
Lin, Chi Ho; Karuturi, R. Krishna M.; Wuthiekanun, Vanaporn; Tuanyok, Apichai; Chua, Hui Hoon; Ong, Catherine; Paramalingam, Sivalingam Suppiah; Tan, Gladys; Tang, Lynn; Lau, Gary; Ooi, Eng Eong; Woods, Donald; Feil, Edward; Peacock, Sharon J.; Tan, Patrick
2008-01-01
Natural isolates of Burkholderia pseudomallei (Bp), the causative agent of melioidosis, can exhibit significant ecological flexibility that is likely reflective of a dynamic genome. Using whole-genome Bp microarrays, we examined patterns of gene presence and absence across 94 South East Asian strains isolated from a variety of clinical, environmental, or animal sources. 86% of the Bp K96243 reference genome was common to all the strains representing the Bp “core genome”, comprising genes largely involved in essential functions (eg amino acid metabolism, protein translation). In contrast, 14% of the K96243 genome was variably present across the isolates. This Bp accessory genome encompassed multiple genomic islands (GIs), paralogous genes, and insertions/deletions, including three distinct lipopolysaccharide (LPS)-related gene clusters. Strikingly, strains recovered from cases of human melioidosis clustered on a tree based on accessory gene content, and were significantly more likely to harbor certain GIs compared to animal and environmental isolates. Consistent with the inference that the GIs may contribute to pathogenesis, experimental mutation of BPSS2053, a GI gene, reduced microbial adherence to human epithelial cells. Our results suggest that the Bp accessory genome is likely to play an important role in microbial adaptation and virulence. PMID:18927621
Deconstructing transcriptional heterogeneity in pluripotent stem cells
Shalek, Alex K.; Satija, Rahul; DaleyKeyser, AJay; Li, Hu; Zhang, Jin; Pardee, Keith; Gennert, David; Trombetta, John J.; Ferrante, Thomas C.; Regev, Aviv; Daley, George Q.; Collins, James J.
2014-01-01
SUMMARY Pluripotent stem cells (PSCs) are capable of dynamic interconversion between distinct substates, but the regulatory circuits specifying these states and enabling transitions between them are not well understood. We set out to characterize transcriptional heterogeneity in PSCs by single-cell expression profiling under different chemical and genetic perturbations. Signaling factors and developmental regulators show highly variable expression, with expression states for some variable genes heritable through multiple cell divisions. Expression variability and population heterogeneity can be influenced by perturbation of signaling pathways and chromatin regulators. Strikingly, either removal of mature miRNAs or pharmacologic blockage of signaling pathways drives PSCs into a low-noise ground state characterized by a reconfigured pluripotency network, enhanced self-renewal, and a distinct chromatin state, an effect mediated by opposing miRNA families acting on the c-myc / Lin28 / let-7 axis. These data illuminate the nature of transcriptional heterogeneity in PSCs. PMID:25471879
Jung, Seung H.; Brownlow, Milene L.; Pellegrini, Matteo; Jankord, Ryan
2017-01-01
Individual susceptibility determines the magnitude of stress effects on cognitive function. The hippocampus, a brain region of memory consolidation, is vulnerable to stressful environments, and the impact of stress on hippocampus may determine individual variability in cognitive performance. Therefore, the purpose of this study was to define the relationship between the divergence in spatial memory performance under chronically unpredictable stress and an associated transcriptomic alternation in hippocampus, the brain region of spatial memory consolidation. Multiple strains of BXD (B6 × D2) recombinant inbred mice went through a 4-week chronic variable stress (CVS) paradigm, and the Morris water maze (MWM) test was conducted during the last week of CVS to assess hippocampal-dependent spatial memory performance and grouped animals into low and high performing groups based on the cognitive performance. Using hippocampal whole transcriptome RNA-sequencing data, differential expression, PANTHER analysis, WGCNA, Ingenuity's upstream regulator analysis in the Ingenuity Pathway Analysis® and phenotype association analysis were conducted. Our data identified multiple genes and pathways that were significantly associated with chronic stress-associated cognitive modification and the divergence in hippocampal dependent memory performance under chronic stress. Biological pathways associated with memory performance following chronic stress included metabolism, neurotransmitter and receptor regulation, immune response and cellular process. The Ingenuity's upstream regulator analysis identified 247 upstream transcriptional regulators from 16 different molecule types. Transcripts predictive of cognitive performance under high stress included genes that are associated with a high occurrence of Alzheimer's and cognitive impairments (e.g., Ncl, Eno1, Scn9a, Slc19a3, Ncstn, Fos, Eif4h, Copa, etc.). Our results show that the variable effects of chronic stress on the hippocampal transcriptome are related to the ability to complete the MWM task and that the modulations of specific pathways are indicative of hippocampal dependent memory performance. Thus, the divergence in spatial memory performance following chronic stress is related to the unique pattern of gene expression within the hippocampus. PMID:28912681
Dixit, Shalabh; Kumar Biswal, Akshaya; Min, Aye; Henry, Amelia; Oane, Rowena H.; Raorane, Manish L.; Longkumer, Toshisangba; Pabuayon, Isaiah M.; Mutte, Sumanth K.; Vardarajan, Adithi R.; Miro, Berta; Govindan, Ganesan; Albano-Enriquez, Blesilda; Pueffeld, Mandy; Sreenivasulu, Nese; Slamet-Loedin, Inez; Sundarvelpandian, Kalaipandian; Tsai, Yuan-Ching; Raghuvanshi, Saurabh; Hsing, Yue-Ie C.; Kumar, Arvind; Kohli, Ajay
2015-01-01
Sub-QTLs and multiple intra-QTL genes are hypothesized to underpin large-effect QTLs. Known QTLs over gene families, biosynthetic pathways or certain traits represent functional gene-clusters of genes of the same gene ontology (GO). Gene-clusters containing genes of different GO have not been elaborated, except in silico as coexpressed genes within QTLs. Here we demonstrate the requirement of multiple intra-QTL genes for the full impact of QTL qDTY12.1 on rice yield under drought. Multiple evidences are presented for the need of the transcription factor ‘no apical meristem’ (OsNAM12.1) and its co-localized target genes of separate GO categories for qDTY12.1 function, raising a regulon-like model of genetic architecture. The molecular underpinnings of qDTY12.1 support its effectiveness in further improving a drought tolerant genotype and for its validity in multiple genotypes/ecosystems/environments. Resolving the combinatorial value of OsNAM12.1 with individual intra-QTL genes notwithstanding, identification and analyses of qDTY12.1has fast-tracked rice improvement towards food security. PMID:26507552
Multiple homologous genes knockout (KO) by CRISPR/Cas9 system in rabbit.
Liu, Huan; Sui, Tingting; Liu, Di; Liu, Tingjun; Chen, Mao; Deng, Jichao; Xu, Yuanyuan; Li, Zhanjun
2018-03-20
The CRISPR/Cas9 system is a highly efficient and convenient genome editing tool, which has been widely used for single or multiple gene mutation in a variety of organisms. Disruption of multiple homologous genes, which have similar DNA sequences and gene function, is required for the study of the desired phenotype. In this study, to test whether the CRISPR/Cas9 system works on the mutation of multiple homologous genes, a single guide RNA (sgRNA) targeting three fucosyltransferases encoding genes (FUT1, FUT2 and SEC1) was designed. As expected, triple gene mutation of FUT1, FUT2 and SEC1 could be achieved simultaneously via a sgRNA mediated CRISPR/Cas9 system. Besides, significantly reduced serum fucosyltransferases enzymes activity was also determined in those triple gene mutation rabbits. Thus, we provide the first evidence that multiple homologous genes knockout (KO) could be achieved efficiently by a sgRNA mediated CRISPR/Cas9 system in mammals, which could facilitate the genotype to phenotype studies of homologous genes in future. Copyright © 2018 Elsevier B.V. All rights reserved.
Male-Mediated Gene Flow in Patrilocal Primates
Schubert, Grit; Stoneking, Colin J.; Arandjelovic, Mimi; Boesch, Christophe; Eckhardt, Nadin; Hohmann, Gottfried; Langergraber, Kevin; Lukas, Dieter; Vigilant, Linda
2011-01-01
Background Many group–living species display strong sex biases in dispersal tendencies. However, gene flow mediated by apparently philopatric sex may still occur and potentially alters population structure. In our closest living evolutionary relatives, dispersal of adult males seems to be precluded by high levels of territoriality between males of different groups in chimpanzees, and has only been observed once in bonobos. Still, male–mediated gene flow might occur through rare events such as extra–group matings leading to extra–group paternity (EGP) and female secondary dispersal with offspring, but the extent of this gene flow has not yet been assessed. Methodology/Principal Findings Using autosomal microsatellite genotyping of samples from multiple groups of wild western chimpanzees (Pan troglodytes verus) and bonobos (Pan paniscus), we found low genetic differentiation among groups for both males and females. Characterization of Y–chromosome microsatellites revealed levels of genetic differentiation between groups in bonobos almost as high as those reported previously in eastern chimpanzees, but lower levels of differentiation in western chimpanzees. By using simulations to evaluate the patterns of Y–chromosomal variation expected under realistic assumptions of group size, mutation rate and reproductive skew, we demonstrate that the observed presence of multiple and highly divergent Y–haplotypes within western chimpanzee and bonobo groups is best explained by successful male–mediated gene flow. Conclusions/Significance The similarity of inferred rates of male–mediated gene flow and published rates of EGP in western chimpanzees suggests this is the most likely mechanism of male–mediated gene flow in this subspecies. In bonobos more data are needed to refine the estimated rate of gene flow. Our findings suggest that dispersal patterns in these closely related species, and particularly for the chimpanzee subspecies, are more variable than previously appreciated. This is consistent with growing recognition of extensive behavioral variation in chimpanzees and bonobos. PMID:21747938
Thompson, Bryony A.; Greenblatt, Marc S.; Vallee, Maxime P.; Herkert, Johanna C.; Tessereau, Chloe; Young, Erin L.; Adzhubey, Ivan A.; Li, Biao; Bell, Russell; Feng, Bingjian; Mooney, Sean D.; Radivojac, Predrag; Sunyaev, Shamil R.; Frebourg, Thierry; Hofstra, Robert M.W.; Sijmons, Rolf H.; Boucher, Ken; Thomas, Alun; Goldgar, David E.; Spurdle, Amanda B.; Tavtigian, Sean V.
2015-01-01
Classification of rare missense substitutions observed during genetic testing for patient management is a considerable problem in clinical genetics. The Bayesian integrated evaluation of unclassified variants is a solution originally developed for BRCA1/2. Here, we take a step toward an analogous system for the mismatch repair (MMR) genes (MLH1, MSH2, MSH6, and PMS2) that confer colon cancer susceptibility in Lynch syndrome by calibrating in silico tools to estimate prior probabilities of pathogenicity for MMR gene missense substitutions. A qualitative five-class classification system was developed and applied to 143 MMR missense variants. This identified 74 missense substitutions suitable for calibration. These substitutions were scored using six different in silico tools (Align-Grantham Variation Grantham Deviation, multivariate analysis of protein polymorphisms [MAPP], Mut-Pred, PolyPhen-2.1, Sorting Intolerant From Tolerant, and Xvar), using curated MMR multiple sequence alignments where possible. The output from each tool was calibrated by regression against the classifications of the 74 missense substitutions; these calibrated outputs are interpretable as prior probabilities of pathogenicity. MAPP was the most accurate tool and MAPP + PolyPhen-2.1 provided the best-combined model (R2 = 0.62 and area under receiver operating characteristic = 0.93). The MAPP + PolyPhen-2.1 output is sufficiently predictive to feed as a continuous variable into the quantitative Bayesian integrated evaluation for clinical classification of MMR gene missense substitutions. PMID:22949387
Biasogram: Visualization of Confounding Technical Bias in Gene Expression Data
Krzystanek, Marcin; Szallasi, Zoltan; Eklund, Aron C.
2013-01-01
Gene expression profiles of clinical cohorts can be used to identify genes that are correlated with a clinical variable of interest such as patient outcome or response to a particular drug. However, expression measurements are susceptible to technical bias caused by variation in extraneous factors such as RNA quality and array hybridization conditions. If such technical bias is correlated with the clinical variable of interest, the likelihood of identifying false positive genes is increased. Here we describe a method to visualize an expression matrix as a projection of all genes onto a plane defined by a clinical variable and a technical nuisance variable. The resulting plot indicates the extent to which each gene is correlated with the clinical variable or the technical variable. We demonstrate this method by applying it to three clinical trial microarray data sets, one of which identified genes that may have been driven by a confounding technical variable. This approach can be used as a quality control step to identify data sets that are likely to yield false positive results. PMID:23613961
Short and long-term genome stability analysis of prokaryotic genomes.
Brilli, Matteo; Liò, Pietro; Lacroix, Vincent; Sagot, Marie-France
2013-05-08
Gene organization dynamics is actively studied because it provides useful evolutionary information, makes functional annotation easier and often enables to characterize pathogens. There is therefore a strong interest in understanding the variability of this trait and the possible correlations with life-style. Two kinds of events affect genome organization: on one hand translocations and recombinations change the relative position of genes shared by two genomes (i.e. the backbone gene order); on the other, insertions and deletions leave the backbone gene order unchanged but they alter the gene neighborhoods by breaking the syntenic regions. A complete picture about genome organization evolution therefore requires to account for both kinds of events. We developed an approach where we model chromosomes as graphs on which we compute different stability estimators; we consider genome rearrangements as well as the effect of gene insertions and deletions. In a first part of the paper, we fit a measure of backbone gene order conservation (hereinafter called backbone stability) against phylogenetic distance for over 3000 genome comparisons, improving existing models for the divergence in time of backbone stability. Intra- and inter-specific comparisons were treated separately to focus on different time-scales. The use of multiple genomes of a same species allowed to identify genomes with diverging gene order with respect to their conspecific. The inter-species analysis indicates that pathogens are more often unstable with respect to non-pathogens. In a second part of the text, we show that in pathogens, gene content dynamics (insertions and deletions) have a much more dramatic effect on genome organization stability than backbone rearrangements. In this work, we studied genome organization divergence taking into account the contribution of both genome order rearrangements and genome content dynamics. By studying species with multiple sequenced genomes available, we were able to explore genome organization stability at different time-scales and to find significant differences for pathogen and non-pathogen species. The output of our framework also allows to identify the conserved gene clusters and/or partial occurrences thereof, making possible to explore how gene clusters assembled during evolution.
Ye, Fei; Lan, Xu-E; Zhu, Wen-Bo; You, Ping
2016-05-09
Insect mitochondrial genomes (mitogenomes) contain a conserved set of 37 genes for an extensive diversity of lineages. Previously reported dictyopteran mitogenomes share this conserved mitochondrial gene arrangement, although surprisingly little is known about the mitogenome of Mantodea. We sequenced eight mantodean mitogenomes including the first representatives of two families: Hymenopodidae and Liturgusidae. Only two of these genomes retain the typical insect gene arrangement. In three Liturgusidae species, the trnM genes have translocated. Four species of mantis (Creobroter gemmata, Mantis religiosa, Statilia sp., and Theopompa sp.-HN) have multiple identical tandem duplication of trnR, and Statilia sp. additionally includes five extra duplicate trnW. These extra trnR and trnW in Statilia sp. are erratically arranged and form another novel gene order. Interestingly, the extra trnW is converted from trnR by the process of point mutation at anticodon, which is the first case of tRNA reassignment for an insect. Furthermore, no significant differences were observed amongst mantodean mitogenomes with variable copies of tRNA according to comparative analysis of codon usage. Combined with phylogenetic analysis, the characteristics of tRNA only possess limited phylogenetic information in this research. Nevertheless, these features of gene rearrangement, duplication, and reassignment provide valuable information toward understanding mitogenome evolution in insects.
Ye, Fei; Lan, Xu-e; Zhu, Wen-bo; You, Ping
2016-01-01
Insect mitochondrial genomes (mitogenomes) contain a conserved set of 37 genes for an extensive diversity of lineages. Previously reported dictyopteran mitogenomes share this conserved mitochondrial gene arrangement, although surprisingly little is known about the mitogenome of Mantodea. We sequenced eight mantodean mitogenomes including the first representatives of two families: Hymenopodidae and Liturgusidae. Only two of these genomes retain the typical insect gene arrangement. In three Liturgusidae species, the trnM genes have translocated. Four species of mantis (Creobroter gemmata, Mantis religiosa, Statilia sp., and Theopompa sp.-HN) have multiple identical tandem duplication of trnR, and Statilia sp. additionally includes five extra duplicate trnW. These extra trnR and trnW in Statilia sp. are erratically arranged and form another novel gene order. Interestingly, the extra trnW is converted from trnR by the process of point mutation at anticodon, which is the first case of tRNA reassignment for an insect. Furthermore, no significant differences were observed amongst mantodean mitogenomes with variable copies of tRNA according to comparative analysis of codon usage. Combined with phylogenetic analysis, the characteristics of tRNA only possess limited phylogenetic information in this research. Nevertheless, these features of gene rearrangement, duplication, and reassignment provide valuable information toward understanding mitogenome evolution in insects. PMID:27157299
Polymorphisms of vitamin K-related genes (EPHX1 and VKORC1L1) and stable warfarin doses.
Chung, Jee-Eun; Lee, Kyung Eun; Chang, Byung Chul; Gwak, Hye Sun
2018-01-30
The aim of this study was to investigate the possible effects of EPHX1 and VKORC1L1 polymorphisms on variability of responses to warfarin. Sixteen single nucleotide polymorphisms (SNPs) in 201 patients with stable warfarin doses were analyzed including genes of VKORC1, CYP2C9, CYP4F2, GGCX, EPHX1 and VKORC1L1. Univariate analysis was conducted for the association of genotypes with stable warfarin doses. Multiple linear regression analysis was used to investigate factors that independently affected the inter-individual variability of warfarin dose requirements. The rs4072879 of VKORC1L1 (A>G) was significantly associated with stable warfarin doses; wild homozygote carriers (AA) required significantly lower stable warfarin doses than those with the variant G allele (5.02±1.56 vs. 5.96±2.01mg; p=0.001). Multivariate analysis showed that EPHX1 rs1877724 and VKORC1L1 rs4072879 accounted for 1.5% and 1.3% of the warfarin dose variability. Adding EPHX1 and VKORC1L1 SNPs to the base model including non-genetic variables (operation age, body weight and the therapy of ACEI or ARB) and genetic variables (VKORC1 rs9934438, CYP2C9 rs1057910, and CYP4F2 rs2108622) gave a number needed to genotype of 34. This study showed that polymorphisms of EPHX1 and VKORC1L1 could be determinants of stable warfarin doses. Copyright © 2017. Published by Elsevier B.V.
Ge, Tian; Nichols, Thomas E; Ghosh, Debashis; Mormino, Elizabeth C; Smoller, Jordan W; Sabuncu, Mert R
2015-04-01
Measurements derived from neuroimaging data can serve as markers of disease and/or healthy development, are largely heritable, and have been increasingly utilized as (intermediate) phenotypes in genetic association studies. To date, imaging genetic studies have mostly focused on discovering isolated genetic effects, typically ignoring potential interactions with non-genetic variables such as disease risk factors, environmental exposures, and epigenetic markers. However, identifying significant interaction effects is critical for revealing the true relationship between genetic and phenotypic variables, and shedding light on disease mechanisms. In this paper, we present a general kernel machine based method for detecting effects of the interaction between multidimensional variable sets. This method can model the joint and epistatic effect of a collection of single nucleotide polymorphisms (SNPs), accommodate multiple factors that potentially moderate genetic influences, and test for nonlinear interactions between sets of variables in a flexible framework. As a demonstration of application, we applied the method to the data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) to detect the effects of the interactions between candidate Alzheimer's disease (AD) risk genes and a collection of cardiovascular disease (CVD) risk factors, on hippocampal volume measurements derived from structural brain magnetic resonance imaging (MRI) scans. Our method identified that two genes, CR1 and EPHA1, demonstrate significant interactions with CVD risk factors on hippocampal volume, suggesting that CR1 and EPHA1 may play a role in influencing AD-related neurodegeneration in the presence of CVD risks. Copyright © 2015 Elsevier Inc. All rights reserved.
Dong, Yun-Wei; Han, Guo-Dong; Huang, Xiong-Wei
2014-09-01
In the natural environment, organisms are exposed to large variations in physical conditions. Quantifying such physiological responses is, however, often performed in laboratory acclimation studies, in which usually only a single factor is varied. In contrast, field acclimatization may expose organisms to concurrent changes in several environmental variables. The interactions of these factors may have strong effects on organismal function. In particular, rare events that occur stochastically and have relatively short duration may have strong effects. The present experiments studied levels of expression of several genes associated with cellular stress and metabolic regulation in a field population of limpet Cellana toreuma that encountered a wide range of temperatures plus periodic rain events. Physiological responses to these variable conditions were quantified by measuring levels of mRNA of genes encoding heat-shock proteins (Hsps) and metabolic sensors (AMPKs and Sirtuin 1). Our results reveal high ratios of individuals in upregulation group of stress-related gene expression at high temperature and rainy days, indicating the occurrence of stress from both prevailing high summer temperatures and occasional rainfall during periods of emersion. At high temperature, stress due to exposure to rainfall may be more challenging than heat stress alone. The highly variable physiological performances of limpets in their natural habitats indicate the possible differences in capability for physiological regulation among individuals. Our results emphasize the importance of studies of field acclimatization in unravelling the effects of environmental change on organisms, notably in the context of multiple changes in abiotic factors that are accompanying global change. © 2014 John Wiley & Sons Ltd.
Levran, Orna; Randesi, Matthew; Peles, Einat; Correa da Rosa, Joel; Ott, Jurg; Rotrosen, John; Adelson, Miriam; Kreek, Mary Jeanne
2016-06-01
This study was designed to determine whether polymorphisms in acetylcholine receptors contribute to opioid dependence and/or cocaine dependence. The sample (n = 1860) was divided by drug and ancestry, and 55 polymorphisms (nine genes) were analyzed. Of the 20 SNPs that showed nominally significant associations, the association of the African-specific CHRM4 SNP rs2229163 (Asn417=) with cocaine dependence survived correction for multiple testing (Pcorrected = 0.047). CHRM4 is located in a region of strong linkage disequilibrium on chromosome 11 that includes genes associated with schizophrenia. CHRM4 SNP rs2229163 is in strong linkage disequilibrium with several African-specific SNPs in DGKZ and AMBRA1. Cholinergic receptors' variants may contribute to drug addiction and have a potential role as pharmacogenetic markers.
Hypertension, dyslipidemia, and insulin resistance: links in a chain or spokes on a wheel?
Hopkins, P N; Hunt, S C; Wu, L L; Williams, G H; Williams, R R
1996-08-01
Rather than a link in a causal chain leading to hypertension, insulin resistance and resultant hyperinsulinemia may be 'spokes on a wheel', with central or visceral obesity as the postulated hub of the wheel. Hypertension, hypertriglyceridemia and high density lipoprotein cholesterol are depicted as other spokes. Newly identified metabolic pathways in adipose tissue or the modulating effects of various predisposing genes may lead to variable expression of various components of the multiple metabolic syndrome in individuals with a predisposition to the collection of visceral fat.
Genetic Structure and Gene Flows within Horses: A Genealogical Study at the French Population Scale
Pirault, Pauline; Danvy, Sophy; Verrier, Etienne; Leroy, Grégoire
2013-01-01
Since horse breeds constitute populations submitted to variable and multiple outcrossing events, we analyzed the genetic structure and gene flows considering horses raised in France. We used genealogical data, with a reference population of 547,620 horses born in France between 2002 and 2011, grouped according to 55 breed origins. On average, individuals had 6.3 equivalent generations known. Considering different population levels, fixation index decreased from an overall species FIT of 1.37%, to an average of −0.07% when considering the 55 origins, showing that most horse breeds constitute populations without genetic structure. We illustrate the complexity of gene flows existing among horse breeds, a few populations being closed to foreign influence, most, however, being submitted to various levels of introgression. In particular, Thoroughbred and Arab breeds are largely used as introgression sources, since those two populations explain together 26% of founder origins within the overall horse population. When compared with molecular data, breeds with a small level of coancestry also showed low genetic distance; the gene pool of the breeds was probably impacted by their reproducer exchanges. PMID:23630596
Flatworms have lost the right open reading frame kinase 3 gene during evolution
Breugelmans, Bert; Ansell, Brendan R. E.; Young, Neil D.; Amani, Parisa; Stroehlein, Andreas J.; Sternberg, Paul W.; Jex, Aaron R.; Boag, Peter R.; Hofmann, Andreas; Gasser, Robin B.
2015-01-01
All multicellular organisms studied to date have three right open reading frame kinase genes (designated riok-1, riok-2 and riok-3). Current evidence indicates that riok-1 and riok-2 have essential roles in ribosome biosynthesis, and that the riok-3 gene assists this process. In the present study, we conducted a detailed bioinformatic analysis of the riok gene family in 25 parasitic flatworms (platyhelminths) for which extensive genomic and transcriptomic data sets are available. We found that none of the flatworms studied have a riok-3 gene, which is unprecedented for multicellular organisms. We propose that, unlike in other eukaryotes, the loss of RIOK-3 from flatworms does not result in an evolutionary disadvantage due to the unique biology and physiology of this phylum. We show that the loss of RIOK-3 coincides with a loss of particular proteins associated with essential cellular pathways linked to cell growth and apoptosis. These findings indicate multiple, key regulatory functions of RIOK-3 in other metazoan species. Taking advantage of a known partial crystal structure of human RIOK-1, molecular modelling revealed variability in nucleotide binding sites between flatworm and human RIOK proteins. PMID:25976756
Flatworms have lost the right open reading frame kinase 3 gene during evolution.
Breugelmans, Bert; Ansell, Brendan R E; Young, Neil D; Amani, Parisa; Stroehlein, Andreas J; Sternberg, Paul W; Jex, Aaron R; Boag, Peter R; Hofmann, Andreas; Gasser, Robin B
2015-05-15
All multicellular organisms studied to date have three right open reading frame kinase genes (designated riok-1, riok-2 and riok-3). Current evidence indicates that riok-1 and riok-2 have essential roles in ribosome biosynthesis, and that the riok-3 gene assists this process. In the present study, we conducted a detailed bioinformatic analysis of the riok gene family in 25 parasitic flatworms (platyhelminths) for which extensive genomic and transcriptomic data sets are available. We found that none of the flatworms studied have a riok-3 gene, which is unprecedented for multicellular organisms. We propose that, unlike in other eukaryotes, the loss of RIOK-3 from flatworms does not result in an evolutionary disadvantage due to the unique biology and physiology of this phylum. We show that the loss of RIOK-3 coincides with a loss of particular proteins associated with essential cellular pathways linked to cell growth and apoptosis. These findings indicate multiple, key regulatory functions of RIOK-3 in other metazoan species. Taking advantage of a known partial crystal structure of human RIOK-1, molecular modelling revealed variability in nucleotide binding sites between flatworm and human RIOK proteins.
Banerjee, Bodhisattwa; Koner, Debaprasad; Bhuyan, Gitalee; Saha, Nirmalendu
2018-06-01
The present study demonstrates the unique presence of three different gs genes (cmgs01, cmgs02, and cmgs03) in air-breathing ureogenic magur catfish (Clarias magur), which is otherwise reported to be encoded by a single gene in higher vertebrates. Of these three genes, two (cmgs01and cmgs03) were identified as 'liver' form, predominantly expressed in liver cells, and the third one as 'brain' form (cmgs02), expressed chiefly in brain cells. Molecular characterization studies have revealed conservation of homologous active site residues in all the three gs genes. In silico analysis, accompanied by GS enzyme assay and Western blot analysis of different GS isoforms in different subcellular fractions indicated the mitochondrial localization of cmGS01 and cmGS03 in liver and kidney cells and cytosolic localization of cmGS02 in brain cells. Further, exposure of magur catfish to high external ammonia (HEA; 25 mM NH 4 Cl) led to a significant induction of multiple gs genes as evidenced by higher expression of different gs mRNAs at variable levels in different tissues. The cmgs01 and cmgs03 mRNA levels elevated significantly in liver, kidney, muscle, and gills, whereas the cmgs02 mRNA level increased considerably in the brain after 14 days of exposure to HEA. These increases in mRNA levels were associated with a significant rise in cmGS01 and cmGS03 proteins in liver, kidney, muscle, and gills, and the cmGS02 protein in the brain after 14 days of exposure to HEA. Therefore, it can be concluded that the unique differential expression of three gs genes and their induction under high ammonia level probably helps in detoxification of ammonia to glutamine and further to urea via the ornithine-urea cycle in ureogenic as well as non-ureogenic tissues of these magur catfish. Copyright © 2017. Published by Elsevier B.V.
Hartnett, M Elizabeth; Morrison, Margaux A; Smith, Silvia; Yanovitch, Tammy L; Young, Terri L; Colaizy, Tarah; Momany, Allison; Dagle, John; Carlo, Waldemar A; Clark, Erin A S; Page, Grier; Murray, Jeff; DeAngelis, Margaret M; Cotten, C Michael
2014-08-12
To determine genetic variants associated with severe retinopathy of prematurity (ROP) in a candidate gene cohort study of US preterm infants. Preterm infants in the discovery cohort were enrolled through the Eunice Kennedy Shriver National Institute of Child Health and Human Development Neonatal Research Network, and those in the replication cohort were from the University of Iowa. All infants were phenotyped for ROP severity. Because of differences in the durations of enrollment between cohorts, severe ROP was defined as threshold disease in the discovery cohort and as threshold disease or type 1 ROP in the replication cohort. Whole genome amplified DNA from stored blood spot samples from the Neonatal Research Network biorepository was genotyped using an Illumina GoldenGate platform for candidate gene single nucleotide polymorphisms (SNPs) involving angiogenic, developmental, inflammatory, and oxidative pathways. Three analyses were performed to determine significant epidemiologic variables and SNPs associated with levels of ROP severity. Analyses controlled for multiple comparisons, ancestral eigenvalues, family relatedness, and significant epidemiologic variables. Single nucleotide polymorphisms significantly associated with ROP severity from the discovery cohort were analyzed in the replication cohort and in meta-analysis. Eight hundred seventeen infants in the discovery cohort and 543 in the replication cohort were analyzed. Severe ROP occurred in 126 infants in the discovery and in 14 in the replication cohort. In both cohorts, ventilation days and seizure occurrence were associated with severe ROP. After controlling for significant factors and multiple comparisons, two intronic SNPs in the gene BDNF (rs7934165 and rs2049046, P < 3.1 × 10(-5)) were associated with severe ROP in the discovery cohort and were not associated with severe ROP in the replication cohort. However, when the cohorts were analyzed together in an exploratory meta-analysis, rs7934165 increased in associated significance with severe ROP (P = 2.9 × 10(-7)). Variants in BDNF encoding brain-derived neurotrophic factor were associated with severe ROP in a large candidate gene study of infants with threshold ROP. Copyright 2014 The Association for Research in Vision and Ophthalmology, Inc.
Wojciechowski, Robert; Yee, Stephanie S.; Simpson, Claire L.; Bailey-Wilson, Joan E.; Stambolian, Dwight
2012-01-01
Purpose A previous study of Old Order Amish families has shown association of ocular refraction with markers proximal to matrix metalloproteinase (MMP) genes MMP1 and MMP10 and intragenic to MMP2. We conducted a candidate gene replication study of association between refraction and single nucleotide polymorphisms (SNPs) within these genomic regions. Design Candidate gene genetic association study. Participants 2,000 participants drawn from the Age Related Eye Disease Study (AREDS) were chosen for genotyping. After quality control filtering, 1912 individuals were available for analysis. Methods Microarray genotyping was performed using the HumanOmni 2.5 bead array. SNPs originally typed in the previous Amish association study were extracted for analysis. In addition, haplotype tagging SNPs were genotyped using TaqMan assays. Quantitative trait association analyses of mean spherical equivalent refraction (MSE) were performed on 30 markers using linear regression models and an additive genetic risk model, while adjusting for age, sex, education, and population substructure. Post-hoc analyses were performed after stratifying on a dichotomous education variable. Pointwise (P-emp) and multiple-test study-wise (P-multi) significance levels were calculated empirically through permutation. Main outcome measures MSE was used as a quantitative measure of ocular refraction. Results The mean age and ocular refraction were 68 years (SD=4.7) and +0.55 D (SD=2.14), respectively. Pointwise statistical significance was obtained for rs1939008 (P-emp=0.0326). No SNP attained statistical significance after correcting for multiple testing. In stratified analyses, multiple SNPs reached pointwise significance in the lower-education group: 2 of these were statistically significant after multiple testing correction. The two highest-ranking SNPs in Amish families (rs1939008 and rs9928731) showed pointwise P-emp<0.01 in the lower-education stratum of AREDS participants. Conclusions We show suggestive evidence of replication of an association signal for ocular refraction to a marker between MMP1 and MMP10. We also provide evidence of a gene-environment interaction between previously-reported markers and education on refractive error. Variants in MMP1- MMP10 and MMP2 regions appear to affect population variation in ocular refraction in environmental conditions less favorable for myopia development. PMID:23098370
Kanoun, Houda; Jarraya, Faiçal; Maalej, Bayen; Lahiani, Amina; Mahfoudh, Hichem; Makni, Fatma; Hachicha, Jamil; Fakhfakh, Faiza
2017-10-02
Primary hyperoxaluria type 1 (PH1) is an autosomal recessive inherited disorder of glyoxylate metabolism in which excessive oxalates are formed by the liver and excreted by the kidneys. Calcium oxalate crystallizes in the urine, leading to urolithiasis, nephrocalcinosis, and consequent renal failure if treatment is not initiated promptly. Mutations in the AGXT gene which encodes the hepatic peroxisomal enzyme alanine:glyoxylate aminotransferase are responsible of PH1. In the present work, we aimed to analyze AGXT gene and in silico investigations performed in four patients with PH1 among two non consanguineous families. Exhaustive gene sequencing was performed after PCR amplification of coding exons and introns boundaries. Bioinformatic tools were used to predict the impact of AGXT variants on gene expression as well as on the protein structure and function. Direct sequencing of all exons of AGXT gene revealed the emergence of multiple mutations in compound heterozygous state in the two studied families. Two patients were compound heterozygous for the c.731 T > C, c.32C > T, c.1020A > G and c.33_34insC and presented clinically with recurrent urinary tract infection, multiple urolithiasis and nephrocalcinosis under the age of 1 year and a persistent hyperoxaluria at the age of diagnosis. The two other patients presenting a less severe phenotypes were heterozygous for c.731 T > C and homozygous for the c.32C > T and c.1020A > G or compound heterozygous for c.26C > A and c.65A > G variants. In Summary, we provided relevance regarding the compound heterozygous mutations in non consanguineous PH1 families with variable severity.
Exploring seascape genetics and kinship in the reef sponge Stylissa carteri in the Red Sea
Giles, Emily C; Saenz-Agudelo, Pablo; Hussey, Nigel E; Ravasi, Timothy; Berumen, Michael L
2015-01-01
A main goal of population geneticists is to study patterns of gene flow to gain a better understanding of the population structure in a given organism. To date most efforts have been focused on studying gene flow at either broad scales to identify barriers to gene flow and isolation by distance or at fine spatial scales in order to gain inferences regarding reproduction and local dispersal. Few studies have measured connectivity at multiple spatial scales and have utilized novel tools to test the influence of both environment and geography on shaping gene flow in an organism. Here a seascape genetics approach was used to gain insight regarding geographic and ecological barriers to gene flow of a common reef sponge, Stylissa carteri in the Red Sea. Furthermore, a small-scale (<1 km) analysis was also conducted to infer reproductive potential in this organism. At the broad scale, we found that sponge connectivity is not structured by geography alone, but rather, genetic isolation in the southern Red Sea correlates strongly with environmental heterogeneity. At the scale of a 50-m transect, spatial autocorrelation analyses and estimates of full-siblings revealed that there is no deviation from random mating. However, at slightly larger scales (100–200 m) encompassing multiple transects at a given site, a greater proportion of full-siblings was found within sites versus among sites in a given location suggesting that mating and/or dispersal are constrained to some extent at this spatial scale. This study adds to the growing body of literature suggesting that environmental and ecological variables play a major role in the genetic structure of marine invertebrate populations. PMID:26257865
Selection of Reference Gene Expression in a Schizophrenia Brain Cohort
Weickert, Cynthia Shannon; Sheedy, Donna; Rothmond, Debora A.; Dedova, Irina; Fung, Samantha; Garrick, Therese; Wong, Jenny; Harding, Antony J.; Sivagnanansundaram, Sinthuja; Hunt, Clare; Duncan, Carlotta; Sundqvist, Nina; Tsai, Shan-Yuan; Anand, Jasna; Draganic, Daren; Harper, Clive
2010-01-01
Objective To conduct postmortem human brain research into the neuropathological basis of schizophrenia, it is critical to establish cohorts that are well-characterised and well-matched. Our objective was to determine if specimen characteristics, including: diagnosis, age, postmortem interval (PMI), brain acidity (pH), and/or the agonal state of the subject at death related to RNA quality, and to determine the most appropriate reference gene mRNAs. Methods We selected a matched cohort of 74 cases (37 schizophrenia / schizoaffective disorder cases and 37 controls cases). Middle frontal gyrus tissue was pulverised, tissue pH was measured, RNA isolated for cDNA from each case, and RNA integrity number (RIN) measurements were assessed. Using RT-PCR, we measured nine housekeeper genes and calculated a geomean in each diagnostic group. Results We found that the RINs were very good (mean 7.3) and all nine housekeeper control genes were significantly correlated with RIN. Seven of nine housekeeper genes were also correlated with pH, and two clinical variables, agonal state and duration of illness did have an effect on some control mRNAs. No major impact of PMI or freezer time on housekeeper mRNAs was detected. Our results show that people with schizophrenia had significantly less PPIA, and SDHA and tended to have less GUSB and B2M mRNA suggesting that these control genes may not be good candidates for normalisation. Conclusions In our cohort, less than 10% variability in RIN values was detected and the diagnostic groups were well matched overall. Our cohort was adequately powered (0.80–0.90) to detect mRNA differences (25%) due to disease. Our study suggests that multiple factors should be considered in mRNA expression studies of human brain tissues. When schizophrenia cases are adequately matched to control cases subtle differences in gene expression can be reliably detected. PMID:20073568
Evolution of genes and repeats in the Nimrod superfamily.
Somogyi, Kálmán; Sipos, Botond; Pénzes, Zsolt; Kurucz, Eva; Zsámboki, János; Hultmark, Dan; Andó, István
2008-11-01
The recently identified Nimrod superfamily is characterized by the presence of a special type of EGF repeat, the NIM repeat, located right after a typical CCXGY/W amino acid motif. On the basis of structural features, nimrod genes can be divided into three types. The proteins encoded by Draper-type genes have an EMI domain at the N-terminal part and only one copy of the NIM motif, followed by a variable number of EGF-like repeats. The products of Nimrod B-type and Nimrod C-type genes (including the eater gene) have different kinds of N-terminal domains, and lack EGF-like repeats but contain a variable number of NIM repeats. Draper and Nimrod C-type (but not Nimrod B-type) proteins carry a transmembrane domain. Several members of the superfamily were claimed to function as receptors in phagocytosis and/or binding of bacteria, which indicates an important role in the cellular immunity and the elimination of apoptotic cells. In this paper, the evolution of the Nimrod superfamily is studied with various methods on the level of genes and repeats. A hypothesis is presented in which the NIM repeat, along with the EMI domain, emerged by structural reorganizations at the end of an EGF-like repeat chain, suggesting a mechanism for the formation of novel types of repeats. The analyses revealed diverse evolutionary patterns in the sequences containing multiple NIM repeats. Although in the Nimrod B and Nimrod C proteins show characteristics of independent evolution, many internal NIM repeats in Eater sequences seem to have undergone concerted evolution. An analysis of the nimrod genes has been performed using phylogenetic and other methods and an evolutionary scenario of the origin and diversification of the Nimrod superfamily is proposed. Our study presents an intriguing example how the evolution of multigene families may contribute to the complexity of the innate immune response.
Gene set analysis using variance component tests.
Huang, Yen-Tsung; Lin, Xihong
2013-06-28
Gene set analyses have become increasingly important in genomic research, as many complex diseases are contributed jointly by alterations of numerous genes. Genes often coordinate together as a functional repertoire, e.g., a biological pathway/network and are highly correlated. However, most of the existing gene set analysis methods do not fully account for the correlation among the genes. Here we propose to tackle this important feature of a gene set to improve statistical power in gene set analyses. We propose to model the effects of an independent variable, e.g., exposure/biological status (yes/no), on multiple gene expression values in a gene set using a multivariate linear regression model, where the correlation among the genes is explicitly modeled using a working covariance matrix. We develop TEGS (Test for the Effect of a Gene Set), a variance component test for the gene set effects by assuming a common distribution for regression coefficients in multivariate linear regression models, and calculate the p-values using permutation and a scaled chi-square approximation. We show using simulations that type I error is protected under different choices of working covariance matrices and power is improved as the working covariance approaches the true covariance. The global test is a special case of TEGS when correlation among genes in a gene set is ignored. Using both simulation data and a published diabetes dataset, we show that our test outperforms the commonly used approaches, the global test and gene set enrichment analysis (GSEA). We develop a gene set analyses method (TEGS) under the multivariate regression framework, which directly models the interdependence of the expression values in a gene set using a working covariance. TEGS outperforms two widely used methods, GSEA and global test in both simulation and a diabetes microarray data.
HIV promoter integration site primarily modulates transcriptional burst size rather than frequency.
Skupsky, Ron; Burnett, John C; Foley, Jonathan E; Schaffer, David V; Arkin, Adam P
2010-09-30
Mammalian gene expression patterns, and their variability across populations of cells, are regulated by factors specific to each gene in concert with its surrounding cellular and genomic environment. Lentiviruses such as HIV integrate their genomes into semi-random genomic locations in the cells they infect, and the resulting viral gene expression provides a natural system to dissect the contributions of genomic environment to transcriptional regulation. Previously, we showed that expression heterogeneity and its modulation by specific host factors at HIV integration sites are key determinants of infected-cell fate and a possible source of latent infections. Here, we assess the integration context dependence of expression heterogeneity from diverse single integrations of a HIV-promoter/GFP-reporter cassette in Jurkat T-cells. Systematically fitting a stochastic model of gene expression to our data reveals an underlying transcriptional dynamic, by which multiple transcripts are produced during short, infrequent bursts, that quantitatively accounts for the wide, highly skewed protein expression distributions observed in each of our clonal cell populations. Interestingly, we find that the size of transcriptional bursts is the primary systematic covariate over integration sites, varying from a few to tens of transcripts across integration sites, and correlating well with mean expression. In contrast, burst frequencies are scattered about a typical value of several per cell-division time and demonstrate little correlation with the clonal means. This pattern of modulation generates consistently noisy distributions over the sampled integration positions, with large expression variability relative to the mean maintained even for the most productive integrations, and could contribute to specifying heterogeneous, integration-site-dependent viral production patterns in HIV-infected cells. Genomic environment thus emerges as a significant control parameter for gene expression variation that may contribute to structuring mammalian genomes, as well as be exploited for survival by integrating viruses.
Xia, Zheng; Donehower, Lawrence A; Cooper, Thomas A.; Neilson, Joel R.; Wheeler, David A.; Wagner, Eric J.; Li, Wei
2015-01-01
Alternative polyadenylation (APA) is a pervasive mechanism in the regulation of most human genes, and its implication in diseases including cancer is only beginning to be appreciated. Since conventional APA profiling has not been widely adopted, global cancer APA studies are very limited. Here we develop a novel bioinformatics algorithm (DaPars) for the de novo identification of dynamic APAs from standard RNA-seq. When applied to 358 TCGA Pan-Cancer tumor/normal pairs across 7 tumor types, DaPars reveals 1,346 genes with recurrent and tumor-specific APAs. Most APA genes (91%) have shorter 3′ UTRs in tumors that can avoid miRNA-mediated repression, including glutaminase (GLS), a key metabolic enzyme for tumor proliferation. Interestingly, selected APA events add strong prognostic power beyond common clinical and molecular variables, suggesting their potential as novel prognostic biomarkers. Finally, our results implicate CstF64, an essential polyadenylation factor, as a master regulator of 3′ UTR shortening across multiple tumor types. PMID:25409906
Zhai, Yiqian; Zhang, Lichao; Xia, Chuan; Fu, Silu; Zhao, Guangyao; Jia, Jizeng; Kong, Xiuying
2016-05-13
Although bHLH transcription factors play important roles regulating plant development and abiotic stress response and tolerance, few functional studies have been performed in wheat. In this study, we isolated and characterized a bHLH gene, TabHLH39, from wheat. The TabHLH39 gene is located on wheat chromosome 5DL, and the protein localized to the nucleus and activated transcription. TabHLH39 showed variable expression in roots, stems, leaves, glumes, pistils and stamens and was induced by polyethylene glycol, salt and cold treatments. Further analysis revealed that TabHLH39 overexpression in Arabidopsis significantly enhanced tolerance to drought, salt and freezing stress during the seedling stage, which was also demonstrated by enhanced abiotic stress-response gene expression and changes to several physiological indices. Therefore, TabHLH39 has potential in transgenic breeding applications to improve abiotic stress tolerance in crops. Copyright © 2016 Elsevier Inc. All rights reserved.
Genomic insights into the uncultured genus 'Candidatus Magnetobacterium' in the phylum Nitrospirae.
Lin, Wei; Deng, Aihua; Wang, Zhang; Li, Ying; Wen, Tingyi; Wu, Long-Fei; Wu, Martin; Pan, Yongxin
2014-12-01
Magnetotactic bacteria (MTB) of the genus 'Candidatus Magnetobacterium' in phylum Nitrospirae are of great interest because of the formation of hundreds of bullet-shaped magnetite magnetosomes in multiple bundles of chains per cell. These bacteria are worldwide distributed in aquatic environments and have important roles in the biogeochemical cycles of iron and sulfur. However, except for a few short genomic fragments, no genome data are available for this ecologically important genus, and little is known about their metabolic capacity owing to the lack of pure cultures. Here we report the first draft genome sequence of 3.42 Mb from an uncultivated strain tentatively named 'Ca. Magnetobacterium casensis' isolated from Lake Miyun, China. The genome sequence indicates an autotrophic lifestyle using the Wood-Ljungdahl pathway for CO2 fixation, which has not been described in any previously known MTB or Nitrospirae organisms. Pathways involved in the denitrification, sulfur oxidation and sulfate reduction have been predicted, indicating its considerable capacity for adaptation to variable geochemical conditions and roles in local biogeochemical cycles. Moreover, we have identified a complete magnetosome gene island containing mam, mad and a set of novel genes (named as man genes) putatively responsible for the formation of bullet-shaped magnetite magnetosomes and the arrangement of multiple magnetosome chains. This first comprehensive genomic analysis sheds light on the physiology, ecology and biomineralization of the poorly understood 'Ca. Magnetobacterium' genus.
Deng, Wenping; Zhang, Kui; Liu, Sanzhen; Zhao, Patrick; Xu, Shizhong; Wei, Hairong
2018-04-30
Joint reconstruction of multiple gene regulatory networks (GRNs) using gene expression data from multiple tissues/conditions is very important for understanding common and tissue/condition-specific regulation. However, there are currently no computational models and methods available for directly constructing such multiple GRNs that not only share some common hub genes but also possess tissue/condition-specific regulatory edges. In this paper, we proposed a new graphic Gaussian model for joint reconstruction of multiple gene regulatory networks (JRmGRN), which highlighted hub genes, using gene expression data from several tissues/conditions. Under the framework of Gaussian graphical model, JRmGRN method constructs the GRNs through maximizing a penalized log likelihood function. We formulated it as a convex optimization problem, and then solved it with an alternating direction method of multipliers (ADMM) algorithm. The performance of JRmGRN was first evaluated with synthetic data and the results showed that JRmGRN outperformed several other methods for reconstruction of GRNs. We also applied our method to real Arabidopsis thaliana RNA-seq data from two light regime conditions in comparison with other methods, and both common hub genes and some conditions-specific hub genes were identified with higher accuracy and precision. JRmGRN is available as a R program from: https://github.com/wenpingd. hairong@mtu.edu. Proof of theorem, derivation of algorithm and supplementary data are available at Bioinformatics online.
RNAi for functional genomics in plants.
McGinnis, Karen M
2010-03-01
RNAi refers to several different types of gene silencing mediated by small, dsRNA molecules. Over the course of 20 years, the scientific understanding of RNAi has developed from the initial observation of unexpected expression patterns to a sophisticated understanding of a multi-faceted, evolutionarily conserved network of mechanisms that regulate gene expression in many organisms. It has also been developed as a genetic tool that can be exploited in a wide range of species. Because transgene-induced RNAi has been effective at silencing one or more genes in a wide range of plants, this technology also bears potential as a powerful functional genomics tool across the plant kingdom. Transgene-induced RNAi has indeed been shown to be an effective mechanism for silencing many genes in many organisms, but the results from multiple projects which attempted to exploit RNAi on a genome-wide scale suggest that there is a great deal of variation in the silencing efficacy between transgenic events, silencing targets and silencing-induced phenotype. The results from these projects indicate several important variables that should be considered in experimental design prior to the initiation of functional genomics efforts based on RNAi silencing. In recent years, alternative strategies have been developed for targeted gene silencing, and a combination of approaches may also enhance the use of targeted gene silencing for functional genomics.
Bergsveinson, Jordyn; Ziola, Barry
2017-12-01
Beer-spoilage-related lactic acid bacteria (BSR LAB) belong to multiple genera and species; however, beer-spoilage capacity is isolate-specific and partially acquired via horizontal gene transfer within the brewing environment. Thus, the extent to which genus-, species-, or environment- (i.e., brewery-) level genetic variability influences beer-spoilage phenotype is unknown. Publicly available Lactobacillus brevis genomes were analyzed via BlAst Diagnostic Gene findEr (BADGE) for BSR genes and assessed for pangenomic relationships. Also analyzed were functional coding capacities of plasmids of LAB inhabiting extreme niche environments. Considerable genetic variation was observed in L. brevis isolated from clinical samples, whereas 16 candidate genes distinguish BSR and non-BSR L. brevis genomes. These genes are related to nutrient scavenging of gluconate or pentoses, mannose, and metabolism of pectin. BSR L. brevis isolates also have higher average nucleotide identity and stronger pangenome association with one another, though isolation source (i.e., specific brewery) also appears to influence the plasmid coding capacity of BSR LAB. Finally, it is shown that niche-specific adaptation and phenotype are plasmid-encoded for both BSR and non-BSR LAB. The ultimate combination of plasmid-encoded genes dictates the ability of L. brevis to survive in the most extreme beer environment, namely, gassed (i.e., pressurized) beer.
Trifonova, E A; Eremina, E R; Urnov, F D; Stepanov, V A
2012-01-01
The structure of the haplotypes and linkage disequilibrium (LD) of the methylenetetrahydrofolate reductase gene (MTHFR) in 9 population groups from Northern Eurasia and populations of the international HapMap project was investigated in the present study. The data suggest that the architecture of LD in the human genome is largely determined by the evolutionary history of populations; however, the results of phylogenetic and haplotype analyses seems to suggest that in fact there may be a common "old" mechanism for the formation of certain patterns of LD. Variability in the structure of LD and the level of diversity of MTHFRhaplotypes cause a certain set of tagSNPs with an established prognostic significance for each population. In our opinion, the results obtained in the present study are of considerable interest for understanding multiple genetic phenomena: namely, the association of interpopulation differences in the patterns of LD with structures possessing a genetic susceptibility to complex diseases, and the functional significance of the pleiotropicMTHFR gene effect. Summarizing the results of this study, a conclusion can be made that the genetic variability analysis with emphasis on the structure of LD in human populations is a powerful tool that can make a significant contribution to such areas of biomedical science as human evolutionary biology, functional genomics, genetics of complex diseases, and pharmacogenomics.
Brennan, Paul M; Barlow, Antonio; Geraghty, Alistair; Summers, David; Fitzpatrick, Michael M
2011-06-01
The most common genetic predisposition to multiple schwannoma growth is mutation of the neurofibromatosis type 2 gene. We describe a patient with multiple schwannomas and mutation in the recently described INI1 gene, which also predisposes to the disease. We explore the implications for prognosis and outcome.
Gcn4-Mediator Specificity Is Mediated by a Large and Dynamic Fuzzy Protein-Protein Complex.
Tuttle, Lisa M; Pacheco, Derek; Warfield, Linda; Luo, Jie; Ranish, Jeff; Hahn, Steven; Klevit, Rachel E
2018-03-20
Transcription activation domains (ADs) are inherently disordered proteins that often target multiple coactivator complexes, but the specificity of these interactions is not understood. Efficient transcription activation by yeast Gcn4 requires its tandem ADs and four activator-binding domains (ABDs) on its target, the Mediator subunit Med15. Multiple ABDs are a common feature of coactivator complexes. We find that the large Gcn4-Med15 complex is heterogeneous and contains nearly all possible AD-ABD interactions. Gcn4-Med15 forms via a dynamic fuzzy protein-protein interface, where ADs bind the ABDs in multiple orientations via hydrophobic regions that gain helicity. This combinatorial mechanism allows individual low-affinity and specificity interactions to generate a biologically functional, specific, and higher affinity complex despite lacking a defined protein-protein interface. This binding strategy is likely representative of many activators that target multiple coactivators, as it allows great flexibility in combinations of activators that can cooperate to regulate genes with variable coactivator requirements. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Kasher, Paul R; Schertz, Katherine E; Thomas, Megan; Jackson, Adam; Annunziata, Silvia; Ballesta-Martinez, María J; Campeau, Philippe M; Clayton, Peter E; Eaton, Jennifer L; Granata, Tiziana; Guillén-Navarro, Encarna; Hernando, Cristina; Laverriere, Caroline E; Liedén, Agne; Villa-Marcos, Olaya; McEntagart, Meriel; Nordgren, Ann; Pantaleoni, Chiara; Pebrel-Richard, Céline; Sarret, Catherine; Sciacca, Francesca L; Wright, Ronnie; Kerr, Bronwyn; Glasgow, Eric; Banka, Siddharth
2016-02-04
Genetic studies of intellectual disability and identification of monogenic causes of obesity in humans have made immense contribution toward the understanding of the brain and control of body mass. The leptin > melanocortin > SIM1 pathway is dysregulated in multiple monogenic human obesity syndromes but its downstream targets are still unknown. In ten individuals from six families, with overlapping 6q16.1 deletions, we describe a disorder of variable developmental delay, intellectual disability, and susceptibility to obesity and hyperphagia. The 6q16.1 deletions segregated with the phenotype in multiplex families and were shown to be de novo in four families, and there was dramatic phenotypic overlap among affected individuals who were independently ascertained without bias from clinical features. Analysis of the deletions revealed a ∼350 kb critical region on chromosome 6q16.1 that encompasses a gene for proneuronal transcription factor POU3F2, which is important for hypothalamic development and function. Using morpholino and mutant zebrafish models, we show that POU3F2 lies downstream of SIM1 and controls oxytocin expression in the hypothalamic neuroendocrine preoptic area. We show that this finding is consistent with the expression patterns of POU3F2 and related genes in the human brain. Our work helps to further delineate the neuro-endocrine control of energy balance/body mass and demonstrates that this molecular pathway is conserved across multiple species. Copyright © 2016 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Molecular study on some antibiotic resistant genes in Salmonella spp. isolates
NASA Astrophysics Data System (ADS)
Nabi, Ari Q.
2017-09-01
Studying the genes related with antimicrobial resistance in Salmonella spp. is a crucial step toward a correct and faster treatment of infections caused by the pathogen. In this work Integron mediated antibiotic resistant gene IntI1 (Class I Integrase IntI1) and some plasmid mediated antibiotic resistance genes (Qnr) were scanned among the isolated non-Typhoid Salmonellae strains with known resistance to some important antimicrobial drugs using Sybr Green real time PCR. The aim of the study was to correlate the multiple antibiotics and antimicrobial resistance of Salmonella spp. with the presence of integrase (IntI1) gene and plasmid mediated quinolone resistant genes. Results revealed the presence of Class I Integrase gene in 76% of the isolates with confirmed multiple antibiotic resistances. Moreover, about 32% of the multiple antibiotic resistant serotypes showed a positive R-PCR for plasmid mediated qnrA gene encoding for nalidixic acid and ciprofloxacin resistance. No positive results could be revealed form R-PCRs targeting qnrB or qnrS. In light of these results we can conclude that the presence of at least one of the qnr genes and/or the presence of Integrase Class I gene were responsible for the multiple antibiotic resistance to for nalidixic acid and ciprofloxacin from the studied Salmonella spp. and further studies required to identify the genes related with multiple antibiotic resistance of the pathogen.
Classes and continua of hippocampal CA1 inhibitory neurons revealed by single-cell transcriptomics.
Harris, Kenneth D; Hochgerner, Hannah; Skene, Nathan G; Magno, Lorenza; Katona, Linda; Bengtsson Gonzales, Carolina; Somogyi, Peter; Kessaris, Nicoletta; Linnarsson, Sten; Hjerling-Leffler, Jens
2018-06-18
Understanding any brain circuit will require a categorization of its constituent neurons. In hippocampal area CA1, at least 23 classes of GABAergic neuron have been proposed to date. However, this list may be incomplete; additionally, it is unclear whether discrete classes are sufficient to describe the diversity of cortical inhibitory neurons or whether continuous modes of variability are also required. We studied the transcriptomes of 3,663 CA1 inhibitory cells, revealing 10 major GABAergic groups that divided into 49 fine-scale clusters. All previously described and several novel cell classes were identified, with three previously described classes unexpectedly found to be identical. A division into discrete classes, however, was not sufficient to describe the diversity of these cells, as continuous variation also occurred between and within classes. Latent factor analysis revealed that a single continuous variable could predict the expression levels of several genes, which correlated similarly with it across multiple cell types. Analysis of the genes correlating with this variable suggested it reflects a range from metabolically highly active faster-spiking cells that proximally target pyramidal cells to slower-spiking cells targeting distal dendrites or interneurons. These results elucidate the complexity of inhibitory neurons in one of the simplest cortical structures and show that characterizing these cells requires continuous modes of variation as well as discrete cell classes.
Song, Minghui; Shi, Chunlei; Xu, Xuebing; Shi, Xianming
2016-11-01
The enterotoxin gene cluster (egc) has been proposed to contribute to the Staphylococcus aureus colonization, which highlights the need to evaluate genetic diversity and virulence gene profiles of the egc-positive population. Here, a total of 43 egc-positive isolates (16.2%) were identified from 266 S. aureus isolates that were obtained from various food and clinical specimens in Shanghai. Seven different egc profiles were found based on the polymerase chain reaction (PCR) result for egc genes. Then, these 43 egc-positive isolates were further typed by multilocus sequence typing, pulsed-field gel electrophoresis (PFGE), multiple-locus variable-number tandem-repeat analysis (MLVA), and accessory gene regulatory (agr) typing. It showed that the 43 egc-positive isolates displayed 17 sequence types, 28 PFGE patterns, 29 MLVA types, and 4 agr types, respectively. Among them, the dominant clonal lineage was CC5-agr II (48.84%). Thirty toxin and 20 adhesion-associated genes were detected by PCR in egc-positive isolates. Notably, invasive toxin genes showed a high prevalence, such as 76.7% for Panton-Valentine leukocidin encoding genes, 27.9% for sec, and 23.3% for tsst-1. Most of the examined adhesion-associated genes were found to be conserved (76.7-100%), whereas the fnbB gene was only found in 8 (18.6%) isolates. In addition, 33 toxin gene profiles and 13 adhesion gene profiles were identified, respectively. Our results imply that isolates belonging to the same clonal lineage harbored similar adhesion gene profiles but diverse toxin gene profiles. Overall, the high prevalence of invasive virulence genes increases the potential risk of egc-positive isolates in S. aureus infection.
Krebes, Lukas; Zeidler, Lisza; Frankowski, Jens; Bastrop, Ralf
2014-01-01
Microsporidia are single-celled, intracellular eukaryotes that parasitise a wide range of animals. The Nosema/Vairimorpha group includes some putative asexual species, and asexuality is proposed to have originated multiple times from sexual ancestors. Here, we studied the variation in the ribosomal DNA (rDNA) of 14 isolates of the presumed apomictic and vertically transmitted Nosema granulosis to evaluate its sexual status. The analysed DNA fragment contained a part of the small-subunit ribosomal gene (SSU) and the entire intergenic spacer (IGS). The mitochondrial cox1 gene of the host Gammarus duebeni (Crustacea) was analysed to temporally calibrate the system and to test the expectation of cophylogeny of host and parasite genealogies. Genetic variability of the SSU gene was very low within and between the isolates. In contrast, intraisolate (within a single host) variability of the IGS felt in two categories, because 12 isolates possess a very high IGS genetic diversity and two isolates were almost invariable in the IGS. This difference suggests variable models of rDNA evolution involving birth-and-death and unexpectedly concerted evolution. An alternative explanation could be a likewise unattended mixed infection of host individuals by more than one parasite strain. Despite considerable genetic divergence between associated host mitochondrial haplotypes, some N. granulosis 'IGS populations' seem not to belong to different gene pools; the relevant tests failed to show significant differences between populations. A set of recombinant IGS sequences made our data incompatible with the model of a solely maternally inherited, asexual species. In line with recent reports, our study supports the hypothesis that some assumed apomictic Microsporidia did not entirely abstain from the evolutionary advantages of sex. In addition, the presented data indicate that horizontal transmission may occur occasionally. This transmission mode could be a survival strategy of N. granulosis whose host often populates ephemeral habitats. Copyright © 2013 Elsevier B.V. All rights reserved.
Dynamic karyotype evolution and unique sex determination systems in Leptidea wood white butterflies.
Šíchová, Jindra; Voleníková, Anna; Dincă, Vlad; Nguyen, Petr; Vila, Roger; Sahara, Ken; Marec, František
2015-05-19
Chromosomal rearrangements have the potential to limit the rate and pattern of gene flow within and between species and thus play a direct role in promoting and maintaining speciation. Wood white butterflies of the genus Leptidea are excellent models to study the role of chromosome rearrangements in speciation because they show karyotype variability not only among but also within species. In this work, we investigated genome architecture of three cryptic Leptidea species (L. juvernica, L. sinapis and L. reali) by standard and molecular cytogenetic techniques in order to reveal causes of the karyotype variability. Chromosome numbers ranged from 2n = 85 to 91 in L. juvernica and 2n = 69 to 73 in L. sinapis (both from Czech populations) to 2n = 51 to 55 in L. reali (Spanish population). We observed significant differences in chromosome numbers and localization of cytogenetic markers (rDNA and H3 histone genes) within the offspring of individual females. Using FISH with the (TTAGG) n telomeric probe we also documented the presence of multiple chromosome fusions and/or fissions and other complex rearrangements. Thus, the intraspecific karyotype variability is likely due to irregular chromosome segregation of multivalent meiotic configurations. The analysis of female meiotic chromosomes by GISH and CGH revealed multiple sex chromosomes: W1W2W3Z1Z2Z3Z4 in L. juvernica, W1W2W3Z1Z2Z3 in L. sinapis and W1W2W3W4Z1Z2Z3Z4 in L. reali. Our results suggest a dynamic karyotype evolution and point to the role of chromosomal rearrangements in the speciation of Leptidea butterflies. Moreover, our study revealed a curious sex determination system with 3-4 W and 3-4 Z chromosomes, which is unique in the Lepidoptera and which could also have played a role in the speciation process of the three Leptidea species.
Korf, Bruce R
2013-01-01
The "neurofibromatoses" are a set of distinct genetic disorders that have in common the occurrence of tumors of the nerve sheath. They include NF1, NF2, and schwannomatosis. All are dominantly inherited with a high rate of new mutation and variable expression. NF1 includes effects on multiple systems of the body. The major NF1-associated tumor is the neurofibroma. In addition, clinical manifestations include bone dysplasia, learning disabilities, and an increased risk of malignancy. NF2 includes schwannomas of multiple cranial and spinal nerves, especially the vestibular nerve, as well as other tumors such as meningiomas and ependymomas. The schwannomatosis phenotype is limited to multiple schwannomas, and usually presents with pain. The genes that underlie each of the disorders are known: NF1 for neurofibromatosis type 1, NF2 for neurofibromatosis type 2, and INI1/SMARCB1 for schwannomatosis. Genetic testing is possible to identify mutations. Insights into pathogenesis are beginning to suggest new treatment strategies, and therapeutic trials with several new forms of treatment are underway. Copyright © 2013 Elsevier B.V. All rights reserved.
HIV-1 Genetic Variability in Cuba and Implications for Transmission and Clinical Progression.
Blanco, Madeline; Machado, Liuber Y; Díaz, Héctor; Ruiz, Nancy; Romay, Dania; Silva, Eladio
2015-10-01
INTRODUCTION Serological and molecular HIV-1 studies in Cuba have shown very low prevalence of seropositivity, but an increasing genetic diversity attributable to introduction of many HIV-1 variants from different areas, exchange of such variants among HIV-positive people with several coinciding routes of infection and other epidemiologic risk factors in the seropositive population. The high HIV-1 genetic variability observed in Cuba has possible implications for transmission and clinical progression. OBJECTIVE Study genetic variability for the HIV-1 env, gag and pol structural genes in Cuba; determine the prevalence of B and non-B subtypes according to epidemiologic and behavioral variables and determine whether a relationship exists between genetic variability and transmissibility, and between genetic variability and clinical disease progression in people living with HIV/AIDS. METHODS Using two molecular assays (heteroduplex mobility assay and nucleic acid sequencing), structural genes were characterized in 590 people with HIV-1 (480 men and 110 women), accounting for 3.4% of seropositive individuals in Cuba as of December 31, 2013. Nonrandom sampling, proportional to HIV prevalence by province, was conducted. Relationships between molecular results and viral factors, host characteristics, and patients' clinical, epidemiologic and behavioral variables were studied for molecular epidemiology, transmission, and progression analyses. RESULTS Molecular analysis of the three HIV-1 structural genes classified 297 samples as subtype B (50.3%), 269 as non-B subtypes (45.6%) and 24 were not typeable. Subtype B prevailed overall and in men, mainly in those who have sex with men. Non-B subtypes were prevalent in women and heterosexual men, showing multiple circulating variants and recombinant forms. Sexual transmission was the predominant form of infection for all. B and non-B subtypes were encountered throughout Cuba. No association was found between subtypes and transmission or clinical progression, although the proportion of deaths was higher for subtype B. Among those who died during the study period, there were no differences between subtypes in the mean time from HIV or AIDS diagnosis to death. CONCLUSIONS Our results suggest that B and non-B HIV-1 subtypes found in Cuba do not differ in transmissibility and in clinical disease progression. KEYWORDS HIV-1, AIDS, molecular epidemiology, transmissibility, clinical progression, subtypes, circulating recombinant forms, pathogenesis, Cuba.
Database of cattle candidate genes and genetic markers for milk production and mastitis
Ogorevc, J; Kunej, T; Razpet, A; Dovc, P
2009-01-01
A cattle database of candidate genes and genetic markers for milk production and mastitis has been developed to provide an integrated research tool incorporating different types of information supporting a genomic approach to study lactation, udder development and health. The database contains 943 genes and genetic markers involved in mammary gland development and function, representing candidates for further functional studies. The candidate loci were drawn on a genetic map to reveal positional overlaps. For identification of candidate loci, data from seven different research approaches were exploited: (i) gene knockouts or transgenes in mice that result in specific phenotypes associated with mammary gland (143 loci); (ii) cattle QTL for milk production (344) and mastitis related traits (71); (iii) loci with sequence variations that show specific allele-phenotype interactions associated with milk production (24) or mastitis (10) in cattle; (iv) genes with expression profiles associated with milk production (207) or mastitis (107) in cattle or mouse; (v) cattle milk protein genes that exist in different genetic variants (9); (vi) miRNAs expressed in bovine mammary gland (32) and (vii) epigenetically regulated cattle genes associated with mammary gland function (1). Fourty-four genes found by multiple independent analyses were suggested as the most promising candidates and were further in silico analysed for expression levels in lactating mammary gland, genetic variability and top biological functions in functional networks. A miRNA target search for mammary gland expressed miRNAs identified 359 putative binding sites in 3′UTRs of candidate genes. PMID:19508288
Li, Linlin; Gao, Kaiping; Zhao, Jingzhi; Feng, Tianping; Yin, Lei; Wang, Jinjin; Wang, Chongjian; Li, Chunyang; Wang, Yan; Wang, Qian; Zhai, Yujia; You, Haifei; Ren, Yongcheng; Wang, Bingyuan; Hu, Dongsheng
2014-01-25
Few genome-wide association studies have considered interactions between multiple genetic variants and environmental factors associated with disease. The interaction was examined between a glucagon gene (GCG) polymorphism and smoking, alcohol consumption and physical activity and the association with risk of type 2 diabetes mellitus (T2DM) in a case-control study of Chinese Han subjects. The rs12104705 polymorphism of GCG and interactions with environmental variables were analyzed for 9619 participants by binary multiple logistic regression. Smoking with the C-C haplotype of rs12104705 was associated with increased risk of T2DM (OR=1.174, 95% CI=1.013-1.361). Moderate and high physical activity with the C-C genotype was associated with decreased risk of T2DM as compared with low physical activity with the genotype (OR=0.251, 95% CI=0.206-0.306 and OR=0.190, 95% CI=0.164-0.220). However, the interaction of drinking and genotype was not associated with risk of T2DM. Genetic polymorphism in rs12104705 of GCG may interact with smoking and physical activity to modify the risk of T2DM. © 2013.
McKay, Fiona C; Gatt, Prudence N; Fewings, Nicole; Parnell, Grant P; Schibeci, Stephen D; Basuki, Monica A I; Powell, Joseph E; Goldinger, Anita; Fabis-Pedrini, Marzena J; Kermode, Allan G; Burke, Therese; Vucic, Steve; Stewart, Graeme J; Booth, David R
2016-02-01
Multiple Sclerosis (MS) is an autoimmune disease treated by therapies targeting peripheral blood cells. We previously identified that expression of two MS-risk genes, the transcription factors EOMES and TBX21 (ET), was low in blood from MS and stable over time. Here we replicated the low ET expression in a new MS cohort (p<0.0007 for EOMES, p<0.028 for TBX21) and demonstrate longitudinal stability (p<10(-4)) and high heritability (h(2)=0.48 for EOMES) for this molecular phenotype. Genes whose expression correlated with ET, especially those controlling cell migration, further defined the phenotype. CD56+ cells and other subsets expressed lower levels of Eomes or T-bet protein and/or were under-represented in MS. EOMES and TBX21 risk SNP genotypes, and serum EBNA-1 titres were not correlated with ET expression, but HLA-DRB1*1501 genotype was. ET expression was normalised to healthy control levels with natalizumab, and was highly variable for glatiramer acetate, fingolimod, interferon-beta, dimethyl fumarate. Crown Copyright © 2016. Published by Elsevier Inc. All rights reserved.
Cell cycle gene expression networks discovered using systems biology: Significance in carcinogenesis
Scott, RE; Ghule, PN; Stein, JL; Stein, GS
2015-01-01
The early stages of carcinogenesis are linked to defects in the cell cycle. A series of cell cycle checkpoints are involved in this process. The G1/S checkpoint that serves to integrate the control of cell proliferation and differentiation is linked to carcinogenesis and the mitotic spindle checkpoint with the development of chromosomal instability. This paper presents the outcome of systems biology studies designed to evaluate if networks of covariate cell cycle gene transcripts exist in proliferative mammalian tissues including mice, rats and humans. The GeneNetwork website that contains numerous gene expression datasets from different species, sexes and tissues represents the foundational resource for these studies (www.genenetwork.org). In addition, WebGestalt, a gene ontology tool, facilitated the identification of expression networks of genes that co-vary with key cell cycle targets, especially Cdc20 and Plk1 (www.bioinfo.vanderbilt.edu/webgestalt). Cell cycle expression networks of such covariate mRNAs exist in multiple proliferative tissues including liver, lung, pituitary, adipose and lymphoid tissues among others but not in brain or retina that have low proliferative potential. Sixty-three covariate cell cycle gene transcripts (mRNAs) compose the average cell cycle network with p = e−13 to e−36. Cell cycle expression networks show species, sex and tissue variability and they are enriched in mRNA transcripts associated with mitosis many of which are associated with chromosomal instability. PMID:25808367
Poole, William; Leinonen, Kalle; Shmulevich, Ilya
2017-01-01
Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C. PMID:28170390
Poole, William; Leinonen, Kalle; Shmulevich, Ilya; Knijnenburg, Theo A; Bernard, Brady
2017-02-01
Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C.
Wikswo, J P; Prokop, A; Baudenbacher, F; Cliffel, D; Csukas, B; Velkovsky, M
2006-08-01
Systems biology, i.e. quantitative, postgenomic, postproteomic, dynamic, multiscale physiology, addresses in an integrative, quantitative manner the shockwave of genetic and proteomic information using computer models that may eventually have 10(6) dynamic variables with non-linear interactions. Historically, single biological measurements are made over minutes, suggesting the challenge of specifying 10(6) model parameters. Except for fluorescence and micro-electrode recordings, most cellular measurements have inadequate bandwidth to discern the time course of critical intracellular biochemical events. Micro-array expression profiles of thousands of genes cannot determine quantitative dynamic cellular signalling and metabolic variables. Major gaps must be bridged between the computational vision and experimental reality. The analysis of cellular signalling dynamics and control requires, first, micro- and nano-instruments that measure simultaneously multiple extracellular and intracellular variables with sufficient bandwidth; secondly, the ability to open existing internal control and signalling loops; thirdly, external BioMEMS micro-actuators that provide high bandwidth feedback and externally addressable intracellular nano-actuators; and, fourthly, real-time, closed-loop, single-cell control algorithms. The unravelling of the nested and coupled nature of cellular control loops requires simultaneous recording of multiple single-cell signatures. Externally controlled nano-actuators, needed to effect changes in the biochemical, mechanical and electrical environment both outside and inside the cell, will provide a major impetus for nanoscience.
Li, Ziyi; Safo, Sandra E; Long, Qi
2017-07-11
Sparse principal component analysis (PCA) is a popular tool for dimensionality reduction, pattern recognition, and visualization of high dimensional data. It has been recognized that complex biological mechanisms occur through concerted relationships of multiple genes working in networks that are often represented by graphs. Recent work has shown that incorporating such biological information improves feature selection and prediction performance in regression analysis, but there has been limited work on extending this approach to PCA. In this article, we propose two new sparse PCA methods called Fused and Grouped sparse PCA that enable incorporation of prior biological information in variable selection. Our simulation studies suggest that, compared to existing sparse PCA methods, the proposed methods achieve higher sensitivity and specificity when the graph structure is correctly specified, and are fairly robust to misspecified graph structures. Application to a glioblastoma gene expression dataset identified pathways that are suggested in the literature to be related with glioblastoma. The proposed sparse PCA methods Fused and Grouped sparse PCA can effectively incorporate prior biological information in variable selection, leading to improved feature selection and more interpretable principal component loadings and potentially providing insights on molecular underpinnings of complex diseases.
Multiple Site-Directed and Saturation Mutagenesis by the Patch Cloning Method.
Taniguchi, Naohiro; Murakami, Hiroshi
2017-01-01
Constructing protein-coding genes with desired mutations is a basic step for protein engineering. Herein, we describe a multiple site-directed and saturation mutagenesis method, termed MUPAC. This method has been used to introduce multiple site-directed mutations in the green fluorescent protein gene and in the moloney murine leukemia virus reverse transcriptase gene. Moreover, this method was also successfully used to introduce randomized codons at five desired positions in the green fluorescent protein gene, and for simple DNA assembly for cloning.
Population ecology of nitrifying archaea and bacteria in the Southern California Bight.
Beman, J Michael; Sachdeva, Rohan; Fuhrman, Jed A
2010-05-01
Marine Crenarchaeota are among the most abundant microbial groups in the ocean, and although relatively little is currently known about their biogeochemical roles in marine ecosystems, recognition that Crenarchaeota posses ammonia monooxygenase (amoA) genes and may act as ammonia-oxidizing archaea (AOA) offers another means of probing the ecology of these microorganisms. Here we use a time series approach combining quantification of archaeal and bacterial ammonia oxidizers with bacterial community fingerprints and biogeochemistry, to explore the population and community ecology of nitrification. At multiple depths (150, 500 and 890 m) in the Southern California Bight sampled monthly from 2003 to 2006, AOA were enumerated via quantitative PCR of archaeal amoA and marine group 1 Crenarchaeota 16S rRNA genes. Based on amoA genes, AOA were highly variable in time - a consistent feature of marine Crenarchaeota- however, average values were similar at different depths and ranged from 2.20 to 2.76 x 10(4) amoA copies ml(-1). Archaeal amoA genes were correlated with Crenarchaeota 16S rRNA genes (r(2) = 0.79) and the slope of this relationship was 1.02, demonstrating that the majority of marine group 1 Crenarchaeota present over the dates and depths sampled possessed amoA. Two AOA clades were specifically quantified and compared with betaproteobacterial ammonia-oxidizing bacteria (beta-AOB) amoA genes at 150 m; these AOA groups were found to strongly co-vary in time (r(2) = 0.70, P < 0.001) whereas AOA : beta-AOB ratios ranged from 13 to 5630. Increases in the AOA : beta-AOB ratio correlated with the accumulation of nitrite (r(2) = 0.87, P < 0.001), and may be indicative of differences in substrate affinities and activities leading to periodic decoupling between ammonia and nitrite oxidation. These data capture a dynamic nitrogen cycle in which multiple microbial groups appear to be active participants.
Meta-analysis identifies gene-by-environment interactions as demonstrated in a study of 4,965 mice.
Kang, Eun Yong; Han, Buhm; Furlotte, Nicholas; Joo, Jong Wha J; Shih, Diana; Davis, Richard C; Lusis, Aldons J; Eskin, Eleazar
2014-01-01
Identifying environmentally-specific genetic effects is a key challenge in understanding the structure of complex traits. Model organisms play a crucial role in the identification of such gene-by-environment interactions, as a result of the unique ability to observe genetically similar individuals across multiple distinct environments. Many model organism studies examine the same traits but under varying environmental conditions. For example, knock-out or diet-controlled studies are often used to examine cholesterol in mice. These studies, when examined in aggregate, provide an opportunity to identify genomic loci exhibiting environmentally-dependent effects. However, the straightforward application of traditional methodologies to aggregate separate studies suffers from several problems. First, environmental conditions are often variable and do not fit the standard univariate model for interactions. Additionally, applying a multivariate model results in increased degrees of freedom and low statistical power. In this paper, we jointly analyze multiple studies with varying environmental conditions using a meta-analytic approach based on a random effects model to identify loci involved in gene-by-environment interactions. Our approach is motivated by the observation that methods for discovering gene-by-environment interactions are closely related to random effects models for meta-analysis. We show that interactions can be interpreted as heterogeneity and can be detected without utilizing the traditional uni- or multi-variate approaches for discovery of gene-by-environment interactions. We apply our new method to combine 17 mouse studies containing in aggregate 4,965 distinct animals. We identify 26 significant loci involved in High-density lipoprotein (HDL) cholesterol, many of which are consistent with previous findings. Several of these loci show significant evidence of involvement in gene-by-environment interactions. An additional advantage of our meta-analysis approach is that our combined study has significantly higher power and improved resolution compared to any single study thus explaining the large number of loci discovered in the combined study.
Meta-Analysis Identifies Gene-by-Environment Interactions as Demonstrated in a Study of 4,965 Mice
Joo, Jong Wha J.; Shih, Diana; Davis, Richard C.; Lusis, Aldons J.; Eskin, Eleazar
2014-01-01
Identifying environmentally-specific genetic effects is a key challenge in understanding the structure of complex traits. Model organisms play a crucial role in the identification of such gene-by-environment interactions, as a result of the unique ability to observe genetically similar individuals across multiple distinct environments. Many model organism studies examine the same traits but under varying environmental conditions. For example, knock-out or diet-controlled studies are often used to examine cholesterol in mice. These studies, when examined in aggregate, provide an opportunity to identify genomic loci exhibiting environmentally-dependent effects. However, the straightforward application of traditional methodologies to aggregate separate studies suffers from several problems. First, environmental conditions are often variable and do not fit the standard univariate model for interactions. Additionally, applying a multivariate model results in increased degrees of freedom and low statistical power. In this paper, we jointly analyze multiple studies with varying environmental conditions using a meta-analytic approach based on a random effects model to identify loci involved in gene-by-environment interactions. Our approach is motivated by the observation that methods for discovering gene-by-environment interactions are closely related to random effects models for meta-analysis. We show that interactions can be interpreted as heterogeneity and can be detected without utilizing the traditional uni- or multi-variate approaches for discovery of gene-by-environment interactions. We apply our new method to combine 17 mouse studies containing in aggregate 4,965 distinct animals. We identify 26 significant loci involved in High-density lipoprotein (HDL) cholesterol, many of which are consistent with previous findings. Several of these loci show significant evidence of involvement in gene-by-environment interactions. An additional advantage of our meta-analysis approach is that our combined study has significantly higher power and improved resolution compared to any single study thus explaining the large number of loci discovered in the combined study. PMID:24415945
Diversity of the human intestinal microbial flora.
Eckburg, Paul B; Bik, Elisabeth M; Bernstein, Charles N; Purdom, Elizabeth; Dethlefsen, Les; Sargent, Michael; Gill, Steven R; Nelson, Karen E; Relman, David A
2005-06-10
The human endogenous intestinal microflora is an essential "organ" in providing nourishment, regulating epithelial development, and instructing innate immunity; yet, surprisingly, basic features remain poorly described. We examined 13,355 prokaryotic ribosomal RNA gene sequences from multiple colonic mucosal sites and feces of healthy subjects to improve our understanding of gut microbial diversity. A majority of the bacterial sequences corresponded to uncultivated species and novel microorganisms. We discovered significant intersubject variability and differences between stool and mucosa community composition. Characterization of this immensely diverse ecosystem is the first step in elucidating its role in health and disease.
The Challenges of Measuring Glycemic Variability
Rodbard, David
2012-01-01
This commentary reviews several of the challenges encountered when attempting to quantify glycemic variability and correlate it with risk of diabetes complications. These challenges include (1) immaturity of the field, including problems of data accuracy, precision, reliability, cost, and availability; (2) larger relative error in the estimates of glycemic variability than in the estimates of the mean glucose; (3) high correlation between glycemic variability and mean glucose level; (4) multiplicity of measures; (5) correlation of the multiple measures; (6) duplication or reinvention of methods; (7) confusion of measures of glycemic variability with measures of quality of glycemic control; (8) the problem of multiple comparisons when assessing relationships among multiple measures of variability and multiple clinical end points; and (9) differing needs for routine clinical practice and clinical research applications. PMID:22768904
Yancopoulos, G D; Blackwell, T K; Suh, H; Hood, L; Alt, F W
1986-01-31
We have recently proposed that a common recombinase performs all of the many variable region gene assembly events in B and T cells, and that the specificity of these joining events is mediated by regulating the "accessibility" of the involved gene segments. To test this possibility, we have introduced "accessible" T cell receptor (TCR) variable region gene segments into a pre-B cell line capable of recombining endogenous and transfected immunoglobulin (Ig) variable region gene segments. Although the corresponding "inaccessible" endogenous TCR gene segments do not rearrange in this line or in B cells in general, the introduced TCR gene segments join very frequently and, in fact, closely resemble introduced Ig gene segments in their recombination characteristics. These observations suggest a new role for conventional Ig transcriptional enhancers--recombinational enhancement. Our studies provide insight into additional aspects of the joining mechanism such as N region insertion, aberrant joining, and recombination-recognition sequence requirements for joining.
Novel mutations in IBA57 are associated with leukodystrophy and variable clinical phenotypes.
Torraco, Alessandra; Ardissone, Anna; Invernizzi, Federica; Rizza, Teresa; Fiermonte, Giuseppe; Niceta, Marcello; Zanetti, Nadia; Martinelli, Diego; Vozza, Angelo; Verrigni, Daniela; Di Nottia, Michela; Lamantea, Eleonora; Diodato, Daria; Tartaglia, Marco; Dionisi-Vici, Carlo; Moroni, Isabella; Farina, Laura; Bertini, Enrico; Ghezzi, Daniele; Carrozzo, Rosalba
2017-01-01
Defects of the Fe/S cluster biosynthesis represent a subgroup of diseases affecting the mitochondrial energy metabolism. In the last years, mutations in four genes (NFU1, BOLA3, ISCA2 and IBA57) have been related to a new group of multiple mitochondrial dysfunction syndromes characterized by lactic acidosis, hyperglycinemia, multiple defects of the respiratory chain complexes, and impairment of four lipoic acid-dependent enzymes: α-ketoglutarate dehydrogenase complex, pyruvic dehydrogenase, branched-chain α-keto acid dehydrogenase complex and the H protein of the glycine cleavage system. Few patients have been reported with mutations in IBA57 and with variable clinical phenotype. Herein, we describe four unrelated patients carrying novel mutations in IBA57. All patients presented with combined or isolated defect of complex I and II. Clinical features varied widely, ranging from fatal infantile onset of the disease to acute and severe psychomotor regression after the first year of life. Brain MRI was characterized by cavitating leukodystrophy. The identified mutations were never reported previously and all had a dramatic effect on IBA57 stability. Our study contributes to expand the array of the genotypic variation of IBA57 and delineates the leukodystrophic pattern of IBA57 deficient patients.
Heterogeneous Stock Rat: A Unique Animal Model for Mapping Genes Influencing Bone Fragility
Alam, Imranul; Koller, Daniel L.; Sun, Qiwei; Roeder, Ryan K.; Cañete, Toni; Blázquez, Gloria; López-Aumatell, Regina; Martínez-Membrives, Esther; Vicens-Costa, Elia; Mont, Carme; Díaz, Sira; Tobeña, Adolf; Fernández-Teruel, Alberto; Whitley, Adam; Strid, Pernilla; Diez, Margarita; Johannesson, Martina; Flint, Jonathan; Econs, Michael J.; Turner, Charles H.; Foroud, Tatiana
2011-01-01
Previously, we demonstrated that skeletal mass, structure and biomechanical properties vary considerably among 11 different inbred rat strains. Subsequently, we performed quantitative trait loci (QTL) analysis in 4 inbred rat strains (F344, LEW, COP and DA) for different bone phenotypes and identified several candidate genes influencing various bone traits. The standard approach to narrowing QTL intervals down to a few candidate genes typically employs the generation of congenic lines, which is time consuming and often not successful. A potential alternative approach is to use a highly genetically informative animal model resource capable of delivering very high-resolution gene mapping such as Heterogeneous stock (HS) rat. HS rat was derived from eight inbred progenitors: ACI/N, BN/SsN, BUF/N, F344/N, M520/N, MR/N, WKY/N and WN/N. The genetic recombination pattern generated across 50 generations in these rats has been shown to deliver ultra-high even gene-level resolution for complex genetic studies. The purpose of this study is to investigate the usefulness of the HS rat model for fine mapping and identification of genes underlying bone fragility phenotypes. We compared bone geometry, density and strength phenotypes at multiple skeletal sites in HS rats with those obtained from 5 of the 8 progenitor inbred strains. In addition, we estimated the heritability for different bone phenotypes in these rats and employed principal component analysis to explore relationships among bone phenotypes in the HS rats. Our study demonstrates that significant variability exists for different skeletal phenotypes in HS rats compared with their inbred progenitors. In addition, we estimated high heritability for several bone phenotypes and biologically interpretable factors explaining significant overall variability, suggesting that the HS rat model could be a unique genetic resource for rapid and efficient discovery of the genetic determinants of bone fragility. PMID:21334473
Heterogeneous stock rat: a unique animal model for mapping genes influencing bone fragility.
Alam, Imranul; Koller, Daniel L; Sun, Qiwei; Roeder, Ryan K; Cañete, Toni; Blázquez, Gloria; López-Aumatell, Regina; Martínez-Membrives, Esther; Vicens-Costa, Elia; Mont, Carme; Díaz, Sira; Tobeña, Adolf; Fernández-Teruel, Alberto; Whitley, Adam; Strid, Pernilla; Diez, Margarita; Johannesson, Martina; Flint, Jonathan; Econs, Michael J; Turner, Charles H; Foroud, Tatiana
2011-05-01
Previously, we demonstrated that skeletal mass, structure and biomechanical properties vary considerably among 11 different inbred rat strains. Subsequently, we performed quantitative trait loci (QTL) analysis in four inbred rat strains (F344, LEW, COP and DA) for different bone phenotypes and identified several candidate genes influencing various bone traits. The standard approach to narrowing QTL intervals down to a few candidate genes typically employs the generation of congenic lines, which is time consuming and often not successful. A potential alternative approach is to use a highly genetically informative animal model resource capable of delivering very high resolution gene mapping such as Heterogeneous stock (HS) rat. HS rat was derived from eight inbred progenitors: ACI/N, BN/SsN, BUF/N, F344/N, M520/N, MR/N, WKY/N and WN/N. The genetic recombination pattern generated across 50 generations in these rats has been shown to deliver ultra-high even gene-level resolution for complex genetic studies. The purpose of this study is to investigate the usefulness of the HS rat model for fine mapping and identification of genes underlying bone fragility phenotypes. We compared bone geometry, density and strength phenotypes at multiple skeletal sites in HS rats with those obtained from five of the eight progenitor inbred strains. In addition, we estimated the heritability for different bone phenotypes in these rats and employed principal component analysis to explore relationships among bone phenotypes in the HS rats. Our study demonstrates that significant variability exists for different skeletal phenotypes in HS rats compared with their inbred progenitors. In addition, we estimated high heritability for several bone phenotypes and biologically interpretable factors explaining significant overall variability, suggesting that the HS rat model could be a unique genetic resource for rapid and efficient discovery of the genetic determinants of bone fragility. Copyright © 2010 Elsevier Inc. All rights reserved.
Ishii, Jun; Kondo, Takashi; Makino, Harumi; Ogura, Akira; Matsuda, Fumio; Kondo, Akihiko
2014-05-01
Yeast has the potential to be used in bulk-scale fermentative production of fuels and chemicals due to its tolerance for low pH and robustness for autolysis. However, expression of multiple external genes in one host yeast strain is considerably labor-intensive due to the lack of polycistronic transcription. To promote the metabolic engineering of yeast, we generated systematic and convenient genetic engineering tools to express multiple genes in Saccharomyces cerevisiae. We constructed a series of multi-copy and integration vector sets for concurrently expressing two or three genes in S. cerevisiae by embedding three classical promoters. The comparative expression capabilities of the constructed vectors were monitored with green fluorescent protein, and the concurrent expression of genes was monitored with three different fluorescent proteins. Our multiple gene expression tool will be helpful to the advanced construction of genetically engineered yeast strains in a variety of research fields other than metabolic engineering. © 2014 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
Delvaux, Elaine; Mastroeni, Diego; Nolz, Jennifer; Chow, Nienwen; Sabbagh, Marwan; Caselli, Richard J; Reiman, Eric M; Marshall, Frederick J; Coleman, Paul D
2017-10-01
The need for a reliable, simple, and inexpensive blood test for Alzheimer's disease (AD) suitable for use in a primary care setting is widely recognized. This has led to a large number of publications describing blood tests for AD, which have, for the most part, not been replicable. We have chosen to examine transcripts expressed by the cellular, leukocyte compartment of blood. We have used hypothesis-based cDNA arrays and quantitative PCR to quantify the expression of selected sets of genes followed by multivariate analyses in multiple independent samples. Rather than a single study with no replicates, we chose an experimental design in which there were multiple replicates using different platforms and different sample populations. We have divided 177 blood samples and 27 brain samples into multiple replicates to demonstrate the ability to distinguish early clinical AD (Clinical Dementia Rating scale 0.5), Parkinson's disease (PD), and cognitively unimpaired APOE4 homozygotes, as well as to determine persons at risk for future cognitive impairment with significant accuracy. We assess our methods in a training/test set and also show that the variables we use distinguish AD, PD, and control brain. Importantly, we describe the variability of the weights assigned to individual transcripts in multivariate analyses in repeated studies and suggest that the variability we describe may be the cause of inability to repeat many earlier studies. Our data constitute a proof of principle that multivariate analysis of the transcriptome related to cell stress and inflammation of peripheral blood leukocytes has significant potential as a minimally invasive and inexpensive diagnostic tool for diagnosis and early detection of risk for AD. Copyright © 2017 Elsevier Inc. All rights reserved.
Salanti, Ali; Lavstsen, Thomas; Nielsen, Morten A.; Theander, Thor G.; Leke, Rose G. F.; Lo, Yeung Y.; Bobbili, Naveen; Arnot, David E.; Taylor, Diane W.
2011-01-01
Placental malaria infections are caused by Plasmodium falciparum–infected red blood cells sequestering in the placenta by binding to chondroitin sulfate A, mediated by VAR2CSA, a variant of the PfEMP1 family of adhesion antigens. Recent studies have shown that many P. falciparum genomes have multiple genes coding for different VAR2CSA proteins, and parasites with >1 var2csa gene appear to be more common in pregnant women with placental malaria than in nonpregnant individuals. We present evidence that, in pregnant women, parasites containing multiple var2csa-type genes possess a selective advantage over parasites with a single var2csa gene. Accumulation of parasites with multiple copies of the var2csa gene during the course of pregnancy was also correlated with the development of antibodies involved in blocking VAR2CSA adhesion. The data suggest that multiplicity of var2csa-type genes enables P. falciparum parasites to persist for a longer period of time during placental infections, probably because of their greater capacity for antigenic variation and evasion of variant-specific immune responses. PMID:21592998
Rodríguez-Blanco, Arturo; Lemos, Manuel L; Osorio, Carlos R
2016-08-01
Integrating conjugative elements (ICEs) of the SXT/R391 family have been identified in fish-isolated bacterial strains collected from marine aquaculture environments of the northwestern Iberian Peninsula. Here we analysed the variable regions of two ICEs, one preliminarily characterised in a previous study (ICEVscSpa3) and one newly identified (ICEPspSpa1). Bacterial strains harboring these ICEs were phylogenetically assigned to Vibrio scophthalmi and Pseudoalteromonas sp., thus constituting the first evidence of SXT/R391-like ICEs in the genus Pseudoalteromonas to date. Variable DNA regions, which confer element-specific properties to ICEs of this family, were characterised. Interestingly, the two ICEs contained 29 genes not found in variable DNA insertions of previously described ICEs. Most notably, variable gene content for ICEVscSpa3 showed similarity to genes potentially involved in housekeeping functions of replication, nucleotide metabolism and transcription. For these genes, closest homologues were found clustered in the genome of Pseudomonas psychrotolerans L19, suggesting a transfer as a block to ICEVscSpa3. Genes encoding antibiotic resistance, restriction modification systems and toxin/antitoxin systems were absent from hotspots of ICEVscSpa3. In contrast, the variable gene content of ICEPspSpa1 included genes involved in restriction/modification functions in two different hotspots and genes related to ICE maintenance. The present study unveils a relatively large number of novel genes in SXT/R391-ICEs, and demonstrates the major role of ICE elements as contributors to horizontal gene transfer.
Health-related disparities: influence of environmental factors.
Olden, Kenneth; White, Sandra L
2005-07-01
Racial disparities in health cannot be explained solely on the basis of poverty, access to health care, behavior, or environmental factors. Their complex etiology is dependent on interactions between all these factors plus genetics. Scientists have been slow to consider genetics as a risk factor because genetic polymorphisms tend to be more variable within a race than between races. Now that studies are demonstrating the existence of racial differences in allelic frequencies for multiple genes affecting a single biologic mechanism, the present argument for a significant genetic role in contributing to health disparities is gaining support. Individuals vary, often significantly, in their response to environmental agents. This variability provides a high "background noise" when scientists examine human populations to identify environmental links to disease. This variability often masks important environmental contributors to disease risk and is a major impediment to efforts to investigate the causes of diseases.Fortunately, investments in the various genome projects have led to the development of tools and databases that can be used to help identify the genetic variations in environmental response genes that can lead to such wide differences in disease susceptibility. NIEHS developed the environ-mental genome project to catalog these genetic variants (polymorphisms)and to identify the ones that play a major role in human susceptibility to environmental agents. This information is being used in epidemiologic studies to pinpoint environmental contributors to disease better. The research summarized in this article is critically important for tying genetics and the environment to health disparities, and for the development of a rational approach to gauge environmental threats. Common variants in genes play pivotal roles in determining if or when illness or death result from exposure to drugs or environmental xenobiotics. Most common variants exist in all human populations, but their frequency can vary substantially,rendering individuals or groups more or less susceptible to particular environmental exposures. Such findings are consistent with the highly publicized analogy, "genetics loads the gun, but the environment pulls the trigger." That is, one can inherit the genetic predisposition to develop a disease but will do so only if or when exposed to the environmental trigger. Poor people have approximately the same genetic makeup as everyone else,but they have the unfortunate experience of living and working in environments containing multiple and high levels of carcinogens or other toxicants capable of interacting with susceptibility genes to cause disease.Furthermore, certain disadvantaged ethnic groups may have a higher incidence of certain susceptible genes that render them more vulnerable to adverse effects of the environments they inhabit. For both of these reasons,much of the nation's disease burden could likely be reduced through better environmental protection practices, especially in low-income and minority communities. Of the many implications of polymorphisms and frequency variations for public health and the practice of medicine, however, none is more urgent than the choice of drugs in therapy. Using such knowledge,randomized trials have identified race-specific drug response differences between blacks and whites [42].To date, most knowledge of the health effects of environmental factors is derived from studies of single agents. The reality, though, is that environmental contributions to health disparities are mostly from multiple agents. These simultaneous exposures to multiple risk factors, which may accumulate or interact synergistically, remain to be fully explained and defined.Finally, health disparity is a significant public health problem that cannot be solved using "business as usual" approaches for funding and priority setting. The current emphasis on basic and clinical research at the exclusion of public health and the social sciences does not provide the interdisciplinary research teams necessary to address such a complex problem as health disparities. Although the poor will always be with us, their health could be greatly improved if social, environmental, and genetic scientists could find ways to collaborate and develop more insightful and relevant ways to address the health of disadvantaged communities.
Repeat-Associated Plasticity in the Helicobacter pylori RD Gene Family▿ †
Shak, Joshua R.; Dick, Jonathan J.; Meinersmann, Richard J.; Perez-Perez, Guillermo I.; Blaser, Martin J.
2009-01-01
The bacterium Helicobacter pylori is remarkable for its ability to persist in the human stomach for decades without provoking sterilizing immunity. Since repetitive DNA can facilitate adaptive genomic flexibility via increased recombination, insertion, and deletion, we searched the genomes of two H. pylori strains for nucleotide repeats. We discovered a family of genes with extensive repetitive DNA that we have termed the H. pylori RD gene family. Each gene of this family is composed of a conserved 3′ region, a variable mid-region encoding 7 and 11 amino acid repeats, and a 5′ region containing one of two possible alleles. Analysis of five complete genome sequences and PCR genotyping of 42 H. pylori strains revealed extensive variation between strains in the number, location, and arrangement of RD genes. Furthermore, examination of multiple strains isolated from a single subject's stomach revealed intrahost variation in repeat number and composition. Despite prior evidence that the protein products of this gene family are expressed at the bacterial cell surface, enzyme-linked immunosorbent assay and immunoblot studies revealed no consistent seroreactivity to a recombinant RD protein by H. pylori-positive hosts. The pattern of repeats uncovered in the RD gene family appears to reflect slipped-strand mispairing or domain duplication, allowing for redundancy and subsequent diversity in genotype and phenotype. This novel family of hypervariable genes with conserved, repetitive, and allelic domains may represent an important locus for understanding H. pylori persistence in its natural host. PMID:19749042
Repeat-associated plasticity in the Helicobacter pylori RD gene family.
Shak, Joshua R; Dick, Jonathan J; Meinersmann, Richard J; Perez-Perez, Guillermo I; Blaser, Martin J
2009-11-01
The bacterium Helicobacter pylori is remarkable for its ability to persist in the human stomach for decades without provoking sterilizing immunity. Since repetitive DNA can facilitate adaptive genomic flexibility via increased recombination, insertion, and deletion, we searched the genomes of two H. pylori strains for nucleotide repeats. We discovered a family of genes with extensive repetitive DNA that we have termed the H. pylori RD gene family. Each gene of this family is composed of a conserved 3' region, a variable mid-region encoding 7 and 11 amino acid repeats, and a 5' region containing one of two possible alleles. Analysis of five complete genome sequences and PCR genotyping of 42 H. pylori strains revealed extensive variation between strains in the number, location, and arrangement of RD genes. Furthermore, examination of multiple strains isolated from a single subject's stomach revealed intrahost variation in repeat number and composition. Despite prior evidence that the protein products of this gene family are expressed at the bacterial cell surface, enzyme-linked immunosorbent assay and immunoblot studies revealed no consistent seroreactivity to a recombinant RD protein by H. pylori-positive hosts. The pattern of repeats uncovered in the RD gene family appears to reflect slipped-strand mispairing or domain duplication, allowing for redundancy and subsequent diversity in genotype and phenotype. This novel family of hypervariable genes with conserved, repetitive, and allelic domains may represent an important locus for understanding H. pylori persistence in its natural host.
Promoter architecture dictates cell-to-cell variability in gene expression.
Jones, Daniel L; Brewster, Robert C; Phillips, Rob
2014-12-19
Variability in gene expression among genetically identical cells has emerged as a central preoccupation in the study of gene regulation; however, a divide exists between the predictions of molecular models of prokaryotic transcriptional regulation and genome-wide experimental studies suggesting that this variability is indifferent to the underlying regulatory architecture. We constructed a set of promoters in Escherichia coli in which promoter strength, transcription factor binding strength, and transcription factor copy numbers are systematically varied, and used messenger RNA (mRNA) fluorescence in situ hybridization to observe how these changes affected variability in gene expression. Our parameter-free models predicted the observed variability; hence, the molecular details of transcription dictate variability in mRNA expression, and transcriptional noise is specifically tunable and thus represents an evolutionarily accessible phenotypic parameter. Copyright © 2014, American Association for the Advancement of Science.
Aguiar, Bruno; Vieira, Jorge; Cunha, Ana E; Fonseca, Nuno A; Reboiro-Jato, David; Reboiro-Jato, Miguel; Fdez-Riverola, Florentino; Raspé, Olivier; Vieira, Cristina P
2013-05-01
S-RNase-based gametophytic self-incompatibility evolved once before the split of the Asteridae and Rosidae. In Prunus (tribe Amygdaloideae of Rosaceae), the self-incompatibility S-pollen is a single F-box gene that presents the expected evolutionary signatures. In Malus and Pyrus (subtribe Pyrinae of Rosaceae), however, clusters of F-box genes (called SFBBs) have been described that are expressed in pollen only and are linked to the S-RNase gene. Although polymorphic, SFBB genes present levels of diversity lower than those of the S-RNase gene. They have been suggested as putative S-pollen genes, in a system of non-self recognition by multiple factors. Subsets of allelic products of the different SFBB genes interact with non-self S-RNases, marking them for degradation, and allowing compatible pollinations. This study performed a detailed characterization of SFBB genes in Sorbus aucuparia (Pyrinae) to address three predictions of the non-self recognition by multiple factors model. As predicted, the number of SFBB genes was large to account for the many S-RNase specificities. Secondly, like the S-RNase gene, the SFBB genes were old. Thirdly, amino acids under positive selection-those that could be involved in specificity determination-were identified when intra-haplotype SFBB genes were analysed using codon models. Overall, the findings reported here support the non-self recognition by multiple factors model.
Multiconstrained gene clustering based on generalized projections
2010-01-01
Background Gene clustering for annotating gene functions is one of the fundamental issues in bioinformatics. The best clustering solution is often regularized by multiple constraints such as gene expressions, Gene Ontology (GO) annotations and gene network structures. How to integrate multiple pieces of constraints for an optimal clustering solution still remains an unsolved problem. Results We propose a novel multiconstrained gene clustering (MGC) method within the generalized projection onto convex sets (POCS) framework used widely in image reconstruction. Each constraint is formulated as a corresponding set. The generalized projector iteratively projects the clustering solution onto these sets in order to find a consistent solution included in the intersection set that satisfies all constraints. Compared with previous MGC methods, POCS can integrate multiple constraints from different nature without distorting the original constraints. To evaluate the clustering solution, we also propose a new performance measure referred to as Gene Log Likelihood (GLL) that considers genes having more than one function and hence in more than one cluster. Comparative experimental results show that our POCS-based gene clustering method outperforms current state-of-the-art MGC methods. Conclusions The POCS-based MGC method can successfully combine multiple constraints from different nature for gene clustering. Also, the proposed GLL is an effective performance measure for the soft clustering solutions. PMID:20356386
Recurrent Rearrangements of Human Amylase Genes Create Multiple Independent CNV Series.
Shwan, Nzar A A; Louzada, Sandra; Yang, Fengtang; Armour, John A L
2017-05-01
The human amylase gene cluster includes the human salivary (AMY1) and pancreatic amylase genes (AMY2A and AMY2B), and is a highly variable and dynamic region of the genome. Copy number variation (CNV) of AMY1 has been implicated in human dietary adaptation, and in population association with obesity, but neither of these findings has been independently replicated. Despite these functional implications, the structural genomic basis of CNV has only been defined in detail very recently. In this work, we use high-resolution analysis of copy number, and analysis of segregation in trios, to define new, independent allelic series of amylase CNVs in sub-Saharan Africans, including a series of higher-order expansions of a unit consisting of one copy each of AMY1, AMY2A, and AMY2B. We use fiber-FISH (fluorescence in situ hybridization) to define unexpected complexity in the accompanying rearrangements. These findings demonstrate recurrent involvement of the amylase gene region in genomic instability, involving at least five independent rearrangements of the pancreatic amylase genes (AMY2A and AMY2B). Structural features shared by fundamentally distinct lineages strongly suggest that the common ancestral state for the human amylase cluster contained more than one, and probably three, copies of AMY1. © 2017 WILEY PERIODICALS, INC.
Benayahu, Dafna; Socher, Rina; Shur, Irena
2008-01-01
Laser capture microdissection (LCM) method allows selection of individual or clustered cells from intact tissues. This technology enables one to pick cells from tissues that are difficult to study individually, sort the anatomical complexity of these tissues, and make the cells available for molecular analyses. Following the cells' extraction, the nucleic acids and proteins can be isolated and used for multiple applications that provide an opportunity to uncover the molecular control of cellular fate in the natural microenvironment. Utilization of LCM for the molecular analysis of cells from skeletal tissues will enable one to study differential patterns of gene expression in the native intact skeletal tissue with reliable interpretation of function for known genes as well as to discover novel genes. Variability between samples may be caused either by differences in the tissue samples (different areas isolated from the same section) or some variances in sample handling. LCM is a multi-task technology that combines histology, microscopy work, and dedicated molecular biology. The LCM application will provide results that will pave the way toward high throughput profiling of tissue-specific gene expression using Gene Chip arrays. Detailed description of in vivo molecular pathways will make it possible to elaborate on control systems to apply for the repair of genetic or metabolic diseases of skeletal tissues.
Assessment of brain reference genes for RT-qPCR studies in neurodegenerative diseases
Rydbirk, Rasmus; Folke, Jonas; Winge, Kristian; Aznar, Susana; Pakkenberg, Bente; Brudek, Tomasz
2016-01-01
Evaluation of gene expression levels by reverse transcription quantitative real-time PCR (RT-qPCR) has for many years been the favourite approach for discovering disease-associated alterations. Normalization of results to stably expressed reference genes (RGs) is pivotal to obtain reliable results. This is especially important in relation to neurodegenerative diseases where disease-related structural changes may affect the most commonly used RGs. We analysed 15 candidate RGs in 98 brain samples from two brain regions from Alzheimer’s disease (AD), Parkinson’s disease (PD), Multiple System Atrophy, and Progressive Supranuclear Palsy patients. Using RefFinder, a web-based tool for evaluating RG stability, we identified the most stable RGs to be UBE2D2, CYC1, and RPL13 which we recommend for future RT-qPCR studies on human brain tissue from these patients. None of the investigated genes were affected by experimental variables such as RIN, PMI, or age. Findings were further validated by expression analyses of a target gene GSK3B, known to be affected by AD and PD. We obtained high variations in GSK3B levels when contrasting the results using different sets of common RG underlining the importance of a priori validation of RGs for RT-qPCR studies. PMID:27853238
Assessment of brain reference genes for RT-qPCR studies in neurodegenerative diseases.
Rydbirk, Rasmus; Folke, Jonas; Winge, Kristian; Aznar, Susana; Pakkenberg, Bente; Brudek, Tomasz
2016-11-17
Evaluation of gene expression levels by reverse transcription quantitative real-time PCR (RT-qPCR) has for many years been the favourite approach for discovering disease-associated alterations. Normalization of results to stably expressed reference genes (RGs) is pivotal to obtain reliable results. This is especially important in relation to neurodegenerative diseases where disease-related structural changes may affect the most commonly used RGs. We analysed 15 candidate RGs in 98 brain samples from two brain regions from Alzheimer's disease (AD), Parkinson's disease (PD), Multiple System Atrophy, and Progressive Supranuclear Palsy patients. Using RefFinder, a web-based tool for evaluating RG stability, we identified the most stable RGs to be UBE2D2, CYC1, and RPL13 which we recommend for future RT-qPCR studies on human brain tissue from these patients. None of the investigated genes were affected by experimental variables such as RIN, PMI, or age. Findings were further validated by expression analyses of a target gene GSK3B, known to be affected by AD and PD. We obtained high variations in GSK3B levels when contrasting the results using different sets of common RG underlining the importance of a priori validation of RGs for RT-qPCR studies.
Weiler, K S; Wakimoto, B T
1998-01-01
In Drosophila melanogaster, chromosome rearrangements that juxtapose euchromatin and heterochromatin can result in position effect variegation (PEV), the variable expression of heterochromatic and euchromatic genes in the vicinity of the novel breakpoint. We examined PEV of the heterochromatic light (lt) and concertina (cta) genes in order to investigate potential tissue or developmental differences in chromosome structure that might be informative for comparing the mechanisms of PEV of heterochromatic and euchromatic genes. We employed tissue pigmentation and in situ hybridization to RNA to assess expression of lt in individual cells of multiple tissues during development. Variegation of lt was induced in the adult eye, larval salivary glands and larval Malpighian tubules for each of three different chromosome rearrangements. The relative severity of the effect in these tissues was not tissue-specific but rather was characteristic of each rearrangement. Surprisingly, larval imaginal discs did not exhibit variegated lt expression. Instead, a uniform reduction of the lt transcript was observed, which correlated in magnitude with the degree of variegation. The same results were obtained for cta expression. These two distinct effects of rearrangements on heterochromatic gene expression correlated with the developmental stage of the tissue. These results have implications for models of heterochromatin formation and the nuclear organization of chromosomes during development and differentiation. PMID:9649533
Amplification of a Gene Related to Mammalian mdr Genes in Drug-Resistant Plasmodium falciparum
NASA Astrophysics Data System (ADS)
Wilson, Craig M.; Serrano, Adelfa E.; Wasley, Annemarie; Bogenschutz, Michael P.; Shankar, Anuraj H.; Wirth, Dyann F.
1989-06-01
The malaria parasite Plasmodium falciparum contains at least two genes related to the mammalian multiple drug resistance genes, and at least one of the P. falciparum genes is expressed at a higher level and is present in higher copy number in a strain that is resistant to multiple drugs than in a strain that is sensitive to the drugs.
The gene for replication factor C subunit 2 (RFC2) is within the 7q11.23 Williams syndrome deletion
DOE Office of Scientific and Technical Information (OSTI.GOV)
Peoples, R.; Perez-Jurado, L.; Francke, U.
1996-06-01
Williams syndrome (WS) is a developmental disorder with multiple system manifestations, including supraval var aortic stenosis (SVAS), peripheral pulmonic stenosis, connective tissue abnormalities, short stature, characteristic personality profile and cognitive deficits, and variable hypercalcemia in infancy. It is caused by heterozygosity for a chromosomal deletion of part of band 7q11.23 including the elastin locus (ELN). Since disruption of the ELN gene causes autosomal dominant SVAS, it is assumed that ELN haploinsufficiency is responsible for the cardiovascular features of WS. The deletion that extends from the ELN locus in both directions is {ge}200 kb in size, although estimates of {ge}2 Mbmore » are suggested by high-resolution chromosome banding and physical mapping studies. We have searched for additional dosage-sensitive genes within the deletion that may be responsible for the noncardiovascular features. We report here that the gene for replication factor C subunit 2 (RFC2) maps within the WS deletion region and was found to be deleted in all of 18 WS patients studied. The protein product of RFC2 is part of a multimeric complex involved in DNA elongation during replication. 14 refs., 3 figs.« less
Thollesson, M.
1999-01-01
The phylogeny of Euthyneura is analysed by using DNA sequences of the mitochondrial 16S rRNA gene. Despite the common notion that this gene is too variable to provide useful information at high taxonomic levels, such as in the present study, bootstrap proportions are high for several clades in the study. This indicates that there is a useful amount of variation despite the noise due to multiple substitutions. The analyses furthermore indicate that (i) Gymnosomata (represented by Clione) is not a part of Euthyneura, but Clione forms a clade with the caenogastropods; (ii) Acteon is the sister group to the remaining euthyneuran taxa in the study; (iii) the nudibranch taxa form two clades, one comprising Dendronotoidea, Arminoidea and Aeolidoidea (together Cladobranchia) with Notaspidea (represented by Berthella) as sister group, while the fourth nudibranch taxon, Doridoidea, forms a separate clade; (iv) Cephalaspidea s.s. and Anaspidea form clades that are each other's sister groups (together Pleurocoela). Finally, there is no clade present in the analyses corresponding to the taxon Opisthobranchia in the traditional sense, and the use of this name is probably better abandoned altogether.
Dystrophic Cardiomyopathy: Complex Pathobiological Processes to Generate Clinical Phenotype
Tsuda, Takeshi; Fitzgerald, Kristi K.
2017-01-01
Duchenne muscular dystrophy (DMD), Becker muscular dystrophy (BMD), and X-linked dilated cardiomyopathy (XL-DCM) consist of a unique clinical entity, the dystrophinopathies, which are due to variable mutations in the dystrophin gene. Dilated cardiomyopathy (DCM) is a common complication of dystrophinopathies, but the onset, progression, and severity of heart disease differ among these subgroups. Extensive molecular genetic studies have been conducted to assess genotype-phenotype correlation in DMD, BMD, and XL-DCM to understand the underlying mechanisms of these diseases, but the results are not always conclusive, suggesting the involvement of complex multi-layers of pathological processes that generate the final clinical phenotype. Dystrophin protein is a part of dystrophin-glycoprotein complex (DGC) that is localized in skeletal muscles, myocardium, smooth muscles, and neuronal tissues. Diversity of cardiac phenotype in dystrophinopathies suggests multiple layers of pathogenetic mechanisms in forming dystrophic cardiomyopathy. In this review article, we review the complex molecular interactions involving the pathogenesis of dystrophic cardiomyopathy, including primary gene mutations and loss of structural integrity, secondary cellular responses, and certain epigenetic and other factors that modulate gene expressions. Involvement of epigenetic gene regulation appears to lead to specific cardiac phenotypes in dystrophic hearts. PMID:29367543
Chen, Jiang; Du, Yinan; He, Xueyan; Huang, Xingxu; Shi, Yun S
2017-03-31
The most powerful way to probe protein function is to characterize the consequence of its deletion. Compared to conventional gene knockout (KO), conditional knockout (cKO) provides an advanced gene targeting strategy with which gene deletion can be performed in a spatially and temporally restricted manner. However, for most species that are amphiploid, the widely used Cre-flox conditional KO (cKO) system would need targeting loci in both alleles to be loxP flanked, which in practice, requires time and labor consuming breeding. This is considerably significant when one is dealing with multiple genes. CRISPR/Cas9 genome modulation system is advantaged in its capability in targeting multiple sites simultaneously. Here we propose a strategy that could achieve conditional KO of multiple genes in mouse with Cre recombinase dependent Cas9 expression. By transgenic construction of loxP-stop-loxP (LSL) controlled Cas9 (LSL-Cas9) together with sgRNAs targeting EGFP, we showed that the fluorescence molecule could be eliminated in a Cre-dependent manner. We further verified the efficacy of this novel strategy to target multiple sites by deleting c-Maf and MafB simultaneously in macrophages specifically. Compared to the traditional Cre-flox cKO strategy, this sgRNAs-LSL-Cas9 cKO system is simpler and faster, and would make conditional manipulation of multiple genes feasible.
Should "Multiple Imputations" Be Treated as "Multiple Indicators"?
ERIC Educational Resources Information Center
Mislevy, Robert J.
1993-01-01
Multiple imputations for latent variables are constructed so that analyses treating them as true variables have the correct expectations for population characteristics. Analyzing multiple imputations in accordance with their construction yields correct estimates of population characteristics, whereas analyzing them as multiple indicators generally…
Klein, Hans-Ulrich; Ruckert, Christian; Kohlmann, Alexander; Bullinger, Lars; Thiede, Christian; Haferlach, Torsten; Dugas, Martin
2009-12-15
Multiple gene expression signatures derived from microarray experiments have been published in the field of leukemia research. A comparison of these signatures with results from new experiments is useful for verification as well as for interpretation of the results obtained. Currently, the percentage of overlapping genes is frequently used to compare published gene signatures against a signature derived from a new experiment. However, it has been shown that the percentage of overlapping genes is of limited use for comparing two experiments due to the variability of gene signatures caused by different array platforms or assay-specific influencing parameters. Here, we present a robust approach for a systematic and quantitative comparison of published gene expression signatures with an exemplary query dataset. A database storing 138 leukemia-related published gene signatures was designed. Each gene signature was manually annotated with terms according to a leukemia-specific taxonomy. Two analysis steps are implemented to compare a new microarray dataset with the results from previous experiments stored and curated in the database. First, the global test method is applied to assess gene signatures and to constitute a ranking among them. In a subsequent analysis step, the focus is shifted from single gene signatures to chromosomal aberrations or molecular mutations as modeled in the taxonomy. Potentially interesting disease characteristics are detected based on the ranking of gene signatures associated with these aberrations stored in the database. Two example analyses are presented. An implementation of the approach is freely available as web-based application. The presented approach helps researchers to systematically integrate the knowledge derived from numerous microarray experiments into the analysis of a new dataset. By means of example leukemia datasets we demonstrate that this approach detects related experiments as well as related molecular mutations and may help to interpret new microarray data.
Kujoth, Gregory C.; Sullivan, Thomas D.; Merkhofer, Richard; Lee, Taek-Jin; Wang, Huafeng; Brandhorst, Tristan; Wüthrich, Marcel
2018-01-01
ABSTRACT Blastomyces dermatitidis is a human fungal pathogen of the lung that can lead to disseminated disease in healthy and immunocompromised individuals. Genetic analysis of this fungus is hampered by the relative inefficiency of traditional recombination-based gene-targeting approaches. Here, we demonstrate the feasibility of applying CRISPR/Cas9-mediated gene editing to Blastomyces, including to simultaneously target multiple genes. We created targeting plasmid vectors expressing Cas9 and either one or two single guide RNAs and introduced these plasmids into Blastomyces via Agrobacterium gene transfer. We succeeded in disrupting several fungal genes, including PRA1 and ZRT1, which are involved in scavenging and uptake of zinc from the extracellular environment. Single-gene-targeting efficiencies varied by locus (median, 60% across four loci) but were approximately 100-fold greater than traditional methods of Blastomyces gene disruption. Simultaneous dual-gene targeting proceeded with efficiencies similar to those of single-gene-targeting frequencies for the respective targets. CRISPR/Cas9 disruption of PRA1 or ZRT1 had a variable impact on growth under zinc-limiting conditions, showing reduced growth at early time points in low-passage-number cultures and growth similar to wild-type levels by later passage. Individual impairment of PRA1 or ZRT1 resulted in a reduction of the fungal burden in a mouse model of Blastomyces infection by a factor of ~1 log (range, up to 3 logs), and combined disruption of both genes had no additional impact on the fungal burden. These results underscore the utility of CRISPR/Cas9 for efficient gene disruption in dimorphic fungi and reveal a role for zinc metabolism in Blastomyces fitness in vivo. PMID:29615501
Unifying measures of gene function and evolution.
Wolf, Yuri I; Carmel, Liran; Koonin, Eugene V
2006-06-22
Recent genome analyses revealed intriguing correlations between variables characterizing the functioning of a gene, such as expression level (EL), connectivity of genetic and protein-protein interaction networks, and knockout effect, and variables describing gene evolution, such as sequence evolution rate (ER) and propensity for gene loss. Typically, variables within each of these classes are positively correlated, e.g. products of highly expressed genes also have a propensity to be involved in many protein-protein interactions, whereas variables between classes are negatively correlated, e.g. highly expressed genes, on average, evolve slower than weakly expressed genes. Here, we describe principal component (PC) analysis of seven genome-related variables and propose biological interpretations for the first three PCs. The first PC reflects a gene's 'importance', or the 'status' of a gene in the genomic community, with positive contributions from knockout lethality, EL, number of protein-protein interaction partners and the number of paralogues, and negative contributions from sequence ER and gene loss propensity. The next two PCs define a plane that seems to reflect the functional and evolutionary plasticity of a gene. Specifically, PC2 can be interpreted as a gene's 'adaptability' whereby genes with high adaptability readily duplicate, have many genetic interaction partners and tend to be non-essential. PC3 also might reflect the role of a gene in organismal adaptation albeit with a negative rather than a positive contribution of genetic interactions; we provisionally designate this PC 'reactivity'. The interpretation of PC2 and PC3 as measures of a gene's plasticity is compatible with the observation that genes with high values of these PCs tend to be expressed in a condition- or tissue-specific manner. Functional classes of genes substantially vary in status, adaptability and reactivity, with the highest status characteristic of the translation system and cytoskeletal proteins, highest adaptability seen in cellular processes and signalling genes, and top reactivity characteristic of metabolic enzymes.
Evolutionary dynamics and genetic diversity from three genes of Anguillid rhabdovirus.
Bellec, Laure; Cabon, Joelle; Bergmann, Sven; de Boisséson, Claire; Engelsma, Marc; Haenen, Olga; Morin, Thierry; Olesen, Niels Jørgen; Schuetze, Heike; Toffan, Anna; Way, Keith; Bigarré, Laurent
2014-11-01
Wild freshwater eel populations have dramatically declined in recent past decades in Europe and America, partially through the impact of several factors including the wide spread of infectious diseases. The anguillid rhabdoviruses eel virus European X (EVEX) and eel virus American (EVA) potentially play a role in this decline, even if their real contribution is still unclear. In this study, we investigate the evolutionary dynamics and genetic diversity of anguiillid rhabdoviruses by analysing sequences from the glycoprotein, nucleoprotein and phosphoprotein (P) genes of 57 viral strains collected from seven countries over 40 years using maximum-likelihood and Bayesian approaches. Phylogenetic trees from the three genes are congruent and allow two monophyletic groups, European and American, to be clearly distinguished. Results of nucleotide substitution rates per site per year indicate that the P gene is expected to evolve most rapidly. The nucleotide diversity observed is low (2-3 %) for the three genes, with a significantly higher variability within the P gene, which encodes multiple proteins from a single genomic RNA sequence, particularly a small C protein. This putative C protein is a potential molecular marker suitable for characterization of distinct genotypes within anguillid rhabdoviruses. This study provides, to our knowledge, the first molecular characterization of EVA, brings new insights to the evolutionary dynamics of two genotypes of Anguillid rhabdovirus, and is a baseline for further investigations on the tracking of its spread.
Sang, Yanmei; Zong, Wei; Yan, Jie; Liu, Min
2012-01-01
Objective. The CLEC16A gene is related to the genetic susceptibility to T1DM with racial variability. This study investigated the association between CLEC16A gene polymorphisms and T1DM in Chinese children. Methods. 131 Chinese children with T1DM were selected for study, and 121 healthy adult blood donors were selected as normal controls. PCR and mass spectrometry was used to study the distributions of 17 CLEC16A alleles in patients and controls. The relationship between CLEC16A gene polymorphisms and T1DM was studied. Results. The distributions of two polymorphisms (rs12921922, rs12931878) of CLEC16A in T1DM and healthy controls were significantly different, while the distributions of other CLEC16A polymorphisms show no significant differences. The alleles of rs12921922 are C and T. The frequency of the T allele was significantly increased in patients versus healthy controls. The alleles of rs12931878 are A and C. The frequencies of the A allele are significantly increased in T1DM patients versus healthy controls. Conclusion. Two polymorphisms in the CLEC16A gene correlate with increased susceptibility to T1DM in Chinese children, revealing that it was another new gene that correlates with susceptibility to T1DM in multiple populations.
Mueller-Spitz, Sabrina R.; Stewart, Lisa B.; Klump, J. Val; McLellan, Sandra L.
2010-01-01
The release of fecal pollution into surface waters may create environmental reservoirs of feces-derived microorganisms, including pathogens. Clostridium perfringens is a commonly used fecal indicator that represents a human pathogen. The pathogenicity of this bacterium is associated with its expression of multiple toxins; however, the prevalence of C. perfringens with various toxin genes in aquatic environments is not well characterized. In this study, C. perfringens spores were used to measure the distribution of fecal pollution associated with suspended sediments in the nearshore waters of Lake Michigan. Particle-associated C. perfringens levels were greatest adjacent to the Milwaukee harbor and diminished in the nearshore waters. Species-specific PCR and toxin gene profiles identified 174 isolates collected from the suspended sediments, surface water, and sewage influent as C. perfringens type A. Regardless of the isolation source, the beta2 and enterotoxin genes were common among isolates. The suspended sediments yielded the highest frequency of cpe-carrying C. perfringens (61%) compared to sewage (38%). Gene arrangement of enterotoxin was investigated using PCR to target known insertion sequences associated with this gene. Amplification products were detected in only 9 of 90 strains, which suggests there is greater variability in cpe gene arrangement than previously described. This work presents evidence that freshwater suspended sediments and sewage influent are reservoirs for potentially pathogenic cpe-carrying C. perfringens spores. PMID:20581181
Rublee, Parke A; Remington, David L; Schaefer, Eric F; Marshall, Michael M
2005-01-01
Molecular methods, including conventional PCR, real-time PCR, denaturing gradient gel electrophoresis, fluorescent fragment detection PCR, and fluorescent in situ hybridization, have all been developed for use in identifying and studying the distribution of the toxic dinoflagellates Pfiesteria piscicida and P. shumwayae. Application of the methods has demonstrated a worldwide distribution of both species and provided insight into their environmental tolerance range and temporal changes in distribution. Genetic variability among geographic locations generally appears low in rDNA genes, and detection of the organisms in ballast water is consistent with rapid dispersal or high gene flow among populations, but additional sequence data are needed to verify this hypothesis. The rapid development and application of these tools serves as a model for study of other microbial taxa and provides a basis for future development of tools that can simultaneously detect multiple targets.
Egg phenotype differentiation in sympatric cuckoo Cuculus canorus gentes.
Antonov, Anton; Stokke, B G; Vikan, J R; Fossøy, F; Ranke, P S; Røskaft, E; Moksnes, A; Møller, A P; Shykoff, J A
2010-06-01
The brood parasitic common cuckoo Cuculus canorus consists of gentes, which typically parasitize only a single host species whose eggs they often mimic. Where multiple cuckoo gentes co-exist in sympatry, we may expect variable but generally poorer mimicry because of host switches or inter-gens gene flow via males if these also contribute to egg phenotypes. Here, we investigated egg trait differentiation and mimicry in three cuckoo gentes parasitizing great reed warblers Acrocephalus arundinaceus, marsh warblers Acrocephalus palustris and corn buntings Miliaria calandra breeding in close sympatry in partially overlapping habitat types. The three cuckoo gentes showed a remarkable degree of mimicry to their three host species in some but not all egg features, including egg size, a hitherto largely ignored feature of egg mimicry. Egg phenotype matching for both background and spot colours as well as for egg size has been maintained in close sympatry despite the possibility for gene flow.
Evans, Jessica J; Gygli, Patrick E; McCaskill, Julienne; DeVeaux, Linda C
2018-04-20
The haloarchaea are unusual in possessing genes for multiple homologs to the ubiquitous single-stranded DNA binding protein (SSB or replication protein A, RPA) found in all three domains of life. Halobacterium salinarum contains five homologs: two are eukaryotic in organization, two are prokaryotic and are encoded on the minichromosomes, and one is uniquely euryarchaeal. Radiation-resistant mutants previously isolated show upregulation of one of the eukaryotic-type RPA genes. Here, we have created deletions in the five RPA operons. These deletion mutants were exposed to DNA-damaging conditions: ionizing radiation, UV radiation, and mitomycin C. Deletion of the euryarchaeal homolog, although not lethal as in Haloferax volcanii , causes severe sensitivity to all of these agents. Deletion of the other RPA/SSB homologs imparts a variable sensitivity to these DNA-damaging agents, suggesting that the different RPA homologs have specialized roles depending on the type of genomic insult encountered.
Walters, Alison D; Chong, James P J
2017-05-01
The single minichromosome maintenance (MCM) protein found in most archaea has been widely studied as a simplified model for the MCM complex that forms the catalytic core of the eukaryotic replicative helicase. Organisms of the order Methanococcales are unusual in possessing multiple MCM homologues. The Methanococcus maripaludis S2 genome encodes four MCM homologues, McmA-McmD. DNA helicase assays reveal that the unwinding activity of the three MCM-like proteins is highly variable despite sequence similarities and suggests additional motifs that influence MCM function are yet to be identified. While the gene encoding McmA could not be deleted, strains harbouring individual deletions of genes encoding each of the other MCMs display phenotypes consistent with these proteins modulating DNA damage responses. M. maripaludis S2 is the first archaeon in which MCM proteins have been shown to influence the DNA damage response.
Moderating role of the MAOA genotype in antisocial behaviour.
Fergusson, David M; Boden, Joseph M; Horwood, L John; Miller, Allison; Kennedy, Martin A
2012-02-01
Recent studies have examined gene×environment (G×E) interactions involving the monoamine oxidase A (MAOA) gene in moderating the associations between exposure to adversity and antisocial behaviour. The present study examined a novel method for assessing interactions between a single gene and multiple risk factors related to environmental and personal adversity. To test the hypothesis that the presence of the low-activity MAOA genotype was associated with an increased response to a series of risk factors. Participants were 399 males from the Christchurch Health and Development Study who had complete data on: (a) MAOA promoter region variable number tandem repeat genotype; (b) antisocial behaviour (criminal offending) to age 30 and convictions to age 21; and (c) maternal smoking during pregnancy, IQ, childhood maltreatment and school failure. Poisson regression models were fitted to three antisocial behaviour outcomes (property/violent offending ages 15-30; and convictions ages 17-21), using measures of exposure to adverse childhood circumstances. The analyses revealed consistent evidence of G x E interactions, such that those with the low-activity MAOA variant who were exposed to adversity in childhood were significantly more likely to report offending in late adolescence and early adulthood. The present findings add to the evidence suggesting that there is a stable G x E interaction involving MAOA, a range of adverse environmental and personal factors, and antisocial behaviour across the life course. These analyses also demonstrate the utility of using multiple environmental/personal exposures to test G×E interactions.
The TERT gene harbors multiple variants associated with pancreatic cancer susceptibility
Campa, Daniele; Rizzato, Cosmeri; Stolzenberg-Solomon, Rachael; Pacetti, Paola; Vodicka, Pavel; Cleary, Sean P.; Capurso, Gabriele; Bueno-de-Mesquita, H. Bas; Werner, Jens; Gazouli, Maria; Butterbach, Katja; Ivanauskas, Audrius; Giese, Nathalia; Petersen, Gloria M.; Fogar, Paola; Wang, Zhaoming; Bassi, Claudio; Ryska, Miroslav; Theodoropoulos, George E.; Kooperberg, Charles; Li, Donghui; Greenhalf, William; Pasquali, Claudio; Hackert, Thilo; Fuchs, Charles S.; Mohelnikova-Duchonova, Beatrice; Sperti, Cosimo; Funel, Niccola; Dieffenbach, Aida Karina; Wareham, Nicholas J.; Buring, Julie; Holcátová, Ivana; Costello, Eithne; Zambon, Carlo-Federico; Kupcinskas, Juozas; Risch, Harvey A.; Kraft, Peter; Bracci, Paige M.; Pezzilli, Raffaele; Olson, Sara H.; Sesso, Howard D.; Hartge, Patricia; Strobel, Oliver; Małecka-Panas, Ewa; Visvanathan, Kala; Arslan, Alan A.; Pedrazzoli, Sergio; Souček, Pavel; Gioffreda, Domenica; Key, Timothy J.; Talar-Wojnarowska, Renata; Scarpa, Aldo; Mambrini, Andrea; Jacobs, Eric J.; Jamroziak, Krzysztof; Klein, Alison; Tavano, Francesca; Bambi, Franco; Landi, Stefano; Austin, Melissa A.; Vodickova, Ludmila; Brenner, Hermann; Chanock, Stephen J.; Fave, Gianfranco Delle; Piepoli, Ada; Cantore, Maurizio; Zheng, Wei; Wolpin, Brian M.; Amundadottir, Laufey T.; Canzian, Federico
2015-01-01
A small number of common susceptibility loci have been identified for pancreatic cancer, one of which is marked by rs401681 in the TERT – CLPTM1L gene region on chr5p15.33. Since this region is characterized by low linkage disequilibrium (LD), we sought to identify additional SNPs could be related to pancreatic cancer risk, independently of rs401681. We performed an in-depth analysis of genetic variability of the telomerase reverse transcriptase (TERT) and the telomerase RNA component (TERC) genes, in 5,550 subjects with pancreatic cancer and 7,585 controls from the PANcreatic Disease ReseArch (PANDoRA) and the PanScan consortia. We identified a significant association between a variant in TERT and pancreatic cancer risk (rs2853677, OR=0.85; 95% CI=0.80–0.90, P=8.3×10−8). Additional analysis adjusting rs2853677 for rs401681 indicated that the two SNPs are independently associated with pancreatic cancer risk, as suggested by the low LD between them (r2=0.07, D´=0.28). Three additional SNPs in TERT reached statistical significance after correction for multiple testing: rs2736100 (P=3.0×10−5), rs4583925 (P=4.0×10−5) and rs2735948 (P=5.0×10−5). In conclusion, we confirmed that the TERT locus is associated with pancreatic cancer risk, possibly through several independent variants. PMID:25940397
Emerick, Mark C; Stein, Rebecca; Kunze, Robin; McNulty, Megan M; Regan, Melissa R; Hanck, Dorothy A; Agnew, William S
2006-08-01
We describe the regulated transcriptome of CACNA1G, a human gene for T-type Ca(v)3.1 calcium channels that is subject to extensive alternative RNA splicing. Fifteen sites of transcript variation include 2 alternative 5'-UTR promoter sites, 2 alternative 3'-UTR polyadenylation sites, and 11 sites of alternative splicing within the open reading frame. A survey of 1580 fetal and adult human brain full-length complementary DNAs reveals a family of 30 distinct transcripts, including multiple functional forms that vary in expression with development. Statistical analyses of fetal and adult transcript populations reveal patterns of linkages among intramolecular splice site configurations that change dramatically with development. A shift from nearly independent, biased splicing in fetal transcripts to strongly concerted splicing in adult transcripts suggests progressive activation of multiple "programs" of splicing regulation that reorganize molecular structures in differentiating cells. Patch-clamp studies of nine selected variants help relate splicing regulation to permutations of the gating parameters most likely to modify T-channel physiology in expressing neurons. Gating behavior reflects combinatorial interactions between variable domains so that molecular phenotype depends on ensembles of coselected domains, consistent with the observed emergence of concerted splicing during development. We conclude that the structural gene and networks of splicing regulatory factors define an integrated system for the phenotypic variation of Ca(v)3.1 biophysics during nervous system development. Copyright 2006 Wiley-Liss, Inc.
Human genetics of infectious diseases: a unified theory
Casanova, Jean-Laurent; Abel, Laurent
2007-01-01
Since the early 1950s, the dominant paradigm in the human genetics of infectious diseases postulates that rare monogenic immunodeficiencies confer vulnerability to multiple infectious diseases (one gene, multiple infections), whereas common infections are associated with the polygenic inheritance of multiple susceptibility genes (one infection, multiple genes). Recent studies, since 1996 in particular, have challenged this view. A newly recognised group of primary immunodeficiencies predisposing the individual to a principal or single type of infection is emerging. In parallel, several common infections have been shown to reflect the inheritance of one major susceptibility gene, at least in some populations. This novel causal relationship (one gene, one infection) blurs the distinction between patient-based Mendelian genetics and population-based complex genetics, and provides a unified conceptual frame for exploring the molecular genetic basis of infectious diseases in humans. PMID:17255931
Anti-inflammatory genes associated with multiple sclerosis: a gene expression study.
Perga, S; Montarolo, F; Martire, S; Berchialla, P; Malucchi, S; Bertolotto, A
2015-02-15
Multiple sclerosis (MS) is an autoimmune inflammatory disease of the central nervous system caused by a complex interaction between multiple genes and environmental factors. HLA region is the strongest susceptibility locus, but recent huge genome-wide association studies identified new susceptibility genes. Among these, BACH2, PTGER4, RGS1 and ZFP36L1 were highlighted. Here, a gene expression analysis revealed that three of them, namely BACH2, PTGER4 and ZFP36L1, are down-regulated in MS patients' blood cells compared to healthy subjects. Interestingly, all these genes are involved in the immune system regulation with predominant anti-inflammatory role and their reduction could predispose to MS development. Copyright © 2015 Elsevier B.V. All rights reserved.
Early developmental gene enhancers affect subcortical volumes in the adult human brain.
Becker, Martin; Guadalupe, Tulio; Franke, Barbara; Hibar, Derrek P; Renteria, Miguel E; Stein, Jason L; Thompson, Paul M; Francks, Clyde; Vernes, Sonja C; Fisher, Simon E
2016-05-01
Genome-wide association screens aim to identify common genetic variants contributing to the phenotypic variability of complex traits, such as human height or brain morphology. The identified genetic variants are mostly within noncoding genomic regions and the biology of the genotype-phenotype association typically remains unclear. In this article, we propose a complementary targeted strategy to reveal the genetic underpinnings of variability in subcortical brain volumes, by specifically selecting genomic loci that are experimentally validated forebrain enhancers, active in early embryonic development. We hypothesized that genetic variation within these enhancers may affect the development and ultimately the structure of subcortical brain regions in adults. We tested whether variants in forebrain enhancer regions showed an overall enrichment of association with volumetric variation in subcortical structures of >13,000 healthy adults. We observed significant enrichment of genomic loci that affect the volume of the hippocampus within forebrain enhancers (empirical P = 0.0015), a finding which robustly passed the adjusted threshold for testing of multiple brain phenotypes (cutoff of P < 0.0083 at an alpha of 0.05). In analyses of individual single nucleotide polymorphisms (SNPs), we identified an association upstream of the ID2 gene with rs7588305 and variation in hippocampal volume. This SNP-based association survived multiple-testing correction for the number of SNPs analyzed but not for the number of subcortical structures. Targeting known regulatory regions offers a way to understand the underlying biology that connects genotypes to phenotypes, particularly in the context of neuroimaging genetics. This biology-driven approach generates testable hypotheses regarding the functional biology of identified associations. Hum Brain Mapp 37:1788-1800, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Noonan like appearance and familial deletion of the 22q11 Shprintzen-DiGeorge critical region
DOE Office of Scientific and Technical Information (OSTI.GOV)
Piussan, C.; Mathieu, M.; Boudailliez, B.
1994-09-01
Shprintzen velocardiofacial syndrome (VCFS) and reported cases of autosomal dominant DiGeorge sequence (DGS) both belong to a heterogeneous developmental field defect due to the familial segregation of a 22q11 deletion. Two sisters present with mental retardation, dysmorphia and multiple congenital anomalies. The eldest has a Noonan-like appearance; short stature, short webbed neck, low posterior hairline, widely spaced nipples, hemivertebrae, speech disability and mild hypoparathyroidism. Her younger sister has prominent eyes, floppy ears, pulmonary valvular stenosis, hypoplastic right kidney, left multicystic kidney, hypoparathyroidism and renal failure causing death at age 3. Their retarded mother has a typical Shprintzen phenotype and nomore » hypoparathyroidism. A deletion of the critical DiGeorge-Shprintzen conotruncal malformation region was found by FISH in the mother and her Noonan-like daughter. In the mother`s family exist 3 cleft palates, an imperforate anus, a stillbirth and one infant died at age 3 months because of heart malformation. To our knowledge, another case of Noonan-like appearance in a DG patient affected with monosomy 22q11 has been reported in 1992 by Wilson et al. Whether resulting from the hemizygosity of a gene or from the deletion of contiguous genes, the wide DGS-VCFS spectrum encompasses quite variable phenotypes, discordant for palatal and conotruncal defects as well as for hypoparathyroidism, dysmorphic features and multiple congenital anomalies. Physical mapping of both the large 22q11 region commonly lost and the smallest deletion sufficient to produce DGS has been done and may account for the broadening spectrum, the variable expression and the frequently delayed diagnosis of this syndrome.« less
Ghorbanoghli, Z; Nieuwenhuis, M H; Houwing-Duistermaat, J J; Jagmohan-Changur, S; Hes, F J; Tops, C M; Wagner, A; Aalfs, C M; Verhoef, S; Gómez García, E B; Sijmons, R H; Menko, F H; Letteboer, T G; Hoogerbrugge, N; van Wezel, T; Vasen, H F A; Wijnen, J T
2016-10-01
Familial adenomatous polyposis (FAP) is a dominantly inherited syndrome caused by germline mutations in the APC gene and characterized by the development of multiple colorectal adenomas and a high risk of developing colorectal cancer (CRC). The severity of polyposis is correlated with the site of the APC mutation. However, there is also phenotypic variability within families with the same underlying APC mutation, suggesting that additional factors influence the severity of polyposis. Genome-wide association studies identified several single nucleotide polymorphisms (SNPs) that are associated with CRC. We assessed whether these SNPs are associated with polyp multiplicity in proven APC mutation carriers. Sixteen CRC-associated SNPs were analysed in a cohort of 419 APC germline mutation carriers from 182 families. Clinical data were retrieved from the Dutch Polyposis Registry. Allele frequencies of the SNPs were compared for patients with <100 colorectal adenomas versus patients with ≥100 adenomas, using generalized estimating equations with the APC genotype as a covariate. We found a trend of association of two of the tested SNPs with the ≥100 adenoma phenotype: the C alleles of rs16892766 at 8q23.3 (OR 1.71, 95 % CI 1.05-2.76, p = 0.03, dominant model) and rs3802842 at 11q23.1 (OR 1.51, 95 % CI 1.03-2.22, p = 0.04, dominant model). We identified two risk variants that are associated with a more severe phenotype in APC mutation carriers. These risk variants may partly explain the phenotypic variability in families with the same APC gene defect. Further studies with a larger sample size are recommended to evaluate and confirm the phenotypic effect of these SNPs in FAP.
Zuo, Erwei; Cai, Yi-Jun; Li, Kui; Wei, Yu; Wang, Bang-An; Sun, Yidi; Liu, Zhen; Liu, Jiwei; Hu, Xinde; Wei, Wei; Huo, Xiaona; Shi, Linyu; Tang, Cheng; Liang, Dan; Wang, Yan; Nie, Yan-Hong; Zhang, Chen-Chen; Yao, Xuan; Wang, Xing; Zhou, Changyang; Ying, Wenqin; Wang, Qifang; Chen, Ren-Chao; Shen, Qi; Xu, Guo-Liang; Li, Jinsong; Sun, Qiang; Xiong, Zhi-Qi; Yang, Hui
2017-07-01
The CRISPR/Cas9 system is an efficient gene-editing method, but the majority of gene-edited animals showed mosaicism, with editing occurring only in a portion of cells. Here we show that single gene or multiple genes can be completely knocked out in mouse and monkey embryos by zygotic injection of Cas9 mRNA and multiple adjacent single-guide RNAs (spaced 10-200 bp apart) that target only a single key exon of each gene. Phenotypic analysis of F0 mice following targeted deletion of eight genes on the Y chromosome individually demonstrated the robustness of this approach in generating knockout mice. Importantly, this approach delivers complete gene knockout at high efficiencies (100% on Arntl and 91% on Prrt2) in monkey embryos. Finally, we could generate a complete Prrt2 knockout monkey in a single step, demonstrating the usefulness of this approach in rapidly establishing gene-edited monkey models.
Podgoreanu, M V; White, W D; Morris, R W; Mathew, J P; Stafford-Smith, M; Welsby, I J; Grocott, H P; Milano, C A; Newman, M F; Schwinn, D A
2006-07-04
The inflammatory response triggered by cardiac surgery with cardiopulmonary bypass (CPB) is a primary mechanism in the pathogenesis of postoperative myocardial infarction (PMI), a multifactorial disorder with significant inter-patient variability poorly predicted by clinical and procedural factors. We tested the hypothesis that candidate gene polymorphisms in inflammatory pathways contribute to risk of PMI after cardiac surgery. We genotyped 48 polymorphisms from 23 candidate genes in a prospective cohort of 434 patients undergoing elective cardiac surgery with CPB. PMI was defined as creatine kinase-MB isoenzyme level > or = 10x upper limit of normal at 24 hours postoperatively. A 2-step analysis strategy was used: marker selection, followed by model building. To minimize false-positive associations, we adjusted for multiple testing by permutation analysis, Bonferroni correction, and controlling the false discovery rate; 52 patients (12%) experienced PMI. After adjusting for multiple comparisons and clinical risk factors, 3 polymorphisms were found to be independent predictors of PMI (adjusted P<0.05; false discovery rate <10%). These gene variants encode the proinflammatory cytokine interleukin 6 (IL6 -572G>C; odds ratio [OR], 2.47), and 2 adhesion molecules: intercellular adhesion molecule-1 (ICAM1 Lys469Glu; OR, 1.88), and E-selectin (SELE 98G>T; OR, 0.16). The inclusion of genotypic information from these polymorphisms improved prediction models for PMI based on traditional risk factors alone (C-statistic 0.764 versus 0.703). Functional genetic variants in cytokine and leukocyte-endothelial interaction pathways are independently associated with severity of myonecrosis after cardiac surgery. This may aid in preoperative identification of high-risk cardiac surgical patients and development of novel cardioprotective strategies.
IL-17A Mediates a Selective Gene Expression Profile in Asthmatic Human Airway Smooth Muscle Cells
Dragon, Stéphane; Hirst, Stuart J.; Lee, Tak H.
2014-01-01
Airway smooth muscle (ASM) cells are thought to contribute to the pathogenesis of allergic asthma by orchestrating and perpetuating airway inflammation and remodeling responses. In this study, we evaluated the IL-17RA signal transduction and gene expression profile in ASM cells from subjects with mild asthma and healthy individuals. Human primary ASM cells were treated with IL-17A and probed by the Affymetrix GeneChip array, and gene targets were validated by real-time quantitative RT-PCR. Genomic analysis underlined the proinflammatory nature of IL-17A, as multiple NF-κB regulatory factors and chemokines were induced in ASM cells. Transcriptional regulators consisting of primary response genes were overrepresented and displayed dynamic expression profiles. IL-17A poorly enhanced IL-1β or IL-22 gene responses in ASM cells from both subjects with mild asthma and healthy donors. Interestingly, protein modifications to the NF-κB regulatory network were not observed after IL-17A stimulation, although oscillations in IκBε expression were detected. ASM cells from subjects with mild asthma up-regulated more genes with greater overall variability in response to IL-17A than from healthy donors. Finally, in response to IL-17A, ASM cells displayed rapid activation of the extracellular signal–regulated kinase/ribosomal S6 kinase signaling pathway and increased nuclear levels of phosphorylated extracellular signal–regulated kinase. Taken together, our results suggest that IL-17A mediated modest gene expression response, which, in cooperation with the NF-κB signaling network, may regulate the gene expression profile in ASM cells. PMID:24393021
The evolution of highly variable immunity genes across a passerine bird radiation.
O'Connor, E A; Strandh, M; Hasselquist, D; Nilsson, J-Å; Westerdahl, H
2016-02-01
To survive, individuals must be able to recognize and eliminate pathogens. The genes of the major histocompatibility complex (MHC) play an essential role in this process in vertebrates as their diversity affects the repertoire of pathogens that can be recognized by the immune system. Emerging evidence suggests that birds within the parvorder Passerida possess an exceptionally high number of MHC genes. However, this has yet to be directly investigated using a consistent framework, and the question of how this MHC diversity has evolved has not been addressed. We used next-generation sequencing to investigate how MHC class I gene copy number and sequence diversity varies across the Passerida radiation using twelve species chosen to represent the phylogenetic range of this group. Additionally, we performed phylogenetic analyses on this data to identify, for the first time, the evolutionary model that best describes how MHC class I gene diversity has evolved within Passerida. We found evidence of multiple MHC class I genes in every family tested, with an extremely broad range in gene copy number across Passerida. There was a strong phylogenetic signal in MHC gene copy number and diversity, and these traits appear to have evolved through a process of Brownian motion in the species studied, that is following the pattern of genetic drift or fluctuating selection, as opposed to towards a single optimal value or through evolutionary 'bursts'. By characterizing MHC class I gene diversity across Passerida in a systematic framework, this study provides a first step towards understanding this huge variation. © 2016 John Wiley & Sons Ltd.
Sources of Variance in Baseline Gene Expression in the Rodent Liver
Corton, J. Christopher; Bushel, Pierre R.; Fostel, Jennifer; O'Lone, Raegan B.
2012-01-01
The use of gene expression profiling in both clinical and laboratory settings would be enhanced by better characterization of variation due to individual, environmental, and technical factors. Analysis of microarray data from untreated or vehicle-treated animals within the control arm of toxicogenomics studies has yielded useful information on baseline fluctuations in liver gene expression in the rodent. Here, studies which highlight contributions of different factors to gene expression variability in the rodent liver are discussed including a large meta-analysis of rat liver, which identified genes that vary in control animals in the absence of chemical treatment. Genes and their pathways that are the most and least variable were identified in a number of these studies. Life stage, fasting, sex, diet, circadian rhythm and liver lobe source can profoundly influence gene expression in the liver. Recognition of biological and technical factors that contribute to variability of background gene expression can help the investigator in the design of an experiment that maximizes sensitivity and reduces the influence of confounders that may lead to misinterpretation of genomic changes. The factors that contribute to variability in liver gene expression in rodents are likely analogous to those contributing to human interindividual variability in drug response and chemical toxicity. Identification of batteries of genes that are altered in a variety of background conditions could be used to predict responses to drugs and chemicals in appropriate models of the human liver. PMID:22230429
Ji, S C; Pan, Y T; Lu, Q Y; Sun, Z Y; Liu, Y Z
2014-03-17
The purpose of this study was to identify critical genes associated with septic multiple trauma by comparing peripheral whole blood samples from multiple trauma patients with and without sepsis. A microarray data set was downloaded from the Gene Expression Omnibus (GEO) database. This data set included 70 samples, 36 from multiple trauma patients with sepsis and 34 from multiple trauma patients without sepsis (as a control set). The data were preprocessed, and differentially expressed genes (DEGs) were then screened for using packages of the R language. Functional analysis of DEGs was performed with DAVID. Interaction networks were then established for the most up- and down-regulated genes using HitPredict. Pathway-enrichment analysis was conducted for genes in the networks using WebGestalt. Fifty-eight DEGs were identified. The expression levels of PLAU (down-regulated) and MMP8 (up-regulated) presented the largest fold-changes, and interaction networks were established for these genes. Further analysis revealed that PLAT (plasminogen activator, tissue) and SERPINF2 (serpin peptidase inhibitor, clade F, member 2), which interact with PLAU, play important roles in the pathway of the component and coagulation cascade. We hypothesize that PLAU is a major regulator of the component and coagulation cascade, and down-regulation of PLAU results in dysfunction of the pathway, causing sepsis.
Moore, L.; Grobárová, V.; Shen, H.; Man, H. B.; Míčová, J.; Ledvina, M.; Štursa, J.; Nesladek, M.
2015-01-01
Nanodiamonds (NDs) are versatile nanoparticles that are currently being investigated for a variety of applications in drug delivery, biomedical imaging and nanoscale sensing. Although initial studies indicate that these small gems are biocompatible, there is a great deal of variability in synthesis methods and surface functionalization that has yet to be evaluated. Here we present a comprehensive analysis of the cellular compatibility of an array of nanodiamond subtypes and surface functionalization strategies. These results demonstrate that NDs are well tolerated by multiple cell types at both functional and gene expression levels. In addition, ND-mediated delivery of daunorubicin is less toxic to multiple cell types than treatment with daunorubicin alone, demonstrating the ability of the ND agent to improve drug tolerance and decrease therapeutic toxicity. Overall, the results here indicate that ND biocompatibility serves as a promising foundation for continued preclinical investigation. PMID:25037888
Moore, Laura; Grobárová, Valéria; Shen, Helen; Man, Han Bin; Míčová, Júlia; Ledvina, Miroslav; Štursa, Jan; Nesladek, Milos; Fišerová, Anna; Ho, Dean
2014-10-21
Nanodiamonds (NDs) are versatile nanoparticles that are currently being investigated for a variety of applications in drug delivery, biomedical imaging and nanoscale sensing. Although initial studies indicate that these small gems are biocompatible, there is a great deal of variability in synthesis methods and surface functionalization that has yet to be evaluated. Here we present a comprehensive analysis of the cellular compatibility of an array of nanodiamond subtypes and surface functionalization strategies. These results demonstrate that NDs are well tolerated by multiple cell types at both functional and gene expression levels. In addition, ND-mediated delivery of daunorubicin is less toxic to multiple cell types than treatment with daunorubicin alone, thus demonstrating the ability of the ND agent to improve drug tolerance and decrease therapeutic toxicity. Overall, the results here indicate that ND biocompatibility serves as a promising foundation for continued preclinical investigation.
NASA Astrophysics Data System (ADS)
Moore, Laura; Grobárová, Valéria; Shen, Helen; Man, Han Bin; Míčová, Júlia; Ledvina, Miroslav; Štursa, Jan; Nesladek, Milos; Fišerová, Anna; Ho, Dean
2014-09-01
Nanodiamonds (NDs) are versatile nanoparticles that are currently being investigated for a variety of applications in drug delivery, biomedical imaging and nanoscale sensing. Although initial studies indicate that these small gems are biocompatible, there is a great deal of variability in synthesis methods and surface functionalization that has yet to be evaluated. Here we present a comprehensive analysis of the cellular compatibility of an array of nanodiamond subtypes and surface functionalization strategies. These results demonstrate that NDs are well tolerated by multiple cell types at both functional and gene expression levels. In addition, ND-mediated delivery of daunorubicin is less toxic to multiple cell types than treatment with daunorubicin alone, thus demonstrating the ability of the ND agent to improve drug tolerance and decrease therapeutic toxicity. Overall, the results here indicate that ND biocompatibility serves as a promising foundation for continued preclinical investigation.
Effects of BDNF polymorphisms on antidepressant action.
Tsai, Shih-Jen; Hong, Chen-Jee; Liou, Ying-Jay
2010-12-01
Evidence suggests that the down-regulation of the signaling pathway involving brain-derived neurotrophic factor (BDNF), a molecular element known to regulate neuronal plasticity and survival, plays an important role in the pathogenesis of major depression. The restoration of BDNF activity induced by antidepressant treatment has been implicated in the antidepressant therapeutic mechanism. Because there is variability among patients with major depressive disorder in terms of response to antidepressant treatment and since genetic factors may contribute to this inter-individual variability in antidepressant response, pharmacogenetic studies have tested the associations between genetic polymorphisms in candidate genes related to antidepressant therapeutic action. In human BDNF gene, there is a common functional polymorphism (Val66Met) in the pro-region of BDNF, which affects the intracellular trafficking of proBDNF. Because of the potentially important role of BDNF in the antidepressant mechanism, many pharmacogenetic studies have tested the association between this polymorphism and the antidepressant therapeutic response, but they have produced inconsistent results. A recent meta-analysis of eight studies, which included data from 1,115 subjects, suggested that the Val/Met carriers have increased antidepressant response in comparison to Val/Val homozygotes, particularly in the Asian population. The positive molecular heterosis effect (subjects heterozygous for a specific genetic polymorphism show a significantly greater effect) is compatible with animal studies showing that, although BDNF exerts an antidepressant effect, too much BDNF may have a detrimental effect on mood. Several recommendations are proposed for future antidepressant pharmacogenetic studies of BDNF, including the consideration of multiple polymorphisms and a haplotype approach, gene-gene interaction, a single antidepressant regimen, controlling for age and gender interactions, and pharmacogenetic effects on specific depressive symptom-clusters.
Statistical inference of the generation probability of T-cell receptors from sequence repertoires.
Murugan, Anand; Mora, Thierry; Walczak, Aleksandra M; Callan, Curtis G
2012-10-02
Stochastic rearrangement of germline V-, D-, and J-genes to create variable coding sequence for certain cell surface receptors is at the origin of immune system diversity. This process, known as "VDJ recombination", is implemented via a series of stochastic molecular events involving gene choices and random nucleotide insertions between, and deletions from, genes. We use large sequence repertoires of the variable CDR3 region of human CD4+ T-cell receptor beta chains to infer the statistical properties of these basic biochemical events. Because any given CDR3 sequence can be produced in multiple ways, the probability distribution of hidden recombination events cannot be inferred directly from the observed sequences; we therefore develop a maximum likelihood inference method to achieve this end. To separate the properties of the molecular rearrangement mechanism from the effects of selection, we focus on nonproductive CDR3 sequences in T-cell DNA. We infer the joint distribution of the various generative events that occur when a new T-cell receptor gene is created. We find a rich picture of correlation (and absence thereof), providing insight into the molecular mechanisms involved. The generative event statistics are consistent between individuals, suggesting a universal biochemical process. Our probabilistic model predicts the generation probability of any specific CDR3 sequence by the primitive recombination process, allowing us to quantify the potential diversity of the T-cell repertoire and to understand why some sequences are shared between individuals. We argue that the use of formal statistical inference methods, of the kind presented in this paper, will be essential for quantitative understanding of the generation and evolution of diversity in the adaptive immune system.
Evidence for polymorphism in the cytochrome P450 2D50 gene in horses.
Corado, C R; McKemie, D S; Young, A; Knych, H K
2016-06-01
Metabolism is an essential factor in the clearance of many drugs and as such plays a major role in the establishment of dosage regimens and withdrawal times. CYP2D6, the human orthologue to equine CYP2D50, is a drug-metabolizing enzyme that is highly polymorphic in humans leading to widely differing levels of metabolic activity. As CYP2D6 is highly polymorphic, in this study it was hypothesized that the gene coding for the equine orthologue, CYP2D50, may also be prone to polymorphism. Blood samples were collected from 150 horses, the CYP2D50 gene was cloned and sequenced; and full-length sequences were analyzed for single nucleotide polymorphisms (SNPs), deletions, or insertions. Pharmacokinetic data were collected from a subset of horses following the administration of a single oral dose of tramadol and probit analysis used to calculate metabolic ratios. Prior to drug administration, the ability of recombinant CYP2D50 to metabolize tramadol to O-desmethyltramadol was confirmed. Sequencing of CYP2D50 identified 126 exonic SNPs, with 31 of those appearing in multiple horses. Oral administration of tramadol to a subset of these horses revealed variable metabolic ratios (tramadol: O-desmethyltramadol) in individual horses and separation into three metabolic groups. While a limited number of horses of primarily a single breed were studied, the variability in tramadol metabolism to O-desmethyltramadol between horses and preliminary evidence of what appears to be poor, extensive, and ultra-rapid metabolizers supports further study of the potential for genetic polymorphisms in the CYP2D50 gene in horses. © 2015 John Wiley & Sons Ltd.
Howard, Timothy D.; Hsu, Fang-Chi; Grzywacz, Joseph G.; Chen, Haiying; Quandt, Sara A.; Vallejos, Quirina M.; Whalley, Lara E.; Cui, Wei; Padilla, Stephanie; Arcury, Thomas A.
2010-01-01
Background Organophosphate pesticides act as cholinesterase inhibitors. For those with agricultural exposure to these chemicals, risk of potential exposure-related health effects may be modified by genetic variability in cholinesterase metabolism. Cholinesterase activity is a useful, indirect measurement of pesticide exposure, especially in high-risk individuals such as farmworkers. To understand fully the links between pesticide exposure and potential human disease, analyses must be able to consider genetic variability in pesticide metabolism. Objectives We studied participants in the Community Participatory Approach to Measuring Farmworker Pesticide Exposure (PACE3) study to determine whether cholinesterase levels are associated with single-nucleotide polymorphisms (SNPs) involved in pesticide metabolism. Methods Cholinesterase levels were measured from blood samples taken from 287 PACE3 participants at up to four time points during the 2007 growing season. We performed association tests of cholinesterase levels and 256 SNPs in 30 candidate genes potentially involved in pesticide metabolism. A false discovery rate (FDR) p-value was used to account for multiple testing. Results Thirty-five SNPs were associated (unadjusted p < 0.05) based on at least one of the genetic models tested (general, additive, dominant, and recessive). The strongest evidence of association with cholinesterase levels was observed with two SNPs, rs2668207 and rs2048493, in the butyrylcholinesterase (BCHE) gene (FDR adjusted p = 0.15 for both; unadjusted p = 0.00098 and 0.00068, respectively). In participants with at least one minor allele, cholinesterase levels were lower by 4.3–9.5% at all time points, consistent with an effect that is independent of pesticide exposure. Conclusions Common genetic variation in the BCHE gene may contribute to subtle changes in cholinesterase levels. PMID:20529763
Waardenburg syndrome type 4: report of two new cases caused by SOX10 mutations in Spain.
Fernández, Raquel M; Núñez-Ramos, Raquel; Enguix-Riego, M Valle; Román-Rodríguez, Francisco José; Galán-Gómez, Enrique; Blesa-Sánchez, Emilio; Antiñolo, Guillermo; Núñez-Núñez, Ramón; Borrego, Salud
2014-02-01
Shah-Waardenburg syndrome or Waardenburg syndrome type 4 (WS4) is a neurocristopathy characterized by the association of deafness, depigmentation and Hirschsprung disease. Three disease-causing genes have been identified so far for WS4: EDNRB, EDN3, and SOX10. SOX10 mutations, found in 45-55% of WS4 patients, are inherited in autosomal dominant way. In addition, mutations in SOX10 are also responsible for an extended syndrome involving peripheral and central neurological phenotypes, referred to as PCWH (peripheral demyelinating neuropathy, central dysmyelinating leucodystrophy, Waardenburg syndrome, Hirschsprung disease). Such mutations are mostly private, and a high intra- and inter-familial variability exists. In this report, we present a patient with WS4 and a second with PCWH due to SOX10 mutations supporting again the genetic and phenotypic heterogeneity of these syndromes. Interestingly, the WS4 family carries an insertion of 19 nucleotides in exon 5 of SOX10, which results in distinct phenotypes along three different generations: hypopigmentation in the maternal grandmother, hearing loss in the mother, and WS4 in the proband. Since mosaicism cannot explain the three different related-WS features observed in this family, we propose as the most plausible explanation the existence of additional molecular events, acting in an additive or multiplicative fashion, in genes or regulatory regions unidentified so far. On the other hand, the PCWH case was due to a de novo deletion in exon 5 of the gene. Efforts should be devoted to unravel the mechanisms underlying the intrafamilial phenotypic variability observed in the families affected, and to identify new genes responsible for the still unsolved WS4 cases. © 2013 Wiley Periodicals, Inc.
Aguiar, Bruno; Vieira, Jorge; Cunha, Ana E.; Fonseca, Nuno A.; Reboiro-Jato, David; Reboiro-Jato, Miguel; Fdez-Riverola, Florentino; Raspé, Olivier; Vieira, Cristina P.
2013-01-01
S-RNase-based gametophytic self-incompatibility evolved once before the split of the Asteridae and Rosidae. In Prunus (tribe Amygdaloideae of Rosaceae), the self-incompatibility S-pollen is a single F-box gene that presents the expected evolutionary signatures. In Malus and Pyrus (subtribe Pyrinae of Rosaceae), however, clusters of F-box genes (called SFBBs) have been described that are expressed in pollen only and are linked to the S-RNase gene. Although polymorphic, SFBB genes present levels of diversity lower than those of the S-RNase gene. They have been suggested as putative S-pollen genes, in a system of non-self recognition by multiple factors. Subsets of allelic products of the different SFBB genes interact with non-self S-RNases, marking them for degradation, and allowing compatible pollinations. This study performed a detailed characterization of SFBB genes in Sorbus aucuparia (Pyrinae) to address three predictions of the non-self recognition by multiple factors model. As predicted, the number of SFBB genes was large to account for the many S-RNase specificities. Secondly, like the S-RNase gene, the SFBB genes were old. Thirdly, amino acids under positive selection—those that could be involved in specificity determination—were identified when intra-haplotype SFBB genes were analysed using codon models. Overall, the findings reported here support the non-self recognition by multiple factors model. PMID:23606363
Zhang, Zhonghui; Wu, Elise; Qian, Zhijian; Wu, Wen-Shu
2014-01-01
Stable and efficient knockdown of multiple gene targets is highly desirable for dissection of molecular pathways. Because it allows sequence-specific DNA binding, transcription activator-like effector (TALE) offers a new genetic perturbation technique that allows for gene-specific repression. Here, we constructed a multicolor lentiviral TALE-Kruppel-associated box (KRAB) expression vector platform that enables knockdown of multiple gene targets. This platform is fully compatible with the Golden Gate TALEN and TAL Effector Kit 2.0, a widely used and efficient method for TALE assembly. We showed that this multicolor TALE-KRAB vector system when combined together with bone marrow transplantation could quickly knock down c-kit and PU.1 genes in hematopoietic stem and progenitor cells of recipient mice. Furthermore, our data demonstrated that this platform simultaneously knocked down both c-Kit and PU.1 genes in the same primary cell populations. Together, our results suggest that this multicolor TALE-KRAB vector platform is a promising and versatile tool for knockdown of multiple gene targets and could greatly facilitate dissection of molecular pathways. PMID:25475013
Zhang, Zhonghui; Wu, Elise; Qian, Zhijian; Wu, Wen-Shu
2014-12-05
Stable and efficient knockdown of multiple gene targets is highly desirable for dissection of molecular pathways. Because it allows sequence-specific DNA binding, transcription activator-like effector (TALE) offers a new genetic perturbation technique that allows for gene-specific repression. Here, we constructed a multicolor lentiviral TALE-Kruppel-associated box (KRAB) expression vector platform that enables knockdown of multiple gene targets. This platform is fully compatible with the Golden Gate TALEN and TAL Effector Kit 2.0, a widely used and efficient method for TALE assembly. We showed that this multicolor TALE-KRAB vector system when combined together with bone marrow transplantation could quickly knock down c-kit and PU.1 genes in hematopoietic stem and progenitor cells of recipient mice. Furthermore, our data demonstrated that this platform simultaneously knocked down both c-Kit and PU.1 genes in the same primary cell populations. Together, our results suggest that this multicolor TALE-KRAB vector platform is a promising and versatile tool for knockdown of multiple gene targets and could greatly facilitate dissection of molecular pathways.
State Space Model with hidden variables for reconstruction of gene regulatory networks.
Wu, Xi; Li, Peng; Wang, Nan; Gong, Ping; Perkins, Edward J; Deng, Youping; Zhang, Chaoyang
2011-01-01
State Space Model (SSM) is a relatively new approach to inferring gene regulatory networks. It requires less computational time than Dynamic Bayesian Networks (DBN). There are two types of variables in the linear SSM, observed variables and hidden variables. SSM uses an iterative method, namely Expectation-Maximization, to infer regulatory relationships from microarray datasets. The hidden variables cannot be directly observed from experiments. How to determine the number of hidden variables has a significant impact on the accuracy of network inference. In this study, we used SSM to infer Gene regulatory networks (GRNs) from synthetic time series datasets, investigated Bayesian Information Criterion (BIC) and Principle Component Analysis (PCA) approaches to determining the number of hidden variables in SSM, and evaluated the performance of SSM in comparison with DBN. True GRNs and synthetic gene expression datasets were generated using GeneNetWeaver. Both DBN and linear SSM were used to infer GRNs from the synthetic datasets. The inferred networks were compared with the true networks. Our results show that inference precision varied with the number of hidden variables. For some regulatory networks, the inference precision of DBN was higher but SSM performed better in other cases. Although the overall performance of the two approaches is compatible, SSM is much faster and capable of inferring much larger networks than DBN. This study provides useful information in handling the hidden variables and improving the inference precision.
Tettelin, Hervé; Masignani, Vega; Cieslewicz, Michael J.; Donati, Claudio; Medini, Duccio; Ward, Naomi L.; Angiuoli, Samuel V.; Crabtree, Jonathan; Jones, Amanda L.; Durkin, A. Scott; DeBoy, Robert T.; Davidsen, Tanja M.; Mora, Marirosa; Scarselli, Maria; Margarit y Ros, Immaculada; Peterson, Jeremy D.; Hauser, Christopher R.; Sundaram, Jaideep P.; Nelson, William C.; Madupu, Ramana; Brinkac, Lauren M.; Dodson, Robert J.; Rosovitz, Mary J.; Sullivan, Steven A.; Daugherty, Sean C.; Haft, Daniel H.; Selengut, Jeremy; Gwinn, Michelle L.; Zhou, Liwei; Zafar, Nikhat; Khouri, Hoda; Radune, Diana; Dimitrov, George; Watkins, Kisha; O'Connor, Kevin J. B.; Smith, Shannon; Utterback, Teresa R.; White, Owen; Rubens, Craig E.; Grandi, Guido; Madoff, Lawrence C.; Kasper, Dennis L.; Telford, John L.; Wessels, Michael R.; Rappuoli, Rino; Fraser, Claire M.
2005-01-01
The development of efficient and inexpensive genome sequencing methods has revolutionized the study of human bacterial pathogens and improved vaccine design. Unfortunately, the sequence of a single genome does not reflect how genetic variability drives pathogenesis within a bacterial species and also limits genome-wide screens for vaccine candidates or for antimicrobial targets. We have generated the genomic sequence of six strains representing the five major disease-causing serotypes of Streptococcus agalactiae, the main cause of neonatal infection in humans. Analysis of these genomes and those available in databases showed that the S. agalactiae species can be described by a pan-genome consisting of a core genome shared by all isolates, accounting for ≈80% of any single genome, plus a dispensable genome consisting of partially shared and strain-specific genes. Mathematical extrapolation of the data suggests that the gene reservoir available for inclusion in the S. agalactiae pan-genome is vast and that unique genes will continue to be identified even after sequencing hundreds of genomes. PMID:16172379
Rapid Assembly of Customized TALENs into Multiple Delivery Systems
Zhang, Zhengxing; Zhang, Siliang; Huang, Xin; Orwig, Kyle E.; Sheng, Yi
2013-01-01
Transcriptional activator-like effector nucleases (TALENs) have become a powerful tool for genome editing. Here we present an efficient TALEN assembly approach in which TALENs are assembled by direct Golden Gate ligation into Gateway® Entry vectors from a repeat variable di-residue (RVD) plasmid array. We constructed TALEN pairs targeted to mouse Ddx3 subfamily genes, and demonstrated that our modified TALEN assembly approach efficiently generates accurate TALEN moieties that effectively introduce mutations into target genes. We generated “user friendly” TALEN Entry vectors containing TALEN expression cassettes with fluorescent reporter genes that can be efficiently transferred via Gateway (LR) recombination into different delivery systems. We demonstrated that the TALEN Entry vectors can be easily transferred to an adenoviral delivery system to expand application to cells that are difficult to transfect. Since TALENs work in pairs, we also generated a TALEN Entry vector set that combines a TALEN pair into one PiggyBac transposon-based destination vector. The approach described here can also be modified for construction of TALE transcriptional activators, repressors or other functional domains. PMID:24244669
Estimating the probability for major gene Alzheimer disease
DOE Office of Scientific and Technical Information (OSTI.GOV)
Farrer, L.A.; Cupples, L.A.
1994-02-01
Alzheimer disease (AD) is a neuropsychiatric illness caused by multiple etiologies. Prediction of whether AD is genetically based in a given family is problematic because of censoring bias among unaffected relatives as a consequence of the late onset of the disorder, diagnostic uncertainties, heterogeneity, and limited information in a single family. The authors have developed a method based on Bayesian probability to compute values for a continuous variable that ranks AD families as having a major gene form of AD (MGAD). In addition, they have compared the Bayesian method with a maximum-likelihood approach. These methods incorporate sex- and age-adjusted riskmore » estimates and allow for phenocopies and familial clustering of age on onset. Agreement is high between the two approaches for ranking families as MGAD (Spearman rank [r] = .92). When either method is used, the numerical outcomes are sensitive to assumptions of the gene frequency and cumulative incidence of the disease in the population. Consequently, risk estimates should be used cautiously for counseling purposes; however, there are numerous valid applications of these procedures in genetic and epidemiological studies. 41 refs., 4 figs., 3 tabs.« less
Namgoong, Suhg; Cheong, Hyun Sub; Kim, Ji On; Kim, Lyoung Hyo; Na, Han Sung; Koh, In Song; Chung, Myeon Woo; Shin, Hyoung Doo
2015-11-01
Organic anion-transporting polypeptide (OATP; gene symbol, SLCO) transporters are generally involved in the uptake of multiple drugs and their metabolites at most epithelial barriers. The pattern of single-nucleotide polymorphisms (SNPs) in these transporters may be determinants of interindividual variability in drug disposition and response. The objective of this study was to define the distribution of SNPs of three SLCO genes, SLCO1B1, SLCO1B3, and SLCO2B1, in a Korean population and other ethnic groups. The study was screened using the Illumina GoldenGate assay for genomic DNA from 450 interethnic subjects, including 11 pharmacogenetic core variants and 76 HapMap tagging SNPs. The genotype distribution of the Korean population was similar to East Asian populations, but significantly different from African American and European American cohorts. These interethnic differences will be useful information for prospective studies, including genetic association and pharmacogenetic studies of drug metabolism by SLCO families. Copyright © 2015 Elsevier B.V. All rights reserved.
Gillot, Guillaume; Jany, Jean-Luc; Dominguez-Santos, Rebeca; Poirier, Elisabeth; Debaets, Stella; Hidalgo, Pedro I; Ullán, Ricardo V; Coton, Emmanuel; Coton, Monika
2017-04-01
Mycophenolic acid (MPA) is a secondary metabolite produced by various Penicillium species including Penicillium roqueforti. The MPA biosynthetic pathway was recently described in Penicillium brevicompactum. In this study, an in silico analysis of the P. roqueforti FM164 genome sequence localized a 23.5-kb putative MPA gene cluster. The cluster contains seven genes putatively coding seven proteins (MpaA, MpaB, MpaC, MpaDE, MpaF, MpaG, MpaH) and is highly similar (i.e. gene synteny, sequence homology) to the P. brevicompactum cluster. To confirm the involvement of this gene cluster in MPA biosynthesis, gene silencing using RNA interference targeting mpaC, encoding a putative polyketide synthase, was performed in a high MPA-producing P. roqueforti strain (F43-1). In the obtained transformants, decreased MPA production (measured by LC-Q-TOF/MS) was correlated to reduced mpaC gene expression by Q-RT-PCR. In parallel, mycotoxin quantification on multiple P. roqueforti strains suggested strain-dependent MPA-production. Thus, the entire MPA cluster was sequenced for P. roqueforti strains with contrasted MPA production and a 174bp deletion in mpaC was observed in low MPA-producers. PCRs directed towards the deleted region among 55 strains showed an excellent correlation with MPA quantification. Our results indicated the clear involvement of mpaC gene as well as surrounding cluster in P. roqueforti MPA biosynthesis. Copyright © 2016 Elsevier Ltd. All rights reserved.
The king cobra genome reveals dynamic gene evolution and adaptation in the snake venom system.
Vonk, Freek J; Casewell, Nicholas R; Henkel, Christiaan V; Heimberg, Alysha M; Jansen, Hans J; McCleary, Ryan J R; Kerkkamp, Harald M E; Vos, Rutger A; Guerreiro, Isabel; Calvete, Juan J; Wüster, Wolfgang; Woods, Anthony E; Logan, Jessica M; Harrison, Robert A; Castoe, Todd A; de Koning, A P Jason; Pollock, David D; Yandell, Mark; Calderon, Diego; Renjifo, Camila; Currier, Rachel B; Salgado, David; Pla, Davinia; Sanz, Libia; Hyder, Asad S; Ribeiro, José M C; Arntzen, Jan W; van den Thillart, Guido E E J M; Boetzer, Marten; Pirovano, Walter; Dirks, Ron P; Spaink, Herman P; Duboule, Denis; McGlinn, Edwina; Kini, R Manjunatha; Richardson, Michael K
2013-12-17
Snakes are limbless predators, and many species use venom to help overpower relatively large, agile prey. Snake venoms are complex protein mixtures encoded by several multilocus gene families that function synergistically to cause incapacitation. To examine venom evolution, we sequenced and interrogated the genome of a venomous snake, the king cobra (Ophiophagus hannah), and compared it, together with our unique transcriptome, microRNA, and proteome datasets from this species, with data from other vertebrates. In contrast to the platypus, the only other venomous vertebrate with a sequenced genome, we find that snake toxin genes evolve through several distinct co-option mechanisms and exhibit surprisingly variable levels of gene duplication and directional selection that correlate with their functional importance in prey capture. The enigmatic accessory venom gland shows a very different pattern of toxin gene expression from the main venom gland and seems to have recruited toxin-like lectin genes repeatedly for new nontoxic functions. In addition, tissue-specific microRNA analyses suggested the co-option of core genetic regulatory components of the venom secretory system from a pancreatic origin. Although the king cobra is limbless, we recovered coding sequences for all Hox genes involved in amniote limb development, with the exception of Hoxd12. Our results provide a unique view of the origin and evolution of snake venom and reveal multiple genome-level adaptive responses to natural selection in this complex biological weapon system. More generally, they provide insight into mechanisms of protein evolution under strong selection.
Hoppman-Chaney, N; Wain, K; Seger, P R; Superneau, D W; Hodge, J C
2013-04-01
The 15q13.3 microdeletion syndrome (OMIM #612001) is characterized by a wide range of phenotypic features, including intellectual disability, seizures, autism, and psychiatric conditions. This deletion is inherited in approximately 75% of cases and has been found in mildly affected and normal parents, consistent with variable expressivity and incomplete penetrance. The common deletion is approximately 2 Mb and contains several genes; however, the gene(s) responsible for the resulting clinical features have not been clearly defined. Recently, four probands were reported with small deletions including only the CHRNA7 gene. These patients showed a wide range of phenotypic features similar to those associated with the larger 15q13.3 microdeletion. To further correlate genotype and phenotype, we queried our database of >15,000 patients tested in the Mayo Clinic Cytogenetics Laboratory from 2008 to 2011 and identified 19 individuals (10 probands and 9 family members) with isolated heterozygous CHRNA7 gene deletions. All but two infants displayed multiple features consistent with 15q13.3 microdeletion syndrome. We also identified the first de novo deletion confined to CHRNA7 as well as the second known case with homozygous deletion of CHRNA7 only. These results provide further evidence implicating CHRNA7 as the gene responsible for the clinical findings associated with 15q13.3 microdeletion. © 2012 John Wiley & Sons A/S. Published by Blackwell Publishing Ltd.
Zhou, Haibo; Liu, Junlai; Zhou, Changyang; Gao, Ni; Rao, Zhiping; Li, He; Hu, Xinde; Li, Changlin; Yao, Xuan; Shen, Xiaowen; Sun, Yidi; Wei, Yu; Liu, Fei; Ying, Wenqin; Zhang, Junming; Tang, Cheng; Zhang, Xu; Xu, Huatai; Shi, Linyu; Cheng, Leping; Huang, Pengyu; Yang, Hui
2018-03-01
Despite rapid progresses in the genome-editing field, in vivo simultaneous overexpression of multiple genes remains challenging. We generated a transgenic mouse using an improved dCas9 system that enables simultaneous and precise in vivo transcriptional activation of multiple genes and long noncoding RNAs in the nervous system. As proof of concept, we were able to use targeted activation of endogenous neurogenic genes in these transgenic mice to directly and efficiently convert astrocytes into functional neurons in vivo. This system provides a flexible and rapid screening platform for studying complex gene networks and gain-of-function phenotypes in the mammalian brain.
Adebola, Adijat A; Di Castri, Theo; He, Chui-Zhen; Salvatierra, Laura A; Zhao, Jian; Brown, Kristy; Lin, Chyuan-Sheng; Worman, Howard J; Liem, Ronald K H
2015-04-15
Charcot-Marie-Tooth disease (CMT) is the most commonly inherited neurological disorder with a prevalence of 1 in 2500 people worldwide. Patients suffer from degeneration of the peripheral nerves that control sensory information of the foot/leg and hand/arm. Multiple mutations in the neurofilament light polypeptide gene, NEFL, cause CMT2E. Previous studies in transfected cells showed that expression of disease-associated neurofilament light chain variants results in abnormal intermediate filament networks associated with defects in axonal transport. We have now generated knock-in mice with two different point mutations in Nefl: P8R that has been reported in multiple families with variable age of onset and N98S that has been described as an early-onset, sporadic mutation in multiple individuals. Nefl(P8R/+) and Nefl(P8R/P8R) mice were indistinguishable from Nefl(+/+) in terms of behavioral phenotype. In contrast, Nefl(N98S/+) mice had a noticeable tremor, and most animals showed a hindlimb clasping phenotype. Immunohistochemical analysis revealed multiple inclusions in the cell bodies and proximal axons of spinal cord neurons, disorganized processes in the cerebellum and abnormal processes in the cerebral cortex and pons. Abnormal processes were observed as early as post-natal day 7. Electron microscopic analysis of sciatic nerves showed a reduction in the number of neurofilaments, an increase in the number of microtubules and a decrease in the axonal diameters. The Nefl(N98S/+) mice provide an excellent model to study the pathogenesis of CMT2E and should prove useful for testing potential therapies. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Liu, Li-Zhi; Wu, Fang-Xiang; Zhang, Wen-Jun
2014-01-01
As an abstract mapping of the gene regulations in the cell, gene regulatory network is important to both biological research study and practical applications. The reverse engineering of gene regulatory networks from microarray gene expression data is a challenging research problem in systems biology. With the development of biological technologies, multiple time-course gene expression datasets might be collected for a specific gene network under different circumstances. The inference of a gene regulatory network can be improved by integrating these multiple datasets. It is also known that gene expression data may be contaminated with large errors or outliers, which may affect the inference results. A novel method, Huber group LASSO, is proposed to infer the same underlying network topology from multiple time-course gene expression datasets as well as to take the robustness to large error or outliers into account. To solve the optimization problem involved in the proposed method, an efficient algorithm which combines the ideas of auxiliary function minimization and block descent is developed. A stability selection method is adapted to our method to find a network topology consisting of edges with scores. The proposed method is applied to both simulation datasets and real experimental datasets. It shows that Huber group LASSO outperforms the group LASSO in terms of both areas under receiver operating characteristic curves and areas under the precision-recall curves. The convergence analysis of the algorithm theoretically shows that the sequence generated from the algorithm converges to the optimal solution of the problem. The simulation and real data examples demonstrate the effectiveness of the Huber group LASSO in integrating multiple time-course gene expression datasets and improving the resistance to large errors or outliers.
Kassambara, Alboukadel; Hose, Dirk; Moreaux, Jérôme; Walker, Brian A.; Protopopov, Alexei; Reme, Thierry; Pellestor, Franck; Pantesco, Véronique; Jauch, Anna; Morgan, Gareth; Goldschmidt, Hartmut; Klein, Bernard
2012-01-01
Background Genetic abnormalities are common in patients with multiple myeloma, and may deregulate gene products involved in tumor survival, proliferation, metabolism and drug resistance. In particular, translocations may result in a high expression of targeted genes (termed spike expression) in tumor cells. We identified spike genes in multiple myeloma cells of patients with newly-diagnosed myeloma and investigated their prognostic value. Design and Methods Genes with a spike expression in multiple myeloma cells were picked up using box plot probe set signal distribution and two selection filters. Results In a cohort of 206 newly diagnosed patients with multiple myeloma, 2587 genes/expressed sequence tags with a spike expression were identified. Some spike genes were associated with some transcription factors such as MAF or MMSET and with known recurrent translocations as expected. Spike genes were not associated with increased DNA copy number and for a majority of them, involved unknown mechanisms. Of spiked genes, 36.7% clustered significantly in 149 out of 862 documented chromosome (sub)bands, of which 53 had prognostic value (35 bad, 18 good). Their prognostic value was summarized with a spike band score that delineated 23.8% of patients with a poor median overall survival (27.4 months versus not reached, P<0.001) using the training cohort of 206 patients. The spike band score was independent of other gene expression profiling-based risk scores, t(4;14), or del17p in an independent validation cohort of 345 patients. Conclusions We present a new approach to identify spike genes and their relationship to patients’ survival. PMID:22102711
Lê Cao, Kim-Anh; Boitard, Simon; Besse, Philippe
2011-06-22
Variable selection on high throughput biological data, such as gene expression or single nucleotide polymorphisms (SNPs), becomes inevitable to select relevant information and, therefore, to better characterize diseases or assess genetic structure. There are different ways to perform variable selection in large data sets. Statistical tests are commonly used to identify differentially expressed features for explanatory purposes, whereas Machine Learning wrapper approaches can be used for predictive purposes. In the case of multiple highly correlated variables, another option is to use multivariate exploratory approaches to give more insight into cell biology, biological pathways or complex traits. A simple extension of a sparse PLS exploratory approach is proposed to perform variable selection in a multiclass classification framework. sPLS-DA has a classification performance similar to other wrapper or sparse discriminant analysis approaches on public microarray and SNP data sets. More importantly, sPLS-DA is clearly competitive in terms of computational efficiency and superior in terms of interpretability of the results via valuable graphical outputs. sPLS-DA is available in the R package mixOmics, which is dedicated to the analysis of large biological data sets.
Keel, Brittney N; Zarek, Christina M; Keele, John W; Kuehn, Larry A; Snelling, Warren M; Oliver, William T; Freetly, Harvey C; Lindholm-Perry, Amanda K
2018-06-04
Feed intake and body weight gain are economically important inputs and outputs of beef production systems. The purpose of this study was to discover differentially expressed genes that will be robust for feed intake and gain across a large segment of the cattle industry. Transcriptomic studies often suffer from issues with reproducibility and cross-validation. One way to improve reproducibility is by integrating multiple datasets via meta-analysis. RNA sequencing (RNA-Seq) was performed on longissimus dorsi muscle from 80 steers (5 cohorts, each with 16 animals) selected from the outside fringe of a bivariate gain and feed intake distribution to understand the genes and pathways involved in feed efficiency. In each cohort, 16 steers were selected from one of four gain and feed intake phenotypes (n = 4 per phenotype) in a 2 × 2 factorial arrangement with gain and feed intake as main effect variables. Each cohort was analyzed as a single experiment using a generalized linear model and results from the 5 cohort analyses were combined in a meta-analysis to identify differentially expressed genes (DEG) across the cohorts. A total of 51 genes were differentially expressed for the main effect of gain, 109 genes for the intake main effect, and 11 genes for the gain x intake interaction (P corrected < 0.05). A jackknife sensitivity analysis showed that, in general, the meta-analysis produced robust DEGs for the two main effects and their interaction. Pathways identified from over-represented genes included mitochondrial energy production and oxidative stress pathways for the main effect of gain due to DEG including GPD1, NDUFA6, UQCRQ, ACTC1, and MGST3. For intake, metabolic pathways including amino acid biosynthesis and degradation were identified, and for the interaction analysis the pathways identified included GADD45, pyridoxal 5'phosphate salvage, and caveolar mediated endocytosis signaling. Variation among DEG identified by cohort suggests that environment and breed may play large roles in the expression of genes associated with feed efficiency in the muscle of beef cattle. Meta-analyses of transcriptome data from groups of animals over multiple cohorts may be necessary to elucidate the genetics contributing these types of biological phenotypes.
Gene variants associated with antisocial behaviour: A latent variable approach
Bentley, Mary Jane; Lin, Haiqun; Fernandez, Thomas V.; Lee, Maria; Yrigollen, Carolyn M.; Pakstis, Andrew J.; Katsovich, Liliya; Olds, David L.; Grigorenko, Elena L.; Leckman, James F.
2013-01-01
Objective The aim of this study was to determine if a latent variable approach might be useful in identifying shared variance across genetic risk alleles that is associated with antisocial behaviour at age 15 years. Methods Using a conventional latent variable approach, we derived an antisocial phenotype in 328 adolescents utilizing data from a 15-year follow-up of a randomized trial of a prenatal and infancy nurse-home visitation program in Elmira, New York. We then investigated, via a novel latent variable approach, 450 informative genetic polymorphisms in 71 genes previously associated with antisocial behaviour, drug use, affiliative behaviours, and stress response in 241 consenting individuals for whom DNA was available. Haplotype and Pathway analyses were also performed. Results Eight single-nucleotide polymorphisms (SNPs) from 8 genes contributed to the latent genetic variable that in turn accounted for 16.0% of the variance within the latent antisocial phenotype. The number of risk alleles was linearly related to the latent antisocial variable scores. Haplotypes that included the putative risk alleles for all 8 genes were also associated with higher latent antisocial variable scores. In addition, 33 SNPs from 63 of the remaining genes were also significant when added to the final model. Many of these genes interact on a molecular level, forming molecular networks. The results support a role for genes related to dopamine, norepinephrine, serotonin, glutamate, opioid, and cholinergic signaling as well as stress response pathways in mediating susceptibility to antisocial behaviour. Conclusions This preliminary study supports use of relevant behavioural indicators and latent variable approaches to study the potential “co-action” of gene variants associated with antisocial behaviour. It also underscores the cumulative relevance of common genetic variants for understanding the etiology of complex behaviour. If replicated in future studies, this approach may allow the identification of a ‘shared’ variance across genetic risk alleles associated with complex neuropsychiatric dimensional phenotypes using relatively small numbers of well-characterized research participants. PMID:23822756
Using msa-2b as a molecular marker for genotyping Mexican isolates of Babesia bovis.
Genis, Alma D; Perez, Jocelin; Mosqueda, Juan J; Alvarez, Antonio; Camacho, Minerva; Muñoz, Maria de Lourdes; Rojas, Carmen; Figueroa, Julio V
2009-12-01
Variable merozoite surface antigens of Babesia bovis are exposed glycoproteins having a role in erythrocyte invasion. Members of this gene family include msa-1 and msa-2 (msa-2c, msa-2a(1), msa-2a(2) and msa-2b). To determine the sequence variation among B. bovis Mexican isolates using msa-2b as a genetic marker, PCR amplicons corresponding to msa-2b were cloned and plasmids carrying the corresponding inserts were purified and sequenced. Comparative analysis of nucleotide and deduced amino acid sequences revealed distinct degrees of variability and identity among the coding gene sequences obtained from 16 geographically different Mexican B. bovis isolates and a reference strain. Clustal-W multiple alignments of the MSA-2b deduced amino acid sequences performed with the 17 B. bovis Mexican isolates, revealed the identification of three genotypes with a distinct set each of amino acid residues present at the variable region: Genotype I represented by the MO7 strain (in vitro culture-derived from the Mexico isolate) as well as RAD, Chiapas-1, Tabasco and Veracruz-3 isolates; Genotype II, represented by the Jalisco, Mexico and Veracruz-2 isolates; and Genotype III comprising the sequences from most of the isolates studied, Tamaulipas-1, Chiapas-2, Guerrero-1, Nayarit, Quintana Roo, Nuevo Leon, Tamaulipas-2, Yucatan and Guerrero-2. Moreover, these three genotypes could be discriminated against each other by using a PCR-RFLP approach. The results suggest that occurrence of indels within the variable region of msa-2b sequences can be useful markers for identifying a particular genotype present in field populations of B. bovis isolated from infected cattle in Mexico.
Lodh, Nilanjan; Kerans, Billie L; Stevens, Lori
2012-01-01
Understanding the genetic structure of parasite populations on the natural landscape can reveal important aspects of disease ecology and epidemiology and can indicate parasite dispersal across the landscape. Myxobolus cerebralis (Myxozoa: Myxosporea), the causative agent of whirling disease in the definitive host Tubifex tubifex, is native to Eurasia and has spread to more than 25 states in the USA. The small amounts of data available to date suggest that M. cerebralis has little genetic variability. We examined the genetic variability of parasites infecting the definitive host T. tubifex in the Madison River, MT, and also from other parts of North America and Europe. We cloned and sequenced 18S ribosomal DNA and the internal transcribed spacer-1 (ITS-1) gene. Five oligochaetes were examined for 18S and five for ITS-1, only one individual was examined for both genes. We found two different 18S rRNA haplotypes of M. cerebralis from five worms and both intra- and interworm genetic variation for ITS-1, which showed 16 different haplotypes from among 20 clones. Comparison of our sequences with those from other studies revealed M. cerebralis from MT was similar to the parasite collected from Alaska, Oregon, California, and Virginia in the USA and from Munich, Germany, based on 18S, whereas parasite sequences from West Virginia were very different. Combined with the high haplotype diversity of ITS-1 and uniqueness of ITS-1 haplotypes, our results show that M. cerebralis is more variable than previously thought and raises the possibility of multiple introductions of the parasite into North America. © 2011 The Author(s) Journal of Eukaryotic Microbiology © 2011 International Society of Protistologists.
Kühne, Annett; Kaiser, Rolf; Schirmer, Markus; Heider, Ulrike; Muhlke, Sabine; Niere, Wiebke; Overbeck, Tobias; Hohloch, Karin; Trümper, Lorenz; Sezer, Orhan; Brockmöller, Jürgen
2007-07-01
Melphalan is widely used in the treatment of multiple myeloma. Pharmacokinetics of this alkylating drug shows high inter-individual variability. As melphalan is a phenylalanine derivative, the pharmacokinetic variability may be determined by genetic polymorphisms in the L-type amino acid transporters LAT1 (SLC7A5) and LAT2 (SLC7A8). Pharmacokinetics were analysed in 64 patients after first administration of intravenous melphalan. Severity of side effects was documented according to WHO criteria. Genomic DNA was analysed for polymorphisms in LAT1 and LAT2 by sequencing of the entire coding region, intron-exon boundaries and 2 kb upstream promoter region. Selected polymorphisms in the common heavy chain of both transporters, the protein 4F2hc (SLC3A2), were analysed by single nucleotide primer extension. Melphalan pharmacokinetics was highly variable with up to 6.2-fold differences in total clearance. A total of 44 polymorphisms were identified in LAT1 and 21 polymorphisms in LAT2. From all variants, only five were in the coding region and only one heterozygous non-synonymous polymorphism (Ala94Thr) was found in LAT2. Numerous polymorphisms were found in the LAT1 and LAT2 5'-flanking regions but did not correlate with expression of the respective genes. No significant correlations could be observed between the polymorphisms in 4F2hc, LAT1, and LAT2 with melphalan pharmacokinetics or with melphalan side effects. The study confirmed that these transporter genes are highly conserved, particularly in the coding sequences. Genetic variation in 4F2hc, LAT1, and LAT2 does not appear to be a major cause of inter-individual variability in pharmacokinetics and of adverse reactions to melphalan.
Hovel-Miner, Galadriel; Pampou, Sergey; Faucher, Sebastien P; Clarke, Margaret; Morozova, Irina; Morozov, Pavel; Russo, James J; Shuman, Howard A; Kalachikov, Sergey
2009-04-01
Legionella pneumophila is the causative agent of the severe and potentially fatal pneumonia Legionnaires' disease. L. pneumophila is able to replicate within macrophages and protozoa by establishing a replicative compartment in a process that requires the Icm/Dot type IVB secretion system. The signals and regulatory pathways required for Legionella infection and intracellular replication are poorly understood. Mutation of the rpoS gene, which encodes sigma(S), does not affect growth in rich medium but severely decreases L. pneumophila intracellular multiplication within protozoan hosts. To gain insight into the intracellular multiplication defect of an rpoS mutant, we examined its pattern of gene expression during exponential and postexponential growth. We found that sigma(S) affects distinct groups of genes that contribute to Legionella intracellular multiplication. We demonstrate that rpoS mutants have a functional Icm/Dot system yet are defective for the expression of many genes encoding Icm/Dot-translocated substrates. We also show that sigma(S) affects the transcription of the cpxR and pmrA genes, which encode two-component response regulators that directly affect the transcription of Icm/Dot substrates. Our characterization of the L. pneumophila small RNA csrB homologs, rsmY and rsmZ, introduces a link between sigma(S) and the posttranscriptional regulator CsrA. We analyzed the network of sigma(S)-controlled genes by mutational analysis of transcriptional regulators affected by sigma(S). One of these, encoding the L. pneumophila arginine repressor homolog gene, argR, is required for maximal intracellular growth in amoebae. These data show that sigma(S) is a key regulator of multiple pathways required for L. pneumophila intracellular multiplication.
Targeted and efficient transfer of multiple value-added genes into wheat varieties
USDA-ARS?s Scientific Manuscript database
With an objective to optimize an approach to transfer multiple value added genes to a wheat variety while maintaining and improving agronomic performance, two alleles with mutations in the acetolactate synthase (ALS) gene located on wheat chromosomes 6B and 6D providing tolerance to imidazolinone (I...
Li, Qin; Li, Jing; Sun, Jin-Long; Ma, Xian-Feng; Wang, Ting-Ting; Berkey, Robert; Yang, Hui; Niu, Ying-Ze; Fan, Jing; Li, Yan; Xiao, Shunyuan; Wang, Wen-Ming
2016-01-01
The Resistance to Powdery Mildew 8 (RPW8) locus confers broad-spectrum resistance to powdery mildew in Arabidopsis thaliana. There are four Homologous to RPW8s (BrHRs) in Brassica rapa and three in Brassica oleracea (BoHRs). Brassica napus (Bn) is derived from diploidization of a hybrid between B. rapa and B. oleracea, thus should have seven homologs of RPW8 (BnHRs). It is unclear whether these genes are still maintained or lost in B. napus after diploidization and how they might have been evolved. Here, we reported the identification and sequence polymorphisms of BnHRs from a set of B. napus accessions. Our data indicated that while the BoHR copy from B. oleracea is highly conserved, the BrHR copy from B. rapa is relatively variable in the B. napus genome owing to multiple evolutionary events, such as gene loss, point mutation, insertion, deletion, and intragenic recombination. Given the overall high sequence homology of BnHR genes, it is not surprising that both intragenic recombination between two orthologs and two paralogs were detected in B. napus, which may explain the loss of BoHR genes in some B. napus accessions. When ectopically expressed in Arabidopsis, a C-terminally truncated version of BnHRa and BnHRb, as well as the full length BnHRd fused with YFP at their C-termini could trigger cell death in the absence of pathogens and enhanced resistance to powdery mildew disease. Moreover, subcellular localization analysis showed that both BnHRa-YFP and BnHRb-YFP were mainly localized to the extra-haustorial membrane encasing the haustorium of powdery mildew. Taken together, our data suggest that the duplicated BnHR genes might have been subjected to differential selection and at least some may play a role in defense and could serve as resistance resource in engineering disease-resistant plants.
Wang, Yi-Ting; Sung, Pei-Yuan; Lin, Peng-Lin; Yu, Ya-Wen; Chung, Ren-Hua
2015-05-15
Genome-wide association studies (GWAS) have become a common approach to identifying single nucleotide polymorphisms (SNPs) associated with complex diseases. As complex diseases are caused by the joint effects of multiple genes, while the effect of individual gene or SNP is modest, a method considering the joint effects of multiple SNPs can be more powerful than testing individual SNPs. The multi-SNP analysis aims to test association based on a SNP set, usually defined based on biological knowledge such as gene or pathway, which may contain only a portion of SNPs with effects on the disease. Therefore, a challenge for the multi-SNP analysis is how to effectively select a subset of SNPs with promising association signals from the SNP set. We developed the Optimal P-value Threshold Pedigree Disequilibrium Test (OPTPDT). The OPTPDT uses general nuclear families. A variable p-value threshold algorithm is used to determine an optimal p-value threshold for selecting a subset of SNPs. A permutation procedure is used to assess the significance of the test. We used simulations to verify that the OPTPDT has correct type I error rates. Our power studies showed that the OPTPDT can be more powerful than the set-based test in PLINK, the multi-SNP FBAT test, and the p-value based test GATES. We applied the OPTPDT to a family-based autism GWAS dataset for gene-based association analysis and identified MACROD2-AS1 with genome-wide significance (p-value=2.5×10(-6)). Our simulation results suggested that the OPTPDT is a valid and powerful test. The OPTPDT will be helpful for gene-based or pathway association analysis. The method is ideal for the secondary analysis of existing GWAS datasets, which may identify a set of SNPs with joint effects on the disease.
Bennett, Matthew S.; Triemer, Richard E.; Preisfeld, Angelika
2017-01-01
Background Over the last few years multiple studies have been published showing a great diversity in size of chloroplast genomes (cpGenomes), and in the arrangement of gene clusters, in the Euglenales. However, while these genomes provided important insights into the evolution of cpGenomes across the Euglenales and within their genera, only two genomes were analyzed in regard to genomic variability between and within Euglenales and Eutreptiales. To better understand the dynamics of chloroplast genome evolution in early evolving Eutreptiales, this study focused on the cpGenome of Eutreptiella pomquetensis, and the spread and peculiarities of introns. Methods The Etl. pomquetensis cpGenome was sequenced, annotated and afterwards examined in structure, size, gene order and intron content. These features were compared with other euglenoid cpGenomes as well as those of prasinophyte green algae, including Pyramimonas parkeae. Results and Discussion With about 130,561 bp the chloroplast genome of Etl. pomquetensis, a basal taxon in the phototrophic euglenoids, was considerably larger than the two other Eutreptiales cpGenomes sequenced so far. Although the detected quadripartite structure resembled most green algae and plant chloroplast genomes, the gene content of the single copy regions in Etl. pomquetensis was completely different from those observed in green algae and plants. The gene composition of Etl. pomquetensis was extensively changed and turned out to be almost identical to other Eutreptiales and Euglenales, and not to P. parkeae. Furthermore, the cpGenome of Etl. pomquetensis was unexpectedly permeated by a high number of introns, which led to a substantially larger genome. The 51 identified introns of Etl. pomquetensis showed two major unique features: (i) more than half of the introns displayed a high level of pairwise identities; (ii) no group III introns could be identified in the protein coding genes. These findings support the hypothesis that group III introns are degenerated group II introns and evolved later. PMID:28852596
Multi-layered mutation in hedgehog-related genes in Gorlin syndrome may affect the phenotype.
Onodera, Shoko; Saito, Akiko; Hasegawa, Daigo; Morita, Nana; Watanabe, Katsuhito; Nomura, Takeshi; Shibahara, Takahiko; Ohba, Shinsuke; Yamaguchi, Akira; Azuma, Toshifumi
2017-01-01
Gorlin syndrome is a genetic disorder of autosomal dominant inheritance that predisposes the affected individual to a variety of disorders that are attributed largely to heterozygous germline patched1 (PTCH1) mutations. PTCH1 is a hedgehog (Hh) receptor as well as a repressor, mutation of which leads to constitutive activation of Hh pathway. Hh pathway encompasses a wide variety of cellular signaling cascades, which involve several molecules; however, no associated genotype-phenotype correlations have been reported. Recently, mutations in Suppressor of fused homolog (SUFU) or PTCH2 were reported in patients with Gorlin syndrome. These facts suggest that multi-layered mutations in Hh pathway may contribute to the development of Gorlin syndrome. We demonstrated multiple mutations of Hh-related genes in addition to PTCH1, which possibly act in an additive or multiplicative manner and lead to Gorlin syndrome. High-throughput sequencing was performed to analyze exome sequences in four unrelated Gorlin syndrome patient genomes. Mutations in PTCH1 gene were detected in all four patients. Specific nucleotide variations or frameshift variations of PTCH1 were identified along with the inferred amino acid changes in all patients. We further filtered 84 different genes which are closely related to Hh signaling. Fifty three of these had enough coverage of over ×30. The sequencing results were filtered and compared to reduce the number of sequence variants identified in each of the affected individuals. We discovered three genes, PTCH2, BOC, and WNT9b, with mutations with a predicted functional impact assessed by MutationTaster2 or PolyPhen-2 (Polymorphism Phenotyping v2) analysis. It is noticeable that PTCH2 and BOC are Hh receptor molecules. No significant mutations were observed in SUFU. Multi-layered mutations in Hh pathway may change the activation level of the Hh signals, which may explain the wide phenotypic variability of Gorlin syndrome.
Nolden, T; Pfaff, F; Nemitz, S; Freuling, C M; Höper, D; Müller, T; Finke, Stefan
2016-04-05
Reverse genetics approaches are indispensable tools for proof of concepts in virus replication and pathogenesis. For negative strand RNA viruses (NSVs) the limited number of infectious cDNA clones represents a bottleneck as clones are often generated from cell culture adapted or attenuated viruses, with limited potential for pathogenesis research. We developed a system in which cDNA copies of complete NSV genomes were directly cloned into reverse genetics vectors by linear-to-linear RedE/T recombination. Rapid cloning of multiple rabies virus (RABV) full length genomes and identification of clones identical to field virus consensus sequence confirmed the approache's reliability. Recombinant viruses were recovered from field virus cDNA clones. Similar growth kinetics of parental and recombinant viruses, preservation of field virus characters in cell type specific replication and virulence in the mouse model were confirmed. Reduced titers after reporter gene insertion indicated that the low level of field virus replication is affected by gene insertions. The flexibility of the strategy was demonstrated by cloning multiple copies of an orthobunyavirus L genome segment. This important step in reverse genetics technology development opens novel avenues for the analysis of virus variability combined with phenotypical characterization of recombinant viruses at a clonal level.
Cassidy, Suzanne B; Driscoll, Daniel J
2009-01-01
Prader-Willi syndrome (PWS) is a highly variable genetic disorder affecting multiple body systems whose most consistent major manifestations include hypotonia with poor suck and poor weight gain in infancy; mild mental retardation, hypogonadism, growth hormone insufficiency causing short stature for the family, early childhood-onset hyperphagia and obesity, characteristic appearance, and behavioral and sometimes psychiatric disturbance. Many more minor characteristics can be helpful in diagnosis and important in management. PWS is an example of a genetic condition involving genomic imprinting. It can occur by three main mechanisms, which lead to absence of expression of paternally inherited genes in the 15q11.2-q13 region: paternal microdeletion, maternal uniparental disomy, and imprinting defect.
Haack, S.K.; Duris, J.W.; Fogarty, L.R.; Kolpin, D.W.; Focazio, M.J.; Furlong, E.T.; Meyer, M.T.
2009-01-01
The objective of this study was to compare fecal indicator bacteria (FIB) (fecal coliforms, Escherichia coli [EC], and enterococci [ENT]) concentrations with a wide array of typical organic wastewater chemicals and selected bacterial genes as indicators of fecal pollution in water samples collected at or near 18 surface water drinking water intakes. Genes tested included esp (indicating human-pathogenic ENT) and nine genes associated with various animal sources of shiga-toxin-producing EC (STEC). Fecal pollution was indicated by genes and/or chemicals for 14 of the 18 tested samples, with little relation to FIB standards. Of 13 samples with <50 EC 100 mL-1, human pharmaceuticals or chemical indicators of wastewater treatment plant effluent occurred in six, veterinary antibiotics were detected in three, and stx1 or stx2 genes (indicating varying animal sources of STEC) were detected in eight. Only the EC eaeA gene was positively correlated with FIB concentrations. Human-source fecal pollution was indicated by the esp gene and the human pharmaceutical carbamazepine in one of the nine samples that met all FIB recreational water quality standards. Escherichia coli rfbO157 and stx2c genes, which are typically associated with cattle sources and are of potential human health significance, were detected in one sample in the absence of tested chemicals. Chemical and gene-based indicators of fecal contamination may be present even when FIB standards are met, and some may, unlike FIB, indicate potential sources. Application of multiple water quality indicators with variable environmental persistence and fate may yield greater confidence in fecal pollution assessment and may inform remediation decisions. Copyright ?? 2009 by the American Society of Agronomy, Crop Science Society of America, and Soil Science Society of America. All rights reserved.
Reynolds, Chandra A; Gatz, Margaret; Christensen, Kaare; Christiansen, Lene; Dahl Aslan, Anna K; Kaprio, Jaakko; Korhonen, Tellervo; Kremen, William S; Krueger, Robert; McGue, Matt; Neiderhiser, Jenae M; Pedersen, Nancy L
2016-01-01
Despite emerging interest in gene-environment interaction (GxE) effects, there is a dearth of studies evaluating its potential relevance apart from specific hypothesized environments and biometrical variance trends. Using a monozygotic within-pair approach, we evaluated evidence of G×E for body mass index (BMI), depressive symptoms, and cognition (verbal, spatial, attention, working memory, perceptual speed) in twin studies from four countries. We also evaluated whether APOE is a 'variability gene' across these measures and whether it partly represents the 'G' in G×E effects. In all three domains, G×E effects were pervasive across country and gender, with small-to-moderate effects. Age-cohort trends were generally stable for BMI and depressive symptoms; however, they were variable-with both increasing and decreasing age-cohort trends-for different cognitive measures. Results also suggested that APOE may represent a 'variability gene' for depressive symptoms and spatial reasoning, but not for BMI or other cognitive measures. Hence, additional genes are salient beyond APOE.
Modise, David M.; Gemeildien, Junaid; Ndimba, Bongani K.; Christoffels, Alan
2018-01-01
Background Crop response to the changing climate and unpredictable effects of global warming with adverse conditions such as drought stress has brought concerns about food security to the fore; crop yield loss is a major cause of concern in this regard. Identification of genes with multiple responses across environmental stresses is the genetic foundation that leads to crop adaptation to environmental perturbations. Methods In this paper, we introduce an integrated approach to assess candidate genes for multiple stress responses across-species. The approach combines ontology based semantic data integration with expression profiling, comparative genomics, phylogenomics, functional gene enrichment and gene enrichment network analysis to identify genes associated with plant stress phenotypes. Five different ontologies, viz., Gene Ontology (GO), Trait Ontology (TO), Plant Ontology (PO), Growth Ontology (GRO) and Environment Ontology (EO) were used to semantically integrate drought related information. Results Target genes linked to Quantitative Trait Loci (QTLs) controlling yield and stress tolerance in sorghum (Sorghum bicolor (L.) Moench) and closely related species were identified. Based on the enriched GO terms of the biological processes, 1116 sorghum genes with potential responses to 5 different stresses, such as drought (18%), salt (32%), cold (20%), heat (8%) and oxidative stress (25%) were identified to be over-expressed. Out of 169 sorghum drought responsive QTLs associated genes that were identified based on expression datasets, 56% were shown to have multiple stress responses. On the other hand, out of 168 additional genes that have been evaluated for orthologous pairs, 90% were conserved across species for drought tolerance. Over 50% of identified maize and rice genes were responsive to drought and salt stresses and were co-located within multifunctional QTLs. Among the total identified multi-stress responsive genes, 272 targets were shown to be co-localized within QTLs associated with different traits that are responsive to multiple stresses. Ontology mapping was used to validate the identified genes, while reconstruction of the phylogenetic tree was instrumental to infer the evolutionary relationship of the sorghum orthologs. The results also show specific genes responsible for various interrelated components of drought response mechanism such as drought tolerance, drought avoidance and drought escape. Conclusions We submit that this approach is novel and to our knowledge, has not been used previously in any other research; it enables us to perform cross-species queries for genes that are likely to be associated with multiple stress tolerance, as a means to identify novel targets for engineering stress resistance in sorghum and possibly, in other crop species. PMID:29590108
Detection of susceptibility genes as modifiers due to subgroup differences in complex disease.
Bergen, Sarah E; Maher, Brion S; Fanous, Ayman H; Kendler, Kenneth S
2010-08-01
Complex diseases invariably involve multiple genes and often exhibit variable symptom profiles. The extent to which disease symptoms, course, and severity differ between affected individuals may result from underlying genetic heterogeneity. Genes with modifier effects may or may not also influence disease susceptibility. In this study, we have simulated data in which a subset of cases differ by some effect size (ES) on a quantitative trait and are also enriched for a risk allele. Power to detect this 'pseudo-modifier' gene in case-only and case-control designs was explored blind to case substructure. Simulations involved 1000 iterations and calculations for 80% power at P<0.01 while varying the risk allele frequency (RAF), sample size (SS), ES, odds ratio (OR), and proportions of the case subgroups. With realistic values for the RAF (0.20), SS (3000) and ES (1), an OR of 1.7 is necessary to detect a pseudo-modifier gene. Unequal numbers of subjects in the case groups result in little decrement in power until the group enriched for the risk allele is <30% or >70% of the total case population. In practice, greater numbers of subjects and selection of a quantitative trait with a large range will provide researchers with greater power to detect a pseudo-modifier gene. However, even under ideal conditions, studies involving alleles with low frequencies or low ORs are usually underpowered for detection of a modifier or susceptibility gene. This may explain some of the inconsistent association results for many candidate gene studies of complex diseases.
Vavougios, George D; Zarogiannis, Sotirios G; Krogfelt, Karen Angeliki; Gourgoulianis, Konstantinos; Mitsikostas, Dimos Dimitrios; Hadjigeorgiou, Georgios
2018-01-01
currently only 4 studies have explored the potential role of PARK7's dysregulation in MS pathophysiology Currently, no study has evaluated the potential role of the PARK7 interactome in MS. The aim of our study was to assess the differential expression of PARK7 mRNA in peripheral blood mononuclears (PBMCs) donated from MS versus healthy patients using data mining techniques. The PARK7 interactome data from the GDS3920 profile were scrutinized for differentially expressed genes (DEGs); Gene Enrichment Analysis (GEA) was used to detect significantly enriched biological functions. 27 differentially expressed genes in the MS dataset were detected; 12 of these (NDUFA4, UBA2, TDP2, NPM1, NDUFS3, SUMO1, PIAS2, KIAA0101, RBBP4, NONO, RBBP7 AND HSPA4) are reported for the first time in MS. Stepwise Linear Discriminant Function Analysis constructed a predictive model (Wilk's λ = 0.176, χ 2 = 45.204, p = 1.5275e -10 ) with 2 variables (TIDP2, RBBP4) that achieved 96.6% accuracy when discriminating between patients and controls. Gene Enrichment Analysis revealed that induction and regulation of programmed / intrinsic cell death represented the most salient Gene Ontology annotations. Cross-validation on systemic lupus erythematosus and ischemic stroke datasets revealed that these functions are unique to the MS dataset. Based on our results, novel potential target genes are revealed; these differentially expressed genes regulate epigenetic and apoptotic pathways that may further elucidate underlying mechanisms of autorreactivity in MS. Copyright © 2017 Elsevier B.V. All rights reserved.
Genome-Wide Analysis of Gene-Gene and Gene-Environment Interactions Using Closed-Form Wald Tests.
Yu, Zhaoxia; Demetriou, Michael; Gillen, Daniel L
2015-09-01
Despite the successful discovery of hundreds of variants for complex human traits using genome-wide association studies, the degree to which genes and environmental risk factors jointly affect disease risk is largely unknown. One obstacle toward this goal is that the computational effort required for testing gene-gene and gene-environment interactions is enormous. As a result, numerous computationally efficient tests were recently proposed. However, the validity of these methods often relies on unrealistic assumptions such as additive main effects, main effects at only one variable, no linkage disequilibrium between the two single-nucleotide polymorphisms (SNPs) in a pair or gene-environment independence. Here, we derive closed-form and consistent estimates for interaction parameters and propose to use Wald tests for testing interactions. The Wald tests are asymptotically equivalent to the likelihood ratio tests (LRTs), largely considered to be the gold standard tests but generally too computationally demanding for genome-wide interaction analysis. Simulation studies show that the proposed Wald tests have very similar performances with the LRTs but are much more computationally efficient. Applying the proposed tests to a genome-wide study of multiple sclerosis, we identify interactions within the major histocompatibility complex region. In this application, we find that (1) focusing on pairs where both SNPs are marginally significant leads to more significant interactions when compared to focusing on pairs where at least one SNP is marginally significant; and (2) parsimonious parameterization of interaction effects might decrease, rather than increase, statistical power. © 2015 WILEY PERIODICALS, INC.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bozza, M.; Gerard, C.; Kolakowski, L.F. Jr.
1995-06-10
Macrophage migration inhibitory factor, MIF, is a cytokine released by T-lymphocytes, macrophages, and the pituitary gland that serves to integrate peripheral and central inflammatory responses. Ubiquitous expression and developmental regulation suggest that MIF may have additional roles outside of the immune system. Here we report the structure and chromosomal location of the mouse Mif gene and the partial characterization of five Mif pseudogenes. The mouse Mif gene spans less than 0.7 kb of chromosomal DNA and is composed of three exons. A comparison between the mouse and the human genes shows a similar gene structure and common regulatory elements inmore » both promoter regions. The mouse Mif gene maps to the middle region of chromosome 10, between Bcr and S100b, which have been mapped to human chromosomes 22q11 and 21q22.3, respectively. The entire sequence of two pseudogenes demonstrates the absence of introns, the presence of the 5{prime} untranslated region of the cDNA, a 3{prime} poly(A) tail, and the lack of sequence similarity with untranscribed regions of the gene. The five pseudogenes are highly homologous to the cDNA, but contain a variable number of mutations that would produce mutated or truncated MIF-like proteins. Phylogenetic analyses of MIF genes and pseudogenes indicate several independent genetic events that can account for multiple genomic integrations. Three of the Mif pseudogenes were also mapped by interspecific backcross to chromosomes 1, 9, and 17. These results suggest that Mif pseudogenes originated by retrotransposition. 46 refs., 5 figs., 1 tab.« less
Kutschera, Verena E.; Bidon, Tobias; Hailer, Frank; Rodi, Julia L.; Fain, Steven R.; Janke, Axel
2014-01-01
Ursine bears are a mammalian subfamily that comprises six morphologically and ecologically distinct extant species. Previous phylogenetic analyses of concatenated nuclear genes could not resolve all relationships among bears, and appeared to conflict with the mitochondrial phylogeny. Evolutionary processes such as incomplete lineage sorting and introgression can cause gene tree discordance and complicate phylogenetic inferences, but are not accounted for in phylogenetic analyses of concatenated data. We generated a high-resolution data set of autosomal introns from several individuals per species and of Y-chromosomal markers. Incorporating intraspecific variability in coalescence-based phylogenetic and gene flow estimation approaches, we traced the genealogical history of individual alleles. Considerable heterogeneity among nuclear loci and discordance between nuclear and mitochondrial phylogenies were found. A species tree with divergence time estimates indicated that ursine bears diversified within less than 2 My. Consistent with a complex branching order within a clade of Asian bear species, we identified unidirectional gene flow from Asian black into sloth bears. Moreover, gene flow detected from brown into American black bears can explain the conflicting placement of the American black bear in mitochondrial and nuclear phylogenies. These results highlight that both incomplete lineage sorting and introgression are prominent evolutionary forces even on time scales up to several million years. Complex evolutionary patterns are not adequately captured by strictly bifurcating models, and can only be fully understood when analyzing multiple independently inherited loci in a coalescence framework. Phylogenetic incongruence among gene trees hence needs to be recognized as a biologically meaningful signal. PMID:24903145
Wu, Wentao; Liu, Yaxue; Wang, Yuqian; Li, Huimin; Liu, Jiaxi; Tan, Jiaxin; He, Jiadai; Bai, Jingwen; Ma, Haoli
2017-10-08
The plant hormone auxin plays pivotal roles in many aspects of plant growth and development. The auxin/indole-3-acetic acid (Aux/IAA) gene family encodes short-lived nuclear proteins acting on auxin perception and signaling, but the evolutionary history of this gene family remains to be elucidated. In this study, the Aux/IAA gene family in 17 plant species covering all major lineages of plants is identified and analyzed by using multiple bioinformatics methods. A total of 434 Aux/IAA genes was found among these plant species, and the gene copy number ranges from three ( Physcomitrella patens ) to 63 ( Glycine max ). The phylogenetic analysis shows that the canonical Aux/IAA proteins can be generally divided into five major clades, and the origin of Aux/IAA proteins could be traced back to the common ancestor of land plants and green algae. Many truncated Aux/IAA proteins were found, and some of these truncated Aux/IAA proteins may be generated from the C-terminal truncation of auxin response factor (ARF) proteins. Our results indicate that tandem and segmental duplications play dominant roles for the expansion of the Aux/IAA gene family mainly under purifying selection. The putative nuclear localization signals (NLSs) in Aux/IAA proteins are conservative, and two kinds of new primordial bipartite NLSs in P. patens and Selaginella moellendorffii were discovered. Our findings not only give insights into the origin and expansion of the Aux/IAA gene family, but also provide a basis for understanding their functions during the course of evolution.
Wu, Wentao; Liu, Yaxue; Wang, Yuqian; Li, Huimin; Liu, Jiaxi; Tan, Jiaxin; He, Jiadai; Bai, Jingwen
2017-01-01
The plant hormone auxin plays pivotal roles in many aspects of plant growth and development. The auxin/indole-3-acetic acid (Aux/IAA) gene family encodes short-lived nuclear proteins acting on auxin perception and signaling, but the evolutionary history of this gene family remains to be elucidated. In this study, the Aux/IAA gene family in 17 plant species covering all major lineages of plants is identified and analyzed by using multiple bioinformatics methods. A total of 434 Aux/IAA genes was found among these plant species, and the gene copy number ranges from three (Physcomitrella patens) to 63 (Glycine max). The phylogenetic analysis shows that the canonical Aux/IAA proteins can be generally divided into five major clades, and the origin of Aux/IAA proteins could be traced back to the common ancestor of land plants and green algae. Many truncated Aux/IAA proteins were found, and some of these truncated Aux/IAA proteins may be generated from the C-terminal truncation of auxin response factor (ARF) proteins. Our results indicate that tandem and segmental duplications play dominant roles for the expansion of the Aux/IAA gene family mainly under purifying selection. The putative nuclear localization signals (NLSs) in Aux/IAA proteins are conservative, and two kinds of new primordial bipartite NLSs in P. patens and Selaginella moellendorffii were discovered. Our findings not only give insights into the origin and expansion of the Aux/IAA gene family, but also provide a basis for understanding their functions during the course of evolution. PMID:28991190
Effects of electrofishing gear type on spatial and temporal variability in fish community sampling
Meador, M.R.; McIntyre, J.P.
2003-01-01
Fish community data collected from 24 major river basins between 1993 and 1998 as part of the U.S. Geological Survey's National Water-Quality Assessment Program were analyzed to assess multiple-reach (three consecutive reaches) and multiple-year (three consecutive years) variability in samples collected at a site. Variability was assessed using the coefficient of variation (CV; SD/mean) of species richness, the Jaccard index (JI), and the percent similarity index (PSI). Data were categorized by three electrofishing sample collection methods: backpack, towed barge, and boat. Overall, multiple-reach CV values were significantly lower than those for multiple years, whereas multiple-reach JI and PSI values were significantly greater than those for multiple years. Multiple-reach and multiple-year CV values did not vary significantly among electrofishing methods, although JI and PSI values were significantly greatest for backpack electrofishing across multiple reaches and multiple years. The absolute difference between mean species richness for multiple-reach samples and mean species richness for multiple-year samples was 0.8 species (9.5% of total species richness) for backpack samples, 1.7 species (10.1%) for towed-barge samples, and 4.5 species (24.4%) for boat-collected samples. Review of boat-collected fish samples indicated that representatives of four taxonomic families - Catostomidae, Centrarchidae, Cyprinidae, and Ictaluridae - were collected at all sites. Of these, catostomids exhibited greater interannual variability than centrarchids, cyprinids, or ictalurids. Caution should be exercised when combining boat-collected fish community data from different years because of relatively high interannual variability, which is primarily due to certain relatively mobile species. Such variability may obscure longer-term trends.
Tong, Zheng; Wang, Dan; Sun, Yong; Yang, Qian; Meng, Xueru; Wang, Limin; Feng, Weiqiang; Li, Ling; Wurtele, Eve Syrkin; Wang, Xuchu
2017-05-02
Rubber elongation factor (REF) and small rubber particle protein (SRPP) are two key factors for natural rubber biosynthesis. To further understand the roles of these proteins in rubber formation, six different genes for latex abundant REF or SRPP proteins, including REF 138,175,258 and SRPP 117,204,243 , were characterized from Hevea brasiliensis Reyan (RY) 7-33-97. Sequence analysis showed that REFs have a variable and long N-terminal, whereas SRPPs have a variable and long C-terminal beyond the REF domain, and REF 258 has a β subunit of ATPase in its N-terminal. Through two-dimensional electrophoresis (2-DE), each REF/SRPP protein was separated into multiple protein spots on 2-DE gels, indicating they have multiple protein species. The abundance of REF/SRPP proteins was compared between ethylene and control treatments or among rubber tree clones with different levels of latex productivity by analyzing 2-DE gels. The total abundance of each REF/SRPP protein decreased or changed a little upon ethylene stimulation, whereas the abundance of multiple protein species of the same REF/SRPP changed diversely. Among the three rubber tree clones, the abundance of the protein species also differed significantly. Especially, two protein species of REF 175 or REF 258 were ethylene-responsive only in the high latex productivity clone RY 8-79 instead of in RY 7-33-97 and PR 107. Some individual protein species were positively related to ethylene stimulation and latex productivity. These results suggested that the specific protein species could be more important than others for rubber production and post-translational modifications might play important roles in rubber biosynthesis.
Killgore, George; Thompson, Angela; Johnson, Stuart; Brazier, Jon; Kuijper, Ed; Pepin, Jacques; Frost, Eric H; Savelkoul, Paul; Nicholson, Brad; van den Berg, Renate J; Kato, Haru; Sambol, Susan P; Zukowski, Walter; Woods, Christopher; Limbago, Brandi; Gerding, Dale N; McDonald, L Clifford
2008-02-01
Using 42 isolates contributed by laboratories in Canada, The Netherlands, the United Kingdom, and the United States, we compared the results of analyses done with seven Clostridium difficile typing techniques: multilocus variable-number tandem-repeat analysis (MLVA), amplified fragment length polymorphism (AFLP), surface layer protein A gene sequence typing (slpAST), PCR-ribotyping, restriction endonuclease analysis (REA), multilocus sequence typing (MLST), and pulsed-field gel electrophoresis (PFGE). We assessed the discriminating ability and typeability of each technique as well as the agreement among techniques in grouping isolates by allele profile A (AP-A) through AP-F, which are defined by toxinotype, the presence of the binary toxin gene, and deletion in the tcdC gene. We found that all isolates were typeable by all techniques and that discrimination index scores for the techniques tested ranged from 0.964 to 0.631 in the following order: MLVA, REA, PFGE, slpAST, PCR-ribotyping, MLST, and AFLP. All the techniques were able to distinguish the current epidemic strain of C. difficile (BI/027/NAP1) from other strains. All of the techniques showed multiple types for AP-A (toxinotype 0, binary toxin negative, and no tcdC gene deletion). REA, slpAST, MLST, and PCR-ribotyping all included AP-B (toxinotype III, binary toxin positive, and an 18-bp deletion in tcdC) in a single group that excluded other APs. PFGE, AFLP, and MLVA grouped two, one, and two different non-AP-B isolates, respectively, with their AP-B isolates. All techniques appear to be capable of detecting outbreak strains, but only REA and MLVA showed sufficient discrimination to distinguish strains from different outbreaks.
Zhu, Luchang; Olsen, Randall J; Horstmann, Nicola; Shelburne, Samuel A; Fan, Jia; Hu, Ye; Musser, James M
2016-07-01
Variable-number tandem-repeat (VNTR) polymorphisms are ubiquitous in bacteria. However, only a small fraction of them has been functionally studied. Here, we report an intergenic VNTR polymorphism that confers an altered level of toxin production and increased virulence in Streptococcus pyogenes The nature of the polymorphism is a one-unit deletion in a three-tandem-repeat locus upstream of the rocA gene encoding a sensor kinase. S. pyogenes strains with this type of polymorphism cause human infection and produce significantly larger amounts of the secreted cytotoxins S. pyogenes NADase (SPN) and streptolysin O (SLO). Using isogenic mutant strains, we demonstrate that deleting one or more units of the tandem repeats abolished RocA production, reduced CovR phosphorylation, derepressed multiple CovR-regulated virulence factors (such as SPN and SLO), and increased virulence in a mouse model of necrotizing fasciitis. The phenotypic effect of the VNTR polymorphism was nearly the same as that of inactivating the rocA gene. In summary, we identified and characterized an intergenic VNTR polymorphism in S. pyogenes that affects toxin production and virulence. These new findings enhance understanding of rocA biology and the function of VNTR polymorphisms in S. pyogenes. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Highly variable penetrance of abnormal phenotypes in embryonic lethal knockout mice
Wilson, Robert; Geyer, Stefan H.; Reissig, Lukas; Rose, Julia; Szumska, Dorota; Hardman, Emily; Prin, Fabrice; McGuire, Christina; Ramirez-Solis, Ramiro; White, Jacqui; Galli, Antonella; Tudor, Catherine; Tuck, Elizabeth; Mazzeo, Cecilia Icoresi; Smith, James C.; Robertson, Elizabeth; Adams, David J.; Mohun, Timothy; Weninger, Wolfgang J.
2017-01-01
Background: Identifying genes that are essential for mouse embryonic development and survival through term is a powerful and unbiased way to discover possible genetic determinants of human developmental disorders. Characterising the changes in mouse embryos that result from ablation of lethal genes is a necessary first step towards uncovering their role in normal embryonic development and establishing any correlates amongst human congenital abnormalities. Methods: Here we present results gathered to date in the Deciphering the Mechanisms of Developmental Disorders (DMDD) programme, cataloguing the morphological defects identified from comprehensive imaging of 220 homozygous mutant and 114 wild type embryos from 42 lethal and subviable lines, analysed at E14.5. Results: Virtually all mutant embryos show multiple abnormal phenotypes and amongst the 42 lines these affect most organ systems. Within each mutant line, the phenotypes of individual embryos form distinct but overlapping sets. Subcutaneous edema, malformations of the heart or great vessels, abnormalities in forebrain morphology and the musculature of the eyes are all prevalent phenotypes, as is loss or abnormal size of the hypoglossal nerve. Conclusions: Overall, the most striking finding is that no matter how profound the malformation, each phenotype shows highly variable penetrance within a mutant line. These findings have challenging implications for efforts to identify human disease correlates. PMID:27996060
RNA-seq mixology: designing realistic control experiments to compare protocols and analysis methods
Holik, Aliaksei Z.; Law, Charity W.; Liu, Ruijie; Wang, Zeya; Wang, Wenyi; Ahn, Jaeil; Asselin-Labat, Marie-Liesse; Smyth, Gordon K.
2017-01-01
Abstract Carefully designed control experiments provide a gold standard for benchmarking different genomics research tools. A shortcoming of many gene expression control studies is that replication involves profiling the same reference RNA sample multiple times. This leads to low, pure technical noise that is atypical of regular studies. To achieve a more realistic noise structure, we generated a RNA-sequencing mixture experiment using two cell lines of the same cancer type. Variability was added by extracting RNA from independent cell cultures and degrading particular samples. The systematic gene expression changes induced by this design allowed benchmarking of different library preparation kits (standard poly-A versus total RNA with Ribozero depletion) and analysis pipelines. Data generated using the total RNA kit had more signal for introns and various RNA classes (ncRNA, snRNA, snoRNA) and less variability after degradation. For differential expression analysis, voom with quality weights marginally outperformed other popular methods, while for differential splicing, DEXSeq was simultaneously the most sensitive and the most inconsistent method. For sample deconvolution analysis, DeMix outperformed IsoPure convincingly. Our RNA-sequencing data set provides a valuable resource for benchmarking different protocols and data pre-processing workflows. The extra noise mimics routine lab experiments more closely, ensuring any conclusions are widely applicable. PMID:27899618
ERIC Educational Resources Information Center
Woolley, Kristin K.
Many researchers are unfamiliar with suppressor variables and how they operate in multiple regression analyses. This paper describes the role suppressor variables play in a multiple regression model and provides practical examples that explain how they can change research results. A variable that when added as another predictor increases the total…
Sahakyan, Aleksandr B; Balasubramanian, Shankar
2016-03-12
The role of random mutations and genetic errors in defining the etiology of cancer and other multigenic diseases has recently received much attention. With the view that complex genes should be particularly vulnerable to such events, here we explore the link between the simple properties of the human genes, such as transcript length, number of splice variants, exon/intron composition, and their involvement in the pathways linked to cancer and other multigenic diseases. We reveal a substantial enrichment of cancer pathways with long genes and genes that have multiple splice variants. Although the latter two factors are interdependent, we show that the overall gene length and splicing complexity increase in cancer pathways in a partially decoupled manner. Our systematic survey for the pathways enriched with top lengthy genes and with genes that have multiple splice variants reveal, along with cancer pathways, the pathways involved in various neuronal processes, cardiomyopathies and type II diabetes. We outline a correlation between the gene length and the number of somatic mutations. Our work is a step forward in the assessment of the role of simple gene characteristics in cancer and a wider range of multigenic diseases. We demonstrate a significant accumulation of long genes and genes with multiple splice variants in pathways of multigenic diseases that have already been associated with de novo mutations. Unlike the cancer pathways, we note that the pathways of neuronal processes, cardiomyopathies and type II diabetes contain genes long enough for topoisomerase-dependent gene expression to also be a potential contributing factor in the emergence of pathologies, should topoisomerases become impaired.
Bao, Shaopan; Lu, Qicong; Dai, Heping; Zhang, Chao
2015-01-01
To develop applicable and susceptible models to evaluate the toxicity of nanoparticles, the antimicrobial effects of CuO nanoparticles (CuO-NPs) on various Saccharomyces cerevisiae (S. cerevisiae) strains (wild type, single-gene-deleted mutants, and multiple-gene-deleted mutants) were determined and compared. Further experiments were also conducted to analyze the mechanisms associated with toxicity using copper salt, bulk CuO (bCuO), carbon-shelled copper nanoparticles (C/Cu-NPs), and carbon nanoparticles (C-NPs) for comparisons. The results indicated that the growth inhibition rates of CuO-NPs for the wild-type and the single-gene-deleted strains were comparable, while for the multiple-gene deletion mutant, significantly higher toxicity was observed (P < 0.05). When the toxicity of the CuO-NPs to yeast cells was compared with the toxicities of copper salt and bCuO, we concluded that the toxicity of CuO-NPs should be attributed to soluble copper rather than to the nanoparticles. The striking difference in adverse effects of C-NPs and C/Cu-NPs with equivalent surface areas also proved this. A toxicity assay revealed that the multiple-gene-deleted mutant was significantly more sensitive to CuO-NPs than the wild type. Specifically, compared with the wild-type strain, copper was readily taken up by mutant strains when cell permeability genes were knocked out, and the mutants with deletions of genes regulated under oxidative stress (OS) were likely producing more reactive oxygen species (ROS). Hence, as mechanism-based gene inactivation could increase the susceptibility of yeast, the multiple-gene-deleted mutants should be improved model organisms to investigate the toxicity of nanoparticles. PMID:26386067
Shafie, Suraiya M.; Barria von-Bischhoffshausen, Fernando R.; Bateman, J. Bronwyn
2006-01-01
PURPOSE To document intrafamilial and interocular phenotypic variability of autosomal dominant cataract (ADC). DESIGN Prospective observational case series. METHODS We performed ophthalmologic examination in four Chilean ADC families. RESULTS The families exhibited variability with respect to morphology, location with the lens, color and density of cataracts among affected members. We documented asymmetry between eyes in the morphology, location within the lens, color and density of cataracts, and a variable rate of progression. CONCLUSIONS The cataracts in these families exhibit wide intrafamilial and interocular phenotypic variability, supporting the premise that the mutated genes are expressed differentially in individuals and between eyes; other genes or environmental factors may be the bases for this variability. Marked progression among some family members underscores the variable clinical course of a common mutation within a family. Like retinitis pigmentosa, classification of ADC will be most useful if based on the gene and specific mutation. PMID:16564818
Taguchi, Y-H
2018-05-08
Even though coexistence of multiple phenotypes sharing the same genomic background is interesting, it remains incompletely understood. Epigenomic profiles may represent key factors, with unknown contributions to the development of multiple phenotypes, and social-insect castes are a good model for elucidation of the underlying mechanisms. Nonetheless, previous studies have failed to identify genes associated with aberrant gene expression and methylation profiles because of the lack of suitable methodology that can address this problem properly. A recently proposed principal component analysis (PCA)-based and tensor decomposition (TD)-based unsupervised feature extraction (FE) can solve this problem because these two approaches can deal with gene expression and methylation profiles even when a small number of samples is available. PCA-based and TD-based unsupervised FE methods were applied to the analysis of gene expression and methylation profiles in the brains of two social insects, Polistes canadensis and Dinoponera quadriceps. Genes associated with differential expression and methylation between castes were identified, and analysis of enrichment of Gene Ontology terms confirmed reliability of the obtained sets of genes from the biological standpoint. Biologically relevant genes, shown to be associated with significant differential gene expression and methylation between castes, were identified here for the first time. The identification of these genes may help understand the mechanisms underlying epigenetic control of development of multiple phenotypes under the same genomic conditions.
Mets, David G; Brainard, Michael S
2018-01-01
Abstract Background Vocal learning in songbirds has emerged as a powerful model for sensorimotor learning. Neurobehavioral studies of Bengalese finch (Lonchura striata domestica) song, naturally more variable and plastic than songs of other finch species, have demonstrated the importance of behavioral variability for initial learning, maintenance, and plasticity of vocalizations. However, the molecular and genetic underpinnings of this variability and the learning it supports are poorly understood. Findings To establish a platform for the molecular analysis of behavioral variability and plasticity, we generated an initial draft assembly of the Bengalese finch genome from a single male animal to 151× coverage and an N50 of 3.0 MB. Furthermore, we developed an initial set of gene models using RNA-seq data from 8 samples that comprise liver, muscle, cerebellum, brainstem/midbrain, and forebrain tissue from juvenile and adult Bengalese finches of both sexes. Conclusions We provide a draft Bengalese finch genome and gene annotation to facilitate the study of the molecular-genetic influences on behavioral variability and the process of vocal learning. These data will directly support many avenues for the identification of genes involved in learning, including differential expression analysis, comparative genomic analysis (through comparison to existing avian genome assemblies), and derivation of genetic maps for linkage analysis. Bengalese finch gene models and sequences will be essential for subsequent manipulation (molecular or genetic) of genes and gene products, enabling novel mechanistic investigations into the role of variability in learned behavior. PMID:29618046
Colquitt, Bradley M; Mets, David G; Brainard, Michael S
2018-03-01
Vocal learning in songbirds has emerged as a powerful model for sensorimotor learning. Neurobehavioral studies of Bengalese finch (Lonchura striata domestica) song, naturally more variable and plastic than songs of other finch species, have demonstrated the importance of behavioral variability for initial learning, maintenance, and plasticity of vocalizations. However, the molecular and genetic underpinnings of this variability and the learning it supports are poorly understood. To establish a platform for the molecular analysis of behavioral variability and plasticity, we generated an initial draft assembly of the Bengalese finch genome from a single male animal to 151× coverage and an N50 of 3.0 MB. Furthermore, we developed an initial set of gene models using RNA-seq data from 8 samples that comprise liver, muscle, cerebellum, brainstem/midbrain, and forebrain tissue from juvenile and adult Bengalese finches of both sexes. We provide a draft Bengalese finch genome and gene annotation to facilitate the study of the molecular-genetic influences on behavioral variability and the process of vocal learning. These data will directly support many avenues for the identification of genes involved in learning, including differential expression analysis, comparative genomic analysis (through comparison to existing avian genome assemblies), and derivation of genetic maps for linkage analysis. Bengalese finch gene models and sequences will be essential for subsequent manipulation (molecular or genetic) of genes and gene products, enabling novel mechanistic investigations into the role of variability in learned behavior.
Global population-specific variation in miRNA associated with cancer risk and clinical biomarkers.
Rawlings-Goss, Renata A; Campbell, Michael C; Tishkoff, Sarah A
2014-08-28
MiRNA expression profiling is being actively investigated as a clinical biomarker and diagnostic tool to detect multiple cancer types and stages as well as other complex diseases. Initial investigations, however, have not comprehensively taken into account genetic variability affecting miRNA expression and/or function in populations of different ethnic backgrounds. Therefore, more complete surveys of miRNA genetic variability are needed to assess global patterns of miRNA variation within and between diverse human populations and their effect on clinically relevant miRNA genes. Genetic variation in 1524 miRNA genes was examined using whole genome sequencing (60x coverage) in a panel of 69 unrelated individuals from 14 global populations, including European, Asian and African populations. We identified 33 previously undescribed miRNA variants, and 31 miRNA containing variants that are globally population-differentiated in frequency between African and non-African populations (PD-miRNA). The top 1% of PD-miRNA were significantly enriched for regulation of genes involved in glucose/insulin metabolism and cell division (p < 10(-7)), most significantly the mitosis pathway, which is strongly linked to cancer onset. Overall, we identify 7 PD-miRNAs that are currently implicated as cancer biomarkers or diagnostics: hsa-mir-202, hsa-mir-423, hsa-mir-196a-2, hsa-mir-520h, hsa-mir-647, hsa-mir-943, and hsa-mir-1908. Notably, hsa-mir-202, a potential breast cancer biomarker, was found to show significantly high allele frequency differentiation at SNP rs12355840, which is known to affect miRNA expression levels in vivo and subsequently breast cancer mortality. MiRNA expression profiles represent a promising new category of disease biomarkers. However, population specific genetic variation can affect the prevalence and baseline expression of these miRNAs in diverse populations. Consequently, miRNA genetic and expression level variation among ethnic groups may be contributing in part to health disparities observed in multiple forms of cancer, specifically breast cancer, and will be an essential consideration when assessing the utility of miRNA biomarkers for the clinic.
Developing Pedagogical Tools to Improve Teaching Multiple Models of the Gene in High School
ERIC Educational Resources Information Center
Auckaraaree, Nantaya
2013-01-01
Multiple models of the gene are used to explore genetic phenomena in scientific practices and in the classroom. In genetics curricula, the classical and molecular models are presented in disconnected domains. Research demonstrates that, without explicit connections, students have difficulty developing an understanding of the gene that spans…
Array data extractor (ADE): a LabVIEW program to extract and merge gene array data.
Kurtenbach, Stefan; Kurtenbach, Sarah; Zoidl, Georg
2013-12-01
Large data sets from gene expression array studies are publicly available offering information highly valuable for research across many disciplines ranging from fundamental to clinical research. Highly advanced bioinformatics tools have been made available to researchers, but a demand for user-friendly software allowing researchers to quickly extract expression information for multiple genes from multiple studies persists. Here, we present a user-friendly LabVIEW program to automatically extract gene expression data for a list of genes from multiple normalized microarray datasets. Functionality was tested for 288 class A G protein-coupled receptors (GPCRs) and expression data from 12 studies comparing normal and diseased human hearts. Results confirmed known regulation of a beta 1 adrenergic receptor and further indicate novel research targets. Although existing software allows for complex data analyses, the LabVIEW based program presented here, "Array Data Extractor (ADE)", provides users with a tool to retrieve meaningful information from multiple normalized gene expression datasets in a fast and easy way. Further, the graphical programming language used in LabVIEW allows applying changes to the program without the need of advanced programming knowledge.
Tanyimboh, Tiku T; Seyoum, Alemtsehay G
2016-12-01
This article investigates the computational efficiency of constraint handling in multi-objective evolutionary optimization algorithms for water distribution systems. The methodology investigated here encourages the co-existence and simultaneous development including crossbreeding of subpopulations of cost-effective feasible and infeasible solutions based on Pareto dominance. This yields a boundary search approach that also promotes diversity in the gene pool throughout the progress of the optimization by exploiting the full spectrum of non-dominated infeasible solutions. The relative effectiveness of small and moderate population sizes with respect to the number of decision variables is investigated also. The results reveal the optimization algorithm to be efficient, stable and robust. It found optimal and near-optimal solutions reliably and efficiently. The real-world system based optimization problem involved multiple variable head supply nodes, 29 fire-fighting flows, extended period simulation and multiple demand categories including water loss. The least cost solutions found satisfied the flow and pressure requirements consistently. The best solutions achieved indicative savings of 48.1% and 48.2% based on the cost of the pipes in the existing network, for populations of 200 and 1000, respectively. The population of 1000 achieved slightly better results overall. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
The influence of talker and foreign-accent variability on spoken word identification.
Bent, Tessa; Holt, Rachael Frush
2013-03-01
In spoken word identification and memory tasks, stimulus variability from numerous sources impairs performance. In the current study, the influence of foreign-accent variability on spoken word identification was evaluated in two experiments. Experiment 1 used a between-subjects design to test word identification in noise in single-talker and two multiple-talker conditions: multiple talkers with the same accent and multiple talkers with different accents. Identification performance was highest in the single-talker condition, but there was no difference between the single-accent and multiple-accent conditions. Experiment 2 further explored word recognition for multiple talkers in single-accent versus multiple-accent conditions using a mixed design. A detriment to word recognition was observed in the multiple-accent condition compared to the single-accent condition, but the effect differed across the language backgrounds tested. These results demonstrate that the processing of foreign-accent variation may influence word recognition in ways similar to other sources of variability (e.g., speaking rate or style) in that the inclusion of multiple foreign accents can result in a small but significant performance decrement beyond the multiple-talker effect.
Strand, Tanja; Westerdahl, Helena; Höglund, Jacob; V Alatalo, Rauno; Siitari, Heli
2007-09-01
We found that the Black grouse (Tetrao tetrix) possess low numbers of Mhc class II B (BLB) and Y (YLB) genes with variable diversity and expression. We have therefore shown, for the first time, that another bird species (in this case, a wild lek-breeding galliform) shares several features of the simple Mhc of the domestic chicken (Gallus gallus). The Black grouse BLB genes showed the same level of polymorphism that has been reported in chicken, and we also found indications of balancing selection in the peptide-binding regions. The YLB genes were less variable than the BLB genes, also in accordance with earlier studies in chicken, although their functional significance still remains obscure. We hypothesize that the YLB genes could have been under purifying selection, just as the mammal Mhc-E gene cluster.
A large-scale study of the random variability of a coding sequence: a study on the CFTR gene.
Modiano, Guido; Bombieri, Cristina; Ciminelli, Bianca Maria; Belpinati, Francesca; Giorgi, Silvia; Georges, Marie des; Scotet, Virginie; Pompei, Fiorenza; Ciccacci, Cinzia; Guittard, Caroline; Audrézet, Marie Pierre; Begnini, Angela; Toepfer, Michael; Macek, Milan; Ferec, Claude; Claustres, Mireille; Pignatti, Pier Franco
2005-02-01
Coding single nucleotide substitutions (cSNSs) have been studied on hundreds of genes using small samples (n(g) approximately 100-150 genes). In the present investigation, a large random European population sample (average n(g) approximately 1500) was studied for a single gene, the CFTR (Cystic Fibrosis Transmembrane conductance Regulator). The nonsynonymous (NS) substitutions exhibited, in accordance with previous reports, a mean probability of being polymorphic (q > 0.005), much lower than that of the synonymous (S) substitutions, but they showed a similar rate of subpolymorphic (q < 0.005) variability. This indicates that, in autosomal genes that may have harmful recessive alleles (nonduplicated genes with important functions), genetic drift overwhelms selection in the subpolymorphic range of variability, making disadvantageous alleles behave as neutral. These results imply that the majority of the subpolymorphic nonsynonymous alleles of these genes are selectively negative or even pathogenic.
Kuo, Kevin H M
2017-01-01
The issue of multiple testing, also termed multiplicity, is ubiquitous in studies where multiple hypotheses are tested simultaneously. Genome-wide association study (GWAS), a type of genetic association study that has gained popularity in the past decade, is most susceptible to the issue of multiple testing. Different methodologies have been employed to address the issue of multiple testing in GWAS. The purpose of the review is to examine the methodologies employed in dealing with multiple testing in the context of gene discovery using GWAS in sickle cell disease complications.
Response variables for evaluation of the effectiveness of conservation corridors.
Gregory, Andrew J; Beier, Paul
2014-06-01
Many studies have evaluated effectiveness of corridors by measuring species presence in and movement through small structural corridors. However, few studies have assessed whether these response variables are adequate for assessing whether the conservation goals of the corridors have been achieved or considered the costs or lag times involved in measuring the response variables. We examined 4 response variables-presence of the focal species in the corridor, interpatch movement via the corridor, gene flow, and patch occupancy--with respect to 3 criteria--relevance to conservation goals, lag time (fewest generations at which a positive response to the corridor might be evident with a particular variable), and the cost of a study when applying a particular variable. The presence variable had the least relevance to conservation goals, no lag time advantage compared with interpatch movement, and only a moderate cost advantage over interpatch movement or gene flow. Movement of individual animals between patches was the most appropriate response variable for a corridor intended to provide seasonal migration, but it was not an appropriate response variable for corridor dwellers, and for passage species it was only moderately relevant to the goals of gene flow, demographic rescue, and recolonization. Response variables related to gene flow provided a good trade-off among cost, relevance to conservation goals, and lag time. Nonetheless, the lag time of 10-20 generations means that evaluation of conservation corridors cannot occur until a few decades after a corridor has been established. Response variables related to occupancy were most relevant to conservation goals, but the lag time and costs to detect corridor effects on occupancy were much greater than the lag time and costs to detect corridor effects on gene flow. © 2014 Society for Conservation Biology.
Powers, T. O.; Harris, T. S.; Hyman, B. C.
1993-01-01
Mitochondrial DNA sequences were obtained from the NADH dehydrogenase subunit 3 (ND3), large rRNA, and cytochrome b genes from Meloidogyne incognita and Romanomermis culicivorax. Both species show considerable genetic distance within these same genes when compared with Caenorhabditis elegans or Ascaris suum, two species previously analyzed. Caenorhabditis, Ascaris, and Meloidogyne were selected as representatives of three subclasses in the nematode class Secernentea: Rhabditia, Spiruria, and Diplogasteria, respectively. Romanomermis served as a representative out-group of the class Adenophorea. The divergence between the phytoparasitic lineage (represented by Meloidogyne) and the three other species is so great that virtually every variable position in these genes appears to have accumulated multiple mutations, obscuring the phylogenetic information obtainable from these comparisons. The 39 and 42% amino acid similarity between the M. incognita and C. elegans ND3 and cytochrome b coding sequences, respectively, are approximately the same as those of C. elegans-mouse comparisons for the same genes (26 and 44%). This discovery calls into question the feasibility of employing cloned C. elegans probes as reagents to isolate phytoparasitic nematode genes. The genetic distance between the phytoparasitic nematode lineage and C. elegans markedly contrasts with the 79% amino acid similarity between C. elegans and A. suum for the same sequences. The molecular data suggest that Caenorhabditis and Ascaris belong to the same subclass. PMID:19279810
Curtis, Ross E; Kim, Seyoung; Woolford, John L; Xu, Wenjie; Xing, Eric P
2013-03-21
Association analysis using genome-wide expression quantitative trait locus (eQTL) data investigates the effect that genetic variation has on cellular pathways and leads to the discovery of candidate regulators. Traditional analysis of eQTL data via pairwise statistical significance tests or linear regression does not leverage the availability of the structural information of the transcriptome, such as presence of gene networks that reveal correlation and potentially regulatory relationships among the study genes. We employ a new eQTL mapping algorithm, GFlasso, which we have previously developed for sparse structured regression, to reanalyze a genome-wide yeast dataset. GFlasso fully takes into account the dependencies among expression traits to suppress false positives and to enhance the signal/noise ratio. Thus, GFlasso leverages the gene-interaction network to discover the pleiotropic effects of genetic loci that perturb the expression level of multiple (rather than individual) genes, which enables us to gain more power in detecting previously neglected signals that are marginally weak but pleiotropically significant. While eQTL hotspots in yeast have been reported previously as genomic regions controlling multiple genes, our analysis reveals additional novel eQTL hotspots and, more interestingly, uncovers groups of multiple contributing eQTL hotspots that affect the expression level of functional gene modules. To our knowledge, our study is the first to report this type of gene regulation stemming from multiple eQTL hotspots. Additionally, we report the results from in-depth bioinformatics analysis for three groups of these eQTL hotspots: ribosome biogenesis, telomere silencing, and retrotransposon biology. We suggest candidate regulators for the functional gene modules that map to each group of hotspots. Not only do we find that many of these candidate regulators contain mutations in the promoter and coding regions of the genes, in the case of the Ribi group, we provide experimental evidence suggesting that the identified candidates do regulate the target genes predicted by GFlasso. Thus, this structured association analysis of a yeast eQTL dataset via GFlasso, coupled with extensive bioinformatics analysis, discovers a novel regulation pattern between multiple eQTL hotspots and functional gene modules. Furthermore, this analysis demonstrates the potential of GFlasso as a powerful computational tool for eQTL studies that exploit the rich structural information among expression traits due to correlation, regulation, or other forms of biological dependencies.
Arashiro, Patricia; Eisenberg, Iris; Kho, Alvin T.; Cerqueira, Antonia M. P.; Canovas, Marta; Silva, Helga C. A.; Pavanello, Rita C. M.; Verjovski-Almeida, Sergio; Kunkel, Louis M.; Zatz, Mayana
2009-01-01
Facioscapulohumeral muscular dystrophy (FSHD) is a progressive muscle disorder that has been associated with a contraction of 3.3-kb repeats on chromosome 4q35. FSHD is characterized by a wide clinical inter- and intrafamilial variability, ranging from wheelchair-bound patients to asymptomatic carriers. Our study is unique in comparing the gene expression profiles from related affected, asymptomatic carrier, and control individuals. Our results suggest that the expression of genes on chromosome 4q is altered in affected and asymptomatic individuals. Remarkably, the changes seen in asymptomatic samples are largely in products of genes encoding several chemokines, whereas the changes seen in affected samples are largely in genes governing the synthesis of GPI-linked proteins and histone acetylation. Besides this, the affected patient and related asymptomatic carrier share the 4qA161 haplotype. Thus, these polymorphisms by themselves do not explain the pathogenicity of the contracted allele. Interestingly, our results also suggest that the miRNAs might mediate the regulatory network in FSHD. Together, our results support the previous evidence that FSHD may be caused by transcriptional dysregulation of multiple genes, in cis and in trans, and suggest some factors potentially important for FSHD pathogenesis. The study of the gene expression profiles from asymptomatic carriers and related affected patients is a unique approach to try to enhance our understanding of the missing link between the contraction in D4Z4 repeats and muscle disease, while minimizing the effects of differences resulting from genetic background. PMID:19339494
Zhang, Bo; Peng, Yu; Zheng, Jincheng; Liang, Lina; Hoffmann, Ary A; Ma, Chun-Sen
2016-07-01
Heat shock protein gene (Hsp) families are thought to be important in thermal adaptation, but their expression patterns under various thermal stresses have still been poorly characterized outside of model systems. We have therefore characterized Hsp genes and their stress responses in the oriental fruit moth (OFM), Grapholita molesta, a widespread global orchard pest, and compared patterns of expression in this species to that of other insects. Genes from four Hsp families showed variable expression levels among tissues and developmental stages. Members of the Hsp40, 70, and 90 families were highly expressed under short exposures to heat and cold. Expression of Hsp40, 70, and Hsc70 family members increased in OFM undergoing diapause, while Hsp90 was downregulated. We found that there was strong sequence conservation of members of large Hsp families (Hsp40, Hsp60, Hsp70, Hsc70) across taxa, but this was not always matched by conservation of expression patterns. When the large Hsps as well as small Hsps from OFM were compared under acute and ramping heat stress, two groups of sHsps expression patterns were apparent, depending on whether expression increased or decreased immediately after stress exposure. These results highlight potential differences in conservation of function as opposed to sequence in this gene family and also point to Hsp genes potentially useful as bioindicators of diapause and thermal stress in OFM.
Millet, Antoine; Kristjánsson, Bjarni K; Einarsson, Arni; Räsänen, Katja
2013-09-01
Eco-evolutionary responses of natural populations to spatial environmental variation strongly depend on the relative strength of environmental differences/natural selection and dispersal/gene flow. In absence of geographic barriers, as often is the case in lake ecosystems, gene flow is expected to constrain adaptive divergence between environments - favoring phenotypic plasticity or high trait variability. However, if divergent natural selection is sufficiently strong, adaptive divergence can occur in face of gene flow. The extent of divergence is most often studied between two contrasting environments, whereas potential for multimodal divergence is little explored. We investigated phenotypic (body size, defensive structures, and feeding morphology) and genetic (microsatellites) structure in threespine stickleback (Gasterosteus aculeatus) across five habitat types and two basins (North and South) within the geologically young and highly heterogeneous Lake Mývatn, North East Iceland. We found that (1) North basin stickleback were, on average, larger and had relatively longer spines than South basin stickleback, whereas (2) feeding morphology (gill raker number and gill raker gap width) differed among three of five habitat types, and (3) there was only subtle genetic differentiation across the lake. Overall, our results indicate predator and prey mediated phenotypic divergence across multiple habitats in the lake, in face of gene flow.
Millet, Antoine; Kristjánsson, Bjarni K; Einarsson, Árni; Räsänen, Katja
2013-01-01
Eco-evolutionary responses of natural populations to spatial environmental variation strongly depend on the relative strength of environmental differences/natural selection and dispersal/gene flow. In absence of geographic barriers, as often is the case in lake ecosystems, gene flow is expected to constrain adaptive divergence between environments – favoring phenotypic plasticity or high trait variability. However, if divergent natural selection is sufficiently strong, adaptive divergence can occur in face of gene flow. The extent of divergence is most often studied between two contrasting environments, whereas potential for multimodal divergence is little explored. We investigated phenotypic (body size, defensive structures, and feeding morphology) and genetic (microsatellites) structure in threespine stickleback (Gasterosteus aculeatus) across five habitat types and two basins (North and South) within the geologically young and highly heterogeneous Lake Mývatn, North East Iceland. We found that (1) North basin stickleback were, on average, larger and had relatively longer spines than South basin stickleback, whereas (2) feeding morphology (gill raker number and gill raker gap width) differed among three of five habitat types, and (3) there was only subtle genetic differentiation across the lake. Overall, our results indicate predator and prey mediated phenotypic divergence across multiple habitats in the lake, in face of gene flow. PMID:24223263
Clustering "N" Objects into "K" Groups under Optimal Scaling of Variables.
ERIC Educational Resources Information Center
van Buuren, Stef; Heiser, Willem J.
1989-01-01
A method based on homogeneity analysis (multiple correspondence analysis or multiple scaling) is proposed to reduce many categorical variables to one variable with "k" categories. The method is a generalization of the sum of squared distances cluster analysis problem to the case of mixed measurement level variables. (SLD)
The nuclear 18S ribosomal RNA gene as a source of phylogenetic information in the genus Taenia.
Yan, Hongbin; Lou, Zhongzi; Li, Li; Ni, Xingwei; Guo, Aijiang; Li, Hongmin; Zheng, Yadong; Dyachenko, Viktor; Jia, Wanzhong
2013-03-01
Most species of the genus Taenia are of considerable medical and veterinary significance. In this study, complete nuclear 18S rRNA gene sequences were obtained from seven members of genus Taenia [Taenia multiceps, Taenia saginata, Taenia asiatica, Taenia solium, Taenia pisiformis, Taenia hydatigena, and Taenia taeniaeformis] and a phylogeny inferred using these sequences. Most of the variable sites fall within the variable regions, V1-V5. We show that sequences from the nuclear 18S ribosomal RNA gene have considerable promise as sources of phylogenetic information within the genus Taenia. Furthermore, given that almost all the variable sites lie within defined variable portions of that gene, it will be appropriate and economical to sequence only those regions for additional species of Taenia.
Ferraris, Alessandro; Bernardini, Laura; Sabolic Avramovska, Vesna; Zanni, Ginevra; Loddo, Sara; Sukarova-Angelovska, Elena; Parisi, Valentina; Capalbo, Anna; Tumini, Stefano; Travaglini, Lorena; Mancini, Francesca; Duma, Filip; Barresi, Sabina; Novelli, Antonio; Mercuri, Eugenio; Tarani, Luigi; Bertini, Enrico; Dallapiccola, Bruno; Valente, Enza Maria
2013-05-16
The Dandy-Walker malformation (DWM) is one of the commonest congenital cerebellar defects, and can be associated with multiple congenital anomalies and chromosomal syndromes. The occurrence of overlapping 3q deletions including the ZIC1 and ZIC4 genes in few patients, along with data from mouse models, have implicated both genes in the pathogenesis of DWM. Using a SNP-array approach, we recently identified three novel patients carrying heterozygous 3q deletions encompassing ZIC1 and ZIC4. Magnetic resonance imaging showed that only two had a typical DWM, while the third did not present any defect of the DWM spectrum. SNP-array analysis in further eleven children diagnosed with DWM failed to identify deletions of ZIC1-ZIC4. The clinical phenotype of the three 3q deleted patients included multiple congenital anomalies and peculiar facial appearance, related to the localization and extension of each deletion. In particular, phenotypes resulted from the variable combination of three recognizable patterns: DWM (with incomplete penetrance); blepharophimosis, ptosis, and epicanthus inversus syndrome; and Wisconsin syndrome (WS), recently mapped to 3q. Our data indicate that the 3q deletion is a rare defect associated with DWM, and suggest that the hemizygosity of ZIC1-ZIC4 genes is neither necessary nor sufficient per se to cause this condition. Furthermore, based on a detailed comparison of clinical features and molecular data from 3q deleted patients, we propose clinical diagnostic criteria and refine the critical region for WS.
2013-01-01
Background The Dandy-Walker malformation (DWM) is one of the commonest congenital cerebellar defects, and can be associated with multiple congenital anomalies and chromosomal syndromes. The occurrence of overlapping 3q deletions including the ZIC1 and ZIC4 genes in few patients, along with data from mouse models, have implicated both genes in the pathogenesis of DWM. Methods and results Using a SNP-array approach, we recently identified three novel patients carrying heterozygous 3q deletions encompassing ZIC1 and ZIC4. Magnetic resonance imaging showed that only two had a typical DWM, while the third did not present any defect of the DWM spectrum. SNP-array analysis in further eleven children diagnosed with DWM failed to identify deletions of ZIC1-ZIC4. The clinical phenotype of the three 3q deleted patients included multiple congenital anomalies and peculiar facial appearance, related to the localization and extension of each deletion. In particular, phenotypes resulted from the variable combination of three recognizable patterns: DWM (with incomplete penetrance); blepharophimosis, ptosis, and epicanthus inversus syndrome; and Wisconsin syndrome (WS), recently mapped to 3q. Conclusions Our data indicate that the 3q deletion is a rare defect associated with DWM, and suggest that the hemizygosity of ZIC1-ZIC4 genes is neither necessary nor sufficient per se to cause this condition. Furthermore, based on a detailed comparison of clinical features and molecular data from 3q deleted patients, we propose clinical diagnostic criteria and refine the critical region for WS. PMID:23679990
Islam, Nazrul; Woo, Sun-Hee; Tsujimoto, Hisashi; Kawasaki, Hiroshi; Hirano, Hisashi
2002-09-01
Changes in protein composition of wheat endosperm proteome were investigated in 39 ditelocentric chromosome lines of common wheat (Triticum aestivum L.) cv. Chinese Spring. Two-dimensional gel electrophoresis followed by Coomassie Brilliant Blue staining has resolved a total of 105 protein spots in a gel. Quantitative image analysis of protein spots was performed by PDQuest. Variations in protein spots between the euploid and the 39 ditelocentric lines were evaluated by spot number, appearance, disappearance and intensity. A specific spot present in all gels was taken as an internal standard, and the intensity of all other spots was calculated as the ratio of the internal standard. Out of the 1755 major spots detected in 39 ditelocentric lines, 1372 (78%) spots were found variable in different spot parameters: 147 (11%) disappeared, 978 (71%) up-regulated and 247 (18%) down-regulated. Correlation studies in changes in protein intensities among 24 protein spots across the ditelocentric lines were performed. High correlations in changes of protein intensities were observed among the proteins encoded by genes located in the homoeologous arms. Locations of structural genes controlling 26 spots were identified in 10 chromosomal arms. Multiple regulators of the same protein located at various chromosomal arms were also noticed. Identification of structural genes for most of the proteins was found difficult due to multiple regulators encoding the same protein. Two novel subunits (1B(Z,) 1BDz), the structure of which are very similar to the high molecular weight glutenin subunit 12, were identified, and the chromosome arm locations of these subunits were assigned.
Moderating role of the MAOA genotype in antisocial behaviour
Fergusson, David M.; Boden, Joseph M.; Horwood, L. John; Miller, Allison; Kennedy, Martin A.
2012-01-01
Background Recent studies have examined gene×environment (G×E) interactions involving the monoamine oxidase A (MAOA) gene in moderating the associations between exposure to adversity and antisocial behaviour. The present study examined a novel method for assessing interactions between a single gene and multiple risk factors related to environmental and personal adversity. Aims To test the hypothesis that the presence of the low-activity MAOA genotype was associated with an increased response to a series of risk factors. Method Participants were 399 males from the Christchurch Health and Development Study who had complete data on: (a) MAOA promoter region variable number tandem repeat genotype; (b) antisocial behaviour (criminal offending) to age 30 and convictions to age 21; and (c) maternal smoking during pregnancy, IQ, childhood maltreatment and school failure. Results Poisson regression models were fitted to three antisocial behaviour outcomes (property/violent offending ages 15–30; and convictions ages 17–21), using measures of exposure to adverse childhood circumstances. The analyses revealed consistent evidence of G x E interactions, such that those with the low-activity MAOA variant who were exposed to adversity in childhood were significantly more likely to report offending in late adolescence and early adulthood. Conclusions The present findings add to the evidence suggesting that there is a stable G x E interaction involving MAOA, a range of adverse environmental and personal factors, and antisocial behaviour across the life course. These analyses also demonstrate the utility of using multiple environmental/personal exposures to test G×E interactions. PMID:22297589
Foraita, R; Günther, F; Gwozdz, W; Reisch, L A; Russo, P; Lauria, F; Siani, A; Veidebaum, T; Tornaritis, M; Iacoviello, L; Vyncke, K; Pitsiladis, Y; Mårild, S; Molnár, D; Moreno, L A; Bammann, K; Pigeot, I
2015-01-01
Various twin studies revealed that the influence of genetic factors on psychological diseases or behaviour is more expressed in socioeconomically advantaged environments. Other studies predominantly show an inverse association between socioeconomic status (SES) and childhood obesity in Western developed countries. The aim of this study is to investigate whether the fat mass and obesity-associated (FTO) gene interacts with the SES on childhood obesity in a subsample (N = 4406) of the IDEFICS (Identification and prevention of Dietary- and lifestyle-induced health EFfects In Children and infantS) cohort. A structural equation model (SEM) is applied with the latent constructs obesity, dietary intakes, physical activity and fitness habits, and parental SES to estimate the main effects of the latter three variables and a FTO polymorphism on childhood obesity. Further, a multiple group SEM is used to explore whether an interaction effect exists between the single nucleotide polymorphism rs9939609 within the FTO gene and SES. Significant main effects are shown for physical activity and fitness (standardised [betacrc ](s) = -0.113), SES ([betacrc ](s) = -0.057) and the FTO homozygous AA risk genotype ([betacrc ](s) = -0.177). The explained variance of obesity is ~9%. According to the multiple group approach of SEM, we see an interaction between SES and FTO with respect to their effect on childhood obesity (Δχ(2) = 7.3, df = 2, P = 0.03). Children carrying the protective FTO genotype TT seem to be more protected by a favourable social environment regarding the development of obesity than children carrying the AT or AA genotype.
Moghadam, Samira; Erfanmanesh, Maryam; Esmaeilzadeh, Abdolreza
2017-11-01
An autoimmune demyelination disease of the Central Nervous System, Multiple Sclerosis, is a chronic inflammation which mostly involves young adults. Suffering people face functional loss with a severe pain. Most current MS treatments are focused on the immune response suppression. Approved drugs suppress the inflammatory process, but factually, there is no definite cure for Multiple Sclerosis. Recently developed knowledge has demonstrated that gene and cell therapy as a hopeful approach in tissue regeneration. The authors propose a novel combined immune gene therapy for Multiple Sclerosis treatment using anti-inflammatory and remyelination of Interleukine-35 and Hepatocyte Growth Factor properties, respectively. In this hypothesis Interleukine-35 and Hepatocyte Growth Factor introduce to Mesenchymal Stem Cells of EAE mouse model via an adenovirus based vector. It is expected that Interleukine-35 and Hepatocyte Growth Factor genes expressed from MSCs could effectively perform in immunotherapy of Multiple Sclerosis. Copyright © 2017. Published by Elsevier Ltd.
Law, Sheran Hiu Wan; Redelings, Benjamin David; Kullman, Seth William
2012-01-15
The availability of multiple teleost (bony fish) genomes is providing unprecedented opportunities to understand the diversity and function of gene duplication events using comparative genomics. Here we examine multiple paralogous genes of γ-glutamyl transferase (GGT) in several distantly related teleost species including medaka, stickleback, green spotted pufferfish, fugu, and zebrafish. Through mining genome databases, we have identified multiple GGT orthologs. Duplicate (paralogous) GGT sequences for GGT1 (GGT1 a and b), GGTL1 (GGTL1 a and b), and GGTL3 (GGTL3 a and b) were identified for each species. Phylogenetic analysis suggests that GGTs are ancient proteins conserved across most metazoan phyla and those paralogous GGTs in teleosts likely arose from the serial 3R genome duplication events. A third GGTL1 gene (GGTL1c) was found in green spotted pufferfish; however, this gene is not present in medaka, stickleback, or fugu. Similarly, one or both paralogs of GGTL3 appear to have been lost in green spotted pufferfish, fugu, and zebrafish. Syntenic relationships were highly maintained between duplicated teleost chromosomes, among teleosts and across ray-finned (Actinopterygii) and lobe-finned (Sarcopterygii) species. To assess subfunction partitioning, six medaka GGT genes were cloned and assessed for developmental and tissue-specific expression. On the basis of these data, we propose a modification of the "duplication-degeneration-complementation" model of subfunction partitioning where quantitative differences rather than absolute differences in gene expression are observed between gene paralogs. Our results demonstrate that multiple GGT genes have been retained within teleost genomes. Questions remain, however, regarding the functional roles of multiple GGTs in these species. Copyright © 2011 Wiley Periodicals, Inc., A Wiley Company.
Dwivedi, Bhakti; Kowalski, Jeanne
2018-01-01
While many methods exist for integrating multi-omics data or defining gene sets, there is no one single tool that defines gene sets based on merging of multiple omics data sets. We present shinyGISPA, an open-source application with a user-friendly web-based interface to define genes according to their similarity in several molecular changes that are driving a disease phenotype. This tool was developed to help facilitate the usability of a previously published method, Gene Integrated Set Profile Analysis (GISPA), among researchers with limited computer-programming skills. The GISPA method allows the identification of multiple gene sets that may play a role in the characterization, clinical application, or functional relevance of a disease phenotype. The tool provides an automated workflow that is highly scalable and adaptable to applications that go beyond genomic data merging analysis. It is available at http://shinygispa.winship.emory.edu/shinyGISPA/.
Dwivedi, Bhakti
2018-01-01
While many methods exist for integrating multi-omics data or defining gene sets, there is no one single tool that defines gene sets based on merging of multiple omics data sets. We present shinyGISPA, an open-source application with a user-friendly web-based interface to define genes according to their similarity in several molecular changes that are driving a disease phenotype. This tool was developed to help facilitate the usability of a previously published method, Gene Integrated Set Profile Analysis (GISPA), among researchers with limited computer-programming skills. The GISPA method allows the identification of multiple gene sets that may play a role in the characterization, clinical application, or functional relevance of a disease phenotype. The tool provides an automated workflow that is highly scalable and adaptable to applications that go beyond genomic data merging analysis. It is available at http://shinygispa.winship.emory.edu/shinyGISPA/. PMID:29415010
Palau, Montserrat; Kulmann, Marcos; Ramírez-Lázaro, María José; Lario, Sergio; Quilez, María Elisa; Campo, Rafael; Piqué, Núria; Calvet, Xavier; Miñana-Galbis, David
2016-12-01
Helicobacter pylori infects human stomachs of over half the world's population, evades the immune response and establishes a chronic infection. Although most people remains asymptomatic, duodenal and gastric ulcers, MALT lymphoma and progression to gastric cancer could be developed. Several virulence factors such as flagella, lipopolysaccharide, adhesins and especially the vacuolating cytotoxin VacA and the oncoprotein CagA have been described for H. pylori. Despite the extensive published data on H. pylori, more research is needed to determine new virulence markers, the exact mode of transmission or the role of multiple infection. Amplification and sequencing of six housekeeping genes (amiA, cgt, cpn60, cpn70, dnaJ, and luxS) related to H. pylori pathogenesis have been performed in order to evaluate their usefulness for the specific detection of H. pylori, the genetic discrimination at strain level and the detection of multiple infection. A total of 52 H. pylori clones, isolated from 14 gastric biopsies from 11 patients, were analyzed for this purpose. All genes were specifically amplified for H. pylori and all clones isolated from different patients were discriminated, with gene distances ranged from 0.9 to 7.8%. Although most clones isolated from the same patient showed identical gene sequences, an event of multiple infection was detected in all the genes and microevolution events were showed for amiA and cpn60 genes. These results suggested that housekeeping genes could be useful for H. pylori detection and to elucidate the mode of transmission and the relevance of the multiple infection. © 2016 John Wiley & Sons Ltd.
Comparative genome analysis of 19 Ureaplasma urealyticum and Ureaplasma parvum strains
2012-01-01
Background Ureaplasma urealyticum (UUR) and Ureaplasma parvum (UPA) are sexually transmitted bacteria among humans implicated in a variety of disease states including but not limited to: nongonococcal urethritis, infertility, adverse pregnancy outcomes, chorioamnionitis, and bronchopulmonary dysplasia in neonates. There are 10 distinct serotypes of UUR and 4 of UPA. Efforts to determine whether difference in pathogenic potential exists at the ureaplasma serovar level have been hampered by limitations of antibody-based typing methods, multiple cross-reactions and poor discriminating capacity in clinical samples containing two or more serovars. Results We determined the genome sequences of the American Type Culture Collection (ATCC) type strains of all UUR and UPA serovars as well as four clinical isolates of UUR for which we were not able to determine serovar designation. UPA serovars had 0.75−0.78 Mbp genomes and UUR serovars were 0.84−0.95 Mbp. The original classification of ureaplasma isolates into distinct serovars was largely based on differences in the major ureaplasma surface antigen called the multiple banded antigen (MBA) and reactions of human and animal sera to the organisms. Whole genome analysis of the 14 serovars and the 4 clinical isolates showed the mba gene was part of a large superfamily, which is a phase variable gene system, and that some serovars have identical sets of mba genes. Most of the differences among serovars are hypothetical genes, and in general the two species and 14 serovars are extremely similar at the genome level. Conclusions Comparative genome analysis suggests UUR is more capable of acquiring genes horizontally, which may contribute to its greater virulence for some conditions. The overwhelming evidence of extensive horizontal gene transfer among these organisms from our previous studies combined with our comparative analysis indicates that ureaplasmas exist as quasi-species rather than as stable serovars in their native environment. Therefore, differential pathogenicity and clinical outcome of a ureaplasmal infection is most likely not on the serovar level, but rather may be due to the presence or absence of potential pathogenicity factors in an individual ureaplasma clinical isolate and/or patient to patient differences in terms of autoimmunity and microbiome. PMID:22646228
Comparative genome analysis of 19 Ureaplasma urealyticum and Ureaplasma parvum strains.
Paralanov, Vanya; Lu, Jin; Duffy, Lynn B; Crabb, Donna M; Shrivastava, Susmita; Methé, Barbara A; Inman, Jason; Yooseph, Shibu; Xiao, Li; Cassell, Gail H; Waites, Ken B; Glass, John I
2012-05-30
Ureaplasma urealyticum (UUR) and Ureaplasma parvum (UPA) are sexually transmitted bacteria among humans implicated in a variety of disease states including but not limited to: nongonococcal urethritis, infertility, adverse pregnancy outcomes, chorioamnionitis, and bronchopulmonary dysplasia in neonates. There are 10 distinct serotypes of UUR and 4 of UPA. Efforts to determine whether difference in pathogenic potential exists at the ureaplasma serovar level have been hampered by limitations of antibody-based typing methods, multiple cross-reactions and poor discriminating capacity in clinical samples containing two or more serovars. We determined the genome sequences of the American Type Culture Collection (ATCC) type strains of all UUR and UPA serovars as well as four clinical isolates of UUR for which we were not able to determine serovar designation. UPA serovars had 0.75-0.78 Mbp genomes and UUR serovars were 0.84-0.95 Mbp. The original classification of ureaplasma isolates into distinct serovars was largely based on differences in the major ureaplasma surface antigen called the multiple banded antigen (MBA) and reactions of human and animal sera to the organisms. Whole genome analysis of the 14 serovars and the 4 clinical isolates showed the mba gene was part of a large superfamily, which is a phase variable gene system, and that some serovars have identical sets of mba genes. Most of the differences among serovars are hypothetical genes, and in general the two species and 14 serovars are extremely similar at the genome level. Comparative genome analysis suggests UUR is more capable of acquiring genes horizontally, which may contribute to its greater virulence for some conditions. The overwhelming evidence of extensive horizontal gene transfer among these organisms from our previous studies combined with our comparative analysis indicates that ureaplasmas exist as quasi-species rather than as stable serovars in their native environment. Therefore, differential pathogenicity and clinical outcome of a ureaplasmal infection is most likely not on the serovar level, but rather may be due to the presence or absence of potential pathogenicity factors in an individual ureaplasma clinical isolate and/or patient to patient differences in terms of autoimmunity and microbiome.
Benmansour, A.; Bascuro, B.; Monnier, A.F.; Vende, P.; Winton, J.R.; de Kinkelin, P.
1997-01-01
To evaluate the genetic diversity of viral haemorrhagic septicaemia virus (VHSV), the sequence of the glycoprotein genes (G) of 11 North American and European isolates were determined. Comparison with the G protein of representative members of the family Rhabdoviridae suggested that VHSV was a different virus species from infectious haemorrhagic necrosis virus (IHNV) and Hirame rhabdovirus (HIRRV). At a higher taxonomic level, VHSV, IHNV and HIRRV formed a group which was genetically closest to the genus Lyssavirus. Compared with each other, the G genes of VHSV displayed a dissimilar overall genetic diversity which correlated with differences in geographical origin. The multiple sequence alignment of the complete G protein, showed that the divergent positions were not uniformly distributed along the sequence. A central region (amino acid position 245-300) accumulated substitutions and appeared to be highly variable. The genetic heterogeneity within a single isolate was high, with an apparent internal mutation frequency of 1.2 x 10(-3) per nucleotide site, attesting the quasispecies nature of the viral population. The phylogeny separated VHSV strains according to the major geographical area of isolation: genotype I for continental Europe, genotype II for the British Isles, and genotype III for North America. Isolates from continental Europe exhibited the highest genetic variability, with sub-groups correlated partially with the serological classification. Neither neutralizing polyclonal sera, nor monoclonal antibodies, were able to discriminate between the genotypes. The overall structure of the phylogenetic tree suggests that VHSV genetic diversity and evolution fit within the model of random change and positive selection operating on quasispecies.
Kuan, Lisa; Schaffer, Jessica N.; Zouzias, Christos D.
2014-01-01
Proteus mirabilis is a Gram-negative enteric bacterium that causes complicated urinary tract infections, particularly in patients with indwelling catheters. Sequencing of clinical isolate P. mirabilis HI4320 revealed the presence of 17 predicted chaperone-usher fimbrial operons. We classified these fimbriae into three groups by their genetic relationship to other chaperone-usher fimbriae. Sixteen of these fimbriae are encoded by all seven currently sequenced P. mirabilis genomes. The predicted protein sequence of the major structural subunit for 14 of these fimbriae was highly conserved (≥95 % identity), whereas three other structural subunits (Fim3A, UcaA and Fim6A) were variable. Further examination of 58 clinical isolates showed that 14 of the 17 predicted major structural subunit genes of the fimbriae were present in most strains (>85 %). Transcription of the predicted major structural subunit genes for all 17 fimbriae was measured under different culture conditions designed to mimic conditions in the urinary tract. The majority of the fimbrial genes were induced during stationary phase, static culture or colony growth when compared to exponential-phase aerated culture. Major structural subunit proteins for six of these fimbriae were detected using MS of proteins sheared from the surface of broth-cultured P. mirabilis, demonstrating that this organism may produce multiple fimbriae within a single culture. The high degree of conservation of P. mirabilis fimbriae stands in contrast to uropathogenic Escherichia coli and Salmonella enterica, which exhibit greater variability in their fimbrial repertoires. These findings suggest there may be evolutionary pressure for P. mirabilis to maintain a large fimbrial arsenal. PMID:24809384
Jarvi, S.I.; Tarr, C.L.; Mcintosh, C.E.; Atkinson, C.T.; Fleischer, R.C.
2004-01-01
The native Hawaiian honeycreepers represent a classic example of adaptive radiation and speciation, but currently face one the highest extinction rates in the world. Although multiple factors have likely influenced the fate of Hawaiian birds, the relatively recent introduction of avian malaria is thought to be a major factor limiting honeycreeper distribution and abundance. We have initiated genetic analyses of class II ?? chain Mhc genes in four species of honeycreepers using methods that eliminate the possibility of sequencing mosaic variants formed by cloning heteroduplexed polymerase chain reaction products. Phylogenetic analyses group the honeycreeper Mhc sequences into two distinct clusters. Variation within one cluster is high, with dN > d S and levels of diversity similar to other studies of Mhc (B system) genes in birds. The second cluster is nearly invariant and includes sequences from honeycreepers (Fringillidae), a sparrow (Emberizidae) and a blackbird (Emberizidae). This highly conserved cluster appears reminiscent of the independently segregating Rfp-Y system of genes defined in chickens. The notion that balancing selection operates at the Mhc in the honeycreepers is supported by transpecies polymorphism and strikingly high dN/dS ratios at codons putatively involved in peptide interaction. Mitochondrial DNA control region sequences were invariant in the i'iwi, but were highly variable in the 'amakihi. By contrast, levels of variability of class II ?? chain Mhc sequence codons that are hypothesized to be directly involved in peptide interactions appear comparable between i'iwi and 'amakihi. In the i'iwi, natural selection may have maintained variation within the Mhc, even in the face of what appears to a genetic bottleneck.
ERIC Educational Resources Information Center
McDonough, Janet; Goudsouzian, Lara K.; Papaj, Agllai; Maceli, Ashley R.; Klepac-Ceraj, Vanja; Peterson, Celeste N.
2017-01-01
Course-based undergraduate research experiences (CUREs) have been shown to increase student retention and learning in the biological sciences. Most CURES cover only one aspect of gene regulation, such as transcriptional control. Here we present a new inquiry-based lab that engages understanding of gene expression from multiple perspectives.…
BASiCS: Bayesian Analysis of Single-Cell Sequencing Data
Vallejos, Catalina A.; Marioni, John C.; Richardson, Sylvia
2015-01-01
Single-cell mRNA sequencing can uncover novel cell-to-cell heterogeneity in gene expression levels in seemingly homogeneous populations of cells. However, these experiments are prone to high levels of unexplained technical noise, creating new challenges for identifying genes that show genuine heterogeneous expression within the population of cells under study. BASiCS (Bayesian Analysis of Single-Cell Sequencing data) is an integrated Bayesian hierarchical model where: (i) cell-specific normalisation constants are estimated as part of the model parameters, (ii) technical variability is quantified based on spike-in genes that are artificially introduced to each analysed cell’s lysate and (iii) the total variability of the expression counts is decomposed into technical and biological components. BASiCS also provides an intuitive detection criterion for highly (or lowly) variable genes within the population of cells under study. This is formalised by means of tail posterior probabilities associated to high (or low) biological cell-to-cell variance contributions, quantities that can be easily interpreted by users. We demonstrate our method using gene expression measurements from mouse Embryonic Stem Cells. Cross-validation and meaningful enrichment of gene ontology categories within genes classified as highly (or lowly) variable supports the efficacy of our approach. PMID:26107944
BASiCS: Bayesian Analysis of Single-Cell Sequencing Data.
Vallejos, Catalina A; Marioni, John C; Richardson, Sylvia
2015-06-01
Single-cell mRNA sequencing can uncover novel cell-to-cell heterogeneity in gene expression levels in seemingly homogeneous populations of cells. However, these experiments are prone to high levels of unexplained technical noise, creating new challenges for identifying genes that show genuine heterogeneous expression within the population of cells under study. BASiCS (Bayesian Analysis of Single-Cell Sequencing data) is an integrated Bayesian hierarchical model where: (i) cell-specific normalisation constants are estimated as part of the model parameters, (ii) technical variability is quantified based on spike-in genes that are artificially introduced to each analysed cell's lysate and (iii) the total variability of the expression counts is decomposed into technical and biological components. BASiCS also provides an intuitive detection criterion for highly (or lowly) variable genes within the population of cells under study. This is formalised by means of tail posterior probabilities associated to high (or low) biological cell-to-cell variance contributions, quantities that can be easily interpreted by users. We demonstrate our method using gene expression measurements from mouse Embryonic Stem Cells. Cross-validation and meaningful enrichment of gene ontology categories within genes classified as highly (or lowly) variable supports the efficacy of our approach.
Loeza-Quintana, Tzitziki; Adamowicz, Sarah J
2018-02-01
During the past 50 years, the molecular clock has become one of the main tools for providing a time scale for the history of life. In the era of robust molecular evolutionary analysis, clock calibration is still one of the most basic steps needing attention. When fossil records are limited, well-dated geological events are the main resource for calibration. However, biogeographic calibrations have often been used in a simplistic manner, for example assuming simultaneous vicariant divergence of multiple sister lineages. Here, we propose a novel iterative calibration approach to define the most appropriate calibration date by seeking congruence between the dates assigned to multiple allopatric divergences and the geological history. Exploring patterns of molecular divergence in 16 trans-Bering sister clades of echinoderms, we demonstrate that the iterative calibration is predominantly advantageous when using complex geological or climatological events-such as the opening/reclosure of the Bering Strait-providing a powerful tool for clock dating that can be applied to other biogeographic calibration systems and further taxa. Using Bayesian analysis, we observed that evolutionary rate variability in the COI-5P gene is generally distributed in a clock-like fashion for Northern echinoderms. The results reveal a large range of genetic divergences, consistent with multiple pulses of trans-Bering migrations. A resulting rate of 2.8% pairwise Kimura-2-parameter sequence divergence per million years is suggested for the COI-5P gene in Northern echinoderms. Given that molecular rates may vary across latitudes and taxa, this study provides a new context for dating the evolutionary history of Arctic marine life.
Pritchard, Antonia L; Johansson, Peter A; Nathan, Vaishnavi; Howlie, Madeleine; Symmons, Judith; Palmer, Jane M; Hayward, Nicholas K
2018-01-01
While a number of autosomal dominant and autosomal recessive cancer syndromes have an associated spectrum of cancers, the prevalence and variety of cancer predisposition mutations in patients with multiple primary cancers have not been extensively investigated. An understanding of the variants predisposing to more than one cancer type could improve patient care, including screening and genetic counselling, as well as advancing the understanding of tumour development. A cohort of 57 patients ascertained due to their cutaneous melanoma (CM) diagnosis and with a history of two or more additional non-cutaneous independent primary cancer types were recruited for this study. Patient blood samples were assessed by whole exome or whole genome sequencing. We focussed on variants in 525 pre-selected genes, including 65 autosomal dominant and 31 autosomal recessive cancer predisposition genes, 116 genes involved in the DNA repair pathway, and 313 commonly somatically mutated in cancer. The same genes were analysed in exome sequence data from 1358 control individuals collected as part of non-cancer studies (UK10K). The identified variants were classified for pathogenicity using online databases, literature and in silico prediction tools. No known pathogenic autosomal dominant or previously described compound heterozygous mutations in autosomal recessive genes were observed in the multiple cancer cohort. Variants typically found somatically in haematological malignancies (in JAK1, JAK2, SF3B1, SRSF2, TET2 and TYK2) were present in lymphocyte DNA of patients with multiple primary cancers, all of whom had a history of haematological malignancy and cutaneous melanoma, as well as colorectal cancer and/or prostate cancer. Other potentially pathogenic variants were discovered in BUB1B, POLE2, ROS1 and DNMT3A. Compared to controls, multiple cancer cases had significantly more likely damaging mutations (nonsense, frameshift ins/del) in tumour suppressor and tyrosine kinase genes and higher overall burden of mutations in all cancer genes. We identified several pathogenic variants that likely predispose to at least one of the tumours in patients with multiple cancers. We additionally present evidence that there may be a higher burden of variants of unknown significance in 'cancer genes' in patients with multiple cancer types. Further screens of this nature need to be carried out to build evidence to show if the cancers observed in these patients form part of a cancer spectrum associated with single germline variants in these genes, whether multiple layers of susceptibility exist (oligogenic or polygenic), or if the occurrence of multiple different cancers is due to random chance.
Parvari, R; Avivi, A; Lentner, F; Ziv, E; Tel-Or, S; Burstein, Y; Schechter, I
1988-03-01
cDNA clones encoding the variable and constant regions of chicken immunoglobulin (Ig) gamma-chains were obtained from spleen cDNA libraries. Southern blots of kidney DNA show that the variable region sequences of eight cDNA clones reveal the same set of bands corresponding to approximately 30 cross-hybridizing VH genes of one subgroup. Since the VH clones were randomly selected, it is likely that the bulk of chicken H-chains are encoded by a single VH subgroup. Nucleotide sequence determinations of two cDNA clones reveal VH, D, JH and the constant region. The VH segments are closely related to each other (83% homology) as expected for VH or the same subgroup. The JHs are 15 residues long and differ by one amino acid. The Ds differ markedly in sequence (20% homology) and size (10 and 20 residues). These findings strongly indicate multiple (at least two) D genes which by a combinatorial joining mechanism diversify the H-chains, a mechanism which is not operative in the chicken L-chain locus. The most notable among the chicken Igs is the so-called 7S IgG because its H-chain differs in many important aspects from any mammalian IgG. The sequence of the C gamma cDNA reported here resolves this issue. The chicken C gamma is 426 residues long with four CH domains (unlike mammalian C gamma which has three CH domains) and it shows 25% homology to the chicken C mu. The chicken C gamma is most related to the mammalian C epsilon in length, the presence of four CH domains and the distribution of cysteines in the CH1 and CH2 domains. We propose that the unique chicken C gamma is the ancestor of the mammalian C epsilon and C gamma subclasses, and discuss the evolution of the H-chain locus from that of chicken with presumably three genes (mu, gamma, alpha) to the mammalian loci with 8-10 H-chain genes.
Sasayama, Daimei; Hori, Hiroaki; Iijima, Yoshimi; Teraishi, Toshiya; Hattori, Kotaro; Ota, Miho; Fujii, Takashi; Higuchi, Teruhiko; Amano, Naoji; Kunugi, Hiroshi
2011-07-05
Recently, hypothalamus-pituitary-adrenal (HPA) axis function assessed with the combined dexamethasone (DEX)/corticotropin releasing hormone (CRH) test has been shown to be associated with response to antidepressant treatment. A polymorphism (rs16944) in the interleukin-1beta (IL-1β) gene has also been reported to be associated with the medication response in depression. These findings prompted us to examine the possible association between IL-1β gene polymorphisms and HPA axis function assessed with the DEX/CRH test. DEX/CRH test was performed in 179 healthy volunteers (45 males: mean age 40.5 ± 15.8 years; 134 females: mean age 47.1 ± 13.2 years). Five tagging single nucleotide polymorphisms (SNPs) of IL-1β gene (rs2853550, rs1143634, rs1143633, rs1143630, rs16944) were selected at an r2 threshold of 0.80 with a minor allele frequency > 0.1. Genotyping was performed by the TaqMan allelic discrimination assay. A two-way factorial analysis of variance (ANOVA) was performed with the DEX/CRH test results as the dependent variable and genotype and gender as independent variables. To account for multiple testing, P values < 0.01 were considered statistically significant for associations between the genotypes and the cortisol levels. The cortisol levels after DEX administration (DST-Cortisol) showed significant associations with the genotypes of rs16944 (P = 0.00049) and rs1143633 (P = 0.0060), with no significant gender effect or genotype × gender interaction. On the other hand, cortisol levels after CRH administration (DEX/CRH-Cortisol) were affected by gender but were not significantly influenced by the genotype of the examined SNPs, with no significant genotype × gender interaction. Our results suggest that genetic variations in the IL-1β gene contribute to the HPA axis alteration assessed by DST-Cortisol in healthy subjects. On the other hand, no significant associations of the IL-1β gene polymorphisms with the DEX/CRH-Cortisol were observed. Confirmation of our findings in futures studies may add new insight into the communication between the immune system and the HPA axis.
2011-01-01
Background Recently, hypothalamus-pituitary-adrenal (HPA) axis function assessed with the combined dexamethasone (DEX)/corticotropin releasing hormone (CRH) test has been shown to be associated with response to antidepressant treatment. A polymorphism (rs16944) in the interleukin-1beta (IL-1β) gene has also been reported to be associated with the medication response in depression. These findings prompted us to examine the possible association between IL-1β gene polymorphisms and HPA axis function assessed with the DEX/CRH test. Methods DEX/CRH test was performed in 179 healthy volunteers (45 males: mean age 40.5 ± 15.8 years; 134 females: mean age 47.1 ± 13.2 years). Five tagging single nucleotide polymorphisms (SNPs) of IL-1β gene (rs2853550, rs1143634, rs1143633, rs1143630, rs16944) were selected at an r2 threshold of 0.80 with a minor allele frequency > 0.1. Genotyping was performed by the TaqMan allelic discrimination assay. A two-way factorial analysis of variance (ANOVA) was performed with the DEX/CRH test results as the dependent variable and genotype and gender as independent variables. To account for multiple testing, P values < 0.01 were considered statistically significant for associations between the genotypes and the cortisol levels. Results The cortisol levels after DEX administration (DST-Cortisol) showed significant associations with the genotypes of rs16944 (P = 0.00049) and rs1143633 (P = 0.0060), with no significant gender effect or genotype × gender interaction. On the other hand, cortisol levels after CRH administration (DEX/CRH-Cortisol) were affected by gender but were not significantly influenced by the genotype of the examined SNPs, with no significant genotype × gender interaction. Conclusions Our results suggest that genetic variations in the IL-1β gene contribute to the HPA axis alteration assessed by DST-Cortisol in healthy subjects. On the other hand, no significant associations of the IL-1β gene polymorphisms with the DEX/CRH-Cortisol were observed. Confirmation of our findings in futures studies may add new insight into the communication between the immune system and the HPA axis. PMID:21726461
Neville, B. Anne; Sheridan, Paul O.; Harris, Hugh M. B.; Coughlan, Simone; Flint, Harry J.; Duncan, Sylvia H.; Jeffery, Ian B.; Claesson, Marcus J.; Ross, R. Paul; Scott, Karen P.; O'Toole, Paul W.
2013-01-01
Some Eubacterium and Roseburia species are among the most prevalent motile bacteria present in the intestinal microbiota of healthy adults. These flagellate species contribute “cell motility” category genes to the intestinal microbiome and flagellin proteins to the intestinal proteome. We reviewed and revised the annotation of motility genes in the genomes of six Eubacterium and Roseburia species that occur in the human intestinal microbiota and examined their respective locus organization by comparative genomics. Motility gene order was generally conserved across these loci. Five of these species harbored multiple genes for predicted flagellins. Flagellin proteins were isolated from R. inulinivorans strain A2-194 and from E. rectale strains A1-86 and M104/1. The amino-termini sequences of the R. inulinivorans and E. rectale A1-86 proteins were almost identical. These protein preparations stimulated secretion of interleukin-8 (IL-8) from human intestinal epithelial cell lines, suggesting that these flagellins were pro-inflammatory. Flagellins from the other four species were predicted to be pro-inflammatory on the basis of alignment to the consensus sequence of pro-inflammatory flagellins from the β- and γ- proteobacteria. Many fliC genes were deduced to be under the control of σ28. The relative abundance of the target Eubacterium and Roseburia species varied across shotgun metagenomes from 27 elderly individuals. Genes involved in the flagellum biogenesis pathways of these species were variably abundant in these metagenomes, suggesting that the current depth of coverage used for metagenomic sequencing (3.13–4.79 Gb total sequence in our study) insufficiently captures the functional diversity of genomes present at low (≤1%) relative abundance. E. rectale and R. inulinivorans thus appear to synthesize complex flagella composed of flagellin proteins that stimulate IL-8 production. A greater depth of sequencing, improved evenness of sequencing and improved metagenome assembly from short reads will be required to facilitate in silico analyses of complete complex biochemical pathways for low-abundance target species from shotgun metagenomes. PMID:23935906
The king cobra genome reveals dynamic gene evolution and adaptation in the snake venom system
Vonk, Freek J.; Casewell, Nicholas R.; Henkel, Christiaan V.; Heimberg, Alysha M.; Jansen, Hans J.; McCleary, Ryan J. R.; Kerkkamp, Harald M. E.; Vos, Rutger A.; Guerreiro, Isabel; Calvete, Juan J.; Wüster, Wolfgang; Woods, Anthony E.; Logan, Jessica M.; Harrison, Robert A.; Castoe, Todd A.; de Koning, A. P. Jason; Pollock, David D.; Yandell, Mark; Calderon, Diego; Renjifo, Camila; Currier, Rachel B.; Salgado, David; Pla, Davinia; Sanz, Libia; Hyder, Asad S.; Ribeiro, José M. C.; Arntzen, Jan W.; van den Thillart, Guido E. E. J. M.; Boetzer, Marten; Pirovano, Walter; Dirks, Ron P.; Spaink, Herman P.; Duboule, Denis; McGlinn, Edwina; Kini, R. Manjunatha; Richardson, Michael K.
2013-01-01
Snakes are limbless predators, and many species use venom to help overpower relatively large, agile prey. Snake venoms are complex protein mixtures encoded by several multilocus gene families that function synergistically to cause incapacitation. To examine venom evolution, we sequenced and interrogated the genome of a venomous snake, the king cobra (Ophiophagus hannah), and compared it, together with our unique transcriptome, microRNA, and proteome datasets from this species, with data from other vertebrates. In contrast to the platypus, the only other venomous vertebrate with a sequenced genome, we find that snake toxin genes evolve through several distinct co-option mechanisms and exhibit surprisingly variable levels of gene duplication and directional selection that correlate with their functional importance in prey capture. The enigmatic accessory venom gland shows a very different pattern of toxin gene expression from the main venom gland and seems to have recruited toxin-like lectin genes repeatedly for new nontoxic functions. In addition, tissue-specific microRNA analyses suggested the co-option of core genetic regulatory components of the venom secretory system from a pancreatic origin. Although the king cobra is limbless, we recovered coding sequences for all Hox genes involved in amniote limb development, with the exception of Hoxd12. Our results provide a unique view of the origin and evolution of snake venom and reveal multiple genome-level adaptive responses to natural selection in this complex biological weapon system. More generally, they provide insight into mechanisms of protein evolution under strong selection. PMID:24297900
Ohm-Laursen, Line; Nielsen, Morten; Larsen, Stine R; Barington, Torben
2006-01-01
Antibody diversity is created by imprecise joining of the variability (V), diversity (D) and joining (J) gene segments of the heavy and light chain loci. Analysis of rearrangements is complicated by somatic hypermutations and uncertainty concerning the sources of gene segments and the precise way in which they recombine. It has been suggested that D genes with irregular recombination signal sequences (DIR) and chromosome 15 open reading frames (OR15) can replace conventional D genes, that two D genes or inverted D genes may be used and that the repertoire can be further diversified by heavy chain V gene (VH) replacement. Safe conclusions require large, well-defined sequence samples and algorithms minimizing stochastic assignment of segments. Two computer programs were developed for analysis of heavy chain joints. JointHMM is a profile hidden Markow model, while JointML is a maximum-likelihood-based method taking the lengths of the joint and the mutational status of the VH gene into account. The programs were applied to a set of 6329 clonally unrelated rearrangements. A conventional D gene was found in 80% of unmutated sequences and 64% of mutated sequences, while D-gene assignment was kept below 5% in artificial (randomly permutated) rearrangements. No evidence for the use of DIR, OR15, multiple D genes or VH replacements was found, while inverted D genes were used in less than 1‰ of the sequences. JointML was shown to have a higher predictive performance for D-gene assignment in mutated and unmutated sequences than four other publicly available programs. An online version 1·0 of JointML is available at http://www.cbs.dtu.dk/services/VDJsolver. PMID:17005006
Jena, Kshirod K; Hechanova, Sherry Lou; Verdeprado, Holden; Prahalada, G D; Kim, Sung-Ryul
2017-11-01
A first set of 25 NILs carrying ten BPH resistance genes and their pyramids was developed in the background of indica variety IR24 for insect resistance breeding in rice. Brown planthopper (Nilaparvata lugens Stal.) is one of the most destructive insect pests in rice. Development of near-isogenic lines (NILs) is an important strategy for genetic analysis of brown planthopper (BPH) resistance (R) genes and their deployment against diverse BPH populations. A set of 25 NILs with 9 single R genes and 16 multiple R gene combinations consisting of 11 two-gene pyramids and 5 three-gene pyramids in the genetic background of the susceptible indica rice cultivar IR24 was developed through marker-assisted selection. The linked DNA markers for each of the R genes were used for foreground selection and confirming the introgressed regions of the BPH R genes. Modified seed box screening and feeding rate of BPH were used to evaluate the spectrum of resistance. BPH reaction of each of the NILs carrying different single genes was variable at the antibiosis level with the four BPH populations of the Philippines. The NILs with two- to three-pyramided genes showed a stronger level of antibiosis (49.3-99.0%) against BPH populations compared with NILs with a single R gene NILs (42.0-83.5%) and IR24 (10.0%). Background genotyping by high-density SNPs markers revealed that most of the chromosome regions of the NILs (BC 3 F 5 ) had IR24 genome recovery of 82.0-94.2%. Six major agronomic data of the NILs showed a phenotypically comparable agronomic performance with IR24. These newly developed NILs will be useful as new genetic resources for BPH resistance breeding and are valuable sources of genes in monitoring against the emerging BPH biotypes in different rice-growing countries.
dbCPG: A web resource for cancer predisposition genes.
Wei, Ran; Yao, Yao; Yang, Wu; Zheng, Chun-Hou; Zhao, Min; Xia, Junfeng
2016-06-21
Cancer predisposition genes (CPGs) are genes in which inherited mutations confer highly or moderately increased risks of developing cancer. Identification of these genes and understanding the biological mechanisms that underlie them is crucial for the prevention, early diagnosis, and optimized management of cancer. Over the past decades, great efforts have been made to identify CPGs through multiple strategies. However, information on these CPGs and their molecular functions is scattered. To address this issue and provide a comprehensive resource for researchers, we developed the Cancer Predisposition Gene Database (dbCPG, Database URL: http://bioinfo.ahu.edu.cn:8080/dbCPG/index.jsp), the first literature-based gene resource for exploring human CPGs. It contains 827 human (724 protein-coding, 23 non-coding, and 80 unknown type genes), 637 rats, and 658 mouse CPGs. Furthermore, data mining was performed to gain insights into the understanding of the CPGs data, including functional annotation, gene prioritization, network analysis of prioritized genes and overlap analysis across multiple cancer types. A user-friendly web interface with multiple browse, search, and upload functions was also developed to facilitate access to the latest information on CPGs. Taken together, the dbCPG database provides a comprehensive data resource for further studies of cancer predisposition genes.
Katz Sand, Ilana B.; Honce, Justin M.; Lublin, Fred D.
2015-01-01
Several single gene disorders share clinical and radiologic characteristics with multiple sclerosis and have the potential to be overlooked in the differential diagnostic evaluation of both adult and paediatric patients with multiple sclerosis. This group includes lysosomal storage disorders, various mitochondrial diseases, other neurometabolic disorders, and several other miscellaneous disorders. Recognition of a single-gene disorder as causal for a patient’s ‘multiple sclerosis-like’ phenotype is critically important for accurate direction of patient management, and evokes broader genetic counselling implications for affected families. Here we review single gene disorders that have the potential to mimic multiple sclerosis, provide an overview of clinical and investigational characteristics of each disorder, and present guidelines for when clinicians should suspect an underlying heritable disorder that requires diagnostic confirmation in a patient with a definite or probable diagnosis of multiple sclerosis. PMID:25636970
Methods for simultaneous control of lignin content and composition, and cellulose content in plants
Chiang, Vincent Lee C.; Li, Laigeng
2005-02-15
The present invention relates to a method of concurrently introducing multiple genes into plants and trees is provided. The method includes simultaneous transformation of plants with multiple genes from the phenylpropanoid pathways including 4CL, CAld5H, AldOMT, SAD and CAD genes and combinations thereof to produce various lines of transgenic plants displaying altered agronomic traits. The agronomic traits of the plants are regulated by the orientation of the specific genes and the selected gene combinations, which are incorporated into the plant genome.
Ma, Quan-Ping; Su, Liang; Liu, Jing-Wen; Yao, Ming-Xiao; Yuan, Guang-Ying
2018-06-01
The aim of the present study was to investigate the correlation between the multi‑drug resistance of Shigella flexneri and the drug‑resistant gene cassette carried by integrons; in the meanwhile, to detect the associations between drug‑resistance and gene mutations of the active efflux pump acrAB‑tolC gene and its regulatory genes, including marOR, acrR and soxS. A total of 158 isolates were isolated from the stool samples of 1,026 children with diarrhoea aged 14 years old between May 2012 and October 2015 in Henan. The K‑B method was applied for the determination of drug resistance of Shigella flexneri, and polymerase chain reaction amplification was used for class 1, 2 and 3 integrase genes. Enzyme digestion and sequence analysis were performed for the variable regions of positive strains. Based on the drug sensitivity assessment, multi‑drug resistant strains that were resistant to five or more antibiotics, and sensitive strains were selected for amplification. Their active efflux pump genes, acrA and acrB, and regulatory genes, marOR, acrR and soxS, were selected for sequencing. The results revealed that 91.1% of the 158 strains were multi‑resistant to ampicillin, chloramphenicol, tetracycline and streptomycin, and 69.6% of the strains were multi‑resistant to sulfamethoxazole/trimethoprim. The resistance to ceftazidime, ciprofloxacin and levofloxacin was <32.9%. All strains (100%) were sensitive to cefoxitin, cefoperazone/sulbactam and imipenem. The rate of the class 1 integron positivity was 91.9% (144/158). Among these class 1 integron‑positive strains, 18 strains exhibited the resistance gene cassette dfrV in the variable region of the strain, four strains exhibited dfrA17‑aadA5 in the variable region and 140 strains exhibited blaOXA‑30‑aadA1 in the variable region. Four strains showed no resistance gene in the variable regions. The rate of class 2 integron positivity was 86.1% (136/158), and all positive strains harboured the dfrA1‑sat1‑aadA resistance gene cassette in the variable region. The class 3 integrase gene was not detected in these strains. The gene sequencing showed the deletion of base CATT in the 36, 37, 38, 39 site in the marOR gene, which is a regulatory gene of the active efflux pump, AcrAB‑TolC. Taken together, the multi‑drug resistance of Shigella flexneri was closely associated with gene mutations of class 1 and 2 integrons and the marOR gene.
Xie, Ping; Wu, Zi Yi; Zhao, Jiang Yan; Sang, Yan Fang; Chen, Jie
2018-04-01
A stochastic hydrological process is influenced by both stochastic and deterministic factors. A hydrological time series contains not only pure random components reflecting its inheri-tance characteristics, but also deterministic components reflecting variability characteristics, such as jump, trend, period, and stochastic dependence. As a result, the stochastic hydrological process presents complicated evolution phenomena and rules. To better understand these complicated phenomena and rules, this study described the inheritance and variability characteristics of an inconsistent hydrological series from two aspects: stochastic process simulation and time series analysis. In addition, several frequency analysis approaches for inconsistent time series were compared to reveal the main problems in inconsistency study. Then, we proposed a new concept of hydrological genes origined from biological genes to describe the inconsistent hydrolocal processes. The hydrologi-cal genes were constructed using moments methods, such as general moments, weight function moments, probability weight moments and L-moments. Meanwhile, the five components, including jump, trend, periodic, dependence and pure random components, of a stochastic hydrological process were defined as five hydrological bases. With this method, the inheritance and variability of inconsistent hydrological time series were synthetically considered and the inheritance, variability and evolution principles were fully described. Our study would contribute to reveal the inheritance, variability and evolution principles in probability distribution of hydrological elements.
Shivange, Amol V; Hoeffken, Hans Wolfgang; Haefner, Stefan; Schwaneberg, Ulrich
2016-12-01
Protein consensus-based surface engineering (ProCoS) is a simple and efficient method for directed protein evolution combining computational analysis and molecular biology tools to engineer protein surfaces. ProCoS is based on the hypothesis that conserved residues originated from a common ancestor and that these residues are crucial for the function of a protein, whereas highly variable regions (situated on the surface of a protein) can be targeted for surface engineering to maximize performance. ProCoS comprises four main steps: ( i ) identification of conserved and highly variable regions; ( ii ) protein sequence design by substituting residues in the highly variable regions, and gene synthesis; ( iii ) in vitro DNA recombination of synthetic genes; and ( iv ) screening for active variants. ProCoS is a simple method for surface mutagenesis in which multiple sequence alignment is used for selection of surface residues based on a structural model. To demonstrate the technique's utility for directed evolution, the surface of a phytase enzyme from Yersinia mollaretii (Ymphytase) was subjected to ProCoS. Screening just 1050 clones from ProCoS engineering-guided mutant libraries yielded an enzyme with 34 amino acid substitutions. The surface-engineered Ymphytase exhibited 3.8-fold higher pH stability (at pH 2.8 for 3 h) and retained 40% of the enzyme's specific activity (400 U/mg) compared with the wild-type Ymphytase. The pH stability might be attributed to a significantly increased (20 percentage points; from 9% to 29%) number of negatively charged amino acids on the surface of the engineered phytase.
Sylvatic plague reduces genetic variability in black-tailed prairie dogs.
Trudeau, Kristie M; Britten, Hugh B; Restani, Marco
2004-04-01
Small, isolated populations are vulnerable to loss of genetic diversity through in-breeding and genetic drift. Sylvatic plague due to infection by the bacterium Yersinia pestis caused an epizootic in the early 1990s resullting in declines and extirpations of many black-tailed prairie dog (Cynomys ludovicianus) colonies in north-central Montana, USA. Plague-induced population bottlenecks may contribute to significant reductions in genetic variability. In contrast, gene flow maintains genetic variability within colonies. We investigated the impacts of the plague epizootic and distance to nearest colony on levels of genetic variability in six prairie dog colonies sampled between June 1999 and July 2001 using 24 variable randomly amplified polymorphic DNA (RAPD) markers. Number of effective alleles per locus (n(e)) and gene diversity (h) were significantly decreased in the three colonies affected by plague that were recovering from the resulting bottlenecks compared with the three colonies that did not experience plague. Genetic variability was not significantly affected by geographic distance between colonies. The majority of variance in gene fieqnencies was found within prairie clog colonies. Conservation of genetic variability in black-tailed prairie dogs will require the preservation of both large and small colony complexes and the gene flow amonog them.
Bowen, Lizabeth; Miles, A. Keith; Murray, Michael; Haulena, Martin; Tuttle, Judy; van Bonn, William; Adams, Lance; Bodkin, James L.; Ballachey, Brenda E.; Estes, James A.; Tinker, M. Tim; Keister, Robin; Stott, Jeffrey L.
2012-01-01
Gene transcription analysis for diagnosing or monitoring wildlife health requires the ability to distinguish pathophysiological change from natural variation. Herein, we describe methodology for the development of quantitative real-time polymerase chain reaction (qPCR) assays to measure differential transcript levels of multiple immune function genes in the sea otter (Enhydra lutris); sea otter-specific qPCR primer sequences for the genes of interest are defined. We establish a ‘reference’ range of transcripts for each gene in a group of clinically healthy captive and free-ranging sea otters. The 10 genes of interest represent multiple physiological systems that play a role in immuno-modulation, inflammation, cell protection, tumour suppression, cellular stress response, xenobiotic metabolizing enzymes, antioxidant enzymes and cell–cell adhesion. The cycle threshold (CT) measures for most genes were normally distributed; the complement cytolysis inhibitor was the exception. The relative enumeration of multiple gene transcripts in simple peripheral blood samples expands the diagnostic capability currently available to assess the health of sea otters in situ and provides a better understanding of the state of their environment.
Wagner-Schuman, Melissa; Neitz, Jay; Rha, Jungtae; Williams, David R.; Neitz, Maureen; Carroll, Joseph
2010-01-01
Our understanding of the etiology of red-green color vision defects is evolving. While missense mutations within the long- (L-) and middle-wavelength sensitive (M-) photopigments and gross rearrangements within the L/M-opsin gene array are commonly associated with red-green defects, recent work using adaptive optics retinal imaging has shown that different genotypes can have distinct consequences for the cone mosaic. Here we examined the cone mosaic in red-green color deficient individuals with multiple X-chromosome opsin genes that encode L opsin, as well as individuals with a single X-chromosome opsin gene that encodes L opsin and a single patient with a novel premature termination codon in his M-opsin gene and a normal L-opsin gene. We observed no difference in cone density between normal trichomats and multiple or single gene dichromats. In addition, we demonstrate different phenotypic effects of a nonsense mutation versus the previously described deleterious polymorphism, (LIAVA), both of which differ from multiple and single gene dichromats. Our results help refine the relationship between opsin genotype and cone photoreceptor mosaic phenotype. PMID:20854834
Bao, Zehua; Xiao, Han; Liang, Jing; Zhang, Lu; Xiong, Xiong; Sun, Ning; Si, Tong; Zhao, Huimin
2015-05-15
One-step multiple gene disruption in the model organism Saccharomyces cerevisiae is a highly useful tool for both basic and applied research, but it remains a challenge. Here, we report a rapid, efficient, and potentially scalable strategy based on the type II Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-CRISPR associated proteins (Cas) system to generate multiple gene disruptions simultaneously in S. cerevisiae. A 100 bp dsDNA mutagenizing homologous recombination donor is inserted between two direct repeats for each target gene in a CRISPR array consisting of multiple donor and guide sequence pairs. An ultrahigh copy number plasmid carrying iCas9, a variant of wild-type Cas9, trans-encoded RNA (tracrRNA), and a homology-integrated crRNA cassette is designed to greatly increase the gene disruption efficiency. As proof of concept, three genes, CAN1, ADE2, and LYP1, were simultaneously disrupted in 4 days with an efficiency ranging from 27 to 87%. Another three genes involved in an artificial hydrocortisone biosynthetic pathway, ATF2, GCY1, and YPR1, were simultaneously disrupted in 6 days with 100% efficiency. This homology-integrated CRISPR (HI-CRISPR) strategy represents a powerful tool for creating yeast strains with multiple gene knockouts.
Combining lipophilic dye, in situ hybridization, immunohistochemistry, and histology.
Duncan, Jeremy; Kersigo, Jennifer; Gray, Brian; Fritzsch, Bernd
2011-03-17
Going beyond single gene function to cut deeper into gene regulatory networks requires multiple mutations combined in a single animal. Such analysis of two or more genes needs to be complemented with in situ hybridization of other genes, or immunohistochemistry of their proteins, both in whole mounted developing organs or sections for detailed resolution of the cellular and tissue expression alterations. Combining multiple gene alterations requires the use of cre or flipase to conditionally delete genes and avoid embryonic lethality. Required breeding schemes dramatically enhance effort and cost proportional to the number of genes mutated, with an outcome of very few animals with the full repertoire of genetic modifications desired. Amortizing the vast amount of effort and time to obtain these few precious specimens that are carrying multiple mutations necessitates tissue optimization. Moreover, investigating a single animal with multiple techniques makes it easier to correlate gene deletion defects with expression profiles. We have developed a technique to obtain a more thorough analysis of a given animal; with the ability to analyze several different histologically recognizable structures as well as gene and protein expression all from the same specimen in both whole mounted organs and sections. Although mice have been utilized to demonstrate the effectiveness of this technique it can be applied to a wide array of animals. To do this we combine lipophilic dye tracing, whole mount in situ hybridization, immunohistochemistry, and histology to extract the maximal possible amount of data.
Combining Lipophilic dye, in situ Hybridization, Immunohistochemistry, and Histology
Duncan, Jeremy; Kersigo, Jennifer; Gray, Brian; Fritzsch, Bernd
2011-01-01
Going beyond single gene function to cut deeper into gene regulatory networks requires multiple mutations combined in a single animal. Such analysis of two or more genes needs to be complemented with in situ hybridization of other genes, or immunohistochemistry of their proteins, both in whole mounted developing organs or sections for detailed resolution of the cellular and tissue expression alterations. Combining multiple gene alterations requires the use of cre or flipase to conditionally delete genes and avoid embryonic lethality. Required breeding schemes dramatically enhance effort and cost proportional to the number of genes mutated, with an outcome of very few animals with the full repertoire of genetic modifications desired. Amortizing the vast amount of effort and time to obtain these few precious specimens that are carrying multiple mutations necessitates tissue optimization. Moreover, investigating a single animal with multiple techniques makes it easier to correlate gene deletion defects with expression profiles. We have developed a technique to obtain a more thorough analysis of a given animal; with the ability to analyze several different histologically recognizable structures as well as gene and protein expression all from the same specimen in both whole mounted organs and sections. Although mice have been utilized to demonstrate the effectiveness of this technique it can be applied to a wide array of animals. To do this we combine lipophilic dye tracing, whole mount in situ hybridization, immunohistochemistry, and histology to extract the maximal possible amount of data. PMID:21445047
Comparative analysis and visualization of multiple collinear genomes
2012-01-01
Background Genome browsers are a common tool used by biologists to visualize genomic features including genes, polymorphisms, and many others. However, existing genome browsers and visualization tools are not well-suited to perform meaningful comparative analysis among a large number of genomes. With the increasing quantity and availability of genomic data, there is an increased burden to provide useful visualization and analysis tools for comparison of multiple collinear genomes such as the large panels of model organisms which are the basis for much of the current genetic research. Results We have developed a novel web-based tool for visualizing and analyzing multiple collinear genomes. Our tool illustrates genome-sequence similarity through a mosaic of intervals representing local phylogeny, subspecific origin, and haplotype identity. Comparative analysis is facilitated through reordering and clustering of tracks, which can vary throughout the genome. In addition, we provide local phylogenetic trees as an alternate visualization to assess local variations. Conclusions Unlike previous genome browsers and viewers, ours allows for simultaneous and comparative analysis. Our browser provides intuitive selection and interactive navigation about features of interest. Dynamic visualizations adjust to scale and data content making analysis at variable resolutions and of multiple data sets more informative. We demonstrate our genome browser for an extensive set of genomic data sets composed of almost 200 distinct mouse laboratory strains. PMID:22536897
Tian, Xin; Xin, Mingyuan; Luo, Jian; Liu, Mingyao; Jiang, Zhenran
2017-02-01
The selection of relevant genes for breast cancer metastasis is critical for the treatment and prognosis of cancer patients. Although much effort has been devoted to the gene selection procedures by use of different statistical analysis methods or computational techniques, the interpretation of the variables in the resulting survival models has been limited so far. This article proposes a new Random Forest (RF)-based algorithm to identify important variables highly related with breast cancer metastasis, which is based on the important scores of two variable selection algorithms, including the mean decrease Gini (MDG) criteria of Random Forest and the GeneRank algorithm with protein-protein interaction (PPI) information. The new gene selection algorithm can be called PPIRF. The improved prediction accuracy fully illustrated the reliability and high interpretability of gene list selected by the PPIRF approach.
Dean, C; Jones, J; Favreau, M; Dunsmuir, P; Bedbrook, J
1988-01-01
The petunia rbcS gene SSU301 was introduced into tobacco using Agrobacterium tumefaciens-mediated transformation. The time at which rbcS expression was maximal after transfer of the tobacco plants to the greenhouse was determined. The expression level of the SSU301 gene varied up to 9 fold between individual tobacco plants which had been standardized physiologically as much as possible. The presence of adjacent pUC plasmid sequences did not affect the expression of the SSU301 gene. In an attempt to reduce the between-transformant variability in expression, the SSU301 gene was introduced into tobacco surrounded by 10kb of 5' and 13 kb of 3' DNA sequences which normally flank SSU301 in petunia. The longer flanking regions did not reduce the between-transformant variability of SSU301 gene expression. Images PMID:3174450
Array data extractor (ADE): a LabVIEW program to extract and merge gene array data
2013-01-01
Background Large data sets from gene expression array studies are publicly available offering information highly valuable for research across many disciplines ranging from fundamental to clinical research. Highly advanced bioinformatics tools have been made available to researchers, but a demand for user-friendly software allowing researchers to quickly extract expression information for multiple genes from multiple studies persists. Findings Here, we present a user-friendly LabVIEW program to automatically extract gene expression data for a list of genes from multiple normalized microarray datasets. Functionality was tested for 288 class A G protein-coupled receptors (GPCRs) and expression data from 12 studies comparing normal and diseased human hearts. Results confirmed known regulation of a beta 1 adrenergic receptor and further indicate novel research targets. Conclusions Although existing software allows for complex data analyses, the LabVIEW based program presented here, “Array Data Extractor (ADE)”, provides users with a tool to retrieve meaningful information from multiple normalized gene expression datasets in a fast and easy way. Further, the graphical programming language used in LabVIEW allows applying changes to the program without the need of advanced programming knowledge. PMID:24289243
Chaw, R. Crystal; Collin, Matthew; Wimmer, Marjorie; Helmrick, Kara-Leigh; Hayashi, Cheryl Y.
2017-01-01
Spiders swath their eggs with silk to protect developing embryos and hatchlings. Egg case silks, like other fibrous spider silks, are primarily composed of proteins called spidroins (spidroin = spider-fibroin). Silks, and thus spidroins, are important throughout the lives of spiders, yet the evolution of spidroin genes has been relatively understudied. Spidroin genes are notoriously difficult to sequence because they are typically very long (≥ 10 kb of coding sequence) and highly repetitive. Here, we investigate the evolution of spider silk genes through long-read sequencing of Bacterial Artificial Chromosome (BAC) clones. We demonstrate that the silver garden spider Argiope argentata has multiple egg case spidroin loci with a loss of function at one locus. We also use degenerate PCR primers to search the genomic DNA of congeneric species and find evidence for multiple egg case spidroin loci in other Argiope spiders. Comparative analyses show that these multiple loci are more similar at the nucleotide level within a species than between species. This pattern is consistent with concerted evolution homogenizing gene copies within a genome. More complicated explanations include convergent evolution or recent independent gene duplications within each species. PMID:29127108
Mark Welch, David B; Cummings, Michael P; Hillis, David M; Meselson, Matthew
2004-02-10
Rotifers of the asexual class Bdelloidea are unusual in possessing two or more divergent copies of every gene that has been examined. Phylogenetic analysis of the heat-shock gene hsp82 and the TATA-box-binding protein gene tbp in multiple bdelloid species suggested that for each gene, each copy belonged to one of two lineages that began to diverge before the bdelloid radiation. Such gene trees are consistent with the two lineages having descended from former alleles that began to diverge after meiotic segregation ceased or from subgenomes of an alloploid ancestor of the bdelloids. However, the original analyses of bdelloid gene-copy divergence used only a single outgroup species and were based on parsimony and neighbor joining. We have now used maximum likelihood and Bayesian inference methods and, for hsp82, multiple outgroups in an attempt to produce more robust gene trees. Here we report that the available data do not unambiguously discriminate between gene trees that root the origin of hsp82 and tbp copy divergence before the bdelloid radiation and those which indicate that the gene copies began to diverge within bdelloid families. The remarkable presence of multiple diverged gene copies in individual genomes is nevertheless consistent with the loss of sex in an ancient ancestor of bdelloids.
Gene panel testing for hereditary breast cancer.
Winship, Ingrid; Southey, Melissa C
2016-03-21
Inherited predisposition to breast cancer is explained only in part by mutations in the BRCA1 and BRCA2 genes. Most families with an apparent familial clustering of breast cancer who are investigated through Australia's network of genetic services and familial cancer centres do not have mutations in either of these genes. More recently, additional breast cancer predisposition genes, such as PALB2, have been identified. New genetic technology allows a panel of multiple genes to be tested for mutations in a single test. This enables more women and their families to have risk assessment and risk management, in a preventive approach to predictable breast cancer. Predictive testing for a known family-specific mutation in a breast cancer predisposition gene provides personalised risk assessment and evidence-based risk management. Breast cancer predisposition gene panel tests have a greater diagnostic yield than conventional testing of only the BRCA1 and BRCA2 genes. The clinical validity and utility of some of the putative breast cancer predisposition genes is not yet clear. Ethical issues warrant consideration, as multiple gene panel testing has the potential to identify secondary findings not originally sought by the test requested. Multiple gene panel tests may provide an affordable and effective way to investigate the heritability of breast cancer.
Speranskaya, Anna S; Krinitsina, Anastasia A; Kudryavtseva, Anna V; Poltronieri, Palmiro; Santino, Angelo; Oparina, Nina Y; Dmitriev, Alexey A; Belenikin, Maxim S; Guseva, Marina A; Shevelev, Alexei B
2012-08-01
The group of Kunitz-type protease inhibitors (KPI) from potato is encoded by a polymorphic family of multiple allelic and non-allelic genes. The previous explanations of the KPI variability were based on the hypothesis of random mutagenesis as a key factor of KPI polymorphism. KPI-A genes from the genomes of Solanum tuberosum cv. Istrinskii and the wild species Solanum palustre were amplified by PCR with subsequent cloning in plasmids. True KPI sequences were derived from comparison of the cloned copies. "Hot spots" of recombination in KPI genes were independently identified by DnaSP 4.0 and TOPALi v2.5 software. The KPI-A sequence from potato cv. Istrinskii was found to be 100% identical to the gene from Solanum nigrum. This fact illustrates a high degree of similarity of KPI genes in the genus Solanum. Pairwise comparison of KPI A and B genes unambiguously showed a non-uniform extent of polymorphism at different nt positions. Moreover, the occurrence of substitutions was not random along the strand. Taken together, these facts contradict the traditional hypothesis of random mutagenesis as a principal source of KPI gene polymorphism. The experimentally found mosaic structure of KPI genes in both plants studied is consistent with the hypothesis suggesting recombination of ancestral genes. The same mechanism was proposed earlier for other resistance-conferring genes in the nightshade family (Solanaceae). Based on the data obtained, we searched for potential motifs of site-specific binding with plant DNA recombinases. During this work, we analyzed the sequencing data reported by the Potato Genome Sequencing Consortium (PGSC), 2011 and found considerable inconsistence of their data concerning the number, location, and orientation of KPI genes of groups A and B. The key role of recombination rather than random point mutagenesis in KPI polymorphism was demonstrated for the first time. Copyright © 2012 Elsevier Masson SAS. All rights reserved.
Natural killer cell receptor genes in the family Equidae: not only Ly49.
Futas, Jan; Horin, Petr
2013-01-01
Natural killer (NK) cells have important functions in immunity. NK recognition in mammals can be mediated through killer cell immunoglobulin-like receptors (KIR) and/or killer cell lectin-like Ly49 receptors. Genes encoding highly variable NK cell receptors (NKR) represent rapidly evolving genomic regions. No single conservative model of NKR genes was observed in mammals. Single-copy low polymorphic NKR genes present in one mammalian species may expand into highly polymorphic multigene families in other species. In contrast to other non-rodent mammals, multiple Ly49-like genes appear to exist in the horse, while no functional KIR genes were observed in this species. In this study, Ly49 and KIR were sought and their evolution was characterized in the entire family Equidae. Genomic sequences retrieved showed the presence of at least five highly conserved polymorphic Ly49 genes in horses, asses and zebras. These findings confirmed that the expansion of Ly49 occurred in the entire family. Several KIR-like sequences were also identified in the genome of Equids. Besides a previously identified non-functional KIR-Immunoglobulin-like transcript fusion gene (KIR-ILTA) and two putative pseudogenes, a KIR3DL-like sequence was analyzed. In contrast to previous observations made in the horse, the KIR3DL sequence, genomic organization and mRNA expression suggest that all Equids might produce a functional KIR receptor protein molecule with a single non-mutated immune tyrosine-based inhibition motif (ITIM) domain. No evidence for positive selection in the KIR3DL gene was found. Phylogenetic analysis including rhinoceros and tapir genomic DNA and deduced amino acid KIR-related sequences showed differences between families and even between species within the order Perissodactyla. The results suggest that the order Perissodactyla and its family Equidae with expanded Ly49 genes and with a potentially functional KIR gene may represent an interesting model for evolutionary biology of NKR genes.
Natural Killer Cell Receptor Genes in the Family Equidae: Not only Ly49
Futas, Jan; Horin, Petr
2013-01-01
Natural killer (NK) cells have important functions in immunity. NK recognition in mammals can be mediated through killer cell immunoglobulin-like receptors (KIR) and/or killer cell lectin-like Ly49 receptors. Genes encoding highly variable NK cell receptors (NKR) represent rapidly evolving genomic regions. No single conservative model of NKR genes was observed in mammals. Single-copy low polymorphic NKR genes present in one mammalian species may expand into highly polymorphic multigene families in other species. In contrast to other non-rodent mammals, multiple Ly49-like genes appear to exist in the horse, while no functional KIR genes were observed in this species. In this study, Ly49 and KIR were sought and their evolution was characterized in the entire family Equidae. Genomic sequences retrieved showed the presence of at least five highly conserved polymorphic Ly49 genes in horses, asses and zebras. These findings confirmed that the expansion of Ly49 occurred in the entire family. Several KIR-like sequences were also identified in the genome of Equids. Besides a previously identified non-functional KIR-Immunoglobulin-like transcript fusion gene (KIR-ILTA) and two putative pseudogenes, a KIR3DL-like sequence was analyzed. In contrast to previous observations made in the horse, the KIR3DL sequence, genomic organization and mRNA expression suggest that all Equids might produce a functional KIR receptor protein molecule with a single non-mutated immune tyrosine-based inhibition motif (ITIM) domain. No evidence for positive selection in the KIR3DL gene was found. Phylogenetic analysis including rhinoceros and tapir genomic DNA and deduced amino acid KIR-related sequences showed differences between families and even between species within the order Perissodactyla. The results suggest that the order Perissodactyla and its family Equidae with expanded Ly49 genes and with a potentially functional KIR gene may represent an interesting model for evolutionary biology of NKR genes. PMID:23724088
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chaiboonchoe, Amphun; Ghamsari, Lila; Dohai, Bushra
Metabolic networks, which are mathematical representations of organismal metabolism, are reconstructed to provide computational platforms to guide metabolic engineering experiments and explore fundamental questions on metabolism. Systems level analyses, such as interrogation of phylogenetic relationships within the network, can provide further guidance on the modification of metabolic circuitries. Chlamydomonas reinhardtii, a biofuel relevant green alga that has retained key genes with plant, animal, and protist affinities, serves as an ideal model organism to investigate the interplay between gene function and phylogenetic affinities at multiple organizational levels. Here, using detailed topological and functional analyses, coupled with transcriptomics studies on a metabolicmore » network that we have reconstructed for C. reinhardtii, we show that network connectivity has a significant concordance with the co-conservation of genes; however, a distinction between topological and functional relationships is observable within the network. Dynamic and static modes of co-conservation were defined and observed in a subset of gene-pairs across the network topologically. In contrast, genes with predicted synthetic interactions, or genes involved in coupled reactions, show significant enrichment for both shorter and longer phylogenetic distances. Based on our results, we propose that the metabolic network of C. reinhardtii is assembled with an architecture to minimize phylogenetic profile distances topologically, while it includes an expansion of such distances for functionally interacting genes. This arrangement may increase the robustness of C. reinhardtii's network in dealing with varied environmental challenges that the species may face. As a result, the defined evolutionary constraints within the network, which identify important pairings of genes in metabolism, may offer guidance on synthetic biology approaches to optimize the production of desirable metabolites.« less
Kutschera, Verena E; Bidon, Tobias; Hailer, Frank; Rodi, Julia L; Fain, Steven R; Janke, Axel
2014-08-01
Ursine bears are a mammalian subfamily that comprises six morphologically and ecologically distinct extant species. Previous phylogenetic analyses of concatenated nuclear genes could not resolve all relationships among bears, and appeared to conflict with the mitochondrial phylogeny. Evolutionary processes such as incomplete lineage sorting and introgression can cause gene tree discordance and complicate phylogenetic inferences, but are not accounted for in phylogenetic analyses of concatenated data. We generated a high-resolution data set of autosomal introns from several individuals per species and of Y-chromosomal markers. Incorporating intraspecific variability in coalescence-based phylogenetic and gene flow estimation approaches, we traced the genealogical history of individual alleles. Considerable heterogeneity among nuclear loci and discordance between nuclear and mitochondrial phylogenies were found. A species tree with divergence time estimates indicated that ursine bears diversified within less than 2 My. Consistent with a complex branching order within a clade of Asian bear species, we identified unidirectional gene flow from Asian black into sloth bears. Moreover, gene flow detected from brown into American black bears can explain the conflicting placement of the American black bear in mitochondrial and nuclear phylogenies. These results highlight that both incomplete lineage sorting and introgression are prominent evolutionary forces even on time scales up to several million years. Complex evolutionary patterns are not adequately captured by strictly bifurcating models, and can only be fully understood when analyzing multiple independently inherited loci in a coalescence framework. Phylogenetic incongruence among gene trees hence needs to be recognized as a biologically meaningful signal. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Chaiboonchoe, Amphun; Ghamsari, Lila; Dohai, Bushra; Ng, Patrick; Khraiwesh, Basel; Jaiswal, Ashish; Jijakli, Kenan; Koussa, Joseph; Nelson, David R; Cai, Hong; Yang, Xinping; Chang, Roger L; Papin, Jason; Yu, Haiyuan; Balaji, Santhanam; Salehi-Ashtiani, Kourosh
2016-07-19
Metabolic networks, which are mathematical representations of organismal metabolism, are reconstructed to provide computational platforms to guide metabolic engineering experiments and explore fundamental questions on metabolism. Systems level analyses, such as interrogation of phylogenetic relationships within the network, can provide further guidance on the modification of metabolic circuitries. Chlamydomonas reinhardtii, a biofuel relevant green alga that has retained key genes with plant, animal, and protist affinities, serves as an ideal model organism to investigate the interplay between gene function and phylogenetic affinities at multiple organizational levels. Here, using detailed topological and functional analyses, coupled with transcriptomics studies on a metabolic network that we have reconstructed for C. reinhardtii, we show that network connectivity has a significant concordance with the co-conservation of genes; however, a distinction between topological and functional relationships is observable within the network. Dynamic and static modes of co-conservation were defined and observed in a subset of gene-pairs across the network topologically. In contrast, genes with predicted synthetic interactions, or genes involved in coupled reactions, show significant enrichment for both shorter and longer phylogenetic distances. Based on our results, we propose that the metabolic network of C. reinhardtii is assembled with an architecture to minimize phylogenetic profile distances topologically, while it includes an expansion of such distances for functionally interacting genes. This arrangement may increase the robustness of C. reinhardtii's network in dealing with varied environmental challenges that the species may face. The defined evolutionary constraints within the network, which identify important pairings of genes in metabolism, may offer guidance on synthetic biology approaches to optimize the production of desirable metabolites.
Chaiboonchoe, Amphun; Ghamsari, Lila; Dohai, Bushra; ...
2016-06-14
Metabolic networks, which are mathematical representations of organismal metabolism, are reconstructed to provide computational platforms to guide metabolic engineering experiments and explore fundamental questions on metabolism. Systems level analyses, such as interrogation of phylogenetic relationships within the network, can provide further guidance on the modification of metabolic circuitries. Chlamydomonas reinhardtii, a biofuel relevant green alga that has retained key genes with plant, animal, and protist affinities, serves as an ideal model organism to investigate the interplay between gene function and phylogenetic affinities at multiple organizational levels. Here, using detailed topological and functional analyses, coupled with transcriptomics studies on a metabolicmore » network that we have reconstructed for C. reinhardtii, we show that network connectivity has a significant concordance with the co-conservation of genes; however, a distinction between topological and functional relationships is observable within the network. Dynamic and static modes of co-conservation were defined and observed in a subset of gene-pairs across the network topologically. In contrast, genes with predicted synthetic interactions, or genes involved in coupled reactions, show significant enrichment for both shorter and longer phylogenetic distances. Based on our results, we propose that the metabolic network of C. reinhardtii is assembled with an architecture to minimize phylogenetic profile distances topologically, while it includes an expansion of such distances for functionally interacting genes. This arrangement may increase the robustness of C. reinhardtii's network in dealing with varied environmental challenges that the species may face. As a result, the defined evolutionary constraints within the network, which identify important pairings of genes in metabolism, may offer guidance on synthetic biology approaches to optimize the production of desirable metabolites.« less
Erickson, Keesha E; Otoupal, Peter B; Chatterjee, Anushree
2017-01-01
Antibiotic-resistant bacteria are an increasingly serious public health concern, as strains emerge that demonstrate resistance to almost all available treatments. One factor that contributes to the crisis is the adaptive ability of bacteria, which exhibit remarkable phenotypic and gene expression heterogeneity in order to gain a survival advantage in damaging environments. This high degree of variability in gene expression across biological populations makes it a challenging task to identify key regulators of bacterial adaptation. Here, we research the regulation of adaptive resistance by investigating transcriptome profiles of Escherichia coli upon adaptation to disparate toxins, including antibiotics and biofuels. We locate potential target genes via conventional gene expression analysis as well as using a new analysis technique examining differential gene expression variability. By investigating trends across the diverse adaptation conditions, we identify a focused set of genes with conserved behavior, including those involved in cell motility, metabolism, membrane structure, and transport, and several genes of unknown function. To validate the biological relevance of the observed changes, we synthetically perturb gene expression using clustered regularly interspaced short palindromic repeat (CRISPR)-dCas9. Manipulation of select genes in combination with antibiotic treatment promotes adaptive resistance as demonstrated by an increased degree of antibiotic tolerance and heterogeneity in MICs. We study the mechanisms by which identified genes influence adaptation and find that select differentially variable genes have the potential to impact metabolic rates, mutation rates, and motility. Overall, this work provides evidence for a complex nongenetic response, encompassing shifts in gene expression and gene expression variability, which underlies adaptive resistance. IMPORTANCE Even initially sensitive bacteria can rapidly thwart antibiotic treatment through stress response processes known as adaptive resistance. Adaptive resistance fosters transient tolerance increases and the emergence of mutations conferring heritable drug resistance. In order to extend the applicable lifetime of new antibiotics, we must seek to hinder the occurrence of bacterial adaptive resistance; however, the regulation of adaptation is difficult to identify due to immense heterogeneity emerging during evolution. This study specifically seeks to generate heterogeneity by adapting bacteria to different stresses and then examines gene expression trends across the disparate populations in order to pinpoint key genes and pathways associated with adaptive resistance. The targets identified here may eventually inform strategies for impeding adaptive resistance and prolonging the effectiveness of antibiotic treatment.
Functional and mechanistic diversity of distal transcription enhancers
Bulger, Michael; Groudine, Mark
2013-01-01
Biological differences among metazoans, and between cell types in a given organism, arise in large part due to differences in gene expression patterns. The sequencing of multiple metazoan genomes, coupled with recent advances in genome-wide analysis of histone modifications and transcription factor binding, has revealed that among regulatory DNA sequences, gene-distal enhancers appear to exhibit the greatest diversity and cell-type specificity. Moreover, such elements are emerging as important targets for mutations that can give rise to disease and to genetic variability that underlies evolutionary change. Studies of long-range interactions between distal genomic sequences in the nucleus indicate that enhancers are often important determinants of nuclear organization, contributing to a general model for enhancer function that involves direct enhancer-promoter contact. In a number of systems, however, mechanisms for enhancer function are emerging that do not fit solely within such a model, suggesting that enhancers as a class of DNA regulatory element may be functionally and mechanistically diverse. PMID:21295696
Adaptability of non-genetic diversity in bacterial chemotaxis
Frankel, Nicholas W; Pontius, William; Dufour, Yann S; Long, Junjiajia; Hernandez-Nunez, Luis; Emonet, Thierry
2014-01-01
Bacterial chemotaxis systems are as diverse as the environments that bacteria inhabit, but how much environmental variation can cells tolerate with a single system? Diversification of a single chemotaxis system could serve as an alternative, or even evolutionary stepping-stone, to switching between multiple systems. We hypothesized that mutations in gene regulation could lead to heritable control of chemotactic diversity. By simulating foraging and colonization of E. coli using a single-cell chemotaxis model, we found that different environments selected for different behaviors. The resulting trade-offs show that populations facing diverse environments would ideally diversify behaviors when time for navigation is limited. We show that advantageous diversity can arise from changes in the distribution of protein levels among individuals, which could occur through mutations in gene regulation. We propose experiments to test our prediction that chemotactic diversity in a clonal population could be a selectable trait that enables adaptation to environmental variability. DOI: http://dx.doi.org/10.7554/eLife.03526.001 PMID:25279698
Kuwahara, Tomomi; Yamashita, Atsushi; Hirakawa, Hideki; Nakayama, Haruyuki; Toh, Hidehiro; Okada, Natsumi; Kuhara, Satoru; Hattori, Masahira; Hayashi, Tetsuya; Ohnishi, Yoshinari
2004-01-01
Bacteroides are predominant human colonic commensals, but the principal pathogenic species, Bacteroides fragilis (BF), lives closely associated with the mucosal surface, whereas a second major species, Bacteroides thetaiotaomicron (BT), concentrates within the colon. We find corresponding differences in their genomes, based on determination of the genome sequence of BF and comparative analysis with BT. Both species have acquired two mechanisms that contribute to their dominance among the colonic microbiota: an exceptional capability to use a wide range of dietary polysaccharides by gene amplification and the capacity to create variable surface antigenicities by multiple DNA inversion systems. However, the gene amplification for polysaccharide assimilation is more developed in BT, in keeping with its internal localization. In contrast, external antigenic structures can be changed more systematically in BF. Thereby, at the mucosal surface, where microbes encounter continuous attack by host defenses, BF evasion of the immune system is favored, and its colonization and infectious potential are increased. PMID:15466707
Jiang, Haiqin; Jin, Yali; Vissa, Varalakshmi; Zhang, Liangfen; Liu, Weijun; Qin, Lianhua; Wan, Kanglin; Wu, Xiaocui; Wang, Hongsheng; Liu, Weida; Wang, Baoxi
2017-04-06
Cutaneous tuberculosis (CTB) is probably underreported due to difficulties in detection and diagnosis. To address this issue, genotypes of Mycobacterium tuberculosis strains isolated from 30 patients with CTB were mapped at multiple loci, namely, RD105 deletions, spacer oligonucleotides, and Mycobacterial Interspersed Repetitive Unit-Variable Number Tandem Repeats (MIRU-VNTRs). Fifty-eight strains of pulmonary tuberculosis (PTB) were mapped as experimental controls. Drug resistance-associated gene mutations were determined by amplicon sequencing of target regions within 7 genes. Beijing family isolates were the most prevalent strains in CTB and PTB. MIRU-VNTR typing separated the Beijing strains from the non-Beijing strains, and the majority of CTB could be separated from PTB counterparts. Drug resistance determining regions showed only one CTB strain expressing isomazid resistance. Thus, while the CTB strains belonged to the same phylogenetic lineages and sub-lineages as the PTB strains, they differed at the level of several MIRU-VNTRs and in the proportion of drug resistance.
Bacteriophage P2 ogr and P4 delta genes act independently and are essential for P4 multiplication.
Halling, C; Calendar, R
1990-01-01
Satellite bacteriophage P4 requires the products of the late genes of a helper phage such as P2 for lytic growth. Expression of the P2 late genes is positively regulated by the P2 ogr gene in a process requiring P2 DNA replication. Transactivation of P2 late gene expression by P4 requires the P4 delta gene product and works even in the absence of P2 DNA replication. We have made null mutants of the P2 ogr and P4 delta genes. In the absence of the P4 delta gene product, P4 multiplication required both the P2 ogr protein and P2 DNA replication. In the absence of the P2 ogr gene product, P4 multiplication required the P4 delta protein. In complementation experiments, we found that the P2 ogr protein was made in the absence of P2 DNA replication but could not function unless P2 DNA replicated. We produced P4 delta protein from a plasmid and found that it complemented the null P4 delta and P2 ogr mutants. Images PMID:2193911
Castro-Mujica, María Del Carmen; Barletta-Carrillo, Claudia; Poterico, Julio A; Acosta, Marisa; Valer, Jesús; Cruz, Miguel De La
2017-01-01
Gorlin syndrome (GS) is a genetic disorder with an autosomal dominant inheritance pattern, with complete penetrance and variable expressivity. GS is caused by germline mutations in the genes PTCH1 or SUFU, which are components of the Sonic hedgehog molecular pathway. GS is characterized by the presence of multiple nevoid basal cell carcinomas, odontogenic cysts, calcification of the brain sickle, and lesions in the palms and soles. This study is the first to report cases in Peru of patients with GS who underwent genetic evaluation and counseling. We present two GS cases that meet the clinical criteria for the syndrome and review the literature.
Murphree, Colin A; Li, Qing; Heist, E Patrick; Moe, Luke A
2014-09-17
An Enterobacter cloacae strain (E. cloacae F3S3) that was collected as part of a project to assess antibiotic resistance among bacteria isolated from bioethanol fermentation facilities demonstrated high levels of resistance to antibiotics added prophylactically to bioethanol fermentors. PCR assays revealed the presence of canonical genes encoding resistance to penicillin (ampC) and erythromycin (ermG). Assays measuring biofilm formation under antibiotic stress indicated that erythromycin induced biofilm formation in E. cloacae F3S3. Planktonic growth and biofilm formation were observed at a high ethanol content, indicating E. cloacae F3S3 can persist in a bioethanol fermentor under the highly variable environmental conditions found in fermentors.
Multifocal Epithelial Hyperplasia of Oral Cavity Expressing HPV 16 Gene: A Rare Entity
Prabhat, M. P. V.; Raja Lakshmi, Chintamaneni; Sai Madhavi, N.; Bhavana, Sujana Mulk; Sarat, Gummadapu; Ramamohan, Kodali
2013-01-01
Focal epithelial hyperplasia is a rare contagious disease caused by human papilloma virus. Usually HPV involves either cutaneous or mucosal surfaces, whereas concomitant mucocutaneous involvement is extremely rare. We report such a unique case of multifocal epithelial hyperplasia involving multiple sites of oral cavity along with skin lesions in a 65-year-old female. We also discuss the probable multifactorial etiology and variable clinical presentations of the lesions, including evidence of HPV 16 expression, as detected by polymerase chain reaction. The present report illustrates the need for careful examination and prompt diagnosis of the disease, as it might be associated with high risk genotypes such as HPV 16 and 18. PMID:24455323
Parmar, Jyotsana J; Das, Dibyendu; Padinhateeri, Ranjith
2016-02-29
It is being increasingly realized that nucleosome organization on DNA crucially regulates DNA-protein interactions and the resulting gene expression. While the spatial character of the nucleosome positioning on DNA has been experimentally and theoretically studied extensively, the temporal character is poorly understood. Accounting for ATPase activity and DNA-sequence effects on nucleosome kinetics, we develop a theoretical method to estimate the time of continuous exposure of binding sites of non-histone proteins (e.g. transcription factors and TATA binding proteins) along any genome. Applying the method to Saccharomyces cerevisiae, we show that the exposure timescales are determined by cooperative dynamics of multiple nucleosomes, and their behavior is often different from expectations based on static nucleosome occupancy. Examining exposure times in the promoters of GAL1 and PHO5, we show that our theoretical predictions are consistent with known experiments. We apply our method genome-wide and discover huge gene-to-gene variability of mean exposure times of TATA boxes and patches adjacent to TSS (+1 nucleosome region); the resulting timescale distributions have non-exponential tails. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Smith, Jennifer A; Zhao, Wei; Yasutake, Kalyn; August, Carmella; Ratliff, Scott M; Faul, Jessica D; Boerwinkle, Eric; Chakravarti, Aravinda; Diez Roux, Ana V; Gao, Yan; Griswold, Michael E; Heiss, Gerardo; Kardia, Sharon L R; Morrison, Alanna C; Musani, Solomon K; Mwasongwe, Stanford; North, Kari E; Rose, Kathryn M; Sims, Mario; Sun, Yan V; Weir, David R; Needham, Belinda L
2017-12-18
Inter-individual variability in blood pressure (BP) is influenced by both genetic and non-genetic factors including socioeconomic and psychosocial stressors. A deeper understanding of the gene-by-socioeconomic/psychosocial factor interactions on BP may help to identify individuals that are genetically susceptible to high BP in specific social contexts. In this study, we used a genomic region-based method for longitudinal analysis, Longitudinal Gene-Environment-Wide Interaction Studies (LGEWIS), to evaluate the effects of interactions between known socioeconomic/psychosocial and genetic risk factors on systolic and diastolic BP in four large epidemiologic cohorts of European and/or African ancestry. After correction for multiple testing, two interactions were significantly associated with diastolic BP. In European ancestry participants, outward/trait anger score had a significant interaction with the C10orf107 genomic region ( p = 0.0019). In African ancestry participants, depressive symptom score had a significant interaction with the HFE genomic region ( p = 0.0048). This study provides a foundation for using genomic region-based longitudinal analysis to identify subgroups of the population that may be at greater risk of elevated BP due to the combined influence of genetic and socioeconomic/psychosocial risk factors.
Population and genomic analysis of the genus Halorubrum
Fullmer, Matthew S.; Soucy, Shannon M.; Swithers, Kristen S.; Makkay, Andrea M.; Wheeler, Ryan; Ventosa, Antonio; Gogarten, J. Peter; Papke, R. Thane
2014-01-01
The Halobacteria are known to engage in frequent gene transfer and homologous recombination. For stably diverged lineages to persist some checks on the rate of between lineage recombination must exist. We surveyed a group of isolates from the Aran-Bidgol endorheic lake in Iran and sequenced a selection of them. Multilocus Sequence Analysis (MLSA) and Average Nucleotide Identity (ANI) revealed multiple clusters (phylogroups) of organisms present in the lake. Patterns of intein and Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs) presence/absence and their sequence similarity, GC usage along with the ANI and the identities of the genes used in the MLSA revealed that two of these clusters share an exchange bias toward others in their phylogroup while showing reduced rates of exchange with other organisms in the environment. However, a third cluster, composed in part of named species from other areas of central Asia, displayed many indications of variability in exchange partners, from within the lake as well as outside the lake. We conclude that barriers to gene exchange exist between the two purely Aran-Bidgol phylogroups, and that the third cluster with members from other regions is not a single population and likely reflects an amalgamation of several populations. PMID:24782836
A new fast method for inferring multiple consensus trees using k-medoids.
Tahiri, Nadia; Willems, Matthieu; Makarenkov, Vladimir
2018-04-05
Gene trees carry important information about specific evolutionary patterns which characterize the evolution of the corresponding gene families. However, a reliable species consensus tree cannot be inferred from a multiple sequence alignment of a single gene family or from the concatenation of alignments corresponding to gene families having different evolutionary histories. These evolutionary histories can be quite different due to horizontal transfer events or to ancient gene duplications which cause the emergence of paralogs within a genome. Many methods have been proposed to infer a single consensus tree from a collection of gene trees. Still, the application of these tree merging methods can lead to the loss of specific evolutionary patterns which characterize some gene families or some groups of gene families. Thus, the problem of inferring multiple consensus trees from a given set of gene trees becomes relevant. We describe a new fast method for inferring multiple consensus trees from a given set of phylogenetic trees (i.e. additive trees or X-trees) defined on the same set of species (i.e. objects or taxa). The traditional consensus approach yields a single consensus tree. We use the popular k-medoids partitioning algorithm to divide a given set of trees into several clusters of trees. We propose novel versions of the well-known Silhouette and Caliński-Harabasz cluster validity indices that are adapted for tree clustering with k-medoids. The efficiency of the new method was assessed using both synthetic and real data, such as a well-known phylogenetic dataset consisting of 47 gene trees inferred for 14 archaeal organisms. The method described here allows inference of multiple consensus trees from a given set of gene trees. It can be used to identify groups of gene trees having similar intragroup and different intergroup evolutionary histories. The main advantage of our method is that it is much faster than the existing tree clustering approaches, while providing similar or better clustering results in most cases. This makes it particularly well suited for the analysis of large genomic and phylogenetic datasets.
Oshiki, Mamoru; Segawa, Takahiro; Ishii, Satoshi
2018-02-02
Various microorganisms play key roles in the Nitrogen (N) cycle. Quantitative PCR (qPCR) and PCR-amplicon sequencing of the N cycle functional genes allow us to analyze the abundance and diversity of microbes responsible in the N transforming reactions in various environmental samples. However, analysis of multiple target genes can be cumbersome and expensive. PCR-independent analysis, such as metagenomics and metatranscriptomics, is useful but expensive especially when we analyze multiple samples and try to detect N cycle functional genes present at relatively low abundance. Here, we present the application of microfluidic qPCR chip technology to simultaneously quantify and prepare amplicon sequence libraries for multiple N cycle functional genes as well as taxon-specific 16S rRNA gene markers for many samples. This approach, named as N cycle evaluation (NiCE) chip, was evaluated by using DNA from pure and artificially mixed bacterial cultures and by comparing the results with those obtained by conventional qPCR and amplicon sequencing methods. Quantitative results obtained by the NiCE chip were comparable to those obtained by conventional qPCR. In addition, the NiCE chip was successfully applied to examine abundance and diversity of N cycle functional genes in wastewater samples. Although non-specific amplification was detected on the NiCE chip, this could be overcome by optimizing the primer sequences in the future. As the NiCE chip can provide high-throughput format to quantify and prepare sequence libraries for multiple N cycle functional genes, this tool should advance our ability to explore N cycling in various samples. Importance. We report a novel approach, namely Nitrogen Cycle Evaluation (NiCE) chip by using microfluidic qPCR chip technology. By sequencing the amplicons recovered from the NiCE chip, we can assess diversities of the N cycle functional genes. The NiCE chip technology is applicable to analyze the temporal dynamics of the N cycle gene transcriptions in wastewater treatment bioreactors. The NiCE chip can provide high-throughput format to quantify and prepare sequence libraries for multiple N cycle functional genes. While there is a room for future improvement, this tool should significantly advance our ability to explore the N cycle in various environmental samples. Copyright © 2018 American Society for Microbiology.
Ensemble positive unlabeled learning for disease gene identification.
Yang, Peng; Li, Xiaoli; Chua, Hon-Nian; Kwoh, Chee-Keong; Ng, See-Kiong
2014-01-01
An increasing number of genes have been experimentally confirmed in recent years as causative genes to various human diseases. The newly available knowledge can be exploited by machine learning methods to discover additional unknown genes that are likely to be associated with diseases. In particular, positive unlabeled learning (PU learning) methods, which require only a positive training set P (confirmed disease genes) and an unlabeled set U (the unknown candidate genes) instead of a negative training set N, have been shown to be effective in uncovering new disease genes in the current scenario. Using only a single source of data for prediction can be susceptible to bias due to incompleteness and noise in the genomic data and a single machine learning predictor prone to bias caused by inherent limitations of individual methods. In this paper, we propose an effective PU learning framework that integrates multiple biological data sources and an ensemble of powerful machine learning classifiers for disease gene identification. Our proposed method integrates data from multiple biological sources for training PU learning classifiers. A novel ensemble-based PU learning method EPU is then used to integrate multiple PU learning classifiers to achieve accurate and robust disease gene predictions. Our evaluation experiments across six disease groups showed that EPU achieved significantly better results compared with various state-of-the-art prediction methods as well as ensemble learning classifiers. Through integrating multiple biological data sources for training and the outputs of an ensemble of PU learning classifiers for prediction, we are able to minimize the potential bias and errors in individual data sources and machine learning algorithms to achieve more accurate and robust disease gene predictions. In the future, our EPU method provides an effective framework to integrate the additional biological and computational resources for better disease gene predictions.
Luan, Jun-Bo; Chen, Wenbo; Hasegawa, Daniel K; Simmons, Alvin M; Wintermantel, William M; Ling, Kai-Shu; Fei, Zhangjun; Liu, Shu-Sheng; Douglas, Angela E
2015-09-15
Genomic decay is a common feature of intracellular bacteria that have entered into symbiosis with plant sap-feeding insects. This study of the whitefly Bemisia tabaci and two bacteria (Portiera aleyrodidarum and Hamiltonella defensa) cohoused in each host cell investigated whether the decay of Portiera metabolism genes is complemented by host and Hamiltonella genes, and compared the metabolic traits of the whitefly symbiosis with other sap-feeding insects (aphids, psyllids, and mealybugs). Parallel genomic and transcriptomic analysis revealed that the host genome contributes multiple metabolic reactions that complement or duplicate Portiera function, and that Hamiltonella may contribute multiple cofactors and one essential amino acid, lysine. Homologs of the Bemisia metabolism genes of insect origin have also been implicated in essential amino acid synthesis in other sap-feeding insect hosts, indicative of parallel coevolution of shared metabolic pathways across multiple symbioses. Further metabolism genes coded in the Bemisia genome are of bacterial origin, but phylogenetically distinct from Portiera, Hamiltonella and horizontally transferred genes identified in other sap-feeding insects. Overall, 75% of the metabolism genes of bacterial origin are functionally unique to one symbiosis, indicating that the evolutionary history of metabolic integration in these symbioses is strongly contingent on the pattern of horizontally acquired genes. Our analysis, further, shows that bacteria with genomic decay enable host acquisition of complex metabolic pathways by multiple independent horizontal gene transfers from exogenous bacteria. Specifically, each horizontally acquired gene can function with other genes in the pathway coded by the symbiont, while facilitating the decay of the symbiont gene coding the same reaction. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Individual and social determinants of multiple chronic disease behavioral risk factors among youth.
Alamian, Arsham; Paradis, Gilles
2012-03-22
Behavioral risk factors are known to co-occur among youth, and to increase risks of chronic diseases morbidity and mortality later in life. However, little is known about determinants of multiple chronic disease behavioral risk factors, particularly among youth. Previous studies have been cross-sectional and carried out without a sound theoretical framework. Using longitudinal data (n = 1135) from Cycle 4 (2000-2001), Cycle 5 (2002-2003) and Cycle 6 (2004-2005) of the National Longitudinal Survey of Children and Youth, a nationally representative sample of Canadian children who are followed biennially, the present study examines the influence of a set of conceptually-related individual/social distal variables (variables situated at an intermediate distance from behaviors), and individual/social ultimate variables (variables situated at an utmost distance from behaviors) on the rate of occurrence of multiple behavioral risk factors (physical inactivity, sedentary behavior, tobacco smoking, alcohol drinking, and high body mass index) in a sample of children aged 10-11 years at baseline. Multiple behavioral risk factors were assessed using a multiple risk factor score. All statistical analyses were performed using SAS, version 9.1, and SUDAAN, version 9.01. Multivariate longitudinal Poisson models showed that social distal variables including parental/peer smoking and peer drinking (Log-likelihood ratio (LLR) = 187.86, degrees of freedom (DF) = 8, p < .001), as well as individual distal variables including low self-esteem (LLR = 76.94, DF = 4, p < .001) increased the rate of occurrence of multiple behavioral risk factors. Individual ultimate variables including age, sex, and anxiety (LLR = 9.34, DF = 3, p < .05), as well as social ultimate variables including family socioeconomic status, and family structure (LLR = 10.93, DF = 5, p = .05) contributed minimally to the rate of co-occurrence of behavioral risk factors. The results suggest targeting individual/social distal variables in prevention programs of multiple chronic disease behavioral risk factors among youth.
The Role of Multiple Transcription Factors In Archaeal Gene Expression
DOE Office of Scientific and Technical Information (OSTI.GOV)
Charles J. Daniels
2008-09-23
Since the inception of this research program, the project has focused on two central questions: What is the relationship between the 'eukaryal-like' transcription machinery of archaeal cells and its counterparts in eukaryal cells? And, how does the archaeal cell control gene expression using its mosaic of eukaryal core transcription machinery and its bacterial-like transcription regulatory proteins? During the grant period we have addressed these questions using a variety of in vivo approaches and have sought to specifically define the roles of the multiple TATA binding protein (TBP) and TFIIB-like (TFB) proteins in controlling gene expression in Haloferax volcanii. H. volcaniimore » was initially chosen as a model for the Archaea based on the availability of suitable genetic tools; however, later studies showed that all haloarchaea possessed multiple tbp and tfb genes, which led to the proposal that multiple TBP and TFB proteins may function in a manner similar to alternative sigma factors in bacterial cells. In vivo transcription and promoter analysis established a clear relationship between the promoter requirements of haloarchaeal genes and those of the eukaryal RNA polymerase II promoter. Studies on heat shock gene promoters, and the demonstration that specific tfb genes were induced by heat shock, provided the first indication that TFB proteins may direct expression of specific gene families. The construction of strains lacking tbp or tfb genes, coupled with the finding that many of these genes are differentially expressed under varying growth conditions, provided further support for this model. Genetic tools were also developed that led to the construction of insertion and deletion mutants, and a novel gene expression scheme was designed that allowed the controlled expression of these genes in vivo. More recent studies have used a whole genome array to examine the expression of these genes and we have established a linkage between the expression of specific tfb genes and the regulation of nitrogen metabolism and other global cellular responses.« less
Ma, Xingliang; Zhang, Qunyu; Zhu, Qinlong; Liu, Wei; Chen, Yan; Qiu, Rong; Wang, Bin; Yang, Zhongfang; Li, Heying; Lin, Yuru; Xie, Yongyao; Shen, Rongxin; Chen, Shuifu; Wang, Zhi; Chen, Yuanling; Guo, Jingxin; Chen, Letian; Zhao, Xiucai; Dong, Zhicheng; Liu, Yao-Guang
2015-08-01
CRISPR/Cas9 genome targeting systems have been applied to a variety of species. However, most CRISPR/Cas9 systems reported for plants can only modify one or a few target sites. Here, we report a robust CRISPR/Cas9 vector system, utilizing a plant codon optimized Cas9 gene, for convenient and high-efficiency multiplex genome editing in monocot and dicot plants. We designed PCR-based procedures to rapidly generate multiple sgRNA expression cassettes, which can be assembled into the binary CRISPR/Cas9 vectors in one round of cloning by Golden Gate ligation or Gibson Assembly. With this system, we edited 46 target sites in rice with an average 85.4% rate of mutation, mostly in biallelic and homozygous status. We reasoned that about 16% of the homozygous mutations in rice were generated through the non-homologous end-joining mechanism followed by homologous recombination-based repair. We also obtained uniform biallelic, heterozygous, homozygous, and chimeric mutations in Arabidopsis T1 plants. The targeted mutations in both rice and Arabidopsis were heritable. We provide examples of loss-of-function gene mutations in T0 rice and T1 Arabidopsis plants by simultaneous targeting of multiple (up to eight) members of a gene family, multiple genes in a biosynthetic pathway, or multiple sites in a single gene. This system has provided a versatile toolbox for studying functions of multiple genes and gene families in plants for basic research and genetic improvement. Copyright © 2015 The Author. Published by Elsevier Inc. All rights reserved.
Morata, Jordi; Puigdomènech, Pere
2017-02-08
Cucurbitaceae species contain a significantly lower number of genes coding for proteins with similarity to plant resistance genes belonging to the NBS-LRR family than other plant species of similar genome size. A large proportion of these genes are organized in clusters that appear to be hotspots of variability. The genomes of the Cucurbitaceae species measured until now are intermediate in size (between 350 and 450 Mb) and they apparently have not undergone any genome duplications beside those at the origin of eudicots. The cluster containing the largest number of NBS-LRR genes has previously been analyzed in melon and related species and showed a high degree of interspecific and intraspecific variability. It was of interest to study whether similar behavior occurred in other cluster of the same family of genes. The cluster of NBS-LRR genes located in melon chromosome 9 was analyzed and compared with the syntenic regions in other cucurbit genomes. This is the second cluster in number within this species and it contains nine sequences with a NBS-LRR annotation including two genes, Fom1 and Prv, providing resistance against Fusarium and Ppapaya ring-spot virus (PRSV). The variability within the melon species appears to consist essentially of single nucleotide polymorphisms. Clusters of similar genes are present in the syntenic regions of the two species of Cucurbitaceae that were sequenced, cucumber and watermelon. Most of the genes in the syntenic clusters can be aligned between species and a hypothesis of generation of the cluster is proposed. The number of genes in the watermelon cluster is similar to that in melon while a higher number of genes (12) is present in cucumber, a species with a smaller genome than melon. After comparing genome resequencing data of 115 cucumber varieties, deletion of a group of genes is observed in a group of varieties of Indian origin. Clusters of genes coding for NBS-LRR proteins in cucurbits appear to have specific variability in different regions of the genome and between different species. This observation is in favour of considering that the adaptation of plant species to changing environments is based upon the variability that may occur at any location in the genome and that has been produced by specific mechanisms of sequence variation acting on plant genomes. This information could be useful both to understand the evolution of species and for plant breeding.
Fractional populations in multiple gene inheritance.
Chung, Myung-Hoon; Kim, Chul Koo; Nahm, Kyun
2003-01-22
With complete knowledge of the human genome sequence, one of the most interesting tasks remaining is to understand the functions of individual genes and how they communicate. Using the information about genes (locus, allele, mutation rate, fitness, etc.), we attempt to explain population demographic data. This population evolution study could complement and enhance biologists' understanding about genes. We present a general approach to study population genetics in complex situations. In the present approach, multiple allele inheritance, multiple loci inheritance, natural selection and mutations are allowed simultaneously in order to consider a more realistic situation. A simulation program is presented so that readers can readily carry out studies with their own parameters. It is shown that the multiplicity of the loci greatly affects the demographic results of fractional population ratios. Furthermore, the study indicates that some high infant mortality rates due to congenital anomalies can be attributed to multiple loci inheritance. The simulation program can be downloaded from http://won.hongik.ac.kr/~mhchung/index_files/yapop.htm. In order to run this program, one needs Visual Studio.NET platform, which can be downloaded from http://msdn.microsoft.com/netframework/downloads/default.asp.
Fusagene vectors: a novel strategy for the expression of multiple genes from a single cistron.
Gäken, J; Jiang, J; Daniel, K; van Berkel, E; Hughes, C; Kuiper, M; Darling, D; Tavassoli, M; Galea-Lauri, J; Ford, K; Kemeny, M; Russell, S; Farzaneh, F
2000-12-01
Transduction of cells with multiple genes, allowing their stable and co-ordinated expression, is difficult with the available methodologies. A method has been developed for expression of multiple gene products, as fusion proteins, from a single cistron. The encoded proteins are post-synthetically cleaved and processed into each of their constituent proteins as individual, biologically active factors. Specifically, linkers encoding cleavage sites for the Golgi expressed endoprotease, furin, have been incorporated between in-frame cDNA sequences encoding different secreted or membrane bound proteins. With this strategy we have developed expression vectors encoding multiple proteins (IL-2 and B7.1, IL-4 and B7.1, IL-4 and IL-2, IL-12 p40 and p35, and IL-12 p40, p35 and IL-2 ). Transduction and analysis of over 100 individual clones, derived from murine and human tumour cell lines, demonstrate the efficient expression and biological activity of each of the encoded proteins. Fusagene vectors enable the co-ordinated expression of multiple gene products from a single, monocistronic, expression cassette.
Methylation of HPA axis related genes in men with hypersexual disorder.
Jokinen, Jussi; Boström, Adrian E; Chatzittofis, Andreas; Ciuculete, Diana M; Öberg, Katarina Görts; Flanagan, John N; Arver, Stefan; Schiöth, Helgi B
2017-06-01
Hypersexual Disorder (HD) defined as non-paraphilic sexual desire disorder with components of compulsivity, impulsivity and behavioral addiction, and proposed as a diagnosis in the DSM 5, shares some overlapping features with substance use disorder including common neurotransmitter systems and dysregulated hypothalamic-pituitary-adrenal (HPA) axis function. In this study, comprising 67 HD male patients and 39 male healthy volunteers, we aimed to identify HPA-axis coupled CpG-sites, in which modifications of the epigenetic profile are associated with hypersexuality. The genome-wide methylation pattern was measured in whole blood using the Illumina Infinium Methylation EPIC BeadChip, measuring the methylation state of over 850K CpG sites. Prior to analysis, the global DNA methylation pattern was pre-processed according to standard protocols and adjusted for white blood cell type heterogeneity. We included CpG sites located within 2000bp of the transcriptional start site of the following HPA-axis coupled genes: Corticotropin releasing hormone (CRH), corticotropin releasing hormone binding protein (CRHBP), corticotropin releasing hormone receptor 1 (CRHR1), corticotropin releasing hormone receptor 2 (CRHR2), FKBP5 and the glucocorticoid receptor (NR3C1). We performed multiple linear regression models of methylation M-values to a categorical variable of hypersexuality, adjusting for depression, dexamethasone non-suppression status, Childhood Trauma Questionnaire total score and plasma levels of TNF-alpha and IL-6. Of 76 tested individual CpG sites, four were nominally significant (p<0.05), associated with the genes CRH, CRHR2 and NR3C1. Cg23409074-located 48bp upstream of the transcription start site of the CRH gene - was significantly hypomethylated in hypersexual patients after corrections for multiple testing using the FDR-method. Methylation levels of cg23409074 were positively correlated with gene expression of the CRH gene in an independent cohort of 11 healthy male subjects. The methylation levels at the identified CRH site, cg23409074, were significantly correlated between blood and four different brain regions. CRH is an important integrator of neuroendocrine stress responses in the brain, with a key role in the addiction processes. Our results show epigenetic changes in the CRH gene related to hypersexual disorder in men. Copyright © 2017 Elsevier Ltd. All rights reserved.
Takano, Oona M; Mitchell, Preston S; Gustafsson, Daniel R; Adite, Alphonse; Voelker, Gary; Light, Jessica E
2017-04-01
Host associations of highly host-specific chewing lice (Insecta: Phthiraptera) across multiple avian species remains fairly undocumented in the West African country of Benin. Two hundred and seventeen bird specimens collected from multiple localities across Benin and housed at the Texas A&M University Biodiversity Research and Teaching Collections were examined for lice. Lice were identified and genetic data (mitochondrial COI and nuclear EF-1α genes) were obtained and phylogenetically analyzed. In total, we found 15 host associations, 7 of which were new to science. Genetically, most lice from Benin were unique and could represent new species. Based on host associations and unique genetic lineages, we estimate we discovered a minimum of 4 and possibly as many as 8 new chewing louse species. Given the lack of current data on chewing louse species distributions in Benin, this study adds to the knowledge of host associations, geographic distribution, and genetic variability of avian chewing louse species in West Africa.
Amodio, Nicola; D'Aquila, Patrizia; Passarino, Giuseppe; Tassone, Pierfrancesco; Bellizzi, Dina
2017-01-01
Multiple Myeloma (MM) is a clonal late B-cell disorder accounting for about 13% of hematological cancers and 1% of all neoplastic diseases. Recent studies on the molecular pathogenesis and biology of MM have highlighted a complex epigenomic landscape contributing to MM onset, prognosis and high individual variability. Areas covered: We describe here the current knowledge on epigenetic events characterizing MM initiation and progression, focusing on the role of DNA and histone methylation and on the most promising epi-therapeutic approaches targeting the methylation pathway. Expert opinion: Data published so far indicate that alterations of the epigenetic framework, which include aberrant global or gene/non-coding RNA specific methylation profiles, feature prominently in the pathobiology of MM. Indeed, the aberrant expression of components of the epigenetic machinery as well as the reversibility of the epigenetic marks make this pathway druggable, providing the basis for the design of epigenetic therapies against this still fatal malignancy.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Czarnecki, Olaf; Bryan, Anthony C.; Jawdy, Sara S.
Genetic engineering of plants that results in successful establishment of new biochemical or regulatory pathways requires stable introduction of one or more genes into the plant genome. It might also be necessary to down-regulate or turn off expression of endogenous genes in order to reduce activity of competing pathways. An established way to knockdown gene expression in plants is expressing a hairpin-RNAi construct, eventually leading to degradation of a specifically targeted mRNA. Knockdown of multiple genes that do not share homologous sequences is still challenging and involves either sophisticated cloning strategies to create vectors with different serial expression constructs ormore » multiple transformation events that is often restricted by a lack of available transformation markers. Synthetic RNAi fragments were assembled in yeast carrying homologous sequences to six or seven non-family genes and introduced into pAGRIKOLA. Transformation of Arabidopsis thaliana and subsequent expression analysis of targeted genes proved efficient knockdown of all target genes. In conclusion, we present a simple and cost-effective method to create constructs to simultaneously knockdown multiple non-family genes or genes that do not share sequence homology. The presented method can be applied in plant and animal synthetic biology as well as traditional plant and animal genetic engineering.« less
dbCPG: A web resource for cancer predisposition genes
Wei, Ran; Yao, Yao; Yang, Wu; Zheng, Chun-Hou; Zhao, Min; Xia, Junfeng
2016-01-01
Cancer predisposition genes (CPGs) are genes in which inherited mutations confer highly or moderately increased risks of developing cancer. Identification of these genes and understanding the biological mechanisms that underlie them is crucial for the prevention, early diagnosis, and optimized management of cancer. Over the past decades, great efforts have been made to identify CPGs through multiple strategies. However, information on these CPGs and their molecular functions is scattered. To address this issue and provide a comprehensive resource for researchers, we developed the Cancer Predisposition Gene Database (dbCPG, Database URL: http://bioinfo.ahu.edu.cn:8080/dbCPG/index.jsp), the first literature-based gene resource for exploring human CPGs. It contains 827 human (724 protein-coding, 23 non-coding, and 80 unknown type genes), 637 rats, and 658 mouse CPGs. Furthermore, data mining was performed to gain insights into the understanding of the CPGs data, including functional annotation, gene prioritization, network analysis of prioritized genes and overlap analysis across multiple cancer types. A user-friendly web interface with multiple browse, search, and upload functions was also developed to facilitate access to the latest information on CPGs. Taken together, the dbCPG database provides a comprehensive data resource for further studies of cancer predisposition genes. PMID:27192119
Czarnecki, Olaf; Bryan, Anthony C.; Jawdy, Sara S.; ...
2016-02-17
Genetic engineering of plants that results in successful establishment of new biochemical or regulatory pathways requires stable introduction of one or more genes into the plant genome. It might also be necessary to down-regulate or turn off expression of endogenous genes in order to reduce activity of competing pathways. An established way to knockdown gene expression in plants is expressing a hairpin-RNAi construct, eventually leading to degradation of a specifically targeted mRNA. Knockdown of multiple genes that do not share homologous sequences is still challenging and involves either sophisticated cloning strategies to create vectors with different serial expression constructs ormore » multiple transformation events that is often restricted by a lack of available transformation markers. Synthetic RNAi fragments were assembled in yeast carrying homologous sequences to six or seven non-family genes and introduced into pAGRIKOLA. Transformation of Arabidopsis thaliana and subsequent expression analysis of targeted genes proved efficient knockdown of all target genes. In conclusion, we present a simple and cost-effective method to create constructs to simultaneously knockdown multiple non-family genes or genes that do not share sequence homology. The presented method can be applied in plant and animal synthetic biology as well as traditional plant and animal genetic engineering.« less
Node-Based Learning of Multiple Gaussian Graphical Models
Mohan, Karthik; London, Palma; Fazel, Maryam; Witten, Daniela; Lee, Su-In
2014-01-01
We consider the problem of estimating high-dimensional Gaussian graphical models corresponding to a single set of variables under several distinct conditions. This problem is motivated by the task of recovering transcriptional regulatory networks on the basis of gene expression data containing heterogeneous samples, such as different disease states, multiple species, or different developmental stages. We assume that most aspects of the conditional dependence networks are shared, but that there are some structured differences between them. Rather than assuming that similarities and differences between networks are driven by individual edges, we take a node-based approach, which in many cases provides a more intuitive interpretation of the network differences. We consider estimation under two distinct assumptions: (1) differences between the K networks are due to individual nodes that are perturbed across conditions, or (2) similarities among the K networks are due to the presence of common hub nodes that are shared across all K networks. Using a row-column overlap norm penalty function, we formulate two convex optimization problems that correspond to these two assumptions. We solve these problems using an alternating direction method of multipliers algorithm, and we derive a set of necessary and sufficient conditions that allows us to decompose the problem into independent subproblems so that our algorithm can be scaled to high-dimensional settings. Our proposal is illustrated on synthetic data, a webpage data set, and a brain cancer gene expression data set. PMID:25309137
Liu, Yanyan; Xiong, Sican; Sun, Wei; Zou, Fei
2018-02-02
Multiparent populations (MPP) have become popular resources for complex trait mapping because of their wider allelic diversity and larger population size compared with traditional two-way recombinant inbred (RI) strains. In mice, the collaborative cross (CC) is one of the most popular MPP and is derived from eight genetically diverse inbred founder strains. The strategy of generating RI intercrosses (RIX) from MPP in general and from the CC in particular can produce a large number of completely reproducible heterozygote genomes that better represent the (outbred) human population. Since both maternal and paternal haplotypes of each RIX are readily available, RIX is a powerful resource for studying both standing genetic and epigenetic variations of complex traits, in particular, the parent-of-origin (PoO) effects, which are important contributors to many complex traits. Furthermore, most complex traits are affected by >1 genes, where multiple quantitative trait locus mapping could be more advantageous. In this paper, for MPP-RIX data but taking CC-RIX as a working example, we propose a general Bayesian variable selection procedure to simultaneously search for multiple genes with founder allelic effects and PoO effects. The proposed model respects the complex relationship among RIX samples, and the performance of the proposed method is examined by extensive simulations. Copyright © 2018 Liu et al.
Selection for avian immune response: a commercial breeding company challenge.
Fulton, J E
2004-04-01
Selection for immune function in the commercial breeding environment is a challenging proposition for commercial breeding companies. Immune response is only one of many traits that are under intensive selection, thus selection pressure needs to be carefully balanced across multiple traits. The selection environment (single bird cages, biosecure facilities, controlled environment) is a very different environment than the commercial production facilities (multiple bird cages, potential disease exposure, variable environment) in which birds are to produce. The testing of individual birds is difficult, time consuming, and expensive. It is essential that the results of any tests be relevant to actual disease or environmental challenge in the commercial environment. The use of genetic markers as indicators of immune function is being explored by breeding companies. Use of genetic markers would eliminate many of the limitations in enhancing immune function currently encountered by commercial breeding companies. Information on genetic markers would allow selection to proceed without subjecting breeding stock to disease conditions and could be done before production traits are measured. These markers could be candidate genes with known interaction or involvement with disease pathology or DNA markers that are closely linked to genetic regions that influence the immune response. The current major limitation to this approach is the paucity of mapped chicken immune response genes and the limited number of DNA markers mapped on the chicken genome. These limitations should be eliminated once the chicken genome is sequenced.
Rydenfelt, Mattias; Cox, Robert Sidney; Garcia, Hernan; Phillips, Rob
2014-01-01
Transcription factors (TFs) with regulatory action at multiple promoter targets is the rule rather than the exception, with examples ranging from the cAMP receptor protein (CRP) in E. coli that regulates hundreds of different genes simultaneously to situations involving multiple copies of the same gene, such as plasmids, retrotransposons, or highly replicated viral DNA. When the number of TFs heavily exceeds the number of binding sites, TF binding to each promoter can be regarded as independent. However, when the number of TF molecules is comparable to the number of binding sites, TF titration will result in correlation (“promoter entanglement”) between transcription of different genes. We develop a statistical mechanical model which takes the TF titration effect into account and use it to predict both the level of gene expression for a general set of promoters and the resulting correlation in transcription rates of different genes. Our results show that the TF titration effect could be important for understanding gene expression in many regulatory settings. PMID:24580252
Anthony, Kim; More, Abhijit; Zhang, Xiaoliu
2014-01-01
Recent work has shown that the combinatorial use of multiple TALE activators can selectively activate certain cellular genes in inaccessible chromatin regions. In this study, we aimed to interrogate the activation potential of TALEs upon transcriptionally silenced immune genes in the context of non-immune cells. We designed a unique strategy, in which a single TALE fused to the TATA-box binding protein (TBP-TALE) is coupled with multiple VP64-TALE activators. We found that our strategy is significantly more potent than multiple TALE activators alone in activating expression of IL-2 and GM-CSF in diverse cell origins in which both genes are otherwise completely silenced. Chromatin analysis revealed that the gene activation was due in part to displacement of a distinctly positioned nucleosome. These studies provide a novel epigenetic mechanism for artificial gene induction and have important implications for targeted cancer immunotherapy, DNA vaccine development, as well as rational design of TALE activators.
Anthony, Kim; More, Abhijit; Zhang, Xiaoliu
2014-01-01
Recent work has shown that the combinatorial use of multiple TALE activators can selectively activate certain cellular genes in inaccessible chromatin regions. In this study, we aimed to interrogate the activation potential of TALEs upon transcriptionally silenced immune genes in the context of non-immune cells. We designed a unique strategy, in which a single TALE fused to the TATA-box binding protein (TBP-TALE) is coupled with multiple VP64-TALE activators. We found that our strategy is significantly more potent than multiple TALE activators alone in activating expression of IL-2 and GM-CSF in diverse cell origins in which both genes are otherwise completely silenced. Chromatin analysis revealed that the gene activation was due in part to displacement of a distinctly positioned nucleosome. These studies provide a novel epigenetic mechanism for artificial gene induction and have important implications for targeted cancer immunotherapy, DNA vaccine development, as well as rational design of TALE activators. PMID:24755922
Silver, Matt; Chen, Peng; Li, Ruoying; Cheng, Ching-Yu; Wong, Tien-Yin; Tai, E-Shyong; Teo, Yik-Ying; Montana, Giovanni
2013-01-01
Standard approaches to data analysis in genome-wide association studies (GWAS) ignore any potential functional relationships between gene variants. In contrast gene pathways analysis uses prior information on functional structure within the genome to identify pathways associated with a trait of interest. In a second step, important single nucleotide polymorphisms (SNPs) or genes may be identified within associated pathways. The pathways approach is motivated by the fact that genes do not act alone, but instead have effects that are likely to be mediated through their interaction in gene pathways. Where this is the case, pathways approaches may reveal aspects of a trait's genetic architecture that would otherwise be missed when considering SNPs in isolation. Most pathways methods begin by testing SNPs one at a time, and so fail to capitalise on the potential advantages inherent in a multi-SNP, joint modelling approach. Here, we describe a dual-level, sparse regression model for the simultaneous identification of pathways and genes associated with a quantitative trait. Our method takes account of various factors specific to the joint modelling of pathways with genome-wide data, including widespread correlation between genetic predictors, and the fact that variants may overlap multiple pathways. We use a resampling strategy that exploits finite sample variability to provide robust rankings for pathways and genes. We test our method through simulation, and use it to perform pathways-driven gene selection in a search for pathways and genes associated with variation in serum high-density lipoprotein cholesterol levels in two separate GWAS cohorts of Asian adults. By comparing results from both cohorts we identify a number of candidate pathways including those associated with cardiomyopathy, and T cell receptor and PPAR signalling. Highlighted genes include those associated with the L-type calcium channel, adenylate cyclase, integrin, laminin, MAPK signalling and immune function. PMID:24278029
Silver, Matt; Chen, Peng; Li, Ruoying; Cheng, Ching-Yu; Wong, Tien-Yin; Tai, E-Shyong; Teo, Yik-Ying; Montana, Giovanni
2013-11-01
Standard approaches to data analysis in genome-wide association studies (GWAS) ignore any potential functional relationships between gene variants. In contrast gene pathways analysis uses prior information on functional structure within the genome to identify pathways associated with a trait of interest. In a second step, important single nucleotide polymorphisms (SNPs) or genes may be identified within associated pathways. The pathways approach is motivated by the fact that genes do not act alone, but instead have effects that are likely to be mediated through their interaction in gene pathways. Where this is the case, pathways approaches may reveal aspects of a trait's genetic architecture that would otherwise be missed when considering SNPs in isolation. Most pathways methods begin by testing SNPs one at a time, and so fail to capitalise on the potential advantages inherent in a multi-SNP, joint modelling approach. Here, we describe a dual-level, sparse regression model for the simultaneous identification of pathways and genes associated with a quantitative trait. Our method takes account of various factors specific to the joint modelling of pathways with genome-wide data, including widespread correlation between genetic predictors, and the fact that variants may overlap multiple pathways. We use a resampling strategy that exploits finite sample variability to provide robust rankings for pathways and genes. We test our method through simulation, and use it to perform pathways-driven gene selection in a search for pathways and genes associated with variation in serum high-density lipoprotein cholesterol levels in two separate GWAS cohorts of Asian adults. By comparing results from both cohorts we identify a number of candidate pathways including those associated with cardiomyopathy, and T cell receptor and PPAR signalling. Highlighted genes include those associated with the L-type calcium channel, adenylate cyclase, integrin, laminin, MAPK signalling and immune function.
McDaniel, Lauren D; Young, Elizabeth C; Ritchie, Kimberly B; Paul, John H
2012-01-01
Microbial genomic sequence analyses have indicated widespread horizontal gene transfer (HGT). However, an adequate mechanism accounting for the ubiquity of HGT has been lacking. Recently, high frequencies of interspecific gene transfer have been documented, catalyzed by Gene Transfer Agents (GTAs) of marine α-Proteobacteria. It has been proposed that the presence of bacterial genes in highly purified viral metagenomes may be due to GTAs. However, factors influencing GTA-mediated gene transfer in the environment have not yet been determined. Several genomically sequenced strains containing complete GTA sequences similar to Rhodobacter capsulatus (RcGTA, type strain) were screened to ascertain if they produced putative GTAs, and at what abundance. Five of nine marine strains screened to date spontaneously produced virus-like particles (VLP's) in stationary phase. Three of these strains have demonstrated gene transfer activity, two of which were documented by this lab. These two strains Roseovarius nubinhibens ISM and Nitratireductor 44B9s, were utilized to produce GTAs designated RnGTA and NrGTA and gene transfer activity was verified in culture. Cell-free preparations of purified RnGTA and NrGTA particles from marked donor strains were incubated with natural microbial assemblages to determine the level of GTA-mediated gene transfer. In conjunction, several ambient environmental parameters were measured including lysogeny indicated by prophage induction. GTA production in culture systems indicated that approximately half of the strains produced GTA-like particles and maximal GTA counts ranged from 10-30% of host abundance. Modeling of GTA-mediated gene transfer frequencies in natural samples, along with other measured environmental variables, indicated a strong relationship between GTA mediated gene transfer and the combined factors of salinity, multiplicity of infection (MOI) and ambient bacterial abundance. These results indicate that GTA-mediated HGT in the marine environment with the strains examined is favored during times of elevated bacterial and GTA abundance as well as in areas of higher salinity.
McDaniel, Lauren D.; Young, Elizabeth C.; Ritchie, Kimberly B.; Paul, John H.
2012-01-01
Microbial genomic sequence analyses have indicated widespread horizontal gene transfer (HGT). However, an adequate mechanism accounting for the ubiquity of HGT has been lacking. Recently, high frequencies of interspecific gene transfer have been documented, catalyzed by Gene Transfer Agents (GTAs) of marine α-Proteobacteria. It has been proposed that the presence of bacterial genes in highly purified viral metagenomes may be due to GTAs. However, factors influencing GTA-mediated gene transfer in the environment have not yet been determined. Several genomically sequenced strains containing complete GTA sequences similar to Rhodobacter capsulatus (RcGTA, type strain) were screened to ascertain if they produced putative GTAs, and at what abundance. Five of nine marine strains screened to date spontaneously produced virus-like particles (VLP's) in stationary phase. Three of these strains have demonstrated gene transfer activity, two of which were documented by this lab. These two strains Roseovarius nubinhibens ISM and Nitratireductor 44B9s, were utilized to produce GTAs designated RnGTA and NrGTA and gene transfer activity was verified in culture. Cell-free preparations of purified RnGTA and NrGTA particles from marked donor strains were incubated with natural microbial assemblages to determine the level of GTA-mediated gene transfer. In conjunction, several ambient environmental parameters were measured including lysogeny indicated by prophage induction. GTA production in culture systems indicated that approximately half of the strains produced GTA-like particles and maximal GTA counts ranged from 10–30% of host abundance. Modeling of GTA-mediated gene transfer frequencies in natural samples, along with other measured environmental variables, indicated a strong relationship between GTA mediated gene transfer and the combined factors of salinity, multiplicity of infection (MOI) and ambient bacterial abundance. These results indicate that GTA-mediated HGT in the marine environment with the strains examined is favored during times of elevated bacterial and GTA abundance as well as in areas of higher salinity. PMID:22905268
van Wieringen, Wessel N; van de Wiel, Mark A
2011-05-01
Realizing that genes often operate together, studies into the molecular biology of cancer shift focus from individual genes to pathways. In order to understand the regulatory mechanisms of a pathway, one must study its genes at all molecular levels. To facilitate such study at the genomic level, we developed exploratory factor analysis for the characterization of the variability of a pathway's copy number data. A latent variable model that describes the call probability data of a pathway is introduced and fitted with an EM algorithm. In two breast cancer data sets, it is shown that the first two latent variables of GO nodes, which inherit a clear interpretation from the call probabilities, are often related to the proportion of aberrations and a contrast of the probabilities of a loss and of a gain. Linking the latent variables to the node's gene expression data suggests that they capture the "global" effect of genomic aberrations on these transcript levels. In all, the proposed method provides an possibly insightful characterization of pathway copy number data, which may be fruitfully exploited to study the interaction between the pathway's DNA copy number aberrations and data from other molecular levels like gene expression.
Non-Syndromic Recurrent Multiple Odontogenic Keratocysts: A Case Report
Bartake, AR.; Shreekanth, NG.; Prabhu, S.; Gopalkrishnan, K.
2011-01-01
Odontogenic keratocysts (OKCs) are one of the most frequent features of nevoid basal cell carcinoma syndrome (NBS). It is linked with mutation in the PTCH gene. Partial expression of the gene may result in occurrence of only multiple recurring OKC. Our patient presented with nine cysts with multiple recurrences over a period of 11 years without any other manifestation of the syndrome. PMID:21998815